OLAC Record oai:scholarspace.manoa.hawaii.edu:10125/4512 |
Metadata | ||
Title: | Language-specific encoding in endangered language corpora | |
Bibliographic Citation: | Gippert, Jost; 2012-08; Kaipuleohone University of Hawai'i Digital Language Archive;http://hdl.handle.net/10125/4512. | |
Creator: | Gippert, Jost | |
Date (W3CDTF): | 2012-08 | |
Description: | The paper addresses problems of corpus building and retrieval resulting from codeswitching, which is a characteristic feature of endangered language recordings. The typical appearance of code-switching phenomena is first outlined on the basis of data collected in the DoBeS ‘ECLinG’ project, which dealt with three endangered Caucasian languages spoken in Georgia: Tsova-Tush (Batsbi), Udi, and Svan. The problem of language-specific retrieval is illustrated with examples showing the usage of the word da in Tsova-Tush contexts, which represents, as a homonym, either a native copula form (‘it is’) or the Georgian conjunction ‘and’. The subsequent section discusses the annotation requirements that are necessary to automatically distinguish the languages involved in code-switching, with a focus on the emerging ISO standard 639-6. It is argued that the fine-grained distinction of varieties and subvarieties and their interrelationship – as aimed at in this standard – requires a thorough reconsideration if it is to be applied in the markup of corpus data. | |
National Foreign Language Resource Center | ||
Identifier: | Gippert, Jost. 2012. Language-specific encoding in endangered language corpora. In Frank Seifart, Geoffrey Haig, Nikolaus P. Himmelmann, Dagmar Jung, Anna Margetts, and Paul Trilsbeek (eds). 2012. Potentials of Language Documentation: Methods, Analyses, and Utilization. 25-31. Honolulu: University of Hawai'i Press. | |
978-0-9856211-0-0 | ||
Identifier (URI): | http://hdl.handle.net/10125/4512 | |
Publisher: | University of Hawai'i Press | |
Relation: | LD&C Special Publication | |
Rights: | Creative Commons Attribution Non-Commercial Share Alike License | |
Table Of Contents: | 03gippert.pdf | |
OLAC Info |
||
Archive: | Language Documentation and Conservation | |
Description: | http://www.language-archives.org/archive/ldc.scholarspace.manoa.hawaii.edu | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:scholarspace.manoa.hawaii.edu:10125/4512 | |
DateStamp: | 2024-09-17 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Gippert, Jost. 2012. University of Hawai'i Press. |