OLAC Record oai:scholarspace.manoa.hawaii.edu:10125/24817 |
Metadata | ||
Title: | Reflections on documentary corpora | |
Bibliographic Citation: | Rice, Sally; 2018-12-01; Kaipuleohone University of Hawai'i Digital Language Archive;http://hdl.handle.net/10125/24817. | |
Creator: | Rice, Sally | |
Date (W3CDTF): | 2018-12-01 | |
Description: | For decades, language documentation proponents have argued for the separability of LD as its own sub-discipline. Many corpus linguists have made this same claim; thus, corpus linguistics shares the ethos of data over theorizing, whereby primary data represent authentic, connected discourse that is natural (not elicited), broadly sampled (across speakers, generations, dialects), and balanced (reflecting different usage contexts and genres). Nevertheless, many misconceptions remain about what a language corpus is, how it is formatted, how big or balanced it needs to be, and most importantly, how it is queried. In this reflection, I dispel some of these misconceptions, while reassuring community members and field linguists alike that a corpus is an exceedingly powerful tool for guiding the expansion of the documentary record, keeping precious language data in circulation, and helping to produce the classic descriptive by-products of LD such as dictionaries, phrasebooks, and grammars. Above all, the less-familiar but more direct by-products of corpus interrogation, such as word lists, frequency counts, concordance lines, N-grams, collocations, distribution, and dispersion plots, are so immediately interpretable and useful by speakers, learners, and linguists, that LD should give corpus linguistic training the same attention as project planning, ethics, recording, transcription, annotation, metadata, and archiving. | |
National Foreign Language Resource Center | ||
Identifier: | Rice, Sally. 2018. Reflections on documentary corpora. In McDonnell, Bradley, Andrea L. Berez-Kroeker, and Gary Holton. (Eds.) Reflections on Language Documentation 20 Years after Himmelmann 1998. Language Documentation & Conservation Special Publication no. 15. [PP 157-172] Honolulu: University of Hawai‘i Press. | |
978-0-9973295-3-7 | ||
Identifier (URI): | http://hdl.handle.net/10125/24817 | |
Publisher: | University of Hawai'i Press | |
Relation: | LD&C Special Publication | |
Rights: | Creative Commons Attribution Non-Commercial Share Alike License | |
Subject: | corpus linguistics, language corpora, corpus queries, concordancer, connected discourse | |
Table Of Contents: | ldc-sp15-srice.pdf | |
OLAC Info |
||
Archive: | Language Documentation and Conservation | |
Description: | http://www.language-archives.org/archive/ldc.scholarspace.manoa.hawaii.edu | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:scholarspace.manoa.hawaii.edu:10125/24817 | |
DateStamp: | 2024-08-09 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Rice, Sally. 2018. University of Hawai'i Press. |