OLAC Record oai:www.ldc.upenn.edu:LDC2006T09 |
Metadata | ||
Title: | Korean Treebank Annotations Version 2.0 | |
Access Rights: | Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining | |
Bibliographic Citation: | Han, Na-Rae, et al. Korean Treebank Annotations Version 2.0 LDC2006T09. Web Download. Philadelphia: Linguistic Data Consortium, 2006 | |
Contributor: | Han, Na-Rae | |
Ryu, Shijong | ||
Chae, Sook-Hee | ||
Yang, Seung-yun | ||
Lee, Seunghun | ||
Palmer, Martha | ||
Date (W3CDTF): | 2006 | |
Date Issued (W3CDTF): | 2006-04-17 | |
Description: | *Introduction* The Korean Treebank Annotations Version 2.0 was developed by the Linguistic Data Consortium (LDC) and contains 647 articles of Korean newswire text annotated with morphological and syntactic information. It is an extension of the Korean English Treebank Annotations (LDC2002T26). *Data* The original texts for the Korean Treebank 2.0 were selected from the LDC corpus Korean Newswire (LDC2000T45), which is a collection of Korean Press Agency news articles from June 2, 1994 to March 20, 2000. Korean Treebank 2.0 is based on the March 2000 portion of the corpus. The articles were collected by means of a continuous feed from the news provider over a modem connection. The annotated corpus can find many uses, including training of morphological analyzers, part-of-speech taggers and syntactic parsers. The text is encoded as KSC-5601(EUC-KR). Version 1.1 of the treebank is included in this release. *Samples* For an example of the data in the corpus, please review this sample. *Updates* None at this time. | |
Extent: | Corpus size: 19456 KB | |
Identifier: | LDC2006T09 | |
https://catalog.ldc.upenn.edu/LDC2006T09 | ||
ISBN: 1-58563-381-X | ||
ISLRN: 365-025-522-700-1 | ||
DOI: 10.35111/02nk-p662 | ||
Language: | Korean | |
Language (ISO639): | kor | |
License: | Korean Treebank Annotations Version 2.0 Agreement: https://catalog.ldc.upenn.edu/license/korean-treebank-annotations-version-2.pdf | |
Medium: | Distribution: Web Download | |
Publisher: | Linguistic Data Consortium | |
Publisher (URI): | https://www.ldc.upenn.edu | |
Rights Holder: | © 2001-2002 CoGenTex, Inc., © 2000 Korean Press Agency, © 2000-2005, 2006 Trustees of the University of Pennsylvania | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | The LDC Corpus Catalog | |
Description: | http://www.language-archives.org/archive/www.ldc.upenn.edu | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:www.ldc.upenn.edu:LDC2006T09 | |
DateStamp: | 2021-09-28 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Han, Na-Rae; Ryu, Shijong; Chae, Sook-Hee; Yang, Seung-yun; Lee, Seunghun; Palmer, Martha. 2006. Linguistic Data Consortium. | |
Terms: | area_Asia country_KR dcmi_Text iso639_kor olac_primary_text |