OLAC Record oai:catalogue.elra.info:ELRA-W0062 |
Metadata | ||
Title: | CINTIL-DeepBank | |
Access Rights: | Rights available for: nonCommercialUse, commercialUse | |
Date Available (W3CDTF): | 2012-12-05 | |
Date Issued (W3CDTF): | 2012-12-05 | |
Date Modified (W3CDTF): | 2012-12-05 | |
Description: | The CINTIL-DeepBank (Branco et al., 2010) is a corpus of sentences annotated with their full-fledged deep grammatical representations, composed of 10,039 sentences and 110,166 tokens taken from different sources and domains: news (8,861 sentences; 101,430 tokens), and novels (399 sentences; 3,082 tokens). In addition, there are 779 sentences (5,654 tokens) used for regression testing of the computational grammar that supported the annotation of the corpus.For the creation of this DeepBank we adopted a semi-automatic analysis with a double-blind annotation followed by adjudication. The resulting dataset contains various levels of grammatical information, including morpho-syntactic information, phrase constituency, grammatical functions, and logical forms.The main motivation behind the creation of this resource was to build a high quality data set with grammatical information that could support the development of high-level processing tools for Portuguese.For more information see also: Branco, António, Costa, Francisco, João, Silva, Silveira, Sara, Castro, Sérgio, Avelãs, Mariana, Pinto, Clara and Graça, João, 2010, “Developing a Deep Linguistic Databank Supporting a Collection of Treebanks: the CINTIL DeepGramBank”, In Proceedings, LREC2010 – The 7th International Conference on Language Resources and Evaluation, La Valletta, Malta, May 19-21, 2010. | |
Identifier: | ELRA-W0062 | |
ISLRN: 368-672-631-502-0 | ||
Identifier (URI): | https://catalog.elra.info/en-us/repository/browse/ELRA-W0062/ | |
Language: | Portuguese | |
Language (ISO639): | por | |
Medium: | Not specified | |
Publisher: | ELRA (European Language Resources Association) | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | ELRA Catalogue of Language Resources | |
Description: | http://www.language-archives.org/archive/catalogue.elra.info | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:catalogue.elra.info:ELRA-W0062 | |
DateStamp: | 2012-12-05 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | n.a. 2012. ELRA (European Language Resources Association). | |
Terms: | area_Europe country_PT dcmi_Text iso639_por olac_primary_text |