OLAC Record
oai:lindat.mff.cuni.cz:11234/1-3185

Metadata
Title:Prague Dependency Treebank - Consolidated 1.0 (PDT-C 1.0)
Bibliographic Citation:http://hdl.handle.net/11234/1-3185
Creator:Hajič, Jan
Bejček, Eduard
Bémová, Alevtina
Buráňová, Eva
Fučíková, Eva
Hajičová, Eva
Havelka, Jiří
Hlaváčová, Jaroslava
Homola, Petr
Ircing, Pavel
Kárník, Jiří
Kettnerová, Václava
Klyueva, Natalia
Kolářová, Veronika
Kučová, Lucie
Lopatková, Markéta
Mareček, David
Mikulová, Marie
Mírovský, Jiří
Nedoluzhko, Anna
Novák, Michal
Pajas, Petr
Panevová, Jarmila
Peterek, Nino
Poláková, Lucie
Popel, Martin
Popelka, Jan
Romportl, Jan
Rysová, Magdaléna
Semecký, Jiří
Sgall, Petr
Spoustová, Johanka
Straka, Milan
Straňák, Pavel
Synková, Pavlína
Ševčíková, Magda
Šindlerová, Jana
Štěpánek, Jan
Štěpánková, Barbora
Toman, Josef
Urešová, Zdeňka
Vidová Hladká, Barbora
Zeman, Daniel
Zikánová, Šárka
Žabokrtský, Zdeněk
Date (W3CDTF):2021-01-11T10:09:56Z
Date Available:2021-01-11T10:09:56Z
Description:A richly annotated and genre-diversified language resource, The Prague Dependency Treebank – Consolidated 1.0 (PDT-C 1.0, or PDT-C in short in the sequel) is a consolidated release of the existing PDT-corpora of Czech data, uniformly annotated using the standard PDT scheme. PDT-corpora included in PDT-C: Prague Dependency Treebank (the original PDT contents, written newspaper and journal texts from three genres); Czech part of Prague Czech-English Dependency Treebank (translated financial texts, from English), Prague Dependency Treebank of Spoken Czech (spoken data, including audio and transcripts and multiple speech reconstruction annotation); PDT-Faust (user-generated texts). The difference from the separately published original treebanks can be briefly described as follows: it is published in one package, to allow easier data handling for all the datasets; the data is enhanced with a manual linguistic annotation at the morphological layer and new version of morphological dictionary is enclosed; a common valency lexicon for all four original parts is enclosed. Documentation provides two browsing and editing desktop tools (TrEd and MEd) and the corpus is also available online for searching using PML-TQ.
Identifier (URI):http://hdl.handle.net/11234/1-3185
Language:Czech
Language (ISO639):ces
Publisher:Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Replaces (URI):http://hdl.handle.net/11234/1-2621
http://hdl.handle.net/11234/1-1664
http://hdl.handle.net/11234/1-2375
Rights:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
http://creativecommons.org/licenses/by-nc-sa/4.0/
Subject:treebank
dependency
tectogrammatics
topic-focus articulation
multiword expressions
coreference
bridging relations
discourse
morphology
syntax
tokenization
lemmatization
semantic relations
lexical semantics
lexicon
valency
speech reconstruction
clauses
speech recognition
spoken corpus
Type:corpus
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11234/1-3185
DateStamp:  2021-06-29
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Hajič, Jan; Bejček, Eduard; Bémová, Alevtina; Buráňová, Eva; Fučíková, Eva; Hajičová, Eva; Havelka, Jiří; Hlaváčová, Jaroslava; Homola, Petr; Ircing, Pavel; Kárník, Jiří; Kettnerová, Václava; Klyueva, Natalia; Kolářová, Veronika; Kučová, Lucie; Lopatková, Markéta; Mareček, David; Mikulová, Marie; Mírovský, Jiří; Nedoluzhko, Anna; Novák, Michal; Pajas, Petr; Panevová, Jarmila; Peterek, Nino; Poláková, Lucie; Popel, Martin; Popelka, Jan; Romportl, Jan; Rysová, Magdaléna; Semecký, Jiří; Sgall, Petr; Spoustová, Johanka; Straka, Milan; Straňák, Pavel; Synková, Pavlína; Ševčíková, Magda; Šindlerová, Jana; Štěpánek, Jan; Štěpánková, Barbora; Toman, Josef; Urešová, Zdeňka; Vidová Hladká, Barbora; Zeman, Daniel; Zikánová, Šárka; Žabokrtský, Zdeněk. 2021. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL).
Terms: area_Europe country_CZ dcmi_Text iso639_ces olac_primary_text


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11234/1-3185
Up-to-date as of: Thu Oct 5 0:41:04 EDT 2023