OLAC Record oai:lindat.mff.cuni.cz:11234/1-3775 |
Metadata | ||
Title: | FAUST cs-en 0.5 | |
Bibliographic Citation: | http://hdl.handle.net/11234/1-3775 | |
Creator: | Hajič, Jan | |
Mareček, David | ||
Fučíková, Eva | ||
Cinková, Silvie | ||
Štěpánek, Jan | ||
Mikulová, Marie | ||
Popel, Martin | ||
Date (W3CDTF): | 2021-10-15T13:57:12Z | |
Date Available: | 2021-10-15T13:57:12Z | |
Description: | This machine translation test set contains 2223 Czech sentences collected within the FAUST project (https://ufal.mff.cuni.cz/grants/faust, http://hdl.handle.net/11234/1-3308). Each original (noisy) sentence was normalized (clean1 and clean2) and translated to English independently by two translators. | |
Identifier (URI): | http://hdl.handle.net/11234/1-3775 | |
Language: | English | |
Czech | ||
Language (ISO639): | eng | |
ces | ||
Publisher: | Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) | |
Rights: | Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) | |
http://creativecommons.org/licenses/by-nc-sa/4.0/ | ||
Subject: | noisy texts | |
parallel corpus | ||
machine translation | ||
Type: | corpus | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University | |
Description: | http://www.language-archives.org/archive/lindat.mff.cuni.cz | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:lindat.mff.cuni.cz:11234/1-3775 | |
DateStamp: | 2021-10-15 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Hajič, Jan; Mareček, David; Fučíková, Eva; Cinková, Silvie; Štěpánek, Jan; Mikulová, Marie; Popel, Martin. 2021. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL). | |
Terms: | area_Europe country_CZ country_GB dcmi_Text iso639_ces iso639_eng olac_primary_text |