OLAC Record oai:www.ldc.upenn.edu:LDC2022S11 |
Metadata | ||
Title: | Samrómur Children Icelandic Speech 1.0 | |
Access Rights: | Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining | |
Bibliographic Citation: | Hernández Mena, Carlos Daniel, et al. Samrómur Children Icelandic Speech 1.0 LDC2022S11. Web Download. Philadelphia: Linguistic Data Consortium, 2022 | |
Contributor: | Hernández Mena, Carlos Daniel | |
Borsky, Michal | ||
Mollberg, David | ||
Guðmundsson, Smári Freyr | ||
Hedström, Staffan | ||
Pálsson, Ragnar | ||
Jónsson, Ólafur Helgi | ||
Þorsteinsdóttir, Sunneva | ||
Guðmundsdóttir, Jóhanna Vigdís | ||
Magnusdottir, Eydis Huld | ||
Þórhallsdóttir, Ragnheiður | ||
Gudnason, Jon | ||
Date (W3CDTF): | 2022 | |
Date Issued (W3CDTF): | 2022-11-15 | |
Description: | *Introduction* Samrómur Children Icelandic Speech 1.0 was developed by the Language and Voice Lab, Reykjavik University in cooperation with Almannarómur, Center for Language Technology. The corpus contains 131 hours of Icelandic prompted speech from 3,175 speakers (children, aged 4-17 years) representing 137,597 utterances. This version 1.0 is equivalent to "Samrómur Children Icelandic Speech 21.09" as used by the Language Technology Programme for Icelandic 2019-2023. *Data* Speech data was collected between October 2019 and September 2021 using the Samrómur website which displayed prompts to participants. The prompts were mainly from The Icelandic Gigaword Corpus, which includes text from novels, news, plays, and from a list of location names in Iceland. Additional prompts were taken from the Icelandic Web of Science and others were created by combining a name followed by a question or a demand. Prompts and speaker metadata are included in the corpus. The audio data is divided into train, dev, and test sets and is presented as flac compressed, single channel, 16 kHz, 16-bit linear PCM. *Samples* Please listen to this audio sample (FLAC). *Updates* None at this time. | |
Extent: | Corpus size: 7971738 KB | |
Format: | Sampling Rate: 16000 | |
Sampling Format: flac | ||
Identifier: | LDC2022S11 | |
https://catalog.ldc.upenn.edu/LDC2022S11 | ||
ISLRN: 228-981-226-601-4 | ||
DOI: 10.35111/frrj-qd60 | ||
Language: | Icelandic | |
Language (ISO639): | isl | |
License: | Samrómur Children Icelandic Speech 1.0 Agreement (For-Profit): https://catalog.ldc.upenn.edu/license/samromur-children-icelandic-speech-1-dot-0-agreement-for-profit.pdf | |
Samrómur Children Icelandic Speech 1.0 Agreement (Non-Member): https://catalog.ldc.upenn.edu/license/samromur-children-icelandic-speech-1-dot-0-agreement-non-member.pdf | ||
Samrómur Children Icelandic Speech 1.0 Agreement (Not-For-Profit): https://catalog.ldc.upenn.edu/license/samromur-children-icelandic-speech-1-dot-0-agreement-not-for-profit.pdf | ||
Medium: | Distribution: Web Download | |
Publisher: | Linguistic Data Consortium | |
Publisher (URI): | https://www.ldc.upenn.edu | |
Relation (URI): | https://catalog.ldc.upenn.edu/docs/LDC2022S11 | |
Rights Holder: | Portions © 2022 Reykjavik University, © 2022 Trustees of the University of Pennsylvania | |
Type (DCMI): | Sound | |
Text | ||
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | The LDC Corpus Catalog | |
Description: | http://www.language-archives.org/archive/www.ldc.upenn.edu | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:www.ldc.upenn.edu:LDC2022S11 | |
DateStamp: | 2023-12-05 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Hernández Mena, Carlos Daniel; Borsky, Michal; Mollberg, David; Guðmundsson, Smári Freyr; Hedström, Staffan; Pálsson, Ragnar; Jónsson, Ólafur Helgi; Þorsteinsdóttir, Sunneva; Guðmundsdóttir, Jóhanna Vigdís; Magnusdottir, Eydis Huld; Þórhallsdóttir, Ragnheiður; Gudnason, Jon. 2022. Linguistic Data Consortium. | |
Terms: | area_Europe country_IS dcmi_Sound dcmi_Text iso639_isl olac_primary_text |