Deutsch
 
Hilfe Datenschutzhinweis Impressum
  DetailsucheBrowse

Datensatz

DATENSATZ AKTIONENEXPORT
  The Nijmegen corpus of casual Czech

Ernestus, M., Kočková-Amortová, L., & Pollak, P. (2014). The Nijmegen corpus of casual Czech. In N. Calzolari, K. Choukri, T. Declerck, H. Loftsson, B. Maegaard, J. Mariani, et al. (Eds.), Proceedings of LREC 2014: 9th International Conference on Language Resources and Evaluation (pp. 365-370).

Item is

Dateien

einblenden: Dateien
ausblenden: Dateien
:
Ernestus_KockovaAmortova_Pollak_2014.pdf (Verlagsversion), 310KB
Name:
Ernestus_KockovaAmortova_Pollak_2014.pdf
Beschreibung:
-
OA-Status:
Sichtbarkeit:
Öffentlich
MIME-Typ / Prüfsumme:
application/pdf / [MD5]
Technische Metadaten:
Copyright Datum:
-
Copyright Info:
-

Externe Referenzen

einblenden:

Urheber

einblenden:
ausblenden:
 Urheber:
Ernestus, Mirjam1, 2, Autor           
Kočková-Amortová, Lucie1, Autor
Pollak, Petr3, Autor
Affiliations:
1Center for Language Studies, External Organization, ou_55238              
2Research Associates, MPI for Psycholinguistics, Max Planck Society, Wundtlaan 1, 6525 XD Nijmegen, NL, ou_2344700              
3Faculty of Electrical Engineering, Czech Technical University, Prague, ou_persistent22              

Inhalt

einblenden:
ausblenden:
Schlagwörter: -
 Zusammenfassung: This article introduces a new speech corpus, the Nijmegen Corpus of Casual Czech (NCCCz), which contains more than 30 hours of high-quality recordings of casual conversations in Common Czech, among ten groups of three male and ten groups of three female friends. All speakers were native speakers of Czech, raised in Prague or in the region of Central Bohemia, and were between 19 and 26 years old. Every group of speakers consisted of one confederate, who was instructed to keep the conversations lively, and two speakers naive to the purposes of the recordings. The naive speakers were engaged in conversations for approximately 90 minutes, while the confederate joined them for approximately the last 72 minutes. The corpus was orthographically annotated by experienced transcribers and this orthographic transcription was aligned with the speech signal. In addition, the conversations were videotaped. This corpus can form the basis for all types of research on casual conversations in Czech, including phonetic research and research on how to improve automatic speech recognition. The corpus will be freely available

Details

einblenden:
ausblenden:
Sprache(n): eng - English
 Datum: 20142014
 Publikationsstatus: Online veröffentlicht
 Seiten: -
 Ort, Verlag, Ausgabe: -
 Inhaltsverzeichnis: -
 Art der Begutachtung: -
 Identifikatoren: -
 Art des Abschluß: -

Veranstaltung

einblenden:
ausblenden:
Titel: LREC 2014: 9th International Conference on Language Resources and Evaluation
Veranstaltungsort: Reykjavik, Iceland
Start-/Enddatum: 2014-05-26 - 2014-05-31

Entscheidung

einblenden:

Projektinformation

einblenden:

Quelle 1

einblenden:
ausblenden:
Titel: Proceedings of LREC 2014: 9th International Conference on Language Resources and Evaluation
Genre der Quelle: Konferenzband
 Urheber:
Calzolari, Nicoletta, Herausgeber
Choukri, Khalid, Herausgeber
Declerck, Thierry, Herausgeber
Loftsson, Hrafn, Herausgeber
Maegaard, Bente, Herausgeber
Mariani, Joseph, Herausgeber
Moreno, Asuncion, Herausgeber
Odijk, Jan, Herausgeber
Piperidis, Stelios, Herausgeber
Affiliations:
-
Ort, Verlag, Ausgabe: -
Seiten: - Band / Heft: - Artikelnummer: - Start- / Endseite: 365 - 370 Identifikator: ISBN: 978-2-9517408-8-4