English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
 
 
DownloadE-Mail
  Building and curating conversational corpora for diversity-aware language science and technology

Liesenfeld, A., & Dingemanse, M. (2022). Building and curating conversational corpora for diversity-aware language science and technology. In F. Béchet, P. Blache, K. Choukri, C. Cieri, T. DeClerck, S. Goggi, et al. (Eds.), Proceedings of the 13th Language Resources and Evaluation Conference (LREC 2022) (pp. 1178-1192). Marseille, France: European Language Resources Association.

Item is

Basic

show hide
Genre: Conference Paper

Files

show Files
hide Files
:
Liesenfeld_Dingemanse_2022_Building and curating conversational corpora for diversity-aware language.pdf (Publisher version), 2MB
Name:
Liesenfeld_Dingemanse_2022_Building and curating conversational corpora for diversity-aware language.pdf
Description:
-
OA-Status:
Green
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
2022
Copyright Info:
-
License:
-

Locators

show

Creators

show
hide
 Creators:
Liesenfeld, Andreas1, Author
Dingemanse, Mark1, Author           
Affiliations:
1Center for Language Studies, External Organizations, ou_55238              

Content

show
hide
Free keywords: -
 Abstract: We present an analysis pipeline and best practice guidelines for building and curating corpora of everyday conversation in diverse languages. Surveying language documentation corpora and other resources that cover 67 languages and varieties from 28 phyla, we describe the compilation and curation process, specify minimal properties of a unified format for interactional data, and develop methods for quality control that take into account turn-taking and timing. Two case studies show the broad utility of conversational data for (i) charting human interactional infrastructure and (ii) tracing challenges and opportunities for current ASR solutions. Linguistically diverse conversational corpora can provide new insights for the language sciences and stronger empirical foundations for language technology.

Details

show
hide
Language(s):
 Dates: 2022
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: Peer
 Degree: -

Event

show
hide
Title: the 13th Language Resources and Evaluation Conference (LREC 2022)
Place of Event: Marseille, France
Start-/End Date: 2022-06-20 - 2022-06-25

Legal Case

show

Project information

show

Source 1

show
hide
Title: Proceedings of the 13th Language Resources and Evaluation Conference (LREC 2022)
Source Genre: Proceedings
 Creator(s):
Béchet, F., Editor
Blache, P., Editor
Choukri, K., Editor
Cieri, C., Editor
DeClerck, T., Editor
Goggi, S., Editor
Isahara, H., Editor
Maegaard, B., Editor
Mariani, J., Editor
Mazo, H., Editor
Odijk, J., Editor
Piperidis , S., Author
Affiliations:
-
Publ. Info: Marseille, France : European Language Resources Association
Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 1178 - 1192 Identifier: -