English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
 
 
DownloadE-Mail
  Lexibank, a public repository of standardized wordlists with computed phonological and lexical features

List, J.-M., Forkel, R., Greenhill, S. J., Rzymski, C., Englisch, J., & Gray, R. D. (2022). Lexibank, a public repository of standardized wordlists with computed phonological and lexical features. Scientific Data, 9: 316. doi:10.1038/s41597-022-01432-0.

Item is

Files

show Files
hide Files
:
List_Lexibank_SciData_2022.pdf (Publisher version), 10MB
Name:
List_Lexibank_SciData_2022.pdf
Description:
-
OA-Status:
Gold
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
2022
Copyright Info:
This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Cre- ative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not per- mitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Locators

show
hide
Description:
-
OA-Status:
Green

Creators

show
hide
 Creators:
List, Johann-Mattis1, Author                 
Forkel, Robert1, Author                 
Greenhill, Simon J.1, Author                 
Rzymski, Christoph1, Author                 
Englisch, Johannes1, Author                 
Gray, Russell D.1, Author                 
Affiliations:
1Department of Linguistic and Cultural Evolution, Max Planck Institute for Evolutionary Anthropology, Max Planck Society, ou_3237541              

Content

show
hide
Free keywords: -
 Abstract: The past decades have seen substantial growth in digital data on the world’s languages. At the same time, the demand for cross-linguistic datasets has been increasing, as witnessed by numerous studies devoted to diverse questions on human prehistory, cultural evolution, and human cognition. Unfortunately, most published datasets lack standardization which makes their comparison difficult. Here, we present a new approach to increase the comparability of cross-linguistic lexical data. We have designed workflows for the computer-assisted lifting of datasets to Cross-Linguistic Data Formats, a collection of standards that make these datasets more Findable, Accessible, Interoperable, and Reusable (FAIR). We test the Lexibank workflow on 100 lexical datasets from which we derive an aggregated database of wordlists in unified phonetic transcriptions covering more than 2000 language varieties. We illustrate the benefits of our approach by showing how phonological and lexical features can be automatically inferred, complementing and expanding existing cross-linguistic datasets.

Details

show
hide
Language(s): eng - English
 Dates: 2022-06-16
 Publication Status: Published online
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: Peer
 Identifiers: DOI: 10.1038/s41597-022-01432-0
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: Scientific Data
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: 9 Sequence Number: 316 Start / End Page: - Identifier: ISSN: 2052-4463