English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Computer-assisted language comparison: state of the art

Wu, M.-S., Schweikhard, N. E., Bodt, T. A., Hill, N. W., & List, J.-M. (2020). Computer-assisted language comparison: state of the art. Journal of open humanities data, 6(2): 12. doi:10.5334/johd.12.

Item is

Basic

show hide
Genre: Journal Article

Files

show Files
hide Files
:
shh2604.pdf (Publisher version), 2MB
Name:
shh2604.pdf
Description:
OA
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-

Locators

show

Creators

show
hide
 Creators:
Wu, Mei-Shin1, Author              
Schweikhard, Nathanael E.1, Author              
Bodt, Timotheus A., Author
Hill, Nathan W., Author
List, Johann-Mattis1, Author              
Affiliations:
1CALC, Max Planck Institute for the Science of Human History, Max Planck Society, ou_2385703              

Content

show
hide
Free keywords: computer-assisted; language comparison; historical linguistics; Hmong-Mien language family
 Abstract: Historical language comparison opens windows onto a human past, long before the availability of written records. Since traditional language comparison within the framework of the comparative method is largely based on manual data comparison, requiring the meticulous sifting through dictionaries, word lists, and grammars, the framework is difficult to apply, especially in times where more and more data have become available in digital form. Unfortunately, it is not possible to simply automate the process of historical language comparison, not only because computational solutions lag behind human judgments in historical linguistics, but also because they lack the flexibility that would allow them to integrate various types of information from various kinds of sources. A more promising approach is to integrate computational and classical approaches within a computer-assisted framework, “neither completely computer-driven nor ignorant of the assistance computers afford” [1, p. 4]. In this paper, we will illustrate what we consider the current state of the art of computer-assisted language comparison by presenting a workflow that starts with raw data and leads up to a stage where sound correspondence patterns across multiple languages have been identified and can be readily presented, inspected, and discussed. We illustrate this workflow with the help of a newly prepared dataset on Hmong-Mien languages. Our illustration is accompanied by Python code and instructions on how to use additional web-based tools we developed so that users can apply our workflow for their own purposes.

Details

show
hide
Language(s): eng - English
 Dates: 2020-05-22
 Publication Status: Published online
 Pages: 14
 Publishing info: -
 Table of Contents: -
 Rev. Type: Peer
 Identifiers: DOI: 10.5334/johd.12
Other: shh2604
 Degree: -

Event

show

Legal Case

show

Project information

show hide
Project name : CALC
Grant ID : 715618
Funding program : Horizon 2020 (H2020)
Funding organization : European Commission (EC)

Source 1

show
hide
Title: Journal of open humanities data
  Abbreviation : JOHD
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: London : Ubiquity Press
Pages: - Volume / Issue: 6 (2) Sequence Number: 12 Start / End Page: - Identifier: ISSN: 2059-481X
CoNE: https://pure.mpg.de/cone/journals/resource/2059-481X