English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Book Chapter

Managing historical linguistic data for computational phylogenetics and computer-assisted language comparison

MPS-Authors
/persons/resource/persons220957

Tresoldi,  Tiago
CALC, Max Planck Institute for the Science of Human History, Max Planck Society;
Department of Linguistic and Cultural Evolution, Max Planck Institute for Evolutionary Anthropology, Max Planck Society;

/persons/resource/persons222944

Rzymski,  Christoph       
Department of Linguistic and Cultural Evolution, Max Planck Institute for Evolutionary Anthropology, Max Planck Society;

/persons/resource/persons96313

Forkel,  Robert       
Department of Linguistic and Cultural Evolution, Max Planck Institute for Evolutionary Anthropology, Max Planck Society;

/persons/resource/persons185771

Greenhill,  Simon J.       
Department of Linguistic and Cultural Evolution, Max Planck Institute for Evolutionary Anthropology, Max Planck Society;

/persons/resource/persons201886

List,  Johann-Mattis       
CALC, Max Planck Institute for the Science of Human History, Max Planck Society;
Department of Linguistic and Cultural Evolution, Max Planck Institute for Evolutionary Anthropology, Max Planck Society;

/persons/resource/persons138255

Gray,  Russell D.       
Department of Linguistic and Cultural Evolution, Max Planck Institute for Evolutionary Anthropology, Max Planck Society;

Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Supplementary Material (public)
There is no public supplementary material available
Citation

Tresoldi, T., Rzymski, C., Forkel, R., Greenhill, S. J., List, J.-M., & Gray, R. D. (2022). Managing historical linguistic data for computational phylogenetics and computer-assisted language comparison. In A. L. Berez-Kroeker, B. McDonnel, & E. Koller (Eds.), The open handbook of linguistic data management (pp. 345-354). Massachusetts: The MIT Press.


Cite as: https://hdl.handle.net/21.11116/0000-0009-02C2-9
Abstract
The popularisation of computer-based methods in comparative linguistics has led to a greater awareness of issues resulting from limited data sustainability and proper data management. In this use-case and its accompanying tutorial, we present principles of data management as applied to computational phylogenetics and computer-assisted language comparison, showcasing the solutions we recommend. Instead of enumerating the many possibilities to code and use linguistic data to conduct a phylogenetic analysis, we illustrate our suggestions for phylogenetic data management in a workflow based on a concrete analysis, showing how data should be managed with the help of a published dataset, exploring the information, file formats, processes, and software involved, explaining and showing how to collect and store cross-linguistic information, how to guarantee that datasets are cross-linguistically comparable, how to store intermediate and final results of the analyses, and how to share data in a reusable form by relying in the tools and principles of the CLDF initiative.