English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Linking norms, ratings, and relations of words and concepts across multiple language varieties [Preprint]

Tjuka, A., Forkel, R., & List, J.-M. (2020). Linking norms, ratings, and relations of words and concepts across multiple language varieties [Preprint]. PsyArXiv Preprints, tgw3z. doi:10.31234/osf.io/tgw3z.

Item is

Files

show Files
hide Files
:
shh2672pre.pdf (Preprint), 2MB
Name:
shh2672pre.pdf
Description:
OA
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-

Locators

show

Creators

show
hide
 Creators:
Tjuka, Annika1, Author              
Forkel, Robert1, Author              
List, Johann-Mattis2, Author              
Affiliations:
1Linguistic and Cultural Evolution, Max Planck Institute for the Science of Human History, Max Planck Society, ou_2074311              
2CALC, Max Planck Institute for the Science of Human History, Max Planck Society, ou_2385703              

Content

show
hide
Free keywords: Cross-linguistic Comparison, Interdisciplinary Database, Test-driven Data Curation, Word and Concept Properties
 Abstract: Psychologists and linguists have collected a great diversity of data for word and concept properties. In psychology, many studies accumulate norms and ratings such as word frequencies or age-of-acquisition often for a large number of words. Linguistics, on the other hand, provides valuable insights into relations of word meanings. We present a collection of those data sets for norms, ratings, and relations that cover different languages: ‘NoRaRe.’ To enable a comparison between the diverse data types, we established workflows that facilitate the expansion of the database. A web application allows convenient access to the data (https://digling.org/norare/). Furthermore, a software API ensures consistent data curation by providing tests to validate the data sets. The NoRaRe collection is linked to the database curated by the Concepticon project (https://concepticon.clld.org) which offers a reference catalog of unified concept sets. The link between words in the data sets and the Concepticon concept sets makes a cross-linguistic comparison possible. In three case studies, we test the validity of our approach, the accuracy of our workflow, and the applicability of our database. The results indicate that the NoRaRe database can be applied for the study of word properties across multiple languages. The data can be used by psychologists and linguists to benefit from the knowledge rooted in both research disciplines.

Details

show
hide
Language(s): eng - English
 Dates: 2020-07-27
 Publication Status: Published online
 Pages: 24
 Publishing info: -
 Table of Contents: Introduction

Combing Forests of Data

Materials and Methods
Materials
Methods
- Manual Concept Mapping
- Automated Concept Mapping
- Semi-Automated Concept Mapping
- Labeling Word and Concept Properties

Validation
Descriptive Statistics of NoRaRe
Data Curation Workflow
Data Applicability
- Case Study 1: Replication of existing Findings
- Case Study 2: Comparison of Concept Mappings
- Case Study 3: Cross-Linguistic Comparison

Discussion and Conclusion
 Rev. Type: No review
 Identifiers: DOI: 10.31234/osf.io/tgw3z
Other: shh2672
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: PsyArXiv Preprints
  Abbreviation : PsyArXiv
Source Genre: Journal
 Creator(s):
Society for the Improvement of Psychological Science , Editor              
Affiliations:
-
Publ. Info: London, Berlin : Society for the Improvement of Psychological Science
Pages: - Volume / Issue: - Sequence Number: tgw3z Start / End Page: - Identifier: URN: https://psyarxiv.com/