Cargando…

Linking norms, ratings, and relations of words and concepts across multiple language varieties

Psychologists and linguists collect various data on word and concept properties. In psychology, scholars have accumulated norms and ratings for a large number of words in languages with many speakers. In linguistics, scholars have accumulated cross-linguistic information about the relations between...

Descripción completa

Detalles Bibliográficos
Autores principales: Tjuka, Annika, Forkel, Robert, List, Johann-Mattis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9046307/
https://www.ncbi.nlm.nih.gov/pubmed/34357536
http://dx.doi.org/10.3758/s13428-021-01650-1
_version_ 1784695494849593344
author Tjuka, Annika
Forkel, Robert
List, Johann-Mattis
author_facet Tjuka, Annika
Forkel, Robert
List, Johann-Mattis
author_sort Tjuka, Annika
collection PubMed
description Psychologists and linguists collect various data on word and concept properties. In psychology, scholars have accumulated norms and ratings for a large number of words in languages with many speakers. In linguistics, scholars have accumulated cross-linguistic information about the relations between words and concepts. Until now, however, there have been no efforts to combine information from the two fields, which would allow comparison of psychological and linguistic properties across different languages. The Database of Cross-Linguistic Norms, Ratings, and Relations for Words and Concepts (NoRaRe) is the first attempt to close this gap. Building on a reference catalog that offers standardization of concepts used in historical and typological language comparison, it integrates data from psychology and linguistics, collected from 98 data sets, covering 65 unique properties for 40 languages. The database is curated with the help of manual, automated, semi-automated workflows and uses a software API to control and access the data. The database is accessible via a web application, the software API, or using scripting languages. In this study, we present how the database is structured, how it can be extended, and how we control the quality of the data curation process. To illustrate its application, we present three case studies that test the validity of our approach, the accuracy of our workflows, and the integrative potential of the database. Due to regular version updates, the NoRaRe database has the potential to advance research in psychology and linguistics by offering researchers an integrated perspective on both fields. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.3758/s13428-021-01650-1.
format Online
Article
Text
id pubmed-9046307
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-90463072022-05-07 Linking norms, ratings, and relations of words and concepts across multiple language varieties Tjuka, Annika Forkel, Robert List, Johann-Mattis Behav Res Methods Comment Psychologists and linguists collect various data on word and concept properties. In psychology, scholars have accumulated norms and ratings for a large number of words in languages with many speakers. In linguistics, scholars have accumulated cross-linguistic information about the relations between words and concepts. Until now, however, there have been no efforts to combine information from the two fields, which would allow comparison of psychological and linguistic properties across different languages. The Database of Cross-Linguistic Norms, Ratings, and Relations for Words and Concepts (NoRaRe) is the first attempt to close this gap. Building on a reference catalog that offers standardization of concepts used in historical and typological language comparison, it integrates data from psychology and linguistics, collected from 98 data sets, covering 65 unique properties for 40 languages. The database is curated with the help of manual, automated, semi-automated workflows and uses a software API to control and access the data. The database is accessible via a web application, the software API, or using scripting languages. In this study, we present how the database is structured, how it can be extended, and how we control the quality of the data curation process. To illustrate its application, we present three case studies that test the validity of our approach, the accuracy of our workflows, and the integrative potential of the database. Due to regular version updates, the NoRaRe database has the potential to advance research in psychology and linguistics by offering researchers an integrated perspective on both fields. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.3758/s13428-021-01650-1. Springer US 2021-08-06 2022 /pmc/articles/PMC9046307/ /pubmed/34357536 http://dx.doi.org/10.3758/s13428-021-01650-1 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Comment
Tjuka, Annika
Forkel, Robert
List, Johann-Mattis
Linking norms, ratings, and relations of words and concepts across multiple language varieties
title Linking norms, ratings, and relations of words and concepts across multiple language varieties
title_full Linking norms, ratings, and relations of words and concepts across multiple language varieties
title_fullStr Linking norms, ratings, and relations of words and concepts across multiple language varieties
title_full_unstemmed Linking norms, ratings, and relations of words and concepts across multiple language varieties
title_short Linking norms, ratings, and relations of words and concepts across multiple language varieties
title_sort linking norms, ratings, and relations of words and concepts across multiple language varieties
topic Comment
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9046307/
https://www.ncbi.nlm.nih.gov/pubmed/34357536
http://dx.doi.org/10.3758/s13428-021-01650-1
work_keys_str_mv AT tjukaannika linkingnormsratingsandrelationsofwordsandconceptsacrossmultiplelanguagevarieties
AT forkelrobert linkingnormsratingsandrelationsofwordsandconceptsacrossmultiplelanguagevarieties
AT listjohannmattis linkingnormsratingsandrelationsofwordsandconceptsacrossmultiplelanguagevarieties