Cargando…
Linking norms, ratings, and relations of words and concepts across multiple language varieties
Psychologists and linguists collect various data on word and concept properties. In psychology, scholars have accumulated norms and ratings for a large number of words in languages with many speakers. In linguistics, scholars have accumulated cross-linguistic information about the relations between...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer US
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9046307/ https://www.ncbi.nlm.nih.gov/pubmed/34357536 http://dx.doi.org/10.3758/s13428-021-01650-1 |
_version_ | 1784695494849593344 |
---|---|
author | Tjuka, Annika Forkel, Robert List, Johann-Mattis |
author_facet | Tjuka, Annika Forkel, Robert List, Johann-Mattis |
author_sort | Tjuka, Annika |
collection | PubMed |
description | Psychologists and linguists collect various data on word and concept properties. In psychology, scholars have accumulated norms and ratings for a large number of words in languages with many speakers. In linguistics, scholars have accumulated cross-linguistic information about the relations between words and concepts. Until now, however, there have been no efforts to combine information from the two fields, which would allow comparison of psychological and linguistic properties across different languages. The Database of Cross-Linguistic Norms, Ratings, and Relations for Words and Concepts (NoRaRe) is the first attempt to close this gap. Building on a reference catalog that offers standardization of concepts used in historical and typological language comparison, it integrates data from psychology and linguistics, collected from 98 data sets, covering 65 unique properties for 40 languages. The database is curated with the help of manual, automated, semi-automated workflows and uses a software API to control and access the data. The database is accessible via a web application, the software API, or using scripting languages. In this study, we present how the database is structured, how it can be extended, and how we control the quality of the data curation process. To illustrate its application, we present three case studies that test the validity of our approach, the accuracy of our workflows, and the integrative potential of the database. Due to regular version updates, the NoRaRe database has the potential to advance research in psychology and linguistics by offering researchers an integrated perspective on both fields. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.3758/s13428-021-01650-1. |
format | Online Article Text |
id | pubmed-9046307 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer US |
record_format | MEDLINE/PubMed |
spelling | pubmed-90463072022-05-07 Linking norms, ratings, and relations of words and concepts across multiple language varieties Tjuka, Annika Forkel, Robert List, Johann-Mattis Behav Res Methods Comment Psychologists and linguists collect various data on word and concept properties. In psychology, scholars have accumulated norms and ratings for a large number of words in languages with many speakers. In linguistics, scholars have accumulated cross-linguistic information about the relations between words and concepts. Until now, however, there have been no efforts to combine information from the two fields, which would allow comparison of psychological and linguistic properties across different languages. The Database of Cross-Linguistic Norms, Ratings, and Relations for Words and Concepts (NoRaRe) is the first attempt to close this gap. Building on a reference catalog that offers standardization of concepts used in historical and typological language comparison, it integrates data from psychology and linguistics, collected from 98 data sets, covering 65 unique properties for 40 languages. The database is curated with the help of manual, automated, semi-automated workflows and uses a software API to control and access the data. The database is accessible via a web application, the software API, or using scripting languages. In this study, we present how the database is structured, how it can be extended, and how we control the quality of the data curation process. To illustrate its application, we present three case studies that test the validity of our approach, the accuracy of our workflows, and the integrative potential of the database. Due to regular version updates, the NoRaRe database has the potential to advance research in psychology and linguistics by offering researchers an integrated perspective on both fields. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.3758/s13428-021-01650-1. Springer US 2021-08-06 2022 /pmc/articles/PMC9046307/ /pubmed/34357536 http://dx.doi.org/10.3758/s13428-021-01650-1 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Comment Tjuka, Annika Forkel, Robert List, Johann-Mattis Linking norms, ratings, and relations of words and concepts across multiple language varieties |
title | Linking norms, ratings, and relations of words and concepts across multiple language varieties |
title_full | Linking norms, ratings, and relations of words and concepts across multiple language varieties |
title_fullStr | Linking norms, ratings, and relations of words and concepts across multiple language varieties |
title_full_unstemmed | Linking norms, ratings, and relations of words and concepts across multiple language varieties |
title_short | Linking norms, ratings, and relations of words and concepts across multiple language varieties |
title_sort | linking norms, ratings, and relations of words and concepts across multiple language varieties |
topic | Comment |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9046307/ https://www.ncbi.nlm.nih.gov/pubmed/34357536 http://dx.doi.org/10.3758/s13428-021-01650-1 |
work_keys_str_mv | AT tjukaannika linkingnormsratingsandrelationsofwordsandconceptsacrossmultiplelanguagevarieties AT forkelrobert linkingnormsratingsandrelationsofwordsandconceptsacrossmultiplelanguagevarieties AT listjohannmattis linkingnormsratingsandrelationsofwordsandconceptsacrossmultiplelanguagevarieties |