Cargando…
Language statistics as a window into mental representations
Large-scale linguistic data is nowadays available in abundance. Using this source of data, previous research has identified redundancies between the statistical structure of natural language and properties of the (physical) world we live in. For example, it has been shown that we can gauge city size...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9110419/ https://www.ncbi.nlm.nih.gov/pubmed/35577887 http://dx.doi.org/10.1038/s41598-022-12027-5 |
_version_ | 1784709099417501696 |
---|---|
author | Günther, Fritz Rinaldi, Luca |
author_facet | Günther, Fritz Rinaldi, Luca |
author_sort | Günther, Fritz |
collection | PubMed |
description | Large-scale linguistic data is nowadays available in abundance. Using this source of data, previous research has identified redundancies between the statistical structure of natural language and properties of the (physical) world we live in. For example, it has been shown that we can gauge city sizes by analyzing their respective word frequencies in corpora. However, since natural language is always produced by human speakers, we point out that such redundancies can only come about indirectly and should necessarily be restricted cases where human representations largely retain characteristics of the physical world. To demonstrate this, we examine the statistical occurrence of words referring to body parts in very different languages, covering nearly 4 billions of native speakers. This is because the convergence between language and physical properties of the stimuli clearly breaks down for the human body (i.e., more relevant and functional body parts are not necessarily larger in size). Our findings indicate that the human body as extracted from language does not retain its actual physical proportions; instead, it resembles the distorted human-like figure known as the sensory homunculus, whose form depicts the amount of cortical area dedicated to sensorimotor functions of each body part (and, thus, their relative functional relevance). This demonstrates that the surface-level statistical structure of language opens a window into how humans represent the world they live in, rather than into the world itself. |
format | Online Article Text |
id | pubmed-9110419 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-91104192022-05-18 Language statistics as a window into mental representations Günther, Fritz Rinaldi, Luca Sci Rep Article Large-scale linguistic data is nowadays available in abundance. Using this source of data, previous research has identified redundancies between the statistical structure of natural language and properties of the (physical) world we live in. For example, it has been shown that we can gauge city sizes by analyzing their respective word frequencies in corpora. However, since natural language is always produced by human speakers, we point out that such redundancies can only come about indirectly and should necessarily be restricted cases where human representations largely retain characteristics of the physical world. To demonstrate this, we examine the statistical occurrence of words referring to body parts in very different languages, covering nearly 4 billions of native speakers. This is because the convergence between language and physical properties of the stimuli clearly breaks down for the human body (i.e., more relevant and functional body parts are not necessarily larger in size). Our findings indicate that the human body as extracted from language does not retain its actual physical proportions; instead, it resembles the distorted human-like figure known as the sensory homunculus, whose form depicts the amount of cortical area dedicated to sensorimotor functions of each body part (and, thus, their relative functional relevance). This demonstrates that the surface-level statistical structure of language opens a window into how humans represent the world they live in, rather than into the world itself. Nature Publishing Group UK 2022-05-16 /pmc/articles/PMC9110419/ /pubmed/35577887 http://dx.doi.org/10.1038/s41598-022-12027-5 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Günther, Fritz Rinaldi, Luca Language statistics as a window into mental representations |
title | Language statistics as a window into mental representations |
title_full | Language statistics as a window into mental representations |
title_fullStr | Language statistics as a window into mental representations |
title_full_unstemmed | Language statistics as a window into mental representations |
title_short | Language statistics as a window into mental representations |
title_sort | language statistics as a window into mental representations |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9110419/ https://www.ncbi.nlm.nih.gov/pubmed/35577887 http://dx.doi.org/10.1038/s41598-022-12027-5 |
work_keys_str_mv | AT guntherfritz languagestatisticsasawindowintomentalrepresentations AT rinaldiluca languagestatisticsasawindowintomentalrepresentations |