Cargando…
Principal Semantic Components of Language and the Measurement of Meaning
Metric systems for semantics, or semantic cognitive maps, are allocations of words or other representations in a metric space based on their meaning. Existing methods for semantic mapping, such as Latent Semantic Analysis and Latent Dirichlet Allocation, are based on paradigms involving dissimilarit...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2883995/ https://www.ncbi.nlm.nih.gov/pubmed/20552009 http://dx.doi.org/10.1371/journal.pone.0010921 |
_version_ | 1782182302731730944 |
---|---|
author | Samsonovic, Alexei V. Ascoli, Giorgio A. |
author_facet | Samsonovic, Alexei V. Ascoli, Giorgio A. |
author_sort | Samsonovic, Alexei V. |
collection | PubMed |
description | Metric systems for semantics, or semantic cognitive maps, are allocations of words or other representations in a metric space based on their meaning. Existing methods for semantic mapping, such as Latent Semantic Analysis and Latent Dirichlet Allocation, are based on paradigms involving dissimilarity metrics. They typically do not take into account relations of antonymy and yield a large number of domain-specific semantic dimensions. Here, using a novel self-organization approach, we construct a low-dimensional, context-independent semantic map of natural language that represents simultaneously synonymy and antonymy. Emergent semantics of the map principal components are clearly identifiable: the first three correspond to the meanings of “good/bad” (valence), “calm/excited” (arousal), and “open/closed” (freedom), respectively. The semantic map is sufficiently robust to allow the automated extraction of synonyms and antonyms not originally in the dictionaries used to construct the map and to predict connotation from their coordinates. The map geometric characteristics include a limited number (∼4) of statistically significant dimensions, a bimodal distribution of the first component, increasing kurtosis of subsequent (unimodal) components, and a U-shaped maximum-spread planar projection. Both the semantic content and the main geometric features of the map are consistent between dictionaries (Microsoft Word and Princeton's WordNet), among Western languages (English, French, German, and Spanish), and with previously established psychometric measures. By defining the semantics of its dimensions, the constructed map provides a foundational metric system for the quantitative analysis of word meaning. Language can be viewed as a cumulative product of human experiences. Therefore, the extracted principal semantic dimensions may be useful to characterize the general semantic dimensions of the content of mental states. This is a fundamental step toward a universal metric system for semantics of human experiences, which is necessary for developing a rigorous science of the mind. |
format | Text |
id | pubmed-2883995 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-28839952010-06-15 Principal Semantic Components of Language and the Measurement of Meaning Samsonovic, Alexei V. Ascoli, Giorgio A. PLoS One Research Article Metric systems for semantics, or semantic cognitive maps, are allocations of words or other representations in a metric space based on their meaning. Existing methods for semantic mapping, such as Latent Semantic Analysis and Latent Dirichlet Allocation, are based on paradigms involving dissimilarity metrics. They typically do not take into account relations of antonymy and yield a large number of domain-specific semantic dimensions. Here, using a novel self-organization approach, we construct a low-dimensional, context-independent semantic map of natural language that represents simultaneously synonymy and antonymy. Emergent semantics of the map principal components are clearly identifiable: the first three correspond to the meanings of “good/bad” (valence), “calm/excited” (arousal), and “open/closed” (freedom), respectively. The semantic map is sufficiently robust to allow the automated extraction of synonyms and antonyms not originally in the dictionaries used to construct the map and to predict connotation from their coordinates. The map geometric characteristics include a limited number (∼4) of statistically significant dimensions, a bimodal distribution of the first component, increasing kurtosis of subsequent (unimodal) components, and a U-shaped maximum-spread planar projection. Both the semantic content and the main geometric features of the map are consistent between dictionaries (Microsoft Word and Princeton's WordNet), among Western languages (English, French, German, and Spanish), and with previously established psychometric measures. By defining the semantics of its dimensions, the constructed map provides a foundational metric system for the quantitative analysis of word meaning. Language can be viewed as a cumulative product of human experiences. Therefore, the extracted principal semantic dimensions may be useful to characterize the general semantic dimensions of the content of mental states. This is a fundamental step toward a universal metric system for semantics of human experiences, which is necessary for developing a rigorous science of the mind. Public Library of Science 2010-06-11 /pmc/articles/PMC2883995/ /pubmed/20552009 http://dx.doi.org/10.1371/journal.pone.0010921 Text en Samsonovich, Ascoli. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Samsonovic, Alexei V. Ascoli, Giorgio A. Principal Semantic Components of Language and the Measurement of Meaning |
title | Principal Semantic Components of Language and the Measurement of Meaning |
title_full | Principal Semantic Components of Language and the Measurement of Meaning |
title_fullStr | Principal Semantic Components of Language and the Measurement of Meaning |
title_full_unstemmed | Principal Semantic Components of Language and the Measurement of Meaning |
title_short | Principal Semantic Components of Language and the Measurement of Meaning |
title_sort | principal semantic components of language and the measurement of meaning |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2883995/ https://www.ncbi.nlm.nih.gov/pubmed/20552009 http://dx.doi.org/10.1371/journal.pone.0010921 |
work_keys_str_mv | AT samsonovicalexeiv principalsemanticcomponentsoflanguageandthemeasurementofmeaning AT ascoligiorgioa principalsemanticcomponentsoflanguageandthemeasurementofmeaning |