Cargando…

Principal Semantic Components of Language and the Measurement of Meaning

Metric systems for semantics, or semantic cognitive maps, are allocations of words or other representations in a metric space based on their meaning. Existing methods for semantic mapping, such as Latent Semantic Analysis and Latent Dirichlet Allocation, are based on paradigms involving dissimilarit...

Descripción completa

Detalles Bibliográficos
Autores principales: Samsonovic, Alexei V., Ascoli, Giorgio A.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2883995/
https://www.ncbi.nlm.nih.gov/pubmed/20552009
http://dx.doi.org/10.1371/journal.pone.0010921
_version_ 1782182302731730944
author Samsonovic, Alexei V.
Ascoli, Giorgio A.
author_facet Samsonovic, Alexei V.
Ascoli, Giorgio A.
author_sort Samsonovic, Alexei V.
collection PubMed
description Metric systems for semantics, or semantic cognitive maps, are allocations of words or other representations in a metric space based on their meaning. Existing methods for semantic mapping, such as Latent Semantic Analysis and Latent Dirichlet Allocation, are based on paradigms involving dissimilarity metrics. They typically do not take into account relations of antonymy and yield a large number of domain-specific semantic dimensions. Here, using a novel self-organization approach, we construct a low-dimensional, context-independent semantic map of natural language that represents simultaneously synonymy and antonymy. Emergent semantics of the map principal components are clearly identifiable: the first three correspond to the meanings of “good/bad” (valence), “calm/excited” (arousal), and “open/closed” (freedom), respectively. The semantic map is sufficiently robust to allow the automated extraction of synonyms and antonyms not originally in the dictionaries used to construct the map and to predict connotation from their coordinates. The map geometric characteristics include a limited number (∼4) of statistically significant dimensions, a bimodal distribution of the first component, increasing kurtosis of subsequent (unimodal) components, and a U-shaped maximum-spread planar projection. Both the semantic content and the main geometric features of the map are consistent between dictionaries (Microsoft Word and Princeton's WordNet), among Western languages (English, French, German, and Spanish), and with previously established psychometric measures. By defining the semantics of its dimensions, the constructed map provides a foundational metric system for the quantitative analysis of word meaning. Language can be viewed as a cumulative product of human experiences. Therefore, the extracted principal semantic dimensions may be useful to characterize the general semantic dimensions of the content of mental states. This is a fundamental step toward a universal metric system for semantics of human experiences, which is necessary for developing a rigorous science of the mind.
format Text
id pubmed-2883995
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-28839952010-06-15 Principal Semantic Components of Language and the Measurement of Meaning Samsonovic, Alexei V. Ascoli, Giorgio A. PLoS One Research Article Metric systems for semantics, or semantic cognitive maps, are allocations of words or other representations in a metric space based on their meaning. Existing methods for semantic mapping, such as Latent Semantic Analysis and Latent Dirichlet Allocation, are based on paradigms involving dissimilarity metrics. They typically do not take into account relations of antonymy and yield a large number of domain-specific semantic dimensions. Here, using a novel self-organization approach, we construct a low-dimensional, context-independent semantic map of natural language that represents simultaneously synonymy and antonymy. Emergent semantics of the map principal components are clearly identifiable: the first three correspond to the meanings of “good/bad” (valence), “calm/excited” (arousal), and “open/closed” (freedom), respectively. The semantic map is sufficiently robust to allow the automated extraction of synonyms and antonyms not originally in the dictionaries used to construct the map and to predict connotation from their coordinates. The map geometric characteristics include a limited number (∼4) of statistically significant dimensions, a bimodal distribution of the first component, increasing kurtosis of subsequent (unimodal) components, and a U-shaped maximum-spread planar projection. Both the semantic content and the main geometric features of the map are consistent between dictionaries (Microsoft Word and Princeton's WordNet), among Western languages (English, French, German, and Spanish), and with previously established psychometric measures. By defining the semantics of its dimensions, the constructed map provides a foundational metric system for the quantitative analysis of word meaning. Language can be viewed as a cumulative product of human experiences. Therefore, the extracted principal semantic dimensions may be useful to characterize the general semantic dimensions of the content of mental states. This is a fundamental step toward a universal metric system for semantics of human experiences, which is necessary for developing a rigorous science of the mind. Public Library of Science 2010-06-11 /pmc/articles/PMC2883995/ /pubmed/20552009 http://dx.doi.org/10.1371/journal.pone.0010921 Text en Samsonovich, Ascoli. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Samsonovic, Alexei V.
Ascoli, Giorgio A.
Principal Semantic Components of Language and the Measurement of Meaning
title Principal Semantic Components of Language and the Measurement of Meaning
title_full Principal Semantic Components of Language and the Measurement of Meaning
title_fullStr Principal Semantic Components of Language and the Measurement of Meaning
title_full_unstemmed Principal Semantic Components of Language and the Measurement of Meaning
title_short Principal Semantic Components of Language and the Measurement of Meaning
title_sort principal semantic components of language and the measurement of meaning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2883995/
https://www.ncbi.nlm.nih.gov/pubmed/20552009
http://dx.doi.org/10.1371/journal.pone.0010921
work_keys_str_mv AT samsonovicalexeiv principalsemanticcomponentsoflanguageandthemeasurementofmeaning
AT ascoligiorgioa principalsemanticcomponentsoflanguageandthemeasurementofmeaning