Cargando…
Statistical inferences for polarity identification in natural language
Information forms the basis for all human behavior, including the ubiquitous decision-making that people constantly perform in their every day lives. It is thus the mission of researchers to understand how humans process information to reach decisions. In order to facilitate this task, this work pro...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6303018/ https://www.ncbi.nlm.nih.gov/pubmed/30576343 http://dx.doi.org/10.1371/journal.pone.0209323 |
_version_ | 1783382098542854144 |
---|---|
author | Pröllochs, Nicolas Feuerriegel, Stefan Neumann, Dirk |
author_facet | Pröllochs, Nicolas Feuerriegel, Stefan Neumann, Dirk |
author_sort | Pröllochs, Nicolas |
collection | PubMed |
description | Information forms the basis for all human behavior, including the ubiquitous decision-making that people constantly perform in their every day lives. It is thus the mission of researchers to understand how humans process information to reach decisions. In order to facilitate this task, this work proposes LASSO regularization as a statistical tool to extract decisive words from textual content in order to study the reception of granular expressions in natural language. This differs from the usual use of the LASSO as a predictive model and, instead, yields highly interpretable statistical inferences between the occurrences of words and an outcome variable. Accordingly, the method suggests direct implications for the social sciences: it serves as a statistical procedure for generating domain-specific dictionaries as opposed to frequently employed heuristics. In addition, researchers can now identify text segments and word choices that are statistically decisive to authors or readers and, based on this knowledge, test hypotheses from behavioral research. |
format | Online Article Text |
id | pubmed-6303018 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-63030182019-01-08 Statistical inferences for polarity identification in natural language Pröllochs, Nicolas Feuerriegel, Stefan Neumann, Dirk PLoS One Research Article Information forms the basis for all human behavior, including the ubiquitous decision-making that people constantly perform in their every day lives. It is thus the mission of researchers to understand how humans process information to reach decisions. In order to facilitate this task, this work proposes LASSO regularization as a statistical tool to extract decisive words from textual content in order to study the reception of granular expressions in natural language. This differs from the usual use of the LASSO as a predictive model and, instead, yields highly interpretable statistical inferences between the occurrences of words and an outcome variable. Accordingly, the method suggests direct implications for the social sciences: it serves as a statistical procedure for generating domain-specific dictionaries as opposed to frequently employed heuristics. In addition, researchers can now identify text segments and word choices that are statistically decisive to authors or readers and, based on this knowledge, test hypotheses from behavioral research. Public Library of Science 2018-12-21 /pmc/articles/PMC6303018/ /pubmed/30576343 http://dx.doi.org/10.1371/journal.pone.0209323 Text en © 2018 Pröllochs et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Pröllochs, Nicolas Feuerriegel, Stefan Neumann, Dirk Statistical inferences for polarity identification in natural language |
title | Statistical inferences for polarity identification in natural language |
title_full | Statistical inferences for polarity identification in natural language |
title_fullStr | Statistical inferences for polarity identification in natural language |
title_full_unstemmed | Statistical inferences for polarity identification in natural language |
title_short | Statistical inferences for polarity identification in natural language |
title_sort | statistical inferences for polarity identification in natural language |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6303018/ https://www.ncbi.nlm.nih.gov/pubmed/30576343 http://dx.doi.org/10.1371/journal.pone.0209323 |
work_keys_str_mv | AT prollochsnicolas statisticalinferencesforpolarityidentificationinnaturallanguage AT feuerriegelstefan statisticalinferencesforpolarityidentificationinnaturallanguage AT neumanndirk statisticalinferencesforpolarityidentificationinnaturallanguage |