Cargando…

Statistical inferences for polarity identification in natural language

Information forms the basis for all human behavior, including the ubiquitous decision-making that people constantly perform in their every day lives. It is thus the mission of researchers to understand how humans process information to reach decisions. In order to facilitate this task, this work pro...

Descripción completa

Detalles Bibliográficos
Autores principales: Pröllochs, Nicolas, Feuerriegel, Stefan, Neumann, Dirk
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6303018/
https://www.ncbi.nlm.nih.gov/pubmed/30576343
http://dx.doi.org/10.1371/journal.pone.0209323
_version_ 1783382098542854144
author Pröllochs, Nicolas
Feuerriegel, Stefan
Neumann, Dirk
author_facet Pröllochs, Nicolas
Feuerriegel, Stefan
Neumann, Dirk
author_sort Pröllochs, Nicolas
collection PubMed
description Information forms the basis for all human behavior, including the ubiquitous decision-making that people constantly perform in their every day lives. It is thus the mission of researchers to understand how humans process information to reach decisions. In order to facilitate this task, this work proposes LASSO regularization as a statistical tool to extract decisive words from textual content in order to study the reception of granular expressions in natural language. This differs from the usual use of the LASSO as a predictive model and, instead, yields highly interpretable statistical inferences between the occurrences of words and an outcome variable. Accordingly, the method suggests direct implications for the social sciences: it serves as a statistical procedure for generating domain-specific dictionaries as opposed to frequently employed heuristics. In addition, researchers can now identify text segments and word choices that are statistically decisive to authors or readers and, based on this knowledge, test hypotheses from behavioral research.
format Online
Article
Text
id pubmed-6303018
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-63030182019-01-08 Statistical inferences for polarity identification in natural language Pröllochs, Nicolas Feuerriegel, Stefan Neumann, Dirk PLoS One Research Article Information forms the basis for all human behavior, including the ubiquitous decision-making that people constantly perform in their every day lives. It is thus the mission of researchers to understand how humans process information to reach decisions. In order to facilitate this task, this work proposes LASSO regularization as a statistical tool to extract decisive words from textual content in order to study the reception of granular expressions in natural language. This differs from the usual use of the LASSO as a predictive model and, instead, yields highly interpretable statistical inferences between the occurrences of words and an outcome variable. Accordingly, the method suggests direct implications for the social sciences: it serves as a statistical procedure for generating domain-specific dictionaries as opposed to frequently employed heuristics. In addition, researchers can now identify text segments and word choices that are statistically decisive to authors or readers and, based on this knowledge, test hypotheses from behavioral research. Public Library of Science 2018-12-21 /pmc/articles/PMC6303018/ /pubmed/30576343 http://dx.doi.org/10.1371/journal.pone.0209323 Text en © 2018 Pröllochs et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Pröllochs, Nicolas
Feuerriegel, Stefan
Neumann, Dirk
Statistical inferences for polarity identification in natural language
title Statistical inferences for polarity identification in natural language
title_full Statistical inferences for polarity identification in natural language
title_fullStr Statistical inferences for polarity identification in natural language
title_full_unstemmed Statistical inferences for polarity identification in natural language
title_short Statistical inferences for polarity identification in natural language
title_sort statistical inferences for polarity identification in natural language
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6303018/
https://www.ncbi.nlm.nih.gov/pubmed/30576343
http://dx.doi.org/10.1371/journal.pone.0209323
work_keys_str_mv AT prollochsnicolas statisticalinferencesforpolarityidentificationinnaturallanguage
AT feuerriegelstefan statisticalinferencesforpolarityidentificationinnaturallanguage
AT neumanndirk statisticalinferencesforpolarityidentificationinnaturallanguage