Cargando…

Predictive modeling for trustworthiness and other subjective text properties in online nutrition and health communication

While the internet has democratized and accelerated content creation and sharing, it has also made people more vulnerable to manipulation and misinformation. Also, the received information can be distorted by psychological biases. This is problematic especially in health-related communications which...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kauttonen, Janne, Hannukainen, Jenni, Tikka, Pia, Suomala, Jyrki
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2020
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7410325/ https://www.ncbi.nlm.nih.gov/pubmed/32760095 http://dx.doi.org/10.1371/journal.pone.0237144

_version_	1783568221238984704
author	Kauttonen, Janne Hannukainen, Jenni Tikka, Pia Suomala, Jyrki
author_facet	Kauttonen, Janne Hannukainen, Jenni Tikka, Pia Suomala, Jyrki
author_sort	Kauttonen, Janne
collection	PubMed
description	While the internet has democratized and accelerated content creation and sharing, it has also made people more vulnerable to manipulation and misinformation. Also, the received information can be distorted by psychological biases. This is problematic especially in health-related communications which can greatly affect the quality of life of individuals. We assembled and analyzed 364 texts related to nutrition and health from Finnish online sources, such as news, columns and blogs, and asked non-experts to subjectively evaluate the texts. Texts were rated for their trustworthiness, sentiment, logic, information, clarity, and neutrality properties. We then estimated individual biases and consensus ratings that were used in training regression models. Firstly, we found that trustworthiness was significantly correlated to the information, neutrality and logic of the texts. Secondly, individual ratings for information and logic were significantly biased by the age and diet of the raters. Our best regression models explained up to 70% of the total variance of consensus ratings based on the low-level properties of texts, such as semantic embeddings, presence of key-terms and part-of-speech tags, references, quotes and paragraphs. With a novel combination of crowdsourcing, behavioral analysis, natural language processing and predictive modeling, our study contributes to the automated identification of reliable and high-quality online information. While critical evaluation of truthfulness cannot be surrendered to the machine only, our findings provide new insights into automated evaluation of subjective text properties and analysis of morphologically-rich languages in regards to trustworthiness.
format	Online Article Text
id	pubmed-7410325
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-74103252020-08-13 Predictive modeling for trustworthiness and other subjective text properties in online nutrition and health communication Kauttonen, Janne Hannukainen, Jenni Tikka, Pia Suomala, Jyrki PLoS One Research Article While the internet has democratized and accelerated content creation and sharing, it has also made people more vulnerable to manipulation and misinformation. Also, the received information can be distorted by psychological biases. This is problematic especially in health-related communications which can greatly affect the quality of life of individuals. We assembled and analyzed 364 texts related to nutrition and health from Finnish online sources, such as news, columns and blogs, and asked non-experts to subjectively evaluate the texts. Texts were rated for their trustworthiness, sentiment, logic, information, clarity, and neutrality properties. We then estimated individual biases and consensus ratings that were used in training regression models. Firstly, we found that trustworthiness was significantly correlated to the information, neutrality and logic of the texts. Secondly, individual ratings for information and logic were significantly biased by the age and diet of the raters. Our best regression models explained up to 70% of the total variance of consensus ratings based on the low-level properties of texts, such as semantic embeddings, presence of key-terms and part-of-speech tags, references, quotes and paragraphs. With a novel combination of crowdsourcing, behavioral analysis, natural language processing and predictive modeling, our study contributes to the automated identification of reliable and high-quality online information. While critical evaluation of truthfulness cannot be surrendered to the machine only, our findings provide new insights into automated evaluation of subjective text properties and analysis of morphologically-rich languages in regards to trustworthiness. Public Library of Science 2020-08-06 /pmc/articles/PMC7410325/ /pubmed/32760095 http://dx.doi.org/10.1371/journal.pone.0237144 Text en © 2020 Kauttonen et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Kauttonen, Janne Hannukainen, Jenni Tikka, Pia Suomala, Jyrki Predictive modeling for trustworthiness and other subjective text properties in online nutrition and health communication
title	Predictive modeling for trustworthiness and other subjective text properties in online nutrition and health communication
title_full	Predictive modeling for trustworthiness and other subjective text properties in online nutrition and health communication
title_fullStr	Predictive modeling for trustworthiness and other subjective text properties in online nutrition and health communication
title_full_unstemmed	Predictive modeling for trustworthiness and other subjective text properties in online nutrition and health communication
title_short	Predictive modeling for trustworthiness and other subjective text properties in online nutrition and health communication
title_sort	predictive modeling for trustworthiness and other subjective text properties in online nutrition and health communication
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7410325/ https://www.ncbi.nlm.nih.gov/pubmed/32760095 http://dx.doi.org/10.1371/journal.pone.0237144
work_keys_str_mv	AT kauttonenjanne predictivemodelingfortrustworthinessandothersubjectivetextpropertiesinonlinenutritionandhealthcommunication AT hannukainenjenni predictivemodelingfortrustworthinessandothersubjectivetextpropertiesinonlinenutritionandhealthcommunication AT tikkapia predictivemodelingfortrustworthinessandothersubjectivetextpropertiesinonlinenutritionandhealthcommunication AT suomalajyrki predictivemodelingfortrustworthinessandothersubjectivetextpropertiesinonlinenutritionandhealthcommunication

Predictive modeling for trustworthiness and other subjective text properties in online nutrition and health communication

Ejemplares similares