Cargando…

Consumer Health Search on the Web: Study of Web Page Understandability and Its Integration in Ranking Algorithms

BACKGROUND: Understandability plays a key role in ensuring that people accessing health information are capable of gaining insights that can assist them with their health concerns and choices. The access to unclear or misleading information has been shown to negatively impact the health decisions of...

Descripción completa

Detalles Bibliográficos
Autores principales: Palotti, Joao, Zuccon, Guido, Hanbury, Allan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6372940/
https://www.ncbi.nlm.nih.gov/pubmed/30698536
http://dx.doi.org/10.2196/10986
_version_ 1783394866127962112
author Palotti, Joao
Zuccon, Guido
Hanbury, Allan
author_facet Palotti, Joao
Zuccon, Guido
Hanbury, Allan
author_sort Palotti, Joao
collection PubMed
description BACKGROUND: Understandability plays a key role in ensuring that people accessing health information are capable of gaining insights that can assist them with their health concerns and choices. The access to unclear or misleading information has been shown to negatively impact the health decisions of the general public. OBJECTIVE: The aim of this study was to investigate methods to estimate the understandability of health Web pages and use these to improve the retrieval of information for people seeking health advice on the Web. METHODS: Our investigation considered methods to automatically estimate the understandability of health information in Web pages, and it provided a thorough evaluation of these methods using human assessments as well as an analysis of preprocessing factors affecting understandability estimations and associated pitfalls. Furthermore, lessons learned for estimating Web page understandability were applied to the construction of retrieval methods, with specific attention to retrieving information understandable by the general public. RESULTS: We found that machine learning techniques were more suitable to estimate health Web page understandability than traditional readability formulae, which are often used as guidelines and benchmark by health information providers on the Web (larger difference found for Pearson correlation of .602 using gradient boosting regressor compared with .438 using Simple Measure of Gobbledygook Index with the Conference and Labs of the Evaluation Forum eHealth 2015 collection). CONCLUSIONS: The findings reported in this paper are important for specialized search services tailored to support the general public in seeking health advice on the Web, as they document and empirically validate state-of-the-art techniques and settings for this domain application.
format Online
Article
Text
id pubmed-6372940
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-63729402019-03-08 Consumer Health Search on the Web: Study of Web Page Understandability and Its Integration in Ranking Algorithms Palotti, Joao Zuccon, Guido Hanbury, Allan J Med Internet Res Original Paper BACKGROUND: Understandability plays a key role in ensuring that people accessing health information are capable of gaining insights that can assist them with their health concerns and choices. The access to unclear or misleading information has been shown to negatively impact the health decisions of the general public. OBJECTIVE: The aim of this study was to investigate methods to estimate the understandability of health Web pages and use these to improve the retrieval of information for people seeking health advice on the Web. METHODS: Our investigation considered methods to automatically estimate the understandability of health information in Web pages, and it provided a thorough evaluation of these methods using human assessments as well as an analysis of preprocessing factors affecting understandability estimations and associated pitfalls. Furthermore, lessons learned for estimating Web page understandability were applied to the construction of retrieval methods, with specific attention to retrieving information understandable by the general public. RESULTS: We found that machine learning techniques were more suitable to estimate health Web page understandability than traditional readability formulae, which are often used as guidelines and benchmark by health information providers on the Web (larger difference found for Pearson correlation of .602 using gradient boosting regressor compared with .438 using Simple Measure of Gobbledygook Index with the Conference and Labs of the Evaluation Forum eHealth 2015 collection). CONCLUSIONS: The findings reported in this paper are important for specialized search services tailored to support the general public in seeking health advice on the Web, as they document and empirically validate state-of-the-art techniques and settings for this domain application. JMIR Publications 2019-01-30 /pmc/articles/PMC6372940/ /pubmed/30698536 http://dx.doi.org/10.2196/10986 Text en ©Joao Palotti, Guido Zuccon, Allan Hanbury. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 30.01.2019. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Palotti, Joao
Zuccon, Guido
Hanbury, Allan
Consumer Health Search on the Web: Study of Web Page Understandability and Its Integration in Ranking Algorithms
title Consumer Health Search on the Web: Study of Web Page Understandability and Its Integration in Ranking Algorithms
title_full Consumer Health Search on the Web: Study of Web Page Understandability and Its Integration in Ranking Algorithms
title_fullStr Consumer Health Search on the Web: Study of Web Page Understandability and Its Integration in Ranking Algorithms
title_full_unstemmed Consumer Health Search on the Web: Study of Web Page Understandability and Its Integration in Ranking Algorithms
title_short Consumer Health Search on the Web: Study of Web Page Understandability and Its Integration in Ranking Algorithms
title_sort consumer health search on the web: study of web page understandability and its integration in ranking algorithms
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6372940/
https://www.ncbi.nlm.nih.gov/pubmed/30698536
http://dx.doi.org/10.2196/10986
work_keys_str_mv AT palottijoao consumerhealthsearchonthewebstudyofwebpageunderstandabilityanditsintegrationinrankingalgorithms
AT zucconguido consumerhealthsearchonthewebstudyofwebpageunderstandabilityanditsintegrationinrankingalgorithms
AT hanburyallan consumerhealthsearchonthewebstudyofwebpageunderstandabilityanditsintegrationinrankingalgorithms