Cargando…

Use of Machine Learning Algorithms to Predict the Understandability of Health Education Materials: Development and Evaluation Study

BACKGROUND: Improving the understandability of health information can significantly increase the cost-effectiveness and efficiency of health education programs for vulnerable populations. There is a pressing need to develop clinically informed computerized tools to enable rapid, reliable assessment...

Descripción completa

Detalles Bibliográficos
Autores principales: Ji, Meng, Liu, Yanmeng, Zhao, Mengdan, Lyu, Ziqing, Zhang, Boren, Luo, Xin, Li, Yanlin, Zhong, Yin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8138706/
https://www.ncbi.nlm.nih.gov/pubmed/33955834
http://dx.doi.org/10.2196/28413
_version_ 1783695864812797952
author Ji, Meng
Liu, Yanmeng
Zhao, Mengdan
Lyu, Ziqing
Zhang, Boren
Luo, Xin
Li, Yanlin
Zhong, Yin
author_facet Ji, Meng
Liu, Yanmeng
Zhao, Mengdan
Lyu, Ziqing
Zhang, Boren
Luo, Xin
Li, Yanlin
Zhong, Yin
author_sort Ji, Meng
collection PubMed
description BACKGROUND: Improving the understandability of health information can significantly increase the cost-effectiveness and efficiency of health education programs for vulnerable populations. There is a pressing need to develop clinically informed computerized tools to enable rapid, reliable assessment of the linguistic understandability of specialized health and medical education resources. This paper fills a critical gap in current patient-oriented health resource development, which requires reliable and accurate evaluation instruments to increase the efficiency and cost-effectiveness of health education resource evaluation. OBJECTIVE: We aimed to translate internationally endorsed clinical guidelines to machine learning algorithms to facilitate the evaluation of the understandability of health resources for international students at Australian universities. METHODS: Based on international patient health resource assessment guidelines, we developed machine learning algorithms to predict the linguistic understandability of health texts for Australian college students (aged 25-30 years) from non-English speaking backgrounds. We compared extreme gradient boosting, random forest, neural networks, and C5.0 decision tree for automated health information understandability evaluation. The 5 machine learning models achieved statistically better results compared to the baseline logistic regression model. We also evaluated the impact of each linguistic feature on the performance of each of the 5 models. RESULTS: We found that information evidentness, relevance to educational purposes, and logical sequence were consistently more important than numeracy skills and medical knowledge when assessing the linguistic understandability of health education resources for international tertiary students with adequate English skills (International English Language Testing System mean score 6.5) and high health literacy (mean 16.5 in the Short Assessment of Health Literacy-English test). Our results challenge the traditional views that lack of medical knowledge and numerical skills constituted the barriers to the understanding of health educational materials. CONCLUSIONS: Machine learning algorithms were developed to predict health information understandability for international college students aged 25-30 years. Thirteen natural language features and 5 evaluation dimensions were identified and compared in terms of their impact on the performance of the models. Health information understandability varies according to the demographic profiles of the target readers, and for international tertiary students, improving health information evidentness, relevance, and logic is critical.
format Online
Article
Text
id pubmed-8138706
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-81387062021-05-25 Use of Machine Learning Algorithms to Predict the Understandability of Health Education Materials: Development and Evaluation Study Ji, Meng Liu, Yanmeng Zhao, Mengdan Lyu, Ziqing Zhang, Boren Luo, Xin Li, Yanlin Zhong, Yin JMIR Med Inform Original Paper BACKGROUND: Improving the understandability of health information can significantly increase the cost-effectiveness and efficiency of health education programs for vulnerable populations. There is a pressing need to develop clinically informed computerized tools to enable rapid, reliable assessment of the linguistic understandability of specialized health and medical education resources. This paper fills a critical gap in current patient-oriented health resource development, which requires reliable and accurate evaluation instruments to increase the efficiency and cost-effectiveness of health education resource evaluation. OBJECTIVE: We aimed to translate internationally endorsed clinical guidelines to machine learning algorithms to facilitate the evaluation of the understandability of health resources for international students at Australian universities. METHODS: Based on international patient health resource assessment guidelines, we developed machine learning algorithms to predict the linguistic understandability of health texts for Australian college students (aged 25-30 years) from non-English speaking backgrounds. We compared extreme gradient boosting, random forest, neural networks, and C5.0 decision tree for automated health information understandability evaluation. The 5 machine learning models achieved statistically better results compared to the baseline logistic regression model. We also evaluated the impact of each linguistic feature on the performance of each of the 5 models. RESULTS: We found that information evidentness, relevance to educational purposes, and logical sequence were consistently more important than numeracy skills and medical knowledge when assessing the linguistic understandability of health education resources for international tertiary students with adequate English skills (International English Language Testing System mean score 6.5) and high health literacy (mean 16.5 in the Short Assessment of Health Literacy-English test). Our results challenge the traditional views that lack of medical knowledge and numerical skills constituted the barriers to the understanding of health educational materials. CONCLUSIONS: Machine learning algorithms were developed to predict health information understandability for international college students aged 25-30 years. Thirteen natural language features and 5 evaluation dimensions were identified and compared in terms of their impact on the performance of the models. Health information understandability varies according to the demographic profiles of the target readers, and for international tertiary students, improving health information evidentness, relevance, and logic is critical. JMIR Publications 2021-05-06 /pmc/articles/PMC8138706/ /pubmed/33955834 http://dx.doi.org/10.2196/28413 Text en ©Meng Ji, Yanmeng Liu, Mengdan Zhao, Ziqing Lyu, Boren Zhang, Xin Luo, Yanlin Li, Yin Zhong. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 06.05.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Ji, Meng
Liu, Yanmeng
Zhao, Mengdan
Lyu, Ziqing
Zhang, Boren
Luo, Xin
Li, Yanlin
Zhong, Yin
Use of Machine Learning Algorithms to Predict the Understandability of Health Education Materials: Development and Evaluation Study
title Use of Machine Learning Algorithms to Predict the Understandability of Health Education Materials: Development and Evaluation Study
title_full Use of Machine Learning Algorithms to Predict the Understandability of Health Education Materials: Development and Evaluation Study
title_fullStr Use of Machine Learning Algorithms to Predict the Understandability of Health Education Materials: Development and Evaluation Study
title_full_unstemmed Use of Machine Learning Algorithms to Predict the Understandability of Health Education Materials: Development and Evaluation Study
title_short Use of Machine Learning Algorithms to Predict the Understandability of Health Education Materials: Development and Evaluation Study
title_sort use of machine learning algorithms to predict the understandability of health education materials: development and evaluation study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8138706/
https://www.ncbi.nlm.nih.gov/pubmed/33955834
http://dx.doi.org/10.2196/28413
work_keys_str_mv AT jimeng useofmachinelearningalgorithmstopredicttheunderstandabilityofhealtheducationmaterialsdevelopmentandevaluationstudy
AT liuyanmeng useofmachinelearningalgorithmstopredicttheunderstandabilityofhealtheducationmaterialsdevelopmentandevaluationstudy
AT zhaomengdan useofmachinelearningalgorithmstopredicttheunderstandabilityofhealtheducationmaterialsdevelopmentandevaluationstudy
AT lyuziqing useofmachinelearningalgorithmstopredicttheunderstandabilityofhealtheducationmaterialsdevelopmentandevaluationstudy
AT zhangboren useofmachinelearningalgorithmstopredicttheunderstandabilityofhealtheducationmaterialsdevelopmentandevaluationstudy
AT luoxin useofmachinelearningalgorithmstopredicttheunderstandabilityofhealtheducationmaterialsdevelopmentandevaluationstudy
AT liyanlin useofmachinelearningalgorithmstopredicttheunderstandabilityofhealtheducationmaterialsdevelopmentandevaluationstudy
AT zhongyin useofmachinelearningalgorithmstopredicttheunderstandabilityofhealtheducationmaterialsdevelopmentandevaluationstudy