Cargando…

Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm

OBJECTIVE: Improving health literacy in infectious diseases is a direct manifestation of the solid advance in disease control and prevention. Our study is aimed at exploring applying synthetic minority oversampling technique (SMOTE) in the prediction assessment of whether residents and business empl...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhou, Rongsheng, Yin, Weihao, Li, Wenjin, Wang, Yingchun, Lu, Jing, Li, Zhong, Hu, Xinxin
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Hindawi 2022
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8975663/ https://www.ncbi.nlm.nih.gov/pubmed/35371281 http://dx.doi.org/10.1155/2022/8498159

_version_	1784680413121216512
author	Zhou, Rongsheng Yin, Weihao Li, Wenjin Wang, Yingchun Lu, Jing Li, Zhong Hu, Xinxin
author_facet	Zhou, Rongsheng Yin, Weihao Li, Wenjin Wang, Yingchun Lu, Jing Li, Zhong Hu, Xinxin
author_sort	Zhou, Rongsheng
collection	PubMed
description	OBJECTIVE: Improving health literacy in infectious diseases is a direct manifestation of the solid advance in disease control and prevention. Our study is aimed at exploring applying synthetic minority oversampling technique (SMOTE) in the prediction assessment of whether residents and business employees have infectious disease health literacy. METHODS: The Chinese resident infectious disease health literacy evaluation scale was used to investigate the associated variables. The screened variables were input variables and the presence or absence of infectious diseases health literacy as outcome variables. Logistic regression, random forest, and support vector machine (SVM) models were built in the data sets before and after treatment by the SMOTE algorithm, respectively, and the performance of the models was evaluated by receiver operating characteristic curves (ROC). RESULTS: Logistic regression, random forest, and SVM achieved accuracies of 0.828, 0.612, and 0.654 before SMOTE algorithm processing, and the areas under the ROC curves (AUCs) of the three models were 0.754, 0.817, and 0.759, respectively. The accuracies were 0.938, 0.911, and 0.894 after SMOTE algorithm processing, and the AUCs of the three models were 0.913, 0.925, and 0.910, respectively. CONCLUSIONS: The random forest model based on the SMOTE has high application value in assessing whether residents versus enterprise employees have infectious disease health literacy.
format	Online Article Text
id	pubmed-8975663
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Hindawi
record_format	MEDLINE/PubMed
spelling	pubmed-89756632022-04-02 Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm Zhou, Rongsheng Yin, Weihao Li, Wenjin Wang, Yingchun Lu, Jing Li, Zhong Hu, Xinxin Comput Math Methods Med Research Article OBJECTIVE: Improving health literacy in infectious diseases is a direct manifestation of the solid advance in disease control and prevention. Our study is aimed at exploring applying synthetic minority oversampling technique (SMOTE) in the prediction assessment of whether residents and business employees have infectious disease health literacy. METHODS: The Chinese resident infectious disease health literacy evaluation scale was used to investigate the associated variables. The screened variables were input variables and the presence or absence of infectious diseases health literacy as outcome variables. Logistic regression, random forest, and support vector machine (SVM) models were built in the data sets before and after treatment by the SMOTE algorithm, respectively, and the performance of the models was evaluated by receiver operating characteristic curves (ROC). RESULTS: Logistic regression, random forest, and SVM achieved accuracies of 0.828, 0.612, and 0.654 before SMOTE algorithm processing, and the areas under the ROC curves (AUCs) of the three models were 0.754, 0.817, and 0.759, respectively. The accuracies were 0.938, 0.911, and 0.894 after SMOTE algorithm processing, and the AUCs of the three models were 0.913, 0.925, and 0.910, respectively. CONCLUSIONS: The random forest model based on the SMOTE has high application value in assessing whether residents versus enterprise employees have infectious disease health literacy. Hindawi 2022-03-25 /pmc/articles/PMC8975663/ /pubmed/35371281 http://dx.doi.org/10.1155/2022/8498159 Text en Copyright © 2022 Rongsheng Zhou et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Zhou, Rongsheng Yin, Weihao Li, Wenjin Wang, Yingchun Lu, Jing Li, Zhong Hu, Xinxin Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm
title	Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm
title_full	Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm
title_fullStr	Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm
title_full_unstemmed	Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm
title_short	Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm
title_sort	prediction model for infectious disease health literacy based on synthetic minority oversampling technique algorithm
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8975663/ https://www.ncbi.nlm.nih.gov/pubmed/35371281 http://dx.doi.org/10.1155/2022/8498159
work_keys_str_mv	AT zhourongsheng predictionmodelforinfectiousdiseasehealthliteracybasedonsyntheticminorityoversamplingtechniquealgorithm AT yinweihao predictionmodelforinfectiousdiseasehealthliteracybasedonsyntheticminorityoversamplingtechniquealgorithm AT liwenjin predictionmodelforinfectiousdiseasehealthliteracybasedonsyntheticminorityoversamplingtechniquealgorithm AT wangyingchun predictionmodelforinfectiousdiseasehealthliteracybasedonsyntheticminorityoversamplingtechniquealgorithm AT lujing predictionmodelforinfectiousdiseasehealthliteracybasedonsyntheticminorityoversamplingtechniquealgorithm AT lizhong predictionmodelforinfectiousdiseasehealthliteracybasedonsyntheticminorityoversamplingtechniquealgorithm AT huxinxin predictionmodelforinfectiousdiseasehealthliteracybasedonsyntheticminorityoversamplingtechniquealgorithm

Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm

Ejemplares similares