Cargando…

Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm

OBJECTIVE: Improving health literacy in infectious diseases is a direct manifestation of the solid advance in disease control and prevention. Our study is aimed at exploring applying synthetic minority oversampling technique (SMOTE) in the prediction assessment of whether residents and business empl...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Rongsheng, Yin, Weihao, Li, Wenjin, Wang, Yingchun, Lu, Jing, Li, Zhong, Hu, Xinxin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8975663/
https://www.ncbi.nlm.nih.gov/pubmed/35371281
http://dx.doi.org/10.1155/2022/8498159
_version_ 1784680413121216512
author Zhou, Rongsheng
Yin, Weihao
Li, Wenjin
Wang, Yingchun
Lu, Jing
Li, Zhong
Hu, Xinxin
author_facet Zhou, Rongsheng
Yin, Weihao
Li, Wenjin
Wang, Yingchun
Lu, Jing
Li, Zhong
Hu, Xinxin
author_sort Zhou, Rongsheng
collection PubMed
description OBJECTIVE: Improving health literacy in infectious diseases is a direct manifestation of the solid advance in disease control and prevention. Our study is aimed at exploring applying synthetic minority oversampling technique (SMOTE) in the prediction assessment of whether residents and business employees have infectious disease health literacy. METHODS: The Chinese resident infectious disease health literacy evaluation scale was used to investigate the associated variables. The screened variables were input variables and the presence or absence of infectious diseases health literacy as outcome variables. Logistic regression, random forest, and support vector machine (SVM) models were built in the data sets before and after treatment by the SMOTE algorithm, respectively, and the performance of the models was evaluated by receiver operating characteristic curves (ROC). RESULTS: Logistic regression, random forest, and SVM achieved accuracies of 0.828, 0.612, and 0.654 before SMOTE algorithm processing, and the areas under the ROC curves (AUCs) of the three models were 0.754, 0.817, and 0.759, respectively. The accuracies were 0.938, 0.911, and 0.894 after SMOTE algorithm processing, and the AUCs of the three models were 0.913, 0.925, and 0.910, respectively. CONCLUSIONS: The random forest model based on the SMOTE has high application value in assessing whether residents versus enterprise employees have infectious disease health literacy.
format Online
Article
Text
id pubmed-8975663
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-89756632022-04-02 Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm Zhou, Rongsheng Yin, Weihao Li, Wenjin Wang, Yingchun Lu, Jing Li, Zhong Hu, Xinxin Comput Math Methods Med Research Article OBJECTIVE: Improving health literacy in infectious diseases is a direct manifestation of the solid advance in disease control and prevention. Our study is aimed at exploring applying synthetic minority oversampling technique (SMOTE) in the prediction assessment of whether residents and business employees have infectious disease health literacy. METHODS: The Chinese resident infectious disease health literacy evaluation scale was used to investigate the associated variables. The screened variables were input variables and the presence or absence of infectious diseases health literacy as outcome variables. Logistic regression, random forest, and support vector machine (SVM) models were built in the data sets before and after treatment by the SMOTE algorithm, respectively, and the performance of the models was evaluated by receiver operating characteristic curves (ROC). RESULTS: Logistic regression, random forest, and SVM achieved accuracies of 0.828, 0.612, and 0.654 before SMOTE algorithm processing, and the areas under the ROC curves (AUCs) of the three models were 0.754, 0.817, and 0.759, respectively. The accuracies were 0.938, 0.911, and 0.894 after SMOTE algorithm processing, and the AUCs of the three models were 0.913, 0.925, and 0.910, respectively. CONCLUSIONS: The random forest model based on the SMOTE has high application value in assessing whether residents versus enterprise employees have infectious disease health literacy. Hindawi 2022-03-25 /pmc/articles/PMC8975663/ /pubmed/35371281 http://dx.doi.org/10.1155/2022/8498159 Text en Copyright © 2022 Rongsheng Zhou et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhou, Rongsheng
Yin, Weihao
Li, Wenjin
Wang, Yingchun
Lu, Jing
Li, Zhong
Hu, Xinxin
Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm
title Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm
title_full Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm
title_fullStr Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm
title_full_unstemmed Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm
title_short Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm
title_sort prediction model for infectious disease health literacy based on synthetic minority oversampling technique algorithm
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8975663/
https://www.ncbi.nlm.nih.gov/pubmed/35371281
http://dx.doi.org/10.1155/2022/8498159
work_keys_str_mv AT zhourongsheng predictionmodelforinfectiousdiseasehealthliteracybasedonsyntheticminorityoversamplingtechniquealgorithm
AT yinweihao predictionmodelforinfectiousdiseasehealthliteracybasedonsyntheticminorityoversamplingtechniquealgorithm
AT liwenjin predictionmodelforinfectiousdiseasehealthliteracybasedonsyntheticminorityoversamplingtechniquealgorithm
AT wangyingchun predictionmodelforinfectiousdiseasehealthliteracybasedonsyntheticminorityoversamplingtechniquealgorithm
AT lujing predictionmodelforinfectiousdiseasehealthliteracybasedonsyntheticminorityoversamplingtechniquealgorithm
AT lizhong predictionmodelforinfectiousdiseasehealthliteracybasedonsyntheticminorityoversamplingtechniquealgorithm
AT huxinxin predictionmodelforinfectiousdiseasehealthliteracybasedonsyntheticminorityoversamplingtechniquealgorithm