Cargando…
Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm
OBJECTIVE: Improving health literacy in infectious diseases is a direct manifestation of the solid advance in disease control and prevention. Our study is aimed at exploring applying synthetic minority oversampling technique (SMOTE) in the prediction assessment of whether residents and business empl...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8975663/ https://www.ncbi.nlm.nih.gov/pubmed/35371281 http://dx.doi.org/10.1155/2022/8498159 |
_version_ | 1784680413121216512 |
---|---|
author | Zhou, Rongsheng Yin, Weihao Li, Wenjin Wang, Yingchun Lu, Jing Li, Zhong Hu, Xinxin |
author_facet | Zhou, Rongsheng Yin, Weihao Li, Wenjin Wang, Yingchun Lu, Jing Li, Zhong Hu, Xinxin |
author_sort | Zhou, Rongsheng |
collection | PubMed |
description | OBJECTIVE: Improving health literacy in infectious diseases is a direct manifestation of the solid advance in disease control and prevention. Our study is aimed at exploring applying synthetic minority oversampling technique (SMOTE) in the prediction assessment of whether residents and business employees have infectious disease health literacy. METHODS: The Chinese resident infectious disease health literacy evaluation scale was used to investigate the associated variables. The screened variables were input variables and the presence or absence of infectious diseases health literacy as outcome variables. Logistic regression, random forest, and support vector machine (SVM) models were built in the data sets before and after treatment by the SMOTE algorithm, respectively, and the performance of the models was evaluated by receiver operating characteristic curves (ROC). RESULTS: Logistic regression, random forest, and SVM achieved accuracies of 0.828, 0.612, and 0.654 before SMOTE algorithm processing, and the areas under the ROC curves (AUCs) of the three models were 0.754, 0.817, and 0.759, respectively. The accuracies were 0.938, 0.911, and 0.894 after SMOTE algorithm processing, and the AUCs of the three models were 0.913, 0.925, and 0.910, respectively. CONCLUSIONS: The random forest model based on the SMOTE has high application value in assessing whether residents versus enterprise employees have infectious disease health literacy. |
format | Online Article Text |
id | pubmed-8975663 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-89756632022-04-02 Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm Zhou, Rongsheng Yin, Weihao Li, Wenjin Wang, Yingchun Lu, Jing Li, Zhong Hu, Xinxin Comput Math Methods Med Research Article OBJECTIVE: Improving health literacy in infectious diseases is a direct manifestation of the solid advance in disease control and prevention. Our study is aimed at exploring applying synthetic minority oversampling technique (SMOTE) in the prediction assessment of whether residents and business employees have infectious disease health literacy. METHODS: The Chinese resident infectious disease health literacy evaluation scale was used to investigate the associated variables. The screened variables were input variables and the presence or absence of infectious diseases health literacy as outcome variables. Logistic regression, random forest, and support vector machine (SVM) models were built in the data sets before and after treatment by the SMOTE algorithm, respectively, and the performance of the models was evaluated by receiver operating characteristic curves (ROC). RESULTS: Logistic regression, random forest, and SVM achieved accuracies of 0.828, 0.612, and 0.654 before SMOTE algorithm processing, and the areas under the ROC curves (AUCs) of the three models were 0.754, 0.817, and 0.759, respectively. The accuracies were 0.938, 0.911, and 0.894 after SMOTE algorithm processing, and the AUCs of the three models were 0.913, 0.925, and 0.910, respectively. CONCLUSIONS: The random forest model based on the SMOTE has high application value in assessing whether residents versus enterprise employees have infectious disease health literacy. Hindawi 2022-03-25 /pmc/articles/PMC8975663/ /pubmed/35371281 http://dx.doi.org/10.1155/2022/8498159 Text en Copyright © 2022 Rongsheng Zhou et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Zhou, Rongsheng Yin, Weihao Li, Wenjin Wang, Yingchun Lu, Jing Li, Zhong Hu, Xinxin Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm |
title | Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm |
title_full | Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm |
title_fullStr | Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm |
title_full_unstemmed | Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm |
title_short | Prediction Model for Infectious Disease Health Literacy Based on Synthetic Minority Oversampling Technique Algorithm |
title_sort | prediction model for infectious disease health literacy based on synthetic minority oversampling technique algorithm |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8975663/ https://www.ncbi.nlm.nih.gov/pubmed/35371281 http://dx.doi.org/10.1155/2022/8498159 |
work_keys_str_mv | AT zhourongsheng predictionmodelforinfectiousdiseasehealthliteracybasedonsyntheticminorityoversamplingtechniquealgorithm AT yinweihao predictionmodelforinfectiousdiseasehealthliteracybasedonsyntheticminorityoversamplingtechniquealgorithm AT liwenjin predictionmodelforinfectiousdiseasehealthliteracybasedonsyntheticminorityoversamplingtechniquealgorithm AT wangyingchun predictionmodelforinfectiousdiseasehealthliteracybasedonsyntheticminorityoversamplingtechniquealgorithm AT lujing predictionmodelforinfectiousdiseasehealthliteracybasedonsyntheticminorityoversamplingtechniquealgorithm AT lizhong predictionmodelforinfectiousdiseasehealthliteracybasedonsyntheticminorityoversamplingtechniquealgorithm AT huxinxin predictionmodelforinfectiousdiseasehealthliteracybasedonsyntheticminorityoversamplingtechniquealgorithm |