Cargando…
An optimized machine learning model for identifying socio-economic, demographic and health-related variables associated with low vaccination levels that vary across ZIP codes in California
There is an urgent need for an in-depth and systematic assessment of a wide range of predictive factors related to populations most at risk for delaying and refusing COVID-19 vaccination as cases of the disease surge across the United States. Many studies have assessed a limited number of general so...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9186792/ https://www.ncbi.nlm.nih.gov/pubmed/35706686 http://dx.doi.org/10.1016/j.pmedr.2022.101858 |
_version_ | 1784725022797987840 |
---|---|
author | Avirappattu, George Pach III, Alfred Locklear, Clarence E. Briggs, Anthony Q. |
author_facet | Avirappattu, George Pach III, Alfred Locklear, Clarence E. Briggs, Anthony Q. |
author_sort | Avirappattu, George |
collection | PubMed |
description | There is an urgent need for an in-depth and systematic assessment of a wide range of predictive factors related to populations most at risk for delaying and refusing COVID-19 vaccination as cases of the disease surge across the United States. Many studies have assessed a limited number of general sociodemographic and health-related factors related to low vaccination rates. Machine learning methods were used to assess the association of 151 social and health-related risk factors derived from the American Community Survey 2019 and the Centers for Disease Control and Prevention (CDC) BRFSS with the response variables of vaccination rates and unvaccinated counts in 1,555 ZIP Codes in California. The performance of various analytical models was evaluated according to their ability to regress between predictive variables and vaccination levels. Machine learning modeling identified the Gradient Boosting Regressor (GBR) as the predictive model with a higher percentage of the explained variance than the variance identified through linear and generalized regression models. A set of 20 variables explained 72.90% of the variability of unvaccinated counts among ZIP Codes in California. ZIP Codes were shown to be a more meaningful geo-local unit of analysis than county-level assessments. Modeling vaccination rates was not as effective as modeling unvaccinated counts. The public health utility of this model provides for the analysis of state and local conditions related to COVID-19 vaccination use and future public health problems and pandemics. |
format | Online Article Text |
id | pubmed-9186792 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
record_format | MEDLINE/PubMed |
spelling | pubmed-91867922022-06-10 An optimized machine learning model for identifying socio-economic, demographic and health-related variables associated with low vaccination levels that vary across ZIP codes in California Avirappattu, George Pach III, Alfred Locklear, Clarence E. Briggs, Anthony Q. Prev Med Rep Regular Article There is an urgent need for an in-depth and systematic assessment of a wide range of predictive factors related to populations most at risk for delaying and refusing COVID-19 vaccination as cases of the disease surge across the United States. Many studies have assessed a limited number of general sociodemographic and health-related factors related to low vaccination rates. Machine learning methods were used to assess the association of 151 social and health-related risk factors derived from the American Community Survey 2019 and the Centers for Disease Control and Prevention (CDC) BRFSS with the response variables of vaccination rates and unvaccinated counts in 1,555 ZIP Codes in California. The performance of various analytical models was evaluated according to their ability to regress between predictive variables and vaccination levels. Machine learning modeling identified the Gradient Boosting Regressor (GBR) as the predictive model with a higher percentage of the explained variance than the variance identified through linear and generalized regression models. A set of 20 variables explained 72.90% of the variability of unvaccinated counts among ZIP Codes in California. ZIP Codes were shown to be a more meaningful geo-local unit of analysis than county-level assessments. Modeling vaccination rates was not as effective as modeling unvaccinated counts. The public health utility of this model provides for the analysis of state and local conditions related to COVID-19 vaccination use and future public health problems and pandemics. 2022-06-10 /pmc/articles/PMC9186792/ /pubmed/35706686 http://dx.doi.org/10.1016/j.pmedr.2022.101858 Text en © 2022 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Regular Article Avirappattu, George Pach III, Alfred Locklear, Clarence E. Briggs, Anthony Q. An optimized machine learning model for identifying socio-economic, demographic and health-related variables associated with low vaccination levels that vary across ZIP codes in California |
title | An optimized machine learning model for identifying socio-economic, demographic and health-related variables associated with low vaccination levels that vary across ZIP codes in California |
title_full | An optimized machine learning model for identifying socio-economic, demographic and health-related variables associated with low vaccination levels that vary across ZIP codes in California |
title_fullStr | An optimized machine learning model for identifying socio-economic, demographic and health-related variables associated with low vaccination levels that vary across ZIP codes in California |
title_full_unstemmed | An optimized machine learning model for identifying socio-economic, demographic and health-related variables associated with low vaccination levels that vary across ZIP codes in California |
title_short | An optimized machine learning model for identifying socio-economic, demographic and health-related variables associated with low vaccination levels that vary across ZIP codes in California |
title_sort | optimized machine learning model for identifying socio-economic, demographic and health-related variables associated with low vaccination levels that vary across zip codes in california |
topic | Regular Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9186792/ https://www.ncbi.nlm.nih.gov/pubmed/35706686 http://dx.doi.org/10.1016/j.pmedr.2022.101858 |
work_keys_str_mv | AT avirappattugeorge anoptimizedmachinelearningmodelforidentifyingsocioeconomicdemographicandhealthrelatedvariablesassociatedwithlowvaccinationlevelsthatvaryacrosszipcodesincalifornia AT pachiiialfred anoptimizedmachinelearningmodelforidentifyingsocioeconomicdemographicandhealthrelatedvariablesassociatedwithlowvaccinationlevelsthatvaryacrosszipcodesincalifornia AT locklearclarencee anoptimizedmachinelearningmodelforidentifyingsocioeconomicdemographicandhealthrelatedvariablesassociatedwithlowvaccinationlevelsthatvaryacrosszipcodesincalifornia AT briggsanthonyq anoptimizedmachinelearningmodelforidentifyingsocioeconomicdemographicandhealthrelatedvariablesassociatedwithlowvaccinationlevelsthatvaryacrosszipcodesincalifornia AT avirappattugeorge optimizedmachinelearningmodelforidentifyingsocioeconomicdemographicandhealthrelatedvariablesassociatedwithlowvaccinationlevelsthatvaryacrosszipcodesincalifornia AT pachiiialfred optimizedmachinelearningmodelforidentifyingsocioeconomicdemographicandhealthrelatedvariablesassociatedwithlowvaccinationlevelsthatvaryacrosszipcodesincalifornia AT locklearclarencee optimizedmachinelearningmodelforidentifyingsocioeconomicdemographicandhealthrelatedvariablesassociatedwithlowvaccinationlevelsthatvaryacrosszipcodesincalifornia AT briggsanthonyq optimizedmachinelearningmodelforidentifyingsocioeconomicdemographicandhealthrelatedvariablesassociatedwithlowvaccinationlevelsthatvaryacrosszipcodesincalifornia |