Cargando…
A Comparative Analysis of Multidimensional COVID-19 Poverty Determinants: An Observational Machine Learning Approach
Poverty is a glaring issue in the twenty-first century, even after concerted efforts of organizations to eliminate the same. Predicting poverty using machine learning can offer practical models for facilitating the process of elimination of poverty. This paper uses Multidimensional Poverty Index Dat...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Japan
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9889951/ https://www.ncbi.nlm.nih.gov/pubmed/36741502 http://dx.doi.org/10.1007/s00354-023-00203-8 |
_version_ | 1784880845801127936 |
---|---|
author | Satapathy, Sandeep Kumar Saravanan, Shreyaa Mishra, Shruti Mohanty, Sachi Nandan |
author_facet | Satapathy, Sandeep Kumar Saravanan, Shreyaa Mishra, Shruti Mohanty, Sachi Nandan |
author_sort | Satapathy, Sandeep Kumar |
collection | PubMed |
description | Poverty is a glaring issue in the twenty-first century, even after concerted efforts of organizations to eliminate the same. Predicting poverty using machine learning can offer practical models for facilitating the process of elimination of poverty. This paper uses Multidimensional Poverty Index Data from the Oxford Poverty and Human Development Initiative across the years 2019 and 2021 to make predictions of multidimensional poverty before and during the pandemic. Several poverty indicators under health, education and living standards are taken into consideration. The work implements several data analysis techniques like feature correlation and selection, and graphical visualizations to answer research questions about poverty. Various machine learning, such as Multiple Linear Regression, Decision Tree Regressor, Random Forest Regressor, XGBoost, AdaBoost, Gradient Boosting, Linear Support Vector Regressor (SVR), Ridge Regression, Lasso Regression, ElasticNet Regression, and K-Nearest Neighbor Regression algorithm, have been implemented to predict poverty across four datasets on a national and a subnational level. Regularization is used to increase the performance of the models, and cross-validation is used for estimation. Through a rigorous analysis and comparison of different models, this work identifies important poverty determinants and concludes that overall, Ridge Regression model performs the best with the highest R(2) score. |
format | Online Article Text |
id | pubmed-9889951 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Springer Japan |
record_format | MEDLINE/PubMed |
spelling | pubmed-98899512023-02-01 A Comparative Analysis of Multidimensional COVID-19 Poverty Determinants: An Observational Machine Learning Approach Satapathy, Sandeep Kumar Saravanan, Shreyaa Mishra, Shruti Mohanty, Sachi Nandan New Gener Comput Article Poverty is a glaring issue in the twenty-first century, even after concerted efforts of organizations to eliminate the same. Predicting poverty using machine learning can offer practical models for facilitating the process of elimination of poverty. This paper uses Multidimensional Poverty Index Data from the Oxford Poverty and Human Development Initiative across the years 2019 and 2021 to make predictions of multidimensional poverty before and during the pandemic. Several poverty indicators under health, education and living standards are taken into consideration. The work implements several data analysis techniques like feature correlation and selection, and graphical visualizations to answer research questions about poverty. Various machine learning, such as Multiple Linear Regression, Decision Tree Regressor, Random Forest Regressor, XGBoost, AdaBoost, Gradient Boosting, Linear Support Vector Regressor (SVR), Ridge Regression, Lasso Regression, ElasticNet Regression, and K-Nearest Neighbor Regression algorithm, have been implemented to predict poverty across four datasets on a national and a subnational level. Regularization is used to increase the performance of the models, and cross-validation is used for estimation. Through a rigorous analysis and comparison of different models, this work identifies important poverty determinants and concludes that overall, Ridge Regression model performs the best with the highest R(2) score. Springer Japan 2023-02-01 2023 /pmc/articles/PMC9889951/ /pubmed/36741502 http://dx.doi.org/10.1007/s00354-023-00203-8 Text en © The Author(s), under exclusive licence to The Japanese Society for Artificial Intelligence and Springer Nature Japan KK, part of Springer Nature 2023, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Satapathy, Sandeep Kumar Saravanan, Shreyaa Mishra, Shruti Mohanty, Sachi Nandan A Comparative Analysis of Multidimensional COVID-19 Poverty Determinants: An Observational Machine Learning Approach |
title | A Comparative Analysis of Multidimensional COVID-19 Poverty Determinants: An Observational Machine Learning Approach |
title_full | A Comparative Analysis of Multidimensional COVID-19 Poverty Determinants: An Observational Machine Learning Approach |
title_fullStr | A Comparative Analysis of Multidimensional COVID-19 Poverty Determinants: An Observational Machine Learning Approach |
title_full_unstemmed | A Comparative Analysis of Multidimensional COVID-19 Poverty Determinants: An Observational Machine Learning Approach |
title_short | A Comparative Analysis of Multidimensional COVID-19 Poverty Determinants: An Observational Machine Learning Approach |
title_sort | comparative analysis of multidimensional covid-19 poverty determinants: an observational machine learning approach |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9889951/ https://www.ncbi.nlm.nih.gov/pubmed/36741502 http://dx.doi.org/10.1007/s00354-023-00203-8 |
work_keys_str_mv | AT satapathysandeepkumar acomparativeanalysisofmultidimensionalcovid19povertydeterminantsanobservationalmachinelearningapproach AT saravananshreyaa acomparativeanalysisofmultidimensionalcovid19povertydeterminantsanobservationalmachinelearningapproach AT mishrashruti acomparativeanalysisofmultidimensionalcovid19povertydeterminantsanobservationalmachinelearningapproach AT mohantysachinandan acomparativeanalysisofmultidimensionalcovid19povertydeterminantsanobservationalmachinelearningapproach AT satapathysandeepkumar comparativeanalysisofmultidimensionalcovid19povertydeterminantsanobservationalmachinelearningapproach AT saravananshreyaa comparativeanalysisofmultidimensionalcovid19povertydeterminantsanobservationalmachinelearningapproach AT mishrashruti comparativeanalysisofmultidimensionalcovid19povertydeterminantsanobservationalmachinelearningapproach AT mohantysachinandan comparativeanalysisofmultidimensionalcovid19povertydeterminantsanobservationalmachinelearningapproach |