Cargando…

Application of machine learning methods for predicting infant mortality in Rwanda: analysis of Rwanda demographic health survey 2014–15 dataset

BACKGROUND: Extensive research on infant mortality (IM) exists in developing countries; however, most of the methods applied thus far relied on conventional regression analyses with limited prediction capability. Advanced of Machine Learning (AML) methods provide accurate prediction of IM; however,...

Descripción completa

Detalles Bibliográficos
Autores principales: Mfateneza, Emmanuel, Rutayisire, Pierre Claver, Biracyaza, Emmanuel, Musafiri, Sanctus, Mpabuka, Willy Gasafari
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9066935/
https://www.ncbi.nlm.nih.gov/pubmed/35509018
http://dx.doi.org/10.1186/s12884-022-04699-8
_version_ 1784699900648226816
author Mfateneza, Emmanuel
Rutayisire, Pierre Claver
Biracyaza, Emmanuel
Musafiri, Sanctus
Mpabuka, Willy Gasafari
author_facet Mfateneza, Emmanuel
Rutayisire, Pierre Claver
Biracyaza, Emmanuel
Musafiri, Sanctus
Mpabuka, Willy Gasafari
author_sort Mfateneza, Emmanuel
collection PubMed
description BACKGROUND: Extensive research on infant mortality (IM) exists in developing countries; however, most of the methods applied thus far relied on conventional regression analyses with limited prediction capability. Advanced of Machine Learning (AML) methods provide accurate prediction of IM; however, there is no study conducted using ML methods in Rwanda. This study, therefore, applied Machine Learning Methods for predicting infant mortality in Rwanda.  METHODS: A cross-sectional study design was conducted using the 2014–15 Rwanda Demographic and Health Survey. Python software version 3.8 was employed to test and apply ML methods through Random Forest (RF), Decision Tree, Support Vector Machine and Logistic regression. STATA version 13 was used for analysing conventional methods. Evaluation metrics methods specifically confusion matrix, accuracy, precision, recall, F1 score, and Area under the Receiver Operating Characteristics (AUROC) were used to evaluate the performance of predictive models. RESULTS: Ability of prediction was between 68.6% and 61.5% for AML. We preferred with the RF model (61.5%) presenting the best performance. The RF model was the best predictive model of IM with accuracy (84.3%), recall (91.3%), precision (80.3%), F1 score (85.5%), and AUROC (84.2%); followed by decision tree model with model accuracy (83%), recall (91%), precision (79%), F1 score (84.67%) and AUROC(82.9%), followed by support vector machine with model accuracy (68.6%), recall (74.9%), precision(67%), F1 score (70.73%) and AUROC (68.6%) and last was a logistic regression with the low accuracy of prediction (61.5%), recall (61.1%), precision (62.2%), F1 score (61.6%) and AUROC (61.5%) compared to other predictive models. Our predictive models showed that marital status, children ever born, birth order and wealth index are the 4 top predictors of IM. CONCLUSIONS: In developing a predictive model, ML methods are used to classify certain hidden information that could not be detected by traditional statistical methods. Random Forest was classified as the best classifier to be used for the predictive models of IM.
format Online
Article
Text
id pubmed-9066935
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-90669352022-05-04 Application of machine learning methods for predicting infant mortality in Rwanda: analysis of Rwanda demographic health survey 2014–15 dataset Mfateneza, Emmanuel Rutayisire, Pierre Claver Biracyaza, Emmanuel Musafiri, Sanctus Mpabuka, Willy Gasafari BMC Pregnancy Childbirth Research BACKGROUND: Extensive research on infant mortality (IM) exists in developing countries; however, most of the methods applied thus far relied on conventional regression analyses with limited prediction capability. Advanced of Machine Learning (AML) methods provide accurate prediction of IM; however, there is no study conducted using ML methods in Rwanda. This study, therefore, applied Machine Learning Methods for predicting infant mortality in Rwanda.  METHODS: A cross-sectional study design was conducted using the 2014–15 Rwanda Demographic and Health Survey. Python software version 3.8 was employed to test and apply ML methods through Random Forest (RF), Decision Tree, Support Vector Machine and Logistic regression. STATA version 13 was used for analysing conventional methods. Evaluation metrics methods specifically confusion matrix, accuracy, precision, recall, F1 score, and Area under the Receiver Operating Characteristics (AUROC) were used to evaluate the performance of predictive models. RESULTS: Ability of prediction was between 68.6% and 61.5% for AML. We preferred with the RF model (61.5%) presenting the best performance. The RF model was the best predictive model of IM with accuracy (84.3%), recall (91.3%), precision (80.3%), F1 score (85.5%), and AUROC (84.2%); followed by decision tree model with model accuracy (83%), recall (91%), precision (79%), F1 score (84.67%) and AUROC(82.9%), followed by support vector machine with model accuracy (68.6%), recall (74.9%), precision(67%), F1 score (70.73%) and AUROC (68.6%) and last was a logistic regression with the low accuracy of prediction (61.5%), recall (61.1%), precision (62.2%), F1 score (61.6%) and AUROC (61.5%) compared to other predictive models. Our predictive models showed that marital status, children ever born, birth order and wealth index are the 4 top predictors of IM. CONCLUSIONS: In developing a predictive model, ML methods are used to classify certain hidden information that could not be detected by traditional statistical methods. Random Forest was classified as the best classifier to be used for the predictive models of IM. BioMed Central 2022-05-04 /pmc/articles/PMC9066935/ /pubmed/35509018 http://dx.doi.org/10.1186/s12884-022-04699-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Mfateneza, Emmanuel
Rutayisire, Pierre Claver
Biracyaza, Emmanuel
Musafiri, Sanctus
Mpabuka, Willy Gasafari
Application of machine learning methods for predicting infant mortality in Rwanda: analysis of Rwanda demographic health survey 2014–15 dataset
title Application of machine learning methods for predicting infant mortality in Rwanda: analysis of Rwanda demographic health survey 2014–15 dataset
title_full Application of machine learning methods for predicting infant mortality in Rwanda: analysis of Rwanda demographic health survey 2014–15 dataset
title_fullStr Application of machine learning methods for predicting infant mortality in Rwanda: analysis of Rwanda demographic health survey 2014–15 dataset
title_full_unstemmed Application of machine learning methods for predicting infant mortality in Rwanda: analysis of Rwanda demographic health survey 2014–15 dataset
title_short Application of machine learning methods for predicting infant mortality in Rwanda: analysis of Rwanda demographic health survey 2014–15 dataset
title_sort application of machine learning methods for predicting infant mortality in rwanda: analysis of rwanda demographic health survey 2014–15 dataset
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9066935/
https://www.ncbi.nlm.nih.gov/pubmed/35509018
http://dx.doi.org/10.1186/s12884-022-04699-8
work_keys_str_mv AT mfatenezaemmanuel applicationofmachinelearningmethodsforpredictinginfantmortalityinrwandaanalysisofrwandademographichealthsurvey201415dataset
AT rutayisirepierreclaver applicationofmachinelearningmethodsforpredictinginfantmortalityinrwandaanalysisofrwandademographichealthsurvey201415dataset
AT biracyazaemmanuel applicationofmachinelearningmethodsforpredictinginfantmortalityinrwandaanalysisofrwandademographichealthsurvey201415dataset
AT musafirisanctus applicationofmachinelearningmethodsforpredictinginfantmortalityinrwandaanalysisofrwandademographichealthsurvey201415dataset
AT mpabukawillygasafari applicationofmachinelearningmethodsforpredictinginfantmortalityinrwandaanalysisofrwandademographichealthsurvey201415dataset