Cargando…

Predicting the level of anemia among Ethiopian pregnant women using homogeneous ensemble machine learning algorithm

BACKGROUND: More than 115,000 maternal deaths and 591,000 prenatal deaths occurred in the world per year with anemia, the reduction of red blood cells or hemoglobin in the blood. The world health organization divides anemia in pregnancy into mild anemia (Hb 10–10.9 g/dl), moderate anemia (Hb 7.0–9.9...

Descripción completa

Detalles Bibliográficos
Autores principales: Dejene, Belayneh Endalamaw, Abuhay, Tesfamariam M., Bogale, Dawit Shibabaw
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9494842/
https://www.ncbi.nlm.nih.gov/pubmed/36138398
http://dx.doi.org/10.1186/s12911-022-01992-6
_version_ 1784793879330947072
author Dejene, Belayneh Endalamaw
Abuhay, Tesfamariam M.
Bogale, Dawit Shibabaw
author_facet Dejene, Belayneh Endalamaw
Abuhay, Tesfamariam M.
Bogale, Dawit Shibabaw
author_sort Dejene, Belayneh Endalamaw
collection PubMed
description BACKGROUND: More than 115,000 maternal deaths and 591,000 prenatal deaths occurred in the world per year with anemia, the reduction of red blood cells or hemoglobin in the blood. The world health organization divides anemia in pregnancy into mild anemia (Hb 10–10.9 g/dl), moderate anemia (Hb 7.0–9.9 g/dl), and severe anemia (Hb < 7 g/dl). This study aims to predict the level of anemia among pregnant women in the case of Ethiopia using homogeneous ensemble machine learning algorithms. METHODS: This study was conducted following a design science approach. The data were gathered from the Ethiopian demographic health survey and preprocessed to get quality data that are suitable for the machine learning algorithm to develop a model that predicts the levels of anemia among pregnant. Decision tree, random forest, cat boost, and extreme gradient boosting with class decomposition (one versus one and one versus rest) and without class decomposition were employed to build the predictive model. For constructing the proposed model, twelve experiments were conducted with a total of 29,104 instances with 23 features, and a training and testing dataset split ratio of 80/20. RESULTS: The overall accuracy of random forest, extreme gradient boosting, and cat boost without class decompositions is 91.34%, 94.26%, and 97.08.90%, respectively. The overall accuracy of random forest, extreme gradient boosting, and cat boost with one versus one is 94.4%, 95.21%, and 97.44%, respectively. The overall accuracy of random forest, extreme gradient boosting, and cat boost with one versus the rest are 94.4%, 94.54%, and 97.6%, respectively. CONCLUSION: Finally, the researcher decided to use cat boost algorithms with one versus the rest for further use in the development of artifacts, model deployment, risk factor analysis, and generating rules because it has registered better performance with 97.6% accuracy. The most determinant risk factors of anemia among pregnant women were identified using feature importance. Some of them are the duration of the current pregnancy, age, source of drinking water, respondent’s (pregnant women) occupation, number of household members, wealth index, husband/partner's education level, and birth history.
format Online
Article
Text
id pubmed-9494842
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-94948422022-09-23 Predicting the level of anemia among Ethiopian pregnant women using homogeneous ensemble machine learning algorithm Dejene, Belayneh Endalamaw Abuhay, Tesfamariam M. Bogale, Dawit Shibabaw BMC Med Inform Decis Mak Research BACKGROUND: More than 115,000 maternal deaths and 591,000 prenatal deaths occurred in the world per year with anemia, the reduction of red blood cells or hemoglobin in the blood. The world health organization divides anemia in pregnancy into mild anemia (Hb 10–10.9 g/dl), moderate anemia (Hb 7.0–9.9 g/dl), and severe anemia (Hb < 7 g/dl). This study aims to predict the level of anemia among pregnant women in the case of Ethiopia using homogeneous ensemble machine learning algorithms. METHODS: This study was conducted following a design science approach. The data were gathered from the Ethiopian demographic health survey and preprocessed to get quality data that are suitable for the machine learning algorithm to develop a model that predicts the levels of anemia among pregnant. Decision tree, random forest, cat boost, and extreme gradient boosting with class decomposition (one versus one and one versus rest) and without class decomposition were employed to build the predictive model. For constructing the proposed model, twelve experiments were conducted with a total of 29,104 instances with 23 features, and a training and testing dataset split ratio of 80/20. RESULTS: The overall accuracy of random forest, extreme gradient boosting, and cat boost without class decompositions is 91.34%, 94.26%, and 97.08.90%, respectively. The overall accuracy of random forest, extreme gradient boosting, and cat boost with one versus one is 94.4%, 95.21%, and 97.44%, respectively. The overall accuracy of random forest, extreme gradient boosting, and cat boost with one versus the rest are 94.4%, 94.54%, and 97.6%, respectively. CONCLUSION: Finally, the researcher decided to use cat boost algorithms with one versus the rest for further use in the development of artifacts, model deployment, risk factor analysis, and generating rules because it has registered better performance with 97.6% accuracy. The most determinant risk factors of anemia among pregnant women were identified using feature importance. Some of them are the duration of the current pregnancy, age, source of drinking water, respondent’s (pregnant women) occupation, number of household members, wealth index, husband/partner's education level, and birth history. BioMed Central 2022-09-22 /pmc/articles/PMC9494842/ /pubmed/36138398 http://dx.doi.org/10.1186/s12911-022-01992-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Dejene, Belayneh Endalamaw
Abuhay, Tesfamariam M.
Bogale, Dawit Shibabaw
Predicting the level of anemia among Ethiopian pregnant women using homogeneous ensemble machine learning algorithm
title Predicting the level of anemia among Ethiopian pregnant women using homogeneous ensemble machine learning algorithm
title_full Predicting the level of anemia among Ethiopian pregnant women using homogeneous ensemble machine learning algorithm
title_fullStr Predicting the level of anemia among Ethiopian pregnant women using homogeneous ensemble machine learning algorithm
title_full_unstemmed Predicting the level of anemia among Ethiopian pregnant women using homogeneous ensemble machine learning algorithm
title_short Predicting the level of anemia among Ethiopian pregnant women using homogeneous ensemble machine learning algorithm
title_sort predicting the level of anemia among ethiopian pregnant women using homogeneous ensemble machine learning algorithm
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9494842/
https://www.ncbi.nlm.nih.gov/pubmed/36138398
http://dx.doi.org/10.1186/s12911-022-01992-6
work_keys_str_mv AT dejenebelaynehendalamaw predictingthelevelofanemiaamongethiopianpregnantwomenusinghomogeneousensemblemachinelearningalgorithm
AT abuhaytesfamariamm predictingthelevelofanemiaamongethiopianpregnantwomenusinghomogeneousensemblemachinelearningalgorithm
AT bogaledawitshibabaw predictingthelevelofanemiaamongethiopianpregnantwomenusinghomogeneousensemblemachinelearningalgorithm