Cargando…
Predicting the level of anemia among Ethiopian pregnant women using homogeneous ensemble machine learning algorithm
BACKGROUND: More than 115,000 maternal deaths and 591,000 prenatal deaths occurred in the world per year with anemia, the reduction of red blood cells or hemoglobin in the blood. The world health organization divides anemia in pregnancy into mild anemia (Hb 10–10.9 g/dl), moderate anemia (Hb 7.0–9.9...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9494842/ https://www.ncbi.nlm.nih.gov/pubmed/36138398 http://dx.doi.org/10.1186/s12911-022-01992-6 |
_version_ | 1784793879330947072 |
---|---|
author | Dejene, Belayneh Endalamaw Abuhay, Tesfamariam M. Bogale, Dawit Shibabaw |
author_facet | Dejene, Belayneh Endalamaw Abuhay, Tesfamariam M. Bogale, Dawit Shibabaw |
author_sort | Dejene, Belayneh Endalamaw |
collection | PubMed |
description | BACKGROUND: More than 115,000 maternal deaths and 591,000 prenatal deaths occurred in the world per year with anemia, the reduction of red blood cells or hemoglobin in the blood. The world health organization divides anemia in pregnancy into mild anemia (Hb 10–10.9 g/dl), moderate anemia (Hb 7.0–9.9 g/dl), and severe anemia (Hb < 7 g/dl). This study aims to predict the level of anemia among pregnant women in the case of Ethiopia using homogeneous ensemble machine learning algorithms. METHODS: This study was conducted following a design science approach. The data were gathered from the Ethiopian demographic health survey and preprocessed to get quality data that are suitable for the machine learning algorithm to develop a model that predicts the levels of anemia among pregnant. Decision tree, random forest, cat boost, and extreme gradient boosting with class decomposition (one versus one and one versus rest) and without class decomposition were employed to build the predictive model. For constructing the proposed model, twelve experiments were conducted with a total of 29,104 instances with 23 features, and a training and testing dataset split ratio of 80/20. RESULTS: The overall accuracy of random forest, extreme gradient boosting, and cat boost without class decompositions is 91.34%, 94.26%, and 97.08.90%, respectively. The overall accuracy of random forest, extreme gradient boosting, and cat boost with one versus one is 94.4%, 95.21%, and 97.44%, respectively. The overall accuracy of random forest, extreme gradient boosting, and cat boost with one versus the rest are 94.4%, 94.54%, and 97.6%, respectively. CONCLUSION: Finally, the researcher decided to use cat boost algorithms with one versus the rest for further use in the development of artifacts, model deployment, risk factor analysis, and generating rules because it has registered better performance with 97.6% accuracy. The most determinant risk factors of anemia among pregnant women were identified using feature importance. Some of them are the duration of the current pregnancy, age, source of drinking water, respondent’s (pregnant women) occupation, number of household members, wealth index, husband/partner's education level, and birth history. |
format | Online Article Text |
id | pubmed-9494842 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-94948422022-09-23 Predicting the level of anemia among Ethiopian pregnant women using homogeneous ensemble machine learning algorithm Dejene, Belayneh Endalamaw Abuhay, Tesfamariam M. Bogale, Dawit Shibabaw BMC Med Inform Decis Mak Research BACKGROUND: More than 115,000 maternal deaths and 591,000 prenatal deaths occurred in the world per year with anemia, the reduction of red blood cells or hemoglobin in the blood. The world health organization divides anemia in pregnancy into mild anemia (Hb 10–10.9 g/dl), moderate anemia (Hb 7.0–9.9 g/dl), and severe anemia (Hb < 7 g/dl). This study aims to predict the level of anemia among pregnant women in the case of Ethiopia using homogeneous ensemble machine learning algorithms. METHODS: This study was conducted following a design science approach. The data were gathered from the Ethiopian demographic health survey and preprocessed to get quality data that are suitable for the machine learning algorithm to develop a model that predicts the levels of anemia among pregnant. Decision tree, random forest, cat boost, and extreme gradient boosting with class decomposition (one versus one and one versus rest) and without class decomposition were employed to build the predictive model. For constructing the proposed model, twelve experiments were conducted with a total of 29,104 instances with 23 features, and a training and testing dataset split ratio of 80/20. RESULTS: The overall accuracy of random forest, extreme gradient boosting, and cat boost without class decompositions is 91.34%, 94.26%, and 97.08.90%, respectively. The overall accuracy of random forest, extreme gradient boosting, and cat boost with one versus one is 94.4%, 95.21%, and 97.44%, respectively. The overall accuracy of random forest, extreme gradient boosting, and cat boost with one versus the rest are 94.4%, 94.54%, and 97.6%, respectively. CONCLUSION: Finally, the researcher decided to use cat boost algorithms with one versus the rest for further use in the development of artifacts, model deployment, risk factor analysis, and generating rules because it has registered better performance with 97.6% accuracy. The most determinant risk factors of anemia among pregnant women were identified using feature importance. Some of them are the duration of the current pregnancy, age, source of drinking water, respondent’s (pregnant women) occupation, number of household members, wealth index, husband/partner's education level, and birth history. BioMed Central 2022-09-22 /pmc/articles/PMC9494842/ /pubmed/36138398 http://dx.doi.org/10.1186/s12911-022-01992-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Dejene, Belayneh Endalamaw Abuhay, Tesfamariam M. Bogale, Dawit Shibabaw Predicting the level of anemia among Ethiopian pregnant women using homogeneous ensemble machine learning algorithm |
title | Predicting the level of anemia among Ethiopian pregnant women using homogeneous ensemble machine learning algorithm |
title_full | Predicting the level of anemia among Ethiopian pregnant women using homogeneous ensemble machine learning algorithm |
title_fullStr | Predicting the level of anemia among Ethiopian pregnant women using homogeneous ensemble machine learning algorithm |
title_full_unstemmed | Predicting the level of anemia among Ethiopian pregnant women using homogeneous ensemble machine learning algorithm |
title_short | Predicting the level of anemia among Ethiopian pregnant women using homogeneous ensemble machine learning algorithm |
title_sort | predicting the level of anemia among ethiopian pregnant women using homogeneous ensemble machine learning algorithm |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9494842/ https://www.ncbi.nlm.nih.gov/pubmed/36138398 http://dx.doi.org/10.1186/s12911-022-01992-6 |
work_keys_str_mv | AT dejenebelaynehendalamaw predictingthelevelofanemiaamongethiopianpregnantwomenusinghomogeneousensemblemachinelearningalgorithm AT abuhaytesfamariamm predictingthelevelofanemiaamongethiopianpregnantwomenusinghomogeneousensemblemachinelearningalgorithm AT bogaledawitshibabaw predictingthelevelofanemiaamongethiopianpregnantwomenusinghomogeneousensemblemachinelearningalgorithm |