Cargando…

Machine learning based efficient prediction of positive cases of waterborne diseases

BACKGROUND: Water quality has been compromised and endangered by different contaminants due to Pakistan’s rapid population development, which has resulted in a dramatic rise in waterborne infections and afflicted many regions of Pakistan. Because of this, modeling and predicting waterborne diseases...

Descripción completa

Detalles Bibliográficos
Autores principales: Hussain, Mushtaq, Cifci, Mehmet Akif, Sehar, Tayyaba, Nabi, Said, Cheikhrouhou, Omar, Maqsood, Hasaan, Ibrahim, Muhammad, Mohammad, Fida
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9848024/
https://www.ncbi.nlm.nih.gov/pubmed/36653779
http://dx.doi.org/10.1186/s12911-022-02092-1
_version_ 1784871609612369920
author Hussain, Mushtaq
Cifci, Mehmet Akif
Sehar, Tayyaba
Nabi, Said
Cheikhrouhou, Omar
Maqsood, Hasaan
Ibrahim, Muhammad
Mohammad, Fida
author_facet Hussain, Mushtaq
Cifci, Mehmet Akif
Sehar, Tayyaba
Nabi, Said
Cheikhrouhou, Omar
Maqsood, Hasaan
Ibrahim, Muhammad
Mohammad, Fida
author_sort Hussain, Mushtaq
collection PubMed
description BACKGROUND: Water quality has been compromised and endangered by different contaminants due to Pakistan’s rapid population development, which has resulted in a dramatic rise in waterborne infections and afflicted many regions of Pakistan. Because of this, modeling and predicting waterborne diseases has become a hot topic for researchers and is very important for controlling waterborne disease pollution. METHODS: In our study, first, we collected typhoid and malaria patient data for the years 2017–2020 from Ayub Medical Hospital. The collected data set has seven important input features. In the current study, different ML models were first trained and tested on the current study dataset using the tenfold cross-validation method. Second, we investigated the importance of input features in waterborne disease-positive case detection. The experiment results showed that Random Forest correctly predicted malaria-positive cases 60% of the time and typhoid-positive cases 77% of the time, which is better than other machine-learning models. In this research, we have also investigated the input features that are more important in the prediction and will help analyze positive cases of waterborne disease. The random forest feature selection technique has been used, and experimental results have shown that age, history, and test results play an important role in predicting waterborne disease-positive cases. In the end, we concluded that this interesting study could help health departments in different areas reduce the number of people who get sick from the water.
format Online
Article
Text
id pubmed-9848024
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-98480242023-01-19 Machine learning based efficient prediction of positive cases of waterborne diseases Hussain, Mushtaq Cifci, Mehmet Akif Sehar, Tayyaba Nabi, Said Cheikhrouhou, Omar Maqsood, Hasaan Ibrahim, Muhammad Mohammad, Fida BMC Med Inform Decis Mak Research BACKGROUND: Water quality has been compromised and endangered by different contaminants due to Pakistan’s rapid population development, which has resulted in a dramatic rise in waterborne infections and afflicted many regions of Pakistan. Because of this, modeling and predicting waterborne diseases has become a hot topic for researchers and is very important for controlling waterborne disease pollution. METHODS: In our study, first, we collected typhoid and malaria patient data for the years 2017–2020 from Ayub Medical Hospital. The collected data set has seven important input features. In the current study, different ML models were first trained and tested on the current study dataset using the tenfold cross-validation method. Second, we investigated the importance of input features in waterborne disease-positive case detection. The experiment results showed that Random Forest correctly predicted malaria-positive cases 60% of the time and typhoid-positive cases 77% of the time, which is better than other machine-learning models. In this research, we have also investigated the input features that are more important in the prediction and will help analyze positive cases of waterborne disease. The random forest feature selection technique has been used, and experimental results have shown that age, history, and test results play an important role in predicting waterborne disease-positive cases. In the end, we concluded that this interesting study could help health departments in different areas reduce the number of people who get sick from the water. BioMed Central 2023-01-18 /pmc/articles/PMC9848024/ /pubmed/36653779 http://dx.doi.org/10.1186/s12911-022-02092-1 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Hussain, Mushtaq
Cifci, Mehmet Akif
Sehar, Tayyaba
Nabi, Said
Cheikhrouhou, Omar
Maqsood, Hasaan
Ibrahim, Muhammad
Mohammad, Fida
Machine learning based efficient prediction of positive cases of waterborne diseases
title Machine learning based efficient prediction of positive cases of waterborne diseases
title_full Machine learning based efficient prediction of positive cases of waterborne diseases
title_fullStr Machine learning based efficient prediction of positive cases of waterborne diseases
title_full_unstemmed Machine learning based efficient prediction of positive cases of waterborne diseases
title_short Machine learning based efficient prediction of positive cases of waterborne diseases
title_sort machine learning based efficient prediction of positive cases of waterborne diseases
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9848024/
https://www.ncbi.nlm.nih.gov/pubmed/36653779
http://dx.doi.org/10.1186/s12911-022-02092-1
work_keys_str_mv AT hussainmushtaq machinelearningbasedefficientpredictionofpositivecasesofwaterbornediseases
AT cifcimehmetakif machinelearningbasedefficientpredictionofpositivecasesofwaterbornediseases
AT sehartayyaba machinelearningbasedefficientpredictionofpositivecasesofwaterbornediseases
AT nabisaid machinelearningbasedefficientpredictionofpositivecasesofwaterbornediseases
AT cheikhrouhouomar machinelearningbasedefficientpredictionofpositivecasesofwaterbornediseases
AT maqsoodhasaan machinelearningbasedefficientpredictionofpositivecasesofwaterbornediseases
AT ibrahimmuhammad machinelearningbasedefficientpredictionofpositivecasesofwaterbornediseases
AT mohammadfida machinelearningbasedefficientpredictionofpositivecasesofwaterbornediseases