Cargando…
Multi-class classification algorithms for the diagnosis of anemia in an outpatient clinical setting
Anemia is one of the most pressing public health issues in the world with iron deficiency a major public health issue worldwide. The highest prevalence of anemia is in developing countries. The complete blood count is a blood test used to diagnose the prevalence of anemia. While earlier studies have...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9258850/ https://www.ncbi.nlm.nih.gov/pubmed/35793343 http://dx.doi.org/10.1371/journal.pone.0269685 |
_version_ | 1784741639819886592 |
---|---|
author | Vohra, Rajan Hussain, Abir Dudyala, Anil Kumar Pahareeya, Jankisharan Khan, Wasiq |
author_facet | Vohra, Rajan Hussain, Abir Dudyala, Anil Kumar Pahareeya, Jankisharan Khan, Wasiq |
author_sort | Vohra, Rajan |
collection | PubMed |
description | Anemia is one of the most pressing public health issues in the world with iron deficiency a major public health issue worldwide. The highest prevalence of anemia is in developing countries. The complete blood count is a blood test used to diagnose the prevalence of anemia. While earlier studies have framed the problem of diagnosis as a binary classification problem, this paper frames it as a multi class (three classes) classification problem with mild, moderate and severe classes. The three classes for the anemia classification (mild, moderate, severe) are so chosen as the world health organization (WHO) guidelines formalize this categorization based on the Haemoglobin (HGB) values of the chosen sample of patients in the Complete Blood Count (CBC) patient data set. Complete blood count test data was collected in an outpatient clinical setting in India. We used Feature selection with Majority voting to identify the key attributes in the input patient data set. In addition, since the original data set was imbalanced we used Synthetic Minority Oversampling Technique (SMOTE) to balance the data set. Four data sets including the original data set were used to perform the data experiments. Six standard machine learning algorithms were utilised to test our four data sets, performing multi class classification. Benchmarking these algorithms was performed and tabulated using both10 fold cross validation and hold out methods. The experimental results indicated that multilayer perceptron network was predominantly giving good recall values across mild and moderate class which are early and middle stages of the disease. With a good prediction model at early stages, medical intervention can provide preventive measure from further deterioration into severe stage or recommend the use of supplements to overcome this problem. |
format | Online Article Text |
id | pubmed-9258850 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-92588502022-07-07 Multi-class classification algorithms for the diagnosis of anemia in an outpatient clinical setting Vohra, Rajan Hussain, Abir Dudyala, Anil Kumar Pahareeya, Jankisharan Khan, Wasiq PLoS One Research Article Anemia is one of the most pressing public health issues in the world with iron deficiency a major public health issue worldwide. The highest prevalence of anemia is in developing countries. The complete blood count is a blood test used to diagnose the prevalence of anemia. While earlier studies have framed the problem of diagnosis as a binary classification problem, this paper frames it as a multi class (three classes) classification problem with mild, moderate and severe classes. The three classes for the anemia classification (mild, moderate, severe) are so chosen as the world health organization (WHO) guidelines formalize this categorization based on the Haemoglobin (HGB) values of the chosen sample of patients in the Complete Blood Count (CBC) patient data set. Complete blood count test data was collected in an outpatient clinical setting in India. We used Feature selection with Majority voting to identify the key attributes in the input patient data set. In addition, since the original data set was imbalanced we used Synthetic Minority Oversampling Technique (SMOTE) to balance the data set. Four data sets including the original data set were used to perform the data experiments. Six standard machine learning algorithms were utilised to test our four data sets, performing multi class classification. Benchmarking these algorithms was performed and tabulated using both10 fold cross validation and hold out methods. The experimental results indicated that multilayer perceptron network was predominantly giving good recall values across mild and moderate class which are early and middle stages of the disease. With a good prediction model at early stages, medical intervention can provide preventive measure from further deterioration into severe stage or recommend the use of supplements to overcome this problem. Public Library of Science 2022-07-06 /pmc/articles/PMC9258850/ /pubmed/35793343 http://dx.doi.org/10.1371/journal.pone.0269685 Text en © 2022 Vohra et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Vohra, Rajan Hussain, Abir Dudyala, Anil Kumar Pahareeya, Jankisharan Khan, Wasiq Multi-class classification algorithms for the diagnosis of anemia in an outpatient clinical setting |
title | Multi-class classification algorithms for the diagnosis of anemia in an outpatient clinical setting |
title_full | Multi-class classification algorithms for the diagnosis of anemia in an outpatient clinical setting |
title_fullStr | Multi-class classification algorithms for the diagnosis of anemia in an outpatient clinical setting |
title_full_unstemmed | Multi-class classification algorithms for the diagnosis of anemia in an outpatient clinical setting |
title_short | Multi-class classification algorithms for the diagnosis of anemia in an outpatient clinical setting |
title_sort | multi-class classification algorithms for the diagnosis of anemia in an outpatient clinical setting |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9258850/ https://www.ncbi.nlm.nih.gov/pubmed/35793343 http://dx.doi.org/10.1371/journal.pone.0269685 |
work_keys_str_mv | AT vohrarajan multiclassclassificationalgorithmsforthediagnosisofanemiainanoutpatientclinicalsetting AT hussainabir multiclassclassificationalgorithmsforthediagnosisofanemiainanoutpatientclinicalsetting AT dudyalaanilkumar multiclassclassificationalgorithmsforthediagnosisofanemiainanoutpatientclinicalsetting AT pahareeyajankisharan multiclassclassificationalgorithmsforthediagnosisofanemiainanoutpatientclinicalsetting AT khanwasiq multiclassclassificationalgorithmsforthediagnosisofanemiainanoutpatientclinicalsetting |