Cargando…

Multi-class classification algorithms for the diagnosis of anemia in an outpatient clinical setting

Anemia is one of the most pressing public health issues in the world with iron deficiency a major public health issue worldwide. The highest prevalence of anemia is in developing countries. The complete blood count is a blood test used to diagnose the prevalence of anemia. While earlier studies have...

Descripción completa

Detalles Bibliográficos
Autores principales: Vohra, Rajan, Hussain, Abir, Dudyala, Anil Kumar, Pahareeya, Jankisharan, Khan, Wasiq
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9258850/
https://www.ncbi.nlm.nih.gov/pubmed/35793343
http://dx.doi.org/10.1371/journal.pone.0269685
_version_ 1784741639819886592
author Vohra, Rajan
Hussain, Abir
Dudyala, Anil Kumar
Pahareeya, Jankisharan
Khan, Wasiq
author_facet Vohra, Rajan
Hussain, Abir
Dudyala, Anil Kumar
Pahareeya, Jankisharan
Khan, Wasiq
author_sort Vohra, Rajan
collection PubMed
description Anemia is one of the most pressing public health issues in the world with iron deficiency a major public health issue worldwide. The highest prevalence of anemia is in developing countries. The complete blood count is a blood test used to diagnose the prevalence of anemia. While earlier studies have framed the problem of diagnosis as a binary classification problem, this paper frames it as a multi class (three classes) classification problem with mild, moderate and severe classes. The three classes for the anemia classification (mild, moderate, severe) are so chosen as the world health organization (WHO) guidelines formalize this categorization based on the Haemoglobin (HGB) values of the chosen sample of patients in the Complete Blood Count (CBC) patient data set. Complete blood count test data was collected in an outpatient clinical setting in India. We used Feature selection with Majority voting to identify the key attributes in the input patient data set. In addition, since the original data set was imbalanced we used Synthetic Minority Oversampling Technique (SMOTE) to balance the data set. Four data sets including the original data set were used to perform the data experiments. Six standard machine learning algorithms were utilised to test our four data sets, performing multi class classification. Benchmarking these algorithms was performed and tabulated using both10 fold cross validation and hold out methods. The experimental results indicated that multilayer perceptron network was predominantly giving good recall values across mild and moderate class which are early and middle stages of the disease. With a good prediction model at early stages, medical intervention can provide preventive measure from further deterioration into severe stage or recommend the use of supplements to overcome this problem.
format Online
Article
Text
id pubmed-9258850
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-92588502022-07-07 Multi-class classification algorithms for the diagnosis of anemia in an outpatient clinical setting Vohra, Rajan Hussain, Abir Dudyala, Anil Kumar Pahareeya, Jankisharan Khan, Wasiq PLoS One Research Article Anemia is one of the most pressing public health issues in the world with iron deficiency a major public health issue worldwide. The highest prevalence of anemia is in developing countries. The complete blood count is a blood test used to diagnose the prevalence of anemia. While earlier studies have framed the problem of diagnosis as a binary classification problem, this paper frames it as a multi class (three classes) classification problem with mild, moderate and severe classes. The three classes for the anemia classification (mild, moderate, severe) are so chosen as the world health organization (WHO) guidelines formalize this categorization based on the Haemoglobin (HGB) values of the chosen sample of patients in the Complete Blood Count (CBC) patient data set. Complete blood count test data was collected in an outpatient clinical setting in India. We used Feature selection with Majority voting to identify the key attributes in the input patient data set. In addition, since the original data set was imbalanced we used Synthetic Minority Oversampling Technique (SMOTE) to balance the data set. Four data sets including the original data set were used to perform the data experiments. Six standard machine learning algorithms were utilised to test our four data sets, performing multi class classification. Benchmarking these algorithms was performed and tabulated using both10 fold cross validation and hold out methods. The experimental results indicated that multilayer perceptron network was predominantly giving good recall values across mild and moderate class which are early and middle stages of the disease. With a good prediction model at early stages, medical intervention can provide preventive measure from further deterioration into severe stage or recommend the use of supplements to overcome this problem. Public Library of Science 2022-07-06 /pmc/articles/PMC9258850/ /pubmed/35793343 http://dx.doi.org/10.1371/journal.pone.0269685 Text en © 2022 Vohra et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Vohra, Rajan
Hussain, Abir
Dudyala, Anil Kumar
Pahareeya, Jankisharan
Khan, Wasiq
Multi-class classification algorithms for the diagnosis of anemia in an outpatient clinical setting
title Multi-class classification algorithms for the diagnosis of anemia in an outpatient clinical setting
title_full Multi-class classification algorithms for the diagnosis of anemia in an outpatient clinical setting
title_fullStr Multi-class classification algorithms for the diagnosis of anemia in an outpatient clinical setting
title_full_unstemmed Multi-class classification algorithms for the diagnosis of anemia in an outpatient clinical setting
title_short Multi-class classification algorithms for the diagnosis of anemia in an outpatient clinical setting
title_sort multi-class classification algorithms for the diagnosis of anemia in an outpatient clinical setting
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9258850/
https://www.ncbi.nlm.nih.gov/pubmed/35793343
http://dx.doi.org/10.1371/journal.pone.0269685
work_keys_str_mv AT vohrarajan multiclassclassificationalgorithmsforthediagnosisofanemiainanoutpatientclinicalsetting
AT hussainabir multiclassclassificationalgorithmsforthediagnosisofanemiainanoutpatientclinicalsetting
AT dudyalaanilkumar multiclassclassificationalgorithmsforthediagnosisofanemiainanoutpatientclinicalsetting
AT pahareeyajankisharan multiclassclassificationalgorithmsforthediagnosisofanemiainanoutpatientclinicalsetting
AT khanwasiq multiclassclassificationalgorithmsforthediagnosisofanemiainanoutpatientclinicalsetting