Cargando…

Prediction of [Formula: see text] -Thalassemia carriers using complete blood count features

[Formula: see text] -Thalassemia is one of the dangerous causes of the high mortality rate in the Mediterranean countries. Substantial resources are required to save a [Formula: see text] -Thalassemia carriers’ life and early detection of thalassemia patients can help appropriate treatment to increa...

Descripción completa

Detalles Bibliográficos
Autores principales: Rustam, Furqan, Ashraf, Imran, Jabbar, Shehbaz, Tutusaus, Kilian, Mazas, Cristina, Barrera, Alina Eugenia Pascual, de la Torre Diez, Isabel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9678892/
https://www.ncbi.nlm.nih.gov/pubmed/36411295
http://dx.doi.org/10.1038/s41598-022-22011-8
_version_ 1784834089183870976
author Rustam, Furqan
Ashraf, Imran
Jabbar, Shehbaz
Tutusaus, Kilian
Mazas, Cristina
Barrera, Alina Eugenia Pascual
de la Torre Diez, Isabel
author_facet Rustam, Furqan
Ashraf, Imran
Jabbar, Shehbaz
Tutusaus, Kilian
Mazas, Cristina
Barrera, Alina Eugenia Pascual
de la Torre Diez, Isabel
author_sort Rustam, Furqan
collection PubMed
description [Formula: see text] -Thalassemia is one of the dangerous causes of the high mortality rate in the Mediterranean countries. Substantial resources are required to save a [Formula: see text] -Thalassemia carriers’ life and early detection of thalassemia patients can help appropriate treatment to increase the carrier’s life expectancy. Being a genetic disease, it can not be prevented however the analysis of several indicators in parents’ blood can be used to detect disorders causing Thalassemia. Laboratory tests for Thalassemia are time-consuming and expensive like high-performance liquid chromatography, Complete Blood Count (CBC) with peripheral smear, genetic test, etc. Red blood indices from CBC can be used with machine learning models for the same task. Despite the available approaches for Thalassemia carriers from CBC data, gaps exist between the desired and achieved accuracy. Moreover, the data imbalance problem is studied well which makes the models less generalizable. This study proposes a highly accurate approach for [Formula: see text] -Thalassemia detection using red blood indices from CBC augmented by supervised machine learning. In view of the fact that all the features do not carry predictive information regarding the target variable, this study employs a unified framework of two features selection techniques including Principal Component Analysis (PCA) and Singular Vector Decomposition (SVD). The data imbalance between [Formula: see text] -Thalassemia carrier and non-carriers is handled by Synthetic Minority Oversampling Technique (SMOTE) and Adaptive Synthetic (ADASYN). Extensive experiments are performed using many state-of-the-art machine learning models and deep learning models. Experimental results indicate the superiority of the proposed approach over existing approaches with an accuracy score of 0.96.
format Online
Article
Text
id pubmed-9678892
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-96788922022-11-23 Prediction of [Formula: see text] -Thalassemia carriers using complete blood count features Rustam, Furqan Ashraf, Imran Jabbar, Shehbaz Tutusaus, Kilian Mazas, Cristina Barrera, Alina Eugenia Pascual de la Torre Diez, Isabel Sci Rep Article [Formula: see text] -Thalassemia is one of the dangerous causes of the high mortality rate in the Mediterranean countries. Substantial resources are required to save a [Formula: see text] -Thalassemia carriers’ life and early detection of thalassemia patients can help appropriate treatment to increase the carrier’s life expectancy. Being a genetic disease, it can not be prevented however the analysis of several indicators in parents’ blood can be used to detect disorders causing Thalassemia. Laboratory tests for Thalassemia are time-consuming and expensive like high-performance liquid chromatography, Complete Blood Count (CBC) with peripheral smear, genetic test, etc. Red blood indices from CBC can be used with machine learning models for the same task. Despite the available approaches for Thalassemia carriers from CBC data, gaps exist between the desired and achieved accuracy. Moreover, the data imbalance problem is studied well which makes the models less generalizable. This study proposes a highly accurate approach for [Formula: see text] -Thalassemia detection using red blood indices from CBC augmented by supervised machine learning. In view of the fact that all the features do not carry predictive information regarding the target variable, this study employs a unified framework of two features selection techniques including Principal Component Analysis (PCA) and Singular Vector Decomposition (SVD). The data imbalance between [Formula: see text] -Thalassemia carrier and non-carriers is handled by Synthetic Minority Oversampling Technique (SMOTE) and Adaptive Synthetic (ADASYN). Extensive experiments are performed using many state-of-the-art machine learning models and deep learning models. Experimental results indicate the superiority of the proposed approach over existing approaches with an accuracy score of 0.96. Nature Publishing Group UK 2022-11-21 /pmc/articles/PMC9678892/ /pubmed/36411295 http://dx.doi.org/10.1038/s41598-022-22011-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Rustam, Furqan
Ashraf, Imran
Jabbar, Shehbaz
Tutusaus, Kilian
Mazas, Cristina
Barrera, Alina Eugenia Pascual
de la Torre Diez, Isabel
Prediction of [Formula: see text] -Thalassemia carriers using complete blood count features
title Prediction of [Formula: see text] -Thalassemia carriers using complete blood count features
title_full Prediction of [Formula: see text] -Thalassemia carriers using complete blood count features
title_fullStr Prediction of [Formula: see text] -Thalassemia carriers using complete blood count features
title_full_unstemmed Prediction of [Formula: see text] -Thalassemia carriers using complete blood count features
title_short Prediction of [Formula: see text] -Thalassemia carriers using complete blood count features
title_sort prediction of [formula: see text] -thalassemia carriers using complete blood count features
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9678892/
https://www.ncbi.nlm.nih.gov/pubmed/36411295
http://dx.doi.org/10.1038/s41598-022-22011-8
work_keys_str_mv AT rustamfurqan predictionofformulaseetextthalassemiacarriersusingcompletebloodcountfeatures
AT ashrafimran predictionofformulaseetextthalassemiacarriersusingcompletebloodcountfeatures
AT jabbarshehbaz predictionofformulaseetextthalassemiacarriersusingcompletebloodcountfeatures
AT tutusauskilian predictionofformulaseetextthalassemiacarriersusingcompletebloodcountfeatures
AT mazascristina predictionofformulaseetextthalassemiacarriersusingcompletebloodcountfeatures
AT barreraalinaeugeniapascual predictionofformulaseetextthalassemiacarriersusingcompletebloodcountfeatures
AT delatorrediezisabel predictionofformulaseetextthalassemiacarriersusingcompletebloodcountfeatures