Cargando…

Detection and Classification of Immature Leukocytes for Diagnosis of Acute Myeloid Leukemia Using Random Forest Algorithm

Acute myeloid leukemia (AML) is a fatal blood cancer that progresses rapidly and hinders the function of blood cells and the immune system. The current AML diagnostic method, a manual examination of the peripheral blood smear, is time consuming, labor intensive, and suffers from considerable inter-o...

Descripción completa

Detalles Bibliográficos
Autores principales: Dasariraju, Satvik, Huo, Marc, McCalla, Serena
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7711527/
https://www.ncbi.nlm.nih.gov/pubmed/33019619
http://dx.doi.org/10.3390/bioengineering7040120
_version_ 1783618166737338368
author Dasariraju, Satvik
Huo, Marc
McCalla, Serena
author_facet Dasariraju, Satvik
Huo, Marc
McCalla, Serena
author_sort Dasariraju, Satvik
collection PubMed
description Acute myeloid leukemia (AML) is a fatal blood cancer that progresses rapidly and hinders the function of blood cells and the immune system. The current AML diagnostic method, a manual examination of the peripheral blood smear, is time consuming, labor intensive, and suffers from considerable inter-observer variation. Herein, a machine learning model to detect and classify immature leukocytes for efficient diagnosis of AML is presented. Images of leukocytes in AML patients and healthy controls were obtained from a publicly available dataset in The Cancer Imaging Archive. Image format conversion, multi-Otsu thresholding, and morphological operations were used for segmentation of the nucleus and cytoplasm. From each image, 16 features were extracted, two of which are new nucleus color features proposed in this study. A random forest algorithm was trained for the detection and classification of immature leukocytes. The model achieved 92.99% accuracy for detection and 93.45% accuracy for classification of immature leukocytes into four types. Precision values for each class were above 65%, which is an improvement on the current state of art. Based on Gini importance, the nucleus to cytoplasm area ratio was a discriminative feature for both detection and classification, while the two proposed features were shown to be significant for classification. The proposed model can be used as a support tool for the diagnosis of AML, and the features calculated to be most important serve as a baseline for future research.
format Online
Article
Text
id pubmed-7711527
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-77115272020-12-04 Detection and Classification of Immature Leukocytes for Diagnosis of Acute Myeloid Leukemia Using Random Forest Algorithm Dasariraju, Satvik Huo, Marc McCalla, Serena Bioengineering (Basel) Article Acute myeloid leukemia (AML) is a fatal blood cancer that progresses rapidly and hinders the function of blood cells and the immune system. The current AML diagnostic method, a manual examination of the peripheral blood smear, is time consuming, labor intensive, and suffers from considerable inter-observer variation. Herein, a machine learning model to detect and classify immature leukocytes for efficient diagnosis of AML is presented. Images of leukocytes in AML patients and healthy controls were obtained from a publicly available dataset in The Cancer Imaging Archive. Image format conversion, multi-Otsu thresholding, and morphological operations were used for segmentation of the nucleus and cytoplasm. From each image, 16 features were extracted, two of which are new nucleus color features proposed in this study. A random forest algorithm was trained for the detection and classification of immature leukocytes. The model achieved 92.99% accuracy for detection and 93.45% accuracy for classification of immature leukocytes into four types. Precision values for each class were above 65%, which is an improvement on the current state of art. Based on Gini importance, the nucleus to cytoplasm area ratio was a discriminative feature for both detection and classification, while the two proposed features were shown to be significant for classification. The proposed model can be used as a support tool for the diagnosis of AML, and the features calculated to be most important serve as a baseline for future research. MDPI 2020-10-01 /pmc/articles/PMC7711527/ /pubmed/33019619 http://dx.doi.org/10.3390/bioengineering7040120 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Dasariraju, Satvik
Huo, Marc
McCalla, Serena
Detection and Classification of Immature Leukocytes for Diagnosis of Acute Myeloid Leukemia Using Random Forest Algorithm
title Detection and Classification of Immature Leukocytes for Diagnosis of Acute Myeloid Leukemia Using Random Forest Algorithm
title_full Detection and Classification of Immature Leukocytes for Diagnosis of Acute Myeloid Leukemia Using Random Forest Algorithm
title_fullStr Detection and Classification of Immature Leukocytes for Diagnosis of Acute Myeloid Leukemia Using Random Forest Algorithm
title_full_unstemmed Detection and Classification of Immature Leukocytes for Diagnosis of Acute Myeloid Leukemia Using Random Forest Algorithm
title_short Detection and Classification of Immature Leukocytes for Diagnosis of Acute Myeloid Leukemia Using Random Forest Algorithm
title_sort detection and classification of immature leukocytes for diagnosis of acute myeloid leukemia using random forest algorithm
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7711527/
https://www.ncbi.nlm.nih.gov/pubmed/33019619
http://dx.doi.org/10.3390/bioengineering7040120
work_keys_str_mv AT dasarirajusatvik detectionandclassificationofimmatureleukocytesfordiagnosisofacutemyeloidleukemiausingrandomforestalgorithm
AT huomarc detectionandclassificationofimmatureleukocytesfordiagnosisofacutemyeloidleukemiausingrandomforestalgorithm
AT mccallaserena detectionandclassificationofimmatureleukocytesfordiagnosisofacutemyeloidleukemiausingrandomforestalgorithm