Cargando…

UMAP Based Anomaly Detection for Minimal Residual Disease Quantification within Acute Myeloid Leukemia

SIMPLE SUMMARY: Acute myeloid leukemia (AML) is the second most frequent leukemia entity in children and adolescents, and definitely the most aggressive variant. Multiparameter flow-cytometry is one of the methodologies most useful to monitor the number of remaining leukemic cells in bone marrow (mi...

Descripción completa

Detalles Bibliográficos
Autores principales: Weijler, Lisa, Kowarsch, Florian, Wödlinger, Matthias, Reiter, Michael, Maurer-Granofszky, Margarita, Schumich, Angela, Dworzak, Michael N.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8870142/
https://www.ncbi.nlm.nih.gov/pubmed/35205645
http://dx.doi.org/10.3390/cancers14040898
_version_ 1784656666951680000
author Weijler, Lisa
Kowarsch, Florian
Wödlinger, Matthias
Reiter, Michael
Maurer-Granofszky, Margarita
Schumich, Angela
Dworzak, Michael N.
author_facet Weijler, Lisa
Kowarsch, Florian
Wödlinger, Matthias
Reiter, Michael
Maurer-Granofszky, Margarita
Schumich, Angela
Dworzak, Michael N.
author_sort Weijler, Lisa
collection PubMed
description SIMPLE SUMMARY: Acute myeloid leukemia (AML) is the second most frequent leukemia entity in children and adolescents, and definitely the most aggressive variant. Multiparameter flow-cytometry is one of the methodologies most useful to monitor the number of remaining leukemic cells in bone marrow (minimal residual disease, MRD) in AML patients, because it is widely available and applicable to most patients. However, AML flow cytometry data show very complex patterns and identifying leukemic cells in the data is subjective, time-consuming and requires experienced operators who are not available world-wide. In this paper, we approach automatic assessment of AML flow cytometry samples with a novel semi-supervised machine learning model, leveraging implicit expert knowledge stored in a collection of manually assessed samples. Because AML data exhibit a high degree of variability in the patterns of blast cell populations that is difficult to model, the model detects anomalies starting from the appearance of normal cell populations. ABSTRACT: Leukemia is the most frequent malignancy in children and adolescents, with acute lymphoblastic leukemia (ALL) and acute myeloid leukemia (AML) as the most common subtypes. Minimal residual disease (MRD) measured by flow cytometry (FCM) has proven to be a strong prognostic factor in ALL as well as in AML. Machine learning techniques have been emerging in the field of automated MRD quantification with the objective of superseding subjective and time-consuming manual analysis of FCM-MRD data. In contrast to ALL, where supervised multi-class classification methods have been successfully deployed for MRD detection, AML poses new challenges: AML is rarer (with fewer available training data) than ALL and much more heterogeneous in its immunophenotypic appearance, where one-class classification (anomaly detection) methods seem more suitable. In this work, a new semi-supervised approach based on the UMAP algorithm for MRD detection utilizing only labels of blast free FCM samples is presented. The method is tested on a newly gathered set of AML FCM samples and results are compared to state-of-the-art methods. We reach a median [Formula: see text]-score of 0.794, while providing a transparent classification pipeline with explainable results that facilitates inter-disciplinary work between medical and technical experts. This work shows that despite several issues yet to overcome, the merits of automated MRD quantification can be fully exploited also in AML.
format Online
Article
Text
id pubmed-8870142
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-88701422022-02-25 UMAP Based Anomaly Detection for Minimal Residual Disease Quantification within Acute Myeloid Leukemia Weijler, Lisa Kowarsch, Florian Wödlinger, Matthias Reiter, Michael Maurer-Granofszky, Margarita Schumich, Angela Dworzak, Michael N. Cancers (Basel) Article SIMPLE SUMMARY: Acute myeloid leukemia (AML) is the second most frequent leukemia entity in children and adolescents, and definitely the most aggressive variant. Multiparameter flow-cytometry is one of the methodologies most useful to monitor the number of remaining leukemic cells in bone marrow (minimal residual disease, MRD) in AML patients, because it is widely available and applicable to most patients. However, AML flow cytometry data show very complex patterns and identifying leukemic cells in the data is subjective, time-consuming and requires experienced operators who are not available world-wide. In this paper, we approach automatic assessment of AML flow cytometry samples with a novel semi-supervised machine learning model, leveraging implicit expert knowledge stored in a collection of manually assessed samples. Because AML data exhibit a high degree of variability in the patterns of blast cell populations that is difficult to model, the model detects anomalies starting from the appearance of normal cell populations. ABSTRACT: Leukemia is the most frequent malignancy in children and adolescents, with acute lymphoblastic leukemia (ALL) and acute myeloid leukemia (AML) as the most common subtypes. Minimal residual disease (MRD) measured by flow cytometry (FCM) has proven to be a strong prognostic factor in ALL as well as in AML. Machine learning techniques have been emerging in the field of automated MRD quantification with the objective of superseding subjective and time-consuming manual analysis of FCM-MRD data. In contrast to ALL, where supervised multi-class classification methods have been successfully deployed for MRD detection, AML poses new challenges: AML is rarer (with fewer available training data) than ALL and much more heterogeneous in its immunophenotypic appearance, where one-class classification (anomaly detection) methods seem more suitable. In this work, a new semi-supervised approach based on the UMAP algorithm for MRD detection utilizing only labels of blast free FCM samples is presented. The method is tested on a newly gathered set of AML FCM samples and results are compared to state-of-the-art methods. We reach a median [Formula: see text]-score of 0.794, while providing a transparent classification pipeline with explainable results that facilitates inter-disciplinary work between medical and technical experts. This work shows that despite several issues yet to overcome, the merits of automated MRD quantification can be fully exploited also in AML. MDPI 2022-02-11 /pmc/articles/PMC8870142/ /pubmed/35205645 http://dx.doi.org/10.3390/cancers14040898 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Weijler, Lisa
Kowarsch, Florian
Wödlinger, Matthias
Reiter, Michael
Maurer-Granofszky, Margarita
Schumich, Angela
Dworzak, Michael N.
UMAP Based Anomaly Detection for Minimal Residual Disease Quantification within Acute Myeloid Leukemia
title UMAP Based Anomaly Detection for Minimal Residual Disease Quantification within Acute Myeloid Leukemia
title_full UMAP Based Anomaly Detection for Minimal Residual Disease Quantification within Acute Myeloid Leukemia
title_fullStr UMAP Based Anomaly Detection for Minimal Residual Disease Quantification within Acute Myeloid Leukemia
title_full_unstemmed UMAP Based Anomaly Detection for Minimal Residual Disease Quantification within Acute Myeloid Leukemia
title_short UMAP Based Anomaly Detection for Minimal Residual Disease Quantification within Acute Myeloid Leukemia
title_sort umap based anomaly detection for minimal residual disease quantification within acute myeloid leukemia
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8870142/
https://www.ncbi.nlm.nih.gov/pubmed/35205645
http://dx.doi.org/10.3390/cancers14040898
work_keys_str_mv AT weijlerlisa umapbasedanomalydetectionforminimalresidualdiseasequantificationwithinacutemyeloidleukemia
AT kowarschflorian umapbasedanomalydetectionforminimalresidualdiseasequantificationwithinacutemyeloidleukemia
AT wodlingermatthias umapbasedanomalydetectionforminimalresidualdiseasequantificationwithinacutemyeloidleukemia
AT reitermichael umapbasedanomalydetectionforminimalresidualdiseasequantificationwithinacutemyeloidleukemia
AT maurergranofszkymargarita umapbasedanomalydetectionforminimalresidualdiseasequantificationwithinacutemyeloidleukemia
AT schumichangela umapbasedanomalydetectionforminimalresidualdiseasequantificationwithinacutemyeloidleukemia
AT dworzakmichaeln umapbasedanomalydetectionforminimalresidualdiseasequantificationwithinacutemyeloidleukemia