Cargando…

Analysis of Flow Cytometry Data by Matrix Relevance Learning Vector Quantization

Flow cytometry is a widely used technique for the analysis of cell populations in the study and diagnosis of human diseases. It yields large amounts of high-dimensional data, the analysis of which would clearly benefit from efficient computational approaches aiming at automated diagnosis and decisio...

Descripción completa

Detalles Bibliográficos
Autores principales: Biehl, Michael, Bunte, Kerstin, Schneider, Petra
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3601077/
https://www.ncbi.nlm.nih.gov/pubmed/23527184
http://dx.doi.org/10.1371/journal.pone.0059401
_version_ 1782475712663388160
author Biehl, Michael
Bunte, Kerstin
Schneider, Petra
author_facet Biehl, Michael
Bunte, Kerstin
Schneider, Petra
author_sort Biehl, Michael
collection PubMed
description Flow cytometry is a widely used technique for the analysis of cell populations in the study and diagnosis of human diseases. It yields large amounts of high-dimensional data, the analysis of which would clearly benefit from efficient computational approaches aiming at automated diagnosis and decision support. This article presents our analysis of flow cytometry data in the framework of the DREAM6/FlowCAP2 Molecular Classification of Acute Myeloid Leukemia (AML) Challenge, 2011. In the challenge, example data was provided for a set of 179 subjects, comprising healthy donors and 23 cases of AML. The participants were asked to provide predictions with respect to the condition of 180 patients in a test set. We extracted feature vectors from the data in terms of single marker statistics, including characteristic moments, median and interquartile range of the observed values. Subsequently, we applied Generalized Matrix Relevance Learning Vector Quantization (GMLVQ), a machine learning technique which extends standard LVQ by an adaptive distance measure. Our method achieved the best possible performance with respect to the diagnoses of test set patients. The extraction of features from the flow cytometry data is outlined in detail, the machine learning approach is discussed and classification results are presented. In addition, we illustrate how GMLVQ can provide deeper insight into the problem by allowing to infer the relevance of specific markers and features for the diagnosis.
format Online
Article
Text
id pubmed-3601077
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-36010772013-03-22 Analysis of Flow Cytometry Data by Matrix Relevance Learning Vector Quantization Biehl, Michael Bunte, Kerstin Schneider, Petra PLoS One Research Article Flow cytometry is a widely used technique for the analysis of cell populations in the study and diagnosis of human diseases. It yields large amounts of high-dimensional data, the analysis of which would clearly benefit from efficient computational approaches aiming at automated diagnosis and decision support. This article presents our analysis of flow cytometry data in the framework of the DREAM6/FlowCAP2 Molecular Classification of Acute Myeloid Leukemia (AML) Challenge, 2011. In the challenge, example data was provided for a set of 179 subjects, comprising healthy donors and 23 cases of AML. The participants were asked to provide predictions with respect to the condition of 180 patients in a test set. We extracted feature vectors from the data in terms of single marker statistics, including characteristic moments, median and interquartile range of the observed values. Subsequently, we applied Generalized Matrix Relevance Learning Vector Quantization (GMLVQ), a machine learning technique which extends standard LVQ by an adaptive distance measure. Our method achieved the best possible performance with respect to the diagnoses of test set patients. The extraction of features from the flow cytometry data is outlined in detail, the machine learning approach is discussed and classification results are presented. In addition, we illustrate how GMLVQ can provide deeper insight into the problem by allowing to infer the relevance of specific markers and features for the diagnosis. Public Library of Science 2013-03-18 /pmc/articles/PMC3601077/ /pubmed/23527184 http://dx.doi.org/10.1371/journal.pone.0059401 Text en © 2013 Biehl et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Biehl, Michael
Bunte, Kerstin
Schneider, Petra
Analysis of Flow Cytometry Data by Matrix Relevance Learning Vector Quantization
title Analysis of Flow Cytometry Data by Matrix Relevance Learning Vector Quantization
title_full Analysis of Flow Cytometry Data by Matrix Relevance Learning Vector Quantization
title_fullStr Analysis of Flow Cytometry Data by Matrix Relevance Learning Vector Quantization
title_full_unstemmed Analysis of Flow Cytometry Data by Matrix Relevance Learning Vector Quantization
title_short Analysis of Flow Cytometry Data by Matrix Relevance Learning Vector Quantization
title_sort analysis of flow cytometry data by matrix relevance learning vector quantization
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3601077/
https://www.ncbi.nlm.nih.gov/pubmed/23527184
http://dx.doi.org/10.1371/journal.pone.0059401
work_keys_str_mv AT biehlmichael analysisofflowcytometrydatabymatrixrelevancelearningvectorquantization
AT buntekerstin analysisofflowcytometrydatabymatrixrelevancelearningvectorquantization
AT schneiderpetra analysisofflowcytometrydatabymatrixrelevancelearningvectorquantization