Cargando…
Analysis of Flow Cytometry Data by Matrix Relevance Learning Vector Quantization
Flow cytometry is a widely used technique for the analysis of cell populations in the study and diagnosis of human diseases. It yields large amounts of high-dimensional data, the analysis of which would clearly benefit from efficient computational approaches aiming at automated diagnosis and decisio...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3601077/ https://www.ncbi.nlm.nih.gov/pubmed/23527184 http://dx.doi.org/10.1371/journal.pone.0059401 |
_version_ | 1782475712663388160 |
---|---|
author | Biehl, Michael Bunte, Kerstin Schneider, Petra |
author_facet | Biehl, Michael Bunte, Kerstin Schneider, Petra |
author_sort | Biehl, Michael |
collection | PubMed |
description | Flow cytometry is a widely used technique for the analysis of cell populations in the study and diagnosis of human diseases. It yields large amounts of high-dimensional data, the analysis of which would clearly benefit from efficient computational approaches aiming at automated diagnosis and decision support. This article presents our analysis of flow cytometry data in the framework of the DREAM6/FlowCAP2 Molecular Classification of Acute Myeloid Leukemia (AML) Challenge, 2011. In the challenge, example data was provided for a set of 179 subjects, comprising healthy donors and 23 cases of AML. The participants were asked to provide predictions with respect to the condition of 180 patients in a test set. We extracted feature vectors from the data in terms of single marker statistics, including characteristic moments, median and interquartile range of the observed values. Subsequently, we applied Generalized Matrix Relevance Learning Vector Quantization (GMLVQ), a machine learning technique which extends standard LVQ by an adaptive distance measure. Our method achieved the best possible performance with respect to the diagnoses of test set patients. The extraction of features from the flow cytometry data is outlined in detail, the machine learning approach is discussed and classification results are presented. In addition, we illustrate how GMLVQ can provide deeper insight into the problem by allowing to infer the relevance of specific markers and features for the diagnosis. |
format | Online Article Text |
id | pubmed-3601077 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-36010772013-03-22 Analysis of Flow Cytometry Data by Matrix Relevance Learning Vector Quantization Biehl, Michael Bunte, Kerstin Schneider, Petra PLoS One Research Article Flow cytometry is a widely used technique for the analysis of cell populations in the study and diagnosis of human diseases. It yields large amounts of high-dimensional data, the analysis of which would clearly benefit from efficient computational approaches aiming at automated diagnosis and decision support. This article presents our analysis of flow cytometry data in the framework of the DREAM6/FlowCAP2 Molecular Classification of Acute Myeloid Leukemia (AML) Challenge, 2011. In the challenge, example data was provided for a set of 179 subjects, comprising healthy donors and 23 cases of AML. The participants were asked to provide predictions with respect to the condition of 180 patients in a test set. We extracted feature vectors from the data in terms of single marker statistics, including characteristic moments, median and interquartile range of the observed values. Subsequently, we applied Generalized Matrix Relevance Learning Vector Quantization (GMLVQ), a machine learning technique which extends standard LVQ by an adaptive distance measure. Our method achieved the best possible performance with respect to the diagnoses of test set patients. The extraction of features from the flow cytometry data is outlined in detail, the machine learning approach is discussed and classification results are presented. In addition, we illustrate how GMLVQ can provide deeper insight into the problem by allowing to infer the relevance of specific markers and features for the diagnosis. Public Library of Science 2013-03-18 /pmc/articles/PMC3601077/ /pubmed/23527184 http://dx.doi.org/10.1371/journal.pone.0059401 Text en © 2013 Biehl et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Biehl, Michael Bunte, Kerstin Schneider, Petra Analysis of Flow Cytometry Data by Matrix Relevance Learning Vector Quantization |
title | Analysis of Flow Cytometry Data by Matrix Relevance Learning Vector Quantization |
title_full | Analysis of Flow Cytometry Data by Matrix Relevance Learning Vector Quantization |
title_fullStr | Analysis of Flow Cytometry Data by Matrix Relevance Learning Vector Quantization |
title_full_unstemmed | Analysis of Flow Cytometry Data by Matrix Relevance Learning Vector Quantization |
title_short | Analysis of Flow Cytometry Data by Matrix Relevance Learning Vector Quantization |
title_sort | analysis of flow cytometry data by matrix relevance learning vector quantization |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3601077/ https://www.ncbi.nlm.nih.gov/pubmed/23527184 http://dx.doi.org/10.1371/journal.pone.0059401 |
work_keys_str_mv | AT biehlmichael analysisofflowcytometrydatabymatrixrelevancelearningvectorquantization AT buntekerstin analysisofflowcytometrydatabymatrixrelevancelearningvectorquantization AT schneiderpetra analysisofflowcytometrydatabymatrixrelevancelearningvectorquantization |