Cargando…

Multivariate Analysis of Flow Cytometric Data Using Decision Trees

Characterization of the response of the host immune system is important in understanding the bidirectional interactions between the host and microbial pathogens. For research on the host site, flow cytometry has become one of the major tools in immunology. Advances in technology and reagents allow n...

Descripción completa

Detalles Bibliográficos
Autores principales: Simon, Svenja, Guthke, Reinhard, Kamradt, Thomas, Frey, Oliver
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Research Foundation 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3316995/
https://www.ncbi.nlm.nih.gov/pubmed/22485112
http://dx.doi.org/10.3389/fmicb.2012.00114
_version_ 1782228484983095296
author Simon, Svenja
Guthke, Reinhard
Kamradt, Thomas
Frey, Oliver
author_facet Simon, Svenja
Guthke, Reinhard
Kamradt, Thomas
Frey, Oliver
author_sort Simon, Svenja
collection PubMed
description Characterization of the response of the host immune system is important in understanding the bidirectional interactions between the host and microbial pathogens. For research on the host site, flow cytometry has become one of the major tools in immunology. Advances in technology and reagents allow now the simultaneous assessment of multiple markers on a single cell level generating multidimensional data sets that require multivariate statistical analysis. We explored the explanatory power of the supervised machine learning method called “induction of decision trees” in flow cytometric data. In order to examine whether the production of a certain cytokine is depended on other cytokines, datasets from intracellular staining for six cytokines with complex patterns of co-expression were analyzed by induction of decision trees. After weighting the data according to their class probabilities, we created a total of 13,392 different decision trees for each given cytokine with different parameter settings. For a more realistic estimation of the decision trees’ quality, we used stratified fivefold cross validation and chose the “best” tree according to a combination of different quality criteria. While some of the decision trees reflected previously known co-expression patterns, we found that the expression of some cytokines was not only dependent on the co-expression of others per se, but was also dependent on the intensity of expression. Thus, for the first time we successfully used induction of decision trees for the analysis of high dimensional flow cytometric data and demonstrated the feasibility of this method to reveal structural patterns in such data sets.
format Online
Article
Text
id pubmed-3316995
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Frontiers Research Foundation
record_format MEDLINE/PubMed
spelling pubmed-33169952012-04-06 Multivariate Analysis of Flow Cytometric Data Using Decision Trees Simon, Svenja Guthke, Reinhard Kamradt, Thomas Frey, Oliver Front Microbiol Microbiology Characterization of the response of the host immune system is important in understanding the bidirectional interactions between the host and microbial pathogens. For research on the host site, flow cytometry has become one of the major tools in immunology. Advances in technology and reagents allow now the simultaneous assessment of multiple markers on a single cell level generating multidimensional data sets that require multivariate statistical analysis. We explored the explanatory power of the supervised machine learning method called “induction of decision trees” in flow cytometric data. In order to examine whether the production of a certain cytokine is depended on other cytokines, datasets from intracellular staining for six cytokines with complex patterns of co-expression were analyzed by induction of decision trees. After weighting the data according to their class probabilities, we created a total of 13,392 different decision trees for each given cytokine with different parameter settings. For a more realistic estimation of the decision trees’ quality, we used stratified fivefold cross validation and chose the “best” tree according to a combination of different quality criteria. While some of the decision trees reflected previously known co-expression patterns, we found that the expression of some cytokines was not only dependent on the co-expression of others per se, but was also dependent on the intensity of expression. Thus, for the first time we successfully used induction of decision trees for the analysis of high dimensional flow cytometric data and demonstrated the feasibility of this method to reveal structural patterns in such data sets. Frontiers Research Foundation 2012-04-02 /pmc/articles/PMC3316995/ /pubmed/22485112 http://dx.doi.org/10.3389/fmicb.2012.00114 Text en Copyright © 2012 Simon, Guthke, Kamradt and Frey. http://www.frontiersin.org/licenseagreement This is an open-access article distributed under the terms of the Creative Commons Attribution Non Commercial License, which permits non-commercial use, distribution, and reproduction in other forums, provided the original authors and source are credited.
spellingShingle Microbiology
Simon, Svenja
Guthke, Reinhard
Kamradt, Thomas
Frey, Oliver
Multivariate Analysis of Flow Cytometric Data Using Decision Trees
title Multivariate Analysis of Flow Cytometric Data Using Decision Trees
title_full Multivariate Analysis of Flow Cytometric Data Using Decision Trees
title_fullStr Multivariate Analysis of Flow Cytometric Data Using Decision Trees
title_full_unstemmed Multivariate Analysis of Flow Cytometric Data Using Decision Trees
title_short Multivariate Analysis of Flow Cytometric Data Using Decision Trees
title_sort multivariate analysis of flow cytometric data using decision trees
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3316995/
https://www.ncbi.nlm.nih.gov/pubmed/22485112
http://dx.doi.org/10.3389/fmicb.2012.00114
work_keys_str_mv AT simonsvenja multivariateanalysisofflowcytometricdatausingdecisiontrees
AT guthkereinhard multivariateanalysisofflowcytometricdatausingdecisiontrees
AT kamradtthomas multivariateanalysisofflowcytometricdatausingdecisiontrees
AT freyoliver multivariateanalysisofflowcytometricdatausingdecisiontrees