Cargando…
Multivariate Analysis of Flow Cytometric Data Using Decision Trees
Characterization of the response of the host immune system is important in understanding the bidirectional interactions between the host and microbial pathogens. For research on the host site, flow cytometry has become one of the major tools in immunology. Advances in technology and reagents allow n...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Research Foundation
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3316995/ https://www.ncbi.nlm.nih.gov/pubmed/22485112 http://dx.doi.org/10.3389/fmicb.2012.00114 |
_version_ | 1782228484983095296 |
---|---|
author | Simon, Svenja Guthke, Reinhard Kamradt, Thomas Frey, Oliver |
author_facet | Simon, Svenja Guthke, Reinhard Kamradt, Thomas Frey, Oliver |
author_sort | Simon, Svenja |
collection | PubMed |
description | Characterization of the response of the host immune system is important in understanding the bidirectional interactions between the host and microbial pathogens. For research on the host site, flow cytometry has become one of the major tools in immunology. Advances in technology and reagents allow now the simultaneous assessment of multiple markers on a single cell level generating multidimensional data sets that require multivariate statistical analysis. We explored the explanatory power of the supervised machine learning method called “induction of decision trees” in flow cytometric data. In order to examine whether the production of a certain cytokine is depended on other cytokines, datasets from intracellular staining for six cytokines with complex patterns of co-expression were analyzed by induction of decision trees. After weighting the data according to their class probabilities, we created a total of 13,392 different decision trees for each given cytokine with different parameter settings. For a more realistic estimation of the decision trees’ quality, we used stratified fivefold cross validation and chose the “best” tree according to a combination of different quality criteria. While some of the decision trees reflected previously known co-expression patterns, we found that the expression of some cytokines was not only dependent on the co-expression of others per se, but was also dependent on the intensity of expression. Thus, for the first time we successfully used induction of decision trees for the analysis of high dimensional flow cytometric data and demonstrated the feasibility of this method to reveal structural patterns in such data sets. |
format | Online Article Text |
id | pubmed-3316995 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Frontiers Research Foundation |
record_format | MEDLINE/PubMed |
spelling | pubmed-33169952012-04-06 Multivariate Analysis of Flow Cytometric Data Using Decision Trees Simon, Svenja Guthke, Reinhard Kamradt, Thomas Frey, Oliver Front Microbiol Microbiology Characterization of the response of the host immune system is important in understanding the bidirectional interactions between the host and microbial pathogens. For research on the host site, flow cytometry has become one of the major tools in immunology. Advances in technology and reagents allow now the simultaneous assessment of multiple markers on a single cell level generating multidimensional data sets that require multivariate statistical analysis. We explored the explanatory power of the supervised machine learning method called “induction of decision trees” in flow cytometric data. In order to examine whether the production of a certain cytokine is depended on other cytokines, datasets from intracellular staining for six cytokines with complex patterns of co-expression were analyzed by induction of decision trees. After weighting the data according to their class probabilities, we created a total of 13,392 different decision trees for each given cytokine with different parameter settings. For a more realistic estimation of the decision trees’ quality, we used stratified fivefold cross validation and chose the “best” tree according to a combination of different quality criteria. While some of the decision trees reflected previously known co-expression patterns, we found that the expression of some cytokines was not only dependent on the co-expression of others per se, but was also dependent on the intensity of expression. Thus, for the first time we successfully used induction of decision trees for the analysis of high dimensional flow cytometric data and demonstrated the feasibility of this method to reveal structural patterns in such data sets. Frontiers Research Foundation 2012-04-02 /pmc/articles/PMC3316995/ /pubmed/22485112 http://dx.doi.org/10.3389/fmicb.2012.00114 Text en Copyright © 2012 Simon, Guthke, Kamradt and Frey. http://www.frontiersin.org/licenseagreement This is an open-access article distributed under the terms of the Creative Commons Attribution Non Commercial License, which permits non-commercial use, distribution, and reproduction in other forums, provided the original authors and source are credited. |
spellingShingle | Microbiology Simon, Svenja Guthke, Reinhard Kamradt, Thomas Frey, Oliver Multivariate Analysis of Flow Cytometric Data Using Decision Trees |
title | Multivariate Analysis of Flow Cytometric Data Using Decision Trees |
title_full | Multivariate Analysis of Flow Cytometric Data Using Decision Trees |
title_fullStr | Multivariate Analysis of Flow Cytometric Data Using Decision Trees |
title_full_unstemmed | Multivariate Analysis of Flow Cytometric Data Using Decision Trees |
title_short | Multivariate Analysis of Flow Cytometric Data Using Decision Trees |
title_sort | multivariate analysis of flow cytometric data using decision trees |
topic | Microbiology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3316995/ https://www.ncbi.nlm.nih.gov/pubmed/22485112 http://dx.doi.org/10.3389/fmicb.2012.00114 |
work_keys_str_mv | AT simonsvenja multivariateanalysisofflowcytometricdatausingdecisiontrees AT guthkereinhard multivariateanalysisofflowcytometricdatausingdecisiontrees AT kamradtthomas multivariateanalysisofflowcytometricdatausingdecisiontrees AT freyoliver multivariateanalysisofflowcytometricdatausingdecisiontrees |