Cargando…
Tree-Based Methods for Discovery of Association between Flow Cytometry Data and Clinical Endpoints
We demonstrate the application and comparative interpretations of three tree-based algorithms for the analysis of data arising from flow cytometry: classification and regression trees (CARTs), random forests (RFs), and logic regression (LR). Specifically, we consider the question of what best predic...
Autores principales: | , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2817388/ https://www.ncbi.nlm.nih.gov/pubmed/20145719 http://dx.doi.org/10.1155/2009/235320 |
_version_ | 1782177184241156096 |
---|---|
author | Eliot, M. Azzoni, L. Firnhaber, C. Stevens, W. Glencross, D. K. Sanne, I. Montaner, L. J. Foulkes, A. S. |
author_facet | Eliot, M. Azzoni, L. Firnhaber, C. Stevens, W. Glencross, D. K. Sanne, I. Montaner, L. J. Foulkes, A. S. |
author_sort | Eliot, M. |
collection | PubMed |
description | We demonstrate the application and comparative interpretations of three tree-based algorithms for the analysis of data arising from flow cytometry: classification and regression trees (CARTs), random forests (RFs), and logic regression (LR). Specifically, we consider the question of what best predicts CD4 T-cell recovery in HIV-1 infected persons starting antiretroviral therapy with CD4 count between 200 and 350 cell/μL. A comparison to a more standard contingency table analysis is provided. While contingency table analysis and RFs provide information on the importance of each potential predictor variable, CART and LR offer additional insight into the combinations of variables that together are predictive of the outcome. In all cases considered, baseline CD3-DR-CD56+CD16+ emerges as an important predictor variable, while the tree-based approaches identify additional variables as potentially informative. Application of tree-based methods to our data suggests that a combination of baseline immune activation states, with emphasis on CD8 T-cell activation, may be a better predictor than any single T-cell/innate cell subset analyzed. Taken together, we show that tree-based methods can be successfully applied to flow cytometry data to better inform and discover associations that may not emerge in the context of a univariate analysis. |
format | Text |
id | pubmed-2817388 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-28173882010-02-09 Tree-Based Methods for Discovery of Association between Flow Cytometry Data and Clinical Endpoints Eliot, M. Azzoni, L. Firnhaber, C. Stevens, W. Glencross, D. K. Sanne, I. Montaner, L. J. Foulkes, A. S. Adv Bioinformatics Research Article We demonstrate the application and comparative interpretations of three tree-based algorithms for the analysis of data arising from flow cytometry: classification and regression trees (CARTs), random forests (RFs), and logic regression (LR). Specifically, we consider the question of what best predicts CD4 T-cell recovery in HIV-1 infected persons starting antiretroviral therapy with CD4 count between 200 and 350 cell/μL. A comparison to a more standard contingency table analysis is provided. While contingency table analysis and RFs provide information on the importance of each potential predictor variable, CART and LR offer additional insight into the combinations of variables that together are predictive of the outcome. In all cases considered, baseline CD3-DR-CD56+CD16+ emerges as an important predictor variable, while the tree-based approaches identify additional variables as potentially informative. Application of tree-based methods to our data suggests that a combination of baseline immune activation states, with emphasis on CD8 T-cell activation, may be a better predictor than any single T-cell/innate cell subset analyzed. Taken together, we show that tree-based methods can be successfully applied to flow cytometry data to better inform and discover associations that may not emerge in the context of a univariate analysis. Hindawi Publishing Corporation 2009 2010-01-21 /pmc/articles/PMC2817388/ /pubmed/20145719 http://dx.doi.org/10.1155/2009/235320 Text en Copyright © 2009 M. Eliot et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Eliot, M. Azzoni, L. Firnhaber, C. Stevens, W. Glencross, D. K. Sanne, I. Montaner, L. J. Foulkes, A. S. Tree-Based Methods for Discovery of Association between Flow Cytometry Data and Clinical Endpoints |
title | Tree-Based Methods for Discovery of Association between Flow Cytometry Data and Clinical Endpoints |
title_full | Tree-Based Methods for Discovery of Association between Flow Cytometry Data and Clinical Endpoints |
title_fullStr | Tree-Based Methods for Discovery of Association between Flow Cytometry Data and Clinical Endpoints |
title_full_unstemmed | Tree-Based Methods for Discovery of Association between Flow Cytometry Data and Clinical Endpoints |
title_short | Tree-Based Methods for Discovery of Association between Flow Cytometry Data and Clinical Endpoints |
title_sort | tree-based methods for discovery of association between flow cytometry data and clinical endpoints |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2817388/ https://www.ncbi.nlm.nih.gov/pubmed/20145719 http://dx.doi.org/10.1155/2009/235320 |
work_keys_str_mv | AT eliotm treebasedmethodsfordiscoveryofassociationbetweenflowcytometrydataandclinicalendpoints AT azzonil treebasedmethodsfordiscoveryofassociationbetweenflowcytometrydataandclinicalendpoints AT firnhaberc treebasedmethodsfordiscoveryofassociationbetweenflowcytometrydataandclinicalendpoints AT stevensw treebasedmethodsfordiscoveryofassociationbetweenflowcytometrydataandclinicalendpoints AT glencrossdk treebasedmethodsfordiscoveryofassociationbetweenflowcytometrydataandclinicalendpoints AT sannei treebasedmethodsfordiscoveryofassociationbetweenflowcytometrydataandclinicalendpoints AT montanerlj treebasedmethodsfordiscoveryofassociationbetweenflowcytometrydataandclinicalendpoints AT foulkesas treebasedmethodsfordiscoveryofassociationbetweenflowcytometrydataandclinicalendpoints |