Cargando…

Tree-Based Methods for Discovery of Association between Flow Cytometry Data and Clinical Endpoints

We demonstrate the application and comparative interpretations of three tree-based algorithms for the analysis of data arising from flow cytometry: classification and regression trees (CARTs), random forests (RFs), and logic regression (LR). Specifically, we consider the question of what best predic...

Descripción completa

Detalles Bibliográficos
Autores principales: Eliot, M., Azzoni, L., Firnhaber, C., Stevens, W., Glencross, D. K., Sanne, I., Montaner, L. J., Foulkes, A. S.
Formato: Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2817388/
https://www.ncbi.nlm.nih.gov/pubmed/20145719
http://dx.doi.org/10.1155/2009/235320
_version_ 1782177184241156096
author Eliot, M.
Azzoni, L.
Firnhaber, C.
Stevens, W.
Glencross, D. K.
Sanne, I.
Montaner, L. J.
Foulkes, A. S.
author_facet Eliot, M.
Azzoni, L.
Firnhaber, C.
Stevens, W.
Glencross, D. K.
Sanne, I.
Montaner, L. J.
Foulkes, A. S.
author_sort Eliot, M.
collection PubMed
description We demonstrate the application and comparative interpretations of three tree-based algorithms for the analysis of data arising from flow cytometry: classification and regression trees (CARTs), random forests (RFs), and logic regression (LR). Specifically, we consider the question of what best predicts CD4 T-cell recovery in HIV-1 infected persons starting antiretroviral therapy with CD4 count between 200 and 350 cell/μL. A comparison to a more standard contingency table analysis is provided. While contingency table analysis and RFs provide information on the importance of each potential predictor variable, CART and LR offer additional insight into the combinations of variables that together are predictive of the outcome. In all cases considered, baseline CD3-DR-CD56+CD16+ emerges as an important predictor variable, while the tree-based approaches identify additional variables as potentially informative. Application of tree-based methods to our data suggests that a combination of baseline immune activation states, with emphasis on CD8 T-cell activation, may be a better predictor than any single T-cell/innate cell subset analyzed. Taken together, we show that tree-based methods can be successfully applied to flow cytometry data to better inform and discover associations that may not emerge in the context of a univariate analysis.
format Text
id pubmed-2817388
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-28173882010-02-09 Tree-Based Methods for Discovery of Association between Flow Cytometry Data and Clinical Endpoints Eliot, M. Azzoni, L. Firnhaber, C. Stevens, W. Glencross, D. K. Sanne, I. Montaner, L. J. Foulkes, A. S. Adv Bioinformatics Research Article We demonstrate the application and comparative interpretations of three tree-based algorithms for the analysis of data arising from flow cytometry: classification and regression trees (CARTs), random forests (RFs), and logic regression (LR). Specifically, we consider the question of what best predicts CD4 T-cell recovery in HIV-1 infected persons starting antiretroviral therapy with CD4 count between 200 and 350 cell/μL. A comparison to a more standard contingency table analysis is provided. While contingency table analysis and RFs provide information on the importance of each potential predictor variable, CART and LR offer additional insight into the combinations of variables that together are predictive of the outcome. In all cases considered, baseline CD3-DR-CD56+CD16+ emerges as an important predictor variable, while the tree-based approaches identify additional variables as potentially informative. Application of tree-based methods to our data suggests that a combination of baseline immune activation states, with emphasis on CD8 T-cell activation, may be a better predictor than any single T-cell/innate cell subset analyzed. Taken together, we show that tree-based methods can be successfully applied to flow cytometry data to better inform and discover associations that may not emerge in the context of a univariate analysis. Hindawi Publishing Corporation 2009 2010-01-21 /pmc/articles/PMC2817388/ /pubmed/20145719 http://dx.doi.org/10.1155/2009/235320 Text en Copyright © 2009 M. Eliot et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Eliot, M.
Azzoni, L.
Firnhaber, C.
Stevens, W.
Glencross, D. K.
Sanne, I.
Montaner, L. J.
Foulkes, A. S.
Tree-Based Methods for Discovery of Association between Flow Cytometry Data and Clinical Endpoints
title Tree-Based Methods for Discovery of Association between Flow Cytometry Data and Clinical Endpoints
title_full Tree-Based Methods for Discovery of Association between Flow Cytometry Data and Clinical Endpoints
title_fullStr Tree-Based Methods for Discovery of Association between Flow Cytometry Data and Clinical Endpoints
title_full_unstemmed Tree-Based Methods for Discovery of Association between Flow Cytometry Data and Clinical Endpoints
title_short Tree-Based Methods for Discovery of Association between Flow Cytometry Data and Clinical Endpoints
title_sort tree-based methods for discovery of association between flow cytometry data and clinical endpoints
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2817388/
https://www.ncbi.nlm.nih.gov/pubmed/20145719
http://dx.doi.org/10.1155/2009/235320
work_keys_str_mv AT eliotm treebasedmethodsfordiscoveryofassociationbetweenflowcytometrydataandclinicalendpoints
AT azzonil treebasedmethodsfordiscoveryofassociationbetweenflowcytometrydataandclinicalendpoints
AT firnhaberc treebasedmethodsfordiscoveryofassociationbetweenflowcytometrydataandclinicalendpoints
AT stevensw treebasedmethodsfordiscoveryofassociationbetweenflowcytometrydataandclinicalendpoints
AT glencrossdk treebasedmethodsfordiscoveryofassociationbetweenflowcytometrydataandclinicalendpoints
AT sannei treebasedmethodsfordiscoveryofassociationbetweenflowcytometrydataandclinicalendpoints
AT montanerlj treebasedmethodsfordiscoveryofassociationbetweenflowcytometrydataandclinicalendpoints
AT foulkesas treebasedmethodsfordiscoveryofassociationbetweenflowcytometrydataandclinicalendpoints