Cargando…
Bagging survival tree procedure for variable selection and prediction in the presence of nonsusceptible patients
BACKGROUND: For clinical genomic studies with high-dimensional datasets, tree-based ensemble methods offer a powerful solution for variable selection and prediction taking into account the complex interrelationships between explanatory variables. One of the key component of the tree-building process...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4895817/ https://www.ncbi.nlm.nih.gov/pubmed/27266372 http://dx.doi.org/10.1186/s12859-016-1090-x |
_version_ | 1782435930814021632 |
---|---|
author | Mbogning, Cyprien Broët, Philippe |
author_facet | Mbogning, Cyprien Broët, Philippe |
author_sort | Mbogning, Cyprien |
collection | PubMed |
description | BACKGROUND: For clinical genomic studies with high-dimensional datasets, tree-based ensemble methods offer a powerful solution for variable selection and prediction taking into account the complex interrelationships between explanatory variables. One of the key component of the tree-building process is the splitting criterion. For survival data, the classical splitting criterion is the Logrank statistic. However, the presence of a fraction of nonsusceptible patients in the studied population advocates for considering a criterion tailored to this peculiar situation. RESULTS: We propose a bagging survival tree procedure for variable selection and prediction where the survival tree-building process relies on a splitting criterion that explicitly focuses on time-to-event survival distribution among susceptible patients. A simulation study shows that our method achieves good performance for the variable selection and prediction. Different criteria for evaluating the importance of the explanatory variables and the prediction performance are reported. Our procedure is illustrated on a genomic dataset with gene expression measurements from early breast cancer patients. CONCLUSIONS: In the presence of nonsusceptible patients among the studied population, our procedure represents an efficient way to select event-related explanatory covariates with potential higher-order interaction and identify homogeneous groups of susceptible patients. |
format | Online Article Text |
id | pubmed-4895817 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-48958172016-06-10 Bagging survival tree procedure for variable selection and prediction in the presence of nonsusceptible patients Mbogning, Cyprien Broët, Philippe BMC Bioinformatics Methodology Article BACKGROUND: For clinical genomic studies with high-dimensional datasets, tree-based ensemble methods offer a powerful solution for variable selection and prediction taking into account the complex interrelationships between explanatory variables. One of the key component of the tree-building process is the splitting criterion. For survival data, the classical splitting criterion is the Logrank statistic. However, the presence of a fraction of nonsusceptible patients in the studied population advocates for considering a criterion tailored to this peculiar situation. RESULTS: We propose a bagging survival tree procedure for variable selection and prediction where the survival tree-building process relies on a splitting criterion that explicitly focuses on time-to-event survival distribution among susceptible patients. A simulation study shows that our method achieves good performance for the variable selection and prediction. Different criteria for evaluating the importance of the explanatory variables and the prediction performance are reported. Our procedure is illustrated on a genomic dataset with gene expression measurements from early breast cancer patients. CONCLUSIONS: In the presence of nonsusceptible patients among the studied population, our procedure represents an efficient way to select event-related explanatory covariates with potential higher-order interaction and identify homogeneous groups of susceptible patients. BioMed Central 2016-06-07 /pmc/articles/PMC4895817/ /pubmed/27266372 http://dx.doi.org/10.1186/s12859-016-1090-x Text en © Mbogning and Broët. 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Mbogning, Cyprien Broët, Philippe Bagging survival tree procedure for variable selection and prediction in the presence of nonsusceptible patients |
title | Bagging survival tree procedure for variable selection and prediction in the presence of nonsusceptible patients |
title_full | Bagging survival tree procedure for variable selection and prediction in the presence of nonsusceptible patients |
title_fullStr | Bagging survival tree procedure for variable selection and prediction in the presence of nonsusceptible patients |
title_full_unstemmed | Bagging survival tree procedure for variable selection and prediction in the presence of nonsusceptible patients |
title_short | Bagging survival tree procedure for variable selection and prediction in the presence of nonsusceptible patients |
title_sort | bagging survival tree procedure for variable selection and prediction in the presence of nonsusceptible patients |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4895817/ https://www.ncbi.nlm.nih.gov/pubmed/27266372 http://dx.doi.org/10.1186/s12859-016-1090-x |
work_keys_str_mv | AT mbogningcyprien baggingsurvivaltreeprocedureforvariableselectionandpredictioninthepresenceofnonsusceptiblepatients AT broetphilippe baggingsurvivaltreeprocedureforvariableselectionandpredictioninthepresenceofnonsusceptiblepatients |