Cargando…

Bagging survival tree procedure for variable selection and prediction in the presence of nonsusceptible patients

BACKGROUND: For clinical genomic studies with high-dimensional datasets, tree-based ensemble methods offer a powerful solution for variable selection and prediction taking into account the complex interrelationships between explanatory variables. One of the key component of the tree-building process...

Descripción completa

Detalles Bibliográficos
Autores principales: Mbogning, Cyprien, Broët, Philippe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4895817/
https://www.ncbi.nlm.nih.gov/pubmed/27266372
http://dx.doi.org/10.1186/s12859-016-1090-x
_version_ 1782435930814021632
author Mbogning, Cyprien
Broët, Philippe
author_facet Mbogning, Cyprien
Broët, Philippe
author_sort Mbogning, Cyprien
collection PubMed
description BACKGROUND: For clinical genomic studies with high-dimensional datasets, tree-based ensemble methods offer a powerful solution for variable selection and prediction taking into account the complex interrelationships between explanatory variables. One of the key component of the tree-building process is the splitting criterion. For survival data, the classical splitting criterion is the Logrank statistic. However, the presence of a fraction of nonsusceptible patients in the studied population advocates for considering a criterion tailored to this peculiar situation. RESULTS: We propose a bagging survival tree procedure for variable selection and prediction where the survival tree-building process relies on a splitting criterion that explicitly focuses on time-to-event survival distribution among susceptible patients. A simulation study shows that our method achieves good performance for the variable selection and prediction. Different criteria for evaluating the importance of the explanatory variables and the prediction performance are reported. Our procedure is illustrated on a genomic dataset with gene expression measurements from early breast cancer patients. CONCLUSIONS: In the presence of nonsusceptible patients among the studied population, our procedure represents an efficient way to select event-related explanatory covariates with potential higher-order interaction and identify homogeneous groups of susceptible patients.
format Online
Article
Text
id pubmed-4895817
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-48958172016-06-10 Bagging survival tree procedure for variable selection and prediction in the presence of nonsusceptible patients Mbogning, Cyprien Broët, Philippe BMC Bioinformatics Methodology Article BACKGROUND: For clinical genomic studies with high-dimensional datasets, tree-based ensemble methods offer a powerful solution for variable selection and prediction taking into account the complex interrelationships between explanatory variables. One of the key component of the tree-building process is the splitting criterion. For survival data, the classical splitting criterion is the Logrank statistic. However, the presence of a fraction of nonsusceptible patients in the studied population advocates for considering a criterion tailored to this peculiar situation. RESULTS: We propose a bagging survival tree procedure for variable selection and prediction where the survival tree-building process relies on a splitting criterion that explicitly focuses on time-to-event survival distribution among susceptible patients. A simulation study shows that our method achieves good performance for the variable selection and prediction. Different criteria for evaluating the importance of the explanatory variables and the prediction performance are reported. Our procedure is illustrated on a genomic dataset with gene expression measurements from early breast cancer patients. CONCLUSIONS: In the presence of nonsusceptible patients among the studied population, our procedure represents an efficient way to select event-related explanatory covariates with potential higher-order interaction and identify homogeneous groups of susceptible patients. BioMed Central 2016-06-07 /pmc/articles/PMC4895817/ /pubmed/27266372 http://dx.doi.org/10.1186/s12859-016-1090-x Text en © Mbogning and Broët. 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Mbogning, Cyprien
Broët, Philippe
Bagging survival tree procedure for variable selection and prediction in the presence of nonsusceptible patients
title Bagging survival tree procedure for variable selection and prediction in the presence of nonsusceptible patients
title_full Bagging survival tree procedure for variable selection and prediction in the presence of nonsusceptible patients
title_fullStr Bagging survival tree procedure for variable selection and prediction in the presence of nonsusceptible patients
title_full_unstemmed Bagging survival tree procedure for variable selection and prediction in the presence of nonsusceptible patients
title_short Bagging survival tree procedure for variable selection and prediction in the presence of nonsusceptible patients
title_sort bagging survival tree procedure for variable selection and prediction in the presence of nonsusceptible patients
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4895817/
https://www.ncbi.nlm.nih.gov/pubmed/27266372
http://dx.doi.org/10.1186/s12859-016-1090-x
work_keys_str_mv AT mbogningcyprien baggingsurvivaltreeprocedureforvariableselectionandpredictioninthepresenceofnonsusceptiblepatients
AT broetphilippe baggingsurvivaltreeprocedureforvariableselectionandpredictioninthepresenceofnonsusceptiblepatients