Cargando…

Competing Risks Data Analysis with High-dimensional Covariates: An Application in Bladder Cancer

Analysis of microarray data is associated with the methodological problems of high dimension and small sample size. Various methods have been used for variable selection in high-dimension and small sample size cases with a single survival endpoint. However, little effort has been directed toward add...

Descripción completa

Detalles Bibliográficos
Autores principales: Tapak, Leili, Saidijam, Massoud, Sadeghifar, Majid, Poorolajal, Jalal, Mahjub, Hossein
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4563215/
https://www.ncbi.nlm.nih.gov/pubmed/25907251
http://dx.doi.org/10.1016/j.gpb.2015.04.001
_version_ 1782389269522808832
author Tapak, Leili
Saidijam, Massoud
Sadeghifar, Majid
Poorolajal, Jalal
Mahjub, Hossein
author_facet Tapak, Leili
Saidijam, Massoud
Sadeghifar, Majid
Poorolajal, Jalal
Mahjub, Hossein
author_sort Tapak, Leili
collection PubMed
description Analysis of microarray data is associated with the methodological problems of high dimension and small sample size. Various methods have been used for variable selection in high-dimension and small sample size cases with a single survival endpoint. However, little effort has been directed toward addressing competing risks where there is more than one failure risks. This study compared three typical variable selection techniques including Lasso, elastic net, and likelihood-based boosting for high-dimensional time-to-event data with competing risks. The performance of these methods was evaluated via a simulation study by analyzing a real dataset related to bladder cancer patients using time-dependent receiver operator characteristic (ROC) curve and bootstrap .632+ prediction error curves. The elastic net penalization method was shown to outperform Lasso and boosting. Based on the elastic net, 33 genes out of 1381 genes related to bladder cancer were selected. By fitting to the Fine and Gray model, eight genes were highly significant (P < 0.001). Among them, expression of RTN4, SON, IGF1R, SNRPE, PTGR1, PLEK, and ETFDH was associated with a decrease in survival time, whereas SMARCAD1 expression was associated with an increase in survival time. This study indicates that the elastic net has a higher capacity than the Lasso and boosting for the prediction of survival time in bladder cancer patients. Moreover, genes selected by all methods improved the predictive power of the model based on only clinical variables, indicating the value of information contained in the microarray features.
format Online
Article
Text
id pubmed-4563215
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-45632152015-09-30 Competing Risks Data Analysis with High-dimensional Covariates: An Application in Bladder Cancer Tapak, Leili Saidijam, Massoud Sadeghifar, Majid Poorolajal, Jalal Mahjub, Hossein Genomics Proteomics Bioinformatics Original Research Analysis of microarray data is associated with the methodological problems of high dimension and small sample size. Various methods have been used for variable selection in high-dimension and small sample size cases with a single survival endpoint. However, little effort has been directed toward addressing competing risks where there is more than one failure risks. This study compared three typical variable selection techniques including Lasso, elastic net, and likelihood-based boosting for high-dimensional time-to-event data with competing risks. The performance of these methods was evaluated via a simulation study by analyzing a real dataset related to bladder cancer patients using time-dependent receiver operator characteristic (ROC) curve and bootstrap .632+ prediction error curves. The elastic net penalization method was shown to outperform Lasso and boosting. Based on the elastic net, 33 genes out of 1381 genes related to bladder cancer were selected. By fitting to the Fine and Gray model, eight genes were highly significant (P < 0.001). Among them, expression of RTN4, SON, IGF1R, SNRPE, PTGR1, PLEK, and ETFDH was associated with a decrease in survival time, whereas SMARCAD1 expression was associated with an increase in survival time. This study indicates that the elastic net has a higher capacity than the Lasso and boosting for the prediction of survival time in bladder cancer patients. Moreover, genes selected by all methods improved the predictive power of the model based on only clinical variables, indicating the value of information contained in the microarray features. Elsevier 2015-06 2015-04-20 /pmc/articles/PMC4563215/ /pubmed/25907251 http://dx.doi.org/10.1016/j.gpb.2015.04.001 Text en © 2015 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Original Research
Tapak, Leili
Saidijam, Massoud
Sadeghifar, Majid
Poorolajal, Jalal
Mahjub, Hossein
Competing Risks Data Analysis with High-dimensional Covariates: An Application in Bladder Cancer
title Competing Risks Data Analysis with High-dimensional Covariates: An Application in Bladder Cancer
title_full Competing Risks Data Analysis with High-dimensional Covariates: An Application in Bladder Cancer
title_fullStr Competing Risks Data Analysis with High-dimensional Covariates: An Application in Bladder Cancer
title_full_unstemmed Competing Risks Data Analysis with High-dimensional Covariates: An Application in Bladder Cancer
title_short Competing Risks Data Analysis with High-dimensional Covariates: An Application in Bladder Cancer
title_sort competing risks data analysis with high-dimensional covariates: an application in bladder cancer
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4563215/
https://www.ncbi.nlm.nih.gov/pubmed/25907251
http://dx.doi.org/10.1016/j.gpb.2015.04.001
work_keys_str_mv AT tapakleili competingrisksdataanalysiswithhighdimensionalcovariatesanapplicationinbladdercancer
AT saidijammassoud competingrisksdataanalysiswithhighdimensionalcovariatesanapplicationinbladdercancer
AT sadeghifarmajid competingrisksdataanalysiswithhighdimensionalcovariatesanapplicationinbladdercancer
AT poorolajaljalal competingrisksdataanalysiswithhighdimensionalcovariatesanapplicationinbladdercancer
AT mahjubhossein competingrisksdataanalysiswithhighdimensionalcovariatesanapplicationinbladdercancer