Cargando…

Discrete mixture modeling to address genetic heterogeneity in time-to-event regression

Motivation: Time-to-event regression models are a critical tool for associating survival time outcomes with molecular data. Despite mounting evidence that genetic subgroups of the same clinical disease exist, little attention has been given to exploring how this heterogeneity affects time-to-event m...

Descripción completa

Detalles Bibliográficos
Autores principales: Eng, Kevin H., Hanlon, Bret M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4058947/
https://www.ncbi.nlm.nih.gov/pubmed/24532723
http://dx.doi.org/10.1093/bioinformatics/btu065
_version_ 1782321191951794176
author Eng, Kevin H.
Hanlon, Bret M.
author_facet Eng, Kevin H.
Hanlon, Bret M.
author_sort Eng, Kevin H.
collection PubMed
description Motivation: Time-to-event regression models are a critical tool for associating survival time outcomes with molecular data. Despite mounting evidence that genetic subgroups of the same clinical disease exist, little attention has been given to exploring how this heterogeneity affects time-to-event model building and how to accommodate it. Methods able to diagnose and model heterogeneity should be valuable additions to the biomarker discovery toolset. Results: We propose a mixture of survival functions that classifies subjects with similar relationships to a time-to-event response. This model incorporates multivariate regression and model selection and can be fit with an expectation maximization algorithm, we call Cox-assisted clustering. We illustrate a likely manifestation of genetic heterogeneity and demonstrate how it may affect survival models with little warning. An application to gene expression in ovarian cancer DNA repair pathways illustrates how the model may be used to learn new genetic subsets for risk stratification. We explore the implications of this model for censored observations and the effect on genomic predictors and diagnostic analysis. Availability and implementation: R implementation of CAC using standard packages is available at https://gist.github.com/programeng/8620b85146b14b6edf8f Data used in the analysis are publicly available. Contact: kevin.eng@roswellpark.org Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4058947
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-40589472014-06-18 Discrete mixture modeling to address genetic heterogeneity in time-to-event regression Eng, Kevin H. Hanlon, Bret M. Bioinformatics Original Papers Motivation: Time-to-event regression models are a critical tool for associating survival time outcomes with molecular data. Despite mounting evidence that genetic subgroups of the same clinical disease exist, little attention has been given to exploring how this heterogeneity affects time-to-event model building and how to accommodate it. Methods able to diagnose and model heterogeneity should be valuable additions to the biomarker discovery toolset. Results: We propose a mixture of survival functions that classifies subjects with similar relationships to a time-to-event response. This model incorporates multivariate regression and model selection and can be fit with an expectation maximization algorithm, we call Cox-assisted clustering. We illustrate a likely manifestation of genetic heterogeneity and demonstrate how it may affect survival models with little warning. An application to gene expression in ovarian cancer DNA repair pathways illustrates how the model may be used to learn new genetic subsets for risk stratification. We explore the implications of this model for censored observations and the effect on genomic predictors and diagnostic analysis. Availability and implementation: R implementation of CAC using standard packages is available at https://gist.github.com/programeng/8620b85146b14b6edf8f Data used in the analysis are publicly available. Contact: kevin.eng@roswellpark.org Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2014-06-15 2014-02-14 /pmc/articles/PMC4058947/ /pubmed/24532723 http://dx.doi.org/10.1093/bioinformatics/btu065 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Eng, Kevin H.
Hanlon, Bret M.
Discrete mixture modeling to address genetic heterogeneity in time-to-event regression
title Discrete mixture modeling to address genetic heterogeneity in time-to-event regression
title_full Discrete mixture modeling to address genetic heterogeneity in time-to-event regression
title_fullStr Discrete mixture modeling to address genetic heterogeneity in time-to-event regression
title_full_unstemmed Discrete mixture modeling to address genetic heterogeneity in time-to-event regression
title_short Discrete mixture modeling to address genetic heterogeneity in time-to-event regression
title_sort discrete mixture modeling to address genetic heterogeneity in time-to-event regression
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4058947/
https://www.ncbi.nlm.nih.gov/pubmed/24532723
http://dx.doi.org/10.1093/bioinformatics/btu065
work_keys_str_mv AT engkevinh discretemixturemodelingtoaddressgeneticheterogeneityintimetoeventregression
AT hanlonbretm discretemixturemodelingtoaddressgeneticheterogeneityintimetoeventregression