Cargando…
Discrete mixture modeling to address genetic heterogeneity in time-to-event regression
Motivation: Time-to-event regression models are a critical tool for associating survival time outcomes with molecular data. Despite mounting evidence that genetic subgroups of the same clinical disease exist, little attention has been given to exploring how this heterogeneity affects time-to-event m...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4058947/ https://www.ncbi.nlm.nih.gov/pubmed/24532723 http://dx.doi.org/10.1093/bioinformatics/btu065 |
_version_ | 1782321191951794176 |
---|---|
author | Eng, Kevin H. Hanlon, Bret M. |
author_facet | Eng, Kevin H. Hanlon, Bret M. |
author_sort | Eng, Kevin H. |
collection | PubMed |
description | Motivation: Time-to-event regression models are a critical tool for associating survival time outcomes with molecular data. Despite mounting evidence that genetic subgroups of the same clinical disease exist, little attention has been given to exploring how this heterogeneity affects time-to-event model building and how to accommodate it. Methods able to diagnose and model heterogeneity should be valuable additions to the biomarker discovery toolset. Results: We propose a mixture of survival functions that classifies subjects with similar relationships to a time-to-event response. This model incorporates multivariate regression and model selection and can be fit with an expectation maximization algorithm, we call Cox-assisted clustering. We illustrate a likely manifestation of genetic heterogeneity and demonstrate how it may affect survival models with little warning. An application to gene expression in ovarian cancer DNA repair pathways illustrates how the model may be used to learn new genetic subsets for risk stratification. We explore the implications of this model for censored observations and the effect on genomic predictors and diagnostic analysis. Availability and implementation: R implementation of CAC using standard packages is available at https://gist.github.com/programeng/8620b85146b14b6edf8f Data used in the analysis are publicly available. Contact: kevin.eng@roswellpark.org Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-4058947 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-40589472014-06-18 Discrete mixture modeling to address genetic heterogeneity in time-to-event regression Eng, Kevin H. Hanlon, Bret M. Bioinformatics Original Papers Motivation: Time-to-event regression models are a critical tool for associating survival time outcomes with molecular data. Despite mounting evidence that genetic subgroups of the same clinical disease exist, little attention has been given to exploring how this heterogeneity affects time-to-event model building and how to accommodate it. Methods able to diagnose and model heterogeneity should be valuable additions to the biomarker discovery toolset. Results: We propose a mixture of survival functions that classifies subjects with similar relationships to a time-to-event response. This model incorporates multivariate regression and model selection and can be fit with an expectation maximization algorithm, we call Cox-assisted clustering. We illustrate a likely manifestation of genetic heterogeneity and demonstrate how it may affect survival models with little warning. An application to gene expression in ovarian cancer DNA repair pathways illustrates how the model may be used to learn new genetic subsets for risk stratification. We explore the implications of this model for censored observations and the effect on genomic predictors and diagnostic analysis. Availability and implementation: R implementation of CAC using standard packages is available at https://gist.github.com/programeng/8620b85146b14b6edf8f Data used in the analysis are publicly available. Contact: kevin.eng@roswellpark.org Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2014-06-15 2014-02-14 /pmc/articles/PMC4058947/ /pubmed/24532723 http://dx.doi.org/10.1093/bioinformatics/btu065 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Papers Eng, Kevin H. Hanlon, Bret M. Discrete mixture modeling to address genetic heterogeneity in time-to-event regression |
title | Discrete mixture modeling to address genetic heterogeneity in time-to-event regression |
title_full | Discrete mixture modeling to address genetic heterogeneity in time-to-event regression |
title_fullStr | Discrete mixture modeling to address genetic heterogeneity in time-to-event regression |
title_full_unstemmed | Discrete mixture modeling to address genetic heterogeneity in time-to-event regression |
title_short | Discrete mixture modeling to address genetic heterogeneity in time-to-event regression |
title_sort | discrete mixture modeling to address genetic heterogeneity in time-to-event regression |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4058947/ https://www.ncbi.nlm.nih.gov/pubmed/24532723 http://dx.doi.org/10.1093/bioinformatics/btu065 |
work_keys_str_mv | AT engkevinh discretemixturemodelingtoaddressgeneticheterogeneityintimetoeventregression AT hanlonbretm discretemixturemodelingtoaddressgeneticheterogeneityintimetoeventregression |