Cargando…

Patient-Specific Data Fusion Defines Prognostic Cancer Subtypes

Different data types can offer complementary perspectives on the same biological phenomenon. In cancer studies, for example, data on copy number alterations indicate losses and amplifications of genomic regions in tumours, while transcriptomic data point to the impact of genomic and environmental ev...

Descripción completa

Detalles Bibliográficos
Autores principales: Yuan, Yinyin, Savage, Richard S., Markowetz, Florian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3197649/
https://www.ncbi.nlm.nih.gov/pubmed/22028636
http://dx.doi.org/10.1371/journal.pcbi.1002227
_version_ 1782214347693490176
author Yuan, Yinyin
Savage, Richard S.
Markowetz, Florian
author_facet Yuan, Yinyin
Savage, Richard S.
Markowetz, Florian
author_sort Yuan, Yinyin
collection PubMed
description Different data types can offer complementary perspectives on the same biological phenomenon. In cancer studies, for example, data on copy number alterations indicate losses and amplifications of genomic regions in tumours, while transcriptomic data point to the impact of genomic and environmental events on the internal wiring of the cell. Fusing different data provides a more comprehensive model of the cancer cell than that offered by any single type. However, biological signals in different patients exhibit diverse degrees of concordance due to cancer heterogeneity and inherent noise in the measurements. This is a particularly important issue in cancer subtype discovery, where personalised strategies to guide therapy are of vital importance. We present a nonparametric Bayesian model for discovering prognostic cancer subtypes by integrating gene expression and copy number variation data. Our model is constructed from a hierarchy of Dirichlet Processes and addresses three key challenges in data fusion: (i) To separate concordant from discordant signals, (ii) to select informative features, (iii) to estimate the number of disease subtypes. Concordance of signals is assessed individually for each patient, giving us an additional level of insight into the underlying disease structure. We exemplify the power of our model in prostate cancer and breast cancer and show that it outperforms competing methods. In the prostate cancer data, we identify an entirely new subtype with extremely poor survival outcome and show how other analyses fail to detect it. In the breast cancer data, we find subtypes with superior prognostic value by using the concordant results. These discoveries were crucially dependent on our model's ability to distinguish concordant and discordant signals within each patient sample, and would otherwise have been missed. We therefore demonstrate the importance of taking a patient-specific approach, using highly-flexible nonparametric Bayesian methods.
format Online
Article
Text
id pubmed-3197649
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-31976492011-10-25 Patient-Specific Data Fusion Defines Prognostic Cancer Subtypes Yuan, Yinyin Savage, Richard S. Markowetz, Florian PLoS Comput Biol Research Article Different data types can offer complementary perspectives on the same biological phenomenon. In cancer studies, for example, data on copy number alterations indicate losses and amplifications of genomic regions in tumours, while transcriptomic data point to the impact of genomic and environmental events on the internal wiring of the cell. Fusing different data provides a more comprehensive model of the cancer cell than that offered by any single type. However, biological signals in different patients exhibit diverse degrees of concordance due to cancer heterogeneity and inherent noise in the measurements. This is a particularly important issue in cancer subtype discovery, where personalised strategies to guide therapy are of vital importance. We present a nonparametric Bayesian model for discovering prognostic cancer subtypes by integrating gene expression and copy number variation data. Our model is constructed from a hierarchy of Dirichlet Processes and addresses three key challenges in data fusion: (i) To separate concordant from discordant signals, (ii) to select informative features, (iii) to estimate the number of disease subtypes. Concordance of signals is assessed individually for each patient, giving us an additional level of insight into the underlying disease structure. We exemplify the power of our model in prostate cancer and breast cancer and show that it outperforms competing methods. In the prostate cancer data, we identify an entirely new subtype with extremely poor survival outcome and show how other analyses fail to detect it. In the breast cancer data, we find subtypes with superior prognostic value by using the concordant results. These discoveries were crucially dependent on our model's ability to distinguish concordant and discordant signals within each patient sample, and would otherwise have been missed. We therefore demonstrate the importance of taking a patient-specific approach, using highly-flexible nonparametric Bayesian methods. Public Library of Science 2011-10-20 /pmc/articles/PMC3197649/ /pubmed/22028636 http://dx.doi.org/10.1371/journal.pcbi.1002227 Text en Yuan et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Yuan, Yinyin
Savage, Richard S.
Markowetz, Florian
Patient-Specific Data Fusion Defines Prognostic Cancer Subtypes
title Patient-Specific Data Fusion Defines Prognostic Cancer Subtypes
title_full Patient-Specific Data Fusion Defines Prognostic Cancer Subtypes
title_fullStr Patient-Specific Data Fusion Defines Prognostic Cancer Subtypes
title_full_unstemmed Patient-Specific Data Fusion Defines Prognostic Cancer Subtypes
title_short Patient-Specific Data Fusion Defines Prognostic Cancer Subtypes
title_sort patient-specific data fusion defines prognostic cancer subtypes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3197649/
https://www.ncbi.nlm.nih.gov/pubmed/22028636
http://dx.doi.org/10.1371/journal.pcbi.1002227
work_keys_str_mv AT yuanyinyin patientspecificdatafusiondefinesprognosticcancersubtypes
AT savagerichards patientspecificdatafusiondefinesprognosticcancersubtypes
AT markowetzflorian patientspecificdatafusiondefinesprognosticcancersubtypes