Cargando…
Patient-Specific Data Fusion Defines Prognostic Cancer Subtypes
Different data types can offer complementary perspectives on the same biological phenomenon. In cancer studies, for example, data on copy number alterations indicate losses and amplifications of genomic regions in tumours, while transcriptomic data point to the impact of genomic and environmental ev...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3197649/ https://www.ncbi.nlm.nih.gov/pubmed/22028636 http://dx.doi.org/10.1371/journal.pcbi.1002227 |
_version_ | 1782214347693490176 |
---|---|
author | Yuan, Yinyin Savage, Richard S. Markowetz, Florian |
author_facet | Yuan, Yinyin Savage, Richard S. Markowetz, Florian |
author_sort | Yuan, Yinyin |
collection | PubMed |
description | Different data types can offer complementary perspectives on the same biological phenomenon. In cancer studies, for example, data on copy number alterations indicate losses and amplifications of genomic regions in tumours, while transcriptomic data point to the impact of genomic and environmental events on the internal wiring of the cell. Fusing different data provides a more comprehensive model of the cancer cell than that offered by any single type. However, biological signals in different patients exhibit diverse degrees of concordance due to cancer heterogeneity and inherent noise in the measurements. This is a particularly important issue in cancer subtype discovery, where personalised strategies to guide therapy are of vital importance. We present a nonparametric Bayesian model for discovering prognostic cancer subtypes by integrating gene expression and copy number variation data. Our model is constructed from a hierarchy of Dirichlet Processes and addresses three key challenges in data fusion: (i) To separate concordant from discordant signals, (ii) to select informative features, (iii) to estimate the number of disease subtypes. Concordance of signals is assessed individually for each patient, giving us an additional level of insight into the underlying disease structure. We exemplify the power of our model in prostate cancer and breast cancer and show that it outperforms competing methods. In the prostate cancer data, we identify an entirely new subtype with extremely poor survival outcome and show how other analyses fail to detect it. In the breast cancer data, we find subtypes with superior prognostic value by using the concordant results. These discoveries were crucially dependent on our model's ability to distinguish concordant and discordant signals within each patient sample, and would otherwise have been missed. We therefore demonstrate the importance of taking a patient-specific approach, using highly-flexible nonparametric Bayesian methods. |
format | Online Article Text |
id | pubmed-3197649 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-31976492011-10-25 Patient-Specific Data Fusion Defines Prognostic Cancer Subtypes Yuan, Yinyin Savage, Richard S. Markowetz, Florian PLoS Comput Biol Research Article Different data types can offer complementary perspectives on the same biological phenomenon. In cancer studies, for example, data on copy number alterations indicate losses and amplifications of genomic regions in tumours, while transcriptomic data point to the impact of genomic and environmental events on the internal wiring of the cell. Fusing different data provides a more comprehensive model of the cancer cell than that offered by any single type. However, biological signals in different patients exhibit diverse degrees of concordance due to cancer heterogeneity and inherent noise in the measurements. This is a particularly important issue in cancer subtype discovery, where personalised strategies to guide therapy are of vital importance. We present a nonparametric Bayesian model for discovering prognostic cancer subtypes by integrating gene expression and copy number variation data. Our model is constructed from a hierarchy of Dirichlet Processes and addresses three key challenges in data fusion: (i) To separate concordant from discordant signals, (ii) to select informative features, (iii) to estimate the number of disease subtypes. Concordance of signals is assessed individually for each patient, giving us an additional level of insight into the underlying disease structure. We exemplify the power of our model in prostate cancer and breast cancer and show that it outperforms competing methods. In the prostate cancer data, we identify an entirely new subtype with extremely poor survival outcome and show how other analyses fail to detect it. In the breast cancer data, we find subtypes with superior prognostic value by using the concordant results. These discoveries were crucially dependent on our model's ability to distinguish concordant and discordant signals within each patient sample, and would otherwise have been missed. We therefore demonstrate the importance of taking a patient-specific approach, using highly-flexible nonparametric Bayesian methods. Public Library of Science 2011-10-20 /pmc/articles/PMC3197649/ /pubmed/22028636 http://dx.doi.org/10.1371/journal.pcbi.1002227 Text en Yuan et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Yuan, Yinyin Savage, Richard S. Markowetz, Florian Patient-Specific Data Fusion Defines Prognostic Cancer Subtypes |
title | Patient-Specific Data Fusion Defines Prognostic Cancer Subtypes |
title_full | Patient-Specific Data Fusion Defines Prognostic Cancer Subtypes |
title_fullStr | Patient-Specific Data Fusion Defines Prognostic Cancer Subtypes |
title_full_unstemmed | Patient-Specific Data Fusion Defines Prognostic Cancer Subtypes |
title_short | Patient-Specific Data Fusion Defines Prognostic Cancer Subtypes |
title_sort | patient-specific data fusion defines prognostic cancer subtypes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3197649/ https://www.ncbi.nlm.nih.gov/pubmed/22028636 http://dx.doi.org/10.1371/journal.pcbi.1002227 |
work_keys_str_mv | AT yuanyinyin patientspecificdatafusiondefinesprognosticcancersubtypes AT savagerichards patientspecificdatafusiondefinesprognosticcancersubtypes AT markowetzflorian patientspecificdatafusiondefinesprognosticcancersubtypes |