Cargando…
A simplicial complex-based approach to unmixing tumor progression data
BACKGROUND: Tumorigenesis is an evolutionary process by which tumor cells acquire mutations through successive diversification and differentiation. There is much interest in reconstructing this process of evolution due to its relevance to identifying drivers of mutation and predicting future prognos...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4534068/ https://www.ncbi.nlm.nih.gov/pubmed/26264682 http://dx.doi.org/10.1186/s12859-015-0694-x |
_version_ | 1782385407694995456 |
---|---|
author | Roman, Theodore Nayyeri, Amir Fasy, Brittany Terese Schwartz, Russell |
author_facet | Roman, Theodore Nayyeri, Amir Fasy, Brittany Terese Schwartz, Russell |
author_sort | Roman, Theodore |
collection | PubMed |
description | BACKGROUND: Tumorigenesis is an evolutionary process by which tumor cells acquire mutations through successive diversification and differentiation. There is much interest in reconstructing this process of evolution due to its relevance to identifying drivers of mutation and predicting future prognosis and drug response. Efforts are challenged by high tumor heterogeneity, though, both within and among patients. In prior work, we showed that this heterogeneity could be turned into an advantage by computationally reconstructing models of cell populations mixed to different degrees in distinct tumors. Such mixed membership model approaches, however, are still limited in their ability to dissect more than a few well-conserved cell populations across a tumor data set. RESULTS: We present a method to improve on current mixed membership model approaches by better accounting for conserved progression pathways between subsets of cancers, which imply a structure to the data that has not previously been exploited. We extend our prior methods, which use an interpretation of the mixture problem as that of reconstructing simple geometric objects called simplices, to instead search for structured unions of simplices called simplicial complexes that one would expect to emerge from mixture processes describing branches along an evolutionary tree. We further improve on the prior work with a novel objective function to better identify mixtures corresponding to parsimonious evolutionary tree models. We demonstrate that this approach improves on our ability to accurately resolve mixtures on simulated data sets and demonstrate its practical applicability on a large RNASeq tumor data set. CONCLUSIONS: Better exploiting the expected geometric structure for mixed membership models produced from common evolutionary trees allows us to quickly and accurately reconstruct models of cell populations sampled from those trees. In the process, we hope to develop a better understanding of tumor evolution as well as other biological problems that involve interpreting genomic data gathered from heterogeneous populations of cells. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0694-x) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4534068 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-45340682015-08-13 A simplicial complex-based approach to unmixing tumor progression data Roman, Theodore Nayyeri, Amir Fasy, Brittany Terese Schwartz, Russell BMC Bioinformatics Methodology Article BACKGROUND: Tumorigenesis is an evolutionary process by which tumor cells acquire mutations through successive diversification and differentiation. There is much interest in reconstructing this process of evolution due to its relevance to identifying drivers of mutation and predicting future prognosis and drug response. Efforts are challenged by high tumor heterogeneity, though, both within and among patients. In prior work, we showed that this heterogeneity could be turned into an advantage by computationally reconstructing models of cell populations mixed to different degrees in distinct tumors. Such mixed membership model approaches, however, are still limited in their ability to dissect more than a few well-conserved cell populations across a tumor data set. RESULTS: We present a method to improve on current mixed membership model approaches by better accounting for conserved progression pathways between subsets of cancers, which imply a structure to the data that has not previously been exploited. We extend our prior methods, which use an interpretation of the mixture problem as that of reconstructing simple geometric objects called simplices, to instead search for structured unions of simplices called simplicial complexes that one would expect to emerge from mixture processes describing branches along an evolutionary tree. We further improve on the prior work with a novel objective function to better identify mixtures corresponding to parsimonious evolutionary tree models. We demonstrate that this approach improves on our ability to accurately resolve mixtures on simulated data sets and demonstrate its practical applicability on a large RNASeq tumor data set. CONCLUSIONS: Better exploiting the expected geometric structure for mixed membership models produced from common evolutionary trees allows us to quickly and accurately reconstruct models of cell populations sampled from those trees. In the process, we hope to develop a better understanding of tumor evolution as well as other biological problems that involve interpreting genomic data gathered from heterogeneous populations of cells. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0694-x) contains supplementary material, which is available to authorized users. BioMed Central 2015-08-12 /pmc/articles/PMC4534068/ /pubmed/26264682 http://dx.doi.org/10.1186/s12859-015-0694-x Text en © Roman et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Roman, Theodore Nayyeri, Amir Fasy, Brittany Terese Schwartz, Russell A simplicial complex-based approach to unmixing tumor progression data |
title | A simplicial complex-based approach to unmixing tumor progression data |
title_full | A simplicial complex-based approach to unmixing tumor progression data |
title_fullStr | A simplicial complex-based approach to unmixing tumor progression data |
title_full_unstemmed | A simplicial complex-based approach to unmixing tumor progression data |
title_short | A simplicial complex-based approach to unmixing tumor progression data |
title_sort | simplicial complex-based approach to unmixing tumor progression data |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4534068/ https://www.ncbi.nlm.nih.gov/pubmed/26264682 http://dx.doi.org/10.1186/s12859-015-0694-x |
work_keys_str_mv | AT romantheodore asimplicialcomplexbasedapproachtounmixingtumorprogressiondata AT nayyeriamir asimplicialcomplexbasedapproachtounmixingtumorprogressiondata AT fasybrittanyterese asimplicialcomplexbasedapproachtounmixingtumorprogressiondata AT schwartzrussell asimplicialcomplexbasedapproachtounmixingtumorprogressiondata AT romantheodore simplicialcomplexbasedapproachtounmixingtumorprogressiondata AT nayyeriamir simplicialcomplexbasedapproachtounmixingtumorprogressiondata AT fasybrittanyterese simplicialcomplexbasedapproachtounmixingtumorprogressiondata AT schwartzrussell simplicialcomplexbasedapproachtounmixingtumorprogressiondata |