Cargando…

A simplicial complex-based approach to unmixing tumor progression data

BACKGROUND: Tumorigenesis is an evolutionary process by which tumor cells acquire mutations through successive diversification and differentiation. There is much interest in reconstructing this process of evolution due to its relevance to identifying drivers of mutation and predicting future prognos...

Descripción completa

Detalles Bibliográficos
Autores principales: Roman, Theodore, Nayyeri, Amir, Fasy, Brittany Terese, Schwartz, Russell
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4534068/
https://www.ncbi.nlm.nih.gov/pubmed/26264682
http://dx.doi.org/10.1186/s12859-015-0694-x
_version_ 1782385407694995456
author Roman, Theodore
Nayyeri, Amir
Fasy, Brittany Terese
Schwartz, Russell
author_facet Roman, Theodore
Nayyeri, Amir
Fasy, Brittany Terese
Schwartz, Russell
author_sort Roman, Theodore
collection PubMed
description BACKGROUND: Tumorigenesis is an evolutionary process by which tumor cells acquire mutations through successive diversification and differentiation. There is much interest in reconstructing this process of evolution due to its relevance to identifying drivers of mutation and predicting future prognosis and drug response. Efforts are challenged by high tumor heterogeneity, though, both within and among patients. In prior work, we showed that this heterogeneity could be turned into an advantage by computationally reconstructing models of cell populations mixed to different degrees in distinct tumors. Such mixed membership model approaches, however, are still limited in their ability to dissect more than a few well-conserved cell populations across a tumor data set. RESULTS: We present a method to improve on current mixed membership model approaches by better accounting for conserved progression pathways between subsets of cancers, which imply a structure to the data that has not previously been exploited. We extend our prior methods, which use an interpretation of the mixture problem as that of reconstructing simple geometric objects called simplices, to instead search for structured unions of simplices called simplicial complexes that one would expect to emerge from mixture processes describing branches along an evolutionary tree. We further improve on the prior work with a novel objective function to better identify mixtures corresponding to parsimonious evolutionary tree models. We demonstrate that this approach improves on our ability to accurately resolve mixtures on simulated data sets and demonstrate its practical applicability on a large RNASeq tumor data set. CONCLUSIONS: Better exploiting the expected geometric structure for mixed membership models produced from common evolutionary trees allows us to quickly and accurately reconstruct models of cell populations sampled from those trees. In the process, we hope to develop a better understanding of tumor evolution as well as other biological problems that involve interpreting genomic data gathered from heterogeneous populations of cells. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0694-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4534068
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45340682015-08-13 A simplicial complex-based approach to unmixing tumor progression data Roman, Theodore Nayyeri, Amir Fasy, Brittany Terese Schwartz, Russell BMC Bioinformatics Methodology Article BACKGROUND: Tumorigenesis is an evolutionary process by which tumor cells acquire mutations through successive diversification and differentiation. There is much interest in reconstructing this process of evolution due to its relevance to identifying drivers of mutation and predicting future prognosis and drug response. Efforts are challenged by high tumor heterogeneity, though, both within and among patients. In prior work, we showed that this heterogeneity could be turned into an advantage by computationally reconstructing models of cell populations mixed to different degrees in distinct tumors. Such mixed membership model approaches, however, are still limited in their ability to dissect more than a few well-conserved cell populations across a tumor data set. RESULTS: We present a method to improve on current mixed membership model approaches by better accounting for conserved progression pathways between subsets of cancers, which imply a structure to the data that has not previously been exploited. We extend our prior methods, which use an interpretation of the mixture problem as that of reconstructing simple geometric objects called simplices, to instead search for structured unions of simplices called simplicial complexes that one would expect to emerge from mixture processes describing branches along an evolutionary tree. We further improve on the prior work with a novel objective function to better identify mixtures corresponding to parsimonious evolutionary tree models. We demonstrate that this approach improves on our ability to accurately resolve mixtures on simulated data sets and demonstrate its practical applicability on a large RNASeq tumor data set. CONCLUSIONS: Better exploiting the expected geometric structure for mixed membership models produced from common evolutionary trees allows us to quickly and accurately reconstruct models of cell populations sampled from those trees. In the process, we hope to develop a better understanding of tumor evolution as well as other biological problems that involve interpreting genomic data gathered from heterogeneous populations of cells. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0694-x) contains supplementary material, which is available to authorized users. BioMed Central 2015-08-12 /pmc/articles/PMC4534068/ /pubmed/26264682 http://dx.doi.org/10.1186/s12859-015-0694-x Text en © Roman et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Roman, Theodore
Nayyeri, Amir
Fasy, Brittany Terese
Schwartz, Russell
A simplicial complex-based approach to unmixing tumor progression data
title A simplicial complex-based approach to unmixing tumor progression data
title_full A simplicial complex-based approach to unmixing tumor progression data
title_fullStr A simplicial complex-based approach to unmixing tumor progression data
title_full_unstemmed A simplicial complex-based approach to unmixing tumor progression data
title_short A simplicial complex-based approach to unmixing tumor progression data
title_sort simplicial complex-based approach to unmixing tumor progression data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4534068/
https://www.ncbi.nlm.nih.gov/pubmed/26264682
http://dx.doi.org/10.1186/s12859-015-0694-x
work_keys_str_mv AT romantheodore asimplicialcomplexbasedapproachtounmixingtumorprogressiondata
AT nayyeriamir asimplicialcomplexbasedapproachtounmixingtumorprogressiondata
AT fasybrittanyterese asimplicialcomplexbasedapproachtounmixingtumorprogressiondata
AT schwartzrussell asimplicialcomplexbasedapproachtounmixingtumorprogressiondata
AT romantheodore simplicialcomplexbasedapproachtounmixingtumorprogressiondata
AT nayyeriamir simplicialcomplexbasedapproachtounmixingtumorprogressiondata
AT fasybrittanyterese simplicialcomplexbasedapproachtounmixingtumorprogressiondata
AT schwartzrussell simplicialcomplexbasedapproachtounmixingtumorprogressiondata