Cargando…

Medoidshift clustering applied to genomic bulk tumor data

Despite the enormous medical impact of cancers and intensive study of their biology, detailed characterization of tumor growth and development remains elusive. This difficulty occurs in large part because of enormous heterogeneity in the molecular mechanisms of cancer progression, both tumor-to-tumo...

Descripción completa

Detalles Bibliográficos
Autores principales: Roman, Theodore, Xie, Lu, Schwartz, Russell
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4895288/
https://www.ncbi.nlm.nih.gov/pubmed/26817708
http://dx.doi.org/10.1186/s12864-015-2302-x
_version_ 1782435819441618944
author Roman, Theodore
Xie, Lu
Schwartz, Russell
author_facet Roman, Theodore
Xie, Lu
Schwartz, Russell
author_sort Roman, Theodore
collection PubMed
description Despite the enormous medical impact of cancers and intensive study of their biology, detailed characterization of tumor growth and development remains elusive. This difficulty occurs in large part because of enormous heterogeneity in the molecular mechanisms of cancer progression, both tumor-to-tumor and cell-to-cell in single tumors. Advances in genomic technologies, especially at the single-cell level, are improving the situation, but these approaches are held back by limitations of the biotechnologies for gathering genomic data from heterogeneous cell populations and the computational methods for making sense of those data. One popular way to gain the advantages of whole-genome methods without the cost of single-cell genomics has been the use of computational deconvolution (unmixing) methods to reconstruct clonal heterogeneity from bulk genomic data. These methods, too, are limited by the difficulty of inferring genomic profiles of rare or subtly varying clonal subpopulations from bulk data, a problem that can be computationally reduced to that of reconstructing the geometry of point clouds of tumor samples in a genome space. Here, we present a new method to improve that reconstruction by better identifying subspaces corresponding to tumors produced from mixtures of distinct combinations of clonal subpopulations. We develop a nonparametric clustering method based on medoidshift clustering for identifying subgroups of tumors expected to correspond to distinct trajectories of evolutionary progression. We show on synthetic and real tumor copy-number data that this new method substantially improves our ability to resolve discrete tumor subgroups, a key step in the process of accurately deconvolving tumor genomic data and inferring clonal heterogeneity from bulk data.
format Online
Article
Text
id pubmed-4895288
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-48952882016-06-10 Medoidshift clustering applied to genomic bulk tumor data Roman, Theodore Xie, Lu Schwartz, Russell BMC Genomics Proceedings Despite the enormous medical impact of cancers and intensive study of their biology, detailed characterization of tumor growth and development remains elusive. This difficulty occurs in large part because of enormous heterogeneity in the molecular mechanisms of cancer progression, both tumor-to-tumor and cell-to-cell in single tumors. Advances in genomic technologies, especially at the single-cell level, are improving the situation, but these approaches are held back by limitations of the biotechnologies for gathering genomic data from heterogeneous cell populations and the computational methods for making sense of those data. One popular way to gain the advantages of whole-genome methods without the cost of single-cell genomics has been the use of computational deconvolution (unmixing) methods to reconstruct clonal heterogeneity from bulk genomic data. These methods, too, are limited by the difficulty of inferring genomic profiles of rare or subtly varying clonal subpopulations from bulk data, a problem that can be computationally reduced to that of reconstructing the geometry of point clouds of tumor samples in a genome space. Here, we present a new method to improve that reconstruction by better identifying subspaces corresponding to tumors produced from mixtures of distinct combinations of clonal subpopulations. We develop a nonparametric clustering method based on medoidshift clustering for identifying subgroups of tumors expected to correspond to distinct trajectories of evolutionary progression. We show on synthetic and real tumor copy-number data that this new method substantially improves our ability to resolve discrete tumor subgroups, a key step in the process of accurately deconvolving tumor genomic data and inferring clonal heterogeneity from bulk data. BioMed Central 2016-01-11 /pmc/articles/PMC4895288/ /pubmed/26817708 http://dx.doi.org/10.1186/s12864-015-2302-x Text en © Roman et al. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Roman, Theodore
Xie, Lu
Schwartz, Russell
Medoidshift clustering applied to genomic bulk tumor data
title Medoidshift clustering applied to genomic bulk tumor data
title_full Medoidshift clustering applied to genomic bulk tumor data
title_fullStr Medoidshift clustering applied to genomic bulk tumor data
title_full_unstemmed Medoidshift clustering applied to genomic bulk tumor data
title_short Medoidshift clustering applied to genomic bulk tumor data
title_sort medoidshift clustering applied to genomic bulk tumor data
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4895288/
https://www.ncbi.nlm.nih.gov/pubmed/26817708
http://dx.doi.org/10.1186/s12864-015-2302-x
work_keys_str_mv AT romantheodore medoidshiftclusteringappliedtogenomicbulktumordata
AT xielu medoidshiftclusteringappliedtogenomicbulktumordata
AT schwartzrussell medoidshiftclusteringappliedtogenomicbulktumordata