Cargando…

Reconstructing genome mixtures from partial adjacencies

Many cancer genome sequencing efforts are underway with the goal of identifying the somatic mutations that drive cancer progression. A major difficulty in these studies is that tumors are typically heterogeneous, with individual cells in a tumor having different complements of somatic mutations. How...

Descripción completa

Detalles Bibliográficos
Autores principales: Mahmoody, Ahmad, Kahn, Crystal L, Raphael, Benjamin J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3526445/
https://www.ncbi.nlm.nih.gov/pubmed/23282028
http://dx.doi.org/10.1186/1471-2105-13-S19-S9
_version_ 1782253562030456832
author Mahmoody, Ahmad
Kahn, Crystal L
Raphael, Benjamin J
author_facet Mahmoody, Ahmad
Kahn, Crystal L
Raphael, Benjamin J
author_sort Mahmoody, Ahmad
collection PubMed
description Many cancer genome sequencing efforts are underway with the goal of identifying the somatic mutations that drive cancer progression. A major difficulty in these studies is that tumors are typically heterogeneous, with individual cells in a tumor having different complements of somatic mutations. However, nearly all DNA sequencing technologies sequence DNA from multiple cells, thus resulting in measurement of mutations from a mixture of genomes. Genome rearrangements are a major class of somatic mutations in many tumors, and the novel adjacencies (i.e. breakpoints) resulting from these rearrangements are readily detected from DNA sequencing reads. However, the assignment of each rearrangement, or adjacency, to an individual cancer genome in the mixture is not known. Moreover, the quantity of DNA sequence reads may be insufficient to measure all rearrangements in all genomes in the tumor. Motivated by this application, we formulate the k-minimum completion problem (k-MCP). In this problem, we aim to reconstruct k genomes derived from a single reference genome, given partial information about the adjacencies present in the mixture of these genomes. We show that the 1-MCP is solvable in linear time in the cases where: (i) the measured, incomplete genome has a single circular or linear chromosome; (ii) there are no restrictions on the chromosomal content of the measured, incomplete genome. We also show that the k-MCP problem, for k ≥ 3 in general, and the 2-MCP problem with the double-cut-and-join (DCJ) distance are NP-complete, when there are no restriction on the chromosomal structure of the measured, incomplete genome. These results lay the foundation for future algorithmic studies of the k-MCP and the application of these algorithms to real cancer sequencing data.
format Online
Article
Text
id pubmed-3526445
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35264452013-01-10 Reconstructing genome mixtures from partial adjacencies Mahmoody, Ahmad Kahn, Crystal L Raphael, Benjamin J BMC Bioinformatics Proceedings Many cancer genome sequencing efforts are underway with the goal of identifying the somatic mutations that drive cancer progression. A major difficulty in these studies is that tumors are typically heterogeneous, with individual cells in a tumor having different complements of somatic mutations. However, nearly all DNA sequencing technologies sequence DNA from multiple cells, thus resulting in measurement of mutations from a mixture of genomes. Genome rearrangements are a major class of somatic mutations in many tumors, and the novel adjacencies (i.e. breakpoints) resulting from these rearrangements are readily detected from DNA sequencing reads. However, the assignment of each rearrangement, or adjacency, to an individual cancer genome in the mixture is not known. Moreover, the quantity of DNA sequence reads may be insufficient to measure all rearrangements in all genomes in the tumor. Motivated by this application, we formulate the k-minimum completion problem (k-MCP). In this problem, we aim to reconstruct k genomes derived from a single reference genome, given partial information about the adjacencies present in the mixture of these genomes. We show that the 1-MCP is solvable in linear time in the cases where: (i) the measured, incomplete genome has a single circular or linear chromosome; (ii) there are no restrictions on the chromosomal content of the measured, incomplete genome. We also show that the k-MCP problem, for k ≥ 3 in general, and the 2-MCP problem with the double-cut-and-join (DCJ) distance are NP-complete, when there are no restriction on the chromosomal structure of the measured, incomplete genome. These results lay the foundation for future algorithmic studies of the k-MCP and the application of these algorithms to real cancer sequencing data. BioMed Central 2012-12-19 /pmc/articles/PMC3526445/ /pubmed/23282028 http://dx.doi.org/10.1186/1471-2105-13-S19-S9 Text en Copyright ©2012 Mahmoody et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Mahmoody, Ahmad
Kahn, Crystal L
Raphael, Benjamin J
Reconstructing genome mixtures from partial adjacencies
title Reconstructing genome mixtures from partial adjacencies
title_full Reconstructing genome mixtures from partial adjacencies
title_fullStr Reconstructing genome mixtures from partial adjacencies
title_full_unstemmed Reconstructing genome mixtures from partial adjacencies
title_short Reconstructing genome mixtures from partial adjacencies
title_sort reconstructing genome mixtures from partial adjacencies
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3526445/
https://www.ncbi.nlm.nih.gov/pubmed/23282028
http://dx.doi.org/10.1186/1471-2105-13-S19-S9
work_keys_str_mv AT mahmoodyahmad reconstructinggenomemixturesfrompartialadjacencies
AT kahncrystall reconstructinggenomemixturesfrompartialadjacencies
AT raphaelbenjaminj reconstructinggenomemixturesfrompartialadjacencies