Cargando…
Reconstructing genome mixtures from partial adjacencies
Many cancer genome sequencing efforts are underway with the goal of identifying the somatic mutations that drive cancer progression. A major difficulty in these studies is that tumors are typically heterogeneous, with individual cells in a tumor having different complements of somatic mutations. How...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3526445/ https://www.ncbi.nlm.nih.gov/pubmed/23282028 http://dx.doi.org/10.1186/1471-2105-13-S19-S9 |
_version_ | 1782253562030456832 |
---|---|
author | Mahmoody, Ahmad Kahn, Crystal L Raphael, Benjamin J |
author_facet | Mahmoody, Ahmad Kahn, Crystal L Raphael, Benjamin J |
author_sort | Mahmoody, Ahmad |
collection | PubMed |
description | Many cancer genome sequencing efforts are underway with the goal of identifying the somatic mutations that drive cancer progression. A major difficulty in these studies is that tumors are typically heterogeneous, with individual cells in a tumor having different complements of somatic mutations. However, nearly all DNA sequencing technologies sequence DNA from multiple cells, thus resulting in measurement of mutations from a mixture of genomes. Genome rearrangements are a major class of somatic mutations in many tumors, and the novel adjacencies (i.e. breakpoints) resulting from these rearrangements are readily detected from DNA sequencing reads. However, the assignment of each rearrangement, or adjacency, to an individual cancer genome in the mixture is not known. Moreover, the quantity of DNA sequence reads may be insufficient to measure all rearrangements in all genomes in the tumor. Motivated by this application, we formulate the k-minimum completion problem (k-MCP). In this problem, we aim to reconstruct k genomes derived from a single reference genome, given partial information about the adjacencies present in the mixture of these genomes. We show that the 1-MCP is solvable in linear time in the cases where: (i) the measured, incomplete genome has a single circular or linear chromosome; (ii) there are no restrictions on the chromosomal content of the measured, incomplete genome. We also show that the k-MCP problem, for k ≥ 3 in general, and the 2-MCP problem with the double-cut-and-join (DCJ) distance are NP-complete, when there are no restriction on the chromosomal structure of the measured, incomplete genome. These results lay the foundation for future algorithmic studies of the k-MCP and the application of these algorithms to real cancer sequencing data. |
format | Online Article Text |
id | pubmed-3526445 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-35264452013-01-10 Reconstructing genome mixtures from partial adjacencies Mahmoody, Ahmad Kahn, Crystal L Raphael, Benjamin J BMC Bioinformatics Proceedings Many cancer genome sequencing efforts are underway with the goal of identifying the somatic mutations that drive cancer progression. A major difficulty in these studies is that tumors are typically heterogeneous, with individual cells in a tumor having different complements of somatic mutations. However, nearly all DNA sequencing technologies sequence DNA from multiple cells, thus resulting in measurement of mutations from a mixture of genomes. Genome rearrangements are a major class of somatic mutations in many tumors, and the novel adjacencies (i.e. breakpoints) resulting from these rearrangements are readily detected from DNA sequencing reads. However, the assignment of each rearrangement, or adjacency, to an individual cancer genome in the mixture is not known. Moreover, the quantity of DNA sequence reads may be insufficient to measure all rearrangements in all genomes in the tumor. Motivated by this application, we formulate the k-minimum completion problem (k-MCP). In this problem, we aim to reconstruct k genomes derived from a single reference genome, given partial information about the adjacencies present in the mixture of these genomes. We show that the 1-MCP is solvable in linear time in the cases where: (i) the measured, incomplete genome has a single circular or linear chromosome; (ii) there are no restrictions on the chromosomal content of the measured, incomplete genome. We also show that the k-MCP problem, for k ≥ 3 in general, and the 2-MCP problem with the double-cut-and-join (DCJ) distance are NP-complete, when there are no restriction on the chromosomal structure of the measured, incomplete genome. These results lay the foundation for future algorithmic studies of the k-MCP and the application of these algorithms to real cancer sequencing data. BioMed Central 2012-12-19 /pmc/articles/PMC3526445/ /pubmed/23282028 http://dx.doi.org/10.1186/1471-2105-13-S19-S9 Text en Copyright ©2012 Mahmoody et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Mahmoody, Ahmad Kahn, Crystal L Raphael, Benjamin J Reconstructing genome mixtures from partial adjacencies |
title | Reconstructing genome mixtures from partial adjacencies |
title_full | Reconstructing genome mixtures from partial adjacencies |
title_fullStr | Reconstructing genome mixtures from partial adjacencies |
title_full_unstemmed | Reconstructing genome mixtures from partial adjacencies |
title_short | Reconstructing genome mixtures from partial adjacencies |
title_sort | reconstructing genome mixtures from partial adjacencies |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3526445/ https://www.ncbi.nlm.nih.gov/pubmed/23282028 http://dx.doi.org/10.1186/1471-2105-13-S19-S9 |
work_keys_str_mv | AT mahmoodyahmad reconstructinggenomemixturesfrompartialadjacencies AT kahncrystall reconstructinggenomemixturesfrompartialadjacencies AT raphaelbenjaminj reconstructinggenomemixturesfrompartialadjacencies |