Cargando…
Consistency-based detection of potential tumor-specific deletions in matched normal/tumor genomes
BACKGROUND: Structural variations in human genomes, such as insertions, deletion, or rearrangements, play an important role in cancer development. Next-Generation Sequencing technologies have been central in providing ways to detect such variations. Most existing methods however are limited to the a...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3283309/ https://www.ncbi.nlm.nih.gov/pubmed/22152084 http://dx.doi.org/10.1186/1471-2105-12-S9-S21 |
_version_ | 1782224182039281664 |
---|---|
author | Wittler, Roland Chauve, Cedric |
author_facet | Wittler, Roland Chauve, Cedric |
author_sort | Wittler, Roland |
collection | PubMed |
description | BACKGROUND: Structural variations in human genomes, such as insertions, deletion, or rearrangements, play an important role in cancer development. Next-Generation Sequencing technologies have been central in providing ways to detect such variations. Most existing methods however are limited to the analysis of a single genome, and it is only recently that the comparison of closely related genomes has been considered. In particular, a few recent works considered the analysis of data sets obtained by sequencing both tumor and healthy tissues of the same cancer patient. In that context, the goal is to detect variations that are specific to exactly one of the genomes, for example to differentiate between patient-specific and tumor-specific variations. This is a difficult task, especially when facing the additional challenge of the possible contamination of healthy tissues by tumor cells and conversely. RESULTS: In the current work, we analyzed a data set of paired-end short-reads, obtained by sequencing tumor tissues and healthy tissues, both from the same cancer patient. Based on a combinatorial notion of conflict between deletions, we show that in the tumor data, more deletions are predicted than there could actually be in a diploid genome. In contrast, the predictions for the data from normal tissues are almost conflict-free. We designed and applied a method, specific to the analysis of such pooled and contaminated data sets, to detect potential tumor-specific deletions. Our method takes the deletion calls from both data sets and assigns reads from the mixed tumor/normal data to the normal one with the goal to minimize the number of reads that need to be discarded to obtain a set of conflict-free deletion clusters. We observed that, on the specific data set we analyze, only a very small fraction of the reads needs to be discarded to obtain a set of consistent deletions. CONCLUSIONS: We present a framework based on a rigorous definition of consistency between deletions and the assumption that the tumor sample also contains normal cells. A combined analysis of both data sets based on this model allowed a consistent explanation of almost all data, providing a detailed picture of candidate patient- and tumor-specific deletions. |
format | Online Article Text |
id | pubmed-3283309 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32833092012-02-22 Consistency-based detection of potential tumor-specific deletions in matched normal/tumor genomes Wittler, Roland Chauve, Cedric BMC Bioinformatics Proceedings BACKGROUND: Structural variations in human genomes, such as insertions, deletion, or rearrangements, play an important role in cancer development. Next-Generation Sequencing technologies have been central in providing ways to detect such variations. Most existing methods however are limited to the analysis of a single genome, and it is only recently that the comparison of closely related genomes has been considered. In particular, a few recent works considered the analysis of data sets obtained by sequencing both tumor and healthy tissues of the same cancer patient. In that context, the goal is to detect variations that are specific to exactly one of the genomes, for example to differentiate between patient-specific and tumor-specific variations. This is a difficult task, especially when facing the additional challenge of the possible contamination of healthy tissues by tumor cells and conversely. RESULTS: In the current work, we analyzed a data set of paired-end short-reads, obtained by sequencing tumor tissues and healthy tissues, both from the same cancer patient. Based on a combinatorial notion of conflict between deletions, we show that in the tumor data, more deletions are predicted than there could actually be in a diploid genome. In contrast, the predictions for the data from normal tissues are almost conflict-free. We designed and applied a method, specific to the analysis of such pooled and contaminated data sets, to detect potential tumor-specific deletions. Our method takes the deletion calls from both data sets and assigns reads from the mixed tumor/normal data to the normal one with the goal to minimize the number of reads that need to be discarded to obtain a set of conflict-free deletion clusters. We observed that, on the specific data set we analyze, only a very small fraction of the reads needs to be discarded to obtain a set of consistent deletions. CONCLUSIONS: We present a framework based on a rigorous definition of consistency between deletions and the assumption that the tumor sample also contains normal cells. A combined analysis of both data sets based on this model allowed a consistent explanation of almost all data, providing a detailed picture of candidate patient- and tumor-specific deletions. BioMed Central 2011-10-05 /pmc/articles/PMC3283309/ /pubmed/22152084 http://dx.doi.org/10.1186/1471-2105-12-S9-S21 Text en Copyright ©2011 Wittler and Chauve; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Wittler, Roland Chauve, Cedric Consistency-based detection of potential tumor-specific deletions in matched normal/tumor genomes |
title | Consistency-based detection of potential tumor-specific deletions in matched normal/tumor genomes |
title_full | Consistency-based detection of potential tumor-specific deletions in matched normal/tumor genomes |
title_fullStr | Consistency-based detection of potential tumor-specific deletions in matched normal/tumor genomes |
title_full_unstemmed | Consistency-based detection of potential tumor-specific deletions in matched normal/tumor genomes |
title_short | Consistency-based detection of potential tumor-specific deletions in matched normal/tumor genomes |
title_sort | consistency-based detection of potential tumor-specific deletions in matched normal/tumor genomes |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3283309/ https://www.ncbi.nlm.nih.gov/pubmed/22152084 http://dx.doi.org/10.1186/1471-2105-12-S9-S21 |
work_keys_str_mv | AT wittlerroland consistencybaseddetectionofpotentialtumorspecificdeletionsinmatchednormaltumorgenomes AT chauvecedric consistencybaseddetectionofpotentialtumorspecificdeletionsinmatchednormaltumorgenomes |