Cargando…

Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data

BACKGROUND: Exome or whole-genome deep sequencing of tumor DNA along with paired normal DNA can potentially provide a detailed picture of the somatic mutations that characterize the tumor. However, analysis of such sequence data can be complicated by the presence of normal cells in the tumor specime...

Descripción completa

Detalles Bibliográficos
Autores principales: Favero, F., Joshi, T., Marquard, A. M., Birkbak, N. J., Krzystanek, M., Li, Q., Szallasi, Z., Eklund, A. C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4269342/
https://www.ncbi.nlm.nih.gov/pubmed/25319062
http://dx.doi.org/10.1093/annonc/mdu479
_version_ 1782349347250241536
author Favero, F.
Joshi, T.
Marquard, A. M.
Birkbak, N. J.
Krzystanek, M.
Li, Q.
Szallasi, Z.
Eklund, A. C.
author_facet Favero, F.
Joshi, T.
Marquard, A. M.
Birkbak, N. J.
Krzystanek, M.
Li, Q.
Szallasi, Z.
Eklund, A. C.
author_sort Favero, F.
collection PubMed
description BACKGROUND: Exome or whole-genome deep sequencing of tumor DNA along with paired normal DNA can potentially provide a detailed picture of the somatic mutations that characterize the tumor. However, analysis of such sequence data can be complicated by the presence of normal cells in the tumor specimen, by intratumor heterogeneity, and by the sheer size of the raw data. In particular, determination of copy number variations from exome sequencing data alone has proven difficult; thus, single nucleotide polymorphism (SNP) arrays have often been used for this task. Recently, algorithms to estimate absolute, but not allele-specific, copy number profiles from tumor sequencing data have been described. MATERIALS AND METHODS: We developed Sequenza, a software package that uses paired tumor-normal DNA sequencing data to estimate tumor cellularity and ploidy, and to calculate allele-specific copy number profiles and mutation profiles. We applied Sequenza, as well as two previously published algorithms, to exome sequence data from 30 tumors from The Cancer Genome Atlas. We assessed the performance of these algorithms by comparing their results with those generated using matched SNP arrays and processed by the allele-specific copy number analysis of tumors (ASCAT) algorithm. RESULTS: Comparison between Sequenza/exome and SNP/ASCAT revealed strong correlation in cellularity (Pearson's r = 0.90) and ploidy estimates (r = 0.42, or r = 0.94 after manual inspecting alternative solutions). This performance was noticeably superior to previously published algorithms. In addition, in artificial data simulating normal-tumor admixtures, Sequenza detected the correct ploidy in samples with tumor content as low as 30%. CONCLUSIONS: The agreement between Sequenza and SNP array-based copy number profiles suggests that exome sequencing alone is sufficient not only for identifying small scale mutations but also for estimating cellularity and inferring DNA copy number aberrations.
format Online
Article
Text
id pubmed-4269342
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-42693422015-03-24 Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data Favero, F. Joshi, T. Marquard, A. M. Birkbak, N. J. Krzystanek, M. Li, Q. Szallasi, Z. Eklund, A. C. Ann Oncol Original Articles BACKGROUND: Exome or whole-genome deep sequencing of tumor DNA along with paired normal DNA can potentially provide a detailed picture of the somatic mutations that characterize the tumor. However, analysis of such sequence data can be complicated by the presence of normal cells in the tumor specimen, by intratumor heterogeneity, and by the sheer size of the raw data. In particular, determination of copy number variations from exome sequencing data alone has proven difficult; thus, single nucleotide polymorphism (SNP) arrays have often been used for this task. Recently, algorithms to estimate absolute, but not allele-specific, copy number profiles from tumor sequencing data have been described. MATERIALS AND METHODS: We developed Sequenza, a software package that uses paired tumor-normal DNA sequencing data to estimate tumor cellularity and ploidy, and to calculate allele-specific copy number profiles and mutation profiles. We applied Sequenza, as well as two previously published algorithms, to exome sequence data from 30 tumors from The Cancer Genome Atlas. We assessed the performance of these algorithms by comparing their results with those generated using matched SNP arrays and processed by the allele-specific copy number analysis of tumors (ASCAT) algorithm. RESULTS: Comparison between Sequenza/exome and SNP/ASCAT revealed strong correlation in cellularity (Pearson's r = 0.90) and ploidy estimates (r = 0.42, or r = 0.94 after manual inspecting alternative solutions). This performance was noticeably superior to previously published algorithms. In addition, in artificial data simulating normal-tumor admixtures, Sequenza detected the correct ploidy in samples with tumor content as low as 30%. CONCLUSIONS: The agreement between Sequenza and SNP array-based copy number profiles suggests that exome sequencing alone is sufficient not only for identifying small scale mutations but also for estimating cellularity and inferring DNA copy number aberrations. Oxford University Press 2015-01 2014-10-15 /pmc/articles/PMC4269342/ /pubmed/25319062 http://dx.doi.org/10.1093/annonc/mdu479 Text en © The Author 2014. Published by Oxford University Press on behalf of the European Society for Medical Oncology. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Articles
Favero, F.
Joshi, T.
Marquard, A. M.
Birkbak, N. J.
Krzystanek, M.
Li, Q.
Szallasi, Z.
Eklund, A. C.
Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data
title Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data
title_full Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data
title_fullStr Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data
title_full_unstemmed Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data
title_short Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data
title_sort sequenza: allele-specific copy number and mutation profiles from tumor sequencing data
topic Original Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4269342/
https://www.ncbi.nlm.nih.gov/pubmed/25319062
http://dx.doi.org/10.1093/annonc/mdu479
work_keys_str_mv AT faverof sequenzaallelespecificcopynumberandmutationprofilesfromtumorsequencingdata
AT joshit sequenzaallelespecificcopynumberandmutationprofilesfromtumorsequencingdata
AT marquardam sequenzaallelespecificcopynumberandmutationprofilesfromtumorsequencingdata
AT birkbaknj sequenzaallelespecificcopynumberandmutationprofilesfromtumorsequencingdata
AT krzystanekm sequenzaallelespecificcopynumberandmutationprofilesfromtumorsequencingdata
AT liq sequenzaallelespecificcopynumberandmutationprofilesfromtumorsequencingdata
AT szallasiz sequenzaallelespecificcopynumberandmutationprofilesfromtumorsequencingdata
AT eklundac sequenzaallelespecificcopynumberandmutationprofilesfromtumorsequencingdata