Cargando…
Accucopy: accurate and fast inference of allele-specific copy number alterations from low-coverage low-purity tumor sequencing data
BACKGROUND: Copy number alterations (CNAs), due to their large impact on the genome, have been an important contributing factor to oncogenesis and metastasis. Detecting genomic alterations from the shallow-sequencing data of a low-purity tumor sample remains a challenging task. RESULTS: We introduce...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7811225/ https://www.ncbi.nlm.nih.gov/pubmed/33451280 http://dx.doi.org/10.1186/s12859-020-03924-5 |
_version_ | 1783637455205826560 |
---|---|
author | Fan, Xinping Luo, Guanghao Huang, Yu S. |
author_facet | Fan, Xinping Luo, Guanghao Huang, Yu S. |
author_sort | Fan, Xinping |
collection | PubMed |
description | BACKGROUND: Copy number alterations (CNAs), due to their large impact on the genome, have been an important contributing factor to oncogenesis and metastasis. Detecting genomic alterations from the shallow-sequencing data of a low-purity tumor sample remains a challenging task. RESULTS: We introduce Accucopy, a method to infer total copy numbers (TCNs) and allele-specific copy numbers (ASCNs) from challenging low-purity and low-coverage tumor samples. Accucopy adopts many robust statistical techniques such as kernel smoothing of coverage differentiation information to discern signals from noise and combines ideas from time-series analysis and the signal-processing field to derive a range of estimates for the period in a histogram of coverage differentiation information. Statistical learning models such as the tiered Gaussian mixture model, the expectation–maximization algorithm, and sparse Bayesian learning were customized and built into the model. Accucopy is implemented in C++ /Rust, packaged in a docker image, and supports non-human samples, more at http://www.yfish.org/software/. CONCLUSIONS: We describe Accucopy, a method that can predict both TCNs and ASCNs from low-coverage low-purity tumor sequencing data. Through comparative analyses in both simulated and real-sequencing samples, we demonstrate that Accucopy is more accurate than Sclust, ABSOLUTE, and Sequenza. |
format | Online Article Text |
id | pubmed-7811225 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-78112252021-01-18 Accucopy: accurate and fast inference of allele-specific copy number alterations from low-coverage low-purity tumor sequencing data Fan, Xinping Luo, Guanghao Huang, Yu S. BMC Bioinformatics Methodology Article BACKGROUND: Copy number alterations (CNAs), due to their large impact on the genome, have been an important contributing factor to oncogenesis and metastasis. Detecting genomic alterations from the shallow-sequencing data of a low-purity tumor sample remains a challenging task. RESULTS: We introduce Accucopy, a method to infer total copy numbers (TCNs) and allele-specific copy numbers (ASCNs) from challenging low-purity and low-coverage tumor samples. Accucopy adopts many robust statistical techniques such as kernel smoothing of coverage differentiation information to discern signals from noise and combines ideas from time-series analysis and the signal-processing field to derive a range of estimates for the period in a histogram of coverage differentiation information. Statistical learning models such as the tiered Gaussian mixture model, the expectation–maximization algorithm, and sparse Bayesian learning were customized and built into the model. Accucopy is implemented in C++ /Rust, packaged in a docker image, and supports non-human samples, more at http://www.yfish.org/software/. CONCLUSIONS: We describe Accucopy, a method that can predict both TCNs and ASCNs from low-coverage low-purity tumor sequencing data. Through comparative analyses in both simulated and real-sequencing samples, we demonstrate that Accucopy is more accurate than Sclust, ABSOLUTE, and Sequenza. BioMed Central 2021-01-15 /pmc/articles/PMC7811225/ /pubmed/33451280 http://dx.doi.org/10.1186/s12859-020-03924-5 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Article Fan, Xinping Luo, Guanghao Huang, Yu S. Accucopy: accurate and fast inference of allele-specific copy number alterations from low-coverage low-purity tumor sequencing data |
title | Accucopy: accurate and fast inference of allele-specific copy number alterations from low-coverage low-purity tumor sequencing data |
title_full | Accucopy: accurate and fast inference of allele-specific copy number alterations from low-coverage low-purity tumor sequencing data |
title_fullStr | Accucopy: accurate and fast inference of allele-specific copy number alterations from low-coverage low-purity tumor sequencing data |
title_full_unstemmed | Accucopy: accurate and fast inference of allele-specific copy number alterations from low-coverage low-purity tumor sequencing data |
title_short | Accucopy: accurate and fast inference of allele-specific copy number alterations from low-coverage low-purity tumor sequencing data |
title_sort | accucopy: accurate and fast inference of allele-specific copy number alterations from low-coverage low-purity tumor sequencing data |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7811225/ https://www.ncbi.nlm.nih.gov/pubmed/33451280 http://dx.doi.org/10.1186/s12859-020-03924-5 |
work_keys_str_mv | AT fanxinping accucopyaccurateandfastinferenceofallelespecificcopynumberalterationsfromlowcoveragelowpuritytumorsequencingdata AT luoguanghao accucopyaccurateandfastinferenceofallelespecificcopynumberalterationsfromlowcoveragelowpuritytumorsequencingdata AT huangyus accucopyaccurateandfastinferenceofallelespecificcopynumberalterationsfromlowcoveragelowpuritytumorsequencingdata |