Cargando…
CNVcaller: highly efficient and widely applicable software for detecting copy number variations in large populations
BACKGROUND: The increasing amount of sequencing data available for a wide variety of species can be theoretically used for detecting copy number variations (CNVs) at the population level. However, the growing sample sizes and the divergent complexity of nonhuman genomes challenge the efficiency and...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5751039/ https://www.ncbi.nlm.nih.gov/pubmed/29220491 http://dx.doi.org/10.1093/gigascience/gix115 |
_version_ | 1783289858835349504 |
---|---|
author | Wang, Xihong Zheng, Zhuqing Cai, Yudong Chen, Ting Li, Chao Fu, Weiwei Jiang, Yu |
author_facet | Wang, Xihong Zheng, Zhuqing Cai, Yudong Chen, Ting Li, Chao Fu, Weiwei Jiang, Yu |
author_sort | Wang, Xihong |
collection | PubMed |
description | BACKGROUND: The increasing amount of sequencing data available for a wide variety of species can be theoretically used for detecting copy number variations (CNVs) at the population level. However, the growing sample sizes and the divergent complexity of nonhuman genomes challenge the efficiency and robustness of current human-oriented CNV detection methods. RESULTS: Here, we present CNVcaller, a read-depth method for discovering CNVs in population sequencing data. The computational speed of CNVcaller was 1–2 orders of magnitude faster than CNVnator and Genome STRiP for complex genomes with thousands of unmapped scaffolds. CNV detection of 232 goats required only 1.4 days on a single compute node. Additionally, the Mendelian consistency of sheep trios indicated that CNVcaller mitigated the influence of high proportions of gaps and misassembled duplications in the nonhuman reference genome assembly. Furthermore, multiple evaluations using real sheep and human data indicated that CNVcaller achieved the best accuracy and sensitivity for detecting duplications. CONCLUSIONS: The fast generalized detection algorithms included in CNVcaller overcome prior computational barriers for detecting CNVs in large-scale sequencing data with complex genomic structures. Therefore, CNVcaller promotes population genetic analyses of functional CNVs in more species. |
format | Online Article Text |
id | pubmed-5751039 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-57510392018-01-05 CNVcaller: highly efficient and widely applicable software for detecting copy number variations in large populations Wang, Xihong Zheng, Zhuqing Cai, Yudong Chen, Ting Li, Chao Fu, Weiwei Jiang, Yu Gigascience Technical Note BACKGROUND: The increasing amount of sequencing data available for a wide variety of species can be theoretically used for detecting copy number variations (CNVs) at the population level. However, the growing sample sizes and the divergent complexity of nonhuman genomes challenge the efficiency and robustness of current human-oriented CNV detection methods. RESULTS: Here, we present CNVcaller, a read-depth method for discovering CNVs in population sequencing data. The computational speed of CNVcaller was 1–2 orders of magnitude faster than CNVnator and Genome STRiP for complex genomes with thousands of unmapped scaffolds. CNV detection of 232 goats required only 1.4 days on a single compute node. Additionally, the Mendelian consistency of sheep trios indicated that CNVcaller mitigated the influence of high proportions of gaps and misassembled duplications in the nonhuman reference genome assembly. Furthermore, multiple evaluations using real sheep and human data indicated that CNVcaller achieved the best accuracy and sensitivity for detecting duplications. CONCLUSIONS: The fast generalized detection algorithms included in CNVcaller overcome prior computational barriers for detecting CNVs in large-scale sequencing data with complex genomic structures. Therefore, CNVcaller promotes population genetic analyses of functional CNVs in more species. Oxford University Press 2017-12-04 /pmc/articles/PMC5751039/ /pubmed/29220491 http://dx.doi.org/10.1093/gigascience/gix115 Text en © The Author(s) 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Technical Note Wang, Xihong Zheng, Zhuqing Cai, Yudong Chen, Ting Li, Chao Fu, Weiwei Jiang, Yu CNVcaller: highly efficient and widely applicable software for detecting copy number variations in large populations |
title | CNVcaller: highly efficient and widely applicable software for detecting copy number variations in large populations |
title_full | CNVcaller: highly efficient and widely applicable software for detecting copy number variations in large populations |
title_fullStr | CNVcaller: highly efficient and widely applicable software for detecting copy number variations in large populations |
title_full_unstemmed | CNVcaller: highly efficient and widely applicable software for detecting copy number variations in large populations |
title_short | CNVcaller: highly efficient and widely applicable software for detecting copy number variations in large populations |
title_sort | cnvcaller: highly efficient and widely applicable software for detecting copy number variations in large populations |
topic | Technical Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5751039/ https://www.ncbi.nlm.nih.gov/pubmed/29220491 http://dx.doi.org/10.1093/gigascience/gix115 |
work_keys_str_mv | AT wangxihong cnvcallerhighlyefficientandwidelyapplicablesoftwarefordetectingcopynumbervariationsinlargepopulations AT zhengzhuqing cnvcallerhighlyefficientandwidelyapplicablesoftwarefordetectingcopynumbervariationsinlargepopulations AT caiyudong cnvcallerhighlyefficientandwidelyapplicablesoftwarefordetectingcopynumbervariationsinlargepopulations AT chenting cnvcallerhighlyefficientandwidelyapplicablesoftwarefordetectingcopynumbervariationsinlargepopulations AT lichao cnvcallerhighlyefficientandwidelyapplicablesoftwarefordetectingcopynumbervariationsinlargepopulations AT fuweiwei cnvcallerhighlyefficientandwidelyapplicablesoftwarefordetectingcopynumbervariationsinlargepopulations AT jiangyu cnvcallerhighlyefficientandwidelyapplicablesoftwarefordetectingcopynumbervariationsinlargepopulations |