Cargando…

CNVcaller: highly efficient and widely applicable software for detecting copy number variations in large populations

BACKGROUND: The increasing amount of sequencing data available for a wide variety of species can be theoretically used for detecting copy number variations (CNVs) at the population level. However, the growing sample sizes and the divergent complexity of nonhuman genomes challenge the efficiency and...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Xihong, Zheng, Zhuqing, Cai, Yudong, Chen, Ting, Li, Chao, Fu, Weiwei, Jiang, Yu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5751039/
https://www.ncbi.nlm.nih.gov/pubmed/29220491
http://dx.doi.org/10.1093/gigascience/gix115
_version_ 1783289858835349504
author Wang, Xihong
Zheng, Zhuqing
Cai, Yudong
Chen, Ting
Li, Chao
Fu, Weiwei
Jiang, Yu
author_facet Wang, Xihong
Zheng, Zhuqing
Cai, Yudong
Chen, Ting
Li, Chao
Fu, Weiwei
Jiang, Yu
author_sort Wang, Xihong
collection PubMed
description BACKGROUND: The increasing amount of sequencing data available for a wide variety of species can be theoretically used for detecting copy number variations (CNVs) at the population level. However, the growing sample sizes and the divergent complexity of nonhuman genomes challenge the efficiency and robustness of current human-oriented CNV detection methods. RESULTS: Here, we present CNVcaller, a read-depth method for discovering CNVs in population sequencing data. The computational speed of CNVcaller was 1–2 orders of magnitude faster than CNVnator and Genome STRiP for complex genomes with thousands of unmapped scaffolds. CNV detection of 232 goats required only 1.4 days on a single compute node. Additionally, the Mendelian consistency of sheep trios indicated that CNVcaller mitigated the influence of high proportions of gaps and misassembled duplications in the nonhuman reference genome assembly. Furthermore, multiple evaluations using real sheep and human data indicated that CNVcaller achieved the best accuracy and sensitivity for detecting duplications. CONCLUSIONS: The fast generalized detection algorithms included in CNVcaller overcome prior computational barriers for detecting CNVs in large-scale sequencing data with complex genomic structures. Therefore, CNVcaller promotes population genetic analyses of functional CNVs in more species.
format Online
Article
Text
id pubmed-5751039
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-57510392018-01-05 CNVcaller: highly efficient and widely applicable software for detecting copy number variations in large populations Wang, Xihong Zheng, Zhuqing Cai, Yudong Chen, Ting Li, Chao Fu, Weiwei Jiang, Yu Gigascience Technical Note BACKGROUND: The increasing amount of sequencing data available for a wide variety of species can be theoretically used for detecting copy number variations (CNVs) at the population level. However, the growing sample sizes and the divergent complexity of nonhuman genomes challenge the efficiency and robustness of current human-oriented CNV detection methods. RESULTS: Here, we present CNVcaller, a read-depth method for discovering CNVs in population sequencing data. The computational speed of CNVcaller was 1–2 orders of magnitude faster than CNVnator and Genome STRiP for complex genomes with thousands of unmapped scaffolds. CNV detection of 232 goats required only 1.4 days on a single compute node. Additionally, the Mendelian consistency of sheep trios indicated that CNVcaller mitigated the influence of high proportions of gaps and misassembled duplications in the nonhuman reference genome assembly. Furthermore, multiple evaluations using real sheep and human data indicated that CNVcaller achieved the best accuracy and sensitivity for detecting duplications. CONCLUSIONS: The fast generalized detection algorithms included in CNVcaller overcome prior computational barriers for detecting CNVs in large-scale sequencing data with complex genomic structures. Therefore, CNVcaller promotes population genetic analyses of functional CNVs in more species. Oxford University Press 2017-12-04 /pmc/articles/PMC5751039/ /pubmed/29220491 http://dx.doi.org/10.1093/gigascience/gix115 Text en © The Author(s) 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Wang, Xihong
Zheng, Zhuqing
Cai, Yudong
Chen, Ting
Li, Chao
Fu, Weiwei
Jiang, Yu
CNVcaller: highly efficient and widely applicable software for detecting copy number variations in large populations
title CNVcaller: highly efficient and widely applicable software for detecting copy number variations in large populations
title_full CNVcaller: highly efficient and widely applicable software for detecting copy number variations in large populations
title_fullStr CNVcaller: highly efficient and widely applicable software for detecting copy number variations in large populations
title_full_unstemmed CNVcaller: highly efficient and widely applicable software for detecting copy number variations in large populations
title_short CNVcaller: highly efficient and widely applicable software for detecting copy number variations in large populations
title_sort cnvcaller: highly efficient and widely applicable software for detecting copy number variations in large populations
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5751039/
https://www.ncbi.nlm.nih.gov/pubmed/29220491
http://dx.doi.org/10.1093/gigascience/gix115
work_keys_str_mv AT wangxihong cnvcallerhighlyefficientandwidelyapplicablesoftwarefordetectingcopynumbervariationsinlargepopulations
AT zhengzhuqing cnvcallerhighlyefficientandwidelyapplicablesoftwarefordetectingcopynumbervariationsinlargepopulations
AT caiyudong cnvcallerhighlyefficientandwidelyapplicablesoftwarefordetectingcopynumbervariationsinlargepopulations
AT chenting cnvcallerhighlyefficientandwidelyapplicablesoftwarefordetectingcopynumbervariationsinlargepopulations
AT lichao cnvcallerhighlyefficientandwidelyapplicablesoftwarefordetectingcopynumbervariationsinlargepopulations
AT fuweiwei cnvcallerhighlyefficientandwidelyapplicablesoftwarefordetectingcopynumbervariationsinlargepopulations
AT jiangyu cnvcallerhighlyefficientandwidelyapplicablesoftwarefordetectingcopynumbervariationsinlargepopulations