Cargando…
ConanVarvar: a versatile tool for the detection of large syndromic copy number variation from whole-genome sequencing data
BACKGROUND: A wide range of tools are available for the detection of copy number variants (CNVs) from whole-genome sequencing (WGS) data. However, none of them focus on clinically-relevant CNVs, such as those that are associated with known genetic syndromes. Such variants are often large in size, ty...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9930243/ https://www.ncbi.nlm.nih.gov/pubmed/36792982 http://dx.doi.org/10.1186/s12859-023-05154-x |
_version_ | 1784889010741575680 |
---|---|
author | Gudkov, Mikhail Thibaut, Loïc Khushi, Matloob Blue, Gillian M. Winlaw, David S. Dunwoodie, Sally L. Giannoulatou, Eleni |
author_facet | Gudkov, Mikhail Thibaut, Loïc Khushi, Matloob Blue, Gillian M. Winlaw, David S. Dunwoodie, Sally L. Giannoulatou, Eleni |
author_sort | Gudkov, Mikhail |
collection | PubMed |
description | BACKGROUND: A wide range of tools are available for the detection of copy number variants (CNVs) from whole-genome sequencing (WGS) data. However, none of them focus on clinically-relevant CNVs, such as those that are associated with known genetic syndromes. Such variants are often large in size, typically 1–5 Mb, but currently available CNV callers have been developed and benchmarked for the discovery of smaller variants. Thus, the ability of these programs to detect tens of real syndromic CNVs remains largely unknown. RESULTS: Here we present ConanVarvar, a tool which implements a complete workflow for the targeted analysis of large germline CNVs from WGS data. ConanVarvar comes with an intuitive R Shiny graphical user interface and annotates identified variants with information about 56 associated syndromic conditions. We benchmarked ConanVarvar and four other programs on a dataset containing real and simulated syndromic CNVs larger than 1 Mb. In comparison to other tools, ConanVarvar reports 10–30 times less false-positive variants without compromising sensitivity and is quicker to run, especially on large batches of samples. CONCLUSIONS: ConanVarvar is a useful instrument for primary analysis in disease sequencing studies, where large CNVs could be the cause of disease. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05154-x. |
format | Online Article Text |
id | pubmed-9930243 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-99302432023-02-16 ConanVarvar: a versatile tool for the detection of large syndromic copy number variation from whole-genome sequencing data Gudkov, Mikhail Thibaut, Loïc Khushi, Matloob Blue, Gillian M. Winlaw, David S. Dunwoodie, Sally L. Giannoulatou, Eleni BMC Bioinformatics Software BACKGROUND: A wide range of tools are available for the detection of copy number variants (CNVs) from whole-genome sequencing (WGS) data. However, none of them focus on clinically-relevant CNVs, such as those that are associated with known genetic syndromes. Such variants are often large in size, typically 1–5 Mb, but currently available CNV callers have been developed and benchmarked for the discovery of smaller variants. Thus, the ability of these programs to detect tens of real syndromic CNVs remains largely unknown. RESULTS: Here we present ConanVarvar, a tool which implements a complete workflow for the targeted analysis of large germline CNVs from WGS data. ConanVarvar comes with an intuitive R Shiny graphical user interface and annotates identified variants with information about 56 associated syndromic conditions. We benchmarked ConanVarvar and four other programs on a dataset containing real and simulated syndromic CNVs larger than 1 Mb. In comparison to other tools, ConanVarvar reports 10–30 times less false-positive variants without compromising sensitivity and is quicker to run, especially on large batches of samples. CONCLUSIONS: ConanVarvar is a useful instrument for primary analysis in disease sequencing studies, where large CNVs could be the cause of disease. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05154-x. BioMed Central 2023-02-15 /pmc/articles/PMC9930243/ /pubmed/36792982 http://dx.doi.org/10.1186/s12859-023-05154-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Software Gudkov, Mikhail Thibaut, Loïc Khushi, Matloob Blue, Gillian M. Winlaw, David S. Dunwoodie, Sally L. Giannoulatou, Eleni ConanVarvar: a versatile tool for the detection of large syndromic copy number variation from whole-genome sequencing data |
title | ConanVarvar: a versatile tool for the detection of large syndromic copy number variation from whole-genome sequencing data |
title_full | ConanVarvar: a versatile tool for the detection of large syndromic copy number variation from whole-genome sequencing data |
title_fullStr | ConanVarvar: a versatile tool for the detection of large syndromic copy number variation from whole-genome sequencing data |
title_full_unstemmed | ConanVarvar: a versatile tool for the detection of large syndromic copy number variation from whole-genome sequencing data |
title_short | ConanVarvar: a versatile tool for the detection of large syndromic copy number variation from whole-genome sequencing data |
title_sort | conanvarvar: a versatile tool for the detection of large syndromic copy number variation from whole-genome sequencing data |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9930243/ https://www.ncbi.nlm.nih.gov/pubmed/36792982 http://dx.doi.org/10.1186/s12859-023-05154-x |
work_keys_str_mv | AT gudkovmikhail conanvarvaraversatiletoolforthedetectionoflargesyndromiccopynumbervariationfromwholegenomesequencingdata AT thibautloic conanvarvaraversatiletoolforthedetectionoflargesyndromiccopynumbervariationfromwholegenomesequencingdata AT khushimatloob conanvarvaraversatiletoolforthedetectionoflargesyndromiccopynumbervariationfromwholegenomesequencingdata AT bluegillianm conanvarvaraversatiletoolforthedetectionoflargesyndromiccopynumbervariationfromwholegenomesequencingdata AT winlawdavids conanvarvaraversatiletoolforthedetectionoflargesyndromiccopynumbervariationfromwholegenomesequencingdata AT dunwoodiesallyl conanvarvaraversatiletoolforthedetectionoflargesyndromiccopynumbervariationfromwholegenomesequencingdata AT giannoulatoueleni conanvarvaraversatiletoolforthedetectionoflargesyndromiccopynumbervariationfromwholegenomesequencingdata |