Cargando…

ConanVarvar: a versatile tool for the detection of large syndromic copy number variation from whole-genome sequencing data

BACKGROUND: A wide range of tools are available for the detection of copy number variants (CNVs) from whole-genome sequencing (WGS) data. However, none of them focus on clinically-relevant CNVs, such as those that are associated with known genetic syndromes. Such variants are often large in size, ty...

Descripción completa

Detalles Bibliográficos
Autores principales: Gudkov, Mikhail, Thibaut, Loïc, Khushi, Matloob, Blue, Gillian M., Winlaw, David S., Dunwoodie, Sally L., Giannoulatou, Eleni
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9930243/
https://www.ncbi.nlm.nih.gov/pubmed/36792982
http://dx.doi.org/10.1186/s12859-023-05154-x
_version_ 1784889010741575680
author Gudkov, Mikhail
Thibaut, Loïc
Khushi, Matloob
Blue, Gillian M.
Winlaw, David S.
Dunwoodie, Sally L.
Giannoulatou, Eleni
author_facet Gudkov, Mikhail
Thibaut, Loïc
Khushi, Matloob
Blue, Gillian M.
Winlaw, David S.
Dunwoodie, Sally L.
Giannoulatou, Eleni
author_sort Gudkov, Mikhail
collection PubMed
description BACKGROUND: A wide range of tools are available for the detection of copy number variants (CNVs) from whole-genome sequencing (WGS) data. However, none of them focus on clinically-relevant CNVs, such as those that are associated with known genetic syndromes. Such variants are often large in size, typically 1–5 Mb, but currently available CNV callers have been developed and benchmarked for the discovery of smaller variants. Thus, the ability of these programs to detect tens of real syndromic CNVs remains largely unknown. RESULTS: Here we present ConanVarvar, a tool which implements a complete workflow for the targeted analysis of large germline CNVs from WGS data. ConanVarvar comes with an intuitive R Shiny graphical user interface and annotates identified variants with information about 56 associated syndromic conditions. We benchmarked ConanVarvar and four other programs on a dataset containing real and simulated syndromic CNVs larger than 1 Mb. In comparison to other tools, ConanVarvar reports 10–30 times less false-positive variants without compromising sensitivity and is quicker to run, especially on large batches of samples. CONCLUSIONS: ConanVarvar is a useful instrument for primary analysis in disease sequencing studies, where large CNVs could be the cause of disease. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05154-x.
format Online
Article
Text
id pubmed-9930243
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-99302432023-02-16 ConanVarvar: a versatile tool for the detection of large syndromic copy number variation from whole-genome sequencing data Gudkov, Mikhail Thibaut, Loïc Khushi, Matloob Blue, Gillian M. Winlaw, David S. Dunwoodie, Sally L. Giannoulatou, Eleni BMC Bioinformatics Software BACKGROUND: A wide range of tools are available for the detection of copy number variants (CNVs) from whole-genome sequencing (WGS) data. However, none of them focus on clinically-relevant CNVs, such as those that are associated with known genetic syndromes. Such variants are often large in size, typically 1–5 Mb, but currently available CNV callers have been developed and benchmarked for the discovery of smaller variants. Thus, the ability of these programs to detect tens of real syndromic CNVs remains largely unknown. RESULTS: Here we present ConanVarvar, a tool which implements a complete workflow for the targeted analysis of large germline CNVs from WGS data. ConanVarvar comes with an intuitive R Shiny graphical user interface and annotates identified variants with information about 56 associated syndromic conditions. We benchmarked ConanVarvar and four other programs on a dataset containing real and simulated syndromic CNVs larger than 1 Mb. In comparison to other tools, ConanVarvar reports 10–30 times less false-positive variants without compromising sensitivity and is quicker to run, especially on large batches of samples. CONCLUSIONS: ConanVarvar is a useful instrument for primary analysis in disease sequencing studies, where large CNVs could be the cause of disease. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05154-x. BioMed Central 2023-02-15 /pmc/articles/PMC9930243/ /pubmed/36792982 http://dx.doi.org/10.1186/s12859-023-05154-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Gudkov, Mikhail
Thibaut, Loïc
Khushi, Matloob
Blue, Gillian M.
Winlaw, David S.
Dunwoodie, Sally L.
Giannoulatou, Eleni
ConanVarvar: a versatile tool for the detection of large syndromic copy number variation from whole-genome sequencing data
title ConanVarvar: a versatile tool for the detection of large syndromic copy number variation from whole-genome sequencing data
title_full ConanVarvar: a versatile tool for the detection of large syndromic copy number variation from whole-genome sequencing data
title_fullStr ConanVarvar: a versatile tool for the detection of large syndromic copy number variation from whole-genome sequencing data
title_full_unstemmed ConanVarvar: a versatile tool for the detection of large syndromic copy number variation from whole-genome sequencing data
title_short ConanVarvar: a versatile tool for the detection of large syndromic copy number variation from whole-genome sequencing data
title_sort conanvarvar: a versatile tool for the detection of large syndromic copy number variation from whole-genome sequencing data
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9930243/
https://www.ncbi.nlm.nih.gov/pubmed/36792982
http://dx.doi.org/10.1186/s12859-023-05154-x
work_keys_str_mv AT gudkovmikhail conanvarvaraversatiletoolforthedetectionoflargesyndromiccopynumbervariationfromwholegenomesequencingdata
AT thibautloic conanvarvaraversatiletoolforthedetectionoflargesyndromiccopynumbervariationfromwholegenomesequencingdata
AT khushimatloob conanvarvaraversatiletoolforthedetectionoflargesyndromiccopynumbervariationfromwholegenomesequencingdata
AT bluegillianm conanvarvaraversatiletoolforthedetectionoflargesyndromiccopynumbervariationfromwholegenomesequencingdata
AT winlawdavids conanvarvaraversatiletoolforthedetectionoflargesyndromiccopynumbervariationfromwholegenomesequencingdata
AT dunwoodiesallyl conanvarvaraversatiletoolforthedetectionoflargesyndromiccopynumbervariationfromwholegenomesequencingdata
AT giannoulatoueleni conanvarvaraversatiletoolforthedetectionoflargesyndromiccopynumbervariationfromwholegenomesequencingdata