Cargando…

Annotation of structural variants with reported allele frequencies and related metrics from multiple datasets using SVAFotate

BACKGROUND: Identification of deleterious genetic variants using DNA sequencing data relies on increasingly detailed filtering strategies to isolate the small subset of variants that are more likely to underlie a disease phenotype. Datasets reflecting population allele frequencies of different types...

Descripción completa

Detalles Bibliográficos
Autores principales: Nicholas, Thomas J., Cormier, Michael J., Quinlan, Aaron R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9670370/
https://www.ncbi.nlm.nih.gov/pubmed/36384437
http://dx.doi.org/10.1186/s12859-022-05008-y
_version_ 1784832321150517248
author Nicholas, Thomas J.
Cormier, Michael J.
Quinlan, Aaron R.
author_facet Nicholas, Thomas J.
Cormier, Michael J.
Quinlan, Aaron R.
author_sort Nicholas, Thomas J.
collection PubMed
description BACKGROUND: Identification of deleterious genetic variants using DNA sequencing data relies on increasingly detailed filtering strategies to isolate the small subset of variants that are more likely to underlie a disease phenotype. Datasets reflecting population allele frequencies of different types of variants serve as powerful filtering tools, especially in the context of rare disease analysis. While such population-scale allele frequency datasets now exist for structural variants (SVs), it remains a challenge to match SV calls between multiple datasets, thereby complicating estimates of a putative SV's population allele frequency. RESULTS: We introduce SVAFotate, a software tool that enables the annotation of SVs with variant allele frequency and related information from existing SV datasets. As a result, VCF files annotated by SVAFotate offer a variety of metrics to aid in the stratification of SVs as common or rare in the broader human population. CONCLUSIONS: Here we demonstrate the use of SVAFotate in the classification of SVs with regards to their population frequency and illustrate how SVAFotate's annotations can be used to filter and prioritize SVs. Lastly, we detail how best to utilize these SV annotations in the analysis of genetic variation in studies of rare disease. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-05008-y.
format Online
Article
Text
id pubmed-9670370
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-96703702022-11-18 Annotation of structural variants with reported allele frequencies and related metrics from multiple datasets using SVAFotate Nicholas, Thomas J. Cormier, Michael J. Quinlan, Aaron R. BMC Bioinformatics Software BACKGROUND: Identification of deleterious genetic variants using DNA sequencing data relies on increasingly detailed filtering strategies to isolate the small subset of variants that are more likely to underlie a disease phenotype. Datasets reflecting population allele frequencies of different types of variants serve as powerful filtering tools, especially in the context of rare disease analysis. While such population-scale allele frequency datasets now exist for structural variants (SVs), it remains a challenge to match SV calls between multiple datasets, thereby complicating estimates of a putative SV's population allele frequency. RESULTS: We introduce SVAFotate, a software tool that enables the annotation of SVs with variant allele frequency and related information from existing SV datasets. As a result, VCF files annotated by SVAFotate offer a variety of metrics to aid in the stratification of SVs as common or rare in the broader human population. CONCLUSIONS: Here we demonstrate the use of SVAFotate in the classification of SVs with regards to their population frequency and illustrate how SVAFotate's annotations can be used to filter and prioritize SVs. Lastly, we detail how best to utilize these SV annotations in the analysis of genetic variation in studies of rare disease. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-05008-y. BioMed Central 2022-11-16 /pmc/articles/PMC9670370/ /pubmed/36384437 http://dx.doi.org/10.1186/s12859-022-05008-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Nicholas, Thomas J.
Cormier, Michael J.
Quinlan, Aaron R.
Annotation of structural variants with reported allele frequencies and related metrics from multiple datasets using SVAFotate
title Annotation of structural variants with reported allele frequencies and related metrics from multiple datasets using SVAFotate
title_full Annotation of structural variants with reported allele frequencies and related metrics from multiple datasets using SVAFotate
title_fullStr Annotation of structural variants with reported allele frequencies and related metrics from multiple datasets using SVAFotate
title_full_unstemmed Annotation of structural variants with reported allele frequencies and related metrics from multiple datasets using SVAFotate
title_short Annotation of structural variants with reported allele frequencies and related metrics from multiple datasets using SVAFotate
title_sort annotation of structural variants with reported allele frequencies and related metrics from multiple datasets using svafotate
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9670370/
https://www.ncbi.nlm.nih.gov/pubmed/36384437
http://dx.doi.org/10.1186/s12859-022-05008-y
work_keys_str_mv AT nicholasthomasj annotationofstructuralvariantswithreportedallelefrequenciesandrelatedmetricsfrommultipledatasetsusingsvafotate
AT cormiermichaelj annotationofstructuralvariantswithreportedallelefrequenciesandrelatedmetricsfrommultipledatasetsusingsvafotate
AT quinlanaaronr annotationofstructuralvariantswithreportedallelefrequenciesandrelatedmetricsfrommultipledatasetsusingsvafotate