Cargando…

Loqusdb: added value of an observations database of local genomic variation

BACKGROUND: Exome and genome sequencing is becoming the method of choice for rare disease diagnostics. One of the key challenges remaining is distinguishing the disease causing variants from the benign background variation. After analysis and annotation of the sequencing data there are typically tho...

Descripción completa

Detalles Bibliográficos
Autores principales: Magnusson, Måns, Eisfeldt, Jesper, Nilsson, Daniel, Rosenbaum, Adam, Wirta, Valtteri, Lindstrand, Anna, Wedell, Anna, Stranneheim, Henrik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7329469/
https://www.ncbi.nlm.nih.gov/pubmed/32611382
http://dx.doi.org/10.1186/s12859-020-03609-z
_version_ 1783552910911602688
author Magnusson, Måns
Eisfeldt, Jesper
Nilsson, Daniel
Rosenbaum, Adam
Wirta, Valtteri
Lindstrand, Anna
Wedell, Anna
Stranneheim, Henrik
author_facet Magnusson, Måns
Eisfeldt, Jesper
Nilsson, Daniel
Rosenbaum, Adam
Wirta, Valtteri
Lindstrand, Anna
Wedell, Anna
Stranneheim, Henrik
author_sort Magnusson, Måns
collection PubMed
description BACKGROUND: Exome and genome sequencing is becoming the method of choice for rare disease diagnostics. One of the key challenges remaining is distinguishing the disease causing variants from the benign background variation. After analysis and annotation of the sequencing data there are typically thousands of candidate variants requiring further investigation. One of the most effective and least biased ways to reduce this number is to assess the rarity of a variant in any population. Currently, there are a number of reliable sources of information for major population frequencies when considering single nucleotide variants (SNVs) and small insertion and deletions (INDELs), with gnomAD as the most prominent public resource available. However, local variation or frequencies in sub-populations may be underrepresented in these public resources. In contrast, for structural variation (SV), the background frequency in the general population is more or less unknown mostly due to challenges in calling SVs in a consistent way. Keeping track of local variation is one way to overcome these problems and significantly reduce the number of potential disease causing variants retained for manual inspection, both for SNVs and SVs. RESULTS: Here, we present loqusdb, a tool to solve the challenge of keeping track of any type of variant observations from genome sequencing data. Loqusdb was designed to handle a large flow of samples and unlike other solutions, samples can be added continuously to the database without rebuilding it, facilitating improvements and additions. We assessed the added value of a local observations database using 98 samples annotated with information from a background of 888 unrelated individuals. CONCLUSIONS: We show both how powerful SV analysis can be when filtering for population frequencies and how the number of apparently rare SNVs/INDELs can be reduced by adding local population information even after annotating the data with other large frequency databases, such as gnomAD. In conclusion, we show that a local frequency database is an attractive, and a necessary addition to the publicly available databases that facilitate the analysis of exome and genome data in a clinical setting.
format Online
Article
Text
id pubmed-7329469
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-73294692020-07-02 Loqusdb: added value of an observations database of local genomic variation Magnusson, Måns Eisfeldt, Jesper Nilsson, Daniel Rosenbaum, Adam Wirta, Valtteri Lindstrand, Anna Wedell, Anna Stranneheim, Henrik BMC Bioinformatics Software BACKGROUND: Exome and genome sequencing is becoming the method of choice for rare disease diagnostics. One of the key challenges remaining is distinguishing the disease causing variants from the benign background variation. After analysis and annotation of the sequencing data there are typically thousands of candidate variants requiring further investigation. One of the most effective and least biased ways to reduce this number is to assess the rarity of a variant in any population. Currently, there are a number of reliable sources of information for major population frequencies when considering single nucleotide variants (SNVs) and small insertion and deletions (INDELs), with gnomAD as the most prominent public resource available. However, local variation or frequencies in sub-populations may be underrepresented in these public resources. In contrast, for structural variation (SV), the background frequency in the general population is more or less unknown mostly due to challenges in calling SVs in a consistent way. Keeping track of local variation is one way to overcome these problems and significantly reduce the number of potential disease causing variants retained for manual inspection, both for SNVs and SVs. RESULTS: Here, we present loqusdb, a tool to solve the challenge of keeping track of any type of variant observations from genome sequencing data. Loqusdb was designed to handle a large flow of samples and unlike other solutions, samples can be added continuously to the database without rebuilding it, facilitating improvements and additions. We assessed the added value of a local observations database using 98 samples annotated with information from a background of 888 unrelated individuals. CONCLUSIONS: We show both how powerful SV analysis can be when filtering for population frequencies and how the number of apparently rare SNVs/INDELs can be reduced by adding local population information even after annotating the data with other large frequency databases, such as gnomAD. In conclusion, we show that a local frequency database is an attractive, and a necessary addition to the publicly available databases that facilitate the analysis of exome and genome data in a clinical setting. BioMed Central 2020-07-01 /pmc/articles/PMC7329469/ /pubmed/32611382 http://dx.doi.org/10.1186/s12859-020-03609-z Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Magnusson, Måns
Eisfeldt, Jesper
Nilsson, Daniel
Rosenbaum, Adam
Wirta, Valtteri
Lindstrand, Anna
Wedell, Anna
Stranneheim, Henrik
Loqusdb: added value of an observations database of local genomic variation
title Loqusdb: added value of an observations database of local genomic variation
title_full Loqusdb: added value of an observations database of local genomic variation
title_fullStr Loqusdb: added value of an observations database of local genomic variation
title_full_unstemmed Loqusdb: added value of an observations database of local genomic variation
title_short Loqusdb: added value of an observations database of local genomic variation
title_sort loqusdb: added value of an observations database of local genomic variation
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7329469/
https://www.ncbi.nlm.nih.gov/pubmed/32611382
http://dx.doi.org/10.1186/s12859-020-03609-z
work_keys_str_mv AT magnussonmans loqusdbaddedvalueofanobservationsdatabaseoflocalgenomicvariation
AT eisfeldtjesper loqusdbaddedvalueofanobservationsdatabaseoflocalgenomicvariation
AT nilssondaniel loqusdbaddedvalueofanobservationsdatabaseoflocalgenomicvariation
AT rosenbaumadam loqusdbaddedvalueofanobservationsdatabaseoflocalgenomicvariation
AT wirtavaltteri loqusdbaddedvalueofanobservationsdatabaseoflocalgenomicvariation
AT lindstrandanna loqusdbaddedvalueofanobservationsdatabaseoflocalgenomicvariation
AT wedellanna loqusdbaddedvalueofanobservationsdatabaseoflocalgenomicvariation
AT stranneheimhenrik loqusdbaddedvalueofanobservationsdatabaseoflocalgenomicvariation