Cargando…
Loqusdb: added value of an observations database of local genomic variation
BACKGROUND: Exome and genome sequencing is becoming the method of choice for rare disease diagnostics. One of the key challenges remaining is distinguishing the disease causing variants from the benign background variation. After analysis and annotation of the sequencing data there are typically tho...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7329469/ https://www.ncbi.nlm.nih.gov/pubmed/32611382 http://dx.doi.org/10.1186/s12859-020-03609-z |
_version_ | 1783552910911602688 |
---|---|
author | Magnusson, Måns Eisfeldt, Jesper Nilsson, Daniel Rosenbaum, Adam Wirta, Valtteri Lindstrand, Anna Wedell, Anna Stranneheim, Henrik |
author_facet | Magnusson, Måns Eisfeldt, Jesper Nilsson, Daniel Rosenbaum, Adam Wirta, Valtteri Lindstrand, Anna Wedell, Anna Stranneheim, Henrik |
author_sort | Magnusson, Måns |
collection | PubMed |
description | BACKGROUND: Exome and genome sequencing is becoming the method of choice for rare disease diagnostics. One of the key challenges remaining is distinguishing the disease causing variants from the benign background variation. After analysis and annotation of the sequencing data there are typically thousands of candidate variants requiring further investigation. One of the most effective and least biased ways to reduce this number is to assess the rarity of a variant in any population. Currently, there are a number of reliable sources of information for major population frequencies when considering single nucleotide variants (SNVs) and small insertion and deletions (INDELs), with gnomAD as the most prominent public resource available. However, local variation or frequencies in sub-populations may be underrepresented in these public resources. In contrast, for structural variation (SV), the background frequency in the general population is more or less unknown mostly due to challenges in calling SVs in a consistent way. Keeping track of local variation is one way to overcome these problems and significantly reduce the number of potential disease causing variants retained for manual inspection, both for SNVs and SVs. RESULTS: Here, we present loqusdb, a tool to solve the challenge of keeping track of any type of variant observations from genome sequencing data. Loqusdb was designed to handle a large flow of samples and unlike other solutions, samples can be added continuously to the database without rebuilding it, facilitating improvements and additions. We assessed the added value of a local observations database using 98 samples annotated with information from a background of 888 unrelated individuals. CONCLUSIONS: We show both how powerful SV analysis can be when filtering for population frequencies and how the number of apparently rare SNVs/INDELs can be reduced by adding local population information even after annotating the data with other large frequency databases, such as gnomAD. In conclusion, we show that a local frequency database is an attractive, and a necessary addition to the publicly available databases that facilitate the analysis of exome and genome data in a clinical setting. |
format | Online Article Text |
id | pubmed-7329469 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-73294692020-07-02 Loqusdb: added value of an observations database of local genomic variation Magnusson, Måns Eisfeldt, Jesper Nilsson, Daniel Rosenbaum, Adam Wirta, Valtteri Lindstrand, Anna Wedell, Anna Stranneheim, Henrik BMC Bioinformatics Software BACKGROUND: Exome and genome sequencing is becoming the method of choice for rare disease diagnostics. One of the key challenges remaining is distinguishing the disease causing variants from the benign background variation. After analysis and annotation of the sequencing data there are typically thousands of candidate variants requiring further investigation. One of the most effective and least biased ways to reduce this number is to assess the rarity of a variant in any population. Currently, there are a number of reliable sources of information for major population frequencies when considering single nucleotide variants (SNVs) and small insertion and deletions (INDELs), with gnomAD as the most prominent public resource available. However, local variation or frequencies in sub-populations may be underrepresented in these public resources. In contrast, for structural variation (SV), the background frequency in the general population is more or less unknown mostly due to challenges in calling SVs in a consistent way. Keeping track of local variation is one way to overcome these problems and significantly reduce the number of potential disease causing variants retained for manual inspection, both for SNVs and SVs. RESULTS: Here, we present loqusdb, a tool to solve the challenge of keeping track of any type of variant observations from genome sequencing data. Loqusdb was designed to handle a large flow of samples and unlike other solutions, samples can be added continuously to the database without rebuilding it, facilitating improvements and additions. We assessed the added value of a local observations database using 98 samples annotated with information from a background of 888 unrelated individuals. CONCLUSIONS: We show both how powerful SV analysis can be when filtering for population frequencies and how the number of apparently rare SNVs/INDELs can be reduced by adding local population information even after annotating the data with other large frequency databases, such as gnomAD. In conclusion, we show that a local frequency database is an attractive, and a necessary addition to the publicly available databases that facilitate the analysis of exome and genome data in a clinical setting. BioMed Central 2020-07-01 /pmc/articles/PMC7329469/ /pubmed/32611382 http://dx.doi.org/10.1186/s12859-020-03609-z Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Software Magnusson, Måns Eisfeldt, Jesper Nilsson, Daniel Rosenbaum, Adam Wirta, Valtteri Lindstrand, Anna Wedell, Anna Stranneheim, Henrik Loqusdb: added value of an observations database of local genomic variation |
title | Loqusdb: added value of an observations database of local genomic variation |
title_full | Loqusdb: added value of an observations database of local genomic variation |
title_fullStr | Loqusdb: added value of an observations database of local genomic variation |
title_full_unstemmed | Loqusdb: added value of an observations database of local genomic variation |
title_short | Loqusdb: added value of an observations database of local genomic variation |
title_sort | loqusdb: added value of an observations database of local genomic variation |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7329469/ https://www.ncbi.nlm.nih.gov/pubmed/32611382 http://dx.doi.org/10.1186/s12859-020-03609-z |
work_keys_str_mv | AT magnussonmans loqusdbaddedvalueofanobservationsdatabaseoflocalgenomicvariation AT eisfeldtjesper loqusdbaddedvalueofanobservationsdatabaseoflocalgenomicvariation AT nilssondaniel loqusdbaddedvalueofanobservationsdatabaseoflocalgenomicvariation AT rosenbaumadam loqusdbaddedvalueofanobservationsdatabaseoflocalgenomicvariation AT wirtavaltteri loqusdbaddedvalueofanobservationsdatabaseoflocalgenomicvariation AT lindstrandanna loqusdbaddedvalueofanobservationsdatabaseoflocalgenomicvariation AT wedellanna loqusdbaddedvalueofanobservationsdatabaseoflocalgenomicvariation AT stranneheimhenrik loqusdbaddedvalueofanobservationsdatabaseoflocalgenomicvariation |