Cargando…
webGQT: A Shiny Server for Genotype Query Tools for Model-Based Variant Filtering
SUMMARY: Genotype Query Tools (GQT) were developed to discover disease-causing variations from billions of genotypes and millions of genomes, processes data at substantially higher speed over other existing methods. While GQT has been available to a wide audience as command-line software, the diffic...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7063093/ https://www.ncbi.nlm.nih.gov/pubmed/32194629 http://dx.doi.org/10.3389/fgene.2020.00152 |
_version_ | 1783504644060741632 |
---|---|
author | Arumilli, Meharji Layer, Ryan M. Hytönen, Marjo K. Lohi, Hannes |
author_facet | Arumilli, Meharji Layer, Ryan M. Hytönen, Marjo K. Lohi, Hannes |
author_sort | Arumilli, Meharji |
collection | PubMed |
description | SUMMARY: Genotype Query Tools (GQT) were developed to discover disease-causing variations from billions of genotypes and millions of genomes, processes data at substantially higher speed over other existing methods. While GQT has been available to a wide audience as command-line software, the difficulty of constructing queries among non-IT or non-bioinformatics researchers has limited its applicability. To overcome this limitation, we developed webGQT, an easy-to-use tool with a graphical user interface. With pre-built queries across three modules, webGQT allows for pedigree analysis, case-control studies, and population frequency studies. As a package, webGQT allows researchers with less or no applied bioinformatics/IT experience to mine potential disease-causing variants from billions. RESULTS: webGQT offers a flexible and easy-to-use interface for model-based candidate variant filtering for Mendelian diseases from thousands to millions of genomes at a reduced computation time. Additionally, webGQT provides adjustable parameters to reduce false positives and rescue missing genotypes across all modules. Using a case study, we demonstrate the applicability of webGQT to query non-human genomes. In addition, we demonstrate the scalability of webGQT on large data sets by implementing complex population-specific queries on the 1000 Genomes Project Phase 3 data set, which includes 8.4 billion variants from 2504 individuals across 26 different populations. Furthermore, webGQT supports filtering single-nucleotide variants, short insertions/deletions, copy number or any other variant genotypes supported by the VCF specification. Our results show that webGQT can be used as an online web service, or deployed on personal computers or local servers within research groups. AVAILABILITY: webGQT is made available to the users in three forms: 1) as a webserver available at https://vm1138.kaj.pouta.csc.fi/webgqt/, 2) as an R package to install on personal computers, and 3) as part of the same R package to configure on the user's own servers. The application is available for installation at https://github.com/arumds/webgqt. |
format | Online Article Text |
id | pubmed-7063093 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-70630932020-03-19 webGQT: A Shiny Server for Genotype Query Tools for Model-Based Variant Filtering Arumilli, Meharji Layer, Ryan M. Hytönen, Marjo K. Lohi, Hannes Front Genet Genetics SUMMARY: Genotype Query Tools (GQT) were developed to discover disease-causing variations from billions of genotypes and millions of genomes, processes data at substantially higher speed over other existing methods. While GQT has been available to a wide audience as command-line software, the difficulty of constructing queries among non-IT or non-bioinformatics researchers has limited its applicability. To overcome this limitation, we developed webGQT, an easy-to-use tool with a graphical user interface. With pre-built queries across three modules, webGQT allows for pedigree analysis, case-control studies, and population frequency studies. As a package, webGQT allows researchers with less or no applied bioinformatics/IT experience to mine potential disease-causing variants from billions. RESULTS: webGQT offers a flexible and easy-to-use interface for model-based candidate variant filtering for Mendelian diseases from thousands to millions of genomes at a reduced computation time. Additionally, webGQT provides adjustable parameters to reduce false positives and rescue missing genotypes across all modules. Using a case study, we demonstrate the applicability of webGQT to query non-human genomes. In addition, we demonstrate the scalability of webGQT on large data sets by implementing complex population-specific queries on the 1000 Genomes Project Phase 3 data set, which includes 8.4 billion variants from 2504 individuals across 26 different populations. Furthermore, webGQT supports filtering single-nucleotide variants, short insertions/deletions, copy number or any other variant genotypes supported by the VCF specification. Our results show that webGQT can be used as an online web service, or deployed on personal computers or local servers within research groups. AVAILABILITY: webGQT is made available to the users in three forms: 1) as a webserver available at https://vm1138.kaj.pouta.csc.fi/webgqt/, 2) as an R package to install on personal computers, and 3) as part of the same R package to configure on the user's own servers. The application is available for installation at https://github.com/arumds/webgqt. Frontiers Media S.A. 2020-03-03 /pmc/articles/PMC7063093/ /pubmed/32194629 http://dx.doi.org/10.3389/fgene.2020.00152 Text en Copyright © 2020 Arumilli, Layer, Hytönen and Lohi http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Arumilli, Meharji Layer, Ryan M. Hytönen, Marjo K. Lohi, Hannes webGQT: A Shiny Server for Genotype Query Tools for Model-Based Variant Filtering |
title | webGQT: A Shiny Server for Genotype Query Tools for Model-Based Variant Filtering |
title_full | webGQT: A Shiny Server for Genotype Query Tools for Model-Based Variant Filtering |
title_fullStr | webGQT: A Shiny Server for Genotype Query Tools for Model-Based Variant Filtering |
title_full_unstemmed | webGQT: A Shiny Server for Genotype Query Tools for Model-Based Variant Filtering |
title_short | webGQT: A Shiny Server for Genotype Query Tools for Model-Based Variant Filtering |
title_sort | webgqt: a shiny server for genotype query tools for model-based variant filtering |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7063093/ https://www.ncbi.nlm.nih.gov/pubmed/32194629 http://dx.doi.org/10.3389/fgene.2020.00152 |
work_keys_str_mv | AT arumillimeharji webgqtashinyserverforgenotypequerytoolsformodelbasedvariantfiltering AT layerryanm webgqtashinyserverforgenotypequerytoolsformodelbasedvariantfiltering AT hytonenmarjok webgqtashinyserverforgenotypequerytoolsformodelbasedvariantfiltering AT lohihannes webgqtashinyserverforgenotypequerytoolsformodelbasedvariantfiltering |