Cargando…

webGQT: A Shiny Server for Genotype Query Tools for Model-Based Variant Filtering

SUMMARY: Genotype Query Tools (GQT) were developed to discover disease-causing variations from billions of genotypes and millions of genomes, processes data at substantially higher speed over other existing methods. While GQT has been available to a wide audience as command-line software, the diffic...

Descripción completa

Detalles Bibliográficos
Autores principales:	Arumilli, Meharji, Layer, Ryan M., Hytönen, Marjo K., Lohi, Hannes
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2020
Materias:	Genetics
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7063093/ https://www.ncbi.nlm.nih.gov/pubmed/32194629 http://dx.doi.org/10.3389/fgene.2020.00152

_version_	1783504644060741632
author	Arumilli, Meharji Layer, Ryan M. Hytönen, Marjo K. Lohi, Hannes
author_facet	Arumilli, Meharji Layer, Ryan M. Hytönen, Marjo K. Lohi, Hannes
author_sort	Arumilli, Meharji
collection	PubMed
description	SUMMARY: Genotype Query Tools (GQT) were developed to discover disease-causing variations from billions of genotypes and millions of genomes, processes data at substantially higher speed over other existing methods. While GQT has been available to a wide audience as command-line software, the difficulty of constructing queries among non-IT or non-bioinformatics researchers has limited its applicability. To overcome this limitation, we developed webGQT, an easy-to-use tool with a graphical user interface. With pre-built queries across three modules, webGQT allows for pedigree analysis, case-control studies, and population frequency studies. As a package, webGQT allows researchers with less or no applied bioinformatics/IT experience to mine potential disease-causing variants from billions. RESULTS: webGQT offers a flexible and easy-to-use interface for model-based candidate variant filtering for Mendelian diseases from thousands to millions of genomes at a reduced computation time. Additionally, webGQT provides adjustable parameters to reduce false positives and rescue missing genotypes across all modules. Using a case study, we demonstrate the applicability of webGQT to query non-human genomes. In addition, we demonstrate the scalability of webGQT on large data sets by implementing complex population-specific queries on the 1000 Genomes Project Phase 3 data set, which includes 8.4 billion variants from 2504 individuals across 26 different populations. Furthermore, webGQT supports filtering single-nucleotide variants, short insertions/deletions, copy number or any other variant genotypes supported by the VCF specification. Our results show that webGQT can be used as an online web service, or deployed on personal computers or local servers within research groups. AVAILABILITY: webGQT is made available to the users in three forms: 1) as a webserver available at https://vm1138.kaj.pouta.csc.fi/webgqt/, 2) as an R package to install on personal computers, and 3) as part of the same R package to configure on the user's own servers. The application is available for installation at https://github.com/arumds/webgqt.
format	Online Article Text
id	pubmed-7063093
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-70630932020-03-19 webGQT: A Shiny Server for Genotype Query Tools for Model-Based Variant Filtering Arumilli, Meharji Layer, Ryan M. Hytönen, Marjo K. Lohi, Hannes Front Genet Genetics SUMMARY: Genotype Query Tools (GQT) were developed to discover disease-causing variations from billions of genotypes and millions of genomes, processes data at substantially higher speed over other existing methods. While GQT has been available to a wide audience as command-line software, the difficulty of constructing queries among non-IT or non-bioinformatics researchers has limited its applicability. To overcome this limitation, we developed webGQT, an easy-to-use tool with a graphical user interface. With pre-built queries across three modules, webGQT allows for pedigree analysis, case-control studies, and population frequency studies. As a package, webGQT allows researchers with less or no applied bioinformatics/IT experience to mine potential disease-causing variants from billions. RESULTS: webGQT offers a flexible and easy-to-use interface for model-based candidate variant filtering for Mendelian diseases from thousands to millions of genomes at a reduced computation time. Additionally, webGQT provides adjustable parameters to reduce false positives and rescue missing genotypes across all modules. Using a case study, we demonstrate the applicability of webGQT to query non-human genomes. In addition, we demonstrate the scalability of webGQT on large data sets by implementing complex population-specific queries on the 1000 Genomes Project Phase 3 data set, which includes 8.4 billion variants from 2504 individuals across 26 different populations. Furthermore, webGQT supports filtering single-nucleotide variants, short insertions/deletions, copy number or any other variant genotypes supported by the VCF specification. Our results show that webGQT can be used as an online web service, or deployed on personal computers or local servers within research groups. AVAILABILITY: webGQT is made available to the users in three forms: 1) as a webserver available at https://vm1138.kaj.pouta.csc.fi/webgqt/, 2) as an R package to install on personal computers, and 3) as part of the same R package to configure on the user's own servers. The application is available for installation at https://github.com/arumds/webgqt. Frontiers Media S.A. 2020-03-03 /pmc/articles/PMC7063093/ /pubmed/32194629 http://dx.doi.org/10.3389/fgene.2020.00152 Text en Copyright © 2020 Arumilli, Layer, Hytönen and Lohi http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Genetics Arumilli, Meharji Layer, Ryan M. Hytönen, Marjo K. Lohi, Hannes webGQT: A Shiny Server for Genotype Query Tools for Model-Based Variant Filtering
title	webGQT: A Shiny Server for Genotype Query Tools for Model-Based Variant Filtering
title_full	webGQT: A Shiny Server for Genotype Query Tools for Model-Based Variant Filtering
title_fullStr	webGQT: A Shiny Server for Genotype Query Tools for Model-Based Variant Filtering
title_full_unstemmed	webGQT: A Shiny Server for Genotype Query Tools for Model-Based Variant Filtering
title_short	webGQT: A Shiny Server for Genotype Query Tools for Model-Based Variant Filtering
title_sort	webgqt: a shiny server for genotype query tools for model-based variant filtering
topic	Genetics
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7063093/ https://www.ncbi.nlm.nih.gov/pubmed/32194629 http://dx.doi.org/10.3389/fgene.2020.00152
work_keys_str_mv	AT arumillimeharji webgqtashinyserverforgenotypequerytoolsformodelbasedvariantfiltering AT layerryanm webgqtashinyserverforgenotypequerytoolsformodelbasedvariantfiltering AT hytonenmarjok webgqtashinyserverforgenotypequerytoolsformodelbasedvariantfiltering AT lohihannes webgqtashinyserverforgenotypequerytoolsformodelbasedvariantfiltering

webGQT: A Shiny Server for Genotype Query Tools for Model-Based Variant Filtering

Ejemplares similares