Cargando…

VAS: a convenient web portal for efficient integration of genomic features with millions of genetic variants

BACKGROUND: High-throughput experimental methods have fostered the systematic detection of millions of genetic variants from any human genome. To help explore the potential biological implications of these genetic variants, software tools have been previously developed for integrating various types...

Descripción completa

Detalles Bibliográficos
Autores principales: Ho, Eric Dun, Cao, Qin, Lee, Sau Dan, Yip, Kevin Y
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4210471/
https://www.ncbi.nlm.nih.gov/pubmed/25306238
http://dx.doi.org/10.1186/1471-2164-15-886
_version_ 1782341376808058880
author Ho, Eric Dun
Cao, Qin
Lee, Sau Dan
Yip, Kevin Y
author_facet Ho, Eric Dun
Cao, Qin
Lee, Sau Dan
Yip, Kevin Y
author_sort Ho, Eric Dun
collection PubMed
description BACKGROUND: High-throughput experimental methods have fostered the systematic detection of millions of genetic variants from any human genome. To help explore the potential biological implications of these genetic variants, software tools have been previously developed for integrating various types of information about these genomic regions from multiple data sources. Most of these tools were designed either for studying a small number of variants at a time, or for local execution on powerful machines. RESULTS: To make exploration of whole lists of genetic variants simple and accessible, we have developed a new Web-based system called VAS (Variant Annotation System, available at https://yiplab.cse.cuhk.edu.hk/vas/). It provides a large variety of information useful for studying both coding and non-coding variants, including whole-genome transcription factor binding, open chromatin and transcription data from the ENCODE consortium. By means of data compression, millions of variants can be uploaded from a client machine to the server in less than 50 megabytes of data. On the server side, our customized data integration algorithms can efficiently link millions of variants with tens of whole-genome datasets. These two enabling technologies make VAS a practical tool for annotating genetic variants from large genomic studies. We demonstrate the use of VAS in annotating genetic variants obtained from a migraine meta-analysis study and multiple data sets from the Personal Genomes Project. We also compare the running time of annotating 6.4 million SNPs of the CEU trio by VAS and another tool, showing that VAS is efficient in handling new variant lists without requiring any pre-computations. CONCLUSIONS: VAS is specially designed to handle annotation tasks with long lists of genetic variants and large numbers of annotating features efficiently. It is complementary to other existing tools with more specific aims such as evaluating the potential impacts of genetic variants in terms of disease risk. We recommend using VAS for a quick first-pass identification of potentially interesting genetic variants, to minimize the time required for other more in-depth downstream analyses.
format Online
Article
Text
id pubmed-4210471
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42104712014-10-29 VAS: a convenient web portal for efficient integration of genomic features with millions of genetic variants Ho, Eric Dun Cao, Qin Lee, Sau Dan Yip, Kevin Y BMC Genomics Software BACKGROUND: High-throughput experimental methods have fostered the systematic detection of millions of genetic variants from any human genome. To help explore the potential biological implications of these genetic variants, software tools have been previously developed for integrating various types of information about these genomic regions from multiple data sources. Most of these tools were designed either for studying a small number of variants at a time, or for local execution on powerful machines. RESULTS: To make exploration of whole lists of genetic variants simple and accessible, we have developed a new Web-based system called VAS (Variant Annotation System, available at https://yiplab.cse.cuhk.edu.hk/vas/). It provides a large variety of information useful for studying both coding and non-coding variants, including whole-genome transcription factor binding, open chromatin and transcription data from the ENCODE consortium. By means of data compression, millions of variants can be uploaded from a client machine to the server in less than 50 megabytes of data. On the server side, our customized data integration algorithms can efficiently link millions of variants with tens of whole-genome datasets. These two enabling technologies make VAS a practical tool for annotating genetic variants from large genomic studies. We demonstrate the use of VAS in annotating genetic variants obtained from a migraine meta-analysis study and multiple data sets from the Personal Genomes Project. We also compare the running time of annotating 6.4 million SNPs of the CEU trio by VAS and another tool, showing that VAS is efficient in handling new variant lists without requiring any pre-computations. CONCLUSIONS: VAS is specially designed to handle annotation tasks with long lists of genetic variants and large numbers of annotating features efficiently. It is complementary to other existing tools with more specific aims such as evaluating the potential impacts of genetic variants in terms of disease risk. We recommend using VAS for a quick first-pass identification of potentially interesting genetic variants, to minimize the time required for other more in-depth downstream analyses. BioMed Central 2014-10-11 /pmc/articles/PMC4210471/ /pubmed/25306238 http://dx.doi.org/10.1186/1471-2164-15-886 Text en © Ho et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Ho, Eric Dun
Cao, Qin
Lee, Sau Dan
Yip, Kevin Y
VAS: a convenient web portal for efficient integration of genomic features with millions of genetic variants
title VAS: a convenient web portal for efficient integration of genomic features with millions of genetic variants
title_full VAS: a convenient web portal for efficient integration of genomic features with millions of genetic variants
title_fullStr VAS: a convenient web portal for efficient integration of genomic features with millions of genetic variants
title_full_unstemmed VAS: a convenient web portal for efficient integration of genomic features with millions of genetic variants
title_short VAS: a convenient web portal for efficient integration of genomic features with millions of genetic variants
title_sort vas: a convenient web portal for efficient integration of genomic features with millions of genetic variants
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4210471/
https://www.ncbi.nlm.nih.gov/pubmed/25306238
http://dx.doi.org/10.1186/1471-2164-15-886
work_keys_str_mv AT hoericdun vasaconvenientwebportalforefficientintegrationofgenomicfeatureswithmillionsofgeneticvariants
AT caoqin vasaconvenientwebportalforefficientintegrationofgenomicfeatureswithmillionsofgeneticvariants
AT leesaudan vasaconvenientwebportalforefficientintegrationofgenomicfeatureswithmillionsofgeneticvariants
AT yipkeviny vasaconvenientwebportalforefficientintegrationofgenomicfeatureswithmillionsofgeneticvariants