Cargando…

SEQMINER: An R‐Package to Facilitate the Functional Interpretation of Sequence‐Based Associations

Next‐generation sequencing has enabled the study of a comprehensive catalogue of genetic variants for their impact on various complex diseases. Numerous consortia studies of complex traits have publically released their summary association statistics, which have become an invaluable resource for lea...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhan, Xiaowei, Liu, Dajiang J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4794281/
https://www.ncbi.nlm.nih.gov/pubmed/26394715
http://dx.doi.org/10.1002/gepi.21918
_version_ 1782421464384798720
author Zhan, Xiaowei
Liu, Dajiang J.
author_facet Zhan, Xiaowei
Liu, Dajiang J.
author_sort Zhan, Xiaowei
collection PubMed
description Next‐generation sequencing has enabled the study of a comprehensive catalogue of genetic variants for their impact on various complex diseases. Numerous consortia studies of complex traits have publically released their summary association statistics, which have become an invaluable resource for learning the underlying biology, understanding the genetic architecture, and guiding clinical translations. There is great interest in the field in developing novel statistical methods for analyzing and interpreting results from these genotype‐phenotype association studies. One popular platform for method development and data analysis is R. In order to enable these analyses in R, it is necessary to develop packages that can efficiently query files of summary association statistics, explore the linkage disequilibrium structure between variants, and integrate various bioinformatics databases. The complexity and scale of sequence datasets and databases pose significant computational challenges for method developers. To address these challenges and facilitate method development, we developed the R package SEQMINER for annotating and querying files of sequence variants (e.g., VCF/BCF files) and summary association statistics (e.g., METAL/RAREMETAL files), and for integrating bioinformatics databases. SEQMINER provides an infrastructure where novel methods can be distributed and applied to analyzing sequence datasets in practice. We illustrate the performance of SEQMINER using datasets from the 1000 Genomes Project. We show that SEQMINER is highly efficient and easy to use. It will greatly accelerate the process of applying statistical innovations to analyze and interpret sequence‐based associations. The R package, its source code and documentations are available from http://cran.r‐project.org/web/packages/seqminer and http://seqminer.genomic.codes/.
format Online
Article
Text
id pubmed-4794281
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-47942812016-03-16 SEQMINER: An R‐Package to Facilitate the Functional Interpretation of Sequence‐Based Associations Zhan, Xiaowei Liu, Dajiang J. Genet Epidemiol Research Articles Next‐generation sequencing has enabled the study of a comprehensive catalogue of genetic variants for their impact on various complex diseases. Numerous consortia studies of complex traits have publically released their summary association statistics, which have become an invaluable resource for learning the underlying biology, understanding the genetic architecture, and guiding clinical translations. There is great interest in the field in developing novel statistical methods for analyzing and interpreting results from these genotype‐phenotype association studies. One popular platform for method development and data analysis is R. In order to enable these analyses in R, it is necessary to develop packages that can efficiently query files of summary association statistics, explore the linkage disequilibrium structure between variants, and integrate various bioinformatics databases. The complexity and scale of sequence datasets and databases pose significant computational challenges for method developers. To address these challenges and facilitate method development, we developed the R package SEQMINER for annotating and querying files of sequence variants (e.g., VCF/BCF files) and summary association statistics (e.g., METAL/RAREMETAL files), and for integrating bioinformatics databases. SEQMINER provides an infrastructure where novel methods can be distributed and applied to analyzing sequence datasets in practice. We illustrate the performance of SEQMINER using datasets from the 1000 Genomes Project. We show that SEQMINER is highly efficient and easy to use. It will greatly accelerate the process of applying statistical innovations to analyze and interpret sequence‐based associations. The R package, its source code and documentations are available from http://cran.r‐project.org/web/packages/seqminer and http://seqminer.genomic.codes/. John Wiley and Sons Inc. 2015-09-23 2015-12 /pmc/articles/PMC4794281/ /pubmed/26394715 http://dx.doi.org/10.1002/gepi.21918 Text en © 2015 The Authors. *Genetic Epidemiologypublished by Wiley Periodicals, Inc. This is an open access article under the terms of the Creative Commons Attribution‐NonCommercial (http://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Research Articles
Zhan, Xiaowei
Liu, Dajiang J.
SEQMINER: An R‐Package to Facilitate the Functional Interpretation of Sequence‐Based Associations
title SEQMINER: An R‐Package to Facilitate the Functional Interpretation of Sequence‐Based Associations
title_full SEQMINER: An R‐Package to Facilitate the Functional Interpretation of Sequence‐Based Associations
title_fullStr SEQMINER: An R‐Package to Facilitate the Functional Interpretation of Sequence‐Based Associations
title_full_unstemmed SEQMINER: An R‐Package to Facilitate the Functional Interpretation of Sequence‐Based Associations
title_short SEQMINER: An R‐Package to Facilitate the Functional Interpretation of Sequence‐Based Associations
title_sort seqminer: an r‐package to facilitate the functional interpretation of sequence‐based associations
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4794281/
https://www.ncbi.nlm.nih.gov/pubmed/26394715
http://dx.doi.org/10.1002/gepi.21918
work_keys_str_mv AT zhanxiaowei seqmineranrpackagetofacilitatethefunctionalinterpretationofsequencebasedassociations
AT liudajiangj seqmineranrpackagetofacilitatethefunctionalinterpretationofsequencebasedassociations