Cargando…
Vcfanno: fast, flexible annotation of genetic variants
The integration of genome annotations is critical to the identification of genetic variants that are relevant to studies of disease or other traits. However, comprehensive variant annotation with diverse file formats is difficult with existing methods. Here we describe vcfanno, which flexibly extrac...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4888505/ https://www.ncbi.nlm.nih.gov/pubmed/27250555 http://dx.doi.org/10.1186/s13059-016-0973-5 |
_version_ | 1782434862861385728 |
---|---|
author | Pedersen, Brent S. Layer, Ryan M. Quinlan, Aaron R. |
author_facet | Pedersen, Brent S. Layer, Ryan M. Quinlan, Aaron R. |
author_sort | Pedersen, Brent S. |
collection | PubMed |
description | The integration of genome annotations is critical to the identification of genetic variants that are relevant to studies of disease or other traits. However, comprehensive variant annotation with diverse file formats is difficult with existing methods. Here we describe vcfanno, which flexibly extracts and summarizes attributes from multiple annotation files and integrates the annotations within the INFO column of the original VCF file. By leveraging a parallel “chromosome sweeping” algorithm, we demonstrate substantial performance gains by annotating ~85,000 variants per second with 50 attributes from 17 commonly used genome annotation resources. Vcfanno is available at https://github.com/brentp/vcfanno under the MIT license. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-016-0973-5) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4888505 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-48885052016-06-02 Vcfanno: fast, flexible annotation of genetic variants Pedersen, Brent S. Layer, Ryan M. Quinlan, Aaron R. Genome Biol Software The integration of genome annotations is critical to the identification of genetic variants that are relevant to studies of disease or other traits. However, comprehensive variant annotation with diverse file formats is difficult with existing methods. Here we describe vcfanno, which flexibly extracts and summarizes attributes from multiple annotation files and integrates the annotations within the INFO column of the original VCF file. By leveraging a parallel “chromosome sweeping” algorithm, we demonstrate substantial performance gains by annotating ~85,000 variants per second with 50 attributes from 17 commonly used genome annotation resources. Vcfanno is available at https://github.com/brentp/vcfanno under the MIT license. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-016-0973-5) contains supplementary material, which is available to authorized users. BioMed Central 2016-06-01 /pmc/articles/PMC4888505/ /pubmed/27250555 http://dx.doi.org/10.1186/s13059-016-0973-5 Text en © Pedersen et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Pedersen, Brent S. Layer, Ryan M. Quinlan, Aaron R. Vcfanno: fast, flexible annotation of genetic variants |
title | Vcfanno: fast, flexible annotation of genetic variants |
title_full | Vcfanno: fast, flexible annotation of genetic variants |
title_fullStr | Vcfanno: fast, flexible annotation of genetic variants |
title_full_unstemmed | Vcfanno: fast, flexible annotation of genetic variants |
title_short | Vcfanno: fast, flexible annotation of genetic variants |
title_sort | vcfanno: fast, flexible annotation of genetic variants |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4888505/ https://www.ncbi.nlm.nih.gov/pubmed/27250555 http://dx.doi.org/10.1186/s13059-016-0973-5 |
work_keys_str_mv | AT pedersenbrents vcfannofastflexibleannotationofgeneticvariants AT layerryanm vcfannofastflexibleannotationofgeneticvariants AT quinlanaaronr vcfannofastflexibleannotationofgeneticvariants |