Cargando…

A standard variation file format for human genome sequences

Here we describe the Genome Variation Format (GVF) and the 10Gen dataset. GVF, an extension of Generic Feature Format version 3 (GFF3), is a simple tab-delimited format for DNA variant files, which uses Sequence Ontology to describe genome variation data. The 10Gen dataset, ten human genomes in GVF...

Descripción completa

Detalles Bibliográficos
Autores principales: Reese, Martin G, Moore, Barry, Batchelor, Colin, Salas, Fidel, Cunningham, Fiona, Marth, Gabor T, Stein, Lincoln, Flicek, Paul, Yandell, Mark, Eilbeck, Karen
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2945790/
https://www.ncbi.nlm.nih.gov/pubmed/20796305
http://dx.doi.org/10.1186/gb-2010-11-8-r88
_version_ 1782187249195024384
author Reese, Martin G
Moore, Barry
Batchelor, Colin
Salas, Fidel
Cunningham, Fiona
Marth, Gabor T
Stein, Lincoln
Flicek, Paul
Yandell, Mark
Eilbeck, Karen
author_facet Reese, Martin G
Moore, Barry
Batchelor, Colin
Salas, Fidel
Cunningham, Fiona
Marth, Gabor T
Stein, Lincoln
Flicek, Paul
Yandell, Mark
Eilbeck, Karen
author_sort Reese, Martin G
collection PubMed
description Here we describe the Genome Variation Format (GVF) and the 10Gen dataset. GVF, an extension of Generic Feature Format version 3 (GFF3), is a simple tab-delimited format for DNA variant files, which uses Sequence Ontology to describe genome variation data. The 10Gen dataset, ten human genomes in GVF format, is freely available for community analysis from the Sequence Ontology website and from an Amazon elastic block storage (EBS) snapshot for use in Amazon's EC2 cloud computing environment.
format Text
id pubmed-2945790
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29457902010-09-28 A standard variation file format for human genome sequences Reese, Martin G Moore, Barry Batchelor, Colin Salas, Fidel Cunningham, Fiona Marth, Gabor T Stein, Lincoln Flicek, Paul Yandell, Mark Eilbeck, Karen Genome Biol Method Here we describe the Genome Variation Format (GVF) and the 10Gen dataset. GVF, an extension of Generic Feature Format version 3 (GFF3), is a simple tab-delimited format for DNA variant files, which uses Sequence Ontology to describe genome variation data. The 10Gen dataset, ten human genomes in GVF format, is freely available for community analysis from the Sequence Ontology website and from an Amazon elastic block storage (EBS) snapshot for use in Amazon's EC2 cloud computing environment. BioMed Central 2010 2010-08-26 /pmc/articles/PMC2945790/ /pubmed/20796305 http://dx.doi.org/10.1186/gb-2010-11-8-r88 Text en Copyright ©2010 Reese et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Method
Reese, Martin G
Moore, Barry
Batchelor, Colin
Salas, Fidel
Cunningham, Fiona
Marth, Gabor T
Stein, Lincoln
Flicek, Paul
Yandell, Mark
Eilbeck, Karen
A standard variation file format for human genome sequences
title A standard variation file format for human genome sequences
title_full A standard variation file format for human genome sequences
title_fullStr A standard variation file format for human genome sequences
title_full_unstemmed A standard variation file format for human genome sequences
title_short A standard variation file format for human genome sequences
title_sort standard variation file format for human genome sequences
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2945790/
https://www.ncbi.nlm.nih.gov/pubmed/20796305
http://dx.doi.org/10.1186/gb-2010-11-8-r88
work_keys_str_mv AT reesemarting astandardvariationfileformatforhumangenomesequences
AT moorebarry astandardvariationfileformatforhumangenomesequences
AT batchelorcolin astandardvariationfileformatforhumangenomesequences
AT salasfidel astandardvariationfileformatforhumangenomesequences
AT cunninghamfiona astandardvariationfileformatforhumangenomesequences
AT marthgabort astandardvariationfileformatforhumangenomesequences
AT steinlincoln astandardvariationfileformatforhumangenomesequences
AT flicekpaul astandardvariationfileformatforhumangenomesequences
AT yandellmark astandardvariationfileformatforhumangenomesequences
AT eilbeckkaren astandardvariationfileformatforhumangenomesequences
AT reesemarting standardvariationfileformatforhumangenomesequences
AT moorebarry standardvariationfileformatforhumangenomesequences
AT batchelorcolin standardvariationfileformatforhumangenomesequences
AT salasfidel standardvariationfileformatforhumangenomesequences
AT cunninghamfiona standardvariationfileformatforhumangenomesequences
AT marthgabort standardvariationfileformatforhumangenomesequences
AT steinlincoln standardvariationfileformatforhumangenomesequences
AT flicekpaul standardvariationfileformatforhumangenomesequences
AT yandellmark standardvariationfileformatforhumangenomesequences
AT eilbeckkaren standardvariationfileformatforhumangenomesequences