Cargando…

GFF-Ex: a genome feature extraction package

BACKGROUND: Genomic features of whole genome sequences emerging from various sequencing and annotation projects are represented and stored in several formats. Amongst these formats, the GFF (Generic/General Feature Format) has emerged as a widely accepted, portable and successfully used flat file fo...

Descripción completa

Detalles Bibliográficos
Autores principales: Rastogi, Achal, Gupta, Dinesh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4045924/
https://www.ncbi.nlm.nih.gov/pubmed/24885931
http://dx.doi.org/10.1186/1756-0500-7-315
_version_ 1782319411242205184
author Rastogi, Achal
Gupta, Dinesh
author_facet Rastogi, Achal
Gupta, Dinesh
author_sort Rastogi, Achal
collection PubMed
description BACKGROUND: Genomic features of whole genome sequences emerging from various sequencing and annotation projects are represented and stored in several formats. Amongst these formats, the GFF (Generic/General Feature Format) has emerged as a widely accepted, portable and successfully used flat file format for genome annotation storage. With an increasing interest in genome annotation projects and secondary and meta-analysis, there is a need for efficient tools to extract sequences of interests from GFF files. FINDINGS: We have developed GFF-Ex to automate feature-based extraction of sequences from a GFF file. In addition to automated sequence extraction of the features described within a feature file, GFF-Ex also assigns boundaries for the features (introns, intergenic, regions upstream to genes), which are not explicitly specified in the GFF format, and exports the corresponding primary sequence information into predefined feature specific output files. GFF-Ex package consists of several UNIX Shell and PERL scripts. CONCLUSIONS: Compared to other available GFF parsers, GFF-Ex is a simpler tool, which permits sequence retrieval based on additional inferred features. GFF-Ex can also be integrated with any genome annotation or analysis pipeline. GFF-Ex is freely available at http://bioinfo.icgeb.res.in/gff.
format Online
Article
Text
id pubmed-4045924
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40459242014-06-06 GFF-Ex: a genome feature extraction package Rastogi, Achal Gupta, Dinesh BMC Res Notes Technical Note BACKGROUND: Genomic features of whole genome sequences emerging from various sequencing and annotation projects are represented and stored in several formats. Amongst these formats, the GFF (Generic/General Feature Format) has emerged as a widely accepted, portable and successfully used flat file format for genome annotation storage. With an increasing interest in genome annotation projects and secondary and meta-analysis, there is a need for efficient tools to extract sequences of interests from GFF files. FINDINGS: We have developed GFF-Ex to automate feature-based extraction of sequences from a GFF file. In addition to automated sequence extraction of the features described within a feature file, GFF-Ex also assigns boundaries for the features (introns, intergenic, regions upstream to genes), which are not explicitly specified in the GFF format, and exports the corresponding primary sequence information into predefined feature specific output files. GFF-Ex package consists of several UNIX Shell and PERL scripts. CONCLUSIONS: Compared to other available GFF parsers, GFF-Ex is a simpler tool, which permits sequence retrieval based on additional inferred features. GFF-Ex can also be integrated with any genome annotation or analysis pipeline. GFF-Ex is freely available at http://bioinfo.icgeb.res.in/gff. BioMed Central 2014-05-24 /pmc/articles/PMC4045924/ /pubmed/24885931 http://dx.doi.org/10.1186/1756-0500-7-315 Text en Copyright © 2014 Rastogi and Gupta; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.
spellingShingle Technical Note
Rastogi, Achal
Gupta, Dinesh
GFF-Ex: a genome feature extraction package
title GFF-Ex: a genome feature extraction package
title_full GFF-Ex: a genome feature extraction package
title_fullStr GFF-Ex: a genome feature extraction package
title_full_unstemmed GFF-Ex: a genome feature extraction package
title_short GFF-Ex: a genome feature extraction package
title_sort gff-ex: a genome feature extraction package
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4045924/
https://www.ncbi.nlm.nih.gov/pubmed/24885931
http://dx.doi.org/10.1186/1756-0500-7-315
work_keys_str_mv AT rastogiachal gffexagenomefeatureextractionpackage
AT guptadinesh gffexagenomefeatureextractionpackage