Cargando…
GFF3sort: a novel tool to sort GFF3 files for tabix indexing
BACKGROUND: The traditional method of visualizing gene annotation data in JBrowse is converting GFF3 files to JSON format, which is time-consuming. The latest version of JBrowse supports rendering sorted GFF3 files indexed by tabix, a novel strategy that is more convenient than the original conversi...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5686826/ https://www.ncbi.nlm.nih.gov/pubmed/29137598 http://dx.doi.org/10.1186/s12859-017-1930-3 |
_version_ | 1783278843226750976 |
---|---|
author | Zhu, Tao Liang, Chengzhen Meng, Zhigang Guo, Sandui Zhang, Rui |
author_facet | Zhu, Tao Liang, Chengzhen Meng, Zhigang Guo, Sandui Zhang, Rui |
author_sort | Zhu, Tao |
collection | PubMed |
description | BACKGROUND: The traditional method of visualizing gene annotation data in JBrowse is converting GFF3 files to JSON format, which is time-consuming. The latest version of JBrowse supports rendering sorted GFF3 files indexed by tabix, a novel strategy that is more convenient than the original conversion process. However, current tools available for GFF3 file sorting have some limitations and their sorting results would lead to erroneous rendering in JBrowse. RESULTS: We developed GFF3sort, a script to sort GFF3 files for tabix indexing. Specifically designed for JBrowse rendering, GFF3sort can properly deal with the order of features that have the same chromosome and start position, either by remembering their original orders or by conducting parent-child topology sorting. Based on our test datasets from seven species, GFF3sort produced accurate sorting results with acceptable efficiency compared with currently available tools. CONCLUSIONS: GFF3sort is a novel tool to sort GFF3 files for tabix indexing. We anticipate that GFF3sort will be useful to help with genome annotation data processing and visualization. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-017-1930-3) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5686826 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-56868262017-11-21 GFF3sort: a novel tool to sort GFF3 files for tabix indexing Zhu, Tao Liang, Chengzhen Meng, Zhigang Guo, Sandui Zhang, Rui BMC Bioinformatics Software BACKGROUND: The traditional method of visualizing gene annotation data in JBrowse is converting GFF3 files to JSON format, which is time-consuming. The latest version of JBrowse supports rendering sorted GFF3 files indexed by tabix, a novel strategy that is more convenient than the original conversion process. However, current tools available for GFF3 file sorting have some limitations and their sorting results would lead to erroneous rendering in JBrowse. RESULTS: We developed GFF3sort, a script to sort GFF3 files for tabix indexing. Specifically designed for JBrowse rendering, GFF3sort can properly deal with the order of features that have the same chromosome and start position, either by remembering their original orders or by conducting parent-child topology sorting. Based on our test datasets from seven species, GFF3sort produced accurate sorting results with acceptable efficiency compared with currently available tools. CONCLUSIONS: GFF3sort is a novel tool to sort GFF3 files for tabix indexing. We anticipate that GFF3sort will be useful to help with genome annotation data processing and visualization. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-017-1930-3) contains supplementary material, which is available to authorized users. BioMed Central 2017-11-14 /pmc/articles/PMC5686826/ /pubmed/29137598 http://dx.doi.org/10.1186/s12859-017-1930-3 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Zhu, Tao Liang, Chengzhen Meng, Zhigang Guo, Sandui Zhang, Rui GFF3sort: a novel tool to sort GFF3 files for tabix indexing |
title | GFF3sort: a novel tool to sort GFF3 files for tabix indexing |
title_full | GFF3sort: a novel tool to sort GFF3 files for tabix indexing |
title_fullStr | GFF3sort: a novel tool to sort GFF3 files for tabix indexing |
title_full_unstemmed | GFF3sort: a novel tool to sort GFF3 files for tabix indexing |
title_short | GFF3sort: a novel tool to sort GFF3 files for tabix indexing |
title_sort | gff3sort: a novel tool to sort gff3 files for tabix indexing |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5686826/ https://www.ncbi.nlm.nih.gov/pubmed/29137598 http://dx.doi.org/10.1186/s12859-017-1930-3 |
work_keys_str_mv | AT zhutao gff3sortanoveltooltosortgff3filesfortabixindexing AT liangchengzhen gff3sortanoveltooltosortgff3filesfortabixindexing AT mengzhigang gff3sortanoveltooltosortgff3filesfortabixindexing AT guosandui gff3sortanoveltooltosortgff3filesfortabixindexing AT zhangrui gff3sortanoveltooltosortgff3filesfortabixindexing |