Cargando…

Tbl2KnownGene: A command-line program to convert NCBI.tbl to UCSC knownGene.txt data file

The schema for UCSC Known Genes (knownGene.txt) has been widely adopted for use in both standard and custom downstream analysis tools/scripts. For many popular model organisms (e.g. Arabidopsis), sequence and annotation data tables (including “knownGene.txt”) have not yet been made available to the...

Descripción completa

Detalles Bibliográficos
Autor principal: Bai, Yongsheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Biomedical Informatics 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4166776/
https://www.ncbi.nlm.nih.gov/pubmed/25258492
http://dx.doi.org/10.6026/97320630010544
_version_ 1782335306207330304
author Bai, Yongsheng
author_facet Bai, Yongsheng
author_sort Bai, Yongsheng
collection PubMed
description The schema for UCSC Known Genes (knownGene.txt) has been widely adopted for use in both standard and custom downstream analysis tools/scripts. For many popular model organisms (e.g. Arabidopsis), sequence and annotation data tables (including “knownGene.txt”) have not yet been made available to the public. Therefore, it is of interest to describe Tbl2KnownGene, a .tbl file parser that can process the contents of a NCBI .tbl file and produce a UCSC Known Genes annotation feature table. The algorithm is tested with chromosome datasets from Arabidopsis genome (TAIR10). The Tbl2KnownGene parser finds utility for data with other organisms having similar .tbl annotations. AVAILABILITY: Perl scripts and required input files are available on the web at http://thoth.indstate.edu/~ybai2/Tbl2KnownGene/ index.html
format Online
Article
Text
id pubmed-4166776
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Biomedical Informatics
record_format MEDLINE/PubMed
spelling pubmed-41667762014-09-25 Tbl2KnownGene: A command-line program to convert NCBI.tbl to UCSC knownGene.txt data file Bai, Yongsheng Bioinformation Software The schema for UCSC Known Genes (knownGene.txt) has been widely adopted for use in both standard and custom downstream analysis tools/scripts. For many popular model organisms (e.g. Arabidopsis), sequence and annotation data tables (including “knownGene.txt”) have not yet been made available to the public. Therefore, it is of interest to describe Tbl2KnownGene, a .tbl file parser that can process the contents of a NCBI .tbl file and produce a UCSC Known Genes annotation feature table. The algorithm is tested with chromosome datasets from Arabidopsis genome (TAIR10). The Tbl2KnownGene parser finds utility for data with other organisms having similar .tbl annotations. AVAILABILITY: Perl scripts and required input files are available on the web at http://thoth.indstate.edu/~ybai2/Tbl2KnownGene/ index.html Biomedical Informatics 2014-08-30 /pmc/articles/PMC4166776/ /pubmed/25258492 http://dx.doi.org/10.6026/97320630010544 Text en © 2014 Biomedical Informatics This is an open-access article, which permits unrestricted use, distribution, and reproduction in any medium, for non-commercial purposes, provided the original author and source are credited.
spellingShingle Software
Bai, Yongsheng
Tbl2KnownGene: A command-line program to convert NCBI.tbl to UCSC knownGene.txt data file
title Tbl2KnownGene: A command-line program to convert NCBI.tbl to UCSC knownGene.txt data file
title_full Tbl2KnownGene: A command-line program to convert NCBI.tbl to UCSC knownGene.txt data file
title_fullStr Tbl2KnownGene: A command-line program to convert NCBI.tbl to UCSC knownGene.txt data file
title_full_unstemmed Tbl2KnownGene: A command-line program to convert NCBI.tbl to UCSC knownGene.txt data file
title_short Tbl2KnownGene: A command-line program to convert NCBI.tbl to UCSC knownGene.txt data file
title_sort tbl2knowngene: a command-line program to convert ncbi.tbl to ucsc knowngene.txt data file
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4166776/
https://www.ncbi.nlm.nih.gov/pubmed/25258492
http://dx.doi.org/10.6026/97320630010544
work_keys_str_mv AT baiyongsheng tbl2knowngeneacommandlineprogramtoconvertncbitbltoucscknowngenetxtdatafile