Cargando…

orfipy: a fast and flexible tool for extracting ORFs

SUMMARY: Searching for open reading frames is a routine task and a critical step prior to annotating protein coding regions in newly sequenced genomes or de novo transcriptome assemblies. With the tremendous increase in genomic and transcriptomic data, faster tools are needed to handle large input d...

Descripción completa

Detalles Bibliográficos
Autores principales: Singh, Urminder, Wurtele, Eve Syrkin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8479652/
https://www.ncbi.nlm.nih.gov/pubmed/33576786
http://dx.doi.org/10.1093/bioinformatics/btab090
Descripción
Sumario:SUMMARY: Searching for open reading frames is a routine task and a critical step prior to annotating protein coding regions in newly sequenced genomes or de novo transcriptome assemblies. With the tremendous increase in genomic and transcriptomic data, faster tools are needed to handle large input datasets. These tools should be versatile enough to fine-tune search criteria and allow efficient downstream analysis. Here we present a new python based tool, orfipy, which allows the user to flexibly search for open reading frames in genomic and transcriptomic sequences. The search is rapid and is fully customizable, with a choice of FASTA and BED output formats. AVAILABILITY AND IMPLEMENTATION: orfipy is implemented in python and is compatible with python v3.6 and higher. Source code: https://github.com/urmi-21/orfipy. Installation: from the source, or via PyPi (https://pypi.org/project/orfipy) or bioconda (https://anaconda.org/bioconda/orfipy). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.