Cargando…

Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA

RNA-seq is a new tool to measure RNA transcript counts, using high-throughput sequencing at an extraordinary accuracy. It provides quantitative means to explore the transcriptome of an organism of interest. However, interpreting this extremely large data into biological knowledge is a problem, and b...

Descripción completa

Detalles Bibliográficos
Autores principales: Tripathi, Kumar Parijat, Evangelista, Daniela, Zuccaro, Antonio, Guarracino, Mario Rosario
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4651556/
https://www.ncbi.nlm.nih.gov/pubmed/26581084
http://dx.doi.org/10.1371/journal.pone.0140268
_version_ 1782401650298716160
author Tripathi, Kumar Parijat
Evangelista, Daniela
Zuccaro, Antonio
Guarracino, Mario Rosario
author_facet Tripathi, Kumar Parijat
Evangelista, Daniela
Zuccaro, Antonio
Guarracino, Mario Rosario
author_sort Tripathi, Kumar Parijat
collection PubMed
description RNA-seq is a new tool to measure RNA transcript counts, using high-throughput sequencing at an extraordinary accuracy. It provides quantitative means to explore the transcriptome of an organism of interest. However, interpreting this extremely large data into biological knowledge is a problem, and biologist-friendly tools are lacking. In our lab, we developed Transcriptator, a web application based on a computational Python pipeline with a user-friendly Java interface. This pipeline uses the web services available for BLAST (Basis Local Search Alignment Tool), QuickGO and DAVID (Database for Annotation, Visualization and Integrated Discovery) tools. It offers a report on statistical analysis of functional and Gene Ontology (GO) annotation’s enrichment. It helps users to identify enriched biological themes, particularly GO terms, pathways, domains, gene/proteins features and protein—protein interactions related informations. It clusters the transcripts based on functional annotations and generates a tabular report for functional and gene ontology annotations for each submitted transcript to the web server. The implementation of QuickGo web-services in our pipeline enable the users to carry out GO-Slim analysis, whereas the integration of PORTRAIT (Prediction of transcriptomic non coding RNA (ncRNA) by ab initio methods) helps to identify the non coding RNAs and their regulatory role in transcriptome. In summary, Transcriptator is a useful software for both NGS and array data. It helps the users to characterize the de-novo assembled reads, obtained from NGS experiments for non-referenced organisms, while it also performs the functional enrichment analysis of differentially expressed transcripts/genes for both RNA-seq and micro-array experiments. It generates easy to read tables and interactive charts for better understanding of the data. The pipeline is modular in nature, and provides an opportunity to add new plugins in the future. Web application is freely available at: http://www-labgtp.na.icar.cnr.it/Transcriptator
format Online
Article
Text
id pubmed-4651556
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-46515562015-11-25 Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA Tripathi, Kumar Parijat Evangelista, Daniela Zuccaro, Antonio Guarracino, Mario Rosario PLoS One Research Article RNA-seq is a new tool to measure RNA transcript counts, using high-throughput sequencing at an extraordinary accuracy. It provides quantitative means to explore the transcriptome of an organism of interest. However, interpreting this extremely large data into biological knowledge is a problem, and biologist-friendly tools are lacking. In our lab, we developed Transcriptator, a web application based on a computational Python pipeline with a user-friendly Java interface. This pipeline uses the web services available for BLAST (Basis Local Search Alignment Tool), QuickGO and DAVID (Database for Annotation, Visualization and Integrated Discovery) tools. It offers a report on statistical analysis of functional and Gene Ontology (GO) annotation’s enrichment. It helps users to identify enriched biological themes, particularly GO terms, pathways, domains, gene/proteins features and protein—protein interactions related informations. It clusters the transcripts based on functional annotations and generates a tabular report for functional and gene ontology annotations for each submitted transcript to the web server. The implementation of QuickGo web-services in our pipeline enable the users to carry out GO-Slim analysis, whereas the integration of PORTRAIT (Prediction of transcriptomic non coding RNA (ncRNA) by ab initio methods) helps to identify the non coding RNAs and their regulatory role in transcriptome. In summary, Transcriptator is a useful software for both NGS and array data. It helps the users to characterize the de-novo assembled reads, obtained from NGS experiments for non-referenced organisms, while it also performs the functional enrichment analysis of differentially expressed transcripts/genes for both RNA-seq and micro-array experiments. It generates easy to read tables and interactive charts for better understanding of the data. The pipeline is modular in nature, and provides an opportunity to add new plugins in the future. Web application is freely available at: http://www-labgtp.na.icar.cnr.it/Transcriptator Public Library of Science 2015-11-18 /pmc/articles/PMC4651556/ /pubmed/26581084 http://dx.doi.org/10.1371/journal.pone.0140268 Text en © 2015 Tripathi et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Tripathi, Kumar Parijat
Evangelista, Daniela
Zuccaro, Antonio
Guarracino, Mario Rosario
Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA
title Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA
title_full Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA
title_fullStr Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA
title_full_unstemmed Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA
title_short Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA
title_sort transcriptator: an automated computational pipeline to annotate assembled reads and identify non coding rna
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4651556/
https://www.ncbi.nlm.nih.gov/pubmed/26581084
http://dx.doi.org/10.1371/journal.pone.0140268
work_keys_str_mv AT tripathikumarparijat transcriptatoranautomatedcomputationalpipelinetoannotateassembledreadsandidentifynoncodingrna
AT evangelistadaniela transcriptatoranautomatedcomputationalpipelinetoannotateassembledreadsandidentifynoncodingrna
AT zuccaroantonio transcriptatoranautomatedcomputationalpipelinetoannotateassembledreadsandidentifynoncodingrna
AT guarracinomariorosario transcriptatoranautomatedcomputationalpipelinetoannotateassembledreadsandidentifynoncodingrna