Cargando…

Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA

RNA-seq is a new tool to measure RNA transcript counts, using high-throughput sequencing at an extraordinary accuracy. It provides quantitative means to explore the transcriptome of an organism of interest. However, interpreting this extremely large data into biological knowledge is a problem, and b...

Descripción completa

Detalles Bibliográficos
Autores principales:	Tripathi, Kumar Parijat, Evangelista, Daniela, Zuccaro, Antonio, Guarracino, Mario Rosario
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2015
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4651556/ https://www.ncbi.nlm.nih.gov/pubmed/26581084 http://dx.doi.org/10.1371/journal.pone.0140268

_version_	1782401650298716160
author	Tripathi, Kumar Parijat Evangelista, Daniela Zuccaro, Antonio Guarracino, Mario Rosario
author_facet	Tripathi, Kumar Parijat Evangelista, Daniela Zuccaro, Antonio Guarracino, Mario Rosario
author_sort	Tripathi, Kumar Parijat
collection	PubMed
description	RNA-seq is a new tool to measure RNA transcript counts, using high-throughput sequencing at an extraordinary accuracy. It provides quantitative means to explore the transcriptome of an organism of interest. However, interpreting this extremely large data into biological knowledge is a problem, and biologist-friendly tools are lacking. In our lab, we developed Transcriptator, a web application based on a computational Python pipeline with a user-friendly Java interface. This pipeline uses the web services available for BLAST (Basis Local Search Alignment Tool), QuickGO and DAVID (Database for Annotation, Visualization and Integrated Discovery) tools. It offers a report on statistical analysis of functional and Gene Ontology (GO) annotation’s enrichment. It helps users to identify enriched biological themes, particularly GO terms, pathways, domains, gene/proteins features and protein—protein interactions related informations. It clusters the transcripts based on functional annotations and generates a tabular report for functional and gene ontology annotations for each submitted transcript to the web server. The implementation of QuickGo web-services in our pipeline enable the users to carry out GO-Slim analysis, whereas the integration of PORTRAIT (Prediction of transcriptomic non coding RNA (ncRNA) by ab initio methods) helps to identify the non coding RNAs and their regulatory role in transcriptome. In summary, Transcriptator is a useful software for both NGS and array data. It helps the users to characterize the de-novo assembled reads, obtained from NGS experiments for non-referenced organisms, while it also performs the functional enrichment analysis of differentially expressed transcripts/genes for both RNA-seq and micro-array experiments. It generates easy to read tables and interactive charts for better understanding of the data. The pipeline is modular in nature, and provides an opportunity to add new plugins in the future. Web application is freely available at: http://www-labgtp.na.icar.cnr.it/Transcriptator
format	Online Article Text
id	pubmed-4651556
institution	National Center for Biotechnology Information
language	English
publishDate	2015
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-46515562015-11-25 Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA Tripathi, Kumar Parijat Evangelista, Daniela Zuccaro, Antonio Guarracino, Mario Rosario PLoS One Research Article RNA-seq is a new tool to measure RNA transcript counts, using high-throughput sequencing at an extraordinary accuracy. It provides quantitative means to explore the transcriptome of an organism of interest. However, interpreting this extremely large data into biological knowledge is a problem, and biologist-friendly tools are lacking. In our lab, we developed Transcriptator, a web application based on a computational Python pipeline with a user-friendly Java interface. This pipeline uses the web services available for BLAST (Basis Local Search Alignment Tool), QuickGO and DAVID (Database for Annotation, Visualization and Integrated Discovery) tools. It offers a report on statistical analysis of functional and Gene Ontology (GO) annotation’s enrichment. It helps users to identify enriched biological themes, particularly GO terms, pathways, domains, gene/proteins features and protein—protein interactions related informations. It clusters the transcripts based on functional annotations and generates a tabular report for functional and gene ontology annotations for each submitted transcript to the web server. The implementation of QuickGo web-services in our pipeline enable the users to carry out GO-Slim analysis, whereas the integration of PORTRAIT (Prediction of transcriptomic non coding RNA (ncRNA) by ab initio methods) helps to identify the non coding RNAs and their regulatory role in transcriptome. In summary, Transcriptator is a useful software for both NGS and array data. It helps the users to characterize the de-novo assembled reads, obtained from NGS experiments for non-referenced organisms, while it also performs the functional enrichment analysis of differentially expressed transcripts/genes for both RNA-seq and micro-array experiments. It generates easy to read tables and interactive charts for better understanding of the data. The pipeline is modular in nature, and provides an opportunity to add new plugins in the future. Web application is freely available at: http://www-labgtp.na.icar.cnr.it/Transcriptator Public Library of Science 2015-11-18 /pmc/articles/PMC4651556/ /pubmed/26581084 http://dx.doi.org/10.1371/journal.pone.0140268 Text en © 2015 Tripathi et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle	Research Article Tripathi, Kumar Parijat Evangelista, Daniela Zuccaro, Antonio Guarracino, Mario Rosario Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA
title	Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA
title_full	Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA
title_fullStr	Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA
title_full_unstemmed	Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA
title_short	Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA
title_sort	transcriptator: an automated computational pipeline to annotate assembled reads and identify non coding rna
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4651556/ https://www.ncbi.nlm.nih.gov/pubmed/26581084 http://dx.doi.org/10.1371/journal.pone.0140268
work_keys_str_mv	AT tripathikumarparijat transcriptatoranautomatedcomputationalpipelinetoannotateassembledreadsandidentifynoncodingrna AT evangelistadaniela transcriptatoranautomatedcomputationalpipelinetoannotateassembledreadsandidentifynoncodingrna AT zuccaroantonio transcriptatoranautomatedcomputationalpipelinetoannotateassembledreadsandidentifynoncodingrna AT guarracinomariorosario transcriptatoranautomatedcomputationalpipelinetoannotateassembledreadsandidentifynoncodingrna

Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA

Ejemplares similares