Cargando…
Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA
RNA-seq is a new tool to measure RNA transcript counts, using high-throughput sequencing at an extraordinary accuracy. It provides quantitative means to explore the transcriptome of an organism of interest. However, interpreting this extremely large data into biological knowledge is a problem, and b...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4651556/ https://www.ncbi.nlm.nih.gov/pubmed/26581084 http://dx.doi.org/10.1371/journal.pone.0140268 |
_version_ | 1782401650298716160 |
---|---|
author | Tripathi, Kumar Parijat Evangelista, Daniela Zuccaro, Antonio Guarracino, Mario Rosario |
author_facet | Tripathi, Kumar Parijat Evangelista, Daniela Zuccaro, Antonio Guarracino, Mario Rosario |
author_sort | Tripathi, Kumar Parijat |
collection | PubMed |
description | RNA-seq is a new tool to measure RNA transcript counts, using high-throughput sequencing at an extraordinary accuracy. It provides quantitative means to explore the transcriptome of an organism of interest. However, interpreting this extremely large data into biological knowledge is a problem, and biologist-friendly tools are lacking. In our lab, we developed Transcriptator, a web application based on a computational Python pipeline with a user-friendly Java interface. This pipeline uses the web services available for BLAST (Basis Local Search Alignment Tool), QuickGO and DAVID (Database for Annotation, Visualization and Integrated Discovery) tools. It offers a report on statistical analysis of functional and Gene Ontology (GO) annotation’s enrichment. It helps users to identify enriched biological themes, particularly GO terms, pathways, domains, gene/proteins features and protein—protein interactions related informations. It clusters the transcripts based on functional annotations and generates a tabular report for functional and gene ontology annotations for each submitted transcript to the web server. The implementation of QuickGo web-services in our pipeline enable the users to carry out GO-Slim analysis, whereas the integration of PORTRAIT (Prediction of transcriptomic non coding RNA (ncRNA) by ab initio methods) helps to identify the non coding RNAs and their regulatory role in transcriptome. In summary, Transcriptator is a useful software for both NGS and array data. It helps the users to characterize the de-novo assembled reads, obtained from NGS experiments for non-referenced organisms, while it also performs the functional enrichment analysis of differentially expressed transcripts/genes for both RNA-seq and micro-array experiments. It generates easy to read tables and interactive charts for better understanding of the data. The pipeline is modular in nature, and provides an opportunity to add new plugins in the future. Web application is freely available at: http://www-labgtp.na.icar.cnr.it/Transcriptator |
format | Online Article Text |
id | pubmed-4651556 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-46515562015-11-25 Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA Tripathi, Kumar Parijat Evangelista, Daniela Zuccaro, Antonio Guarracino, Mario Rosario PLoS One Research Article RNA-seq is a new tool to measure RNA transcript counts, using high-throughput sequencing at an extraordinary accuracy. It provides quantitative means to explore the transcriptome of an organism of interest. However, interpreting this extremely large data into biological knowledge is a problem, and biologist-friendly tools are lacking. In our lab, we developed Transcriptator, a web application based on a computational Python pipeline with a user-friendly Java interface. This pipeline uses the web services available for BLAST (Basis Local Search Alignment Tool), QuickGO and DAVID (Database for Annotation, Visualization and Integrated Discovery) tools. It offers a report on statistical analysis of functional and Gene Ontology (GO) annotation’s enrichment. It helps users to identify enriched biological themes, particularly GO terms, pathways, domains, gene/proteins features and protein—protein interactions related informations. It clusters the transcripts based on functional annotations and generates a tabular report for functional and gene ontology annotations for each submitted transcript to the web server. The implementation of QuickGo web-services in our pipeline enable the users to carry out GO-Slim analysis, whereas the integration of PORTRAIT (Prediction of transcriptomic non coding RNA (ncRNA) by ab initio methods) helps to identify the non coding RNAs and their regulatory role in transcriptome. In summary, Transcriptator is a useful software for both NGS and array data. It helps the users to characterize the de-novo assembled reads, obtained from NGS experiments for non-referenced organisms, while it also performs the functional enrichment analysis of differentially expressed transcripts/genes for both RNA-seq and micro-array experiments. It generates easy to read tables and interactive charts for better understanding of the data. The pipeline is modular in nature, and provides an opportunity to add new plugins in the future. Web application is freely available at: http://www-labgtp.na.icar.cnr.it/Transcriptator Public Library of Science 2015-11-18 /pmc/articles/PMC4651556/ /pubmed/26581084 http://dx.doi.org/10.1371/journal.pone.0140268 Text en © 2015 Tripathi et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Tripathi, Kumar Parijat Evangelista, Daniela Zuccaro, Antonio Guarracino, Mario Rosario Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA |
title | Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA |
title_full | Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA |
title_fullStr | Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA |
title_full_unstemmed | Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA |
title_short | Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA |
title_sort | transcriptator: an automated computational pipeline to annotate assembled reads and identify non coding rna |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4651556/ https://www.ncbi.nlm.nih.gov/pubmed/26581084 http://dx.doi.org/10.1371/journal.pone.0140268 |
work_keys_str_mv | AT tripathikumarparijat transcriptatoranautomatedcomputationalpipelinetoannotateassembledreadsandidentifynoncodingrna AT evangelistadaniela transcriptatoranautomatedcomputationalpipelinetoannotateassembledreadsandidentifynoncodingrna AT zuccaroantonio transcriptatoranautomatedcomputationalpipelinetoannotateassembledreadsandidentifynoncodingrna AT guarracinomariorosario transcriptatoranautomatedcomputationalpipelinetoannotateassembledreadsandidentifynoncodingrna |