Cargando…

RAP: RNA-Seq Analysis Pipeline, a new cloud-based NGS web application

BACKGROUND: The study of RNA has been dramatically improved by the introduction of Next Generation Sequencing platforms allowing massive and cheap sequencing of selected RNA fractions, also providing information on strand orientation (RNA-Seq). The complexity of transcriptomes and of their regulativ...

Descripción completa

Detalles Bibliográficos
Autores principales: D'Antonio, Mattia, D'Onorio De Meo, Paolo, Pallocca, Matteo, Picardi, Ernesto, D'Erchia, Anna Maria, Calogero, Raffaele A, Castrignanò, Tiziana, Pesole, Graziano
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4461013/
https://www.ncbi.nlm.nih.gov/pubmed/26046471
http://dx.doi.org/10.1186/1471-2164-16-S6-S3
_version_ 1782375474592219136
author D'Antonio, Mattia
D'Onorio De Meo, Paolo
Pallocca, Matteo
Picardi, Ernesto
D'Erchia, Anna Maria
Calogero, Raffaele A
Castrignanò, Tiziana
Pesole, Graziano
author_facet D'Antonio, Mattia
D'Onorio De Meo, Paolo
Pallocca, Matteo
Picardi, Ernesto
D'Erchia, Anna Maria
Calogero, Raffaele A
Castrignanò, Tiziana
Pesole, Graziano
author_sort D'Antonio, Mattia
collection PubMed
description BACKGROUND: The study of RNA has been dramatically improved by the introduction of Next Generation Sequencing platforms allowing massive and cheap sequencing of selected RNA fractions, also providing information on strand orientation (RNA-Seq). The complexity of transcriptomes and of their regulative pathways make RNA-Seq one of most complex field of NGS applications, addressing several aspects of the expression process (e.g. identification and quantification of expressed genes and transcripts, alternative splicing and polyadenylation, fusion genes and trans-splicing, post-transcriptional events, etc.). Moreover, the huge volume of data generated by NGS platforms introduces unprecedented computational and technological challenges to efficiently analyze and store sequence data and results. METHODS: In order to provide researchers with an effective and friendly resource for analyzing RNA-Seq data, we present here RAP (RNA-Seq Analysis Pipeline), a cloud computing web application implementing a complete but modular analysis workflow. This pipeline integrates both state-of-the-art bioinformatics tools for RNA-Seq analysis and in-house developed scripts to offer to the user a comprehensive strategy for data analysis. RAP is able to perform quality checks (adopting FastQC and NGS QC Toolkit), identify and quantify expressed genes and transcripts (with Tophat, Cufflinks and HTSeq), detect alternative splicing events (using SpliceTrap) and chimeric transcripts (with ChimeraScan). This pipeline is also able to identify splicing junctions and constitutive or alternative polyadenylation sites (implementing custom analysis modules) and call for statistically significant differences in genes and transcripts expression, splicing pattern and polyadenylation site usage (using Cuffdiff2 and DESeq). RESULTS: Through a user friendly web interface, the RAP workflow can be suitably customized by the user and it is automatically executed on our cloud computing environment. This strategy allows to access to bioinformatics tools and computational resources without specific bioinformatics and IT skills. RAP provides a set of tabular and graphical results that can be helpful to browse, filter and export analyzed data, according to the user needs.
format Online
Article
Text
id pubmed-4461013
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44610132015-06-29 RAP: RNA-Seq Analysis Pipeline, a new cloud-based NGS web application D'Antonio, Mattia D'Onorio De Meo, Paolo Pallocca, Matteo Picardi, Ernesto D'Erchia, Anna Maria Calogero, Raffaele A Castrignanò, Tiziana Pesole, Graziano BMC Genomics Research BACKGROUND: The study of RNA has been dramatically improved by the introduction of Next Generation Sequencing platforms allowing massive and cheap sequencing of selected RNA fractions, also providing information on strand orientation (RNA-Seq). The complexity of transcriptomes and of their regulative pathways make RNA-Seq one of most complex field of NGS applications, addressing several aspects of the expression process (e.g. identification and quantification of expressed genes and transcripts, alternative splicing and polyadenylation, fusion genes and trans-splicing, post-transcriptional events, etc.). Moreover, the huge volume of data generated by NGS platforms introduces unprecedented computational and technological challenges to efficiently analyze and store sequence data and results. METHODS: In order to provide researchers with an effective and friendly resource for analyzing RNA-Seq data, we present here RAP (RNA-Seq Analysis Pipeline), a cloud computing web application implementing a complete but modular analysis workflow. This pipeline integrates both state-of-the-art bioinformatics tools for RNA-Seq analysis and in-house developed scripts to offer to the user a comprehensive strategy for data analysis. RAP is able to perform quality checks (adopting FastQC and NGS QC Toolkit), identify and quantify expressed genes and transcripts (with Tophat, Cufflinks and HTSeq), detect alternative splicing events (using SpliceTrap) and chimeric transcripts (with ChimeraScan). This pipeline is also able to identify splicing junctions and constitutive or alternative polyadenylation sites (implementing custom analysis modules) and call for statistically significant differences in genes and transcripts expression, splicing pattern and polyadenylation site usage (using Cuffdiff2 and DESeq). RESULTS: Through a user friendly web interface, the RAP workflow can be suitably customized by the user and it is automatically executed on our cloud computing environment. This strategy allows to access to bioinformatics tools and computational resources without specific bioinformatics and IT skills. RAP provides a set of tabular and graphical results that can be helpful to browse, filter and export analyzed data, according to the user needs. BioMed Central 2015-06-01 /pmc/articles/PMC4461013/ /pubmed/26046471 http://dx.doi.org/10.1186/1471-2164-16-S6-S3 Text en Copyright © 2015 D'Antonio et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
D'Antonio, Mattia
D'Onorio De Meo, Paolo
Pallocca, Matteo
Picardi, Ernesto
D'Erchia, Anna Maria
Calogero, Raffaele A
Castrignanò, Tiziana
Pesole, Graziano
RAP: RNA-Seq Analysis Pipeline, a new cloud-based NGS web application
title RAP: RNA-Seq Analysis Pipeline, a new cloud-based NGS web application
title_full RAP: RNA-Seq Analysis Pipeline, a new cloud-based NGS web application
title_fullStr RAP: RNA-Seq Analysis Pipeline, a new cloud-based NGS web application
title_full_unstemmed RAP: RNA-Seq Analysis Pipeline, a new cloud-based NGS web application
title_short RAP: RNA-Seq Analysis Pipeline, a new cloud-based NGS web application
title_sort rap: rna-seq analysis pipeline, a new cloud-based ngs web application
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4461013/
https://www.ncbi.nlm.nih.gov/pubmed/26046471
http://dx.doi.org/10.1186/1471-2164-16-S6-S3
work_keys_str_mv AT dantoniomattia raprnaseqanalysispipelineanewcloudbasedngswebapplication
AT donoriodemeopaolo raprnaseqanalysispipelineanewcloudbasedngswebapplication
AT palloccamatteo raprnaseqanalysispipelineanewcloudbasedngswebapplication
AT picardiernesto raprnaseqanalysispipelineanewcloudbasedngswebapplication
AT derchiaannamaria raprnaseqanalysispipelineanewcloudbasedngswebapplication
AT calogeroraffaelea raprnaseqanalysispipelineanewcloudbasedngswebapplication
AT castrignanotiziana raprnaseqanalysispipelineanewcloudbasedngswebapplication
AT pesolegraziano raprnaseqanalysispipelineanewcloudbasedngswebapplication