Cargando…

Leveraging transcript quantification for fast computation of alternative splicing profiles

Alternative splicing plays an essential role in many cellular processes and bears major relevance in the understanding of multiple diseases, including cancer. High-throughput RNA sequencing allows genome-wide analyses of splicing across multiple conditions. However, the increasing number of availabl...

Descripción completa

Detalles Bibliográficos
Autores principales: Alamancos, Gael P., Pagès, Amadís, Trincado, Juan L., Bellora, Nicolás, Eyras, Eduardo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4536314/
https://www.ncbi.nlm.nih.gov/pubmed/26179515
http://dx.doi.org/10.1261/rna.051557.115
_version_ 1782385727433080832
author Alamancos, Gael P.
Pagès, Amadís
Trincado, Juan L.
Bellora, Nicolás
Eyras, Eduardo
author_facet Alamancos, Gael P.
Pagès, Amadís
Trincado, Juan L.
Bellora, Nicolás
Eyras, Eduardo
author_sort Alamancos, Gael P.
collection PubMed
description Alternative splicing plays an essential role in many cellular processes and bears major relevance in the understanding of multiple diseases, including cancer. High-throughput RNA sequencing allows genome-wide analyses of splicing across multiple conditions. However, the increasing number of available data sets represents a major challenge in terms of computation time and storage requirements. We describe SUPPA, a computational tool to calculate relative inclusion values of alternative splicing events, exploiting fast transcript quantification. SUPPA accuracy is comparable and sometimes superior to standard methods using simulated as well as real RNA-sequencing data compared with experimentally validated events. We assess the variability in terms of the choice of annotation and provide evidence that using complete transcripts rather than more transcripts per gene provides better estimates. Moreover, SUPPA coupled with de novo transcript reconstruction methods does not achieve accuracies as high as using quantification of known transcripts, but remains comparable to existing methods. Finally, we show that SUPPA is more than 1000 times faster than standard methods. Coupled with fast transcript quantification, SUPPA provides inclusion values at a much higher speed than existing methods without compromising accuracy, thereby facilitating the systematic splicing analysis of large data sets with limited computational resources. The software is implemented in Python 2.7 and is available under the MIT license at https://bitbucket.org/regulatorygenomicsupf/suppa.
format Online
Article
Text
id pubmed-4536314
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-45363142015-09-01 Leveraging transcript quantification for fast computation of alternative splicing profiles Alamancos, Gael P. Pagès, Amadís Trincado, Juan L. Bellora, Nicolás Eyras, Eduardo RNA Bioinformatics Alternative splicing plays an essential role in many cellular processes and bears major relevance in the understanding of multiple diseases, including cancer. High-throughput RNA sequencing allows genome-wide analyses of splicing across multiple conditions. However, the increasing number of available data sets represents a major challenge in terms of computation time and storage requirements. We describe SUPPA, a computational tool to calculate relative inclusion values of alternative splicing events, exploiting fast transcript quantification. SUPPA accuracy is comparable and sometimes superior to standard methods using simulated as well as real RNA-sequencing data compared with experimentally validated events. We assess the variability in terms of the choice of annotation and provide evidence that using complete transcripts rather than more transcripts per gene provides better estimates. Moreover, SUPPA coupled with de novo transcript reconstruction methods does not achieve accuracies as high as using quantification of known transcripts, but remains comparable to existing methods. Finally, we show that SUPPA is more than 1000 times faster than standard methods. Coupled with fast transcript quantification, SUPPA provides inclusion values at a much higher speed than existing methods without compromising accuracy, thereby facilitating the systematic splicing analysis of large data sets with limited computational resources. The software is implemented in Python 2.7 and is available under the MIT license at https://bitbucket.org/regulatorygenomicsupf/suppa. Cold Spring Harbor Laboratory Press 2015-09 /pmc/articles/PMC4536314/ /pubmed/26179515 http://dx.doi.org/10.1261/rna.051557.115 Text en © 2015 Alamancos et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society http://creativecommons.org/licenses/by/4.0/ This article, published in RNA, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/.
spellingShingle Bioinformatics
Alamancos, Gael P.
Pagès, Amadís
Trincado, Juan L.
Bellora, Nicolás
Eyras, Eduardo
Leveraging transcript quantification for fast computation of alternative splicing profiles
title Leveraging transcript quantification for fast computation of alternative splicing profiles
title_full Leveraging transcript quantification for fast computation of alternative splicing profiles
title_fullStr Leveraging transcript quantification for fast computation of alternative splicing profiles
title_full_unstemmed Leveraging transcript quantification for fast computation of alternative splicing profiles
title_short Leveraging transcript quantification for fast computation of alternative splicing profiles
title_sort leveraging transcript quantification for fast computation of alternative splicing profiles
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4536314/
https://www.ncbi.nlm.nih.gov/pubmed/26179515
http://dx.doi.org/10.1261/rna.051557.115
work_keys_str_mv AT alamancosgaelp leveragingtranscriptquantificationforfastcomputationofalternativesplicingprofiles
AT pagesamadis leveragingtranscriptquantificationforfastcomputationofalternativesplicingprofiles
AT trincadojuanl leveragingtranscriptquantificationforfastcomputationofalternativesplicingprofiles
AT belloranicolas leveragingtranscriptquantificationforfastcomputationofalternativesplicingprofiles
AT eyraseduardo leveragingtranscriptquantificationforfastcomputationofalternativesplicingprofiles