Cargando…

FineSplice, enhanced splice junction detection and quantification: a novel pipeline based on the assessment of diverse RNA-Seq alignment solutions

Alternative splicing is the main mechanism governing protein diversity. The recent developments in RNA-Seq technology have enabled the study of the global impact and regulation of this biological process. However, the lack of standardized protocols constitutes a major bottleneck in the analysis of a...

Descripción completa

Detalles Bibliográficos
Autores principales: Gatto, Alberto, Torroja-Fungairiño, Carlos, Mazzarotto, Francesco, Cook, Stuart A., Barton, Paul J. R., Sánchez-Cabo, Fátima, Lara-Pezzi, Enrique
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4005686/
https://www.ncbi.nlm.nih.gov/pubmed/24574529
http://dx.doi.org/10.1093/nar/gku166
_version_ 1782314141769269248
author Gatto, Alberto
Torroja-Fungairiño, Carlos
Mazzarotto, Francesco
Cook, Stuart A.
Barton, Paul J. R.
Sánchez-Cabo, Fátima
Lara-Pezzi, Enrique
author_facet Gatto, Alberto
Torroja-Fungairiño, Carlos
Mazzarotto, Francesco
Cook, Stuart A.
Barton, Paul J. R.
Sánchez-Cabo, Fátima
Lara-Pezzi, Enrique
author_sort Gatto, Alberto
collection PubMed
description Alternative splicing is the main mechanism governing protein diversity. The recent developments in RNA-Seq technology have enabled the study of the global impact and regulation of this biological process. However, the lack of standardized protocols constitutes a major bottleneck in the analysis of alternative splicing. This is particularly important for the identification of exon–exon junctions, which is a critical step in any analysis workflow. Here we performed a systematic benchmarking of alignment tools to dissect the impact of design and method on the mapping, detection and quantification of splice junctions from multi-exon reads. Accordingly, we devised a novel pipeline based on TopHat2 combined with a splice junction detection algorithm, which we have named FineSplice. FineSplice allows effective elimination of spurious junction hits arising from artefactual alignments, achieving up to 99% precision in both real and simulated data sets and yielding superior F(1) scores under most tested conditions. The proposed strategy conjugates an efficient mapping solution with a semi-supervised anomaly detection scheme to filter out false positives and allows reliable estimation of expressed junctions from the alignment output. Ultimately this provides more accurate information to identify meaningful splicing patterns. FineSplice is freely available at https://sourceforge.net/p/finesplice/.
format Online
Article
Text
id pubmed-4005686
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-40056862014-05-01 FineSplice, enhanced splice junction detection and quantification: a novel pipeline based on the assessment of diverse RNA-Seq alignment solutions Gatto, Alberto Torroja-Fungairiño, Carlos Mazzarotto, Francesco Cook, Stuart A. Barton, Paul J. R. Sánchez-Cabo, Fátima Lara-Pezzi, Enrique Nucleic Acids Res Methods Online Alternative splicing is the main mechanism governing protein diversity. The recent developments in RNA-Seq technology have enabled the study of the global impact and regulation of this biological process. However, the lack of standardized protocols constitutes a major bottleneck in the analysis of alternative splicing. This is particularly important for the identification of exon–exon junctions, which is a critical step in any analysis workflow. Here we performed a systematic benchmarking of alignment tools to dissect the impact of design and method on the mapping, detection and quantification of splice junctions from multi-exon reads. Accordingly, we devised a novel pipeline based on TopHat2 combined with a splice junction detection algorithm, which we have named FineSplice. FineSplice allows effective elimination of spurious junction hits arising from artefactual alignments, achieving up to 99% precision in both real and simulated data sets and yielding superior F(1) scores under most tested conditions. The proposed strategy conjugates an efficient mapping solution with a semi-supervised anomaly detection scheme to filter out false positives and allows reliable estimation of expressed junctions from the alignment output. Ultimately this provides more accurate information to identify meaningful splicing patterns. FineSplice is freely available at https://sourceforge.net/p/finesplice/. Oxford University Press 2014-04 2014-02-25 /pmc/articles/PMC4005686/ /pubmed/24574529 http://dx.doi.org/10.1093/nar/gku166 Text en © The Author(s) 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Gatto, Alberto
Torroja-Fungairiño, Carlos
Mazzarotto, Francesco
Cook, Stuart A.
Barton, Paul J. R.
Sánchez-Cabo, Fátima
Lara-Pezzi, Enrique
FineSplice, enhanced splice junction detection and quantification: a novel pipeline based on the assessment of diverse RNA-Seq alignment solutions
title FineSplice, enhanced splice junction detection and quantification: a novel pipeline based on the assessment of diverse RNA-Seq alignment solutions
title_full FineSplice, enhanced splice junction detection and quantification: a novel pipeline based on the assessment of diverse RNA-Seq alignment solutions
title_fullStr FineSplice, enhanced splice junction detection and quantification: a novel pipeline based on the assessment of diverse RNA-Seq alignment solutions
title_full_unstemmed FineSplice, enhanced splice junction detection and quantification: a novel pipeline based on the assessment of diverse RNA-Seq alignment solutions
title_short FineSplice, enhanced splice junction detection and quantification: a novel pipeline based on the assessment of diverse RNA-Seq alignment solutions
title_sort finesplice, enhanced splice junction detection and quantification: a novel pipeline based on the assessment of diverse rna-seq alignment solutions
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4005686/
https://www.ncbi.nlm.nih.gov/pubmed/24574529
http://dx.doi.org/10.1093/nar/gku166
work_keys_str_mv AT gattoalberto finespliceenhancedsplicejunctiondetectionandquantificationanovelpipelinebasedontheassessmentofdiversernaseqalignmentsolutions
AT torrojafungairinocarlos finespliceenhancedsplicejunctiondetectionandquantificationanovelpipelinebasedontheassessmentofdiversernaseqalignmentsolutions
AT mazzarottofrancesco finespliceenhancedsplicejunctiondetectionandquantificationanovelpipelinebasedontheassessmentofdiversernaseqalignmentsolutions
AT cookstuarta finespliceenhancedsplicejunctiondetectionandquantificationanovelpipelinebasedontheassessmentofdiversernaseqalignmentsolutions
AT bartonpauljr finespliceenhancedsplicejunctiondetectionandquantificationanovelpipelinebasedontheassessmentofdiversernaseqalignmentsolutions
AT sanchezcabofatima finespliceenhancedsplicejunctiondetectionandquantificationanovelpipelinebasedontheassessmentofdiversernaseqalignmentsolutions
AT larapezzienrique finespliceenhancedsplicejunctiondetectionandquantificationanovelpipelinebasedontheassessmentofdiversernaseqalignmentsolutions