Cargando…

Alternative splicing detection workflow needs a careful combination of sample prep and bioinformatics analysis

BACKGROUND: RNA-Seq provides remarkable power in the area of biomarkers discovery and disease characterization. Two crucial steps that affect RNA-Seq experiment results are Library Sample Preparation (LSP) and Bioinformatics Analysis (BA). This work describes an evaluation of the combined effect of...

Descripción completa

Detalles Bibliográficos
Autores principales: Carrara, Matteo, Lum, Josephine, Cordero, Francesca, Beccuti, Marco, Poidinger, Michael, Donatelli, Susanna, Calogero, Raffaele Adolfo, Zolezzi, Francesca
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4464605/
https://www.ncbi.nlm.nih.gov/pubmed/26050971
http://dx.doi.org/10.1186/1471-2105-16-S9-S2
_version_ 1782376005253464064
author Carrara, Matteo
Lum, Josephine
Cordero, Francesca
Beccuti, Marco
Poidinger, Michael
Donatelli, Susanna
Calogero, Raffaele Adolfo
Zolezzi, Francesca
author_facet Carrara, Matteo
Lum, Josephine
Cordero, Francesca
Beccuti, Marco
Poidinger, Michael
Donatelli, Susanna
Calogero, Raffaele Adolfo
Zolezzi, Francesca
author_sort Carrara, Matteo
collection PubMed
description BACKGROUND: RNA-Seq provides remarkable power in the area of biomarkers discovery and disease characterization. Two crucial steps that affect RNA-Seq experiment results are Library Sample Preparation (LSP) and Bioinformatics Analysis (BA). This work describes an evaluation of the combined effect of LSP methods and BA tools in the detection of splice variants. RESULTS: Different LSPs (TruSeq unstranded/stranded, ScriptSeq, NuGEN) allowed the detection of a large common set of splice variants. However, each LSP also detected a small set of unique transcripts that are characterized by a low coverage and/or FPKM. This effect was particularly evident using the low input RNA NuGEN v2 protocol. A benchmark dataset, in which synthetic reads as well as reads generated from standard (Illumina TruSeq 100) and low input (NuGEN) LSPs were spiked-in was used to evaluate the effect of LSP on the statistical detection of alternative splicing events (AltDE). Statistical detection of AltDE was done using as prototypes for splice variant-quantification Cuffdiff2 and RSEM-EBSeq. As prototype for exon-level analysis DEXSeq was used. Exon-level analysis performed slightly better than splice variant-quantification approaches, although at most only 50% of the spiked-in transcripts was detected. The performances of both splice variant-quantification and exon-level analysis improved when raising the number of input reads. CONCLUSION: Data, derived from NuGEN v2, were not the ideal input for AltDE, especially when the exon-level approach was used. We observed that both splice variant-quantification and exon-level analysis performances were strongly dependent on the number of input reads. Moreover, the ribosomal RNA depletion protocol was less sensitive in detecting splicing variants, possibly due to the significant percentage of the reads mapping to non-coding transcripts.
format Online
Article
Text
id pubmed-4464605
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44646052015-06-29 Alternative splicing detection workflow needs a careful combination of sample prep and bioinformatics analysis Carrara, Matteo Lum, Josephine Cordero, Francesca Beccuti, Marco Poidinger, Michael Donatelli, Susanna Calogero, Raffaele Adolfo Zolezzi, Francesca BMC Bioinformatics Research BACKGROUND: RNA-Seq provides remarkable power in the area of biomarkers discovery and disease characterization. Two crucial steps that affect RNA-Seq experiment results are Library Sample Preparation (LSP) and Bioinformatics Analysis (BA). This work describes an evaluation of the combined effect of LSP methods and BA tools in the detection of splice variants. RESULTS: Different LSPs (TruSeq unstranded/stranded, ScriptSeq, NuGEN) allowed the detection of a large common set of splice variants. However, each LSP also detected a small set of unique transcripts that are characterized by a low coverage and/or FPKM. This effect was particularly evident using the low input RNA NuGEN v2 protocol. A benchmark dataset, in which synthetic reads as well as reads generated from standard (Illumina TruSeq 100) and low input (NuGEN) LSPs were spiked-in was used to evaluate the effect of LSP on the statistical detection of alternative splicing events (AltDE). Statistical detection of AltDE was done using as prototypes for splice variant-quantification Cuffdiff2 and RSEM-EBSeq. As prototype for exon-level analysis DEXSeq was used. Exon-level analysis performed slightly better than splice variant-quantification approaches, although at most only 50% of the spiked-in transcripts was detected. The performances of both splice variant-quantification and exon-level analysis improved when raising the number of input reads. CONCLUSION: Data, derived from NuGEN v2, were not the ideal input for AltDE, especially when the exon-level approach was used. We observed that both splice variant-quantification and exon-level analysis performances were strongly dependent on the number of input reads. Moreover, the ribosomal RNA depletion protocol was less sensitive in detecting splicing variants, possibly due to the significant percentage of the reads mapping to non-coding transcripts. BioMed Central 2015-06-01 /pmc/articles/PMC4464605/ /pubmed/26050971 http://dx.doi.org/10.1186/1471-2105-16-S9-S2 Text en Copyright © 2015 Carrara et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Carrara, Matteo
Lum, Josephine
Cordero, Francesca
Beccuti, Marco
Poidinger, Michael
Donatelli, Susanna
Calogero, Raffaele Adolfo
Zolezzi, Francesca
Alternative splicing detection workflow needs a careful combination of sample prep and bioinformatics analysis
title Alternative splicing detection workflow needs a careful combination of sample prep and bioinformatics analysis
title_full Alternative splicing detection workflow needs a careful combination of sample prep and bioinformatics analysis
title_fullStr Alternative splicing detection workflow needs a careful combination of sample prep and bioinformatics analysis
title_full_unstemmed Alternative splicing detection workflow needs a careful combination of sample prep and bioinformatics analysis
title_short Alternative splicing detection workflow needs a careful combination of sample prep and bioinformatics analysis
title_sort alternative splicing detection workflow needs a careful combination of sample prep and bioinformatics analysis
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4464605/
https://www.ncbi.nlm.nih.gov/pubmed/26050971
http://dx.doi.org/10.1186/1471-2105-16-S9-S2
work_keys_str_mv AT carraramatteo alternativesplicingdetectionworkflowneedsacarefulcombinationofsampleprepandbioinformaticsanalysis
AT lumjosephine alternativesplicingdetectionworkflowneedsacarefulcombinationofsampleprepandbioinformaticsanalysis
AT corderofrancesca alternativesplicingdetectionworkflowneedsacarefulcombinationofsampleprepandbioinformaticsanalysis
AT beccutimarco alternativesplicingdetectionworkflowneedsacarefulcombinationofsampleprepandbioinformaticsanalysis
AT poidingermichael alternativesplicingdetectionworkflowneedsacarefulcombinationofsampleprepandbioinformaticsanalysis
AT donatellisusanna alternativesplicingdetectionworkflowneedsacarefulcombinationofsampleprepandbioinformaticsanalysis
AT calogeroraffaeleadolfo alternativesplicingdetectionworkflowneedsacarefulcombinationofsampleprepandbioinformaticsanalysis
AT zolezzifrancesca alternativesplicingdetectionworkflowneedsacarefulcombinationofsampleprepandbioinformaticsanalysis