Cargando…
Alternative splicing detection workflow needs a careful combination of sample prep and bioinformatics analysis
BACKGROUND: RNA-Seq provides remarkable power in the area of biomarkers discovery and disease characterization. Two crucial steps that affect RNA-Seq experiment results are Library Sample Preparation (LSP) and Bioinformatics Analysis (BA). This work describes an evaluation of the combined effect of...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4464605/ https://www.ncbi.nlm.nih.gov/pubmed/26050971 http://dx.doi.org/10.1186/1471-2105-16-S9-S2 |
_version_ | 1782376005253464064 |
---|---|
author | Carrara, Matteo Lum, Josephine Cordero, Francesca Beccuti, Marco Poidinger, Michael Donatelli, Susanna Calogero, Raffaele Adolfo Zolezzi, Francesca |
author_facet | Carrara, Matteo Lum, Josephine Cordero, Francesca Beccuti, Marco Poidinger, Michael Donatelli, Susanna Calogero, Raffaele Adolfo Zolezzi, Francesca |
author_sort | Carrara, Matteo |
collection | PubMed |
description | BACKGROUND: RNA-Seq provides remarkable power in the area of biomarkers discovery and disease characterization. Two crucial steps that affect RNA-Seq experiment results are Library Sample Preparation (LSP) and Bioinformatics Analysis (BA). This work describes an evaluation of the combined effect of LSP methods and BA tools in the detection of splice variants. RESULTS: Different LSPs (TruSeq unstranded/stranded, ScriptSeq, NuGEN) allowed the detection of a large common set of splice variants. However, each LSP also detected a small set of unique transcripts that are characterized by a low coverage and/or FPKM. This effect was particularly evident using the low input RNA NuGEN v2 protocol. A benchmark dataset, in which synthetic reads as well as reads generated from standard (Illumina TruSeq 100) and low input (NuGEN) LSPs were spiked-in was used to evaluate the effect of LSP on the statistical detection of alternative splicing events (AltDE). Statistical detection of AltDE was done using as prototypes for splice variant-quantification Cuffdiff2 and RSEM-EBSeq. As prototype for exon-level analysis DEXSeq was used. Exon-level analysis performed slightly better than splice variant-quantification approaches, although at most only 50% of the spiked-in transcripts was detected. The performances of both splice variant-quantification and exon-level analysis improved when raising the number of input reads. CONCLUSION: Data, derived from NuGEN v2, were not the ideal input for AltDE, especially when the exon-level approach was used. We observed that both splice variant-quantification and exon-level analysis performances were strongly dependent on the number of input reads. Moreover, the ribosomal RNA depletion protocol was less sensitive in detecting splicing variants, possibly due to the significant percentage of the reads mapping to non-coding transcripts. |
format | Online Article Text |
id | pubmed-4464605 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-44646052015-06-29 Alternative splicing detection workflow needs a careful combination of sample prep and bioinformatics analysis Carrara, Matteo Lum, Josephine Cordero, Francesca Beccuti, Marco Poidinger, Michael Donatelli, Susanna Calogero, Raffaele Adolfo Zolezzi, Francesca BMC Bioinformatics Research BACKGROUND: RNA-Seq provides remarkable power in the area of biomarkers discovery and disease characterization. Two crucial steps that affect RNA-Seq experiment results are Library Sample Preparation (LSP) and Bioinformatics Analysis (BA). This work describes an evaluation of the combined effect of LSP methods and BA tools in the detection of splice variants. RESULTS: Different LSPs (TruSeq unstranded/stranded, ScriptSeq, NuGEN) allowed the detection of a large common set of splice variants. However, each LSP also detected a small set of unique transcripts that are characterized by a low coverage and/or FPKM. This effect was particularly evident using the low input RNA NuGEN v2 protocol. A benchmark dataset, in which synthetic reads as well as reads generated from standard (Illumina TruSeq 100) and low input (NuGEN) LSPs were spiked-in was used to evaluate the effect of LSP on the statistical detection of alternative splicing events (AltDE). Statistical detection of AltDE was done using as prototypes for splice variant-quantification Cuffdiff2 and RSEM-EBSeq. As prototype for exon-level analysis DEXSeq was used. Exon-level analysis performed slightly better than splice variant-quantification approaches, although at most only 50% of the spiked-in transcripts was detected. The performances of both splice variant-quantification and exon-level analysis improved when raising the number of input reads. CONCLUSION: Data, derived from NuGEN v2, were not the ideal input for AltDE, especially when the exon-level approach was used. We observed that both splice variant-quantification and exon-level analysis performances were strongly dependent on the number of input reads. Moreover, the ribosomal RNA depletion protocol was less sensitive in detecting splicing variants, possibly due to the significant percentage of the reads mapping to non-coding transcripts. BioMed Central 2015-06-01 /pmc/articles/PMC4464605/ /pubmed/26050971 http://dx.doi.org/10.1186/1471-2105-16-S9-S2 Text en Copyright © 2015 Carrara et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Carrara, Matteo Lum, Josephine Cordero, Francesca Beccuti, Marco Poidinger, Michael Donatelli, Susanna Calogero, Raffaele Adolfo Zolezzi, Francesca Alternative splicing detection workflow needs a careful combination of sample prep and bioinformatics analysis |
title | Alternative splicing detection workflow needs a careful combination of sample prep and bioinformatics analysis |
title_full | Alternative splicing detection workflow needs a careful combination of sample prep and bioinformatics analysis |
title_fullStr | Alternative splicing detection workflow needs a careful combination of sample prep and bioinformatics analysis |
title_full_unstemmed | Alternative splicing detection workflow needs a careful combination of sample prep and bioinformatics analysis |
title_short | Alternative splicing detection workflow needs a careful combination of sample prep and bioinformatics analysis |
title_sort | alternative splicing detection workflow needs a careful combination of sample prep and bioinformatics analysis |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4464605/ https://www.ncbi.nlm.nih.gov/pubmed/26050971 http://dx.doi.org/10.1186/1471-2105-16-S9-S2 |
work_keys_str_mv | AT carraramatteo alternativesplicingdetectionworkflowneedsacarefulcombinationofsampleprepandbioinformaticsanalysis AT lumjosephine alternativesplicingdetectionworkflowneedsacarefulcombinationofsampleprepandbioinformaticsanalysis AT corderofrancesca alternativesplicingdetectionworkflowneedsacarefulcombinationofsampleprepandbioinformaticsanalysis AT beccutimarco alternativesplicingdetectionworkflowneedsacarefulcombinationofsampleprepandbioinformaticsanalysis AT poidingermichael alternativesplicingdetectionworkflowneedsacarefulcombinationofsampleprepandbioinformaticsanalysis AT donatellisusanna alternativesplicingdetectionworkflowneedsacarefulcombinationofsampleprepandbioinformaticsanalysis AT calogeroraffaeleadolfo alternativesplicingdetectionworkflowneedsacarefulcombinationofsampleprepandbioinformaticsanalysis AT zolezzifrancesca alternativesplicingdetectionworkflowneedsacarefulcombinationofsampleprepandbioinformaticsanalysis |