Cargando…

SplicingFactory—splicing diversity analysis for transcriptome data

MOTIVATION: Alternative splicing contributes to the diversity of RNA found in biological samples. Current tools investigating patterns of alternative splicing check for coordinated changes in the expression or relative ratio of RNA isoforms where specific isoforms are up- or down-regulated in a cond...

Descripción completa

Detalles Bibliográficos
Autores principales: Dankó, Benedek, Szikora, Péter, Pór, Tamás, Szeifert, Alexa, Sebestyén, Endre
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8722757/
https://www.ncbi.nlm.nih.gov/pubmed/34499147
http://dx.doi.org/10.1093/bioinformatics/btab648
_version_ 1784625581933985792
author Dankó, Benedek
Szikora, Péter
Pór, Tamás
Szeifert, Alexa
Sebestyén, Endre
author_facet Dankó, Benedek
Szikora, Péter
Pór, Tamás
Szeifert, Alexa
Sebestyén, Endre
author_sort Dankó, Benedek
collection PubMed
description MOTIVATION: Alternative splicing contributes to the diversity of RNA found in biological samples. Current tools investigating patterns of alternative splicing check for coordinated changes in the expression or relative ratio of RNA isoforms where specific isoforms are up- or down-regulated in a condition. However, the molecular process of splicing is stochastic and changes in RNA isoform diversity for a gene might arise between samples or conditions. A specific condition can be dominated by a single isoform, while multiple isoforms with similar expression levels can be present in a different condition. These changes might be the result of mutations, drug treatments or differences in the cellular or tissue environment. Here, we present a tool for the characterization and analysis of RNA isoform diversity using isoform level expression measurements. RESULTS: We developed an R package called SplicingFactory, to calculate various RNA isoform diversity metrics, and compare them across conditions. Using the package, we tested the effect of RNA-seq quantification tools, quantification uncertainty, gene expression levels and isoform numbers on the isoform diversity calculation. We analyzed a set of CD34+ hematopoietic stem cells and myelodysplastic syndrome samples and found a set of genes whose isoform diversity change is associated with SF3B1 mutations. AVAILABILITY AND IMPLEMENTATION: The SplicingFactory package is freely available under the GPL-3.0 license from Bioconductor for the Windows, MacOS and Linux operating systems (https://www.bioconductor.org/packages/release/bioc/html/SplicingFactory.html). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-8722757
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-87227572022-01-05 SplicingFactory—splicing diversity analysis for transcriptome data Dankó, Benedek Szikora, Péter Pór, Tamás Szeifert, Alexa Sebestyén, Endre Bioinformatics Original Paper MOTIVATION: Alternative splicing contributes to the diversity of RNA found in biological samples. Current tools investigating patterns of alternative splicing check for coordinated changes in the expression or relative ratio of RNA isoforms where specific isoforms are up- or down-regulated in a condition. However, the molecular process of splicing is stochastic and changes in RNA isoform diversity for a gene might arise between samples or conditions. A specific condition can be dominated by a single isoform, while multiple isoforms with similar expression levels can be present in a different condition. These changes might be the result of mutations, drug treatments or differences in the cellular or tissue environment. Here, we present a tool for the characterization and analysis of RNA isoform diversity using isoform level expression measurements. RESULTS: We developed an R package called SplicingFactory, to calculate various RNA isoform diversity metrics, and compare them across conditions. Using the package, we tested the effect of RNA-seq quantification tools, quantification uncertainty, gene expression levels and isoform numbers on the isoform diversity calculation. We analyzed a set of CD34+ hematopoietic stem cells and myelodysplastic syndrome samples and found a set of genes whose isoform diversity change is associated with SF3B1 mutations. AVAILABILITY AND IMPLEMENTATION: The SplicingFactory package is freely available under the GPL-3.0 license from Bioconductor for the Windows, MacOS and Linux operating systems (https://www.bioconductor.org/packages/release/bioc/html/SplicingFactory.html). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-09-09 /pmc/articles/PMC8722757/ /pubmed/34499147 http://dx.doi.org/10.1093/bioinformatics/btab648 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Dankó, Benedek
Szikora, Péter
Pór, Tamás
Szeifert, Alexa
Sebestyén, Endre
SplicingFactory—splicing diversity analysis for transcriptome data
title SplicingFactory—splicing diversity analysis for transcriptome data
title_full SplicingFactory—splicing diversity analysis for transcriptome data
title_fullStr SplicingFactory—splicing diversity analysis for transcriptome data
title_full_unstemmed SplicingFactory—splicing diversity analysis for transcriptome data
title_short SplicingFactory—splicing diversity analysis for transcriptome data
title_sort splicingfactory—splicing diversity analysis for transcriptome data
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8722757/
https://www.ncbi.nlm.nih.gov/pubmed/34499147
http://dx.doi.org/10.1093/bioinformatics/btab648
work_keys_str_mv AT dankobenedek splicingfactorysplicingdiversityanalysisfortranscriptomedata
AT szikorapeter splicingfactorysplicingdiversityanalysisfortranscriptomedata
AT portamas splicingfactorysplicingdiversityanalysisfortranscriptomedata
AT szeifertalexa splicingfactorysplicingdiversityanalysisfortranscriptomedata
AT sebestyenendre splicingfactorysplicingdiversityanalysisfortranscriptomedata