Cargando…

Design of RNA splicing analysis null models for post hoc filtering of Drosophila head RNA-Seq data with the splicing analysis kit (Spanki)

BACKGROUND: The production of multiple transcript isoforms from one gene is a major source of transcriptome complexity. RNA-Seq experiments, in which transcripts are converted to cDNA and sequenced, allow the resolution and quantification of alternative transcript isoforms. However, methods to analy...

Descripción completa

Detalles Bibliográficos
Autores principales: Sturgill, David, Malone, John H, Sun, Xia, Smith, Harold E, Rabinow, Leonard, Samson, Marie-Laure, Oliver, Brian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3827500/
https://www.ncbi.nlm.nih.gov/pubmed/24209455
http://dx.doi.org/10.1186/1471-2105-14-320
_version_ 1782478253531856896
author Sturgill, David
Malone, John H
Sun, Xia
Smith, Harold E
Rabinow, Leonard
Samson, Marie-Laure
Oliver, Brian
author_facet Sturgill, David
Malone, John H
Sun, Xia
Smith, Harold E
Rabinow, Leonard
Samson, Marie-Laure
Oliver, Brian
author_sort Sturgill, David
collection PubMed
description BACKGROUND: The production of multiple transcript isoforms from one gene is a major source of transcriptome complexity. RNA-Seq experiments, in which transcripts are converted to cDNA and sequenced, allow the resolution and quantification of alternative transcript isoforms. However, methods to analyze splicing are underdeveloped and errors resulting in incorrect splicing calls occur in every experiment. RESULTS: We used RNA-Seq data to develop sequencing and aligner error models. By applying these error models to known input from simulations, we found that errors result from false alignment to minor splice motifs and antisense stands, shifted junction positions, paralog joining, and repeat induced gaps. By using a series of quantitative and qualitative filters, we eliminated diagnosed errors in the simulation, and applied this to RNA-Seq data from Drosophila melanogaster heads. We used high-confidence junction detections to specifically interrogate local splicing differences between transcripts. This method out-performed commonly used RNA-seq methods to identify known alternative splicing events in the Drosophila sex determination pathway. We describe a flexible software package to perform these tasks called Splicing Analysis Kit (Spanki), available at http://www.cbcb.umd.edu/software/spanki. CONCLUSIONS: Splice-junction centric analysis of RNA-Seq data provides advantages in specificity for detection of alternative splicing. Our software provides tools to better understand error profiles in RNA-Seq data and improve inference from this new technology. The splice-junction centric approach that this software enables will provide more accurate estimates of differentially regulated splicing than current tools.
format Online
Article
Text
id pubmed-3827500
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38275002013-11-15 Design of RNA splicing analysis null models for post hoc filtering of Drosophila head RNA-Seq data with the splicing analysis kit (Spanki) Sturgill, David Malone, John H Sun, Xia Smith, Harold E Rabinow, Leonard Samson, Marie-Laure Oliver, Brian BMC Bioinformatics Methodology Article BACKGROUND: The production of multiple transcript isoforms from one gene is a major source of transcriptome complexity. RNA-Seq experiments, in which transcripts are converted to cDNA and sequenced, allow the resolution and quantification of alternative transcript isoforms. However, methods to analyze splicing are underdeveloped and errors resulting in incorrect splicing calls occur in every experiment. RESULTS: We used RNA-Seq data to develop sequencing and aligner error models. By applying these error models to known input from simulations, we found that errors result from false alignment to minor splice motifs and antisense stands, shifted junction positions, paralog joining, and repeat induced gaps. By using a series of quantitative and qualitative filters, we eliminated diagnosed errors in the simulation, and applied this to RNA-Seq data from Drosophila melanogaster heads. We used high-confidence junction detections to specifically interrogate local splicing differences between transcripts. This method out-performed commonly used RNA-seq methods to identify known alternative splicing events in the Drosophila sex determination pathway. We describe a flexible software package to perform these tasks called Splicing Analysis Kit (Spanki), available at http://www.cbcb.umd.edu/software/spanki. CONCLUSIONS: Splice-junction centric analysis of RNA-Seq data provides advantages in specificity for detection of alternative splicing. Our software provides tools to better understand error profiles in RNA-Seq data and improve inference from this new technology. The splice-junction centric approach that this software enables will provide more accurate estimates of differentially regulated splicing than current tools. BioMed Central 2013-11-09 /pmc/articles/PMC3827500/ /pubmed/24209455 http://dx.doi.org/10.1186/1471-2105-14-320 Text en Copyright © 2013 Sturgill et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Sturgill, David
Malone, John H
Sun, Xia
Smith, Harold E
Rabinow, Leonard
Samson, Marie-Laure
Oliver, Brian
Design of RNA splicing analysis null models for post hoc filtering of Drosophila head RNA-Seq data with the splicing analysis kit (Spanki)
title Design of RNA splicing analysis null models for post hoc filtering of Drosophila head RNA-Seq data with the splicing analysis kit (Spanki)
title_full Design of RNA splicing analysis null models for post hoc filtering of Drosophila head RNA-Seq data with the splicing analysis kit (Spanki)
title_fullStr Design of RNA splicing analysis null models for post hoc filtering of Drosophila head RNA-Seq data with the splicing analysis kit (Spanki)
title_full_unstemmed Design of RNA splicing analysis null models for post hoc filtering of Drosophila head RNA-Seq data with the splicing analysis kit (Spanki)
title_short Design of RNA splicing analysis null models for post hoc filtering of Drosophila head RNA-Seq data with the splicing analysis kit (Spanki)
title_sort design of rna splicing analysis null models for post hoc filtering of drosophila head rna-seq data with the splicing analysis kit (spanki)
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3827500/
https://www.ncbi.nlm.nih.gov/pubmed/24209455
http://dx.doi.org/10.1186/1471-2105-14-320
work_keys_str_mv AT sturgilldavid designofrnasplicinganalysisnullmodelsforposthocfilteringofdrosophilaheadrnaseqdatawiththesplicinganalysiskitspanki
AT malonejohnh designofrnasplicinganalysisnullmodelsforposthocfilteringofdrosophilaheadrnaseqdatawiththesplicinganalysiskitspanki
AT sunxia designofrnasplicinganalysisnullmodelsforposthocfilteringofdrosophilaheadrnaseqdatawiththesplicinganalysiskitspanki
AT smithharolde designofrnasplicinganalysisnullmodelsforposthocfilteringofdrosophilaheadrnaseqdatawiththesplicinganalysiskitspanki
AT rabinowleonard designofrnasplicinganalysisnullmodelsforposthocfilteringofdrosophilaheadrnaseqdatawiththesplicinganalysiskitspanki
AT samsonmarielaure designofrnasplicinganalysisnullmodelsforposthocfilteringofdrosophilaheadrnaseqdatawiththesplicinganalysiskitspanki
AT oliverbrian designofrnasplicinganalysisnullmodelsforposthocfilteringofdrosophilaheadrnaseqdatawiththesplicinganalysiskitspanki