Cargando…
SL-quant: a fast and flexible pipeline to quantify spliced leader trans-splicing events from RNA-seq data
BACKGROUND: The spliceosomal transfer of a short spliced leader (SL) RNA to an independent pre-mRNA molecule is called SL trans-splicing and is widespread in the nematode Caenorhabditis elegans. While RNA-sequencing (RNA-seq) data contain information on such events, properly documented methods to ex...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6055573/ https://www.ncbi.nlm.nih.gov/pubmed/30010768 http://dx.doi.org/10.1093/gigascience/giy084 |
_version_ | 1783341198852751360 |
---|---|
author | Yague-Sanz, Carlo Hermand, Damien |
author_facet | Yague-Sanz, Carlo Hermand, Damien |
author_sort | Yague-Sanz, Carlo |
collection | PubMed |
description | BACKGROUND: The spliceosomal transfer of a short spliced leader (SL) RNA to an independent pre-mRNA molecule is called SL trans-splicing and is widespread in the nematode Caenorhabditis elegans. While RNA-sequencing (RNA-seq) data contain information on such events, properly documented methods to extract them are lacking. FINDINGS: To address this, we developed SL-quant, a fast and flexible pipeline that adapts to paired-end and single-end RNA-seq data and accurately quantifies SL trans-splicing events. It is designed to work downstream of read mapping and uses the reads left unmapped as primary input. Briefly, the SL sequences are identified with high specificity and are trimmed from the input reads, which are then remapped on the reference genome and quantified at the nucleotide position level (SL trans-splice sites) or at the gene level. CONCLUSIONS: SL-quant completes within 10 minutes on a basic desktop computer for typical C. elegans RNA-seq datasets and can be applied to other species as well. Validating the method, the SL trans-splice sites identified display the expected consensus sequence, and the results of the gene-level quantification are predictive of the gene position within operons. We also compared SL-quant to a recently published SL-containing read identification strategy that was found to be more sensitive but less specific than SL-quant. Both methods are implemented as a bash script available under the MIT license [1]. Full instructions for its installation, usage, and adaptation to other organisms are provided. |
format | Online Article Text |
id | pubmed-6055573 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-60555732018-07-25 SL-quant: a fast and flexible pipeline to quantify spliced leader trans-splicing events from RNA-seq data Yague-Sanz, Carlo Hermand, Damien Gigascience Technical Note BACKGROUND: The spliceosomal transfer of a short spliced leader (SL) RNA to an independent pre-mRNA molecule is called SL trans-splicing and is widespread in the nematode Caenorhabditis elegans. While RNA-sequencing (RNA-seq) data contain information on such events, properly documented methods to extract them are lacking. FINDINGS: To address this, we developed SL-quant, a fast and flexible pipeline that adapts to paired-end and single-end RNA-seq data and accurately quantifies SL trans-splicing events. It is designed to work downstream of read mapping and uses the reads left unmapped as primary input. Briefly, the SL sequences are identified with high specificity and are trimmed from the input reads, which are then remapped on the reference genome and quantified at the nucleotide position level (SL trans-splice sites) or at the gene level. CONCLUSIONS: SL-quant completes within 10 minutes on a basic desktop computer for typical C. elegans RNA-seq datasets and can be applied to other species as well. Validating the method, the SL trans-splice sites identified display the expected consensus sequence, and the results of the gene-level quantification are predictive of the gene position within operons. We also compared SL-quant to a recently published SL-containing read identification strategy that was found to be more sensitive but less specific than SL-quant. Both methods are implemented as a bash script available under the MIT license [1]. Full instructions for its installation, usage, and adaptation to other organisms are provided. Oxford University Press 2018-07-11 /pmc/articles/PMC6055573/ /pubmed/30010768 http://dx.doi.org/10.1093/gigascience/giy084 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Technical Note Yague-Sanz, Carlo Hermand, Damien SL-quant: a fast and flexible pipeline to quantify spliced leader trans-splicing events from RNA-seq data |
title |
SL-quant: a fast and flexible pipeline to quantify spliced leader trans-splicing events from RNA-seq data |
title_full |
SL-quant: a fast and flexible pipeline to quantify spliced leader trans-splicing events from RNA-seq data |
title_fullStr |
SL-quant: a fast and flexible pipeline to quantify spliced leader trans-splicing events from RNA-seq data |
title_full_unstemmed |
SL-quant: a fast and flexible pipeline to quantify spliced leader trans-splicing events from RNA-seq data |
title_short |
SL-quant: a fast and flexible pipeline to quantify spliced leader trans-splicing events from RNA-seq data |
title_sort | sl-quant: a fast and flexible pipeline to quantify spliced leader trans-splicing events from rna-seq data |
topic | Technical Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6055573/ https://www.ncbi.nlm.nih.gov/pubmed/30010768 http://dx.doi.org/10.1093/gigascience/giy084 |
work_keys_str_mv | AT yaguesanzcarlo slquantafastandflexiblepipelinetoquantifysplicedleadertranssplicingeventsfromrnaseqdata AT hermanddamien slquantafastandflexiblepipelinetoquantifysplicedleadertranssplicingeventsfromrnaseqdata |