Cargando…

SL-quant: a fast and flexible pipeline to quantify spliced leader trans-splicing events from RNA-seq data

BACKGROUND: The spliceosomal transfer of a short spliced leader (SL) RNA to an independent pre-mRNA molecule is called SL trans-splicing and is widespread in the nematode Caenorhabditis elegans. While RNA-sequencing (RNA-seq) data contain information on such events, properly documented methods to ex...

Descripción completa

Detalles Bibliográficos
Autores principales: Yague-Sanz, Carlo, Hermand, Damien
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6055573/
https://www.ncbi.nlm.nih.gov/pubmed/30010768
http://dx.doi.org/10.1093/gigascience/giy084
_version_ 1783341198852751360
author Yague-Sanz, Carlo
Hermand, Damien
author_facet Yague-Sanz, Carlo
Hermand, Damien
author_sort Yague-Sanz, Carlo
collection PubMed
description BACKGROUND: The spliceosomal transfer of a short spliced leader (SL) RNA to an independent pre-mRNA molecule is called SL trans-splicing and is widespread in the nematode Caenorhabditis elegans. While RNA-sequencing (RNA-seq) data contain information on such events, properly documented methods to extract them are lacking. FINDINGS: To address this, we developed SL-quant, a fast and flexible pipeline that adapts to paired-end and single-end RNA-seq data and accurately quantifies SL trans-splicing events. It is designed to work downstream of read mapping and uses the reads left unmapped as primary input. Briefly, the SL sequences are identified with high specificity and are trimmed from the input reads, which are then remapped on the reference genome and quantified at the nucleotide position level (SL trans-splice sites) or at the gene level. CONCLUSIONS: SL-quant completes within 10 minutes on a basic desktop computer for typical C. elegans RNA-seq datasets and can be applied to other species as well. Validating the method, the SL trans-splice sites identified display the expected consensus sequence, and the results of the gene-level quantification are predictive of the gene position within operons. We also compared SL-quant to a recently published SL-containing read identification strategy that was found to be more sensitive but less specific than SL-quant. Both methods are implemented as a bash script available under the MIT license [1]. Full instructions for its installation, usage, and adaptation to other organisms are provided.
format Online
Article
Text
id pubmed-6055573
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-60555732018-07-25 SL-quant: a fast and flexible pipeline to quantify spliced leader trans-splicing events from RNA-seq data Yague-Sanz, Carlo Hermand, Damien Gigascience Technical Note BACKGROUND: The spliceosomal transfer of a short spliced leader (SL) RNA to an independent pre-mRNA molecule is called SL trans-splicing and is widespread in the nematode Caenorhabditis elegans. While RNA-sequencing (RNA-seq) data contain information on such events, properly documented methods to extract them are lacking. FINDINGS: To address this, we developed SL-quant, a fast and flexible pipeline that adapts to paired-end and single-end RNA-seq data and accurately quantifies SL trans-splicing events. It is designed to work downstream of read mapping and uses the reads left unmapped as primary input. Briefly, the SL sequences are identified with high specificity and are trimmed from the input reads, which are then remapped on the reference genome and quantified at the nucleotide position level (SL trans-splice sites) or at the gene level. CONCLUSIONS: SL-quant completes within 10 minutes on a basic desktop computer for typical C. elegans RNA-seq datasets and can be applied to other species as well. Validating the method, the SL trans-splice sites identified display the expected consensus sequence, and the results of the gene-level quantification are predictive of the gene position within operons. We also compared SL-quant to a recently published SL-containing read identification strategy that was found to be more sensitive but less specific than SL-quant. Both methods are implemented as a bash script available under the MIT license [1]. Full instructions for its installation, usage, and adaptation to other organisms are provided. Oxford University Press 2018-07-11 /pmc/articles/PMC6055573/ /pubmed/30010768 http://dx.doi.org/10.1093/gigascience/giy084 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Yague-Sanz, Carlo
Hermand, Damien
SL-quant: a fast and flexible pipeline to quantify spliced leader trans-splicing events from RNA-seq data
title SL-quant: a fast and flexible pipeline to quantify spliced leader trans-splicing events from RNA-seq data
title_full SL-quant: a fast and flexible pipeline to quantify spliced leader trans-splicing events from RNA-seq data
title_fullStr SL-quant: a fast and flexible pipeline to quantify spliced leader trans-splicing events from RNA-seq data
title_full_unstemmed SL-quant: a fast and flexible pipeline to quantify spliced leader trans-splicing events from RNA-seq data
title_short SL-quant: a fast and flexible pipeline to quantify spliced leader trans-splicing events from RNA-seq data
title_sort sl-quant: a fast and flexible pipeline to quantify spliced leader trans-splicing events from rna-seq data
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6055573/
https://www.ncbi.nlm.nih.gov/pubmed/30010768
http://dx.doi.org/10.1093/gigascience/giy084
work_keys_str_mv AT yaguesanzcarlo slquantafastandflexiblepipelinetoquantifysplicedleadertranssplicingeventsfromrnaseqdata
AT hermanddamien slquantafastandflexiblepipelinetoquantifysplicedleadertranssplicingeventsfromrnaseqdata