Cargando…

Analysis of stranded information using an automated procedure for strand specific RNA sequencing

BACKGROUND: Strand specific RNA sequencing is rapidly replacing conventional cDNA sequencing as an approach for assessing information about the transcriptome. Alongside improved laboratory protocols the development of bioinformatical tools is steadily progressing. In the current procedure the Illumi...

Descripción completa

Detalles Bibliográficos
Autores principales: Sigurgeirsson, Benjamín, Emanuelsson, Olof, Lundeberg, Joakim
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4247151/
https://www.ncbi.nlm.nih.gov/pubmed/25070246
http://dx.doi.org/10.1186/1471-2164-15-631
_version_ 1782346597218123776
author Sigurgeirsson, Benjamín
Emanuelsson, Olof
Lundeberg, Joakim
author_facet Sigurgeirsson, Benjamín
Emanuelsson, Olof
Lundeberg, Joakim
author_sort Sigurgeirsson, Benjamín
collection PubMed
description BACKGROUND: Strand specific RNA sequencing is rapidly replacing conventional cDNA sequencing as an approach for assessing information about the transcriptome. Alongside improved laboratory protocols the development of bioinformatical tools is steadily progressing. In the current procedure the Illumina TruSeq library preparation kit is used, along with additional reagents, to make stranded libraries in an automated fashion which are then sequenced on Illumina HiSeq 2000. By the use of freely available bioinformatical tools we show, through quality metrics, that the protocol is robust and reproducible. We further highlight the practicality of strand specific libraries by comparing expression of strand specific libraries to non-stranded libraries, by looking at known antisense transcription of pseudogenes and by identifying novel transcription. Furthermore, two ribosomal depletion kits, RiboMinus and RiboZero, are compared and two sequence aligners, Tophat2 and STAR, are also compared. RESULTS: The, non-stranded, Illumina TruSeq kit can be adapted to generate strand specific libraries and can be used to access detailed information on the transcriptome. The RiboZero kit is very effective in removing ribosomal RNA from total RNA and the STAR aligner produces high mapping yield in a short time. Strand specific data gives more detailed and correct results than does non-stranded data as we show when estimating expression values and in assembling transcripts. Even well annotated genomes need improvements and corrections which can be achieved using strand specific data. CONCLUSIONS: Researchers in the field should strive to use strand specific data; it allows for more confidence in the data analysis and is less likely to lead to false conclusions. If faced with analysing non-stranded data, researchers should be well aware of the caveats of that approach. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-631) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4247151
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42471512014-11-29 Analysis of stranded information using an automated procedure for strand specific RNA sequencing Sigurgeirsson, Benjamín Emanuelsson, Olof Lundeberg, Joakim BMC Genomics Methodology Article BACKGROUND: Strand specific RNA sequencing is rapidly replacing conventional cDNA sequencing as an approach for assessing information about the transcriptome. Alongside improved laboratory protocols the development of bioinformatical tools is steadily progressing. In the current procedure the Illumina TruSeq library preparation kit is used, along with additional reagents, to make stranded libraries in an automated fashion which are then sequenced on Illumina HiSeq 2000. By the use of freely available bioinformatical tools we show, through quality metrics, that the protocol is robust and reproducible. We further highlight the practicality of strand specific libraries by comparing expression of strand specific libraries to non-stranded libraries, by looking at known antisense transcription of pseudogenes and by identifying novel transcription. Furthermore, two ribosomal depletion kits, RiboMinus and RiboZero, are compared and two sequence aligners, Tophat2 and STAR, are also compared. RESULTS: The, non-stranded, Illumina TruSeq kit can be adapted to generate strand specific libraries and can be used to access detailed information on the transcriptome. The RiboZero kit is very effective in removing ribosomal RNA from total RNA and the STAR aligner produces high mapping yield in a short time. Strand specific data gives more detailed and correct results than does non-stranded data as we show when estimating expression values and in assembling transcripts. Even well annotated genomes need improvements and corrections which can be achieved using strand specific data. CONCLUSIONS: Researchers in the field should strive to use strand specific data; it allows for more confidence in the data analysis and is less likely to lead to false conclusions. If faced with analysing non-stranded data, researchers should be well aware of the caveats of that approach. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-631) contains supplementary material, which is available to authorized users. BioMed Central 2014-07-28 /pmc/articles/PMC4247151/ /pubmed/25070246 http://dx.doi.org/10.1186/1471-2164-15-631 Text en © Sigurgeirsson et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Sigurgeirsson, Benjamín
Emanuelsson, Olof
Lundeberg, Joakim
Analysis of stranded information using an automated procedure for strand specific RNA sequencing
title Analysis of stranded information using an automated procedure for strand specific RNA sequencing
title_full Analysis of stranded information using an automated procedure for strand specific RNA sequencing
title_fullStr Analysis of stranded information using an automated procedure for strand specific RNA sequencing
title_full_unstemmed Analysis of stranded information using an automated procedure for strand specific RNA sequencing
title_short Analysis of stranded information using an automated procedure for strand specific RNA sequencing
title_sort analysis of stranded information using an automated procedure for strand specific rna sequencing
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4247151/
https://www.ncbi.nlm.nih.gov/pubmed/25070246
http://dx.doi.org/10.1186/1471-2164-15-631
work_keys_str_mv AT sigurgeirssonbenjamin analysisofstrandedinformationusinganautomatedprocedureforstrandspecificrnasequencing
AT emanuelssonolof analysisofstrandedinformationusinganautomatedprocedureforstrandspecificrnasequencing
AT lundebergjoakim analysisofstrandedinformationusinganautomatedprocedureforstrandspecificrnasequencing