Cargando…
Analysis of stranded information using an automated procedure for strand specific RNA sequencing
BACKGROUND: Strand specific RNA sequencing is rapidly replacing conventional cDNA sequencing as an approach for assessing information about the transcriptome. Alongside improved laboratory protocols the development of bioinformatical tools is steadily progressing. In the current procedure the Illumi...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4247151/ https://www.ncbi.nlm.nih.gov/pubmed/25070246 http://dx.doi.org/10.1186/1471-2164-15-631 |
_version_ | 1782346597218123776 |
---|---|
author | Sigurgeirsson, Benjamín Emanuelsson, Olof Lundeberg, Joakim |
author_facet | Sigurgeirsson, Benjamín Emanuelsson, Olof Lundeberg, Joakim |
author_sort | Sigurgeirsson, Benjamín |
collection | PubMed |
description | BACKGROUND: Strand specific RNA sequencing is rapidly replacing conventional cDNA sequencing as an approach for assessing information about the transcriptome. Alongside improved laboratory protocols the development of bioinformatical tools is steadily progressing. In the current procedure the Illumina TruSeq library preparation kit is used, along with additional reagents, to make stranded libraries in an automated fashion which are then sequenced on Illumina HiSeq 2000. By the use of freely available bioinformatical tools we show, through quality metrics, that the protocol is robust and reproducible. We further highlight the practicality of strand specific libraries by comparing expression of strand specific libraries to non-stranded libraries, by looking at known antisense transcription of pseudogenes and by identifying novel transcription. Furthermore, two ribosomal depletion kits, RiboMinus and RiboZero, are compared and two sequence aligners, Tophat2 and STAR, are also compared. RESULTS: The, non-stranded, Illumina TruSeq kit can be adapted to generate strand specific libraries and can be used to access detailed information on the transcriptome. The RiboZero kit is very effective in removing ribosomal RNA from total RNA and the STAR aligner produces high mapping yield in a short time. Strand specific data gives more detailed and correct results than does non-stranded data as we show when estimating expression values and in assembling transcripts. Even well annotated genomes need improvements and corrections which can be achieved using strand specific data. CONCLUSIONS: Researchers in the field should strive to use strand specific data; it allows for more confidence in the data analysis and is less likely to lead to false conclusions. If faced with analysing non-stranded data, researchers should be well aware of the caveats of that approach. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-631) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4247151 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42471512014-11-29 Analysis of stranded information using an automated procedure for strand specific RNA sequencing Sigurgeirsson, Benjamín Emanuelsson, Olof Lundeberg, Joakim BMC Genomics Methodology Article BACKGROUND: Strand specific RNA sequencing is rapidly replacing conventional cDNA sequencing as an approach for assessing information about the transcriptome. Alongside improved laboratory protocols the development of bioinformatical tools is steadily progressing. In the current procedure the Illumina TruSeq library preparation kit is used, along with additional reagents, to make stranded libraries in an automated fashion which are then sequenced on Illumina HiSeq 2000. By the use of freely available bioinformatical tools we show, through quality metrics, that the protocol is robust and reproducible. We further highlight the practicality of strand specific libraries by comparing expression of strand specific libraries to non-stranded libraries, by looking at known antisense transcription of pseudogenes and by identifying novel transcription. Furthermore, two ribosomal depletion kits, RiboMinus and RiboZero, are compared and two sequence aligners, Tophat2 and STAR, are also compared. RESULTS: The, non-stranded, Illumina TruSeq kit can be adapted to generate strand specific libraries and can be used to access detailed information on the transcriptome. The RiboZero kit is very effective in removing ribosomal RNA from total RNA and the STAR aligner produces high mapping yield in a short time. Strand specific data gives more detailed and correct results than does non-stranded data as we show when estimating expression values and in assembling transcripts. Even well annotated genomes need improvements and corrections which can be achieved using strand specific data. CONCLUSIONS: Researchers in the field should strive to use strand specific data; it allows for more confidence in the data analysis and is less likely to lead to false conclusions. If faced with analysing non-stranded data, researchers should be well aware of the caveats of that approach. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-631) contains supplementary material, which is available to authorized users. BioMed Central 2014-07-28 /pmc/articles/PMC4247151/ /pubmed/25070246 http://dx.doi.org/10.1186/1471-2164-15-631 Text en © Sigurgeirsson et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Sigurgeirsson, Benjamín Emanuelsson, Olof Lundeberg, Joakim Analysis of stranded information using an automated procedure for strand specific RNA sequencing |
title | Analysis of stranded information using an automated procedure for strand specific RNA sequencing |
title_full | Analysis of stranded information using an automated procedure for strand specific RNA sequencing |
title_fullStr | Analysis of stranded information using an automated procedure for strand specific RNA sequencing |
title_full_unstemmed | Analysis of stranded information using an automated procedure for strand specific RNA sequencing |
title_short | Analysis of stranded information using an automated procedure for strand specific RNA sequencing |
title_sort | analysis of stranded information using an automated procedure for strand specific rna sequencing |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4247151/ https://www.ncbi.nlm.nih.gov/pubmed/25070246 http://dx.doi.org/10.1186/1471-2164-15-631 |
work_keys_str_mv | AT sigurgeirssonbenjamin analysisofstrandedinformationusinganautomatedprocedureforstrandspecificrnasequencing AT emanuelssonolof analysisofstrandedinformationusinganautomatedprocedureforstrandspecificrnasequencing AT lundebergjoakim analysisofstrandedinformationusinganautomatedprocedureforstrandspecificrnasequencing |