Cargando…

Computational analysis of bacterial RNA-Seq data

Recent advances in high-throughput RNA sequencing (RNA-seq) have enabled tremendous leaps forward in our understanding of bacterial transcriptomes. However, computational methods for analysis of bacterial transcriptome data have not kept pace with the large and growing data sets generated by RNA-seq...

Descripción completa

Detalles Bibliográficos
Autores principales: McClure, Ryan, Balasubramanian, Divya, Sun, Yan, Bobrovskyy, Maksym, Sumby, Paul, Genco, Caroline A., Vanderpool, Carin K., Tjaden, Brian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3737546/
https://www.ncbi.nlm.nih.gov/pubmed/23716638
http://dx.doi.org/10.1093/nar/gkt444
_version_ 1782279877269913600
author McClure, Ryan
Balasubramanian, Divya
Sun, Yan
Bobrovskyy, Maksym
Sumby, Paul
Genco, Caroline A.
Vanderpool, Carin K.
Tjaden, Brian
author_facet McClure, Ryan
Balasubramanian, Divya
Sun, Yan
Bobrovskyy, Maksym
Sumby, Paul
Genco, Caroline A.
Vanderpool, Carin K.
Tjaden, Brian
author_sort McClure, Ryan
collection PubMed
description Recent advances in high-throughput RNA sequencing (RNA-seq) have enabled tremendous leaps forward in our understanding of bacterial transcriptomes. However, computational methods for analysis of bacterial transcriptome data have not kept pace with the large and growing data sets generated by RNA-seq technology. Here, we present new algorithms, specific to bacterial gene structures and transcriptomes, for analysis of RNA-seq data. The algorithms are implemented in an open source software system called Rockhopper that supports various stages of bacterial RNA-seq data analysis, including aligning sequencing reads to a genome, constructing transcriptome maps, quantifying transcript abundance, testing for differential gene expression, determining operon structures and visualizing results. We demonstrate the performance of Rockhopper using 2.1 billion sequenced reads from 75 RNA-seq experiments conducted with Escherichia coli, Neisseria gonorrhoeae, Salmonella enterica, Streptococcus pyogenes and Xenorhabdus nematophila. We find that the transcriptome maps generated by our algorithms are highly accurate when compared with focused experimental data from E. coli and N. gonorrhoeae, and we validate our system’s ability to identify novel small RNAs, operons and transcription start sites. Our results suggest that Rockhopper can be used for efficient and accurate analysis of bacterial RNA-seq data, and that it can aid with elucidation of bacterial transcriptomes.
format Online
Article
Text
id pubmed-3737546
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-37375462013-08-08 Computational analysis of bacterial RNA-Seq data McClure, Ryan Balasubramanian, Divya Sun, Yan Bobrovskyy, Maksym Sumby, Paul Genco, Caroline A. Vanderpool, Carin K. Tjaden, Brian Nucleic Acids Res Methods Online Recent advances in high-throughput RNA sequencing (RNA-seq) have enabled tremendous leaps forward in our understanding of bacterial transcriptomes. However, computational methods for analysis of bacterial transcriptome data have not kept pace with the large and growing data sets generated by RNA-seq technology. Here, we present new algorithms, specific to bacterial gene structures and transcriptomes, for analysis of RNA-seq data. The algorithms are implemented in an open source software system called Rockhopper that supports various stages of bacterial RNA-seq data analysis, including aligning sequencing reads to a genome, constructing transcriptome maps, quantifying transcript abundance, testing for differential gene expression, determining operon structures and visualizing results. We demonstrate the performance of Rockhopper using 2.1 billion sequenced reads from 75 RNA-seq experiments conducted with Escherichia coli, Neisseria gonorrhoeae, Salmonella enterica, Streptococcus pyogenes and Xenorhabdus nematophila. We find that the transcriptome maps generated by our algorithms are highly accurate when compared with focused experimental data from E. coli and N. gonorrhoeae, and we validate our system’s ability to identify novel small RNAs, operons and transcription start sites. Our results suggest that Rockhopper can be used for efficient and accurate analysis of bacterial RNA-seq data, and that it can aid with elucidation of bacterial transcriptomes. Oxford University Press 2013-08 2013-05-28 /pmc/articles/PMC3737546/ /pubmed/23716638 http://dx.doi.org/10.1093/nar/gkt444 Text en © The Author(s) 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
McClure, Ryan
Balasubramanian, Divya
Sun, Yan
Bobrovskyy, Maksym
Sumby, Paul
Genco, Caroline A.
Vanderpool, Carin K.
Tjaden, Brian
Computational analysis of bacterial RNA-Seq data
title Computational analysis of bacterial RNA-Seq data
title_full Computational analysis of bacterial RNA-Seq data
title_fullStr Computational analysis of bacterial RNA-Seq data
title_full_unstemmed Computational analysis of bacterial RNA-Seq data
title_short Computational analysis of bacterial RNA-Seq data
title_sort computational analysis of bacterial rna-seq data
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3737546/
https://www.ncbi.nlm.nih.gov/pubmed/23716638
http://dx.doi.org/10.1093/nar/gkt444
work_keys_str_mv AT mcclureryan computationalanalysisofbacterialrnaseqdata
AT balasubramaniandivya computationalanalysisofbacterialrnaseqdata
AT sunyan computationalanalysisofbacterialrnaseqdata
AT bobrovskyymaksym computationalanalysisofbacterialrnaseqdata
AT sumbypaul computationalanalysisofbacterialrnaseqdata
AT gencocarolinea computationalanalysisofbacterialrnaseqdata
AT vanderpoolcarink computationalanalysisofbacterialrnaseqdata
AT tjadenbrian computationalanalysisofbacterialrnaseqdata