Cargando…

Escherichia coli transcriptome assembly from a compendium of RNA-seq data sets

Owing to the complexities of bacterial RNA biology, the transcriptomes of even the best studied bacteria are not fully understood. To help elucidate the transcriptional landscape of E. coli, we compiled a compendium of 3,376 RNA-seq data sets composed of more than 7 trillion sequenced bases, which w...

Descripción completa

Detalles Bibliográficos
Autor principal: Tjaden, Brian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Taylor & Francis 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10392735/
https://www.ncbi.nlm.nih.gov/pubmed/36920168
http://dx.doi.org/10.1080/15476286.2023.2189331
_version_ 1785083020966887424
author Tjaden, Brian
author_facet Tjaden, Brian
author_sort Tjaden, Brian
collection PubMed
description Owing to the complexities of bacterial RNA biology, the transcriptomes of even the best studied bacteria are not fully understood. To help elucidate the transcriptional landscape of E. coli, we compiled a compendium of 3,376 RNA-seq data sets composed of more than 7 trillion sequenced bases, which we evaluate with a transcript assembly pipeline. We report expression profiles for all annotated E. coli genes as well as 5,071 other transcripts. Additionally, we observe hundreds of instances of co-transcribed genes that are novel with respect to existing operon databases. By integrating data from a large number of sequencing experiments corresponding to a wide range of conditions, we are able to obtain a comprehensive view of the E. coli transcriptome.
format Online
Article
Text
id pubmed-10392735
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Taylor & Francis
record_format MEDLINE/PubMed
spelling pubmed-103927352023-08-02 Escherichia coli transcriptome assembly from a compendium of RNA-seq data sets Tjaden, Brian RNA Biol Brief Communication Owing to the complexities of bacterial RNA biology, the transcriptomes of even the best studied bacteria are not fully understood. To help elucidate the transcriptional landscape of E. coli, we compiled a compendium of 3,376 RNA-seq data sets composed of more than 7 trillion sequenced bases, which we evaluate with a transcript assembly pipeline. We report expression profiles for all annotated E. coli genes as well as 5,071 other transcripts. Additionally, we observe hundreds of instances of co-transcribed genes that are novel with respect to existing operon databases. By integrating data from a large number of sequencing experiments corresponding to a wide range of conditions, we are able to obtain a comprehensive view of the E. coli transcriptome. Taylor & Francis 2023-03-15 /pmc/articles/PMC10392735/ /pubmed/36920168 http://dx.doi.org/10.1080/15476286.2023.2189331 Text en © 2023 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The terms on which this article has been published allow the posting of the Accepted Manuscript in a repository by the author(s) or with their consent.
spellingShingle Brief Communication
Tjaden, Brian
Escherichia coli transcriptome assembly from a compendium of RNA-seq data sets
title Escherichia coli transcriptome assembly from a compendium of RNA-seq data sets
title_full Escherichia coli transcriptome assembly from a compendium of RNA-seq data sets
title_fullStr Escherichia coli transcriptome assembly from a compendium of RNA-seq data sets
title_full_unstemmed Escherichia coli transcriptome assembly from a compendium of RNA-seq data sets
title_short Escherichia coli transcriptome assembly from a compendium of RNA-seq data sets
title_sort escherichia coli transcriptome assembly from a compendium of rna-seq data sets
topic Brief Communication
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10392735/
https://www.ncbi.nlm.nih.gov/pubmed/36920168
http://dx.doi.org/10.1080/15476286.2023.2189331
work_keys_str_mv AT tjadenbrian escherichiacolitranscriptomeassemblyfromacompendiumofrnaseqdatasets