Cargando…

A multi-scale expression and regulation knowledge base for Escherichia coli

Transcriptomic data is accumulating rapidly; thus, scalable methods for extracting knowledge from this data are critical. Here, we assembled a top-down expression and regulation knowledge base for Escherichia coli. The expression component is a 1035-sample, high-quality RNA-seq compendium consisting...

Descripción completa

Detalles Bibliográficos
Autores principales: Lamoureux, Cameron R, Decker, Katherine T, Sastry, Anand V, Rychel, Kevin, Gao, Ye, McConn, John Luke, Zielinski, Daniel C, Palsson, Bernhard O
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10602906/
https://www.ncbi.nlm.nih.gov/pubmed/37713610
http://dx.doi.org/10.1093/nar/gkad750
_version_ 1785126486146023424
author Lamoureux, Cameron R
Decker, Katherine T
Sastry, Anand V
Rychel, Kevin
Gao, Ye
McConn, John Luke
Zielinski, Daniel C
Palsson, Bernhard O
author_facet Lamoureux, Cameron R
Decker, Katherine T
Sastry, Anand V
Rychel, Kevin
Gao, Ye
McConn, John Luke
Zielinski, Daniel C
Palsson, Bernhard O
author_sort Lamoureux, Cameron R
collection PubMed
description Transcriptomic data is accumulating rapidly; thus, scalable methods for extracting knowledge from this data are critical. Here, we assembled a top-down expression and regulation knowledge base for Escherichia coli. The expression component is a 1035-sample, high-quality RNA-seq compendium consisting of data generated in our lab using a single experimental protocol. The compendium contains diverse growth conditions, including: 9 media; 39 supplements, including antibiotics; 42 heterologous proteins; and 76 gene knockouts. Using this resource, we elucidated global expression patterns. We used machine learning to extract 201 modules that account for 86% of known regulatory interactions, creating the regulatory component. With these modules, we identified two novel regulons and quantified systems-level regulatory responses. We also integrated 1675 curated, publicly-available transcriptomes into the resource. We demonstrated workflows for analyzing new data against this knowledge base via deconstruction of regulation during aerobic transition. This resource illuminates the E. coli transcriptome at scale and provides a blueprint for top-down transcriptomic analysis of non-model organisms.
format Online
Article
Text
id pubmed-10602906
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-106029062023-10-28 A multi-scale expression and regulation knowledge base for Escherichia coli Lamoureux, Cameron R Decker, Katherine T Sastry, Anand V Rychel, Kevin Gao, Ye McConn, John Luke Zielinski, Daniel C Palsson, Bernhard O Nucleic Acids Res Data Resources and Analyses Transcriptomic data is accumulating rapidly; thus, scalable methods for extracting knowledge from this data are critical. Here, we assembled a top-down expression and regulation knowledge base for Escherichia coli. The expression component is a 1035-sample, high-quality RNA-seq compendium consisting of data generated in our lab using a single experimental protocol. The compendium contains diverse growth conditions, including: 9 media; 39 supplements, including antibiotics; 42 heterologous proteins; and 76 gene knockouts. Using this resource, we elucidated global expression patterns. We used machine learning to extract 201 modules that account for 86% of known regulatory interactions, creating the regulatory component. With these modules, we identified two novel regulons and quantified systems-level regulatory responses. We also integrated 1675 curated, publicly-available transcriptomes into the resource. We demonstrated workflows for analyzing new data against this knowledge base via deconstruction of regulation during aerobic transition. This resource illuminates the E. coli transcriptome at scale and provides a blueprint for top-down transcriptomic analysis of non-model organisms. Oxford University Press 2023-09-15 /pmc/articles/PMC10602906/ /pubmed/37713610 http://dx.doi.org/10.1093/nar/gkad750 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Data Resources and Analyses
Lamoureux, Cameron R
Decker, Katherine T
Sastry, Anand V
Rychel, Kevin
Gao, Ye
McConn, John Luke
Zielinski, Daniel C
Palsson, Bernhard O
A multi-scale expression and regulation knowledge base for Escherichia coli
title A multi-scale expression and regulation knowledge base for Escherichia coli
title_full A multi-scale expression and regulation knowledge base for Escherichia coli
title_fullStr A multi-scale expression and regulation knowledge base for Escherichia coli
title_full_unstemmed A multi-scale expression and regulation knowledge base for Escherichia coli
title_short A multi-scale expression and regulation knowledge base for Escherichia coli
title_sort multi-scale expression and regulation knowledge base for escherichia coli
topic Data Resources and Analyses
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10602906/
https://www.ncbi.nlm.nih.gov/pubmed/37713610
http://dx.doi.org/10.1093/nar/gkad750
work_keys_str_mv AT lamoureuxcameronr amultiscaleexpressionandregulationknowledgebaseforescherichiacoli
AT deckerkatherinet amultiscaleexpressionandregulationknowledgebaseforescherichiacoli
AT sastryanandv amultiscaleexpressionandregulationknowledgebaseforescherichiacoli
AT rychelkevin amultiscaleexpressionandregulationknowledgebaseforescherichiacoli
AT gaoye amultiscaleexpressionandregulationknowledgebaseforescherichiacoli
AT mcconnjohnluke amultiscaleexpressionandregulationknowledgebaseforescherichiacoli
AT zielinskidanielc amultiscaleexpressionandregulationknowledgebaseforescherichiacoli
AT palssonbernhardo amultiscaleexpressionandregulationknowledgebaseforescherichiacoli
AT lamoureuxcameronr multiscaleexpressionandregulationknowledgebaseforescherichiacoli
AT deckerkatherinet multiscaleexpressionandregulationknowledgebaseforescherichiacoli
AT sastryanandv multiscaleexpressionandregulationknowledgebaseforescherichiacoli
AT rychelkevin multiscaleexpressionandregulationknowledgebaseforescherichiacoli
AT gaoye multiscaleexpressionandregulationknowledgebaseforescherichiacoli
AT mcconnjohnluke multiscaleexpressionandregulationknowledgebaseforescherichiacoli
AT zielinskidanielc multiscaleexpressionandregulationknowledgebaseforescherichiacoli
AT palssonbernhardo multiscaleexpressionandregulationknowledgebaseforescherichiacoli