Cargando…

Multi-omics integration accurately predicts cellular state in unexplored conditions for Escherichia coli

A significant obstacle in training predictive cell models is the lack of integrated data sources. We develop semi-supervised normalization pipelines and perform experimental characterization (growth, transcriptional, proteome) to create Ecomics, a consistent, quality-controlled multi-omics compendiu...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Minseung, Rai, Navneet, Zorraquino, Violeta, Tagkopoulos, Ilias
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5059772/
https://www.ncbi.nlm.nih.gov/pubmed/27713404
http://dx.doi.org/10.1038/ncomms13090
_version_ 1782459475459833856
author Kim, Minseung
Rai, Navneet
Zorraquino, Violeta
Tagkopoulos, Ilias
author_facet Kim, Minseung
Rai, Navneet
Zorraquino, Violeta
Tagkopoulos, Ilias
author_sort Kim, Minseung
collection PubMed
description A significant obstacle in training predictive cell models is the lack of integrated data sources. We develop semi-supervised normalization pipelines and perform experimental characterization (growth, transcriptional, proteome) to create Ecomics, a consistent, quality-controlled multi-omics compendium for Escherichia coli with cohesive meta-data information. We then use this resource to train a multi-scale model that integrates four omics layers to predict genome-wide concentrations and growth dynamics. The genetic and environmental ontology reconstructed from the omics data is substantially different and complementary to the genetic and chemical ontologies. The integration of different layers confers an incremental increase in the prediction performance, as does the information about the known gene regulatory and protein-protein interactions. The predictive performance of the model ranges from 0.54 to 0.87 for the various omics layers, which far exceeds various baselines. This work provides an integrative framework of omics-driven predictive modelling that is broadly applicable to guide biological discovery.
format Online
Article
Text
id pubmed-5059772
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-50597722016-10-26 Multi-omics integration accurately predicts cellular state in unexplored conditions for Escherichia coli Kim, Minseung Rai, Navneet Zorraquino, Violeta Tagkopoulos, Ilias Nat Commun Article A significant obstacle in training predictive cell models is the lack of integrated data sources. We develop semi-supervised normalization pipelines and perform experimental characterization (growth, transcriptional, proteome) to create Ecomics, a consistent, quality-controlled multi-omics compendium for Escherichia coli with cohesive meta-data information. We then use this resource to train a multi-scale model that integrates four omics layers to predict genome-wide concentrations and growth dynamics. The genetic and environmental ontology reconstructed from the omics data is substantially different and complementary to the genetic and chemical ontologies. The integration of different layers confers an incremental increase in the prediction performance, as does the information about the known gene regulatory and protein-protein interactions. The predictive performance of the model ranges from 0.54 to 0.87 for the various omics layers, which far exceeds various baselines. This work provides an integrative framework of omics-driven predictive modelling that is broadly applicable to guide biological discovery. Nature Publishing Group 2016-10-07 /pmc/articles/PMC5059772/ /pubmed/27713404 http://dx.doi.org/10.1038/ncomms13090 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Kim, Minseung
Rai, Navneet
Zorraquino, Violeta
Tagkopoulos, Ilias
Multi-omics integration accurately predicts cellular state in unexplored conditions for Escherichia coli
title Multi-omics integration accurately predicts cellular state in unexplored conditions for Escherichia coli
title_full Multi-omics integration accurately predicts cellular state in unexplored conditions for Escherichia coli
title_fullStr Multi-omics integration accurately predicts cellular state in unexplored conditions for Escherichia coli
title_full_unstemmed Multi-omics integration accurately predicts cellular state in unexplored conditions for Escherichia coli
title_short Multi-omics integration accurately predicts cellular state in unexplored conditions for Escherichia coli
title_sort multi-omics integration accurately predicts cellular state in unexplored conditions for escherichia coli
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5059772/
https://www.ncbi.nlm.nih.gov/pubmed/27713404
http://dx.doi.org/10.1038/ncomms13090
work_keys_str_mv AT kimminseung multiomicsintegrationaccuratelypredictscellularstateinunexploredconditionsforescherichiacoli
AT rainavneet multiomicsintegrationaccuratelypredictscellularstateinunexploredconditionsforescherichiacoli
AT zorraquinovioleta multiomicsintegrationaccuratelypredictscellularstateinunexploredconditionsforescherichiacoli
AT tagkopoulosilias multiomicsintegrationaccuratelypredictscellularstateinunexploredconditionsforescherichiacoli