Cargando…
Multi-omics integration accurately predicts cellular state in unexplored conditions for Escherichia coli
A significant obstacle in training predictive cell models is the lack of integrated data sources. We develop semi-supervised normalization pipelines and perform experimental characterization (growth, transcriptional, proteome) to create Ecomics, a consistent, quality-controlled multi-omics compendiu...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5059772/ https://www.ncbi.nlm.nih.gov/pubmed/27713404 http://dx.doi.org/10.1038/ncomms13090 |
_version_ | 1782459475459833856 |
---|---|
author | Kim, Minseung Rai, Navneet Zorraquino, Violeta Tagkopoulos, Ilias |
author_facet | Kim, Minseung Rai, Navneet Zorraquino, Violeta Tagkopoulos, Ilias |
author_sort | Kim, Minseung |
collection | PubMed |
description | A significant obstacle in training predictive cell models is the lack of integrated data sources. We develop semi-supervised normalization pipelines and perform experimental characterization (growth, transcriptional, proteome) to create Ecomics, a consistent, quality-controlled multi-omics compendium for Escherichia coli with cohesive meta-data information. We then use this resource to train a multi-scale model that integrates four omics layers to predict genome-wide concentrations and growth dynamics. The genetic and environmental ontology reconstructed from the omics data is substantially different and complementary to the genetic and chemical ontologies. The integration of different layers confers an incremental increase in the prediction performance, as does the information about the known gene regulatory and protein-protein interactions. The predictive performance of the model ranges from 0.54 to 0.87 for the various omics layers, which far exceeds various baselines. This work provides an integrative framework of omics-driven predictive modelling that is broadly applicable to guide biological discovery. |
format | Online Article Text |
id | pubmed-5059772 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-50597722016-10-26 Multi-omics integration accurately predicts cellular state in unexplored conditions for Escherichia coli Kim, Minseung Rai, Navneet Zorraquino, Violeta Tagkopoulos, Ilias Nat Commun Article A significant obstacle in training predictive cell models is the lack of integrated data sources. We develop semi-supervised normalization pipelines and perform experimental characterization (growth, transcriptional, proteome) to create Ecomics, a consistent, quality-controlled multi-omics compendium for Escherichia coli with cohesive meta-data information. We then use this resource to train a multi-scale model that integrates four omics layers to predict genome-wide concentrations and growth dynamics. The genetic and environmental ontology reconstructed from the omics data is substantially different and complementary to the genetic and chemical ontologies. The integration of different layers confers an incremental increase in the prediction performance, as does the information about the known gene regulatory and protein-protein interactions. The predictive performance of the model ranges from 0.54 to 0.87 for the various omics layers, which far exceeds various baselines. This work provides an integrative framework of omics-driven predictive modelling that is broadly applicable to guide biological discovery. Nature Publishing Group 2016-10-07 /pmc/articles/PMC5059772/ /pubmed/27713404 http://dx.doi.org/10.1038/ncomms13090 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Kim, Minseung Rai, Navneet Zorraquino, Violeta Tagkopoulos, Ilias Multi-omics integration accurately predicts cellular state in unexplored conditions for Escherichia coli |
title | Multi-omics integration accurately predicts cellular state in unexplored conditions for Escherichia coli |
title_full | Multi-omics integration accurately predicts cellular state in unexplored conditions for Escherichia coli |
title_fullStr | Multi-omics integration accurately predicts cellular state in unexplored conditions for Escherichia coli |
title_full_unstemmed | Multi-omics integration accurately predicts cellular state in unexplored conditions for Escherichia coli |
title_short | Multi-omics integration accurately predicts cellular state in unexplored conditions for Escherichia coli |
title_sort | multi-omics integration accurately predicts cellular state in unexplored conditions for escherichia coli |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5059772/ https://www.ncbi.nlm.nih.gov/pubmed/27713404 http://dx.doi.org/10.1038/ncomms13090 |
work_keys_str_mv | AT kimminseung multiomicsintegrationaccuratelypredictscellularstateinunexploredconditionsforescherichiacoli AT rainavneet multiomicsintegrationaccuratelypredictscellularstateinunexploredconditionsforescherichiacoli AT zorraquinovioleta multiomicsintegrationaccuratelypredictscellularstateinunexploredconditionsforescherichiacoli AT tagkopoulosilias multiomicsintegrationaccuratelypredictscellularstateinunexploredconditionsforescherichiacoli |