Cargando…
A Mixed Quantum Chemistry/Machine Learning Approach for the Fast and Accurate Prediction of Biochemical Redox Potentials and Its Large-Scale Application to 315 000 Redox Reactions
[Image: see text] A quantitative understanding of the thermodynamics of biochemical reactions is essential for accurately modeling metabolism. The group contribution method (GCM) is one of the most widely used approaches to estimate standard Gibbs energies and redox potentials of reactions for which...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2019
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6661861/ https://www.ncbi.nlm.nih.gov/pubmed/31404220 http://dx.doi.org/10.1021/acscentsci.9b00297 |
_version_ | 1783439543197761536 |
---|---|
author | Jinich, Adrian Sanchez-Lengeling, Benjamin Ren, Haniu Harman, Rebecca Aspuru-Guzik, Alán |
author_facet | Jinich, Adrian Sanchez-Lengeling, Benjamin Ren, Haniu Harman, Rebecca Aspuru-Guzik, Alán |
author_sort | Jinich, Adrian |
collection | PubMed |
description | [Image: see text] A quantitative understanding of the thermodynamics of biochemical reactions is essential for accurately modeling metabolism. The group contribution method (GCM) is one of the most widely used approaches to estimate standard Gibbs energies and redox potentials of reactions for which no experimental measurements exist. Previous work has shown that quantum chemical predictions of biochemical thermodynamics are a promising approach to overcome the limitations of GCM. However, the quantum chemistry approach is significantly more expensive. Here, we use a combination of quantum chemistry and machine learning to obtain a fast and accurate method for predicting the thermodynamics of biochemical redox reactions. We focus on predicting the redox potentials of carbonyl functional group reductions to alcohols and amines, two of the most ubiquitous carbon redox transformations in biology. Our method relies on semiempirical quantum chemistry calculations calibrated with Gaussian process (GP) regression against available experimental data and results in higher predictive power than the GCM at low computational cost. Direct calibration of GCM and fingerprint-based predictions (without quantum chemistry) with GP regression also results in significant improvements in prediction accuracy, demonstrating the versatility of the approach. We design and implement a network expansion algorithm that iteratively reduces and oxidizes a set of natural seed metabolites and demonstrate the high-throughput applicability of our method by predicting the standard potentials of more than 315 000 redox reactions involving approximately 70 000 compounds. Additionally, we developed a novel fingerprint-based framework for detecting molecular environment motifs that are enriched or depleted across different regions of the redox potential landscape. We provide open access to all source code and data generated. |
format | Online Article Text |
id | pubmed-6661861 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-66618612019-08-09 A Mixed Quantum Chemistry/Machine Learning Approach for the Fast and Accurate Prediction of Biochemical Redox Potentials and Its Large-Scale Application to 315 000 Redox Reactions Jinich, Adrian Sanchez-Lengeling, Benjamin Ren, Haniu Harman, Rebecca Aspuru-Guzik, Alán ACS Cent Sci [Image: see text] A quantitative understanding of the thermodynamics of biochemical reactions is essential for accurately modeling metabolism. The group contribution method (GCM) is one of the most widely used approaches to estimate standard Gibbs energies and redox potentials of reactions for which no experimental measurements exist. Previous work has shown that quantum chemical predictions of biochemical thermodynamics are a promising approach to overcome the limitations of GCM. However, the quantum chemistry approach is significantly more expensive. Here, we use a combination of quantum chemistry and machine learning to obtain a fast and accurate method for predicting the thermodynamics of biochemical redox reactions. We focus on predicting the redox potentials of carbonyl functional group reductions to alcohols and amines, two of the most ubiquitous carbon redox transformations in biology. Our method relies on semiempirical quantum chemistry calculations calibrated with Gaussian process (GP) regression against available experimental data and results in higher predictive power than the GCM at low computational cost. Direct calibration of GCM and fingerprint-based predictions (without quantum chemistry) with GP regression also results in significant improvements in prediction accuracy, demonstrating the versatility of the approach. We design and implement a network expansion algorithm that iteratively reduces and oxidizes a set of natural seed metabolites and demonstrate the high-throughput applicability of our method by predicting the standard potentials of more than 315 000 redox reactions involving approximately 70 000 compounds. Additionally, we developed a novel fingerprint-based framework for detecting molecular environment motifs that are enriched or depleted across different regions of the redox potential landscape. We provide open access to all source code and data generated. American Chemical Society 2019-06-07 2019-07-24 /pmc/articles/PMC6661861/ /pubmed/31404220 http://dx.doi.org/10.1021/acscentsci.9b00297 Text en Copyright © 2019 American Chemical Society This is an open access article published under an ACS AuthorChoice License (http://pubs.acs.org/page/policy/authorchoice_termsofuse.html) , which permits copying and redistribution of the article or any adaptations for non-commercial purposes. |
spellingShingle | Jinich, Adrian Sanchez-Lengeling, Benjamin Ren, Haniu Harman, Rebecca Aspuru-Guzik, Alán A Mixed Quantum Chemistry/Machine Learning Approach for the Fast and Accurate Prediction of Biochemical Redox Potentials and Its Large-Scale Application to 315 000 Redox Reactions |
title | A Mixed Quantum Chemistry/Machine Learning Approach
for the Fast and Accurate Prediction of Biochemical Redox Potentials
and Its Large-Scale Application to 315 000 Redox Reactions |
title_full | A Mixed Quantum Chemistry/Machine Learning Approach
for the Fast and Accurate Prediction of Biochemical Redox Potentials
and Its Large-Scale Application to 315 000 Redox Reactions |
title_fullStr | A Mixed Quantum Chemistry/Machine Learning Approach
for the Fast and Accurate Prediction of Biochemical Redox Potentials
and Its Large-Scale Application to 315 000 Redox Reactions |
title_full_unstemmed | A Mixed Quantum Chemistry/Machine Learning Approach
for the Fast and Accurate Prediction of Biochemical Redox Potentials
and Its Large-Scale Application to 315 000 Redox Reactions |
title_short | A Mixed Quantum Chemistry/Machine Learning Approach
for the Fast and Accurate Prediction of Biochemical Redox Potentials
and Its Large-Scale Application to 315 000 Redox Reactions |
title_sort | mixed quantum chemistry/machine learning approach
for the fast and accurate prediction of biochemical redox potentials
and its large-scale application to 315 000 redox reactions |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6661861/ https://www.ncbi.nlm.nih.gov/pubmed/31404220 http://dx.doi.org/10.1021/acscentsci.9b00297 |
work_keys_str_mv | AT jinichadrian amixedquantumchemistrymachinelearningapproachforthefastandaccuratepredictionofbiochemicalredoxpotentialsanditslargescaleapplicationto315000redoxreactions AT sanchezlengelingbenjamin amixedquantumchemistrymachinelearningapproachforthefastandaccuratepredictionofbiochemicalredoxpotentialsanditslargescaleapplicationto315000redoxreactions AT renhaniu amixedquantumchemistrymachinelearningapproachforthefastandaccuratepredictionofbiochemicalredoxpotentialsanditslargescaleapplicationto315000redoxreactions AT harmanrebecca amixedquantumchemistrymachinelearningapproachforthefastandaccuratepredictionofbiochemicalredoxpotentialsanditslargescaleapplicationto315000redoxreactions AT aspuruguzikalan amixedquantumchemistrymachinelearningapproachforthefastandaccuratepredictionofbiochemicalredoxpotentialsanditslargescaleapplicationto315000redoxreactions AT jinichadrian mixedquantumchemistrymachinelearningapproachforthefastandaccuratepredictionofbiochemicalredoxpotentialsanditslargescaleapplicationto315000redoxreactions AT sanchezlengelingbenjamin mixedquantumchemistrymachinelearningapproachforthefastandaccuratepredictionofbiochemicalredoxpotentialsanditslargescaleapplicationto315000redoxreactions AT renhaniu mixedquantumchemistrymachinelearningapproachforthefastandaccuratepredictionofbiochemicalredoxpotentialsanditslargescaleapplicationto315000redoxreactions AT harmanrebecca mixedquantumchemistrymachinelearningapproachforthefastandaccuratepredictionofbiochemicalredoxpotentialsanditslargescaleapplicationto315000redoxreactions AT aspuruguzikalan mixedquantumchemistrymachinelearningapproachforthefastandaccuratepredictionofbiochemicalredoxpotentialsanditslargescaleapplicationto315000redoxreactions |