Cargando…

A Mixed Quantum Chemistry/Machine Learning Approach for the Fast and Accurate Prediction of Biochemical Redox Potentials and Its Large-Scale Application to 315 000 Redox Reactions

[Image: see text] A quantitative understanding of the thermodynamics of biochemical reactions is essential for accurately modeling metabolism. The group contribution method (GCM) is one of the most widely used approaches to estimate standard Gibbs energies and redox potentials of reactions for which...

Descripción completa

Detalles Bibliográficos
Autores principales: Jinich, Adrian, Sanchez-Lengeling, Benjamin, Ren, Haniu, Harman, Rebecca, Aspuru-Guzik, Alán
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2019
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6661861/
https://www.ncbi.nlm.nih.gov/pubmed/31404220
http://dx.doi.org/10.1021/acscentsci.9b00297
_version_ 1783439543197761536
author Jinich, Adrian
Sanchez-Lengeling, Benjamin
Ren, Haniu
Harman, Rebecca
Aspuru-Guzik, Alán
author_facet Jinich, Adrian
Sanchez-Lengeling, Benjamin
Ren, Haniu
Harman, Rebecca
Aspuru-Guzik, Alán
author_sort Jinich, Adrian
collection PubMed
description [Image: see text] A quantitative understanding of the thermodynamics of biochemical reactions is essential for accurately modeling metabolism. The group contribution method (GCM) is one of the most widely used approaches to estimate standard Gibbs energies and redox potentials of reactions for which no experimental measurements exist. Previous work has shown that quantum chemical predictions of biochemical thermodynamics are a promising approach to overcome the limitations of GCM. However, the quantum chemistry approach is significantly more expensive. Here, we use a combination of quantum chemistry and machine learning to obtain a fast and accurate method for predicting the thermodynamics of biochemical redox reactions. We focus on predicting the redox potentials of carbonyl functional group reductions to alcohols and amines, two of the most ubiquitous carbon redox transformations in biology. Our method relies on semiempirical quantum chemistry calculations calibrated with Gaussian process (GP) regression against available experimental data and results in higher predictive power than the GCM at low computational cost. Direct calibration of GCM and fingerprint-based predictions (without quantum chemistry) with GP regression also results in significant improvements in prediction accuracy, demonstrating the versatility of the approach. We design and implement a network expansion algorithm that iteratively reduces and oxidizes a set of natural seed metabolites and demonstrate the high-throughput applicability of our method by predicting the standard potentials of more than 315 000 redox reactions involving approximately 70 000 compounds. Additionally, we developed a novel fingerprint-based framework for detecting molecular environment motifs that are enriched or depleted across different regions of the redox potential landscape. We provide open access to all source code and data generated.
format Online
Article
Text
id pubmed-6661861
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-66618612019-08-09 A Mixed Quantum Chemistry/Machine Learning Approach for the Fast and Accurate Prediction of Biochemical Redox Potentials and Its Large-Scale Application to 315 000 Redox Reactions Jinich, Adrian Sanchez-Lengeling, Benjamin Ren, Haniu Harman, Rebecca Aspuru-Guzik, Alán ACS Cent Sci [Image: see text] A quantitative understanding of the thermodynamics of biochemical reactions is essential for accurately modeling metabolism. The group contribution method (GCM) is one of the most widely used approaches to estimate standard Gibbs energies and redox potentials of reactions for which no experimental measurements exist. Previous work has shown that quantum chemical predictions of biochemical thermodynamics are a promising approach to overcome the limitations of GCM. However, the quantum chemistry approach is significantly more expensive. Here, we use a combination of quantum chemistry and machine learning to obtain a fast and accurate method for predicting the thermodynamics of biochemical redox reactions. We focus on predicting the redox potentials of carbonyl functional group reductions to alcohols and amines, two of the most ubiquitous carbon redox transformations in biology. Our method relies on semiempirical quantum chemistry calculations calibrated with Gaussian process (GP) regression against available experimental data and results in higher predictive power than the GCM at low computational cost. Direct calibration of GCM and fingerprint-based predictions (without quantum chemistry) with GP regression also results in significant improvements in prediction accuracy, demonstrating the versatility of the approach. We design and implement a network expansion algorithm that iteratively reduces and oxidizes a set of natural seed metabolites and demonstrate the high-throughput applicability of our method by predicting the standard potentials of more than 315 000 redox reactions involving approximately 70 000 compounds. Additionally, we developed a novel fingerprint-based framework for detecting molecular environment motifs that are enriched or depleted across different regions of the redox potential landscape. We provide open access to all source code and data generated. American Chemical Society 2019-06-07 2019-07-24 /pmc/articles/PMC6661861/ /pubmed/31404220 http://dx.doi.org/10.1021/acscentsci.9b00297 Text en Copyright © 2019 American Chemical Society This is an open access article published under an ACS AuthorChoice License (http://pubs.acs.org/page/policy/authorchoice_termsofuse.html) , which permits copying and redistribution of the article or any adaptations for non-commercial purposes.
spellingShingle Jinich, Adrian
Sanchez-Lengeling, Benjamin
Ren, Haniu
Harman, Rebecca
Aspuru-Guzik, Alán
A Mixed Quantum Chemistry/Machine Learning Approach for the Fast and Accurate Prediction of Biochemical Redox Potentials and Its Large-Scale Application to 315 000 Redox Reactions
title A Mixed Quantum Chemistry/Machine Learning Approach for the Fast and Accurate Prediction of Biochemical Redox Potentials and Its Large-Scale Application to 315 000 Redox Reactions
title_full A Mixed Quantum Chemistry/Machine Learning Approach for the Fast and Accurate Prediction of Biochemical Redox Potentials and Its Large-Scale Application to 315 000 Redox Reactions
title_fullStr A Mixed Quantum Chemistry/Machine Learning Approach for the Fast and Accurate Prediction of Biochemical Redox Potentials and Its Large-Scale Application to 315 000 Redox Reactions
title_full_unstemmed A Mixed Quantum Chemistry/Machine Learning Approach for the Fast and Accurate Prediction of Biochemical Redox Potentials and Its Large-Scale Application to 315 000 Redox Reactions
title_short A Mixed Quantum Chemistry/Machine Learning Approach for the Fast and Accurate Prediction of Biochemical Redox Potentials and Its Large-Scale Application to 315 000 Redox Reactions
title_sort mixed quantum chemistry/machine learning approach for the fast and accurate prediction of biochemical redox potentials and its large-scale application to 315 000 redox reactions
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6661861/
https://www.ncbi.nlm.nih.gov/pubmed/31404220
http://dx.doi.org/10.1021/acscentsci.9b00297
work_keys_str_mv AT jinichadrian amixedquantumchemistrymachinelearningapproachforthefastandaccuratepredictionofbiochemicalredoxpotentialsanditslargescaleapplicationto315000redoxreactions
AT sanchezlengelingbenjamin amixedquantumchemistrymachinelearningapproachforthefastandaccuratepredictionofbiochemicalredoxpotentialsanditslargescaleapplicationto315000redoxreactions
AT renhaniu amixedquantumchemistrymachinelearningapproachforthefastandaccuratepredictionofbiochemicalredoxpotentialsanditslargescaleapplicationto315000redoxreactions
AT harmanrebecca amixedquantumchemistrymachinelearningapproachforthefastandaccuratepredictionofbiochemicalredoxpotentialsanditslargescaleapplicationto315000redoxreactions
AT aspuruguzikalan amixedquantumchemistrymachinelearningapproachforthefastandaccuratepredictionofbiochemicalredoxpotentialsanditslargescaleapplicationto315000redoxreactions
AT jinichadrian mixedquantumchemistrymachinelearningapproachforthefastandaccuratepredictionofbiochemicalredoxpotentialsanditslargescaleapplicationto315000redoxreactions
AT sanchezlengelingbenjamin mixedquantumchemistrymachinelearningapproachforthefastandaccuratepredictionofbiochemicalredoxpotentialsanditslargescaleapplicationto315000redoxreactions
AT renhaniu mixedquantumchemistrymachinelearningapproachforthefastandaccuratepredictionofbiochemicalredoxpotentialsanditslargescaleapplicationto315000redoxreactions
AT harmanrebecca mixedquantumchemistrymachinelearningapproachforthefastandaccuratepredictionofbiochemicalredoxpotentialsanditslargescaleapplicationto315000redoxreactions
AT aspuruguzikalan mixedquantumchemistrymachinelearningapproachforthefastandaccuratepredictionofbiochemicalredoxpotentialsanditslargescaleapplicationto315000redoxreactions