Cargando…

Accurate Machine Learning Prediction of Protein Circular Dichroism Spectra with Embedded Density Descriptors

[Image: see text] A data-driven approach to simulate circular dichroism (CD) spectra is appealing for fast protein secondary structure determination, yet the challenge of predicting electric and magnetic transition dipole moments poses a substantial barrier for the goal. To address this problem, we...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Luyuan, Zhang, Jinxiao, Zhang, Yaolong, Ye, Sheng, Zhang, Guozhen, Chen, Xin, Jiang, Bin, Jiang, Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2021
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8715543/
https://www.ncbi.nlm.nih.gov/pubmed/34977905
http://dx.doi.org/10.1021/jacsau.1c00449
_version_ 1784624147333120000
author Zhao, Luyuan
Zhang, Jinxiao
Zhang, Yaolong
Ye, Sheng
Zhang, Guozhen
Chen, Xin
Jiang, Bin
Jiang, Jun
author_facet Zhao, Luyuan
Zhang, Jinxiao
Zhang, Yaolong
Ye, Sheng
Zhang, Guozhen
Chen, Xin
Jiang, Bin
Jiang, Jun
author_sort Zhao, Luyuan
collection PubMed
description [Image: see text] A data-driven approach to simulate circular dichroism (CD) spectra is appealing for fast protein secondary structure determination, yet the challenge of predicting electric and magnetic transition dipole moments poses a substantial barrier for the goal. To address this problem, we designed a new machine learning (ML) protocol in which ordinary pure geometry-based descriptors are replaced with alternative embedded density descriptors and electric and magnetic transition dipole moments are successfully predicted with an accuracy comparable to first-principle calculation. The ML model is able to not only simulate protein CD spectra nearly 4 orders of magnitude faster than conventional first-principle simulation but also obtain CD spectra in good agreement with experiments. Finally, we predicted a series of CD spectra of the Trp-cage protein associated with continuous changes of protein configuration along its folding path, showing the potential of our ML model for supporting real-time CD spectroscopy study of protein dynamics.
format Online
Article
Text
id pubmed-8715543
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-87155432021-12-30 Accurate Machine Learning Prediction of Protein Circular Dichroism Spectra with Embedded Density Descriptors Zhao, Luyuan Zhang, Jinxiao Zhang, Yaolong Ye, Sheng Zhang, Guozhen Chen, Xin Jiang, Bin Jiang, Jun JACS Au [Image: see text] A data-driven approach to simulate circular dichroism (CD) spectra is appealing for fast protein secondary structure determination, yet the challenge of predicting electric and magnetic transition dipole moments poses a substantial barrier for the goal. To address this problem, we designed a new machine learning (ML) protocol in which ordinary pure geometry-based descriptors are replaced with alternative embedded density descriptors and electric and magnetic transition dipole moments are successfully predicted with an accuracy comparable to first-principle calculation. The ML model is able to not only simulate protein CD spectra nearly 4 orders of magnitude faster than conventional first-principle simulation but also obtain CD spectra in good agreement with experiments. Finally, we predicted a series of CD spectra of the Trp-cage protein associated with continuous changes of protein configuration along its folding path, showing the potential of our ML model for supporting real-time CD spectroscopy study of protein dynamics. American Chemical Society 2021-11-25 /pmc/articles/PMC8715543/ /pubmed/34977905 http://dx.doi.org/10.1021/jacsau.1c00449 Text en © 2021 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Zhao, Luyuan
Zhang, Jinxiao
Zhang, Yaolong
Ye, Sheng
Zhang, Guozhen
Chen, Xin
Jiang, Bin
Jiang, Jun
Accurate Machine Learning Prediction of Protein Circular Dichroism Spectra with Embedded Density Descriptors
title Accurate Machine Learning Prediction of Protein Circular Dichroism Spectra with Embedded Density Descriptors
title_full Accurate Machine Learning Prediction of Protein Circular Dichroism Spectra with Embedded Density Descriptors
title_fullStr Accurate Machine Learning Prediction of Protein Circular Dichroism Spectra with Embedded Density Descriptors
title_full_unstemmed Accurate Machine Learning Prediction of Protein Circular Dichroism Spectra with Embedded Density Descriptors
title_short Accurate Machine Learning Prediction of Protein Circular Dichroism Spectra with Embedded Density Descriptors
title_sort accurate machine learning prediction of protein circular dichroism spectra with embedded density descriptors
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8715543/
https://www.ncbi.nlm.nih.gov/pubmed/34977905
http://dx.doi.org/10.1021/jacsau.1c00449
work_keys_str_mv AT zhaoluyuan accuratemachinelearningpredictionofproteincirculardichroismspectrawithembeddeddensitydescriptors
AT zhangjinxiao accuratemachinelearningpredictionofproteincirculardichroismspectrawithembeddeddensitydescriptors
AT zhangyaolong accuratemachinelearningpredictionofproteincirculardichroismspectrawithembeddeddensitydescriptors
AT yesheng accuratemachinelearningpredictionofproteincirculardichroismspectrawithembeddeddensitydescriptors
AT zhangguozhen accuratemachinelearningpredictionofproteincirculardichroismspectrawithembeddeddensitydescriptors
AT chenxin accuratemachinelearningpredictionofproteincirculardichroismspectrawithembeddeddensitydescriptors
AT jiangbin accuratemachinelearningpredictionofproteincirculardichroismspectrawithembeddeddensitydescriptors
AT jiangjun accuratemachinelearningpredictionofproteincirculardichroismspectrawithembeddeddensitydescriptors