Cargando…

Machine learning recognition of protein secondary structures based on two-dimensional spectroscopic descriptors

Protein secondary structure discrimination is crucial for understanding their biological function. It is not generally possible to invert spectroscopic data to yield the structure. We present a machine learning protocol which uses two-dimensional UV (2DUV) spectra as pattern recognition descriptors,...

Descripción completa

Detalles Bibliográficos
Autores principales: Ren, Hao, Zhang, Qian, Wang, Zhengjie, Zhang, Guozhen, Liu, Hongzhang, Guo, Wenyue, Mukamel, Shaul, Jiang, Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9171355/
https://www.ncbi.nlm.nih.gov/pubmed/35476517
http://dx.doi.org/10.1073/pnas.2202713119
_version_ 1784721647930966016
author Ren, Hao
Zhang, Qian
Wang, Zhengjie
Zhang, Guozhen
Liu, Hongzhang
Guo, Wenyue
Mukamel, Shaul
Jiang, Jun
author_facet Ren, Hao
Zhang, Qian
Wang, Zhengjie
Zhang, Guozhen
Liu, Hongzhang
Guo, Wenyue
Mukamel, Shaul
Jiang, Jun
author_sort Ren, Hao
collection PubMed
description Protein secondary structure discrimination is crucial for understanding their biological function. It is not generally possible to invert spectroscopic data to yield the structure. We present a machine learning protocol which uses two-dimensional UV (2DUV) spectra as pattern recognition descriptors, aiming at automated protein secondary structure determination from spectroscopic features. Accurate secondary structure recognition is obtained for homologous (97%) and nonhomologous (91%) protein segments, randomly selected from simulated model datasets. The advantage of 2DUV descriptors over one-dimensional linear absorption and circular dichroism spectra lies in the cross-peak information that reflects interactions between local regions of the protein. Thanks to their ultrafast (∼200 fs) nature, 2DUV measurements can be used in the future to probe conformational variations in the course of protein dynamics.
format Online
Article
Text
id pubmed-9171355
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-91713552022-06-08 Machine learning recognition of protein secondary structures based on two-dimensional spectroscopic descriptors Ren, Hao Zhang, Qian Wang, Zhengjie Zhang, Guozhen Liu, Hongzhang Guo, Wenyue Mukamel, Shaul Jiang, Jun Proc Natl Acad Sci U S A Physical Sciences Protein secondary structure discrimination is crucial for understanding their biological function. It is not generally possible to invert spectroscopic data to yield the structure. We present a machine learning protocol which uses two-dimensional UV (2DUV) spectra as pattern recognition descriptors, aiming at automated protein secondary structure determination from spectroscopic features. Accurate secondary structure recognition is obtained for homologous (97%) and nonhomologous (91%) protein segments, randomly selected from simulated model datasets. The advantage of 2DUV descriptors over one-dimensional linear absorption and circular dichroism spectra lies in the cross-peak information that reflects interactions between local regions of the protein. Thanks to their ultrafast (∼200 fs) nature, 2DUV measurements can be used in the future to probe conformational variations in the course of protein dynamics. National Academy of Sciences 2022-04-27 2022-05-03 /pmc/articles/PMC9171355/ /pubmed/35476517 http://dx.doi.org/10.1073/pnas.2202713119 Text en Copyright © 2022 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Physical Sciences
Ren, Hao
Zhang, Qian
Wang, Zhengjie
Zhang, Guozhen
Liu, Hongzhang
Guo, Wenyue
Mukamel, Shaul
Jiang, Jun
Machine learning recognition of protein secondary structures based on two-dimensional spectroscopic descriptors
title Machine learning recognition of protein secondary structures based on two-dimensional spectroscopic descriptors
title_full Machine learning recognition of protein secondary structures based on two-dimensional spectroscopic descriptors
title_fullStr Machine learning recognition of protein secondary structures based on two-dimensional spectroscopic descriptors
title_full_unstemmed Machine learning recognition of protein secondary structures based on two-dimensional spectroscopic descriptors
title_short Machine learning recognition of protein secondary structures based on two-dimensional spectroscopic descriptors
title_sort machine learning recognition of protein secondary structures based on two-dimensional spectroscopic descriptors
topic Physical Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9171355/
https://www.ncbi.nlm.nih.gov/pubmed/35476517
http://dx.doi.org/10.1073/pnas.2202713119
work_keys_str_mv AT renhao machinelearningrecognitionofproteinsecondarystructuresbasedontwodimensionalspectroscopicdescriptors
AT zhangqian machinelearningrecognitionofproteinsecondarystructuresbasedontwodimensionalspectroscopicdescriptors
AT wangzhengjie machinelearningrecognitionofproteinsecondarystructuresbasedontwodimensionalspectroscopicdescriptors
AT zhangguozhen machinelearningrecognitionofproteinsecondarystructuresbasedontwodimensionalspectroscopicdescriptors
AT liuhongzhang machinelearningrecognitionofproteinsecondarystructuresbasedontwodimensionalspectroscopicdescriptors
AT guowenyue machinelearningrecognitionofproteinsecondarystructuresbasedontwodimensionalspectroscopicdescriptors
AT mukamelshaul machinelearningrecognitionofproteinsecondarystructuresbasedontwodimensionalspectroscopicdescriptors
AT jiangjun machinelearningrecognitionofproteinsecondarystructuresbasedontwodimensionalspectroscopicdescriptors