Cargando…
Machine learning recognition of protein secondary structures based on two-dimensional spectroscopic descriptors
Protein secondary structure discrimination is crucial for understanding their biological function. It is not generally possible to invert spectroscopic data to yield the structure. We present a machine learning protocol which uses two-dimensional UV (2DUV) spectra as pattern recognition descriptors,...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
National Academy of Sciences
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9171355/ https://www.ncbi.nlm.nih.gov/pubmed/35476517 http://dx.doi.org/10.1073/pnas.2202713119 |
_version_ | 1784721647930966016 |
---|---|
author | Ren, Hao Zhang, Qian Wang, Zhengjie Zhang, Guozhen Liu, Hongzhang Guo, Wenyue Mukamel, Shaul Jiang, Jun |
author_facet | Ren, Hao Zhang, Qian Wang, Zhengjie Zhang, Guozhen Liu, Hongzhang Guo, Wenyue Mukamel, Shaul Jiang, Jun |
author_sort | Ren, Hao |
collection | PubMed |
description | Protein secondary structure discrimination is crucial for understanding their biological function. It is not generally possible to invert spectroscopic data to yield the structure. We present a machine learning protocol which uses two-dimensional UV (2DUV) spectra as pattern recognition descriptors, aiming at automated protein secondary structure determination from spectroscopic features. Accurate secondary structure recognition is obtained for homologous (97%) and nonhomologous (91%) protein segments, randomly selected from simulated model datasets. The advantage of 2DUV descriptors over one-dimensional linear absorption and circular dichroism spectra lies in the cross-peak information that reflects interactions between local regions of the protein. Thanks to their ultrafast (∼200 fs) nature, 2DUV measurements can be used in the future to probe conformational variations in the course of protein dynamics. |
format | Online Article Text |
id | pubmed-9171355 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | National Academy of Sciences |
record_format | MEDLINE/PubMed |
spelling | pubmed-91713552022-06-08 Machine learning recognition of protein secondary structures based on two-dimensional spectroscopic descriptors Ren, Hao Zhang, Qian Wang, Zhengjie Zhang, Guozhen Liu, Hongzhang Guo, Wenyue Mukamel, Shaul Jiang, Jun Proc Natl Acad Sci U S A Physical Sciences Protein secondary structure discrimination is crucial for understanding their biological function. It is not generally possible to invert spectroscopic data to yield the structure. We present a machine learning protocol which uses two-dimensional UV (2DUV) spectra as pattern recognition descriptors, aiming at automated protein secondary structure determination from spectroscopic features. Accurate secondary structure recognition is obtained for homologous (97%) and nonhomologous (91%) protein segments, randomly selected from simulated model datasets. The advantage of 2DUV descriptors over one-dimensional linear absorption and circular dichroism spectra lies in the cross-peak information that reflects interactions between local regions of the protein. Thanks to their ultrafast (∼200 fs) nature, 2DUV measurements can be used in the future to probe conformational variations in the course of protein dynamics. National Academy of Sciences 2022-04-27 2022-05-03 /pmc/articles/PMC9171355/ /pubmed/35476517 http://dx.doi.org/10.1073/pnas.2202713119 Text en Copyright © 2022 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) . |
spellingShingle | Physical Sciences Ren, Hao Zhang, Qian Wang, Zhengjie Zhang, Guozhen Liu, Hongzhang Guo, Wenyue Mukamel, Shaul Jiang, Jun Machine learning recognition of protein secondary structures based on two-dimensional spectroscopic descriptors |
title | Machine learning recognition of protein secondary structures based on two-dimensional spectroscopic descriptors |
title_full | Machine learning recognition of protein secondary structures based on two-dimensional spectroscopic descriptors |
title_fullStr | Machine learning recognition of protein secondary structures based on two-dimensional spectroscopic descriptors |
title_full_unstemmed | Machine learning recognition of protein secondary structures based on two-dimensional spectroscopic descriptors |
title_short | Machine learning recognition of protein secondary structures based on two-dimensional spectroscopic descriptors |
title_sort | machine learning recognition of protein secondary structures based on two-dimensional spectroscopic descriptors |
topic | Physical Sciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9171355/ https://www.ncbi.nlm.nih.gov/pubmed/35476517 http://dx.doi.org/10.1073/pnas.2202713119 |
work_keys_str_mv | AT renhao machinelearningrecognitionofproteinsecondarystructuresbasedontwodimensionalspectroscopicdescriptors AT zhangqian machinelearningrecognitionofproteinsecondarystructuresbasedontwodimensionalspectroscopicdescriptors AT wangzhengjie machinelearningrecognitionofproteinsecondarystructuresbasedontwodimensionalspectroscopicdescriptors AT zhangguozhen machinelearningrecognitionofproteinsecondarystructuresbasedontwodimensionalspectroscopicdescriptors AT liuhongzhang machinelearningrecognitionofproteinsecondarystructuresbasedontwodimensionalspectroscopicdescriptors AT guowenyue machinelearningrecognitionofproteinsecondarystructuresbasedontwodimensionalspectroscopicdescriptors AT mukamelshaul machinelearningrecognitionofproteinsecondarystructuresbasedontwodimensionalspectroscopicdescriptors AT jiangjun machinelearningrecognitionofproteinsecondarystructuresbasedontwodimensionalspectroscopicdescriptors |