Cargando…

Machine learning/molecular dynamic protein structure prediction approach to investigate the protein conformational ensemble

Proteins exist in several different conformations. These structural changes are often associated with fluctuations at the residue level. Recent findings show that co-evolutionary analysis coupled with machine-learning techniques improves the precision by providing quantitative distance predictions b...

Descripción completa

Detalles Bibliográficos
Autores principales: Audagnotto, Martina, Czechtizky, Werngard, De Maria, Leonardo, Käck, Helena, Papoian, Garegin, Tornberg, Lars, Tyrchan, Christian, Ulander, Johan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9200820/
https://www.ncbi.nlm.nih.gov/pubmed/35705565
http://dx.doi.org/10.1038/s41598-022-13714-z
_version_ 1784728149803663360
author Audagnotto, Martina
Czechtizky, Werngard
De Maria, Leonardo
Käck, Helena
Papoian, Garegin
Tornberg, Lars
Tyrchan, Christian
Ulander, Johan
author_facet Audagnotto, Martina
Czechtizky, Werngard
De Maria, Leonardo
Käck, Helena
Papoian, Garegin
Tornberg, Lars
Tyrchan, Christian
Ulander, Johan
author_sort Audagnotto, Martina
collection PubMed
description Proteins exist in several different conformations. These structural changes are often associated with fluctuations at the residue level. Recent findings show that co-evolutionary analysis coupled with machine-learning techniques improves the precision by providing quantitative distance predictions between pairs of residues. The predicted statistical distance distribution from Multi Sequence Analysis reveals the presence of different local maxima suggesting the flexibility of key residue pairs. Here we investigate the ability of the residue-residue distance prediction to provide insights into the protein conformational ensemble. We combine deep learning approaches with mechanistic modeling to a set of proteins that experimentally showed conformational changes. The predicted protein models were filtered based on energy scores, RMSD clustering, and the centroids selected as the lowest energy structure per cluster. These models were compared to the experimental-Molecular Dynamics (MD) relaxed structure by analyzing the backbone residue torsional distribution and the sidechain orientations. Our pipeline allows to retrieve the experimental structural dynamics experimentally represented by different X-ray conformations for the same sequence as well the conformational space observed with the MD simulations. We show the potential correlation between the experimental structure dynamics and the predicted model ensemble demonstrating the susceptibility of the current state-of-the-art methods in protein folding and dynamics prediction and pointing out the areas of improvement.
format Online
Article
Text
id pubmed-9200820
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-92008202022-06-17 Machine learning/molecular dynamic protein structure prediction approach to investigate the protein conformational ensemble Audagnotto, Martina Czechtizky, Werngard De Maria, Leonardo Käck, Helena Papoian, Garegin Tornberg, Lars Tyrchan, Christian Ulander, Johan Sci Rep Article Proteins exist in several different conformations. These structural changes are often associated with fluctuations at the residue level. Recent findings show that co-evolutionary analysis coupled with machine-learning techniques improves the precision by providing quantitative distance predictions between pairs of residues. The predicted statistical distance distribution from Multi Sequence Analysis reveals the presence of different local maxima suggesting the flexibility of key residue pairs. Here we investigate the ability of the residue-residue distance prediction to provide insights into the protein conformational ensemble. We combine deep learning approaches with mechanistic modeling to a set of proteins that experimentally showed conformational changes. The predicted protein models were filtered based on energy scores, RMSD clustering, and the centroids selected as the lowest energy structure per cluster. These models were compared to the experimental-Molecular Dynamics (MD) relaxed structure by analyzing the backbone residue torsional distribution and the sidechain orientations. Our pipeline allows to retrieve the experimental structural dynamics experimentally represented by different X-ray conformations for the same sequence as well the conformational space observed with the MD simulations. We show the potential correlation between the experimental structure dynamics and the predicted model ensemble demonstrating the susceptibility of the current state-of-the-art methods in protein folding and dynamics prediction and pointing out the areas of improvement. Nature Publishing Group UK 2022-06-15 /pmc/articles/PMC9200820/ /pubmed/35705565 http://dx.doi.org/10.1038/s41598-022-13714-z Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Audagnotto, Martina
Czechtizky, Werngard
De Maria, Leonardo
Käck, Helena
Papoian, Garegin
Tornberg, Lars
Tyrchan, Christian
Ulander, Johan
Machine learning/molecular dynamic protein structure prediction approach to investigate the protein conformational ensemble
title Machine learning/molecular dynamic protein structure prediction approach to investigate the protein conformational ensemble
title_full Machine learning/molecular dynamic protein structure prediction approach to investigate the protein conformational ensemble
title_fullStr Machine learning/molecular dynamic protein structure prediction approach to investigate the protein conformational ensemble
title_full_unstemmed Machine learning/molecular dynamic protein structure prediction approach to investigate the protein conformational ensemble
title_short Machine learning/molecular dynamic protein structure prediction approach to investigate the protein conformational ensemble
title_sort machine learning/molecular dynamic protein structure prediction approach to investigate the protein conformational ensemble
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9200820/
https://www.ncbi.nlm.nih.gov/pubmed/35705565
http://dx.doi.org/10.1038/s41598-022-13714-z
work_keys_str_mv AT audagnottomartina machinelearningmoleculardynamicproteinstructurepredictionapproachtoinvestigatetheproteinconformationalensemble
AT czechtizkywerngard machinelearningmoleculardynamicproteinstructurepredictionapproachtoinvestigatetheproteinconformationalensemble
AT demarialeonardo machinelearningmoleculardynamicproteinstructurepredictionapproachtoinvestigatetheproteinconformationalensemble
AT kackhelena machinelearningmoleculardynamicproteinstructurepredictionapproachtoinvestigatetheproteinconformationalensemble
AT papoiangaregin machinelearningmoleculardynamicproteinstructurepredictionapproachtoinvestigatetheproteinconformationalensemble
AT tornberglars machinelearningmoleculardynamicproteinstructurepredictionapproachtoinvestigatetheproteinconformationalensemble
AT tyrchanchristian machinelearningmoleculardynamicproteinstructurepredictionapproachtoinvestigatetheproteinconformationalensemble
AT ulanderjohan machinelearningmoleculardynamicproteinstructurepredictionapproachtoinvestigatetheproteinconformationalensemble