Cargando…

EnGens: a computational framework for generation and analysis of representative protein conformational ensembles

Proteins are dynamic macromolecules that perform vital functions in cells. A protein structure determines its function, but this structure is not static, as proteins change their conformation to achieve various functions. Understanding the conformational landscapes of proteins is essential to unders...

Descripción completa

Detalles Bibliográficos
Autores principales: Conev, Anja, Rigo, Mauricio Menagatti, Devaurs, Didier, Fonseca, André Faustino, Kalavadwala, Hussain, de Freitas, Martiela Vaz, Clementi, Cecilia, Zanatta, Geancarlo, Antunes, Dinler Amaral, Kavraki, Lydia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10168271/
https://www.ncbi.nlm.nih.gov/pubmed/37163076
http://dx.doi.org/10.1101/2023.04.24.538094
_version_ 1785038827331518464
author Conev, Anja
Rigo, Mauricio Menagatti
Devaurs, Didier
Fonseca, André Faustino
Kalavadwala, Hussain
de Freitas, Martiela Vaz
Clementi, Cecilia
Zanatta, Geancarlo
Antunes, Dinler Amaral
Kavraki, Lydia
author_facet Conev, Anja
Rigo, Mauricio Menagatti
Devaurs, Didier
Fonseca, André Faustino
Kalavadwala, Hussain
de Freitas, Martiela Vaz
Clementi, Cecilia
Zanatta, Geancarlo
Antunes, Dinler Amaral
Kavraki, Lydia
author_sort Conev, Anja
collection PubMed
description Proteins are dynamic macromolecules that perform vital functions in cells. A protein structure determines its function, but this structure is not static, as proteins change their conformation to achieve various functions. Understanding the conformational landscapes of proteins is essential to understand their mechanism of action. Sets of carefully chosen conformations can summarize such complex landscapes and provide better insights into protein function than single conformations. We refer to these sets as representative conformational ensembles. Recent advances in computational methods have led to an increase in number of available structural datasets spanning conformational landscapes. However, extracting representative conformational ensembles from such datasets is not an easy task and many methods have been developed to tackle it. Our new approach, EnGens (short for ensemble generation), collects these methods into a unified framework for generating and analyzing protein conformational ensembles. In this work we: (1) provide an overview of existing methods and tools for protein structural ensemble generation and analysis; (2) unify existing approaches in an open-source Python package, and a portable Docker image, providing interactive visualizations within a Jupyter Notebook pipeline; (3) test our pipeline on a few canonical examples found in the literature. Representative ensembles produced by EnGens can be used for many downstream tasks such as protein-ligand ensemble docking, Markov state modeling of protein dynamics and analysis of the effect of single-point mutations.
format Online
Article
Text
id pubmed-10168271
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-101682712023-05-10 EnGens: a computational framework for generation and analysis of representative protein conformational ensembles Conev, Anja Rigo, Mauricio Menagatti Devaurs, Didier Fonseca, André Faustino Kalavadwala, Hussain de Freitas, Martiela Vaz Clementi, Cecilia Zanatta, Geancarlo Antunes, Dinler Amaral Kavraki, Lydia bioRxiv Article Proteins are dynamic macromolecules that perform vital functions in cells. A protein structure determines its function, but this structure is not static, as proteins change their conformation to achieve various functions. Understanding the conformational landscapes of proteins is essential to understand their mechanism of action. Sets of carefully chosen conformations can summarize such complex landscapes and provide better insights into protein function than single conformations. We refer to these sets as representative conformational ensembles. Recent advances in computational methods have led to an increase in number of available structural datasets spanning conformational landscapes. However, extracting representative conformational ensembles from such datasets is not an easy task and many methods have been developed to tackle it. Our new approach, EnGens (short for ensemble generation), collects these methods into a unified framework for generating and analyzing protein conformational ensembles. In this work we: (1) provide an overview of existing methods and tools for protein structural ensemble generation and analysis; (2) unify existing approaches in an open-source Python package, and a portable Docker image, providing interactive visualizations within a Jupyter Notebook pipeline; (3) test our pipeline on a few canonical examples found in the literature. Representative ensembles produced by EnGens can be used for many downstream tasks such as protein-ligand ensemble docking, Markov state modeling of protein dynamics and analysis of the effect of single-point mutations. Cold Spring Harbor Laboratory 2023-04-28 /pmc/articles/PMC10168271/ /pubmed/37163076 http://dx.doi.org/10.1101/2023.04.24.538094 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Conev, Anja
Rigo, Mauricio Menagatti
Devaurs, Didier
Fonseca, André Faustino
Kalavadwala, Hussain
de Freitas, Martiela Vaz
Clementi, Cecilia
Zanatta, Geancarlo
Antunes, Dinler Amaral
Kavraki, Lydia
EnGens: a computational framework for generation and analysis of representative protein conformational ensembles
title EnGens: a computational framework for generation and analysis of representative protein conformational ensembles
title_full EnGens: a computational framework for generation and analysis of representative protein conformational ensembles
title_fullStr EnGens: a computational framework for generation and analysis of representative protein conformational ensembles
title_full_unstemmed EnGens: a computational framework for generation and analysis of representative protein conformational ensembles
title_short EnGens: a computational framework for generation and analysis of representative protein conformational ensembles
title_sort engens: a computational framework for generation and analysis of representative protein conformational ensembles
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10168271/
https://www.ncbi.nlm.nih.gov/pubmed/37163076
http://dx.doi.org/10.1101/2023.04.24.538094
work_keys_str_mv AT conevanja engensacomputationalframeworkforgenerationandanalysisofrepresentativeproteinconformationalensembles
AT rigomauriciomenagatti engensacomputationalframeworkforgenerationandanalysisofrepresentativeproteinconformationalensembles
AT devaursdidier engensacomputationalframeworkforgenerationandanalysisofrepresentativeproteinconformationalensembles
AT fonsecaandrefaustino engensacomputationalframeworkforgenerationandanalysisofrepresentativeproteinconformationalensembles
AT kalavadwalahussain engensacomputationalframeworkforgenerationandanalysisofrepresentativeproteinconformationalensembles
AT defreitasmartielavaz engensacomputationalframeworkforgenerationandanalysisofrepresentativeproteinconformationalensembles
AT clementicecilia engensacomputationalframeworkforgenerationandanalysisofrepresentativeproteinconformationalensembles
AT zanattageancarlo engensacomputationalframeworkforgenerationandanalysisofrepresentativeproteinconformationalensembles
AT antunesdinleramaral engensacomputationalframeworkforgenerationandanalysisofrepresentativeproteinconformationalensembles
AT kavrakilydia engensacomputationalframeworkforgenerationandanalysisofrepresentativeproteinconformationalensembles