Cargando…

A Bayesian Framework for Cryo-EM Heterogeneity Analysis using Regularized Covariance Estimation

Proteins and the complexes they form are central to nearly all cellular processes. Their flexibility, expressed through a continuum of states, provides a window into their biological functions. Cryogenic-electron microscopy (cryo-EM) is an ideal tool to study these dynamic states as it captures spec...

Descripción completa

Detalles Bibliográficos
Autores principales: Gilles, Marc Aurèle, Singer, Amit
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10634927/
https://www.ncbi.nlm.nih.gov/pubmed/37961393
http://dx.doi.org/10.1101/2023.10.28.564422
_version_ 1785146263041212416
author Gilles, Marc Aurèle
Singer, Amit
author_facet Gilles, Marc Aurèle
Singer, Amit
author_sort Gilles, Marc Aurèle
collection PubMed
description Proteins and the complexes they form are central to nearly all cellular processes. Their flexibility, expressed through a continuum of states, provides a window into their biological functions. Cryogenic-electron microscopy (cryo-EM) is an ideal tool to study these dynamic states as it captures specimens in non-crystalline conditions and enables high-resolution reconstructions. However, analyzing the heterogeneous distribution of conformations from cryo-EM data is challenging. Current methods face issues such as a lack of explainability, overfitting caused by lack of regularization, and a large number of parameters to tune; problems exacerbated by the lack of proper metrics to evaluate or compare heterogeneous reconstructions. To address these challenges, we present RECOVAR, a white-box method based on principal component analysis (PCA) computed via regularized covariance estimation that can resolve intricate heterogeneity with similar expressive power to neural networks with significantly lower computational demands. We extend the ubiquitous Bayesian framework used in homogeneous reconstruction to automatically regularize principal components, overcoming overfitting concerns and removing the need for most parameters. We further exploit the conservation of density and distances endowed by the embedding in PCA space, opening the door to reliable free energy computation. We leverage the predictable uncertainty of image labels to generate high-resolution reconstructions and identify high-density trajectories in latent space. We make the code freely available at https://github.com/ma-gilles/recovar.
format Online
Article
Text
id pubmed-10634927
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-106349272023-11-13 A Bayesian Framework for Cryo-EM Heterogeneity Analysis using Regularized Covariance Estimation Gilles, Marc Aurèle Singer, Amit bioRxiv Article Proteins and the complexes they form are central to nearly all cellular processes. Their flexibility, expressed through a continuum of states, provides a window into their biological functions. Cryogenic-electron microscopy (cryo-EM) is an ideal tool to study these dynamic states as it captures specimens in non-crystalline conditions and enables high-resolution reconstructions. However, analyzing the heterogeneous distribution of conformations from cryo-EM data is challenging. Current methods face issues such as a lack of explainability, overfitting caused by lack of regularization, and a large number of parameters to tune; problems exacerbated by the lack of proper metrics to evaluate or compare heterogeneous reconstructions. To address these challenges, we present RECOVAR, a white-box method based on principal component analysis (PCA) computed via regularized covariance estimation that can resolve intricate heterogeneity with similar expressive power to neural networks with significantly lower computational demands. We extend the ubiquitous Bayesian framework used in homogeneous reconstruction to automatically regularize principal components, overcoming overfitting concerns and removing the need for most parameters. We further exploit the conservation of density and distances endowed by the embedding in PCA space, opening the door to reliable free energy computation. We leverage the predictable uncertainty of image labels to generate high-resolution reconstructions and identify high-density trajectories in latent space. We make the code freely available at https://github.com/ma-gilles/recovar. Cold Spring Harbor Laboratory 2023-11-01 /pmc/articles/PMC10634927/ /pubmed/37961393 http://dx.doi.org/10.1101/2023.10.28.564422 Text en https://creativecommons.org/licenses/by-nc/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (https://creativecommons.org/licenses/by-nc/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Gilles, Marc Aurèle
Singer, Amit
A Bayesian Framework for Cryo-EM Heterogeneity Analysis using Regularized Covariance Estimation
title A Bayesian Framework for Cryo-EM Heterogeneity Analysis using Regularized Covariance Estimation
title_full A Bayesian Framework for Cryo-EM Heterogeneity Analysis using Regularized Covariance Estimation
title_fullStr A Bayesian Framework for Cryo-EM Heterogeneity Analysis using Regularized Covariance Estimation
title_full_unstemmed A Bayesian Framework for Cryo-EM Heterogeneity Analysis using Regularized Covariance Estimation
title_short A Bayesian Framework for Cryo-EM Heterogeneity Analysis using Regularized Covariance Estimation
title_sort bayesian framework for cryo-em heterogeneity analysis using regularized covariance estimation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10634927/
https://www.ncbi.nlm.nih.gov/pubmed/37961393
http://dx.doi.org/10.1101/2023.10.28.564422
work_keys_str_mv AT gillesmarcaurele abayesianframeworkforcryoemheterogeneityanalysisusingregularizedcovarianceestimation
AT singeramit abayesianframeworkforcryoemheterogeneityanalysisusingregularizedcovarianceestimation
AT gillesmarcaurele bayesianframeworkforcryoemheterogeneityanalysisusingregularizedcovarianceestimation
AT singeramit bayesianframeworkforcryoemheterogeneityanalysisusingregularizedcovarianceestimation