Cargando…
A Bayesian Framework for Cryo-EM Heterogeneity Analysis using Regularized Covariance Estimation
Proteins and the complexes they form are central to nearly all cellular processes. Their flexibility, expressed through a continuum of states, provides a window into their biological functions. Cryogenic-electron microscopy (cryo-EM) is an ideal tool to study these dynamic states as it captures spec...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10634927/ https://www.ncbi.nlm.nih.gov/pubmed/37961393 http://dx.doi.org/10.1101/2023.10.28.564422 |
_version_ | 1785146263041212416 |
---|---|
author | Gilles, Marc Aurèle Singer, Amit |
author_facet | Gilles, Marc Aurèle Singer, Amit |
author_sort | Gilles, Marc Aurèle |
collection | PubMed |
description | Proteins and the complexes they form are central to nearly all cellular processes. Their flexibility, expressed through a continuum of states, provides a window into their biological functions. Cryogenic-electron microscopy (cryo-EM) is an ideal tool to study these dynamic states as it captures specimens in non-crystalline conditions and enables high-resolution reconstructions. However, analyzing the heterogeneous distribution of conformations from cryo-EM data is challenging. Current methods face issues such as a lack of explainability, overfitting caused by lack of regularization, and a large number of parameters to tune; problems exacerbated by the lack of proper metrics to evaluate or compare heterogeneous reconstructions. To address these challenges, we present RECOVAR, a white-box method based on principal component analysis (PCA) computed via regularized covariance estimation that can resolve intricate heterogeneity with similar expressive power to neural networks with significantly lower computational demands. We extend the ubiquitous Bayesian framework used in homogeneous reconstruction to automatically regularize principal components, overcoming overfitting concerns and removing the need for most parameters. We further exploit the conservation of density and distances endowed by the embedding in PCA space, opening the door to reliable free energy computation. We leverage the predictable uncertainty of image labels to generate high-resolution reconstructions and identify high-density trajectories in latent space. We make the code freely available at https://github.com/ma-gilles/recovar. |
format | Online Article Text |
id | pubmed-10634927 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-106349272023-11-13 A Bayesian Framework for Cryo-EM Heterogeneity Analysis using Regularized Covariance Estimation Gilles, Marc Aurèle Singer, Amit bioRxiv Article Proteins and the complexes they form are central to nearly all cellular processes. Their flexibility, expressed through a continuum of states, provides a window into their biological functions. Cryogenic-electron microscopy (cryo-EM) is an ideal tool to study these dynamic states as it captures specimens in non-crystalline conditions and enables high-resolution reconstructions. However, analyzing the heterogeneous distribution of conformations from cryo-EM data is challenging. Current methods face issues such as a lack of explainability, overfitting caused by lack of regularization, and a large number of parameters to tune; problems exacerbated by the lack of proper metrics to evaluate or compare heterogeneous reconstructions. To address these challenges, we present RECOVAR, a white-box method based on principal component analysis (PCA) computed via regularized covariance estimation that can resolve intricate heterogeneity with similar expressive power to neural networks with significantly lower computational demands. We extend the ubiquitous Bayesian framework used in homogeneous reconstruction to automatically regularize principal components, overcoming overfitting concerns and removing the need for most parameters. We further exploit the conservation of density and distances endowed by the embedding in PCA space, opening the door to reliable free energy computation. We leverage the predictable uncertainty of image labels to generate high-resolution reconstructions and identify high-density trajectories in latent space. We make the code freely available at https://github.com/ma-gilles/recovar. Cold Spring Harbor Laboratory 2023-11-01 /pmc/articles/PMC10634927/ /pubmed/37961393 http://dx.doi.org/10.1101/2023.10.28.564422 Text en https://creativecommons.org/licenses/by-nc/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (https://creativecommons.org/licenses/by-nc/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format for noncommercial purposes only, and only so long as attribution is given to the creator. |
spellingShingle | Article Gilles, Marc Aurèle Singer, Amit A Bayesian Framework for Cryo-EM Heterogeneity Analysis using Regularized Covariance Estimation |
title | A Bayesian Framework for Cryo-EM Heterogeneity Analysis using Regularized Covariance Estimation |
title_full | A Bayesian Framework for Cryo-EM Heterogeneity Analysis using Regularized Covariance Estimation |
title_fullStr | A Bayesian Framework for Cryo-EM Heterogeneity Analysis using Regularized Covariance Estimation |
title_full_unstemmed | A Bayesian Framework for Cryo-EM Heterogeneity Analysis using Regularized Covariance Estimation |
title_short | A Bayesian Framework for Cryo-EM Heterogeneity Analysis using Regularized Covariance Estimation |
title_sort | bayesian framework for cryo-em heterogeneity analysis using regularized covariance estimation |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10634927/ https://www.ncbi.nlm.nih.gov/pubmed/37961393 http://dx.doi.org/10.1101/2023.10.28.564422 |
work_keys_str_mv | AT gillesmarcaurele abayesianframeworkforcryoemheterogeneityanalysisusingregularizedcovarianceestimation AT singeramit abayesianframeworkforcryoemheterogeneityanalysisusingregularizedcovarianceestimation AT gillesmarcaurele bayesianframeworkforcryoemheterogeneityanalysisusingregularizedcovarianceestimation AT singeramit bayesianframeworkforcryoemheterogeneityanalysisusingregularizedcovarianceestimation |