Cargando…

Interpretable pairwise distillations for generative protein sequence models

Many different types of generative models for protein sequences have been proposed in literature. Their uses include the prediction of mutational effects, protein design and the prediction of structural properties. Neural network (NN) architectures have shown great performances, commonly attributed...

Descripción completa

Detalles Bibliográficos
Autores principales:	Feinauer, Christoph, Meynard-Piganeau, Barthelemy, Lucibello, Carlo
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2022
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9258900/ https://www.ncbi.nlm.nih.gov/pubmed/35737722 http://dx.doi.org/10.1371/journal.pcbi.1010219

_version_	1784741655818010624
author	Feinauer, Christoph Meynard-Piganeau, Barthelemy Lucibello, Carlo
author_facet	Feinauer, Christoph Meynard-Piganeau, Barthelemy Lucibello, Carlo
author_sort	Feinauer, Christoph
collection	PubMed
description	Many different types of generative models for protein sequences have been proposed in literature. Their uses include the prediction of mutational effects, protein design and the prediction of structural properties. Neural network (NN) architectures have shown great performances, commonly attributed to the capacity to extract non-trivial higher-order interactions from the data. In this work, we analyze two different NN models and assess how close they are to simple pairwise distributions, which have been used in the past for similar problems. We present an approach for extracting pairwise models from more complex ones using an energy-based modeling framework. We show that for the tested models the extracted pairwise models can replicate the energies of the original models and are also close in performance in tasks like mutational effect prediction. In addition, we show that even simpler, factorized models often come close in performance to the original models.
format	Online Article Text
id	pubmed-9258900
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-92589002022-07-07 Interpretable pairwise distillations for generative protein sequence models Feinauer, Christoph Meynard-Piganeau, Barthelemy Lucibello, Carlo PLoS Comput Biol Research Article Many different types of generative models for protein sequences have been proposed in literature. Their uses include the prediction of mutational effects, protein design and the prediction of structural properties. Neural network (NN) architectures have shown great performances, commonly attributed to the capacity to extract non-trivial higher-order interactions from the data. In this work, we analyze two different NN models and assess how close they are to simple pairwise distributions, which have been used in the past for similar problems. We present an approach for extracting pairwise models from more complex ones using an energy-based modeling framework. We show that for the tested models the extracted pairwise models can replicate the energies of the original models and are also close in performance in tasks like mutational effect prediction. In addition, we show that even simpler, factorized models often come close in performance to the original models. Public Library of Science 2022-06-23 /pmc/articles/PMC9258900/ /pubmed/35737722 http://dx.doi.org/10.1371/journal.pcbi.1010219 Text en © 2022 Feinauer et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Feinauer, Christoph Meynard-Piganeau, Barthelemy Lucibello, Carlo Interpretable pairwise distillations for generative protein sequence models
title	Interpretable pairwise distillations for generative protein sequence models
title_full	Interpretable pairwise distillations for generative protein sequence models
title_fullStr	Interpretable pairwise distillations for generative protein sequence models
title_full_unstemmed	Interpretable pairwise distillations for generative protein sequence models
title_short	Interpretable pairwise distillations for generative protein sequence models
title_sort	interpretable pairwise distillations for generative protein sequence models
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9258900/ https://www.ncbi.nlm.nih.gov/pubmed/35737722 http://dx.doi.org/10.1371/journal.pcbi.1010219
work_keys_str_mv	AT feinauerchristoph interpretablepairwisedistillationsforgenerativeproteinsequencemodels AT meynardpiganeaubarthelemy interpretablepairwisedistillationsforgenerativeproteinsequencemodels AT lucibellocarlo interpretablepairwisedistillationsforgenerativeproteinsequencemodels

Interpretable pairwise distillations for generative protein sequence models

Ejemplares similares