Cargando…

Interpretable modeling of genotype–phenotype landscapes with state-of-the-art predictive power

Large-scale measurements linking genetic background to biological function have driven a need for models that can incorporate these data for reliable predictions and insight into the underlying biophysical system. Recent modeling efforts, however, prioritize predictive accuracy at the expense of mod...

Descripción completa

Detalles Bibliográficos
Autores principales: Tonner, Peter D., Pressman, Abe, Ross, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9245639/
https://www.ncbi.nlm.nih.gov/pubmed/35733251
http://dx.doi.org/10.1073/pnas.2114021119
_version_ 1784738784866205696
author Tonner, Peter D.
Pressman, Abe
Ross, David
author_facet Tonner, Peter D.
Pressman, Abe
Ross, David
author_sort Tonner, Peter D.
collection PubMed
description Large-scale measurements linking genetic background to biological function have driven a need for models that can incorporate these data for reliable predictions and insight into the underlying biophysical system. Recent modeling efforts, however, prioritize predictive accuracy at the expense of model interpretability. Here, we present LANTERN (landscape interpretable nonparametric model, https://github.com/usnistgov/lantern), a hierarchical Bayesian model that distills genotype–phenotype landscape (GPL) measurements into a low-dimensional feature space that represents the fundamental biological mechanisms of the system while also enabling straightforward, explainable predictions. Across a benchmark of large-scale datasets, LANTERN equals or outperforms all alternative approaches, including deep neural networks. LANTERN furthermore extracts useful insights of the landscape, including its inherent dimensionality, a latent space of additive mutational effects, and metrics of landscape structure. LANTERN facilitates straightforward discovery of fundamental mechanisms in GPLs, while also reliably extrapolating to unexplored regions of genotypic space.
format Online
Article
Text
id pubmed-9245639
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-92456392022-07-01 Interpretable modeling of genotype–phenotype landscapes with state-of-the-art predictive power Tonner, Peter D. Pressman, Abe Ross, David Proc Natl Acad Sci U S A Biological Sciences Large-scale measurements linking genetic background to biological function have driven a need for models that can incorporate these data for reliable predictions and insight into the underlying biophysical system. Recent modeling efforts, however, prioritize predictive accuracy at the expense of model interpretability. Here, we present LANTERN (landscape interpretable nonparametric model, https://github.com/usnistgov/lantern), a hierarchical Bayesian model that distills genotype–phenotype landscape (GPL) measurements into a low-dimensional feature space that represents the fundamental biological mechanisms of the system while also enabling straightforward, explainable predictions. Across a benchmark of large-scale datasets, LANTERN equals or outperforms all alternative approaches, including deep neural networks. LANTERN furthermore extracts useful insights of the landscape, including its inherent dimensionality, a latent space of additive mutational effects, and metrics of landscape structure. LANTERN facilitates straightforward discovery of fundamental mechanisms in GPLs, while also reliably extrapolating to unexplored regions of genotypic space. National Academy of Sciences 2022-06-21 2022-06-28 /pmc/articles/PMC9245639/ /pubmed/35733251 http://dx.doi.org/10.1073/pnas.2114021119 Text en Copyright © 2022 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Biological Sciences
Tonner, Peter D.
Pressman, Abe
Ross, David
Interpretable modeling of genotype–phenotype landscapes with state-of-the-art predictive power
title Interpretable modeling of genotype–phenotype landscapes with state-of-the-art predictive power
title_full Interpretable modeling of genotype–phenotype landscapes with state-of-the-art predictive power
title_fullStr Interpretable modeling of genotype–phenotype landscapes with state-of-the-art predictive power
title_full_unstemmed Interpretable modeling of genotype–phenotype landscapes with state-of-the-art predictive power
title_short Interpretable modeling of genotype–phenotype landscapes with state-of-the-art predictive power
title_sort interpretable modeling of genotype–phenotype landscapes with state-of-the-art predictive power
topic Biological Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9245639/
https://www.ncbi.nlm.nih.gov/pubmed/35733251
http://dx.doi.org/10.1073/pnas.2114021119
work_keys_str_mv AT tonnerpeterd interpretablemodelingofgenotypephenotypelandscapeswithstateoftheartpredictivepower
AT pressmanabe interpretablemodelingofgenotypephenotypelandscapeswithstateoftheartpredictivepower
AT rossdavid interpretablemodelingofgenotypephenotypelandscapeswithstateoftheartpredictivepower