Cargando…
Interpretable modeling of genotype–phenotype landscapes with state-of-the-art predictive power
Large-scale measurements linking genetic background to biological function have driven a need for models that can incorporate these data for reliable predictions and insight into the underlying biophysical system. Recent modeling efforts, however, prioritize predictive accuracy at the expense of mod...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
National Academy of Sciences
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9245639/ https://www.ncbi.nlm.nih.gov/pubmed/35733251 http://dx.doi.org/10.1073/pnas.2114021119 |
_version_ | 1784738784866205696 |
---|---|
author | Tonner, Peter D. Pressman, Abe Ross, David |
author_facet | Tonner, Peter D. Pressman, Abe Ross, David |
author_sort | Tonner, Peter D. |
collection | PubMed |
description | Large-scale measurements linking genetic background to biological function have driven a need for models that can incorporate these data for reliable predictions and insight into the underlying biophysical system. Recent modeling efforts, however, prioritize predictive accuracy at the expense of model interpretability. Here, we present LANTERN (landscape interpretable nonparametric model, https://github.com/usnistgov/lantern), a hierarchical Bayesian model that distills genotype–phenotype landscape (GPL) measurements into a low-dimensional feature space that represents the fundamental biological mechanisms of the system while also enabling straightforward, explainable predictions. Across a benchmark of large-scale datasets, LANTERN equals or outperforms all alternative approaches, including deep neural networks. LANTERN furthermore extracts useful insights of the landscape, including its inherent dimensionality, a latent space of additive mutational effects, and metrics of landscape structure. LANTERN facilitates straightforward discovery of fundamental mechanisms in GPLs, while also reliably extrapolating to unexplored regions of genotypic space. |
format | Online Article Text |
id | pubmed-9245639 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | National Academy of Sciences |
record_format | MEDLINE/PubMed |
spelling | pubmed-92456392022-07-01 Interpretable modeling of genotype–phenotype landscapes with state-of-the-art predictive power Tonner, Peter D. Pressman, Abe Ross, David Proc Natl Acad Sci U S A Biological Sciences Large-scale measurements linking genetic background to biological function have driven a need for models that can incorporate these data for reliable predictions and insight into the underlying biophysical system. Recent modeling efforts, however, prioritize predictive accuracy at the expense of model interpretability. Here, we present LANTERN (landscape interpretable nonparametric model, https://github.com/usnistgov/lantern), a hierarchical Bayesian model that distills genotype–phenotype landscape (GPL) measurements into a low-dimensional feature space that represents the fundamental biological mechanisms of the system while also enabling straightforward, explainable predictions. Across a benchmark of large-scale datasets, LANTERN equals or outperforms all alternative approaches, including deep neural networks. LANTERN furthermore extracts useful insights of the landscape, including its inherent dimensionality, a latent space of additive mutational effects, and metrics of landscape structure. LANTERN facilitates straightforward discovery of fundamental mechanisms in GPLs, while also reliably extrapolating to unexplored regions of genotypic space. National Academy of Sciences 2022-06-21 2022-06-28 /pmc/articles/PMC9245639/ /pubmed/35733251 http://dx.doi.org/10.1073/pnas.2114021119 Text en Copyright © 2022 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) . |
spellingShingle | Biological Sciences Tonner, Peter D. Pressman, Abe Ross, David Interpretable modeling of genotype–phenotype landscapes with state-of-the-art predictive power |
title | Interpretable modeling of genotype–phenotype landscapes with state-of-the-art predictive power |
title_full | Interpretable modeling of genotype–phenotype landscapes with state-of-the-art predictive power |
title_fullStr | Interpretable modeling of genotype–phenotype landscapes with state-of-the-art predictive power |
title_full_unstemmed | Interpretable modeling of genotype–phenotype landscapes with state-of-the-art predictive power |
title_short | Interpretable modeling of genotype–phenotype landscapes with state-of-the-art predictive power |
title_sort | interpretable modeling of genotype–phenotype landscapes with state-of-the-art predictive power |
topic | Biological Sciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9245639/ https://www.ncbi.nlm.nih.gov/pubmed/35733251 http://dx.doi.org/10.1073/pnas.2114021119 |
work_keys_str_mv | AT tonnerpeterd interpretablemodelingofgenotypephenotypelandscapeswithstateoftheartpredictivepower AT pressmanabe interpretablemodelingofgenotypephenotypelandscapeswithstateoftheartpredictivepower AT rossdavid interpretablemodelingofgenotypephenotypelandscapeswithstateoftheartpredictivepower |