Cargando…

FLAN: feature-wise latent additive neural models for biological applications

MOTIVATION: Interpretability has become a necessary feature for machine learning models deployed in critical scenarios, e.g. legal system, healthcare. In these situations, algorithmic decisions may have (potentially negative) long-lasting effects on the end-user affected by the decision. While deep...

Descripción completa

Detalles Bibliográficos
Autores principales: Nguyen, An-Phi, Vasilaki, Stefania, Martínez, María Rodríguez
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10199769/
https://www.ncbi.nlm.nih.gov/pubmed/37031956
http://dx.doi.org/10.1093/bib/bbad056
_version_ 1785045001253683200
author Nguyen, An-Phi
Vasilaki, Stefania
Martínez, María Rodríguez
author_facet Nguyen, An-Phi
Vasilaki, Stefania
Martínez, María Rodríguez
author_sort Nguyen, An-Phi
collection PubMed
description MOTIVATION: Interpretability has become a necessary feature for machine learning models deployed in critical scenarios, e.g. legal system, healthcare. In these situations, algorithmic decisions may have (potentially negative) long-lasting effects on the end-user affected by the decision. While deep learning models achieve impressive results, they often function as a black-box. Inspired by linear models, we propose a novel class of structurally constrained deep neural networks, which we call FLAN (Feature-wise Latent Additive Networks). Crucially, FLANs process each input feature separately, computing for each of them a representation in a common latent space. These feature-wise latent representations are then simply summed, and the aggregated representation is used for the prediction. These feature-wise representations allow a user to estimate the effect of each individual feature independently from the others, similarly to the way linear models are interpreted. RESULTS: We demonstrate FLAN on a series of benchmark datasets in different biological domains. Our experiments show that FLAN achieves good performances even in complex datasets (e.g. TCR-epitope binding prediction), despite the structural constraint we imposed. On the other hand, this constraint enables us to interpret FLAN by deciphering its decision process, as well as obtaining biological insights (e.g. by identifying the marker genes of different cell populations). In supplementary experiments, we show similar performances also on non-biological datasets. CODE AND DATA AVAILABILITY: Code and example data are available at https://github.com/phineasng/flan_bio.
format Online
Article
Text
id pubmed-10199769
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101997692023-05-21 FLAN: feature-wise latent additive neural models for biological applications Nguyen, An-Phi Vasilaki, Stefania Martínez, María Rodríguez Brief Bioinform Problem Solving Protocol MOTIVATION: Interpretability has become a necessary feature for machine learning models deployed in critical scenarios, e.g. legal system, healthcare. In these situations, algorithmic decisions may have (potentially negative) long-lasting effects on the end-user affected by the decision. While deep learning models achieve impressive results, they often function as a black-box. Inspired by linear models, we propose a novel class of structurally constrained deep neural networks, which we call FLAN (Feature-wise Latent Additive Networks). Crucially, FLANs process each input feature separately, computing for each of them a representation in a common latent space. These feature-wise latent representations are then simply summed, and the aggregated representation is used for the prediction. These feature-wise representations allow a user to estimate the effect of each individual feature independently from the others, similarly to the way linear models are interpreted. RESULTS: We demonstrate FLAN on a series of benchmark datasets in different biological domains. Our experiments show that FLAN achieves good performances even in complex datasets (e.g. TCR-epitope binding prediction), despite the structural constraint we imposed. On the other hand, this constraint enables us to interpret FLAN by deciphering its decision process, as well as obtaining biological insights (e.g. by identifying the marker genes of different cell populations). In supplementary experiments, we show similar performances also on non-biological datasets. CODE AND DATA AVAILABILITY: Code and example data are available at https://github.com/phineasng/flan_bio. Oxford University Press 2023-04-06 /pmc/articles/PMC10199769/ /pubmed/37031956 http://dx.doi.org/10.1093/bib/bbad056 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Problem Solving Protocol
Nguyen, An-Phi
Vasilaki, Stefania
Martínez, María Rodríguez
FLAN: feature-wise latent additive neural models for biological applications
title FLAN: feature-wise latent additive neural models for biological applications
title_full FLAN: feature-wise latent additive neural models for biological applications
title_fullStr FLAN: feature-wise latent additive neural models for biological applications
title_full_unstemmed FLAN: feature-wise latent additive neural models for biological applications
title_short FLAN: feature-wise latent additive neural models for biological applications
title_sort flan: feature-wise latent additive neural models for biological applications
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10199769/
https://www.ncbi.nlm.nih.gov/pubmed/37031956
http://dx.doi.org/10.1093/bib/bbad056
work_keys_str_mv AT nguyenanphi flanfeaturewiselatentadditiveneuralmodelsforbiologicalapplications
AT vasilakistefania flanfeaturewiselatentadditiveneuralmodelsforbiologicalapplications
AT martinezmariarodriguez flanfeaturewiselatentadditiveneuralmodelsforbiologicalapplications