Cargando…

Inherently interpretable position-aware convolutional motif kernel networks for biological sequencing data

Artificial neural networks show promising performance in detecting correlations within data that are associated with specific outcomes. However, the black-box nature of such models can hinder the knowledge advancement in research fields by obscuring the decision process and preventing scientist to f...

Descripción completa

Detalles Bibliográficos
Autores principales: Ditz, Jonas C., Reuter, Bernhard, Pfeifer, Nico
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10567796/
https://www.ncbi.nlm.nih.gov/pubmed/37821530
http://dx.doi.org/10.1038/s41598-023-44175-7
_version_ 1785119212478398464
author Ditz, Jonas C.
Reuter, Bernhard
Pfeifer, Nico
author_facet Ditz, Jonas C.
Reuter, Bernhard
Pfeifer, Nico
author_sort Ditz, Jonas C.
collection PubMed
description Artificial neural networks show promising performance in detecting correlations within data that are associated with specific outcomes. However, the black-box nature of such models can hinder the knowledge advancement in research fields by obscuring the decision process and preventing scientist to fully conceptualize predicted outcomes. Furthermore, domain experts like healthcare providers need explainable predictions to assess whether a predicted outcome can be trusted in high stakes scenarios and to help them integrating a model into their own routine. Therefore, interpretable models play a crucial role for the incorporation of machine learning into high stakes scenarios like healthcare. In this paper we introduce Convolutional Motif Kernel Networks, a neural network architecture that involves learning a feature representation within a subspace of the reproducing kernel Hilbert space of the position-aware motif kernel function. The resulting model enables to directly interpret and evaluate prediction outcomes by providing a biologically and medically meaningful explanation without the need for additional post-hoc analysis. We show that our model is able to robustly learn on small datasets and reaches state-of-the-art performance on relevant healthcare prediction tasks. Our proposed method can be utilized on DNA and protein sequences. Furthermore, we show that the proposed method learns biologically meaningful concepts directly from data using an end-to-end learning scheme.
format Online
Article
Text
id pubmed-10567796
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-105677962023-10-13 Inherently interpretable position-aware convolutional motif kernel networks for biological sequencing data Ditz, Jonas C. Reuter, Bernhard Pfeifer, Nico Sci Rep Article Artificial neural networks show promising performance in detecting correlations within data that are associated with specific outcomes. However, the black-box nature of such models can hinder the knowledge advancement in research fields by obscuring the decision process and preventing scientist to fully conceptualize predicted outcomes. Furthermore, domain experts like healthcare providers need explainable predictions to assess whether a predicted outcome can be trusted in high stakes scenarios and to help them integrating a model into their own routine. Therefore, interpretable models play a crucial role for the incorporation of machine learning into high stakes scenarios like healthcare. In this paper we introduce Convolutional Motif Kernel Networks, a neural network architecture that involves learning a feature representation within a subspace of the reproducing kernel Hilbert space of the position-aware motif kernel function. The resulting model enables to directly interpret and evaluate prediction outcomes by providing a biologically and medically meaningful explanation without the need for additional post-hoc analysis. We show that our model is able to robustly learn on small datasets and reaches state-of-the-art performance on relevant healthcare prediction tasks. Our proposed method can be utilized on DNA and protein sequences. Furthermore, we show that the proposed method learns biologically meaningful concepts directly from data using an end-to-end learning scheme. Nature Publishing Group UK 2023-10-11 /pmc/articles/PMC10567796/ /pubmed/37821530 http://dx.doi.org/10.1038/s41598-023-44175-7 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Ditz, Jonas C.
Reuter, Bernhard
Pfeifer, Nico
Inherently interpretable position-aware convolutional motif kernel networks for biological sequencing data
title Inherently interpretable position-aware convolutional motif kernel networks for biological sequencing data
title_full Inherently interpretable position-aware convolutional motif kernel networks for biological sequencing data
title_fullStr Inherently interpretable position-aware convolutional motif kernel networks for biological sequencing data
title_full_unstemmed Inherently interpretable position-aware convolutional motif kernel networks for biological sequencing data
title_short Inherently interpretable position-aware convolutional motif kernel networks for biological sequencing data
title_sort inherently interpretable position-aware convolutional motif kernel networks for biological sequencing data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10567796/
https://www.ncbi.nlm.nih.gov/pubmed/37821530
http://dx.doi.org/10.1038/s41598-023-44175-7
work_keys_str_mv AT ditzjonasc inherentlyinterpretablepositionawareconvolutionalmotifkernelnetworksforbiologicalsequencingdata
AT reuterbernhard inherentlyinterpretablepositionawareconvolutionalmotifkernelnetworksforbiologicalsequencingdata
AT pfeifernico inherentlyinterpretablepositionawareconvolutionalmotifkernelnetworksforbiologicalsequencingdata