Cargando…

Positional SHAP (PoSHAP) for Interpretation of machine learning models trained from biological sequences

Machine learning with multi-layered artificial neural networks, also known as “deep learning,” is effective for making biological predictions. However, model interpretation is challenging, especially for sequential input data used with recurrent neural network architectures. Here, we introduce a fra...

Descripción completa

Detalles Bibliográficos
Autores principales:	Dickinson, Quinn, Meyer, Jesse G.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2022
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8797255/ https://www.ncbi.nlm.nih.gov/pubmed/35089914 http://dx.doi.org/10.1371/journal.pcbi.1009736

_version_	1784641507055108096
author	Dickinson, Quinn Meyer, Jesse G.
author_facet	Dickinson, Quinn Meyer, Jesse G.
author_sort	Dickinson, Quinn
collection	PubMed
description	Machine learning with multi-layered artificial neural networks, also known as “deep learning,” is effective for making biological predictions. However, model interpretation is challenging, especially for sequential input data used with recurrent neural network architectures. Here, we introduce a framework called “Positional SHAP” (PoSHAP) to interpret models trained from biological sequences by utilizing SHapely Additive exPlanations (SHAP) to generate positional model interpretations. We demonstrate this using three long short-term memory (LSTM) regression models that predict peptide properties, including binding affinity to major histocompatibility complexes (MHC), and collisional cross section (CCS) measured by ion mobility spectrometry. Interpretation of these models with PoSHAP reproduced MHC class I (rhesus macaque Mamu-A1001 and human A11:01) peptide binding motifs, reflected known properties of peptide CCS, and provided new insights into interpositional dependencies of amino acid interactions. PoSHAP should have widespread utility for interpreting a variety of models trained from biological sequences.
format	Online Article Text
id	pubmed-8797255
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-87972552022-01-29 Positional SHAP (PoSHAP) for Interpretation of machine learning models trained from biological sequences Dickinson, Quinn Meyer, Jesse G. PLoS Comput Biol Research Article Machine learning with multi-layered artificial neural networks, also known as “deep learning,” is effective for making biological predictions. However, model interpretation is challenging, especially for sequential input data used with recurrent neural network architectures. Here, we introduce a framework called “Positional SHAP” (PoSHAP) to interpret models trained from biological sequences by utilizing SHapely Additive exPlanations (SHAP) to generate positional model interpretations. We demonstrate this using three long short-term memory (LSTM) regression models that predict peptide properties, including binding affinity to major histocompatibility complexes (MHC), and collisional cross section (CCS) measured by ion mobility spectrometry. Interpretation of these models with PoSHAP reproduced MHC class I (rhesus macaque Mamu-A1001 and human A11:01) peptide binding motifs, reflected known properties of peptide CCS, and provided new insights into interpositional dependencies of amino acid interactions. PoSHAP should have widespread utility for interpreting a variety of models trained from biological sequences. Public Library of Science 2022-01-28 /pmc/articles/PMC8797255/ /pubmed/35089914 http://dx.doi.org/10.1371/journal.pcbi.1009736 Text en © 2022 Dickinson, Meyer https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Dickinson, Quinn Meyer, Jesse G. Positional SHAP (PoSHAP) for Interpretation of machine learning models trained from biological sequences
title	Positional SHAP (PoSHAP) for Interpretation of machine learning models trained from biological sequences
title_full	Positional SHAP (PoSHAP) for Interpretation of machine learning models trained from biological sequences
title_fullStr	Positional SHAP (PoSHAP) for Interpretation of machine learning models trained from biological sequences
title_full_unstemmed	Positional SHAP (PoSHAP) for Interpretation of machine learning models trained from biological sequences
title_short	Positional SHAP (PoSHAP) for Interpretation of machine learning models trained from biological sequences
title_sort	positional shap (poshap) for interpretation of machine learning models trained from biological sequences
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8797255/ https://www.ncbi.nlm.nih.gov/pubmed/35089914 http://dx.doi.org/10.1371/journal.pcbi.1009736
work_keys_str_mv	AT dickinsonquinn positionalshapposhapforinterpretationofmachinelearningmodelstrainedfrombiologicalsequences AT meyerjesseg positionalshapposhapforinterpretationofmachinelearningmodelstrainedfrombiologicalsequences

Positional SHAP (PoSHAP) for Interpretation of machine learning models trained from biological sequences

Ejemplares similares