Cargando…

Assessment and Optimization of Explainable Machine Learning Models Applied to Transcriptomic Data

Explainable artificial intelligence aims to interpret how machine learning models make decisions, and many model explainers have been developed in the computer vision field. However, understanding of the applicability of these model explainers to biological data is still lacking. In this study, we c...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhao, Yongbing, Shao, Jinfeng, Asmann, Yan W.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Elsevier 2022
Materias:	Original Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10025763/ https://www.ncbi.nlm.nih.gov/pubmed/35931322 http://dx.doi.org/10.1016/j.gpb.2022.07.003

_version_	1784909406900584448
author	Zhao, Yongbing Shao, Jinfeng Asmann, Yan W.
author_facet	Zhao, Yongbing Shao, Jinfeng Asmann, Yan W.
author_sort	Zhao, Yongbing
collection	PubMed
description	Explainable artificial intelligence aims to interpret how machine learning models make decisions, and many model explainers have been developed in the computer vision field. However, understanding of the applicability of these model explainers to biological data is still lacking. In this study, we comprehensively evaluated multiple explainers by interpreting pre-trained models for predicting tissue types from transcriptomic data and by identifying the top contributing genes from each sample with the greatest impacts on model prediction. To improve the reproducibility and interpretability of results generated by model explainers, we proposed a series of optimization strategies for each explainer on two different model architectures of multilayer perceptron (MLP) and convolutional neural network (CNN). We observed three groups of explainer and model architecture combinations with high reproducibility. Group II, which contains three model explainers on aggregated MLP models, identified top contributing genes in different tissues that exhibited tissue-specific manifestation and were potential cancer biomarkers. In summary, our work provides novel insights and guidance for exploring biological mechanisms using explainable machine learning models.
format	Online Article Text
id	pubmed-10025763
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Elsevier
record_format	MEDLINE/PubMed
spelling	pubmed-100257632023-03-21 Assessment and Optimization of Explainable Machine Learning Models Applied to Transcriptomic Data Zhao, Yongbing Shao, Jinfeng Asmann, Yan W. Genomics Proteomics Bioinformatics Original Research Explainable artificial intelligence aims to interpret how machine learning models make decisions, and many model explainers have been developed in the computer vision field. However, understanding of the applicability of these model explainers to biological data is still lacking. In this study, we comprehensively evaluated multiple explainers by interpreting pre-trained models for predicting tissue types from transcriptomic data and by identifying the top contributing genes from each sample with the greatest impacts on model prediction. To improve the reproducibility and interpretability of results generated by model explainers, we proposed a series of optimization strategies for each explainer on two different model architectures of multilayer perceptron (MLP) and convolutional neural network (CNN). We observed three groups of explainer and model architecture combinations with high reproducibility. Group II, which contains three model explainers on aggregated MLP models, identified top contributing genes in different tissues that exhibited tissue-specific manifestation and were potential cancer biomarkers. In summary, our work provides novel insights and guidance for exploring biological mechanisms using explainable machine learning models. Elsevier 2022-10 2022-08-03 /pmc/articles/PMC10025763/ /pubmed/35931322 http://dx.doi.org/10.1016/j.gpb.2022.07.003 Text en © 2022 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Original Research Zhao, Yongbing Shao, Jinfeng Asmann, Yan W. Assessment and Optimization of Explainable Machine Learning Models Applied to Transcriptomic Data
title	Assessment and Optimization of Explainable Machine Learning Models Applied to Transcriptomic Data
title_full	Assessment and Optimization of Explainable Machine Learning Models Applied to Transcriptomic Data
title_fullStr	Assessment and Optimization of Explainable Machine Learning Models Applied to Transcriptomic Data
title_full_unstemmed	Assessment and Optimization of Explainable Machine Learning Models Applied to Transcriptomic Data
title_short	Assessment and Optimization of Explainable Machine Learning Models Applied to Transcriptomic Data
title_sort	assessment and optimization of explainable machine learning models applied to transcriptomic data
topic	Original Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10025763/ https://www.ncbi.nlm.nih.gov/pubmed/35931322 http://dx.doi.org/10.1016/j.gpb.2022.07.003
work_keys_str_mv	AT zhaoyongbing assessmentandoptimizationofexplainablemachinelearningmodelsappliedtotranscriptomicdata AT shaojinfeng assessmentandoptimizationofexplainablemachinelearningmodelsappliedtotranscriptomicdata AT asmannyanw assessmentandoptimizationofexplainablemachinelearningmodelsappliedtotranscriptomicdata

Assessment and Optimization of Explainable Machine Learning Models Applied to Transcriptomic Data

Ejemplares similares