Cargando…

Transformer for Gene Expression Modeling (T-GEM): An Interpretable Deep Learning Model for Gene Expression-Based Phenotype Predictions

SIMPLE SUMMARY: Cancer is the second leading cause of death worldwide. Predicting phenotype and understanding makers that define the phenotype are important tasks. We propose an interpretable deep learning model called T-GEM that can predict cancer-related phenotype prediction and reveal phenotype-r...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhang, Ting-He, Hasib, Md Musaddaqul, Chiu, Yu-Chiao, Han, Zhi-Feng, Jin, Yu-Fang, Flores, Mario, Chen, Yidong, Huang, Yufei
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9562172/ https://www.ncbi.nlm.nih.gov/pubmed/36230685 http://dx.doi.org/10.3390/cancers14194763

_version_	1784808110189182976
author	Zhang, Ting-He Hasib, Md Musaddaqul Chiu, Yu-Chiao Han, Zhi-Feng Jin, Yu-Fang Flores, Mario Chen, Yidong Huang, Yufei
author_facet	Zhang, Ting-He Hasib, Md Musaddaqul Chiu, Yu-Chiao Han, Zhi-Feng Jin, Yu-Fang Flores, Mario Chen, Yidong Huang, Yufei
author_sort	Zhang, Ting-He
collection	PubMed
description	SIMPLE SUMMARY: Cancer is the second leading cause of death worldwide. Predicting phenotype and understanding makers that define the phenotype are important tasks. We propose an interpretable deep learning model called T-GEM that can predict cancer-related phenotype prediction and reveal phenotype-related biological functions and marker genes. We demonstrated the capability of T-GEM on cancer type prediction using TGCA data and immune cell type identification using scRNA-seq data. The code and detailed documents are provided to facilitate easy implementation of the model in other studies. ABSTRACT: Deep learning has been applied in precision oncology to address a variety of gene expression-based phenotype predictions. However, gene expression data’s unique characteristics challenge the computer vision-inspired design of popular Deep Learning (DL) models such as Convolutional Neural Network (CNN) and ask for the need to develop interpretable DL models tailored for transcriptomics study. To address the current challenges in developing an interpretable DL model for modeling gene expression data, we propose a novel interpretable deep learning architecture called T-GEM, or Transformer for Gene Expression Modeling. We provided the detailed T-GEM model for modeling gene–gene interactions and demonstrated its utility for gene expression-based predictions of cancer-related phenotypes, including cancer type prediction and immune cell type classification. We carefully analyzed the learning mechanism of T-GEM and showed that the first layer has broader attention while higher layers focus more on phenotype-related genes. We also showed that T-GEM’s self-attention could capture important biological functions associated with the predicted phenotypes. We further devised a method to extract the regulatory network that T-GEM learns by exploiting the attributions of self-attention weights for classifications and showed that the network hub genes were likely markers for the predicted phenotypes.
format	Online Article Text
id	pubmed-9562172
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-95621722022-10-15 Transformer for Gene Expression Modeling (T-GEM): An Interpretable Deep Learning Model for Gene Expression-Based Phenotype Predictions Zhang, Ting-He Hasib, Md Musaddaqul Chiu, Yu-Chiao Han, Zhi-Feng Jin, Yu-Fang Flores, Mario Chen, Yidong Huang, Yufei Cancers (Basel) Article SIMPLE SUMMARY: Cancer is the second leading cause of death worldwide. Predicting phenotype and understanding makers that define the phenotype are important tasks. We propose an interpretable deep learning model called T-GEM that can predict cancer-related phenotype prediction and reveal phenotype-related biological functions and marker genes. We demonstrated the capability of T-GEM on cancer type prediction using TGCA data and immune cell type identification using scRNA-seq data. The code and detailed documents are provided to facilitate easy implementation of the model in other studies. ABSTRACT: Deep learning has been applied in precision oncology to address a variety of gene expression-based phenotype predictions. However, gene expression data’s unique characteristics challenge the computer vision-inspired design of popular Deep Learning (DL) models such as Convolutional Neural Network (CNN) and ask for the need to develop interpretable DL models tailored for transcriptomics study. To address the current challenges in developing an interpretable DL model for modeling gene expression data, we propose a novel interpretable deep learning architecture called T-GEM, or Transformer for Gene Expression Modeling. We provided the detailed T-GEM model for modeling gene–gene interactions and demonstrated its utility for gene expression-based predictions of cancer-related phenotypes, including cancer type prediction and immune cell type classification. We carefully analyzed the learning mechanism of T-GEM and showed that the first layer has broader attention while higher layers focus more on phenotype-related genes. We also showed that T-GEM’s self-attention could capture important biological functions associated with the predicted phenotypes. We further devised a method to extract the regulatory network that T-GEM learns by exploiting the attributions of self-attention weights for classifications and showed that the network hub genes were likely markers for the predicted phenotypes. MDPI 2022-09-29 /pmc/articles/PMC9562172/ /pubmed/36230685 http://dx.doi.org/10.3390/cancers14194763 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Zhang, Ting-He Hasib, Md Musaddaqul Chiu, Yu-Chiao Han, Zhi-Feng Jin, Yu-Fang Flores, Mario Chen, Yidong Huang, Yufei Transformer for Gene Expression Modeling (T-GEM): An Interpretable Deep Learning Model for Gene Expression-Based Phenotype Predictions
title	Transformer for Gene Expression Modeling (T-GEM): An Interpretable Deep Learning Model for Gene Expression-Based Phenotype Predictions
title_full	Transformer for Gene Expression Modeling (T-GEM): An Interpretable Deep Learning Model for Gene Expression-Based Phenotype Predictions
title_fullStr	Transformer for Gene Expression Modeling (T-GEM): An Interpretable Deep Learning Model for Gene Expression-Based Phenotype Predictions
title_full_unstemmed	Transformer for Gene Expression Modeling (T-GEM): An Interpretable Deep Learning Model for Gene Expression-Based Phenotype Predictions
title_short	Transformer for Gene Expression Modeling (T-GEM): An Interpretable Deep Learning Model for Gene Expression-Based Phenotype Predictions
title_sort	transformer for gene expression modeling (t-gem): an interpretable deep learning model for gene expression-based phenotype predictions
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9562172/ https://www.ncbi.nlm.nih.gov/pubmed/36230685 http://dx.doi.org/10.3390/cancers14194763
work_keys_str_mv	AT zhangtinghe transformerforgeneexpressionmodelingtgemaninterpretabledeeplearningmodelforgeneexpressionbasedphenotypepredictions AT hasibmdmusaddaqul transformerforgeneexpressionmodelingtgemaninterpretabledeeplearningmodelforgeneexpressionbasedphenotypepredictions AT chiuyuchiao transformerforgeneexpressionmodelingtgemaninterpretabledeeplearningmodelforgeneexpressionbasedphenotypepredictions AT hanzhifeng transformerforgeneexpressionmodelingtgemaninterpretabledeeplearningmodelforgeneexpressionbasedphenotypepredictions AT jinyufang transformerforgeneexpressionmodelingtgemaninterpretabledeeplearningmodelforgeneexpressionbasedphenotypepredictions AT floresmario transformerforgeneexpressionmodelingtgemaninterpretabledeeplearningmodelforgeneexpressionbasedphenotypepredictions AT chenyidong transformerforgeneexpressionmodelingtgemaninterpretabledeeplearningmodelforgeneexpressionbasedphenotypepredictions AT huangyufei transformerforgeneexpressionmodelingtgemaninterpretabledeeplearningmodelforgeneexpressionbasedphenotypepredictions

Transformer for Gene Expression Modeling (T-GEM): An Interpretable Deep Learning Model for Gene Expression-Based Phenotype Predictions

Ejemplares similares