Cargando…
omicsGAT: Graph Attention Network for Cancer Subtype Analyses
The use of high-throughput omics technologies is becoming increasingly popular in all facets of biomedical science. The mRNA sequencing (RNA-seq) method reports quantitative measures of more than tens of thousands of biological features. It provides a more comprehensive molecular perspective of stud...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9499656/ https://www.ncbi.nlm.nih.gov/pubmed/36142140 http://dx.doi.org/10.3390/ijms231810220 |
_version_ | 1784795045771083776 |
---|---|
author | Baul, Sudipto Ahmed, Khandakar Tanvir Filipek, Joseph Zhang, Wei |
author_facet | Baul, Sudipto Ahmed, Khandakar Tanvir Filipek, Joseph Zhang, Wei |
author_sort | Baul, Sudipto |
collection | PubMed |
description | The use of high-throughput omics technologies is becoming increasingly popular in all facets of biomedical science. The mRNA sequencing (RNA-seq) method reports quantitative measures of more than tens of thousands of biological features. It provides a more comprehensive molecular perspective of studied cancer mechanisms compared to traditional approaches. Graph-based learning models have been proposed to learn important hidden representations from gene expression data and network structure to improve cancer outcome prediction, patient stratification, and cell clustering. However, these graph-based methods cannot rank the importance of the different neighbors for a particular sample in the downstream cancer subtype analyses. In this study, we introduce omicsGAT, a graph attention network (GAT) model to integrate graph-based learning with an attention mechanism for RNA-seq data analysis. The multi-head attention mechanism in omicsGAT can more effectively secure information of a particular sample by assigning different attention coefficients to its neighbors. Comprehensive experiments on The Cancer Genome Atlas (TCGA) breast cancer and bladder cancer bulk RNA-seq data and two single-cell RNA-seq datasets validate that (1) the proposed model can effectively integrate neighborhood information of a sample and learn an embedding vector to improve disease phenotype prediction, cancer patient stratification, and cell clustering of the sample and (2) the attention matrix generated from the multi-head attention coefficients provides more useful information compared to the sample correlation-based adjacency matrix. From the results, we can conclude that some neighbors play a more important role than others in cancer subtype analyses of a particular sample based on the attention coefficient. |
format | Online Article Text |
id | pubmed-9499656 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-94996562022-09-23 omicsGAT: Graph Attention Network for Cancer Subtype Analyses Baul, Sudipto Ahmed, Khandakar Tanvir Filipek, Joseph Zhang, Wei Int J Mol Sci Article The use of high-throughput omics technologies is becoming increasingly popular in all facets of biomedical science. The mRNA sequencing (RNA-seq) method reports quantitative measures of more than tens of thousands of biological features. It provides a more comprehensive molecular perspective of studied cancer mechanisms compared to traditional approaches. Graph-based learning models have been proposed to learn important hidden representations from gene expression data and network structure to improve cancer outcome prediction, patient stratification, and cell clustering. However, these graph-based methods cannot rank the importance of the different neighbors for a particular sample in the downstream cancer subtype analyses. In this study, we introduce omicsGAT, a graph attention network (GAT) model to integrate graph-based learning with an attention mechanism for RNA-seq data analysis. The multi-head attention mechanism in omicsGAT can more effectively secure information of a particular sample by assigning different attention coefficients to its neighbors. Comprehensive experiments on The Cancer Genome Atlas (TCGA) breast cancer and bladder cancer bulk RNA-seq data and two single-cell RNA-seq datasets validate that (1) the proposed model can effectively integrate neighborhood information of a sample and learn an embedding vector to improve disease phenotype prediction, cancer patient stratification, and cell clustering of the sample and (2) the attention matrix generated from the multi-head attention coefficients provides more useful information compared to the sample correlation-based adjacency matrix. From the results, we can conclude that some neighbors play a more important role than others in cancer subtype analyses of a particular sample based on the attention coefficient. MDPI 2022-09-06 /pmc/articles/PMC9499656/ /pubmed/36142140 http://dx.doi.org/10.3390/ijms231810220 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Baul, Sudipto Ahmed, Khandakar Tanvir Filipek, Joseph Zhang, Wei omicsGAT: Graph Attention Network for Cancer Subtype Analyses |
title | omicsGAT: Graph Attention Network for Cancer Subtype Analyses |
title_full | omicsGAT: Graph Attention Network for Cancer Subtype Analyses |
title_fullStr | omicsGAT: Graph Attention Network for Cancer Subtype Analyses |
title_full_unstemmed | omicsGAT: Graph Attention Network for Cancer Subtype Analyses |
title_short | omicsGAT: Graph Attention Network for Cancer Subtype Analyses |
title_sort | omicsgat: graph attention network for cancer subtype analyses |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9499656/ https://www.ncbi.nlm.nih.gov/pubmed/36142140 http://dx.doi.org/10.3390/ijms231810220 |
work_keys_str_mv | AT baulsudipto omicsgatgraphattentionnetworkforcancersubtypeanalyses AT ahmedkhandakartanvir omicsgatgraphattentionnetworkforcancersubtypeanalyses AT filipekjoseph omicsgatgraphattentionnetworkforcancersubtypeanalyses AT zhangwei omicsgatgraphattentionnetworkforcancersubtypeanalyses |