Cargando…

BERTwalk for integrating gene networks to predict gene- to pathway-level properties

MOTIVATION: Graph representation learning is a fundamental problem in the field of data science with applications to integrative analysis of biological networks. Previous work in this domain was mostly limited to shallow representation techniques. A recent deep representation technique, BIONIC, has...

Descripción completa

Detalles Bibliográficos
Autores principales: Nasser, Rami, Sharan, Roded
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10336298/
https://www.ncbi.nlm.nih.gov/pubmed/37448813
http://dx.doi.org/10.1093/bioadv/vbad086
_version_ 1785071180337643520
author Nasser, Rami
Sharan, Roded
author_facet Nasser, Rami
Sharan, Roded
author_sort Nasser, Rami
collection PubMed
description MOTIVATION: Graph representation learning is a fundamental problem in the field of data science with applications to integrative analysis of biological networks. Previous work in this domain was mostly limited to shallow representation techniques. A recent deep representation technique, BIONIC, has achieved state-of-the-art results in a variety of tasks but used arbitrarily defined components. RESULTS: Here, we present BERTwalk, an unsupervised learning scheme that combines the BERT masked language model with a network propagation regularization for graph representation learning. The transformation from networks to texts allows our method to naturally integrate different networks and provide features that inform not only nodes or edges but also pathway-level properties. We show that our BERTwalk model outperforms BIONIC, as well as four other recent methods, on two comprehensive benchmarks in yeast and human. We further show that our model can be utilized to infer functional pathways and their effects. AVAILABILITY AND IMPLEMENTATION: Code and data are available at https://github.com/raminass/BERTwalk. CONTACT: roded@tauex.tau.ac.il
format Online
Article
Text
id pubmed-10336298
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-103362982023-07-13 BERTwalk for integrating gene networks to predict gene- to pathway-level properties Nasser, Rami Sharan, Roded Bioinform Adv Original Article MOTIVATION: Graph representation learning is a fundamental problem in the field of data science with applications to integrative analysis of biological networks. Previous work in this domain was mostly limited to shallow representation techniques. A recent deep representation technique, BIONIC, has achieved state-of-the-art results in a variety of tasks but used arbitrarily defined components. RESULTS: Here, we present BERTwalk, an unsupervised learning scheme that combines the BERT masked language model with a network propagation regularization for graph representation learning. The transformation from networks to texts allows our method to naturally integrate different networks and provide features that inform not only nodes or edges but also pathway-level properties. We show that our BERTwalk model outperforms BIONIC, as well as four other recent methods, on two comprehensive benchmarks in yeast and human. We further show that our model can be utilized to infer functional pathways and their effects. AVAILABILITY AND IMPLEMENTATION: Code and data are available at https://github.com/raminass/BERTwalk. CONTACT: roded@tauex.tau.ac.il Oxford University Press 2023-07-03 /pmc/articles/PMC10336298/ /pubmed/37448813 http://dx.doi.org/10.1093/bioadv/vbad086 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Nasser, Rami
Sharan, Roded
BERTwalk for integrating gene networks to predict gene- to pathway-level properties
title BERTwalk for integrating gene networks to predict gene- to pathway-level properties
title_full BERTwalk for integrating gene networks to predict gene- to pathway-level properties
title_fullStr BERTwalk for integrating gene networks to predict gene- to pathway-level properties
title_full_unstemmed BERTwalk for integrating gene networks to predict gene- to pathway-level properties
title_short BERTwalk for integrating gene networks to predict gene- to pathway-level properties
title_sort bertwalk for integrating gene networks to predict gene- to pathway-level properties
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10336298/
https://www.ncbi.nlm.nih.gov/pubmed/37448813
http://dx.doi.org/10.1093/bioadv/vbad086
work_keys_str_mv AT nasserrami bertwalkforintegratinggenenetworkstopredictgenetopathwaylevelproperties
AT sharanroded bertwalkforintegratinggenenetworkstopredictgenetopathwaylevelproperties