Cargando…

Metapaths: similarity search in heterogeneous knowledge graphs via meta-paths

SUMMARY: Heterogeneous knowledge graphs (KGs) have enabled the modeling of complex systems, from genetic interaction graphs and protein-protein interaction networks to networks representing drugs, diseases, proteins, and side effects. Analytical methods for KGs rely on quantifying similarities betwe...

Descripción completa

Detalles Bibliográficos
Autores principales: Noori, Ayush, Li, Michelle M, Tan, Amelia L M, Zitnik, Marinka
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10209523/
https://www.ncbi.nlm.nih.gov/pubmed/37140542
http://dx.doi.org/10.1093/bioinformatics/btad297
_version_ 1785046893630324736
author Noori, Ayush
Li, Michelle M
Tan, Amelia L M
Zitnik, Marinka
author_facet Noori, Ayush
Li, Michelle M
Tan, Amelia L M
Zitnik, Marinka
author_sort Noori, Ayush
collection PubMed
description SUMMARY: Heterogeneous knowledge graphs (KGs) have enabled the modeling of complex systems, from genetic interaction graphs and protein-protein interaction networks to networks representing drugs, diseases, proteins, and side effects. Analytical methods for KGs rely on quantifying similarities between entities, such as nodes, in the graph. However, such methods must consider the diversity of node and edge types contained within the KG via, for example, defined sequences of entity types known as meta-paths. We present metapaths, the first R software package to implement meta-paths and perform meta-path-based similarity search in heterogeneous KGs. The metapaths package offers various built-in similarity metrics for node pair comparison by querying KGs represented as either edge or adjacency lists, as well as auxiliary aggregation methods to measure set-level relationships. Indeed, evaluation of these methods on an open-source biomedical KG recovered meaningful drug and disease-associated relationships, including those in Alzheimer’s disease. The metapaths framework facilitates the scalable and flexible modeling of network similarities in KGs with applications across KG learning. AVAILABILITY AND IMPLEMENTATION: The metapaths R package is available via GitHub at https://github.com/ayushnoori/metapaths and is released under MPL 2.0 (Zenodo DOI: 10.5281/zenodo.7047209). Package documentation and usage examples are available at https://www.ayushnoori.com/metapaths.
format Online
Article
Text
id pubmed-10209523
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-102095232023-05-26 Metapaths: similarity search in heterogeneous knowledge graphs via meta-paths Noori, Ayush Li, Michelle M Tan, Amelia L M Zitnik, Marinka Bioinformatics Applications Note SUMMARY: Heterogeneous knowledge graphs (KGs) have enabled the modeling of complex systems, from genetic interaction graphs and protein-protein interaction networks to networks representing drugs, diseases, proteins, and side effects. Analytical methods for KGs rely on quantifying similarities between entities, such as nodes, in the graph. However, such methods must consider the diversity of node and edge types contained within the KG via, for example, defined sequences of entity types known as meta-paths. We present metapaths, the first R software package to implement meta-paths and perform meta-path-based similarity search in heterogeneous KGs. The metapaths package offers various built-in similarity metrics for node pair comparison by querying KGs represented as either edge or adjacency lists, as well as auxiliary aggregation methods to measure set-level relationships. Indeed, evaluation of these methods on an open-source biomedical KG recovered meaningful drug and disease-associated relationships, including those in Alzheimer’s disease. The metapaths framework facilitates the scalable and flexible modeling of network similarities in KGs with applications across KG learning. AVAILABILITY AND IMPLEMENTATION: The metapaths R package is available via GitHub at https://github.com/ayushnoori/metapaths and is released under MPL 2.0 (Zenodo DOI: 10.5281/zenodo.7047209). Package documentation and usage examples are available at https://www.ayushnoori.com/metapaths. Oxford University Press 2023-05-04 /pmc/articles/PMC10209523/ /pubmed/37140542 http://dx.doi.org/10.1093/bioinformatics/btad297 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Note
Noori, Ayush
Li, Michelle M
Tan, Amelia L M
Zitnik, Marinka
Metapaths: similarity search in heterogeneous knowledge graphs via meta-paths
title Metapaths: similarity search in heterogeneous knowledge graphs via meta-paths
title_full Metapaths: similarity search in heterogeneous knowledge graphs via meta-paths
title_fullStr Metapaths: similarity search in heterogeneous knowledge graphs via meta-paths
title_full_unstemmed Metapaths: similarity search in heterogeneous knowledge graphs via meta-paths
title_short Metapaths: similarity search in heterogeneous knowledge graphs via meta-paths
title_sort metapaths: similarity search in heterogeneous knowledge graphs via meta-paths
topic Applications Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10209523/
https://www.ncbi.nlm.nih.gov/pubmed/37140542
http://dx.doi.org/10.1093/bioinformatics/btad297
work_keys_str_mv AT nooriayush metapathssimilaritysearchinheterogeneousknowledgegraphsviametapaths
AT limichellem metapathssimilaritysearchinheterogeneousknowledgegraphsviametapaths
AT tanamelialm metapathssimilaritysearchinheterogeneousknowledgegraphsviametapaths
AT zitnikmarinka metapathssimilaritysearchinheterogeneousknowledgegraphsviametapaths