Cargando…

Hetnet connectivity search provides rapid insights into how two biomedical entities are related

Hetnets, short for “heterogeneous networks”, contain multiple node and relationship types and offer a way to encode biomedical knowledge. One such example, Hetionet connects 11 types of nodes — including genes, diseases, drugs, pathways, and anatomical structures — with over 2 million edges of 24 ty...

Descripción completa

Detalles Bibliográficos
Autores principales: Himmelstein, Daniel S., Zietz, Michael, Rubinetti, Vincent, Kloster, Kyle, Heil, Benjamin J., Alquaddoomi, Faisal, Hu, Dongbo, Nicholson, David N., Hao, Yun, Sullivan, Blair D., Nagle, Michael W., Greene, Casey S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9882000/
https://www.ncbi.nlm.nih.gov/pubmed/36711546
http://dx.doi.org/10.1101/2023.01.05.522941
_version_ 1784879222784786432
author Himmelstein, Daniel S.
Zietz, Michael
Rubinetti, Vincent
Kloster, Kyle
Heil, Benjamin J.
Alquaddoomi, Faisal
Hu, Dongbo
Nicholson, David N.
Hao, Yun
Sullivan, Blair D.
Nagle, Michael W.
Greene, Casey S.
author_facet Himmelstein, Daniel S.
Zietz, Michael
Rubinetti, Vincent
Kloster, Kyle
Heil, Benjamin J.
Alquaddoomi, Faisal
Hu, Dongbo
Nicholson, David N.
Hao, Yun
Sullivan, Blair D.
Nagle, Michael W.
Greene, Casey S.
author_sort Himmelstein, Daniel S.
collection PubMed
description Hetnets, short for “heterogeneous networks”, contain multiple node and relationship types and offer a way to encode biomedical knowledge. One such example, Hetionet connects 11 types of nodes — including genes, diseases, drugs, pathways, and anatomical structures — with over 2 million edges of 24 types. Previous work has demonstrated that supervised machine learning methods applied to such networks can identify drug repurposing opportunities. However, a training set of known relationships does not exist for many types of node pairs, even when it would be useful to examine how nodes of those types are meaningfully connected. For example, users may be curious not only how metformin is related to breast cancer, but also how the GJA1 gene might be involved in insomnia. We developed a new procedure, termed hetnet connectivity search, that proposes important paths between any two nodes without requiring a supervised gold standard. The algorithm behind connectivity search identifies types of paths that occur more frequently than would be expected by chance (based on node degree alone). We find that predictions are broadly similar to those from previously described supervised approaches for certain node type pairs. Scoring of individual paths is based on the most specific paths of a given type. Several optimizations were required to precompute significant instances of node connectivity at the scale of large knowledge graphs. We implemented the method on Hetionet and provide an online interface at https://het.io/search. We provide an open source implementation of these methods in our new Python package named hetmatpy.
format Online
Article
Text
id pubmed-9882000
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-98820002023-01-28 Hetnet connectivity search provides rapid insights into how two biomedical entities are related Himmelstein, Daniel S. Zietz, Michael Rubinetti, Vincent Kloster, Kyle Heil, Benjamin J. Alquaddoomi, Faisal Hu, Dongbo Nicholson, David N. Hao, Yun Sullivan, Blair D. Nagle, Michael W. Greene, Casey S. bioRxiv Article Hetnets, short for “heterogeneous networks”, contain multiple node and relationship types and offer a way to encode biomedical knowledge. One such example, Hetionet connects 11 types of nodes — including genes, diseases, drugs, pathways, and anatomical structures — with over 2 million edges of 24 types. Previous work has demonstrated that supervised machine learning methods applied to such networks can identify drug repurposing opportunities. However, a training set of known relationships does not exist for many types of node pairs, even when it would be useful to examine how nodes of those types are meaningfully connected. For example, users may be curious not only how metformin is related to breast cancer, but also how the GJA1 gene might be involved in insomnia. We developed a new procedure, termed hetnet connectivity search, that proposes important paths between any two nodes without requiring a supervised gold standard. The algorithm behind connectivity search identifies types of paths that occur more frequently than would be expected by chance (based on node degree alone). We find that predictions are broadly similar to those from previously described supervised approaches for certain node type pairs. Scoring of individual paths is based on the most specific paths of a given type. Several optimizations were required to precompute significant instances of node connectivity at the scale of large knowledge graphs. We implemented the method on Hetionet and provide an online interface at https://het.io/search. We provide an open source implementation of these methods in our new Python package named hetmatpy. Cold Spring Harbor Laboratory 2023-01-07 /pmc/articles/PMC9882000/ /pubmed/36711546 http://dx.doi.org/10.1101/2023.01.05.522941 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Himmelstein, Daniel S.
Zietz, Michael
Rubinetti, Vincent
Kloster, Kyle
Heil, Benjamin J.
Alquaddoomi, Faisal
Hu, Dongbo
Nicholson, David N.
Hao, Yun
Sullivan, Blair D.
Nagle, Michael W.
Greene, Casey S.
Hetnet connectivity search provides rapid insights into how two biomedical entities are related
title Hetnet connectivity search provides rapid insights into how two biomedical entities are related
title_full Hetnet connectivity search provides rapid insights into how two biomedical entities are related
title_fullStr Hetnet connectivity search provides rapid insights into how two biomedical entities are related
title_full_unstemmed Hetnet connectivity search provides rapid insights into how two biomedical entities are related
title_short Hetnet connectivity search provides rapid insights into how two biomedical entities are related
title_sort hetnet connectivity search provides rapid insights into how two biomedical entities are related
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9882000/
https://www.ncbi.nlm.nih.gov/pubmed/36711546
http://dx.doi.org/10.1101/2023.01.05.522941
work_keys_str_mv AT himmelsteindaniels hetnetconnectivitysearchprovidesrapidinsightsintohowtwobiomedicalentitiesarerelated
AT zietzmichael hetnetconnectivitysearchprovidesrapidinsightsintohowtwobiomedicalentitiesarerelated
AT rubinettivincent hetnetconnectivitysearchprovidesrapidinsightsintohowtwobiomedicalentitiesarerelated
AT klosterkyle hetnetconnectivitysearchprovidesrapidinsightsintohowtwobiomedicalentitiesarerelated
AT heilbenjaminj hetnetconnectivitysearchprovidesrapidinsightsintohowtwobiomedicalentitiesarerelated
AT alquaddoomifaisal hetnetconnectivitysearchprovidesrapidinsightsintohowtwobiomedicalentitiesarerelated
AT hudongbo hetnetconnectivitysearchprovidesrapidinsightsintohowtwobiomedicalentitiesarerelated
AT nicholsondavidn hetnetconnectivitysearchprovidesrapidinsightsintohowtwobiomedicalentitiesarerelated
AT haoyun hetnetconnectivitysearchprovidesrapidinsightsintohowtwobiomedicalentitiesarerelated
AT sullivanblaird hetnetconnectivitysearchprovidesrapidinsightsintohowtwobiomedicalentitiesarerelated
AT naglemichaelw hetnetconnectivitysearchprovidesrapidinsightsintohowtwobiomedicalentitiesarerelated
AT greenecaseys hetnetconnectivitysearchprovidesrapidinsightsintohowtwobiomedicalentitiesarerelated