Cargando…

GLIDE: combining local methods and diffusion state embeddings to predict missing interactions in biological networks

MOTIVATION: One of the core problems in the analysis of biological networks is the link prediction problem. In particular, existing interactions networks are noisy and incomplete snapshots of the true network, with many true links missing because those interactions have not yet been experimentally o...

Descripción completa

Detalles Bibliográficos
Autores principales: Devkota, Kapil, Murphy, James M, Cowen, Lenore J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355260/
https://www.ncbi.nlm.nih.gov/pubmed/32657369
http://dx.doi.org/10.1093/bioinformatics/btaa459
_version_ 1783558239236915200
author Devkota, Kapil
Murphy, James M
Cowen, Lenore J
author_facet Devkota, Kapil
Murphy, James M
Cowen, Lenore J
author_sort Devkota, Kapil
collection PubMed
description MOTIVATION: One of the core problems in the analysis of biological networks is the link prediction problem. In particular, existing interactions networks are noisy and incomplete snapshots of the true network, with many true links missing because those interactions have not yet been experimentally observed. Methods to predict missing links have been more extensively studied for social than for biological networks; it was recently argued that there is some special structure in protein–protein interaction (PPI) network data that might mean that alternate methods may outperform the best methods for social networks. Based on a generalization of the diffusion state distance, we design a new embedding-based link prediction method called global and local integrated diffusion embedding (GLIDE). GLIDE is designed to effectively capture global network structure, combined with alternative network type-specific customized measures that capture local network structure. We test GLIDE on a collection of three recently curated human biological networks derived from the 2016 DREAM disease module identification challenge as well as a classical version of the yeast PPI network in rigorous cross validation experiments. RESULTS: We indeed find that different local network structure is dominant in different types of biological networks. We find that the simple local network measures are dominant in the highly connected network core between hub genes, but that GLIDE’s global embedding measure adds value in the rest of the network. For example, we make GLIDE-based link predictions from genes known to be involved in Crohn’s disease, to genes that are not known to have an association, and make some new predictions, finding support in other network data and the literature. AVAILABILITY AND IMPLEMENTATION: GLIDE can be downloaded at https://bitbucket.org/kap_devkota/glide. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-7355260
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-73552602020-07-16 GLIDE: combining local methods and diffusion state embeddings to predict missing interactions in biological networks Devkota, Kapil Murphy, James M Cowen, Lenore J Bioinformatics Systems Biology and Networks MOTIVATION: One of the core problems in the analysis of biological networks is the link prediction problem. In particular, existing interactions networks are noisy and incomplete snapshots of the true network, with many true links missing because those interactions have not yet been experimentally observed. Methods to predict missing links have been more extensively studied for social than for biological networks; it was recently argued that there is some special structure in protein–protein interaction (PPI) network data that might mean that alternate methods may outperform the best methods for social networks. Based on a generalization of the diffusion state distance, we design a new embedding-based link prediction method called global and local integrated diffusion embedding (GLIDE). GLIDE is designed to effectively capture global network structure, combined with alternative network type-specific customized measures that capture local network structure. We test GLIDE on a collection of three recently curated human biological networks derived from the 2016 DREAM disease module identification challenge as well as a classical version of the yeast PPI network in rigorous cross validation experiments. RESULTS: We indeed find that different local network structure is dominant in different types of biological networks. We find that the simple local network measures are dominant in the highly connected network core between hub genes, but that GLIDE’s global embedding measure adds value in the rest of the network. For example, we make GLIDE-based link predictions from genes known to be involved in Crohn’s disease, to genes that are not known to have an association, and make some new predictions, finding support in other network data and the literature. AVAILABILITY AND IMPLEMENTATION: GLIDE can be downloaded at https://bitbucket.org/kap_devkota/glide. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-07 2020-07-13 /pmc/articles/PMC7355260/ /pubmed/32657369 http://dx.doi.org/10.1093/bioinformatics/btaa459 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Systems Biology and Networks
Devkota, Kapil
Murphy, James M
Cowen, Lenore J
GLIDE: combining local methods and diffusion state embeddings to predict missing interactions in biological networks
title GLIDE: combining local methods and diffusion state embeddings to predict missing interactions in biological networks
title_full GLIDE: combining local methods and diffusion state embeddings to predict missing interactions in biological networks
title_fullStr GLIDE: combining local methods and diffusion state embeddings to predict missing interactions in biological networks
title_full_unstemmed GLIDE: combining local methods and diffusion state embeddings to predict missing interactions in biological networks
title_short GLIDE: combining local methods and diffusion state embeddings to predict missing interactions in biological networks
title_sort glide: combining local methods and diffusion state embeddings to predict missing interactions in biological networks
topic Systems Biology and Networks
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355260/
https://www.ncbi.nlm.nih.gov/pubmed/32657369
http://dx.doi.org/10.1093/bioinformatics/btaa459
work_keys_str_mv AT devkotakapil glidecombininglocalmethodsanddiffusionstateembeddingstopredictmissinginteractionsinbiologicalnetworks
AT murphyjamesm glidecombininglocalmethodsanddiffusionstateembeddingstopredictmissinginteractionsinbiologicalnetworks
AT cowenlenorej glidecombininglocalmethodsanddiffusionstateembeddingstopredictmissinginteractionsinbiologicalnetworks