Cargando…
GLIDE: combining local methods and diffusion state embeddings to predict missing interactions in biological networks
MOTIVATION: One of the core problems in the analysis of biological networks is the link prediction problem. In particular, existing interactions networks are noisy and incomplete snapshots of the true network, with many true links missing because those interactions have not yet been experimentally o...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355260/ https://www.ncbi.nlm.nih.gov/pubmed/32657369 http://dx.doi.org/10.1093/bioinformatics/btaa459 |
_version_ | 1783558239236915200 |
---|---|
author | Devkota, Kapil Murphy, James M Cowen, Lenore J |
author_facet | Devkota, Kapil Murphy, James M Cowen, Lenore J |
author_sort | Devkota, Kapil |
collection | PubMed |
description | MOTIVATION: One of the core problems in the analysis of biological networks is the link prediction problem. In particular, existing interactions networks are noisy and incomplete snapshots of the true network, with many true links missing because those interactions have not yet been experimentally observed. Methods to predict missing links have been more extensively studied for social than for biological networks; it was recently argued that there is some special structure in protein–protein interaction (PPI) network data that might mean that alternate methods may outperform the best methods for social networks. Based on a generalization of the diffusion state distance, we design a new embedding-based link prediction method called global and local integrated diffusion embedding (GLIDE). GLIDE is designed to effectively capture global network structure, combined with alternative network type-specific customized measures that capture local network structure. We test GLIDE on a collection of three recently curated human biological networks derived from the 2016 DREAM disease module identification challenge as well as a classical version of the yeast PPI network in rigorous cross validation experiments. RESULTS: We indeed find that different local network structure is dominant in different types of biological networks. We find that the simple local network measures are dominant in the highly connected network core between hub genes, but that GLIDE’s global embedding measure adds value in the rest of the network. For example, we make GLIDE-based link predictions from genes known to be involved in Crohn’s disease, to genes that are not known to have an association, and make some new predictions, finding support in other network data and the literature. AVAILABILITY AND IMPLEMENTATION: GLIDE can be downloaded at https://bitbucket.org/kap_devkota/glide. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-7355260 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-73552602020-07-16 GLIDE: combining local methods and diffusion state embeddings to predict missing interactions in biological networks Devkota, Kapil Murphy, James M Cowen, Lenore J Bioinformatics Systems Biology and Networks MOTIVATION: One of the core problems in the analysis of biological networks is the link prediction problem. In particular, existing interactions networks are noisy and incomplete snapshots of the true network, with many true links missing because those interactions have not yet been experimentally observed. Methods to predict missing links have been more extensively studied for social than for biological networks; it was recently argued that there is some special structure in protein–protein interaction (PPI) network data that might mean that alternate methods may outperform the best methods for social networks. Based on a generalization of the diffusion state distance, we design a new embedding-based link prediction method called global and local integrated diffusion embedding (GLIDE). GLIDE is designed to effectively capture global network structure, combined with alternative network type-specific customized measures that capture local network structure. We test GLIDE on a collection of three recently curated human biological networks derived from the 2016 DREAM disease module identification challenge as well as a classical version of the yeast PPI network in rigorous cross validation experiments. RESULTS: We indeed find that different local network structure is dominant in different types of biological networks. We find that the simple local network measures are dominant in the highly connected network core between hub genes, but that GLIDE’s global embedding measure adds value in the rest of the network. For example, we make GLIDE-based link predictions from genes known to be involved in Crohn’s disease, to genes that are not known to have an association, and make some new predictions, finding support in other network data and the literature. AVAILABILITY AND IMPLEMENTATION: GLIDE can be downloaded at https://bitbucket.org/kap_devkota/glide. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-07 2020-07-13 /pmc/articles/PMC7355260/ /pubmed/32657369 http://dx.doi.org/10.1093/bioinformatics/btaa459 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Systems Biology and Networks Devkota, Kapil Murphy, James M Cowen, Lenore J GLIDE: combining local methods and diffusion state embeddings to predict missing interactions in biological networks |
title | GLIDE: combining local methods and diffusion state embeddings to predict missing interactions in biological networks |
title_full | GLIDE: combining local methods and diffusion state embeddings to predict missing interactions in biological networks |
title_fullStr | GLIDE: combining local methods and diffusion state embeddings to predict missing interactions in biological networks |
title_full_unstemmed | GLIDE: combining local methods and diffusion state embeddings to predict missing interactions in biological networks |
title_short | GLIDE: combining local methods and diffusion state embeddings to predict missing interactions in biological networks |
title_sort | glide: combining local methods and diffusion state embeddings to predict missing interactions in biological networks |
topic | Systems Biology and Networks |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355260/ https://www.ncbi.nlm.nih.gov/pubmed/32657369 http://dx.doi.org/10.1093/bioinformatics/btaa459 |
work_keys_str_mv | AT devkotakapil glidecombininglocalmethodsanddiffusionstateembeddingstopredictmissinginteractionsinbiologicalnetworks AT murphyjamesm glidecombininglocalmethodsanddiffusionstateembeddingstopredictmissinginteractionsinbiologicalnetworks AT cowenlenorej glidecombininglocalmethodsanddiffusionstateembeddingstopredictmissinginteractionsinbiologicalnetworks |