Cargando…

GLIDER: function prediction from GLIDE-based neighborhoods

MOTIVATION: Protein function prediction, based on the patterns of connection in a protein–protein interaction (or association) network, is perhaps the most studied of the classical, fundamental inference problems for biological networks. A highly successful set of recent approaches use random walk-b...

Descripción completa

Detalles Bibliográficos
Autores principales: Devkota, Kapil, Schmidt, Henri, Werenski, Matt, Murphy, James M, Erden, Mert, Arsenescu, Victor, Cowen, Lenore J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9237677/
https://www.ncbi.nlm.nih.gov/pubmed/35575379
http://dx.doi.org/10.1093/bioinformatics/btac322
_version_ 1784736853872607232
author Devkota, Kapil
Schmidt, Henri
Werenski, Matt
Murphy, James M
Erden, Mert
Arsenescu, Victor
Cowen, Lenore J
author_facet Devkota, Kapil
Schmidt, Henri
Werenski, Matt
Murphy, James M
Erden, Mert
Arsenescu, Victor
Cowen, Lenore J
author_sort Devkota, Kapil
collection PubMed
description MOTIVATION: Protein function prediction, based on the patterns of connection in a protein–protein interaction (or association) network, is perhaps the most studied of the classical, fundamental inference problems for biological networks. A highly successful set of recent approaches use random walk-based low-dimensional embeddings that tend to place functionally similar proteins into coherent spatial regions. However, these approaches lose valuable local graph structure from the network when considering only the embedding. We introduce GLIDER, a method that replaces a protein–protein interaction or association network with a new graph-based similarity network. GLIDER is based on a variant of our previous GLIDE method, which was designed to predict missing links in protein–protein association networks, capturing implicit local and global (i.e. embedding-based) graph properties. RESULTS: GLIDER outperforms competing methods on the task of predicting GO functional labels in cross-validation on a heterogeneous collection of four human protein–protein association networks derived from the 2016 DREAM Disease Module Identification Challenge, and also on three different protein–protein association networks built from the STRING database. We show that this is due to the strong functional enrichment that is present in the local GLIDER neighborhood in multiple different types of protein–protein association networks. Furthermore, we introduce the GLIDER graph neighborhood as a way for biologists to visualize the local neighborhood of a disease gene. As an application, we look at the local GLIDER neighborhoods of a set of known Parkinson’s Disease GWAS genes, rediscover many genes which have known involvement in Parkinson’s disease pathways, plus suggest some new genes to study. AVAILABILITY AND IMPLEMENTATION: All code is publicly available and can be accessed here: https://github.com/kap-devkota/GLIDER. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9237677
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-92376772022-06-29 GLIDER: function prediction from GLIDE-based neighborhoods Devkota, Kapil Schmidt, Henri Werenski, Matt Murphy, James M Erden, Mert Arsenescu, Victor Cowen, Lenore J Bioinformatics Original Papers MOTIVATION: Protein function prediction, based on the patterns of connection in a protein–protein interaction (or association) network, is perhaps the most studied of the classical, fundamental inference problems for biological networks. A highly successful set of recent approaches use random walk-based low-dimensional embeddings that tend to place functionally similar proteins into coherent spatial regions. However, these approaches lose valuable local graph structure from the network when considering only the embedding. We introduce GLIDER, a method that replaces a protein–protein interaction or association network with a new graph-based similarity network. GLIDER is based on a variant of our previous GLIDE method, which was designed to predict missing links in protein–protein association networks, capturing implicit local and global (i.e. embedding-based) graph properties. RESULTS: GLIDER outperforms competing methods on the task of predicting GO functional labels in cross-validation on a heterogeneous collection of four human protein–protein association networks derived from the 2016 DREAM Disease Module Identification Challenge, and also on three different protein–protein association networks built from the STRING database. We show that this is due to the strong functional enrichment that is present in the local GLIDER neighborhood in multiple different types of protein–protein association networks. Furthermore, we introduce the GLIDER graph neighborhood as a way for biologists to visualize the local neighborhood of a disease gene. As an application, we look at the local GLIDER neighborhoods of a set of known Parkinson’s Disease GWAS genes, rediscover many genes which have known involvement in Parkinson’s disease pathways, plus suggest some new genes to study. AVAILABILITY AND IMPLEMENTATION: All code is publicly available and can be accessed here: https://github.com/kap-devkota/GLIDER. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-05-16 /pmc/articles/PMC9237677/ /pubmed/35575379 http://dx.doi.org/10.1093/bioinformatics/btac322 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Devkota, Kapil
Schmidt, Henri
Werenski, Matt
Murphy, James M
Erden, Mert
Arsenescu, Victor
Cowen, Lenore J
GLIDER: function prediction from GLIDE-based neighborhoods
title GLIDER: function prediction from GLIDE-based neighborhoods
title_full GLIDER: function prediction from GLIDE-based neighborhoods
title_fullStr GLIDER: function prediction from GLIDE-based neighborhoods
title_full_unstemmed GLIDER: function prediction from GLIDE-based neighborhoods
title_short GLIDER: function prediction from GLIDE-based neighborhoods
title_sort glider: function prediction from glide-based neighborhoods
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9237677/
https://www.ncbi.nlm.nih.gov/pubmed/35575379
http://dx.doi.org/10.1093/bioinformatics/btac322
work_keys_str_mv AT devkotakapil gliderfunctionpredictionfromglidebasedneighborhoods
AT schmidthenri gliderfunctionpredictionfromglidebasedneighborhoods
AT werenskimatt gliderfunctionpredictionfromglidebasedneighborhoods
AT murphyjamesm gliderfunctionpredictionfromglidebasedneighborhoods
AT erdenmert gliderfunctionpredictionfromglidebasedneighborhoods
AT arsenescuvictor gliderfunctionpredictionfromglidebasedneighborhoods
AT cowenlenorej gliderfunctionpredictionfromglidebasedneighborhoods