Cargando…
GLIDER: function prediction from GLIDE-based neighborhoods
MOTIVATION: Protein function prediction, based on the patterns of connection in a protein–protein interaction (or association) network, is perhaps the most studied of the classical, fundamental inference problems for biological networks. A highly successful set of recent approaches use random walk-b...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9237677/ https://www.ncbi.nlm.nih.gov/pubmed/35575379 http://dx.doi.org/10.1093/bioinformatics/btac322 |
_version_ | 1784736853872607232 |
---|---|
author | Devkota, Kapil Schmidt, Henri Werenski, Matt Murphy, James M Erden, Mert Arsenescu, Victor Cowen, Lenore J |
author_facet | Devkota, Kapil Schmidt, Henri Werenski, Matt Murphy, James M Erden, Mert Arsenescu, Victor Cowen, Lenore J |
author_sort | Devkota, Kapil |
collection | PubMed |
description | MOTIVATION: Protein function prediction, based on the patterns of connection in a protein–protein interaction (or association) network, is perhaps the most studied of the classical, fundamental inference problems for biological networks. A highly successful set of recent approaches use random walk-based low-dimensional embeddings that tend to place functionally similar proteins into coherent spatial regions. However, these approaches lose valuable local graph structure from the network when considering only the embedding. We introduce GLIDER, a method that replaces a protein–protein interaction or association network with a new graph-based similarity network. GLIDER is based on a variant of our previous GLIDE method, which was designed to predict missing links in protein–protein association networks, capturing implicit local and global (i.e. embedding-based) graph properties. RESULTS: GLIDER outperforms competing methods on the task of predicting GO functional labels in cross-validation on a heterogeneous collection of four human protein–protein association networks derived from the 2016 DREAM Disease Module Identification Challenge, and also on three different protein–protein association networks built from the STRING database. We show that this is due to the strong functional enrichment that is present in the local GLIDER neighborhood in multiple different types of protein–protein association networks. Furthermore, we introduce the GLIDER graph neighborhood as a way for biologists to visualize the local neighborhood of a disease gene. As an application, we look at the local GLIDER neighborhoods of a set of known Parkinson’s Disease GWAS genes, rediscover many genes which have known involvement in Parkinson’s disease pathways, plus suggest some new genes to study. AVAILABILITY AND IMPLEMENTATION: All code is publicly available and can be accessed here: https://github.com/kap-devkota/GLIDER. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-9237677 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-92376772022-06-29 GLIDER: function prediction from GLIDE-based neighborhoods Devkota, Kapil Schmidt, Henri Werenski, Matt Murphy, James M Erden, Mert Arsenescu, Victor Cowen, Lenore J Bioinformatics Original Papers MOTIVATION: Protein function prediction, based on the patterns of connection in a protein–protein interaction (or association) network, is perhaps the most studied of the classical, fundamental inference problems for biological networks. A highly successful set of recent approaches use random walk-based low-dimensional embeddings that tend to place functionally similar proteins into coherent spatial regions. However, these approaches lose valuable local graph structure from the network when considering only the embedding. We introduce GLIDER, a method that replaces a protein–protein interaction or association network with a new graph-based similarity network. GLIDER is based on a variant of our previous GLIDE method, which was designed to predict missing links in protein–protein association networks, capturing implicit local and global (i.e. embedding-based) graph properties. RESULTS: GLIDER outperforms competing methods on the task of predicting GO functional labels in cross-validation on a heterogeneous collection of four human protein–protein association networks derived from the 2016 DREAM Disease Module Identification Challenge, and also on three different protein–protein association networks built from the STRING database. We show that this is due to the strong functional enrichment that is present in the local GLIDER neighborhood in multiple different types of protein–protein association networks. Furthermore, we introduce the GLIDER graph neighborhood as a way for biologists to visualize the local neighborhood of a disease gene. As an application, we look at the local GLIDER neighborhoods of a set of known Parkinson’s Disease GWAS genes, rediscover many genes which have known involvement in Parkinson’s disease pathways, plus suggest some new genes to study. AVAILABILITY AND IMPLEMENTATION: All code is publicly available and can be accessed here: https://github.com/kap-devkota/GLIDER. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-05-16 /pmc/articles/PMC9237677/ /pubmed/35575379 http://dx.doi.org/10.1093/bioinformatics/btac322 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Papers Devkota, Kapil Schmidt, Henri Werenski, Matt Murphy, James M Erden, Mert Arsenescu, Victor Cowen, Lenore J GLIDER: function prediction from GLIDE-based neighborhoods |
title | GLIDER: function prediction from GLIDE-based neighborhoods |
title_full | GLIDER: function prediction from GLIDE-based neighborhoods |
title_fullStr | GLIDER: function prediction from GLIDE-based neighborhoods |
title_full_unstemmed | GLIDER: function prediction from GLIDE-based neighborhoods |
title_short | GLIDER: function prediction from GLIDE-based neighborhoods |
title_sort | glider: function prediction from glide-based neighborhoods |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9237677/ https://www.ncbi.nlm.nih.gov/pubmed/35575379 http://dx.doi.org/10.1093/bioinformatics/btac322 |
work_keys_str_mv | AT devkotakapil gliderfunctionpredictionfromglidebasedneighborhoods AT schmidthenri gliderfunctionpredictionfromglidebasedneighborhoods AT werenskimatt gliderfunctionpredictionfromglidebasedneighborhoods AT murphyjamesm gliderfunctionpredictionfromglidebasedneighborhoods AT erdenmert gliderfunctionpredictionfromglidebasedneighborhoods AT arsenescuvictor gliderfunctionpredictionfromglidebasedneighborhoods AT cowenlenorej gliderfunctionpredictionfromglidebasedneighborhoods |