Cargando…

MUNDO: protein function prediction embedded in a multispecies world

MOTIVATION: Leveraging cross-species information in protein function prediction can add significant power to network-based protein function prediction methods, because so much functional information is conserved across at least close scales of evolution. We introduce MUNDO, a new cross-species co-em...

Descripción completa

Detalles Bibliográficos
Autores principales: Arsenescu, Victor, Devkota, Kapil, Erden, Mert, Shpilker, Polina, Werenski, Matthew, Cowen, Lenore J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710620/
https://www.ncbi.nlm.nih.gov/pubmed/36699351
http://dx.doi.org/10.1093/bioadv/vbab025
_version_ 1784841405808508928
author Arsenescu, Victor
Devkota, Kapil
Erden, Mert
Shpilker, Polina
Werenski, Matthew
Cowen, Lenore J
author_facet Arsenescu, Victor
Devkota, Kapil
Erden, Mert
Shpilker, Polina
Werenski, Matthew
Cowen, Lenore J
author_sort Arsenescu, Victor
collection PubMed
description MOTIVATION: Leveraging cross-species information in protein function prediction can add significant power to network-based protein function prediction methods, because so much functional information is conserved across at least close scales of evolution. We introduce MUNDO, a new cross-species co-embedding method that combines a single-network embedding method with a co-embedding method to predict functional annotations in a target species, leveraging also functional annotations in a model species network. RESULTS: Across a wide range of parameter choices, MUNDO performs best at predicting annotations in the mouse network, when trained on mouse and human protein–protein interaction (PPI) networks, in the human network, when trained on human and mouse PPIs, and in Baker’s yeast, when trained on Fission and Baker’s yeast, as compared to competitor methods. MUNDO also outperforms all the cross-species methods when predicting in Fission yeast when trained on Fission and Baker’s yeast; however, in this single case, discarding the information from the other species and using annotations from the Fission yeast network alone usually performs best. AVAILABILITY AND IMPLEMENTATION: All code is available and can be accessed here: github.com/v0rtex20k/MUNDO. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online. Additional experimental results are on our github site.
format Online
Article
Text
id pubmed-9710620
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-97106202023-01-24 MUNDO: protein function prediction embedded in a multispecies world Arsenescu, Victor Devkota, Kapil Erden, Mert Shpilker, Polina Werenski, Matthew Cowen, Lenore J Bioinform Adv Original Article MOTIVATION: Leveraging cross-species information in protein function prediction can add significant power to network-based protein function prediction methods, because so much functional information is conserved across at least close scales of evolution. We introduce MUNDO, a new cross-species co-embedding method that combines a single-network embedding method with a co-embedding method to predict functional annotations in a target species, leveraging also functional annotations in a model species network. RESULTS: Across a wide range of parameter choices, MUNDO performs best at predicting annotations in the mouse network, when trained on mouse and human protein–protein interaction (PPI) networks, in the human network, when trained on human and mouse PPIs, and in Baker’s yeast, when trained on Fission and Baker’s yeast, as compared to competitor methods. MUNDO also outperforms all the cross-species methods when predicting in Fission yeast when trained on Fission and Baker’s yeast; however, in this single case, discarding the information from the other species and using annotations from the Fission yeast network alone usually performs best. AVAILABILITY AND IMPLEMENTATION: All code is available and can be accessed here: github.com/v0rtex20k/MUNDO. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online. Additional experimental results are on our github site. Oxford University Press 2021-09-29 /pmc/articles/PMC9710620/ /pubmed/36699351 http://dx.doi.org/10.1093/bioadv/vbab025 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Arsenescu, Victor
Devkota, Kapil
Erden, Mert
Shpilker, Polina
Werenski, Matthew
Cowen, Lenore J
MUNDO: protein function prediction embedded in a multispecies world
title MUNDO: protein function prediction embedded in a multispecies world
title_full MUNDO: protein function prediction embedded in a multispecies world
title_fullStr MUNDO: protein function prediction embedded in a multispecies world
title_full_unstemmed MUNDO: protein function prediction embedded in a multispecies world
title_short MUNDO: protein function prediction embedded in a multispecies world
title_sort mundo: protein function prediction embedded in a multispecies world
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710620/
https://www.ncbi.nlm.nih.gov/pubmed/36699351
http://dx.doi.org/10.1093/bioadv/vbab025
work_keys_str_mv AT arsenescuvictor mundoproteinfunctionpredictionembeddedinamultispeciesworld
AT devkotakapil mundoproteinfunctionpredictionembeddedinamultispeciesworld
AT erdenmert mundoproteinfunctionpredictionembeddedinamultispeciesworld
AT shpilkerpolina mundoproteinfunctionpredictionembeddedinamultispeciesworld
AT werenskimatthew mundoproteinfunctionpredictionembeddedinamultispeciesworld
AT cowenlenorej mundoproteinfunctionpredictionembeddedinamultispeciesworld