Cargando…

MUNDO: protein function prediction embedded in a multispecies world

MOTIVATION: Leveraging cross-species information in protein function prediction can add significant power to network-based protein function prediction methods, because so much functional information is conserved across at least close scales of evolution. We introduce MUNDO, a new cross-species co-em...

Descripción completa

Detalles Bibliográficos
Autores principales: Arsenescu, Victor, Devkota, Kapil, Erden, Mert, Shpilker, Polina, Werenski, Matthew, Cowen, Lenore J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710620/
https://www.ncbi.nlm.nih.gov/pubmed/36699351
http://dx.doi.org/10.1093/bioadv/vbab025
Descripción
Sumario:MOTIVATION: Leveraging cross-species information in protein function prediction can add significant power to network-based protein function prediction methods, because so much functional information is conserved across at least close scales of evolution. We introduce MUNDO, a new cross-species co-embedding method that combines a single-network embedding method with a co-embedding method to predict functional annotations in a target species, leveraging also functional annotations in a model species network. RESULTS: Across a wide range of parameter choices, MUNDO performs best at predicting annotations in the mouse network, when trained on mouse and human protein–protein interaction (PPI) networks, in the human network, when trained on human and mouse PPIs, and in Baker’s yeast, when trained on Fission and Baker’s yeast, as compared to competitor methods. MUNDO also outperforms all the cross-species methods when predicting in Fission yeast when trained on Fission and Baker’s yeast; however, in this single case, discarding the information from the other species and using annotations from the Fission yeast network alone usually performs best. AVAILABILITY AND IMPLEMENTATION: All code is available and can be accessed here: github.com/v0rtex20k/MUNDO. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online. Additional experimental results are on our github site.