Cargando…
MUNDO: protein function prediction embedded in a multispecies world
MOTIVATION: Leveraging cross-species information in protein function prediction can add significant power to network-based protein function prediction methods, because so much functional information is conserved across at least close scales of evolution. We introduce MUNDO, a new cross-species co-em...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710620/ https://www.ncbi.nlm.nih.gov/pubmed/36699351 http://dx.doi.org/10.1093/bioadv/vbab025 |
_version_ | 1784841405808508928 |
---|---|
author | Arsenescu, Victor Devkota, Kapil Erden, Mert Shpilker, Polina Werenski, Matthew Cowen, Lenore J |
author_facet | Arsenescu, Victor Devkota, Kapil Erden, Mert Shpilker, Polina Werenski, Matthew Cowen, Lenore J |
author_sort | Arsenescu, Victor |
collection | PubMed |
description | MOTIVATION: Leveraging cross-species information in protein function prediction can add significant power to network-based protein function prediction methods, because so much functional information is conserved across at least close scales of evolution. We introduce MUNDO, a new cross-species co-embedding method that combines a single-network embedding method with a co-embedding method to predict functional annotations in a target species, leveraging also functional annotations in a model species network. RESULTS: Across a wide range of parameter choices, MUNDO performs best at predicting annotations in the mouse network, when trained on mouse and human protein–protein interaction (PPI) networks, in the human network, when trained on human and mouse PPIs, and in Baker’s yeast, when trained on Fission and Baker’s yeast, as compared to competitor methods. MUNDO also outperforms all the cross-species methods when predicting in Fission yeast when trained on Fission and Baker’s yeast; however, in this single case, discarding the information from the other species and using annotations from the Fission yeast network alone usually performs best. AVAILABILITY AND IMPLEMENTATION: All code is available and can be accessed here: github.com/v0rtex20k/MUNDO. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online. Additional experimental results are on our github site. |
format | Online Article Text |
id | pubmed-9710620 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-97106202023-01-24 MUNDO: protein function prediction embedded in a multispecies world Arsenescu, Victor Devkota, Kapil Erden, Mert Shpilker, Polina Werenski, Matthew Cowen, Lenore J Bioinform Adv Original Article MOTIVATION: Leveraging cross-species information in protein function prediction can add significant power to network-based protein function prediction methods, because so much functional information is conserved across at least close scales of evolution. We introduce MUNDO, a new cross-species co-embedding method that combines a single-network embedding method with a co-embedding method to predict functional annotations in a target species, leveraging also functional annotations in a model species network. RESULTS: Across a wide range of parameter choices, MUNDO performs best at predicting annotations in the mouse network, when trained on mouse and human protein–protein interaction (PPI) networks, in the human network, when trained on human and mouse PPIs, and in Baker’s yeast, when trained on Fission and Baker’s yeast, as compared to competitor methods. MUNDO also outperforms all the cross-species methods when predicting in Fission yeast when trained on Fission and Baker’s yeast; however, in this single case, discarding the information from the other species and using annotations from the Fission yeast network alone usually performs best. AVAILABILITY AND IMPLEMENTATION: All code is available and can be accessed here: github.com/v0rtex20k/MUNDO. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online. Additional experimental results are on our github site. Oxford University Press 2021-09-29 /pmc/articles/PMC9710620/ /pubmed/36699351 http://dx.doi.org/10.1093/bioadv/vbab025 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Arsenescu, Victor Devkota, Kapil Erden, Mert Shpilker, Polina Werenski, Matthew Cowen, Lenore J MUNDO: protein function prediction embedded in a multispecies world |
title | MUNDO: protein function prediction embedded in a multispecies world |
title_full | MUNDO: protein function prediction embedded in a multispecies world |
title_fullStr | MUNDO: protein function prediction embedded in a multispecies world |
title_full_unstemmed | MUNDO: protein function prediction embedded in a multispecies world |
title_short | MUNDO: protein function prediction embedded in a multispecies world |
title_sort | mundo: protein function prediction embedded in a multispecies world |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710620/ https://www.ncbi.nlm.nih.gov/pubmed/36699351 http://dx.doi.org/10.1093/bioadv/vbab025 |
work_keys_str_mv | AT arsenescuvictor mundoproteinfunctionpredictionembeddedinamultispeciesworld AT devkotakapil mundoproteinfunctionpredictionembeddedinamultispeciesworld AT erdenmert mundoproteinfunctionpredictionembeddedinamultispeciesworld AT shpilkerpolina mundoproteinfunctionpredictionembeddedinamultispeciesworld AT werenskimatthew mundoproteinfunctionpredictionembeddedinamultispeciesworld AT cowenlenorej mundoproteinfunctionpredictionembeddedinamultispeciesworld |