Cargando…

Functional protein representations from biological networks enable diverse cross-species inference

Transferring knowledge between species is key for many biological applications, but is complicated by divergent and convergent evolution. Many current approaches for this problem leverage sequence and interaction network data to transfer knowledge across species, exemplified by network alignment met...

Descripción completa

Detalles Bibliográficos
Autores principales: Fan, Jason, Cannistra, Anthony, Fried, Inbar, Lim, Tim, Schaffner, Thomas, Crovella, Mark, Hescott, Benjamin, Leiserson, Mark D M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6511848/
https://www.ncbi.nlm.nih.gov/pubmed/30847485
http://dx.doi.org/10.1093/nar/gkz132
_version_ 1783417611484135424
author Fan, Jason
Cannistra, Anthony
Fried, Inbar
Lim, Tim
Schaffner, Thomas
Crovella, Mark
Hescott, Benjamin
Leiserson, Mark D M
author_facet Fan, Jason
Cannistra, Anthony
Fried, Inbar
Lim, Tim
Schaffner, Thomas
Crovella, Mark
Hescott, Benjamin
Leiserson, Mark D M
author_sort Fan, Jason
collection PubMed
description Transferring knowledge between species is key for many biological applications, but is complicated by divergent and convergent evolution. Many current approaches for this problem leverage sequence and interaction network data to transfer knowledge across species, exemplified by network alignment methods. While these techniques do well, they are limited in scope, creating metrics to address one specific problem or task. We take a different approach by creating an environment where multiple knowledge transfer tasks can be performed using the same protein representations. Specifically, our kernel-based method, MUNK, integrates sequence and network structure to create functional protein representations, embedding proteins from different species in the same vector space. First we show proteins in different species that are close in MUNK-space are functionally similar. Next, we use these representations to share knowledge of synthetic lethal interactions between species. Importantly, we find that the results using MUNK-representations are at least as accurate as existing algorithms for these tasks. Finally, we generalize the notion of a phenolog (‘orthologous phenotype’) to use functionally similar proteins (i.e. those with similar representations). We demonstrate the utility of this broadened notion by using it to identify known phenologs and novel non-obvious ones supported by current research.
format Online
Article
Text
id pubmed-6511848
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-65118482019-05-20 Functional protein representations from biological networks enable diverse cross-species inference Fan, Jason Cannistra, Anthony Fried, Inbar Lim, Tim Schaffner, Thomas Crovella, Mark Hescott, Benjamin Leiserson, Mark D M Nucleic Acids Res Methods Online Transferring knowledge between species is key for many biological applications, but is complicated by divergent and convergent evolution. Many current approaches for this problem leverage sequence and interaction network data to transfer knowledge across species, exemplified by network alignment methods. While these techniques do well, they are limited in scope, creating metrics to address one specific problem or task. We take a different approach by creating an environment where multiple knowledge transfer tasks can be performed using the same protein representations. Specifically, our kernel-based method, MUNK, integrates sequence and network structure to create functional protein representations, embedding proteins from different species in the same vector space. First we show proteins in different species that are close in MUNK-space are functionally similar. Next, we use these representations to share knowledge of synthetic lethal interactions between species. Importantly, we find that the results using MUNK-representations are at least as accurate as existing algorithms for these tasks. Finally, we generalize the notion of a phenolog (‘orthologous phenotype’) to use functionally similar proteins (i.e. those with similar representations). We demonstrate the utility of this broadened notion by using it to identify known phenologs and novel non-obvious ones supported by current research. Oxford University Press 2019-05-21 2019-03-08 /pmc/articles/PMC6511848/ /pubmed/30847485 http://dx.doi.org/10.1093/nar/gkz132 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Fan, Jason
Cannistra, Anthony
Fried, Inbar
Lim, Tim
Schaffner, Thomas
Crovella, Mark
Hescott, Benjamin
Leiserson, Mark D M
Functional protein representations from biological networks enable diverse cross-species inference
title Functional protein representations from biological networks enable diverse cross-species inference
title_full Functional protein representations from biological networks enable diverse cross-species inference
title_fullStr Functional protein representations from biological networks enable diverse cross-species inference
title_full_unstemmed Functional protein representations from biological networks enable diverse cross-species inference
title_short Functional protein representations from biological networks enable diverse cross-species inference
title_sort functional protein representations from biological networks enable diverse cross-species inference
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6511848/
https://www.ncbi.nlm.nih.gov/pubmed/30847485
http://dx.doi.org/10.1093/nar/gkz132
work_keys_str_mv AT fanjason functionalproteinrepresentationsfrombiologicalnetworksenablediversecrossspeciesinference
AT cannistraanthony functionalproteinrepresentationsfrombiologicalnetworksenablediversecrossspeciesinference
AT friedinbar functionalproteinrepresentationsfrombiologicalnetworksenablediversecrossspeciesinference
AT limtim functionalproteinrepresentationsfrombiologicalnetworksenablediversecrossspeciesinference
AT schaffnerthomas functionalproteinrepresentationsfrombiologicalnetworksenablediversecrossspeciesinference
AT crovellamark functionalproteinrepresentationsfrombiologicalnetworksenablediversecrossspeciesinference
AT hescottbenjamin functionalproteinrepresentationsfrombiologicalnetworksenablediversecrossspeciesinference
AT leisersonmarkdm functionalproteinrepresentationsfrombiologicalnetworksenablediversecrossspeciesinference