Cargando…

Data-driven network alignment

In this study, we deal with the problem of biological network alignment (NA), which aims to find a node mapping between species’ molecular networks that uncovers similar network regions, thus allowing for the transfer of functional knowledge between the aligned nodes. We provide evidence that curren...

Descripción completa

Detalles Bibliográficos
Autores principales: Gu, Shawn, Milenković, Tijana
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7331999/
https://www.ncbi.nlm.nih.gov/pubmed/32614833
http://dx.doi.org/10.1371/journal.pone.0234978
_version_ 1783553438584406016
author Gu, Shawn
Milenković, Tijana
author_facet Gu, Shawn
Milenković, Tijana
author_sort Gu, Shawn
collection PubMed
description In this study, we deal with the problem of biological network alignment (NA), which aims to find a node mapping between species’ molecular networks that uncovers similar network regions, thus allowing for the transfer of functional knowledge between the aligned nodes. We provide evidence that current NA methods, which assume that topologically similar nodes (i.e., nodes whose network neighborhoods are isomorphic-like) have high functional relatedness, do not actually end up aligning functionally related nodes. That is, we show that the current topological similarity assumption does not hold well. Consequently, we argue that a paradigm shift is needed with how the NA problem is approached. So, we redefine NA as a data-driven framework, called TARA (data-driven NA), which attempts to learn the relationship between topological relatedness and functional relatedness without assuming that topological relatedness corresponds to topological similarity. TARA makes no assumptions about what nodes should be aligned, distinguishing it from existing NA methods. Specifically, TARA trains a classifier to predict whether two nodes from different networks are functionally related based on their network topological patterns (features). We find that TARA is able to make accurate predictions. TARA then takes each pair of nodes that are predicted as related to be part of an alignment. Like traditional NA methods, TARA uses this alignment for the across-species transfer of functional knowledge. TARA as currently implemented uses topological but not protein sequence information for functional knowledge transfer. In this context, we find that TARA outperforms existing state-of-the-art NA methods that also use topological information, WAVE and SANA, and even outperforms or complements a state-of-the-art NA method that uses both topological and sequence information, PrimAlign. Hence, adding sequence information to TARA, which is our future work, is likely to further improve its performance. The software and data are available at http://www.nd.edu/~cone/TARA/.
format Online
Article
Text
id pubmed-7331999
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-73319992020-07-14 Data-driven network alignment Gu, Shawn Milenković, Tijana PLoS One Research Article In this study, we deal with the problem of biological network alignment (NA), which aims to find a node mapping between species’ molecular networks that uncovers similar network regions, thus allowing for the transfer of functional knowledge between the aligned nodes. We provide evidence that current NA methods, which assume that topologically similar nodes (i.e., nodes whose network neighborhoods are isomorphic-like) have high functional relatedness, do not actually end up aligning functionally related nodes. That is, we show that the current topological similarity assumption does not hold well. Consequently, we argue that a paradigm shift is needed with how the NA problem is approached. So, we redefine NA as a data-driven framework, called TARA (data-driven NA), which attempts to learn the relationship between topological relatedness and functional relatedness without assuming that topological relatedness corresponds to topological similarity. TARA makes no assumptions about what nodes should be aligned, distinguishing it from existing NA methods. Specifically, TARA trains a classifier to predict whether two nodes from different networks are functionally related based on their network topological patterns (features). We find that TARA is able to make accurate predictions. TARA then takes each pair of nodes that are predicted as related to be part of an alignment. Like traditional NA methods, TARA uses this alignment for the across-species transfer of functional knowledge. TARA as currently implemented uses topological but not protein sequence information for functional knowledge transfer. In this context, we find that TARA outperforms existing state-of-the-art NA methods that also use topological information, WAVE and SANA, and even outperforms or complements a state-of-the-art NA method that uses both topological and sequence information, PrimAlign. Hence, adding sequence information to TARA, which is our future work, is likely to further improve its performance. The software and data are available at http://www.nd.edu/~cone/TARA/. Public Library of Science 2020-07-02 /pmc/articles/PMC7331999/ /pubmed/32614833 http://dx.doi.org/10.1371/journal.pone.0234978 Text en © 2020 Gu, Milenković http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Gu, Shawn
Milenković, Tijana
Data-driven network alignment
title Data-driven network alignment
title_full Data-driven network alignment
title_fullStr Data-driven network alignment
title_full_unstemmed Data-driven network alignment
title_short Data-driven network alignment
title_sort data-driven network alignment
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7331999/
https://www.ncbi.nlm.nih.gov/pubmed/32614833
http://dx.doi.org/10.1371/journal.pone.0234978
work_keys_str_mv AT gushawn datadrivennetworkalignment
AT milenkovictijana datadrivennetworkalignment