Cargando…

Normalized L3-based link prediction in protein–protein interaction networks

BACKGROUND: Protein–protein interaction (PPI) data is an important type of data used in functional genomics. However, high-throughput experiments are often insufficient to complete the PPI interactome of different organisms. Computational techniques are thus used to infer missing data, with link pre...

Descripción completa

Detalles Bibliográficos
Autores principales: Yuen, Ho Yin, Jansson, Jesper
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9945744/
https://www.ncbi.nlm.nih.gov/pubmed/36814208
http://dx.doi.org/10.1186/s12859-023-05178-3
_version_ 1784892200097677312
author Yuen, Ho Yin
Jansson, Jesper
author_facet Yuen, Ho Yin
Jansson, Jesper
author_sort Yuen, Ho Yin
collection PubMed
description BACKGROUND: Protein–protein interaction (PPI) data is an important type of data used in functional genomics. However, high-throughput experiments are often insufficient to complete the PPI interactome of different organisms. Computational techniques are thus used to infer missing data, with link prediction being one such approach that uses the structure of the network of PPIs known so far to identify non-edges whose addition to the network would make it more sound, according to some underlying assumptions. Recently, a new idea called the L3 principle introduced biological motivation into PPI link predictions, yielding predictors that are superior to general-purpose link predictors for complex networks. Interestingly, the L3 principle can be interpreted in another way, so that other signatures of PPI networks can also be characterized for PPI predictions. This alternative interpretation uncovers candidate PPIs that the current L3-based link predictors may not be able to fully capture, underutilizing the L3 principle. RESULTS: In this article, we propose a formulation of link predictors that we call NormalizedL3 (L3N) which addresses certain missing elements within L3 predictors in the perspective of network modeling. Our computational validations show that the L3N predictors are able to find missing PPIs more accurately (in terms of true positives among the predicted PPIs) than the previously proposed methods on several datasets from the literature, including BioGRID, STRING, MINT, and HuRI, at the cost of using more computation time in some of the cases. In addition, we found that L3-based link predictors (including L3N) ranked a different pool of PPIs higher than the general-purpose link predictors did. This suggests that different types of PPIs can be predicted based on different topological assumptions, and that even better PPI link predictors may be obtained in the future by improved network modeling. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05178-3.
format Online
Article
Text
id pubmed-9945744
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-99457442023-02-23 Normalized L3-based link prediction in protein–protein interaction networks Yuen, Ho Yin Jansson, Jesper BMC Bioinformatics Research BACKGROUND: Protein–protein interaction (PPI) data is an important type of data used in functional genomics. However, high-throughput experiments are often insufficient to complete the PPI interactome of different organisms. Computational techniques are thus used to infer missing data, with link prediction being one such approach that uses the structure of the network of PPIs known so far to identify non-edges whose addition to the network would make it more sound, according to some underlying assumptions. Recently, a new idea called the L3 principle introduced biological motivation into PPI link predictions, yielding predictors that are superior to general-purpose link predictors for complex networks. Interestingly, the L3 principle can be interpreted in another way, so that other signatures of PPI networks can also be characterized for PPI predictions. This alternative interpretation uncovers candidate PPIs that the current L3-based link predictors may not be able to fully capture, underutilizing the L3 principle. RESULTS: In this article, we propose a formulation of link predictors that we call NormalizedL3 (L3N) which addresses certain missing elements within L3 predictors in the perspective of network modeling. Our computational validations show that the L3N predictors are able to find missing PPIs more accurately (in terms of true positives among the predicted PPIs) than the previously proposed methods on several datasets from the literature, including BioGRID, STRING, MINT, and HuRI, at the cost of using more computation time in some of the cases. In addition, we found that L3-based link predictors (including L3N) ranked a different pool of PPIs higher than the general-purpose link predictors did. This suggests that different types of PPIs can be predicted based on different topological assumptions, and that even better PPI link predictors may be obtained in the future by improved network modeling. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05178-3. BioMed Central 2023-02-22 /pmc/articles/PMC9945744/ /pubmed/36814208 http://dx.doi.org/10.1186/s12859-023-05178-3 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Yuen, Ho Yin
Jansson, Jesper
Normalized L3-based link prediction in protein–protein interaction networks
title Normalized L3-based link prediction in protein–protein interaction networks
title_full Normalized L3-based link prediction in protein–protein interaction networks
title_fullStr Normalized L3-based link prediction in protein–protein interaction networks
title_full_unstemmed Normalized L3-based link prediction in protein–protein interaction networks
title_short Normalized L3-based link prediction in protein–protein interaction networks
title_sort normalized l3-based link prediction in protein–protein interaction networks
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9945744/
https://www.ncbi.nlm.nih.gov/pubmed/36814208
http://dx.doi.org/10.1186/s12859-023-05178-3
work_keys_str_mv AT yuenhoyin normalizedl3basedlinkpredictioninproteinproteininteractionnetworks
AT janssonjesper normalizedl3basedlinkpredictioninproteinproteininteractionnetworks