Cargando…

WINNER: A network biology tool for biomolecular characterization and prioritization

BACKGROUND AND CONTRIBUTION: In network biology, molecular functions can be characterized by network-based inference, or “guilt-by-associations.” PageRank-like tools have been applied in the study of biomolecular interaction networks to obtain further the relative significance of all molecules in th...

Descripción completa

Detalles Bibliográficos
Autores principales: Nguyen, Thanh, Yue, Zongliang, Slominski, Radomir, Welner, Robert, Zhang, Jianyi, Chen, Jake Y.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9672476/
https://www.ncbi.nlm.nih.gov/pubmed/36407327
http://dx.doi.org/10.3389/fdata.2022.1016606
_version_ 1784832743850377216
author Nguyen, Thanh
Yue, Zongliang
Slominski, Radomir
Welner, Robert
Zhang, Jianyi
Chen, Jake Y.
author_facet Nguyen, Thanh
Yue, Zongliang
Slominski, Radomir
Welner, Robert
Zhang, Jianyi
Chen, Jake Y.
author_sort Nguyen, Thanh
collection PubMed
description BACKGROUND AND CONTRIBUTION: In network biology, molecular functions can be characterized by network-based inference, or “guilt-by-associations.” PageRank-like tools have been applied in the study of biomolecular interaction networks to obtain further the relative significance of all molecules in the network. However, there is a great deal of inherent noise in widely accessible data sets for gene-to-gene associations or protein-protein interactions. How to develop robust tests to expand, filter, and rank molecular entities in disease-specific networks remains an ad hoc data analysis process. RESULTS: We describe a new biomolecular characterization and prioritization tool called Weighted In-Network Node Expansion and Ranking (WINNER). It takes the input of any molecular interaction network data and generates an optionally expanded network with all the nodes ranked according to their relevance to one another in the network. To help users assess the robustness of results, WINNER provides two different types of statistics. The first type is a node-expansion p-value, which helps evaluate the statistical significance of adding “non-seed” molecules to the original biomolecular interaction network consisting of “seed” molecules and molecular interactions. The second type is a node-ranking p-value, which helps evaluate the relative statistical significance of the contribution of each node to the overall network architecture. We validated the robustness of WINNER in ranking top molecules by spiking noises in several network permutation experiments. We have found that node degree–preservation randomization of the gene network produced normally distributed ranking scores, which outperform those made with other gene network randomization techniques. Furthermore, we validated that a more significant proportion of the WINNER-ranked genes was associated with disease biology than existing methods such as PageRank. We demonstrated the performance of WINNER with a few case studies, including Alzheimer's disease, breast cancer, myocardial infarctions, and Triple negative breast cancer (TNBC). In all these case studies, the expanded and top-ranked genes identified by WINNER reveal disease biology more significantly than those identified by other gene prioritizing software tools, including Ingenuity Pathway Analysis (IPA) and DiAMOND. CONCLUSION: WINNER ranking strongly correlates to other ranking methods when the network covers sufficient node and edge information, indicating a high network quality. WINNER users can use this new tool to robustly evaluate a list of candidate genes, proteins, or metabolites produced from high-throughput biology experiments, as long as there is available gene/protein/metabolic network information.
format Online
Article
Text
id pubmed-9672476
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-96724762022-11-19 WINNER: A network biology tool for biomolecular characterization and prioritization Nguyen, Thanh Yue, Zongliang Slominski, Radomir Welner, Robert Zhang, Jianyi Chen, Jake Y. Front Big Data Big Data BACKGROUND AND CONTRIBUTION: In network biology, molecular functions can be characterized by network-based inference, or “guilt-by-associations.” PageRank-like tools have been applied in the study of biomolecular interaction networks to obtain further the relative significance of all molecules in the network. However, there is a great deal of inherent noise in widely accessible data sets for gene-to-gene associations or protein-protein interactions. How to develop robust tests to expand, filter, and rank molecular entities in disease-specific networks remains an ad hoc data analysis process. RESULTS: We describe a new biomolecular characterization and prioritization tool called Weighted In-Network Node Expansion and Ranking (WINNER). It takes the input of any molecular interaction network data and generates an optionally expanded network with all the nodes ranked according to their relevance to one another in the network. To help users assess the robustness of results, WINNER provides two different types of statistics. The first type is a node-expansion p-value, which helps evaluate the statistical significance of adding “non-seed” molecules to the original biomolecular interaction network consisting of “seed” molecules and molecular interactions. The second type is a node-ranking p-value, which helps evaluate the relative statistical significance of the contribution of each node to the overall network architecture. We validated the robustness of WINNER in ranking top molecules by spiking noises in several network permutation experiments. We have found that node degree–preservation randomization of the gene network produced normally distributed ranking scores, which outperform those made with other gene network randomization techniques. Furthermore, we validated that a more significant proportion of the WINNER-ranked genes was associated with disease biology than existing methods such as PageRank. We demonstrated the performance of WINNER with a few case studies, including Alzheimer's disease, breast cancer, myocardial infarctions, and Triple negative breast cancer (TNBC). In all these case studies, the expanded and top-ranked genes identified by WINNER reveal disease biology more significantly than those identified by other gene prioritizing software tools, including Ingenuity Pathway Analysis (IPA) and DiAMOND. CONCLUSION: WINNER ranking strongly correlates to other ranking methods when the network covers sufficient node and edge information, indicating a high network quality. WINNER users can use this new tool to robustly evaluate a list of candidate genes, proteins, or metabolites produced from high-throughput biology experiments, as long as there is available gene/protein/metabolic network information. Frontiers Media S.A. 2022-11-04 /pmc/articles/PMC9672476/ /pubmed/36407327 http://dx.doi.org/10.3389/fdata.2022.1016606 Text en Copyright © 2022 Nguyen, Yue, Slominski, Welner, Zhang and Chen. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Big Data
Nguyen, Thanh
Yue, Zongliang
Slominski, Radomir
Welner, Robert
Zhang, Jianyi
Chen, Jake Y.
WINNER: A network biology tool for biomolecular characterization and prioritization
title WINNER: A network biology tool for biomolecular characterization and prioritization
title_full WINNER: A network biology tool for biomolecular characterization and prioritization
title_fullStr WINNER: A network biology tool for biomolecular characterization and prioritization
title_full_unstemmed WINNER: A network biology tool for biomolecular characterization and prioritization
title_short WINNER: A network biology tool for biomolecular characterization and prioritization
title_sort winner: a network biology tool for biomolecular characterization and prioritization
topic Big Data
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9672476/
https://www.ncbi.nlm.nih.gov/pubmed/36407327
http://dx.doi.org/10.3389/fdata.2022.1016606
work_keys_str_mv AT nguyenthanh winneranetworkbiologytoolforbiomolecularcharacterizationandprioritization
AT yuezongliang winneranetworkbiologytoolforbiomolecularcharacterizationandprioritization
AT slominskiradomir winneranetworkbiologytoolforbiomolecularcharacterizationandprioritization
AT welnerrobert winneranetworkbiologytoolforbiomolecularcharacterizationandprioritization
AT zhangjianyi winneranetworkbiologytoolforbiomolecularcharacterizationandprioritization
AT chenjakey winneranetworkbiologytoolforbiomolecularcharacterizationandprioritization