Cargando…
WINNER: A network biology tool for biomolecular characterization and prioritization
BACKGROUND AND CONTRIBUTION: In network biology, molecular functions can be characterized by network-based inference, or “guilt-by-associations.” PageRank-like tools have been applied in the study of biomolecular interaction networks to obtain further the relative significance of all molecules in th...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9672476/ https://www.ncbi.nlm.nih.gov/pubmed/36407327 http://dx.doi.org/10.3389/fdata.2022.1016606 |
_version_ | 1784832743850377216 |
---|---|
author | Nguyen, Thanh Yue, Zongliang Slominski, Radomir Welner, Robert Zhang, Jianyi Chen, Jake Y. |
author_facet | Nguyen, Thanh Yue, Zongliang Slominski, Radomir Welner, Robert Zhang, Jianyi Chen, Jake Y. |
author_sort | Nguyen, Thanh |
collection | PubMed |
description | BACKGROUND AND CONTRIBUTION: In network biology, molecular functions can be characterized by network-based inference, or “guilt-by-associations.” PageRank-like tools have been applied in the study of biomolecular interaction networks to obtain further the relative significance of all molecules in the network. However, there is a great deal of inherent noise in widely accessible data sets for gene-to-gene associations or protein-protein interactions. How to develop robust tests to expand, filter, and rank molecular entities in disease-specific networks remains an ad hoc data analysis process. RESULTS: We describe a new biomolecular characterization and prioritization tool called Weighted In-Network Node Expansion and Ranking (WINNER). It takes the input of any molecular interaction network data and generates an optionally expanded network with all the nodes ranked according to their relevance to one another in the network. To help users assess the robustness of results, WINNER provides two different types of statistics. The first type is a node-expansion p-value, which helps evaluate the statistical significance of adding “non-seed” molecules to the original biomolecular interaction network consisting of “seed” molecules and molecular interactions. The second type is a node-ranking p-value, which helps evaluate the relative statistical significance of the contribution of each node to the overall network architecture. We validated the robustness of WINNER in ranking top molecules by spiking noises in several network permutation experiments. We have found that node degree–preservation randomization of the gene network produced normally distributed ranking scores, which outperform those made with other gene network randomization techniques. Furthermore, we validated that a more significant proportion of the WINNER-ranked genes was associated with disease biology than existing methods such as PageRank. We demonstrated the performance of WINNER with a few case studies, including Alzheimer's disease, breast cancer, myocardial infarctions, and Triple negative breast cancer (TNBC). In all these case studies, the expanded and top-ranked genes identified by WINNER reveal disease biology more significantly than those identified by other gene prioritizing software tools, including Ingenuity Pathway Analysis (IPA) and DiAMOND. CONCLUSION: WINNER ranking strongly correlates to other ranking methods when the network covers sufficient node and edge information, indicating a high network quality. WINNER users can use this new tool to robustly evaluate a list of candidate genes, proteins, or metabolites produced from high-throughput biology experiments, as long as there is available gene/protein/metabolic network information. |
format | Online Article Text |
id | pubmed-9672476 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-96724762022-11-19 WINNER: A network biology tool for biomolecular characterization and prioritization Nguyen, Thanh Yue, Zongliang Slominski, Radomir Welner, Robert Zhang, Jianyi Chen, Jake Y. Front Big Data Big Data BACKGROUND AND CONTRIBUTION: In network biology, molecular functions can be characterized by network-based inference, or “guilt-by-associations.” PageRank-like tools have been applied in the study of biomolecular interaction networks to obtain further the relative significance of all molecules in the network. However, there is a great deal of inherent noise in widely accessible data sets for gene-to-gene associations or protein-protein interactions. How to develop robust tests to expand, filter, and rank molecular entities in disease-specific networks remains an ad hoc data analysis process. RESULTS: We describe a new biomolecular characterization and prioritization tool called Weighted In-Network Node Expansion and Ranking (WINNER). It takes the input of any molecular interaction network data and generates an optionally expanded network with all the nodes ranked according to their relevance to one another in the network. To help users assess the robustness of results, WINNER provides two different types of statistics. The first type is a node-expansion p-value, which helps evaluate the statistical significance of adding “non-seed” molecules to the original biomolecular interaction network consisting of “seed” molecules and molecular interactions. The second type is a node-ranking p-value, which helps evaluate the relative statistical significance of the contribution of each node to the overall network architecture. We validated the robustness of WINNER in ranking top molecules by spiking noises in several network permutation experiments. We have found that node degree–preservation randomization of the gene network produced normally distributed ranking scores, which outperform those made with other gene network randomization techniques. Furthermore, we validated that a more significant proportion of the WINNER-ranked genes was associated with disease biology than existing methods such as PageRank. We demonstrated the performance of WINNER with a few case studies, including Alzheimer's disease, breast cancer, myocardial infarctions, and Triple negative breast cancer (TNBC). In all these case studies, the expanded and top-ranked genes identified by WINNER reveal disease biology more significantly than those identified by other gene prioritizing software tools, including Ingenuity Pathway Analysis (IPA) and DiAMOND. CONCLUSION: WINNER ranking strongly correlates to other ranking methods when the network covers sufficient node and edge information, indicating a high network quality. WINNER users can use this new tool to robustly evaluate a list of candidate genes, proteins, or metabolites produced from high-throughput biology experiments, as long as there is available gene/protein/metabolic network information. Frontiers Media S.A. 2022-11-04 /pmc/articles/PMC9672476/ /pubmed/36407327 http://dx.doi.org/10.3389/fdata.2022.1016606 Text en Copyright © 2022 Nguyen, Yue, Slominski, Welner, Zhang and Chen. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Big Data Nguyen, Thanh Yue, Zongliang Slominski, Radomir Welner, Robert Zhang, Jianyi Chen, Jake Y. WINNER: A network biology tool for biomolecular characterization and prioritization |
title | WINNER: A network biology tool for biomolecular characterization and prioritization |
title_full | WINNER: A network biology tool for biomolecular characterization and prioritization |
title_fullStr | WINNER: A network biology tool for biomolecular characterization and prioritization |
title_full_unstemmed | WINNER: A network biology tool for biomolecular characterization and prioritization |
title_short | WINNER: A network biology tool for biomolecular characterization and prioritization |
title_sort | winner: a network biology tool for biomolecular characterization and prioritization |
topic | Big Data |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9672476/ https://www.ncbi.nlm.nih.gov/pubmed/36407327 http://dx.doi.org/10.3389/fdata.2022.1016606 |
work_keys_str_mv | AT nguyenthanh winneranetworkbiologytoolforbiomolecularcharacterizationandprioritization AT yuezongliang winneranetworkbiologytoolforbiomolecularcharacterizationandprioritization AT slominskiradomir winneranetworkbiologytoolforbiomolecularcharacterizationandprioritization AT welnerrobert winneranetworkbiologytoolforbiomolecularcharacterizationandprioritization AT zhangjianyi winneranetworkbiologytoolforbiomolecularcharacterizationandprioritization AT chenjakey winneranetworkbiologytoolforbiomolecularcharacterizationandprioritization |