Cargando…
Identifying large-scale interaction atlases using probabilistic graphs and external knowledge
INTRODUCTION: Reconstruction of gene interaction networks from experimental data provides a deep understanding of the underlying biological mechanisms. The noisy nature of the data and the large size of the network make this a very challenging task. Complex approaches handle the stochastic nature of...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cambridge University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8922291/ https://www.ncbi.nlm.nih.gov/pubmed/35321220 http://dx.doi.org/10.1017/cts.2022.18 |
_version_ | 1784669495465345024 |
---|---|
author | Chanumolu, Sree K. Otu, Hasan H. |
author_facet | Chanumolu, Sree K. Otu, Hasan H. |
author_sort | Chanumolu, Sree K. |
collection | PubMed |
description | INTRODUCTION: Reconstruction of gene interaction networks from experimental data provides a deep understanding of the underlying biological mechanisms. The noisy nature of the data and the large size of the network make this a very challenging task. Complex approaches handle the stochastic nature of the data but can only do this for small networks; simpler, linear models generate large networks but with less reliability. METHODS: We propose a divide-and-conquer approach using probabilistic graph representations and external knowledge. We cluster the experimental data and learn an interaction network for each cluster, which are merged using the interaction network for the representative genes selected for each cluster. RESULTS: We generated an interaction atlas for 337 human pathways yielding a network of 11,454 genes with 17,777 edges. Simulated gene expression data from this atlas formed the basis for reconstruction. Based on the area under the curve of the precision-recall curve, the proposed approach outperformed the baseline (random classifier) by ∼15-fold and conventional methods by ∼5–17-fold. The performance of the proposed workflow is significantly linked to the accuracy of the clustering step that tries to identify the modularity of the underlying biological mechanisms. CONCLUSIONS: We provide an interaction atlas generation workflow optimizing the algorithm/parameter selection. The proposed approach integrates external knowledge in the reconstruction of the interactome using probabilistic graphs. Network characterization and understanding long-range effects in interaction atlases provide means for comparative analysis with implications in biomarker discovery and therapeutic approaches. The proposed workflow is freely available at http://otulab.unl.edu/atlas. |
format | Online Article Text |
id | pubmed-8922291 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Cambridge University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-89222912022-03-22 Identifying large-scale interaction atlases using probabilistic graphs and external knowledge Chanumolu, Sree K. Otu, Hasan H. J Clin Transl Sci Research Article INTRODUCTION: Reconstruction of gene interaction networks from experimental data provides a deep understanding of the underlying biological mechanisms. The noisy nature of the data and the large size of the network make this a very challenging task. Complex approaches handle the stochastic nature of the data but can only do this for small networks; simpler, linear models generate large networks but with less reliability. METHODS: We propose a divide-and-conquer approach using probabilistic graph representations and external knowledge. We cluster the experimental data and learn an interaction network for each cluster, which are merged using the interaction network for the representative genes selected for each cluster. RESULTS: We generated an interaction atlas for 337 human pathways yielding a network of 11,454 genes with 17,777 edges. Simulated gene expression data from this atlas formed the basis for reconstruction. Based on the area under the curve of the precision-recall curve, the proposed approach outperformed the baseline (random classifier) by ∼15-fold and conventional methods by ∼5–17-fold. The performance of the proposed workflow is significantly linked to the accuracy of the clustering step that tries to identify the modularity of the underlying biological mechanisms. CONCLUSIONS: We provide an interaction atlas generation workflow optimizing the algorithm/parameter selection. The proposed approach integrates external knowledge in the reconstruction of the interactome using probabilistic graphs. Network characterization and understanding long-range effects in interaction atlases provide means for comparative analysis with implications in biomarker discovery and therapeutic approaches. The proposed workflow is freely available at http://otulab.unl.edu/atlas. Cambridge University Press 2022-02-11 /pmc/articles/PMC8922291/ /pubmed/35321220 http://dx.doi.org/10.1017/cts.2022.18 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Chanumolu, Sree K. Otu, Hasan H. Identifying large-scale interaction atlases using probabilistic graphs and external knowledge |
title | Identifying large-scale interaction atlases using probabilistic graphs and external knowledge |
title_full | Identifying large-scale interaction atlases using probabilistic graphs and external knowledge |
title_fullStr | Identifying large-scale interaction atlases using probabilistic graphs and external knowledge |
title_full_unstemmed | Identifying large-scale interaction atlases using probabilistic graphs and external knowledge |
title_short | Identifying large-scale interaction atlases using probabilistic graphs and external knowledge |
title_sort | identifying large-scale interaction atlases using probabilistic graphs and external knowledge |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8922291/ https://www.ncbi.nlm.nih.gov/pubmed/35321220 http://dx.doi.org/10.1017/cts.2022.18 |
work_keys_str_mv | AT chanumolusreek identifyinglargescaleinteractionatlasesusingprobabilisticgraphsandexternalknowledge AT otuhasanh identifyinglargescaleinteractionatlasesusingprobabilisticgraphsandexternalknowledge |