Cargando…

Identifying large-scale interaction atlases using probabilistic graphs and external knowledge

INTRODUCTION: Reconstruction of gene interaction networks from experimental data provides a deep understanding of the underlying biological mechanisms. The noisy nature of the data and the large size of the network make this a very challenging task. Complex approaches handle the stochastic nature of...

Descripción completa

Detalles Bibliográficos
Autores principales: Chanumolu, Sree K., Otu, Hasan H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cambridge University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8922291/
https://www.ncbi.nlm.nih.gov/pubmed/35321220
http://dx.doi.org/10.1017/cts.2022.18
_version_ 1784669495465345024
author Chanumolu, Sree K.
Otu, Hasan H.
author_facet Chanumolu, Sree K.
Otu, Hasan H.
author_sort Chanumolu, Sree K.
collection PubMed
description INTRODUCTION: Reconstruction of gene interaction networks from experimental data provides a deep understanding of the underlying biological mechanisms. The noisy nature of the data and the large size of the network make this a very challenging task. Complex approaches handle the stochastic nature of the data but can only do this for small networks; simpler, linear models generate large networks but with less reliability. METHODS: We propose a divide-and-conquer approach using probabilistic graph representations and external knowledge. We cluster the experimental data and learn an interaction network for each cluster, which are merged using the interaction network for the representative genes selected for each cluster. RESULTS: We generated an interaction atlas for 337 human pathways yielding a network of 11,454 genes with 17,777 edges. Simulated gene expression data from this atlas formed the basis for reconstruction. Based on the area under the curve of the precision-recall curve, the proposed approach outperformed the baseline (random classifier) by ∼15-fold and conventional methods by ∼5–17-fold. The performance of the proposed workflow is significantly linked to the accuracy of the clustering step that tries to identify the modularity of the underlying biological mechanisms. CONCLUSIONS: We provide an interaction atlas generation workflow optimizing the algorithm/parameter selection. The proposed approach integrates external knowledge in the reconstruction of the interactome using probabilistic graphs. Network characterization and understanding long-range effects in interaction atlases provide means for comparative analysis with implications in biomarker discovery and therapeutic approaches. The proposed workflow is freely available at http://otulab.unl.edu/atlas.
format Online
Article
Text
id pubmed-8922291
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Cambridge University Press
record_format MEDLINE/PubMed
spelling pubmed-89222912022-03-22 Identifying large-scale interaction atlases using probabilistic graphs and external knowledge Chanumolu, Sree K. Otu, Hasan H. J Clin Transl Sci Research Article INTRODUCTION: Reconstruction of gene interaction networks from experimental data provides a deep understanding of the underlying biological mechanisms. The noisy nature of the data and the large size of the network make this a very challenging task. Complex approaches handle the stochastic nature of the data but can only do this for small networks; simpler, linear models generate large networks but with less reliability. METHODS: We propose a divide-and-conquer approach using probabilistic graph representations and external knowledge. We cluster the experimental data and learn an interaction network for each cluster, which are merged using the interaction network for the representative genes selected for each cluster. RESULTS: We generated an interaction atlas for 337 human pathways yielding a network of 11,454 genes with 17,777 edges. Simulated gene expression data from this atlas formed the basis for reconstruction. Based on the area under the curve of the precision-recall curve, the proposed approach outperformed the baseline (random classifier) by ∼15-fold and conventional methods by ∼5–17-fold. The performance of the proposed workflow is significantly linked to the accuracy of the clustering step that tries to identify the modularity of the underlying biological mechanisms. CONCLUSIONS: We provide an interaction atlas generation workflow optimizing the algorithm/parameter selection. The proposed approach integrates external knowledge in the reconstruction of the interactome using probabilistic graphs. Network characterization and understanding long-range effects in interaction atlases provide means for comparative analysis with implications in biomarker discovery and therapeutic approaches. The proposed workflow is freely available at http://otulab.unl.edu/atlas. Cambridge University Press 2022-02-11 /pmc/articles/PMC8922291/ /pubmed/35321220 http://dx.doi.org/10.1017/cts.2022.18 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Chanumolu, Sree K.
Otu, Hasan H.
Identifying large-scale interaction atlases using probabilistic graphs and external knowledge
title Identifying large-scale interaction atlases using probabilistic graphs and external knowledge
title_full Identifying large-scale interaction atlases using probabilistic graphs and external knowledge
title_fullStr Identifying large-scale interaction atlases using probabilistic graphs and external knowledge
title_full_unstemmed Identifying large-scale interaction atlases using probabilistic graphs and external knowledge
title_short Identifying large-scale interaction atlases using probabilistic graphs and external knowledge
title_sort identifying large-scale interaction atlases using probabilistic graphs and external knowledge
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8922291/
https://www.ncbi.nlm.nih.gov/pubmed/35321220
http://dx.doi.org/10.1017/cts.2022.18
work_keys_str_mv AT chanumolusreek identifyinglargescaleinteractionatlasesusingprobabilisticgraphsandexternalknowledge
AT otuhasanh identifyinglargescaleinteractionatlasesusingprobabilisticgraphsandexternalknowledge