Cargando…

MCPNet: a parallel maximum capacity-based genome-scale gene network construction framework

MOTIVATION: Gene network reconstruction from gene expression profiles is a compute- and data-intensive problem. Numerous methods based on diverse approaches including mutual information, random forests, Bayesian networks, correlation measures, as well as their transforms and filters such as data pro...

Descripción completa

Detalles Bibliográficos
Autores principales: Pan, Tony C, Chockalingam, Sriram P, Aluru, Maneesha, Aluru, Srinivas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10287961/
https://www.ncbi.nlm.nih.gov/pubmed/37289522
http://dx.doi.org/10.1093/bioinformatics/btad373
_version_ 1785061978761330688
author Pan, Tony C
Chockalingam, Sriram P
Aluru, Maneesha
Aluru, Srinivas
author_facet Pan, Tony C
Chockalingam, Sriram P
Aluru, Maneesha
Aluru, Srinivas
author_sort Pan, Tony C
collection PubMed
description MOTIVATION: Gene network reconstruction from gene expression profiles is a compute- and data-intensive problem. Numerous methods based on diverse approaches including mutual information, random forests, Bayesian networks, correlation measures, as well as their transforms and filters such as data processing inequality, have been proposed. However, an effective gene network reconstruction method that performs well in all three aspects of computational efficiency, data size scalability, and output quality remains elusive. Simple techniques such as Pearson correlation are fast to compute but ignore indirect interactions, while more robust methods such as Bayesian networks are prohibitively time consuming to apply to tens of thousands of genes. RESULTS: We developed maximum capacity path (MCP) score, a novel maximum-capacity-path-based metric to quantify the relative strengths of direct and indirect gene–gene interactions. We further present MCPNet, an efficient, parallelized gene network reconstruction software based on MCP score, to reverse engineer networks in unsupervised and ensemble manners. Using synthetic and real Saccharomyces cervisiae datasets as well as real Arabidopsis thaliana datasets, we demonstrate that MCPNet produces better quality networks as measured by AUPRC, is significantly faster than all other gene network reconstruction software, and also scales well to tens of thousands of genes and hundreds of CPU cores. Thus, MCPNet represents a new gene network reconstruction tool that simultaneously achieves quality, performance, and scalability requirements. AVAILABILITY AND IMPLEMENTATION: Source code freely available for download at https://doi.org/10.5281/zenodo.6499747 and https://github.com/AluruLab/MCPNet, implemented in C++ and supported on Linux.
format Online
Article
Text
id pubmed-10287961
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-102879612023-06-24 MCPNet: a parallel maximum capacity-based genome-scale gene network construction framework Pan, Tony C Chockalingam, Sriram P Aluru, Maneesha Aluru, Srinivas Bioinformatics Original Paper MOTIVATION: Gene network reconstruction from gene expression profiles is a compute- and data-intensive problem. Numerous methods based on diverse approaches including mutual information, random forests, Bayesian networks, correlation measures, as well as their transforms and filters such as data processing inequality, have been proposed. However, an effective gene network reconstruction method that performs well in all three aspects of computational efficiency, data size scalability, and output quality remains elusive. Simple techniques such as Pearson correlation are fast to compute but ignore indirect interactions, while more robust methods such as Bayesian networks are prohibitively time consuming to apply to tens of thousands of genes. RESULTS: We developed maximum capacity path (MCP) score, a novel maximum-capacity-path-based metric to quantify the relative strengths of direct and indirect gene–gene interactions. We further present MCPNet, an efficient, parallelized gene network reconstruction software based on MCP score, to reverse engineer networks in unsupervised and ensemble manners. Using synthetic and real Saccharomyces cervisiae datasets as well as real Arabidopsis thaliana datasets, we demonstrate that MCPNet produces better quality networks as measured by AUPRC, is significantly faster than all other gene network reconstruction software, and also scales well to tens of thousands of genes and hundreds of CPU cores. Thus, MCPNet represents a new gene network reconstruction tool that simultaneously achieves quality, performance, and scalability requirements. AVAILABILITY AND IMPLEMENTATION: Source code freely available for download at https://doi.org/10.5281/zenodo.6499747 and https://github.com/AluruLab/MCPNet, implemented in C++ and supported on Linux. Oxford University Press 2023-06-08 /pmc/articles/PMC10287961/ /pubmed/37289522 http://dx.doi.org/10.1093/bioinformatics/btad373 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Pan, Tony C
Chockalingam, Sriram P
Aluru, Maneesha
Aluru, Srinivas
MCPNet: a parallel maximum capacity-based genome-scale gene network construction framework
title MCPNet: a parallel maximum capacity-based genome-scale gene network construction framework
title_full MCPNet: a parallel maximum capacity-based genome-scale gene network construction framework
title_fullStr MCPNet: a parallel maximum capacity-based genome-scale gene network construction framework
title_full_unstemmed MCPNet: a parallel maximum capacity-based genome-scale gene network construction framework
title_short MCPNet: a parallel maximum capacity-based genome-scale gene network construction framework
title_sort mcpnet: a parallel maximum capacity-based genome-scale gene network construction framework
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10287961/
https://www.ncbi.nlm.nih.gov/pubmed/37289522
http://dx.doi.org/10.1093/bioinformatics/btad373
work_keys_str_mv AT pantonyc mcpnetaparallelmaximumcapacitybasedgenomescalegenenetworkconstructionframework
AT chockalingamsriramp mcpnetaparallelmaximumcapacitybasedgenomescalegenenetworkconstructionframework
AT alurumaneesha mcpnetaparallelmaximumcapacitybasedgenomescalegenenetworkconstructionframework
AT alurusrinivas mcpnetaparallelmaximumcapacitybasedgenomescalegenenetworkconstructionframework