Cargando…

Robust discovery of gene regulatory networks from single-cell gene expression data by Causal Inference Using Composition of Transactions

Gene regulatory networks (GRNs) drive organism structure and functions, so the discovery and characterization of GRNs is a major goal in biological research. However, accurate identification of causal regulatory connections and inference of GRNs using gene expression datasets, more recently from sin...

Descripción completa

Detalles Bibliográficos
Autores principales: Shojaee, Abbas, Huang, Shao-shan Carol
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10612495/
https://www.ncbi.nlm.nih.gov/pubmed/37897702
http://dx.doi.org/10.1093/bib/bbad370
_version_ 1785128713475588096
author Shojaee, Abbas
Huang, Shao-shan Carol
author_facet Shojaee, Abbas
Huang, Shao-shan Carol
author_sort Shojaee, Abbas
collection PubMed
description Gene regulatory networks (GRNs) drive organism structure and functions, so the discovery and characterization of GRNs is a major goal in biological research. However, accurate identification of causal regulatory connections and inference of GRNs using gene expression datasets, more recently from single-cell RNA-seq (scRNA-seq), has been challenging. Here we employ the innovative method of Causal Inference Using Composition of Transactions (CICT) to uncover GRNs from scRNA-seq data. The basis of CICT is that if all gene expressions were random, a non-random regulatory gene should induce its targets at levels different from the background random process, resulting in distinct patterns in the whole relevance network of gene–gene associations. CICT proposes novel network features derived from a relevance network, which enable any machine learning algorithm to predict causal regulatory edges and infer GRNs. We evaluated CICT using simulated and experimental scRNA-seq data in a well-established benchmarking pipeline and showed that CICT outperformed existing network inference methods representing diverse approaches with many-fold higher accuracy. Furthermore, we demonstrated that GRN inference with CICT was robust to different levels of sparsity in scRNA-seq data, the characteristics of data and ground truth, the choice of association measure and the complexity of the supervised machine learning algorithm. Our results suggest aiming at directly predicting causality to recover regulatory relationships in complex biological networks substantially improves accuracy in GRN inference.
format Online
Article
Text
id pubmed-10612495
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-106124952023-10-29 Robust discovery of gene regulatory networks from single-cell gene expression data by Causal Inference Using Composition of Transactions Shojaee, Abbas Huang, Shao-shan Carol Brief Bioinform Problem Solving Protocol Gene regulatory networks (GRNs) drive organism structure and functions, so the discovery and characterization of GRNs is a major goal in biological research. However, accurate identification of causal regulatory connections and inference of GRNs using gene expression datasets, more recently from single-cell RNA-seq (scRNA-seq), has been challenging. Here we employ the innovative method of Causal Inference Using Composition of Transactions (CICT) to uncover GRNs from scRNA-seq data. The basis of CICT is that if all gene expressions were random, a non-random regulatory gene should induce its targets at levels different from the background random process, resulting in distinct patterns in the whole relevance network of gene–gene associations. CICT proposes novel network features derived from a relevance network, which enable any machine learning algorithm to predict causal regulatory edges and infer GRNs. We evaluated CICT using simulated and experimental scRNA-seq data in a well-established benchmarking pipeline and showed that CICT outperformed existing network inference methods representing diverse approaches with many-fold higher accuracy. Furthermore, we demonstrated that GRN inference with CICT was robust to different levels of sparsity in scRNA-seq data, the characteristics of data and ground truth, the choice of association measure and the complexity of the supervised machine learning algorithm. Our results suggest aiming at directly predicting causality to recover regulatory relationships in complex biological networks substantially improves accuracy in GRN inference. Oxford University Press 2023-10-27 /pmc/articles/PMC10612495/ /pubmed/37897702 http://dx.doi.org/10.1093/bib/bbad370 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Problem Solving Protocol
Shojaee, Abbas
Huang, Shao-shan Carol
Robust discovery of gene regulatory networks from single-cell gene expression data by Causal Inference Using Composition of Transactions
title Robust discovery of gene regulatory networks from single-cell gene expression data by Causal Inference Using Composition of Transactions
title_full Robust discovery of gene regulatory networks from single-cell gene expression data by Causal Inference Using Composition of Transactions
title_fullStr Robust discovery of gene regulatory networks from single-cell gene expression data by Causal Inference Using Composition of Transactions
title_full_unstemmed Robust discovery of gene regulatory networks from single-cell gene expression data by Causal Inference Using Composition of Transactions
title_short Robust discovery of gene regulatory networks from single-cell gene expression data by Causal Inference Using Composition of Transactions
title_sort robust discovery of gene regulatory networks from single-cell gene expression data by causal inference using composition of transactions
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10612495/
https://www.ncbi.nlm.nih.gov/pubmed/37897702
http://dx.doi.org/10.1093/bib/bbad370
work_keys_str_mv AT shojaeeabbas robustdiscoveryofgeneregulatorynetworksfromsinglecellgeneexpressiondatabycausalinferenceusingcompositionoftransactions
AT huangshaoshancarol robustdiscoveryofgeneregulatorynetworksfromsinglecellgeneexpressiondatabycausalinferenceusingcompositionoftransactions