Cargando…
Robust discovery of gene regulatory networks from single-cell gene expression data by Causal Inference Using Composition of Transactions
Gene regulatory networks (GRNs) drive organism structure and functions, so the discovery and characterization of GRNs is a major goal in biological research. However, accurate identification of causal regulatory connections and inference of GRNs using gene expression datasets, more recently from sin...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10612495/ https://www.ncbi.nlm.nih.gov/pubmed/37897702 http://dx.doi.org/10.1093/bib/bbad370 |
_version_ | 1785128713475588096 |
---|---|
author | Shojaee, Abbas Huang, Shao-shan Carol |
author_facet | Shojaee, Abbas Huang, Shao-shan Carol |
author_sort | Shojaee, Abbas |
collection | PubMed |
description | Gene regulatory networks (GRNs) drive organism structure and functions, so the discovery and characterization of GRNs is a major goal in biological research. However, accurate identification of causal regulatory connections and inference of GRNs using gene expression datasets, more recently from single-cell RNA-seq (scRNA-seq), has been challenging. Here we employ the innovative method of Causal Inference Using Composition of Transactions (CICT) to uncover GRNs from scRNA-seq data. The basis of CICT is that if all gene expressions were random, a non-random regulatory gene should induce its targets at levels different from the background random process, resulting in distinct patterns in the whole relevance network of gene–gene associations. CICT proposes novel network features derived from a relevance network, which enable any machine learning algorithm to predict causal regulatory edges and infer GRNs. We evaluated CICT using simulated and experimental scRNA-seq data in a well-established benchmarking pipeline and showed that CICT outperformed existing network inference methods representing diverse approaches with many-fold higher accuracy. Furthermore, we demonstrated that GRN inference with CICT was robust to different levels of sparsity in scRNA-seq data, the characteristics of data and ground truth, the choice of association measure and the complexity of the supervised machine learning algorithm. Our results suggest aiming at directly predicting causality to recover regulatory relationships in complex biological networks substantially improves accuracy in GRN inference. |
format | Online Article Text |
id | pubmed-10612495 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-106124952023-10-29 Robust discovery of gene regulatory networks from single-cell gene expression data by Causal Inference Using Composition of Transactions Shojaee, Abbas Huang, Shao-shan Carol Brief Bioinform Problem Solving Protocol Gene regulatory networks (GRNs) drive organism structure and functions, so the discovery and characterization of GRNs is a major goal in biological research. However, accurate identification of causal regulatory connections and inference of GRNs using gene expression datasets, more recently from single-cell RNA-seq (scRNA-seq), has been challenging. Here we employ the innovative method of Causal Inference Using Composition of Transactions (CICT) to uncover GRNs from scRNA-seq data. The basis of CICT is that if all gene expressions were random, a non-random regulatory gene should induce its targets at levels different from the background random process, resulting in distinct patterns in the whole relevance network of gene–gene associations. CICT proposes novel network features derived from a relevance network, which enable any machine learning algorithm to predict causal regulatory edges and infer GRNs. We evaluated CICT using simulated and experimental scRNA-seq data in a well-established benchmarking pipeline and showed that CICT outperformed existing network inference methods representing diverse approaches with many-fold higher accuracy. Furthermore, we demonstrated that GRN inference with CICT was robust to different levels of sparsity in scRNA-seq data, the characteristics of data and ground truth, the choice of association measure and the complexity of the supervised machine learning algorithm. Our results suggest aiming at directly predicting causality to recover regulatory relationships in complex biological networks substantially improves accuracy in GRN inference. Oxford University Press 2023-10-27 /pmc/articles/PMC10612495/ /pubmed/37897702 http://dx.doi.org/10.1093/bib/bbad370 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Problem Solving Protocol Shojaee, Abbas Huang, Shao-shan Carol Robust discovery of gene regulatory networks from single-cell gene expression data by Causal Inference Using Composition of Transactions |
title | Robust discovery of gene regulatory networks from single-cell gene expression data by Causal Inference Using Composition of Transactions |
title_full | Robust discovery of gene regulatory networks from single-cell gene expression data by Causal Inference Using Composition of Transactions |
title_fullStr | Robust discovery of gene regulatory networks from single-cell gene expression data by Causal Inference Using Composition of Transactions |
title_full_unstemmed | Robust discovery of gene regulatory networks from single-cell gene expression data by Causal Inference Using Composition of Transactions |
title_short | Robust discovery of gene regulatory networks from single-cell gene expression data by Causal Inference Using Composition of Transactions |
title_sort | robust discovery of gene regulatory networks from single-cell gene expression data by causal inference using composition of transactions |
topic | Problem Solving Protocol |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10612495/ https://www.ncbi.nlm.nih.gov/pubmed/37897702 http://dx.doi.org/10.1093/bib/bbad370 |
work_keys_str_mv | AT shojaeeabbas robustdiscoveryofgeneregulatorynetworksfromsinglecellgeneexpressiondatabycausalinferenceusingcompositionoftransactions AT huangshaoshancarol robustdiscoveryofgeneregulatorynetworksfromsinglecellgeneexpressiondatabycausalinferenceusingcompositionoftransactions |