Cargando…
Benchmarking causal reasoning algorithms for gene expression-based compound mechanism of action analysis
BACKGROUND: Elucidating compound mechanism of action (MoA) is beneficial to drug discovery, but in practice often represents a significant challenge. Causal Reasoning approaches aim to address this situation by inferring dysregulated signalling proteins using transcriptomics data and biological netw...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10111792/ https://www.ncbi.nlm.nih.gov/pubmed/37072707 http://dx.doi.org/10.1186/s12859-023-05277-1 |
_version_ | 1785027518898634752 |
---|---|
author | Hosseini-Gerami, Layla Higgins, Ixavier Alonzo Collier, David A. Laing, Emma Evans, David Broughton, Howard Bender, Andreas |
author_facet | Hosseini-Gerami, Layla Higgins, Ixavier Alonzo Collier, David A. Laing, Emma Evans, David Broughton, Howard Bender, Andreas |
author_sort | Hosseini-Gerami, Layla |
collection | PubMed |
description | BACKGROUND: Elucidating compound mechanism of action (MoA) is beneficial to drug discovery, but in practice often represents a significant challenge. Causal Reasoning approaches aim to address this situation by inferring dysregulated signalling proteins using transcriptomics data and biological networks; however, a comprehensive benchmarking of such approaches has not yet been reported. Here we benchmarked four causal reasoning algorithms (SigNet, CausalR, CausalR ScanR and CARNIVAL) with four networks (the smaller Omnipath network vs. 3 larger MetaBase™ networks), using LINCS L1000 and CMap microarray data, and assessed to what extent each factor dictated the successful recovery of direct targets and compound-associated signalling pathways in a benchmark dataset comprising 269 compounds. We additionally examined impact on performance in terms of the functions and roles of protein targets and their connectivity bias in the prior knowledge networks. RESULTS: According to statistical analysis (negative binomial model), the combination of algorithm and network most significantly dictated the performance of causal reasoning algorithms, with the SigNet recovering the greatest number of direct targets. With respect to the recovery of signalling pathways, CARNIVAL with the Omnipath network was able to recover the most informative pathways containing compound targets, based on the Reactome pathway hierarchy. Additionally, CARNIVAL, SigNet and CausalR ScanR all outperformed baseline gene expression pathway enrichment results. We found no significant difference in performance between L1000 data or microarray data, even when limited to just 978 ‘landmark’ genes. Notably, all causal reasoning algorithms also outperformed pathway recovery based on input DEGs, despite these often being used for pathway enrichment. Causal reasoning methods performance was somewhat correlated with connectivity and biological role of the targets. CONCLUSIONS: Overall, we conclude that causal reasoning performs well at recovering signalling proteins related to compound MoA upstream from gene expression changes by leveraging prior knowledge networks, and that the choice of network and algorithm has a profound impact on the performance of causal reasoning algorithms. Based on the analyses presented here this is true for both microarray-based gene expression data as well as those based on the L1000 platform. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05277-1. |
format | Online Article Text |
id | pubmed-10111792 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-101117922023-04-19 Benchmarking causal reasoning algorithms for gene expression-based compound mechanism of action analysis Hosseini-Gerami, Layla Higgins, Ixavier Alonzo Collier, David A. Laing, Emma Evans, David Broughton, Howard Bender, Andreas BMC Bioinformatics Research BACKGROUND: Elucidating compound mechanism of action (MoA) is beneficial to drug discovery, but in practice often represents a significant challenge. Causal Reasoning approaches aim to address this situation by inferring dysregulated signalling proteins using transcriptomics data and biological networks; however, a comprehensive benchmarking of such approaches has not yet been reported. Here we benchmarked four causal reasoning algorithms (SigNet, CausalR, CausalR ScanR and CARNIVAL) with four networks (the smaller Omnipath network vs. 3 larger MetaBase™ networks), using LINCS L1000 and CMap microarray data, and assessed to what extent each factor dictated the successful recovery of direct targets and compound-associated signalling pathways in a benchmark dataset comprising 269 compounds. We additionally examined impact on performance in terms of the functions and roles of protein targets and their connectivity bias in the prior knowledge networks. RESULTS: According to statistical analysis (negative binomial model), the combination of algorithm and network most significantly dictated the performance of causal reasoning algorithms, with the SigNet recovering the greatest number of direct targets. With respect to the recovery of signalling pathways, CARNIVAL with the Omnipath network was able to recover the most informative pathways containing compound targets, based on the Reactome pathway hierarchy. Additionally, CARNIVAL, SigNet and CausalR ScanR all outperformed baseline gene expression pathway enrichment results. We found no significant difference in performance between L1000 data or microarray data, even when limited to just 978 ‘landmark’ genes. Notably, all causal reasoning algorithms also outperformed pathway recovery based on input DEGs, despite these often being used for pathway enrichment. Causal reasoning methods performance was somewhat correlated with connectivity and biological role of the targets. CONCLUSIONS: Overall, we conclude that causal reasoning performs well at recovering signalling proteins related to compound MoA upstream from gene expression changes by leveraging prior knowledge networks, and that the choice of network and algorithm has a profound impact on the performance of causal reasoning algorithms. Based on the analyses presented here this is true for both microarray-based gene expression data as well as those based on the L1000 platform. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05277-1. BioMed Central 2023-04-18 /pmc/articles/PMC10111792/ /pubmed/37072707 http://dx.doi.org/10.1186/s12859-023-05277-1 Text en © The Author(s) 2023, corrected publication 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Hosseini-Gerami, Layla Higgins, Ixavier Alonzo Collier, David A. Laing, Emma Evans, David Broughton, Howard Bender, Andreas Benchmarking causal reasoning algorithms for gene expression-based compound mechanism of action analysis |
title | Benchmarking causal reasoning algorithms for gene expression-based compound mechanism of action analysis |
title_full | Benchmarking causal reasoning algorithms for gene expression-based compound mechanism of action analysis |
title_fullStr | Benchmarking causal reasoning algorithms for gene expression-based compound mechanism of action analysis |
title_full_unstemmed | Benchmarking causal reasoning algorithms for gene expression-based compound mechanism of action analysis |
title_short | Benchmarking causal reasoning algorithms for gene expression-based compound mechanism of action analysis |
title_sort | benchmarking causal reasoning algorithms for gene expression-based compound mechanism of action analysis |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10111792/ https://www.ncbi.nlm.nih.gov/pubmed/37072707 http://dx.doi.org/10.1186/s12859-023-05277-1 |
work_keys_str_mv | AT hosseinigeramilayla benchmarkingcausalreasoningalgorithmsforgeneexpressionbasedcompoundmechanismofactionanalysis AT higginsixavieralonzo benchmarkingcausalreasoningalgorithmsforgeneexpressionbasedcompoundmechanismofactionanalysis AT collierdavida benchmarkingcausalreasoningalgorithmsforgeneexpressionbasedcompoundmechanismofactionanalysis AT laingemma benchmarkingcausalreasoningalgorithmsforgeneexpressionbasedcompoundmechanismofactionanalysis AT evansdavid benchmarkingcausalreasoningalgorithmsforgeneexpressionbasedcompoundmechanismofactionanalysis AT broughtonhoward benchmarkingcausalreasoningalgorithmsforgeneexpressionbasedcompoundmechanismofactionanalysis AT benderandreas benchmarkingcausalreasoningalgorithmsforgeneexpressionbasedcompoundmechanismofactionanalysis |