Cargando…

Benchmarking causal reasoning algorithms for gene expression-based compound mechanism of action analysis

BACKGROUND: Elucidating compound mechanism of action (MoA) is beneficial to drug discovery, but in practice often represents a significant challenge. Causal Reasoning approaches aim to address this situation by inferring dysregulated signalling proteins using transcriptomics data and biological netw...

Descripción completa

Detalles Bibliográficos
Autores principales: Hosseini-Gerami, Layla, Higgins, Ixavier Alonzo, Collier, David A., Laing, Emma, Evans, David, Broughton, Howard, Bender, Andreas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10111792/
https://www.ncbi.nlm.nih.gov/pubmed/37072707
http://dx.doi.org/10.1186/s12859-023-05277-1
_version_ 1785027518898634752
author Hosseini-Gerami, Layla
Higgins, Ixavier Alonzo
Collier, David A.
Laing, Emma
Evans, David
Broughton, Howard
Bender, Andreas
author_facet Hosseini-Gerami, Layla
Higgins, Ixavier Alonzo
Collier, David A.
Laing, Emma
Evans, David
Broughton, Howard
Bender, Andreas
author_sort Hosseini-Gerami, Layla
collection PubMed
description BACKGROUND: Elucidating compound mechanism of action (MoA) is beneficial to drug discovery, but in practice often represents a significant challenge. Causal Reasoning approaches aim to address this situation by inferring dysregulated signalling proteins using transcriptomics data and biological networks; however, a comprehensive benchmarking of such approaches has not yet been reported. Here we benchmarked four causal reasoning algorithms (SigNet, CausalR, CausalR ScanR and CARNIVAL) with four networks (the smaller Omnipath network vs. 3 larger MetaBase™ networks), using LINCS L1000 and CMap microarray data, and assessed to what extent each factor dictated the successful recovery of direct targets and compound-associated signalling pathways in a benchmark dataset comprising 269 compounds. We additionally examined impact on performance in terms of the functions and roles of protein targets and their connectivity bias in the prior knowledge networks. RESULTS: According to statistical analysis (negative binomial model), the combination of algorithm and network most significantly dictated the performance of causal reasoning algorithms, with the SigNet recovering the greatest number of direct targets. With respect to the recovery of signalling pathways, CARNIVAL with the Omnipath network was able to recover the most informative pathways containing compound targets, based on the Reactome pathway hierarchy. Additionally, CARNIVAL, SigNet and CausalR ScanR all outperformed baseline gene expression pathway enrichment results. We found no significant difference in performance between L1000 data or microarray data, even when limited to just 978 ‘landmark’ genes. Notably, all causal reasoning algorithms also outperformed pathway recovery based on input DEGs, despite these often being used for pathway enrichment. Causal reasoning methods performance was somewhat correlated with connectivity and biological role of the targets. CONCLUSIONS: Overall, we conclude that causal reasoning performs well at recovering signalling proteins related to compound MoA upstream from gene expression changes by leveraging prior knowledge networks, and that the choice of network and algorithm has a profound impact on the performance of causal reasoning algorithms. Based on the analyses presented here this is true for both microarray-based gene expression data as well as those based on the L1000 platform. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05277-1.
format Online
Article
Text
id pubmed-10111792
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-101117922023-04-19 Benchmarking causal reasoning algorithms for gene expression-based compound mechanism of action analysis Hosseini-Gerami, Layla Higgins, Ixavier Alonzo Collier, David A. Laing, Emma Evans, David Broughton, Howard Bender, Andreas BMC Bioinformatics Research BACKGROUND: Elucidating compound mechanism of action (MoA) is beneficial to drug discovery, but in practice often represents a significant challenge. Causal Reasoning approaches aim to address this situation by inferring dysregulated signalling proteins using transcriptomics data and biological networks; however, a comprehensive benchmarking of such approaches has not yet been reported. Here we benchmarked four causal reasoning algorithms (SigNet, CausalR, CausalR ScanR and CARNIVAL) with four networks (the smaller Omnipath network vs. 3 larger MetaBase™ networks), using LINCS L1000 and CMap microarray data, and assessed to what extent each factor dictated the successful recovery of direct targets and compound-associated signalling pathways in a benchmark dataset comprising 269 compounds. We additionally examined impact on performance in terms of the functions and roles of protein targets and their connectivity bias in the prior knowledge networks. RESULTS: According to statistical analysis (negative binomial model), the combination of algorithm and network most significantly dictated the performance of causal reasoning algorithms, with the SigNet recovering the greatest number of direct targets. With respect to the recovery of signalling pathways, CARNIVAL with the Omnipath network was able to recover the most informative pathways containing compound targets, based on the Reactome pathway hierarchy. Additionally, CARNIVAL, SigNet and CausalR ScanR all outperformed baseline gene expression pathway enrichment results. We found no significant difference in performance between L1000 data or microarray data, even when limited to just 978 ‘landmark’ genes. Notably, all causal reasoning algorithms also outperformed pathway recovery based on input DEGs, despite these often being used for pathway enrichment. Causal reasoning methods performance was somewhat correlated with connectivity and biological role of the targets. CONCLUSIONS: Overall, we conclude that causal reasoning performs well at recovering signalling proteins related to compound MoA upstream from gene expression changes by leveraging prior knowledge networks, and that the choice of network and algorithm has a profound impact on the performance of causal reasoning algorithms. Based on the analyses presented here this is true for both microarray-based gene expression data as well as those based on the L1000 platform. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05277-1. BioMed Central 2023-04-18 /pmc/articles/PMC10111792/ /pubmed/37072707 http://dx.doi.org/10.1186/s12859-023-05277-1 Text en © The Author(s) 2023, corrected publication 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Hosseini-Gerami, Layla
Higgins, Ixavier Alonzo
Collier, David A.
Laing, Emma
Evans, David
Broughton, Howard
Bender, Andreas
Benchmarking causal reasoning algorithms for gene expression-based compound mechanism of action analysis
title Benchmarking causal reasoning algorithms for gene expression-based compound mechanism of action analysis
title_full Benchmarking causal reasoning algorithms for gene expression-based compound mechanism of action analysis
title_fullStr Benchmarking causal reasoning algorithms for gene expression-based compound mechanism of action analysis
title_full_unstemmed Benchmarking causal reasoning algorithms for gene expression-based compound mechanism of action analysis
title_short Benchmarking causal reasoning algorithms for gene expression-based compound mechanism of action analysis
title_sort benchmarking causal reasoning algorithms for gene expression-based compound mechanism of action analysis
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10111792/
https://www.ncbi.nlm.nih.gov/pubmed/37072707
http://dx.doi.org/10.1186/s12859-023-05277-1
work_keys_str_mv AT hosseinigeramilayla benchmarkingcausalreasoningalgorithmsforgeneexpressionbasedcompoundmechanismofactionanalysis
AT higginsixavieralonzo benchmarkingcausalreasoningalgorithmsforgeneexpressionbasedcompoundmechanismofactionanalysis
AT collierdavida benchmarkingcausalreasoningalgorithmsforgeneexpressionbasedcompoundmechanismofactionanalysis
AT laingemma benchmarkingcausalreasoningalgorithmsforgeneexpressionbasedcompoundmechanismofactionanalysis
AT evansdavid benchmarkingcausalreasoningalgorithmsforgeneexpressionbasedcompoundmechanismofactionanalysis
AT broughtonhoward benchmarkingcausalreasoningalgorithmsforgeneexpressionbasedcompoundmechanismofactionanalysis
AT benderandreas benchmarkingcausalreasoningalgorithmsforgeneexpressionbasedcompoundmechanismofactionanalysis