Cargando…

Explicit representation of protein activity states significantly improves causal discovery of protein phosphorylation networks

BACKGROUND: Protein phosphorylation networks play an important role in cell signaling. In these networks, phosphorylation of a protein kinase usually leads to its activation, which in turn will phosphorylate its downstream target proteins. A phosphorylation network is essentially a causal network, w...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Jinling, Ma, Xiaojun, Cooper, Gregory F., Lu, Xinghua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7496209/
https://www.ncbi.nlm.nih.gov/pubmed/32938361
http://dx.doi.org/10.1186/s12859-020-03676-2
_version_ 1783583047927463936
author Liu, Jinling
Ma, Xiaojun
Cooper, Gregory F.
Lu, Xinghua
author_facet Liu, Jinling
Ma, Xiaojun
Cooper, Gregory F.
Lu, Xinghua
author_sort Liu, Jinling
collection PubMed
description BACKGROUND: Protein phosphorylation networks play an important role in cell signaling. In these networks, phosphorylation of a protein kinase usually leads to its activation, which in turn will phosphorylate its downstream target proteins. A phosphorylation network is essentially a causal network, which can be learned by causal inference algorithms. Prior efforts have applied such algorithms to data measuring protein phosphorylation levels, assuming that the phosphorylation levels represent protein activity states. However, the phosphorylation status of a kinase does not always reflect its activity state, because interventions such as inhibitors or mutations can directly affect its activity state without changing its phosphorylation status. Thus, when cellular systems are subjected to extensive perturbations, the statistical relationships between phosphorylation states of proteins may be disrupted, making it difficult to reconstruct the true protein phosphorylation network. Here, we describe a novel framework to address this challenge. RESULTS: We have developed a causal discovery framework that explicitly represents the activity state of each protein kinase as an unmeasured variable and developed a novel algorithm called “InferA” to infer the protein activity states, which allows us to incorporate the protein phosphorylation level, pharmacological interventions and prior knowledge. We applied our framework to simulated datasets and to a real-world dataset. The simulation experiments demonstrated that explicit representation of activity states of protein kinases allows one to effectively represent the impact of interventions and thus enabled our framework to accurately recover the ground-truth causal network. Results from the real-world dataset showed that the explicit representation of protein activity states allowed an effective and data-driven integration of the prior knowledge by InferA, which further leads to the recovery of a phosphorylation network that is more consistent with experiment results. CONCLUSIONS: Explicit representation of the protein activity states by our novel framework significantly enhances causal discovery of protein phosphorylation networks.
format Online
Article
Text
id pubmed-7496209
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-74962092020-09-21 Explicit representation of protein activity states significantly improves causal discovery of protein phosphorylation networks Liu, Jinling Ma, Xiaojun Cooper, Gregory F. Lu, Xinghua BMC Bioinformatics Research BACKGROUND: Protein phosphorylation networks play an important role in cell signaling. In these networks, phosphorylation of a protein kinase usually leads to its activation, which in turn will phosphorylate its downstream target proteins. A phosphorylation network is essentially a causal network, which can be learned by causal inference algorithms. Prior efforts have applied such algorithms to data measuring protein phosphorylation levels, assuming that the phosphorylation levels represent protein activity states. However, the phosphorylation status of a kinase does not always reflect its activity state, because interventions such as inhibitors or mutations can directly affect its activity state without changing its phosphorylation status. Thus, when cellular systems are subjected to extensive perturbations, the statistical relationships between phosphorylation states of proteins may be disrupted, making it difficult to reconstruct the true protein phosphorylation network. Here, we describe a novel framework to address this challenge. RESULTS: We have developed a causal discovery framework that explicitly represents the activity state of each protein kinase as an unmeasured variable and developed a novel algorithm called “InferA” to infer the protein activity states, which allows us to incorporate the protein phosphorylation level, pharmacological interventions and prior knowledge. We applied our framework to simulated datasets and to a real-world dataset. The simulation experiments demonstrated that explicit representation of activity states of protein kinases allows one to effectively represent the impact of interventions and thus enabled our framework to accurately recover the ground-truth causal network. Results from the real-world dataset showed that the explicit representation of protein activity states allowed an effective and data-driven integration of the prior knowledge by InferA, which further leads to the recovery of a phosphorylation network that is more consistent with experiment results. CONCLUSIONS: Explicit representation of the protein activity states by our novel framework significantly enhances causal discovery of protein phosphorylation networks. BioMed Central 2020-09-17 /pmc/articles/PMC7496209/ /pubmed/32938361 http://dx.doi.org/10.1186/s12859-020-03676-2 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Liu, Jinling
Ma, Xiaojun
Cooper, Gregory F.
Lu, Xinghua
Explicit representation of protein activity states significantly improves causal discovery of protein phosphorylation networks
title Explicit representation of protein activity states significantly improves causal discovery of protein phosphorylation networks
title_full Explicit representation of protein activity states significantly improves causal discovery of protein phosphorylation networks
title_fullStr Explicit representation of protein activity states significantly improves causal discovery of protein phosphorylation networks
title_full_unstemmed Explicit representation of protein activity states significantly improves causal discovery of protein phosphorylation networks
title_short Explicit representation of protein activity states significantly improves causal discovery of protein phosphorylation networks
title_sort explicit representation of protein activity states significantly improves causal discovery of protein phosphorylation networks
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7496209/
https://www.ncbi.nlm.nih.gov/pubmed/32938361
http://dx.doi.org/10.1186/s12859-020-03676-2
work_keys_str_mv AT liujinling explicitrepresentationofproteinactivitystatessignificantlyimprovescausaldiscoveryofproteinphosphorylationnetworks
AT maxiaojun explicitrepresentationofproteinactivitystatessignificantlyimprovescausaldiscoveryofproteinphosphorylationnetworks
AT coopergregoryf explicitrepresentationofproteinactivitystatessignificantlyimprovescausaldiscoveryofproteinphosphorylationnetworks
AT luxinghua explicitrepresentationofproteinactivitystatessignificantlyimprovescausaldiscoveryofproteinphosphorylationnetworks