Cargando…

Learning a Markov Logic network for supervised gene regulatory network inference

BACKGROUND: Gene regulatory network inference remains a challenging problem in systems biology despite the numerous approaches that have been proposed. When substantial knowledge on a gene regulatory network is already available, supervised network inference is appropriate. Such a method builds a bi...

Descripción completa

Detalles Bibliográficos
Autores principales: Brouard, Céline, Vrain, Christel, Dubois, Julie, Castel, David, Debily, Marie-Anne, d’Alché-Buc, Florence
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3849013/
https://www.ncbi.nlm.nih.gov/pubmed/24028533
http://dx.doi.org/10.1186/1471-2105-14-273
_version_ 1782293863258390528
author Brouard, Céline
Vrain, Christel
Dubois, Julie
Castel, David
Debily, Marie-Anne
d’Alché-Buc, Florence
author_facet Brouard, Céline
Vrain, Christel
Dubois, Julie
Castel, David
Debily, Marie-Anne
d’Alché-Buc, Florence
author_sort Brouard, Céline
collection PubMed
description BACKGROUND: Gene regulatory network inference remains a challenging problem in systems biology despite the numerous approaches that have been proposed. When substantial knowledge on a gene regulatory network is already available, supervised network inference is appropriate. Such a method builds a binary classifier able to assign a class (Regulation/No regulation) to an ordered pair of genes. Once learnt, the pairwise classifier can be used to predict new regulations. In this work, we explore the framework of Markov Logic Networks (MLN) that combine features of probabilistic graphical models with the expressivity of first-order logic rules. RESULTS: We propose to learn a Markov Logic network, e.g. a set of weighted rules that conclude on the predicate “regulates”, starting from a known gene regulatory network involved in the switch proliferation/differentiation of keratinocyte cells, a set of experimental transcriptomic data and various descriptions of genes all encoded into first-order logic. As training data are unbalanced, we use asymmetric bagging to learn a set of MLNs. The prediction of a new regulation can then be obtained by averaging predictions of individual MLNs. As a side contribution, we propose three in silico tests to assess the performance of any pairwise classifier in various network inference tasks on real datasets. A first test consists of measuring the average performance on balanced edge prediction problem; a second one deals with the ability of the classifier, once enhanced by asymmetric bagging, to update a given network. Finally our main result concerns a third test that measures the ability of the method to predict regulations with a new set of genes. As expected, MLN, when provided with only numerical discretized gene expression data, does not perform as well as a pairwise SVM in terms of AUPR. However, when a more complete description of gene properties is provided by heterogeneous sources, MLN achieves the same performance as a black-box model such as a pairwise SVM while providing relevant insights on the predictions. CONCLUSIONS: The numerical studies show that MLN achieves very good predictive performance while opening the door to some interpretability of the decisions. Besides the ability to suggest new regulations, such an approach allows to cross-validate experimental data with existing knowledge.
format Online
Article
Text
id pubmed-3849013
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38490132013-12-07 Learning a Markov Logic network for supervised gene regulatory network inference Brouard, Céline Vrain, Christel Dubois, Julie Castel, David Debily, Marie-Anne d’Alché-Buc, Florence BMC Bioinformatics Research Article BACKGROUND: Gene regulatory network inference remains a challenging problem in systems biology despite the numerous approaches that have been proposed. When substantial knowledge on a gene regulatory network is already available, supervised network inference is appropriate. Such a method builds a binary classifier able to assign a class (Regulation/No regulation) to an ordered pair of genes. Once learnt, the pairwise classifier can be used to predict new regulations. In this work, we explore the framework of Markov Logic Networks (MLN) that combine features of probabilistic graphical models with the expressivity of first-order logic rules. RESULTS: We propose to learn a Markov Logic network, e.g. a set of weighted rules that conclude on the predicate “regulates”, starting from a known gene regulatory network involved in the switch proliferation/differentiation of keratinocyte cells, a set of experimental transcriptomic data and various descriptions of genes all encoded into first-order logic. As training data are unbalanced, we use asymmetric bagging to learn a set of MLNs. The prediction of a new regulation can then be obtained by averaging predictions of individual MLNs. As a side contribution, we propose three in silico tests to assess the performance of any pairwise classifier in various network inference tasks on real datasets. A first test consists of measuring the average performance on balanced edge prediction problem; a second one deals with the ability of the classifier, once enhanced by asymmetric bagging, to update a given network. Finally our main result concerns a third test that measures the ability of the method to predict regulations with a new set of genes. As expected, MLN, when provided with only numerical discretized gene expression data, does not perform as well as a pairwise SVM in terms of AUPR. However, when a more complete description of gene properties is provided by heterogeneous sources, MLN achieves the same performance as a black-box model such as a pairwise SVM while providing relevant insights on the predictions. CONCLUSIONS: The numerical studies show that MLN achieves very good predictive performance while opening the door to some interpretability of the decisions. Besides the ability to suggest new regulations, such an approach allows to cross-validate experimental data with existing knowledge. BioMed Central 2013-09-12 /pmc/articles/PMC3849013/ /pubmed/24028533 http://dx.doi.org/10.1186/1471-2105-14-273 Text en Copyright © 2013 Brouard et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Brouard, Céline
Vrain, Christel
Dubois, Julie
Castel, David
Debily, Marie-Anne
d’Alché-Buc, Florence
Learning a Markov Logic network for supervised gene regulatory network inference
title Learning a Markov Logic network for supervised gene regulatory network inference
title_full Learning a Markov Logic network for supervised gene regulatory network inference
title_fullStr Learning a Markov Logic network for supervised gene regulatory network inference
title_full_unstemmed Learning a Markov Logic network for supervised gene regulatory network inference
title_short Learning a Markov Logic network for supervised gene regulatory network inference
title_sort learning a markov logic network for supervised gene regulatory network inference
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3849013/
https://www.ncbi.nlm.nih.gov/pubmed/24028533
http://dx.doi.org/10.1186/1471-2105-14-273
work_keys_str_mv AT brouardceline learningamarkovlogicnetworkforsupervisedgeneregulatorynetworkinference
AT vrainchristel learningamarkovlogicnetworkforsupervisedgeneregulatorynetworkinference
AT duboisjulie learningamarkovlogicnetworkforsupervisedgeneregulatorynetworkinference
AT casteldavid learningamarkovlogicnetworkforsupervisedgeneregulatorynetworkinference
AT debilymarieanne learningamarkovlogicnetworkforsupervisedgeneregulatorynetworkinference
AT dalchebucflorence learningamarkovlogicnetworkforsupervisedgeneregulatorynetworkinference