Cargando…

Tunable structure priors for Bayesian rule learning for knowledge integrated biomarker discovery

AIM: To develop a framework to incorporate background domain knowledge into classification rule learning for knowledge discovery in biomedicine. METHODS: Bayesian rule learning (BRL) is a rule-based classifier that uses a greedy best-first search over a space of Bayesian belief-networks (BN) to find...

Descripción completa

Detalles Bibliográficos
Autores principales:	Balasubramanian, Jeya Balaji, Gopalakrishnan, Vanathi
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Baishideng Publishing Group Inc 2018
Materias:	Basic Study
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6153126/ https://www.ncbi.nlm.nih.gov/pubmed/30254965 http://dx.doi.org/10.5306/wjco.v9.i5.98

_version_	1783357463339204608
author	Balasubramanian, Jeya Balaji Gopalakrishnan, Vanathi
author_facet	Balasubramanian, Jeya Balaji Gopalakrishnan, Vanathi
author_sort	Balasubramanian, Jeya Balaji
collection	PubMed
description	AIM: To develop a framework to incorporate background domain knowledge into classification rule learning for knowledge discovery in biomedicine. METHODS: Bayesian rule learning (BRL) is a rule-based classifier that uses a greedy best-first search over a space of Bayesian belief-networks (BN) to find the optimal BN to explain the input dataset, and then infers classification rules from this BN. BRL uses a Bayesian score to evaluate the quality of BNs. In this paper, we extended the Bayesian score to include informative structure priors, which encodes our prior domain knowledge about the dataset. We call this extension of BRL as BRL(p). The structure prior has a λ hyperparameter that allows the user to tune the degree of incorporation of the prior knowledge in the model learning process. We studied the effect of λ on model learning using a simulated dataset and a real-world lung cancer prognostic biomarker dataset, by measuring the degree of incorporation of our specified prior knowledge. We also monitored its effect on the model predictive performance. Finally, we compared BRL(p) to other state-of-the-art classifiers commonly used in biomedicine. RESULTS: We evaluated the degree of incorporation of prior knowledge into BRL(p), with simulated data by measuring the Graph Edit Distance between the true data-generating model and the model learned by BRL(p). We specified the true model using informative structure priors. We observed that by increasing the value of λ we were able to increase the influence of the specified structure priors on model learning. A large value of λ of BRL(p) caused it to return the true model. This also led to a gain in predictive performance measured by area under the receiver operator characteristic curve (AUC). We then obtained a publicly available real-world lung cancer prognostic biomarker dataset and specified a known biomarker from literature [the epidermal growth factor receptor (EGFR) gene]. We again observed that larger values of λ led to an increased incorporation of EGFR into the final BRL(p) model. This relevant background knowledge also led to a gain in AUC. CONCLUSION: BRL(p) enables tunable structure priors to be incorporated during Bayesian classification rule learning that integrates data and knowledge as demonstrated using lung cancer biomarker data.
format	Online Article Text
id	pubmed-6153126
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Baishideng Publishing Group Inc
record_format	MEDLINE/PubMed
spelling	pubmed-61531262018-09-25 Tunable structure priors for Bayesian rule learning for knowledge integrated biomarker discovery Balasubramanian, Jeya Balaji Gopalakrishnan, Vanathi World J Clin Oncol Basic Study AIM: To develop a framework to incorporate background domain knowledge into classification rule learning for knowledge discovery in biomedicine. METHODS: Bayesian rule learning (BRL) is a rule-based classifier that uses a greedy best-first search over a space of Bayesian belief-networks (BN) to find the optimal BN to explain the input dataset, and then infers classification rules from this BN. BRL uses a Bayesian score to evaluate the quality of BNs. In this paper, we extended the Bayesian score to include informative structure priors, which encodes our prior domain knowledge about the dataset. We call this extension of BRL as BRL(p). The structure prior has a λ hyperparameter that allows the user to tune the degree of incorporation of the prior knowledge in the model learning process. We studied the effect of λ on model learning using a simulated dataset and a real-world lung cancer prognostic biomarker dataset, by measuring the degree of incorporation of our specified prior knowledge. We also monitored its effect on the model predictive performance. Finally, we compared BRL(p) to other state-of-the-art classifiers commonly used in biomedicine. RESULTS: We evaluated the degree of incorporation of prior knowledge into BRL(p), with simulated data by measuring the Graph Edit Distance between the true data-generating model and the model learned by BRL(p). We specified the true model using informative structure priors. We observed that by increasing the value of λ we were able to increase the influence of the specified structure priors on model learning. A large value of λ of BRL(p) caused it to return the true model. This also led to a gain in predictive performance measured by area under the receiver operator characteristic curve (AUC). We then obtained a publicly available real-world lung cancer prognostic biomarker dataset and specified a known biomarker from literature [the epidermal growth factor receptor (EGFR) gene]. We again observed that larger values of λ led to an increased incorporation of EGFR into the final BRL(p) model. This relevant background knowledge also led to a gain in AUC. CONCLUSION: BRL(p) enables tunable structure priors to be incorporated during Bayesian classification rule learning that integrates data and knowledge as demonstrated using lung cancer biomarker data. Baishideng Publishing Group Inc 2018-09-14 2018-09-14 /pmc/articles/PMC6153126/ /pubmed/30254965 http://dx.doi.org/10.5306/wjco.v9.i5.98 Text en ©The Author(s) 2018. Published by Baishideng Publishing Group Inc. All rights reserved. http://creativecommons.org/licenses/by-nc/4.0/ This article is an open-access article which was selected by an in-house editor and fully peer-reviewed by external reviewers. It is distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial.
spellingShingle	Basic Study Balasubramanian, Jeya Balaji Gopalakrishnan, Vanathi Tunable structure priors for Bayesian rule learning for knowledge integrated biomarker discovery
title	Tunable structure priors for Bayesian rule learning for knowledge integrated biomarker discovery
title_full	Tunable structure priors for Bayesian rule learning for knowledge integrated biomarker discovery
title_fullStr	Tunable structure priors for Bayesian rule learning for knowledge integrated biomarker discovery
title_full_unstemmed	Tunable structure priors for Bayesian rule learning for knowledge integrated biomarker discovery
title_short	Tunable structure priors for Bayesian rule learning for knowledge integrated biomarker discovery
title_sort	tunable structure priors for bayesian rule learning for knowledge integrated biomarker discovery
topic	Basic Study
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6153126/ https://www.ncbi.nlm.nih.gov/pubmed/30254965 http://dx.doi.org/10.5306/wjco.v9.i5.98
work_keys_str_mv	AT balasubramanianjeyabalaji tunablestructurepriorsforbayesianrulelearningforknowledgeintegratedbiomarkerdiscovery AT gopalakrishnanvanathi tunablestructurepriorsforbayesianrulelearningforknowledgeintegratedbiomarkerdiscovery

Tunable structure priors for Bayesian rule learning for knowledge integrated biomarker discovery

Ejemplares similares