Cargando…

Approximate Learning of High Dimensional Bayesian Network Structures via Pruning of Candidate Parent Sets

Score-based algorithms that learn Bayesian Network (BN) structures provide solutions ranging from different levels of approximate learning to exact learning. Approximate solutions exist because exact learning is generally not applicable to networks of moderate or higher complexity. In general, appro...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Zhigao, Constantinou, Anthony C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7597292/
https://www.ncbi.nlm.nih.gov/pubmed/33286911
http://dx.doi.org/10.3390/e22101142
_version_ 1783602314057089024
author Guo, Zhigao
Constantinou, Anthony C.
author_facet Guo, Zhigao
Constantinou, Anthony C.
author_sort Guo, Zhigao
collection PubMed
description Score-based algorithms that learn Bayesian Network (BN) structures provide solutions ranging from different levels of approximate learning to exact learning. Approximate solutions exist because exact learning is generally not applicable to networks of moderate or higher complexity. In general, approximate solutions tend to sacrifice accuracy for speed, where the aim is to minimise the loss in accuracy and maximise the gain in speed. While some approximate algorithms are optimised to handle thousands of variables, these algorithms may still be unable to learn such high dimensional structures. Some of the most efficient score-based algorithms cast the structure learning problem as a combinatorial optimisation of candidate parent sets. This paper explores a strategy towards pruning the size of candidate parent sets, and which could form part of existing score-based algorithms as an additional pruning phase aimed at high dimensionality problems. The results illustrate how different levels of pruning affect the learning speed relative to the loss in accuracy in terms of model fitting, and show that aggressive pruning may be required to produce approximate solutions for high complexity problems.
format Online
Article
Text
id pubmed-7597292
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75972922020-11-09 Approximate Learning of High Dimensional Bayesian Network Structures via Pruning of Candidate Parent Sets Guo, Zhigao Constantinou, Anthony C. Entropy (Basel) Article Score-based algorithms that learn Bayesian Network (BN) structures provide solutions ranging from different levels of approximate learning to exact learning. Approximate solutions exist because exact learning is generally not applicable to networks of moderate or higher complexity. In general, approximate solutions tend to sacrifice accuracy for speed, where the aim is to minimise the loss in accuracy and maximise the gain in speed. While some approximate algorithms are optimised to handle thousands of variables, these algorithms may still be unable to learn such high dimensional structures. Some of the most efficient score-based algorithms cast the structure learning problem as a combinatorial optimisation of candidate parent sets. This paper explores a strategy towards pruning the size of candidate parent sets, and which could form part of existing score-based algorithms as an additional pruning phase aimed at high dimensionality problems. The results illustrate how different levels of pruning affect the learning speed relative to the loss in accuracy in terms of model fitting, and show that aggressive pruning may be required to produce approximate solutions for high complexity problems. MDPI 2020-10-10 /pmc/articles/PMC7597292/ /pubmed/33286911 http://dx.doi.org/10.3390/e22101142 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Guo, Zhigao
Constantinou, Anthony C.
Approximate Learning of High Dimensional Bayesian Network Structures via Pruning of Candidate Parent Sets
title Approximate Learning of High Dimensional Bayesian Network Structures via Pruning of Candidate Parent Sets
title_full Approximate Learning of High Dimensional Bayesian Network Structures via Pruning of Candidate Parent Sets
title_fullStr Approximate Learning of High Dimensional Bayesian Network Structures via Pruning of Candidate Parent Sets
title_full_unstemmed Approximate Learning of High Dimensional Bayesian Network Structures via Pruning of Candidate Parent Sets
title_short Approximate Learning of High Dimensional Bayesian Network Structures via Pruning of Candidate Parent Sets
title_sort approximate learning of high dimensional bayesian network structures via pruning of candidate parent sets
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7597292/
https://www.ncbi.nlm.nih.gov/pubmed/33286911
http://dx.doi.org/10.3390/e22101142
work_keys_str_mv AT guozhigao approximatelearningofhighdimensionalbayesiannetworkstructuresviapruningofcandidateparentsets
AT constantinouanthonyc approximatelearningofhighdimensionalbayesiannetworkstructuresviapruningofcandidateparentsets