Cargando…

PEnBayes: A Multi-Layered Ensemble Approach for Learning Bayesian Network Structure from Big Data

Discovering the Bayesian network (BN) structure from big datasets containing rich causal relationships is becoming increasingly valuable for modeling and reasoning under uncertainties in many areas with big data gathered from sensors due to high volume and fast veracity. Most of the current BN struc...

Descripción completa

Detalles Bibliográficos
Autores principales:	Tang, Yan, Wang, Jianwu, Nguyen, Mai, Altintas, Ilkay
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2019
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6832728/ https://www.ncbi.nlm.nih.gov/pubmed/31614544 http://dx.doi.org/10.3390/s19204400

_version_	1783466235987492864
author	Tang, Yan Wang, Jianwu Nguyen, Mai Altintas, Ilkay
author_facet	Tang, Yan Wang, Jianwu Nguyen, Mai Altintas, Ilkay
author_sort	Tang, Yan
collection	PubMed
description	Discovering the Bayesian network (BN) structure from big datasets containing rich causal relationships is becoming increasingly valuable for modeling and reasoning under uncertainties in many areas with big data gathered from sensors due to high volume and fast veracity. Most of the current BN structure learning algorithms have shortcomings facing big data. First, learning a BN structure from the entire big dataset is an expensive task which often ends in failure due to memory constraints. Second, it is quite difficult to select a learner from numerous BN structure learning algorithms to consistently achieve good learning accuracy. Lastly, there is a lack of an intelligent method that merges separately learned BN structures into a well structured BN network. To address these shortcomings, we introduce a novel parallel learning approach called PEnBayes (Parallel Ensemble-based Bayesian network learning). PEnBayes starts with an adaptive data preprocessing phase that calculates the Appropriate Learning Size and intelligently divides a big dataset for fast distributed local structure learning. Then, PEnBayes learns a collection of local BN Structures in parallel using a two-layered weighted adjacent matrix-based structure ensemble method. Lastly, PEnBayes merges the local BN Structures into a global network structure using the structure ensemble method at the global layer. For the experiment, we generate big data sets by simulating sensor data from patient monitoring, transportation, and disease diagnosis domains. The Experimental results show that PEnBayes achieves a significantly improved execution performance with more consistent and stable results compared with three baseline learning algorithms.
format	Online Article Text
id	pubmed-6832728
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-68327282019-11-25 PEnBayes: A Multi-Layered Ensemble Approach for Learning Bayesian Network Structure from Big Data Tang, Yan Wang, Jianwu Nguyen, Mai Altintas, Ilkay Sensors (Basel) Article Discovering the Bayesian network (BN) structure from big datasets containing rich causal relationships is becoming increasingly valuable for modeling and reasoning under uncertainties in many areas with big data gathered from sensors due to high volume and fast veracity. Most of the current BN structure learning algorithms have shortcomings facing big data. First, learning a BN structure from the entire big dataset is an expensive task which often ends in failure due to memory constraints. Second, it is quite difficult to select a learner from numerous BN structure learning algorithms to consistently achieve good learning accuracy. Lastly, there is a lack of an intelligent method that merges separately learned BN structures into a well structured BN network. To address these shortcomings, we introduce a novel parallel learning approach called PEnBayes (Parallel Ensemble-based Bayesian network learning). PEnBayes starts with an adaptive data preprocessing phase that calculates the Appropriate Learning Size and intelligently divides a big dataset for fast distributed local structure learning. Then, PEnBayes learns a collection of local BN Structures in parallel using a two-layered weighted adjacent matrix-based structure ensemble method. Lastly, PEnBayes merges the local BN Structures into a global network structure using the structure ensemble method at the global layer. For the experiment, we generate big data sets by simulating sensor data from patient monitoring, transportation, and disease diagnosis domains. The Experimental results show that PEnBayes achieves a significantly improved execution performance with more consistent and stable results compared with three baseline learning algorithms. MDPI 2019-10-11 /pmc/articles/PMC6832728/ /pubmed/31614544 http://dx.doi.org/10.3390/s19204400 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Tang, Yan Wang, Jianwu Nguyen, Mai Altintas, Ilkay PEnBayes: A Multi-Layered Ensemble Approach for Learning Bayesian Network Structure from Big Data
title	PEnBayes: A Multi-Layered Ensemble Approach for Learning Bayesian Network Structure from Big Data
title_full	PEnBayes: A Multi-Layered Ensemble Approach for Learning Bayesian Network Structure from Big Data
title_fullStr	PEnBayes: A Multi-Layered Ensemble Approach for Learning Bayesian Network Structure from Big Data
title_full_unstemmed	PEnBayes: A Multi-Layered Ensemble Approach for Learning Bayesian Network Structure from Big Data
title_short	PEnBayes: A Multi-Layered Ensemble Approach for Learning Bayesian Network Structure from Big Data
title_sort	penbayes: a multi-layered ensemble approach for learning bayesian network structure from big data
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6832728/ https://www.ncbi.nlm.nih.gov/pubmed/31614544 http://dx.doi.org/10.3390/s19204400
work_keys_str_mv	AT tangyan penbayesamultilayeredensembleapproachforlearningbayesiannetworkstructurefrombigdata AT wangjianwu penbayesamultilayeredensembleapproachforlearningbayesiannetworkstructurefrombigdata AT nguyenmai penbayesamultilayeredensembleapproachforlearningbayesiannetworkstructurefrombigdata AT altintasilkay penbayesamultilayeredensembleapproachforlearningbayesiannetworkstructurefrombigdata

PEnBayes: A Multi-Layered Ensemble Approach for Learning Bayesian Network Structure from Big Data

Ejemplares similares