Cargando…

A Pattern-Recognition-Based Ensemble Data Imputation Framework for Sensors from Building Energy Systems

Building operation data are important for monitoring, analysis, modeling, and control of building energy systems. However, missing data is one of the major data quality issues, making data imputation techniques become increasingly important. There are two key research gaps for missing sensor data im...

Descripción completa

Detalles Bibliográficos
Autor principal: Zhang, Liang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7590169/
https://www.ncbi.nlm.nih.gov/pubmed/33096719
http://dx.doi.org/10.3390/s20205947
_version_ 1783600746046947328
author Zhang, Liang
author_facet Zhang, Liang
author_sort Zhang, Liang
collection PubMed
description Building operation data are important for monitoring, analysis, modeling, and control of building energy systems. However, missing data is one of the major data quality issues, making data imputation techniques become increasingly important. There are two key research gaps for missing sensor data imputation in buildings: the lack of customized and automated imputation methodology, and the difficulty of the validation of data imputation methods. In this paper, a framework is developed to address these two gaps. First, a validation data generation module is developed based on pattern recognition to create a validation dataset to quantify the performance of data imputation methods. Second, a pool of data imputation methods is tested under the validation dataset to find an optimal single imputation method for each sensor, which is termed as an ensemble method. The method can reflect the specific mechanism and randomness of missing data from each sensor. The effectiveness of the framework is demonstrated by 18 sensors from a real campus building. The overall accuracy of data imputation for those sensors improves by 18.2% on average compared with the best single data imputation method.
format Online
Article
Text
id pubmed-7590169
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75901692020-10-29 A Pattern-Recognition-Based Ensemble Data Imputation Framework for Sensors from Building Energy Systems Zhang, Liang Sensors (Basel) Article Building operation data are important for monitoring, analysis, modeling, and control of building energy systems. However, missing data is one of the major data quality issues, making data imputation techniques become increasingly important. There are two key research gaps for missing sensor data imputation in buildings: the lack of customized and automated imputation methodology, and the difficulty of the validation of data imputation methods. In this paper, a framework is developed to address these two gaps. First, a validation data generation module is developed based on pattern recognition to create a validation dataset to quantify the performance of data imputation methods. Second, a pool of data imputation methods is tested under the validation dataset to find an optimal single imputation method for each sensor, which is termed as an ensemble method. The method can reflect the specific mechanism and randomness of missing data from each sensor. The effectiveness of the framework is demonstrated by 18 sensors from a real campus building. The overall accuracy of data imputation for those sensors improves by 18.2% on average compared with the best single data imputation method. MDPI 2020-10-21 /pmc/articles/PMC7590169/ /pubmed/33096719 http://dx.doi.org/10.3390/s20205947 Text en © 2020 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhang, Liang
A Pattern-Recognition-Based Ensemble Data Imputation Framework for Sensors from Building Energy Systems
title A Pattern-Recognition-Based Ensemble Data Imputation Framework for Sensors from Building Energy Systems
title_full A Pattern-Recognition-Based Ensemble Data Imputation Framework for Sensors from Building Energy Systems
title_fullStr A Pattern-Recognition-Based Ensemble Data Imputation Framework for Sensors from Building Energy Systems
title_full_unstemmed A Pattern-Recognition-Based Ensemble Data Imputation Framework for Sensors from Building Energy Systems
title_short A Pattern-Recognition-Based Ensemble Data Imputation Framework for Sensors from Building Energy Systems
title_sort pattern-recognition-based ensemble data imputation framework for sensors from building energy systems
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7590169/
https://www.ncbi.nlm.nih.gov/pubmed/33096719
http://dx.doi.org/10.3390/s20205947
work_keys_str_mv AT zhangliang apatternrecognitionbasedensembledataimputationframeworkforsensorsfrombuildingenergysystems
AT zhangliang patternrecognitionbasedensembledataimputationframeworkforsensorsfrombuildingenergysystems