Cargando…

Pseudo-Labeling Optimization Based Ensemble Semi-Supervised Soft Sensor in the Process Industry

Nowadays, soft sensor techniques have become promising solutions for enabling real-time estimation of difficult-to-measure quality variables in industrial processes. However, labeled data are often scarce in many real-world applications, which poses a significant challenge when building accurate sof...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Youwei, Jin, Huaiping, Dong, Shoulong, Yang, Biao, Chen, Xiangguang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8708742/
https://www.ncbi.nlm.nih.gov/pubmed/34960564
http://dx.doi.org/10.3390/s21248471
_version_ 1784622761713336320
author Li, Youwei
Jin, Huaiping
Dong, Shoulong
Yang, Biao
Chen, Xiangguang
author_facet Li, Youwei
Jin, Huaiping
Dong, Shoulong
Yang, Biao
Chen, Xiangguang
author_sort Li, Youwei
collection PubMed
description Nowadays, soft sensor techniques have become promising solutions for enabling real-time estimation of difficult-to-measure quality variables in industrial processes. However, labeled data are often scarce in many real-world applications, which poses a significant challenge when building accurate soft sensor models. Therefore, this paper proposes a novel semi-supervised soft sensor method, referred to as ensemble semi-supervised negative correlation learning extreme learning machine (EnSSNCLELM), for industrial processes with limited labeled data. First, an improved supervised regression algorithm called NCLELM is developed, by integrating the philosophy of negative correlation learning into extreme learning machine (ELM). Then, with NCLELM as the base learning technique, a multi-learner pseudo-labeling optimization approach is proposed, by converting the estimation of pseudo labels as an explicit optimization problem, in order to obtain high-confidence pseudo-labeled data. Furthermore, a set of diverse semi-supervised NCLELM models (SSNCLELM) are developed from different enlarged labeled sets, which are obtained by combining the labeled and pseudo-labeled training data. Finally, those SSNCLELM models whose prediction accuracies were not worse than their supervised counterparts were combined using a stacking strategy. The proposed method can not only exploit both labeled and unlabeled data, but also combine the merits of semi-supervised and ensemble learning paradigms, thereby providing superior predictions over traditional supervised and semi-supervised soft sensor methods. The effectiveness and superiority of the proposed method were demonstrated through two chemical applications.
format Online
Article
Text
id pubmed-8708742
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-87087422021-12-25 Pseudo-Labeling Optimization Based Ensemble Semi-Supervised Soft Sensor in the Process Industry Li, Youwei Jin, Huaiping Dong, Shoulong Yang, Biao Chen, Xiangguang Sensors (Basel) Article Nowadays, soft sensor techniques have become promising solutions for enabling real-time estimation of difficult-to-measure quality variables in industrial processes. However, labeled data are often scarce in many real-world applications, which poses a significant challenge when building accurate soft sensor models. Therefore, this paper proposes a novel semi-supervised soft sensor method, referred to as ensemble semi-supervised negative correlation learning extreme learning machine (EnSSNCLELM), for industrial processes with limited labeled data. First, an improved supervised regression algorithm called NCLELM is developed, by integrating the philosophy of negative correlation learning into extreme learning machine (ELM). Then, with NCLELM as the base learning technique, a multi-learner pseudo-labeling optimization approach is proposed, by converting the estimation of pseudo labels as an explicit optimization problem, in order to obtain high-confidence pseudo-labeled data. Furthermore, a set of diverse semi-supervised NCLELM models (SSNCLELM) are developed from different enlarged labeled sets, which are obtained by combining the labeled and pseudo-labeled training data. Finally, those SSNCLELM models whose prediction accuracies were not worse than their supervised counterparts were combined using a stacking strategy. The proposed method can not only exploit both labeled and unlabeled data, but also combine the merits of semi-supervised and ensemble learning paradigms, thereby providing superior predictions over traditional supervised and semi-supervised soft sensor methods. The effectiveness and superiority of the proposed method were demonstrated through two chemical applications. MDPI 2021-12-19 /pmc/articles/PMC8708742/ /pubmed/34960564 http://dx.doi.org/10.3390/s21248471 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Li, Youwei
Jin, Huaiping
Dong, Shoulong
Yang, Biao
Chen, Xiangguang
Pseudo-Labeling Optimization Based Ensemble Semi-Supervised Soft Sensor in the Process Industry
title Pseudo-Labeling Optimization Based Ensemble Semi-Supervised Soft Sensor in the Process Industry
title_full Pseudo-Labeling Optimization Based Ensemble Semi-Supervised Soft Sensor in the Process Industry
title_fullStr Pseudo-Labeling Optimization Based Ensemble Semi-Supervised Soft Sensor in the Process Industry
title_full_unstemmed Pseudo-Labeling Optimization Based Ensemble Semi-Supervised Soft Sensor in the Process Industry
title_short Pseudo-Labeling Optimization Based Ensemble Semi-Supervised Soft Sensor in the Process Industry
title_sort pseudo-labeling optimization based ensemble semi-supervised soft sensor in the process industry
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8708742/
https://www.ncbi.nlm.nih.gov/pubmed/34960564
http://dx.doi.org/10.3390/s21248471
work_keys_str_mv AT liyouwei pseudolabelingoptimizationbasedensemblesemisupervisedsoftsensorintheprocessindustry
AT jinhuaiping pseudolabelingoptimizationbasedensemblesemisupervisedsoftsensorintheprocessindustry
AT dongshoulong pseudolabelingoptimizationbasedensemblesemisupervisedsoftsensorintheprocessindustry
AT yangbiao pseudolabelingoptimizationbasedensemblesemisupervisedsoftsensorintheprocessindustry
AT chenxiangguang pseudolabelingoptimizationbasedensemblesemisupervisedsoftsensorintheprocessindustry