Cargando…

Online-Dynamic-Clustering-Based Soft Sensor for Industrial Semi-Supervised Data Streams

In the era of big data, industrial process data are often generated rapidly in the form of streams. Thus, how to process such sequential and high-speed stream data in real time and provide critical quality variable predictions has become a critical issue for facilitating efficient process control an...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yuechen, Jin, Huaiping, Chen, Xiangguang, Wang, Bin, Yang, Biao, Qian, Bin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9920279/
https://www.ncbi.nlm.nih.gov/pubmed/36772560
http://dx.doi.org/10.3390/s23031520
_version_ 1784887030875947008
author Wang, Yuechen
Jin, Huaiping
Chen, Xiangguang
Wang, Bin
Yang, Biao
Qian, Bin
author_facet Wang, Yuechen
Jin, Huaiping
Chen, Xiangguang
Wang, Bin
Yang, Biao
Qian, Bin
author_sort Wang, Yuechen
collection PubMed
description In the era of big data, industrial process data are often generated rapidly in the form of streams. Thus, how to process such sequential and high-speed stream data in real time and provide critical quality variable predictions has become a critical issue for facilitating efficient process control and monitoring in the process industry. Traditionally, soft sensor models are usually built through offline batch learning, which remain unchanged during the online implementation phase. Once the process state changes, soft sensors built from historical data cannot provide accurate predictions. In practice, industrial process data streams often exhibit characteristics such as nonlinearity, time-varying behavior, and label scarcity, which pose great challenges for building high-performance soft sensor models. To address this issue, an online-dynamic-clustering-based soft sensor (ODCSS) is proposed for industrial semi-supervised data streams. The method achieves automatic generation and update of clusters and samples deletion through online dynamic clustering, thus enabling online dynamic identification of process states. Meanwhile, selective ensemble learning and just-in-time learning (JITL) are employed through an adaptive switching prediction strategy, which enables dealing with gradual and abrupt changes in process characteristics and thus alleviates model performance degradation caused by concept drift. In addition, semi-supervised learning is introduced to exploit the information of unlabeled samples and obtain high-confidence pseudo-labeled samples to expand the labeled training set. The proposed method can effectively deal with nonlinearity, time-variability, and label scarcity issues in the process data stream environment and thus enable reliable target variable predictions. The application results from two case studies show that the proposed ODCSS soft sensor approach is superior to conventional soft sensors in a semi-supervised data stream environment.
format Online
Article
Text
id pubmed-9920279
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-99202792023-02-12 Online-Dynamic-Clustering-Based Soft Sensor for Industrial Semi-Supervised Data Streams Wang, Yuechen Jin, Huaiping Chen, Xiangguang Wang, Bin Yang, Biao Qian, Bin Sensors (Basel) Article In the era of big data, industrial process data are often generated rapidly in the form of streams. Thus, how to process such sequential and high-speed stream data in real time and provide critical quality variable predictions has become a critical issue for facilitating efficient process control and monitoring in the process industry. Traditionally, soft sensor models are usually built through offline batch learning, which remain unchanged during the online implementation phase. Once the process state changes, soft sensors built from historical data cannot provide accurate predictions. In practice, industrial process data streams often exhibit characteristics such as nonlinearity, time-varying behavior, and label scarcity, which pose great challenges for building high-performance soft sensor models. To address this issue, an online-dynamic-clustering-based soft sensor (ODCSS) is proposed for industrial semi-supervised data streams. The method achieves automatic generation and update of clusters and samples deletion through online dynamic clustering, thus enabling online dynamic identification of process states. Meanwhile, selective ensemble learning and just-in-time learning (JITL) are employed through an adaptive switching prediction strategy, which enables dealing with gradual and abrupt changes in process characteristics and thus alleviates model performance degradation caused by concept drift. In addition, semi-supervised learning is introduced to exploit the information of unlabeled samples and obtain high-confidence pseudo-labeled samples to expand the labeled training set. The proposed method can effectively deal with nonlinearity, time-variability, and label scarcity issues in the process data stream environment and thus enable reliable target variable predictions. The application results from two case studies show that the proposed ODCSS soft sensor approach is superior to conventional soft sensors in a semi-supervised data stream environment. MDPI 2023-01-30 /pmc/articles/PMC9920279/ /pubmed/36772560 http://dx.doi.org/10.3390/s23031520 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Yuechen
Jin, Huaiping
Chen, Xiangguang
Wang, Bin
Yang, Biao
Qian, Bin
Online-Dynamic-Clustering-Based Soft Sensor for Industrial Semi-Supervised Data Streams
title Online-Dynamic-Clustering-Based Soft Sensor for Industrial Semi-Supervised Data Streams
title_full Online-Dynamic-Clustering-Based Soft Sensor for Industrial Semi-Supervised Data Streams
title_fullStr Online-Dynamic-Clustering-Based Soft Sensor for Industrial Semi-Supervised Data Streams
title_full_unstemmed Online-Dynamic-Clustering-Based Soft Sensor for Industrial Semi-Supervised Data Streams
title_short Online-Dynamic-Clustering-Based Soft Sensor for Industrial Semi-Supervised Data Streams
title_sort online-dynamic-clustering-based soft sensor for industrial semi-supervised data streams
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9920279/
https://www.ncbi.nlm.nih.gov/pubmed/36772560
http://dx.doi.org/10.3390/s23031520
work_keys_str_mv AT wangyuechen onlinedynamicclusteringbasedsoftsensorforindustrialsemisuperviseddatastreams
AT jinhuaiping onlinedynamicclusteringbasedsoftsensorforindustrialsemisuperviseddatastreams
AT chenxiangguang onlinedynamicclusteringbasedsoftsensorforindustrialsemisuperviseddatastreams
AT wangbin onlinedynamicclusteringbasedsoftsensorforindustrialsemisuperviseddatastreams
AT yangbiao onlinedynamicclusteringbasedsoftsensorforindustrialsemisuperviseddatastreams
AT qianbin onlinedynamicclusteringbasedsoftsensorforindustrialsemisuperviseddatastreams