Cargando…

A Data-Driven Framework for Small Hydroelectric Plant Prognosis Using Tsfresh and Machine Learning Survival Models

Maintenance in small hydroelectric plants (SHPs) is essential for securing the expansion of clean energy sources and supplying the energy estimated to be required for the coming years. Identifying failures in SHPs before they happen is crucial for allowing better management of asset maintenance, low...

Descripción completa

Detalles Bibliográficos
Autores principales: de Santis, Rodrigo Barbosa, Gontijo, Tiago Silveira, Costa, Marcelo Azevedo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9824278/
https://www.ncbi.nlm.nih.gov/pubmed/36616612
http://dx.doi.org/10.3390/s23010012
_version_ 1784866370479980544
author de Santis, Rodrigo Barbosa
Gontijo, Tiago Silveira
Costa, Marcelo Azevedo
author_facet de Santis, Rodrigo Barbosa
Gontijo, Tiago Silveira
Costa, Marcelo Azevedo
author_sort de Santis, Rodrigo Barbosa
collection PubMed
description Maintenance in small hydroelectric plants (SHPs) is essential for securing the expansion of clean energy sources and supplying the energy estimated to be required for the coming years. Identifying failures in SHPs before they happen is crucial for allowing better management of asset maintenance, lowering operating costs, and enabling the expansion of renewable energy sources. Most fault prognosis models proposed thus far for hydroelectric generating units are based on signal decomposition and regression models. In the specific case of SHPs, there is a high occurrence of data being censored, since the operation is not consistently steady and can be repeatedly interrupted due to transmission problems or scarcity of water resources. To overcome this, we propose a two-step, data-driven framework for SHP prognosis based on time series feature engineering and survival modeling. We compared two different strategies for feature engineering: one using higher-order statistics and the other using the Tsfresh algorithm. We adjusted three machine learning survival models—CoxNet, survival random forests, and gradient boosting survival analysis—for estimating the concordance index of these approaches. The best model presented a significant concordance index of 77.44%. We further investigated and discussed the importance of the monitored sensors and the feature extraction aggregations. The kurtosis and variance were the most relevant aggregations in the higher-order statistics domain, while the fast Fourier transform and continuous wavelet transform were the most frequent transformations when using Tsfresh. The most important sensors were related to the temperature at several points, such as the bearing generator, oil hydraulic unit, and turbine radial bushing.
format Online
Article
Text
id pubmed-9824278
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-98242782023-01-08 A Data-Driven Framework for Small Hydroelectric Plant Prognosis Using Tsfresh and Machine Learning Survival Models de Santis, Rodrigo Barbosa Gontijo, Tiago Silveira Costa, Marcelo Azevedo Sensors (Basel) Article Maintenance in small hydroelectric plants (SHPs) is essential for securing the expansion of clean energy sources and supplying the energy estimated to be required for the coming years. Identifying failures in SHPs before they happen is crucial for allowing better management of asset maintenance, lowering operating costs, and enabling the expansion of renewable energy sources. Most fault prognosis models proposed thus far for hydroelectric generating units are based on signal decomposition and regression models. In the specific case of SHPs, there is a high occurrence of data being censored, since the operation is not consistently steady and can be repeatedly interrupted due to transmission problems or scarcity of water resources. To overcome this, we propose a two-step, data-driven framework for SHP prognosis based on time series feature engineering and survival modeling. We compared two different strategies for feature engineering: one using higher-order statistics and the other using the Tsfresh algorithm. We adjusted three machine learning survival models—CoxNet, survival random forests, and gradient boosting survival analysis—for estimating the concordance index of these approaches. The best model presented a significant concordance index of 77.44%. We further investigated and discussed the importance of the monitored sensors and the feature extraction aggregations. The kurtosis and variance were the most relevant aggregations in the higher-order statistics domain, while the fast Fourier transform and continuous wavelet transform were the most frequent transformations when using Tsfresh. The most important sensors were related to the temperature at several points, such as the bearing generator, oil hydraulic unit, and turbine radial bushing. MDPI 2022-12-20 /pmc/articles/PMC9824278/ /pubmed/36616612 http://dx.doi.org/10.3390/s23010012 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
de Santis, Rodrigo Barbosa
Gontijo, Tiago Silveira
Costa, Marcelo Azevedo
A Data-Driven Framework for Small Hydroelectric Plant Prognosis Using Tsfresh and Machine Learning Survival Models
title A Data-Driven Framework for Small Hydroelectric Plant Prognosis Using Tsfresh and Machine Learning Survival Models
title_full A Data-Driven Framework for Small Hydroelectric Plant Prognosis Using Tsfresh and Machine Learning Survival Models
title_fullStr A Data-Driven Framework for Small Hydroelectric Plant Prognosis Using Tsfresh and Machine Learning Survival Models
title_full_unstemmed A Data-Driven Framework for Small Hydroelectric Plant Prognosis Using Tsfresh and Machine Learning Survival Models
title_short A Data-Driven Framework for Small Hydroelectric Plant Prognosis Using Tsfresh and Machine Learning Survival Models
title_sort data-driven framework for small hydroelectric plant prognosis using tsfresh and machine learning survival models
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9824278/
https://www.ncbi.nlm.nih.gov/pubmed/36616612
http://dx.doi.org/10.3390/s23010012
work_keys_str_mv AT desantisrodrigobarbosa adatadrivenframeworkforsmallhydroelectricplantprognosisusingtsfreshandmachinelearningsurvivalmodels
AT gontijotiagosilveira adatadrivenframeworkforsmallhydroelectricplantprognosisusingtsfreshandmachinelearningsurvivalmodels
AT costamarceloazevedo adatadrivenframeworkforsmallhydroelectricplantprognosisusingtsfreshandmachinelearningsurvivalmodels
AT desantisrodrigobarbosa datadrivenframeworkforsmallhydroelectricplantprognosisusingtsfreshandmachinelearningsurvivalmodels
AT gontijotiagosilveira datadrivenframeworkforsmallhydroelectricplantprognosisusingtsfreshandmachinelearningsurvivalmodels
AT costamarceloazevedo datadrivenframeworkforsmallhydroelectricplantprognosisusingtsfreshandmachinelearningsurvivalmodels