Cargando…

A multi-scale time-series dataset with benchmark for machine learning in decarbonized energy grids

The electric grid is a key enabling infrastructure for the ambitious transition towards carbon neutrality as we grapple with climate change. With deepening penetration of renewable resources, the reliable operation of the electric grid becomes increasingly challenging. In this paper, we present PSML...

Descripción completa

Detalles Bibliográficos
Autores principales: Zheng, Xiangtian, Xu, Nan, Trinh, Loc, Wu, Dongqi, Huang, Tong, Sivaranjani, S., Liu, Yan, Xie, Le
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9214688/
https://www.ncbi.nlm.nih.gov/pubmed/35732656
http://dx.doi.org/10.1038/s41597-022-01455-7
_version_ 1784731069781639168
author Zheng, Xiangtian
Xu, Nan
Trinh, Loc
Wu, Dongqi
Huang, Tong
Sivaranjani, S.
Liu, Yan
Xie, Le
author_facet Zheng, Xiangtian
Xu, Nan
Trinh, Loc
Wu, Dongqi
Huang, Tong
Sivaranjani, S.
Liu, Yan
Xie, Le
author_sort Zheng, Xiangtian
collection PubMed
description The electric grid is a key enabling infrastructure for the ambitious transition towards carbon neutrality as we grapple with climate change. With deepening penetration of renewable resources, the reliable operation of the electric grid becomes increasingly challenging. In this paper, we present PSML, a first-of-its-kind open-access multi-scale time-series dataset, to aid in the development of data-driven machine learning (ML)-based approaches towards reliable operation of future electric grids. The dataset is synthesized from a joint transmission and distribution electric grid to capture the increasingly important interactions and uncertainties of the grid dynamics, containing power, voltage and current measurements over multiple spatio-temporal scales. Using PSML, we provide state-of-the-art ML benchmarks on three challenging use cases of critical importance to achieve: (i) early detection, accurate classification and localization of dynamic disturbances; (ii) robust hierarchical forecasting of load and renewable energy; and (iii) realistic synthetic generation of physical-law-constrained measurements. We envision that this dataset will provide use-inspired ML research in safety-critical systems, while simultaneously enabling ML researchers to contribute towards decarbonization of energy sectors.
format Online
Article
Text
id pubmed-9214688
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-92146882022-06-22 A multi-scale time-series dataset with benchmark for machine learning in decarbonized energy grids Zheng, Xiangtian Xu, Nan Trinh, Loc Wu, Dongqi Huang, Tong Sivaranjani, S. Liu, Yan Xie, Le Sci Data Data Descriptor The electric grid is a key enabling infrastructure for the ambitious transition towards carbon neutrality as we grapple with climate change. With deepening penetration of renewable resources, the reliable operation of the electric grid becomes increasingly challenging. In this paper, we present PSML, a first-of-its-kind open-access multi-scale time-series dataset, to aid in the development of data-driven machine learning (ML)-based approaches towards reliable operation of future electric grids. The dataset is synthesized from a joint transmission and distribution electric grid to capture the increasingly important interactions and uncertainties of the grid dynamics, containing power, voltage and current measurements over multiple spatio-temporal scales. Using PSML, we provide state-of-the-art ML benchmarks on three challenging use cases of critical importance to achieve: (i) early detection, accurate classification and localization of dynamic disturbances; (ii) robust hierarchical forecasting of load and renewable energy; and (iii) realistic synthetic generation of physical-law-constrained measurements. We envision that this dataset will provide use-inspired ML research in safety-critical systems, while simultaneously enabling ML researchers to contribute towards decarbonization of energy sectors. Nature Publishing Group UK 2022-06-22 /pmc/articles/PMC9214688/ /pubmed/35732656 http://dx.doi.org/10.1038/s41597-022-01455-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Data Descriptor
Zheng, Xiangtian
Xu, Nan
Trinh, Loc
Wu, Dongqi
Huang, Tong
Sivaranjani, S.
Liu, Yan
Xie, Le
A multi-scale time-series dataset with benchmark for machine learning in decarbonized energy grids
title A multi-scale time-series dataset with benchmark for machine learning in decarbonized energy grids
title_full A multi-scale time-series dataset with benchmark for machine learning in decarbonized energy grids
title_fullStr A multi-scale time-series dataset with benchmark for machine learning in decarbonized energy grids
title_full_unstemmed A multi-scale time-series dataset with benchmark for machine learning in decarbonized energy grids
title_short A multi-scale time-series dataset with benchmark for machine learning in decarbonized energy grids
title_sort multi-scale time-series dataset with benchmark for machine learning in decarbonized energy grids
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9214688/
https://www.ncbi.nlm.nih.gov/pubmed/35732656
http://dx.doi.org/10.1038/s41597-022-01455-7
work_keys_str_mv AT zhengxiangtian amultiscaletimeseriesdatasetwithbenchmarkformachinelearningindecarbonizedenergygrids
AT xunan amultiscaletimeseriesdatasetwithbenchmarkformachinelearningindecarbonizedenergygrids
AT trinhloc amultiscaletimeseriesdatasetwithbenchmarkformachinelearningindecarbonizedenergygrids
AT wudongqi amultiscaletimeseriesdatasetwithbenchmarkformachinelearningindecarbonizedenergygrids
AT huangtong amultiscaletimeseriesdatasetwithbenchmarkformachinelearningindecarbonizedenergygrids
AT sivaranjanis amultiscaletimeseriesdatasetwithbenchmarkformachinelearningindecarbonizedenergygrids
AT liuyan amultiscaletimeseriesdatasetwithbenchmarkformachinelearningindecarbonizedenergygrids
AT xiele amultiscaletimeseriesdatasetwithbenchmarkformachinelearningindecarbonizedenergygrids
AT zhengxiangtian multiscaletimeseriesdatasetwithbenchmarkformachinelearningindecarbonizedenergygrids
AT xunan multiscaletimeseriesdatasetwithbenchmarkformachinelearningindecarbonizedenergygrids
AT trinhloc multiscaletimeseriesdatasetwithbenchmarkformachinelearningindecarbonizedenergygrids
AT wudongqi multiscaletimeseriesdatasetwithbenchmarkformachinelearningindecarbonizedenergygrids
AT huangtong multiscaletimeseriesdatasetwithbenchmarkformachinelearningindecarbonizedenergygrids
AT sivaranjanis multiscaletimeseriesdatasetwithbenchmarkformachinelearningindecarbonizedenergygrids
AT liuyan multiscaletimeseriesdatasetwithbenchmarkformachinelearningindecarbonizedenergygrids
AT xiele multiscaletimeseriesdatasetwithbenchmarkformachinelearningindecarbonizedenergygrids