Cargando…
Simulation of electricity consumption data using multiple artificial intelligence models and cross validation techniques
Worldwide, electricity production exceeds its consumption which leads to wasted financial and energy resources. Machine learning models can be utilized to predict the future consumption to avoid these significant losses. This paper presents the data for the monthly electricity consumption on the com...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10661651/ https://www.ncbi.nlm.nih.gov/pubmed/38020440 http://dx.doi.org/10.1016/j.dib.2023.109718 |
_version_ | 1785138024286257152 |
---|---|
author | Hosny, Mariam Waraga, Omnia Abu Talib, Manar Abu Abdallah, Mohamed |
author_facet | Hosny, Mariam Waraga, Omnia Abu Talib, Manar Abu Abdallah, Mohamed |
author_sort | Hosny, Mariam |
collection | PubMed |
description | Worldwide, electricity production exceeds its consumption which leads to wasted financial and energy resources. Machine learning models can be utilized to predict the future consumption to avoid these significant losses. This paper presents the data for the monthly electricity consumption on the community level during May 2017–December 2019 in Dubai, United Arab Emirates. It was acquired from Dubai Pulse, an online repository containing consumption data from Dubai Electricity and Water Authority which provides utility services to the Emirate. Multiple parameters, such as population and number of buildings, were acquired from Dubai Statistics Center in addition to temperature which was obtained from Dubai International Airport. Additional features, such as expatriate ratio, number of customers, and building occupancy, were computed from the available data and utilized to generate a dataset towards accurate prediction. Various linear regression variants, support vector machines, decision tree models, ensemble models, and neural networks were implemented to forecast electricity consumption. The models were trained on two different formats of the same dataset, which were generated by sorting the data with respect to time, named as temporally ordered dataset, and by randomly dividing the data, labelled as randomly split dataset. In addition, the dependence of the models on the amount of data was identified by varying the size of the testing data. Moreover, two cross-validation (CV) procedures, namely rolling CV method and moving CV method, were applied to assess the reliability of the models. All analyses were evaluated by utilizing several performance metrics, namely root mean squared error, coefficient of determination, i.e., R(2), 10-fold CV score, mean absolute error, median absolute error, and computational time. Furthermore, this data could be utilized to analyze the effect of coronavirus disease 2019 (COVID-19) prevention measures in Dubai on electricity usage as well as evaluate the consumption patterns at the consumer level. |
format | Online Article Text |
id | pubmed-10661651 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-106616512023-10-24 Simulation of electricity consumption data using multiple artificial intelligence models and cross validation techniques Hosny, Mariam Waraga, Omnia Abu Talib, Manar Abu Abdallah, Mohamed Data Brief Data Article Worldwide, electricity production exceeds its consumption which leads to wasted financial and energy resources. Machine learning models can be utilized to predict the future consumption to avoid these significant losses. This paper presents the data for the monthly electricity consumption on the community level during May 2017–December 2019 in Dubai, United Arab Emirates. It was acquired from Dubai Pulse, an online repository containing consumption data from Dubai Electricity and Water Authority which provides utility services to the Emirate. Multiple parameters, such as population and number of buildings, were acquired from Dubai Statistics Center in addition to temperature which was obtained from Dubai International Airport. Additional features, such as expatriate ratio, number of customers, and building occupancy, were computed from the available data and utilized to generate a dataset towards accurate prediction. Various linear regression variants, support vector machines, decision tree models, ensemble models, and neural networks were implemented to forecast electricity consumption. The models were trained on two different formats of the same dataset, which were generated by sorting the data with respect to time, named as temporally ordered dataset, and by randomly dividing the data, labelled as randomly split dataset. In addition, the dependence of the models on the amount of data was identified by varying the size of the testing data. Moreover, two cross-validation (CV) procedures, namely rolling CV method and moving CV method, were applied to assess the reliability of the models. All analyses were evaluated by utilizing several performance metrics, namely root mean squared error, coefficient of determination, i.e., R(2), 10-fold CV score, mean absolute error, median absolute error, and computational time. Furthermore, this data could be utilized to analyze the effect of coronavirus disease 2019 (COVID-19) prevention measures in Dubai on electricity usage as well as evaluate the consumption patterns at the consumer level. Elsevier 2023-10-24 /pmc/articles/PMC10661651/ /pubmed/38020440 http://dx.doi.org/10.1016/j.dib.2023.109718 Text en © 2023 The Authors. Published by Elsevier Inc. https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Data Article Hosny, Mariam Waraga, Omnia Abu Talib, Manar Abu Abdallah, Mohamed Simulation of electricity consumption data using multiple artificial intelligence models and cross validation techniques |
title | Simulation of electricity consumption data using multiple artificial intelligence models and cross validation techniques |
title_full | Simulation of electricity consumption data using multiple artificial intelligence models and cross validation techniques |
title_fullStr | Simulation of electricity consumption data using multiple artificial intelligence models and cross validation techniques |
title_full_unstemmed | Simulation of electricity consumption data using multiple artificial intelligence models and cross validation techniques |
title_short | Simulation of electricity consumption data using multiple artificial intelligence models and cross validation techniques |
title_sort | simulation of electricity consumption data using multiple artificial intelligence models and cross validation techniques |
topic | Data Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10661651/ https://www.ncbi.nlm.nih.gov/pubmed/38020440 http://dx.doi.org/10.1016/j.dib.2023.109718 |
work_keys_str_mv | AT hosnymariam simulationofelectricityconsumptiondatausingmultipleartificialintelligencemodelsandcrossvalidationtechniques AT waragaomniaabu simulationofelectricityconsumptiondatausingmultipleartificialintelligencemodelsandcrossvalidationtechniques AT talibmanarabu simulationofelectricityconsumptiondatausingmultipleartificialintelligencemodelsandcrossvalidationtechniques AT abdallahmohamed simulationofelectricityconsumptiondatausingmultipleartificialintelligencemodelsandcrossvalidationtechniques |