Cargando…

Artificial Neural Networks, Sequence-to-Sequence LSTMs, and Exogenous Variables as Analytical Tools for NO(2) (Air Pollution) Forecasting: A Case Study in the Bay of Algeciras (Spain)

This study aims to produce accurate predictions of the NO(2) concentrations at a specific station of a monitoring network located in the Bay of Algeciras (Spain). Artificial neural networks (ANNs) and sequence-to-sequence long short-term memory networks (LSTMs) were used to create the forecasting mo...

Descripción completa

Detalles Bibliográficos
Autores principales: González-Enrique, Javier, Ruiz-Aguilar, Juan Jesús, Moscoso-López, José Antonio, Urda, Daniel, Deka, Lipika, Turias, Ignacio J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7961900/
https://www.ncbi.nlm.nih.gov/pubmed/33806409
http://dx.doi.org/10.3390/s21051770
_version_ 1783665359509782528
author González-Enrique, Javier
Ruiz-Aguilar, Juan Jesús
Moscoso-López, José Antonio
Urda, Daniel
Deka, Lipika
Turias, Ignacio J.
author_facet González-Enrique, Javier
Ruiz-Aguilar, Juan Jesús
Moscoso-López, José Antonio
Urda, Daniel
Deka, Lipika
Turias, Ignacio J.
author_sort González-Enrique, Javier
collection PubMed
description This study aims to produce accurate predictions of the NO(2) concentrations at a specific station of a monitoring network located in the Bay of Algeciras (Spain). Artificial neural networks (ANNs) and sequence-to-sequence long short-term memory networks (LSTMs) were used to create the forecasting models. Additionally, a new prediction method was proposed combining LSTMs using a rolling window scheme with a cross-validation procedure for time series (LSTM-CVT). Two different strategies were followed regarding the input variables: using NO(2) from the station or employing NO(2) and other pollutants data from any station of the network plus meteorological variables. The ANN and LSTM-CVT exogenous models used lagged datasets of different window sizes. Several feature ranking methods were used to select the top lagged variables and include them in the final exogenous datasets. Prediction horizons of t + 1, t + 4 and t + 8 were employed. The exogenous variables inclusion enhanced the model’s performance, especially for t + 4 (ρ ≈ 0.68 to ρ ≈ 0.74) and t + 8 (ρ ≈ 0.59 to ρ ≈ 0.66). The proposed LSTM-CVT method delivered promising results as the best performing models per prediction horizon employed this new methodology. Additionally, per each parameter combination, it obtained lower error values than ANNs in 85% of the cases.
format Online
Article
Text
id pubmed-7961900
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-79619002021-03-17 Artificial Neural Networks, Sequence-to-Sequence LSTMs, and Exogenous Variables as Analytical Tools for NO(2) (Air Pollution) Forecasting: A Case Study in the Bay of Algeciras (Spain) González-Enrique, Javier Ruiz-Aguilar, Juan Jesús Moscoso-López, José Antonio Urda, Daniel Deka, Lipika Turias, Ignacio J. Sensors (Basel) Article This study aims to produce accurate predictions of the NO(2) concentrations at a specific station of a monitoring network located in the Bay of Algeciras (Spain). Artificial neural networks (ANNs) and sequence-to-sequence long short-term memory networks (LSTMs) were used to create the forecasting models. Additionally, a new prediction method was proposed combining LSTMs using a rolling window scheme with a cross-validation procedure for time series (LSTM-CVT). Two different strategies were followed regarding the input variables: using NO(2) from the station or employing NO(2) and other pollutants data from any station of the network plus meteorological variables. The ANN and LSTM-CVT exogenous models used lagged datasets of different window sizes. Several feature ranking methods were used to select the top lagged variables and include them in the final exogenous datasets. Prediction horizons of t + 1, t + 4 and t + 8 were employed. The exogenous variables inclusion enhanced the model’s performance, especially for t + 4 (ρ ≈ 0.68 to ρ ≈ 0.74) and t + 8 (ρ ≈ 0.59 to ρ ≈ 0.66). The proposed LSTM-CVT method delivered promising results as the best performing models per prediction horizon employed this new methodology. Additionally, per each parameter combination, it obtained lower error values than ANNs in 85% of the cases. MDPI 2021-03-04 /pmc/articles/PMC7961900/ /pubmed/33806409 http://dx.doi.org/10.3390/s21051770 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
González-Enrique, Javier
Ruiz-Aguilar, Juan Jesús
Moscoso-López, José Antonio
Urda, Daniel
Deka, Lipika
Turias, Ignacio J.
Artificial Neural Networks, Sequence-to-Sequence LSTMs, and Exogenous Variables as Analytical Tools for NO(2) (Air Pollution) Forecasting: A Case Study in the Bay of Algeciras (Spain)
title Artificial Neural Networks, Sequence-to-Sequence LSTMs, and Exogenous Variables as Analytical Tools for NO(2) (Air Pollution) Forecasting: A Case Study in the Bay of Algeciras (Spain)
title_full Artificial Neural Networks, Sequence-to-Sequence LSTMs, and Exogenous Variables as Analytical Tools for NO(2) (Air Pollution) Forecasting: A Case Study in the Bay of Algeciras (Spain)
title_fullStr Artificial Neural Networks, Sequence-to-Sequence LSTMs, and Exogenous Variables as Analytical Tools for NO(2) (Air Pollution) Forecasting: A Case Study in the Bay of Algeciras (Spain)
title_full_unstemmed Artificial Neural Networks, Sequence-to-Sequence LSTMs, and Exogenous Variables as Analytical Tools for NO(2) (Air Pollution) Forecasting: A Case Study in the Bay of Algeciras (Spain)
title_short Artificial Neural Networks, Sequence-to-Sequence LSTMs, and Exogenous Variables as Analytical Tools for NO(2) (Air Pollution) Forecasting: A Case Study in the Bay of Algeciras (Spain)
title_sort artificial neural networks, sequence-to-sequence lstms, and exogenous variables as analytical tools for no(2) (air pollution) forecasting: a case study in the bay of algeciras (spain)
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7961900/
https://www.ncbi.nlm.nih.gov/pubmed/33806409
http://dx.doi.org/10.3390/s21051770
work_keys_str_mv AT gonzalezenriquejavier artificialneuralnetworkssequencetosequencelstmsandexogenousvariablesasanalyticaltoolsforno2airpollutionforecastingacasestudyinthebayofalgecirasspain
AT ruizaguilarjuanjesus artificialneuralnetworkssequencetosequencelstmsandexogenousvariablesasanalyticaltoolsforno2airpollutionforecastingacasestudyinthebayofalgecirasspain
AT moscosolopezjoseantonio artificialneuralnetworkssequencetosequencelstmsandexogenousvariablesasanalyticaltoolsforno2airpollutionforecastingacasestudyinthebayofalgecirasspain
AT urdadaniel artificialneuralnetworkssequencetosequencelstmsandexogenousvariablesasanalyticaltoolsforno2airpollutionforecastingacasestudyinthebayofalgecirasspain
AT dekalipika artificialneuralnetworkssequencetosequencelstmsandexogenousvariablesasanalyticaltoolsforno2airpollutionforecastingacasestudyinthebayofalgecirasspain
AT turiasignacioj artificialneuralnetworkssequencetosequencelstmsandexogenousvariablesasanalyticaltoolsforno2airpollutionforecastingacasestudyinthebayofalgecirasspain