Cargando…

Estimation of missing data of monthly rainfall in southwestern Colombia using artificial neural networks

The success of many projects linked to the management and planning of water resources depends mainly on the quality of the climatic and hydrological data that is provided. Nevertheless, the missing data are frequently found in hydroclimatic variables due to measuring instrument failures, observation...

Descripción completa

Detalles Bibliográficos
Autores principales: Canchala-Nastar, Teresita, Carvajal-Escobar, Yesid, Alfonso-Morales, Wilfredo, Loaiza Cerón, Wilmar, Caicedo, Eduardo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6811873/
https://www.ncbi.nlm.nih.gov/pubmed/31667280
http://dx.doi.org/10.1016/j.dib.2019.104517
_version_ 1783462550411673600
author Canchala-Nastar, Teresita
Carvajal-Escobar, Yesid
Alfonso-Morales, Wilfredo
Loaiza Cerón, Wilmar
Caicedo, Eduardo
author_facet Canchala-Nastar, Teresita
Carvajal-Escobar, Yesid
Alfonso-Morales, Wilfredo
Loaiza Cerón, Wilmar
Caicedo, Eduardo
author_sort Canchala-Nastar, Teresita
collection PubMed
description The success of many projects linked to the management and planning of water resources depends mainly on the quality of the climatic and hydrological data that is provided. Nevertheless, the missing data are frequently found in hydroclimatic variables due to measuring instrument failures, observation recording errors, meteorological extremes, and the challenges associated with accessing measurement areas. Hence, it is necessary to apply an appropriate fill of missing data before any analysis. This paper is intended to present the filling of missing data of monthly rainfall of 45 gauge stations located in southwestern Colombia. The series analyzed covers 34 years of observations between 1983 and 2016, available from the Instituto de Hidrología, Meteorología y Estudios Ambientales (IDEAM). The estimation of missing data was done using Non-linear Principal Component Analysis (NLPCA); a non-linear generalization of the standard Principal Component Analysis Method via an Artificial Neural Networks (ANN) approach. The best result was obtained using a network with a [45−44−45] architecture. The estimated mean squared error in the imputation of missing data was approximately 9.8 mm. month(−1), showing that the NLPCA approach constitutes a powerful methodology in the imputation of missing rainfall data. The estimated rainfall dataset helps reduce uncertainty for further studies related to homogeneity analyses, conglomerates, trends, multivariate statistics and meteorological forecasts in regions with information deficits such as southwestern Colombia.
format Online
Article
Text
id pubmed-6811873
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-68118732019-10-30 Estimation of missing data of monthly rainfall in southwestern Colombia using artificial neural networks Canchala-Nastar, Teresita Carvajal-Escobar, Yesid Alfonso-Morales, Wilfredo Loaiza Cerón, Wilmar Caicedo, Eduardo Data Brief Computer Science The success of many projects linked to the management and planning of water resources depends mainly on the quality of the climatic and hydrological data that is provided. Nevertheless, the missing data are frequently found in hydroclimatic variables due to measuring instrument failures, observation recording errors, meteorological extremes, and the challenges associated with accessing measurement areas. Hence, it is necessary to apply an appropriate fill of missing data before any analysis. This paper is intended to present the filling of missing data of monthly rainfall of 45 gauge stations located in southwestern Colombia. The series analyzed covers 34 years of observations between 1983 and 2016, available from the Instituto de Hidrología, Meteorología y Estudios Ambientales (IDEAM). The estimation of missing data was done using Non-linear Principal Component Analysis (NLPCA); a non-linear generalization of the standard Principal Component Analysis Method via an Artificial Neural Networks (ANN) approach. The best result was obtained using a network with a [45−44−45] architecture. The estimated mean squared error in the imputation of missing data was approximately 9.8 mm. month(−1), showing that the NLPCA approach constitutes a powerful methodology in the imputation of missing rainfall data. The estimated rainfall dataset helps reduce uncertainty for further studies related to homogeneity analyses, conglomerates, trends, multivariate statistics and meteorological forecasts in regions with information deficits such as southwestern Colombia. Elsevier 2019-09-14 /pmc/articles/PMC6811873/ /pubmed/31667280 http://dx.doi.org/10.1016/j.dib.2019.104517 Text en © 2019 The Author(s) http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Computer Science
Canchala-Nastar, Teresita
Carvajal-Escobar, Yesid
Alfonso-Morales, Wilfredo
Loaiza Cerón, Wilmar
Caicedo, Eduardo
Estimation of missing data of monthly rainfall in southwestern Colombia using artificial neural networks
title Estimation of missing data of monthly rainfall in southwestern Colombia using artificial neural networks
title_full Estimation of missing data of monthly rainfall in southwestern Colombia using artificial neural networks
title_fullStr Estimation of missing data of monthly rainfall in southwestern Colombia using artificial neural networks
title_full_unstemmed Estimation of missing data of monthly rainfall in southwestern Colombia using artificial neural networks
title_short Estimation of missing data of monthly rainfall in southwestern Colombia using artificial neural networks
title_sort estimation of missing data of monthly rainfall in southwestern colombia using artificial neural networks
topic Computer Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6811873/
https://www.ncbi.nlm.nih.gov/pubmed/31667280
http://dx.doi.org/10.1016/j.dib.2019.104517
work_keys_str_mv AT canchalanastarteresita estimationofmissingdataofmonthlyrainfallinsouthwesterncolombiausingartificialneuralnetworks
AT carvajalescobaryesid estimationofmissingdataofmonthlyrainfallinsouthwesterncolombiausingartificialneuralnetworks
AT alfonsomoraleswilfredo estimationofmissingdataofmonthlyrainfallinsouthwesterncolombiausingartificialneuralnetworks
AT loaizaceronwilmar estimationofmissingdataofmonthlyrainfallinsouthwesterncolombiausingartificialneuralnetworks
AT caicedoeduardo estimationofmissingdataofmonthlyrainfallinsouthwesterncolombiausingartificialneuralnetworks