Cargando…

A Data Augmentation-Based Evaluation System for Regional Direct Economic Losses of Storm Surge Disasters

The accurate prediction of storm surge disasters’ direct economic losses plays a positive role in providing critical support for disaster prevention decision-making and management. Previous researches on storm surge disaster loss assessment did not pay much attention to the overfitting phenomenon ca...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Hai, Wang, Jin, Ye, Wentao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7999314/
https://www.ncbi.nlm.nih.gov/pubmed/33809216
http://dx.doi.org/10.3390/ijerph18062918
_version_ 1783670753856585728
author Sun, Hai
Wang, Jin
Ye, Wentao
author_facet Sun, Hai
Wang, Jin
Ye, Wentao
author_sort Sun, Hai
collection PubMed
description The accurate prediction of storm surge disasters’ direct economic losses plays a positive role in providing critical support for disaster prevention decision-making and management. Previous researches on storm surge disaster loss assessment did not pay much attention to the overfitting phenomenon caused by the data scarcity and the excessive model complexity. To solve these problems, this paper puts forward a new evaluation system for forecasting the regional direct economic loss of storm surge disasters, consisting of three parts. First of all, a comprehensive assessment index system was established by considering the storm surge disasters’ formation mechanism and the corresponding risk management theory. Secondly, a novel data augmentation technique, k-nearest neighbor-Gaussian noise (KNN-GN), was presented to overcome data scarcity. Thirdly, an ensemble learning algorithm XGBoost as a regression model was utilized to optimize the results and produce the final forecasting results. To verify the best-combined model, KNN-GN-based XGBoost, we conducted cross-contrast experiments with several data augmentation techniques and some widely-used ensemble learning models. Meanwhile, the traditional prediction models are used as baselines to the optimized forecasting system. The experimental results show that the KNN-GN-based XGBoost model provides more precise predictions than the traditional models, with a 64.1% average improvement in the mean absolute percentage error (MAPE) measurement. It could be noted that the proposed evaluation system can be extended and applied to the geography-related field as well.
format Online
Article
Text
id pubmed-7999314
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-79993142021-03-28 A Data Augmentation-Based Evaluation System for Regional Direct Economic Losses of Storm Surge Disasters Sun, Hai Wang, Jin Ye, Wentao Int J Environ Res Public Health Article The accurate prediction of storm surge disasters’ direct economic losses plays a positive role in providing critical support for disaster prevention decision-making and management. Previous researches on storm surge disaster loss assessment did not pay much attention to the overfitting phenomenon caused by the data scarcity and the excessive model complexity. To solve these problems, this paper puts forward a new evaluation system for forecasting the regional direct economic loss of storm surge disasters, consisting of three parts. First of all, a comprehensive assessment index system was established by considering the storm surge disasters’ formation mechanism and the corresponding risk management theory. Secondly, a novel data augmentation technique, k-nearest neighbor-Gaussian noise (KNN-GN), was presented to overcome data scarcity. Thirdly, an ensemble learning algorithm XGBoost as a regression model was utilized to optimize the results and produce the final forecasting results. To verify the best-combined model, KNN-GN-based XGBoost, we conducted cross-contrast experiments with several data augmentation techniques and some widely-used ensemble learning models. Meanwhile, the traditional prediction models are used as baselines to the optimized forecasting system. The experimental results show that the KNN-GN-based XGBoost model provides more precise predictions than the traditional models, with a 64.1% average improvement in the mean absolute percentage error (MAPE) measurement. It could be noted that the proposed evaluation system can be extended and applied to the geography-related field as well. MDPI 2021-03-12 /pmc/articles/PMC7999314/ /pubmed/33809216 http://dx.doi.org/10.3390/ijerph18062918 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Sun, Hai
Wang, Jin
Ye, Wentao
A Data Augmentation-Based Evaluation System for Regional Direct Economic Losses of Storm Surge Disasters
title A Data Augmentation-Based Evaluation System for Regional Direct Economic Losses of Storm Surge Disasters
title_full A Data Augmentation-Based Evaluation System for Regional Direct Economic Losses of Storm Surge Disasters
title_fullStr A Data Augmentation-Based Evaluation System for Regional Direct Economic Losses of Storm Surge Disasters
title_full_unstemmed A Data Augmentation-Based Evaluation System for Regional Direct Economic Losses of Storm Surge Disasters
title_short A Data Augmentation-Based Evaluation System for Regional Direct Economic Losses of Storm Surge Disasters
title_sort data augmentation-based evaluation system for regional direct economic losses of storm surge disasters
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7999314/
https://www.ncbi.nlm.nih.gov/pubmed/33809216
http://dx.doi.org/10.3390/ijerph18062918
work_keys_str_mv AT sunhai adataaugmentationbasedevaluationsystemforregionaldirecteconomiclossesofstormsurgedisasters
AT wangjin adataaugmentationbasedevaluationsystemforregionaldirecteconomiclossesofstormsurgedisasters
AT yewentao adataaugmentationbasedevaluationsystemforregionaldirecteconomiclossesofstormsurgedisasters
AT sunhai dataaugmentationbasedevaluationsystemforregionaldirecteconomiclossesofstormsurgedisasters
AT wangjin dataaugmentationbasedevaluationsystemforregionaldirecteconomiclossesofstormsurgedisasters
AT yewentao dataaugmentationbasedevaluationsystemforregionaldirecteconomiclossesofstormsurgedisasters