Cargando…

Improved Transferability of Data‐Driven Damage Models Through Sample Selection Bias Correction

Damage models for natural hazards are used for decision making on reducing and transferring risk. The damage estimates from these models depend on many variables and their complex sometimes nonlinear relationships with the damage. In recent years, data‐driven modeling techniques have been used to ca...

Descripción completa

Detalles Bibliográficos
Autores principales: Wagenaar, Dennis, Hermawan, Tiaravanni, van den Homberg, Marc J. C., Aerts, Jeroen C. J. H., Kreibich, Heidi, de Moel, Hans, Bouwer, Laurens M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7891600/
https://www.ncbi.nlm.nih.gov/pubmed/32830337
http://dx.doi.org/10.1111/risa.13575
_version_ 1783652732657205248
author Wagenaar, Dennis
Hermawan, Tiaravanni
van den Homberg, Marc J. C.
Aerts, Jeroen C. J. H.
Kreibich, Heidi
de Moel, Hans
Bouwer, Laurens M.
author_facet Wagenaar, Dennis
Hermawan, Tiaravanni
van den Homberg, Marc J. C.
Aerts, Jeroen C. J. H.
Kreibich, Heidi
de Moel, Hans
Bouwer, Laurens M.
author_sort Wagenaar, Dennis
collection PubMed
description Damage models for natural hazards are used for decision making on reducing and transferring risk. The damage estimates from these models depend on many variables and their complex sometimes nonlinear relationships with the damage. In recent years, data‐driven modeling techniques have been used to capture those relationships. The available data to build such models are often limited. Therefore, in practice it is usually necessary to transfer models to a different context. In this article, we show that this implies the samples used to build the model are often not fully representative for the situation where they need to be applied on, which leads to a “sample selection bias.” In this article, we enhance data‐driven damage models by applying methods, not previously applied to damage modeling, to correct for this bias before the machine learning (ML) models are trained. We demonstrate this with case studies on flooding in Europe, and typhoon wind damage in the Philippines. Two sample selection bias correction methods from the ML literature are applied and one of these methods is also adjusted to our problem. These three methods are combined with stochastic generation of synthetic damage data. We demonstrate that for both case studies, the sample selection bias correction techniques reduce model errors, especially for the mean bias error this reduction can be larger than 30%. The novel combination with stochastic data generation seems to enhance these techniques. This shows that sample selection bias correction methods are beneficial for damage model transfer.
format Online
Article
Text
id pubmed-7891600
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-78916002021-03-02 Improved Transferability of Data‐Driven Damage Models Through Sample Selection Bias Correction Wagenaar, Dennis Hermawan, Tiaravanni van den Homberg, Marc J. C. Aerts, Jeroen C. J. H. Kreibich, Heidi de Moel, Hans Bouwer, Laurens M. Risk Anal Original Research Articles Damage models for natural hazards are used for decision making on reducing and transferring risk. The damage estimates from these models depend on many variables and their complex sometimes nonlinear relationships with the damage. In recent years, data‐driven modeling techniques have been used to capture those relationships. The available data to build such models are often limited. Therefore, in practice it is usually necessary to transfer models to a different context. In this article, we show that this implies the samples used to build the model are often not fully representative for the situation where they need to be applied on, which leads to a “sample selection bias.” In this article, we enhance data‐driven damage models by applying methods, not previously applied to damage modeling, to correct for this bias before the machine learning (ML) models are trained. We demonstrate this with case studies on flooding in Europe, and typhoon wind damage in the Philippines. Two sample selection bias correction methods from the ML literature are applied and one of these methods is also adjusted to our problem. These three methods are combined with stochastic generation of synthetic damage data. We demonstrate that for both case studies, the sample selection bias correction techniques reduce model errors, especially for the mean bias error this reduction can be larger than 30%. The novel combination with stochastic data generation seems to enhance these techniques. This shows that sample selection bias correction methods are beneficial for damage model transfer. John Wiley and Sons Inc. 2020-08-24 2021-01 /pmc/articles/PMC7891600/ /pubmed/32830337 http://dx.doi.org/10.1111/risa.13575 Text en © 2020 Society for Risk Analysis This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Research Articles
Wagenaar, Dennis
Hermawan, Tiaravanni
van den Homberg, Marc J. C.
Aerts, Jeroen C. J. H.
Kreibich, Heidi
de Moel, Hans
Bouwer, Laurens M.
Improved Transferability of Data‐Driven Damage Models Through Sample Selection Bias Correction
title Improved Transferability of Data‐Driven Damage Models Through Sample Selection Bias Correction
title_full Improved Transferability of Data‐Driven Damage Models Through Sample Selection Bias Correction
title_fullStr Improved Transferability of Data‐Driven Damage Models Through Sample Selection Bias Correction
title_full_unstemmed Improved Transferability of Data‐Driven Damage Models Through Sample Selection Bias Correction
title_short Improved Transferability of Data‐Driven Damage Models Through Sample Selection Bias Correction
title_sort improved transferability of data‐driven damage models through sample selection bias correction
topic Original Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7891600/
https://www.ncbi.nlm.nih.gov/pubmed/32830337
http://dx.doi.org/10.1111/risa.13575
work_keys_str_mv AT wagenaardennis improvedtransferabilityofdatadrivendamagemodelsthroughsampleselectionbiascorrection
AT hermawantiaravanni improvedtransferabilityofdatadrivendamagemodelsthroughsampleselectionbiascorrection
AT vandenhombergmarcjc improvedtransferabilityofdatadrivendamagemodelsthroughsampleselectionbiascorrection
AT aertsjeroencjh improvedtransferabilityofdatadrivendamagemodelsthroughsampleselectionbiascorrection
AT kreibichheidi improvedtransferabilityofdatadrivendamagemodelsthroughsampleselectionbiascorrection
AT demoelhans improvedtransferabilityofdatadrivendamagemodelsthroughsampleselectionbiascorrection
AT bouwerlaurensm improvedtransferabilityofdatadrivendamagemodelsthroughsampleselectionbiascorrection