Cargando…

Machine Learning Methods and Synthetic Data Generation to Predict Large Wildfires

Wildfires are becoming more frequent in different parts of the globe, and the ability to predict when and where they will occur is a complex process. Identifying wildfire events with high probability of becoming a large wildfire is an important task for supporting initial attack planning. Different...

Descripción completa

Detalles Bibliográficos
Autores principales: Pérez-Porras, Fernando-Juan, Triviño-Tarradas, Paula, Cima-Rodríguez, Carmen, Meroño-de-Larriva, Jose-Emilio, García-Ferrer, Alfonso, Mesas-Carrascosa, Francisco-Javier
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8198242/
https://www.ncbi.nlm.nih.gov/pubmed/34073312
http://dx.doi.org/10.3390/s21113694
_version_ 1783707091326730240
author Pérez-Porras, Fernando-Juan
Triviño-Tarradas, Paula
Cima-Rodríguez, Carmen
Meroño-de-Larriva, Jose-Emilio
García-Ferrer, Alfonso
Mesas-Carrascosa, Francisco-Javier
author_facet Pérez-Porras, Fernando-Juan
Triviño-Tarradas, Paula
Cima-Rodríguez, Carmen
Meroño-de-Larriva, Jose-Emilio
García-Ferrer, Alfonso
Mesas-Carrascosa, Francisco-Javier
author_sort Pérez-Porras, Fernando-Juan
collection PubMed
description Wildfires are becoming more frequent in different parts of the globe, and the ability to predict when and where they will occur is a complex process. Identifying wildfire events with high probability of becoming a large wildfire is an important task for supporting initial attack planning. Different methods, including those that are physics-based, statistical, and based on machine learning (ML) are used in wildfire analysis. Among the whole, those based on machine learning are relatively novel. In addition, because the number of wildfires is much greater than the number of large wildfires, the dataset to be used in a ML model is imbalanced, resulting in overfitting or underfitting the results. In this manuscript, we propose to generate synthetic data from variables of interest together with ML models for the prediction of large wildfires. Specifically, five synthetic data generation methods have been evaluated, and their results are analyzed with four ML methods. The results yield an improvement in the prediction power when synthetic data are used, offering a new method to be taken into account in Decision Support Systems (DSS) when managing wildfires.
format Online
Article
Text
id pubmed-8198242
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-81982422021-06-14 Machine Learning Methods and Synthetic Data Generation to Predict Large Wildfires Pérez-Porras, Fernando-Juan Triviño-Tarradas, Paula Cima-Rodríguez, Carmen Meroño-de-Larriva, Jose-Emilio García-Ferrer, Alfonso Mesas-Carrascosa, Francisco-Javier Sensors (Basel) Article Wildfires are becoming more frequent in different parts of the globe, and the ability to predict when and where they will occur is a complex process. Identifying wildfire events with high probability of becoming a large wildfire is an important task for supporting initial attack planning. Different methods, including those that are physics-based, statistical, and based on machine learning (ML) are used in wildfire analysis. Among the whole, those based on machine learning are relatively novel. In addition, because the number of wildfires is much greater than the number of large wildfires, the dataset to be used in a ML model is imbalanced, resulting in overfitting or underfitting the results. In this manuscript, we propose to generate synthetic data from variables of interest together with ML models for the prediction of large wildfires. Specifically, five synthetic data generation methods have been evaluated, and their results are analyzed with four ML methods. The results yield an improvement in the prediction power when synthetic data are used, offering a new method to be taken into account in Decision Support Systems (DSS) when managing wildfires. MDPI 2021-05-26 /pmc/articles/PMC8198242/ /pubmed/34073312 http://dx.doi.org/10.3390/s21113694 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Pérez-Porras, Fernando-Juan
Triviño-Tarradas, Paula
Cima-Rodríguez, Carmen
Meroño-de-Larriva, Jose-Emilio
García-Ferrer, Alfonso
Mesas-Carrascosa, Francisco-Javier
Machine Learning Methods and Synthetic Data Generation to Predict Large Wildfires
title Machine Learning Methods and Synthetic Data Generation to Predict Large Wildfires
title_full Machine Learning Methods and Synthetic Data Generation to Predict Large Wildfires
title_fullStr Machine Learning Methods and Synthetic Data Generation to Predict Large Wildfires
title_full_unstemmed Machine Learning Methods and Synthetic Data Generation to Predict Large Wildfires
title_short Machine Learning Methods and Synthetic Data Generation to Predict Large Wildfires
title_sort machine learning methods and synthetic data generation to predict large wildfires
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8198242/
https://www.ncbi.nlm.nih.gov/pubmed/34073312
http://dx.doi.org/10.3390/s21113694
work_keys_str_mv AT perezporrasfernandojuan machinelearningmethodsandsyntheticdatagenerationtopredictlargewildfires
AT trivinotarradaspaula machinelearningmethodsandsyntheticdatagenerationtopredictlargewildfires
AT cimarodriguezcarmen machinelearningmethodsandsyntheticdatagenerationtopredictlargewildfires
AT meronodelarrivajoseemilio machinelearningmethodsandsyntheticdatagenerationtopredictlargewildfires
AT garciaferreralfonso machinelearningmethodsandsyntheticdatagenerationtopredictlargewildfires
AT mesascarrascosafranciscojavier machinelearningmethodsandsyntheticdatagenerationtopredictlargewildfires