Cargando…
Machine Learning Methods and Synthetic Data Generation to Predict Large Wildfires
Wildfires are becoming more frequent in different parts of the globe, and the ability to predict when and where they will occur is a complex process. Identifying wildfire events with high probability of becoming a large wildfire is an important task for supporting initial attack planning. Different...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8198242/ https://www.ncbi.nlm.nih.gov/pubmed/34073312 http://dx.doi.org/10.3390/s21113694 |
_version_ | 1783707091326730240 |
---|---|
author | Pérez-Porras, Fernando-Juan Triviño-Tarradas, Paula Cima-Rodríguez, Carmen Meroño-de-Larriva, Jose-Emilio García-Ferrer, Alfonso Mesas-Carrascosa, Francisco-Javier |
author_facet | Pérez-Porras, Fernando-Juan Triviño-Tarradas, Paula Cima-Rodríguez, Carmen Meroño-de-Larriva, Jose-Emilio García-Ferrer, Alfonso Mesas-Carrascosa, Francisco-Javier |
author_sort | Pérez-Porras, Fernando-Juan |
collection | PubMed |
description | Wildfires are becoming more frequent in different parts of the globe, and the ability to predict when and where they will occur is a complex process. Identifying wildfire events with high probability of becoming a large wildfire is an important task for supporting initial attack planning. Different methods, including those that are physics-based, statistical, and based on machine learning (ML) are used in wildfire analysis. Among the whole, those based on machine learning are relatively novel. In addition, because the number of wildfires is much greater than the number of large wildfires, the dataset to be used in a ML model is imbalanced, resulting in overfitting or underfitting the results. In this manuscript, we propose to generate synthetic data from variables of interest together with ML models for the prediction of large wildfires. Specifically, five synthetic data generation methods have been evaluated, and their results are analyzed with four ML methods. The results yield an improvement in the prediction power when synthetic data are used, offering a new method to be taken into account in Decision Support Systems (DSS) when managing wildfires. |
format | Online Article Text |
id | pubmed-8198242 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-81982422021-06-14 Machine Learning Methods and Synthetic Data Generation to Predict Large Wildfires Pérez-Porras, Fernando-Juan Triviño-Tarradas, Paula Cima-Rodríguez, Carmen Meroño-de-Larriva, Jose-Emilio García-Ferrer, Alfonso Mesas-Carrascosa, Francisco-Javier Sensors (Basel) Article Wildfires are becoming more frequent in different parts of the globe, and the ability to predict when and where they will occur is a complex process. Identifying wildfire events with high probability of becoming a large wildfire is an important task for supporting initial attack planning. Different methods, including those that are physics-based, statistical, and based on machine learning (ML) are used in wildfire analysis. Among the whole, those based on machine learning are relatively novel. In addition, because the number of wildfires is much greater than the number of large wildfires, the dataset to be used in a ML model is imbalanced, resulting in overfitting or underfitting the results. In this manuscript, we propose to generate synthetic data from variables of interest together with ML models for the prediction of large wildfires. Specifically, five synthetic data generation methods have been evaluated, and their results are analyzed with four ML methods. The results yield an improvement in the prediction power when synthetic data are used, offering a new method to be taken into account in Decision Support Systems (DSS) when managing wildfires. MDPI 2021-05-26 /pmc/articles/PMC8198242/ /pubmed/34073312 http://dx.doi.org/10.3390/s21113694 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Pérez-Porras, Fernando-Juan Triviño-Tarradas, Paula Cima-Rodríguez, Carmen Meroño-de-Larriva, Jose-Emilio García-Ferrer, Alfonso Mesas-Carrascosa, Francisco-Javier Machine Learning Methods and Synthetic Data Generation to Predict Large Wildfires |
title | Machine Learning Methods and Synthetic Data Generation to Predict Large Wildfires |
title_full | Machine Learning Methods and Synthetic Data Generation to Predict Large Wildfires |
title_fullStr | Machine Learning Methods and Synthetic Data Generation to Predict Large Wildfires |
title_full_unstemmed | Machine Learning Methods and Synthetic Data Generation to Predict Large Wildfires |
title_short | Machine Learning Methods and Synthetic Data Generation to Predict Large Wildfires |
title_sort | machine learning methods and synthetic data generation to predict large wildfires |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8198242/ https://www.ncbi.nlm.nih.gov/pubmed/34073312 http://dx.doi.org/10.3390/s21113694 |
work_keys_str_mv | AT perezporrasfernandojuan machinelearningmethodsandsyntheticdatagenerationtopredictlargewildfires AT trivinotarradaspaula machinelearningmethodsandsyntheticdatagenerationtopredictlargewildfires AT cimarodriguezcarmen machinelearningmethodsandsyntheticdatagenerationtopredictlargewildfires AT meronodelarrivajoseemilio machinelearningmethodsandsyntheticdatagenerationtopredictlargewildfires AT garciaferreralfonso machinelearningmethodsandsyntheticdatagenerationtopredictlargewildfires AT mesascarrascosafranciscojavier machinelearningmethodsandsyntheticdatagenerationtopredictlargewildfires |