Cargando…
Optimizing the Predictive Ability of Machine Learning Methods for Landslide Susceptibility Mapping Using SMOTE for Lishui City in Zhejiang Province, China
The main goal of this study was to use the synthetic minority oversampling technique (SMOTE) to expand the quantity of landslide samples for machine learning methods (i.e., support vector machine (SVM), logistic regression (LR), artificial neural network (ANN), and random forest (RF)) to produce hig...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6388203/ https://www.ncbi.nlm.nih.gov/pubmed/30696105 http://dx.doi.org/10.3390/ijerph16030368 |
_version_ | 1783397716868464640 |
---|---|
author | Wang, Yumiao Wu, Xueling Chen, Zhangjian Ren, Fu Feng, Luwei Du, Qingyun |
author_facet | Wang, Yumiao Wu, Xueling Chen, Zhangjian Ren, Fu Feng, Luwei Du, Qingyun |
author_sort | Wang, Yumiao |
collection | PubMed |
description | The main goal of this study was to use the synthetic minority oversampling technique (SMOTE) to expand the quantity of landslide samples for machine learning methods (i.e., support vector machine (SVM), logistic regression (LR), artificial neural network (ANN), and random forest (RF)) to produce high-quality landslide susceptibility maps for Lishui City in Zhejiang Province, China. Landslide-related factors were extracted from topographic maps, geological maps, and satellite images. Twelve factors were selected as independent variables using correlation coefficient analysis and the neighborhood rough set (NRS) method. In total, 288 soil landslides were mapped using field surveys, historical records, and satellite images. The landslides were randomly divided into two datasets: 70% of all landslides were selected as the original training dataset and 30% were used for validation. Then, SMOTE was employed to generate datasets with sizes ranging from two to thirty times that of the training dataset to establish and compare the four machine learning methods for landslide susceptibility mapping. In addition, we used slope units to subdivide the terrain to determine the landslide susceptibility. Finally, the landslide susceptibility maps were validated using statistical indexes and the area under the curve (AUC). The results indicated that the performances of the four machine learning methods showed different levels of improvement as the sample sizes increased. The RF model exhibited a more substantial improvement (AUC improved by 24.12%) than did the ANN (18.94%), SVM (17.77%), and LR (3.00%) models. Furthermore, the ANN model achieved the highest predictive ability (AUC = 0.98), followed by the RF (AUC = 0.96), SVM (AUC = 0.94), and LR (AUC = 0.79) models. This approach significantly improves the performance of machine learning techniques for landslide susceptibility mapping, thereby providing a better tool for reducing the impacts of landslide disasters. |
format | Online Article Text |
id | pubmed-6388203 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-63882032019-02-27 Optimizing the Predictive Ability of Machine Learning Methods for Landslide Susceptibility Mapping Using SMOTE for Lishui City in Zhejiang Province, China Wang, Yumiao Wu, Xueling Chen, Zhangjian Ren, Fu Feng, Luwei Du, Qingyun Int J Environ Res Public Health Article The main goal of this study was to use the synthetic minority oversampling technique (SMOTE) to expand the quantity of landslide samples for machine learning methods (i.e., support vector machine (SVM), logistic regression (LR), artificial neural network (ANN), and random forest (RF)) to produce high-quality landslide susceptibility maps for Lishui City in Zhejiang Province, China. Landslide-related factors were extracted from topographic maps, geological maps, and satellite images. Twelve factors were selected as independent variables using correlation coefficient analysis and the neighborhood rough set (NRS) method. In total, 288 soil landslides were mapped using field surveys, historical records, and satellite images. The landslides were randomly divided into two datasets: 70% of all landslides were selected as the original training dataset and 30% were used for validation. Then, SMOTE was employed to generate datasets with sizes ranging from two to thirty times that of the training dataset to establish and compare the four machine learning methods for landslide susceptibility mapping. In addition, we used slope units to subdivide the terrain to determine the landslide susceptibility. Finally, the landslide susceptibility maps were validated using statistical indexes and the area under the curve (AUC). The results indicated that the performances of the four machine learning methods showed different levels of improvement as the sample sizes increased. The RF model exhibited a more substantial improvement (AUC improved by 24.12%) than did the ANN (18.94%), SVM (17.77%), and LR (3.00%) models. Furthermore, the ANN model achieved the highest predictive ability (AUC = 0.98), followed by the RF (AUC = 0.96), SVM (AUC = 0.94), and LR (AUC = 0.79) models. This approach significantly improves the performance of machine learning techniques for landslide susceptibility mapping, thereby providing a better tool for reducing the impacts of landslide disasters. MDPI 2019-01-28 2019-02 /pmc/articles/PMC6388203/ /pubmed/30696105 http://dx.doi.org/10.3390/ijerph16030368 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Wang, Yumiao Wu, Xueling Chen, Zhangjian Ren, Fu Feng, Luwei Du, Qingyun Optimizing the Predictive Ability of Machine Learning Methods for Landslide Susceptibility Mapping Using SMOTE for Lishui City in Zhejiang Province, China |
title | Optimizing the Predictive Ability of Machine Learning Methods for Landslide Susceptibility Mapping Using SMOTE for Lishui City in Zhejiang Province, China |
title_full | Optimizing the Predictive Ability of Machine Learning Methods for Landslide Susceptibility Mapping Using SMOTE for Lishui City in Zhejiang Province, China |
title_fullStr | Optimizing the Predictive Ability of Machine Learning Methods for Landslide Susceptibility Mapping Using SMOTE for Lishui City in Zhejiang Province, China |
title_full_unstemmed | Optimizing the Predictive Ability of Machine Learning Methods for Landslide Susceptibility Mapping Using SMOTE for Lishui City in Zhejiang Province, China |
title_short | Optimizing the Predictive Ability of Machine Learning Methods for Landslide Susceptibility Mapping Using SMOTE for Lishui City in Zhejiang Province, China |
title_sort | optimizing the predictive ability of machine learning methods for landslide susceptibility mapping using smote for lishui city in zhejiang province, china |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6388203/ https://www.ncbi.nlm.nih.gov/pubmed/30696105 http://dx.doi.org/10.3390/ijerph16030368 |
work_keys_str_mv | AT wangyumiao optimizingthepredictiveabilityofmachinelearningmethodsforlandslidesusceptibilitymappingusingsmoteforlishuicityinzhejiangprovincechina AT wuxueling optimizingthepredictiveabilityofmachinelearningmethodsforlandslidesusceptibilitymappingusingsmoteforlishuicityinzhejiangprovincechina AT chenzhangjian optimizingthepredictiveabilityofmachinelearningmethodsforlandslidesusceptibilitymappingusingsmoteforlishuicityinzhejiangprovincechina AT renfu optimizingthepredictiveabilityofmachinelearningmethodsforlandslidesusceptibilitymappingusingsmoteforlishuicityinzhejiangprovincechina AT fengluwei optimizingthepredictiveabilityofmachinelearningmethodsforlandslidesusceptibilitymappingusingsmoteforlishuicityinzhejiangprovincechina AT duqingyun optimizingthepredictiveabilityofmachinelearningmethodsforlandslidesusceptibilitymappingusingsmoteforlishuicityinzhejiangprovincechina |