Cargando…

Optimizing the Predictive Ability of Machine Learning Methods for Landslide Susceptibility Mapping Using SMOTE for Lishui City in Zhejiang Province, China

The main goal of this study was to use the synthetic minority oversampling technique (SMOTE) to expand the quantity of landslide samples for machine learning methods (i.e., support vector machine (SVM), logistic regression (LR), artificial neural network (ANN), and random forest (RF)) to produce hig...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yumiao, Wu, Xueling, Chen, Zhangjian, Ren, Fu, Feng, Luwei, Du, Qingyun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6388203/
https://www.ncbi.nlm.nih.gov/pubmed/30696105
http://dx.doi.org/10.3390/ijerph16030368
_version_ 1783397716868464640
author Wang, Yumiao
Wu, Xueling
Chen, Zhangjian
Ren, Fu
Feng, Luwei
Du, Qingyun
author_facet Wang, Yumiao
Wu, Xueling
Chen, Zhangjian
Ren, Fu
Feng, Luwei
Du, Qingyun
author_sort Wang, Yumiao
collection PubMed
description The main goal of this study was to use the synthetic minority oversampling technique (SMOTE) to expand the quantity of landslide samples for machine learning methods (i.e., support vector machine (SVM), logistic regression (LR), artificial neural network (ANN), and random forest (RF)) to produce high-quality landslide susceptibility maps for Lishui City in Zhejiang Province, China. Landslide-related factors were extracted from topographic maps, geological maps, and satellite images. Twelve factors were selected as independent variables using correlation coefficient analysis and the neighborhood rough set (NRS) method. In total, 288 soil landslides were mapped using field surveys, historical records, and satellite images. The landslides were randomly divided into two datasets: 70% of all landslides were selected as the original training dataset and 30% were used for validation. Then, SMOTE was employed to generate datasets with sizes ranging from two to thirty times that of the training dataset to establish and compare the four machine learning methods for landslide susceptibility mapping. In addition, we used slope units to subdivide the terrain to determine the landslide susceptibility. Finally, the landslide susceptibility maps were validated using statistical indexes and the area under the curve (AUC). The results indicated that the performances of the four machine learning methods showed different levels of improvement as the sample sizes increased. The RF model exhibited a more substantial improvement (AUC improved by 24.12%) than did the ANN (18.94%), SVM (17.77%), and LR (3.00%) models. Furthermore, the ANN model achieved the highest predictive ability (AUC = 0.98), followed by the RF (AUC = 0.96), SVM (AUC = 0.94), and LR (AUC = 0.79) models. This approach significantly improves the performance of machine learning techniques for landslide susceptibility mapping, thereby providing a better tool for reducing the impacts of landslide disasters.
format Online
Article
Text
id pubmed-6388203
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-63882032019-02-27 Optimizing the Predictive Ability of Machine Learning Methods for Landslide Susceptibility Mapping Using SMOTE for Lishui City in Zhejiang Province, China Wang, Yumiao Wu, Xueling Chen, Zhangjian Ren, Fu Feng, Luwei Du, Qingyun Int J Environ Res Public Health Article The main goal of this study was to use the synthetic minority oversampling technique (SMOTE) to expand the quantity of landslide samples for machine learning methods (i.e., support vector machine (SVM), logistic regression (LR), artificial neural network (ANN), and random forest (RF)) to produce high-quality landslide susceptibility maps for Lishui City in Zhejiang Province, China. Landslide-related factors were extracted from topographic maps, geological maps, and satellite images. Twelve factors were selected as independent variables using correlation coefficient analysis and the neighborhood rough set (NRS) method. In total, 288 soil landslides were mapped using field surveys, historical records, and satellite images. The landslides were randomly divided into two datasets: 70% of all landslides were selected as the original training dataset and 30% were used for validation. Then, SMOTE was employed to generate datasets with sizes ranging from two to thirty times that of the training dataset to establish and compare the four machine learning methods for landslide susceptibility mapping. In addition, we used slope units to subdivide the terrain to determine the landslide susceptibility. Finally, the landslide susceptibility maps were validated using statistical indexes and the area under the curve (AUC). The results indicated that the performances of the four machine learning methods showed different levels of improvement as the sample sizes increased. The RF model exhibited a more substantial improvement (AUC improved by 24.12%) than did the ANN (18.94%), SVM (17.77%), and LR (3.00%) models. Furthermore, the ANN model achieved the highest predictive ability (AUC = 0.98), followed by the RF (AUC = 0.96), SVM (AUC = 0.94), and LR (AUC = 0.79) models. This approach significantly improves the performance of machine learning techniques for landslide susceptibility mapping, thereby providing a better tool for reducing the impacts of landslide disasters. MDPI 2019-01-28 2019-02 /pmc/articles/PMC6388203/ /pubmed/30696105 http://dx.doi.org/10.3390/ijerph16030368 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Yumiao
Wu, Xueling
Chen, Zhangjian
Ren, Fu
Feng, Luwei
Du, Qingyun
Optimizing the Predictive Ability of Machine Learning Methods for Landslide Susceptibility Mapping Using SMOTE for Lishui City in Zhejiang Province, China
title Optimizing the Predictive Ability of Machine Learning Methods for Landslide Susceptibility Mapping Using SMOTE for Lishui City in Zhejiang Province, China
title_full Optimizing the Predictive Ability of Machine Learning Methods for Landslide Susceptibility Mapping Using SMOTE for Lishui City in Zhejiang Province, China
title_fullStr Optimizing the Predictive Ability of Machine Learning Methods for Landslide Susceptibility Mapping Using SMOTE for Lishui City in Zhejiang Province, China
title_full_unstemmed Optimizing the Predictive Ability of Machine Learning Methods for Landslide Susceptibility Mapping Using SMOTE for Lishui City in Zhejiang Province, China
title_short Optimizing the Predictive Ability of Machine Learning Methods for Landslide Susceptibility Mapping Using SMOTE for Lishui City in Zhejiang Province, China
title_sort optimizing the predictive ability of machine learning methods for landslide susceptibility mapping using smote for lishui city in zhejiang province, china
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6388203/
https://www.ncbi.nlm.nih.gov/pubmed/30696105
http://dx.doi.org/10.3390/ijerph16030368
work_keys_str_mv AT wangyumiao optimizingthepredictiveabilityofmachinelearningmethodsforlandslidesusceptibilitymappingusingsmoteforlishuicityinzhejiangprovincechina
AT wuxueling optimizingthepredictiveabilityofmachinelearningmethodsforlandslidesusceptibilitymappingusingsmoteforlishuicityinzhejiangprovincechina
AT chenzhangjian optimizingthepredictiveabilityofmachinelearningmethodsforlandslidesusceptibilitymappingusingsmoteforlishuicityinzhejiangprovincechina
AT renfu optimizingthepredictiveabilityofmachinelearningmethodsforlandslidesusceptibilitymappingusingsmoteforlishuicityinzhejiangprovincechina
AT fengluwei optimizingthepredictiveabilityofmachinelearningmethodsforlandslidesusceptibilitymappingusingsmoteforlishuicityinzhejiangprovincechina
AT duqingyun optimizingthepredictiveabilityofmachinelearningmethodsforlandslidesusceptibilitymappingusingsmoteforlishuicityinzhejiangprovincechina