Cargando…

Study on landslide susceptibility mapping with different factor screening methods and random forest models

The number of input factors affects the prediction accuracy of a model. Factor screening plays an important role as the starting point for data input. The aim of this study is to explore the influence of different factor screening methods on the prediction results. Taking the 2014 landslide inventor...

Descripción completa

Detalles Bibliográficos
Autores principales: Gu, Tengfei, Li, Jia, Wang, Mingguo, Duan, Ping, Zhang, Yanke, Cheng, Libo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10569556/
https://www.ncbi.nlm.nih.gov/pubmed/37824559
http://dx.doi.org/10.1371/journal.pone.0292897
_version_ 1785119571232948224
author Gu, Tengfei
Li, Jia
Wang, Mingguo
Duan, Ping
Zhang, Yanke
Cheng, Libo
author_facet Gu, Tengfei
Li, Jia
Wang, Mingguo
Duan, Ping
Zhang, Yanke
Cheng, Libo
author_sort Gu, Tengfei
collection PubMed
description The number of input factors affects the prediction accuracy of a model. Factor screening plays an important role as the starting point for data input. The aim of this study is to explore the influence of different factor screening methods on the prediction results. Taking the 2014 landslide inventory of Jingdong County as an example, a landslide database was constructed based on 136 landslide events and 11 selected factors, which were randomly divided into a training dataset and a test dataset according to a ratio of 7:3. Four factor screening methods, namely, the information gain ratio (IGR), GeoDetector, Pearson correlation coefficient and multicollinearity test (MT), were selected to screen the factors. A random forest (RF) model was then used in combination with each factor set for landslide susceptibility mapping (LSM). Finally, accuracy validation was performed using confusion matrices and ROC curves. The results show that factor screening is beneficial in improving the accuracy of the resulting model compared to the original model. Second, the IGR_RF model had the highest AUC value (0.9334), which was higher than that of the MT_RF model without factor screening (0.9194), and the IGR_RF model predicted the most landslides in the very high susceptibility zone (51.22%), indicating the good prediction performance of the IGR_RF model. Finally, the factor weighting analysis revealed that NDVI, elevation and aspect had the greatest influence on landslides in Jingdong County and that curvature had the least influence on landslides. This study can provide a reference for factor screening in LSM.
format Online
Article
Text
id pubmed-10569556
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-105695562023-10-13 Study on landslide susceptibility mapping with different factor screening methods and random forest models Gu, Tengfei Li, Jia Wang, Mingguo Duan, Ping Zhang, Yanke Cheng, Libo PLoS One Research Article The number of input factors affects the prediction accuracy of a model. Factor screening plays an important role as the starting point for data input. The aim of this study is to explore the influence of different factor screening methods on the prediction results. Taking the 2014 landslide inventory of Jingdong County as an example, a landslide database was constructed based on 136 landslide events and 11 selected factors, which were randomly divided into a training dataset and a test dataset according to a ratio of 7:3. Four factor screening methods, namely, the information gain ratio (IGR), GeoDetector, Pearson correlation coefficient and multicollinearity test (MT), were selected to screen the factors. A random forest (RF) model was then used in combination with each factor set for landslide susceptibility mapping (LSM). Finally, accuracy validation was performed using confusion matrices and ROC curves. The results show that factor screening is beneficial in improving the accuracy of the resulting model compared to the original model. Second, the IGR_RF model had the highest AUC value (0.9334), which was higher than that of the MT_RF model without factor screening (0.9194), and the IGR_RF model predicted the most landslides in the very high susceptibility zone (51.22%), indicating the good prediction performance of the IGR_RF model. Finally, the factor weighting analysis revealed that NDVI, elevation and aspect had the greatest influence on landslides in Jingdong County and that curvature had the least influence on landslides. This study can provide a reference for factor screening in LSM. Public Library of Science 2023-10-12 /pmc/articles/PMC10569556/ /pubmed/37824559 http://dx.doi.org/10.1371/journal.pone.0292897 Text en © 2023 Gu et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Gu, Tengfei
Li, Jia
Wang, Mingguo
Duan, Ping
Zhang, Yanke
Cheng, Libo
Study on landslide susceptibility mapping with different factor screening methods and random forest models
title Study on landslide susceptibility mapping with different factor screening methods and random forest models
title_full Study on landslide susceptibility mapping with different factor screening methods and random forest models
title_fullStr Study on landslide susceptibility mapping with different factor screening methods and random forest models
title_full_unstemmed Study on landslide susceptibility mapping with different factor screening methods and random forest models
title_short Study on landslide susceptibility mapping with different factor screening methods and random forest models
title_sort study on landslide susceptibility mapping with different factor screening methods and random forest models
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10569556/
https://www.ncbi.nlm.nih.gov/pubmed/37824559
http://dx.doi.org/10.1371/journal.pone.0292897
work_keys_str_mv AT gutengfei studyonlandslidesusceptibilitymappingwithdifferentfactorscreeningmethodsandrandomforestmodels
AT lijia studyonlandslidesusceptibilitymappingwithdifferentfactorscreeningmethodsandrandomforestmodels
AT wangmingguo studyonlandslidesusceptibilitymappingwithdifferentfactorscreeningmethodsandrandomforestmodels
AT duanping studyonlandslidesusceptibilitymappingwithdifferentfactorscreeningmethodsandrandomforestmodels
AT zhangyanke studyonlandslidesusceptibilitymappingwithdifferentfactorscreeningmethodsandrandomforestmodels
AT chenglibo studyonlandslidesusceptibilitymappingwithdifferentfactorscreeningmethodsandrandomforestmodels