Cargando…
Study on landslide susceptibility mapping with different factor screening methods and random forest models
The number of input factors affects the prediction accuracy of a model. Factor screening plays an important role as the starting point for data input. The aim of this study is to explore the influence of different factor screening methods on the prediction results. Taking the 2014 landslide inventor...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10569556/ https://www.ncbi.nlm.nih.gov/pubmed/37824559 http://dx.doi.org/10.1371/journal.pone.0292897 |
_version_ | 1785119571232948224 |
---|---|
author | Gu, Tengfei Li, Jia Wang, Mingguo Duan, Ping Zhang, Yanke Cheng, Libo |
author_facet | Gu, Tengfei Li, Jia Wang, Mingguo Duan, Ping Zhang, Yanke Cheng, Libo |
author_sort | Gu, Tengfei |
collection | PubMed |
description | The number of input factors affects the prediction accuracy of a model. Factor screening plays an important role as the starting point for data input. The aim of this study is to explore the influence of different factor screening methods on the prediction results. Taking the 2014 landslide inventory of Jingdong County as an example, a landslide database was constructed based on 136 landslide events and 11 selected factors, which were randomly divided into a training dataset and a test dataset according to a ratio of 7:3. Four factor screening methods, namely, the information gain ratio (IGR), GeoDetector, Pearson correlation coefficient and multicollinearity test (MT), were selected to screen the factors. A random forest (RF) model was then used in combination with each factor set for landslide susceptibility mapping (LSM). Finally, accuracy validation was performed using confusion matrices and ROC curves. The results show that factor screening is beneficial in improving the accuracy of the resulting model compared to the original model. Second, the IGR_RF model had the highest AUC value (0.9334), which was higher than that of the MT_RF model without factor screening (0.9194), and the IGR_RF model predicted the most landslides in the very high susceptibility zone (51.22%), indicating the good prediction performance of the IGR_RF model. Finally, the factor weighting analysis revealed that NDVI, elevation and aspect had the greatest influence on landslides in Jingdong County and that curvature had the least influence on landslides. This study can provide a reference for factor screening in LSM. |
format | Online Article Text |
id | pubmed-10569556 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-105695562023-10-13 Study on landslide susceptibility mapping with different factor screening methods and random forest models Gu, Tengfei Li, Jia Wang, Mingguo Duan, Ping Zhang, Yanke Cheng, Libo PLoS One Research Article The number of input factors affects the prediction accuracy of a model. Factor screening plays an important role as the starting point for data input. The aim of this study is to explore the influence of different factor screening methods on the prediction results. Taking the 2014 landslide inventory of Jingdong County as an example, a landslide database was constructed based on 136 landslide events and 11 selected factors, which were randomly divided into a training dataset and a test dataset according to a ratio of 7:3. Four factor screening methods, namely, the information gain ratio (IGR), GeoDetector, Pearson correlation coefficient and multicollinearity test (MT), were selected to screen the factors. A random forest (RF) model was then used in combination with each factor set for landslide susceptibility mapping (LSM). Finally, accuracy validation was performed using confusion matrices and ROC curves. The results show that factor screening is beneficial in improving the accuracy of the resulting model compared to the original model. Second, the IGR_RF model had the highest AUC value (0.9334), which was higher than that of the MT_RF model without factor screening (0.9194), and the IGR_RF model predicted the most landslides in the very high susceptibility zone (51.22%), indicating the good prediction performance of the IGR_RF model. Finally, the factor weighting analysis revealed that NDVI, elevation and aspect had the greatest influence on landslides in Jingdong County and that curvature had the least influence on landslides. This study can provide a reference for factor screening in LSM. Public Library of Science 2023-10-12 /pmc/articles/PMC10569556/ /pubmed/37824559 http://dx.doi.org/10.1371/journal.pone.0292897 Text en © 2023 Gu et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Gu, Tengfei Li, Jia Wang, Mingguo Duan, Ping Zhang, Yanke Cheng, Libo Study on landslide susceptibility mapping with different factor screening methods and random forest models |
title | Study on landslide susceptibility mapping with different factor screening methods and random forest models |
title_full | Study on landslide susceptibility mapping with different factor screening methods and random forest models |
title_fullStr | Study on landslide susceptibility mapping with different factor screening methods and random forest models |
title_full_unstemmed | Study on landslide susceptibility mapping with different factor screening methods and random forest models |
title_short | Study on landslide susceptibility mapping with different factor screening methods and random forest models |
title_sort | study on landslide susceptibility mapping with different factor screening methods and random forest models |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10569556/ https://www.ncbi.nlm.nih.gov/pubmed/37824559 http://dx.doi.org/10.1371/journal.pone.0292897 |
work_keys_str_mv | AT gutengfei studyonlandslidesusceptibilitymappingwithdifferentfactorscreeningmethodsandrandomforestmodels AT lijia studyonlandslidesusceptibilitymappingwithdifferentfactorscreeningmethodsandrandomforestmodels AT wangmingguo studyonlandslidesusceptibilitymappingwithdifferentfactorscreeningmethodsandrandomforestmodels AT duanping studyonlandslidesusceptibilitymappingwithdifferentfactorscreeningmethodsandrandomforestmodels AT zhangyanke studyonlandslidesusceptibilitymappingwithdifferentfactorscreeningmethodsandrandomforestmodels AT chenglibo studyonlandslidesusceptibilitymappingwithdifferentfactorscreeningmethodsandrandomforestmodels |