Cargando…
Socioeconomic and environmental factors of poverty in China using geographically weighted random forest regression model
Correlations between socioeconomic factors and poverty in regression models do not reflect actual relationships, especially when data exhibit patterns of spatial heterogeneity. Spatial regression models can estimate the relationships between socioeconomic factors and poverty in defined geographical...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8754530/ https://www.ncbi.nlm.nih.gov/pubmed/35022975 http://dx.doi.org/10.1007/s11356-021-17513-3 |
_version_ | 1784632290020687872 |
---|---|
author | Luo, Yaowen Yan, Jianguo McClure, Stephen C. Li, Fei |
author_facet | Luo, Yaowen Yan, Jianguo McClure, Stephen C. Li, Fei |
author_sort | Luo, Yaowen |
collection | PubMed |
description | Correlations between socioeconomic factors and poverty in regression models do not reflect actual relationships, especially when data exhibit patterns of spatial heterogeneity. Spatial regression models can estimate the relationships between socioeconomic factors and poverty in defined geographical areas, explaining the imbalanced distribution of poverty, but the relationships between these factors and poverty are not always linear however, and conventional simple linear local regression models do not accurately capture these nonlinear relationships. To fill this gap, we used a local regression method, geographically weighted random forest regression (GW-RFR), that integrates a spatial weight matrix (SWM) and random forest (RF). The GW-RFR evaluates the spatial variations in the nonlinear relationships between variables. A county-level poverty data set of China was employed to estimate the performance of the GW-RFR against the random forest (RF). In this poverty application, the value of [Formula: see text] was 0.128 higher than that of the RF, the NRMSE value was 1.6% lower than the RF, and the MAE value was 0.295 lower than the RF. These results showed that the relationship between poverty factors and poverty varies with space at the county level in China, and the GW-RFR was suitable for dealing with nonlinear relationships in local regression analysis. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s11356-021-17513-3. |
format | Online Article Text |
id | pubmed-8754530 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-87545302022-01-13 Socioeconomic and environmental factors of poverty in China using geographically weighted random forest regression model Luo, Yaowen Yan, Jianguo McClure, Stephen C. Li, Fei Environ Sci Pollut Res Int Research Article Correlations between socioeconomic factors and poverty in regression models do not reflect actual relationships, especially when data exhibit patterns of spatial heterogeneity. Spatial regression models can estimate the relationships between socioeconomic factors and poverty in defined geographical areas, explaining the imbalanced distribution of poverty, but the relationships between these factors and poverty are not always linear however, and conventional simple linear local regression models do not accurately capture these nonlinear relationships. To fill this gap, we used a local regression method, geographically weighted random forest regression (GW-RFR), that integrates a spatial weight matrix (SWM) and random forest (RF). The GW-RFR evaluates the spatial variations in the nonlinear relationships between variables. A county-level poverty data set of China was employed to estimate the performance of the GW-RFR against the random forest (RF). In this poverty application, the value of [Formula: see text] was 0.128 higher than that of the RF, the NRMSE value was 1.6% lower than the RF, and the MAE value was 0.295 lower than the RF. These results showed that the relationship between poverty factors and poverty varies with space at the county level in China, and the GW-RFR was suitable for dealing with nonlinear relationships in local regression analysis. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s11356-021-17513-3. Springer Berlin Heidelberg 2022-01-13 2022 /pmc/articles/PMC8754530/ /pubmed/35022975 http://dx.doi.org/10.1007/s11356-021-17513-3 Text en © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Research Article Luo, Yaowen Yan, Jianguo McClure, Stephen C. Li, Fei Socioeconomic and environmental factors of poverty in China using geographically weighted random forest regression model |
title | Socioeconomic and environmental factors of poverty in China using geographically weighted random forest regression model |
title_full | Socioeconomic and environmental factors of poverty in China using geographically weighted random forest regression model |
title_fullStr | Socioeconomic and environmental factors of poverty in China using geographically weighted random forest regression model |
title_full_unstemmed | Socioeconomic and environmental factors of poverty in China using geographically weighted random forest regression model |
title_short | Socioeconomic and environmental factors of poverty in China using geographically weighted random forest regression model |
title_sort | socioeconomic and environmental factors of poverty in china using geographically weighted random forest regression model |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8754530/ https://www.ncbi.nlm.nih.gov/pubmed/35022975 http://dx.doi.org/10.1007/s11356-021-17513-3 |
work_keys_str_mv | AT luoyaowen socioeconomicandenvironmentalfactorsofpovertyinchinausinggeographicallyweightedrandomforestregressionmodel AT yanjianguo socioeconomicandenvironmentalfactorsofpovertyinchinausinggeographicallyweightedrandomforestregressionmodel AT mcclurestephenc socioeconomicandenvironmentalfactorsofpovertyinchinausinggeographicallyweightedrandomforestregressionmodel AT lifei socioeconomicandenvironmentalfactorsofpovertyinchinausinggeographicallyweightedrandomforestregressionmodel |