Cargando…

Socioeconomic and environmental factors of poverty in China using geographically weighted random forest regression model

Correlations between socioeconomic factors and poverty in regression models do not reflect actual relationships, especially when data exhibit patterns of spatial heterogeneity. Spatial regression models can estimate the relationships between socioeconomic factors and poverty in defined geographical...

Descripción completa

Detalles Bibliográficos
Autores principales: Luo, Yaowen, Yan, Jianguo, McClure, Stephen C., Li, Fei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8754530/
https://www.ncbi.nlm.nih.gov/pubmed/35022975
http://dx.doi.org/10.1007/s11356-021-17513-3
_version_ 1784632290020687872
author Luo, Yaowen
Yan, Jianguo
McClure, Stephen C.
Li, Fei
author_facet Luo, Yaowen
Yan, Jianguo
McClure, Stephen C.
Li, Fei
author_sort Luo, Yaowen
collection PubMed
description Correlations between socioeconomic factors and poverty in regression models do not reflect actual relationships, especially when data exhibit patterns of spatial heterogeneity. Spatial regression models can estimate the relationships between socioeconomic factors and poverty in defined geographical areas, explaining the imbalanced distribution of poverty, but the relationships between these factors and poverty are not always linear however, and conventional simple linear local regression models do not accurately capture these nonlinear relationships. To fill this gap, we used a local regression method, geographically weighted random forest regression (GW-RFR), that integrates a spatial weight matrix (SWM) and random forest (RF). The GW-RFR evaluates the spatial variations in the nonlinear relationships between variables. A county-level poverty data set of China was employed to estimate the performance of the GW-RFR against the random forest (RF). In this poverty application, the value of [Formula: see text] was 0.128 higher than that of the RF, the NRMSE value was 1.6% lower than the RF, and the MAE value was 0.295 lower than the RF. These results showed that the relationship between poverty factors and poverty varies with space at the county level in China, and the GW-RFR was suitable for dealing with nonlinear relationships in local regression analysis. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s11356-021-17513-3.
format Online
Article
Text
id pubmed-8754530
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-87545302022-01-13 Socioeconomic and environmental factors of poverty in China using geographically weighted random forest regression model Luo, Yaowen Yan, Jianguo McClure, Stephen C. Li, Fei Environ Sci Pollut Res Int Research Article Correlations between socioeconomic factors and poverty in regression models do not reflect actual relationships, especially when data exhibit patterns of spatial heterogeneity. Spatial regression models can estimate the relationships between socioeconomic factors and poverty in defined geographical areas, explaining the imbalanced distribution of poverty, but the relationships between these factors and poverty are not always linear however, and conventional simple linear local regression models do not accurately capture these nonlinear relationships. To fill this gap, we used a local regression method, geographically weighted random forest regression (GW-RFR), that integrates a spatial weight matrix (SWM) and random forest (RF). The GW-RFR evaluates the spatial variations in the nonlinear relationships between variables. A county-level poverty data set of China was employed to estimate the performance of the GW-RFR against the random forest (RF). In this poverty application, the value of [Formula: see text] was 0.128 higher than that of the RF, the NRMSE value was 1.6% lower than the RF, and the MAE value was 0.295 lower than the RF. These results showed that the relationship between poverty factors and poverty varies with space at the county level in China, and the GW-RFR was suitable for dealing with nonlinear relationships in local regression analysis. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s11356-021-17513-3. Springer Berlin Heidelberg 2022-01-13 2022 /pmc/articles/PMC8754530/ /pubmed/35022975 http://dx.doi.org/10.1007/s11356-021-17513-3 Text en © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Research Article
Luo, Yaowen
Yan, Jianguo
McClure, Stephen C.
Li, Fei
Socioeconomic and environmental factors of poverty in China using geographically weighted random forest regression model
title Socioeconomic and environmental factors of poverty in China using geographically weighted random forest regression model
title_full Socioeconomic and environmental factors of poverty in China using geographically weighted random forest regression model
title_fullStr Socioeconomic and environmental factors of poverty in China using geographically weighted random forest regression model
title_full_unstemmed Socioeconomic and environmental factors of poverty in China using geographically weighted random forest regression model
title_short Socioeconomic and environmental factors of poverty in China using geographically weighted random forest regression model
title_sort socioeconomic and environmental factors of poverty in china using geographically weighted random forest regression model
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8754530/
https://www.ncbi.nlm.nih.gov/pubmed/35022975
http://dx.doi.org/10.1007/s11356-021-17513-3
work_keys_str_mv AT luoyaowen socioeconomicandenvironmentalfactorsofpovertyinchinausinggeographicallyweightedrandomforestregressionmodel
AT yanjianguo socioeconomicandenvironmentalfactorsofpovertyinchinausinggeographicallyweightedrandomforestregressionmodel
AT mcclurestephenc socioeconomicandenvironmentalfactorsofpovertyinchinausinggeographicallyweightedrandomforestregressionmodel
AT lifei socioeconomicandenvironmentalfactorsofpovertyinchinausinggeographicallyweightedrandomforestregressionmodel