Cargando…

Remote sensing-based measurement of Living Environment Deprivation: Improving classical approaches with machine learning

This paper provides evidence on the usefulness of very high spatial resolution (VHR) imagery in gathering socioeconomic information in urban settlements. We use land cover, spectral, structure and texture features extracted from a Google Earth image of Liverpool (UK) to evaluate their potential to p...

Descripción completa

Detalles Bibliográficos
Autores principales: Arribas-Bel, Daniel, Patino, Jorge E., Duque, Juan C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5413026/
https://www.ncbi.nlm.nih.gov/pubmed/28464010
http://dx.doi.org/10.1371/journal.pone.0176684
_version_ 1783233121217413120
author Arribas-Bel, Daniel
Patino, Jorge E.
Duque, Juan C.
author_facet Arribas-Bel, Daniel
Patino, Jorge E.
Duque, Juan C.
author_sort Arribas-Bel, Daniel
collection PubMed
description This paper provides evidence on the usefulness of very high spatial resolution (VHR) imagery in gathering socioeconomic information in urban settlements. We use land cover, spectral, structure and texture features extracted from a Google Earth image of Liverpool (UK) to evaluate their potential to predict Living Environment Deprivation at a small statistical area level. We also contribute to the methodological literature on the estimation of socioeconomic indices with remote-sensing data by introducing elements from modern machine learning. In addition to classical approaches such as Ordinary Least Squares (OLS) regression and a spatial lag model, we explore the potential of the Gradient Boost Regressor and Random Forests to improve predictive performance and accuracy. In addition to novel predicting methods, we also introduce tools for model interpretation and evaluation such as feature importance and partial dependence plots, or cross-validation. Our results show that Random Forest proved to be the best model with an R(2) of around 0.54, followed by Gradient Boost Regressor with 0.5. Both the spatial lag model and the OLS fall behind with significantly lower performances of 0.43 and 0.3, respectively.
format Online
Article
Text
id pubmed-5413026
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-54130262017-05-14 Remote sensing-based measurement of Living Environment Deprivation: Improving classical approaches with machine learning Arribas-Bel, Daniel Patino, Jorge E. Duque, Juan C. PLoS One Research Article This paper provides evidence on the usefulness of very high spatial resolution (VHR) imagery in gathering socioeconomic information in urban settlements. We use land cover, spectral, structure and texture features extracted from a Google Earth image of Liverpool (UK) to evaluate their potential to predict Living Environment Deprivation at a small statistical area level. We also contribute to the methodological literature on the estimation of socioeconomic indices with remote-sensing data by introducing elements from modern machine learning. In addition to classical approaches such as Ordinary Least Squares (OLS) regression and a spatial lag model, we explore the potential of the Gradient Boost Regressor and Random Forests to improve predictive performance and accuracy. In addition to novel predicting methods, we also introduce tools for model interpretation and evaluation such as feature importance and partial dependence plots, or cross-validation. Our results show that Random Forest proved to be the best model with an R(2) of around 0.54, followed by Gradient Boost Regressor with 0.5. Both the spatial lag model and the OLS fall behind with significantly lower performances of 0.43 and 0.3, respectively. Public Library of Science 2017-05-02 /pmc/articles/PMC5413026/ /pubmed/28464010 http://dx.doi.org/10.1371/journal.pone.0176684 Text en © 2017 Arribas-Bel et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Arribas-Bel, Daniel
Patino, Jorge E.
Duque, Juan C.
Remote sensing-based measurement of Living Environment Deprivation: Improving classical approaches with machine learning
title Remote sensing-based measurement of Living Environment Deprivation: Improving classical approaches with machine learning
title_full Remote sensing-based measurement of Living Environment Deprivation: Improving classical approaches with machine learning
title_fullStr Remote sensing-based measurement of Living Environment Deprivation: Improving classical approaches with machine learning
title_full_unstemmed Remote sensing-based measurement of Living Environment Deprivation: Improving classical approaches with machine learning
title_short Remote sensing-based measurement of Living Environment Deprivation: Improving classical approaches with machine learning
title_sort remote sensing-based measurement of living environment deprivation: improving classical approaches with machine learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5413026/
https://www.ncbi.nlm.nih.gov/pubmed/28464010
http://dx.doi.org/10.1371/journal.pone.0176684
work_keys_str_mv AT arribasbeldaniel remotesensingbasedmeasurementoflivingenvironmentdeprivationimprovingclassicalapproacheswithmachinelearning
AT patinojorgee remotesensingbasedmeasurementoflivingenvironmentdeprivationimprovingclassicalapproacheswithmachinelearning
AT duquejuanc remotesensingbasedmeasurementoflivingenvironmentdeprivationimprovingclassicalapproacheswithmachinelearning