Cargando…

Population estimation beyond counts—Inferring demographic characteristics

Mapping population distribution at a fine spatial scale is essential for urban studies and planning. Numerous studies, mainly supported by geospatial and statistical methods, have focused primarily on predicting population counts. However, estimating their socio-economic characteristics beyond popul...

Descripción completa

Detalles Bibliográficos
Autores principales: Szarka, Noée, Biljecki, Filip
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982831/
https://www.ncbi.nlm.nih.gov/pubmed/35381028
http://dx.doi.org/10.1371/journal.pone.0266484
_version_ 1784681868127371264
author Szarka, Noée
Biljecki, Filip
author_facet Szarka, Noée
Biljecki, Filip
author_sort Szarka, Noée
collection PubMed
description Mapping population distribution at a fine spatial scale is essential for urban studies and planning. Numerous studies, mainly supported by geospatial and statistical methods, have focused primarily on predicting population counts. However, estimating their socio-economic characteristics beyond population counts, such as average age, income, and gender ratio, remains unattended. We enhance traditional population estimation by predicting not only the number of residents in an area, but also their demographic characteristics: average age and the proportion of seniors. By implementing and comparing different machine learning techniques (Random Forest, Support Vector Machines, and Linear Regression) in administrative areas in Singapore, we investigate the use of point of interest (POI) and real estate data for this purpose. The developed regression model predicts the average age of residents in a neighbourhood with a mean error of about 1.5 years (the range of average resident age across Singaporean districts spans approx. 14 years). The results reveal that age patterns of residents can be predicted using real estate information rather than with amenities, which is in contrast to estimating population counts. Another contribution of our work in population estimation is the use of previously unexploited POI and real estate datasets for it, such as property transactions, year of construction, and flat types (number of rooms). Advancing the domain of population estimation, this study reveals the prospects of a small set of detailed and strong predictors that might have the potential of estimating other demographic characteristics such as income.
format Online
Article
Text
id pubmed-8982831
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-89828312022-04-06 Population estimation beyond counts—Inferring demographic characteristics Szarka, Noée Biljecki, Filip PLoS One Research Article Mapping population distribution at a fine spatial scale is essential for urban studies and planning. Numerous studies, mainly supported by geospatial and statistical methods, have focused primarily on predicting population counts. However, estimating their socio-economic characteristics beyond population counts, such as average age, income, and gender ratio, remains unattended. We enhance traditional population estimation by predicting not only the number of residents in an area, but also their demographic characteristics: average age and the proportion of seniors. By implementing and comparing different machine learning techniques (Random Forest, Support Vector Machines, and Linear Regression) in administrative areas in Singapore, we investigate the use of point of interest (POI) and real estate data for this purpose. The developed regression model predicts the average age of residents in a neighbourhood with a mean error of about 1.5 years (the range of average resident age across Singaporean districts spans approx. 14 years). The results reveal that age patterns of residents can be predicted using real estate information rather than with amenities, which is in contrast to estimating population counts. Another contribution of our work in population estimation is the use of previously unexploited POI and real estate datasets for it, such as property transactions, year of construction, and flat types (number of rooms). Advancing the domain of population estimation, this study reveals the prospects of a small set of detailed and strong predictors that might have the potential of estimating other demographic characteristics such as income. Public Library of Science 2022-04-05 /pmc/articles/PMC8982831/ /pubmed/35381028 http://dx.doi.org/10.1371/journal.pone.0266484 Text en © 2022 Szarka, Biljecki https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Szarka, Noée
Biljecki, Filip
Population estimation beyond counts—Inferring demographic characteristics
title Population estimation beyond counts—Inferring demographic characteristics
title_full Population estimation beyond counts—Inferring demographic characteristics
title_fullStr Population estimation beyond counts—Inferring demographic characteristics
title_full_unstemmed Population estimation beyond counts—Inferring demographic characteristics
title_short Population estimation beyond counts—Inferring demographic characteristics
title_sort population estimation beyond counts—inferring demographic characteristics
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982831/
https://www.ncbi.nlm.nih.gov/pubmed/35381028
http://dx.doi.org/10.1371/journal.pone.0266484
work_keys_str_mv AT szarkanoee populationestimationbeyondcountsinferringdemographiccharacteristics
AT biljeckifilip populationestimationbeyondcountsinferringdemographiccharacteristics