Cargando…
Population estimation beyond counts—Inferring demographic characteristics
Mapping population distribution at a fine spatial scale is essential for urban studies and planning. Numerous studies, mainly supported by geospatial and statistical methods, have focused primarily on predicting population counts. However, estimating their socio-economic characteristics beyond popul...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982831/ https://www.ncbi.nlm.nih.gov/pubmed/35381028 http://dx.doi.org/10.1371/journal.pone.0266484 |
_version_ | 1784681868127371264 |
---|---|
author | Szarka, Noée Biljecki, Filip |
author_facet | Szarka, Noée Biljecki, Filip |
author_sort | Szarka, Noée |
collection | PubMed |
description | Mapping population distribution at a fine spatial scale is essential for urban studies and planning. Numerous studies, mainly supported by geospatial and statistical methods, have focused primarily on predicting population counts. However, estimating their socio-economic characteristics beyond population counts, such as average age, income, and gender ratio, remains unattended. We enhance traditional population estimation by predicting not only the number of residents in an area, but also their demographic characteristics: average age and the proportion of seniors. By implementing and comparing different machine learning techniques (Random Forest, Support Vector Machines, and Linear Regression) in administrative areas in Singapore, we investigate the use of point of interest (POI) and real estate data for this purpose. The developed regression model predicts the average age of residents in a neighbourhood with a mean error of about 1.5 years (the range of average resident age across Singaporean districts spans approx. 14 years). The results reveal that age patterns of residents can be predicted using real estate information rather than with amenities, which is in contrast to estimating population counts. Another contribution of our work in population estimation is the use of previously unexploited POI and real estate datasets for it, such as property transactions, year of construction, and flat types (number of rooms). Advancing the domain of population estimation, this study reveals the prospects of a small set of detailed and strong predictors that might have the potential of estimating other demographic characteristics such as income. |
format | Online Article Text |
id | pubmed-8982831 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-89828312022-04-06 Population estimation beyond counts—Inferring demographic characteristics Szarka, Noée Biljecki, Filip PLoS One Research Article Mapping population distribution at a fine spatial scale is essential for urban studies and planning. Numerous studies, mainly supported by geospatial and statistical methods, have focused primarily on predicting population counts. However, estimating their socio-economic characteristics beyond population counts, such as average age, income, and gender ratio, remains unattended. We enhance traditional population estimation by predicting not only the number of residents in an area, but also their demographic characteristics: average age and the proportion of seniors. By implementing and comparing different machine learning techniques (Random Forest, Support Vector Machines, and Linear Regression) in administrative areas in Singapore, we investigate the use of point of interest (POI) and real estate data for this purpose. The developed regression model predicts the average age of residents in a neighbourhood with a mean error of about 1.5 years (the range of average resident age across Singaporean districts spans approx. 14 years). The results reveal that age patterns of residents can be predicted using real estate information rather than with amenities, which is in contrast to estimating population counts. Another contribution of our work in population estimation is the use of previously unexploited POI and real estate datasets for it, such as property transactions, year of construction, and flat types (number of rooms). Advancing the domain of population estimation, this study reveals the prospects of a small set of detailed and strong predictors that might have the potential of estimating other demographic characteristics such as income. Public Library of Science 2022-04-05 /pmc/articles/PMC8982831/ /pubmed/35381028 http://dx.doi.org/10.1371/journal.pone.0266484 Text en © 2022 Szarka, Biljecki https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Szarka, Noée Biljecki, Filip Population estimation beyond counts—Inferring demographic characteristics |
title | Population estimation beyond counts—Inferring demographic characteristics |
title_full | Population estimation beyond counts—Inferring demographic characteristics |
title_fullStr | Population estimation beyond counts—Inferring demographic characteristics |
title_full_unstemmed | Population estimation beyond counts—Inferring demographic characteristics |
title_short | Population estimation beyond counts—Inferring demographic characteristics |
title_sort | population estimation beyond counts—inferring demographic characteristics |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982831/ https://www.ncbi.nlm.nih.gov/pubmed/35381028 http://dx.doi.org/10.1371/journal.pone.0266484 |
work_keys_str_mv | AT szarkanoee populationestimationbeyondcountsinferringdemographiccharacteristics AT biljeckifilip populationestimationbeyondcountsinferringdemographiccharacteristics |