Cargando…
Creating a surrogate commuter network from Australian Bureau of Statistics census data
Between the 2011 and 2016 national censuses, the Australian Bureau of Statistics changed its anonymity policy compliance system for the distribution of census data. The new method has resulted in dramatic inconsistencies when comparing low-resolution data to aggregated high-resolution data. Hence, a...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6697727/ https://www.ncbi.nlm.nih.gov/pubmed/31420560 http://dx.doi.org/10.1038/s41597-019-0137-z |
_version_ | 1783444422148489216 |
---|---|
author | Fair, Kristopher M. Zachreson, Cameron Prokopenko, Mikhail |
author_facet | Fair, Kristopher M. Zachreson, Cameron Prokopenko, Mikhail |
author_sort | Fair, Kristopher M. |
collection | PubMed |
description | Between the 2011 and 2016 national censuses, the Australian Bureau of Statistics changed its anonymity policy compliance system for the distribution of census data. The new method has resulted in dramatic inconsistencies when comparing low-resolution data to aggregated high-resolution data. Hence, aggregated totals do not match true totals, and the mismatch gets worse as the data resolution gets finer. Here, we address several aspects of this inconsistency with respect to the 2016 usual-residence to place-of-work travel data. We introduce a re-sampling system that rectifies many of the artifacts introduced by the new ABS protocol, ensuring a higher level of consistency across partition sizes. We offer a surrogate high-resolution 2016 commuter dataset that reduces the difference between the aggregated and true commuter totals from ~34% to only ~7%, which is on the order of the discrepancy across partition resolutions in data from earlier years. |
format | Online Article Text |
id | pubmed-6697727 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-66977272019-08-26 Creating a surrogate commuter network from Australian Bureau of Statistics census data Fair, Kristopher M. Zachreson, Cameron Prokopenko, Mikhail Sci Data Data Descriptor Between the 2011 and 2016 national censuses, the Australian Bureau of Statistics changed its anonymity policy compliance system for the distribution of census data. The new method has resulted in dramatic inconsistencies when comparing low-resolution data to aggregated high-resolution data. Hence, aggregated totals do not match true totals, and the mismatch gets worse as the data resolution gets finer. Here, we address several aspects of this inconsistency with respect to the 2016 usual-residence to place-of-work travel data. We introduce a re-sampling system that rectifies many of the artifacts introduced by the new ABS protocol, ensuring a higher level of consistency across partition sizes. We offer a surrogate high-resolution 2016 commuter dataset that reduces the difference between the aggregated and true commuter totals from ~34% to only ~7%, which is on the order of the discrepancy across partition resolutions in data from earlier years. Nature Publishing Group UK 2019-08-16 /pmc/articles/PMC6697727/ /pubmed/31420560 http://dx.doi.org/10.1038/s41597-019-0137-z Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article. |
spellingShingle | Data Descriptor Fair, Kristopher M. Zachreson, Cameron Prokopenko, Mikhail Creating a surrogate commuter network from Australian Bureau of Statistics census data |
title | Creating a surrogate commuter network from Australian Bureau of Statistics census data |
title_full | Creating a surrogate commuter network from Australian Bureau of Statistics census data |
title_fullStr | Creating a surrogate commuter network from Australian Bureau of Statistics census data |
title_full_unstemmed | Creating a surrogate commuter network from Australian Bureau of Statistics census data |
title_short | Creating a surrogate commuter network from Australian Bureau of Statistics census data |
title_sort | creating a surrogate commuter network from australian bureau of statistics census data |
topic | Data Descriptor |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6697727/ https://www.ncbi.nlm.nih.gov/pubmed/31420560 http://dx.doi.org/10.1038/s41597-019-0137-z |
work_keys_str_mv | AT fairkristopherm creatingasurrogatecommuternetworkfromaustralianbureauofstatisticscensusdata AT zachresoncameron creatingasurrogatecommuternetworkfromaustralianbureauofstatisticscensusdata AT prokopenkomikhail creatingasurrogatecommuternetworkfromaustralianbureauofstatisticscensusdata |