Cargando…

A rasterized building footprint dataset for the United States

Microsoft released a U.S.-wide vector building dataset in 2018. Although the vector building layers provide relatively accurate geometries, their use in large-extent geospatial analysis comes at a high computational cost. We used High-Performance Computing (HPC) to develop an algorithm that calculat...

Descripción completa

Detalles Bibliográficos
Autores principales: Heris, Mehdi P., Foks, Nathan Leon, Bagstad, Kenneth J., Troy, Austin, Ancona, Zachary H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7324622/
https://www.ncbi.nlm.nih.gov/pubmed/32601298
http://dx.doi.org/10.1038/s41597-020-0542-3
_version_ 1783551976029552640
author Heris, Mehdi P.
Foks, Nathan Leon
Bagstad, Kenneth J.
Troy, Austin
Ancona, Zachary H.
author_facet Heris, Mehdi P.
Foks, Nathan Leon
Bagstad, Kenneth J.
Troy, Austin
Ancona, Zachary H.
author_sort Heris, Mehdi P.
collection PubMed
description Microsoft released a U.S.-wide vector building dataset in 2018. Although the vector building layers provide relatively accurate geometries, their use in large-extent geospatial analysis comes at a high computational cost. We used High-Performance Computing (HPC) to develop an algorithm that calculates six summary values for each cell in a raster representation of each U.S. state, excluding Alaska and Hawaii: (1) total footprint coverage, (2) number of unique buildings intersecting each cell, (3) number of building centroids falling inside each cell, and area of the (4) average, (5) smallest, and (6) largest area of buildings that intersect each cell. These values are represented as raster layers with 30 m cell size covering the 48 conterminous states. We also identify errors in the original building dataset. We evaluate precision and recall in the data for three large U.S. urban areas. Precision is high and comparable to results reported by Microsoft while recall is high for buildings with footprints larger than 200 m2 but lower for progressively smaller buildings.
format Online
Article
Text
id pubmed-7324622
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-73246222020-07-06 A rasterized building footprint dataset for the United States Heris, Mehdi P. Foks, Nathan Leon Bagstad, Kenneth J. Troy, Austin Ancona, Zachary H. Sci Data Data Descriptor Microsoft released a U.S.-wide vector building dataset in 2018. Although the vector building layers provide relatively accurate geometries, their use in large-extent geospatial analysis comes at a high computational cost. We used High-Performance Computing (HPC) to develop an algorithm that calculates six summary values for each cell in a raster representation of each U.S. state, excluding Alaska and Hawaii: (1) total footprint coverage, (2) number of unique buildings intersecting each cell, (3) number of building centroids falling inside each cell, and area of the (4) average, (5) smallest, and (6) largest area of buildings that intersect each cell. These values are represented as raster layers with 30 m cell size covering the 48 conterminous states. We also identify errors in the original building dataset. We evaluate precision and recall in the data for three large U.S. urban areas. Precision is high and comparable to results reported by Microsoft while recall is high for buildings with footprints larger than 200 m2 but lower for progressively smaller buildings. Nature Publishing Group UK 2020-06-29 /pmc/articles/PMC7324622/ /pubmed/32601298 http://dx.doi.org/10.1038/s41597-020-0542-3 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.
spellingShingle Data Descriptor
Heris, Mehdi P.
Foks, Nathan Leon
Bagstad, Kenneth J.
Troy, Austin
Ancona, Zachary H.
A rasterized building footprint dataset for the United States
title A rasterized building footprint dataset for the United States
title_full A rasterized building footprint dataset for the United States
title_fullStr A rasterized building footprint dataset for the United States
title_full_unstemmed A rasterized building footprint dataset for the United States
title_short A rasterized building footprint dataset for the United States
title_sort rasterized building footprint dataset for the united states
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7324622/
https://www.ncbi.nlm.nih.gov/pubmed/32601298
http://dx.doi.org/10.1038/s41597-020-0542-3
work_keys_str_mv AT herismehdip arasterizedbuildingfootprintdatasetfortheunitedstates
AT foksnathanleon arasterizedbuildingfootprintdatasetfortheunitedstates
AT bagstadkennethj arasterizedbuildingfootprintdatasetfortheunitedstates
AT troyaustin arasterizedbuildingfootprintdatasetfortheunitedstates
AT anconazacharyh arasterizedbuildingfootprintdatasetfortheunitedstates
AT herismehdip rasterizedbuildingfootprintdatasetfortheunitedstates
AT foksnathanleon rasterizedbuildingfootprintdatasetfortheunitedstates
AT bagstadkennethj rasterizedbuildingfootprintdatasetfortheunitedstates
AT troyaustin rasterizedbuildingfootprintdatasetfortheunitedstates
AT anconazacharyh rasterizedbuildingfootprintdatasetfortheunitedstates