Cargando…

Predicting residential structures from open source remotely enumerated data using machine learning

Having accurate maps depicting the locations of residential buildings across a region benefits a range of sectors. This is particularly true for public health programs focused on delivering services at the household level, such as indoor residual spraying with insecticide to help prevent malaria. Wh...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sturrock, Hugh J. W., Woolheater, Katelyn, Bennett, Adam F., Andrade-Pacheco, Ricardo, Midekisa, Alemayehu
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2018
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6150517/ https://www.ncbi.nlm.nih.gov/pubmed/30240429 http://dx.doi.org/10.1371/journal.pone.0204399

_version_	1783357002887462912
author	Sturrock, Hugh J. W. Woolheater, Katelyn Bennett, Adam F. Andrade-Pacheco, Ricardo Midekisa, Alemayehu
author_facet	Sturrock, Hugh J. W. Woolheater, Katelyn Bennett, Adam F. Andrade-Pacheco, Ricardo Midekisa, Alemayehu
author_sort	Sturrock, Hugh J. W.
collection	PubMed
description	Having accurate maps depicting the locations of residential buildings across a region benefits a range of sectors. This is particularly true for public health programs focused on delivering services at the household level, such as indoor residual spraying with insecticide to help prevent malaria. While open source data from OpenStreetMap (OSM) depicting the locations and shapes of buildings is rapidly improving in terms of quality and completeness globally, even in settings where all buildings have been mapped, information on whether these buildings are residential, commercial or another type is often only available for a small subset. Using OSM building data from Botswana and Swaziland, we identified buildings for which ‘type’ was indicated, generated via on the ground observations, and classified these into two classes, “sprayable” and “not-sprayable”. Ensemble machine learning, using building characteristics such as size, shape and proximity to neighbouring features, was then used to form a model to predict which of these 2 classes every building in these two countries fell into. Results show that an ensemble machine learning approach performed marginally, but statistically, better than the best individual model and that using this ensemble model we were able to correctly classify >86% (using independent test data) of structures correctly as sprayable and not-sprayable across both countries.
format	Online Article Text
id	pubmed-6150517
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-61505172018-10-08 Predicting residential structures from open source remotely enumerated data using machine learning Sturrock, Hugh J. W. Woolheater, Katelyn Bennett, Adam F. Andrade-Pacheco, Ricardo Midekisa, Alemayehu PLoS One Research Article Having accurate maps depicting the locations of residential buildings across a region benefits a range of sectors. This is particularly true for public health programs focused on delivering services at the household level, such as indoor residual spraying with insecticide to help prevent malaria. While open source data from OpenStreetMap (OSM) depicting the locations and shapes of buildings is rapidly improving in terms of quality and completeness globally, even in settings where all buildings have been mapped, information on whether these buildings are residential, commercial or another type is often only available for a small subset. Using OSM building data from Botswana and Swaziland, we identified buildings for which ‘type’ was indicated, generated via on the ground observations, and classified these into two classes, “sprayable” and “not-sprayable”. Ensemble machine learning, using building characteristics such as size, shape and proximity to neighbouring features, was then used to form a model to predict which of these 2 classes every building in these two countries fell into. Results show that an ensemble machine learning approach performed marginally, but statistically, better than the best individual model and that using this ensemble model we were able to correctly classify >86% (using independent test data) of structures correctly as sprayable and not-sprayable across both countries. Public Library of Science 2018-09-21 /pmc/articles/PMC6150517/ /pubmed/30240429 http://dx.doi.org/10.1371/journal.pone.0204399 Text en © 2018 Sturrock et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Sturrock, Hugh J. W. Woolheater, Katelyn Bennett, Adam F. Andrade-Pacheco, Ricardo Midekisa, Alemayehu Predicting residential structures from open source remotely enumerated data using machine learning
title	Predicting residential structures from open source remotely enumerated data using machine learning
title_full	Predicting residential structures from open source remotely enumerated data using machine learning
title_fullStr	Predicting residential structures from open source remotely enumerated data using machine learning
title_full_unstemmed	Predicting residential structures from open source remotely enumerated data using machine learning
title_short	Predicting residential structures from open source remotely enumerated data using machine learning
title_sort	predicting residential structures from open source remotely enumerated data using machine learning
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6150517/ https://www.ncbi.nlm.nih.gov/pubmed/30240429 http://dx.doi.org/10.1371/journal.pone.0204399
work_keys_str_mv	AT sturrockhughjw predictingresidentialstructuresfromopensourceremotelyenumerateddatausingmachinelearning AT woolheaterkatelyn predictingresidentialstructuresfromopensourceremotelyenumerateddatausingmachinelearning AT bennettadamf predictingresidentialstructuresfromopensourceremotelyenumerateddatausingmachinelearning AT andradepachecoricardo predictingresidentialstructuresfromopensourceremotelyenumerateddatausingmachinelearning AT midekisaalemayehu predictingresidentialstructuresfromopensourceremotelyenumerateddatausingmachinelearning

Predicting residential structures from open source remotely enumerated data using machine learning

Ejemplares similares