Cargando…

Using deep learning and Google Street View to estimate the demographic makeup of neighborhoods across the United States

The United States spends more than $250 million each year on the American Community Survey (ACS), a labor-intensive door-to-door study that measures statistics relating to race, gender, education, occupation, unemployment, and other demographic factors. Although a comprehensive source of data, the l...

Descripción completa

Detalles Bibliográficos
Autores principales: Gebru, Timnit, Krause, Jonathan, Wang, Yilun, Chen, Duyun, Deng, Jia, Aiden, Erez Lieberman, Fei-Fei, Li
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5740675/
https://www.ncbi.nlm.nih.gov/pubmed/29183967
http://dx.doi.org/10.1073/pnas.1700035114
_version_ 1783288067531997184
author Gebru, Timnit
Krause, Jonathan
Wang, Yilun
Chen, Duyun
Deng, Jia
Aiden, Erez Lieberman
Fei-Fei, Li
author_facet Gebru, Timnit
Krause, Jonathan
Wang, Yilun
Chen, Duyun
Deng, Jia
Aiden, Erez Lieberman
Fei-Fei, Li
author_sort Gebru, Timnit
collection PubMed
description The United States spends more than $250 million each year on the American Community Survey (ACS), a labor-intensive door-to-door study that measures statistics relating to race, gender, education, occupation, unemployment, and other demographic factors. Although a comprehensive source of data, the lag between demographic changes and their appearance in the ACS can exceed several years. As digital imagery becomes ubiquitous and machine vision techniques improve, automated data analysis may become an increasingly practical supplement to the ACS. Here, we present a method that estimates socioeconomic characteristics of regions spanning 200 US cities by using 50 million images of street scenes gathered with Google Street View cars. Using deep learning-based computer vision techniques, we determined the make, model, and year of all motor vehicles encountered in particular neighborhoods. Data from this census of motor vehicles, which enumerated 22 million automobiles in total (8% of all automobiles in the United States), were used to accurately estimate income, race, education, and voting patterns at the zip code and precinct level. (The average US precinct contains ∼1,000 people.) The resulting associations are surprisingly simple and powerful. For instance, if the number of sedans encountered during a drive through a city is higher than the number of pickup trucks, the city is likely to vote for a Democrat during the next presidential election (88% chance); otherwise, it is likely to vote Republican (82%). Our results suggest that automated systems for monitoring demographics may effectively complement labor-intensive approaches, with the potential to measure demographics with fine spatial resolution, in close to real time.
format Online
Article
Text
id pubmed-5740675
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-57406752018-01-22 Using deep learning and Google Street View to estimate the demographic makeup of neighborhoods across the United States Gebru, Timnit Krause, Jonathan Wang, Yilun Chen, Duyun Deng, Jia Aiden, Erez Lieberman Fei-Fei, Li Proc Natl Acad Sci U S A Physical Sciences The United States spends more than $250 million each year on the American Community Survey (ACS), a labor-intensive door-to-door study that measures statistics relating to race, gender, education, occupation, unemployment, and other demographic factors. Although a comprehensive source of data, the lag between demographic changes and their appearance in the ACS can exceed several years. As digital imagery becomes ubiquitous and machine vision techniques improve, automated data analysis may become an increasingly practical supplement to the ACS. Here, we present a method that estimates socioeconomic characteristics of regions spanning 200 US cities by using 50 million images of street scenes gathered with Google Street View cars. Using deep learning-based computer vision techniques, we determined the make, model, and year of all motor vehicles encountered in particular neighborhoods. Data from this census of motor vehicles, which enumerated 22 million automobiles in total (8% of all automobiles in the United States), were used to accurately estimate income, race, education, and voting patterns at the zip code and precinct level. (The average US precinct contains ∼1,000 people.) The resulting associations are surprisingly simple and powerful. For instance, if the number of sedans encountered during a drive through a city is higher than the number of pickup trucks, the city is likely to vote for a Democrat during the next presidential election (88% chance); otherwise, it is likely to vote Republican (82%). Our results suggest that automated systems for monitoring demographics may effectively complement labor-intensive approaches, with the potential to measure demographics with fine spatial resolution, in close to real time. National Academy of Sciences 2017-12-12 2017-11-28 /pmc/articles/PMC5740675/ /pubmed/29183967 http://dx.doi.org/10.1073/pnas.1700035114 Text en Copyright © 2017 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/ This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Physical Sciences
Gebru, Timnit
Krause, Jonathan
Wang, Yilun
Chen, Duyun
Deng, Jia
Aiden, Erez Lieberman
Fei-Fei, Li
Using deep learning and Google Street View to estimate the demographic makeup of neighborhoods across the United States
title Using deep learning and Google Street View to estimate the demographic makeup of neighborhoods across the United States
title_full Using deep learning and Google Street View to estimate the demographic makeup of neighborhoods across the United States
title_fullStr Using deep learning and Google Street View to estimate the demographic makeup of neighborhoods across the United States
title_full_unstemmed Using deep learning and Google Street View to estimate the demographic makeup of neighborhoods across the United States
title_short Using deep learning and Google Street View to estimate the demographic makeup of neighborhoods across the United States
title_sort using deep learning and google street view to estimate the demographic makeup of neighborhoods across the united states
topic Physical Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5740675/
https://www.ncbi.nlm.nih.gov/pubmed/29183967
http://dx.doi.org/10.1073/pnas.1700035114
work_keys_str_mv AT gebrutimnit usingdeeplearningandgooglestreetviewtoestimatethedemographicmakeupofneighborhoodsacrosstheunitedstates
AT krausejonathan usingdeeplearningandgooglestreetviewtoestimatethedemographicmakeupofneighborhoodsacrosstheunitedstates
AT wangyilun usingdeeplearningandgooglestreetviewtoestimatethedemographicmakeupofneighborhoodsacrosstheunitedstates
AT chenduyun usingdeeplearningandgooglestreetviewtoestimatethedemographicmakeupofneighborhoodsacrosstheunitedstates
AT dengjia usingdeeplearningandgooglestreetviewtoestimatethedemographicmakeupofneighborhoodsacrosstheunitedstates
AT aidenerezlieberman usingdeeplearningandgooglestreetviewtoestimatethedemographicmakeupofneighborhoodsacrosstheunitedstates
AT feifeili usingdeeplearningandgooglestreetviewtoestimatethedemographicmakeupofneighborhoodsacrosstheunitedstates