Cargando…

Who Tweets? Deriving the Demographic Characteristics of Age, Occupation and Social Class from Twitter User Meta-Data

This paper specifies, designs and critically evaluates two tools for the automated identification of demographic data (age, occupation and social class) from the profile descriptions of Twitter users in the United Kingdom (UK). Meta-data data routinely collected through the Collaborative Social Medi...

Descripción completa

Detalles Bibliográficos
Autores principales: Sloan, Luke, Morgan, Jeffrey, Burnap, Pete, Williams, Matthew
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4346393/
https://www.ncbi.nlm.nih.gov/pubmed/25729900
http://dx.doi.org/10.1371/journal.pone.0115545
_version_ 1782359704676073472
author Sloan, Luke
Morgan, Jeffrey
Burnap, Pete
Williams, Matthew
author_facet Sloan, Luke
Morgan, Jeffrey
Burnap, Pete
Williams, Matthew
author_sort Sloan, Luke
collection PubMed
description This paper specifies, designs and critically evaluates two tools for the automated identification of demographic data (age, occupation and social class) from the profile descriptions of Twitter users in the United Kingdom (UK). Meta-data data routinely collected through the Collaborative Social Media Observatory (COSMOS: http://www.cosmosproject.net/) relating to UK Twitter users is matched with the occupational lookup tables between job and social class provided by the Office for National Statistics (ONS) using SOC2010. Using expert human validation, the validity and reliability of the automated matching process is critically assessed and a prospective class distribution of UK Twitter users is offered with 2011 Census baseline comparisons. The pattern matching rules for identifying age are explained and enacted following a discussion on how to minimise false positives. The age distribution of Twitter users, as identified using the tool, is presented alongside the age distribution of the UK population from the 2011 Census. The automated occupation detection tool reliably identifies certain occupational groups, such as professionals, for which job titles cannot be confused with hobbies or are used in common parlance within alternative contexts. An alternative explanation on the prevalence of hobbies is that the creative sector is overrepresented on Twitter compared to 2011 Census data. The age detection tool illustrates the youthfulness of Twitter users compared to the general UK population as of the 2011 Census according to proportions, but projections demonstrate that there is still potentially a large number of older platform users. It is possible to detect “signatures” of both occupation and age from Twitter meta-data with varying degrees of accuracy (particularly dependent on occupational groups) but further confirmatory work is needed.
format Online
Article
Text
id pubmed-4346393
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-43463932015-03-17 Who Tweets? Deriving the Demographic Characteristics of Age, Occupation and Social Class from Twitter User Meta-Data Sloan, Luke Morgan, Jeffrey Burnap, Pete Williams, Matthew PLoS One Research Article This paper specifies, designs and critically evaluates two tools for the automated identification of demographic data (age, occupation and social class) from the profile descriptions of Twitter users in the United Kingdom (UK). Meta-data data routinely collected through the Collaborative Social Media Observatory (COSMOS: http://www.cosmosproject.net/) relating to UK Twitter users is matched with the occupational lookup tables between job and social class provided by the Office for National Statistics (ONS) using SOC2010. Using expert human validation, the validity and reliability of the automated matching process is critically assessed and a prospective class distribution of UK Twitter users is offered with 2011 Census baseline comparisons. The pattern matching rules for identifying age are explained and enacted following a discussion on how to minimise false positives. The age distribution of Twitter users, as identified using the tool, is presented alongside the age distribution of the UK population from the 2011 Census. The automated occupation detection tool reliably identifies certain occupational groups, such as professionals, for which job titles cannot be confused with hobbies or are used in common parlance within alternative contexts. An alternative explanation on the prevalence of hobbies is that the creative sector is overrepresented on Twitter compared to 2011 Census data. The age detection tool illustrates the youthfulness of Twitter users compared to the general UK population as of the 2011 Census according to proportions, but projections demonstrate that there is still potentially a large number of older platform users. It is possible to detect “signatures” of both occupation and age from Twitter meta-data with varying degrees of accuracy (particularly dependent on occupational groups) but further confirmatory work is needed. Public Library of Science 2015-03-02 /pmc/articles/PMC4346393/ /pubmed/25729900 http://dx.doi.org/10.1371/journal.pone.0115545 Text en © 2015 Sloan et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Sloan, Luke
Morgan, Jeffrey
Burnap, Pete
Williams, Matthew
Who Tweets? Deriving the Demographic Characteristics of Age, Occupation and Social Class from Twitter User Meta-Data
title Who Tweets? Deriving the Demographic Characteristics of Age, Occupation and Social Class from Twitter User Meta-Data
title_full Who Tweets? Deriving the Demographic Characteristics of Age, Occupation and Social Class from Twitter User Meta-Data
title_fullStr Who Tweets? Deriving the Demographic Characteristics of Age, Occupation and Social Class from Twitter User Meta-Data
title_full_unstemmed Who Tweets? Deriving the Demographic Characteristics of Age, Occupation and Social Class from Twitter User Meta-Data
title_short Who Tweets? Deriving the Demographic Characteristics of Age, Occupation and Social Class from Twitter User Meta-Data
title_sort who tweets? deriving the demographic characteristics of age, occupation and social class from twitter user meta-data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4346393/
https://www.ncbi.nlm.nih.gov/pubmed/25729900
http://dx.doi.org/10.1371/journal.pone.0115545
work_keys_str_mv AT sloanluke whotweetsderivingthedemographiccharacteristicsofageoccupationandsocialclassfromtwitterusermetadata
AT morganjeffrey whotweetsderivingthedemographiccharacteristicsofageoccupationandsocialclassfromtwitterusermetadata
AT burnappete whotweetsderivingthedemographiccharacteristicsofageoccupationandsocialclassfromtwitterusermetadata
AT williamsmatthew whotweetsderivingthedemographiccharacteristicsofageoccupationandsocialclassfromtwitterusermetadata