Cargando…

Emo, love and god: making sense of Urban Dictionary, a crowd-sourced online dictionary

The Internet facilitates large-scale collaborative projects and the emergence of Web 2.0 platforms, where producers and consumers of content unify, has drastically changed the information market. On the one hand, the promise of the ‘wisdom of the crowd’ has inspired successful projects such as Wikip...

Descripción completa

Detalles Bibliográficos
Autores principales: Nguyen, Dong, McGillivray, Barbara, Yasseri, Taha
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society Publishing 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5990761/
https://www.ncbi.nlm.nih.gov/pubmed/29892417
http://dx.doi.org/10.1098/rsos.172320
_version_ 1783329641005580288
author Nguyen, Dong
McGillivray, Barbara
Yasseri, Taha
author_facet Nguyen, Dong
McGillivray, Barbara
Yasseri, Taha
author_sort Nguyen, Dong
collection PubMed
description The Internet facilitates large-scale collaborative projects and the emergence of Web 2.0 platforms, where producers and consumers of content unify, has drastically changed the information market. On the one hand, the promise of the ‘wisdom of the crowd’ has inspired successful projects such as Wikipedia, which has become the primary source of crowd-based information in many languages. On the other hand, the decentralized and often unmonitored environment of such projects may make them susceptible to low-quality content. In this work, we focus on Urban Dictionary, a crowd-sourced online dictionary. We combine computational methods with qualitative annotation and shed light on the overall features of Urban Dictionary in terms of growth, coverage and types of content. We measure a high presence of opinion-focused entries, as opposed to the meaning-focused entries that we expect from traditional dictionaries. Furthermore, Urban Dictionary covers many informal, unfamiliar words as well as proper nouns. Urban Dictionary also contains offensive content, but highly offensive content tends to receive lower scores through the dictionary’s voting system. The low threshold to include new material in Urban Dictionary enables quick recording of new words and new meanings, but the resulting heterogeneous content can pose challenges in using Urban Dictionary as a source to study language innovation.
format Online
Article
Text
id pubmed-5990761
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher The Royal Society Publishing
record_format MEDLINE/PubMed
spelling pubmed-59907612018-06-11 Emo, love and god: making sense of Urban Dictionary, a crowd-sourced online dictionary Nguyen, Dong McGillivray, Barbara Yasseri, Taha R Soc Open Sci Computer Science The Internet facilitates large-scale collaborative projects and the emergence of Web 2.0 platforms, where producers and consumers of content unify, has drastically changed the information market. On the one hand, the promise of the ‘wisdom of the crowd’ has inspired successful projects such as Wikipedia, which has become the primary source of crowd-based information in many languages. On the other hand, the decentralized and often unmonitored environment of such projects may make them susceptible to low-quality content. In this work, we focus on Urban Dictionary, a crowd-sourced online dictionary. We combine computational methods with qualitative annotation and shed light on the overall features of Urban Dictionary in terms of growth, coverage and types of content. We measure a high presence of opinion-focused entries, as opposed to the meaning-focused entries that we expect from traditional dictionaries. Furthermore, Urban Dictionary covers many informal, unfamiliar words as well as proper nouns. Urban Dictionary also contains offensive content, but highly offensive content tends to receive lower scores through the dictionary’s voting system. The low threshold to include new material in Urban Dictionary enables quick recording of new words and new meanings, but the resulting heterogeneous content can pose challenges in using Urban Dictionary as a source to study language innovation. The Royal Society Publishing 2018-05-02 /pmc/articles/PMC5990761/ /pubmed/29892417 http://dx.doi.org/10.1098/rsos.172320 Text en © 2018 The Authors. http://creativecommons.org/licenses/by/4.0/ Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, provided the original author and source are credited.
spellingShingle Computer Science
Nguyen, Dong
McGillivray, Barbara
Yasseri, Taha
Emo, love and god: making sense of Urban Dictionary, a crowd-sourced online dictionary
title Emo, love and god: making sense of Urban Dictionary, a crowd-sourced online dictionary
title_full Emo, love and god: making sense of Urban Dictionary, a crowd-sourced online dictionary
title_fullStr Emo, love and god: making sense of Urban Dictionary, a crowd-sourced online dictionary
title_full_unstemmed Emo, love and god: making sense of Urban Dictionary, a crowd-sourced online dictionary
title_short Emo, love and god: making sense of Urban Dictionary, a crowd-sourced online dictionary
title_sort emo, love and god: making sense of urban dictionary, a crowd-sourced online dictionary
topic Computer Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5990761/
https://www.ncbi.nlm.nih.gov/pubmed/29892417
http://dx.doi.org/10.1098/rsos.172320
work_keys_str_mv AT nguyendong emoloveandgodmakingsenseofurbandictionaryacrowdsourcedonlinedictionary
AT mcgillivraybarbara emoloveandgodmakingsenseofurbandictionaryacrowdsourcedonlinedictionary
AT yasseritaha emoloveandgodmakingsenseofurbandictionaryacrowdsourcedonlinedictionary