Cargando…

Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries

Social data in digital form—including user-generated content, expressed or implicit relations between people, and behavioral traces—are at the core of popular applications and platforms, driving the research agenda of many researchers. The promises of social data are many, including understanding “w...

Descripción completa

Detalles Bibliográficos
Autores principales: Olteanu, Alexandra, Castillo, Carlos, Diaz, Fernando, Kıcıman, Emre
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931947/
https://www.ncbi.nlm.nih.gov/pubmed/33693336
http://dx.doi.org/10.3389/fdata.2019.00013
_version_ 1783660389264785408
author Olteanu, Alexandra
Castillo, Carlos
Diaz, Fernando
Kıcıman, Emre
author_facet Olteanu, Alexandra
Castillo, Carlos
Diaz, Fernando
Kıcıman, Emre
author_sort Olteanu, Alexandra
collection PubMed
description Social data in digital form—including user-generated content, expressed or implicit relations between people, and behavioral traces—are at the core of popular applications and platforms, driving the research agenda of many researchers. The promises of social data are many, including understanding “what the world thinks” about a social issue, brand, celebrity, or other entity, as well as enabling better decision-making in a variety of fields including public policy, healthcare, and economics. Many academics and practitioners have warned against the naïve usage of social data. There are biases and inaccuracies occurring at the source of the data, but also introduced during processing. There are methodological limitations and pitfalls, as well as ethical boundaries and unexpected consequences that are often overlooked. This paper recognizes the rigor with which these issues are addressed by different researchers varies across a wide range. We identify a variety of menaces in the practices around social data use, and organize them in a framework that helps to identify them. “For your own sanity, you have to remember that not all problems can be solved. Not all problems can be solved, but all problems can be illuminated.” –Ursula Franklin
format Online
Article
Text
id pubmed-7931947
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-79319472021-03-09 Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries Olteanu, Alexandra Castillo, Carlos Diaz, Fernando Kıcıman, Emre Front Big Data Big Data Social data in digital form—including user-generated content, expressed or implicit relations between people, and behavioral traces—are at the core of popular applications and platforms, driving the research agenda of many researchers. The promises of social data are many, including understanding “what the world thinks” about a social issue, brand, celebrity, or other entity, as well as enabling better decision-making in a variety of fields including public policy, healthcare, and economics. Many academics and practitioners have warned against the naïve usage of social data. There are biases and inaccuracies occurring at the source of the data, but also introduced during processing. There are methodological limitations and pitfalls, as well as ethical boundaries and unexpected consequences that are often overlooked. This paper recognizes the rigor with which these issues are addressed by different researchers varies across a wide range. We identify a variety of menaces in the practices around social data use, and organize them in a framework that helps to identify them. “For your own sanity, you have to remember that not all problems can be solved. Not all problems can be solved, but all problems can be illuminated.” –Ursula Franklin Frontiers Media S.A. 2019-07-11 /pmc/articles/PMC7931947/ /pubmed/33693336 http://dx.doi.org/10.3389/fdata.2019.00013 Text en Copyright © 2019 Olteanu, Castillo, Diaz and Kıcıman. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Big Data
Olteanu, Alexandra
Castillo, Carlos
Diaz, Fernando
Kıcıman, Emre
Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries
title Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries
title_full Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries
title_fullStr Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries
title_full_unstemmed Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries
title_short Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries
title_sort social data: biases, methodological pitfalls, and ethical boundaries
topic Big Data
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931947/
https://www.ncbi.nlm.nih.gov/pubmed/33693336
http://dx.doi.org/10.3389/fdata.2019.00013
work_keys_str_mv AT olteanualexandra socialdatabiasesmethodologicalpitfallsandethicalboundaries
AT castillocarlos socialdatabiasesmethodologicalpitfallsandethicalboundaries
AT diazfernando socialdatabiasesmethodologicalpitfallsandethicalboundaries
AT kıcımanemre socialdatabiasesmethodologicalpitfallsandethicalboundaries