Cargando…

To scrape or not to scrape, this is dilemma. The post-API scenario and implications on digital research

INTRODUCTION: This article aims to investigate the potential impact of restricted social data access on digital research practices. The 2018 Cambridge Analytica scandal exposed the exploitation of Facebook user data for speculative purposes and led to the end of the so-called “Data Golden Age,” char...

Descripción completa

Detalles Bibliográficos
Autor principal: Trezza, Domenico
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10060875/
https://www.ncbi.nlm.nih.gov/pubmed/37006635
http://dx.doi.org/10.3389/fsoc.2023.1145038
_version_ 1785017173798813696
author Trezza, Domenico
author_facet Trezza, Domenico
author_sort Trezza, Domenico
collection PubMed
description INTRODUCTION: This article aims to investigate the potential impact of restricted social data access on digital research practices. The 2018 Cambridge Analytica scandal exposed the exploitation of Facebook user data for speculative purposes and led to the end of the so-called “Data Golden Age,” characterized by free access to social media user data. As a result, many social platforms have limited or entirely banned data access. This policy shift, referred to as the “APIcalypse,” has revolutionized digital research methods. METHODS: To address the impact of this policy shift on digital research, a non-probabilistic sample of Italian researchers was surveyed and the responses were analyzed. The survey was designed to explore how constraints on digital data access have altered research practices, whether we are truly in a post-API era with a radical change in data scraping strategies, and what shared and sustainable solutions can be identified for the post-API scenario. RESULTS: The findings highlight how limits on social data access have not yet created a “post-Api” scenario as expected, but it is turning research practices upside down, positively and negatively. On the positive side, because researchers are experimenting with innovative forms of scraping. Negatively, because there could be a “mass migration” to the few platforms that freely grant their APIs, with critical consequences for the quality of research. DISCUSSION: The closure of many social media APIs has not opened up a post-API world, but has worsened the conditions of making research, which is increasingly oriented to “easy-data” environments such as Twitter. This should prompt digital researchers to make a self-reflexive effort to diversify research platforms and especially to act ethically with user data. It would also be important for the scientific world and large platforms to enter into understandings for open and conscious sharing of data in the name of scientific progress.
format Online
Article
Text
id pubmed-10060875
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-100608752023-03-31 To scrape or not to scrape, this is dilemma. The post-API scenario and implications on digital research Trezza, Domenico Front Sociol Sociology INTRODUCTION: This article aims to investigate the potential impact of restricted social data access on digital research practices. The 2018 Cambridge Analytica scandal exposed the exploitation of Facebook user data for speculative purposes and led to the end of the so-called “Data Golden Age,” characterized by free access to social media user data. As a result, many social platforms have limited or entirely banned data access. This policy shift, referred to as the “APIcalypse,” has revolutionized digital research methods. METHODS: To address the impact of this policy shift on digital research, a non-probabilistic sample of Italian researchers was surveyed and the responses were analyzed. The survey was designed to explore how constraints on digital data access have altered research practices, whether we are truly in a post-API era with a radical change in data scraping strategies, and what shared and sustainable solutions can be identified for the post-API scenario. RESULTS: The findings highlight how limits on social data access have not yet created a “post-Api” scenario as expected, but it is turning research practices upside down, positively and negatively. On the positive side, because researchers are experimenting with innovative forms of scraping. Negatively, because there could be a “mass migration” to the few platforms that freely grant their APIs, with critical consequences for the quality of research. DISCUSSION: The closure of many social media APIs has not opened up a post-API world, but has worsened the conditions of making research, which is increasingly oriented to “easy-data” environments such as Twitter. This should prompt digital researchers to make a self-reflexive effort to diversify research platforms and especially to act ethically with user data. It would also be important for the scientific world and large platforms to enter into understandings for open and conscious sharing of data in the name of scientific progress. Frontiers Media S.A. 2023-03-16 /pmc/articles/PMC10060875/ /pubmed/37006635 http://dx.doi.org/10.3389/fsoc.2023.1145038 Text en Copyright © 2023 Trezza. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Sociology
Trezza, Domenico
To scrape or not to scrape, this is dilemma. The post-API scenario and implications on digital research
title To scrape or not to scrape, this is dilemma. The post-API scenario and implications on digital research
title_full To scrape or not to scrape, this is dilemma. The post-API scenario and implications on digital research
title_fullStr To scrape or not to scrape, this is dilemma. The post-API scenario and implications on digital research
title_full_unstemmed To scrape or not to scrape, this is dilemma. The post-API scenario and implications on digital research
title_short To scrape or not to scrape, this is dilemma. The post-API scenario and implications on digital research
title_sort to scrape or not to scrape, this is dilemma. the post-api scenario and implications on digital research
topic Sociology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10060875/
https://www.ncbi.nlm.nih.gov/pubmed/37006635
http://dx.doi.org/10.3389/fsoc.2023.1145038
work_keys_str_mv AT trezzadomenico toscrapeornottoscrapethisisdilemmathepostapiscenarioandimplicationsondigitalresearch