Cargando…

Crowdbreaks: Tracking Health Trends Using Public Social Media Data and Crowdsourcing

In the past decade, tracking health trends using social media data has shown great promise, due to a powerful combination of massive adoption of social media around the world, and increasingly potent hardware and software that enables us to work with these new big data streams. At the same time, man...

Descripción completa

Detalles Bibliográficos
Autores principales: Müller, Martin M., Salathé, Marcel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6476276/
https://www.ncbi.nlm.nih.gov/pubmed/31037238
http://dx.doi.org/10.3389/fpubh.2019.00081
_version_ 1783412880205414400
author Müller, Martin M.
Salathé, Marcel
author_facet Müller, Martin M.
Salathé, Marcel
author_sort Müller, Martin M.
collection PubMed
description In the past decade, tracking health trends using social media data has shown great promise, due to a powerful combination of massive adoption of social media around the world, and increasingly potent hardware and software that enables us to work with these new big data streams. At the same time, many challenging problems have been identified. First, there is often a mismatch between how rapidly online data can change, and how rapidly algorithms are updated, which means that there is limited reusability for algorithms trained on past data as their performance decreases over time. Second, much of the work is focusing on specific issues during a specific past period in time, even though public health institutions would need flexible tools to assess multiple evolving situations in real time. Third, most tools providing such capabilities are proprietary systems with little algorithmic or data transparency, and thus little buy-in from the global public health and research community. Here, we introduce Crowdbreaks, an open platform which allows tracking of health trends by making use of continuous crowdsourced labeling of public social media content. The system is built in a way which automatizes the typical workflow from data collection, filtering, labeling and training of machine learning classifiers and therefore can greatly accelerate the research process in the public health domain. This work describes the technical aspects of the platform, thereby covering the functionalities at its current state and exploring its future use cases and extensions.
format Online
Article
Text
id pubmed-6476276
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-64762762019-04-29 Crowdbreaks: Tracking Health Trends Using Public Social Media Data and Crowdsourcing Müller, Martin M. Salathé, Marcel Front Public Health Public Health In the past decade, tracking health trends using social media data has shown great promise, due to a powerful combination of massive adoption of social media around the world, and increasingly potent hardware and software that enables us to work with these new big data streams. At the same time, many challenging problems have been identified. First, there is often a mismatch between how rapidly online data can change, and how rapidly algorithms are updated, which means that there is limited reusability for algorithms trained on past data as their performance decreases over time. Second, much of the work is focusing on specific issues during a specific past period in time, even though public health institutions would need flexible tools to assess multiple evolving situations in real time. Third, most tools providing such capabilities are proprietary systems with little algorithmic or data transparency, and thus little buy-in from the global public health and research community. Here, we introduce Crowdbreaks, an open platform which allows tracking of health trends by making use of continuous crowdsourced labeling of public social media content. The system is built in a way which automatizes the typical workflow from data collection, filtering, labeling and training of machine learning classifiers and therefore can greatly accelerate the research process in the public health domain. This work describes the technical aspects of the platform, thereby covering the functionalities at its current state and exploring its future use cases and extensions. Frontiers Media S.A. 2019-04-12 /pmc/articles/PMC6476276/ /pubmed/31037238 http://dx.doi.org/10.3389/fpubh.2019.00081 Text en Copyright © 2019 Müller and Salathé. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Public Health
Müller, Martin M.
Salathé, Marcel
Crowdbreaks: Tracking Health Trends Using Public Social Media Data and Crowdsourcing
title Crowdbreaks: Tracking Health Trends Using Public Social Media Data and Crowdsourcing
title_full Crowdbreaks: Tracking Health Trends Using Public Social Media Data and Crowdsourcing
title_fullStr Crowdbreaks: Tracking Health Trends Using Public Social Media Data and Crowdsourcing
title_full_unstemmed Crowdbreaks: Tracking Health Trends Using Public Social Media Data and Crowdsourcing
title_short Crowdbreaks: Tracking Health Trends Using Public Social Media Data and Crowdsourcing
title_sort crowdbreaks: tracking health trends using public social media data and crowdsourcing
topic Public Health
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6476276/
https://www.ncbi.nlm.nih.gov/pubmed/31037238
http://dx.doi.org/10.3389/fpubh.2019.00081
work_keys_str_mv AT mullermartinm crowdbreakstrackinghealthtrendsusingpublicsocialmediadataandcrowdsourcing
AT salathemarcel crowdbreakstrackinghealthtrendsusingpublicsocialmediadataandcrowdsourcing