Cargando…

Producing knowledge by admitting ignorance: Enhancing data quality through an “I don’t know” option in citizen science

The “noisy labeler problem” in crowdsourced data has attracted great attention in recent years, with important ramifications in citizen science, where non-experts must produce high-quality data. Particularly relevant to citizen science is dynamic task allocation, in which the level of agreement amon...

Descripción completa

Detalles Bibliográficos
Autores principales:	Torre, Marina, Nakayama, Shinnosuke, Tolbert, Tyrone J., Porfiri, Maurizio
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2019
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6392254/ https://www.ncbi.nlm.nih.gov/pubmed/30811452 http://dx.doi.org/10.1371/journal.pone.0211907

_version_	1783398440278949888
author	Torre, Marina Nakayama, Shinnosuke Tolbert, Tyrone J. Porfiri, Maurizio
author_facet	Torre, Marina Nakayama, Shinnosuke Tolbert, Tyrone J. Porfiri, Maurizio
author_sort	Torre, Marina
collection	PubMed
description	The “noisy labeler problem” in crowdsourced data has attracted great attention in recent years, with important ramifications in citizen science, where non-experts must produce high-quality data. Particularly relevant to citizen science is dynamic task allocation, in which the level of agreement among labelers can be progressively updated through the information-theoretic notion of entropy. Under dynamic task allocation, we hypothesized that providing volunteers with an “I don’t know” option would contribute to enhancing data quality, by introducing further, useful information about the level of agreement among volunteers. We investigated the influence of an “I don’t know” option on the data quality in a citizen science project that entailed classifying the image of a highly polluted canal into “threat” or “no threat” to the environment. Our results show that an “I don’t know” option can enhance accuracy, compared to the case without the option; such an improvement mostly affects the true negative rather than the true positive rate. In an information-theoretic sense, these seemingly meaningless blank votes constitute a meaningful piece of information to help enhance accuracy of data in citizen science.
format	Online Article Text
id	pubmed-6392254
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-63922542019-03-08 Producing knowledge by admitting ignorance: Enhancing data quality through an “I don’t know” option in citizen science Torre, Marina Nakayama, Shinnosuke Tolbert, Tyrone J. Porfiri, Maurizio PLoS One Research Article The “noisy labeler problem” in crowdsourced data has attracted great attention in recent years, with important ramifications in citizen science, where non-experts must produce high-quality data. Particularly relevant to citizen science is dynamic task allocation, in which the level of agreement among labelers can be progressively updated through the information-theoretic notion of entropy. Under dynamic task allocation, we hypothesized that providing volunteers with an “I don’t know” option would contribute to enhancing data quality, by introducing further, useful information about the level of agreement among volunteers. We investigated the influence of an “I don’t know” option on the data quality in a citizen science project that entailed classifying the image of a highly polluted canal into “threat” or “no threat” to the environment. Our results show that an “I don’t know” option can enhance accuracy, compared to the case without the option; such an improvement mostly affects the true negative rather than the true positive rate. In an information-theoretic sense, these seemingly meaningless blank votes constitute a meaningful piece of information to help enhance accuracy of data in citizen science. Public Library of Science 2019-02-27 /pmc/articles/PMC6392254/ /pubmed/30811452 http://dx.doi.org/10.1371/journal.pone.0211907 Text en © 2019 Torre et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Torre, Marina Nakayama, Shinnosuke Tolbert, Tyrone J. Porfiri, Maurizio Producing knowledge by admitting ignorance: Enhancing data quality through an “I don’t know” option in citizen science
title	Producing knowledge by admitting ignorance: Enhancing data quality through an “I don’t know” option in citizen science
title_full	Producing knowledge by admitting ignorance: Enhancing data quality through an “I don’t know” option in citizen science
title_fullStr	Producing knowledge by admitting ignorance: Enhancing data quality through an “I don’t know” option in citizen science
title_full_unstemmed	Producing knowledge by admitting ignorance: Enhancing data quality through an “I don’t know” option in citizen science
title_short	Producing knowledge by admitting ignorance: Enhancing data quality through an “I don’t know” option in citizen science
title_sort	producing knowledge by admitting ignorance: enhancing data quality through an “i don’t know” option in citizen science
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6392254/ https://www.ncbi.nlm.nih.gov/pubmed/30811452 http://dx.doi.org/10.1371/journal.pone.0211907
work_keys_str_mv	AT torremarina producingknowledgebyadmittingignoranceenhancingdataqualitythroughanidontknowoptionincitizenscience AT nakayamashinnosuke producingknowledgebyadmittingignoranceenhancingdataqualitythroughanidontknowoptionincitizenscience AT tolberttyronej producingknowledgebyadmittingignoranceenhancingdataqualitythroughanidontknowoptionincitizenscience AT porfirimaurizio producingknowledgebyadmittingignoranceenhancingdataqualitythroughanidontknowoptionincitizenscience

Producing knowledge by admitting ignorance: Enhancing data quality through an “I don’t know” option in citizen science

Ejemplares similares