Cargando…

Reliable Aggregation Method for Vector Regression Tasks in Crowdsourcing

Crowdsourcing platforms are widely used for collecting large amount of labeled data. Due to low-paid workers and inherent noise, the quality of acquired data could be easily degraded. To solve this, most previous studies have sought to infer the true answer from noisy labels in discrete multiple-cho...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Joonyoung, Lee, Donghyeon, Jung, Kyomin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206235/
http://dx.doi.org/10.1007/978-3-030-47436-2_20
_version_ 1783530374335299584
author Kim, Joonyoung
Lee, Donghyeon
Jung, Kyomin
author_facet Kim, Joonyoung
Lee, Donghyeon
Jung, Kyomin
author_sort Kim, Joonyoung
collection PubMed
description Crowdsourcing platforms are widely used for collecting large amount of labeled data. Due to low-paid workers and inherent noise, the quality of acquired data could be easily degraded. To solve this, most previous studies have sought to infer the true answer from noisy labels in discrete multiple-choice tasks that ask workers to select one of several answer candidates. However, recent crowdsourcing tasks have become more complicated and usually consist of real-valued vectors. In this paper, we propose a novel inference algorithm for vector regression tasks which ask workers to provide accurate vectors such as image object localization and human posture estimation. Our algorithm can estimate the true answer of each task and a reliability of each worker by updating two types of messages iteratively. We also prove its performance bound which depends on the number of queries per task and the average quality of workers. Under a certain condition, we prove that its average performance becomes close to an oracle estimator which knows the reliability of every worker. Through extensive experiments with both real-world and synthetic datasets, we verify that our algorithm are superior to other state-of-the-art algorithms.
format Online
Article
Text
id pubmed-7206235
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-72062352020-05-08 Reliable Aggregation Method for Vector Regression Tasks in Crowdsourcing Kim, Joonyoung Lee, Donghyeon Jung, Kyomin Advances in Knowledge Discovery and Data Mining Article Crowdsourcing platforms are widely used for collecting large amount of labeled data. Due to low-paid workers and inherent noise, the quality of acquired data could be easily degraded. To solve this, most previous studies have sought to infer the true answer from noisy labels in discrete multiple-choice tasks that ask workers to select one of several answer candidates. However, recent crowdsourcing tasks have become more complicated and usually consist of real-valued vectors. In this paper, we propose a novel inference algorithm for vector regression tasks which ask workers to provide accurate vectors such as image object localization and human posture estimation. Our algorithm can estimate the true answer of each task and a reliability of each worker by updating two types of messages iteratively. We also prove its performance bound which depends on the number of queries per task and the average quality of workers. Under a certain condition, we prove that its average performance becomes close to an oracle estimator which knows the reliability of every worker. Through extensive experiments with both real-world and synthetic datasets, we verify that our algorithm are superior to other state-of-the-art algorithms. 2020-04-17 /pmc/articles/PMC7206235/ http://dx.doi.org/10.1007/978-3-030-47436-2_20 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Kim, Joonyoung
Lee, Donghyeon
Jung, Kyomin
Reliable Aggregation Method for Vector Regression Tasks in Crowdsourcing
title Reliable Aggregation Method for Vector Regression Tasks in Crowdsourcing
title_full Reliable Aggregation Method for Vector Regression Tasks in Crowdsourcing
title_fullStr Reliable Aggregation Method for Vector Regression Tasks in Crowdsourcing
title_full_unstemmed Reliable Aggregation Method for Vector Regression Tasks in Crowdsourcing
title_short Reliable Aggregation Method for Vector Regression Tasks in Crowdsourcing
title_sort reliable aggregation method for vector regression tasks in crowdsourcing
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206235/
http://dx.doi.org/10.1007/978-3-030-47436-2_20
work_keys_str_mv AT kimjoonyoung reliableaggregationmethodforvectorregressiontasksincrowdsourcing
AT leedonghyeon reliableaggregationmethodforvectorregressiontasksincrowdsourcing
AT jungkyomin reliableaggregationmethodforvectorregressiontasksincrowdsourcing