Cargando…
Reliable Aggregation Method for Vector Regression Tasks in Crowdsourcing
Crowdsourcing platforms are widely used for collecting large amount of labeled data. Due to low-paid workers and inherent noise, the quality of acquired data could be easily degraded. To solve this, most previous studies have sought to infer the true answer from noisy labels in discrete multiple-cho...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206235/ http://dx.doi.org/10.1007/978-3-030-47436-2_20 |
_version_ | 1783530374335299584 |
---|---|
author | Kim, Joonyoung Lee, Donghyeon Jung, Kyomin |
author_facet | Kim, Joonyoung Lee, Donghyeon Jung, Kyomin |
author_sort | Kim, Joonyoung |
collection | PubMed |
description | Crowdsourcing platforms are widely used for collecting large amount of labeled data. Due to low-paid workers and inherent noise, the quality of acquired data could be easily degraded. To solve this, most previous studies have sought to infer the true answer from noisy labels in discrete multiple-choice tasks that ask workers to select one of several answer candidates. However, recent crowdsourcing tasks have become more complicated and usually consist of real-valued vectors. In this paper, we propose a novel inference algorithm for vector regression tasks which ask workers to provide accurate vectors such as image object localization and human posture estimation. Our algorithm can estimate the true answer of each task and a reliability of each worker by updating two types of messages iteratively. We also prove its performance bound which depends on the number of queries per task and the average quality of workers. Under a certain condition, we prove that its average performance becomes close to an oracle estimator which knows the reliability of every worker. Through extensive experiments with both real-world and synthetic datasets, we verify that our algorithm are superior to other state-of-the-art algorithms. |
format | Online Article Text |
id | pubmed-7206235 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-72062352020-05-08 Reliable Aggregation Method for Vector Regression Tasks in Crowdsourcing Kim, Joonyoung Lee, Donghyeon Jung, Kyomin Advances in Knowledge Discovery and Data Mining Article Crowdsourcing platforms are widely used for collecting large amount of labeled data. Due to low-paid workers and inherent noise, the quality of acquired data could be easily degraded. To solve this, most previous studies have sought to infer the true answer from noisy labels in discrete multiple-choice tasks that ask workers to select one of several answer candidates. However, recent crowdsourcing tasks have become more complicated and usually consist of real-valued vectors. In this paper, we propose a novel inference algorithm for vector regression tasks which ask workers to provide accurate vectors such as image object localization and human posture estimation. Our algorithm can estimate the true answer of each task and a reliability of each worker by updating two types of messages iteratively. We also prove its performance bound which depends on the number of queries per task and the average quality of workers. Under a certain condition, we prove that its average performance becomes close to an oracle estimator which knows the reliability of every worker. Through extensive experiments with both real-world and synthetic datasets, we verify that our algorithm are superior to other state-of-the-art algorithms. 2020-04-17 /pmc/articles/PMC7206235/ http://dx.doi.org/10.1007/978-3-030-47436-2_20 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Kim, Joonyoung Lee, Donghyeon Jung, Kyomin Reliable Aggregation Method for Vector Regression Tasks in Crowdsourcing |
title | Reliable Aggregation Method for Vector Regression Tasks in Crowdsourcing |
title_full | Reliable Aggregation Method for Vector Regression Tasks in Crowdsourcing |
title_fullStr | Reliable Aggregation Method for Vector Regression Tasks in Crowdsourcing |
title_full_unstemmed | Reliable Aggregation Method for Vector Regression Tasks in Crowdsourcing |
title_short | Reliable Aggregation Method for Vector Regression Tasks in Crowdsourcing |
title_sort | reliable aggregation method for vector regression tasks in crowdsourcing |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206235/ http://dx.doi.org/10.1007/978-3-030-47436-2_20 |
work_keys_str_mv | AT kimjoonyoung reliableaggregationmethodforvectorregressiontasksincrowdsourcing AT leedonghyeon reliableaggregationmethodforvectorregressiontasksincrowdsourcing AT jungkyomin reliableaggregationmethodforvectorregressiontasksincrowdsourcing |