Cargando…
Scaling and Disagreements: Bias, Noise, and Ambiguity
Crowdsourced data are often rife with disagreement, either because of genuine item ambiguity, overlapping labels, subjectivity, or annotator error. Hence, a variety of methods have been developed for learning from data containing disagreement. One of the observations emerging from this work is that...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9012579/ https://www.ncbi.nlm.nih.gov/pubmed/35434607 http://dx.doi.org/10.3389/frai.2022.818451 |
_version_ | 1784687826176049152 |
---|---|
author | Uma, Alexandra Almanea, Dina Poesio, Massimo |
author_facet | Uma, Alexandra Almanea, Dina Poesio, Massimo |
author_sort | Uma, Alexandra |
collection | PubMed |
description | Crowdsourced data are often rife with disagreement, either because of genuine item ambiguity, overlapping labels, subjectivity, or annotator error. Hence, a variety of methods have been developed for learning from data containing disagreement. One of the observations emerging from this work is that different methods appear to work best depending on characteristics of the dataset such as the level of noise. In this paper, we investigate the use of an approach developed to estimate noise, temperature scaling, in learning from data containing disagreements. We find that temperature scaling works with data in which the disagreements are the result of label overlap, but not with data in which the disagreements are due to annotator bias, as in, e.g., subjective tasks such as labeling an item as offensive or not. We also find that disagreements due to ambiguity do not fit perfectly either category. |
format | Online Article Text |
id | pubmed-9012579 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-90125792022-04-16 Scaling and Disagreements: Bias, Noise, and Ambiguity Uma, Alexandra Almanea, Dina Poesio, Massimo Front Artif Intell Artificial Intelligence Crowdsourced data are often rife with disagreement, either because of genuine item ambiguity, overlapping labels, subjectivity, or annotator error. Hence, a variety of methods have been developed for learning from data containing disagreement. One of the observations emerging from this work is that different methods appear to work best depending on characteristics of the dataset such as the level of noise. In this paper, we investigate the use of an approach developed to estimate noise, temperature scaling, in learning from data containing disagreements. We find that temperature scaling works with data in which the disagreements are the result of label overlap, but not with data in which the disagreements are due to annotator bias, as in, e.g., subjective tasks such as labeling an item as offensive or not. We also find that disagreements due to ambiguity do not fit perfectly either category. Frontiers Media S.A. 2022-04-01 /pmc/articles/PMC9012579/ /pubmed/35434607 http://dx.doi.org/10.3389/frai.2022.818451 Text en Copyright © 2022 Uma, Almanea and Poesio. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Artificial Intelligence Uma, Alexandra Almanea, Dina Poesio, Massimo Scaling and Disagreements: Bias, Noise, and Ambiguity |
title | Scaling and Disagreements: Bias, Noise, and Ambiguity |
title_full | Scaling and Disagreements: Bias, Noise, and Ambiguity |
title_fullStr | Scaling and Disagreements: Bias, Noise, and Ambiguity |
title_full_unstemmed | Scaling and Disagreements: Bias, Noise, and Ambiguity |
title_short | Scaling and Disagreements: Bias, Noise, and Ambiguity |
title_sort | scaling and disagreements: bias, noise, and ambiguity |
topic | Artificial Intelligence |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9012579/ https://www.ncbi.nlm.nih.gov/pubmed/35434607 http://dx.doi.org/10.3389/frai.2022.818451 |
work_keys_str_mv | AT umaalexandra scalinganddisagreementsbiasnoiseandambiguity AT almaneadina scalinganddisagreementsbiasnoiseandambiguity AT poesiomassimo scalinganddisagreementsbiasnoiseandambiguity |