Cargando…

Judging facts, judging norms: Training machine learning models to judge humans requires a modified approach to labeling data

As governments and industry turn to increased use of automated decision systems, it becomes essential to consider how closely such systems can reproduce human judgment. We identify a core potential failure, finding that annotators label objects differently depending on whether they are being asked a...

Descripción completa

Detalles Bibliográficos
Autores principales: Balagopalan, Aparna, Madras, David, Yang, David H., Hadfield-Menell, Dylan, Hadfield, Gillian K., Ghassemi, Marzyeh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Association for the Advancement of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10171805/
https://www.ncbi.nlm.nih.gov/pubmed/37163590
http://dx.doi.org/10.1126/sciadv.abq0701
_version_ 1785039501491437568
author Balagopalan, Aparna
Madras, David
Yang, David H.
Hadfield-Menell, Dylan
Hadfield, Gillian K.
Ghassemi, Marzyeh
author_facet Balagopalan, Aparna
Madras, David
Yang, David H.
Hadfield-Menell, Dylan
Hadfield, Gillian K.
Ghassemi, Marzyeh
author_sort Balagopalan, Aparna
collection PubMed
description As governments and industry turn to increased use of automated decision systems, it becomes essential to consider how closely such systems can reproduce human judgment. We identify a core potential failure, finding that annotators label objects differently depending on whether they are being asked a factual question or a normative question. This challenges a natural assumption maintained in many standard machine-learning (ML) data acquisition procedures: that there is no difference between predicting the factual classification of an object and an exercise of judgment about whether an object violates a rule premised on those facts. We find that using factual labels to train models intended for normative judgments introduces a notable measurement error. We show that models trained using factual labels yield significantly different judgments than those trained using normative labels and that the impact of this effect on model performance can exceed that of other factors (e.g., dataset size) that routinely attract attention from ML researchers and practitioners.
format Online
Article
Text
id pubmed-10171805
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Association for the Advancement of Science
record_format MEDLINE/PubMed
spelling pubmed-101718052023-05-11 Judging facts, judging norms: Training machine learning models to judge humans requires a modified approach to labeling data Balagopalan, Aparna Madras, David Yang, David H. Hadfield-Menell, Dylan Hadfield, Gillian K. Ghassemi, Marzyeh Sci Adv Social and Interdisciplinary Sciences As governments and industry turn to increased use of automated decision systems, it becomes essential to consider how closely such systems can reproduce human judgment. We identify a core potential failure, finding that annotators label objects differently depending on whether they are being asked a factual question or a normative question. This challenges a natural assumption maintained in many standard machine-learning (ML) data acquisition procedures: that there is no difference between predicting the factual classification of an object and an exercise of judgment about whether an object violates a rule premised on those facts. We find that using factual labels to train models intended for normative judgments introduces a notable measurement error. We show that models trained using factual labels yield significantly different judgments than those trained using normative labels and that the impact of this effect on model performance can exceed that of other factors (e.g., dataset size) that routinely attract attention from ML researchers and practitioners. American Association for the Advancement of Science 2023-05-10 /pmc/articles/PMC10171805/ /pubmed/37163590 http://dx.doi.org/10.1126/sciadv.abq0701 Text en Copyright © 2023 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works. Distributed under a Creative Commons Attribution License 4.0 (CC BY). https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution license (https://creativecommons.org/licenses/by/4.0/) , which permits which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Social and Interdisciplinary Sciences
Balagopalan, Aparna
Madras, David
Yang, David H.
Hadfield-Menell, Dylan
Hadfield, Gillian K.
Ghassemi, Marzyeh
Judging facts, judging norms: Training machine learning models to judge humans requires a modified approach to labeling data
title Judging facts, judging norms: Training machine learning models to judge humans requires a modified approach to labeling data
title_full Judging facts, judging norms: Training machine learning models to judge humans requires a modified approach to labeling data
title_fullStr Judging facts, judging norms: Training machine learning models to judge humans requires a modified approach to labeling data
title_full_unstemmed Judging facts, judging norms: Training machine learning models to judge humans requires a modified approach to labeling data
title_short Judging facts, judging norms: Training machine learning models to judge humans requires a modified approach to labeling data
title_sort judging facts, judging norms: training machine learning models to judge humans requires a modified approach to labeling data
topic Social and Interdisciplinary Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10171805/
https://www.ncbi.nlm.nih.gov/pubmed/37163590
http://dx.doi.org/10.1126/sciadv.abq0701
work_keys_str_mv AT balagopalanaparna judgingfactsjudgingnormstrainingmachinelearningmodelstojudgehumansrequiresamodifiedapproachtolabelingdata
AT madrasdavid judgingfactsjudgingnormstrainingmachinelearningmodelstojudgehumansrequiresamodifiedapproachtolabelingdata
AT yangdavidh judgingfactsjudgingnormstrainingmachinelearningmodelstojudgehumansrequiresamodifiedapproachtolabelingdata
AT hadfieldmenelldylan judgingfactsjudgingnormstrainingmachinelearningmodelstojudgehumansrequiresamodifiedapproachtolabelingdata
AT hadfieldgilliank judgingfactsjudgingnormstrainingmachinelearningmodelstojudgehumansrequiresamodifiedapproachtolabelingdata
AT ghassemimarzyeh judgingfactsjudgingnormstrainingmachinelearningmodelstojudgehumansrequiresamodifiedapproachtolabelingdata