Cargando…

Linking Human And Machine Behavior: A New Approach to Evaluate Training Data Quality for Beneficial Machine Learning

Machine behavior that is based on learning algorithms can be significantly influenced by the exposure to data of different qualities. Up to now, those qualities are solely measured in technical terms, but not in ethical ones, despite the significant role of training and annotation data in supervised...

Descripción completa

Detalles Bibliográficos
Autor principal: Hagendorff, Thilo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Netherlands 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8475847/
https://www.ncbi.nlm.nih.gov/pubmed/34602749
http://dx.doi.org/10.1007/s11023-021-09573-8
_version_ 1784575484869214208
author Hagendorff, Thilo
author_facet Hagendorff, Thilo
author_sort Hagendorff, Thilo
collection PubMed
description Machine behavior that is based on learning algorithms can be significantly influenced by the exposure to data of different qualities. Up to now, those qualities are solely measured in technical terms, but not in ethical ones, despite the significant role of training and annotation data in supervised machine learning. This is the first study to fill this gap by describing new dimensions of data quality for supervised machine learning applications. Based on the rationale that different social and psychological backgrounds of individuals correlate in practice with different modes of human–computer-interaction, the paper describes from an ethical perspective how varying qualities of behavioral data that individuals leave behind while using digital technologies have socially relevant ramification for the development of machine learning applications. The specific objective of this study is to describe how training data can be selected according to ethical assessments of the behavior it originates from, establishing an innovative filter regime to transition from the big data rationale n = all to a more selective way of processing data for training sets in machine learning. The overarching aim of this research is to promote methods for achieving beneficial machine learning applications that could be widely useful for industry as well as academia.
format Online
Article
Text
id pubmed-8475847
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer Netherlands
record_format MEDLINE/PubMed
spelling pubmed-84758472021-09-28 Linking Human And Machine Behavior: A New Approach to Evaluate Training Data Quality for Beneficial Machine Learning Hagendorff, Thilo Minds Mach (Dordr) General Article Machine behavior that is based on learning algorithms can be significantly influenced by the exposure to data of different qualities. Up to now, those qualities are solely measured in technical terms, but not in ethical ones, despite the significant role of training and annotation data in supervised machine learning. This is the first study to fill this gap by describing new dimensions of data quality for supervised machine learning applications. Based on the rationale that different social and psychological backgrounds of individuals correlate in practice with different modes of human–computer-interaction, the paper describes from an ethical perspective how varying qualities of behavioral data that individuals leave behind while using digital technologies have socially relevant ramification for the development of machine learning applications. The specific objective of this study is to describe how training data can be selected according to ethical assessments of the behavior it originates from, establishing an innovative filter regime to transition from the big data rationale n = all to a more selective way of processing data for training sets in machine learning. The overarching aim of this research is to promote methods for achieving beneficial machine learning applications that could be widely useful for industry as well as academia. Springer Netherlands 2021-09-26 2021 /pmc/articles/PMC8475847/ /pubmed/34602749 http://dx.doi.org/10.1007/s11023-021-09573-8 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle General Article
Hagendorff, Thilo
Linking Human And Machine Behavior: A New Approach to Evaluate Training Data Quality for Beneficial Machine Learning
title Linking Human And Machine Behavior: A New Approach to Evaluate Training Data Quality for Beneficial Machine Learning
title_full Linking Human And Machine Behavior: A New Approach to Evaluate Training Data Quality for Beneficial Machine Learning
title_fullStr Linking Human And Machine Behavior: A New Approach to Evaluate Training Data Quality for Beneficial Machine Learning
title_full_unstemmed Linking Human And Machine Behavior: A New Approach to Evaluate Training Data Quality for Beneficial Machine Learning
title_short Linking Human And Machine Behavior: A New Approach to Evaluate Training Data Quality for Beneficial Machine Learning
title_sort linking human and machine behavior: a new approach to evaluate training data quality for beneficial machine learning
topic General Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8475847/
https://www.ncbi.nlm.nih.gov/pubmed/34602749
http://dx.doi.org/10.1007/s11023-021-09573-8
work_keys_str_mv AT hagendorffthilo linkinghumanandmachinebehavioranewapproachtoevaluatetrainingdataqualityforbeneficialmachinelearning