Cargando…
Five sources of bias in natural language processing
Recently, there has been an increased interest in demographically grounded bias in natural language processing (NLP) applications. Much of the recent work has focused on describing bias and providing an overview of bias in a larger context. Here, we provide a simple, actionable summary of this recen...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9285808/ https://www.ncbi.nlm.nih.gov/pubmed/35864931 http://dx.doi.org/10.1111/lnc3.12432 |
_version_ | 1784747866571407360 |
---|---|
author | Hovy, Dirk Prabhumoye, Shrimai |
author_facet | Hovy, Dirk Prabhumoye, Shrimai |
author_sort | Hovy, Dirk |
collection | PubMed |
description | Recently, there has been an increased interest in demographically grounded bias in natural language processing (NLP) applications. Much of the recent work has focused on describing bias and providing an overview of bias in a larger context. Here, we provide a simple, actionable summary of this recent work. We outline five sources where bias can occur in NLP systems: (1) the data, (2) the annotation process, (3) the input representations, (4) the models, and finally (5) the research design (or how we conceptualize our research). We explore each of the bias sources in detail in this article, including examples and links to related work, as well as potential counter‐measures. |
format | Online Article Text |
id | pubmed-9285808 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-92858082022-07-19 Five sources of bias in natural language processing Hovy, Dirk Prabhumoye, Shrimai Lang Linguist Compass Computational & Mathematical Recently, there has been an increased interest in demographically grounded bias in natural language processing (NLP) applications. Much of the recent work has focused on describing bias and providing an overview of bias in a larger context. Here, we provide a simple, actionable summary of this recent work. We outline five sources where bias can occur in NLP systems: (1) the data, (2) the annotation process, (3) the input representations, (4) the models, and finally (5) the research design (or how we conceptualize our research). We explore each of the bias sources in detail in this article, including examples and links to related work, as well as potential counter‐measures. John Wiley and Sons Inc. 2021-08-20 2021-08 /pmc/articles/PMC9285808/ /pubmed/35864931 http://dx.doi.org/10.1111/lnc3.12432 Text en © 2021 The Authors. Language and Linguistics Compass published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes. |
spellingShingle | Computational & Mathematical Hovy, Dirk Prabhumoye, Shrimai Five sources of bias in natural language processing |
title | Five sources of bias in natural language processing |
title_full | Five sources of bias in natural language processing |
title_fullStr | Five sources of bias in natural language processing |
title_full_unstemmed | Five sources of bias in natural language processing |
title_short | Five sources of bias in natural language processing |
title_sort | five sources of bias in natural language processing |
topic | Computational & Mathematical |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9285808/ https://www.ncbi.nlm.nih.gov/pubmed/35864931 http://dx.doi.org/10.1111/lnc3.12432 |
work_keys_str_mv | AT hovydirk fivesourcesofbiasinnaturallanguageprocessing AT prabhumoyeshrimai fivesourcesofbiasinnaturallanguageprocessing |