Cargando…

Racial underrepresentation in dermatological datasets leads to biased machine learning models and inequitable healthcare

OBJECTIVE: Clinical applications of machine learning are promising as a tool to improve patient outcomes through assisting diagnoses, treatment, and analyzing risk factors for screening. Possible clinical applications are especially prominent in dermatology as many diseases and conditions present vi...

Descripción completa

Detalles Bibliográficos
Autores principales: Kleinberg, Giona, Diaz, Michael J, Batchu, Sai, Lucke-Wold, Brandon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9815490/
https://www.ncbi.nlm.nih.gov/pubmed/36619609
_version_ 1784864328995831808
author Kleinberg, Giona
Diaz, Michael J
Batchu, Sai
Lucke-Wold, Brandon
author_facet Kleinberg, Giona
Diaz, Michael J
Batchu, Sai
Lucke-Wold, Brandon
author_sort Kleinberg, Giona
collection PubMed
description OBJECTIVE: Clinical applications of machine learning are promising as a tool to improve patient outcomes through assisting diagnoses, treatment, and analyzing risk factors for screening. Possible clinical applications are especially prominent in dermatology as many diseases and conditions present visually. This allows a machine learning model to analyze and diagnose conditions using patient images and data from electronic health records (EHRs) after training on clinical datasets but could also introduce bias. Despite promising applications, artificial intelligence has the capacity to exacerbate existing demographic disparities in healthcare if models are trained on biased datasets. METHODS: Through systematic literature review of available literature, we highlight the extent of bias present in clinical datasets as well as the implications it could have on healthcare if not addressed. RESULTS: We find the implications are worsened in dermatological models. Despite the severity and complexity of melanoma and other dermatological diseases as well as differing disease presentations based on skin-color, many imaging datasets underrepresent certain demographic groups causing machine learning models to train on images of primarily fair-skinned individuals leaving minorities behind. CONCLUSION: In order to address this disparity, research first needs to be done investigating the extent of the bias present and the implications it may have on equitable healthcare.
format Online
Article
Text
id pubmed-9815490
institution National Center for Biotechnology Information
language English
publishDate 2022
record_format MEDLINE/PubMed
spelling pubmed-98154902023-01-05 Racial underrepresentation in dermatological datasets leads to biased machine learning models and inequitable healthcare Kleinberg, Giona Diaz, Michael J Batchu, Sai Lucke-Wold, Brandon J Biomed Res (Middlet) Article OBJECTIVE: Clinical applications of machine learning are promising as a tool to improve patient outcomes through assisting diagnoses, treatment, and analyzing risk factors for screening. Possible clinical applications are especially prominent in dermatology as many diseases and conditions present visually. This allows a machine learning model to analyze and diagnose conditions using patient images and data from electronic health records (EHRs) after training on clinical datasets but could also introduce bias. Despite promising applications, artificial intelligence has the capacity to exacerbate existing demographic disparities in healthcare if models are trained on biased datasets. METHODS: Through systematic literature review of available literature, we highlight the extent of bias present in clinical datasets as well as the implications it could have on healthcare if not addressed. RESULTS: We find the implications are worsened in dermatological models. Despite the severity and complexity of melanoma and other dermatological diseases as well as differing disease presentations based on skin-color, many imaging datasets underrepresent certain demographic groups causing machine learning models to train on images of primarily fair-skinned individuals leaving minorities behind. CONCLUSION: In order to address this disparity, research first needs to be done investigating the extent of the bias present and the implications it may have on equitable healthcare. 2022 /pmc/articles/PMC9815490/ /pubmed/36619609 Text en https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/)
spellingShingle Article
Kleinberg, Giona
Diaz, Michael J
Batchu, Sai
Lucke-Wold, Brandon
Racial underrepresentation in dermatological datasets leads to biased machine learning models and inequitable healthcare
title Racial underrepresentation in dermatological datasets leads to biased machine learning models and inequitable healthcare
title_full Racial underrepresentation in dermatological datasets leads to biased machine learning models and inequitable healthcare
title_fullStr Racial underrepresentation in dermatological datasets leads to biased machine learning models and inequitable healthcare
title_full_unstemmed Racial underrepresentation in dermatological datasets leads to biased machine learning models and inequitable healthcare
title_short Racial underrepresentation in dermatological datasets leads to biased machine learning models and inequitable healthcare
title_sort racial underrepresentation in dermatological datasets leads to biased machine learning models and inequitable healthcare
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9815490/
https://www.ncbi.nlm.nih.gov/pubmed/36619609
work_keys_str_mv AT kleinberggiona racialunderrepresentationindermatologicaldatasetsleadstobiasedmachinelearningmodelsandinequitablehealthcare
AT diazmichaelj racialunderrepresentationindermatologicaldatasetsleadstobiasedmachinelearningmodelsandinequitablehealthcare
AT batchusai racialunderrepresentationindermatologicaldatasetsleadstobiasedmachinelearningmodelsandinequitablehealthcare
AT luckewoldbrandon racialunderrepresentationindermatologicaldatasetsleadstobiasedmachinelearningmodelsandinequitablehealthcare