Cargando…
Racial underrepresentation in dermatological datasets leads to biased machine learning models and inequitable healthcare
OBJECTIVE: Clinical applications of machine learning are promising as a tool to improve patient outcomes through assisting diagnoses, treatment, and analyzing risk factors for screening. Possible clinical applications are especially prominent in dermatology as many diseases and conditions present vi...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9815490/ https://www.ncbi.nlm.nih.gov/pubmed/36619609 |
_version_ | 1784864328995831808 |
---|---|
author | Kleinberg, Giona Diaz, Michael J Batchu, Sai Lucke-Wold, Brandon |
author_facet | Kleinberg, Giona Diaz, Michael J Batchu, Sai Lucke-Wold, Brandon |
author_sort | Kleinberg, Giona |
collection | PubMed |
description | OBJECTIVE: Clinical applications of machine learning are promising as a tool to improve patient outcomes through assisting diagnoses, treatment, and analyzing risk factors for screening. Possible clinical applications are especially prominent in dermatology as many diseases and conditions present visually. This allows a machine learning model to analyze and diagnose conditions using patient images and data from electronic health records (EHRs) after training on clinical datasets but could also introduce bias. Despite promising applications, artificial intelligence has the capacity to exacerbate existing demographic disparities in healthcare if models are trained on biased datasets. METHODS: Through systematic literature review of available literature, we highlight the extent of bias present in clinical datasets as well as the implications it could have on healthcare if not addressed. RESULTS: We find the implications are worsened in dermatological models. Despite the severity and complexity of melanoma and other dermatological diseases as well as differing disease presentations based on skin-color, many imaging datasets underrepresent certain demographic groups causing machine learning models to train on images of primarily fair-skinned individuals leaving minorities behind. CONCLUSION: In order to address this disparity, research first needs to be done investigating the extent of the bias present and the implications it may have on equitable healthcare. |
format | Online Article Text |
id | pubmed-9815490 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
record_format | MEDLINE/PubMed |
spelling | pubmed-98154902023-01-05 Racial underrepresentation in dermatological datasets leads to biased machine learning models and inequitable healthcare Kleinberg, Giona Diaz, Michael J Batchu, Sai Lucke-Wold, Brandon J Biomed Res (Middlet) Article OBJECTIVE: Clinical applications of machine learning are promising as a tool to improve patient outcomes through assisting diagnoses, treatment, and analyzing risk factors for screening. Possible clinical applications are especially prominent in dermatology as many diseases and conditions present visually. This allows a machine learning model to analyze and diagnose conditions using patient images and data from electronic health records (EHRs) after training on clinical datasets but could also introduce bias. Despite promising applications, artificial intelligence has the capacity to exacerbate existing demographic disparities in healthcare if models are trained on biased datasets. METHODS: Through systematic literature review of available literature, we highlight the extent of bias present in clinical datasets as well as the implications it could have on healthcare if not addressed. RESULTS: We find the implications are worsened in dermatological models. Despite the severity and complexity of melanoma and other dermatological diseases as well as differing disease presentations based on skin-color, many imaging datasets underrepresent certain demographic groups causing machine learning models to train on images of primarily fair-skinned individuals leaving minorities behind. CONCLUSION: In order to address this disparity, research first needs to be done investigating the extent of the bias present and the implications it may have on equitable healthcare. 2022 /pmc/articles/PMC9815490/ /pubmed/36619609 Text en https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) |
spellingShingle | Article Kleinberg, Giona Diaz, Michael J Batchu, Sai Lucke-Wold, Brandon Racial underrepresentation in dermatological datasets leads to biased machine learning models and inequitable healthcare |
title | Racial underrepresentation in dermatological datasets leads to biased machine learning models and inequitable healthcare |
title_full | Racial underrepresentation in dermatological datasets leads to biased machine learning models and inequitable healthcare |
title_fullStr | Racial underrepresentation in dermatological datasets leads to biased machine learning models and inequitable healthcare |
title_full_unstemmed | Racial underrepresentation in dermatological datasets leads to biased machine learning models and inequitable healthcare |
title_short | Racial underrepresentation in dermatological datasets leads to biased machine learning models and inequitable healthcare |
title_sort | racial underrepresentation in dermatological datasets leads to biased machine learning models and inequitable healthcare |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9815490/ https://www.ncbi.nlm.nih.gov/pubmed/36619609 |
work_keys_str_mv | AT kleinberggiona racialunderrepresentationindermatologicaldatasetsleadstobiasedmachinelearningmodelsandinequitablehealthcare AT diazmichaelj racialunderrepresentationindermatologicaldatasetsleadstobiasedmachinelearningmodelsandinequitablehealthcare AT batchusai racialunderrepresentationindermatologicaldatasetsleadstobiasedmachinelearningmodelsandinequitablehealthcare AT luckewoldbrandon racialunderrepresentationindermatologicaldatasetsleadstobiasedmachinelearningmodelsandinequitablehealthcare |