Cargando…

The value of standards for health datasets in artificial intelligence-based applications

Artificial intelligence as a medical device is increasingly being applied to healthcare for diagnosis, risk stratification and resource allocation. However, a growing body of evidence has highlighted the risk of algorithmic bias, which may perpetuate existing health inequity. This problem arises in...

Descripción completa

Detalles Bibliográficos
Autores principales: Arora, Anmol, Alderman, Joseph E., Palmer, Joanne, Ganapathi, Shaswath, Laws, Elinor, McCradden, Melissa D., Oakden-Rayner, Lauren, Pfohl, Stephen R., Ghassemi, Marzyeh, McKay, Francis, Treanor, Darren, Rostamzadeh, Negar, Mateen, Bilal, Gath, Jacqui, Adebajo, Adewole O., Kuku, Stephanie, Matin, Rubeta, Heller, Katherine, Sapey, Elizabeth, Sebire, Neil J., Cole-Lewis, Heather, Calvert, Melanie, Denniston, Alastair, Liu, Xiaoxuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group US 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10667100/
https://www.ncbi.nlm.nih.gov/pubmed/37884627
http://dx.doi.org/10.1038/s41591-023-02608-w
_version_ 1785139177091760128
author Arora, Anmol
Alderman, Joseph E.
Palmer, Joanne
Ganapathi, Shaswath
Laws, Elinor
McCradden, Melissa D.
Oakden-Rayner, Lauren
Pfohl, Stephen R.
Ghassemi, Marzyeh
McKay, Francis
Treanor, Darren
Rostamzadeh, Negar
Mateen, Bilal
Gath, Jacqui
Adebajo, Adewole O.
Kuku, Stephanie
Matin, Rubeta
Heller, Katherine
Sapey, Elizabeth
Sebire, Neil J.
Cole-Lewis, Heather
Calvert, Melanie
Denniston, Alastair
Liu, Xiaoxuan
author_facet Arora, Anmol
Alderman, Joseph E.
Palmer, Joanne
Ganapathi, Shaswath
Laws, Elinor
McCradden, Melissa D.
Oakden-Rayner, Lauren
Pfohl, Stephen R.
Ghassemi, Marzyeh
McKay, Francis
Treanor, Darren
Rostamzadeh, Negar
Mateen, Bilal
Gath, Jacqui
Adebajo, Adewole O.
Kuku, Stephanie
Matin, Rubeta
Heller, Katherine
Sapey, Elizabeth
Sebire, Neil J.
Cole-Lewis, Heather
Calvert, Melanie
Denniston, Alastair
Liu, Xiaoxuan
author_sort Arora, Anmol
collection PubMed
description Artificial intelligence as a medical device is increasingly being applied to healthcare for diagnosis, risk stratification and resource allocation. However, a growing body of evidence has highlighted the risk of algorithmic bias, which may perpetuate existing health inequity. This problem arises in part because of systemic inequalities in dataset curation, unequal opportunity to participate in research and inequalities of access. This study aims to explore existing standards, frameworks and best practices for ensuring adequate data diversity in health datasets. Exploring the body of existing literature and expert views is an important step towards the development of consensus-based guidelines. The study comprises two parts: a systematic review of existing standards, frameworks and best practices for healthcare datasets; and a survey and thematic analysis of stakeholder views of bias, health equity and best practices for artificial intelligence as a medical device. We found that the need for dataset diversity was well described in literature, and experts generally favored the development of a robust set of guidelines, but there were mixed views about how these could be implemented practically. The outputs of this study will be used to inform the development of standards for transparency of data diversity in health datasets (the STANDING Together initiative).
format Online
Article
Text
id pubmed-10667100
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group US
record_format MEDLINE/PubMed
spelling pubmed-106671002023-10-26 The value of standards for health datasets in artificial intelligence-based applications Arora, Anmol Alderman, Joseph E. Palmer, Joanne Ganapathi, Shaswath Laws, Elinor McCradden, Melissa D. Oakden-Rayner, Lauren Pfohl, Stephen R. Ghassemi, Marzyeh McKay, Francis Treanor, Darren Rostamzadeh, Negar Mateen, Bilal Gath, Jacqui Adebajo, Adewole O. Kuku, Stephanie Matin, Rubeta Heller, Katherine Sapey, Elizabeth Sebire, Neil J. Cole-Lewis, Heather Calvert, Melanie Denniston, Alastair Liu, Xiaoxuan Nat Med Analysis Artificial intelligence as a medical device is increasingly being applied to healthcare for diagnosis, risk stratification and resource allocation. However, a growing body of evidence has highlighted the risk of algorithmic bias, which may perpetuate existing health inequity. This problem arises in part because of systemic inequalities in dataset curation, unequal opportunity to participate in research and inequalities of access. This study aims to explore existing standards, frameworks and best practices for ensuring adequate data diversity in health datasets. Exploring the body of existing literature and expert views is an important step towards the development of consensus-based guidelines. The study comprises two parts: a systematic review of existing standards, frameworks and best practices for healthcare datasets; and a survey and thematic analysis of stakeholder views of bias, health equity and best practices for artificial intelligence as a medical device. We found that the need for dataset diversity was well described in literature, and experts generally favored the development of a robust set of guidelines, but there were mixed views about how these could be implemented practically. The outputs of this study will be used to inform the development of standards for transparency of data diversity in health datasets (the STANDING Together initiative). Nature Publishing Group US 2023-10-26 2023 /pmc/articles/PMC10667100/ /pubmed/37884627 http://dx.doi.org/10.1038/s41591-023-02608-w Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Analysis
Arora, Anmol
Alderman, Joseph E.
Palmer, Joanne
Ganapathi, Shaswath
Laws, Elinor
McCradden, Melissa D.
Oakden-Rayner, Lauren
Pfohl, Stephen R.
Ghassemi, Marzyeh
McKay, Francis
Treanor, Darren
Rostamzadeh, Negar
Mateen, Bilal
Gath, Jacqui
Adebajo, Adewole O.
Kuku, Stephanie
Matin, Rubeta
Heller, Katherine
Sapey, Elizabeth
Sebire, Neil J.
Cole-Lewis, Heather
Calvert, Melanie
Denniston, Alastair
Liu, Xiaoxuan
The value of standards for health datasets in artificial intelligence-based applications
title The value of standards for health datasets in artificial intelligence-based applications
title_full The value of standards for health datasets in artificial intelligence-based applications
title_fullStr The value of standards for health datasets in artificial intelligence-based applications
title_full_unstemmed The value of standards for health datasets in artificial intelligence-based applications
title_short The value of standards for health datasets in artificial intelligence-based applications
title_sort value of standards for health datasets in artificial intelligence-based applications
topic Analysis
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10667100/
https://www.ncbi.nlm.nih.gov/pubmed/37884627
http://dx.doi.org/10.1038/s41591-023-02608-w
work_keys_str_mv AT aroraanmol thevalueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT aldermanjosephe thevalueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT palmerjoanne thevalueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT ganapathishaswath thevalueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT lawselinor thevalueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT mccraddenmelissad thevalueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT oakdenraynerlauren thevalueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT pfohlstephenr thevalueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT ghassemimarzyeh thevalueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT mckayfrancis thevalueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT treanordarren thevalueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT rostamzadehnegar thevalueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT mateenbilal thevalueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT gathjacqui thevalueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT adebajoadewoleo thevalueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT kukustephanie thevalueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT matinrubeta thevalueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT hellerkatherine thevalueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT sapeyelizabeth thevalueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT sebireneilj thevalueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT colelewisheather thevalueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT calvertmelanie thevalueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT dennistonalastair thevalueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT liuxiaoxuan thevalueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT aroraanmol valueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT aldermanjosephe valueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT palmerjoanne valueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT ganapathishaswath valueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT lawselinor valueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT mccraddenmelissad valueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT oakdenraynerlauren valueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT pfohlstephenr valueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT ghassemimarzyeh valueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT mckayfrancis valueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT treanordarren valueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT rostamzadehnegar valueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT mateenbilal valueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT gathjacqui valueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT adebajoadewoleo valueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT kukustephanie valueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT matinrubeta valueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT hellerkatherine valueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT sapeyelizabeth valueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT sebireneilj valueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT colelewisheather valueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT calvertmelanie valueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT dennistonalastair valueofstandardsforhealthdatasetsinartificialintelligencebasedapplications
AT liuxiaoxuan valueofstandardsforhealthdatasetsinartificialintelligencebasedapplications