Cargando…

Synthetic Medical Images for Robust, Privacy-Preserving Training of Artificial Intelligence: Application to Retinopathy of Prematurity Diagnosis

PURPOSE: Developing robust artificial intelligence (AI) models for medical image analysis requires large quantities of diverse, well-chosen data that can prove challenging to collect because of privacy concerns, disease rarity, or diagnostic label quality. Collecting image-based datasets for retinop...

Descripción completa

Detalles Bibliográficos
Autores principales:	Coyner, Aaron S., Chen, Jimmy S., Chang, Ken, Singh, Praveer, Ostmo, Susan, Chan, R. V. Paul, Chiang, Michael F., Kalpathy-Cramer, Jayashree, Campbell, J. Peter
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Elsevier 2022
Materias:	Artificial Intelligence and Big Data
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9560638/ https://www.ncbi.nlm.nih.gov/pubmed/36249693 http://dx.doi.org/10.1016/j.xops.2022.100126

_version_	1784807796019036160
author	Coyner, Aaron S. Chen, Jimmy S. Chang, Ken Singh, Praveer Ostmo, Susan Chan, R. V. Paul Chiang, Michael F. Kalpathy-Cramer, Jayashree Campbell, J. Peter
author_facet	Coyner, Aaron S. Chen, Jimmy S. Chang, Ken Singh, Praveer Ostmo, Susan Chan, R. V. Paul Chiang, Michael F. Kalpathy-Cramer, Jayashree Campbell, J. Peter
author_sort	Coyner, Aaron S.
collection	PubMed
description	PURPOSE: Developing robust artificial intelligence (AI) models for medical image analysis requires large quantities of diverse, well-chosen data that can prove challenging to collect because of privacy concerns, disease rarity, or diagnostic label quality. Collecting image-based datasets for retinopathy of prematurity (ROP), a potentially blinding disease, suffers from these challenges. Progressively growing generative adversarial networks (PGANs) may help, because they can synthesize highly realistic images that may increase both the size and diversity of medical datasets. DESIGN: Diagnostic validation study of convolutional neural networks (CNNs) for plus disease detection, a component of severe ROP, using synthetic data. PARTICIPANTS: Five thousand eight hundred forty-two retinal fundus images (RFIs) collected from 963 preterm infants. METHODS: Retinal vessel maps (RVMs) were segmented from RFIs. PGANs were trained to synthesize RVMs with normal, pre-plus, or plus disease vasculature. Convolutional neural networks were trained, using real or synthetic RVMs, to detect plus disease from 2 real RVM test datasets. MAIN OUTCOME MEASURES: Features of real and synthetic RVMs were evaluated using uniform manifold approximation and projection (UMAP). Similarities were evaluated at the dataset and feature level using Fréchet inception distance and Euclidean distance, respectively. CNN performance was assessed via area under the receiver operating characteristic curve (AUC); AUCs were compared via bootstrapping and Delong’s test for correlated receiver operating characteristic curves. Confusion matrices were compared using McNemar’s chi-square test and Cohen’s κ value. RESULTS: The CNN trained on synthetic RVMs showed a significantly higher AUC (0.971; P = 0.006 and P = 0.004) and classified plus disease more similarly to a set of 8 international experts (κ = 0.922) than the CNN trained on real RVMs (AUC = 0.934; κ = 0.701). Real and synthetic RVMs overlapped, by plus disease diagnosis, on the UMAP manifold, showing that synthetic images spanned the disease severity spectrum. Fréchet inception distance and Euclidean distances suggested that real and synthetic RVMs were more dissimilar to one another than real RVMs were to one another, further suggesting that synthetic RVMs were distinct from the training data with respect to privacy considerations. CONCLUSIONS: Synthetic datasets may be useful for training robust medical AI models. Furthermore, PGANs may be able to synthesize realistic data for use without protected health information concerns.
format	Online Article Text
id	pubmed-9560638
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Elsevier
record_format	MEDLINE/PubMed
spelling	pubmed-95606382022-10-14 Synthetic Medical Images for Robust, Privacy-Preserving Training of Artificial Intelligence: Application to Retinopathy of Prematurity Diagnosis Coyner, Aaron S. Chen, Jimmy S. Chang, Ken Singh, Praveer Ostmo, Susan Chan, R. V. Paul Chiang, Michael F. Kalpathy-Cramer, Jayashree Campbell, J. Peter Ophthalmol Sci Artificial Intelligence and Big Data PURPOSE: Developing robust artificial intelligence (AI) models for medical image analysis requires large quantities of diverse, well-chosen data that can prove challenging to collect because of privacy concerns, disease rarity, or diagnostic label quality. Collecting image-based datasets for retinopathy of prematurity (ROP), a potentially blinding disease, suffers from these challenges. Progressively growing generative adversarial networks (PGANs) may help, because they can synthesize highly realistic images that may increase both the size and diversity of medical datasets. DESIGN: Diagnostic validation study of convolutional neural networks (CNNs) for plus disease detection, a component of severe ROP, using synthetic data. PARTICIPANTS: Five thousand eight hundred forty-two retinal fundus images (RFIs) collected from 963 preterm infants. METHODS: Retinal vessel maps (RVMs) were segmented from RFIs. PGANs were trained to synthesize RVMs with normal, pre-plus, or plus disease vasculature. Convolutional neural networks were trained, using real or synthetic RVMs, to detect plus disease from 2 real RVM test datasets. MAIN OUTCOME MEASURES: Features of real and synthetic RVMs were evaluated using uniform manifold approximation and projection (UMAP). Similarities were evaluated at the dataset and feature level using Fréchet inception distance and Euclidean distance, respectively. CNN performance was assessed via area under the receiver operating characteristic curve (AUC); AUCs were compared via bootstrapping and Delong’s test for correlated receiver operating characteristic curves. Confusion matrices were compared using McNemar’s chi-square test and Cohen’s κ value. RESULTS: The CNN trained on synthetic RVMs showed a significantly higher AUC (0.971; P = 0.006 and P = 0.004) and classified plus disease more similarly to a set of 8 international experts (κ = 0.922) than the CNN trained on real RVMs (AUC = 0.934; κ = 0.701). Real and synthetic RVMs overlapped, by plus disease diagnosis, on the UMAP manifold, showing that synthetic images spanned the disease severity spectrum. Fréchet inception distance and Euclidean distances suggested that real and synthetic RVMs were more dissimilar to one another than real RVMs were to one another, further suggesting that synthetic RVMs were distinct from the training data with respect to privacy considerations. CONCLUSIONS: Synthetic datasets may be useful for training robust medical AI models. Furthermore, PGANs may be able to synthesize realistic data for use without protected health information concerns. Elsevier 2022-02-11 /pmc/articles/PMC9560638/ /pubmed/36249693 http://dx.doi.org/10.1016/j.xops.2022.100126 Text en © 2022 by the American Academy of Ophthalmology. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle	Artificial Intelligence and Big Data Coyner, Aaron S. Chen, Jimmy S. Chang, Ken Singh, Praveer Ostmo, Susan Chan, R. V. Paul Chiang, Michael F. Kalpathy-Cramer, Jayashree Campbell, J. Peter Synthetic Medical Images for Robust, Privacy-Preserving Training of Artificial Intelligence: Application to Retinopathy of Prematurity Diagnosis
title	Synthetic Medical Images for Robust, Privacy-Preserving Training of Artificial Intelligence: Application to Retinopathy of Prematurity Diagnosis
title_full	Synthetic Medical Images for Robust, Privacy-Preserving Training of Artificial Intelligence: Application to Retinopathy of Prematurity Diagnosis
title_fullStr	Synthetic Medical Images for Robust, Privacy-Preserving Training of Artificial Intelligence: Application to Retinopathy of Prematurity Diagnosis
title_full_unstemmed	Synthetic Medical Images for Robust, Privacy-Preserving Training of Artificial Intelligence: Application to Retinopathy of Prematurity Diagnosis
title_short	Synthetic Medical Images for Robust, Privacy-Preserving Training of Artificial Intelligence: Application to Retinopathy of Prematurity Diagnosis
title_sort	synthetic medical images for robust, privacy-preserving training of artificial intelligence: application to retinopathy of prematurity diagnosis
topic	Artificial Intelligence and Big Data
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9560638/ https://www.ncbi.nlm.nih.gov/pubmed/36249693 http://dx.doi.org/10.1016/j.xops.2022.100126
work_keys_str_mv	AT coyneraarons syntheticmedicalimagesforrobustprivacypreservingtrainingofartificialintelligenceapplicationtoretinopathyofprematuritydiagnosis AT chenjimmys syntheticmedicalimagesforrobustprivacypreservingtrainingofartificialintelligenceapplicationtoretinopathyofprematuritydiagnosis AT changken syntheticmedicalimagesforrobustprivacypreservingtrainingofartificialintelligenceapplicationtoretinopathyofprematuritydiagnosis AT singhpraveer syntheticmedicalimagesforrobustprivacypreservingtrainingofartificialintelligenceapplicationtoretinopathyofprematuritydiagnosis AT ostmosusan syntheticmedicalimagesforrobustprivacypreservingtrainingofartificialintelligenceapplicationtoretinopathyofprematuritydiagnosis AT chanrvpaul syntheticmedicalimagesforrobustprivacypreservingtrainingofartificialintelligenceapplicationtoretinopathyofprematuritydiagnosis AT chiangmichaelf syntheticmedicalimagesforrobustprivacypreservingtrainingofartificialintelligenceapplicationtoretinopathyofprematuritydiagnosis AT kalpathycramerjayashree syntheticmedicalimagesforrobustprivacypreservingtrainingofartificialintelligenceapplicationtoretinopathyofprematuritydiagnosis AT campbelljpeter syntheticmedicalimagesforrobustprivacypreservingtrainingofartificialintelligenceapplicationtoretinopathyofprematuritydiagnosis AT syntheticmedicalimagesforrobustprivacypreservingtrainingofartificialintelligenceapplicationtoretinopathyofprematuritydiagnosis

Synthetic Medical Images for Robust, Privacy-Preserving Training of Artificial Intelligence: Application to Retinopathy of Prematurity Diagnosis

Ejemplares similares