Cargando…

Domain-guided data augmentation for deep learning on medical imaging

While domain-specific data augmentation can be useful in training neural networks for medical imaging tasks, such techniques have not been widely used to date. Our objective was to test whether domain-specific data augmentation is useful for medical imaging using a well-benchmarked task: view classi...

Descripción completa

Detalles Bibliográficos
Autores principales: Athalye, Chinmayee, Arnaout, Rima
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10035842/
https://www.ncbi.nlm.nih.gov/pubmed/36952442
http://dx.doi.org/10.1371/journal.pone.0282532
_version_ 1784911503947726848
author Athalye, Chinmayee
Arnaout, Rima
author_facet Athalye, Chinmayee
Arnaout, Rima
author_sort Athalye, Chinmayee
collection PubMed
description While domain-specific data augmentation can be useful in training neural networks for medical imaging tasks, such techniques have not been widely used to date. Our objective was to test whether domain-specific data augmentation is useful for medical imaging using a well-benchmarked task: view classification on fetal ultrasound FETAL-125 and OB-125 datasets. We found that using a context-preserving cut-paste strategy, we could create valid training data as measured by performance of the resulting trained model on the benchmark test dataset. When used in an online fashion, models trained on this hybrid data performed similarly to those trained using traditional data augmentation (FETAL-125 F-score 85.33 ± 0.24 vs 86.89 ± 0.60, p-value 0.014; OB-125 F-score 74.60 ± 0.11 vs 72.43 ± 0.62, p-value 0.004). Furthermore, the ability to perform augmentations during training time, as well as the ability to apply chosen augmentations equally across data classes, are important considerations in designing a bespoke data augmentation. Finally, we provide open-source code to facilitate running bespoke data augmentations in an online fashion. Taken together, this work expands the ability to design and apply domain-guided data augmentations for medical imaging tasks.
format Online
Article
Text
id pubmed-10035842
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-100358422023-03-24 Domain-guided data augmentation for deep learning on medical imaging Athalye, Chinmayee Arnaout, Rima PLoS One Research Article While domain-specific data augmentation can be useful in training neural networks for medical imaging tasks, such techniques have not been widely used to date. Our objective was to test whether domain-specific data augmentation is useful for medical imaging using a well-benchmarked task: view classification on fetal ultrasound FETAL-125 and OB-125 datasets. We found that using a context-preserving cut-paste strategy, we could create valid training data as measured by performance of the resulting trained model on the benchmark test dataset. When used in an online fashion, models trained on this hybrid data performed similarly to those trained using traditional data augmentation (FETAL-125 F-score 85.33 ± 0.24 vs 86.89 ± 0.60, p-value 0.014; OB-125 F-score 74.60 ± 0.11 vs 72.43 ± 0.62, p-value 0.004). Furthermore, the ability to perform augmentations during training time, as well as the ability to apply chosen augmentations equally across data classes, are important considerations in designing a bespoke data augmentation. Finally, we provide open-source code to facilitate running bespoke data augmentations in an online fashion. Taken together, this work expands the ability to design and apply domain-guided data augmentations for medical imaging tasks. Public Library of Science 2023-03-23 /pmc/articles/PMC10035842/ /pubmed/36952442 http://dx.doi.org/10.1371/journal.pone.0282532 Text en © 2023 Athalye, Arnaout https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Athalye, Chinmayee
Arnaout, Rima
Domain-guided data augmentation for deep learning on medical imaging
title Domain-guided data augmentation for deep learning on medical imaging
title_full Domain-guided data augmentation for deep learning on medical imaging
title_fullStr Domain-guided data augmentation for deep learning on medical imaging
title_full_unstemmed Domain-guided data augmentation for deep learning on medical imaging
title_short Domain-guided data augmentation for deep learning on medical imaging
title_sort domain-guided data augmentation for deep learning on medical imaging
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10035842/
https://www.ncbi.nlm.nih.gov/pubmed/36952442
http://dx.doi.org/10.1371/journal.pone.0282532
work_keys_str_mv AT athalyechinmayee domainguideddataaugmentationfordeeplearningonmedicalimaging
AT arnaoutrima domainguideddataaugmentationfordeeplearningonmedicalimaging