Cargando…

Aligning Small Datasets Using Domain Adversarial Learning: Applications in Automated in Vivo Oral Cancer Diagnosis

Deep learning approaches for medical image analysis are limited by small data set size due to factors such as patient privacy and difficulties in obtaining expert labelling for each image. In medical imaging system development pipelines, phases for system development and classification algorithms of...

Descripción completa

Detalles Bibliográficos
Autores principales: Caughlin, Kayla, Duran-Sierra, Elvis, Cheng, Shuna, Cuenca, Rodrigo, Ahmed, Beena, Ji, Jim, Martinez, Mathias, Al-Khalil, Moustafa, Al-Enazi, Hussain, Cheng, Yi-Shing Lisa, Wright, John, Jo, Javier A., Busso, Carlos
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10079633/
https://www.ncbi.nlm.nih.gov/pubmed/36279347
http://dx.doi.org/10.1109/JBHI.2022.3217015
_version_ 1785020753425465344
author Caughlin, Kayla
Duran-Sierra, Elvis
Cheng, Shuna
Cuenca, Rodrigo
Ahmed, Beena
Ji, Jim
Martinez, Mathias
Al-Khalil, Moustafa
Al-Enazi, Hussain
Cheng, Yi-Shing Lisa
Wright, John
Jo, Javier A.
Busso, Carlos
author_facet Caughlin, Kayla
Duran-Sierra, Elvis
Cheng, Shuna
Cuenca, Rodrigo
Ahmed, Beena
Ji, Jim
Martinez, Mathias
Al-Khalil, Moustafa
Al-Enazi, Hussain
Cheng, Yi-Shing Lisa
Wright, John
Jo, Javier A.
Busso, Carlos
author_sort Caughlin, Kayla
collection PubMed
description Deep learning approaches for medical image analysis are limited by small data set size due to factors such as patient privacy and difficulties in obtaining expert labelling for each image. In medical imaging system development pipelines, phases for system development and classification algorithms often overlap with data collection, creating small disjoint data sets collected at numerous locations with differing protocols. In this setting, merging data from different data collection centers increases the amount of training data. However, a direct combination of datasets will likely fail due to domain shifts between imaging centers. In contrast to previous approaches that focus on a single data set, we add a domain adaptation module to a neural network and train using multiple data sets. Our approach encourages domain invariance between two multispectral autofluorescence imaging (maFLIM) data sets of in vivo oral lesions collected with an imaging system currently in development. The two data sets have differences in the sub-populations imaged and in the calibration procedures used during data collection. We mitigate these differences using a gradient reversal layer and domain classifier. Our final model trained with two data sets substantially increases performance, including a significant increase in specificity. We also achieve a significant increase in average performance over the best baseline model train with two domains (p = 0.0341). Our approach lays the foundation for faster development of computer-aided diagnostic systems and presents a feasible approach for creating a robust classifier that aligns images from multiple data centers in the presence of domain shifts.
format Online
Article
Text
id pubmed-10079633
institution National Center for Biotechnology Information
language English
publishDate 2023
record_format MEDLINE/PubMed
spelling pubmed-100796332023-04-07 Aligning Small Datasets Using Domain Adversarial Learning: Applications in Automated in Vivo Oral Cancer Diagnosis Caughlin, Kayla Duran-Sierra, Elvis Cheng, Shuna Cuenca, Rodrigo Ahmed, Beena Ji, Jim Martinez, Mathias Al-Khalil, Moustafa Al-Enazi, Hussain Cheng, Yi-Shing Lisa Wright, John Jo, Javier A. Busso, Carlos IEEE J Biomed Health Inform Article Deep learning approaches for medical image analysis are limited by small data set size due to factors such as patient privacy and difficulties in obtaining expert labelling for each image. In medical imaging system development pipelines, phases for system development and classification algorithms often overlap with data collection, creating small disjoint data sets collected at numerous locations with differing protocols. In this setting, merging data from different data collection centers increases the amount of training data. However, a direct combination of datasets will likely fail due to domain shifts between imaging centers. In contrast to previous approaches that focus on a single data set, we add a domain adaptation module to a neural network and train using multiple data sets. Our approach encourages domain invariance between two multispectral autofluorescence imaging (maFLIM) data sets of in vivo oral lesions collected with an imaging system currently in development. The two data sets have differences in the sub-populations imaged and in the calibration procedures used during data collection. We mitigate these differences using a gradient reversal layer and domain classifier. Our final model trained with two data sets substantially increases performance, including a significant increase in specificity. We also achieve a significant increase in average performance over the best baseline model train with two domains (p = 0.0341). Our approach lays the foundation for faster development of computer-aided diagnostic systems and presents a feasible approach for creating a robust classifier that aligns images from multiple data centers in the presence of domain shifts. 2023-01 2023-01-04 /pmc/articles/PMC10079633/ /pubmed/36279347 http://dx.doi.org/10.1109/JBHI.2022.3217015 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. For more information, see https://creativecommons.org/licenses/by-nc-nd/4.0/
spellingShingle Article
Caughlin, Kayla
Duran-Sierra, Elvis
Cheng, Shuna
Cuenca, Rodrigo
Ahmed, Beena
Ji, Jim
Martinez, Mathias
Al-Khalil, Moustafa
Al-Enazi, Hussain
Cheng, Yi-Shing Lisa
Wright, John
Jo, Javier A.
Busso, Carlos
Aligning Small Datasets Using Domain Adversarial Learning: Applications in Automated in Vivo Oral Cancer Diagnosis
title Aligning Small Datasets Using Domain Adversarial Learning: Applications in Automated in Vivo Oral Cancer Diagnosis
title_full Aligning Small Datasets Using Domain Adversarial Learning: Applications in Automated in Vivo Oral Cancer Diagnosis
title_fullStr Aligning Small Datasets Using Domain Adversarial Learning: Applications in Automated in Vivo Oral Cancer Diagnosis
title_full_unstemmed Aligning Small Datasets Using Domain Adversarial Learning: Applications in Automated in Vivo Oral Cancer Diagnosis
title_short Aligning Small Datasets Using Domain Adversarial Learning: Applications in Automated in Vivo Oral Cancer Diagnosis
title_sort aligning small datasets using domain adversarial learning: applications in automated in vivo oral cancer diagnosis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10079633/
https://www.ncbi.nlm.nih.gov/pubmed/36279347
http://dx.doi.org/10.1109/JBHI.2022.3217015
work_keys_str_mv AT caughlinkayla aligningsmalldatasetsusingdomainadversariallearningapplicationsinautomatedinvivooralcancerdiagnosis
AT duransierraelvis aligningsmalldatasetsusingdomainadversariallearningapplicationsinautomatedinvivooralcancerdiagnosis
AT chengshuna aligningsmalldatasetsusingdomainadversariallearningapplicationsinautomatedinvivooralcancerdiagnosis
AT cuencarodrigo aligningsmalldatasetsusingdomainadversariallearningapplicationsinautomatedinvivooralcancerdiagnosis
AT ahmedbeena aligningsmalldatasetsusingdomainadversariallearningapplicationsinautomatedinvivooralcancerdiagnosis
AT jijim aligningsmalldatasetsusingdomainadversariallearningapplicationsinautomatedinvivooralcancerdiagnosis
AT martinezmathias aligningsmalldatasetsusingdomainadversariallearningapplicationsinautomatedinvivooralcancerdiagnosis
AT alkhalilmoustafa aligningsmalldatasetsusingdomainadversariallearningapplicationsinautomatedinvivooralcancerdiagnosis
AT alenazihussain aligningsmalldatasetsusingdomainadversariallearningapplicationsinautomatedinvivooralcancerdiagnosis
AT chengyishinglisa aligningsmalldatasetsusingdomainadversariallearningapplicationsinautomatedinvivooralcancerdiagnosis
AT wrightjohn aligningsmalldatasetsusingdomainadversariallearningapplicationsinautomatedinvivooralcancerdiagnosis
AT jojaviera aligningsmalldatasetsusingdomainadversariallearningapplicationsinautomatedinvivooralcancerdiagnosis
AT bussocarlos aligningsmalldatasetsusingdomainadversariallearningapplicationsinautomatedinvivooralcancerdiagnosis