Cargando…

Generalizability assessment of COVID-19 3D CT data for deep learning-based disease detection

BACKGROUND: Artificial intelligence technologies in classification/detection of COVID-19 positive cases suffer from generalizability. Moreover, accessing and preparing another large dataset is not always feasible and time-consuming. Several studies have combined smaller COVID-19 CT datasets into “su...

Descripción completa

Detalles Bibliográficos
Autores principales:	Fallahpoor, Maryam, Chakraborty, Subrata, Heshejin, Mohammad Tavakoli, Chegeni, Hossein, Horry, Michael James, Pradhan, Biswajeet
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Elsevier Ltd. 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8971071/ https://www.ncbi.nlm.nih.gov/pubmed/35390746 http://dx.doi.org/10.1016/j.compbiomed.2022.105464

_version_	1784679567200354304
author	Fallahpoor, Maryam Chakraborty, Subrata Heshejin, Mohammad Tavakoli Chegeni, Hossein Horry, Michael James Pradhan, Biswajeet
author_facet	Fallahpoor, Maryam Chakraborty, Subrata Heshejin, Mohammad Tavakoli Chegeni, Hossein Horry, Michael James Pradhan, Biswajeet
author_sort	Fallahpoor, Maryam
collection	PubMed
description	BACKGROUND: Artificial intelligence technologies in classification/detection of COVID-19 positive cases suffer from generalizability. Moreover, accessing and preparing another large dataset is not always feasible and time-consuming. Several studies have combined smaller COVID-19 CT datasets into “supersets” to maximize the number of training samples. This study aims to assess generalizability by splitting datasets into different portions based on 3D CT images using deep learning. METHOD: Two large datasets, including 1110 3D CT images, were split into five segments of 20% each. Each dataset's first 20% segment was separated as a holdout test set. 3D-CNN training was performed with the remaining 80% from each dataset. Two small external datasets were also used to independently evaluate the trained models. RESULTS: The total combination of 80% of each dataset has an accuracy of 91% on Iranmehr and 83% on Moscow holdout test datasets. Results indicated that 80% of the primary datasets are adequate for fully training a model. The additional fine-tuning using 40% of a secondary dataset helps the model generalize to a third, unseen dataset. The highest accuracy achieved through transfer learning was 85% on LDCT dataset and 83% on Iranmehr holdout test sets when retrained on 80% of Iranmehr dataset. CONCLUSION: While the total combination of both datasets produced the best results, different combinations and transfer learning still produced generalizable results. Adopting the proposed methodology may help to obtain satisfactory results in the case of limited external datasets.
format	Online Article Text
id	pubmed-8971071
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Elsevier Ltd.
record_format	MEDLINE/PubMed
spelling	pubmed-89710712022-04-01 Generalizability assessment of COVID-19 3D CT data for deep learning-based disease detection Fallahpoor, Maryam Chakraborty, Subrata Heshejin, Mohammad Tavakoli Chegeni, Hossein Horry, Michael James Pradhan, Biswajeet Comput Biol Med Article BACKGROUND: Artificial intelligence technologies in classification/detection of COVID-19 positive cases suffer from generalizability. Moreover, accessing and preparing another large dataset is not always feasible and time-consuming. Several studies have combined smaller COVID-19 CT datasets into “supersets” to maximize the number of training samples. This study aims to assess generalizability by splitting datasets into different portions based on 3D CT images using deep learning. METHOD: Two large datasets, including 1110 3D CT images, were split into five segments of 20% each. Each dataset's first 20% segment was separated as a holdout test set. 3D-CNN training was performed with the remaining 80% from each dataset. Two small external datasets were also used to independently evaluate the trained models. RESULTS: The total combination of 80% of each dataset has an accuracy of 91% on Iranmehr and 83% on Moscow holdout test datasets. Results indicated that 80% of the primary datasets are adequate for fully training a model. The additional fine-tuning using 40% of a secondary dataset helps the model generalize to a third, unseen dataset. The highest accuracy achieved through transfer learning was 85% on LDCT dataset and 83% on Iranmehr holdout test sets when retrained on 80% of Iranmehr dataset. CONCLUSION: While the total combination of both datasets produced the best results, different combinations and transfer learning still produced generalizable results. Adopting the proposed methodology may help to obtain satisfactory results in the case of limited external datasets. Elsevier Ltd. 2022-06 2022-04-01 /pmc/articles/PMC8971071/ /pubmed/35390746 http://dx.doi.org/10.1016/j.compbiomed.2022.105464 Text en © 2022 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle	Article Fallahpoor, Maryam Chakraborty, Subrata Heshejin, Mohammad Tavakoli Chegeni, Hossein Horry, Michael James Pradhan, Biswajeet Generalizability assessment of COVID-19 3D CT data for deep learning-based disease detection
title	Generalizability assessment of COVID-19 3D CT data for deep learning-based disease detection
title_full	Generalizability assessment of COVID-19 3D CT data for deep learning-based disease detection
title_fullStr	Generalizability assessment of COVID-19 3D CT data for deep learning-based disease detection
title_full_unstemmed	Generalizability assessment of COVID-19 3D CT data for deep learning-based disease detection
title_short	Generalizability assessment of COVID-19 3D CT data for deep learning-based disease detection
title_sort	generalizability assessment of covid-19 3d ct data for deep learning-based disease detection
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8971071/ https://www.ncbi.nlm.nih.gov/pubmed/35390746 http://dx.doi.org/10.1016/j.compbiomed.2022.105464
work_keys_str_mv	AT fallahpoormaryam generalizabilityassessmentofcovid193dctdatafordeeplearningbaseddiseasedetection AT chakrabortysubrata generalizabilityassessmentofcovid193dctdatafordeeplearningbaseddiseasedetection AT heshejinmohammadtavakoli generalizabilityassessmentofcovid193dctdatafordeeplearningbaseddiseasedetection AT chegenihossein generalizabilityassessmentofcovid193dctdatafordeeplearningbaseddiseasedetection AT horrymichaeljames generalizabilityassessmentofcovid193dctdatafordeeplearningbaseddiseasedetection AT pradhanbiswajeet generalizabilityassessmentofcovid193dctdatafordeeplearningbaseddiseasedetection

Generalizability assessment of COVID-19 3D CT data for deep learning-based disease detection

Ejemplares similares