Cargando…

Performance change with the number of training data: A case study on the binary classification of COVID-19 chest X-ray by using convolutional neural networks

One of the features of artificial intelligence/machine learning-based medical devices resides in their ability to learn from real-world data. However, obtaining a large number of training data in the early phase is difficult, and the device performance may change after their first introduction into...

Descripción completa

Detalles Bibliográficos
Autores principales: Imagawa, Kuniki, Shiomoto, Kohei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier Ltd. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9749084/
https://www.ncbi.nlm.nih.gov/pubmed/35093727
http://dx.doi.org/10.1016/j.compbiomed.2022.105251
_version_ 1784849968234758144
author Imagawa, Kuniki
Shiomoto, Kohei
author_facet Imagawa, Kuniki
Shiomoto, Kohei
author_sort Imagawa, Kuniki
collection PubMed
description One of the features of artificial intelligence/machine learning-based medical devices resides in their ability to learn from real-world data. However, obtaining a large number of training data in the early phase is difficult, and the device performance may change after their first introduction into the market. To introduce the safety and effectiveness of these devices into the market in a timely manner, an appropriate post-market performance change plan must be established at the timing of the premarket approval. In this work, we evaluate the performance change with the variation of the number of training data. Two publicly available datasets are used: one consisting of 4000 images for COVID-19 and another comprising 4000 images for Normal. The dataset was split into 7000 images for training and validation, also 1000 images for test. Furthermore, the training and validation data were selected as different 16 datasets. Two different convolutional neural networks, namely AlexNet and ResNet34, with and without a fine-tuning method were used to classify two image types. The area under the curve, sensitivity, and specificity were evaluated for each dataset. Our result shows that all performances were rapidly improved as the number of training data was increased and reached an equilibrium state. AlexNet outperformed ResNet34 when the number of images was small. The difference tended to decrease as the number of training data increased, and the fine-tuning method improved all performances. In conclusion, the appropriate model and method should be selected considering the intended performance and available number of data.
format Online
Article
Text
id pubmed-9749084
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier Ltd.
record_format MEDLINE/PubMed
spelling pubmed-97490842022-12-14 Performance change with the number of training data: A case study on the binary classification of COVID-19 chest X-ray by using convolutional neural networks Imagawa, Kuniki Shiomoto, Kohei Comput Biol Med Article One of the features of artificial intelligence/machine learning-based medical devices resides in their ability to learn from real-world data. However, obtaining a large number of training data in the early phase is difficult, and the device performance may change after their first introduction into the market. To introduce the safety and effectiveness of these devices into the market in a timely manner, an appropriate post-market performance change plan must be established at the timing of the premarket approval. In this work, we evaluate the performance change with the variation of the number of training data. Two publicly available datasets are used: one consisting of 4000 images for COVID-19 and another comprising 4000 images for Normal. The dataset was split into 7000 images for training and validation, also 1000 images for test. Furthermore, the training and validation data were selected as different 16 datasets. Two different convolutional neural networks, namely AlexNet and ResNet34, with and without a fine-tuning method were used to classify two image types. The area under the curve, sensitivity, and specificity were evaluated for each dataset. Our result shows that all performances were rapidly improved as the number of training data was increased and reached an equilibrium state. AlexNet outperformed ResNet34 when the number of images was small. The difference tended to decrease as the number of training data increased, and the fine-tuning method improved all performances. In conclusion, the appropriate model and method should be selected considering the intended performance and available number of data. Elsevier Ltd. 2022-03 2022-01-23 /pmc/articles/PMC9749084/ /pubmed/35093727 http://dx.doi.org/10.1016/j.compbiomed.2022.105251 Text en © 2022 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Imagawa, Kuniki
Shiomoto, Kohei
Performance change with the number of training data: A case study on the binary classification of COVID-19 chest X-ray by using convolutional neural networks
title Performance change with the number of training data: A case study on the binary classification of COVID-19 chest X-ray by using convolutional neural networks
title_full Performance change with the number of training data: A case study on the binary classification of COVID-19 chest X-ray by using convolutional neural networks
title_fullStr Performance change with the number of training data: A case study on the binary classification of COVID-19 chest X-ray by using convolutional neural networks
title_full_unstemmed Performance change with the number of training data: A case study on the binary classification of COVID-19 chest X-ray by using convolutional neural networks
title_short Performance change with the number of training data: A case study on the binary classification of COVID-19 chest X-ray by using convolutional neural networks
title_sort performance change with the number of training data: a case study on the binary classification of covid-19 chest x-ray by using convolutional neural networks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9749084/
https://www.ncbi.nlm.nih.gov/pubmed/35093727
http://dx.doi.org/10.1016/j.compbiomed.2022.105251
work_keys_str_mv AT imagawakuniki performancechangewiththenumberoftrainingdataacasestudyonthebinaryclassificationofcovid19chestxraybyusingconvolutionalneuralnetworks
AT shiomotokohei performancechangewiththenumberoftrainingdataacasestudyonthebinaryclassificationofcovid19chestxraybyusingconvolutionalneuralnetworks