Cargando…
Performance change with the number of training data: A case study on the binary classification of COVID-19 chest X-ray by using convolutional neural networks
One of the features of artificial intelligence/machine learning-based medical devices resides in their ability to learn from real-world data. However, obtaining a large number of training data in the early phase is difficult, and the device performance may change after their first introduction into...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier Ltd.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9749084/ https://www.ncbi.nlm.nih.gov/pubmed/35093727 http://dx.doi.org/10.1016/j.compbiomed.2022.105251 |
_version_ | 1784849968234758144 |
---|---|
author | Imagawa, Kuniki Shiomoto, Kohei |
author_facet | Imagawa, Kuniki Shiomoto, Kohei |
author_sort | Imagawa, Kuniki |
collection | PubMed |
description | One of the features of artificial intelligence/machine learning-based medical devices resides in their ability to learn from real-world data. However, obtaining a large number of training data in the early phase is difficult, and the device performance may change after their first introduction into the market. To introduce the safety and effectiveness of these devices into the market in a timely manner, an appropriate post-market performance change plan must be established at the timing of the premarket approval. In this work, we evaluate the performance change with the variation of the number of training data. Two publicly available datasets are used: one consisting of 4000 images for COVID-19 and another comprising 4000 images for Normal. The dataset was split into 7000 images for training and validation, also 1000 images for test. Furthermore, the training and validation data were selected as different 16 datasets. Two different convolutional neural networks, namely AlexNet and ResNet34, with and without a fine-tuning method were used to classify two image types. The area under the curve, sensitivity, and specificity were evaluated for each dataset. Our result shows that all performances were rapidly improved as the number of training data was increased and reached an equilibrium state. AlexNet outperformed ResNet34 when the number of images was small. The difference tended to decrease as the number of training data increased, and the fine-tuning method improved all performances. In conclusion, the appropriate model and method should be selected considering the intended performance and available number of data. |
format | Online Article Text |
id | pubmed-9749084 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Elsevier Ltd. |
record_format | MEDLINE/PubMed |
spelling | pubmed-97490842022-12-14 Performance change with the number of training data: A case study on the binary classification of COVID-19 chest X-ray by using convolutional neural networks Imagawa, Kuniki Shiomoto, Kohei Comput Biol Med Article One of the features of artificial intelligence/machine learning-based medical devices resides in their ability to learn from real-world data. However, obtaining a large number of training data in the early phase is difficult, and the device performance may change after their first introduction into the market. To introduce the safety and effectiveness of these devices into the market in a timely manner, an appropriate post-market performance change plan must be established at the timing of the premarket approval. In this work, we evaluate the performance change with the variation of the number of training data. Two publicly available datasets are used: one consisting of 4000 images for COVID-19 and another comprising 4000 images for Normal. The dataset was split into 7000 images for training and validation, also 1000 images for test. Furthermore, the training and validation data were selected as different 16 datasets. Two different convolutional neural networks, namely AlexNet and ResNet34, with and without a fine-tuning method were used to classify two image types. The area under the curve, sensitivity, and specificity were evaluated for each dataset. Our result shows that all performances were rapidly improved as the number of training data was increased and reached an equilibrium state. AlexNet outperformed ResNet34 when the number of images was small. The difference tended to decrease as the number of training data increased, and the fine-tuning method improved all performances. In conclusion, the appropriate model and method should be selected considering the intended performance and available number of data. Elsevier Ltd. 2022-03 2022-01-23 /pmc/articles/PMC9749084/ /pubmed/35093727 http://dx.doi.org/10.1016/j.compbiomed.2022.105251 Text en © 2022 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Imagawa, Kuniki Shiomoto, Kohei Performance change with the number of training data: A case study on the binary classification of COVID-19 chest X-ray by using convolutional neural networks |
title | Performance change with the number of training data: A case study on the binary classification of COVID-19 chest X-ray by using convolutional neural networks |
title_full | Performance change with the number of training data: A case study on the binary classification of COVID-19 chest X-ray by using convolutional neural networks |
title_fullStr | Performance change with the number of training data: A case study on the binary classification of COVID-19 chest X-ray by using convolutional neural networks |
title_full_unstemmed | Performance change with the number of training data: A case study on the binary classification of COVID-19 chest X-ray by using convolutional neural networks |
title_short | Performance change with the number of training data: A case study on the binary classification of COVID-19 chest X-ray by using convolutional neural networks |
title_sort | performance change with the number of training data: a case study on the binary classification of covid-19 chest x-ray by using convolutional neural networks |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9749084/ https://www.ncbi.nlm.nih.gov/pubmed/35093727 http://dx.doi.org/10.1016/j.compbiomed.2022.105251 |
work_keys_str_mv | AT imagawakuniki performancechangewiththenumberoftrainingdataacasestudyonthebinaryclassificationofcovid19chestxraybyusingconvolutionalneuralnetworks AT shiomotokohei performancechangewiththenumberoftrainingdataacasestudyonthebinaryclassificationofcovid19chestxraybyusingconvolutionalneuralnetworks |