Cargando…
Effect of Dataset Size and Medical Image Modality on Convolutional Neural Network Model Performance for Automated Segmentation: A CT and MR Renal Tumor Imaging Study
The aim of this study is to investigate the use of an exponential-plateau model to determine the required training dataset size that yields the maximum medical image segmentation performance. CT and MR images of patients with renal tumors acquired between 1997 and 2017 were retrospectively collected...
Autores principales: | , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10406754/ https://www.ncbi.nlm.nih.gov/pubmed/36932251 http://dx.doi.org/10.1007/s10278-023-00804-1 |
_version_ | 1785085811881934848 |
---|---|
author | Gottlich, Harrison C. Gregory, Adriana V. Sharma, Vidit Khanna, Abhinav Moustafa, Amr U. Lohse, Christine M. Potretzke, Theodora A. Korfiatis, Panagiotis Potretzke, Aaron M. Denic, Aleksandar Rule, Andrew D. Takahashi, Naoki Erickson, Bradley J. Leibovich, Bradley C. Kline, Timothy L. |
author_facet | Gottlich, Harrison C. Gregory, Adriana V. Sharma, Vidit Khanna, Abhinav Moustafa, Amr U. Lohse, Christine M. Potretzke, Theodora A. Korfiatis, Panagiotis Potretzke, Aaron M. Denic, Aleksandar Rule, Andrew D. Takahashi, Naoki Erickson, Bradley J. Leibovich, Bradley C. Kline, Timothy L. |
author_sort | Gottlich, Harrison C. |
collection | PubMed |
description | The aim of this study is to investigate the use of an exponential-plateau model to determine the required training dataset size that yields the maximum medical image segmentation performance. CT and MR images of patients with renal tumors acquired between 1997 and 2017 were retrospectively collected from our nephrectomy registry. Modality-based datasets of 50, 100, 150, 200, 250, and 300 images were assembled to train models with an 80–20 training-validation split evaluated against 50 randomly held out test set images. A third experiment using the KiTS21 dataset was also used to explore the effects of different model architectures. Exponential-plateau models were used to establish the relationship of dataset size to model generalizability performance. For segmenting non-neoplastic kidney regions on CT and MR imaging, our model yielded test Dice score plateaus of [Formula: see text] and [Formula: see text] with the number of training-validation images needed to reach the plateaus of 54 and 122, respectively. For segmenting CT and MR tumor regions, we modeled a test Dice score plateau of [Formula: see text] and [Formula: see text] , with 125 and 389 training-validation images needed to reach the plateaus. For the KiTS21 dataset, the best Dice score plateaus for nn-UNet 2D and 3D architectures were [Formula: see text] and [Formula: see text] with number to reach performance plateau of 177 and 440. Our research validates that differing imaging modalities, target structures, and model architectures all affect the amount of training images required to reach a performance plateau. The modeling approach we developed will help future researchers determine for their experiments when additional training-validation images will likely not further improve model performance. |
format | Online Article Text |
id | pubmed-10406754 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-104067542023-08-09 Effect of Dataset Size and Medical Image Modality on Convolutional Neural Network Model Performance for Automated Segmentation: A CT and MR Renal Tumor Imaging Study Gottlich, Harrison C. Gregory, Adriana V. Sharma, Vidit Khanna, Abhinav Moustafa, Amr U. Lohse, Christine M. Potretzke, Theodora A. Korfiatis, Panagiotis Potretzke, Aaron M. Denic, Aleksandar Rule, Andrew D. Takahashi, Naoki Erickson, Bradley J. Leibovich, Bradley C. Kline, Timothy L. J Digit Imaging Article The aim of this study is to investigate the use of an exponential-plateau model to determine the required training dataset size that yields the maximum medical image segmentation performance. CT and MR images of patients with renal tumors acquired between 1997 and 2017 were retrospectively collected from our nephrectomy registry. Modality-based datasets of 50, 100, 150, 200, 250, and 300 images were assembled to train models with an 80–20 training-validation split evaluated against 50 randomly held out test set images. A third experiment using the KiTS21 dataset was also used to explore the effects of different model architectures. Exponential-plateau models were used to establish the relationship of dataset size to model generalizability performance. For segmenting non-neoplastic kidney regions on CT and MR imaging, our model yielded test Dice score plateaus of [Formula: see text] and [Formula: see text] with the number of training-validation images needed to reach the plateaus of 54 and 122, respectively. For segmenting CT and MR tumor regions, we modeled a test Dice score plateau of [Formula: see text] and [Formula: see text] , with 125 and 389 training-validation images needed to reach the plateaus. For the KiTS21 dataset, the best Dice score plateaus for nn-UNet 2D and 3D architectures were [Formula: see text] and [Formula: see text] with number to reach performance plateau of 177 and 440. Our research validates that differing imaging modalities, target structures, and model architectures all affect the amount of training images required to reach a performance plateau. The modeling approach we developed will help future researchers determine for their experiments when additional training-validation images will likely not further improve model performance. Springer International Publishing 2023-03-17 2023-08 /pmc/articles/PMC10406754/ /pubmed/36932251 http://dx.doi.org/10.1007/s10278-023-00804-1 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Gottlich, Harrison C. Gregory, Adriana V. Sharma, Vidit Khanna, Abhinav Moustafa, Amr U. Lohse, Christine M. Potretzke, Theodora A. Korfiatis, Panagiotis Potretzke, Aaron M. Denic, Aleksandar Rule, Andrew D. Takahashi, Naoki Erickson, Bradley J. Leibovich, Bradley C. Kline, Timothy L. Effect of Dataset Size and Medical Image Modality on Convolutional Neural Network Model Performance for Automated Segmentation: A CT and MR Renal Tumor Imaging Study |
title | Effect of Dataset Size and Medical Image Modality on Convolutional Neural Network Model Performance for Automated Segmentation: A CT and MR Renal Tumor Imaging Study |
title_full | Effect of Dataset Size and Medical Image Modality on Convolutional Neural Network Model Performance for Automated Segmentation: A CT and MR Renal Tumor Imaging Study |
title_fullStr | Effect of Dataset Size and Medical Image Modality on Convolutional Neural Network Model Performance for Automated Segmentation: A CT and MR Renal Tumor Imaging Study |
title_full_unstemmed | Effect of Dataset Size and Medical Image Modality on Convolutional Neural Network Model Performance for Automated Segmentation: A CT and MR Renal Tumor Imaging Study |
title_short | Effect of Dataset Size and Medical Image Modality on Convolutional Neural Network Model Performance for Automated Segmentation: A CT and MR Renal Tumor Imaging Study |
title_sort | effect of dataset size and medical image modality on convolutional neural network model performance for automated segmentation: a ct and mr renal tumor imaging study |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10406754/ https://www.ncbi.nlm.nih.gov/pubmed/36932251 http://dx.doi.org/10.1007/s10278-023-00804-1 |
work_keys_str_mv | AT gottlichharrisonc effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy AT gregoryadrianav effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy AT sharmavidit effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy AT khannaabhinav effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy AT moustafaamru effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy AT lohsechristinem effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy AT potretzketheodoraa effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy AT korfiatispanagiotis effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy AT potretzkeaaronm effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy AT denicaleksandar effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy AT ruleandrewd effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy AT takahashinaoki effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy AT ericksonbradleyj effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy AT leibovichbradleyc effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy AT klinetimothyl effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy |