Cargando…

Effect of Dataset Size and Medical Image Modality on Convolutional Neural Network Model Performance for Automated Segmentation: A CT and MR Renal Tumor Imaging Study

The aim of this study is to investigate the use of an exponential-plateau model to determine the required training dataset size that yields the maximum medical image segmentation performance. CT and MR images of patients with renal tumors acquired between 1997 and 2017 were retrospectively collected...

Descripción completa

Detalles Bibliográficos
Autores principales: Gottlich, Harrison C., Gregory, Adriana V., Sharma, Vidit, Khanna, Abhinav, Moustafa, Amr U., Lohse, Christine M., Potretzke, Theodora A., Korfiatis, Panagiotis, Potretzke, Aaron M., Denic, Aleksandar, Rule, Andrew D., Takahashi, Naoki, Erickson, Bradley J., Leibovich, Bradley C., Kline, Timothy L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10406754/
https://www.ncbi.nlm.nih.gov/pubmed/36932251
http://dx.doi.org/10.1007/s10278-023-00804-1
_version_ 1785085811881934848
author Gottlich, Harrison C.
Gregory, Adriana V.
Sharma, Vidit
Khanna, Abhinav
Moustafa, Amr U.
Lohse, Christine M.
Potretzke, Theodora A.
Korfiatis, Panagiotis
Potretzke, Aaron M.
Denic, Aleksandar
Rule, Andrew D.
Takahashi, Naoki
Erickson, Bradley J.
Leibovich, Bradley C.
Kline, Timothy L.
author_facet Gottlich, Harrison C.
Gregory, Adriana V.
Sharma, Vidit
Khanna, Abhinav
Moustafa, Amr U.
Lohse, Christine M.
Potretzke, Theodora A.
Korfiatis, Panagiotis
Potretzke, Aaron M.
Denic, Aleksandar
Rule, Andrew D.
Takahashi, Naoki
Erickson, Bradley J.
Leibovich, Bradley C.
Kline, Timothy L.
author_sort Gottlich, Harrison C.
collection PubMed
description The aim of this study is to investigate the use of an exponential-plateau model to determine the required training dataset size that yields the maximum medical image segmentation performance. CT and MR images of patients with renal tumors acquired between 1997 and 2017 were retrospectively collected from our nephrectomy registry. Modality-based datasets of 50, 100, 150, 200, 250, and 300 images were assembled to train models with an 80–20 training-validation split evaluated against 50 randomly held out test set images. A third experiment using the KiTS21 dataset was also used to explore the effects of different model architectures. Exponential-plateau models were used to establish the relationship of dataset size to model generalizability performance. For segmenting non-neoplastic kidney regions on CT and MR imaging, our model yielded test Dice score plateaus of [Formula: see text] and [Formula: see text] with the number of training-validation images needed to reach the plateaus of 54 and 122, respectively. For segmenting CT and MR tumor regions, we modeled a test Dice score plateau of [Formula: see text] and [Formula: see text] , with 125 and 389 training-validation images needed to reach the plateaus. For the KiTS21 dataset, the best Dice score plateaus for nn-UNet 2D and 3D architectures were [Formula: see text] and [Formula: see text] with number to reach performance plateau of 177 and 440. Our research validates that differing imaging modalities, target structures, and model architectures all affect the amount of training images required to reach a performance plateau. The modeling approach we developed will help future researchers determine for their experiments when additional training-validation images will likely not further improve model performance.
format Online
Article
Text
id pubmed-10406754
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-104067542023-08-09 Effect of Dataset Size and Medical Image Modality on Convolutional Neural Network Model Performance for Automated Segmentation: A CT and MR Renal Tumor Imaging Study Gottlich, Harrison C. Gregory, Adriana V. Sharma, Vidit Khanna, Abhinav Moustafa, Amr U. Lohse, Christine M. Potretzke, Theodora A. Korfiatis, Panagiotis Potretzke, Aaron M. Denic, Aleksandar Rule, Andrew D. Takahashi, Naoki Erickson, Bradley J. Leibovich, Bradley C. Kline, Timothy L. J Digit Imaging Article The aim of this study is to investigate the use of an exponential-plateau model to determine the required training dataset size that yields the maximum medical image segmentation performance. CT and MR images of patients with renal tumors acquired between 1997 and 2017 were retrospectively collected from our nephrectomy registry. Modality-based datasets of 50, 100, 150, 200, 250, and 300 images were assembled to train models with an 80–20 training-validation split evaluated against 50 randomly held out test set images. A third experiment using the KiTS21 dataset was also used to explore the effects of different model architectures. Exponential-plateau models were used to establish the relationship of dataset size to model generalizability performance. For segmenting non-neoplastic kidney regions on CT and MR imaging, our model yielded test Dice score plateaus of [Formula: see text] and [Formula: see text] with the number of training-validation images needed to reach the plateaus of 54 and 122, respectively. For segmenting CT and MR tumor regions, we modeled a test Dice score plateau of [Formula: see text] and [Formula: see text] , with 125 and 389 training-validation images needed to reach the plateaus. For the KiTS21 dataset, the best Dice score plateaus for nn-UNet 2D and 3D architectures were [Formula: see text] and [Formula: see text] with number to reach performance plateau of 177 and 440. Our research validates that differing imaging modalities, target structures, and model architectures all affect the amount of training images required to reach a performance plateau. The modeling approach we developed will help future researchers determine for their experiments when additional training-validation images will likely not further improve model performance. Springer International Publishing 2023-03-17 2023-08 /pmc/articles/PMC10406754/ /pubmed/36932251 http://dx.doi.org/10.1007/s10278-023-00804-1 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Gottlich, Harrison C.
Gregory, Adriana V.
Sharma, Vidit
Khanna, Abhinav
Moustafa, Amr U.
Lohse, Christine M.
Potretzke, Theodora A.
Korfiatis, Panagiotis
Potretzke, Aaron M.
Denic, Aleksandar
Rule, Andrew D.
Takahashi, Naoki
Erickson, Bradley J.
Leibovich, Bradley C.
Kline, Timothy L.
Effect of Dataset Size and Medical Image Modality on Convolutional Neural Network Model Performance for Automated Segmentation: A CT and MR Renal Tumor Imaging Study
title Effect of Dataset Size and Medical Image Modality on Convolutional Neural Network Model Performance for Automated Segmentation: A CT and MR Renal Tumor Imaging Study
title_full Effect of Dataset Size and Medical Image Modality on Convolutional Neural Network Model Performance for Automated Segmentation: A CT and MR Renal Tumor Imaging Study
title_fullStr Effect of Dataset Size and Medical Image Modality on Convolutional Neural Network Model Performance for Automated Segmentation: A CT and MR Renal Tumor Imaging Study
title_full_unstemmed Effect of Dataset Size and Medical Image Modality on Convolutional Neural Network Model Performance for Automated Segmentation: A CT and MR Renal Tumor Imaging Study
title_short Effect of Dataset Size and Medical Image Modality on Convolutional Neural Network Model Performance for Automated Segmentation: A CT and MR Renal Tumor Imaging Study
title_sort effect of dataset size and medical image modality on convolutional neural network model performance for automated segmentation: a ct and mr renal tumor imaging study
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10406754/
https://www.ncbi.nlm.nih.gov/pubmed/36932251
http://dx.doi.org/10.1007/s10278-023-00804-1
work_keys_str_mv AT gottlichharrisonc effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy
AT gregoryadrianav effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy
AT sharmavidit effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy
AT khannaabhinav effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy
AT moustafaamru effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy
AT lohsechristinem effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy
AT potretzketheodoraa effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy
AT korfiatispanagiotis effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy
AT potretzkeaaronm effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy
AT denicaleksandar effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy
AT ruleandrewd effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy
AT takahashinaoki effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy
AT ericksonbradleyj effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy
AT leibovichbradleyc effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy
AT klinetimothyl effectofdatasetsizeandmedicalimagemodalityonconvolutionalneuralnetworkmodelperformanceforautomatedsegmentationactandmrrenaltumorimagingstudy