Cargando…

Harnessing clinical annotations to improve deep learning performance in prostate segmentation

PURPOSE: Developing large-scale datasets with research-quality annotations is challenging due to the high cost of refining clinically generated markup into high precision annotations. We evaluated the direct use of a large dataset with only clinically generated annotations in development of high-per...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sarma, Karthik V., Raman, Alex G., Dhinagar, Nikhil J., Priester, Alan M., Harmon, Stephanie, Sanford, Thomas, Mehralivand, Sherif, Turkbey, Baris, Marks, Leonard S., Raman, Steven S., Speier, William, Arnold, Corey W.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2021
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8232529/ https://www.ncbi.nlm.nih.gov/pubmed/34170972 http://dx.doi.org/10.1371/journal.pone.0253829

_version_	1783713656803950592
author	Sarma, Karthik V. Raman, Alex G. Dhinagar, Nikhil J. Priester, Alan M. Harmon, Stephanie Sanford, Thomas Mehralivand, Sherif Turkbey, Baris Marks, Leonard S. Raman, Steven S. Speier, William Arnold, Corey W.
author_facet	Sarma, Karthik V. Raman, Alex G. Dhinagar, Nikhil J. Priester, Alan M. Harmon, Stephanie Sanford, Thomas Mehralivand, Sherif Turkbey, Baris Marks, Leonard S. Raman, Steven S. Speier, William Arnold, Corey W.
author_sort	Sarma, Karthik V.
collection	PubMed
description	PURPOSE: Developing large-scale datasets with research-quality annotations is challenging due to the high cost of refining clinically generated markup into high precision annotations. We evaluated the direct use of a large dataset with only clinically generated annotations in development of high-performance segmentation models for small research-quality challenge datasets. MATERIALS AND METHODS: We used a large retrospective dataset from our institution comprised of 1,620 clinically generated segmentations, and two challenge datasets (PROMISE12: 50 patients, ProstateX-2: 99 patients). We trained a 3D U-Net convolutional neural network (CNN) segmentation model using our entire dataset, and used that model as a template to train models on the challenge datasets. We also trained versions of the template model using ablated proportions of our dataset, and evaluated the relative benefit of those templates for the final models. Finally, we trained a version of the template model using an out-of-domain brain cancer dataset, and evaluated the relevant benefit of that template for the final models. We used five-fold cross-validation (CV) for all training and evaluation across our entire dataset. RESULTS: Our model achieves state-of-the-art performance on our large dataset (mean overall Dice 0.916, average Hausdorff distance 0.135 across CV folds). Using this model as a pre-trained template for refining on two external datasets significantly enhanced performance (30% and 49% enhancement in Dice scores respectively). Mean overall Dice and mean average Hausdorff distance were 0.912 and 0.15 for the ProstateX-2 dataset, and 0.852 and 0.581 for the PROMISE12 dataset. Using even small quantities of data to train the template enhanced performance, with significant improvements using 5% or more of the data. CONCLUSION: We trained a state-of-the-art model using unrefined clinical prostate annotations and found that its use as a template model significantly improved performance in other prostate segmentation tasks, even when trained with only 5% of the original dataset.
format	Online Article Text
id	pubmed-8232529
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-82325292021-07-07 Harnessing clinical annotations to improve deep learning performance in prostate segmentation Sarma, Karthik V. Raman, Alex G. Dhinagar, Nikhil J. Priester, Alan M. Harmon, Stephanie Sanford, Thomas Mehralivand, Sherif Turkbey, Baris Marks, Leonard S. Raman, Steven S. Speier, William Arnold, Corey W. PLoS One Research Article PURPOSE: Developing large-scale datasets with research-quality annotations is challenging due to the high cost of refining clinically generated markup into high precision annotations. We evaluated the direct use of a large dataset with only clinically generated annotations in development of high-performance segmentation models for small research-quality challenge datasets. MATERIALS AND METHODS: We used a large retrospective dataset from our institution comprised of 1,620 clinically generated segmentations, and two challenge datasets (PROMISE12: 50 patients, ProstateX-2: 99 patients). We trained a 3D U-Net convolutional neural network (CNN) segmentation model using our entire dataset, and used that model as a template to train models on the challenge datasets. We also trained versions of the template model using ablated proportions of our dataset, and evaluated the relative benefit of those templates for the final models. Finally, we trained a version of the template model using an out-of-domain brain cancer dataset, and evaluated the relevant benefit of that template for the final models. We used five-fold cross-validation (CV) for all training and evaluation across our entire dataset. RESULTS: Our model achieves state-of-the-art performance on our large dataset (mean overall Dice 0.916, average Hausdorff distance 0.135 across CV folds). Using this model as a pre-trained template for refining on two external datasets significantly enhanced performance (30% and 49% enhancement in Dice scores respectively). Mean overall Dice and mean average Hausdorff distance were 0.912 and 0.15 for the ProstateX-2 dataset, and 0.852 and 0.581 for the PROMISE12 dataset. Using even small quantities of data to train the template enhanced performance, with significant improvements using 5% or more of the data. CONCLUSION: We trained a state-of-the-art model using unrefined clinical prostate annotations and found that its use as a template model significantly improved performance in other prostate segmentation tasks, even when trained with only 5% of the original dataset. Public Library of Science 2021-06-25 /pmc/articles/PMC8232529/ /pubmed/34170972 http://dx.doi.org/10.1371/journal.pone.0253829 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle	Research Article Sarma, Karthik V. Raman, Alex G. Dhinagar, Nikhil J. Priester, Alan M. Harmon, Stephanie Sanford, Thomas Mehralivand, Sherif Turkbey, Baris Marks, Leonard S. Raman, Steven S. Speier, William Arnold, Corey W. Harnessing clinical annotations to improve deep learning performance in prostate segmentation
title	Harnessing clinical annotations to improve deep learning performance in prostate segmentation
title_full	Harnessing clinical annotations to improve deep learning performance in prostate segmentation
title_fullStr	Harnessing clinical annotations to improve deep learning performance in prostate segmentation
title_full_unstemmed	Harnessing clinical annotations to improve deep learning performance in prostate segmentation
title_short	Harnessing clinical annotations to improve deep learning performance in prostate segmentation
title_sort	harnessing clinical annotations to improve deep learning performance in prostate segmentation
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8232529/ https://www.ncbi.nlm.nih.gov/pubmed/34170972 http://dx.doi.org/10.1371/journal.pone.0253829
work_keys_str_mv	AT sarmakarthikv harnessingclinicalannotationstoimprovedeeplearningperformanceinprostatesegmentation AT ramanalexg harnessingclinicalannotationstoimprovedeeplearningperformanceinprostatesegmentation AT dhinagarnikhilj harnessingclinicalannotationstoimprovedeeplearningperformanceinprostatesegmentation AT priesteralanm harnessingclinicalannotationstoimprovedeeplearningperformanceinprostatesegmentation AT harmonstephanie harnessingclinicalannotationstoimprovedeeplearningperformanceinprostatesegmentation AT sanfordthomas harnessingclinicalannotationstoimprovedeeplearningperformanceinprostatesegmentation AT mehralivandsherif harnessingclinicalannotationstoimprovedeeplearningperformanceinprostatesegmentation AT turkbeybaris harnessingclinicalannotationstoimprovedeeplearningperformanceinprostatesegmentation AT marksleonards harnessingclinicalannotationstoimprovedeeplearningperformanceinprostatesegmentation AT ramanstevens harnessingclinicalannotationstoimprovedeeplearningperformanceinprostatesegmentation AT speierwilliam harnessingclinicalannotationstoimprovedeeplearningperformanceinprostatesegmentation AT arnoldcoreyw harnessingclinicalannotationstoimprovedeeplearningperformanceinprostatesegmentation

Harnessing clinical annotations to improve deep learning performance in prostate segmentation

Ejemplares similares