Cargando…
Development and Validation of a Deep Learning Algorithm for Gleason Grading of Prostate Cancer From Biopsy Specimens
IMPORTANCE: For prostate cancer, Gleason grading of the biopsy specimen plays a pivotal role in determining case management. However, Gleason grading is associated with substantial interobserver variability, resulting in a need for decision support tools to improve the reproducibility of Gleason gra...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Medical Association
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7378872/ https://www.ncbi.nlm.nih.gov/pubmed/32701148 http://dx.doi.org/10.1001/jamaoncol.2020.2485 |
_version_ | 1783562517158559744 |
---|---|
author | Nagpal, Kunal Foote, Davis Tan, Fraser Liu, Yun Chen, Po-Hsuan Cameron Steiner, David F. Manoj, Naren Olson, Niels Smith, Jenny L. Mohtashamian, Arash Peterson, Brandon Amin, Mahul B. Evans, Andrew J. Sweet, Joan W. Cheung, Carol van der Kwast, Theodorus Sangoi, Ankur R. Zhou, Ming Allan, Robert Humphrey, Peter A. Hipp, Jason D. Gadepalli, Krishna Corrado, Greg S. Peng, Lily H. Stumpe, Martin C. Mermel, Craig H. |
author_facet | Nagpal, Kunal Foote, Davis Tan, Fraser Liu, Yun Chen, Po-Hsuan Cameron Steiner, David F. Manoj, Naren Olson, Niels Smith, Jenny L. Mohtashamian, Arash Peterson, Brandon Amin, Mahul B. Evans, Andrew J. Sweet, Joan W. Cheung, Carol van der Kwast, Theodorus Sangoi, Ankur R. Zhou, Ming Allan, Robert Humphrey, Peter A. Hipp, Jason D. Gadepalli, Krishna Corrado, Greg S. Peng, Lily H. Stumpe, Martin C. Mermel, Craig H. |
author_sort | Nagpal, Kunal |
collection | PubMed |
description | IMPORTANCE: For prostate cancer, Gleason grading of the biopsy specimen plays a pivotal role in determining case management. However, Gleason grading is associated with substantial interobserver variability, resulting in a need for decision support tools to improve the reproducibility of Gleason grading in routine clinical practice. OBJECTIVE: To evaluate the ability of a deep learning system (DLS) to grade diagnostic prostate biopsy specimens. DESIGN, SETTING, AND PARTICIPANTS: The DLS was evaluated using 752 deidentified digitized images of formalin-fixed paraffin-embedded prostate needle core biopsy specimens obtained from 3 institutions in the United States, including 1 institution not used for DLS development. To obtain the Gleason grade group (GG), each specimen was first reviewed by 2 expert urologic subspecialists from a multi-institutional panel of 6 individuals (years of experience: mean, 25 years; range, 18-34 years). A third subspecialist reviewed discordant cases to arrive at a majority opinion. To reduce diagnostic uncertainty, all subspecialists had access to an immunohistochemical-stained section and 3 histologic sections for every biopsied specimen. Their review was conducted from December 2018 to June 2019. MAIN OUTCOMES AND MEASURES: The frequency of the exact agreement of the DLS with the majority opinion of the subspecialists in categorizing each tumor-containing specimen as 1 of 5 categories: nontumor, GG1, GG2, GG3, or GG4-5. For comparison, the rate of agreement of 19 general pathologists’ opinions with the subspecialists’ majority opinions was also evaluated. RESULTS: For grading tumor-containing biopsy specimens in the validation set (n = 498), the rate of agreement with subspecialists was significantly higher for the DLS (71.7%; 95% CI, 67.9%-75.3%) than for general pathologists (58.0%; 95% CI, 54.5%-61.4%) (P < .001). In subanalyses of biopsy specimens from an external validation set (n = 322), the Gleason grading performance of the DLS remained similar. For distinguishing nontumor from tumor-containing biopsy specimens (n = 752), the rate of agreement with subspecialists was 94.3% (95% CI, 92.4%-95.9%) for the DLS and similar at 94.7% (95% CI, 92.8%-96.3%) for general pathologists (P = .58). CONCLUSIONS AND RELEVANCE: In this study, the DLS showed higher proficiency than general pathologists at Gleason grading prostate needle core biopsy specimens and generalized to an independent institution. Future research is necessary to evaluate the potential utility of using the DLS as a decision support tool in clinical workflows and to improve the quality of prostate cancer grading for therapy decisions. |
format | Online Article Text |
id | pubmed-7378872 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | American Medical Association |
record_format | MEDLINE/PubMed |
spelling | pubmed-73788722020-07-27 Development and Validation of a Deep Learning Algorithm for Gleason Grading of Prostate Cancer From Biopsy Specimens Nagpal, Kunal Foote, Davis Tan, Fraser Liu, Yun Chen, Po-Hsuan Cameron Steiner, David F. Manoj, Naren Olson, Niels Smith, Jenny L. Mohtashamian, Arash Peterson, Brandon Amin, Mahul B. Evans, Andrew J. Sweet, Joan W. Cheung, Carol van der Kwast, Theodorus Sangoi, Ankur R. Zhou, Ming Allan, Robert Humphrey, Peter A. Hipp, Jason D. Gadepalli, Krishna Corrado, Greg S. Peng, Lily H. Stumpe, Martin C. Mermel, Craig H. JAMA Oncol Original Investigation IMPORTANCE: For prostate cancer, Gleason grading of the biopsy specimen plays a pivotal role in determining case management. However, Gleason grading is associated with substantial interobserver variability, resulting in a need for decision support tools to improve the reproducibility of Gleason grading in routine clinical practice. OBJECTIVE: To evaluate the ability of a deep learning system (DLS) to grade diagnostic prostate biopsy specimens. DESIGN, SETTING, AND PARTICIPANTS: The DLS was evaluated using 752 deidentified digitized images of formalin-fixed paraffin-embedded prostate needle core biopsy specimens obtained from 3 institutions in the United States, including 1 institution not used for DLS development. To obtain the Gleason grade group (GG), each specimen was first reviewed by 2 expert urologic subspecialists from a multi-institutional panel of 6 individuals (years of experience: mean, 25 years; range, 18-34 years). A third subspecialist reviewed discordant cases to arrive at a majority opinion. To reduce diagnostic uncertainty, all subspecialists had access to an immunohistochemical-stained section and 3 histologic sections for every biopsied specimen. Their review was conducted from December 2018 to June 2019. MAIN OUTCOMES AND MEASURES: The frequency of the exact agreement of the DLS with the majority opinion of the subspecialists in categorizing each tumor-containing specimen as 1 of 5 categories: nontumor, GG1, GG2, GG3, or GG4-5. For comparison, the rate of agreement of 19 general pathologists’ opinions with the subspecialists’ majority opinions was also evaluated. RESULTS: For grading tumor-containing biopsy specimens in the validation set (n = 498), the rate of agreement with subspecialists was significantly higher for the DLS (71.7%; 95% CI, 67.9%-75.3%) than for general pathologists (58.0%; 95% CI, 54.5%-61.4%) (P < .001). In subanalyses of biopsy specimens from an external validation set (n = 322), the Gleason grading performance of the DLS remained similar. For distinguishing nontumor from tumor-containing biopsy specimens (n = 752), the rate of agreement with subspecialists was 94.3% (95% CI, 92.4%-95.9%) for the DLS and similar at 94.7% (95% CI, 92.8%-96.3%) for general pathologists (P = .58). CONCLUSIONS AND RELEVANCE: In this study, the DLS showed higher proficiency than general pathologists at Gleason grading prostate needle core biopsy specimens and generalized to an independent institution. Future research is necessary to evaluate the potential utility of using the DLS as a decision support tool in clinical workflows and to improve the quality of prostate cancer grading for therapy decisions. American Medical Association 2020-09 2020-07-23 /pmc/articles/PMC7378872/ /pubmed/32701148 http://dx.doi.org/10.1001/jamaoncol.2020.2485 Text en Copyright 2020 Nagpal K et al. JAMA Oncology. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article distributed under the terms of the CC-BY-NC-ND License. |
spellingShingle | Original Investigation Nagpal, Kunal Foote, Davis Tan, Fraser Liu, Yun Chen, Po-Hsuan Cameron Steiner, David F. Manoj, Naren Olson, Niels Smith, Jenny L. Mohtashamian, Arash Peterson, Brandon Amin, Mahul B. Evans, Andrew J. Sweet, Joan W. Cheung, Carol van der Kwast, Theodorus Sangoi, Ankur R. Zhou, Ming Allan, Robert Humphrey, Peter A. Hipp, Jason D. Gadepalli, Krishna Corrado, Greg S. Peng, Lily H. Stumpe, Martin C. Mermel, Craig H. Development and Validation of a Deep Learning Algorithm for Gleason Grading of Prostate Cancer From Biopsy Specimens |
title | Development and Validation of a Deep Learning Algorithm for Gleason Grading of Prostate Cancer From Biopsy Specimens |
title_full | Development and Validation of a Deep Learning Algorithm for Gleason Grading of Prostate Cancer From Biopsy Specimens |
title_fullStr | Development and Validation of a Deep Learning Algorithm for Gleason Grading of Prostate Cancer From Biopsy Specimens |
title_full_unstemmed | Development and Validation of a Deep Learning Algorithm for Gleason Grading of Prostate Cancer From Biopsy Specimens |
title_short | Development and Validation of a Deep Learning Algorithm for Gleason Grading of Prostate Cancer From Biopsy Specimens |
title_sort | development and validation of a deep learning algorithm for gleason grading of prostate cancer from biopsy specimens |
topic | Original Investigation |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7378872/ https://www.ncbi.nlm.nih.gov/pubmed/32701148 http://dx.doi.org/10.1001/jamaoncol.2020.2485 |
work_keys_str_mv | AT nagpalkunal developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT footedavis developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT tanfraser developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT liuyun developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT chenpohsuancameron developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT steinerdavidf developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT manojnaren developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT olsonniels developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT smithjennyl developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT mohtashamianarash developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT petersonbrandon developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT aminmahulb developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT evansandrewj developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT sweetjoanw developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT cheungcarol developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT vanderkwasttheodorus developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT sangoiankurr developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT zhouming developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT allanrobert developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT humphreypetera developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT hippjasond developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT gadepallikrishna developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT corradogregs developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT penglilyh developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT stumpemartinc developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens AT mermelcraigh developmentandvalidationofadeeplearningalgorithmforgleasongradingofprostatecancerfrombiopsyspecimens |