Cargando…

Deep learning to automate the labelling of head MRI datasets for computer vision applications

OBJECTIVES: The purpose of this study was to build a deep learning model to derive labels from neuroradiology reports and assign these to the corresponding examinations, overcoming a bottleneck to computer vision model development. METHODS: Reference-standard labels were generated by a team of neuro...

Descripción completa

Detalles Bibliográficos
Autores principales: Wood, David A., Kafiabadi, Sina, Al Busaidi, Aisha, Guilhem, Emily L., Lynch, Jeremy, Townend, Matthew K., Montvila, Antanas, Kiik, Martin, Siddiqui, Juveria, Gadapa, Naveen, Benger, Matthew D., Mazumder, Asif, Barker, Gareth, Ourselin, Sebastian, Cole, James H., Booth, Thomas C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8660736/
https://www.ncbi.nlm.nih.gov/pubmed/34286375
http://dx.doi.org/10.1007/s00330-021-08132-0
_version_ 1784613251132162048
author Wood, David A.
Kafiabadi, Sina
Al Busaidi, Aisha
Guilhem, Emily L.
Lynch, Jeremy
Townend, Matthew K.
Montvila, Antanas
Kiik, Martin
Siddiqui, Juveria
Gadapa, Naveen
Benger, Matthew D.
Mazumder, Asif
Barker, Gareth
Ourselin, Sebastian
Cole, James H.
Booth, Thomas C.
author_facet Wood, David A.
Kafiabadi, Sina
Al Busaidi, Aisha
Guilhem, Emily L.
Lynch, Jeremy
Townend, Matthew K.
Montvila, Antanas
Kiik, Martin
Siddiqui, Juveria
Gadapa, Naveen
Benger, Matthew D.
Mazumder, Asif
Barker, Gareth
Ourselin, Sebastian
Cole, James H.
Booth, Thomas C.
author_sort Wood, David A.
collection PubMed
description OBJECTIVES: The purpose of this study was to build a deep learning model to derive labels from neuroradiology reports and assign these to the corresponding examinations, overcoming a bottleneck to computer vision model development. METHODS: Reference-standard labels were generated by a team of neuroradiologists for model training and evaluation. Three thousand examinations were labelled for the presence or absence of any abnormality by manually scrutinising the corresponding radiology reports (‘reference-standard report labels’); a subset of these examinations (n = 250) were assigned ‘reference-standard image labels’ by interrogating the actual images. Separately, 2000 reports were labelled for the presence or absence of 7 specialised categories of abnormality (acute stroke, mass, atrophy, vascular abnormality, small vessel disease, white matter inflammation, encephalomalacia), with a subset of these examinations (n = 700) also assigned reference-standard image labels. A deep learning model was trained using labelled reports and validated in two ways: comparing predicted labels to (i) reference-standard report labels and (ii) reference-standard image labels. The area under the receiver operating characteristic curve (AUC-ROC) was used to quantify model performance. Accuracy, sensitivity, specificity, and F1 score were also calculated. RESULTS: Accurate classification (AUC-ROC > 0.95) was achieved for all categories when tested against reference-standard report labels. A drop in performance (ΔAUC-ROC > 0.02) was seen for three categories (atrophy, encephalomalacia, vascular) when tested against reference-standard image labels, highlighting discrepancies in the original reports. Once trained, the model assigned labels to 121,556 examinations in under 30 min. CONCLUSIONS: Our model accurately classifies head MRI examinations, enabling automated dataset labelling for downstream computer vision applications. KEY POINTS: • Deep learning is poised to revolutionise image recognition tasks in radiology; however, a barrier to clinical adoption is the difficulty of obtaining large labelled datasets for model training. • We demonstrate a deep learning model which can derive labels from neuroradiology reports and assign these to the corresponding examinations at scale, facilitating the development of downstream computer vision models. • We rigorously tested our model by comparing labels predicted on the basis of neuroradiology reports with two sets of reference-standard labels: (1) labels derived by manually scrutinising each radiology report and (2) labels derived by interrogating the actual images. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s00330-021-08132-0.
format Online
Article
Text
id pubmed-8660736
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-86607362021-12-27 Deep learning to automate the labelling of head MRI datasets for computer vision applications Wood, David A. Kafiabadi, Sina Al Busaidi, Aisha Guilhem, Emily L. Lynch, Jeremy Townend, Matthew K. Montvila, Antanas Kiik, Martin Siddiqui, Juveria Gadapa, Naveen Benger, Matthew D. Mazumder, Asif Barker, Gareth Ourselin, Sebastian Cole, James H. Booth, Thomas C. Eur Radiol Imaging Informatics and Artificial Intelligence OBJECTIVES: The purpose of this study was to build a deep learning model to derive labels from neuroradiology reports and assign these to the corresponding examinations, overcoming a bottleneck to computer vision model development. METHODS: Reference-standard labels were generated by a team of neuroradiologists for model training and evaluation. Three thousand examinations were labelled for the presence or absence of any abnormality by manually scrutinising the corresponding radiology reports (‘reference-standard report labels’); a subset of these examinations (n = 250) were assigned ‘reference-standard image labels’ by interrogating the actual images. Separately, 2000 reports were labelled for the presence or absence of 7 specialised categories of abnormality (acute stroke, mass, atrophy, vascular abnormality, small vessel disease, white matter inflammation, encephalomalacia), with a subset of these examinations (n = 700) also assigned reference-standard image labels. A deep learning model was trained using labelled reports and validated in two ways: comparing predicted labels to (i) reference-standard report labels and (ii) reference-standard image labels. The area under the receiver operating characteristic curve (AUC-ROC) was used to quantify model performance. Accuracy, sensitivity, specificity, and F1 score were also calculated. RESULTS: Accurate classification (AUC-ROC > 0.95) was achieved for all categories when tested against reference-standard report labels. A drop in performance (ΔAUC-ROC > 0.02) was seen for three categories (atrophy, encephalomalacia, vascular) when tested against reference-standard image labels, highlighting discrepancies in the original reports. Once trained, the model assigned labels to 121,556 examinations in under 30 min. CONCLUSIONS: Our model accurately classifies head MRI examinations, enabling automated dataset labelling for downstream computer vision applications. KEY POINTS: • Deep learning is poised to revolutionise image recognition tasks in radiology; however, a barrier to clinical adoption is the difficulty of obtaining large labelled datasets for model training. • We demonstrate a deep learning model which can derive labels from neuroradiology reports and assign these to the corresponding examinations at scale, facilitating the development of downstream computer vision models. • We rigorously tested our model by comparing labels predicted on the basis of neuroradiology reports with two sets of reference-standard labels: (1) labels derived by manually scrutinising each radiology report and (2) labels derived by interrogating the actual images. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s00330-021-08132-0. Springer Berlin Heidelberg 2021-07-20 2022 /pmc/articles/PMC8660736/ /pubmed/34286375 http://dx.doi.org/10.1007/s00330-021-08132-0 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Imaging Informatics and Artificial Intelligence
Wood, David A.
Kafiabadi, Sina
Al Busaidi, Aisha
Guilhem, Emily L.
Lynch, Jeremy
Townend, Matthew K.
Montvila, Antanas
Kiik, Martin
Siddiqui, Juveria
Gadapa, Naveen
Benger, Matthew D.
Mazumder, Asif
Barker, Gareth
Ourselin, Sebastian
Cole, James H.
Booth, Thomas C.
Deep learning to automate the labelling of head MRI datasets for computer vision applications
title Deep learning to automate the labelling of head MRI datasets for computer vision applications
title_full Deep learning to automate the labelling of head MRI datasets for computer vision applications
title_fullStr Deep learning to automate the labelling of head MRI datasets for computer vision applications
title_full_unstemmed Deep learning to automate the labelling of head MRI datasets for computer vision applications
title_short Deep learning to automate the labelling of head MRI datasets for computer vision applications
title_sort deep learning to automate the labelling of head mri datasets for computer vision applications
topic Imaging Informatics and Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8660736/
https://www.ncbi.nlm.nih.gov/pubmed/34286375
http://dx.doi.org/10.1007/s00330-021-08132-0
work_keys_str_mv AT wooddavida deeplearningtoautomatethelabellingofheadmridatasetsforcomputervisionapplications
AT kafiabadisina deeplearningtoautomatethelabellingofheadmridatasetsforcomputervisionapplications
AT albusaidiaisha deeplearningtoautomatethelabellingofheadmridatasetsforcomputervisionapplications
AT guilhememilyl deeplearningtoautomatethelabellingofheadmridatasetsforcomputervisionapplications
AT lynchjeremy deeplearningtoautomatethelabellingofheadmridatasetsforcomputervisionapplications
AT townendmatthewk deeplearningtoautomatethelabellingofheadmridatasetsforcomputervisionapplications
AT montvilaantanas deeplearningtoautomatethelabellingofheadmridatasetsforcomputervisionapplications
AT kiikmartin deeplearningtoautomatethelabellingofheadmridatasetsforcomputervisionapplications
AT siddiquijuveria deeplearningtoautomatethelabellingofheadmridatasetsforcomputervisionapplications
AT gadapanaveen deeplearningtoautomatethelabellingofheadmridatasetsforcomputervisionapplications
AT bengermatthewd deeplearningtoautomatethelabellingofheadmridatasetsforcomputervisionapplications
AT mazumderasif deeplearningtoautomatethelabellingofheadmridatasetsforcomputervisionapplications
AT barkergareth deeplearningtoautomatethelabellingofheadmridatasetsforcomputervisionapplications
AT ourselinsebastian deeplearningtoautomatethelabellingofheadmridatasetsforcomputervisionapplications
AT colejamesh deeplearningtoautomatethelabellingofheadmridatasetsforcomputervisionapplications
AT booththomasc deeplearningtoautomatethelabellingofheadmridatasetsforcomputervisionapplications