Cargando…

An overview of the National COVID-19 Chest Imaging Database: data quality and cohort analysis

BACKGROUND: The National COVID-19 Chest Imaging Database (NCCID) is a centralized database containing mainly chest X-rays and computed tomography scans from patients across the UK. The objective of the initiative is to support a better understanding of the coronavirus SARS-CoV-2 disease (COVID-19) a...

Descripción completa

Detalles Bibliográficos
Autores principales: Cushnan, Dominic, Bennett, Oscar, Berka, Rosalind, Bertolli, Ottavia, Chopra, Ashwin, Dorgham, Samie, Favaro, Alberto, Ganepola, Tara, Halling-Brown, Mark, Imreh, Gergely, Jacob, Joseph, Jefferson, Emily, Lemarchand, François, Schofield, Daniel, Wyatt, Jeremy C
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8633457/
https://www.ncbi.nlm.nih.gov/pubmed/34849869
http://dx.doi.org/10.1093/gigascience/giab076
_version_ 1784607931173437440
author Cushnan, Dominic
Bennett, Oscar
Berka, Rosalind
Bertolli, Ottavia
Chopra, Ashwin
Dorgham, Samie
Favaro, Alberto
Ganepola, Tara
Halling-Brown, Mark
Imreh, Gergely
Jacob, Joseph
Jefferson, Emily
Lemarchand, François
Schofield, Daniel
Wyatt, Jeremy C
author_facet Cushnan, Dominic
Bennett, Oscar
Berka, Rosalind
Bertolli, Ottavia
Chopra, Ashwin
Dorgham, Samie
Favaro, Alberto
Ganepola, Tara
Halling-Brown, Mark
Imreh, Gergely
Jacob, Joseph
Jefferson, Emily
Lemarchand, François
Schofield, Daniel
Wyatt, Jeremy C
author_sort Cushnan, Dominic
collection PubMed
description BACKGROUND: The National COVID-19 Chest Imaging Database (NCCID) is a centralized database containing mainly chest X-rays and computed tomography scans from patients across the UK. The objective of the initiative is to support a better understanding of the coronavirus SARS-CoV-2 disease (COVID-19) and the development of machine learning technologies that will improve care for patients hospitalized with a severe COVID-19 infection. This article introduces the training dataset, including a snapshot analysis covering the completeness of clinical data, and availability of image data for the various use-cases (diagnosis, prognosis, longitudinal risk). An additional cohort analysis measures how well the NCCID represents the wider COVID-19–affected UK population in terms of geographic, demographic, and temporal coverage. FINDINGS: The NCCID offers high-quality DICOM images acquired across a variety of imaging machinery; multiple time points including historical images are available for a subset of patients. This volume and variety make the database well suited to development of diagnostic/prognostic models for COVID-associated respiratory conditions. Historical images and clinical data may aid long-term risk stratification, particularly as availability of comorbidity data increases through linkage to other resources. The cohort analysis revealed good alignment to general UK COVID-19 statistics for some categories, e.g., sex, whilst identifying areas for improvements to data collection methods, particularly geographic coverage. CONCLUSION: The NCCID is a growing resource that provides researchers with a large, high-quality database that can be leveraged both to support the response to the COVID-19 pandemic and as a test bed for building clinically viable medical imaging models.
format Online
Article
Text
id pubmed-8633457
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-86334572021-12-02 An overview of the National COVID-19 Chest Imaging Database: data quality and cohort analysis Cushnan, Dominic Bennett, Oscar Berka, Rosalind Bertolli, Ottavia Chopra, Ashwin Dorgham, Samie Favaro, Alberto Ganepola, Tara Halling-Brown, Mark Imreh, Gergely Jacob, Joseph Jefferson, Emily Lemarchand, François Schofield, Daniel Wyatt, Jeremy C Gigascience Data Note BACKGROUND: The National COVID-19 Chest Imaging Database (NCCID) is a centralized database containing mainly chest X-rays and computed tomography scans from patients across the UK. The objective of the initiative is to support a better understanding of the coronavirus SARS-CoV-2 disease (COVID-19) and the development of machine learning technologies that will improve care for patients hospitalized with a severe COVID-19 infection. This article introduces the training dataset, including a snapshot analysis covering the completeness of clinical data, and availability of image data for the various use-cases (diagnosis, prognosis, longitudinal risk). An additional cohort analysis measures how well the NCCID represents the wider COVID-19–affected UK population in terms of geographic, demographic, and temporal coverage. FINDINGS: The NCCID offers high-quality DICOM images acquired across a variety of imaging machinery; multiple time points including historical images are available for a subset of patients. This volume and variety make the database well suited to development of diagnostic/prognostic models for COVID-associated respiratory conditions. Historical images and clinical data may aid long-term risk stratification, particularly as availability of comorbidity data increases through linkage to other resources. The cohort analysis revealed good alignment to general UK COVID-19 statistics for some categories, e.g., sex, whilst identifying areas for improvements to data collection methods, particularly geographic coverage. CONCLUSION: The NCCID is a growing resource that provides researchers with a large, high-quality database that can be leveraged both to support the response to the COVID-19 pandemic and as a test bed for building clinically viable medical imaging models. Oxford University Press 2021-11-25 /pmc/articles/PMC8633457/ /pubmed/34849869 http://dx.doi.org/10.1093/gigascience/giab076 Text en © The Author(s) 2021. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Data Note
Cushnan, Dominic
Bennett, Oscar
Berka, Rosalind
Bertolli, Ottavia
Chopra, Ashwin
Dorgham, Samie
Favaro, Alberto
Ganepola, Tara
Halling-Brown, Mark
Imreh, Gergely
Jacob, Joseph
Jefferson, Emily
Lemarchand, François
Schofield, Daniel
Wyatt, Jeremy C
An overview of the National COVID-19 Chest Imaging Database: data quality and cohort analysis
title An overview of the National COVID-19 Chest Imaging Database: data quality and cohort analysis
title_full An overview of the National COVID-19 Chest Imaging Database: data quality and cohort analysis
title_fullStr An overview of the National COVID-19 Chest Imaging Database: data quality and cohort analysis
title_full_unstemmed An overview of the National COVID-19 Chest Imaging Database: data quality and cohort analysis
title_short An overview of the National COVID-19 Chest Imaging Database: data quality and cohort analysis
title_sort overview of the national covid-19 chest imaging database: data quality and cohort analysis
topic Data Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8633457/
https://www.ncbi.nlm.nih.gov/pubmed/34849869
http://dx.doi.org/10.1093/gigascience/giab076
work_keys_str_mv AT cushnandominic anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT bennettoscar anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT berkarosalind anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT bertolliottavia anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT chopraashwin anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT dorghamsamie anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT favaroalberto anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT ganepolatara anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT hallingbrownmark anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT imrehgergely anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT jacobjoseph anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT jeffersonemily anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT lemarchandfrancois anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT schofielddaniel anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT wyattjeremyc anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT cushnandominic overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT bennettoscar overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT berkarosalind overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT bertolliottavia overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT chopraashwin overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT dorghamsamie overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT favaroalberto overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT ganepolatara overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT hallingbrownmark overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT imrehgergely overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT jacobjoseph overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT jeffersonemily overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT lemarchandfrancois overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT schofielddaniel overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT wyattjeremyc overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis
AT overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis