Cargando…
An overview of the National COVID-19 Chest Imaging Database: data quality and cohort analysis
BACKGROUND: The National COVID-19 Chest Imaging Database (NCCID) is a centralized database containing mainly chest X-rays and computed tomography scans from patients across the UK. The objective of the initiative is to support a better understanding of the coronavirus SARS-CoV-2 disease (COVID-19) a...
Autores principales: | , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8633457/ https://www.ncbi.nlm.nih.gov/pubmed/34849869 http://dx.doi.org/10.1093/gigascience/giab076 |
_version_ | 1784607931173437440 |
---|---|
author | Cushnan, Dominic Bennett, Oscar Berka, Rosalind Bertolli, Ottavia Chopra, Ashwin Dorgham, Samie Favaro, Alberto Ganepola, Tara Halling-Brown, Mark Imreh, Gergely Jacob, Joseph Jefferson, Emily Lemarchand, François Schofield, Daniel Wyatt, Jeremy C |
author_facet | Cushnan, Dominic Bennett, Oscar Berka, Rosalind Bertolli, Ottavia Chopra, Ashwin Dorgham, Samie Favaro, Alberto Ganepola, Tara Halling-Brown, Mark Imreh, Gergely Jacob, Joseph Jefferson, Emily Lemarchand, François Schofield, Daniel Wyatt, Jeremy C |
author_sort | Cushnan, Dominic |
collection | PubMed |
description | BACKGROUND: The National COVID-19 Chest Imaging Database (NCCID) is a centralized database containing mainly chest X-rays and computed tomography scans from patients across the UK. The objective of the initiative is to support a better understanding of the coronavirus SARS-CoV-2 disease (COVID-19) and the development of machine learning technologies that will improve care for patients hospitalized with a severe COVID-19 infection. This article introduces the training dataset, including a snapshot analysis covering the completeness of clinical data, and availability of image data for the various use-cases (diagnosis, prognosis, longitudinal risk). An additional cohort analysis measures how well the NCCID represents the wider COVID-19–affected UK population in terms of geographic, demographic, and temporal coverage. FINDINGS: The NCCID offers high-quality DICOM images acquired across a variety of imaging machinery; multiple time points including historical images are available for a subset of patients. This volume and variety make the database well suited to development of diagnostic/prognostic models for COVID-associated respiratory conditions. Historical images and clinical data may aid long-term risk stratification, particularly as availability of comorbidity data increases through linkage to other resources. The cohort analysis revealed good alignment to general UK COVID-19 statistics for some categories, e.g., sex, whilst identifying areas for improvements to data collection methods, particularly geographic coverage. CONCLUSION: The NCCID is a growing resource that provides researchers with a large, high-quality database that can be leveraged both to support the response to the COVID-19 pandemic and as a test bed for building clinically viable medical imaging models. |
format | Online Article Text |
id | pubmed-8633457 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-86334572021-12-02 An overview of the National COVID-19 Chest Imaging Database: data quality and cohort analysis Cushnan, Dominic Bennett, Oscar Berka, Rosalind Bertolli, Ottavia Chopra, Ashwin Dorgham, Samie Favaro, Alberto Ganepola, Tara Halling-Brown, Mark Imreh, Gergely Jacob, Joseph Jefferson, Emily Lemarchand, François Schofield, Daniel Wyatt, Jeremy C Gigascience Data Note BACKGROUND: The National COVID-19 Chest Imaging Database (NCCID) is a centralized database containing mainly chest X-rays and computed tomography scans from patients across the UK. The objective of the initiative is to support a better understanding of the coronavirus SARS-CoV-2 disease (COVID-19) and the development of machine learning technologies that will improve care for patients hospitalized with a severe COVID-19 infection. This article introduces the training dataset, including a snapshot analysis covering the completeness of clinical data, and availability of image data for the various use-cases (diagnosis, prognosis, longitudinal risk). An additional cohort analysis measures how well the NCCID represents the wider COVID-19–affected UK population in terms of geographic, demographic, and temporal coverage. FINDINGS: The NCCID offers high-quality DICOM images acquired across a variety of imaging machinery; multiple time points including historical images are available for a subset of patients. This volume and variety make the database well suited to development of diagnostic/prognostic models for COVID-associated respiratory conditions. Historical images and clinical data may aid long-term risk stratification, particularly as availability of comorbidity data increases through linkage to other resources. The cohort analysis revealed good alignment to general UK COVID-19 statistics for some categories, e.g., sex, whilst identifying areas for improvements to data collection methods, particularly geographic coverage. CONCLUSION: The NCCID is a growing resource that provides researchers with a large, high-quality database that can be leveraged both to support the response to the COVID-19 pandemic and as a test bed for building clinically viable medical imaging models. Oxford University Press 2021-11-25 /pmc/articles/PMC8633457/ /pubmed/34849869 http://dx.doi.org/10.1093/gigascience/giab076 Text en © The Author(s) 2021. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Data Note Cushnan, Dominic Bennett, Oscar Berka, Rosalind Bertolli, Ottavia Chopra, Ashwin Dorgham, Samie Favaro, Alberto Ganepola, Tara Halling-Brown, Mark Imreh, Gergely Jacob, Joseph Jefferson, Emily Lemarchand, François Schofield, Daniel Wyatt, Jeremy C An overview of the National COVID-19 Chest Imaging Database: data quality and cohort analysis |
title | An overview of the National COVID-19 Chest Imaging Database: data quality and
cohort analysis |
title_full | An overview of the National COVID-19 Chest Imaging Database: data quality and
cohort analysis |
title_fullStr | An overview of the National COVID-19 Chest Imaging Database: data quality and
cohort analysis |
title_full_unstemmed | An overview of the National COVID-19 Chest Imaging Database: data quality and
cohort analysis |
title_short | An overview of the National COVID-19 Chest Imaging Database: data quality and
cohort analysis |
title_sort | overview of the national covid-19 chest imaging database: data quality and
cohort analysis |
topic | Data Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8633457/ https://www.ncbi.nlm.nih.gov/pubmed/34849869 http://dx.doi.org/10.1093/gigascience/giab076 |
work_keys_str_mv | AT cushnandominic anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT bennettoscar anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT berkarosalind anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT bertolliottavia anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT chopraashwin anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT dorghamsamie anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT favaroalberto anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT ganepolatara anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT hallingbrownmark anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT imrehgergely anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT jacobjoseph anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT jeffersonemily anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT lemarchandfrancois anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT schofielddaniel anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT wyattjeremyc anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT anoverviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT cushnandominic overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT bennettoscar overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT berkarosalind overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT bertolliottavia overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT chopraashwin overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT dorghamsamie overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT favaroalberto overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT ganepolatara overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT hallingbrownmark overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT imrehgergely overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT jacobjoseph overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT jeffersonemily overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT lemarchandfrancois overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT schofielddaniel overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT wyattjeremyc overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis AT overviewofthenationalcovid19chestimagingdatabasedataqualityandcohortanalysis |