Cargando…
VinDr-Mammo: A large-scale benchmark dataset for computer-aided diagnosis in full-field digital mammography
Mammography, or breast X-ray imaging, is the most widely used imaging modality to detect cancer and other breast diseases. Recent studies have shown that deep learning-based computer-assisted detection and diagnosis (CADe/x) tools have been developed to support physicians and improve the accuracy of...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10182079/ https://www.ncbi.nlm.nih.gov/pubmed/37173336 http://dx.doi.org/10.1038/s41597-023-02100-7 |
_version_ | 1785041713670127616 |
---|---|
author | Nguyen, Hieu T. Nguyen, Ha Q. Pham, Hieu H. Lam, Khanh Le, Linh T. Dao, Minh Vu, Van |
author_facet | Nguyen, Hieu T. Nguyen, Ha Q. Pham, Hieu H. Lam, Khanh Le, Linh T. Dao, Minh Vu, Van |
author_sort | Nguyen, Hieu T. |
collection | PubMed |
description | Mammography, or breast X-ray imaging, is the most widely used imaging modality to detect cancer and other breast diseases. Recent studies have shown that deep learning-based computer-assisted detection and diagnosis (CADe/x) tools have been developed to support physicians and improve the accuracy of interpreting mammography. A number of large-scale mammography datasets from different populations with various associated annotations and clinical data have been introduced to study the potential of learning-based methods in the field of breast radiology. With the aim to develop more robust and more interpretable support systems in breast imaging, we introduce VinDr-Mammo, a Vietnamese dataset of digital mammography with breast-level assessment and extensive lesion-level annotations, enhancing the diversity of the publicly available mammography data. The dataset consists of 5,000 mammography exams, each of which has four standard views and is double read with disagreement (if any) being resolved by arbitration. The purpose of this dataset is to assess Breast Imaging Reporting and Data System (BI-RADS) and breast density at the individual breast level. In addition, the dataset also provides the category, location, and BI-RADS assessment of non-benign findings. We make VinDr-Mammo publicly available as a new imaging resource to promote advances in developing CADe/x tools for mammography interpretation. |
format | Online Article Text |
id | pubmed-10182079 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-101820792023-05-14 VinDr-Mammo: A large-scale benchmark dataset for computer-aided diagnosis in full-field digital mammography Nguyen, Hieu T. Nguyen, Ha Q. Pham, Hieu H. Lam, Khanh Le, Linh T. Dao, Minh Vu, Van Sci Data Data Descriptor Mammography, or breast X-ray imaging, is the most widely used imaging modality to detect cancer and other breast diseases. Recent studies have shown that deep learning-based computer-assisted detection and diagnosis (CADe/x) tools have been developed to support physicians and improve the accuracy of interpreting mammography. A number of large-scale mammography datasets from different populations with various associated annotations and clinical data have been introduced to study the potential of learning-based methods in the field of breast radiology. With the aim to develop more robust and more interpretable support systems in breast imaging, we introduce VinDr-Mammo, a Vietnamese dataset of digital mammography with breast-level assessment and extensive lesion-level annotations, enhancing the diversity of the publicly available mammography data. The dataset consists of 5,000 mammography exams, each of which has four standard views and is double read with disagreement (if any) being resolved by arbitration. The purpose of this dataset is to assess Breast Imaging Reporting and Data System (BI-RADS) and breast density at the individual breast level. In addition, the dataset also provides the category, location, and BI-RADS assessment of non-benign findings. We make VinDr-Mammo publicly available as a new imaging resource to promote advances in developing CADe/x tools for mammography interpretation. Nature Publishing Group UK 2023-05-12 /pmc/articles/PMC10182079/ /pubmed/37173336 http://dx.doi.org/10.1038/s41597-023-02100-7 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Data Descriptor Nguyen, Hieu T. Nguyen, Ha Q. Pham, Hieu H. Lam, Khanh Le, Linh T. Dao, Minh Vu, Van VinDr-Mammo: A large-scale benchmark dataset for computer-aided diagnosis in full-field digital mammography |
title | VinDr-Mammo: A large-scale benchmark dataset for computer-aided diagnosis in full-field digital mammography |
title_full | VinDr-Mammo: A large-scale benchmark dataset for computer-aided diagnosis in full-field digital mammography |
title_fullStr | VinDr-Mammo: A large-scale benchmark dataset for computer-aided diagnosis in full-field digital mammography |
title_full_unstemmed | VinDr-Mammo: A large-scale benchmark dataset for computer-aided diagnosis in full-field digital mammography |
title_short | VinDr-Mammo: A large-scale benchmark dataset for computer-aided diagnosis in full-field digital mammography |
title_sort | vindr-mammo: a large-scale benchmark dataset for computer-aided diagnosis in full-field digital mammography |
topic | Data Descriptor |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10182079/ https://www.ncbi.nlm.nih.gov/pubmed/37173336 http://dx.doi.org/10.1038/s41597-023-02100-7 |
work_keys_str_mv | AT nguyenhieut vindrmammoalargescalebenchmarkdatasetforcomputeraideddiagnosisinfullfielddigitalmammography AT nguyenhaq vindrmammoalargescalebenchmarkdatasetforcomputeraideddiagnosisinfullfielddigitalmammography AT phamhieuh vindrmammoalargescalebenchmarkdatasetforcomputeraideddiagnosisinfullfielddigitalmammography AT lamkhanh vindrmammoalargescalebenchmarkdatasetforcomputeraideddiagnosisinfullfielddigitalmammography AT lelinht vindrmammoalargescalebenchmarkdatasetforcomputeraideddiagnosisinfullfielddigitalmammography AT daominh vindrmammoalargescalebenchmarkdatasetforcomputeraideddiagnosisinfullfielddigitalmammography AT vuvan vindrmammoalargescalebenchmarkdatasetforcomputeraideddiagnosisinfullfielddigitalmammography |