Cargando…

BRACS: A Dataset for BReAst Carcinoma Subtyping in H&E Histology Images

Breast cancer is the most commonly diagnosed cancer and registers the highest number of deaths for women. Advances in diagnostic activities combined with large-scale screening policies have significantly lowered the mortality rates for breast cancer patients. However, the manual inspection of tissue...

Descripción completa

Detalles Bibliográficos
Autores principales: Brancati, Nadia, Anniciello, Anna Maria, Pati, Pushpak, Riccio, Daniel, Scognamiglio, Giosuè, Jaume, Guillaume, De Pietro, Giuseppe, Di Bonito, Maurizio, Foncubierta, Antonio, Botti, Gerardo, Gabrani, Maria, Feroce, Florinda, Frucci, Maria
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9575967/
https://www.ncbi.nlm.nih.gov/pubmed/36251776
http://dx.doi.org/10.1093/database/baac093
_version_ 1784811430674956288
author Brancati, Nadia
Anniciello, Anna Maria
Pati, Pushpak
Riccio, Daniel
Scognamiglio, Giosuè
Jaume, Guillaume
De Pietro, Giuseppe
Di Bonito, Maurizio
Foncubierta, Antonio
Botti, Gerardo
Gabrani, Maria
Feroce, Florinda
Frucci, Maria
author_facet Brancati, Nadia
Anniciello, Anna Maria
Pati, Pushpak
Riccio, Daniel
Scognamiglio, Giosuè
Jaume, Guillaume
De Pietro, Giuseppe
Di Bonito, Maurizio
Foncubierta, Antonio
Botti, Gerardo
Gabrani, Maria
Feroce, Florinda
Frucci, Maria
author_sort Brancati, Nadia
collection PubMed
description Breast cancer is the most commonly diagnosed cancer and registers the highest number of deaths for women. Advances in diagnostic activities combined with large-scale screening policies have significantly lowered the mortality rates for breast cancer patients. However, the manual inspection of tissue slides by pathologists is cumbersome, time-consuming and is subject to significant inter- and intra-observer variability. Recently, the advent of whole-slide scanning systems has empowered the rapid digitization of pathology slides and enabled the development of Artificial Intelligence (AI)-assisted digital workflows. However, AI techniques, especially Deep Learning, require a large amount of high-quality annotated data to learn from. Constructing such task-specific datasets poses several challenges, such as data-acquisition level constraints, time-consuming and expensive annotations and anonymization of patient information. In this paper, we introduce the BReAst Carcinoma Subtyping (BRACS) dataset, a large cohort of annotated Hematoxylin and Eosin (H&E)-stained images to advance AI development in the automatic characterization of breast lesions. BRACS contains 547 Whole-Slide Images (WSIs) and 4539 Regions Of Interest (ROIs) extracted from the WSIs. Each WSI and respective ROIs are annotated by the consensus of three board-certified pathologists into different lesion categories. Specifically, BRACS includes three lesion types, i.e., benign, malignant and atypical, which are further subtyped into seven categories. It is, to the best of our knowledge, the largest annotated dataset for breast cancer subtyping both at WSI and ROI levels. Furthermore, by including the understudied atypical lesions, BRACS offers a unique opportunity for leveraging AI to better understand their characteristics. We encourage AI practitioners to develop and evaluate novel algorithms on the BRACS dataset to further breast cancer diagnosis and patient care. Database URL: https://www.bracs.icar.cnr.it/
format Online
Article
Text
id pubmed-9575967
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-95759672022-10-19 BRACS: A Dataset for BReAst Carcinoma Subtyping in H&E Histology Images Brancati, Nadia Anniciello, Anna Maria Pati, Pushpak Riccio, Daniel Scognamiglio, Giosuè Jaume, Guillaume De Pietro, Giuseppe Di Bonito, Maurizio Foncubierta, Antonio Botti, Gerardo Gabrani, Maria Feroce, Florinda Frucci, Maria Database (Oxford) Database Update Breast cancer is the most commonly diagnosed cancer and registers the highest number of deaths for women. Advances in diagnostic activities combined with large-scale screening policies have significantly lowered the mortality rates for breast cancer patients. However, the manual inspection of tissue slides by pathologists is cumbersome, time-consuming and is subject to significant inter- and intra-observer variability. Recently, the advent of whole-slide scanning systems has empowered the rapid digitization of pathology slides and enabled the development of Artificial Intelligence (AI)-assisted digital workflows. However, AI techniques, especially Deep Learning, require a large amount of high-quality annotated data to learn from. Constructing such task-specific datasets poses several challenges, such as data-acquisition level constraints, time-consuming and expensive annotations and anonymization of patient information. In this paper, we introduce the BReAst Carcinoma Subtyping (BRACS) dataset, a large cohort of annotated Hematoxylin and Eosin (H&E)-stained images to advance AI development in the automatic characterization of breast lesions. BRACS contains 547 Whole-Slide Images (WSIs) and 4539 Regions Of Interest (ROIs) extracted from the WSIs. Each WSI and respective ROIs are annotated by the consensus of three board-certified pathologists into different lesion categories. Specifically, BRACS includes three lesion types, i.e., benign, malignant and atypical, which are further subtyped into seven categories. It is, to the best of our knowledge, the largest annotated dataset for breast cancer subtyping both at WSI and ROI levels. Furthermore, by including the understudied atypical lesions, BRACS offers a unique opportunity for leveraging AI to better understand their characteristics. We encourage AI practitioners to develop and evaluate novel algorithms on the BRACS dataset to further breast cancer diagnosis and patient care. Database URL: https://www.bracs.icar.cnr.it/ Oxford University Press 2022-10-17 /pmc/articles/PMC9575967/ /pubmed/36251776 http://dx.doi.org/10.1093/database/baac093 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Database Update
Brancati, Nadia
Anniciello, Anna Maria
Pati, Pushpak
Riccio, Daniel
Scognamiglio, Giosuè
Jaume, Guillaume
De Pietro, Giuseppe
Di Bonito, Maurizio
Foncubierta, Antonio
Botti, Gerardo
Gabrani, Maria
Feroce, Florinda
Frucci, Maria
BRACS: A Dataset for BReAst Carcinoma Subtyping in H&E Histology Images
title BRACS: A Dataset for BReAst Carcinoma Subtyping in H&E Histology Images
title_full BRACS: A Dataset for BReAst Carcinoma Subtyping in H&E Histology Images
title_fullStr BRACS: A Dataset for BReAst Carcinoma Subtyping in H&E Histology Images
title_full_unstemmed BRACS: A Dataset for BReAst Carcinoma Subtyping in H&E Histology Images
title_short BRACS: A Dataset for BReAst Carcinoma Subtyping in H&E Histology Images
title_sort bracs: a dataset for breast carcinoma subtyping in h&e histology images
topic Database Update
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9575967/
https://www.ncbi.nlm.nih.gov/pubmed/36251776
http://dx.doi.org/10.1093/database/baac093
work_keys_str_mv AT brancatinadia bracsadatasetforbreastcarcinomasubtypinginhehistologyimages
AT annicielloannamaria bracsadatasetforbreastcarcinomasubtypinginhehistologyimages
AT patipushpak bracsadatasetforbreastcarcinomasubtypinginhehistologyimages
AT ricciodaniel bracsadatasetforbreastcarcinomasubtypinginhehistologyimages
AT scognamigliogiosue bracsadatasetforbreastcarcinomasubtypinginhehistologyimages
AT jaumeguillaume bracsadatasetforbreastcarcinomasubtypinginhehistologyimages
AT depietrogiuseppe bracsadatasetforbreastcarcinomasubtypinginhehistologyimages
AT dibonitomaurizio bracsadatasetforbreastcarcinomasubtypinginhehistologyimages
AT foncubiertaantonio bracsadatasetforbreastcarcinomasubtypinginhehistologyimages
AT bottigerardo bracsadatasetforbreastcarcinomasubtypinginhehistologyimages
AT gabranimaria bracsadatasetforbreastcarcinomasubtypinginhehistologyimages
AT feroceflorinda bracsadatasetforbreastcarcinomasubtypinginhehistologyimages
AT fruccimaria bracsadatasetforbreastcarcinomasubtypinginhehistologyimages