The BRACS dataset is specifically designed for the classification of breast cancer subtypes and contains a large number of pathological images stained with H&E. This dataset encompasses 547 whole slide images (WSIs) and 4,539 regions of interest (ROIs) extracted from these slides. Each whole slide image and its corresponding ROIs are annotated by three professional pathologists and classified into different lesion types. Specifically, BRACS includes three main lesion types—benign, malignant, and atypical, which are further divided into seven subtypes.
Breast cancer is one of the most common types of cancer among women, and clear subclassification of subtypes is crucial for breast cancer research as it allows researchers and clinicians to develop personalized treatment plans based on the specific characteristics of the cancer. Advanced AI analysis using the BRACS dataset can help reveal subtle differences between different subtypes, which has potential value in improving diagnostic accuracy and treatment response. In this way, the BRACS dataset not only promotes the development of precise diagnostic technologies but also supports more effective patient management and treatment strategies, thereby improving the success rate of breast cancer treatment globally.
Dimensions | Modality | Task Type | Anatomical Structures | Anatomical Area | Number of Categories | Data Volume | File Format |
---|---|---|---|---|---|---|---|
2D | Pathology | Subtype Classification | Breast tissue | Breast tissue | 7 | 4539 ROIs/547 WSIs | .png/.svs |
Number of BRACS data labels according to lesion type.
Category | Subcategory | Count | Percentage |
---|---|---|---|
WSI | |||
Benign | Normal | 44 | 8% |
Pathological Benign | 147 | 27% | |
Usual Ductal Hyperplasia | 74 | 14% | |
Atypical | Flat Epithelial Atypia | 41 | 7% |
Atypical Ductal Hyperplasia | 48 | 9% | |
Malignant | Ductal Carcinoma in Situ | 61 | 11% |
Invasive Carcinoma | 132 | 24% | |
ROI | |||
Benign | Normal | 484 | 11% |
Pathological Benign | 836 | 18% | |
Usual Ductal Hyperplasia | 517 | 11% | |
Atypical | Flat Epithelial Atypia | 756 | 17% |
Atypical Ductal Hyperplasia | 507 | 11% | |
Malignant | Ductal Carcinoma in Situ | 790 | 17% |
Invasive Carcinoma | 649 | 14% |
This section is statistics results of label information.
Metric | Tumor |
---|---|
Case Count | 501 |
Coverage | 49.4% |
Min Volume (cm³) | 0.1 |
Median Volume (cm³) | 99 |
Max Volume (cm³) | 2480 |
(a) Normal (N), (b) Pathological Benign (PB), (c) Usual Ductal Hyperplasia(UDH), (d) Flat Epithelial Atypia (FEA), (e) Atypical Ductal Hyperplasia (ADH), (f) Carcinoma in Situ (DCIS) and (g)Invasive Carcinoma (IC).
The BRACS dataset is publicly accessible and downloadable via the BRACS website. The data is organized as follows:
BRACS
│
├── Whole Slide Image Set
│ ├── Test
│ │ ├── Group_AT
│ │ ├── ...
│ └── Train
│ │ ├── Group_AT
│ │ ├── ...
│ └── Val
│ │ ├── Group_AT
│ │ ├── ...
├── Region of Interest Set
│ ├── Test
│ │ ├── 0_N
│ │ ├── ...
│ ├── Train
│ │ ├── 0_N
│ │ ├── ...
│ ├── Val
│ │ ├── 0_N
│ │ ├── ...
├── Whole Slide Image Annotations
│ ├── Test
│ │ ├── Group_AT
│ │ ├── ...
│ └── Train
│ │ ├── Group_AT
│ │ ├── ...
│ └── Val
│ │ ├── Group_AT
│ │ ├── ...
├── BRACS Summary File
│ ├── BRACS.xlsx
Nadia Brancati (Italian National Research Council of High Performance Computing and Networking)
Anna Maria Anniciello (Pascale Cancer Research Institute, Naples)
Pushpak Pati (IBM Research, Zurich)
Daniel Riccio (University of Naples Federico II)
Giosuè Scomagniglio (ETH Zurich)
Guillaume Jaume (IBM Research, Zurich; EPFL Lausanne)
Giuseppe De Pietro (Italian National Research Council of High Performance Computing and Networking)
Maurizio Di Bonito (Pascale Cancer Research Institute, Naples)
Antonio Foncubierta (IBM Research, Zurich)
Gerardo Botti (Pascale Cancer Research Institute, Naples)
Maria Gabrani (IBM Research, Zurich)
Florinda Feroce (Pascale Cancer Research Institute, Naples)
Maria Frucci (Italian National Research Council of High Performance Computing and Networking)
Official Website: https://www.bracs.icar.cnr.it/
Download Link: https://www.bracs.icar.cnr.it/download/
Article Address: https://academic.oup.com/database/article/doi/10.1093/database/baac093/6762252
Publication Date: 2022-10
@article{brancati2022bracs,
title={Bracs: A dataset for breast carcinoma subtyping in h\&e histology images},
author={Brancati, Nadia and Anniciello, Anna Maria and Pati, Pushpak and Riccio, Daniel and Scognamiglio, Giosu{\`e} and Jaume, Guillaume and De Pietro, Giuseppe and Di Bonito, Maurizio and Foncubierta, Antonio and Botti, Gerardo and others},
journal={Database},
volume={2022},
pages={baac093},
year={2022},
publisher={Oxford University Press UK}
}
Original introduction article is here.