Datasets
eva provides native support for several public datasets. When possible, the corresponding dataset classes facilitate automatic download to disk, if not possible, this documentation provides download instructions.
Vision Datasets Overview
Whole Slide (WSI) and microscopy image datasets
Patch-level
Dataset | #Patches | Patch Size | Magnification (μm/px) | Task | Tissue Type |
---|---|---|---|---|---|
BACH | 400 | 2048x1536 | 20x (0.5) | Classification (4 classes) | Breast |
CRC | 107,180 | 224x224 | 20x (0.5) | Classification (9 classes) | Colorectal |
PatchCamelyon | 327,680 | 96x96 | 10x (1.0) * | Classification (2 classes) | Breast |
MHIST | 3,152 | 224x224 | 5x (2.0) * | Classification (2 classes) | Colorectal Polyp |
MoNuSAC | 294 | 113x81 - 1398x1956 | 40x (0.25) | Segmentation (4 classes) | Multi-Organ Cell Type (Breast, Kidney, Lung and Prostate) |
CoNSeP | 41 | 1000x1000 | 40x (0.25) * | Segmentation (8 classes) | Colorectal Nuclear |
* Downsampled from 40x (0.25 μm/px) to increase the field of view.
Slide-level
Dataset | #Slides | Slide Size | Magnification (μm/px) | Task | Cancer Type |
---|---|---|---|---|---|
Camelyon16 | 400 | ~100-250k x ~100-250k x 3 | 40x (0.25) | Classification (2 classes) | Breast |
PANDA | 10,616 | ~20k x 20k x 3 | 20x (0.5) | Classification (6 classes) | Prostate |
Radiology datasets
Dataset | #Images | Image Size | Task | Download provided |
---|---|---|---|---|
TotalSegmentator | 1228 | ~300 x ~300 x ~350 * | Semantic Segmentation (117 classes) | Yes |
LiTS | 131 (58638) | ~300 x ~300 x ~350 * | Semantic Segmentation (2 classes) | No |
* 3D images of varying sizes