Data from medical imaging challenges
Challenges typically provide the most current and easiest to use data...
This is the preferred medical imaging challenge portal! I wish everyone used it instead of fracturing the field and making search harder, but...
Portal for 100's of grand challenges in medical imaging:
Most run by academia and featured at international conferences
CAUSE07: Segment the caudate nucleus from brain MRI.
BIOCHANGE 2008 PILOT: Measure changes.
MS lesion segmentation challenge 08 Segment brain lesions from MRI.
Liver Tumor Segmentation 08 Segment liver lesions from contrast enhanced CT.
EXACT09: Extract airways from CT data.
ANODE09: Detect lung lesions from CT.
VOLCANO09: Quantify changes in pulmonary nodules.
Coronary Artery Algorithm Evaluation Framework: Extract coronary artery centerlines from CTA data.
ROC-Retinopathy Online Challenge: Detect microaneurysms for diabetic retinopathy screening.
60 challenges from a variety of biomedical areas
A growing number of their challenges include a medical imaging component: https://openchallenges.io/challenge?searchTerms=image
Portal for grand challenges in machine learning from Microsoft
2017: Pediatric bone age
2019: Intracranial hemorrhage
2022: RSNA Cervical Spine Fracture AI Challenge
2023: RSNA Screening Mammography Breast Cancer Detection AI Challenge
Commercial grand challenges
THE database that started the concept of medical image challenges
Credited for demonstrating that mutual information typically outperforms landmark-based registration (now a generally accepted notion)
Focused on quantifying medical image rigid and affine registration accuracy
Sites that list and/or host multiple collections of data
These data are typically well formatted and well documented
HuggingFace Datasets: https://huggingface.co/datasets
A collection of multiple, annotated datasets for training and evaluating AI models
Also includes links to pre-trained models that are running on HuggingFace servers - you can upload your data and apply their pre-trained models to it.
192 datasets with the term "medical" in their name or description
Lymph node, artery, and vein segmentation from thoracic CT: https://huggingface.co/datasets/andreped/LyNoS
Retinal OCT images: https://huggingface.co/datasets/marcelhuber/downprojection_images
A collection of multiple datasets - providing a way to cite hosted data in your publications
Over 1000 datasets with the filter "Open Access" "Zip File" "Dataset" and the search term "Medical"
AeroPath: Thoracic CT with segmented airways: https://zenodo.org/records/10069289
EEG Time series data: https://zenodo.org/records/4540350
COVID-19 CT Lung: https://zenodo.org/records/3757476
100,000 hisotological images of colorectal cancer: https://zenodo.org/records/1214456
Segmentations of 117 important anatomical structures in 1228 CT images - used to train TotalSegmentator V2: https://zenodo.org/records/10047292
A multi-speaker dataset of real-time two-dimensional speech magnetic resonance images with articulator ground-truth segmentations: https://zenodo.org/records/10046815
Formerly the National Biomedical Imaging Archive (NBIA):
Lung Image Database Consortium (LIDC)
Reference Image Database to Evaluate Response (RIDER)
Osteoarthritis Initiative (MIA)
PET/CT phantom scan collection
Contains COVID CT
Data from phantoms, simulated data
Misc. clinical data
Includes links to data de-identification tools
Liver tumors with segmentations
Age and gender balanced collection
MRA, T1, T2, and some DTI
Intracranial vessels extracted from select patients.
National Institute for Mental Health's (NIMH's) OpenNeuro.org
BIDS compliant MRI, PET, MEG, EEG, and iEEG data
Includes data from healthy volunteers (See release notes)
Provides a list of available databases, many of which are also listed here.
Post mortem CT of 50 subjects
CT, microCT, segmentation, and models of Cochlea
Copies of select challenge data (e.g., BRATS2015)
A joint effort between the American College of Radiology (ACR), the Radiological Society of North America (RSNA), and the American Association of Physicists in Medicine (AAPM)
A non-profit initiative that works closely with health systems around the world to create and curate de-identified datasets of medical images
Includes imaging, wave-forms (ECG), and other high-dimensional data
The father of internet data archives for all forms of machine learning.
Mix of X-ray, CT, and MRI of chest, hands, etc.
Large listing of multiple databases in computer vision and biomedical imaging
Medical Image Databases & Libraries
Data for specific topics or anatomy
Images, associated clinical data, annotations, and diagnoses
Cross-sectional MRI Data in Young, Middle Aged, Nondemented and Demented Older Adults
Longitudinal MRI Data in Nondemented and Demented Older Adults
Alzheimer’s Disease Neuroimaging Initiative (ADNI) unites researchers with study data as they work to define the progression of Alzheimer’s disease. ADNI researchers collect, validate and utilize data such as MRI and PET images, genetics, cognitive tests, CSF and blood biomarkers as predictors for the disease.
The Federal Interagency Traumatic Brain Injury Research (FITBIR) informatics system: MRI, PET, Contrast, and other data on a range of TBI conditions
Structured Analysis of the Retina: This research concerns a system to automatically diagnose diseases of the human eye.
Digital images and expert segmentations of retinal vessels.
Whole-slide images from The Cancer Genome Atlas's (TCGA) glioblastoma multiforme (GBM) samples
DTI Atlases: adults, children, ...
Small animal MRI, CT, ...
60,000 deidentified health data records
Large collection with normal and abnormal findings and ground truth.
Digital Chest X-ray images with lung nodule locations, ground truth, and controls.
Digital Chest X-ray images with segmentations of lung fields, heart, and clavicles.
Well documented chest CT images.
Mammographic images and markup.
Digital retinal images for detecting and quantifying diabetic retinopathy.
SpineWeb is an online collaborative platform for everyone interested in research on spinal imaging and image analysis.
MR data of Hips, knees and other sites affected by osteoarthritis
3D craniofacial surface measurements
Collection of files intended for 3D printing, but includes volumetric medical scans (i.e., CT and MRI in NRRD format) for a variety of anatomic structures (bones, muscles, vessels).
For example, see
Data geared towards education
A free online Medical Image Database with over 59,000 indexed and curated images, from over 12,000 patients
Image Based Medical Reference: "Find Algorithms, Decision Aids, Checklists, Guidelines, Differentials, Point of Care Ultrasound (POCUS), Physical Exam clips and more"
Simulated and phantom data
Simulated brain MR database.
Free AI-generated photos for academic research
Free Human Anatomy Images and Pictures
"a database of tracked sequences of US images from medical phantoms acquired with a methodology that ensures the spatial and force control of the US probe along prescribed trajectories by using a robotic arm and an optical tracking system." (Abdominal and baby phantoms)