Data release notes
Data hosted by IDC is ingested from several sources, including The Cancer Imaging Archive (TCIA), Genomics Data Commons (GDC), Clinical Proteomic Tumor Analysis Consortium (CPTAC) and Human Tumor Atlas Network (HTAN).
Please refer to the license and terms of use, which are defined in the license_url and source_doi or source_url of the IDC BigQuery dicom_all table. You can filter the data by license type in the IDC Portal, or programmatically, as demonstrated in this tutorial.
IDC releases summary view

V24 - May 2026
New Collections
CDDP-EAGLE-1
NIH
Lung Adenocarcinoma
SM
49
49
GDC
CGCI-BLGSP
CGCI
Burkitt Lymphoma
SM
388
1,933
GDC
CGCI-HTMCP-CC
CGCI
Cervical Squamous Cell Carcinoma
SM
211
525
GDC
CGCI-HTMCP-DLBCL
CGCI
Diffuse Large B-Cell Lymphoma
SM
43
496
GDC
CGCI-HTMCP-LC
CGCI
Non-Small Cell Carcinoma
SM
27
84
GDC
HCMI-CMDC
HCMI
Various (pan-cancer)
SM
382
810
GDC
PDXNet
NIH
Various (pan-cancer PDX)
SM
919
919
PDXNet Consortium
UW-CIRP-Mouse-PET-CT-NSCLC
NIH
Lung Squamous Cell Carcinoma
PT, CT, RTSTRUCT
14
75
UW Co-CIRP
HTAN-TNP-SARDANA
HTAN
Colon Mucinous Adenocarcinoma
SM, PR
1
3
HTAN/Synapse
CPTAC-STAD
CPTAC
Stomach Adenocarcinoma
CT, US
20
99
TCIA
CATCH
Community
Canine cutaneous tumors (7 subtypes)
SM
282
350
TCIA
LDCT-and-Projection-data
Community
Lung Cancer
CT
200
698
TCIA
PSMA-PET-CT-Lesions
Community
Prostate Cancer
CT, PT, SEG
378
1,791
TCIA
Spinal-Multiple-Myeloma-SEG
Community
Multiple Myeloma
CT, SEG
67
720
TCIA
EAY131
NCTN
Pan-cancer (46 types)
CT, MR, PT, RTSTRUCT, SEG, NM, XA
2,813
30,293
TCIA
CDDP-EAGLE-1 (IDC Portal | DOI 10.5281/zenodo.17372206) — 49 H&E-stained frozen section whole slide images of primary lung adenocarcinoma from the EAGLE population-based case-control study (Lombardy, Italy). Converted from GDC Aperio SVS to DICOM SM.
CGCI-BLGSP (IDC Portal | DOI 10.5281/zenodo.17381396) — 1,933 whole slide images from 388 subjects in the Burkitt Lymphoma Genome Sequencing Project. Includes H&E and immunohistochemistry stains (BCL2, BCL6, CD10, Ki-67, CD20, CD3, CD5, CD79a), EBER in situ hybridization, and Wright-Giemsa. Both FFPE and frozen sections. Converted from GDC Aperio SVS to DICOM SM.
CGCI-HTMCP-CC (IDC Portal | DOI 10.5281/zenodo.17381404) — 525 whole slide images from 211 subjects in the HIV+ Tumor Molecular Characterization Project - Cervical Cancer, part of CGCI. Includes H&E and p16 stains for most subjects, plus additional IHC markers (BER-EP4, MOC31, P40, P63, CEA, ER, PR, VIM, TP53, CD56, CHR, SYN) for subsets. Converted from GDC Aperio SVS to DICOM SM.
CGCI-HTMCP-DLBCL (IDC Portal | DOI 10.5281/zenodo.17381412) — 496 whole slide images from 43 subjects in the HIV+ Tumor Molecular Characterization Project - Diffuse Large B-Cell Lymphoma, part of CGCI. Includes H&E and IHC/ISH stains (BCL2, BCL6, CD10, CD20, CD3, CD79a, EBER, Ki-67, MUM1, TP53) for most subjects, plus INT and CD138 for subsets. Converted from GDC Aperio SVS to DICOM SM.
CGCI-HTMCP-LC (IDC Portal | DOI 10.5281/zenodo.17381428) — 84 whole slide images from 27 subjects in the HIV+ Tumor Molecular Characterization Project - Lung Cancer, part of CGCI. Includes H&E, P40, and TTF1 stains for most subjects, plus CHR, SYN, and P16 for subsets. Converted from GDC Aperio SVS to DICOM SM.
HCMI-CMDC (IDC Portal | DOI 10.5281/zenodo.17381441) — 810 H&E-stained whole slide images from 382 subjects in the Human Cancer Models Initiative Cancer Model Development Center. Pan-cancer collection spanning 20+ cancer types (colorectal, pancreatic, esophageal, breast, brain, melanoma, lung, and others) with both FFPE and frozen tumor and normal sections. Converted from GDC Aperio SVS to DICOM SM.
PDXNet (IDC Portal | DOI 10.5281/zenodo.16967600) — A pan-cancer repository of >1,000 patient-derived xenograft (PDX) and paired parental tumor H&E whole slide images from the NCI PDXNet Consortium. Covers 29+ cancer types across 22 anatomic sites, with associated genomic, clinical, and pathologic annotation data. Contributed by BCM, Huntsman, MDACC, Wistar, WUSTL, and JAX. Converted from TIFF/SVS to DICOM SM.
UW-CIRP-Mouse-PET-CT-NSCLC (IDC Portal | DOI 10.5281/zenodo.17257735) — 18F-FDG PET/CT imaging of 14 genetically-engineered mouse models of lung squamous cell carcinoma from the NCI Co-CIRP program. 25 imaging sessions with 75 series (25 PT, 25 CT, 25 RTSTRUCT segmentations) on Siemens Inveon PET/CT. 11 animals imaged pre- and post-therapy (anti-PD-L1 + CXCR2 antagonist), 3 baseline-only.
HTAN-TNP-SARDANA (IDC Portal | DOI 10.5281/zenodo.18488943) — CyCIF and H&E whole slide images of a single colorectal cancer specimen from the HTAN SARDANA Trans-Network Project. 265 DICOM instances across 3 series: 240 CyCIF SM, 6 H&E SM, and 19 presentation state (PR) instances. ~20 CyCIF markers targeting immune, epithelial, stromal, and proliferation cell populations.
CPTAC-STAD (IDC Portal | DOI 10.7937/jw9a-8k71) — 99 radiology series (98 CT, 1 US) from 20 patients in the Clinical Proteomic Tumor Analysis Consortium Stomach Adenocarcinoma cohort. Images acquired as standard-of-care radiology (predominantly abdominal/abdominopelvic CT) prior to pathological diagnosis, made publicly available by TCIA. Sourced as-is in DICOM from TCIA.
CATCH (IDC Portal | DOI 10.5281/zenodo.18526942) — 350 H&E-stained whole slide images of seven canine cutaneous tumor subtypes (melanoma, mast cell tumor, squamous cell carcinoma, peripheral nerve sheath tumor, trichoblastoma, histiocytoma, plasmacytoma) from 282 patients. Includes 12,424 polygon annotations for 13 histologic classes. Original Aperio SVS images from TCIA converted to DICOM SM.
LDCT-and-Projection-data (IDC Portal | DOI 10.7937/9npb-2637) — Low-dose CT images and projection data from 200 patients at Mayo Clinic, including non-contrast head CT, low-dose chest CT for pulmonary nodule screening, and contrast-enhanced abdominal CT. CT projection data provided in the open DICOM-CT-PD format. Sourced as-is in DICOM from TCIA.
PSMA-PET-CT-Lesions (IDC Portal | DOI 10.7937/r7ep-3x37) — 597 whole-body PSMA-PET/CT studies from 378 male patients with suspected or diagnosed prostate carcinoma, acquired at LMU University Hospital Munich (2014-2022). All PSMA-avid tumor lesions manually segmented on PET images in 3D. Includes CT, PET, and DICOM SEG segmentation masks. Used in the autoPET III and IV Grand Challenges. Sourced as-is in DICOM from TCIA.
Spinal-Multiple-Myeloma-SEG (IDC Portal | DOI 10.7937/k4qv-hh78) — Dual-energy low-dose CT scans from 67 patients with multiple myeloma acquired at University Hospital Brno (2020-2023). Includes conventional CT, virtual monoenergetic images (40/80/120 keV), calcium-suppressed images, and DICOM SEG segmentation masks of vertebrae (with type classification) and myeloma lesions. 576 CT series plus 144 SEG series. Supporting clinical and demographic data provided as TSV. Sourced as-is in DICOM from TCIA. (Note: 67 unique PatientIDs with 72 studies — 5 patients have 2 scans each.)
EAY131 (IDC Portal | DOI 10.7937/c5ke-yx42) — Imaging and clinical data for 2,813 "unmatched" patients from the NCI MATCH Screening Trial (NCT02465060), performed by the ECOG-ACRIN Cancer Research Group. 30,293 series across CT (13,166), RTSTRUCT (14,395), SEG (1,404), MR (1,100), PT (222), NM (5), and XA (1) modalities. Covers 46 cancer types. Includes accompanying clinical/demographic data. Sourced as-is in DICOM from TCIA.
New Analysis Results
EAY131-Tumor-Annotations
NCTN
Pan-cancer (46 types)
RTSTRUCT, SEG
2,487
15,799
TCIA
EAY131-Tumor-Annotations (IDC Portal | DOI 10.7937/q9rn-m510) — Tumor segmentations, seed points, and negative findings assessments for 2,487 subjects from the EAY131 collection. Annotations follow RECIST 1.1 (CT/MR/USG) and PERCIST (PET) criteria, created by an international team of radiologists and reviewed by US board-certified radiologists. Includes longitudinal lesion tracking via SNOMED-CT codes and Tracking UID tags. Also includes a CSV metadata report with lesion volumes. Created by Petr Jordan and Michael Rozenfeld.
Revised Collections
BoneMarrowWSI-PediatricLeukemia
Replaced all 1,033 ANN series with 1,027 new ANN; removed 1 SM series from 1 patient
-1
-7
BMDeep/Fraunhofer MEVIS
ACRIN-NSCLC-FDG-PET
modality was changed from SC to OT in 4 series
0
0
https://www.cancerimagingarchive.net/collection/acrin-nsclc-fdg-pet/
Anti-PD-1_Lung
modality was changed from SC to OT in 3 series
0
0
https://www.cancerimagingarchive.net/collection/anti-pd-1_lung/
Phantom FDA
Phantom FDA DICOM instance data moved to gs://idc-open-data and s3://idc-open-data buckets from gs://idc-open-cr and s3://idc-open-data-cr buckets respectively. The Phantom FDA data in gs://idc-open-cr and s3://idc-open-data-cr will be available through IDC versions v24 and v25.
0
0
New metadata BQ Tables (non-clinical)
program_metadata
Metadata about each IDC program
Revised BQ Tables (non-clinical)
analysis_results_metadata
added columns:
analysis_result_id
column id changes:
ID --> analysis_result_name
Title --> analysis_result_title
CancerTypes --> cancer_types
TumorLocations --> tumor_locations
Subjects -->s ubjects
Collections --> collections
Modalities --> modalities
Updated --> updated
Description --> description
deprecated columns
ID
Title
CancerTypes
TumorLocations
auxiliary_metadata
column id changes
submitter_case_id --> Patient_id
deprecated columns
submitter_case_id
Access
dicom_metadata_curated
added columns cancer_types
original_collection_metadata
added columns
sources.source_id
sources.source_name
sources.source_type
column id changes
CancerTypes --> cancer_types
TumorLocations --> tumor_locations
Subject --> subjects
Species --> species
Sources --> sources
Sources.ImageTypes --> sources.modalities
Sources.Citation --> source.citation
SupportingData --> supporting_data
Program --> program_id
Status --> status
Updated --> updated
Description --> description
Deprecated columns
Sources.Access
Sources.ImageTypes
CancerTypes
TumorLocations
SupportingData
Program
New clinical metadata BQ tables
ldct_and_projection_data_abdomen_v9
ldct_and_projection_data_chest_v9
V23 - Nov 2025
There are two rows, not one, for every instance in the DICOM converted Slide Microscopy images for the TCGA-BRCA collection in the dicom_all and auxiliary_metadata BigQuery tables in the bigquery-public-data.idc_current and bigquery-public-data.idc_v23 datasets. The collection's DOI is doi.org/10.5281/zenodo.12689962 .
The rows in each pair are identical except that one row has the Creative Commons Attribution 3.0 Unported License (CC BY 3.0) and the other row the Creative Commons Attribution 4.0 International License (CC BY 4.0). The correct license for this collection is CC BY 3.0. The rows having the CC BY 4.0 license can be ignored.
This error will be corrected in the next IDC release.
Release counts
Files: 46,870,903 (+175,736)
Series: 994,073 (+28,666)
Studies: 160,199 (+606)
Cases: 79,889 (+355)
Collections: 161 (no change)
Analysis results collections: 23 (+6)
Disk size: 95.33 TB (+2.22 TB)
New radiology collections
New pathology collections
New analysis results
Lung-PET-CT-Dx-Annotations Collections analyzed:
Lung-PET-CT-Dx
NLST-Sybil Collections analyzed:
NLST
NLSTSeg Collections analyzed
NLST
PROSTATEx-Targets Collections analyzed:
ProstateX
TCGA-GBM360 Collections analyzed:
TCGA-GBM
Revised radiology collections
Revised pathology collections
Revised analysis results
New clinical metadata tables
Revised clinical metadata tables
varepop_apollo_clinical
V22 - Sept 2025
Release counts
Files: 46,695,167 (+910,713)
Series: 965,407 (+14,519)
Studies: 159,593 (+10,016)
Cases: 79,214 (+8,132)
Collections: 161 (+11)
Analysis results collections: 17 (no change)
Disk size: 93.11 TB (+5.62 TB)
New radiology collections
New pathology collections
Revised radiology collections
Revised pathology collections
New clinical metadata tables
bonemarrowwsi_pediatricleukemia_clinical
cbis_ddsm_calc_case_description_test_set
cbis_ddsm_calc_case_description_train_set
cbis_ddsm_mass_case_description_test_set
cbis_ddsm_mass_case_description_train_set
cc_radiomics_phantom_3_chest_settings
cc_radiomics_phantom_3_head_settings
cc_radiomics_phantom_3_manufacturer
Revised clinical metadata tables
varepop_apollo_clinical
V21 - May 2025
Release counts
Files: 45,784,454 (+174,244)
Series: 950,888 (+3,308)
Studies: 149,577 (+2,070)
Cases: 71,082 (+1,893)
Collections: 150 (+1)
Analysis results collections: 17 (no change)
Disk size: 87.49 TB (+1.94 TB)
New radiology collections
Revised radiology collections
Revised pathology collections
Revised analysis results
New clinical metadata tables
bamf_aimi_annotations_brain_mr_qa_results
bamf_aimi_annotations_breast_fdg_pet_ct_qa_results
bamf_aimi_annotations_breast_mr_qa_results
bamf_aimi_annotations_kidney_ct_qa_results
bamf_aimi_annotations_liver2_ct_qa_results
bamf_aimi_annotations_liver_ct_qa_results
bamf_aimi_annotations_liver_mr_qa_results
bamf_aimi_annotations_lung2_ct_qa_results
bamf_aimi_annotations_lung_ct_qa_results
bamf_aimi_annotations_lung_fdg_pet_ct_qa_results
bamf_aimi_annotations_prostate_mr_qa_results
cptac_aml_demographic_classification
varepop_apollo_clinical
Renamed clinical metadata tables
nlst_canc
Previously nlst_clinical
Retired clinical metadata tables
acrin_nsclc_fdg_pet_bamf_lung_pet_ct_segmentation
Subsumed by bamf_aimi_annotations_lung_fdg_pet_ct_qa_results
anti_pd_1_lung_bamf_lung_ct_segmentation
Subsumed by bamf_aimi_annotations_lung_ct_qa_results
anti_pd_1_lung_bamf_lung_fdg_pet_ct_segmenation
Subsumed by bamf_aimi_annotations_lung_fdg_pet_ct_qa_results
lung_pet_ct_dx_bamf_lung_ct_segmentation
Subsumed by bamf_aimi_annotations_lung_ct_qa_results
lung_pet_ct_dx_bamf_lung_fdg_pet_ct_segmenation
Subsumed by bamf_aimi_annotations_lung_fdg_pet_ct_qa_results
nsclc_radiogenomics_bamf_lung_ct_segmentation
Subsumed by bamf_aimi_annotations_lung_ct_qa_results
nsclc_radiogenomics_bamf_lung_fdg_pet_ct_segmenation
Subsumed by bamf_aimi_annotations_lung_fdg_pet_ct_qa_results
prostatex_bamf_segmentations
Subsumed by bamf_aimi_annotations_prostate_mr_qa_results
qin_breast_bamf_breast_segmentation
Subsumed by bamf_aimi_annotations_breast_fdg_pet_ct_qa_results
rider_lung_pet_ct_bamf_lung_ct_segmentation
Subsumed by bamf_aimi_annotations_lung_ct_qa_results
rider_lung_pet_ct_bamf_lung_fdg_pet_ct_segmenation
Subsumed by bamf_aimi_annotations_lung_fdg_pet_ct_qa_results
tcga_kirc_bamf_kidney_segmentation
Subsumed by bamf_aimi_annotations_kidney_ct_qa_results
tcga_lihc_bamf_liver_ct_segmentation
Subsumed by bamf_aimi_annotations_liver_ct_qa_results
tcga_lihc_bamf_liver_mr_segmentation
Subsumed by amf_aimi_annotations_liver_mr_qa_results
tcga_luad_bamf_lung_ct_segmentation
Subsumed by bamf_aimi_annotations_lung_ct_qa_results
tcga_luad_bamf_lung_mr_segmentation
Subsumed by bamf_aimi_annotations_lung_fdg_pet_ct_qa_results
tcga_lusc_lung_ct_segmentation
Subsumed by bamf_aimi_annotations_lung_ct_qa_results
tcga_lusc_lung_mr_segmentation
Subsumed by bamf_aimi_annotations_lung_fdg_pet_ct_qa_results
V20 - November 2024
New radiology collections
New pathology collections
Revised radiology collections
Revised pathology collections
Revised analysis results
BAMF-AIMI-Annotations Collections analyzed:
Pan-Cancer-Nuclei-Seg-DICOM Collections analyzed:
The segmentation of an instance in each of the following series was excluded due to having a DICOM PixelData size greater than or equal to 2GB:
1.2.826.0.1.3680043.10.511.3.10544506665348704312902213950958190
1.2.826.0.1.3680043.10.511.3.11183783347037364699862133130586654
1.2.826.0.1.3680043.10.511.3.11834745481756047014039855874680259
1.2.826.0.1.3680043.10.511.3.11901667084519361717338400810055642
1.2.826.0.1.3680043.10.511.3.12041600048156613329793822566495651
1.2.826.0.1.3680043.10.511.3.12718116375608495830041119776887887
1.2.826.0.1.3680043.10.511.3.13386724401829265460622415500801368
1.2.826.0.1.3680043.10.511.3.14042734131864468280344737986870899
1.2.826.0.1.3680043.10.511.3.17374765903080083648409690755539184
1.2.826.0.1.3680043.10.511.3.17429002643681869326389465422353495
1.2.826.0.1.3680043.10.511.3.20359930476040698387716730891020638
1.2.826.0.1.3680043.10.511.3.28397033639127902823368316410884210
1.2.826.0.1.3680043.10.511.3.28425539132321749931109935391487352
1.2.826.0.1.3680043.10.511.3.34574227972763695321794092913087775
1.2.826.0.1.3680043.10.511.3.36216094237641867532902805456135029
1.2.826.0.1.3680043.10.511.3.39533936694797964318706337783276378
1.2.826.0.1.3680043.10.511.3.39900930856460689132625586523683939
1.2.826.0.1.3680043.10.511.3.41633795217567037218184715094985555
1.2.826.0.1.3680043.10.511.3.42218106649761752724553401155203874
1.2.826.0.1.3680043.10.511.3.49098870621170235412220976183110770
1.2.826.0.1.3680043.10.511.3.50064322235999800062455171235601125
1.2.826.0.1.3680043.10.511.3.50905421517530127976832505410705816
1.2.826.0.1.3680043.10.511.3.62935684444056080516153739948364303
1.2.826.0.1.3680043.10.511.3.73572792121235596011940904319511291
1.2.826.0.1.3680043.10.511.3.74494366757564543824303304482444570
1.2.826.0.1.3680043.10.511.3.79988146996803179892075404247166692
1.2.826.0.1.3680043.10.511.3.80004293150506819482091023564947091
1.2.826.0.1.3680043.10.511.3.82774274518897141254234567300292686
1.2.826.0.1.3680043.10.511.3.84202416467561501610598853920808906
1.2.826.0.1.3680043.10.511.3.86214492184712627544696209982376598
1.2.826.0.1.3680043.10.511.3.90193069664920622990317347485104073
1.2.826.0.1.3680043.10.511.3.95666157880521064637011880609274546
1.2.826.0.1.3680043.10.511.3.96676982370873257329281821215166082
1.2.826.0.1.3680043.10.511.3.98258035017480972315346136181769675
RMS-Mutation-Prediction-Expert-Annotations
WARNING: After the release of v20, it was discovered that a mistake had been made during data conversion that affected the newly-released segmentations accompanying the "RMS-Mutation-Prediction" collection. Segmentations released in v20 for this collection have the segment labels for alveolar rhabdomyosarcoma (ARMS) and embryonal rhabdomyosarcoma (ERMS) switched in the metadata relative to the correct labels. Thus segment 3 in the released files is labelled in the metadata (the SegmentSequence) as ARMS but should correctly be interpreted as ERMS, and conversely segment 4 in the released files is labelled as ERMS but should be correctly interpreted as ARMS. We apologize for the mistake and any confusion that it has caused, and will be releasing a corrected version of the files in the next release as soon as possible. Collections analyzed:
New Clinical Metadata Tables
v19 - September 2024
New pathology collections
New analysis results
Pancreas-CT-SEG Collections analyzed:
Revised radiology collections
Cancer Moonshot Biobank (CMB) radiology images were updated to fix incorrect values assigned to PatientID (see details on the collection pages linked above). The updated images have different DICOM Study/Series/SOPInstanceUIDs.
Revised analysis results
BAMF-AIMI-Annotations Collections analyzed:
New clinical metadata tables
v18 - April 2024
New radiology collections
New analysis results
RMS-Mutation-Prediction-Expert-Annotations* Collections analyzed:
TotalSegmentator-CT-Segmentations** Collections analyzed:
Revised radiology collections
(starred collections are revised due to new or revised analysis results)
Breast-Cancer-Screening-DBT (revisions only to clinical data)
NLST**
Revised pathology collections
(starred collections are revised due to new or revised analysis results)
CPTAC-BRCA (fix PatientAges > 090Y)
CPTAC-COAD (fix PatientAges > 090Y)
Also added missing instance SOPInstanceUID: 1.3.6.1.4.1.5962.99.1.3459553143.523311062.1687086765943.9.0
Removed corrupted instances
SOPInstanceUID: 1.3.6.1.4.1.5962.99.1.2164023716.1899467316.1685791236516.37.0
SOPInstanceUID: 1.3.6.1.4.1.5962.99.1.2411736851.773458418.1686038949651.37.0
SOPInstanceUID: 1.3.6.1.4.1.5962.99.1.2411736851.773458418.16860389
TCGA-BLCA (All TCGA revisions are to correct multiple manufacturer values within same series)
TCGA-DLBC (No description page)
New clinical metadata tables
Notes
The deprecated columns tcia_api_collection_id and idc_webapp_collection_id have been removed from the auxiliary_metadata table in the idc_v18 BQ dataset. These columns were duplicates of columns collection_name and collection_id respectively.
v17 - December 2023
New radiology collections
New analysis results
Prostate-MRI-US-Biopsy-DICOM-Annotations Collections analyzed:
Revised radiology collections
New clinical metadata tables
v16 - September 2023
New radiology collections
New pathology collections
Revised radiology collections
Breast-MRI-NACT-Pilot (TCIA description: (Repair of DICOM tag(0008,0005) to value "ISO_IR 100" in 79 series)
CPTAC-CRCC (Revised because results from CPTAC-CRCC-Tumor-Annotations were added)
CPTAC-UCEC (Revised because results from CPTAC-UCEC-Tumor-Annotations were added)
CPTAC-PDA (Revised because results from CPTAC-PDA-Tumor-Annotations were added)
New analysis results
New clinical metadata tables
v15 - July 2023
New radiology collections
New pathology collections
ICDC-Glioma (ICDC-Glioma radiology added in a previous version)
Revised radiology collections
CPTAC-CCRCC (TCIA description: “Radiology modality data cleanup to remove extraneous scans.”)
CPTAC-CM (“TCIA description: Radiology modality data cleanup to remove extraneous scans.”)
CPTAC-LSCC (TCIA description: “Radiology modality data cleanup to remove extraneous scans.”)
CPTAC-LUAD (TCIA description: “Radiology modality data cleanup to remove extraneous scans.”)
CPTAC-PDA (TCIA description: TCIA description: “Radiology modality data cleanup to remove extraneous scans.”)
CPTAC-SAR (TCIA description: “Radiology modality data cleanup to remove extraneous scans.”)
CPTAC-UCEC (TCIA description: “Radiology modality data cleanup to remove extraneous scans.”)
CT Lymph Nodes (TCIA description: “Added DICOM version of MED_ABD_LYMPH_MASKS.zip segmentations that were previously available”)
RIDER Lung CT (Revised because QIBA-VolCT-1B analysis results were added)
NLST (Revised because analysis results from nnU-Net-BPR-Annotations were revised)
NSCLC-Radiomics (Revised because analysis results from nnU-Net-BPR-Annotations were revised)
Revised pathology collections
CPTAC-GBM (11 pathology-only patients removed at request of data owner)
CPTAC-SAR (1 pathology-only patient removed at request of data owner)
New analysis results
QIBA-VolCT-1B (Analysis of NLST and NSCLC-Radiomics)
Revised analysis results
nnU-Net-BPR-Annotations (Annotations of NLST and NSCLC-Radiomics radiology)
New clinical metadata tables
v14 - May 2023
This release does not introduce any new data, but changes the bucket organization and introduces replication of IDC files in Amazon AWS storage buckets, as described in this section.
v13 - Mar 2023
New analysis results collection:
New clinical data collections:
v12 - Nov 2022
New collections:
Updated collections:
Other:
Metadata corresponding to "limited" access collections are removed.
New clinical data collections:
Other clinical data updates:
Limited access collections are removed. Clinical metadata for the COVID-19-NY-SUB and ACRIN 6698/I-SPY2 Breast DWI collections now includes information ingested from data dictionaries associated with these collections. In v11 the string value 'NA' was being changed to null during the ETL process for some columns/collections. This is now fixed in v12 and the value 'NA' is preserved.
v11 - Sept 2022
This release introduces clinical data ingested for a subset of collections, and now available via a dedicated BigQuery dataset.
New collections:
v10 - Aug 2022
In this release we introduce a new HTAN program including currently three collections release by the Human Tumor Atlas Network.
New collections:
Updated collections:
CPTAC, TCGA and NLST collections have been reconverted due to a technical issue identified with a subset of images included in v9.
TCGA-DLBC
Note that the TCGA-KIRP and TCGA-BRCA collections (marked with the asterisk in the list above) are currently missing SM high resolution layer files/instances due to a known limitation of Google Healthcare that makes it not possible to ingest datasets that exceed some internal limits. Specifically, the following patient/studies are affected:
TCGA-KIRP:
PatientIDTCGA-5P-A9KA,StudyInstanceUID2.25.191236165605958868867890945341011875563TCGA-BRCA:
PatientIDTCGA-OL-A66H,StudyInstanceUID2.25.82800314486527687800038836287574075736 The affected files will be included in IDC when the infrastructure limitation is addressed.
Collection access level change:
Vestibular-Schwannoma-SEG is now available as public access collection
v9 - May 2022
This data release introduces the concept of differential license to IDC: some of the collections maintained by IDC contain items that have different licenses. As an example, radiology component of the TCGA-GBM collection is covered by the TCIA limited access license, and is not available in IDC, while the digital pathology component is covered by CC-BY. With this release, we complete sharing in full of the digital pathology component of the datasets released by the CPTAC and TCGA programs.
New collections:
Updated collections:
v8 - April 2022
The main highlight of this release is the addition of the NLST and TCGA Slide Microscopy imaging data. New TCGA content includes introduction of new (to IDC) TCGA collections that have only slide microscopy component, and addition of the slide microscopy component to those IDC collections that were available earlier and included only the radiology component.
New collections
TCGA-DLBC (TCGA-DLBC collection does not have a description page)
Updated collections
v7 - February 2022
The main highlight of this release is the addition of the Slide Microscopy imaging component to the remaining CPTAC collections.
New collections
Updated collections
v6 - January 2022
The following collections became limited access due to the change in policy by TCIA, which is the original source of those collections.
Original collections:
Analysis results collections:
v5 - December 2021
New collections:
New analysis results collections:
Outcome Prediction in Patients with Glioblastoma by Using Imaging, Clinical, and Genomic Biomarkers: Focus on the Nonenhancing Component of the Tumor (GBM-MR-NER-Outcomes)
DICOM-SEG Conversions for TCGA-LGG and TCGA-GBM Segmentation Datasets (DICOM-Glioma-SEG)
Updated collections:
v4 - September 2021
National Lung Screening Trial (NLST) collection is added. The data included consists of the following components:
1) CT images available as any other imaging collection (via IDC Portal, BigQuery metadata tables, and storage buckets);
2) a subset of clinical data available in the BigQuery tables starting with nlst_ under the idc_v4 dataset, as documented in the Collection-specific BigQuery Tables section.
3) One instance is missing from patient/study/series:
126153/1.2.840.113654.2.55.319335498043274792486636919135185299851/1.2.840.113654.2.55.262421043240525317038356381369289737801
4) Three instances are missing from patient/study/series:
215303/1.3.6.1.4.1.14519.5.2.1.7009.9004.337968382369511017896638591276/1.3.6.1.4.1.14519.5.2.1.7009.9004.180224303090109944523368212991
v3 - August 2021
The following radiology collections were updated to include DICOM Slide Microscopy (SM) images converted from the original vendor-specific representation into dual personality DICOM-TIFF format.
The DICOM Slide Microscopy (SM) images included in the collections above in IDC are not available in TCIA. TCIA only includes images in the vendor-specific SVS format!
v2 - June 2021
Listed below are all of the original and analysis results collections of The Cancer Imaging Archive currently hosted by IDC, with the links to the Digital Object Identifiers (DOIs) of those collections.
New original collections:
New analysis results collections:
v1 - October 2020
Listed below are all of the original and analysis results collections of The Cancer Imaging Archive currently hosted by IDC, with the links to the Digital Object Identifiers (DOIs) of those collections.
Original collections included:
Analysis collections included:
QIN multi-site collection of Lung CT data with Nodule Segmentations (only items corresponding to the LIDC-IDRI original collection are included)
DICOM SR of clinical data and measurement for breast cancer collections to TCIA (only items corresponding to the ISPY1 original collection are included)
Last updated
Was this helpful?