The core functions of IDC support our overall purpose, which is to support cancer imaging research on the cloud. While some of the capabilities that are needed for this are outside the scope of IDC, and may necessitate interaction with the other components of the Cancer Research Data Commons, we welcome IDC users to bring up any use cases that are related to using cloud computing for cancer imaging research. We will work with you to identify relevant component of CRDC that can help you.
What is the status of IDC?
IDC pilot release took place in Fall 2020, followed by the production release in September 2021. You can learn about the planned milestones for the IDC development in these slides presented at RSNA 2019.
For a dataset to become part of the IDC offering, it has to be de-identified and curated by TCIA, and released as a public TCIA collection. Once this is done, it will (eventually!) be replicated in IDC.
How to acknowledge IDC?
Please cite the paper below:
Fedorov, A., Longabaugh, W. J. R., Pot, D., Clunie, D. A., Pieper, S., Aerts, H. J. W. L., Homeyer, A., Lewis, R., Akbarzadeh, A., Bontempi, D., Clifford, W., Herrmann, M. D., Höfener, H., Octaviano, I., Osborne, C., Paquette, S., Petts, J., Punzo, D., Reyes, M., Schacherer, D. P., Tian, M., White, G., Ziegler, E., Shmulevich, I., Pihl, T., Wagner, U., Farahani, K. & Kikinis, R. NCI Imaging Data Commons. Cancer Res. 81, 4188–4193 (2021). http://dx.doi.org/10.1158/0008-5472.CAN-21-0950
What is the difference between IDC and TCIA?
IDC and TCIA are partners in providing FAIR data for cancer imaging researchers. While some of the functions between the two resources are similar, there are also key differences. The table below provides a summary of similarities and differences.
Source of FAIR data for cancer imaging research
Curation of cancer imaging collections
De-identification (note: IDC hosts data de-identified by TCIA)
Cloud-based data co-located with compute resources
Recommended mechanism for downloading data to a local resource
Conversion of pathology images and image-derived data into DICOM format
What about non-imaging data that accompanies many TCIA collections?
At the moment, non-imaging data, such as the spreadsheets with clinical information, is not replicated on IDC and it is not possible to search this data using IDC Portal. You will need to access this data from TCIA.
Our short term plan is to selectively bring such spreadsheets as collection-specific BigQuery tables available within the release dataset (as an example, such tables are available for the NLST collection). We may expose some of those tables/attributes in the IDC portal.
Our longer-term plan is to work with the CRDC Center for Cancer Data Harmonization (CCDH) to harmonize the data in these spreadsheets, and identify the appropriate format and location for the resulting harmonized data.
Is it free to use IDC?
We provide a summary of what you can do with IDC, and what you will need to access specific capabilities. If you want to explore the capabilities of the cloud that require a billing account, and would like to develop a better understanding of the costs before committing your credit card, you can apply for a free Google cloud credit allocation using this form.
I want to search IDC content using an attribute not available in the portal