gs://
URLs;dicom_all
BigQuery table discussed in this documentation article can be used to subset the files you need based on the DICOM metadata attributes as needed utilizing the SQL query interface. The gcs_url
column contains Google Storage gs:// URLs that can be used to retrieve the files.query.txt
. The specific values for PatientID
, SeriesInstanceUID
, StudyInstanceUID
are chosen to serve as examples. bq query
command (from command line) to run the query and save the result into a manifest file, which will be the list of GCP URLs that can be used to download the data.--max_rows
parameter in the queries above to be equal or exceed the number of rows in the result of the query, otherwise your list will be truncated! --max_rows
parameter is sufficiently large (use the BigQuery console to run these queries):type
command in place of cat
, since the latter is not available in the command line prompt on Windows:type manifest.txt | gsutil -m cp -I .