Snakemake workflow: FAIR CRCC - send data
A Snakemake workflow for securely sharing Crypt4GH-encrypted sensitive data from the CRC Cohort to a destination approved through a successful access request.
The recommendation is to create a directory for the request that has been approved; it will be used as the working directory for the run. Copy there the recipient's crypt4gh key and prepare the run configuration. The configuration will specify the repository, the destination of the data, and the list of files/directories to transfer.
What's the CRC Cohort?
The CRC Cohort is a collection of clinical data and digital high-resolution digital pathology images pertaining to tumor cases. The collection has been assembled from a number of participating biobanks and other partners through the ADOPT BBMRI-ERIC project.
Researchers interested in using the data for science can file an application for access. If approved, the part of the dataset required for the planned and approved work can be copied to the requester's selected secure storage location (using this workflow).
Usage
Example
mkdir request_1234 && cd request_1234
# Now write the configuration, specifying crypt4gh keys, destination and files to send.
# Finally, execute workflow.
snakemake --snakefile ../fair-crcc-send-data/workflow/Snakefile --profile ../profile/ --configfile config.yml --use-singularity --cores
Run configuration example
recipient_key: ./recipient_key
repository:
path: "/mnt/rbd/data/sftp/fair-crcc/"
private_key: bbmri-key
public_key: bbmri-key.pub
sources:
glob_extension: ".tiff.c4gh"
items:
- some/directory/to/glob
- another/individual/file.tiff.c4gh
destination:
type: "S3"
root_path: "my-bucket/prefix/"
connection: # all elements will be passed to the selected snakemake remote provider
access_key_id: "MYACCESSKEY"
secret_access_key: "MYSECRET"
host: http://localhost:9000
verify: false # don't verify ssl certificates
TODO
The usage of this workflow is described in the Snakemake Workflow Catalog.
If you use this workflow in a paper, don't forget to give credits to the authors by citing the URL of this (original) fair-crcc-send-datasitory and its DOI (see above).
Click and drag the diagram to pan, double click or use the controls to zoom.
Version History
Version 1 (earliest) Created 27th Jan 2022 at 16:12 by Luca Pireddu
Frozen
Version-1
9c89c00
Creator
Submitter
Views: 1888 Downloads: 201
Created: 27th Jan 2022 at 16:12
This item has not yet been tagged.
None