You must have run the Nucleo workflow first before running any of the MSK-ACCESS QC workflows. Depending on your use case, there are two main sets of workflows you can choose to run: (1) `qc_generator
Note: To see help for the inputs for cwl workflow you can use: toil-cwl-runner nucleo.cwl --help
Once we have successfully installed the requirements we can now run the workflow using cwltool/toil .
Step 5: Run the workflow
To generate the QC files for one sample:
cwltool-execution
cwltool nucleo.cwl inputs.yaml
To aggregate the QC files across one or more samples and visualize with MultiQC:
cwltool-execution
cwltool nucleo.cwl inputs.yaml
toil-local-execution
toil-cwl-runner nucleo.cwl inputs.yaml
Note the use of --singularityto convert Docker containers into singularity containers, the TMPDIR environment variable to avoid writing temporary files to shared disk space, the _JAVA_OPTIONS environment variable to specify java temporary directory to /scratch, using SINGULARITY_BINDPATH environment variable to bind the /scratch when running singularity containers and TOIl_LSF_ARGS to specify any additional arguments to bsubcommands that the jobs should have (in this case, setting a max wall-time of 6 hours).
Next you must generate a proper input file in either or format.
Here we show how to use to run the workflow on a single machine, such as a laptop
Run the workflow with a given set of input using on single machine
Here we show how to run the workflow using using single machine interface
Once we have successfully installed the requirements we can now run the workflow using cwltool if you have proper input file generated either in or format. Please look at for more details.
Run the workflow with a given set of input using on single machine
Here we show how to run the workflow using on MSKCC internal compute cluster called JUNO which has as a scheduler.
Run the workflow with a given set of input using on JUNO (MSKCC Research Cluster)
Your workflow should now be running on the specified batch system. See for a description of the resulting files when is it completed.