1.6.21
Version of tools in docker image (/container/Dockerfile)
Tool
Version
Location
clearlinux (base image)
-
-
vcf2maf
1.6.21
https://github.com/mskcc/vcf2maf/archive/v1.6.21.zip
VEP
105
-
MINICONDA_VERSION
py37_4.9.2
https://repo.anaconda.com/miniconda/Miniconda3-py37_4.9.2-Linux-x86_64.sh
BCFTOOLS_VERSION
1.10.2
-
SAMTOOLS_VERSION
1.10
-
VCF2MAF_VERSION
1.6.21
-
HTSLIB_VERSION
1.10.2
-
CWL
CWL specification 1.0
Use example_inputs.yaml to see the inputs to the cwl
Example Command using toil:
> toil-cwl-runner vcf2maf_1.6.21.cwl example_inputs.yaml
If at MSK, using the JUNO cluster having installed toil version 3.19 and manually modifying lsf.py by removing type==X86_64 &&
you can use the following command
#Using CWLTOOL
> cwltool --singularity --non-strict /path/to/vcf2maf_1.6.21.cwl /path/to/inputs.yaml
#Using toil-cwl-runner
> mkdir vcf2maf_toil_log
> toil-cwl-runner --singularity --logFile /path/to/vcf2maf_toil_log/cwltoil.log --jobStore /path/to/vcf2maf_jobStore --batchSystem lsf --workDir /path/to/vcf2maf_toil_log --outdir . --writeLogs /path/to/vcf2maf_toil_log --logLevel DEBUG --stats --retryCount 2 --disableCaching --maxLogFileSize 20000000000 /path/to/vcf2maf_1.6.17.cwl /path/to/inputs.yaml > vcf2maf_toil.stdout 2> vcf2maf_toil.stderr &
Usage
Usage:
perl vcf2maf.pl --help
perl vcf2maf.pl --input-vcf input.vcf --output-maf output.maf --tumor-id TUMOR_ID --normal-id NORMAL_ID
--input-vcf Path to input file in VCF format
--output-maf Path to output MAF file
--tmp-dir Folder to retain intermediate VCFs after runtime [Default: Folder containing input VCF]
--tumor-id Tumor_Sample_Barcode to report in the MAF [TUMOR]
--normal-id Matched_Norm_Sample_Barcode to report in the MAF [NORMAL]
--vcf-tumor-id Tumor sample ID used in VCF's genotype columns [--tumor-id]
--vcf-normal-id Matched normal ID used in VCF's genotype columns [--normal-id]
--custom-enst List of custom ENST IDs that override canonical selection
--vep-path Folder containing the vep script [~/miniconda3/bin]
--vep-data VEP's base cache/plugin directory [~/.vep]
--vep-forks Number of forked processes to use when running VEP [4]
--vep-custom String to pass into VEP's --custom option []
--vep-config Config file to pass into VEP's --config option []
--vep-overwrite Allow VEP to overwrite output VCF if it exists
--buffer-size Number of variants VEP loads at a time; Reduce this for low memory systems [5000]
--any-allele When reporting co-located variants, allow mismatched variant alleles too
--inhibit-vep Skip running VEP, but extract VEP annotation in VCF if found
--online Use useastdb.ensembl.org instead of local cache (supports only GRCh38 VCFs listing <100 events)
--ref-fasta Reference FASTA file [~/.vep/homo_sapiens/102_GRCh37/Homo_sapiens.GRCh37.dna.toplevel.fa.gz]
--max-subpop-af Add FILTER tag common_variant if gnomAD reports any subpopulation AFs greater than this [0.0004]
--species Ensembl-friendly name of species (e.g. mus_musculus for mouse) [homo_sapiens]
--ncbi-build NCBI reference assembly of variants MAF (e.g. GRCm38 for mouse) [GRCh37]
--cache-version Version of offline cache to use with VEP (e.g. 75, 91, 102) [Default: Installed version]
--maf-center Variant calling center to report in MAF [.]
--retain-info Comma-delimited names of INFO fields to retain as extra columns in MAF []
--retain-fmt Comma-delimited names of FORMAT fields to retain as extra columns in MAF []
--retain-ann Comma-delimited names of annotations (within the VEP CSQ/ANN) to retain as extra columns in MAF []
--min-hom-vaf If GT undefined in VCF, minimum allele fraction to call a variant homozygous [0.7]
--remap-chain Chain file to remap variants to a different assembly before running VEP
--verbose Print more things to log progress
--help Print a brief help message and quit
--man Print the detailed manual
Last updated