🔏
Marianas
  • Introduction
  • Quick Usage
  • Detailed Usage
  • Consensus-Calling Algorithm
  • Read Name Information
  • FAQ
  • Unanswered Q's
Powered by GitBook
On this page
  • Bam Collapsing
  • Polishing

Was this helpful?

Export as PDF

Detailed Usage

Marianas Sub-commands and Parameters

Note: All parameters are positional, and are listed in the following tables in the order they should be supplied. Also note that the defaults listed here are the recommended values from our testing, however they are not internal to the tool, and still must be supplied on the command line.

Bam Collapsing

ProcessLoopUMIFastq

Parameter

Description

Default

read_1

Path to read 1 of paired fastqs

read_2

Path to read 2 of paired fastqs

umi_length

Number of bases that constitute the UMI for each read

3

DuplexUMIBamToCollapsedFastqFirstPass & DuplexUMIBamToCollapsedFastqSecondPass

Parameter

Description

Default

bam_file

Path to .bam file after it has been run through ProcessLoopUMIFastq and the standard Picard and GATK pipeline

pileup_file

(optional) Path to a pileup file generated by Waltz for the current sample being processed. If provided, the genotype at each position from this file will be used to decide the consensus base to use if min_consensus_percent is not reached.

min_mapq

Minimum mapping quality

1

min_basq

Minimum base quality

20

umi_mismatches

UMI families with less than or equal to this many mismatches that map at the same position (within umi_wobble distance) will be merged and collapsed as one family

0

umi_wobble

UMI families that map to within this many bases as each other will be merged and collapsed as one family (if they also satisfy requirement for umi_mismatches)

2

min_consensus_percent

Positions with a single base that is represented in greater than or equal to this many reads will be collapsed with the corresponding consensus base. If no consensus can be reached this position will be output as an N.

90

reference_fasta

Path to reference fasta which will be used as a backup to decide a consensus base if no pileup file is provided.

HG19

minInsertionConsensusPercent [internal]

Similar to min_consensus_percent, this threshold will be used to decide on whether to include insertions in the collapsed consensus sequence.

70

minDeletionConsensusPercent [internal]

Similar to min_consensus_percent, this threshold will be used to decide on whether to include deletions in the collapsed consensus sequence.

50

basesToTrim [internal]

Number of bases to trim from either end of the consensus sequences after collapsing

3

SeparateBams

Parameter

Description

bam_file

Collapsed bam file that should be split into two bams consisting of only Simplex and Duplex reads

Polishing

NoiseFrequencyBuilder

Parameter

Description

pileup_directory

Directory that contains Waltz pileups for a set of normal bam files

Polisher

Parameter

Description

maf_file

These columns are required: Chromosome, Start_Position, Variant_Type, Reference_Allele, Tumor_Seq_Allele2

depth_column_name

Column name for MAF field that represents sequencing depth

alt_count_column_name

Column name for MAF field that represents alt allele count

af_frequencies_file

af-frequencies.txt from NoiseFrequencyBuilder

count_frequencies_file

count-frequencies.txt from NoiseFrequencyBuilder

PreviousQuick UsageNextConsensus-Calling Algorithm

Last updated 5 years ago

Was this helpful?