Distribution of ACCESS panel A coverage values

Ensure consistent coverage across ACCESS bait (or “probe”) regions

Theoretical Method

Coverage of each genomic region in the ACCESS panel is grouped on a per-sample basis, and a distribution of these values is plotted. Each sample is normalized by the median coverage value of that sample to align all peaks with one another and correct for sample-level differences.

Technical Methods

Tool Used:
- Waltz CountReads
- aggregate_bam_metrics.sh
- tables_module.py
- plots_module.r
Input
- Collapsed, unfiltered bam
- ACCESS pool A bed file
Output
- intervals-coverage-sum.txt (one per bam type / pool combination)
- coverage_per_interval.txt (one per sample / bam type / pool combination)
- coverage_per_interval_A_targets_All_Unique.txt (this is used for graph above)
  - ~~(DMP specific format?)~~

Interpretations Each distribution should be unimodal, apart from a second peak on the low end due to X chromosome mapping from male samples. Narrow peaks are indicative of evenly distributed coverage across all bait regions. Wider distributions indicate uneven read distribution, and may be correlated with a large GC bias. Note that the provided bed file lists start and stop coordinates of ACCESS design probes, not the actual genomic target regions.

PreviousInsert Size Distribution NextAverage Coverage, Sample Level, Pool A Targets

Last updated 5 years ago

Was this helpful?