Insert Size Distribution

Confirmation of fragment length information for cfDNA and buffy coat DNA fragments

Theoretical Method

Insert size is calculated from the start and stop positions of the reads after mapping to the reference genome.

Technical Methods

  • Tool Used:

    • Waltz CountReads

    • aggregate_bam_metics.sh

    • tables_module.py

    • plots_module.r

  • Input

    • Collapsed all unique bam

    • ACCESS pool A bed file

  • Output

    • sample_id.bam.fragment-sizes

    • fragment_sizes.txt (aggregated across samples from a single bam type / pool combination)

    • fragment_sizes_unfiltered_A_targets.txt (used for graph above)

Interpretations Cell free DNA has distinctive features due to the natural processes behind its fragmentation. One such feature is the set of 10-11 bp fluctuations that indicate the preferential splicing of fragments due to the number of bases per turn of the DNA helix, which causes a unique pattern of binding to the surface of histones.

The more pronounced peak at 166 bp indicate complete wrapping of the DNA around the histones’ circumference, and similarly the second more pronounced peak indicates two complete wraps.

Buffy coat samples are mechanically sheared and thus do not exhibit these distinctive features, hence the different shape for their distribution.

Note: All values are shifted 6 bp lower, due to clipping of 3 bp from each end of the reads during the collapsing process

Last updated