arrow-left

All pages
gitbookPowered by GitBook
1 of 1

Loading...

Fraction of reads mapping to the human genome

Ensure there is adequate mapping of sequenced reads to the human genome

hashtag
Theoretical Method

This metric is obtained by iterating through the bam file, and looking at the sam flag which indicates whether each read has an adequate mapping to the HG19 reference.

hashtag
Technical Methods

Waltz uses a method from the of the HTSJDK library:

circle-info

Note: This method is distinct from getProperPairFlag(),which will only consider reads which are mapped in a proper pair.

  • Tool Used

    • waltz.jar CountReads

    • Aggregate_bam_metrics.sh

hashtag
Interpretations

Mapping fraction to the human genome should be above 97%, in most cases if it is below that, there is a chance that there is contamination from another species.

circle-info

Note: this metric come from the standard bam, and is calculated across the entire bam file (as opposed to pool A or pool B on their own)

tables_module.py (TotalMapped / TotalReads)

  • plots_module.r

  • Input

    • Standard Bam (tables also produced for U / S / D bams)

  • Output

    • Text file with read count information: “sample_id.bam.read-counts.txt”

  • SAMRecord Classarrow-up-right
    SAMRecord.getReadUnmappedFlag()