# Noise by Substitution Type

![](https://2763969089-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M52gq1rRSDQOKMQGEuR%2F-M9Z39Nd7c6cJRknLQBm%2F-M9Z75cSaiMQSeopyRT4%2FScreen%20Shot%202020-06-11%20at%2012.04.36%20PM.png?alt=media\&token=c06466ec-e0e0-4d12-be9b-261b016fe995)

**Theoretical Method**

For each position that crosses the noise threshold (usually set at 2%), base changes are counted for each of the 6 possible substitution types.&#x20;

{% hint style="info" %}
Note: Duplex bams are used for this calculation
{% endhint %}

**Technical Methods**

* Tool Used:
  * Marianas
  * Waltz PileupMetrics
  * calculate\_noise.sh
* Input
  * sample\_id-duplex-pileup.txt (for duplex noise calculation)
  * MSK-ACCESS-v1\_0-A-good-positions.txt (Pool A bed file with MSI regions removed)
* Output
  * noise-by-substitution.txt

**Interpretations**

ACCESS cfDNA samples usually exhibit larger noise values for C>T transitions, possibly due to cytosine deamination. However, differences between samples are not unexpected. Our threshold for ACCESS samples is 0.001 (past which we would fail a sample).&#x20;
