count-reads-fw¶

NOTE: This function (formerly called comBBo) uses the legacy fixed-width binning described in the HATCHet paper. We recommend using count-reads and combine-counts which apply an adaptive binning scheme to ensure that each genomic bin has comparable BAF signal.

This step of HATCHet splits the human reference genome into fixed-width bins (i.e., small genomic regions), and computes the number of sequencing reads aligned to each bin from every given tumor samples and from the matched normal sample.

Input¶

count-reads-fw takes in input sorted and indexed BAM files for multiple tumor samples from the same patient, a sorted and index BAM file from a matched-normal sample, and a indexed human reference genome.

Name	Description	Usage
`-T`, `--tumors`	A white-space separated list of sorted-indexed BAM files for tumor samples	The tumor samples from the same patient that are jointly analyzed by HATCHet
`-N`, `--normal`	A sorted-indexed BAM file for matched-normal sample	The matched normal sample for the same patient
`-r`, `--reference`	A FASTA file	The human reference genome used for germline variant calling

Output¶

count-reads-fw produces three tab-separated files: the first contains the read counts for every genomic bin in every tumor sample (raw counts not normalized), the second contains the read counts for every genomic bin the matched-normal sample (raw counts not normalized), and the third contains a list of the genomic positions that have been identified as germline heterozygous SNPs in the matched-normal sample.

Name	Description
`-O`, `--outputnormal`	The output file for the read counts from matched-normal sample
`-o`, `--outputtumors`	The output file for the read counts from the tumor samples

Main parameters¶

count-reads-fw

count-reads-fw¶

Input¶

Output¶

Main parameters¶

Optional parameters¶