YaqingLiu/variant-calling: Germline & Somatic short variant discovery (SNVs + Indels) for WGS & WES.

Germline & Somatic short variant discovery (SNVs + Indels) for WGS & WES.

YaqingLiu 33b1c80f3e test bam input		4年前
.vscode	first commit	4年前
tasks	Update: add VEP	4年前
.DS_Store	first commit	4年前
LICENSE.md	first commit	4年前
README.md	Update: README	4年前
defaults	test bam input	4年前
inputs	test add bam	4年前
workflow.wdl	test bam input	4年前

README.md

Variant Calling

This APP developed for germline and somatic short variant discovery (SNVs + Indels).

Accepted data

TN matched WES for somatic variant calling
TN matched WGS for somatic variant calling
Normal-only WES for germline variant calling
Normal-only WGS for germline variant calling

The datatype is judged by whether the bed file is set (i.e. the regions in inputs).

Supported variant callers and annotation tools

Variant calling: haplotyper (germline); tnseq, tnscope, varscan (somatic).
Annotation: annovar, vep.
The above tools are not activated by default, which means the default setting is false. You need to manually set the caller to true in the submitted sample.csv.

New Releases

Two annotation tools have been added.
TNhaplotyper, named as TNseq in v0.1.0, has beed substituted by TNhaplotyper2.
The corealigner step has been removed.
Some parameters’ details have been changed, such as the interval_list has turned into interval.

Getting Started

We recommend using choppy system and Aliyun OSS service. The command will look like this:

# Activate the choppy environment
$ open-choppy-env

# Install the APP
$ choppy install YaqingLiu/variant-calling [-f]

# List the parameters
$ choppy samples YaqingLiu/variant-calling-latest [--no-default]

# Submit you task with the `samples.csv file` and `project name`
$ choppy batch YaqingLiu/variant-calling-latest samples.csv -p Project [-l project:Label]

# Query the status of all tasks in the project
$ choppy query -L project:Label | grep "status"

Please note: The defaults can be forcibly replaced by the settings in samples.csv. Therefore, there is no need to contact me over this issue.

The parameters that must need contains: sample_id,normal_fastq_1,normal_fastq_2,tumor_fastq_1,tumor_fastq_2

Please carefully check

the reference genome you want to use is hg38 or hg19.
bed file.
the caller you want to use.
PoN VCFs for TNseq and TNscope is supported, but are need to be generated in advance.
interval padding is default 0, and you can change it.
usually only one annotation tool is sufficient.