Germline & Somatic short variant discovery (SNVs + Indels) for WGS & WES.
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
YaqingLiu fa7bbf3d1d Update: README 4 年之前
.vscode first commit 4 年之前
tasks Update: add VEP 4 年之前
.DS_Store first commit 4 年之前
LICENSE.md first commit 4 年之前
README.md Update: README 4 年之前
defaults Update: add VEP 4 年之前
inputs Update: add VEP 4 年之前
workflow.wdl Update: add VEP 4 年之前

README.md

Variant Calling

This APP developed for germline and somatic short variant discovery (SNVs + Indels).

Accepted data

  • TN matched WES for somatic variant calling
  • TN matched WGS for somatic variant calling
  • Normal-only WES for germline variant calling
  • Normal-only WGS for germline variant calling

The datatype is judged by whether the bed file is set (i.e. the regions in inputs).

Supported variant callers and annotation tools

  • Variant calling: haplotyper (germline); tnseq, tnscope, varscan (somatic).
  • Annotation: annovar, vep.
  • The above tools are not activated by default, which means the default setting is false. You need to manually set the caller to true in the submitted sample.csv.

New Releases

  • Two annotation tools have been added.
  • TNhaplotyper, named as TNseq in v0.1.0, has beed substituted by TNhaplotyper2.
  • The corealigner step has been removed.
  • Some parameters’ details have been changed, such as the interval_list has turned into interval.

Getting Started

We recommend using choppy system and Aliyun OSS service. The command will look like this:

# Activate the choppy environment
$ open-choppy-env

# Install the APP
$ choppy install YaqingLiu/variant-calling [-f]

# List the parameters
$ choppy samples YaqingLiu/variant-calling-latest [--no-default]

# Submit you task with the `samples.csv file` and `project name`
$ choppy batch YaqingLiu/variant-calling-latest samples.csv -p Project [-l project:Label]

# Query the status of all tasks in the project
$ choppy query -L project:Label | grep "status"

Please note: The defaults can be forcibly replaced by the settings in samples.csv. Therefore, there is no need to contact me over this issue.

The parameters that must need contains: sample_id,normal_fastq_1,normal_fastq_2,tumor_fastq_1,tumor_fastq_2

Please carefully check

  • the reference genome you want to use is hg38 or hg19.
  • bed file.
  • the caller you want to use.
  • PoN VCFs for TNseq and TNscope is supported, but are need to be generated in advance.
  • interval padding is default 0, and you can change it.
  • usually only one annotation tool is sufficient.