Germline & Somatic short variant discovery (SNVs + Indels) for WGS & WES.
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
YaqingLiu eb14b9b267 Update: inputs 4 jaren geleden
.vscode first commit 4 jaren geleden
tasks Update: inputs 4 jaren geleden
.DS_Store first commit 4 jaren geleden
LICENSE.md first commit 4 jaren geleden
README.md Update: README.md 4 jaren geleden
defaults Update: inputs 4 jaren geleden
inputs Update: inputs 4 jaren geleden
workflow.wdl Update: inputs 4 jaren geleden

README.md

Variant Calling

This APP developed for germline and somatic short variant discovery (SNVs + Indels).

Supported Callers

  • All callers are not activated by default, which means the default setting is false.
  • You need to manually set the caller to true in the submitted sample.csv.
  • The fields corresponding to these callers include haplotyper, tnseq, tnscope, varscan.

Germline

  • Haplotyper

Somatic

  • TNseq (TNhaplotyper2)
  • TNscope
  • VarScan
  • TNhaplotyper (This caller is only available in v0.1.0 as it is too outdated)

Accepted Data

  • TN matched WES for somatic variant calling
  • TN matched WGS for somatic variant calling
  • Normal-only WES for germline variant calling
  • Normal-only WGS for germline variant calling

The datatype is judged by whether the bed file is set (i.e. the regions in inputs).

New Releases

  • The TNhaplotyper, named as TNseq in v0.1.0, has beed substituted by TNhaplotyper2.
  • The corealigner step has been removed.
  • The vcf2maf step has been removed. Thus, the final output is the annotated VCF.

Getting Started

We recommend using choppy system and Aliyun OSS service. The command will look like this:

# Activate the choppy environment
$ open-choppy-env

# Install the APP
$ choppy install YaqingLiu/variant-calling-latest [-f]

# List the parameters
$ choppy samples YaqingLiu/variant-calling-latest [--no-default]

# Submit you task with the `samples.csv file` and `project name`
$ choppy batch YaqingLiu/variant-calling-latest samples.csv -p Project [-l project:Label]

# Query the status of all tasks in the project
$ choppy query -L Label | grep "status"

Please note: The defaults can be forcibly replaced by the settings in samples.csv. Therefore, there is no need to contact me over this issue.

The parameters that must need contains: sample_id,normal_fastq_1,normal_fastq_2,tumor_fastq_1,tumor_fastq_2

Please carefully check

  • the reference genome you want to use is hg38.
  • bed file.
  • the caller you want to use.
  • PoN VCFs for TNseq and TNscope is supported, but are need to be generated in advance.
  • interval padding is default 0, and you can change it.