Germline & Somatic short variant discovery (SNVs + Indels) for WGS & WES.
Você não pode selecionar mais de 25 tópicos Os tópicos devem começar com uma letra ou um número, podem incluir traços ('-') e podem ter até 35 caracteres.
YaqingLiu 784f19e2c9 Fix bug: remove vcf2maf 4 anos atrás
.vscode first commit 4 anos atrás
tasks Fix bug: Haplotyper 4 anos atrás
.DS_Store first commit 4 anos atrás
LICENSE.md first commit 4 anos atrás
README.md Update: README.md 4 anos atrás
defaults Update: inputs 4 anos atrás
inputs Update: inputs 4 anos atrás
workflow.wdl Fix bug: remove vcf2maf 4 anos atrás

README.md

Variant Calling

This APP developed for germline and somatic short variant discovery (SNVs + Indels).

Supported Callers

  • All callers are not activated by default, which means the default setting is false.
  • You need to manually set the caller to true in the submitted sample.csv.
  • The fields corresponding to these callers include haplotyper, tnseq, tnscope, varscan.

Germline

  • Haplotyper

Somatic

  • TNseq (TNhaplotyper2)
  • TNscope
  • VarScan
  • TNhaplotyper (This caller is only available in v0.1.0 as it is too outdated)

Accepted Data

  • TN matched WES for somatic variant calling
  • TN matched WGS for somatic variant calling
  • Normal-only WES for germline variant calling
  • Normal-only WGS for germline variant calling

The datatype is judged by whether the bed file is set (i.e. the regions in inputs).

New Releases

  • The TNhaplotyper, named as TNseq in v0.1.0, has beed substituted by TNhaplotyper2.
  • The corealigner step has been removed.
  • The vcf2maf step has been removed. Thus, the final output is the annotated VCF.

Getting Started

We recommend using choppy system and Aliyun OSS service. The command will look like this:

# Activate the choppy environment
$ open-choppy-env

# Install the APP
$ choppy install YaqingLiu/variant-calling-latest [-f]

# List the parameters
$ choppy samples YaqingLiu/variant-calling-latest [--no-default]

# Submit you task with the `samples.csv file` and `project name`
$ choppy batch YaqingLiu/variant-calling-latest samples.csv -p Project [-l project:Label]

# Query the status of all tasks in the project
$ choppy query -L Label | grep "status"

Please note: The defaults can be forcibly replaced by the settings in samples.csv. Therefore, there is no need to contact me over this issue.

The parameters that must need contains: sample_id,normal_fastq_1,normal_fastq_2,tumor_fastq_1,tumor_fastq_2

Please carefully check

  • the reference genome you want to use is hg38.
  • bed file.
  • the caller you want to use.
  • PoN VCFs for TNseq and TNscope is supported, but are need to be generated in advance.
  • interval padding is default 0, and you can change it.