You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

94 satır
4.0KB

  1. # This is an example configuration file for FastQ Screen
  2. ############################
  3. ## Bowtie, Bowtie 2 or BWA #
  4. ############################
  5. ## If the Bowtie, Bowtie 2 or BWA binary is not in your PATH, you can set
  6. ## this value to tell the program where to find your chosen aligner. Uncomment
  7. ## the relevant line below and set the appropriate location. Please note,
  8. ## this path should INCLUDE the executable filename.
  9. #BOWTIE /usr/local/bin/bowtie/bowtie
  10. #BOWTIE2 /usr/local/bowtie2/bowtie2
  11. #BWA /usr/local/bwa/bwa
  12. ############################################
  13. ## Bismark (for bisulfite sequencing only) #
  14. ############################################
  15. ## If the Bismark binary is not in your PATH then you can set this value to
  16. ## tell the program where to find it. Uncomment the line below and set the
  17. ## appropriate location. Please note, this path should INCLUDE the executable
  18. ## filename.
  19. #BISMARK /usr/local/bin/bismark/bismark
  20. ############
  21. ## Threads #
  22. ############
  23. ## Genome aligners can be made to run across multiple CPU cores to speed up
  24. ## searches. Set this value to the number of cores you want for mapping reads.
  25. THREADS 32
  26. ##############
  27. ## DATABASES #
  28. ##############
  29. ## This section enables you to configure multiple genomes databases (aligner index
  30. ## files) to search against in your screen. For each genome you need to provide a
  31. ## database name (which can't contain spaces) and the location of the aligner index
  32. ## files.
  33. ##
  34. ## The path to the index files SHOULD INCLUDE THE BASENAME of the index, e.g:
  35. ## /data/public/Genomes/Human_Bowtie/GRCh37/Homo_sapiens.GRCh37
  36. ## Thus, the index files (Homo_sapiens.GRCh37.1.bt2, Homo_sapiens.GRCh37.2.bt2, etc.)
  37. ## are found in a folder named 'GRCh37'.
  38. ##
  39. ## If, for example, the Bowtie, Bowtie2 and BWA indices of a given genome reside in
  40. ## the SAME FOLDER, a SINLGE path may be provided to ALL the of indices. The index
  41. ## used will be the one compatible with the chosen aligner (as specified using the
  42. ## --aligner flag).
  43. ##
  44. ## The entries shown below are only suggested examples, you can add as many DATABASE
  45. ## sections as required, and you can comment out or remove as many of the existing
  46. ## entries as desired. We suggest including genomes and sequences that may be sources
  47. ## of contamination either because they where run on your sequencer previously, or may
  48. ## have contaminated your sample during the library preparation step.
  49. ##
  50. ## Human - sequences available from
  51. ## ftp://ftp.ensembl.org/pub/current/fasta/homo_sapiens/dna/
  52. #DATABASE Human /data/public/Genomes/Human_Bowtie/GRCh37/Homo_sapiens.GRCh37
  53. ##
  54. ## Mouse - sequence available from
  55. ## ftp://ftp.ensembl.org/pub/current/fasta/mus_musculus/dna/
  56. #DATABASE Mouse /data/public/Genomes/Mouse/NCBIM37/Mus_musculus.NCBIM37
  57. ##
  58. ## Ecoli- sequence available from EMBL accession U00096.2
  59. #DATABASE Ecoli /data/public/Genomes/Ecoli/Ecoli
  60. ##
  61. ## PhiX - sequence available from Refseq accession NC_001422.1
  62. #DATABASE PhiX /data/public/Genomes/PhiX/phi_plus_SNPs
  63. ##
  64. ## Adapters - sequence derived from the FastQC contaminats file found at: www.bioinformatics.babraham.ac.uk/projects/fastqc
  65. #DATABASE Adapters /data/public/Genomes/Contaminants/Contaminants
  66. ##
  67. ## Vector - Sequence taken from the UniVec database
  68. ## http://www.ncbi.nlm.nih.gov/VecScreen/UniVec.html
  69. #DATABASE Vectors /data/public/Genomes/Vectors/Vectors
  70. DATABASE Human /cromwell_root/tmp/fastq_screen_reference/genome
  71. DATABASE Mouse /cromwell_root/tmp/fastq_screen_reference/mouse
  72. DATABASE ERCC /cromwell_root/tmp/fastq_screen_reference/ERCC
  73. DATABASE EColi /cromwell_root/tmp/fastq_screen_reference/ecoli
  74. DATABASE Adapter /cromwell_root/tmp/fastq_screen_reference/adapters
  75. DATABASE Vector /cromwell_root/tmp/fastq_screen_reference/vector
  76. DATABASE rRNA /cromwell_root/tmp/fastq_screen_reference/rRNARef
  77. DATABASE Virus /cromwell_root/tmp/fastq_screen_reference/viral
  78. DATABASE Yeast /cromwell_root/tmp/fastq_screen_reference/GCF_000146045.2_R64_genomic_modify
  79. DATABASE Mitoch /cromwell_root/tmp/fastq_screen_reference/Human_mitoch
  80. DATABASE Phix /cromwell_root/tmp/fastq_screen_reference/phix