from fastq to bam files
Vous ne pouvez pas sélectionner plus de 25 sujets Les noms de sujets doivent commencer par une lettre ou un nombre, peuvent contenir des tirets ('-') et peuvent comporter jusqu'à 35 caractères.

fastq_screen.conf 4.0KB

il y a 3 ans
12345678910111213141516171819202122232425262728293031323334353637383940414243444546474849505152535455565758596061626364656667686970717273747576777879808182838485868788899091929394
  1. # This is an example configuration file for FastQ Screen
  2. ############################
  3. ## Bowtie, Bowtie 2 or BWA #
  4. ############################
  5. ## If the Bowtie, Bowtie 2 or BWA binary is not in your PATH, you can set
  6. ## this value to tell the program where to find your chosen aligner. Uncomment
  7. ## the relevant line below and set the appropriate location. Please note,
  8. ## this path should INCLUDE the executable filename.
  9. #BOWTIE /usr/local/bin/bowtie/bowtie
  10. #BOWTIE2 /usr/local/bowtie2/bowtie2
  11. #BWA /usr/local/bwa/bwa
  12. ############################################
  13. ## Bismark (for bisulfite sequencing only) #
  14. ############################################
  15. ## If the Bismark binary is not in your PATH then you can set this value to
  16. ## tell the program where to find it. Uncomment the line below and set the
  17. ## appropriate location. Please note, this path should INCLUDE the executable
  18. ## filename.
  19. #BISMARK /usr/local/bin/bismark/bismark
  20. ############
  21. ## Threads #
  22. ############
  23. ## Genome aligners can be made to run across multiple CPU cores to speed up
  24. ## searches. Set this value to the number of cores you want for mapping reads.
  25. THREADS 32
  26. ##############
  27. ## DATABASES #
  28. ##############
  29. ## This section enables you to configure multiple genomes databases (aligner index
  30. ## files) to search against in your screen. For each genome you need to provide a
  31. ## database name (which can't contain spaces) and the location of the aligner index
  32. ## files.
  33. ##
  34. ## The path to the index files SHOULD INCLUDE THE BASENAME of the index, e.g:
  35. ## /data/public/Genomes/Human_Bowtie/GRCh37/Homo_sapiens.GRCh37
  36. ## Thus, the index files (Homo_sapiens.GRCh37.1.bt2, Homo_sapiens.GRCh37.2.bt2, etc.)
  37. ## are found in a folder named 'GRCh37'.
  38. ##
  39. ## If, for example, the Bowtie, Bowtie2 and BWA indices of a given genome reside in
  40. ## the SAME FOLDER, a SINLGE path may be provided to ALL the of indices. The index
  41. ## used will be the one compatible with the chosen aligner (as specified using the
  42. ## --aligner flag).
  43. ##
  44. ## The entries shown below are only suggested examples, you can add as many DATABASE
  45. ## sections as required, and you can comment out or remove as many of the existing
  46. ## entries as desired. We suggest including genomes and sequences that may be sources
  47. ## of contamination either because they where run on your sequencer previously, or may
  48. ## have contaminated your sample during the library preparation step.
  49. ##
  50. ## Human - sequences available from
  51. ## ftp://ftp.ensembl.org/pub/current/fasta/homo_sapiens/dna/
  52. #DATABASE Human /data/public/Genomes/Human_Bowtie/GRCh37/Homo_sapiens.GRCh37
  53. ##
  54. ## Mouse - sequence available from
  55. ## ftp://ftp.ensembl.org/pub/current/fasta/mus_musculus/dna/
  56. #DATABASE Mouse /data/public/Genomes/Mouse/NCBIM37/Mus_musculus.NCBIM37
  57. ##
  58. ## Ecoli- sequence available from EMBL accession U00096.2
  59. #DATABASE Ecoli /data/public/Genomes/Ecoli/Ecoli
  60. ##
  61. ## PhiX - sequence available from Refseq accession NC_001422.1
  62. #DATABASE PhiX /data/public/Genomes/PhiX/phi_plus_SNPs
  63. ##
  64. ## Adapters - sequence derived from the FastQC contaminats file found at: www.bioinformatics.babraham.ac.uk/projects/fastqc
  65. #DATABASE Adapters /data/public/Genomes/Contaminants/Contaminants
  66. ##
  67. ## Vector - Sequence taken from the UniVec database
  68. ## http://www.ncbi.nlm.nih.gov/VecScreen/UniVec.html
  69. #DATABASE Vectors /data/public/Genomes/Vectors/Vectors
  70. DATABASE Human /cromwell_root/tmp/fastq_screen_reference/genome
  71. DATABASE Mouse /cromwell_root/tmp/fastq_screen_reference/mouse
  72. DATABASE ERCC /cromwell_root/tmp/fastq_screen_reference/ERCC
  73. DATABASE EColi /cromwell_root/tmp/fastq_screen_reference/ecoli
  74. DATABASE Adapter /cromwell_root/tmp/fastq_screen_reference/adapters
  75. DATABASE Vector /cromwell_root/tmp/fastq_screen_reference/vector
  76. DATABASE rRNA /cromwell_root/tmp/fastq_screen_reference/rRNARef
  77. DATABASE Virus /cromwell_root/tmp/fastq_screen_reference/viral
  78. DATABASE Yeast /cromwell_root/tmp/fastq_screen_reference/GCF_000146045.2_R64_genomic_modify
  79. DATABASE Mitoch /cromwell_root/tmp/fastq_screen_reference/Human_mitoch
  80. DATABASE Phix /cromwell_root/tmp/fastq_screen_reference/phix