Newer
Older
**peak_calling** workflow , which agree to FAIR principles , was built in Nexflow dsl2 language, with singularity container for used softwares, optimized in terms of computing resources (cpu, memory), and its use on a informatic farm with a slurm scheduler.
- Peaks calling (broad/narrow) was achieved using MACS3 [1]
- normalised bigWig files scaled to 1 million mapped reads with BEDTools [2] and bedGraphToBigWig [3]
- genome-wide enrichment with deepTools [4].
[genome_aln](https://forgemia.inra.fr/lpgp/peak_calling) workflow is available for first step.
## Install flow and build singularity image
Clone peak_calling git and build local singularity image (with system admin rights) based on the provided singularity definition file.
```bash
git clone https://forgemia.inra.fr/lpgp/peak_calling.git
sudo singularity build ./peak_calling/singularity/peak_calling.sif ./genome_aln/singularity/peak_calling.def
```
## Usage example ATAC seq
design.csv file must have *ID* and *target* header and write with comma separator.
|ID|target|
|:-|:-|
|A|/path/to/targetA.bam|
|B|/path/to/targetB.bam|
|C|/path/to/targetC.bam|
### narrow peaks
#SBATCH -J peak_calling
#SBATCH --mem=10GB
#SBATCH -p unlimitq
nextflow run /work/project/lpgp/Nextflow/peak_calling/ \
-profile slurm \
--gsize 2.34167e+09 \
--qvalue 0.05 \
--out_dir "${PWD}/results/"
#SBATCH -J peak_calling
#SBATCH --mem=10GB
#SBATCH -p unlimitq
nextflow run /work/project/lpgp/Nextflow/peak_calling/ \
-profile slurm \
--ATAC \
--gsize 2.34167e+09 \
--qvalue 0.05 \
--broad \
--broad_cutoff 0.1 \
--out_dir "${PWD}/results"
## Usage example CHIP seq
design.csv file must have *ID*, *target* and *input* header and write with comma separator.
|ID|target|input|
|:-|:-|:-|
|A|/path/to/targetA.bam|/path/to/inputA.bam|
|B|/path/to/targetB.bam|/path/to/inputB.bam|
|C|/path/to/targetC.bam|/path/to/inputC.bam|
### narrow peaks
#SBATCH -J peak_calling
#SBATCH --mem=10GB
#SBATCH -p unlimitq
nextflow run /work/project/lpgp/Nextflow/peak_calling/ \
-profile slurm \
--gsize 2.34167e+09 \
--qvalue 0.05 \
--out_dir "${PWD}/results/"
```
### broad peaks
```bash
#!/bin/bash
#SBATCH -J peak_calling
#SBATCH --mem=10GB
#SBATCH -p unlimitq
nextflow run /work/project/lpgp/Nextflow/peak_calling/ \
-profile slurm \
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
--gsize 2.34167e+09 \
--qvalue 0.05 \
--broad \
--broad_cutoff 0.1 \
--out_dir "${PWD}/results"
```
## Defaults parameters
Please refer to [macs2](https://github.com/FelixKrueger/TrimGalore), and [deeptools](https://deeptools.readthedocs.io/en/develop/) for complete arguments explanation.
```bash
# design file
input = false
# bam coverage
skip_coverage = false
# bam depth
skip_depth = false
# plotfingerprint
numberOfSamples = 500000
skip_plotfingerprint = false
# MACS2 parameters
gsize = false
qvalue = 0.05
pvalue = false
broad = false
broad_cutoff = 0.1
# count
min_reps_consensus = 1
# ATAC seq
ATAC = false
# save directory
out_dir = "${PWD}/results"
## References
1. MACS: Model-based analysis for ChIP-seq [Internet]. Available from: https://github.com/macs3-project/MACS
2. Bedtools: A powerful toolset for genome arithmetic [Internet]. Available from: https://bedtools.readthedocs.io/en/latest/
3. bedGraphToBigWig [Internet]. Available from: http://hgdownload.soe.ucsc.edu/admin/exe/linux.x86_64.v385/bedGraphToBigWig
4. Ramírez F, Ryan DP, Grüning B, Bhardwaj V, Kilpert F, Richter AS, et al. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Res. 2016;44:W160–165.