Data analysis for ChIC/CUT&RUN
Learn how to analyze ChIC/CUT&RUN data, from read QC and alignment to peak calling, control normalization, fragment‑size profiling, and downstream interpretation.
CUT&RUN generates highly enriched, target‑specific DNA fragments with low background, enabling streamlined and efficient analysis compared with traditional ChIP‑seq. Although many of the downstream steps are conceptually similar to ChIP‑seq analysis, several features of CUT&RUN data—such as shorter fragment sizes, sharper peak shapes, and lower noise—require tailored handling for optimal results.
A typical data‑analysis workflow includes the following stages:
1. Read processing and quality assessment
Raw FASTQ files should first be assessed with standard quality‑control tools (such as FastQC or MultiQC). Because CUT&RUN produces short fragments, high read quality and accurate trimming are essential. Adapter trimming is often required due to the prevalence of sub‑100 bp inserts generated during targeted MNase digestion.
2. Alignment
Trimmed reads are aligned to the reference genome using a short‑read aligner such as Bowtie2, typically with settings optimized for very short fragments. CUT&RUN data align efficiently and with high specificity, and duplicate reads should be interpreted carefully—true biological enrichment can lead to naturally high redundancy at strong binding sites.
3. Fragment size selection (optional)
CUT&RUN yields a mixture of mono‑nucleosomal and sub‑nucleosomal fragments. Size‑selective analyses can provide biological insight:
- <120 bp fragments often correspond to transcription factor footprints or nucleosome‑depleted regions.
- ~150–200 bp fragments typically reflect intact nucleosomes.
Subsetting by fragment size can help resolve binding profiles or improve visualization of transcription factor peaks.
4. Peak calling
Peak callers that support sharp, high‑resolution features work well with CUT&RUN. Tools such as MACS2 (with --nomodel and narrow‑peak parameters) or CUT&RUN‑specific utilities like SEACR perform effectively. The low background characteristic of CUT&RUN typically results in clear, high‑confidence peak sets, even with modest sequencing depth.
5. Control normalization
IgG controls or no‑antibody controls are used to define the background and help distinguish low‑level enrichment from noise. Including internal positive controls (such as H3K27me3 or H3K4me3) can also serve as a benchmark for assessing dataset quality and antibody performance.
6. Visualization
Genome‑browser tracks can be generated using tools like deepTools or IGV. CUT&RUN produces sharp, well‑defined peaks, so smaller bin sizes and high‑resolution settings are recommended. Tracks often appear cleaner and more interpretable than corresponding ChIP‑seq data, even with fewer reads.
7. Downstream interpretation
Final analyses can include motif discovery, differential binding analysis, genomic annotation, nucleosome mapping, or integration with transcriptomic data. Because CUT&RUN offers near‑native mapping conditions, results often align more directly with gene‑regulatory activity and chromatin state.
Troubleshooting
See our detailed troubleshooting guides from experts to common issues in ChIP to get your experiment back on track.
button-primary
Guides
Find out about the different epigenetic factors which require analysis by ChIP and help determine which ChIP method is right for you in our guide to ChIP.
button-primary
Webinars
Watch our on-demand webinar to learn what ChIP-seq datasets should look like and the types of results you can extract.
button-primary