ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads...

27
ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

Transcript of ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads...

Page 1: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

ATAC-seq Analysis

● Process the reads● Align the reads● Call peaks● Investigate the location of peaks

Page 2: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

ATAC-seq Analysis

● Process the reads● Align the reads● Call peaks● Investigate the location of peaks

Page 3: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

Ca. 10M to 20M read pairs per library

Page 4: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

bowtie2 --threads 12 \ --very-sensitive \ --maxins 2000 \ --no-discordant \ -x $genomedir \ -1 "$left" \ -2 "$right" | samtools view -bS - -o $bam;

samtools:- sort by name- fixmate (Fill in mate coordinates, ISIZE and mate related flags)- rmdup (Remove potential PCR duplicates)- sort by position- index

Not in effect unless --no-mixed is also set

Align reads using bowtie2, process using samtools

Page 5: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

High alignment rate of around 92%

Page 6: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

But, many low-quality alignments

Alignedconcordantly

Page 7: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

But, many low-quality alignments

Alignedconcordantly

$%#@*! chloroplasts!

Page 8: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

Ca. 4.6% of alignments have q > 30al

ignm

ent

s

Page 9: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

cont

rol

heat

Consistent read coverage across libraries

Page 10: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

Consistent read coverage across libraries

Page 11: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

Noise in low-coverage regions

Page 12: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

ATAC-seq Analysis

● Process the reads● Align the reads● Call peaks● Investigate the location of peaks

Page 13: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

Many options for peak calling

● PeaKDEck● Hotspot● F-Seq● MACS● ZINBA● HOMER

Page 14: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

Many options for peak calling: PeaKDEck

Page 15: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

Many options for peak calling: Comparison paper

Page 16: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

Define cut site as first base of aligned fragment and firstbase after aligned fragment

Count number of cuts in every (overlapping) 72bp window

Transform aligned reads to cut sites

Read coverage

Cut sites

Page 17: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

Pairwise comparison of libraries

Pearson correlation of number of cuts (log10) in allwindows that had more than 1 cut in both libraries

Replicates not more similar than non-replicates

Page 18: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

Global distribution of ATAC cut sites similar to DHS

Page 19: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

Ignore treatment label, detect open regions based on all 18libraries

Count number of cut sites that fall into 72 bp windowcentered on each base

Consider base open if its window contained at least one cutsite in more than half of the libraries

If two open bases are less than 72 bp apart, call allintermediate bases open

Tools used: R & Rsamtools library

Custom peak caller based on all libraries

Page 20: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

Peaks (open regions) seem to make sense

Page 21: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

Peaks (open regions) seem to make sense

Page 22: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

Peaks (open regions) seem to make sense

Page 23: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

ATAC-seq Analysis

● Process the reads● Align the reads● Call peaks● Investigate the location of peaks

Page 24: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

● Ca. 30k open regions● Covering 8M base pairs● Average length: 268 bp● Median length: 206 bp

We call 2% of the genome open

Page 25: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

Open regions highly enriched upstream and in 5' UTR

Page 26: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

Open regions density highest just before TSS

Page 27: ATAC-seq Analysis - GenCore...Oct 15, 2015  · ATAC-seq Analysis Process the reads Align the reads Call peaks Investigate the location of peaks

Number of ATAC cut sites in promoter is correlated withexpression