Construction of a small Mus musculus repetitive DNA library ...
Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H....
Transcript of Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H....
![Page 1: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/1.jpg)
Statistical Methods for Genome Wide Regional Analysis with NextGeneration Sequencing Data
Hao Wu, Emory Universityand
Rafael A Irizarry, DFCI/Harvard
![Page 2: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/2.jpg)
Outline
• Introduction to Next Generation Sequencing (NGS)
• Motivation for region finding• ChIPSeq• Whole genome bisulfite sequencing (WGBS)• Computer Lab
![Page 3: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/3.jpg)
D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 M. musculus, Nature, 2002Science, 2000
• Back then: millions of clones (thousand bps) in 9 months for billions of dollars
• Today: billion of short reads (35-100 bps) in a week for thousands of dollars
• Claim: Assemble a genome in weeks for less than $100,000
![Page 4: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/4.jpg)
![Page 5: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/5.jpg)
![Page 6: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/6.jpg)
![Page 7: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/7.jpg)
Sequence first 35-‐400 bps: call them “reads”
GTTGAGGCTTGCGTTTTTGGTACGCTGGACTTTGTGTACTCGTCGCTGCGTTGAGGCTTGCGTTTTTGGTATGGTACGCTGGACTTTGTAGGATACCCTCGCTTTTTGCGTTTATGGTACGCTGGACTTTGTAGGATACCCTTGCGTTTATGGTACGCTGGACTTTGTAGGATACTTGCGTTTATGGTACGCTGGACTTTGTAGGATACCGCGTTTATGGTACGCTGGACTTTGTAGGATACCCTGAGGCTTGCGTTTATGGTACGCTGGACTTTGTAGGGCGTTGAGGCTTGCGTTTATGGTACGCTGGATTTTCGTTTATGGTACGCTGGACTTTGTAGGATACCCTCATGGTACGCTGGACTTTGTAGGATACCCTCGCTTT GTTTATGGTACGCTGGACTTTGTAGGATACCCTCGTCTCGTGCTCGTCGCTGCGTTGAGGCTTGCGTTTA TGCTCGTCGCTGCGTTGAGGCTTGCGTTTATGGTAGCTCGTCGCTGCGTTGAGGCTTGCGTTTATGGTAC TATGGTACGCTGGACTTTGTAGGATACCCTCGCTTTCGTGCTCGTCGCTGCGTTGAGGCTTGCGTTTTTGCGTCGCTGCGTTGAGGCTTGCGTTTATGGTACGCTGTTGAGGCTTGCGTTTATGGTACGCTGGGCTTTTT TTGCGTTTATGGTACGCTGGACTTTGTAGGATACC
![Page 8: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/8.jpg)
Available platforms
• Major player:– Illumina: HiSeq, MiSeq.– LifeTech: SOLiD, IonTorrent.– Roche 454.
• Others:– Complete Genomics– Pacific Bioscience– Helicos
![Page 9: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/9.jpg)
• Eight lanes
• ~160M short reads (~50-‐70 bp) per lane
Image: Illumina logo, http://www.illumina.com/
~7 cm
~2 cm
Illumina “flow cell”
![Page 10: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/10.jpg)
Bridge amplification
![Page 11: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/11.jpg)
Sequencing all bases at once
![Page 12: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/12.jpg)
Images from sequencing machine
![Page 13: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/13.jpg)
namesequencequality scores
x 100s of millions
base 3ACGT
base 2ACGT
base 1ACGT …..
![Page 14: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/14.jpg)
Raw sequence reads from NGS
• Large text file (millions of lines) with simple format. – Most frequently used: fasta/fa format for storing the sequences, or fastq
format storing both the sequence and corresponding quality scores. • fasta format:
>5_143_428_832GATATTGTAGCATAACGCAACTTGGGAGGTGAGCTT>5_143_984_487GTTTTCATGCCTCCAAATCTTGGAGGCTTTTTTATG>5_143_963_690GGTATATGCACAAAATGAGATGCTTGCTTATCAACA>5_143_957_461GGAGGGTGTCAATCCTGACGGTTATTTCCTAGACAA>5_143_808_403GATAACCGCATCAAGCTCTTGGAAGAGATTCTGTCT
read nameread sequence
![Page 15: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/15.jpg)
fastq formatSecgen sequencing data: fastq format
@HWI-EAS165:1:1:50:908:1CTGCGGTCTCTAAAGTGCCATCTCATTGTGCTTTGTATCAGTCAGTGCTGGA+BCCBCB8ABBBBBBB:BC=8@BBA:@BB@BBBCBB<9BBAC;A<C?BAAB<#@HWI-EAS165:1:1:50:0:1NCAACCCCCACAGTAATATGTAAAACAAAAACTAAAACCAGGAGCTGAAGGG+#BABABBBBBB@08<@?A@7:A@CCBCCCCBBBCCBB=?BBBB@7@B=A>:2@HWI-EAS165:1:1:50:708:1GGTCAGCATGTCTTCTGTTAAGTGCTTGCACAAGCTAGCCTCTGCCTATGGG+BB@A;B>@A@@=BB=BB?A>@@>B?ABBA=A?@@>@@A:=?>?A@=B8@@AB@HWI-EAS165:1:1:50:1494:1CTGGTGTCACACAAGCAGGTCTCCTGTGTTGACTTCACCAGACACTGTCATT+BCBB@AB@1ABBBBBBAAB?BBBBAB<A?AA>BB@?1ABBA@BBBA@;B>>:
read nameread sequenceseparatorquality scores
![Page 16: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/16.jpg)
Single-‐end vs. paired-‐end sequencing
• Sequence one or both ends of the DNA segments. • Single-‐end sequencing: sequence one end of the DNA
segment. • Paired-‐end sequencing: sequence both ends of a DNA
segments.– Result reads are “paired”, separated by certain length (the length of
the DNA segments, usually a few hundred bps). – Paired-‐end data can be used as single-‐end, but contain extra
information which is useful in some cases, e.g., detecting structural variations in the genome.
– Modeling technique is more complicated.
![Page 17: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/17.jpg)
Segment 2 – Applications of NGS in Genomics
(do not include this slide in video)
![Page 18: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/18.jpg)
Not just Assembly
•Resequencing
•SNP discovery and genotyping
•Variant discovery and quantification
•TF binding sites: ChIP-‐Seq
•Gene expression: RNA-‐Seq
•Measuring methylation
Image: DNA methylation 18.02.2006 Christoph Bock, http://commons.wikimedia.org/wiki/File:DNA_methylation.jpg CCBY
![Page 19: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/19.jpg)
What to do with all these sequences?
GTTGAGGCTTGCGTTTTTGGTACGCTGGACTTTGTGTACTCGTCGCTGCGTTGAGGCTTGCGTTTTTGGTATGGTACGCTGGACTTTGTAGGATACCCTCGCTTTTTGCGTTTATGGTACGCTGGACTTTGTAGGATACCCTTGCGTTTATGGTACGCTGGACTTTGTAGGATACTTGCGTTTATGGTACGCTGGACTTTGTAGGATACCGCGTTTATGGTACGCTGGACTTTGTAGGATACCCTGAGGCTTGCGTTTATGGTACGCTGGACTTTGTAGGGCGTTGAGGCTTGCGTTTATGGTACGCTGGATTTTCGTTTATGGTACGCTGGACTTTGTAGGATACCCTCATGGTACGCTGGACTTTGTAGGATACCCTCGCTTT GTTTATGGTACGCTGGACTTTGTAGGATACCCTCGTCTCGTGCTCGTCGCTGCGTTGAGGCTTGCGTTTA TGCTCGTCGCTGCGTTGAGGCTTGCGTTTATGGTAGCTCGTCGCTGCGTTGAGGCTTGCGTTTATGGTAC TATGGTACGCTGGACTTTGTAGGATACCCTCGCTTTCGTGCTCGTCGCTGCGTTGAGGCTTGCGTTTTTGCGTCGCTGCGTTGAGGCTTGCGTTTATGGTACGCTGTTGAGGCTTGCGTTTATGGTACGCTGGGCTTTTT TTGCGTTTATGGTACGCTGGACTTTGTAGGATACC
![Page 20: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/20.jpg)
Most apps: Start by matching to reference
GTTGAGGCTTGCGTTTTTGGTACGCTGGACTTTGT GTACTCGTCGCTGCGTTGAGGCTTGCGTTTTTGGT
ATGGTACGCTGGACTTTGTAGGATACCCTCGCTTT TTGCGTTTATGGTACGCTGGACTTTGTAGGATACC
CTTGCGTTTATGGTACGCTGGACTTTGTAGGATAC TTGCGTTTATGGTACGCTGGACTTTGTAGGATACC GCGTTTATGGTACGCTGGACTTTGTAGGATACCCT
GAGGCTTGCGTTTATGGTACGCTGGACTTTGTAGG GCGTTGAGGCTTGCGTTTATGGTACGCTGGATTTT
CGTTTATGGTACGCTGGACTTTGTAGGATACCCTC ATGGTACGCTGGACTTTGTAGGATACCCTCGCTTT
GTTTATGGTACGCTGGACTTTGTAGGATACCCTCG TCTCGTGCTCGTCGCTGCGTTGAGGCTTGCGTTTA
TGCTCGTCGCTGCGTTGAGGCTTGCGTTTATGGTA GCTCGTCGCTGCGTTGAGGCTTGCGTTTATGGTAC
TATGGTACGCTGGACTTTGTAGGATACCCTCGCTT TCGTGCTCGTCGCTGCGTTGAGGCTTGCGTTTTTG
CGTCGCTGCGTTGAGGCTTGCGTTTATGGTACGCT GTTGAGGCTTGCGTTTATGGTACGCTGGGCTTTTT
TTGCGTTTATGGTACGCTGGACTTTGTAGGATACCCTCTCGTGCTCGTCGCTGCGTTGAGGCTTGCGTTTATGGTACGCTGGACTTTGTAGGATACCCTCGCTTTC
![Page 21: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/21.jpg)
Variant detectionReference
GATCACAGGTCTATCACCCTATTAACCACTCACGGGAGCTCTCCATGCATTTGGTATTTTCGTCTGGGGGGTATGCACGCGATAGCATTGCGAGACGCTGGAGCCGGAGCACCCTATGTCGCAGTATCTGTCTTTGATTCCTGCCTCATCCTATTATTTATCGCACCTACGTTCAATATT
![Page 22: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/22.jpg)
Variant detection
GTCGCAGTANCTGTCT||||||||| ||||||GTCGCAGTATCTGTCT
GGATCTGCGATATACC|||||| |||||||||GGATCT-CGATATACC
AATCTGATCTTATTTT||||||||||||||||AATCTGATCTTATTTT
ATATATATATATATAT||||||||||||||||ATATATATATATATAT
TCTCTCCCANNAGAGC||||||||| |||||TCTCTCCCAGGAGAGC
Align
ReferenceGATCACAGGTCTATCACCCTATTAACCACTCACGGGAGCTCTCCATGCATTTGGTATTTTCGTCTGGGGGGTATGCACGCGATAGCATTGCGAGACGCTGGAGCCGGAGCACCCTATGTCGCAGTATCTGTCTTTGATTCCTGCCTCATCCTATTATTTATCGCACCTACGTTCAATATT
![Page 23: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/23.jpg)
Variant detection
GTCGCAGTANCTGTCT||||||||| ||||||GTCGCAGTATCTGTCT
GGATCTGCGATATACC|||||| |||||||||GGATCT-CGATATACC
AATCTGATCTTATTTT||||||||||||||||AATCTGATCTTATTTT
ATATATATATATATAT||||||||||||||||ATATATATATATATAT
TCTCTCCCANNAGAGC||||||||| |||||TCTCTCCCAGGAGAGC
Align
ReferenceGATCACAGGTCTATCACCCTATTAACCACTCACGGGAGCTCTCCATGCATTTGGTATTTTCGTCTGGGGGGTATGCACGCGATAGCATTGCGAGACGCTGGAGCCGGAGCACCCTATGTCGCAGTATCTGTCTTTGATTCCTGCCTCATCCTATTATTTATCGCACCTACGTTCAATATT
GTCGCAGTATCTGTCTGTCGCAGTATCTGTNNTGTCGCAGTATCTGTC
TATGTCGCAGTATCTGTATATCGCAGTATCTTTATATCGCAGTATCTGNATATCGCAGTATNTG
CCCTATATCGCAGTATACACCCTATGTCGCAACACCCTATCTCGCAACACCCTATGTCGCA
GA-CACCCTATGTCGCCCGGA-CACCCTATATCCGGA-CACCCTATATGCCGGA-CACCCTATG
“Pileup” or “Coverage plot”
![Page 24: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/24.jpg)
Variant detection
GTCGCAGTANCTGTCT||||||||| ||||||GTCGCAGTATCTGTCT
GGATCTGCGATATACC|||||| |||||||||GGATCT-CGATATACC
AATCTGATCTTATTTT||||||||||||||||AATCTGATCTTATTTT
ATATATATATATATAT||||||||||||||||ATATATATATATATAT
TCTCTCCCANNAGAGC||||||||| |||||TCTCTCCCAGGAGAGC
Align
Reference
Call: HET A, Gp-value: 0.0023
GATCACAGGTCTATCACCCTATTAACCACTCACGGGAGCTCTCCATGCATTTGGTATTTTCGTCTGGGGGGTATGCACGCGATAGCATTGCGAGACGCTGGAGCCGGAGCACCCTATGTCGCAGTATCTGTCTTTGATTCCTGCCTCATCCTATTATTTATCGCACCTACGTTCAATATT
GTCGCAGTATCTGTCTGTCGCAGTATCTGTNNTGTCGCAGTATCTGTC
TATGTCGCAGTATCTGTATATCGCAGTATCTTTATATCGCAGTATCTGNATATCGCAGTATNTG
CCCTATATCGCAGTATACACCCTATGTCGCAACACCCTATCTCGCAACACCCTATGTCGCA
GA-CACCCTATGTCGCCCGGA-CACCCTATATCCGGA-CACCCTATATGCCGGA-CACCCTATG
“Pileup” or “Coverage plot”
![Page 25: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/25.jpg)
Variant detection
GTCGCAGTANCTGTCT||||||||| ||||||GTCGCAGTATCTGTCT
GGATCTGCGATATACC|||||| |||||||||GGATCT-CGATATACC
AATCTGATCTTATTTT||||||||||||||||AATCTGATCTTATTTT
ATATATATATATATAT||||||||||||||||ATATATATATATATAT
TCTCTCCCANNAGAGC||||||||| |||||TCTCTCCCAGGAGAGC
Align
Reference
Call: HET A, Gp-value: 0.0023
GATCACAGGTCTATCACCCTATTAACCACTCACGGGAGCTCTCCATGCATTTGGTATTTTCGTCTGGGGGGTATGCACGCGATAGCATTGCGAGACGCTGGAGCCGGAGCACCCTATGTCGCAGTATCTGTCTTTGATTCCTGCCTCATCCTATTATTTATCGCACCTACGTTCAATATT
GTCGCAGTATCTGTCTGTCGCAGTATCTGTNNTGTCGCAGTATCTGTC
TATGTCGCAGTATCTGTATATCGCAGTATCTTTATATCGCAGTATCTGNATATCGCAGTATNTG
CCCTATATCGCAGTATACACCCTATGTCGCAACACCCTATCTCGCAACACCCTATGTCGCA
GA-CACCCTATGTCGCCCGGA-CACCCTATATCCGGA-CACCCTATATGCCGGA-CACCCTATG
“Coverage”
“Pileup” or “Coverage plot”
“Depth of coverage” = 14
![Page 26: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/26.jpg)
RNA-seq differential expression
GATCACAGGTCTATCACCCTATTAACCACTCACGGGAGCTCTCCATGCATTTGGTATTTTCGTCTGGGGGGTATGCACGCGATAGCATTGCGAGACGCTGGAGCCGGAGCACCCTATGTCGCAGTATCTGTCTTTGATTCCTGCCTCATCCTATTATTTATCGCACCTACGTTCAATATT
Sample B
Sample A
Gene 1
![Page 27: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/27.jpg)
RNA-seq differential expression
Align
GATCACAGGTCTATCACCCTATTAACCACTCACGGGAGCTCTCCATGCATTTGGTATTTTCGTCTGGGGGGTATGCACGCGATAGCATTGCGAGACGCTGGAGCCGGAGCACCCTATGTCGCAGTATCTGTCTTTGATTCCTGCCTCATCCTATTATTTATCGCACCTACGTTCAATATT
GTCGCAGTANCTGTCT||||||||| ||||||GTCGCAGTATCTGTCT
GGATCTGCGATATACC|||||| |||||||||GGATCT-CGATATACC
AATCTGATCTTATTTT||||||||||||||||AATCTGATCTTATTTT
Align
Sample B
Sample A
GTCGCAGTANCTGTCT||||||||| ||||||GTCGCAGTATCTGTCT
GGATCTGCGATATACC|||||| |||||||||GGATCT-CGATATACC
AATCTGATCTTATTTT||||||||||||||||AATCTGATCTTATTTT
Gene 1
![Page 28: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/28.jpg)
RNA-seq differential expression
Align
GATCACAGGTCTATCACCCTATTAACCACTCACGGGAGCTCTCCATGCATTTGGTATTTTCGTCTGGGGGGTATGCACGCGATAGCATTGCGAGACGCTGGAGCCGGAGCACCCTATGTCGCAGTATCTGTCTTTGATTCCTGCCTCATCCTATTATTTATCGCACCTACGTTCAATATT
GTCGCAGTATCTGTCTGTCGCAGTATCTGTCTGTCGCAGTATCTGTCTGTCGCAGTATCTGTCTGTCGCAGTATCTGTCTTGTCGCAGTATCTGTC
TATGTCGCAGTATCTGTATATCGCAGTATCTGTATATCGCAGTATCTGTATATCGCAGTATCTG
CCCTATATCGCAGTATAGCACCCTATGTCGCAAGCACCCTATATCGCAAGCACCCTATGTCGCAGAGCACCCTATGTCGC
CCGGAGCACCCTATATCCGGAGCACCCTATATGCCGGAGCACCCTATG
TGTCGCAGTATCTGTCAGCACCCTATGTCGCA
GCCGGAGCACCCTATGGTCGCAGTANCTGTCT||||||||| ||||||GTCGCAGTATCTGTCT
GGATCTGCGATATACC|||||| |||||||||GGATCT-CGATATACC
AATCTGATCTTATTTT||||||||||||||||AATCTGATCTTATTTT
Align
Sample B
Sample A
GTCGCAGTANCTGTCT||||||||| ||||||GTCGCAGTATCTGTCT
GGATCTGCGATATACC|||||| |||||||||GGATCT-CGATATACC
AATCTGATCTTATTTT||||||||||||||||AATCTGATCTTATTTT
Gene 1
![Page 29: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/29.jpg)
RNA-seq differential expression
Align
GATCACAGGTCTATCACCCTATTAACCACTCACGGGAGCTCTCCATGCATTTGGTATTTTCGTCTGGGGGGTATGCACGCGATAGCATTGCGAGACGCTGGAGCCGGAGCACCCTATGTCGCAGTATCTGTCTTTGATTCCTGCCTCATCCTATTATTTATCGCACCTACGTTCAATATT
GTCGCAGTATCTGTCTGTCGCAGTATCTGTCTGTCGCAGTATCTGTCTGTCGCAGTATCTGTCTGTCGCAGTATCTGTCTTGTCGCAGTATCTGTC
TATGTCGCAGTATCTGTATATCGCAGTATCTGTATATCGCAGTATCTGTATATCGCAGTATCTG
CCCTATATCGCAGTATAGCACCCTATGTCGCAAGCACCCTATATCGCAAGCACCCTATGTCGCAGAGCACCCTATGTCGC
CCGGAGCACCCTATATCCGGAGCACCCTATATGCCGGAGCACCCTATG
TGTCGCAGTATCTGTCAGCACCCTATGTCGCA
GCCGGAGCACCCTATGGTCGCAGTANCTGTCT||||||||| ||||||GTCGCAGTATCTGTCT
GGATCTGCGATATACC|||||| |||||||||GGATCT-CGATATACC
AATCTGATCTTATTTT||||||||||||||||AATCTGATCTTATTTT
Align
Sample B
Sample A
GTCGCAGTANCTGTCT||||||||| ||||||GTCGCAGTATCTGTCT
GGATCTGCGATATACC|||||| |||||||||GGATCT-CGATATACC
AATCTGATCTTATTTT||||||||||||||||AATCTGATCTTATTTT
Gene 1
![Page 30: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/30.jpg)
ChIP-seqReference
GATCACAGGTCTATCACCCTATTAACCACTCACGGGAGCTCTCCATGCATTTGGTATTTTCGTCTGGGGGGTATGCACGCGATAGCATTGCGAGACGCTGGAGCCGGAGCACCCTATGTCGCAGTATCTGTCTTTGATTCCTGCCTCGGGAGCTCTCCAGGGAGCTCTCCA
![Page 31: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/31.jpg)
ChIP-seqReference
GATCACAGGTCTATCACCCTATTAACCACTCACGGGAGCTCTCCATGCATTTGGTATTTTCGTCTGGGGGGTATGCACGCGATAGCATTGCGAGACGCTGGAGCCGGAGCACCCTATGTCGCAGTATCTGTCTTTGATTCCTGCCTCGGGAGCTCTCCAGGGAGCTCTCCA
GTCGCAGTANCTGTCT||||||||| ||||||GTCGCAGTATCTGTCT
GGATCTGCGATATACC|||||| |||||||||GGATCT-CGATATACC
AATCTGATCTTATTTT||||||||||||||||AATCTGATCTTATTTT
ATATATATATATATAT||||||||||||||||ATATATATATATATAT
TCTCTCCCANNAGAGC||||||||| |||||TCTCTCCCAGGAGAGC
Align
![Page 32: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/32.jpg)
ChIP-seq
GTCGCAGTANCTGTCT||||||||| ||||||GTCGCAGTATCTGTCT
GGATCTGCGATATACC|||||| |||||||||GGATCT-CGATATACC
AATCTGATCTTATTTT||||||||||||||||AATCTGATCTTATTTT
ATATATATATATATAT||||||||||||||||ATATATATATATATAT
TCTCTCCCANNAGAGC||||||||| |||||TCTCTCCCAGGAGAGC
Align
Reference
GATTCCTGCCTCGATCACAGGTCTATCACCCTATTAACCACTCACGGGAGCTCTCCATGCATTTGGTATTTTCGTCTGGGGGGTATGCACGCGATAGCATTGCGAGACGCTGGAGCCGGAGCACCCTATGTCGCAGTATCTGTCTTTGATTCCTGCCTCGGGAGCTCTCCAGGGAGCTCTCCA
GTCGCAGTATCTGTCTGTCGCAGTATCTGTCTTGTCGCAGTATCTGTC
TATGTCGCAGTATCTGTATATCGCAGTATCTGTATATCGCAGTATCTGTATATCGCAGTATCTG
CCCTATATCGCAGTATCCCTATATCGCAGTATCCCTATATCGCAGTATCCCTATATCGCAGTATCCCTATATCGCAGTATCCCTATATCGCAGTATCCCTATATCGCAGTAT
AGCACCCTATGTCGCAAGCACCCTATATCGCAAGCACCCTATGTCGCAGAGCACCCTATGTCGC
CCGGAGCACCCTATATCCGGAGCACCCTATATGCCGGAGCACCCTATG
TATGCACGCGATAGCAGATAGCATTGCGAGAC
![Page 33: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/33.jpg)
ChIP-seq
GTCGCAGTANCTGTCT||||||||| ||||||GTCGCAGTATCTGTCT
GGATCTGCGATATACC|||||| |||||||||GGATCT-CGATATACC
AATCTGATCTTATTTT||||||||||||||||AATCTGATCTTATTTT
ATATATATATATATAT||||||||||||||||ATATATATATATATAT
TCTCTCCCANNAGAGC||||||||| |||||TCTCTCCCAGGAGAGC
Align
Reference
GATTCCTGCCTCGATCACAGGTCTATCACCCTATTAACCACTCACGGGAGCTCTCCATGCATTTGGTATTTTCGTCTGGGGGGTATGCACGCGATAGCATTGCGAGACGCTGGAGCCGGAGCACCCTATGTCGCAGTATCTGTCTTTGATTCCTGCCTCGGGAGCTCTCCAGGGAGCTCTCCA
GTCGCAGTATCTGTCTGTCGCAGTATCTGTCTTGTCGCAGTATCTGTC
TATGTCGCAGTATCTGTATATCGCAGTATCTGTATATCGCAGTATCTGTATATCGCAGTATCTG
CCCTATATCGCAGTATCCCTATATCGCAGTATCCCTATATCGCAGTATCCCTATATCGCAGTATCCCTATATCGCAGTATCCCTATATCGCAGTATCCCTATATCGCAGTAT
AGCACCCTATGTCGCAAGCACCCTATATCGCAAGCACCCTATGTCGCAGAGCACCCTATGTCGC
CCGGAGCACCCTATATCCGGAGCACCCTATATGCCGGAGCACCCTATG
TATGCACGCGATAGCAGATAGCATTGCGAGAC
![Page 34: Statistical(Methods(for(Genome( Wide(Regional(Analysis ... · D. melanogaster, Science, 2000 H. sapiens, Nature, 2000 Science, 2000 M.#musculus,Nature,2002 • Back then: millions](https://reader036.fdocuments.in/reader036/viewer/2022081521/5acb40de7f8b9a51678e9185/html5/thumbnails/34.jpg)
ChIP-seq
GTCGCAGTANCTGTCT||||||||| ||||||GTCGCAGTATCTGTCT
GGATCTGCGATATACC|||||| |||||||||GGATCT-CGATATACC
AATCTGATCTTATTTT||||||||||||||||AATCTGATCTTATTTT
ATATATATATATATAT||||||||||||||||ATATATATATATATAT
TCTCTCCCANNAGAGC||||||||| |||||TCTCTCCCAGGAGAGC
Align
Reference
Binding occurs herep-value: 0.0023
GATTCCTGCCTCGATCACAGGTCTATCACCCTATTAACCACTCACGGGAGCTCTCCATGCATTTGGTATTTTCGTCTGGGGGGTATGCACGCGATAGCATTGCGAGACGCTGGAGCCGGAGCACCCTATGTCGCAGTATCTGTCTTTGATTCCTGCCTCGGGAGCTCTCCAGGGAGCTCTCCA
GTCGCAGTATCTGTCTGTCGCAGTATCTGTCTTGTCGCAGTATCTGTC
TATGTCGCAGTATCTGTATATCGCAGTATCTGTATATCGCAGTATCTGTATATCGCAGTATCTG
CCCTATATCGCAGTATCCCTATATCGCAGTATCCCTATATCGCAGTATCCCTATATCGCAGTATCCCTATATCGCAGTATCCCTATATCGCAGTATCCCTATATCGCAGTAT
AGCACCCTATGTCGCAAGCACCCTATATCGCAAGCACCCTATGTCGCAGAGCACCCTATGTCGC
CCGGAGCACCCTATATCCGGAGCACCCTATATGCCGGAGCACCCTATG
TATGCACGCGATAGCAGATAGCATTGCGAGAC