ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason...

55
ChromHMM Tutorial Jason Ernst Assistant Professor University of California, Los Angeles

Transcript of ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason...

Page 1: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

ChromHMM Tutorial

Jason Ernst Assistant Professor

University of California Los Angeles

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

     

Chroma=n Marks for Genome Annota=on100+ histone modifications Specificity in bull Histone protein bull Amino acid residue bull Chemical modification (eg

methyl acetylation) bull Number of occurrence of

the modifications

Examples H3K4me1 ndash Enhancers H3K4me3 ndash Promoters H3K27me3 ndash Repressive H3K9me3 ndash Repressive H3K36me3 ndash Transcribed

Histone Modifications can be Mapped Genome-wide with ChIP-seq

Image source httpnihroadmapnihgovepigenomics

From lsquochromaMn marksrsquo to lsquochromaMn statesrsquo

bull Learn de novosignificant13

Promoter states

Transcribed states combinaMonal andspaMal paOerns ofchromaMn marks

Active Intergenic

Repressed bull Reveal funcMonalelements evenwithout13 looking atsequence

bull Use13 for13 genome13 annotaMon

Ernst and Kellis Nat Biotech 2010

Our approach Mul=variate Hidden Markov Model(ChromHMM)13

Enhancer Gene Starts Gene -shy‐ Transcribed Region DNA

BinarizedchromaMnmarks Calledbased on apoissondistribuMon

Most13 likely HiddenState 2

H3K4me3 H3K36me3 H3K36me3 H3K36me3 H3K36me3 H3K4me1 H3K4me3 H3K4me1

H3K27ac H3K4me1

1 3 4 6 6 6 6 6 5 5 5

Unobserved

High Probability ChromaMn0713 Marks in State

0813 0813 200 base pair interval 1 4K27ac

3

2 0913

0913

0813 K4me1 H3K4me3

H3K4me3

H3K4me1 H3K4me1 All probabiliMesEmission distribution is a product of independent are learned from5Bernoulli random variables

0913 the data13

6H3K36me3

Binarization leads to explicit modeling of mark combinations and interpretable parameters 6

Ernst and Kellis Nat Biotech 2010 Ernst and Kellis Nature Methods 2012

ENCODE Study nine marks in nine human cell lines 9 marks 9 human13 cell13 types13 81 Chroma=n Mark13 TracksH3K4me1

H3K4me2

H3K4me3

H3K27ac

H3K9ac

H3K27me3

H4K20me1

H3K36me3

CTCF

+WCE

+RNA

HUVEC Umbilical vein endothelial

NHEK Keratinocytes

GM12878 Lymphoblastoid

K562 Myelogenous leukemia

HepG2 Liver carcinoma

NHLF Normal human lung fibroblast

HMEC Mammary epithelial cell

HSMM Skeletal muscle myoblasts

H1 Embryonic

x

Ernst et al Nature 2011

bull Learned jointly across cell types (virtual concatenation)

bull State definitions are common

bull State locations are dynamic

Brad Bernstein ENCODE Group HepG2 NHLF HMEC HSMM HUVEC NHEK GM12878 K562 H1

     

Chroma=n states13 dynamics13 across13 nine13 ENCODE cell types

bull Single annotation track for each cell type

bull Summarize cell-type activity at a glance bull Can study 9-cell activity pattern across Ernst et al Nature 2011

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Chroma=n States Defined Across 127 CellTissues Types

Roadmap Epigenomics Consortium et al Nature 2015 16 epigenomes from ENCODE 2

Chroma=n States13 Defined on Imputed DataMarks

ChromImpute methodErnst13 and Kellis13 Nature Biotech 2015

ChromHMMModels across Many Roadmap ENCODE13 Cell13 and Tissue Types13

127 CellTissue13 Types13

98 CellTissue13 Types13

H3K4me1 H3K4me1H3K4me3 H3K4me3H3K27me3 H3K27me3H3K9me3 H3K9me3H3K36me3 H3K36me3

H3K27ac

127 CellTissue13 Types13 H3K4me1 H3K9acH3K4me3 H4K20me1H3K27me3 H3K79me2H3K9me3 H3K4me2H3K36me3 H2AZH3K27ac DNase

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Note Different13 than track hubRoadmap Epigenomics Data13 Complete CollecMon at Wash U VizHub

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Human Epigenome Browser atWashington University

hOpepigenomegatewaywustledu

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Soware download

ChromHMMWebsitehOpcompbiomiteduChromHMM13

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 2: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

     

Chroma=n Marks for Genome Annota=on100+ histone modifications Specificity in bull Histone protein bull Amino acid residue bull Chemical modification (eg

methyl acetylation) bull Number of occurrence of

the modifications

Examples H3K4me1 ndash Enhancers H3K4me3 ndash Promoters H3K27me3 ndash Repressive H3K9me3 ndash Repressive H3K36me3 ndash Transcribed

Histone Modifications can be Mapped Genome-wide with ChIP-seq

Image source httpnihroadmapnihgovepigenomics

From lsquochromaMn marksrsquo to lsquochromaMn statesrsquo

bull Learn de novosignificant13

Promoter states

Transcribed states combinaMonal andspaMal paOerns ofchromaMn marks

Active Intergenic

Repressed bull Reveal funcMonalelements evenwithout13 looking atsequence

bull Use13 for13 genome13 annotaMon

Ernst and Kellis Nat Biotech 2010

Our approach Mul=variate Hidden Markov Model(ChromHMM)13

Enhancer Gene Starts Gene -shy‐ Transcribed Region DNA

BinarizedchromaMnmarks Calledbased on apoissondistribuMon

Most13 likely HiddenState 2

H3K4me3 H3K36me3 H3K36me3 H3K36me3 H3K36me3 H3K4me1 H3K4me3 H3K4me1

H3K27ac H3K4me1

1 3 4 6 6 6 6 6 5 5 5

Unobserved

High Probability ChromaMn0713 Marks in State

0813 0813 200 base pair interval 1 4K27ac

3

2 0913

0913

0813 K4me1 H3K4me3

H3K4me3

H3K4me1 H3K4me1 All probabiliMesEmission distribution is a product of independent are learned from5Bernoulli random variables

0913 the data13

6H3K36me3

Binarization leads to explicit modeling of mark combinations and interpretable parameters 6

Ernst and Kellis Nat Biotech 2010 Ernst and Kellis Nature Methods 2012

ENCODE Study nine marks in nine human cell lines 9 marks 9 human13 cell13 types13 81 Chroma=n Mark13 TracksH3K4me1

H3K4me2

H3K4me3

H3K27ac

H3K9ac

H3K27me3

H4K20me1

H3K36me3

CTCF

+WCE

+RNA

HUVEC Umbilical vein endothelial

NHEK Keratinocytes

GM12878 Lymphoblastoid

K562 Myelogenous leukemia

HepG2 Liver carcinoma

NHLF Normal human lung fibroblast

HMEC Mammary epithelial cell

HSMM Skeletal muscle myoblasts

H1 Embryonic

x

Ernst et al Nature 2011

bull Learned jointly across cell types (virtual concatenation)

bull State definitions are common

bull State locations are dynamic

Brad Bernstein ENCODE Group HepG2 NHLF HMEC HSMM HUVEC NHEK GM12878 K562 H1

     

Chroma=n states13 dynamics13 across13 nine13 ENCODE cell types

bull Single annotation track for each cell type

bull Summarize cell-type activity at a glance bull Can study 9-cell activity pattern across Ernst et al Nature 2011

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Chroma=n States Defined Across 127 CellTissues Types

Roadmap Epigenomics Consortium et al Nature 2015 16 epigenomes from ENCODE 2

Chroma=n States13 Defined on Imputed DataMarks

ChromImpute methodErnst13 and Kellis13 Nature Biotech 2015

ChromHMMModels across Many Roadmap ENCODE13 Cell13 and Tissue Types13

127 CellTissue13 Types13

98 CellTissue13 Types13

H3K4me1 H3K4me1H3K4me3 H3K4me3H3K27me3 H3K27me3H3K9me3 H3K9me3H3K36me3 H3K36me3

H3K27ac

127 CellTissue13 Types13 H3K4me1 H3K9acH3K4me3 H4K20me1H3K27me3 H3K79me2H3K9me3 H3K4me2H3K36me3 H2AZH3K27ac DNase

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Note Different13 than track hubRoadmap Epigenomics Data13 Complete CollecMon at Wash U VizHub

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Human Epigenome Browser atWashington University

hOpepigenomegatewaywustledu

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Soware download

ChromHMMWebsitehOpcompbiomiteduChromHMM13

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 3: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

     

Chroma=n Marks for Genome Annota=on100+ histone modifications Specificity in bull Histone protein bull Amino acid residue bull Chemical modification (eg

methyl acetylation) bull Number of occurrence of

the modifications

Examples H3K4me1 ndash Enhancers H3K4me3 ndash Promoters H3K27me3 ndash Repressive H3K9me3 ndash Repressive H3K36me3 ndash Transcribed

Histone Modifications can be Mapped Genome-wide with ChIP-seq

Image source httpnihroadmapnihgovepigenomics

From lsquochromaMn marksrsquo to lsquochromaMn statesrsquo

bull Learn de novosignificant13

Promoter states

Transcribed states combinaMonal andspaMal paOerns ofchromaMn marks

Active Intergenic

Repressed bull Reveal funcMonalelements evenwithout13 looking atsequence

bull Use13 for13 genome13 annotaMon

Ernst and Kellis Nat Biotech 2010

Our approach Mul=variate Hidden Markov Model(ChromHMM)13

Enhancer Gene Starts Gene -shy‐ Transcribed Region DNA

BinarizedchromaMnmarks Calledbased on apoissondistribuMon

Most13 likely HiddenState 2

H3K4me3 H3K36me3 H3K36me3 H3K36me3 H3K36me3 H3K4me1 H3K4me3 H3K4me1

H3K27ac H3K4me1

1 3 4 6 6 6 6 6 5 5 5

Unobserved

High Probability ChromaMn0713 Marks in State

0813 0813 200 base pair interval 1 4K27ac

3

2 0913

0913

0813 K4me1 H3K4me3

H3K4me3

H3K4me1 H3K4me1 All probabiliMesEmission distribution is a product of independent are learned from5Bernoulli random variables

0913 the data13

6H3K36me3

Binarization leads to explicit modeling of mark combinations and interpretable parameters 6

Ernst and Kellis Nat Biotech 2010 Ernst and Kellis Nature Methods 2012

ENCODE Study nine marks in nine human cell lines 9 marks 9 human13 cell13 types13 81 Chroma=n Mark13 TracksH3K4me1

H3K4me2

H3K4me3

H3K27ac

H3K9ac

H3K27me3

H4K20me1

H3K36me3

CTCF

+WCE

+RNA

HUVEC Umbilical vein endothelial

NHEK Keratinocytes

GM12878 Lymphoblastoid

K562 Myelogenous leukemia

HepG2 Liver carcinoma

NHLF Normal human lung fibroblast

HMEC Mammary epithelial cell

HSMM Skeletal muscle myoblasts

H1 Embryonic

x

Ernst et al Nature 2011

bull Learned jointly across cell types (virtual concatenation)

bull State definitions are common

bull State locations are dynamic

Brad Bernstein ENCODE Group HepG2 NHLF HMEC HSMM HUVEC NHEK GM12878 K562 H1

     

Chroma=n states13 dynamics13 across13 nine13 ENCODE cell types

bull Single annotation track for each cell type

bull Summarize cell-type activity at a glance bull Can study 9-cell activity pattern across Ernst et al Nature 2011

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Chroma=n States Defined Across 127 CellTissues Types

Roadmap Epigenomics Consortium et al Nature 2015 16 epigenomes from ENCODE 2

Chroma=n States13 Defined on Imputed DataMarks

ChromImpute methodErnst13 and Kellis13 Nature Biotech 2015

ChromHMMModels across Many Roadmap ENCODE13 Cell13 and Tissue Types13

127 CellTissue13 Types13

98 CellTissue13 Types13

H3K4me1 H3K4me1H3K4me3 H3K4me3H3K27me3 H3K27me3H3K9me3 H3K9me3H3K36me3 H3K36me3

H3K27ac

127 CellTissue13 Types13 H3K4me1 H3K9acH3K4me3 H4K20me1H3K27me3 H3K79me2H3K9me3 H3K4me2H3K36me3 H2AZH3K27ac DNase

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Note Different13 than track hubRoadmap Epigenomics Data13 Complete CollecMon at Wash U VizHub

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Human Epigenome Browser atWashington University

hOpepigenomegatewaywustledu

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Soware download

ChromHMMWebsitehOpcompbiomiteduChromHMM13

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 4: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

     

Chroma=n Marks for Genome Annota=on100+ histone modifications Specificity in bull Histone protein bull Amino acid residue bull Chemical modification (eg

methyl acetylation) bull Number of occurrence of

the modifications

Examples H3K4me1 ndash Enhancers H3K4me3 ndash Promoters H3K27me3 ndash Repressive H3K9me3 ndash Repressive H3K36me3 ndash Transcribed

Histone Modifications can be Mapped Genome-wide with ChIP-seq

Image source httpnihroadmapnihgovepigenomics

From lsquochromaMn marksrsquo to lsquochromaMn statesrsquo

bull Learn de novosignificant13

Promoter states

Transcribed states combinaMonal andspaMal paOerns ofchromaMn marks

Active Intergenic

Repressed bull Reveal funcMonalelements evenwithout13 looking atsequence

bull Use13 for13 genome13 annotaMon

Ernst and Kellis Nat Biotech 2010

Our approach Mul=variate Hidden Markov Model(ChromHMM)13

Enhancer Gene Starts Gene -shy‐ Transcribed Region DNA

BinarizedchromaMnmarks Calledbased on apoissondistribuMon

Most13 likely HiddenState 2

H3K4me3 H3K36me3 H3K36me3 H3K36me3 H3K36me3 H3K4me1 H3K4me3 H3K4me1

H3K27ac H3K4me1

1 3 4 6 6 6 6 6 5 5 5

Unobserved

High Probability ChromaMn0713 Marks in State

0813 0813 200 base pair interval 1 4K27ac

3

2 0913

0913

0813 K4me1 H3K4me3

H3K4me3

H3K4me1 H3K4me1 All probabiliMesEmission distribution is a product of independent are learned from5Bernoulli random variables

0913 the data13

6H3K36me3

Binarization leads to explicit modeling of mark combinations and interpretable parameters 6

Ernst and Kellis Nat Biotech 2010 Ernst and Kellis Nature Methods 2012

ENCODE Study nine marks in nine human cell lines 9 marks 9 human13 cell13 types13 81 Chroma=n Mark13 TracksH3K4me1

H3K4me2

H3K4me3

H3K27ac

H3K9ac

H3K27me3

H4K20me1

H3K36me3

CTCF

+WCE

+RNA

HUVEC Umbilical vein endothelial

NHEK Keratinocytes

GM12878 Lymphoblastoid

K562 Myelogenous leukemia

HepG2 Liver carcinoma

NHLF Normal human lung fibroblast

HMEC Mammary epithelial cell

HSMM Skeletal muscle myoblasts

H1 Embryonic

x

Ernst et al Nature 2011

bull Learned jointly across cell types (virtual concatenation)

bull State definitions are common

bull State locations are dynamic

Brad Bernstein ENCODE Group HepG2 NHLF HMEC HSMM HUVEC NHEK GM12878 K562 H1

     

Chroma=n states13 dynamics13 across13 nine13 ENCODE cell types

bull Single annotation track for each cell type

bull Summarize cell-type activity at a glance bull Can study 9-cell activity pattern across Ernst et al Nature 2011

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Chroma=n States Defined Across 127 CellTissues Types

Roadmap Epigenomics Consortium et al Nature 2015 16 epigenomes from ENCODE 2

Chroma=n States13 Defined on Imputed DataMarks

ChromImpute methodErnst13 and Kellis13 Nature Biotech 2015

ChromHMMModels across Many Roadmap ENCODE13 Cell13 and Tissue Types13

127 CellTissue13 Types13

98 CellTissue13 Types13

H3K4me1 H3K4me1H3K4me3 H3K4me3H3K27me3 H3K27me3H3K9me3 H3K9me3H3K36me3 H3K36me3

H3K27ac

127 CellTissue13 Types13 H3K4me1 H3K9acH3K4me3 H4K20me1H3K27me3 H3K79me2H3K9me3 H3K4me2H3K36me3 H2AZH3K27ac DNase

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Note Different13 than track hubRoadmap Epigenomics Data13 Complete CollecMon at Wash U VizHub

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Human Epigenome Browser atWashington University

hOpepigenomegatewaywustledu

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Soware download

ChromHMMWebsitehOpcompbiomiteduChromHMM13

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 5: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

From lsquochromaMn marksrsquo to lsquochromaMn statesrsquo

bull Learn de novosignificant13

Promoter states

Transcribed states combinaMonal andspaMal paOerns ofchromaMn marks

Active Intergenic

Repressed bull Reveal funcMonalelements evenwithout13 looking atsequence

bull Use13 for13 genome13 annotaMon

Ernst and Kellis Nat Biotech 2010

Our approach Mul=variate Hidden Markov Model(ChromHMM)13

Enhancer Gene Starts Gene -shy‐ Transcribed Region DNA

BinarizedchromaMnmarks Calledbased on apoissondistribuMon

Most13 likely HiddenState 2

H3K4me3 H3K36me3 H3K36me3 H3K36me3 H3K36me3 H3K4me1 H3K4me3 H3K4me1

H3K27ac H3K4me1

1 3 4 6 6 6 6 6 5 5 5

Unobserved

High Probability ChromaMn0713 Marks in State

0813 0813 200 base pair interval 1 4K27ac

3

2 0913

0913

0813 K4me1 H3K4me3

H3K4me3

H3K4me1 H3K4me1 All probabiliMesEmission distribution is a product of independent are learned from5Bernoulli random variables

0913 the data13

6H3K36me3

Binarization leads to explicit modeling of mark combinations and interpretable parameters 6

Ernst and Kellis Nat Biotech 2010 Ernst and Kellis Nature Methods 2012

ENCODE Study nine marks in nine human cell lines 9 marks 9 human13 cell13 types13 81 Chroma=n Mark13 TracksH3K4me1

H3K4me2

H3K4me3

H3K27ac

H3K9ac

H3K27me3

H4K20me1

H3K36me3

CTCF

+WCE

+RNA

HUVEC Umbilical vein endothelial

NHEK Keratinocytes

GM12878 Lymphoblastoid

K562 Myelogenous leukemia

HepG2 Liver carcinoma

NHLF Normal human lung fibroblast

HMEC Mammary epithelial cell

HSMM Skeletal muscle myoblasts

H1 Embryonic

x

Ernst et al Nature 2011

bull Learned jointly across cell types (virtual concatenation)

bull State definitions are common

bull State locations are dynamic

Brad Bernstein ENCODE Group HepG2 NHLF HMEC HSMM HUVEC NHEK GM12878 K562 H1

     

Chroma=n states13 dynamics13 across13 nine13 ENCODE cell types

bull Single annotation track for each cell type

bull Summarize cell-type activity at a glance bull Can study 9-cell activity pattern across Ernst et al Nature 2011

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Chroma=n States Defined Across 127 CellTissues Types

Roadmap Epigenomics Consortium et al Nature 2015 16 epigenomes from ENCODE 2

Chroma=n States13 Defined on Imputed DataMarks

ChromImpute methodErnst13 and Kellis13 Nature Biotech 2015

ChromHMMModels across Many Roadmap ENCODE13 Cell13 and Tissue Types13

127 CellTissue13 Types13

98 CellTissue13 Types13

H3K4me1 H3K4me1H3K4me3 H3K4me3H3K27me3 H3K27me3H3K9me3 H3K9me3H3K36me3 H3K36me3

H3K27ac

127 CellTissue13 Types13 H3K4me1 H3K9acH3K4me3 H4K20me1H3K27me3 H3K79me2H3K9me3 H3K4me2H3K36me3 H2AZH3K27ac DNase

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Note Different13 than track hubRoadmap Epigenomics Data13 Complete CollecMon at Wash U VizHub

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Human Epigenome Browser atWashington University

hOpepigenomegatewaywustledu

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Soware download

ChromHMMWebsitehOpcompbiomiteduChromHMM13

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 6: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

Our approach Mul=variate Hidden Markov Model(ChromHMM)13

Enhancer Gene Starts Gene -shy‐ Transcribed Region DNA

BinarizedchromaMnmarks Calledbased on apoissondistribuMon

Most13 likely HiddenState 2

H3K4me3 H3K36me3 H3K36me3 H3K36me3 H3K36me3 H3K4me1 H3K4me3 H3K4me1

H3K27ac H3K4me1

1 3 4 6 6 6 6 6 5 5 5

Unobserved

High Probability ChromaMn0713 Marks in State

0813 0813 200 base pair interval 1 4K27ac

3

2 0913

0913

0813 K4me1 H3K4me3

H3K4me3

H3K4me1 H3K4me1 All probabiliMesEmission distribution is a product of independent are learned from5Bernoulli random variables

0913 the data13

6H3K36me3

Binarization leads to explicit modeling of mark combinations and interpretable parameters 6

Ernst and Kellis Nat Biotech 2010 Ernst and Kellis Nature Methods 2012

ENCODE Study nine marks in nine human cell lines 9 marks 9 human13 cell13 types13 81 Chroma=n Mark13 TracksH3K4me1

H3K4me2

H3K4me3

H3K27ac

H3K9ac

H3K27me3

H4K20me1

H3K36me3

CTCF

+WCE

+RNA

HUVEC Umbilical vein endothelial

NHEK Keratinocytes

GM12878 Lymphoblastoid

K562 Myelogenous leukemia

HepG2 Liver carcinoma

NHLF Normal human lung fibroblast

HMEC Mammary epithelial cell

HSMM Skeletal muscle myoblasts

H1 Embryonic

x

Ernst et al Nature 2011

bull Learned jointly across cell types (virtual concatenation)

bull State definitions are common

bull State locations are dynamic

Brad Bernstein ENCODE Group HepG2 NHLF HMEC HSMM HUVEC NHEK GM12878 K562 H1

     

Chroma=n states13 dynamics13 across13 nine13 ENCODE cell types

bull Single annotation track for each cell type

bull Summarize cell-type activity at a glance bull Can study 9-cell activity pattern across Ernst et al Nature 2011

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Chroma=n States Defined Across 127 CellTissues Types

Roadmap Epigenomics Consortium et al Nature 2015 16 epigenomes from ENCODE 2

Chroma=n States13 Defined on Imputed DataMarks

ChromImpute methodErnst13 and Kellis13 Nature Biotech 2015

ChromHMMModels across Many Roadmap ENCODE13 Cell13 and Tissue Types13

127 CellTissue13 Types13

98 CellTissue13 Types13

H3K4me1 H3K4me1H3K4me3 H3K4me3H3K27me3 H3K27me3H3K9me3 H3K9me3H3K36me3 H3K36me3

H3K27ac

127 CellTissue13 Types13 H3K4me1 H3K9acH3K4me3 H4K20me1H3K27me3 H3K79me2H3K9me3 H3K4me2H3K36me3 H2AZH3K27ac DNase

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Note Different13 than track hubRoadmap Epigenomics Data13 Complete CollecMon at Wash U VizHub

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Human Epigenome Browser atWashington University

hOpepigenomegatewaywustledu

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Soware download

ChromHMMWebsitehOpcompbiomiteduChromHMM13

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 7: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

ENCODE Study nine marks in nine human cell lines 9 marks 9 human13 cell13 types13 81 Chroma=n Mark13 TracksH3K4me1

H3K4me2

H3K4me3

H3K27ac

H3K9ac

H3K27me3

H4K20me1

H3K36me3

CTCF

+WCE

+RNA

HUVEC Umbilical vein endothelial

NHEK Keratinocytes

GM12878 Lymphoblastoid

K562 Myelogenous leukemia

HepG2 Liver carcinoma

NHLF Normal human lung fibroblast

HMEC Mammary epithelial cell

HSMM Skeletal muscle myoblasts

H1 Embryonic

x

Ernst et al Nature 2011

bull Learned jointly across cell types (virtual concatenation)

bull State definitions are common

bull State locations are dynamic

Brad Bernstein ENCODE Group HepG2 NHLF HMEC HSMM HUVEC NHEK GM12878 K562 H1

     

Chroma=n states13 dynamics13 across13 nine13 ENCODE cell types

bull Single annotation track for each cell type

bull Summarize cell-type activity at a glance bull Can study 9-cell activity pattern across Ernst et al Nature 2011

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Chroma=n States Defined Across 127 CellTissues Types

Roadmap Epigenomics Consortium et al Nature 2015 16 epigenomes from ENCODE 2

Chroma=n States13 Defined on Imputed DataMarks

ChromImpute methodErnst13 and Kellis13 Nature Biotech 2015

ChromHMMModels across Many Roadmap ENCODE13 Cell13 and Tissue Types13

127 CellTissue13 Types13

98 CellTissue13 Types13

H3K4me1 H3K4me1H3K4me3 H3K4me3H3K27me3 H3K27me3H3K9me3 H3K9me3H3K36me3 H3K36me3

H3K27ac

127 CellTissue13 Types13 H3K4me1 H3K9acH3K4me3 H4K20me1H3K27me3 H3K79me2H3K9me3 H3K4me2H3K36me3 H2AZH3K27ac DNase

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Note Different13 than track hubRoadmap Epigenomics Data13 Complete CollecMon at Wash U VizHub

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Human Epigenome Browser atWashington University

hOpepigenomegatewaywustledu

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Soware download

ChromHMMWebsitehOpcompbiomiteduChromHMM13

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 8: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

     

Chroma=n states13 dynamics13 across13 nine13 ENCODE cell types

bull Single annotation track for each cell type

bull Summarize cell-type activity at a glance bull Can study 9-cell activity pattern across Ernst et al Nature 2011

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Chroma=n States Defined Across 127 CellTissues Types

Roadmap Epigenomics Consortium et al Nature 2015 16 epigenomes from ENCODE 2

Chroma=n States13 Defined on Imputed DataMarks

ChromImpute methodErnst13 and Kellis13 Nature Biotech 2015

ChromHMMModels across Many Roadmap ENCODE13 Cell13 and Tissue Types13

127 CellTissue13 Types13

98 CellTissue13 Types13

H3K4me1 H3K4me1H3K4me3 H3K4me3H3K27me3 H3K27me3H3K9me3 H3K9me3H3K36me3 H3K36me3

H3K27ac

127 CellTissue13 Types13 H3K4me1 H3K9acH3K4me3 H4K20me1H3K27me3 H3K79me2H3K9me3 H3K4me2H3K36me3 H2AZH3K27ac DNase

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Note Different13 than track hubRoadmap Epigenomics Data13 Complete CollecMon at Wash U VizHub

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Human Epigenome Browser atWashington University

hOpepigenomegatewaywustledu

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Soware download

ChromHMMWebsitehOpcompbiomiteduChromHMM13

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 9: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Chroma=n States Defined Across 127 CellTissues Types

Roadmap Epigenomics Consortium et al Nature 2015 16 epigenomes from ENCODE 2

Chroma=n States13 Defined on Imputed DataMarks

ChromImpute methodErnst13 and Kellis13 Nature Biotech 2015

ChromHMMModels across Many Roadmap ENCODE13 Cell13 and Tissue Types13

127 CellTissue13 Types13

98 CellTissue13 Types13

H3K4me1 H3K4me1H3K4me3 H3K4me3H3K27me3 H3K27me3H3K9me3 H3K9me3H3K36me3 H3K36me3

H3K27ac

127 CellTissue13 Types13 H3K4me1 H3K9acH3K4me3 H4K20me1H3K27me3 H3K79me2H3K9me3 H3K4me2H3K36me3 H2AZH3K27ac DNase

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Note Different13 than track hubRoadmap Epigenomics Data13 Complete CollecMon at Wash U VizHub

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Human Epigenome Browser atWashington University

hOpepigenomegatewaywustledu

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Soware download

ChromHMMWebsitehOpcompbiomiteduChromHMM13

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 10: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

Chroma=n States Defined Across 127 CellTissues Types

Roadmap Epigenomics Consortium et al Nature 2015 16 epigenomes from ENCODE 2

Chroma=n States13 Defined on Imputed DataMarks

ChromImpute methodErnst13 and Kellis13 Nature Biotech 2015

ChromHMMModels across Many Roadmap ENCODE13 Cell13 and Tissue Types13

127 CellTissue13 Types13

98 CellTissue13 Types13

H3K4me1 H3K4me1H3K4me3 H3K4me3H3K27me3 H3K27me3H3K9me3 H3K9me3H3K36me3 H3K36me3

H3K27ac

127 CellTissue13 Types13 H3K4me1 H3K9acH3K4me3 H4K20me1H3K27me3 H3K79me2H3K9me3 H3K4me2H3K36me3 H2AZH3K27ac DNase

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Note Different13 than track hubRoadmap Epigenomics Data13 Complete CollecMon at Wash U VizHub

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Human Epigenome Browser atWashington University

hOpepigenomegatewaywustledu

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Soware download

ChromHMMWebsitehOpcompbiomiteduChromHMM13

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 11: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

Chroma=n States13 Defined on Imputed DataMarks

ChromImpute methodErnst13 and Kellis13 Nature Biotech 2015

ChromHMMModels across Many Roadmap ENCODE13 Cell13 and Tissue Types13

127 CellTissue13 Types13

98 CellTissue13 Types13

H3K4me1 H3K4me1H3K4me3 H3K4me3H3K27me3 H3K27me3H3K9me3 H3K9me3H3K36me3 H3K36me3

H3K27ac

127 CellTissue13 Types13 H3K4me1 H3K9acH3K4me3 H4K20me1H3K27me3 H3K79me2H3K9me3 H3K4me2H3K36me3 H2AZH3K27ac DNase

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Note Different13 than track hubRoadmap Epigenomics Data13 Complete CollecMon at Wash U VizHub

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Human Epigenome Browser atWashington University

hOpepigenomegatewaywustledu

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Soware download

ChromHMMWebsitehOpcompbiomiteduChromHMM13

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 12: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

ChromHMMModels across Many Roadmap ENCODE13 Cell13 and Tissue Types13

127 CellTissue13 Types13

98 CellTissue13 Types13

H3K4me1 H3K4me1H3K4me3 H3K4me3H3K27me3 H3K27me3H3K9me3 H3K9me3H3K36me3 H3K36me3

H3K27ac

127 CellTissue13 Types13 H3K4me1 H3K9acH3K4me3 H4K20me1H3K27me3 H3K79me2H3K9me3 H3K4me2H3K36me3 H2AZH3K27ac DNase

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Note Different13 than track hubRoadmap Epigenomics Data13 Complete CollecMon at Wash U VizHub

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Human Epigenome Browser atWashington University

hOpepigenomegatewaywustledu

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Soware download

ChromHMMWebsitehOpcompbiomiteduChromHMM13

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 13: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Note Different13 than track hubRoadmap Epigenomics Data13 Complete CollecMon at Wash U VizHub

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Human Epigenome Browser atWashington University

hOpepigenomegatewaywustledu

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Soware download

ChromHMMWebsitehOpcompbiomiteduChromHMM13

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 14: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

Roadmap Epigenomics IntegraMve Analysis Portal

hOpcompbiomiteduroadmap

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Note Different13 than track hubRoadmap Epigenomics Data13 Complete CollecMon at Wash U VizHub

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Human Epigenome Browser atWashington University

hOpepigenomegatewaywustledu

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Soware download

ChromHMMWebsitehOpcompbiomiteduChromHMM13

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 15: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Note Different13 than track hubRoadmap Epigenomics Data13 Complete CollecMon at Wash U VizHub

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Human Epigenome Browser atWashington University

hOpepigenomegatewaywustledu

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Soware download

ChromHMMWebsitehOpcompbiomiteduChromHMM13

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 16: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

hOpgenomeucscedu13

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Note Different13 than track hubRoadmap Epigenomics Data13 Complete CollecMon at Wash U VizHub

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Human Epigenome Browser atWashington University

hOpepigenomegatewaywustledu

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Soware download

ChromHMMWebsitehOpcompbiomiteduChromHMM13

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 17: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Note Different13 than track hubRoadmap Epigenomics Data13 Complete CollecMon at Wash U VizHub

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Human Epigenome Browser atWashington University

hOpepigenomegatewaywustledu

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Soware download

ChromHMMWebsitehOpcompbiomiteduChromHMM13

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 18: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Human Epigenome Browser atWashington University

hOpepigenomegatewaywustledu

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Soware download

ChromHMMWebsitehOpcompbiomiteduChromHMM13

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 19: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Human Epigenome Browser atWashington University

hOpepigenomegatewaywustledu

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Soware download

ChromHMMWebsitehOpcompbiomiteduChromHMM13

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 20: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Human Epigenome Browser atWashington University

hOpepigenomegatewaywustledu

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Soware download

ChromHMMWebsitehOpcompbiomiteduChromHMM13

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 21: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

Accessing Roadmap ChromHMM throughthe UCSC Genome Browser Track Hubs

Human Epigenome Browser atWashington University

hOpepigenomegatewaywustledu

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Soware download

ChromHMMWebsitehOpcompbiomiteduChromHMM13

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 22: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

Human Epigenome Browser atWashington University

hOpepigenomegatewaywustledu

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Soware download

ChromHMMWebsitehOpcompbiomiteduChromHMM13

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 23: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

  

Talk Outline

bull ChromaMn states analysis and ChromHMM13 bull Accessing chromaMn state annotaMons forENCODE2 and Roadmap Epigenomics

bull Running the ChromHMM soware

Soware download

ChromHMMWebsitehOpcompbiomiteduChromHMM13

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 24: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

Soware download

ChromHMMWebsitehOpcompbiomiteduChromHMM13

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 25: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

Soware manual

ChromHMMWebsitehOpcompbiomiteduChromHMM13

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 26: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

    

Try to Run ChromHMM on SampleData13 on Your Computer

(Java13 needs to already be installed)1 DownloadhOpcompbiomiteduChromHMMChromHMMzip2 Unzip ChromHMMzip3 Open a command line4 Change into the ChromHMM directory5 Enter the commandjava -mx1600M -jar ChromHMMjar LearnModel -p 0 SAMPLEDATA_HG18 OUTPUTSAMPLE 10 hg18

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 27: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

  

Input13 to ChromHMM

bull ChromHMMmodels are learned frombinarized data13 using its LearnModel command

bull Binarized data13 is typically obtained starMngfrom aligned readsndash Apply BinarizeBed if reads are in BED format13 ndash Apply BinarizeBam if reads are in BAM13 format

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 28: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

BinarizeBed

Java13 command lsquo-shy‐mx1600Mrsquo specifies memory to Java13

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 29: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

BinarizeBed

ChromHMM command

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 30: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

BinarizeBed

File with the chromosome lengths for the assembly

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 31: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

BinarizeBed

DIRECTORY of BED files

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 32: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

BinarizeBed

Cell-shy‐mark ndashfile table

cell mark cell-shy‐mark

Control data13 ndash is opMonal and canalso be treated as a mark

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 33: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

BinarizeBed

Output13 directory

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 34: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

LearnModel

lsquo-shy‐p 0rsquo Use as many processors as availablelsquo-shy‐p Nrsquo Use up to N processors (default13 N=1)

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 35: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

LearnModel

Directory with the Binarized Input13

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 36: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

LearnModel

Directory where the output13 goes

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 37: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

LearnModel

Number of states

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 38: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

LearnModel

Genome assembly

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 39: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

ChromHMM Report13

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 40: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

ChromHMM Report13

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 41: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

Emission Parameters

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 42: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

TransiMon Parameters

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 43: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

Model Parameter File

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 44: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

SegmentaMon File

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 45: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

Browser Files13

hOpswwwbroadinsMtuteorgigv

Can load into browser UCSC Genome IGV

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 46: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

Enrichments

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 47: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

PosiMonal Plots

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 48: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

Enrichments for AddiMonal Cell Types

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 49: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

Chroma=n states13 to interpret disease13 variants

bull Specific chromatin states enriched in GWAS catalog

Ernst and Kellis Nature Biotech 2010

bull Imputation based chromatin state used in dissection FTO loci

Claussnitzer et al NEJM 2015

bull Enhancers from different cell types enriched in different traits

Ernst et al Nature 2011

bull Interpreting epigenetic disease associated variation in Alzheimerrsquos disease

De Jager et al Nature Neuroscience 2014

bull Many other examples in the literature

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 50: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

Collaborators and Acknowledgements

bull Manolis Kellis

ENCODE consortium ndash Brad Bernstein production group

Roadmap Epigenomics consortium

Funding bull NHGRI NIH NSF HHMI Sloan Foundation

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 51: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

AddiMonal Commands

bull CompareModels ndash the command allows thecomparison of the emission parameters of a selectedmodel to a set13 of models in terms of correlaMon

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 52: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

AddiMonal Commands

bull MakeBrowserFiles ndash (re)generates browserfiles from segmentaMon files and allowsspecifying the coloring

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 53: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

AddiMonal Commands

bull OverlapEnrichment ndash (re)computesenrichments of a segmentaMon for a set13 ofannotaMons

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 54: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

AddiMonal Commands

bull NeighborhoodEnrichment ndash (re)computesenrichments of a segmentaMon around a set13 of anchor posiMons

AddiMonal Commands

bull Reorder ndash reorders the states of the model

Page 55: ChromHMM Tutorial - National Human Genome … Marks in State 200 ... ChromHMM Tutorial Author: Jason Ernst Subject: ENCODE/Roadmap Epigenomics Tutorial Keywords: ENCODE/Roadmap Epigenomics

AddiMonal Commands

bull Reorder ndash reorders the states of the model