The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the...

53
The Chromatin-Looping Factor ZNF143 is Genetically Altered and Promotes the Oestrogen Response in Breast Cancer by Aislinn Treloar A thesis submitted in conformity with the requirements for the degree of Master of Science Department of Medical Biophysics University of Toronto © Copyright by Aislinn Treloar 2016

Transcript of The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the...

Page 1: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

The Chromatin-Looping Factor ZNF143 is Genetically Altered and Promotes the Oestrogen Response in Breast

Cancer

by

Aislinn Treloar

A thesis submitted in conformity with the requirements for the degree of Master of Science Department of Medical Biophysics

University of Toronto

© Copyright by Aislinn Treloar 2016

Page 2: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

ii

The Chromatin-Looping Factor ZNF143 is Genetically Altered

and Promoters the Oestrogen Response in Breast Cancer

Aislinn Treloar

Master of Science

Department of Medical Biophysics University of Toronto

2016

Abstract

Oestrogen signalling in breast cancer (BrCa) cells relies on chromatin-loops that connect distal

regulatory elements bound by the oestrogen receptor 1 (ESR1) to target gene promoters. We

show that chromatin-looping factor genes, including CTCF, ZNF143 and RAD21, are

genetically altered in BrCa. Expanding on the function of CTCF and cohesin in BrCa, we

demonstrate that ZNF143 binds promoters of most early-response oestrogen target genes

connected to distal regulatory elements in ESR1-positive BrCa cells. Its chromatin occupancy is

unaffected by oestrogen-stimulation, supporting a stable three-dimensional genomic architecture

within the oestrogen response. Its loss abrogates the oestrogen-induced transcriptional response

and growth of BrCa cells. Furthermore, we show that the overexpression of looping-factors

within ESR-1 positive BrCa patients associates with a worse clinical outcome. Overall, our

results suggest that ZNF143 is a new critical effector of the oestrogen response and highlights

the contribution of the chromatin looping machinery to ESR1-positive BrCa development.

Page 3: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

iii

Acknowledgments

During the process of writing this thesis, I received help and support from many people. I would

like to express my gratitude to them all in this acknowledgment.

Foremost, I would like to thank my supervisor Dr. Mathieu Lupien for giving me the

opportunity to work on this project and who has been so generous with his time, knowledge and

encouragement throughout the completion of this thesis.

My sincerest gratitude goes to the current and past members of the Lupien lab who contributed

to my project. I thank Dr. Xue Wu and Dr. Xiaoyang Zhang for performing the ZNF143 ChIP-

sequencing experiment, Dr. Nadia Penrod for performing RNA-sequencing analysis using the

Cufflinks suite and both Dr. Swneke Bailey and Parisa Mazoorei for their guidance in learning

several forms of data-analysis. Thanks to everyone in the lab for their encouragement, assistance

and friendship during the pursuit of this degree.

I would like to thank the members of my supervisory committee, Dr. Brad Wouters and Dr.

Senthil Muthuswamy for their helpful suggestions on this project.

Finally, I would like to acknowledge my friends and family for their encouragement and support

along the way.

Page 4: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

iv

Table of Contents

ABSTRACT...........................................................................................................................................II

ACKNOWLEDGMENTS....................................................................................................................................III

TABLEOFCONTENTS.....................................................................................................................................IV

INTRODUCTION........................................................................................................................................11

METHODS.................................................................................................................................................52

RESULTS...................................................................................................................................................93

3.1 ZNF143BINDINGOCCURSPRINCIPALLYATPROMOTERS,INCLUDINGATTHEMAJORITYOFTHOSEOCCUPIEDBY

ESR1 9

3.2 ZNF143DIRECTLYREGULATESOESTROGEN-TARGETGENETRANSCRIPTIONINESR1-POSITIVEBREASTCANCER

CELLS 12

3.3 GENETICALTERATIONSINCHROMATINLOOPINGFACTORSAREFREQUENTLYOCCURRINGANDTENDTOWARDS

MUTUALEXCLUSIVITY.............................................................................................................................................15

3.4 DIFFERENTIALGENEEXPRESSIONOFTHECHROMATINLOOPINGMACHINERYISCLINICALLYRELEVANTANDRELATES

TOGENETICALTERATIONSINBREASTCANCER.............................................................................................................17

DISCUSSION&FUTUREDIRECTIONS.......................................................................................................204

5SUPPLEMENTARYTABLES&FIGURES........................................................................................................24

5.1SUPPLEMENTARYTABLES..................................................................................................................................24

SupplementaryTable1:E2-regulatedgenesinMCF-7cells........................................................................24

SupplementaryTable2:GSEAanalysisofgenesdown-regulateduponZNF143depletionunderE2

stimulation...................................................................................................................................................26

SupplementaryTable3:Geneticalterationstendtowardsmutualexclusivity(TCGA2012)......................27

SupplementaryTable4:Geneticalterationstendtowardsmutualexclusivity(TCGAProvisional

DataSet).......................................................................................................................................................28

SupplementaryTable5:FoldChangemRNAExpressionuponGeneticAlterationofChromatin

LoopingFactors(TCGAProvisionalDataSet)...............................................................................................29

SupplementaryTable6:Geneticalterationstendtowardsmutualexclusivityinlungsquamouscell

carcinoma(TCGA,Provisional).....................................................................................................................30

Page 5: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

v

SupplementaryTable7:Geneticalterationstendtowardsmutualexclusivityinbladder

urothelialcarcinoma(TCGA,2014)..............................................................................................................31

SupplementaryTable8:Geneticalterationstendtowardsmutualexclusivityinliverhepatocellular

carcinoma(TCGAProvisionalDataSet).......................................................................................................32

SupplementaryTable9:Primersusedinthisstudy.....................................................................................33

5.2SUPPLEMENTARYFIGURES................................................................................................................................34

REFERENCES...........................................................................................................................................405

Page 6: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

1

Introduction 1

Chromatin looping is involved in transcriptional regulation

Gene expression is tightly controlled by spatially and temporally organized transcriptional programs to

determine individual cell identity and function (Spitz and Eileen 2012). This is achieved through the

coordinated interplay between transcription factor-bound promoters and distal regulatory elements,

known as enhancers via long-range chromatin interactions that physically connect enhancers to their

target promoter regions (Lieberman-Aiden 2009; Fullwood et al. 2009; Sanyal et al. 2012; Jin et al.

2013; Heidari et al. 2014; Rao et al. 2014; Plank & Dean 2014; Kron et al. 2014). The vast majority of

long-range chromatin interactions are established during cellular differentiation and participate in the

coordinated activation of cell-type specific transcriptional programs; however, a small subset can form

upon response to external stimuli (Fraser et al. 2009; Sanyal et al. 2012; Jin et al. 2013; Rao et al.

2014).

Enhancer-promoter loops are mediated by chromatin-looping factors

The machinery regulating the formation of chromatin interactions consists of the DNA binding proteins

CCCTC-binding factor (CTCF) and Zinc Finger Protein 143 (ZNF143), and the non-DNA binding

cohesin complex, which must be recruited to the chromatin (Cubeñas-Potts & Corces 2015).

CTCF: CTCF is a ubiquitous and essential 11-zinc finger protein whose sequence is highly conserved

(Klenova et al. 1993; Filippova et al. 1996; Burcin et al. 1997; Fedoriw et al. 2004). It binds tens of

thousands of sites in the human genome (Chen et al. 2012), of which a small proportion are ulta-

conserved between mammalian species (Schmidt et al. 2012) while 50-60% show cell-type specificity

(Barski et al. 2007; Kim et al. 2007; Chen et al. 2008; Cuddapah et al. 2009). Although vertebrate

CTCF has traditionally been implicated as an insulator protein, recent reports support CTCF’s role as a

chromatin-looping factor. It regulates genome topology as a looping factor through two main

mechanisms. First, CTCF contributes to the partitioning the genome into regulatory blocks, or

topologically associated domains (TADs) (Nora et al. 2012; Phillips-Cremins et al. 2013; Rao et al.

2014; Lupianez et al. 2015; Guo et al. 2015; Barutcu et al. 2015; Ji et al. 2015). Chromatin interactions

Page 7: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

2

enrich within these TADs but are relatively rare across TAD boundaries (Dixon et al. 2012; Hou et al.

2012; Sofueva et al. 2013). These sites may thus restrict enhancer-promoter interactions and establish

functional domains of gene expression. In addition to this role in TAD boundaries, CTCF is also

implicated in the cell-type specific interactions that occur within TADs. Enhancer elements are

enriched for CTCF binding (Song et al. 2011; DeMare et al. 2013), indicating that a subset of CTCF

sites may be important in shaping cell-type specific transcriptional programs. A significant overlap

observed between cell-type specific CTCF binding sites and enhancer elements in the human genome

supports this (Barski et al. 2007). Interaction analysis in 3 cell lines indicates that distal fragments that

loop to promoter fragments are enriched for CTCF and active enhancer histone tail modifications

(Sanyal et al. 2012). Furthermore, interaction analyses focused on CTCF binding sites in several cell

types indicate that CTCF-bound fragments interact with gene promoters (Heidari et al. 2015; Guo et al.

2012) and that enhancer-promoter looping may require CTCF (Hirayama et al. 2012). These results

indicate that CTCF function may be context-dependent, either defining TAD boundaries, or targeting

cell-specific enhancers for looping events.

The cohesin complex: The cohesin complex is made up of several subunits, Rad21, SMC1A, SMC3

and either STAG1 or STAG2. Together, they form a ring-like structure approximately 40nm in

diameter that encircles DNA fibers (Losada 2014). The complex contributes to DNA replication, as it

promotes restart of replication forks that stall at regions that are difficult to replicate, has a role in

facilitating doubles-strand break repair by homologous recombination (Remeseiro et al. 2012;

Carretero et al. 2013) and is also involved in sister chromatid cohesion during mitosis (Uhlmann 2004).

As the cohesin complex lacks a DNA binding domain, 60-80% of cohesin binding sites are dependent

on CTCF for the complex’s recruitment to the chromatin (Wendt et al. 2008; Schmidt et al. 2010)

where it is lost upon siRNA mediated depletion of CTCF expression (Parelho et al. 2008; Wendt et al.

2008; Hou et al. 2010). CTCF and cohesin occupancy strongly correlate with long-range interactions

(Sanyal et al. 2012; Phillips-Cremins et al. 2013; Heidari et al. 2014) and the depletion of either of

these factors destabilizes chromatin interactions (Splinter et al. 2006; Hadjur et al. 2009; Hou et al.

2010; Kagey et al. 2010; Sofueva et al. 2013; Li et al. 2013; Phillips-Cremins et al. 2013; Zuin et al.

2014). Cohesin may also have a CTCF-independent role in chromatin looping as it occupies a subset of

active enhancer-promoter interactions that lack CTCF (Schmidt et al. 2010; Kagey et al. 2010; Phillips-

Cremins et al. 2013). These sites often harbour the Mediator complex, a transcription co-activator, and

Page 8: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

3

Nipb1, cohesin’s loading factor, in embryonic cells (Kagey et al. 2010); however the role for either of

these factors in loop formation has neither been fully characterized in these cells, nor suggested in other

cell types.

ZNF143: ZNF143 is a ubiquitously expressed 7 zinc-finger protein that has been described as a

vertebrate transcriptional activator involved in RNA pol II-dependent transcription (Myslinski et al.

1998; Faresse et al. 2012). Although ZNF143’s requirement for growth and development is not well

known in mammals, it is essential for normal development in zebrafish (Halbig et al. 2012). Analysis

of ZNF143 binding across 4 mammalian genomes suggests that ZNF143 constitutes one of the most

widespread transcription factor binding-sites in mammalian promoters (Myslinski et al. 2006), with

binding in approximately 2000 mammalian protein-coding genes. Its enrichment in chromatin

interaction anchors along with the cohesin complex and CTCF indicates that it may be involved in

chromatin looping (Rao et al.; Bailey et al. 2015; Heidari et al. 2015). Indeed, ZNF143 binding at gene

promoters relates directly to cell-specific interactions and the disruption of ZNF143 binding to the

chromatin through genetic variation decreases chromatin interaction frequency and diminishes target-

gene expression (Bailey et al. 2015).

Oestrogen signalling relies on chromatin-looping

Chromatin looping is a key mechanism in oestrogen receptor 1 (ESR1)-mediated transcriptional

regulation in breast tumours that express oestrogen (classified as “luminal”) (Pan et al. 2008; Fullwood

et al. 2009; Li et al. 2012; Zhang et al. 2010; Li et al. 2013). Accordingly, CTCF and the cohesin

complex have been implicated in the ESR1-mediated response of breast cancer cells to oestrogen in a

genome-wide manner. CTCF-bound sites involved in chromatin interactions demarcate the oestrogen

response of the TFF1 and TFF3 genes, with CTCF silencing preventing E2-upregulation of gene

expression. In addition, depletion of the cohesin complex subunit Rad21 abrogates the oestrogen

response by interfering with the formation of chromatin interactions at oestrogen-regulated genes (Li et

al. 2013) and by blocking growth of breast cancer cells (Schmidt et al. 2010). Both CTCF and the

cohesin complex occupy genomic regions where ESR1 is recruited following oestrogen stimulation

(Schmidt et al. 2010; Ross-Innes et al. 2011; Hah et al. 2013). These sites are primarily distal to

promoters, and the factors that target promoters for ESR1-mediated long-range chromatin interactions

Page 9: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

4

have yet to be elucidated. The characterization of ZNF143 in several cell types as a promoter-bound

looping factor suggests that it may provide for this integral component of ESR1 signalling.

In the present study we identify ZNF143 as a key regulator of ESR1-signalling that enriches at

oestrogen-responsive gene promoters involved in chromatin interactions. We show that it is required

for oestrogen-induced gene transcription and for the luminal breast cancer growth response to

oestrogen stimulation. We show that the chromatin interaction machinery, including CTCF, ZNF143

and the cohesin complex, is genetically altered in breast tumours and that the expression of these

factors is clinically relevant in luminal breast cancer patients.

Page 10: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

5

Methods 2

Cell Culture:

MCF-7 cells were cultured as previously described (Magnani et al. 2011). Briefly, cells were

maintained in DMEM (Life Technologies) supplemented with 10%FBS and 1% Penicillin

Streptomycin.

siRNA Transfection of MCF-7 breast cancer cells:

MCF-7 cells were maintained in phenol red free DMEM medium (Life Technologies) supplemented

with 10% heat-inactivated CDT-FBS, 1% Penicillin Streptomycin and 1mM Sodium Pyruvate prior to

transfection, as described previously (Lupien et al 2008). Following two days of oestrogen starvation,

cells were transfected with siZNF143 #1 (Ambion siRNA ID: s15192), siZNF143 #2 (Ambion siRNA

ID: s15194), or a negative scrambled control siRNA (Ambion cat: 4404020). Transfection was

performed using the RNAiMAX reagent (Invitrogen) according to the manufacturer’s instructions. For

cell proliferation assays, cell number or O.D. (562nm) was determined every 24hrs post E2-stimulation

(10µM 17β oestradiol). For expression and protein assays, RNA was extracted 3hrs following 10µM,

17β oestadiol stimulation.

RNA preparation/collection and real-time PCR:

Total RNA extraction was performed using the RNeasy Plus Mini Kit (Qiagen) according to the

manufacturer’s instructions. cDNA was synthesized using the iScript cDNA synthesis kit (Bio-Rad)

and real-time PCR was performed using the SensiFAST SYBR No-ROX master mix (BioLine) and

primers listed in Supplementary Table 9.

RNA-seq:

RNA-seq reads were mapped to the human reference genome (hg19) using TopHat2 with default

parameters (Kim et al. 2013). The Cufflinks suite was used to assemble transcripts, estimate transcript

abundance, and test for differential expression (Trapnell et al. 2010). Specifically, differential

expression was called using CuffDiff2 (Trapnell et al. 2013) with the following parameters: -u to

Page 11: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

6

correct for reads mapping to more than one location and –b to correct for fragment bias (Roberts et al.

2011).

TCGA and BCCRC Xenograft data for breast cancer genetic alterations:

The DNA copy number, RNA-sequencing and genetic alteration datasets of 482 TCGA published

(TCGA 2012), 962 TCGA provisional (2015) and 29 BCCRC Xenograft (Eirew et al. Nature 2014)

breast cancer samples used in this work were obtained from the CBio Cancer Genomics Portal (Cerami

et al. 2012; Gao et al 2013). Copy number of each gene was generated from the algorithm GISTIC

(Genomic Identification of Significant Targets in Cancer) (Beroukhim et al. 2007). Somatic mutation

data were obtained from whole-genome shotgun sequencing for the BCCRC Xenograft dataset (Eirew

et al. 2014) and exome-sequencing for the TCGA Published and Provisional datasets (TCGA 2012).

Figures were recreated using GraphPad Prism Version 5.00 for Mac OS X, (GraphPad Software, San

Diego California USA, www.graphpad.com). Datasets are available through CBioPortal

(http://www.cbioportal.org).

ChromHMM/Genomic distribution of binding:

The ChromHMM algorithm (Ernst et al. 2011) was used to train a multivariate Hidden Markov Model.

ChIP-Seq reads for eight histone modifications in E2-stimulated MCF-7 cells (H3K27ac, H3K9ac,

H3K4me3, H3K14ac, H3K4me1, H3K27me3, H3K9me3 and H3K36me3)

(http://genome.ucsc.edu/ENCODE/downloads.html) were transformed into binary values using a

default 200-bp bin size, and the LearnModel function was used to learn models and to assign each

genomic position to one of twelve chromatin states. Biological relevance for each state was assigned

based on previous studies (Ernst et al. 2011).

Page 12: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

7

ChIP-seq

ChIP assays were performed as previously described (Lupien et al. 2010). Briefly, five million cells

were crosslinked with 1% Formaldehyde and lysed. Chromatin was sonicated by biorupter and

immunoprecipitated with 4µg anti-ZNF143 (Novus Biologicals H00007702-M01), followed by reverse

crosslinking. DNA was extracted using MinElute PCR purification kit (Qiagen 28004), processed with

library construction using NEB ChIP-seq library prep reagent set (E6200S), and then sequenced by Hi-

Seq 2000. Sequencing reads were aligned to hg19 by Bowtie and ZNF143 binding peaks were called

by MACS2.0 (Zhang et al. 2008). All assays were conducted in duplicates. These were pooled to

improve peak detection as duplicates for each condition were highly correlated with one another (R=

0.844 vehicle; R=0.928 E2-stimulation) (Supplementary Figure 1).

Statistical Analysis:

P-value was determined using Student’s t-test with two-tail distribution. Kaplan-Meier analyses

(http://www.kmplot.com) were used to assay differences in overall survival (Györffy 2010). Prism and

Omnigraffle were used for figures.

Overlap Analysis and genome structure correction (GSC):

The intersections between the binding sites for ZNF143, ESR1 and ChIA-PET regions were performed

using the BEDTools software package (Quinlan et al. 2010). Overlapping binding sites were defined by

having at least one base pair in common. Genome structure correction (GSC) statistic (Birney et al.

2007; Bickel et al. 2011) was run to establish the significance of the overlap of ZNF143 with E2-

regulated promoters involved in loops. The software was run using region fraction = 0.2, sub-region

fraction –S = 0.4 and –bm as statistic test. All DNAse I Hypersensitive sites in E2-stimulated (10nM

E2, 45min) MCF-7 cells were used as the null list (He et al. 2012).

Gene Set Enrichment Analysis

Biological pathways disrupted by ZNF143 depletion in MCF-7 cells were investigated by performing a

gene set enrichment analysis (GSEA), software that is available from the Broad Institute (Subramanian

Page 13: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

8

et al. 2005; Mootha et al. 2003). The hallmark gene sets were used for enrichment analysis

(www.broadinstitute.org/gsea).

Page 14: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

9

Results 3

3.1 ZNF143 binding occurs principally at promoters, including at the majority of those occupied by ESR1

The role for CTCF and the cohesin complex in ESR1 signalling has been established in breast cancer

cells (Liu and Cheung 2014). However, ZNF143’s contribution to the progression of this cancer type is

unknown. We therefore determined the genome-wide chromatin binding profile of ZNF143 in ESR1-

positive breast cancer cells through chromatin immunoprecipitation (ChIP) followed by massively

parallel sequencing (ChIP-seq) in MCF-7 cells. This was performed before (vehicle control) and after

stimulation with 17β-oestradiol (E2), a potent form of oestrogen (Supplementary Figure 1). We

identified a total of 76,802 and 73,798 high-confidence (p≤1.0x10-5) ZNF143 chromatin-binding sites

in vehicle and E2-stimulated conditions, respectively. We observe a strong correlation (R = 0.809) in

ZNF143’s binding affinity for the chromatin under these two conditions (Figure 1A, upper panel),

suggesting that ZNF143 is bound to the chromatin prior to E2 stimulation in ESR1-positive breast

cancer cells and that ZNF143 remains stably bound at these sites following hormonal stimulation. The

ZNF143 binding regions in MCF-7 breast cancer cells employed for all subsequent analysis were those

identified using the merged sequencing files.

To determine the genomic distribution of ZNF143 binding events, we first defined chromatin states in

MCF-7 cells based on the ChromHMM algorithm (Ernst and Kellis 2012) using the ChIP-Seq data for

Histone 3 Lysine 27 acetylation (H3K27ac), H3K9ac, H3K14ac, Histone 3 Lysine 4 trimethylation

(H3K4me3), H3K4me1, H3K27me3, H3K9me3 and H3K36me3 generated in E2-stimulated MCF-7

cells (Joseph et al. 2010; Magnani et al. 2013; Li et al. 2013). We applied a 12-state chromatin model

to segment the genome and then grouped predicted functional elements as promoters, enhancers,

transcribed, repressed, CTCF or no signal regions (Supplementary Figure 2). In agreement with reports

revealing a strong enrichment for ZNF143 binding at promoters in other cell types (Bailey et al. 2015),

we find 38,320 (43%) ZNF143 bound sites at promoters (Figure 1A, lower panel). An additional

16,232 sites (18%) map to enhancers, 3,173 sites (3.6%) to transcribed regions, 2,073 sites (2.3%) to

Page 15: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

10

repressed regions, 7,410 (8.3%) to CTCF regions, and the remaining 22,486 sites (26%) fall in regions

with no signal in our ChromHMM segmentation model (Figure 1A, lower panel). The bias of ZNF143

binding at promoters is further highlighted by its significantly increased binding intensity at promoters

over other genomic regions (Figure 1B).

ESR1 recruitment to the chromatin following E2-stimulation in MCF-7 cells has previously been

reported to occur primarily away from promoters (Carroll et al. 2006; Lin et al. 2007; Welboren et al.

2009). These assessments were performed based on the position of annotated genes across the

reference human genome as opposed to chromatin states. Indeed, using this approach, we map less than

5% (1,168) ESR1 binding sites to promoters (Figure 1C, upper panel). However, the proportion of

ESR1-bound promoters increases to 27.4% (6,406 sites) when using chromatin state to define genomic

elements in MCF-7 cells (Figure 1C, lower panel). ESR1 binding is still predominantly (33.8%; 7,907

sites) found at enhancers (Figure 1C) and further assessment of ESR1 called peaks indicates that only

10% of binding sites are within ±2.5kb from the TSS of coding genes; this is in line with reports of a

subset of ESR1 binding sites that are called in proximity of promoters when co-localized with CTCF

(Ross-Innes et al. 2011). The majority of ESR1 sites that are called in promoter regions are so far un-

annotated and do not fall within ±2.5kb of TSS of lncRNAs or coding genes.

Comparing ZNF143 and ESR1 binding profiles reveals over 8,671 shared binding sites (Supplementary

Figure 3). This translates into 72% of ESR1-bound promoter regions occupied by ZNF143 prior to and

after E2-stimulation (4,782 of 6,647 ESR1 promoter bound regions)(Figure 1D). Over 36% of ESR1-

bound enhancers are also occupied by ZNF143 binding in MCF-7 cells but transcribed, CTCF and

repressed ESR1-bound regions show minimal overlap with ZNF143 binding sites (15%, 4% and 18%

of ESR1 sites, respectively) (Figure 1D). We find that ESR1 binding is significantly stronger at

enhancer regions than promoters occupied by ZNF143 (p=1x10-3). In contrast, ZNF143 binds the

ESR1–bound promoters with greater affinity (p<1x10-3) compared to enhancers (Figure 1E). These

results are in line with primary ESR1 binding occurring at enhancers, while primary ZNF143 binding

occurs at promoters.

Page 16: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

11

Figure 1

Figure 1: Genome-Wide Binding of ZNF143 Suggests a Role in E2-Induced ESR1 Recruitment to Promoter Elements

Scatterplot comparing read count for ZNF143 binding in MCF-7 breast cancer cells treated with vehicle or E2 (10µM) for 45 minutes. Scatterplot was generated with SeqMonk (http:/www.bioinformatics.babraham.ac.uk/projects/seqmonk/) from ZNF143 ChIP-sequencing data. R, Pearson’s correlation, was calculated from all data points (Upper Panel). Genomic distribution of ZNF143 binding events in MCF-7 cells. Chromatin states were annotated by the ChromHMM algorithm (Supplemental Figure S2) (Ernst et al. 2012) (Lower Panel) (A) . Boxplots representing the signal intensity (MACS score) for ZNF143 at promoter, enhancer, transcribed, CTCF ad no-signal regions of the genome. P-value was calculated using the Mann-Whitney test (B) . Global genomic distribution of ESR1 binding events called in MCF-7 cells under E2 stimulation. Genomic elements were annotated by the Cis-Regulatory Element Annotation System (CEAS) web application (cistrome.org/ap) or by the ChromHMM algorithm (Supplemental Figure S2 (C) . Histogram illustrating the proportion of ESR1-bound sites that are occupied by ZNF143 at promoters, enhaners, transcribed, repressed and CTCF regions(D) . Boxplots showing the signal intensity of ZNF143 and ESR1 at shared binding sites called in promoters or enhancers, as defined by ChromHMM. P-values were calculated using the Mann-Whitney test (E). (*p<0.05; ** p<0.01; *** p<0.001)

Page 17: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

12

3.2 ZNF143 directly regulates oestrogen-target gene transcription in ESR1-positive breast cancer cells

To determine how ZNF143 occupancy relates to chromatin interactions at E2-regulated promoters, we

first defined E2-responsive genes as genes whose expression, measured by RNA-sequencing (RNA-

seq), is significantly altered following E2-stimulation for three hours (FC > 1.5; p=0.05). This

identified 194 E2-regulated genes (Supplementary Table 1). We then mined the RNA Polymerase II

(RNA Pol II) Chromatin Interaction Analysis by Paired-End Tag Sequencing (ChIA-PET) dataset (Li

et al. 2012) to identify E2-upregulated gene promoters that form chromatin interactions. A total of 223

promoters ascribed to the 194 E2-regulated genes were assessed in this analysis. We identified 137

(61%) E2-upregulated genes with at least one chromatin interaction anchored at the promoter (±2.5

kilobases (kb) from the Transcription Start Site (TSS)) (Figure 2A, upper panel). ZNF143 binds to 110

(80%) of the 137 E2-regulated gene promoters, a significantly higher proportion (P<1.0x10-4) than

expected by chance (Figure 2A, lower panel). These results indicate ZNF143 binding is enriched at E2-

regulated gene promoters involved in chromatin interactions, supporting a direct contribution of

ZNF143 to chromatin interactions associated with the E2-response in breast cancer.

We then further delineated the requirement for ZNF143 in the E2 response, specifically in the

regulation of E2 target gene expression. We performed RNA-seq following E2-stimulation in MCF-7

cells depleted of ZNF143 using siRNAs. We found that 91(47%) of the 194 E2-regulated genes were

no longer responsive to E2-stimulation following ZNF143 depletion (Figure 2B-C). Among the genes

whose expression remained responsive to E2-stimulation, 32 genes (16%) showed significantly reduced

expression (p<0.05) following ZNF143 depletion compared to control cells (Figure 2B-C). The impact

of ZNF143 depletion on E2-target gene regulation relates to ZNF143 binding intensity at promoters;

we found that genes unaffected by the loss of ZNF143 (No Change gene category) showed weak

ZNF143 binding at their promoters whereas genes that showed reduced expression or loss of induction

to E2-stimulation following ZNF143 depletion harboured significantly stronger ZNF143 binding at

their promoters in control conditions (p= 4.0x10-3; p=7.1x10-3) (Figure 2D). These findings suggest that

the genes that are unaffected by the loss of ZNF143 do not require ZNF143 at their promoters to

Page 18: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

13

regulate expression whereas the genes that exhibit either loss of induction or reduced expression when

ZNF143 is lost do rely on ZNF143 occupancy at their promoters. We further interrogated patterns of

mRNA expression in MCF-7 cells depleted of ZNF143 by performing Gene Set Enrichment Analysis

(GSEA) for genes whose expression was significantly down-regulated upon ZNF143 depletion (FC

>1.5, p <0.05) under E2-stimulated conditions. This revealed enrichment for early and late oestrogen

stimulation gene sets (p= 7.21x10-15 & p= 3.92x10-9) (Supplementary Table 2).

MCF-7 cells rely on E2-stimulation to activate the ESR1-regulated gene expression program, which

mediates their re-entry into cell cycle and drives proliferation. We therefore assessed the requirement

for ZNF143 in the E2-induced growth of MCF-7 luminal breast cancer cells following its siRNA-based

depletion (Figure 2E). ZNF143 depletion significantly impaired growth of MCF-7 cells under E2-

stimulation (Figure 2F), supporting its central role in the oestrogen response. Overall, these results

show that ZNF143 is required for the ESR1-mediated transcriptional response and for the E2-induced

growth response of luminal breast cancer cells.

Page 19: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

14

Figure 2

Figure 2: ZNF143 is Required for the E2-Induced Transcriptional Response

Proportion of E2-responsive gene promoters (±2.5kb from TSS) that are involved in chromatin interactions associated with RNA polymerase II in E2-stimulated MCF-7 cells (upper panel). Scatterplot illustrating the proportion of these promoters that exhibit ZNF143 binding (lower panel) (A). Volcano plot showing changes in gene expression of E2-responsive genes upon E2 stimulation in cells depleted of ZNF143 (pink) or transfected with scrambled siRNA (green).Each circle represents one gene. The log fold change in gene expression in E2-stimulated versus vehicle treated conditions is represented on the x-axis. The y-axis shows the -log10 of the pvalue. A p-value of 0.05 and a fold change of 1.5 are indicated by grey lines. Quadrants 1-6 are indicated (B). Case examples of genes that fall within quadrants 1 and 5 that illustrate gene expression changes under vehicle and E2-stimulated conditions. (RPKM: reads per kilobase per million). P-value calculated using student’s t-test. (C). Boxplots illustrating signal intensity of ZNF143 at promoters of genes that show no change, reduced expression or loss of E2-induction upon ZNF143 depletion. P-value calculated using Kruskal-Wallis test (D). ZNF143 depletion via siRNA significantly reduces its mRNA levels 48hrs post-transfection. P-value calculated using student’s unpaired t-test (E). MCF-7 breast cancer cells depleted of ZNF143 fail to grow in response to E2 stimulation compared to control cells. P-value calculated using student’s unpaired t-test (F) (*p<0.05; ** p<0.01; *** p<0.001; NS = Not Significant)

-2 0 2 4 60

1

2

3

4

5si-ScrsiZNF143

mRNA Fold Change: E2 / Vehicle Treated (Log2 transformed)

-Log

10 (p

-Val

)

1 2 3

4 5 6

43% genes lose E2-induction

67% genes lose E2-repression

siScr

(Veh

)

siScr

(E2)

siZNF14

3-1 (

Veh)

siZNF14

3-1 (

E2)

siZNF14

3-2 (

Veh)

siZNF14

3-2 (

E2)0.0

0.5

1.0

1.5

NS

««

«« ««

ZNF1

43 m

RN

A L

evel

(Nor

mal

ized

to A

ctin

)

0 1 2 3 40

2

4

6

si-Scr (Veh)si-Scr (E2)siZNF143-1 (Veh)siZNF143-1 (E2)siZNF143-2 (Veh)siZNF143-2 (E2)

«NS

«« ««

Time (Days)

Gro

wth

(5

20nm

abs

orba

nce

rela

tive

to D

ay 0

)

si-Scr

siZNF14

30

5

10

15

FC = 1.3NS

FC = 1.5!!

TET2

FC = 1.2NS

FC = 1.4!!!

VehicleE2

FPK

M

A B

si-Scr

siZNF14

30

20

40

60

MYC

FC = 2.5!!!

FC = 2.6!!!

FC = 1.0NS

FC = 1.1NS

VehicleE2

FPK

M

C

si-Scr

siZNF14

30

200

400

600

TFF1

FC = 2.2!!!

FC = 2.0!!!

FC = 1.8!!

FC = 1.7!!!

VehicleE2

FPK

M

No Cha

nge

Reduc

ed E

xpre

ssion

Loss

of In

ducti

on

0

10

20

100200300400 !!! NS

!!!

ZNF1

43 S

igna

l Int

ensi

ty (M

AC

S s

core

)

DNo Change

(37% of genes)Reduced Expression

(16% of genes)

Loss of Induction (47% of genes)

Quadrants 1 & 3 Quadrants 2,4,5 & 6

E F

E2-Regulated Gene Promoters

61%$

Pol2$ChIA-PET$

ZNF143 Null List0

20

40

60

80

100

«««

Pro

porti

on o

f Bou

nd,

Loop

ing

E2-

regu

late

d P

rom

oter

s

ZNf143 binding

Figure 2: ZNF143 is Required for the E2-Induced Transcriptional Response

Proportion of E2-responsive gene promoters (±2.5kb from TSS) that are involved in chromatininteractions associated with RNA polymerase II in E2-stimulated MCF-7 cells (upper panel).Scatterplot illustrating the proportion of these promoters that exhibit ZNF143 binding (lowerpanel) (A). Volcano plot showing changes in gene expression of E2-responsive genes upon E2stimulation in cells depleted of ZNF143 (pink) or transfected with scrambled siRNA (green).Each circle represents one gene. The log fold change in gene expression in E2-stimulated versusvehicle treated conditions is represented on the x-axis. The y-axis shows the -log10 of the pvalue.A p-value of 0.05 and a fold change of 1.5 are indicated by grey lines. Quadrants 1-6 areindicated (B). Case examples of genes that fall within quadrants 1 and 5 that illustrate geneexpression changes under vehicle and E2-stimulated conditions. (RPKM: reads per kilobase permillion). P-value calculated using student’s t-test. (C). Boxplots illustrating signal intensity of ZNF143 at promoters of genes that show no change, reduced expression or loss of E2-induction upon ZNF143 depletion. P-value calculated using Kruskal-Wallis test (D). ZNF143 depletion via siRNA significantly reduces its mRNA levels 48hrs post-transfection. P-value calculated using student’s unpaired t-test (E). MCF-7 breast cancer cells depleted of ZNF143 fail to grow in response to E2 stimulation compared to control cells. P-value calculated using student’s unpaired t-test (F) (*p<0.05; ** p<0.01; *** p<0.001; NS = Not Significant)

Page 20: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

15

3.3 Genetic alterations in chromatin looping factors are frequently occurring and tend towards mutual exclusivity

Pan-cancer analyses led by The Cancer Genome Atlas (TCGA) have revealed that CTCF is

significantly mutated in several cancers, including breast (Lawrence et al. 2014). Having established a

role for ZNF143 in ESR1 signalling in luminal breast cancer cells, these reports prompted us to assess

the extent of genetic alterations to the known chromatin looping factors in breast cancer. We

investigated the frequency of genetic alterations in CTCF, ZNF143 and the cohesin complex subunits

(Rad21, STAG1, STAG2, SMCA1 and SMC3) in the published and provisional breast cancer datasets

from TCGA as well as in the Primary Derived Xenograft (PDX) populations from the British Columbia

Cancer Research Centre (BCCRC) (TCGA 2012; Eirew et al. 2014). Collectively, the chromatin-

looping factors harbour genetic alterations (amplification, deletion or somatic mutation) in at least 22%

of primary breast tumours and up to 65% of PDXs derived from primary and metastatic breast tumours

(Figure 3A-B). Although we observe a trend towards mutual exclusivity between genetic alterations in

the chromatin-looping factors across samples from the TCGA provisional and published datasets, 3

gene pairs do show significant co-occurrence in the TCGA provisional dataset (Figure 1C,

Supplementary Table 3-4).

Finally, we assessed the relevance for these genetic alterations to breast cancer subtype as defined by

the PAM50 gene expression classifier (Parker et al. 2009) in the TCGA published and provisional data

sets. Although genetic alterations to ZNF143 and CTCF seem to enrich in luminal cancers (3/3 and

15/18, respectively) in the published data set, this observation is not reproduced in the provisional

dataset; genetic alterations in each of the chromatin looping factors are found in the other breast cancer

subtypes. These results suggest that in addition to the chromatin looping machinery being altered in

luminal cancers, where it is contributes to ESR1 signalling, the dysregulation of these factors may also

contribute to disease in the other breast cancer subtypes.

Page 21: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

16

Figure 3

Figure 3: Chromatin-Looping Factors are Genetically Altered in Breast Cancer

Copy number alterations and somatic mutations in CTCF, ZNF143 and five members of the cohesin complex were analyzed in three separate breast cancer datasets. DNA amplification, deletion, mutation or multiple alterations are indicated (A). Overall frequency of genetic alterations in the chromatin looping factors specified in A, in three breast cancer data sets (B). For each breast cancer data set, the proportion of samples with genetic alterations in a single factor versus in multiple factors is indicated by the strength of colour (C). Breast cancer subtype of tumours harbouring a genetic alteration in a chromatin looping factor; subtype determined by gene expression of 50 genes (PAM50) (D). (*p<0.05; ** p<0.01; *** p<0.001)

ZNF143CTCF

Rad21

STAG1

STAG2

SMC1ASMC3

0

20

40

60

AmplificationMutationDeletionMultiple Alterations

Alte

ratio

n Fr

eque

ncy

(%)

TCGA Pub

TCGA Prov

BCCRC Xen

ograf

t0

20

40

60

80

n = 29

n = 962

n = 482

Ove

rall A

ltera

tion

Freq

uenc

y (%

)

A

BCCRC Xenograft

B

ZNF143CTCF

Rad21

STAG1

STAG2

SMC1ASMC3

0

5

10

15

20

25

AmplificationMutationDeletionMultiple Alterations

Alte

ratio

n Fr

eque

ncy

(%)

ZNF143CTCF

Rad21

STAG1

STAG2

SMC1ASMC3

0

5

10

15

20

AmplificationMutationDeletionMultiple Alterations

Alte

ratio

n Fr

eque

ncy

(%)

TCGA Prov

TCGA PubD

C

★★

★★★

★★★

★★

Co-occurence

ZNF143 (

n=3)

CTCF (n=1

8)

RAD21 (n

=67)

STAG1 (n=

8)

STAG2 (n=

6)

SMC1A (n

=4)

SMC3 (n=

1)

All Sam

ples (

n=48

2)0

20

40

60

80

100

LuminalHER2-EnrichedBasal-LikeNormal-Like

PAM50 (TCGA Published Data Set)

Tum

ours

(%)

ZNF143 (

n=13

)

CTCF (n=2

9)

RAD21 (n

=195

)

STAG1 (n=

64)

STAG2 (n=

18)

SMC1A (n

=21)

SMC3 (n=

12)

All Sam

ples (

n=96

3)0

20

40

60

80

100

LuminalHER2-EnrichedBasal-LikeNormal-LikeNA

PAM50 (TCGA Provisional Data Set)

Tum

ours

(%)

LuminalHER2-EnrichedBasal-LikeNormal-LikeNA

Copy number alterations and somatic mutations in CTCF, ZNF143 and five members of the

cohesin complex were analyzed in three separate breast cancer datasets. DNA amplification,

deletion, mutation or multiple alterations are indicated (A). Overall frequency of genetic

alterations in the chromatin looping factors specified in A, in three breast cancer data sets (B). For

each breast cancer data set, the proportion of samples with genetic alterations in a single factor

versus in multiple factors is indicated by the strength of colour (C). Breast cancer subtype of

tumours harbouring a genetic alteration in a chromatin looping factor; subtype determined by

gene expression of 50 genes (PAM50) (D). (*p<0.05; ** p<0.01; *** p<0.001)

Figure 3: Chromatin-Looping Factors are Genetically Altered in Breast Cancer

Page 22: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

17

3.4 Differential gene expression of the chromatin looping machinery is clinically relevant and relates to genetic alterations in breast cancer

We determined the relevance of genetic alterations in chromatin looping factors to their expression by

segregating breast tumour samples based on the GISTIC-based copy-number alteration score (Mermel

et al. 2011). We observe a significant decrease in ZNF143 expression in tumours harbouring

heterozygous and homozygous deletions compared to diploid cases (p=2.0x10-3 and p<1.0x10-3

respectively) (Figure 4A), while copy number gains correlate with significantly increased ZNF143

expression (p<1.0x10-3). Similar results were obtained for all other chromatin-looping factors with the

exception of STAG2 (Figure 4A; Supplementary Table 5). These results suggest that that copy number

variations in the chromatin looping factors directly impacts their expression.

To address the clinical relevance of differential expression of looping factors in luminal breast cancer

we performed Kaplan-Meier analysis using the METABRIC dataset consisting of close to 2,000

expression profiles from independent, clinically annotated breast cancer samples (Curtis et al. 2012).

Using the KMplot tool (http://kmplot.com/private/) (Györffy et al. 2010) we segregated samples based

on low versus high levels for each chromatin looping factor. Kaplan-Meier curves focused on overall

survival reveal that ESR1-positive breast cancer patients with elevated ZNF143 expression do worse

than those with low expression levels (p=1.1x10-3) (Figure 4B). This observation is valid across all

ESR1-positive breast cancer patients as well as within luminal A or B subtypes (p=4.3x10-2 &

p=7.0x10-4, respectively)(Figure 4B). This suggests that ZNF143 expression does not simply

discriminate luminal A from Luminal B cancers, but that high expression correlates with more

aggressive disease within each subtype. The association of elevated gene expression with more

aggressive breast cancers is also observed for other chromatin-looping factors. For instance, the overall

survival of breast cancer patients whose tumours expressed high levels of CTCF or the cohesin subunit

Rad21 is worse than for patients whose tumours express these factors at lower levels (p=6.3x10-6 &

p=1.3x10-3, respectively across all ESR1-positive breast cancer patients). CTCF expression

discriminates poor outcome within both Luminal A and Luminal B subtypes (p=4.3x10-4 & p=7.7x10-4,

respectively) (Figure 4B), while RAD21’s expression is only predictive in the more aggressive Luminal

B subtype (p=0.027) (Supplementary Figure 5). These results suggest that the expression levels of

Page 23: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

18

these chromatin-looping factors are relevant to the clinical outcome of ESR1-positive breast cancers

patients.

Page 24: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

19

Figure 4

Figure 4: Chromatin Looping Factor Expression is affected by Genetic Alterations and is Clinically Relevant in Luminal Breast Cancer

Box-and-whisker plots showing mRNA expression for the chromatin looping factors assessed in A-C that have altered copy number status, as determined from GISTIC. Gene mutation status and mRNA expression were analyzed using publically available data obtained through the cBioPortal for Cancer Genomics. P-value is calculated using Mann-Whitney test (*p<0.05; ** p <0.01; *** p <0.001) (A). . Kaplan-Meier plots derived from the METABRIC data set (Curtis et al. 2012) evaluating overall survival in ESR1-positive breast cancer patients (n=1486), Luminal A patients (n=825) and Luminal B (n=668) patients, stratified by ZNF143 or CTCF expression. Data were obtained from the Kaplan-Meier plotter breast cancer survival analysis database (Györffy 2010). Hazard ratios (HR) and logrank P-values are displayed (B).

ZNF143

0.0

0.2

0.4

0.6

0.8

1.0

Expressionlowhigh

HR = 1.4 (1.1 − 1.7)logrank P = 0.0011

Prob

abilit

y0.

00.

20.

40.

60.

81.

0

Time (years)0 5 10 15 20 25

HR = 1.4P-val = 0.0011

Expressionlowhigh

0.0

0.2

0.4

0.6

0.8

1.0

Expressionlowhigh

HR = 1.4 (1.0 − 1.9)logrank P = 0.043

Prob

abilit

y0.

00.

20.

40.

60.

81.

0

Time (years)0 5 10 15 20 25

HR = 1.4P-val = 0.043

Expressionlowhigh

0.0

0.2

0.4

0.6

0.8

1.0

Expressionlowhigh

HR = 1.5 (1.2 − 1.9)logrank P = 0.00077

Prob

abilit

y0.

00.

20.

40.

60.

81.

0

Time (years)0 5 10 15 20 25

HR = 1.5P-val = 0.00077

Expressionlowhigh

0.0

0.2

0.4

0.6

0.8

1.0

Expressionlowhigh

HR = 1.6 (1.3 − 2.0)logrank P = 6.3e−06

Prob

abilit

y0.

00.

20.

40.

60.

81.

0

Time (years)0 5 10 15 20 25

HR = 1.6P-val = 6.3e-6

Expressionlowhigh

0.0

0.2

0.4

0.6

0.8

1.0

Expressionlowhigh

HR = 1.6 (1.2 − 2.2)logrank P = 0.0043

Prob

abilit

y0.

00.

20.

40.

60.

81.

0

Time (years)0 5 10 15 20 25

HR = 1.6P-val = 0.0043

Expressionlowhigh

0.0

0.2

0.4

0.6

0.8

1.0

Expressionlowhigh

HR = 1.5 (1.2 − 1.9)logrank P = 0.0013

Prob

abilit

y0.

00.

20.

40.

60.

81.

0

Time (years)0 5 10 15 20 25

HR = 1.5P-val = 0.013

Expressionlowhigh

CTCF

A

B Luminal BLuminal AERα Positive

Box-and-whisker plots showing mRNA expression for the chromatin looping factors assessed in

A-C that have altered copy number status, as determined from GISTIC. Gene mutation status and

mRNA expression were analyzed using publically available data obtained through the cBioPortal

for Cancer Genomics. P-value is calculated using Mann-Whitney test (*p<0.05; ** p <0.01; *** p

<0.001) (A). . Kaplan-Meier plots derived from the METABRIC data set (Curtis et al. 2012)

evaluating overall survival in ESR1-positive breast cancer patients (n=1486), Luminal A patients

(n=825) and Luminal B (n=668) patients, stratified by ZNF143 or CTCF expression. Data were

obtained from the Kaplan-Meier plotter breast cancer survival analysis database (Györffy 2010).

Hazard ratios (HR) and logrank P-values are displayed (B).

Figure 4: Chromatin Looping Factor Expression is Affected by Genetic Alterations and is Clinically Relevant in Luminal Breast Cancer

HomDel

(n=1)

HetLos

s (n=

165)

Diploid

(n=6

55)

Gain (n

=134

)

Amp (n=

4)6

8

10

12

14

STAG2

««

NS NS

mR

NA

Expr

essi

on (l

og2)

HomDel

(n=3)

HetLos

s (n=

151)

Diploid

(n=6

39)

Gain (n

=155

)

Amp (n=

10)

10

12

14

SMC1A

« «««

««««

mR

NA

Expr

essi

on (l

og2)

HomDel

(n=4)

HetLos

s (n=

292)

Diploid

(n=5

77)

Gain (n

=85)

Amp (n=

1)7

8

9

10

11

12

13

SMC3

««« «««

««

mR

NA

Expr

essi

on (l

og2)

HomDel

(n=6)

HetLos

s (n=

263)

Diploid

(n=5

61)

Gain (n

=124

)

Amp (n=

5)

7

8

9

10

ZNF143

««« «««

««

mR

NA

Expr

essi

on (l

og2)

NS

HomDel

(n=10

)

HetLos

s (n=

598)

Diploid

(n=2

70)

Gain (n

=79)

Amp (n=

3)8

9

10

11

12

CTCF

«««

««

mR

NA

Expr

essi

on (l

og2)

««

«

HomDel

(n=0)

HetLos

s (n=

23)

Diploid

(n=3

38)

Gain (n

=407

)

Amp (n=

190)

9

10

11

12

13

14

15

16

17

Rad21

«««

«««

mR

NA

Expr

essi

on (l

og2)

«««

HomDel

(n=1)

HetLos

s (n=

89)

Diploid

(n=6

31)

Gain (n

=227

)

Amp (n=

11)

5

6

7

8

9

10

11

12

13

STAG1

««« «««

NS

mR

NA

Expr

essi

on (l

og2)

Page 25: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

20

Discussion & Future Directions 4

Chromatin interactions regulate transcriptional networks that drive differentiation and cell-specific

responses to stimuli (Fraser et al. 2009; Sanyal et al. 2012; Jin et al. 2013; Rao et al. 2014).

Dysregulation of transcriptional regulation mechanisms and consequent changes to gene expression

networks are central to tumourigenesis and disease progression (Kolch et al. 2015). Using three

different studies characterizing genetic alterations in breast tumours, we show that the chromatin

interaction factors that are known to regulate chromatin loops (ZNF143, CTCF and the subunits of the

cohesin complex) are frequently genetically altered. Furthermore, using an independent dataset, we

show that elevated expression of these factors typifies aggressive ESR1-positive breast tumours. These

results expand on the reported significant mutational load in CTCF and the subunits of the cohesin

complex in some solid and haematological cancers (Lawrence et al. 2014). We find that genetic

alterations in chromatin-looping factors tend towards mutual exclusivity in breast cancer, an

observation shared with lung squamous cell carcinoma (TCGA, Provisional), bladder urothelial

carcinoma (TCGA 2014) and liver hepatocellular carcinoma (TCGA, Provisional) (Supplementary

Tables 6-8) The tendency towards mutual exclusivity in mutations targeting chromatin-looping factors

supports previous reports showing that distinct mutations identified in tumours can converge on

proteins involved in a central oncogenic pathway (Leiserson et al. 2015). In addition to mutations that

directly target genes, genetic variance in the sequences of regulatory elements can alter looping

machinery binding to the chromatin and predispose the cell to improper gene expression. For instance,

CTCF/cohesin binding sites in or adjacent to the CTCF motif are frequently mutated in colorectal

cancers (Katainen et al. 2015) and deletion of CTCF/cohesin co-bound sites in ESCs alter interaction

frequencies of enhancers with proximal genes and can dysregulate their expression (Dowen et al.

2014). These findings indicate that looking beyond the genetic alterations in looping factors to changes

that occur in regulatory elements harbouring ZNF143 and CTCF binding motifs in breast cancer may

further elucidate the extent to which chromatin looping factor activity is affected in tumours.

Recent studies suggest that looping structures are heterogeneous and are mediated by different

combinations of chromatin interaction proteins (De Laat and Duboule 2013). Chromatin interactions

Page 26: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

21

between two CTCF-bound regions are associated with long (>1Mb) stable interactions conserved

across cell types that partition the genome into regulatory blocks (Zhang et al. 2012; Phillips-Cremins

et al. 2013; Heidari et al. 2014, Rao et al. 2014; Dowen et al. 2014). Interactions that connect enhancer

and promoter elements are associated with shorter, more cell-type specific interactions and generally

occur within CTCF-CTCF interaction boundaries (Schmidt et al. 2010; Kagey et al. 2010; DeMare et

al. 2013; Heidari et al. 2014; Dowen et al. 2014; Bailey et al. 2015). They are believed to be mediated

by CTCF at distal sites and ZNF143 at promoters (Cubeñas-Potts and Corces 2015). The dynamics of

these enhancer-promoter interactions is subject to debate, particularly those that mediate the

transcriptional response to external stimuli. Chromatin Conformation Capture (3C) assays in MCF-7

cells depleted of ESR1 indicates that this receptor is required at specific chromatin interaction anchors

to activate transcription of certain E2-regulated genes (Pan et al. 2008; Fullwood et al. 2009). HiC

analysis in stimulated MCF-7 cells also indicates that E2-stimulation increases the frequency of

interactions (Mourad et al. 2014), with the largest changes in interaction frequency occurring at ESR1

binding sites. Finally, locus-specific studies have indicated that E2-stimulation can increase the

frequency of specific promoter-enhancer interactions (Li et al. 2013). Taken together, these results

would suggest that chromatin interactions involving the ESR1 are transient and are induced by E2-

stimulation. These studies however, stand in contrast to the reports that indicate enhancer-promoter

interactions are established during cellular differentiation, do not require transcription to remain stable

and are mostly pre-formed prior to external stimulation (Fraser et al. 2009; Sanyal et al. 2012; Jin et al.

2013; Rao et al. 2014). Our work shows that ZNF143 is present at promoters interacting with distal

ESR1-bound sites and is required for the ESR1-mediated response to E2-stimulation. ZNF143 is

present at E2-regulated gene promoters prior to stimulation and remains bound upon hormone

treatment. A similar observation was reported for CTCF in MCF-7 breast cancer cells (Ross-Innes et al.

2011). Together, these results are consistent with chromatin looping factors stably bookmarking the

chromatin interaction anchors to predispose the cells to a transcriptional response guided by a three-

dimensional chromatin configuration. Interestingly, the ZNF143 motif is found at the majority of

mammalian promoters (Myslinski et al. 2006) and CTCF binding is highly conserved across tissues

(Schmidt et al. 2012). Furthermore, the chromatin looping machinery is genetically altered in each

breast cancer subtype (Figure 3C) as well as in many different cancers-types (Supplementary Figure 6),

indicating that the looping machinery contributes to cell-specific transcriptional regulation. Therefore,

how the looping machinery establishes cell-type specific interactions in a normal cell and how the

Page 27: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

22

alterations to these interactions affects gene expression regulation in the context of distinct cancers

remain important questions to be addressed.

The ESR1-mediated transcriptional response is a major driver of growth and proliferation in a majority

of diagnosed breast cancer cases (Tyson et al. 2011) and is accordingly an important focus for targeted

therapy (Renoir et al. 2013). Although the treatment of breast cancer has benefitted tremendously from

the generation of therapies against ESR1 activity, disease relapse continues to pose a challenge due to

intrinsic or acquired drug resistance. Better understanding of the mechanism of ESR1 signalling may

provide alternative treatment avenues. Our work identifies the chromatin-looping machinery, inclusive

of ZNF143, as a central player in ESR1 signalling in breast cancer cells. As targeting zinc-finger

proteins, either ZNF143 or CTCF, is technically challenging, a comprehensive identification of the

chromatin-looping machinery is warranted. In Drosophila, accessory proteins (CP190, Rad21, Mdg4,

CapH2, condensin factor, Fs1h-L and L3mbt) that interact with and assist DNA binding chromatin-

looping factors have been identified (Moshkovich et al. 2011; Gurudatta et al, 2013; Kellner et al.

2013; Van Bortle et al. 2014; Vogelmann et al. 2014). However, the characterization of chromatin-

looping accessory proteins has been less extensive in mammals. In humans, the cohesin complex

proteins serve as accessory protein to the chromatin-looping factors (Cubeñas-Potts and Corces 2015)

and recent reports suggest that the Mediator complex may assist in the formation of chromatin

interactions. The Mediator enriches at enhancer-promoter contact sites and MED1 or MED12 subunit

depletion alters chromatin interactions in embryonic stem cells (Kagey et al. 2010; Phillips-Cremins et

al. 2013). Additional factors such as Nipbl, which can load cohesin on the chromatin, and CTCF-

interacting proteins YYA, Kaiso, CHD8, PARP1, MAZ, JUND, nuceophosmin, PRDM5 and TF-II

might also prove critical in targeting the chromatin-looping machinery in cancer (Kagey et al. 2010;

Cubeñas-Potts & Corces 2015). Finally, enhancer RNAs (eRNAs) have been suggested to participate in

the formation and stability of chromatin interactions (Li et al. 2013; Hah et al. 2013). However,

chromatin loops can still exist in the absence of eRNA production (Hah et al. 2013). Although our

work focuses on enhancer-promoter loops, the delineation of the full complement of factors involved in

the different types of interactions may provide us with a better understanding of the mechanism

through which chromatin interactions regulate gene transcription.

Page 28: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

23

Overall, we establish the chromatin-looping factor ZNF143 as a key regulator of ESR1 signalling in

luminal breast cancer cells. We show that the chromatin-looping machinery, inclusive of ZNF143, is

altered in over 20% of breast tumours and that the expression of these looping factors is of clinical

relevance to breast cancer patients. This work expands our understanding of the mechanism that drives

two-thirds of breast cancer cases. The prevalence of alterations to the chromatin-interaction machinery

across multiple cancer types is indicative that these factors likely contribute to transcriptional

regulation beyond ESR1 signalling. Thus, in addition to established genetic and epigenetic

contributions, the three-dimensional architecture of the chromatin may prove to be another important

regulator of the hallmark altered gene transcription regulation seen in cancer.

Page 29: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

24

5 Supplementary Tables & Figures

5.1 Supplementary Tables

Supplementary Table 1: E2-regulated genes in MCF-7 cells

E2 Up-Regulated Genes E2 Down-Regulated Genes

EN2 FAT4 PFKFB3 DPM2,FAM102A FUT4 KBTBD8 TMEM201 P2RY2 EPS15L1 GEMIN5 AMER1 ADAT2 LAMA3 TET2 ITPK1 FOSL2 GADD45B SOX9 CA12 HPDL WDR35 RTN4RL1 C10orf2 PRSS23 RAB3A NCOR2 PLD6 TBKBP1

KAZN NAT8L SYNE3 CELSR2 CBFA2T3 SYBU MYEOV FOS SIAH2 GAB2 ZNF703 PITX1 SCAND1 ASB13 FAM178B PADI3 CALCR C5orf4 KCNK6 ARID5A STX1B HSPB8 ARTN PPM1K SLC22A5 XBP1 SGK223 HS3ST3B1 RASGRP1

RARA ANKRD2 RET COLEC12 IGFBP4 A4GALT TNS1 DLC1 RAB37 KCNV1 CYP26A1 BIRC3 SEC14L2 SCNN1G MYC MYBL1 CISH HEY2 RAPGEFL1 AMZ1 CD22 NPY1R MYB C5AR1 RBM24 PDZK1 C1orf226,NOS1AP SGK1

TPPP3 TNFSF10 NUPR1 SLC22A1 PCDH9 DRD1 DEPDC7 NOV BPIFA4P HCAR2,HCAR3 TM4SF18 C8orf4 KCNJ8 HMOX1 BTN3A3 TNFRSF11B ANKRD33B PROCA1 SOX2 TMEM156 CYP1B1-AS1 HEY1 ARHGEF37 ADARB2 TGFB2 EPHA4

Page 30: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

25

POLR1B ONECUT2 ADAMTSL5 C1QTNF6 COL12A1 CAMKK1 SLC7A5 MED13L SLC9A3R1 TMED8 FOXC1 AMOTL1 TGM2 KDM4B SLC2A1 NRIP1 USP35 WIPF1 TFAP4 ANKRD1 OTUB2 CARD10,MFNG ARHGEF26 PPARGC1B RERG JAK2 SNORD15A

MPPED2 RNF223 PKIB SEMA6A ZNF488 PEX11A TSKU ISM1 STC2 NAV2 MBOAT1 TFF1 PABPN1L AGR3 CCDC88C SLC2A4 SIGLEC15 STC1 SMOX KCNQ4 TIFA FAM86B3P TRIM31 HEG1 C8orf44,C8orf44-SGK3,SGK3

GPR68 KCNF1 GPR132 NR5A2 SNORA71B FMN1 PGR C5AR2 FOXN1 ALX4 KIAA0226L KCNK5 EGR3 CXCL12 C8orf46 SLITRK4 PEBP4 TMPRSS3 GREB1 FMN1 TMPRSS3 LOC100507584 SNORA75 RPS17L

SALL4 DIO2 BMF FAM46B EPHB3 CCNG2 EDARADD

Page 31: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

26

Supplementary Table 2: GSEA analysis of genes down-regulated upon ZNF143 depletion under E2 stimulation

Gene Set Name P-Val

Early Oestrogen Response 7.21e-15

Hypoxia 7.21e-15

Late Oestrogen Response 3.92e-9

MTORC1 signalling 3.92e-9

Fatty Acid Metabolism 9.99e-9

Page 32: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

27

Supplementary Table 3: Genetic alterations tend towards mutual exclusivity (TCGA 2012)

Factor 1 Factor 2 P-value Log Odds Ratio

Association

CTCF RAD21 0.243 -1.071 Tendency towards mutual exclusivity

RAD21 STAG2 0.394 <-3 Tendency towards mutual exclusivity

RAD21 SMC1A 0.538 <-3 Tendency towards mutual exclusivity

ZNF143 RAD21 0.628 <-3 Tendency towards mutual exclusivity

CTCF STAG2 0.795 <-3 Tendency towards mutual exclusivity

CTCF SMC1A 0.858 <-3 Tendency towards mutual exclusivity

ZNF143 CTCF 0.892 <-3 Tendency towards mutual exclusivity

STAG1 STAG2 0.904 <-3 Tendency towards mutual exclusivity

CTCF SMC3 0.927 <-3 Tendency towards mutual exclusivity

STAG1 SMC1A 0.935 <-3 Tendency towards mutual exclusivity

ZNF143 STAG1 0.951 <-3 Tendency towards mutual exclusivity

STAG2 SMC1A 0.951 <-3 Tendency towards mutual exclusivity

ZNF143 STAG2 0.963 <-3 Tendency towards mutual exclusivity

STAG1 SMC3 0.967 <-3 Tendency towards mutual exclusivity

ZNF143 SMC1A 0.975 <-3 Tendency towards mutual exclusivity

STAG2 SMC3 0.975 <-3 Tendency towards mutual exclusivity

SMC1A SMC3 0.983 <-3 Tendency towards mutual exclusivity

ZNF143 SMC3 0.988 <-3 Tendency towards mutual exclusivity

Page 33: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

28

Supplementary Table 4: Genetic alterations tend towards mutual exclusivity (TCGA Provisional Data Set)

Factor 1 Factor 2 P-value Log Odds Ratio

Association

ZNF143 RAD21 0.222 -1.131 Tendency towards mutual exclusivity

CTCF RAD21 0.443 -0.210 Tendency towards mutual exclusivity

RAD21 SMC3 0.545 -0.248 Tendency towards mutual exclusivity

CTCF SMC3 0.691 <-3 Tendency towards mutual exclusivity

STAG1 SMC3 0.737 <-3 Tendency towards mutual exclusivity

ZNF143 STAG2 0.781 <-3 Tendency towards mutual exclusivity

Page 34: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

29

Supplementary Table 5: Fold Change mRNA Expression upon Genetic Alteration of Chromatin Looping Factors (TCGA Provisional Data Set)

Factor Homozygous Deletion

Heterozygous Loss

Gain Amplification

ZNF143 FC = 0.69

P-val = 0.002

FC = 0.91

P-val < 0.0001

FC = 1.2

P-val < 0.0001

FC = 1.2

P-val= NS

CTCF FC = 0.69

P-val = 0.004

FC = 0.91

P-val <0.0001

FC = 1.2

P-val = 0.0004

FC = 1.2

P-val = 0.0038

Rad21

NA FC = 0.73

P-val = 0.002

FC = 1.6

P-val < 0.0001

FC = 2.7

P-val < 0.0001

STAG1

NA FC = 0.82

P-val = 0.0001

FC = 1.1

P-val = 0.0004

FC = 1.4

P-val = NS

STAG2 FC = 1.3

P-val = NS

FC = 1.0

P-val = NS

FC = 1.0

P-val = NS

FC = 1.4

P-val = 0.02

SMC1A FC = 0.62

P-val = 0.01

FC = 1.1

P-val = 0.015

FC = 1.2

P-val <0.0001

FC = 2.4

P-val <0.0001

SMC3 FC = 0.44

P-val < 0.0001

FC = 0.8

P-val <0.0001

FC = 1.2

P-val <0.0001

FC = 0.7

P-val = NA

Page 35: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

30

Supplementary Table 6: Genetic alterations tend towards mutual exclusivity in lung squamous cell carcinoma (TCGA, Provisional)

Factor 1 Factor 2 P-value Log Odds Ratio

Association

SMC3 STAG1 0.268 <-3 Tendency towards mutual exclusivity

STAG1 STAG2 0.325 <-3 Tendency towards mutual exclusivity

CTCF STAG1 0.691 <-3 Tendency towards mutual exclusivity

RAD21 SMC1A 0.691 <-3 Tendency towards mutual exclusivity

SMC1A STAG2 0.729 <-3 Tendency towards mutual exclusivity

RAD21 SMC3 0.751 <-3 Tendency towards mutual exclusivity

ZNF143 RAD21 0.783 <-3 Tendency towards mutual exclusivity

RAD21 STAG2 0.783 <-3 Tendency towards mutual exclusivity

SMC3 STAG2 0.783 <-3 Tendency towards mutual exclusivity

ZNF143 STAG2 0.812 <-3 Tendency towards mutual exclusivity

CTCF RAD21 0.923 <-3 Tendency towards mutual exclusivity

ZNF143 CTCF 0.934 <-3 Tendency towards mutual exclusivity

CTCF STAG2 0.934 <-3 Tendency towards mutual exclusivity

ZNF143 STAG1 0.733 -0.014 Tendency towards mutual exclusivity

RAD21 STAG1 0.664 -0.203 Tendency towards mutual exclusivity

Page 36: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

31

Supplementary Table 7: Genetic alterations tend towards mutual exclusivity in bladder urothelial carcinoma (TCGA, 2014)

Factor 1 Factor 2 P-value Log Odds Ratio Association

RAD21 STAG2 0.297 -0.998 Tendency towards mutual exclusivity

CTCF RAD21 0.460 <-3 Tendency towards mutual exclusivity

CTCF STAG2 0.504 <-3 Tendency towards mutual exclusivity

SMC1A STAG2 0.580 <-3 Tendency towards mutual exclusivity

STAG1 STAG2 0.665 <-3 Tendency towards mutual exclusivity

RAD21 SMC3 0.736 <-3 Tendency towards mutual exclusivity

SMC3 STAG2 0.763 <-3 Tendency towards mutual exclusivity

ZNF143 RAD21 0.858 <-3 Tendency towards mutual exclusivity

ZNF143 STAG2 0.874 <-3 Tendency towards mutual exclusivity

CTCF STAG1 0.886 <-3 Tendency towards mutual exclusivity

SMC1A STAG1 0.908 <-3 Tendency towards mutual exclusivity

CTCF SMC3 0.923 <-3 Tendency towards mutual exclusivity

SMC1A SMC3 0.938 <-3 Tendency towards mutual exclusivity

SMC3 STAG1 0.953 <-3 Tendency towards mutual exclusivity

ZNF143 CTCF 0.961 <-3 Tendency towards mutual exclusivity

ZNF143 SMC1A 0.969 <-3 Tendency towards mutual exclusivity

ZNF143 STAG1 0.976 <-3 Tendency towards mutual exclusivity

ZNF143 SMC3 0.984 <-3 Tendency towards mutual exclusivity

Page 37: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

32

Supplementary Table 8: Genetic alterations tend towards mutual exclusivity in liver hepatocellular carcinoma (TCGA Provisional Data Set)

Factor 1 Factor 2 P-value Log Odds Ratio Association

RAD21 STAG1 0.319 <-3 Tendency towards mutual exclusivity

RAD21 SMC3 0.506 <-3 Tendency towards mutual exclusivity

CTCF RAD21 0.569 -0.432 Tendency towards mutual exclusivity

CTCF SMC1A 0.830 <-3 Tendency towards mutual exclusivity

CTCF STAG1 0.830 <-3 Tendency towards mutual exclusivity

SMC1A STAG1 0.876 <-3 Tendency towards mutual exclusivity

ZNF143 CTCF 0.895 <-3 Tendency towards mutual exclusivity

CTCF SMC3 0.895 <-3 Tendency towards mutual exclusivity

ZNF143 STAG1 0.924 <-3 Tendency towards mutual exclusivity

SMC1A SMC3 0.924 <-3 Tendency towards mutual exclusivity

SMC3 STAG1 0.924 <-3 Tendency towards mutual exclusivity

CTCF STAG2 0.929 <-3 Tendency towards mutual exclusivity

SMCA1 STAG2 0.949 <-3 Tendency towards mutual exclusivity

STAG1 STAG2 0.949 <-3 Tendency towards mutual exclusivity

ZNF143 SMC3 0.954 <-3 Tendency towards mutual exclusivity

ZNF143 STAG2 0.969 <-3 Tendency towards mutual exclusivity

SMC3 STAG2 0.969 <-3 Tendency towards mutual exclusivity

Page 38: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

33

Supplementary Table 9: Primers used in this study

Gene Forward Primer Reverse Primer

Actin 5’ GGACTTCGAGCAAGAGATGG 3’ 5’ AGCACTGTCTTGGCGTACAG 3’

ZNF143 5’ CGCAGTCTGACACCATCTTG 3’ 5’ CCAATCATTCCAGTACCTGCT 3’

RARA 5’ AGGCTACCACTATGGGGTCA 3’ 5’ CGGGTCACCTTGTTGATGAT 3’

XBP1 5’ CTGAGCCCCGAGGAGAAG 3’ 5’ TGTTCCAGCTCACTCATTCG 3’

CA12 5’ GTGGTGTCCATTTGGCTTTT 3’ 5’ CAAGGTCCTTCCTGGATGTG 3’

ERBB3 5’ TCCTTCCTGCAGTGGATTCG 3’ 5’ CATCTCGGTCCCTCACGATG 3’

CCND1? 5’ TTGTTCAAGCAGCGAGTCCC 3’ 5’ CTGTTCCTCGCAGACCGAAG 3’

Page 39: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

34

5.2 Supplementary Figures

Supplementary Figure 1

ZNF143 Binding (Vehicle) ZNF143 Binding (E2)

Replicate 1 Replicate 1

Repl

icate

2

Repl

icate

2

Supplementary Figure 1: ZNF143 ChIP-sequencing replicates show a high degree of reproducibility

Scatterplots depicting read count for ZNF143 binding in MCF-7 breast cancer cells treated with vehicle or E2 (10µM) for 45 minutes in two separate experiments. Plot was generated with SeqMonk from ZNF143 ChIP-sequencing data. R, Pearson’s correlations, were calculated from all data points.

R = 0.884 R = 0.928

Page 40: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

35

Supplementary Figure 2

Page 41: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

36

Supplementary Figure 3

Page 42: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

37

Supplementary Figure 4

siScr

siZNF14

30

2

4

6

8

FC = 1.1NS

FC = 1.0NS

FC = 3.6!!!

FC = 3.1!!!

VehicleE2

FPK

M

siScr

siZNF14

30

10

20

30

40

50 FC = 1NS

FC = 1NS

FC = 1.1NS

FC = 1.1NS

VehicleE2

FPK

M

siScr

siZNF14

30

20

40

60

80

100

VehicleE2

FC = 1.1NS

FC = 1NS

FC = 1.2NS

FC = 1.2NS

FPK

M

Supplementary Figure 4: Depletion of ZNF143 mRNA does not affect ESR1 or FOXA1 expression under vehicle or E2-stimulated conditions

Gene expression changes in cells transfected with siZNF143 or scrambled siren under vehicle and E2-stimulated conditions (RPKM: reads per kilobase per million). (*p<0.05; ** p<0.01; *** p<0.001; NS = Not Significant)

ZNF143 ESR1 FOXA1

Page 43: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

38

Supplementary Figure 5

0.0

0.2

0.4

0.6

0.8

1.0

Expressionlowhigh

HR = 1.4 (1.1 − 1.6)logrank P = 0.0013

Prob

abilit

y0.

00.

20.

40.

60.

81.

0

Time (years)0 5 10 15 20 25

HR = 1.4P-val = 0.0013

Expressionlowhigh

0.0

0.2

0.4

0.6

0.8

1.0

Expressionlowhigh

HR = 1.23 (0.91 − 1.67)logrank P = 0.18

Prob

abilit

y0.

00.

20.

40.

60.

81.

0

Time (years)0 5 10 15 20 25

HR = 1.23P-val = 0.18

Expressionlowhigh

0.0

0.2

0.4

0.6

0.8

1.0

Expressionlowhigh

HR = 1.3 (1.0 − 1.7)logrank P = 0.027

Prob

abilit

y0.

00.

20.

40.

60.

81.

0

Time (years)0 5 10 15 20 25

HR = 1.3P-val = 0.027

Expressionlowhigh

Rad21

Luminal BLuminal AERα Positive

Supplemementary Figure 5: Rad21 expression is relevant to clinical outcome of luminal breast cancer patients

Kaplan-Meier plots derived from the METABRIC data set (Curtis et al. 2012) evaluating overall survival in ESR1-positive breast cancer patients (n=1486), Luminal A patients (n=825) and Luminal B (n=668) patients, stratified by RAD21 expression. Data were obtained from the Kaplan-Meier plotter breast cancer survival analysis database (Györffy 2010). Hazard ratios (HR) and logrank P-values are displayed

Page 44: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

39

Supplementary Figure 6

Bre

ast (

BC

CR

C X

enog

raft)

Ova

rian

(TC

GA

) Ova

rian

Ute

rine

(TC

GA

pub

)U

terin

e (T

CG

A) U

terin

eB

ladd

er (T

CG

A) B

ladd

erU

terin

e C

S (T

CG

A)

Bla

dder

(TC

GA

pub

)N

CI-

60 N

CI-

60 C

ell

Lung

squ

(TC

GA

)Pr

osta

te (M

ICH

) Pro

stat

ePr

osta

te (S

U2C

) Met

asta

ticSt

omac

h (T

CG

A) S

tom

ach

Live

r (TC

GA

) Liv

erSt

omac

h (T

CG

A p

ub)

Hea

d &

nec

kB

reas

t (TC

GA

) Bre

ast

Esop

hagu

s (T

CG

A) E

soph

agea

lH

ead

& n

eck

Uve

al m

elan

oma

(TC

GA

)Pa

ncre

as (U

TSW

) Pan

crea

ticO

varia

n (T

CG

A p

ub)

Cer

vica

l (TC

GA

) Cer

vica

lLu

ng a

deno

(TC

GA

)B

ladd

er (B

GI)

Bla

dder

Lung

squ

(TC

GA

Col

orec

tal (

TCG

A) C

olor

ecta

lB

reas

t (TC

GA

pub

)Lu

ng a

deno

(TC

GA

CC

LE (N

ovar

tis/B

road

201

2)C

olor

ecta

l (TC

GA

pub

)M

elan

oma

(TC

GA

) Ski

nPa

ncre

as (T

CG

A) P

ancr

eatic

AC

yC (M

SKC

C) A

deno

iduc

s (J

ohns

Hop

kins

Mel

anom

a (B

road

) Ski

nC

olor

ecta

l (G

enen

tech

) Col

orec

tal

Ewin

g Sa

rcom

a (I

nstit

utA

ML

(TCG

A) A

cute

AM

L (T

CGA

pub

)M

elan

oma

(Bro

ad/D

FCI)

Mel

anom

aG

liom

a (T

CG

A) B

rain

Lung

ade

no (B

road

)B

ladd

er (M

SKC

C 2

014)

Ewin

g Sa

rcom

a (D

FCI)

Sarc

oma

(TC

GA

) Sar

com

aPr

osta

te (T

CG

A) P

rost

ate

AC

C (T

CG

A) A

dren

ocor

tical

Pros

tate

(TC

GA

201

5)G

BC

(Sha

ngha

i) G

allb

ladd

erLu

ng S

C (U

COLO

GN

E)G

BM

(TC

GA

) Glio

blas

tom

aD

LBC

(TC

GA

) Lym

phoi

dLu

ng S

C (C

LCG

P)G

BM

(TC

GA

201

3)M

elan

oma

(Yal

e) S

kin

pRC

C (T

CG

A) K

idne

yEs

opha

gus

(Bro

ad) E

soph

agea

lPr

osta

te (B

road

/Cor

nell

2013

)Lu

ng S

C (J

HU

)cc

RC

C (T

CG

A p

ub)

Cho

lang

ioca

rcin

oma

(NC

CS)

Cho

lang

ioca

rcin

oma

MPN

ST (M

SKC

C) M

alig

nant

ccR

CC

(TC

GA

) Kid

ney

Live

r (A

MC

) Liv

erB

ladd

er (M

SKC

C 2

012)

Hea

d &

nec

kSa

rcom

a (M

SKC

C) S

arco

ma

PCPG

(TC

GA

) Phe

ochr

omoc

ytom

a

0

20

40

60

80

Alte

ratio

n F

requ

ency

(%)

Supplementary Figure 3: Chromatin Looping Factors are Genetically Altered in Many CancersOverall frequency of genetic alterations in the chromatin looping factors (ZNF143, CTCF and the cohesion

subunits) in datasets available through the CBioPortal database (http://www.cbioportal.org).

Page 45: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

40

References 5

Bailey SD, Zhang X, Desai K, Aid M, Corradin O, Cowper-Sal Lari R, Akhtar-Zaidi B, Scacheri PC, Haibe-Kains B, Lupien M. 2015. ZNF143 provides sequence specificity to secure chromatin interactions at gene promoters. Nat Commun 2: 6186. Doi: 10.1038/ncomms7186.

Barski A, Cuddapah S, Cui K, Roh TY, Schones DE, Wang Z, Wei G, Chepelev I, Zhao K. 2007. High-resolution profiling of histone methylations in the human genome. Cell 129(4): 823-837.

Barutcu AR, Lajoie BR, McCord RP, Tye CE, Hong D, Messier TL, Browne G, van Wijnen AJ, Lian JB, Stein JL. 2015. Chromatin interaction analysis reveals changes in small chromosome and telomere clustering between epithelial and breast cancer cells. Genome Biol 16: 214.

Bickel PJ, Boley N, Brown JB, Huang H, Zhang NR. 2010. Subsampling methods for genomic inference. Annals of Applied Statistics 4(4): 1660-1697.

Birney E, Stamatoyannopoulos JA, Dutta A, Guigo R, Gingeras TR, Margulies EH, Weng Z, Snyder M, Dermitzakis ET, Thurman RE. 2007. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 447(7146): 799-816.

Beroukhim R, Getz G, Nghiemphu L, Barretina J, Hsueh T, Linhart D, Vivanco I, Lee JC, Huang JH, Alexander S. 2007. Assessing the significance of chromosomal aberrations in cancer: Methodology and application to glioma. PNAS 104(50): 20007-20012.

Burcin M, Arnold R, Lutz M, Kaier B, Runge D, Lottspeich F, Filippova GN, Lobanenkov VV, Renkawitz R. 1997. Negative protein 1, which is required for function of the chicken lysozyme gene silencer in conjunction with hormone receptors, is identical to the multivalent zinc finger repressor CTCF. Mol Cell Biol 17: 1281-1288.

Carretero M, Ruiz-Torres M, Rodriguez-Corsino M, Barthelemy I, Losada A. 2013. Pds5B is required for cohesion establishment and Aurora B accumulation at centromeres. EMBO J 32: 2938-2949.

Carroll JS, Meyer CA, Song J, Li W, Geistlinger TR, Eeckhoute J, Brodsky AS, Keeton EK, Fertuck KC, Hall GF et al. 2006. Genome-wide analysis of estrogen receptor binding sites. Nature Genet 38(11): 1289-1297.

Cerami E, Gao J, Dogrusoz U, Gross BE, Sumer SO, Aksoy BA, Jacobsen A, Byrne CJ, Heuer ML, Larsson E. 2012. The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. Cancer Discov 2(5): 401-404.

Chen X, Xu H, Yuan P, Fang F, Huss M, Vega VB, Wong E, Orlov YL, Zhang W, Jiang J et al. 2008. Integration of External Signaling Pathways with the Core Transcriptional Network in Embryonic Stem Cells. Cell 133(6): 1106-1117.

Page 46: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

41

Cubeñas-Potts C, Corces VG. 2015. Architectural proteins, transcription, and the three-dimensional organization of the genome. FEBS Lett . Epub ahead of print.

Cuddapah S, Jothi R, Schones DE, Roh TY, Cui K, Zhao K. 2009. Global analysis of the insulator binding protein CTCF in chromatin barrier regions reveals demarcation of active and repressive domains. Genome Res 19: 24-32.

Curtis C, Shah SP, Chin SF, Turashvili G, Rueda OM, Dunning MJ, Speed D, Lynch AG, Samarajiwa S, Yuan Y et al. 2012. The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups. Nature 486: 346-352.

DeMare LE, Leng J, Cotney J, Reilly SK, Yin J, Sarro R, Noonan JP. 2013. The genomic landscape of cohesin-associated chromatin interactions. Genome Res 23(8): 1224-1234.

Dixon JR, Selvaraj S, Yue F, Kim A, Li Y, Shen Y, Hu M, Liu JS, Ren B. 2012. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485(7398): 376-380.

Dowen JM, Fan ZP, Hnisz D, Ren G, Abraham BJ, Zhang LN, Weintraub AS, Schuijers J, Lee TI, Zhao et al. 2014. Control of cell identity genes occurs in insulated neighborhoods in mammalian chromosomes. Cell 159(2): 374-387.

Eirew P, Steif A, Khattra J, Ha G, Yap D, Farahani H, Gelmon K, Chia S, Mar C, Wan A. 2015. Dynamics of genomic clones in breast cancer patient xenografts at single-cell resolution. Nature 518(7539): 422-426.

Engel N, West AG, Felsenfeld G, Bartolomei MS. 2004. Antagonism between DNA hypermethylation and enhancer-blocking activity at the H19 DMD is uncovered by CpG mutations. Nature 36: 883-888.

Ernst J, Kellis M. 2012. ChromHMM: automating chromatin-state discovery and characterization. Nat Methods 9: 215-216.

Faresse NJ, Canella D, Praz V, Michaud J, Romanascano D, Hernandez N. 2012. Genomic Study of RNA Polymerase II and III SNAPc-Bound Promoters Reveals a Gene Transcribed by Both Enzymes and a Broad Use of Common Activators. PLoS Genetics. DOI: 10.1371/journal.pgen.1003028.

Fraser J, Rousseau M, Shenker S, Ferraiuolo MA, Hayashizaki Y, Blanchette M, Dostie J. 2009. Chromatin conformation signatures of cellular differentiation. Genome Biol 10(4): R37. Doi: 10.1186/gb-2009-10-4-r37.

Fedoriw AM, Stein P, Svoboda P, Schultz RM, Bartolomei MS. 2004. Transgenic RNAi reveals essential function for CTCF in H19 gene imprinting. Science 303: 238-240.

Filippova GN, Fagerlie S, Klenova EM, Myers C, Dehner Y, Goodwin G, Neiman PE, Collins SJ, Lobanenkov VV. 1996. An exceptionally conserved transcriptional repressor, CTCF, employs different combinations of zinc fingers to bind diverged promoter sequences of avian and mammalian c-myc oncogenes. Mol Cell Biol 16: 2802-2813.

Page 47: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

42

Fullwood MJ, Liu MH, Pan YF, Liu J, Xu H, Mohamed YB, Orlov YL, Velkov S, Ho A, Mei PH et al. 2009. An oestrogen-receptor-alpha-bound human chromatin interactome. Nature 462(4729): 58-64.

Gao J, Aksoy BA, Dogrusoz U, Dresdner G, Gross B, Sumer SO, Sun Y, Jacobsen A, Sinha R, Larsson E. 2013. Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci Signal 6(269): pl1. Doi: 10.1126/scisignal.2004088.

Guo Y, Xu Q, Canzio D, Shou J, Li J, Gorkin DU, Jung I, Wu H,Zhai Y, Tang Y et al. 2015. CRISPR Inversion of CTCF Sites Alters Genome Topology and Enhancer/Promoter Function. Cell 162(4): 900-910.

Gurudatta BV, Yang J, Van Bortle K, Donlin-Asp P.G., Corces V.G. 2013. Dynamic changes in the genomic localization of DNA replication-related element binding factor during the cell cycle. Cell Cycle 12:1605-1615.

Györffy B, Lanczky A, Eklund, AC, Denkert C, Budczies J, Li Q, Szallasi Z. 2010. An online survival analysis tool to rapidly assess the effect of 22,277 genes on breast cancer prognosis using microarray data of 1,809 patients. Breast Cancer Res Treat 123(3): 725-731.

Hadjur S, Williams LM, Ryan NK, Cobb BS, Sexton T, Fraser P, Fisher AG, Merkenschlager M. 2009. Cohesins form chromosomal cis-interactions at the developmentally regulated IFNG locus. Nature 460(7253): 410-413.

Hah N, Murakami S, Nagari A, Danko CG, Kraus WL. 2013. Enhancer transcripts mark active estrogen receptor binding sites. Genome Res 23(8): 1210-1223.

Halbig KM, Levken AC, Kunkel GR. 2012. The transcriptional activator ZNF143 is essential for normal development in zebrafish. BMC Mol Biol 13(3). DOI: 10.1186/1471-2199-13-3.

He HH, Meyer CA, Chen MW, Jordan VC, Brown M, Liu XS. 2012. Differential DNase I hypersensitivity reveals factor-dependent chromatin dynamics. Genome Res 22(6): 1015-1025.

Heidari N, Phanstiel DH, He C, Grubert F, Jahanbani F, Kasowski M, Zhang MQ, Snyder M. 2014. Genome-wide map of regulatory interactions in the human genome. Genome Res 24: 1905-1917.

Hirayama T, Tarusawa E, Yoshimura Y, Galjart N, Yagi T. 2012. CTCF is required for neural development and stochastic expression of clustered Pcdh genes in neurons. Cell Rep 2(2): 345-357.

Hou C, Dale R, Dean A. 2010. Cell type specificity of chromatin organization mediated by CTCF and cohesin. PNAS 107(8): 3651-3656.

Hou C, Li L, Qin ZS, Corces VG. 2012. Gene density, transcription, and insulators contribute to the partitioning of the Drosophila genome into physical domains. Mol Cell 48(3): 471-484.

Ji X, Dadon DB, Powell BE, Fan ZP, Borges-Rivera D, Shachar S, Weintraub AS, Hnisz D, Pegoraro G, Lee TI et al. 2015. 3D Chromosome Regulatory Landscape of Human Pluripotent Cells. Cell Stem Cell. DOI: 10.1016/j.stem.2015.11.007.

Page 48: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

43

Jin F, Li Y, Dixon JR, Selvaraj S, Ye Z, Lee AY, Yen CA, Schmitt D, Espinoza CA, Ren B. 2013. A high-resolution map of the three-dimensional chromatin interactome in human cells. Nature 503: 290-294.

Joseph R, Orlov YL, Huss M, Sun W, Kong SL, Ukil L, Pan YF, Li G, Lim M, Thomsen JS. 2010. Integrative model of genomic factors for determining binding site selection by estrogen receptor- α. Mol Syst Biol 6(456). Doi: 10.1038/msb.2010.109.

Kagey MH, Newmann JJ, Bilodeau S, Zhan Y, Orlando DA, van Berkum NL, Ebmeier CC, Goossens J, Rahl PB, Levine SS. 2010. Mediator and cohesin connect gene expression and chromatin architecture. Nature 467: 430:-435.

Katainen, Dave K, Pitkänen E, Palin K, Kivioja T, Välimäki N, Gylfe AE, Ristolainen H, Hänninen UA, Cajuso T et al. 2015. CTCF/cohesin-binding sites are frequently mutated in cancer. Nature Gen 47: 818-821.

Kellner WA, Van Bortle K, Li L, Ramos E, Takenaka N, Corces VG. 2013. Distinct isoforms of the Drosophila Brd4 homologue are present at enhancers, promoters and insulator sites. Nucleic Acids Res 41: 9274-9283.

Kim TH, Abdullaev ZK, Smith AD, Ching KA, Loukinov DI, Green RD, Zhang MQ, Lobanenkov VV, Ren B. 2007. Analysis of the vertebrate insulator protein CTCF-binding sites in the human genome. Cell 128(6): 1231-1245.

Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL. 2013. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol 14: R36. Doi: 10.1186/gb-2013-14-4-r36.

Klenova EM, Nicolas RH, Paterson HF, Carne AF, Heath CM, Goodwin GH, Neiman PE, Lobanenkov VV. 1993. CTCF, a conserved nuclear factor required for optimal transcriptional activity of the chicken c-myc gene, is an 11-Zn-finger protein differentially expressed in multiple forms. Mol Cell Biol 13: 7612-7624.

Kolch W, Halasz M, Granovskaya M, Kholodenko BN. 2015. The dynamic control of signal transduction networks in cancer cells. Nature Rev Genet 15: 515-527.

Kron KJ, Bailey SD, Lupien M. 2014. Enhancer alterations in cancer: a source for a cell identity crisis. Genome Med 6(9): 77-89.

Lawrence MS, Stojanov P, Mermel CH, Robinson JT, Garraway LA, Golub TR, Meyerson M, Gabriel SB, Lander ES, Getz G. 2014. Discovery and saturation analysis of cancer genes across 21 tumour types. Nature 505: 495-501.

Leiserson MDM, Vandin F, Wu HT, Dobson JR, Eldridge JV, Thoman JT, Papoutsaki A, Kim Y, Niu B, McLellan M et al. 2015. Pan-cancer network analysis identifies combinations of rare somatic mutations across pathways and protein complexes. Nature Gen 47: 106-114.

Page 49: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

44

Li G, Ruan X, Auerbach RK, Sandhu KS, Zheng M, Wang P, Poh HM, Goh Y, Lim J, Zhang J et al. 2012. Extensive promoter-centered chromatin interactions provide a topological basis for transcription regulation. Cell 148(1-2): 84-89.

Li W, Notani D, Ma Q, Tanasa B, Nunez E, Chen AY, Merkurjev D, Zhang J, Ohgi K, Song X et al. 2013. Functional roles of enhancer RNAs for oestrogen-dependent transcriptional activation. Nature 498: 516-523.

Lieberman-Aiden E, van Berkum NL, Williams L, Imakaev M, Ragoczy T, Telling A, Amit I, Lajoie BR, Sabo PJ, Dorschner MO. 2009. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326(5950): 289-293.

Lin CY, Vega VB, Thomsen JS, Zhang T, Kong SL, Xie M, Chiu KP, Lipovich L, Barnett DH, Stossi F et al. 2007. Whole-genome cartography of estrogen receptor alpha binding sites. PLoS Genet 3(6): e87. Doi: 10.1371/journal.pgen.0030087.

Liu MH and Cheung E. 2014. Estrogen receptor-mediated long-range chromatin interactions and transcription in breast cancer. Mol Cell Endocrinol 382(1): 624-632.

Lupien M, Keeckhoute J, Meyer CA, Wang Q, Zhang Y, W Li, Carroll JS, Liu XS, Brown M. 2008. FoxA1 translates epigenetic signatures into enhancer-driven lineage-specific transcription. Cell 132(6): 958-970.

Losada A. 2014. Cohesin in cancer: chromosome segregation and beyone. Nat Rev Cancer 14: 389-393.

Lupien M, Meyer CA, Bailey ST, Eeckhoute J, Cook J, Westerling T, Zhang X, Carroll JS, Rhodes DR, Liu XS. 2010. Growth factor stimulation induces a distinct ERα cistrome underlying breast cancer endocrine resistance. Genes Dev 24: 2219-2227.

Lupianez DG, Kraft K, Heinrich V, Krawitz P, Brancati F, Klopocki E, Horn D, Kayserili H, Opitz JM, Laxova R et al. 2015. Disruptions of topological domains cause pathogenic rewiring of gene-enhancer interactions. Cell 161(5): 1012-1025.

Magnani L, Ballantyne EB, Zhang X, Lupien M. 2011. PBX1 genomic pioneer function drives ERα signaling underlying progression in breast cancer. PLoS Genet 7(11): e1002368. Doi: 10.1371/journal.pgen.1002368.

Magnani L, Stoeck A, Zhang X, Lánczky A, Mirabella AC, Wang TL, Gyorffy B, Lupien M. 2013. Genome-wide reprogramming of the chromatin landscape underlies endocrine therapy resistance in breast cancer. Proc Natl Acad Sci USA 110(16): E1490-9.

Mootha VM, Lindgren CM, Eriksson KF, Subramanian A, Sihag S, Lehar J, Pulgserver P, Carlsson E, Ridderstråle M, Laurila E et al. 2003. PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nature Genet 34: 267-273.

Page 50: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

45

Mourad R, Hsu PY, Liran J, Shen C, Koneru P, Lin H, Liu Y, Nephew K, Huang TH, Li L. 2014. Estrogen Induces Global Reorganization of Chromatin Structure in Human Breast Cancer Cells. PLOSone 10(3): e0118237. Doi:10.1371/journal.pone.0118237.

Mermel CH, Schumacher SE, Hill B, Meyerson ML, Beroukhim R, Getz G. 2011. GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers. Genome Biol 12: R41. Doi: 10.1186/gb-2011-12-4-r41.

Moshkovich N, Nisha P, Boyle PJ, Thompson BA, Dale RK, Lei EP. 2011. RNAi-independent role for Argonaute2 in CTCF/CP190 chromatin insulator function. Genes Dev 25: 1686-1701.

Myslinski E, Krol A, Carbon P. 1998. ZNF76 and ZNF143 are two human homologs of the transcriptional activator Staf. J Biol Chem 273(34): 21998-2006.

Myslinski E, Gérard MA, Krol A, Carbon P. 2006. A genome scale location analysis of human Staf/ZNF143-binding sites suggests a widespread role for human Staf/ZNF143 in mammalian promoters. J Biol Chem 281(52): 39953-39962.

Nora EP, Lajoie BR, Schulz EG, Giorgetti L, Okamoto I, Servant N, Piolot T, van Berkum NL, Meisig J, Sedat J et al. 2012. Spatial partitioning of the regulatory landscape of the X-inactivation centre. Nature 485(7398): 381-385.

Pan YF, Senali Abayratna Wansa KD, Liu MH, Zhao B, Hong SZ, Tan PY, Lim KS, Bourque G, Liu ET, Cheung E. 2008. Regulation of Estrogen Receptor-mediated Long Range Transcription via Evolutionarily Conserved Distal Response Elements. J Biol Chem 283: 32977-32988.

Parelho V, Hadjur S, Spivakov M, Leleu M, Sauer S, Gregson HC, Jarmuz A, Canzonetta C, Webster Z, Nesterova T. 2008. Cohesins Functionally Associate with CTCF on Mammalian Chromosome Arms. Cell 132: 422-433.

Parker JS, Mullins M, Cheang MCU, Leung S, Voduc D, Vickery T, Davies S, Fauron C, He X, Hu Z et al. 2009. Supervised risk predictor of breast cancer based on intrinsic subtypes. J Clin Oncol 27: 1160-1167.

Phillips-Cremins JE, Sauria ME, Sanyal A, Gerasimova TI, Lajoie BR, Bell JS, Ong CT, Hookway TA, Guo C, Sun Y et al. 2013. Architectural protein subclasses shape 3D organization of genomes during lineage commitment. Cell 153(6): 1281-1295.

Plank JL & Dean A. 2014. Enhancer Function: Mechanistic and Genome-Wide Insights come Together. Molecular Cell 55(1): 5-14.

Quinlan AR and Hall IM. 2010. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26(6): 841-842.

Rao SSP, Huntley MH, Durand NC, Stamenova EK, Bochkov ID, Robinson JT, Sanborn AL, Machol I, Omer AD, Lander ES et al. 2014. A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping. Cell 7: 1665-1680.

Page 51: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

46

Remeseiro S, Cuadrado A, Gomez-Lopez G, Pisano DG, Losada A. 2012. A unique role of cohesin-SA1 in gene regulation and development. EMBO J 9: 2090-2102.

Renoir JM, Marsaud V, Lazennac G. 2013. Estrogen receptor signaling as a target for novel breast cancer therapeutics. Biochem Pharmacol 85(4): 449-465.

Roberts A, Trapnell C, Donaghey J, Rinn JL, Pachter L. 2011. Improving RNA-Seq expression estimates by correcting for fragment bias. Genome Biol 12(3): R22. Doi: 10.1186/gb-2011-12-3-r22.

Ross-Innes CS, Brown GD, Carroll JS. 2011. A co-ordinated interaction between CTCF and ER in breast cancer cells. BMC Genomics 12: 593-603.

Rubio ED, Reiss DJ, Welcsh PL, Distechem Fillippova GN, Baliga NS, Aabersold, Ranish JA, Krumm A. 2008. CTCF physically links cohesin to chromatin. PNAS 105(24): 8309-8314.

Sanyal A, Lajoie BR, Jain G, Dekker J. 2012. The long-range interaction landscape of gene promoters. Nature 489 (7414): 109-113.

Schmidt D, Schwalie PC, Ross-Innes CS, Hurtado A, Brown GD, Carroll JS, Flicek P, Odom DT. 2010. A CTCF-independent role for cohesin in tissue-specific transcription. Genome Res 20(5): 578-588.

Schmidt D, Schwalie PC, Wilson MD, Ballester B, Gonçalves A, Kutter C, Brown GD, Marshall A, Flicek P, Odom DT. 2012. Waves of retrotransposon expansion remodel genome organization and CTCF binding in multiple mammalian lineages. Cell 148(1-2): 335-348.

Shen Y, Yue F, McCleary DF, Ye Z, Edsall L, Kuan S, Wagner U, Dixon J, Lee L, Lobanenkov VV et al. 2012. A map of the cis-regulatory sequences in the mouse genome. Nature 488: 116-120.

Sofueva S, Yaffe E, Chan WC, Georgopoulou D, Vietri Rudan M, Mira-Bontenbal H, Pollard SM, Schroth GP, Tanay A, Hadjur S. 2013. Cohesin-mediated interactions organize chromosomal domain architecture. EMBO J 32(24): 3119-3129.

Splinter E, Heath H, Kooren J, Palstra RJ, Klous P Grosveld F, Galjart N, de Laat W. 2006. CTCF mediates long-range chromatin looping and local histone modification in the beta-globin locus. Genes Dev 20(17): 2349-54.

Song L, Zhang Z, Gradfeder LL, Boyle AP, Giresi PG, Lee BK, Sheffield NC, Graf S, Huss M, Keefe D et al. 2011. Open chromatin defined by DNaseI and FAIRE identifies regulatory elements that shape cell-type identity. Genome Res 21(10): 1757-1767.

Spitz F, Eileen E. 2012. Transcription factors: from enhancer binding to transcriptional control. Nature Rev Genet 13: 613-626.

Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES. 2005. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. PNAS 102(43): 15545-15550.

Page 52: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

47

The Cancer Genome Atlas Research Network. 2012. Comprehensive molecular portraits of human breast tumors. Nature 490(7418): 61-70.

The Cancer Genome Atlas Research Network. 2014. Comprehensive molecular characterization of urothelial bladder carcinoma. Nature 507: 315-322.

Tyson JJ, Baumann WT, Chen C, Verdugo A, Tavassoly I, Wang Y, Weiner LM, Clarke R. 2011. Dynamic modeling of oestrogen signaliing and cell fate in breast cancer cells. Nat Rev Cancer 11: 523-532.

Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, van Baren MJ, Salzberg SL, Wold BJ, Pachter L. 2010. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isofom switching during cell differentiation. Nat Biotechnol 28(5): 511-515.

Trapnell C, Hendrickson DG, Sauvageau M, Goff L, Rinn JL, Pachter L. 2013. Differential analysis of gene regulation at transcript resolution with RNA-seq. Nat Biotechnol 31(1): 46-53.

Uhlmann F. 2004. The mechanism of sister chromatid cohesion. Exp Cell Res 296(1): 80-85.

Van Bortle K, Nichols MH, Li L, Ong CT, Takenaka N, Qin ZS, Corces VG. 2014. Insulator function and topological domain border strength scale with architectural protein occupancy. Genome Biol 15: R82.

Vogelmann J, Le Gall A, Dejardin S, Allemand F, Gamot A, Labesse G, Cuvier O, Negre N, Cohen-Gonsaud M, Margeat E et al. 2014. Chromatin insulator factors involved in long-range DNA interactions and their role in the folding of the Drosophila genome. PLoS Genet 10: e1004544.

Welboren WJ, van Driel MA, Janssen-Megens EM, van Heeringen SJ, Sweep FCGJ, Span PN, Stunnenberg HG. 2009. ChIP-Seq of ERα and RNA polymerase II defines genes differentially responding to ligands. EMBO J 28(10): 1418-1428.

Wendt KS, Yoshida K, Itoh T, Bando M, Koch B, Schirghuber E, Tsutsumi S, Nagae G, Ishihara K, Mishiro T. 2008. Cohesin mediates transcriptional insulation by CCCTC-binding factor. Nature 451(7180): 796-801.

Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, Nusbaum C, Myers RM, Brown M, Li W et al. 2008. Model-based analysis of ChIP-Seq (MACS). Genome Biol 9(9): R137. Doi: 10.1186/gb-2008-9-9-r137.

Zhang Y, Liang J, Li Y, Xuan C, Wang F, Wang D, Shi L, Zhang D, Shang Y. 2010. CCCTC-binding factor acts upstream of FOXA1 and demarcates the genomic response to estrogen. J Biol Chem 285(37): 28604-28613.

Zuin J, Dixon JR, van der Reijden MI, Ye Z, Kolovos P, Brouwer RW, van de Corput MP, van de Werken HJ, Knoch TA, van IJcken WF et al. 2014. Cohesin and CTCF differentially affect chromatin architecture and gene expression in human cells. Proc Natl Acad Sci USA 111(3): 996-1001.

Page 53: The Chromatin-Looping Factor ZNF143 is Genetically Altered ... · Furthermore, we show that the overexpression of looping-factors within ESR-1 positive BrCa patients associates with

48