THE TYPE I ANTIFREEZE PROTEIN GENE FAMILY IN PLEURONECTIDAE

160
THE TYPE I ANTIFREEZE PROTEIN GENE FAMILY IN PLEURONECTIDAE by Kyra Keiko Nabeta A thesis submitted to the Department of Biochemistry in conformity with the requirements for the degree of Master of Science Queen’s University Kingston, Ontario, Canada (January, 2009) Copyright © Kyra Keiko Nabeta, 2009

Transcript of THE TYPE I ANTIFREEZE PROTEIN GENE FAMILY IN PLEURONECTIDAE

THE TYPE I ANTIFREEZE PROTEIN GENE FAMILY IN PLEURONECTIDAE

by

Kyra Keiko Nabeta

A thesis submitted to the Department of Biochemistry

in conformity with the requirements for

the degree of Master of Science

Queen’s University

Kingston, Ontario, Canada

(January, 2009)

Copyright © Kyra Keiko Nabeta, 2009

ii

Abstract

Antifreeze proteins (AFPs) protect marine teleosts from freezing in icy seawater by

binding to nascent ice crystals and preventing their growth. It has been suggested that the gene

dosage for AFPs in fish reflects the degree of exposure to harsh winter climates. The starry

flounder, Platichthys stellatus, has been chosen to examine this relationship because it inhabits a

range of the Pacific coast from California to the Arctic. This flatfish is presumed to produce type

I AFP, which is an alanine-rich, amphipathic alpha-helix.

Genomic DNA from four starry flounder was Southern blotted and probed with a cDNA

of a winter flounder liver AFP. The hybridization signal was consistent with a gene family of

approximately 40 copies. Blots of DNA from other starry flounder indicate that California fish

have far fewer gene copies whereas Alaska fish have far more. This analysis is complicated by

the fact that there are three different type I AFP isoforms. The first is expressed in the liver and

secreted into circulation, the second is a larger hyperactive dimer also thought to be expressed in

the liver, and the third is expressed in peripheral tissues. To evaluate the contribution of these

latter two isoforms to the overall gene signal on Southern blots, hybridization probes for the three

isoforms were isolated from starry flounder DNA by genomic cloning. Two clones revealed

linkage of genes for different isoforms, and this was confirmed by genomic Southern blotting,

where hybridization patterns indicated that the majority of genes were present in tandem repeats.

The sequence and diversity of all three isoforms was sampled in the starry flounder

genome by PCR. All coding sequences derived for the skin and liver isoforms were consistent

with the proposed structure-function relationships for this AFP, where the flat hydrophobic side

of the helix is conserved for ice binding. There was greater sequence diversity in the skin and

hyperactive isoforms than in the liver isoform, suggesting that the latter evolved recently from

one of the other two. The genomic PCR primers are currently being used to sample isoform

diversity in related right-eyed flounders to test this hypothesis.

iii

Acknowledgements

I'd like to thank my supervisor Dr. Peter Davies

for the opportunity to gain insight into the true nature of research.

Special thanks to Dr. Laurie Graham

for teaching me how to write in Science

and for her patience and support and guidance through all that sequence.

I would also like to thank Sherry Gauthier and the rest of the Davies lab

for their technical assistance and moral support.

Thanks to my family and friends

for their proofreading skills, encouragement and general positivity.

Here's to new future directions!

iv

Table of Contents

Abstract ........................................................................................................................................... ii

Acknowledgements ........................................................................................................................ iii

Table of Contents ............................................................................................................................ iv

List of Figures ................................................................................................................................. vi

List of Tables ................................................................................................................................. vii

List of Abbreviations .................................................................................................................... viii

Chapter 1 Introduction ..................................................................................................................... 1

1.1 Flounder type I AFPs ............................................................................................................. 3

1.1.1 Liver isoform ................................................................................................................... 3

1.1.2 Skin isoform .................................................................................................................... 7

1.1.3 Hyperactive isoform ...................................................................................................... 10

1.2 Type I AFP in the Pleuronectidae family ............................................................................. 12

1.2.1 Yellowtail flounder ....................................................................................................... 15

1.2.2 American plaice ............................................................................................................. 16

1.2.3 Other Pleuronectidae ..................................................................................................... 17

1.3 Selective pressure: Shaping evolution .................................................................................. 18

1.4 Goals and Objectives ............................................................................................................ 23

Chapter 2 Materials and Methods .................................................................................................. 24

2.1 Isolation of genomic DNA ................................................................................................... 24

2.2 Southern blotting .................................................................................................................. 25

2.3 Library preparation and amplification .................................................................................. 26

2.4 Library screening.................................................................................................................. 26

2.5 Phage isolation ..................................................................................................................... 27

2.6 DNA sequencing .................................................................................................................. 28

2.7 PCR analysis ........................................................................................................................ 29

2.8 Bioinformatic analysis .......................................................................................................... 32

Chapter 3 Results ........................................................................................................................... 34

3.1 Preface .................................................................................................................................. 34

3.2 Genomic DNA samples were of variable quality ................................................................. 34

3.3 Genomic Southern blot showed strong liver AFP gene signal in starry flounder ................ 35

3.4 Genomic Southern blot shows strong hyperactive AFP gene signal .................................... 38

v

3.5 Genes for starry flounder liver and skin AFPs are closely linked ........................................ 39

3.6 Starry flounder liver and skin AFPs are homologous to their winter flounder counterparts 43

3.6.1 Liver isoform ................................................................................................................. 43

3.6.2 Skin isoform .................................................................................................................. 48

3.6.3 Regulatory elements ...................................................................................................... 51

3.7 The starry flounder has multiple variants of the liver and skin AFPs .................................. 52

3.7.1 Liver variants ................................................................................................................. 54

3.7.2 Skin variants .................................................................................................................. 57

3.8 The American plaice has multiple variants of the skin AFPs .............................................. 60

3.9 The starry flounder has multiple variants of the hyperactive AFPs ..................................... 62

3.10 Genes for starry flounder hyperactive and skin AFPs are closely linked ........................... 65

3.11 Regulatory elements are conserved in stfs-AFP8 ............................................................... 69

3.12 Starry flounder hyperactive AFP is homologous to its winter flounder counterpart .......... 70

3.13 Starry flounder-specific probes do not alter Southern blot banding patterns ..................... 71

Chapter 4 Discussion ...................................................................................................................... 73

4.1 Conclusions .......................................................................................................................... 80

References ...................................................................................................................................... 82

Appendix A DNA alignment of winter and starry flounder liver AFPs ......................................... 91

Appendix B DNA alignment of winter and starry flounder skin AFPs.......................................... 96

Appendix C DNA alignment of the 3' regions from all three type I AFP isoforms of starry

flounder and winter flounder and the liver isoform of yellowtail flounder .................................. 100

Appendix D DNA alignment of the upstream region and exon 1 in the liver and hyperactive AFPs

of winter, starry and yellowtail flounders .................................................................................... 102

Appendix E DNA alignment of four unique starry flounder liver AFP gene sequences ............. 104

Appendix F DNA alignment of fourteen unique genes encoding starry flounder skin AFPs ...... 107

Appendix G DNA alignment the American plaice skin AFPs ..................................................... 114

Appendix H DNA alignment of three unique gene sequences encoding starry flounder

hyperactive AFPs ......................................................................................................................... 121

Appendix I DNA alignment of winter and starry flounder hyperactive AFPs ............................. 123

Appendix J DNA alignment of winter and starry flounder skin AFPs ......................................... 126

Appendix K Genomic Southern blot probed with hyperactive AFP cDNAs from the winter

flounder and starry flounder ......................................................................................................... 130

vi

List of Figures

Figure 1. Comparison of the three type I AFP isoforms. ................................................................ 4

Figure 2. Morphological phylogeny of right-eyed flounders ........................................................ 13

Figure 3. Genomic Southern blot of selected members of the Pleuronectidae family probed with

the winter flounder liver AFP gene sequence ........................................................................ 14

Figure 4. Locations from which starry flounder samples were obtained ...................................... 21

Figure 5. Genomic Southern blot of starry flounders from different latitudes probed with the

winter flounder liver AFP gene sequence .............................................................................. 22

Figure 6. Schematic diagram showing the relative positions of the PCR primers on the liver, skin

and hyperactive AFP isoforms ............................................................................................... 31

Figure 7. Quality assessment of genomic DNA from an Alaskan starry flounder ........................ 36

Figure 8. Genomic Southern blots of winter flounder and QCI starry flounder probed with

various AFP gene sequences .................................................................................................. 37

Figure 9. Schematic diagram of starry flounder genomic DNA insert #1 ..................................... 41

Figure 10. Schematic diagram of the AFP gene organization in genomic DNA insert #1............ 44

Figure 11. Protein alignment of liver AFP variants from starry flounder and winter flounder ..... 46

Figure 12. Helical wheel diagrams of liver AFPs from starry flounder and winter flounder........ 47

Figure 13. Protein alignment of skin AFP variants from starry flounder, winter flounder and

American plaice ...................................................................................................................... 50

Figure 14. Characterization by PCR of the AFP isoforms present in phage stocks isolated from

the primary library screen....................................................................................................... 56

Figure 15. Helical wheel diagrams of skin AFPs from starry flounder, winter flounder and

American plaice ...................................................................................................................... 61

Figure 16. Protein alignment of hyperactive AFP variants from starry flounder and winter

flounder .................................................................................................................................. 64

Figure 17. Schematic diagram of starry flounder genomic DNA insert #2 ................................... 67

Figure 18. Schematic diagram of the AFP gene organization in genomic DNA insert #2............ 68

vii

List of Tables

Table 1. Names and sequences of primers used in PCR experiments ........................................... 30

Table 2. Relative codon usage in starry flounder and winter flounder AFP genes ....................... 49

Table 3. Type I AFP variants isolated from the starry flounder genome ...................................... 55

viii

List of Abbreviations

A260 absorbance at 260 nm AFP antifreeze protein AFGP antifreeze glycoprotein AK Alaska aps American plaice skin bp base pair BC British Columbia C carboxyl CA California cDNA complementary deoxyribonucleic acid CsCl cesium chloride DMSO dimethyl sulfoxide dNTP deoxyribonucleotide triphosphate DPPIV dipeptidyl aminopeptidase IV HPLC high performance liquid chromatography kb kilobases KCl potassium chloride kDa kiloDalton MgCl2 magnesium chloride MHC major histocompatibility complex mRNA messenger RNA mya million years ago N amino (NH4)2SO4 ammonium sulfate ORF open reading frame PCR polymerase chain reactions pfu plaque forming units QCI Queen Charlotte Islands rRNA ribosomal RNA stfh starry flounder hyperactive stfl starry flounder liver stfs starry flounder skin TFIID transcription factor IID TH thermal hysteresis Tm melting temperature UTR untranslated region wfh winter flounder hyperactive wfl winter flounder liver wfs winter flounder skin

Note regarding temperature units:

Absolute temperature is measured in degrees Celsius (°C) while a change in temperature, such as

thermal hysteresis is measured in Celsius degrees (C°).

1

Chapter 1

Introduction

Over evolutionary time, fish have radiated to occupy almost every aquatic habitat there

is. Teleosts, or bony ray-finned fishes, have been particularly successful, comprising more than

95% of all extant fish species and almost half of all living vertebrates [1]. However, marine

teleosts face a major problem in icy water because the solute concentration of their body fluids is

lower than that of seawater [2]. This means that they have the potential to freeze at temperatures

approximately 1 C° above the freezing point of seawater (-1.9 °C). Fish are able to live in a

supercooled state for limited periods, but internal or external contact with ice crystals and other

nucleators during these times is lethal, as it seeds rapid ice growth into the body. Ice crystals are

prevalent in surface waters but can be driven to greater depths during storms, putting supercooled

fish at risk of death by freezing [3]. Despite this danger, fish are nevertheless found in shallow

waters at high latitudes.

Plasma freezing temperatures below the then-accepted value (-0.7 °C) for most teleosts

were observed as early as 1957 [2]. However, it was not until much later that agents responsible

for this freezing point depression were isolated [4] and termed antifreeze (glyco)proteins, or

AF(G)Ps. Several classes of AFPs have since been described [5], but they are all thought to work

by the same adsorption-inhibition mechanism [6]. In this model, AF(G)Ps irreversibly bind

nascent ice crystals and effectively stop crystal growth at concentrations much lower than those

that could affect the bulk properties of the solution. Because the levels required to exert an effect

are so small, AFPs and AFGPs are considered to be non-colligative in their mode of action, even

though antifreeze activity is concentration-dependent. Freezing point depression occurs via the

2

Kelvin effect [7]. When AF(G)Ps adsorb onto ice crystals, they restrict the addition of water

molecules to gaps between the bound proteins, resulting in local surface curvatures. Because it is

energetically more difficult for water to join a curved ice surface, the freezing point decreases

without significantly affecting the melting point. Once the water cools to a temperature that can

overcome the local energy barrier, the AF(G)Ps are overgrown and uncontrolled ice crystal

growth occurs. The difference between the melting and freezing temperatures is termed the

thermal hysteresis (TH) gap and its value, in C°, is used as a measure of antifreeze activity.

After the initial characterization of AFGPs in Antarctic species, other polar fishes were

sampled to determine the distribution of these proteins, and four more unrelated types of AFPs

were discovered: type I in the winter flounder (Pseudopleuronectes americanus) [8], type II in

the sea raven (Hemitripterus americanus) [9], type III in the ocean pout (Macrozoarces

americanus) [10] and type IV in the longhorn sculpin (Myoxocephalus octodecimspinosis) [11].

Briefly, the AFGPs consist of tandem repeats of an Ala-Ala-Thr tripeptide repeat with a

disaccharide moiety attached to each threonyl hydroxyl group. Of the AFPs, type I denotes an

Ala-rich amphipathic alpha-helix. Type II AFPs are globular proteins with mixed secondary

structure. Type III AFPs are also globular but contain short beta-strands, resulting in a flat face

along one side of the protein. Type IV AFPs are the least well-characterized and are predicted to

have a helix-bundle structure. The type I AFPs, specifically those of the right-eyed flounders, are

central to this thesis and are described in detail below.

3

1.1 Flounder type I AFPs There are three different sub-classes of the type I AFPs, all of which have been

characterized in a single species of right-eyed flounder: the skin, liver and hyperactive isoforms

(Figure 1). Within each isoform type, there are several variants with the same overall structure

and regulation, but with slight differences in sequence. Although the three isoforms differ in

some respects, such as their size, tissue distribution and mode of induction, they also share many

features. For example, all are alpha-helical near 0 °C, but with a slight variation in the helix, such

that each amino acid corresponds to a 98.2° turn rather than the typical 100° rotation. All

isoforms are also Ala-rich and effective at stopping crystal growth in a concentration-dependent

manner. Each isoform will be reviewed in the order of its discovery and features will be

summarized as they pertain to the winter flounder, the species in which this type of AFP is best

characterized.

1.1.1 Liver isoform Initial observations of freezing point depression in the serum or plasma of various fishes

led to the discovery and characterization of type I AFPs in winter flounder blood [8, 12, 13]. This

isoform is produced in the liver [14, 15] as a preproprotein [13]. The pre-sequence is a 23-amino

acid signal peptide that is cleaved after directing co-translational secretion of the protein into the

bloodstream [16]. The pro-sequence is an activation peptide of variable length in which every

other residue is Ala or Pro [17], and it is cleaved stepwise within 24 h of entering circulation [18].

It has been suggested that dipeptidyl aminopeptidase IV (DPPIV) is the enzyme responsible for

activation [19], but this has never been confirmed. The mature protein is 37 amino acids long,

4

Figure 1. Comparison of the three type I AFP isoforms.

Three-dimensional structures of the liver, skin and hyperactive type I AFPs (wfl-AFP6, wfs-F2

and wfh-AFP1, respectively) were constructed by F.-H. Lin in space-filling mode using PyMOL.

Only the structure for wfl-AFP6 has been solved to date, and wfh-AFP1 is modelled here in its

soluble homodimeric form. Size ranges for the different variants within each isoform are noted

next to each image, and the size difference between the hyperactive isoform and the other two is

clearly evident.

4

Skin 3-5 kDa

Liver 3-5 kDa

Hyperactive 32 kDa

Skin 3-5 kDa

Skin 3-5 kDa

Liver 3-5 kDa

Liver 3-5 kDa

Hyperactive 32 kDa

Hyperactive 32 kDa

5

after post-translational cleavage of the C-terminal Gly and the retention of its amino group as an

amide on the penultimate Arg residue [13]. It was predicted by circular dichroism and viscosity

studies to be an alpha-helical rod-shaped molecule [12], and these predictions were confirmed

when the structure was solved by X-ray crystallography [20]. Stability of this long helix is

supported by internal salt bridges [20, 21] and elaborate capping structures composed of

hydrogen bonding networks at both termini [22]. The mature peptide consists of approximately

60% Ala, with Thr and Asx being the next most common residues [23], and because codon usage

for Ala is heavily biased towards GCC, genes are GC-rich and highly repetitive [13]. Analysis of

the mature peptide identified an 11-amino acid repeat motif, Thr-X2-Asx-X7 where X is usually

Ala [24], and models regarding the relevance of these repeats to antifreeze function were put

forward. The first mechanistic model proposed that the Thr/Asx residues, which are regularly

spaced 4.5 Å apart, form hydrogen bonds with similarly-spaced oxygen atoms in the primary

prism plane of the ice lattice [6, 24]. However, this model was revised [25] after ice etching

studies established that AFPs bound a specific pyramidal plane of ice [26] and again after

mutagenesis studies determined that the key ice-binding residues were on the conserved

hydrophobic face [27]. Such adsorption also explained the characteristic hexagonal bipyramidal

ice crystals [28] that form in AFP solutions and grow rapidly, or burst, along the c-axis in activity

assays when the limits of the TH gap are breached [29]. As similar AFPs were discovered in

other flounders, alignments across species brought about modification of the accepted repeat

motif to TaaXAXXAAXX, where lowercase a is almost always Ala, X is usually Ala and

uppercase T/A are conserved Thr and Ala residues [30].

Genomic Southern blotting and restriction analysis established that this isoform is

encoded by a gene family consisting of 30 – 40 members, most of which are arranged in direct

tandem repeats [31, 32]. The AFPs expressed are quite similar, aside from minor amino acid

6

substitutions and slight variations in the number of 11-amino acid repeats. Using a nomenclature

system proposed by Low et al. [33], the two main variants of the liver isoform in the winter

flounder are wfl-AFP6 (winter flounder AFP produced in the liver, renamed from HPLC-6) and

wfl-AFP8 (renamed from HPLC-8), which respectively make up close to 60% and 40% of total

plasma AFP content in winter [13]. They both contain three 11-amino acid repeats and have a

molecular mass of 3.3 kDa. As well, both behave similarly on ion exchange and gel filtration

columns, but can be finely separated by reversed-phase high performance liquid chromatography

(HPLC) due to slight differences in amino acid composition and location [13]. Another well-

characterized variant of this isoform is wfl-AFP9 (renamed from AFP9), which contains an

additional full repeat motif and has a molecular mass of 4.3 kDa [17, 34, 35]. Functionally, it is

more active than the other two components because of its larger ice-binding face, but is not

considered to be a major variant due to its lower circulating concentration [35, 36]. There is also

DNA evidence of two five-repeat liver AFPs, but no corresponding protein has been isolated to

date [17, 21].

Putative cis-regulatory sequences have been identified for the three well-characterized

variants mentioned above via sequencing and bioinformatic analysis [31, 34]. There is a CAAT

box, a cis-acting promoter-proximal sequence common to many eukaryotic genes, beginning 84

nucleotides upstream of the conserved transcriptional start site (position -84). There is also a

TATA box, a core promoter sequence that usually lies between the CAAT box and the

transcriptional start site [37], at position -32 [31, 38]. In the 3'-flanking DNA, there are

polyadenylation sites, or sequences that indicate the end of a gene to transcriptional proteins,

located at positions +1429, +1769 and +2087, as numbered in [38]. The first is used most

commonly, but the latter two are also used in particular environmental conditions [38, 39]. Each

7

gene consists of two exons separated by an intron approximately 500 bp in length. Exon 1

encodes the majority of the signal peptide, while exon 2 encodes the last four residues of the

signal peptide and the rest of the proprotein [31]. Within the intron lies an enhancer, designated

Element B, which is bound by liver-enriched transcriptional activators, such as CCAAT/enhancer

binding protein α and a novel protein designated the antifreeze enhancer protein. Element B is

conserved in the genes for wfl-AFP6 and wfl-AFP8 but not in the wfl-AFP9 gene [40].

Laboratory studies on wild-caught fish found that the key environmental cues for AFP

gene expression are a combination of changes in photoperiod, temperature and hormone levels.

Decreasing day lengths in autumn spur AFP production in anticipation of winter ice, via the

reduction of growth hormone secreted from the pituitary [41-43]. Cold-specific stability of the

AFP mRNA allows circulating concentrations of the liver isoform to reach 10 – 15 mg/mL at the

height of winter [44, 45]. Come spring, higher water temperatures coupled with a rise in

circulating concentrations of growth hormone are responsible for the disappearance of AFPs from

the bloodstream via negative feedback at the transcriptional level [41, 42] and accelerated

degradation of the protein, resulting in a several hundred-fold difference from winter levels [46].

1.1.2 Skin isoform Alanine-rich peptides that caused non-colligative freezing point depression were isolated

from the skin of the shorthorn sculpin (Myoxocephalus scorpius) [47]. However, the significance

of this discovery was overlooked. Although later studies on the skin of the cunner

(Tautogolabrus adspersus) confirmed the presence of TH activity in this tissue as well as its

usefulness as a physical barrier to ice propagation [48, 49], it was a study in 1992 that revealed

8

the production of AFP mRNAs in many non-liver tissues in the winter flounder [50]. Sequencing

of cDNAs made from pooled extracts of skin, dorsal fin and scales revealed nine Ala-rich

peptides that were more similar to each other than to the already-characterized liver AFPs [51],

and these were classified as skin-type AFPs.

Around the same time, two clones with high sequence identity and similar gene

organization to the known liver AFP genes were isolated from a winter flounder genomic library

[31]. However, due to the presence of in-frame stop codons in the putative pro-sequence and the

lack of a classical TATA box in the presumed promoter region, these sequences, wfs-F2 and wfs-

11-3 (winter flounder skin AFP, renamed from F2 and 11-3, respectively), were classified as

pseudogenes [52]. Later alignments with the skin isoform cDNAs confirmed that they were

genomic sequences of skin AFPs and further analysis uncovered a putative TFIID binding motif

upstream of the transcriptional start site [51].

Many parallels can be drawn between the liver- and skin-type AFPs, although the latter

are less well-characterized. Like the liver AFPs, the skin-type AFPs consist of approximately

60% Ala, which are preferentially coded by GCC. They are thought to be encoded by a gene

family of 30 – 40 members [51], though this remains to be confirmed, as the skin probe used in

these experiments contained a 91 bp portion of the 3' untranslated region (UTR) that is conserved

among all isoforms (36.8% of the total probe length). For comparison, the liver probe used

contained a 24 bp portion of the 3' UTR, which was 7.5% of the total length [16, 51]. This,

coupled with the low stringency of washing [50], likely resulted in some degree of cross-

hybridization of the skin probe to the liver genes, but not vice versa. In any case, the 11-amino

acid repeat motif is conserved among the skin variants, but they contain only two full repeats, in

9

contrast to the 3 – 4 full repeats in the liver isoforms. The nature of crystal shaping induced by

the winter flounder skin AFPs and their ice hemisphere etch patterns have not been reported. The

skin AFPs have not been recombinantly expressed to date and no crystal structure is available, but

circular dichroism and modeling suggest that they are also entirely alpha-helical [53]. Many of

the residues involved in ice-binding in the liver isoform are also conserved, indicating a similar

mechanism of action and possibly a similar binding plane [27, 51].

Despite the many similarities between the skin and liver isoforms, significant differences

exist as well. Although the skin AFP genes also consist of two exons separated by an intron, the

first exon contains only untranslated sequence [51]. These proteins are produced with no signal

peptide or pro-sequence, and relative to the liver isoform, they possess both a unique N-terminal

sequence and a variable C-terminus [51]. Mass spectrometry indicates that the N-terminal

sequence, which is common among the flounder skin AFP variants, is acetylated [51], but the role

of this moiety is unknown, as it appears to have no effect on antifreeze activity [54]. Studies

investigating the significance of the different N and C termini between the two isoforms revealed

that those of the skin isoforms negatively affect the thermal stability and helical content of the

proteins, and contribute to the 50% decrease in relative antifreeze activity [51, 55].

As with the liver isoforms, regulation of skin AFP expression has not been completely

characterized. The lack of a signal peptide indicates an intracellular role, but identification of

AFP in the interstitial space of gill tissue [56] suggests that secretion may occur by a non-

classical pathway [57, 58]. A region similar to Element B, the liver AFP enhancer, has been

found in the intron of the skin variants; however, it contains a dinucleotide insertion that allows

transactivation activity but abolishes liver-specificity [59]. This enhancer has been named

10

Element S and likely contributes to the broad expression of the skin AFPs, which is especially

high in the skin, scales, fins and gills [50]. Transcripts in the gills have been localized to the

pavement cells and the surrounding interstitial space, and appear to be temporally associated with

the thickening of the epidermis during metamorphosis of juvenile flounder [56, 60]. This occurs

in early summer in the northern range of the winter flounder [61]. Unlike the liver isoform, the

skin isoforms undergo only a 5- to 10-fold difference between winter and summer expression

levels, and this does not appear to be affected by growth hormone [46].

1.1.3 Hyperactive isoform Given the gene dosage of the liver isoform in the winter flounder genome, mid-winter

circulating AFP concentrations can reach 10 – 15 mg/mL [44]. This translates to 0.7 C° of TH

activity. Other blood solutes colligatively add another 0.8 C° for a total freezing point depression

of 1.5 C° [62]. Although icy seawater can reach -1.9 °C, winter flounder thrive. To resolve this

discrepancy, winter flounder plasma was re-examined, and more than 2 C° of TH activity was

detected [63]. The plasma also produced spindle-shaped crystals rather than the hexagonal

bipyramidal crystals typically associated with type I AFPs. Careful purification uncovered a

16.7-kDa protein that produced 1.1 C° of TH activity at a concentration of 0.1 mg/mL, a value far

beyond that of other known fish AFPs. Dubbed the hyperactive AFP, it is present at only 0.2

mg/mL in the blood and is irreversibly inactivated under the conditions traditionally used to

isolate type I AFPs, i.e., room temperature and low pH [63]. This is likely why it had escaped

detection for over 30 years [8].

This isoform is similar to the liver and skin isoforms only in its high Ala content and

alpha-helicity [64]. Alanine again makes up approximately 60% of the mature protein, which is

11

entirely alpha-helical, in spite of the much larger size and an axial ratio of 18:1 [64]; a to-scale

size comparison of the three isoforms is shown in Figure 1. The structure of this isoform has yet

to be solved, but suitable residues are available at both termini for cap structure formation [65].

In addition, it exists in solution as a homodimer, which may provide additional structural support

[64]. It has been suggested that the two straight helices form an anti-parallel dimer, such that

both polypeptide chains can present their ice-binding faces to the ice [65]. This seems plausible,

as it has been shown that AFPs with a larger ice-binding face are more active [35], but there are

likely other contributing factors, given the differences in crystal shaping and potency. Activity is

irreversibly lost at 18 – 20 °C, even though much of the secondary structure is retained,

suggesting that activity may depend on quaternary structure [64]. The ice-binding plane has not

been determined, but the spindle-shaped crystals burst along their a-axes unlike the hexagonal

bipyramidal crystals shaped by the liver AFPs, which burst along the c-axis, implying different or

multiple binding planes [64].

Previously, a DNA sequence containing a long Ala-rich open reading frame was isolated

and named 5a (referred to here as wfh-5a). The corresponding protein or mRNA were never

detected, and it was classified as a pseudogene [52]. Since the discovery and characterization of

the hyperactive isoform, comparisons with the predicted wfh-5a gene product showed that the N-

terminal sequence, amino acid composition and size of the two proteins are quite similar, but

different enough that the detected hyperactive isoform was termed “5a-like” [64]. It appears that

the wfh-5a gene is a variant of those that express the hyperactive AFP but may be silent.

Analysis of the hyperactive AFP cDNA revealed that it is produced with a signal

sequence but no propeptide. Again, codon usage for Ala is heavily biased towards GCC, as in the

12

other type I isoforms [65]. DNA alignments of the hyperactive isoform with the liver and short

isoforms show > 90% sequence identity throughout the 5' UTR, signal peptide and 3' UTR, but

alignment over the mature peptide is problematic due to the repetitiveness of the sequence and

their vastly different lengths [65]. There appears to be no post-translational modification to the

protein after secretion, and the 11-amino acid repeat motif is less well-defined than in the liver

and skin isoforms. Rather than the continuous hydrophobic ice-binding surface conserved in the

smaller isoforms, four disparate regions that resemble the smaller isoforms with respect to their

ice-binding sites were identified upon close scrutiny of the hyperactive sequence [65]. The

physiological source, tissue distribution, and regulation of this isoform are still under

investigation.

1.2 Type I AFP in the Pleuronectidae family The winter flounder belongs to the Pleuronectidae family of right-eyed flatfishes, or

pleuronectid fish. The evolutionary history of these fish is unclear. In an attempt to clarify the

interrelationships within the family, Sakamoto examined 77 species using 78 internal and external

morphological traits, and a phylogeny was established [66]. Eighteen species were selected from

this tree and assessed for the presence of AFP genes on a genomic Southern blot (Figure 2).

Arranged in order of relatedness, the genomes were probed with a representative AFP cDNA, the

main winter flounder liver variant, wfl-AFP6 (S. Gauthier, personal communication). Strong

hybridization signals were observed in several fishes, centered on the winter flounder (Figure 3),

and some were investigated further, specifically those of the yellowtail flounder (Limanda

ferruginea) and the American plaice (Hippogloissoides platessoides).

13

Figure 2. Morphological phylogeny of right-eyed flounders

Interrelationships between 77 species of Pleuronectidae were determined by numerical phenetics

and the application of Gower's general similarity coefficient, based on 78 internal and external

morphological characters. For the sake of simplicity, only 54 species are shown here.

Letter/number combinations denote various species, as listed below. Adapted from [66] with

assistance from J. O'Donnell.

A1 - Arrowtooth flounder C1 - Butter sole E1 - Far Eastern smooth flounder A2 - Kamchatka flounder C2 - English sole E2 - Flounder A3 - Greenland halibut C3 - Rock sole E3 - Starry flounder A4 - Atlantic halibut C4 - Dusky sole E4 - Diamond turbot A5 - Pacific halibut C5 - Dab E5 - Curlfin sole A6 - Slender sole C6 - Yellowfin sole E6 - Ridge-eyed flounder A7 - Petrale sole C7 - Sakhalin sole E7 - Hornyhead turbot A8 - Shotted halibut C8 - Yellowtail flounder E8 - C-O sole A9 - Spotted halibut C9 - Sand flounder E9 - Spotted turbot B1 - Barfin flounder D1 - Longhead dab F1 - Ocellated turbot B2 - Pacific sand sole D2 - Littlemouth flounder F2 - Witch flounder B3 - American plaice D3 - Marbled flounder F3 - Blackfin flounder B4 - Flathead sole D4 - Cresthead flounder F4 - Rex sole B5 - Flathead flounder D5 - Winter flounder F5 - Stone flounder B6 - Bering flounder D6 - Alaska plaice F6 - Lemon sole B7 - Pointhead flounder D7 - European plaice F7 - Slime flounder B8 - Scale-head plaice D8 - Pleuronectes obscurus F8 - Dover sole B9 - Rizuken flounder D9 - Arctic flounder F9 - Willowy flounder

13

C5, C6, C7, C8, C9, D1, D2, D3, D4, D5, D6, D7, D8, D9, E1

A3

E2, E3

A1A2

A4A5

A6A7, A8A9B1B3, B4, B5, B6B7

B8B9

B2C1, C2, C3, C4

E4E5E6, E7, E8, E9, F1F2, F3F4F5F6, F7, F8

F9

C5, C6, C7, C8, C9, D1, D2, D3, D4, D5, D6, D7, D8, D9, E1

A3

E2, E3

A1A2

A4A5

A6A7, A8A9B1B3, B4, B5, B6B7

B8B9

B2C1, C2, C3, C4

E4E5E6, E7, E8, E9, F1F2, F3F4F5F6, F7, F8

F9

14

Figure 3. Genomic Southern blot of selected members of the Pleuronectidae family probed with the winter flounder liver AFP gene sequence

DNA from a variety of species was digested with SacI and probed with a winter flounder cDNA

of the liver isoform (pKEN C17) under highly stringent conditions: 0.1x SSC, 1% SDS, 60 °C,

30 minutes (S. Gauthier, personal communication). Size markers (kb) on the left are from a

HindIII digest of lambda DNA. The letter/number combination above each lane denotes the

species from which the DNA was isolated (see below), and samples are arranged on the blot

according to Sakamoto's morphological phylogeny, shown in Figure 2 [66].

A3 - Greenland halibut A4 - Atlantic halibut

C2 - English sole C3 - Rock sole

B3 - American plaice C5 - Dab

C6 - Yellowfin sole C8 - Yellowtail flounder

D5 - Winter flounder D6 - Alaska plaice

D7 - European plaice E2 - Flounder

E3 - Starry flounder E5 - Curlfin sole

E7 - Hornyhead turbot E8 - C-O sole

F2 - Witch flounder F6 - Lemon sole

14

23.1

2.0

6.69.4

4.4

2.3

0.5

A4 C2 C3 B3 C5 C6 C8 D5 D6 D7 E2 E3 E6 E7 E8 F2 F6A3 A4 C2 C3 B3 C5 C6 C8 D5 D6 D7 E2 E3 E6 E7 E8 F2 F6A3

15

1.2.1 Yellowtail flounder Freezing point depression was observed early on in the serum of the yellowtail flounder

[67], a close relative of the winter flounder [66, 68, 69]. The protein responsible was found to be

seasonally-produced, Ala-rich and homologous to the winter flounder liver AFPs [21, 62]. It was

also predicted to be an amphipathic alpha helix, but with four 11-amino acid repeats rather than

three [21]. Protein sequence alignments indicate that the ice-binding residues are conserved

between the two species. While a larger ice-binding surface has been correlated with increased

efficacy [35], TH activity of the purified yellowtail AFP is 20% less than that of the winter

flounder liver AFPs on a mass basis, though they are similar on a molar basis [21]. The

yellowtail also has a lower circulating AFP concentration (4 mg/mL versus 10 mg/mL) and a

lower AFP gene dosage (~ 10 copies versus 30) compared to the winter flounder [62].

Southern blotting and restriction analysis of genomic clones indicate that the yellowtail

AFP genes are not tandemly arrayed, suggesting that the amplification of the winter flounder

genes occurred after the two species diverged [62]. Both fish live in the same geographical range

along the Atlantic coast of North America, but the winter flounder winters inshore at depths

shallower than 25 m [61] while the yellowtail lives farther offshore in winter, at depths of 60 –

100 m [70]. To explain the reduced antifreeze activity, it was suggested that deeper waters are

warmer and more resistant to the fluctuations in air temperature that cause ice crystal formation at

the surface, and that the differences in activity and gene dosage are due to decreased selective

pressure for AFP [62]. The yellowtail flounder has not been assessed for the presence of skin

AFPs to date. However, after the discovery of the hyperactive AFP in winter flounder,

meticulous re-evaluation of yellowtail serum found evidence of a 16.2 kDa Ala-rich protein that

16

had an N-terminal sequence similar to the hyperactive winter flounder AFP, produced spindle-

shaped ice crystals, and was by itself fully capable of protecting the fish from freezing [71].

Thus, this fish also produces both the small and large plasma AFPs.

1.2.2 American plaice The American plaice lives in the north Atlantic at average depths of 50 – 150 m [72],

similar to the depth range of the yellowtail flounder. It was considered to be relatively distinct

from the yellowtail and winter flounders morphologically [66, 69], but a more recent molecular

phylogeny using ribosomal mtDNA markers suggested that the three species were closely related

[68]. Various assessments of gene dosage using a cDNA probe for the winter flounder liver AFP

have produced disparate results. Genomic Southern blots have shown a similar degree of

hybridization in American plaice and yellowtail flounder DNA [62], while another showed

relatively few AFP signals in the American plaice genome (Figure 3). The Southern blot in

Figure 3 is considered to be more reliable because several exposures were done during a series of

increasingly stringent washes (P. Davies, personal communication). One interpretation of this

latter blot is that the strong signals in species closely related to the winter flounder (lanes C8 –

E3) correspond to the liver isoforms, which underwent amplification in their common ancestor.

Supporting this idea, small AFPs have never been isolated from American plaice plasma.

However, as with gene dosage assessment, results from antifreeze activity assays of American

plaice plasma have varied. In one study, mid-winter analysis for TH showed approximately

1.1 C° of activity, which is on par with that of the yellowtail flounder [73]. Another TH assay

found 2 C° of freezing point depression at 0.4 mg/mL (P. Davies, personal communication), but

the significance of this was not realized until the discovery of the hyperactive thermolabile AFP

17

in winter flounder [63]. Subsequent re-analysis of American plaice plasma under carefully

controlled conditions replicated a TH activity of 2 C° [71].

Full gene and protein sequences are not available for the American plaice hyperactive

AFP, but it was deemed to be homologous to the corresponding winter flounder AFP based on

similarities in N-terminal sequence, size, ice crystal shaping, activity, elution time, amino acid

content and secondary structure [71]. It is slightly more thermolabile, irreversibly losing activity

at 9 °C rather than 20 °C [71], and although its quaternary structure has not been determined, it is

also assumed to form a homodimer in solution (P. Davies, personal communication). The fact

that the American plaice produces the hyperactive AFP without a smaller liver isoform is

noteworthy, and has implications for determining the evolutionary path of the type I AFPs.

1.2.3 Other Pleuronectidae The AFPs of other right-eyed flounders have not been as fully characterized as for the

yellowtail and American plaice. A genomic Southern blot of the smooth flounder (Liopsetta

putnami) DNA produced a strong signal, indicating a large gene family [62], but the signal was

not characterized further. The Southern blot arranged by phylogenetic proximity also showed

evidence of multiple AFP genes in the genomes of the flounder (Platichthys flesus), European

plaice (Pleuronectes platessa), and starry flounder (Platichthys stellatus) (Figure 3). These genes

are thought to be homologous to those of the winter flounder due to the strong hybridization

signal observed with a winter flounder probe, but no other studies have been published on these

species. The same blot also showed relatively few hybridization signals for the Alaska plaice

(Pleuronectes quadrituberculatus), but only limited information is available on the expressed

AFPs; the sequence of one variant from this fish has been published without substantiation [74].

18

The available data, summarized above, indicate that the flounder AFPs fit within the

accepted definition of a type I AFP and that differential amplification of AFP genes has occurred

among pleuronectids. However, a comprehensive evaluation is not possible at this time because

adequate sequence and structural data are not available for all fish. Similarly, the phylogenetic

distribution of these proteins among Pleuronectidae does not indicate a clear ancestor gene or

species and thus, the evolution of these proteins also requires further study.

1.3 Selective pressure: Shaping evolution “Survival of the fittest” is one of the best-known tenets of Darwin’s theory of natural

selection, whereby fitness is determined by the environment. The ability to accommodate

environmental pressures, such as low nutrient availability, non-ideal temperature or toxicant

exposure, determines reproductive success, and successful individuals are more likely to pass

beneficial genes on to future generations. For animals, the simplest method of avoiding these

stressors is to migrate to more amenable surroundings, but if the stress occurs over a large

temporal or spatial scale, migration may not be an option.

Another key facet of Darwin’s theory is that there exists natural variation between

individuals that allows some to better cope with stress. For example, Mouches et al. showed that

mosquitoes chronically exposed to organophosphate insecticides produced more of a particular

esterase, and closer examination revealed an esterase gene dosage 250 times higher in resistant

strains than in sensitive strains [75]. The mosquitoes that can metabolize and thus detoxify

insecticides will survive longer and have more offspring. One way genetic amplification can

occur is via gene duplication. Replicates can arise during meiotic recombination due to

19

homologous but unequal crossing over, and repetitive regions provide substrate for further

duplications and deletions [76]. Such adaptation was also observed in mouse cell lines by Alt et

al., where stepwise methotrexate treatment incurs amplification of dihydrofolate reductase genes

[77]. Scott et al. postulated that an acute selective pressure, namely exposure to ice, induced the

tandem amplification of the liver AFP genes in the winter flounder genome [78]. This claim is

best evaluated in the context of flounder evolution.

The earliest fossil evidence of teleost fish is dated to the middle of the Triassic period 235

million years ago (mya) [79], though molecular data place teleost origin earlier, in the Paleozoic

era [80]. In either case, teleost morphology had attained approximately modern form by the end

of the Eocene epoch, 40 mya [81]. Otolith fossils place true Pleuronectiformes in early Eocene,

53 – 57 mya, while the oldest full skeleton is dated to 45 mya [82]. Recently, a fossil

representing an evolutionary intermediate between flatfish and their symmetrical ancestors has

also been dated to the early Eocene, 50 mya [83, 84]. Thus, the majority of flounder evolution

occurred in warm climates where polar temperatures were at least 6 °C warmer than present and

peaked at 24 °C in the Arctic Ocean at the Paleocene-Eocene thermal maximum, 55 mya [85, 86].

Sedimentary evidence indicates that the earth began transitioning to a bipolar icehouse, or an

environment in which ice sheets are present at both poles, in the middle Eocene, 45 mya [87],

thus presenting modern fishes with their first major cold challenge. Based on the observations

that not all close relatives of the winter flounder possess equivalent AFP gene dosage [62], and

that eukaryotes are under little pressure to purge non-functional DNA, the tandem array of AFP

genes in the winter flounder is thought to be a recent evolutionary novelty. Among the various

pleuronectid species, both AFP gene amplification and serum AFP activity appear to correlate

with habitat depth and the degree of exposure to ice, implying that the genes were amplified

20

according to need [62, 73]. Similarly, differential amplification of AFP genes has been observed

not only between species, but also in geographically distinct populations of the same species [88,

89]. As part of a search for other, better examples of intraspecific variation in gene dosage, and

in order to test the theory that AFP gene dosage amplified according to need, we have examined

the starry flounder.

The starry flounder was chosen because it is a close relative of the winter flounder [66,

68] and because it showed a strong signal on a preliminary Southern blot probed with flounder

liver AFP (Figure 3). It spawns in depths of less than 30 m in December and early January, and is

common at low salinities [90], thus exposing itself to both icy conditions and higher freezing

temperatures. The starry flounder is non-migratory and its habitat encompasses a continuous

stretch of coast on both sides of the north Pacific Ocean between 33° and 73° N latitude (Figure

4) [90, 91]. We hypothesize that if AFPs evolved in response to sudden chronic ice exposure,

more northerly flounder populations should have a higher AFP gene dosage, because they would

be exposed to glaciers descending from the Arctic more severely and over a longer time. When

the genomic DNA of three starry flounder from different latitudes (Alaska 61° N, British

Columbia 49° N and California 37° N) was probed with the cDNA from a winter flounder liver

AFP, a graded signal was observed in which intensity correlated positively with latitude (Figure

5). However, because these signals were not further characterized and the blot contained DNA

from only three fish, limited conclusions could be drawn at the population or species levels.

21

Figure 4. Locations from which starry flounder samples were obtained

The black line along the coastline denotes the geographical range of the starry flounder [90, 91]

with the sampling sites (below) indicated by a code within a circle.

(1) Bering Strait (AK, liver)

(2a) Port Moller, Aleutian Peninsula (AK, muscle)

(2b) Bering Sea, Aleutian Peninsula (AK, muscle)

(3a) Belkofski Bay, Deer Island (AK, muscle)

(3b) Sitkalidak Strait, Kodiak Island (AK, muscle)

(4) Hecate Strait, Queen Charlotte Islands (BC, liver)

(5) English Bay, Vancouver (BC, whole fish)

(6) San Francisco Bay, San Francisco (CA, liver)

21

Pacific Ocean

Bering

Sea

Arctic Ocean

Beaufort Sea

11

2b2b2a2a

3a3a

3b3b

44

55 66

22

Figure 5. Genomic Southern blot of starry flounders from different latitudes probed with the winter flounder liver AFP gene sequence

Genomic DNA was isolated from individual starry flounders from San Francisco, California

(CA), Vancouver, British Columbia (BC) and the Bering Strait, off the northwestern coast of

Alaska (AK), sites 1, 5 and 6, respectively, in Figure 4. DNA was digested with EcoRI (E) or

SacI (S) and probed with the wfl-AFP6 cDNA pKEN C17 [19]. Size markers are not available

for this blot; unpublished data courtesy of Dr. Gary K. Scott.

22

E S E S E S

CA BC AK

23

1.4 Goals and Objectives Type I AFPs are key proteins involved in protecting pleuronectid fish from ice crystal

growth into the body. Evidence suggests that they are produced by several closely-related species

of flatfish, but characterization of these proteins in species other than the winter flounder is

limited or lacking, especially at the genomic level. In addition, it is currently accepted that the

differential amplification of AFP genes is a function of selective pressure, but this has not been

demonstrated definitively within a single population of AFP-producing fish. With the aim of

characterizing the AFP genes in the starry flounder, the following strategies were applied:

1. To replicate and extend the hybridization signals observed in preliminary Southern blots

and to correlate signal intensity with latitude, high molecular weight DNA was isolated

from starry flounder tissues sampled at various locations and latitudes (Figure 4) and

Southern blotted.

2. To characterize the starry flounder AFP gene family and its organization, the

aforementioned Southern blots and a genomic lambda library of starry flounder DNA was

probed with the cDNAs of the different type I AFP isoforms. The banding patterns

generated on the Southern blots and the sequencing and restriction analysis of positive

phage were used to infer the organization of type I AFP gene family.

3. To investigate evolutionary relationships among type I AFP isoforms, PCR was used to

sample the sequence variation of each AFP isoform in the starry flounder genome, and

the starry flounder AFP genes will be compared to those of other pleuronectids.

24

Chapter 2

Materials and Methods

2.1 Isolation of genomic DNA Starry flounder tissue was collected by others at various sites on the north and east

Pacific coasts (Figure 4). Tissue was stored at -80 °C and cooled in liquid nitrogen prior to DNA

extraction. DNA was prepared from 1.5 – 3 g tissue according to the method of Blin and Stafford

[92], as adapted by Scott et al. [32] with other minor modifications. Briefly, tissue and frozen

DNA extraction buffer containing 100 µg/mL proteinase K were ground together to a fine powder

under liquid nitrogen with a pestle and mortar then incubated at 63 °C overnight with additional

proteinase K. This crude digest was deproteinized by stepwise extractions with equal volumes of

buffered phenol, 1:1 v/v phenol:chloroform, and chloroform, prior to at least 12 h of dialysis

against TE. Following a 3 h incubation at 37 °C for 3 h with DNAse-free RNAse A prepared

according to Sambrook [93], a second proteinase K treatment was performed (100 µg/mL with

0.5% SDS) at 50 °C overnight. Phenol-chloroform extractions and dialysis were repeated as

described above. DNA quality was assessed by agarose gel electrophoresis and quantity was

measured by either A260 or densitometric comparison with a sample of known concentration

following agarose gel electrophoresis on a Bio Rad Gel Doc 2000 imaging system coupled to

Quantity One software (version 4.1.0). Preparations that were too dilute were concentrated by

sec-butanol extraction.

25

2.2 Southern blotting Genomic DNA (30 µg) was digested to completion with each of the following enzymes:

EcoRI, SacI or BamHI (New England Biolabs). Digestions contained 10 times the amount of

enzyme required to cut 1 µg DNA in 1 h and the appropriate buffer, both as specified by the

manufacturer, and 100 µg/mL bovine serum albumin. Reactions were incubated at 37 °C without

shaking for 48 h. Small aliquots were taken from each reaction at 20 h and 32 h, prior to boosting

the digestions with the same amount of enzyme. Electrophoresis of these aliquots against the

starting and final products was used to assess the time course of the digestion. Digested DNA

was recovered into a smaller volume by ethanol precipitation and resuspension of the pellet in 60

µL 1x TE. DNA samples (10 µg/lane) were electrophoresed on a 0.8% agarose gel at 20 V or

less and Southern blotted onto nylon (Zeta-probe GT, Bio-Rad Laboratories) according to the

method of Sambrook [93]. Blots were pre-hybridized and washed as described by Davies et al.

[31]. Hybridization probes were labelled with α-32P dCTP (Perkin Elmer) using the Random

Primers DNA Labelling System (Invitrogen). The 287 bp cDNA for the liver isoform contained

the signal peptide, the pro-sequence and mature peptide in their entirety, along with small

portions of the 5' UTR (conserved among liver and hyperactive isoforms) and the 3' UTR

(conserved among all isoforms). The probe consisted of nucleotides 27 – 313 of component A, a

previously published winter flounder cDNA, as numbered by Pickett et al. [16]. The probe for

the winter flounder hyperactive isoform comprised nucleotides 103 – 705, as numbered according

to Graham et al. [65], and coded for a portion of the mature peptide only. The starry flounder-

specific probes were PCR products isolated from genomic DNA. The hyperactive probe was 261

bp long and encoded a portion of the mature peptide. The starry flounder skin probe was 275 bp

26

long and encoded the entire mature peptide as well as the last 24 bp of the intron. All final

washes were done in 0.1 x SSC with 0.5 – 1% SDS at 64 °C for 30 min. Blots were exposed to

Kodak XAR-5 film at -80 °C for periods ranging from 16 – 28 h.

2.3 Library preparation and amplification A genomic library in lambda phage was prepared from frozen starry flounder (Queen

Charlotte Islands, QCI) liver tissue by Bio S&T (Montreal, QC). Briefly, partially digested

Sau3AI fragments (20 kb average length) were ligated into the BamHI site of a Lambda DASH®

II vector (Stratagene). The DNA was packaged as a primary library containing 4.5 x 106 plaque

forming units (pfu). Twenty percent of the library was amplified according to the manufacturer’s

instructions over twenty plates and each stock was stored separately in SM buffer over 0.3%

chloroform at 4 °C. All amplifications and screens were done in E. coli XL1-Blue MRA (P2

lysogen) cells. Frozen permanent stocks were made in 7% DMSO and stored at -80 °C; the

amplified portions were frozen in two 2 mL aliquots per plate and the remainder of the primary

library was frozen in seven 50 µL aliquots.

2.4 Library screening Approximately 5 x 104 pfu were plated onto 150 mm NZY plates during the primary

screen; plaques picked from the primary screen were stored in 1 mL SM buffer with 4% v/v

chloroform without further analysis. All other screens were performed on 100 mm plates. Plaque

lifts were carried out in duplicate onto nylon Colony/Plaque Screen Hybridization Transfer

Membranes (Perkin Elmer) and fixed according to the manufacturer’s recommendations.

27

Membranes were hybridized to the same winter flounder probes as used on the Southern blots.

All washes were done at 60 – 63 °C with 1% SDS present in solution, and the final washes varied

as follows. Filters probed with DNA for the liver isoform were washed with 1x SSC for 30 min

(primary screen), 4x SSC for 15 min (secondary screen), 4x SSC for 30 min (tertiary screen), 1x

SSC for 20 min (quaternary screen). Filters probed with DNA for the hyperactive isoform were

washed with 0.2x SSC for 60 min (primary screen), 1x SSC for 45 min (secondary screen), 4x

SSC for 30 min (tertiary screen), and 1x SSC for 20 min (quaternary screen).

2.5 Phage isolation Plaque-purified phage were amplified by the plate lysate method [93], in which 107, 108

and 109 pfu were incubated with separate 500 mL E. coli cultures. Cultures that appeared to be

sufficiently lysed, as determined by their optical properties and the presence of visible cell debris,

were spun 6175 x g at 4 °C for 10 mins to remove debris and unlysed cells. Following addition

of PEG 8000 (50 g) to precipitate the phage, aggregates were pelleted by centrifugation at the

same relative centrifugal force and temperature for 30 min. The pellet was resuspended in 10 mL

of SM buffer, and the PEG was extracted using an equal volume of chloroform. Following

centrifugation at 3000 x g for 15 min at 4 °C, CsCl was added to the aqueous phase to a final

concentration of 0.3 g/mL.

Phage were isolated from a CsCl step gradient made up of four different densities (1.1,

1.4, 1.5, 1.7 g/mL) layered to form distinct strata. The phage were centrifuged at 76220 x g for 3

h at 16 °C, then extracted via needle puncture of the centrifuge tube just below the phage band,

located between the 1.5 and 1.4 g/mL layers. The band was slowly drawn into the syringe using a

28

wide-bore needle (18-gauge), so as to minimize shearing forces. A second purification was

performed in a CsCl equilibrium gradient (1.5 g/mL) in order to separate intact phage from empty

phage heads and contaminating nucleic acids. The gradient was established at 171500 x g for

18+ h at 16 °C and the phage band was harvested as described above. DNA was extracted by

treatment with formamide at 22 °C for 2 h and ethanol precipitation as described by Sambrook

[93]. Pellets were collected by centrifugation, resuspended in 20 µL TE and stored at 4 °C.

2.6 DNA sequencing Two phage DNAs isolated as described above were sequenced. The first phage DNA

(insert #1) was sequenced by shotgun cloning followed by the 454 method and the second (insert

#2) was sequenced by the 454 method (Genome Quebec) followed by PCR amplification and re-

sequencing of selected portions. For shotgun sequencing, high molecular weight genomic DNA

was randomly sheared and fragments approximately 2 kb in length were size-selected, blunt-

ended with Klenow, and ligated into SmaI-blunted pUC19 vector. Approximately 72

transformants were randomly selected for plasmid DNA isolation followed by sequencing in a

thermocycler with fluorescently-tagged dideoxynucleotides. Each insert was primed with the

M13 forward or reverse primers; 48 were sequenced in both directions while 24 were done using

only the forward primer. The resulting ~600 bp sequence reads were assembled using DNAMAN

from the Lynnon Corporation, version 4.15.

In the 454 method, genomic DNA was randomly sheared via hydrostatic pressure into

300 – 800 bp fragments. These were then blunt-ended and ligated to two different double-

stranded adaptor sequences for selection purposes. The double-stranded fragments were melted,

29

and single-stranded fragments containing both adaptors were selected and fixed onto proprietary

“DNA Capture Beads” in an optimized molar ratio, such that each bead bound no more than one

fragment. Each fragment was amplified on its respective bead via oil-emulsion PCR and

denatured, leaving single stranded templates attached to the beads. Sequencing reactions took

place in a PicoTitrePlate device™, where each bead was incubated with sulfurylase, luciferase

and all reagents required for polymerization, except the dNTPs. The plate was sequentially

bathed in buffer solutions containing one of dCTP, dGTP, dATP or dTTP, which were

incorporated when the appropriate template nucleotide was available. The addition of a single

dNTP generated a light signal that was detected by a charged coupled device camera in the

sequencer; the number of nucleotides added at a given time to the nascent chain was directly

proportional to the strength of the light signal. Reads were short, roughly 100 – 300 bases long,

but the 400000 parallel reactions sequenced per plate resulted in both high coverage and accuracy.

TA clones of PCR products were sequenced using M13R or T7 primers at the Robarts

Research Institute (London, ON) by cycle sequencing with fluorescently-tagged

dideoxynucleotides.

2.7 PCR analysis For the isolation of novel AFP variants from starry flounder genomic DNA and library

phage plaques, primers were designed based on sequence alignments between AFP isoforms and

were synthesized by Cortec DNA Service Laboratories, Inc. (Kingston, ON). Primer sequences

and designations are listed in Table 1, and their placement relative to one another within the

genes is shown in Figure 6. Standard PCR reaction mixtures contained 1.5 mM magnesium

30

Table 1. Names and sequences of primers used in PCR experiments

All sequences are written 5' à 3' and their relative positions are shown in Figure 6; numbers used

for clarity in Figure 6 are noted in brackets next to the primer name. The length of each primer is

listed (bp), as is its GC content.

30

NAME SEQUENCE LENGTH %GC

5'up GTCCAGAGAGGGGAAAGAATACA 23 48 3'up ACGCCTCGACTGAATCCTTTTGT 23 48 5'int GGAAGGAAGGATATCTGCATTAT 23 39 3'int TAATAATACCATTAATTTCTGCAG 24 23 stfsk AGACACTACTGCGGGAAACATAC 23 48 allsk GGCCTAAACCTGAAAAAATCTGAGC 25 44 3'univ ACATGATCCCACATCAAGACGAC 23 48 m5' (1) ATAATACCATTAATCTCTGCAGC 23 35 m5'mid (2) CGCATCCATAGCAACCATCAA 21 48 m5'midstf (2a) AGCAACCATCAAAGCCAATGC 21 48 m5'stf (2b) CAGCAATAGCAGCCGAGGAA 20 55 m3'stf (3b) TTTGTCAAAGATGGCCGCCT 20 50 m3'midstf (3a2) TTGTTTTGGCTGCGGCTGCG 20 60 m3'midstf (3a1) CGATGGTTGTTTTGGCTGCG 20 55 m3'mid (3) TTGTCGATGGTTCTTTTGGCT 21 43 m3' (4) ACGACCACGATCCTTATGGG 20 55 Contig1#1 AAACAGGGTAGAGAACAAGAAC 22 41 Right arm ATACGACTCACTATAGGGCGAAG 23 48 maxisfstop TTAAGGATCGTGGTCGTCTTG 21 48 con2end2#1 GAGTCTCTTCATGTGATACTCT 22 41

31

Figure 6. Schematic diagram showing the relative positions of the PCR primers on the liver, skin and hyperactive AFP isoforms

Boxed portions denote exonic sequences, shaded areas represent translated sequence and primers

are indicated by arrowheads. (A) Starry flounder liver isoform 1 (stfl-AFP1). (B) Starry flounder

skin isoform 1 (stfs-AFP1). The allsk primer was designed to a region conserved among all

starry and winter flounder skin isoforms, whereas the stfsk primer was designed to a region

unique to stfs-AFP1. (C) Winter flounder hyperactive AFP gene (wfh-AFP1). The size of the

intron is unknown and is indicated by the broken line. Primers 2 and 3 were designed in GC-poor

regions of the winter flounder sequences whereas primers 1 and 4 were similar to the 3'int and

3'univ primers, respectively, but shifted a few nucleotides into sequence unique to the hyperactive

sequences. Primers 2a, 2b, 3a1, 3a2 and 3b were designed to exactly match the starry flounder

hyperactive AFP sequence. The following three primers were designed to well-conserved regions

found in all three isoforms and are marked on all three diagrams: 5'int near the 5' end of the

intron, 3'int at the 3' end of the intron and 3'univ in the proximal 3' UTR. The primer sequences

are listed in Table 1.

31

300

5’in

tst

fsk

alls

k3’

int

3’u

niv

300

300

5’in

t3’

int

3’u

niv

(A)

(B)

(C)

3’u

niv

12

43

2a2b

3a1

3a2

3b

3’in

t5’

int

32

chloride (MgCl2), 1x Qiagen buffer, 1x Q solution, 0.2 mM dNTPs, and 0.025 U/µL Taq.

Standard thermocycling programs started with 3 min at 98 °C and a hot start at 80 °C. Reactions

were cycled 25 times through 1 min at 95 °C, 1 min at 55 °C, and 2 min at 72 °C, and finished

with 10 min at 72 °C. MgCl2 concentration, buffers and salts, annealing temperature and number

of cycles were optimized for each primer pair using 0.33 µg starry flounder genomic DNA as the

template. With these optimal conditions, PCR was performed on phage plaques picked from the

primary library screen using 4 µL of the buffer stock as a template. PCR products of interest

were cloned using the TOPO TA Cloning Kit (pCR® 2.1-TOPO® Vector, Invitrogen) and

sequenced. Upon sequence alignment, differences were confirmed as necessary by sequencing

the opposite strand.

For closure of the gaps remaining in phage insert #2 following assembly of the 454-

generated sequence reads, additional primers were designed near the ends of available sequence

(Table 1). PCR amplifications were performed with the same standard reaction conditions listed

above without optimization, and various products were cloned and sequenced.

2.8 Bioinformatic analysis Nucleotide BLAST searches were performed in all nucleotide sequence databases,

including GenBank, RefSeq Nucleotides, EMBL, DDBJ, and PDB sequences (excluding

HTGS0,1,2, EST, GSS, STS, PAT, WGS). Searches were performed using default parameters

and scoring matrix, except for an expect value threshold of 1 and unmasking regions of low

complexity. Open reading frames located by the NCBI ORF Finder within the phage insert were

used to query all non-redundant GenBank CDS translations, as well as the PDB, SwissProt, PIR

33

and PRF databases via BLASTP under default parameters. Signal peptide prediction was

performed using both neural networks and hidden Markov models on the SignalP 3.0 server [94].

Promoter analysis was performed using the Neural Network Promoter Prediction tool, version

2.2, courtesy of the Berkeley Drosophila Genome Project [95].

34

Chapter 3

Results

3.1 Preface Genomic DNA was prepared from starry flounder tissue samples by Kyra Nabeta, Gary

Scott, Pliny Hayes and Laurie Graham. Kyra Nabeta prepared the Southern blot and probed it

with the winter flounder liver and hyperactive isoforms, and Sherry Gauthier applied the starry

flounder hyperactive and skin isoform probes. The phage library was prepared by Bio S&T, and

inserts were sequenced by Genome Quebec, both in Montreal, QC. The first phage insert was

purified and its sequence was assembled by Kyra Nabeta; the second insert was purified by

Sherry Gauthier, who resolved the gaps in the sequence with Laurie Graham. All sequence

analysis, as well as all PCR and TA cloning experiments were performed by Kyra Nabeta, with

the exception of those involving the American plaice, which were done by Sherry Gauthier. The

manuscript was written by Kyra Nabeta with assistance from Peter Davies and Laurie Graham.

3.2 Genomic DNA samples were of variable quality To determine whether gene dosage of the starry flounder type I AFPs is correlated with

increasing latitude, we first attempted to supplement the results from a preliminary three-location

Southern blot that showed evidence of a positive association (Figure 5). Genomic DNA was

prepared from a variety of tissues collected from the south Bering Sea (Port Moller, location 2A),

the Pacific coast of the eastern Aleutian Islands (Sitkalidak Strait, location 3B), the Queen

35

Charlotte Islands (Hecate Strait, location 4), and Vancouver (English Bay, location 5), as

indicated Figure 4. High molecular weight DNA of good quality was extracted from the Queen

Charlotte Islands (QCI) and English Bay samples. However, the DNA from tissues collected

from Port Moller and the Sitkalidak Strait was either of poor quality, as indicated by the presence

of smearing (Figure 7), or of insufficient quantity. Consequently, it was not possible to

significantly extend the biogeographical study correlating latitude with gene dosage at this time.

3.3 Genomic Southern blot showed strong liver AFP gene signal in starry flounder The genomic DNAs extracted from four QCI starry flounders (location 3) were of good

quality, comparable to that of the control winter flounder DNA in Figure 7. They were digested

with EcoRI and SacI and blotted alongside a control DNA (digested with BamHI and SacI) from

a single winter flounder. The blot was probed with the winter flounder liver AFP cDNA pKEN

C17 [19], which was selected as a representative of liver type I flounder AFP genes. The starry

flounder banding pattern differed from that of the winter flounder (Figure 8A). All four starry

flounder SacI digests revealed one strong band at 4.7 kb, whereas the winter flounder SacI digest

contained three intense bands in a narrow size range (2.7 – 3.2 kb), as previously observed [31,

32, 51, 62, 78]. Both species also showed a few faint bands of various sizes. EcoRI generated

hybridization bands in starry flounder DNA were significantly larger than the 7.8 – 8.0 kb BamHI

bands observed in the winter flounder [31, 32]. The total band intensity appeared similar between

the two species, suggesting that the starry flounder AFPs are also encoded by a similarly-sized

multigene family of 30 – 40 members.

36

Figure 7. Quality assessment of genomic DNA from an Alaskan starry flounder

The quality of genomic DNA extracted from starry flounder tissues was assessed by gel

electrophoresis. In this representative gel, high-quality winter flounder genomic DNA was

loaded in lanes 1 to 3 (0.08, 0.16 and 0.42 µg) for comparison. Lanes 4 to 7 contain 1, 2, 5 and

10 µL of a genomic DNA preparation from a starry flounder collected at location 3B, and was

typical of the Alaska samples. No molecular weight markers were loaded, as a visual comparison

with the winter flounder DNA was sufficient to establish sample quality.

36

1 2 3 4 5 6 7

37

Figure 8. Genomic Southern blots of winter flounder and QCI starry flounder probed with various AFP gene sequences

Genomic DNA (10 µg) from individual fish was digested with SacI or BamHI for winter

flounder, or with SacI or EcoRI for starry flounder. The four starry flounders (1, 4, 6, 9) were

obtained from location 4 (Figure 4). Digested DNA was electrophoresed on a 0.8% agarose gel,

blotted onto nylon and probed with the winter flounder liver isoform (A), the winter flounder

hyperactive isoform (B), or the starry flounder skin isoform (C). The positions of DNA size

markers (kb) are indicated on the left with corresponding markers on the right.

37 BamHI

SacI

Win

ter

flo

un

der

Sta

rry

flo

un

der

Sac

I

14

69

14

69

14

69

14

69

14

69

14

69

Eco

RI

14

69

14

69

14

69

14

69

14

69

14

69

Win

ter

flo

un

der

Sta

rry

flo

un

der

Sac

IEco

RI

14

69

14

69

14

69

14

69

14

69

14

69

Win

ter

flo

un

der

Sta

rry

flo

un

der

BamHI

Sac

IEco

RI

10 0.75233.5456810 0.75233.5456810 0.75233.54568

AB

C

SacI

BamHI

SacI

38

The starry flounder banding patterns for the EcoRI and SacI digests were qualitatively

and quantitatively similar to those previously observed in starry flounder specimens from the

Vancouver area of BC, Canada (G. Scott, unpublished data) (Figure 5). In this blot, the signal

observed in the BC sample was very different from that of the California sample, which was

missing the main 4.7 kb SacI band, and from that of the Alaska sample, where the SacI band was

2- to 3-fold stronger and located in a significantly larger fragment. In the Alaska sample the

hybridization signal in the EcoRI digest was also 2- to 3-fold stronger, and was located in a

completely different set of bands that were smaller than those in the BC sample.

3.4 Genomic Southern blot shows strong hyperactive AFP gene signal

Given the considerable sequence differences between the liver and hyperactive isoforms

in the coding region [65], the latter was not expected to cross-hybridize to the liver or skin genes,

despite their similar GC-richness. The blot from Figure 8A was stripped and re-probed with the

winter flounder hyperactive probe, which hybridized to different fragments in the winter flounder

lanes than observed with the liver isoform (Figure 8B vs. Figure 8A). The intense bands on this

blot were smaller in the SacI digest, 2.0 and 2.6 kb compared to 2.7 – 3.2 kb with the liver

isoform, while the principal signals in the BamHI digest were located in much larger DNA

fragments, >>10 kb in size compared to 7.8 kb. This is the first time this newly-discovered

isoform has been used to probe Southern blots.

The starry flounder banding pattern observed with the hyperactive probe was quite

similar among all four fish and to the pattern observed with the liver AFP probe (Figure 8B). The

main difference was in the relative intensity of the bands. As signal strength is a function of

many factors, patterns of relative intensity within lanes can be compared between blots, but no

39

conclusions can be drawn regarding the absolute intensity of individual bands without a single-

copy control. With the hyperactive probe, the SacI band centred at 3.5 kb became much more

intense relative to the 4.7 kb band. The hybridization pattern in the SacI and EcoRI digests

suggests the presence of several copies of the hyperactive AFP gene in the starry flounder

genome.

Upon comparison with the winter flounder, the most noticeable difference between

Figure 8A and Figure 8B was that the starry flounder bands merely displayed different relative

intensities whereas the winter flounder banding pattern changed completely. The absence of

residual signal from the liver probe in the winter flounder lanes (e.g., the intense band at 3.2 kb is

no longer present) implied that the blot was properly stripped; however the blot was not exposed

after stripping to confirm complete removal of the liver probe. The fact that both the liver and

hyperactive probes hybridized strongly to the same fragments of starry flounder DNA suggests

tight linkage of the two isoforms.

3.5 Genes for starry flounder liver and skin AFPs are closely linked Based on the genomic Southern blots, it was clear that the genomic DNA extracted from

QCI fish was of high quality and that it contained multiple AFP gene copies. However, the

identity of these genes was based only on hybridization signals, and confirmation at the DNA

sequence level was essential. Therefore, a genomic lambda library was constructed to

characterize the genes at the sequence level. It provided 131-fold coverage of the genome,

assuming a genome size of 0.7 pg, which was based on values obtained for other closely related

flounder species [96]. A primary screen using both the hyperactive and liver probes was

40

performed on two different amplified phage stocks. Given that 5 x 104 plaques were cultured per

plate, a single copy gene was expected to produce a positive signal once or twice per plate.

Twelve positive plaques in total were identified on the two plates probed with a cDNA of the

liver isoform and ten of these were selected for further screening. Twenty-one signals in total

were detected on the two plates probed with a cDNA of the hyperactive isoform and fifteen of

these were selected. Two plaques from each primary screen were re-screened three more times to

achieve plaque purity. In the end, one plaque identified with each of the liver and hyperactive

probes was purified to homogeneity by four rounds of screening in total.

The clone identified using the hyperactive cDNA probe was sequenced in its entirety by

Genome Quebec. Initially, randomly-sheared fragments were shotgun cloned, and seventy-two

clones were selected randomly and sequenced. Three gaps remained following an assembly using

DNAMAN. The clone was sequenced a second time by Genome Quebec as a test to optimize

their newly acquired 454 system, and a complete assembly was achieved, including an additional

3.2 kb of sequence that spanned the three gaps. The sequences of the regions analyzed by both

methods were identical.

Insert #1, was 18707 bp in length, and it contained two type I AFP genes oriented in the

same direction and separated by 1 kb of intergenic sequence (Figure 9). BLAST searches

revealed that one gene encoded a liver isoform and the other a skin isoform, based on their

similarity to known sequences from other right-eyed flounders. These genes were both located

within a 4 kb region at one end of the insert. The rest of the insert was used to query Genbank

nucleotide databases using BLASTN, and was found to lack other genes. However, there were

some small stretches that showed identity to non-coding regions of flounder type I AFPs and

41

Figure 9. Schematic diagram of starry flounder genomic DNA insert #1

The 18707 bp insert is denoted by the solid line and lambda arms by the dotted lines. The two

predicted exons of each of the two starry flounder AFP genes are indicated using thicker grey

boxes. Lines with diamonds indicate EcoRI cut sites, and the round dumbbell indicates the sole

SacI cut site. The three underlined sections indicate regions outside the AFP genes with

similarity to sequences from the Genbank database; (A) 275 bp segment, beginning at base 9211,

that is 92% identical to a portion of the 3' region of wfl-AFP6, (B) 148 bp segment, beginning at

base 10060, with 83% identity to a non-coding sequence section in the 3' flank of a pleurocidin-

like gene from winter flounder, (C) 196 bp segment, beginning at base 12104, with 78% identity

to a sequence 5' of the wfl-AFP6 gene.

41

1 kb

118

707

Liv

erS

kin

1441

9

1242

0

1293

1A

B

C

λ λλλRλ λλλL

42

pleurocidin, an antimicrobial peptide [97]. For example, a 275 bp region aligned with 92%

identity and an E-value of 3 x 10-105 to a sequence in the 3' flank of wfl-AFP6 (match A in Figure

9). An E-value, or expect value, describes the probability that a sequence of a given size will

match another in the database by random chance. Thus, a low value suggests that two sequences

are significantly similar. Another stretch of 196 bp aligned with 78% identity and an E-value of 2

x 10-31 to a sequence in the 5' flank of the same gene (match C in Figure 9). A 148 bp section

aligned with 83% identity (E-value of 2 x 10-32) to a non-coding sequence in the 3' flank of a

pleurocidin-like gene from winter flounder (match B in Figure 9). In the insert, all three of these

matches occur within 5 kb upstream of the AFP genes. The rest of the matches detected aligned

with highly repetitive dinucleotide repeats, such as (GT)n, and were not considered further; such

simple sequence repeats are common in eukaryotic genomes and are formed by slippage during

replication or recombination [37]. Using the NCBI ORFfinder1 to scan the 14 kb upstream of the

AFPs, 74 open reading frames (ORFs) of 102 – 384 nucleotides were identified. To demonstrate

the likelihood of this happening by chance, five 14 kb sequences were randomly generated in

DNAMAN and were found to contain an average of 71 ORFs with maximum sizes of 363 – 582

nucleotides. These potential ORFs were subjected to protein:protein BLAST searches to identify

potential proteins, but no significant matches were found.

Previous restriction analysis of lambda clones showed that many of the type I AFP genes

of the winter flounder are arranged in direct tandem repeats [32]. The starry flounder lambda

insert was searched for cut sites corresponding to the restriction enzymes used in the Southern

blots to determine whether the two genes are part of a tandem array. Two EcoRI restriction sites

were found within a 1 kb stretch upstream of the genes but no other sites were present in the rest

of the insert (Figure 9). There was only one SacI site, positioned within the intron of the skin

1 http://www.ncbi.nlm.nih.gov/projects/gorf/

43

isoform. Because the genes were located at the very end of the insert and no other EcoRI or SacI

sites were present, it was not possible to determine whether the fragments produced in this region

of the genome correspond to the band sizes observed on the current Southern blots (Figure 8).

3.6 Starry flounder liver and skin AFPs are homologous to their winter flounder counterparts One of the main objectives of this project was to sample the variety of AFP genes in the

starry flounder genome and to assess their relationship to those of the winter flounder. The two

AFP sequences obtained from insert #1 were aligned with winter flounder AFP sequences, and it

became immediately apparent that one was an orthologue to the liver isoform whereas the other

was a skin isoform orthologue. Alignment of the starry flounder AFP genes with the

corresponding winter flounder genes (i.e., stfl-AFP1 with wfl-AFP6, Appendix A, and stfs-AFP1

with wfs-11-3, Appendix B) showed that they were identical in terms of intron/exon structure, in

that both isoforms consisted of two exons separated by a single intron (Figure 10).

3.6.1 Liver isoform

In stfl-AFP1, the first exon was 91% identical to that of wfl-AFP6 and encoded a portion

of the signal peptide (Appendix A). The rest of the signal peptide, the pro-sequence and the

mature protein were encoded by the second exon, which was 92% identical to the corresponding

exon of wfl-AFP6 (81% including gaps). When each portion was taken individually, there was

93% identity in the signal peptide, 75% identity (52% including gaps) in the pro-sequence, and

80% in the mature sequence. The deduced proteins showed 87% identity in the signal peptides,

44

Figure 10. Schematic diagram of the AFP gene organization in genomic DNA insert #1

The portion of the insert from Figure 9 containing the two AFP genes is shown with intronic and

intergenic sequences represented by black lines. Exons are indicated by boxes in which the

translated portions are shaded. Promoters are marked by grey vertical arrows and a putative

enhancer sequence by an asterisk. The SacI cut site in the intron of the skin AFP is marked with

a dumbbell. Arrows indicate the position and directionality of the primers designed for PCR

amplification. Vertical lines denote single nucleotide mismatches and exact sequences are listed

in Table 1.

44

Pro

mo

ter

Un

-tra

nsla

ted

Tra

nsl

ated

En

han

cer

500

bp

1.2

kb1.

0 kb

0.5

kb

Ski

nL

iver

*

*

5’in

tst

fsk

alls

k3’

int

3’u

niv

5’in

t3’

int

3’u

niv

45

81% in the pro-sequence (53% including gaps) and 89% in the mature protein. The introns of these genes were 513 bp and 497 bp long for the starry and winter flounders, respectively, and

showed 68% identity (64% including gaps).

The predicted product of stfl-AFP1 was quite similar to the three-repeat liver isoforms

from winter flounder. The 23-residue secretory signal peptide predicted by SignalP 3.0 [94] was

equal in length to those in the winter flounder AFPs, but the starry flounder sequence contained

three substitutions: L11F, M18I and T21S (Figure 11). Assuming the N terminus of the mature

peptide is identical to that of the winter flounder, the starry flounder pro-sequence would be 30

residues long. The pro-sequence would contain some substitutions as well as insertions of 9 or 15

residues relative to wfl-AFP6 or wfl-AFP9, respectively. However, if this putative pro-sequence

is removed by DPPIV, as is thought to occur in the winter flounder liver AFPs, the twelfth and

thirteenth dipeptides would have Asp and Phe residues in the penultimate position rather than the

preferred Ala or Pro. Current data do not support Asp or Phe as cleavage substrates for DPPIV

[98, 99]. If the N terminus is identical to that in the winter flounder, the mature starry flounder

liver AFP would be 37 amino acids long, after cleavage of the final Gly residue during C-terminal

amidation, and would contain three full 11-amino acid repeats. Overall, stfl-AFP1 contained a

total of 23 nucleotide substitutions relative to wfl-AFP6, eleven of which were silent (Appendix

A). The amphiphilic character of the helix was determined by helical wheel diagrams, which are

oriented such that the reader is looking down the length of the helix. The frequency with which

amino acids appear at various positions around the helix is represented by the proportional

colouring in boxes that correspond to each of the 11 potential positions, indicated by the internal

star (Figure 12). The DNA coding for the hydrophobic ice-binding face was completely

conserved (Figure 12) and contained four silent substitutions. Away from the binding face, seven

46

Figure 11. Protein alignment of liver AFP variants from starry flounder and winter

flounder

An alignment of the three starry flounder liver (stfl) AFP variants, along with three representative

winter flounder liver (wfl) sequences, is shown with polymorphisms highlighted in grey. Stfl-

AFP1 was obtained from lambda insert #1, whereas stfl-AFP 2 and 3 were obtained by PCR and

are incomplete at the N terminus. The signal peptide, pro-sequence and a C-terminal Gly residue

(not shown) are removed during post-translational processing to generate the mature peptide. The

number of residues within each portion of the encoded sequence is noted at the end of each row,

and the total number of residues in the preproprotein indicated in brackets. Placement of the

intron is indicated by an arrow and the ice-binding residues, which are completely conserved, are

denoted by asterisks. Genbank accession numbers for the three winter flounder sequences are

CAA37754 (wfl-AFP9), AAA49469 (wfl-AFP6), AAA49468 (wfl-AFP8).

46

Signal peptide stfl-AFP1 MALSLFTVGQFIFLFWTIR ISEA 23 stfl-AFP2 INEA 4 stfl-AFP3 INEA 4 wfl-AFP9 MALSLFTVGQLIFLFWTMR ITEA 23 wfl-AFP6 MALSLFTVGQLIFLFWTMR ITEA 23 wfl-AFP8 MALSLFTVGQLIFLFWTMR ITEA 23 Prosequence stfl-AFP1 NPDPAAKAAAVADPAAAAVAPAADAFSAAA 30 (53) stfl-AFP2 NPDPAAKAAAVADPAAAAVAPAADAFSAAA 30 (34) stfl-AFP3 NPDPAAKAAAVADPAAAAVAPAADAFSAAA 30 (34) wfl-AFP9 NPDPAAKAV---------------PAAAAP 15 (38) wfl-AFP6 RPDPAAKAA----PAAAAA-----PAAAAP 21 (44) wfl-AFP8 RPDPAAKAA----PAAAAV-----PAAAAP 21 (44) Mature peptide * * * * * * * * * * * * * * stfl-AFP1 DTASDAAAAAAATAAAAKAAAE-----------KTARDAAAAAAAT----AR 37 (90) stfl-AFP2 DTASDAAAAAAATAAAAKAAAE-----------KTARDAAAAAAAT----AR 37 (71) stfl-AFP3 DTASDAAAAAAATAAAAKAVAE-----------KTARDAAAAAAAT----AR 37 (71) wfl-AFP9 DTASDAAAAAAATAATAAAAAAATAVTAAKAAALTAANAAAAAAATAAAAAR 52 (90) wfl-AFP6 DTASDAAAAAALTAANAKAAAE-----------LTAANAAAAAAAT----AR 37 (81) wfl-AFP8 DTASDAAAAAALTAANAAAAAK-----------LTADNAAAAAAAT----AR 37 (81)

47

Figure 12. Helical wheel diagrams of liver AFPs from starry flounder and winter flounder

The coloured areas indicate the frequency of naturally-occurring amino acid substitutions and the

star indicates the 11 potential positions for residues around the helix. The black arcs indicate the

residues that make up the ice-binding face. Residues represented include the entire mature

peptide from all variants of the starry flounder (A) and winter flounder (B) liver AFPs as

indicated in Figure 11, except for the terminal Gly, which is cleaved during post-translational

modification, and the terminal Arg, which is thought to curl away from the ice-binding surface

(L. Graham, personal communication).

47

A Starry flounder

B Winter flounder

ThrThr

AlaAla

ArgArg

GluGlu

LysLys

ValVal

AsnAsn

SerSer

LeuLeu

AspAsp

48

nucleotide substitutions were silent, and the remaining twelve resulted in five substitutions at the

amino acid level, indicating that more plasticity is tolerated here.

3.6.2 Skin isoform

The similarity between the skin AFP genes of the two species was even higher than

between the liver AFP genes. The first exon of stfs-AFP1, which contained only untranslated

sequence, showed 96% identity to wfs-11-3 and had no gaps, whereas the second exon, which

contained the coding sequence, showed 97% identity (94% including gaps) (Appendix B). Taken

alone, the portion of DNA encoding only the expressed peptide showed 97% identity (93%

including gaps) between the two fish. The deduced protein was 97% identical to wfs-11-3 (92%

including gaps). The stfs-AFP1 intron contained one significant insertion of 439 bp in length,

and was 1212 bp in total, compared to the 794 bp winter flounder intron. The introns were 94%

identical (59% including gaps). The starry flounder genes showed a similar codon bias to the

winter flounder AFP sequences, heavily favouring GCC (~70%) for Ala (Table 2).

The predicted starry flounder skin AFP was very similar to the winter flounder skin

AFPs, in that there was no signal peptide or pro-sequence. There was one conservative

substitution near the N terminus (K6R) and a two-residue deletion near the C terminus (Figure

13). There were also two silent single nucleotide substitutions (Appendix B). The peptide

contained two and a half 11-amino acid repeats, and as with the liver AFPs, the residues involved

in the putative ice-binding face were conserved, reiterating the significance of these residues.

49

Table 2. Relative codon usage in starry flounder and winter flounder AFP genes

49

% Usage

ISOFORM VARIANT SPECIES GCC GCT GCA GCG

Liver wfl-AFP6 Winter flounder 71.1 10.5 18.4 0.0

stfl-AFP1 Starry flounder 67.4 14.0 18.6 0.0

Skin wfs-11-3 Winter flounder 63.0 3.7 33.3 0.0

stfs-AFP1 Starry flounder 72.0 4.0 24.0 0.0

Hyperactive wfh-AFP1 Winter flounder 63.9 13.9 19.7 2.5

stfh-AFP2 Starry flounder 64.7 9.8 23.3 2.3

50

Figure 13. Protein alignment of skin AFP variants from starry flounder, winter flounder and American plaice

The starry flounder skin (stfs) and American plaice skin (aps) AFP variants are shown, along with

the 10 known winter flounder AFP skin variants. The number of residues is noted at the end of

each row. Intraspecific polymorphisms are highlighted in grey, and ice-binding residues [27] are

denoted with asterisks. The winter flounder sequences are from Gong et al. [51] and although the

variant numbers have been retained, the acronym has been changed from sAFP to wfs-AFP

(winter flounder skin). Genbank accession numbers for wfs-11-3 and wfs-F2 are M63478 and

M63479, respectively.

50

Starry flounder * * * * * * * * stfs-AFP1 MDAPARAAAA TAAAAKAAAEA TAAAAAKAAAA TK--AAR 37 stfs-AFP2 MDAPARAAAA TAAAAKAAAEA TKAAAAKAAAA TK--AAR 37 stfs-AFP3 MDAPARAAAA TAAAAKAAAEA TAAAAAKAAAD TK--AAAAAAAAL 43 stfs-AFP4 MDAPAKAAAA TAAAAKAAAEA TAAAAAKAAAA TK--AGR 37 stfs-AFP5 MDAPARAAAA TAAAARATAEA TEAAAAKAAAA TK--AAR 37 stfs-AFP6 MDAPARAAAA TAAAAKAATEA TKAAAAKAAAA TK--AAR 37 stfs-AFP7 MGAPARAAAA TAAAAKAAAEA TKAAAAKAAAA TK--AAR 37 stfs-AFP8 MDAPAAAAAA TAAAAKAAAEA TAAAAAKAAAA TKAAAAR 39 Winter flounder * * * * * * * * * * * * wfs-F2 MDAPAKAAAA TAAAAKAAAEA TAAAAAKAAAA TKAGAAR 39 wfs-11-3 MDAPAKAAAA TAAAAKAAAEA TAAAAAKAAAA TKAAAAR 39 wfs-AFP1 MDAPARAAAA TAAAAKAAAEA TKAAAAKAAAA TKA-AAH 38 wfs-AFP2 MDAPAKAAAA TAAAAKAAAEA TAAAAAKAAAA TKAGAAR 39 wfs-AFP3 MDAPAKAAAA TAAAAKAAAEA TAAAAAKAAAD TKAKAAR 39 wfs-AFP4 MDAPAKAAAA TAAAAKAAAEA TAAAAAKAAAA TKAGAAH 39 wfs-AFP5 MDAPAKAAAA TAAAAKAAAEA TKAAAAKAAAA TKA-AAH 38 wfs-AFP6 MDAPAKAAAA TAAAAKAAAEA TAAAAAKAAA- TKAGAAR 38 wfs-AFP7 MDAPAAAAAA TAAAAKAAAEA TAAAAAKAAAA TKAAAAR 39 wfs-AFP8 MDAPAAAAAA TAAAAKAAAEA TAAAAAAAAAA TAEAAAKAAAA TKAAAAAAAAR 54 American plaice * * * * * * * * * * aps-AFP1 MDPAKAAAA TAAKAKADAEK TAAAAAKAAAD TAAAAAKAAKA AAH 45 aps-AFP2 MDPAKAAAA TAAKAKADAEK TAAAAAKAAAD TAA---KAAKA AAH 42 aps-AFP3 MDPAKAAAA TAAKAKADAEK TAAAAAKAAAD TAA------KA AAH 39 aps-AFP4 MDPAKAAAA TAAKAKADAEK TAAAAAKAAAD TAAAAA---KA AAP 42 Interspecies comparison * * * * * * * * * * stfs-AFP1 MDAPARAAAA TAAAAKAAAEA TAAAAAKAAAA TK------AAR 37 wfs-F2 MDAPAKAAAA TAAAAKAAAEA TAAAAAKAAAA TKAG----AAR 39 aps-AFP2 MD-PAKAAAA TAAKAKADAEK TAAAAAKAAAD TAAKAAKAAAH 42

51

3.6.3 Regulatory elements Analysis of the starry flounder AFP genes indicated that most of the known regulatory

elements were also conserved. There are two promoter elements that have been identified for

flounder AFPs. The first is the CAAT box, a cis-acting promoter-proximal sequence common to

many eukaryotic genes [37]. It is perfectly conserved in the winter flounder liver isoforms, and

was easily identified in stfl-AFP1 by alignment (Appendix A). This sequence has not been

identified in the skin AFP genes [51]. The second promoter element is the TATA box, a core

promoter sequence that usually lies between the CAAT box and the transcriptional start site [37].

There is a 1 bp substitution of this sequence in stfl-AFP1, changing the canonical TATAAAA to

a moderately less efficient TACAAAA [100]. Gong et al. performed primer extension studies on

skin AFP transcripts from winter flounder and established the location of two transcriptional start

sites [51], but core promoter sequences for these genes have not been precisely defined. The

same group tentatively identified a TFIID binding site 30 bases upstream of the transcriptional

start site, which is perfectly conserved in stfs-AFP1 (Appendix B). To check for other potential

promoter sites, 2.4 kb of sequence upstream of the stfs-AFP1 start codon was analysed using the

Neural Network Promoter Prediction tool, version 2.2 [95]. The sites identified by Gong et al.

scored highest (0.96 out of 1.0). Two other transcriptional start sites were also predicted, 112 bp

downstream and 153 bp upstream of the correct sites, scoring 0.92 and 0.90, respectively. A third

regulatory element is an enhancer, Element B, which was identified in the intron of wfl-AFP6. It

is bound by the liver-specific transcription factor CCAAT/enhancer binding protein α and a novel

protein designated the antifreeze enhancer protein, and appears to be responsible for the

integration of hormonal regulation and seasonal AFP expression [40]. It is present in the intron

of wfl-AFP8 with a trinucleotide TAT insertion at position 16 of the usually 30-nucleotide

52

element, but the entire sequence has been deleted from wfl-AFP9 (data not shown). This

enhancer also appears to be wholly absent in stfl-AFP1 (Appendix A). In the skin isoforms, a

ubiquitous enhancer, termed Element S, was identified in the intron wfs-F2, and is thought to play

a role in the broad expression of the skin-type AFPs [59]. A dinucleotide TA insertion in

Element S abolishes liver specificity, but it is otherwise identical to Element B [59]. We

identified an Element S-like sequence in stfs-AFP1, but there are two single nucleotide

substitutions at positions 4 and 21 of the 32-base element (Appendix B); the effects of these

changes are uncertain.

The fourth conserved regulatory element is the polyadenylation signal. There is high

sequence identity in the 3' UTR between corresponding genes (100% between the liver genes;

98% between skin genes), and all polyadenylation signals are conserved. Two other alternative

polyadenylation signals have been identified in the 3' flank of the wfl-AFP6 [38, 39], and both are

found in stfl-AFP1; the second contains a 1 bp substitution, but the third is exactly conserved

(Appendix A).

3.7 The starry flounder has multiple variants of the liver and skin AFPs Alignment of all of the liver and skin AFP genes from the starry and winter flounders

indicated that the AFPs were homologous. The 3' UTR was highly conserved among all variants

of all isoforms (Appendix C), as were the 5' UTR and signal peptide sequences among the liver

isoforms in both fish (Appendix D). Thus, PCR was employed to sample the variation of AFP

genes in the starry flounder genome and lambda library. PCR primers were designed to

conserved regions identified from alignments of the starry flounder AFP genes with all available

53

winter flounder AFP sequences, and are summarized in Table 1. The 5'int and 3'int primers are

located at the 5' and 3' ends of the intron, respectively (Figure 6, Figure 10). They were chosen

because their sequences were largely conserved across the liver, skin and hyperactive isoforms in

both species, except for a few nucleotide substitutions. The 3'univ primer was designed to a

portion of the 3' UTR conserved among all three AFP isoforms. When the 3'int and 3'univ

primers were paired, the expected sizes of the fragments amplified from skin, liver, and

hyperactive genes would be 185 bp, 275 bp and ~ 610 bp, respectively, as the lengths of the

coding sequences are quite different. When the 5'int and 3'univ primers were used, a large

deletion in the starry flounder liver gene was expected to allow for differentiation between the

PCR products expected from the skin (1312 bp) and liver (703 bp) isoforms; sequence was not

available for this portion of wfh-5a. Two 5' primers specific for the skin isoforms were also

designed: allsk is conserved across winter flounder and starry flounder skin genes, whereas the

stfsk sequence is only seen in the starry flounder. Band sizes expected from stfsk and allsk when

paired with the 3'univ primer were 784 bp and 734 bp, respectively.

Initial PCR reactions using the primers described above included some failed positive

controls. Therefore, conditions were optimized for each primer pair using starry flounder

genomic DNA as the template. Three different buffers were assessed (Qiagen KCl+(NH4)2SO4,

Fermentas KCl, Fermentas (NH4)2SO4) as well as four different magnesium concentrations (1.5,

2.0, 2.5 and 3.0 mM MgCl2) and the effects of Q solution (Qiagen), a PCR additive designed to

help amplify GC-rich templates. Calculated melting temperatures for all primers (Tm = 2(A+T) +

4(G+C)) were similar and reaction annealing temperature was set to ~ 54 °C to minimize the

effects of single mismatches between the primers and potential priming sites. Optimized

conditions were applied to the 25 phage stocks from the primary library screen; representative

54

PCR results are shown in Figure 14. PCR products of both expected and anomalous sizes from

the genomic optimization and phage screening experiments were cloned into plasmid vectors and

sequenced. The template DNA for all these experiments, regardless of whether it was from the

phage library or directly from genomic DNA, came from the same starry flounder liver (QCI #1).

Altogether, three liver variants were isolated, along with eight skin and two hyperactive variants

(Table 3). For the purposes of this thesis, a variant is defined as an AFP that can be clearly

categorized as one of the three isoforms, but contains one or more unique substitutions. Thus,

"three liver variants" denotes "three type I AFPs of the liver isoform with minor changes in the

amino acid sequence". Where necessary, clones were sequenced on both strands to ensure

changes were not sequencing artifacts. Since the primers were designed to highly conserved

regions and since both genomic DNA and primary phage stocks were sampled, we expect that we

have obtained a fairly good representation of the sequence variety in this gene family.

As previously mentioned, two lambda clones had been plaque purified, one from

screening with the liver probe and one with the hyperactive probe, but only the latter was

sequenced. To determine whether the two inserts were highly similar and/or overlapping, two

additional primers (5'up and 3'up, Table 1) were designed to a unique 1050 bp region of the

lambda insert 1 kb upstream of the AFP genes. PCR with this primer pair as well as the primers

described above produced bands of the expected sizes in the second unsequenced insert, and it

was not considered further.

3.7.1 Liver variants

Variability was low among the liver variants. Fourteen clones were obtained from

various sources and produced four unique DNA sequences that were greater than 99% identical.

55

Table 3. Type I AFP variants isolated from the starry flounder genome

All type I AFP amino acid sequences inferred from genomic cloning and PCR experiments are

shown, with polymorphisms highlighted in grey. The sources listed for each variant denote

unique DNA sequences obtained from either genomic clones (g#, see Appendices E, F, and H for

full sequences) or primary phage stocks (code provided corresponds to Figure 14). Unknown

sequence is marked with dashes (-).

55

ISO

FO

RM

S

OU

RC

E

SE

QU

EN

CE

L

iver

1 10 20 30 40 50 60 70

Stfl

-AFP

1 g1

, g2,

H1,

H7,

L1,

L2,

L4

ISEANPDPAAKAAAVADPAAAAVAPAADAFSAAADTASDAAAAAAATAAAAKAAAEKTARDAAAAAAATARG

Stfl

-AFP

2 g3

, L7,

L8,

H9,

H13

, H14

INEANPDPAAKAAAVADPAAAAVAPAADAFSAAADTASDAAAAAAATAAAAKAAAEKTARDAAAAAAATARG

Stfl

-AFP

3 L5

INEANPDPAAKAAAVADPAAAAVAPAADAFSAAADTASDAAAAAAATAAAAKAVAEKTARDAAAAAAATARG

Ski

n

1 10 20 30 40

Stfs

-AFP

1 g4

, H1,

H7

MDAPARAAAATAAAAKAAAEATAAAAAKAAAATKAAR

Stfs

-AFP

2 g5

, g6,

g7,

L5,

L7

MDAPARAAAATAAAAKAAAEATKAAAAKAAAATKAAR

Stfs

-AFP

3 g8

MDAPARAAAATAAAAKAAAEATAAAAAKAAADTKAAAAAAAAL

Stfs

-AFP

4 g9

MDAPAKAAAATAAAAKAAAEATAAAAAKAAAATKAGR

Stfs

-AFP

5 H

5 MDAPARAAAATAAAARATAEATEAAAAKAAAATKAAR

Stfs

-AFP

6 g1

0 MDAPARAAAATAAAAKAATEATKAAAAKAAAATKAAR

Stfs

-AFP

7 H

5 MGAPARAAAATAAAAKAAAEATKAAAAKAAAATKAAR

Stfs

-AFP

8 H

11

MDAPAAAAAATAAAAKAAAEATAAAAAKAAAATKAAAAR

Hyp

erac

tive

1 10 20 30 40 50 60 70

Stfh

-AFP1

g11

----------------------------------------------------------ANAAAAAATAAA

AAIAAEEAATAAATAAAAAAATAATAQAAIFDKAAAAASTTATTAATAAATIATTAAAAA----------

-----------------------------------------------------------

Stfh

-AFP2

g12,

g13

, H11

ITEAIDPAAQAAAAAAAAAAVVTAADAAAAAANAAANAAAVAAATAADVATASIATIKANAAAAAATAAA

AAIAAEEAATAAATAAAAAAATAATAQAAIFDKAAAAASTTATTAATAAAATATTAAAAAAATETIDKAA

AAAAAAAATAVATAAAAAATAAATAAATLGAAAAKAAATAVAAAAAAAIAAAAAAAAPP

56

Figure 14. Characterization by PCR of the AFP isoforms present in phage stocks isolated from the primary library screen

Both genomic DNA (g) and 25 phage plaques picked during the primary library screen were

subjected to PCR, using either the 5'int or 3’int primer with the 3'univ primer (Figure 10), or in a

nested reaction using the 2a and 3b primers followed by the 2b and 3b primers (Figure 6). The

gel (top panel) shows the products obtained using the 5'int and 3'univ primers. The table beneath

the gel indicates the presence (+) or absence (-) of bands of the expected size corresponding to

liver (L), skin (S) and hyperactive (H) isoforms for the three primer combinations. N indicates

that a plaque was not assessed and circles indicate that the product was subcloned and sequenced

(Appendices E, F, and H, Table 3). Plaques are denoted H1 to H15 or L1 to L10 and were

obtained using the Hyperactive AFP and Liver AFP probes from winter flounder respectively.

The negative (-) and positive (+) controls used no template DNA and the primary plaque from

which insert #1 was isolated, respectively. The asterisk denotes a phage that produced no bands

under these PCR conditions but produced several with the allsk and 3’univ primers (data not

shown).

PCR products were electrophoresed on a 1.5% agarose gel and size markers (M, in kb) are noted

on the right. Expected sizes for the skin and liver isoforms are 1312 bp and 703 bp, respectively,

based the sequence of insert #1. The leftmost lane contains fragments amplified from starry

flounder genomic DNA, several of which (g2, g3, g6, g8 and g9) were subcloned and sequenced

(Table 3). Reactions were cycled under standard conditions, except for the magnesium

concentration (2 mM) and 10x buffer (KCl only, Fermentas).

56

1.5

1.2

1.0

0.5

M

–+ 1

23

45

67

89

1011

1213

1415

H

12

34

56

78

910

1112

1314

151

23

45

67

89

1011

1213

1415

H

12

34

56

78

910

L

12

34

56

78

910

12

34

56

78

910

L

nes

t

*

NN

-N

NN

--

NN

NN

-N

NN

--

NN

N-

NN

NN

NN

--

+-

NN

NN

-N

NN

NN

N-

-+

-N

NN

5’in

t+

++

++

--

++

++

++

+-

++

-+

--

-+

+-

--

+-

++

+-

+-

++

++

-+

+-

--

+-

--

-L S

5’in

t+

++

++

--

++

++

++

+-

++

-+

--

-+

+-

--

+-

++

+-

+-

++

++

-+

+-

--

+-

--

-L S

++

++

+-

-+

++

++

++

+-

-+

++

++

++

-+

+-

+-

--

++

-+

++

+-

++

-+

--

-+

+-

--

+-

++

+-

+-

--

+-

++

+-

+-

++

++

-+

+-

--

+-

--

-+

++

+-

++

--

-+

--

--

L S

3’in

tL S

++

++

++

-+

++

++

++

++

-+

++

++

++

-+

+-

+-

--

++

-+

++

+-

++

-+

--

-+

+-

--

+-

++

+-

++

--

+-

++

+-

++

++

++

-+

+-

--

+-

--

-+

++

+-

++

--

-+

--

--

gg

g9g6

g2,

3g

8

H

57

Three of the fourteen sequences were isolated from genomic DNA and eleven from different

primary phage stocks. All were isolated using 3'univ as the 3' primer; 5'int was used to amplify

four clones whereas 3'int was used for the other ten. There were a total of five nucleotide

substitutions among all sequences (Appendix E). Two of these were each located in seven

sequences, suggesting that they are real polymorphisms rather than sequencing artefacts; one was

silent, while the other caused an S21N substitution in the signal peptide. This S21N substitution

will likely have no effect on protein function or cleavage, as it is neither a drastic change in terms

of size and hydrophobic character, nor in a critical position. The other three substitutions were

observed in only one clone each: two occurred at different locations in the intron while the last

resulted in an A73V substitution in the mature protein (Figure 11). These may be the result of

mispairing during plasmid replication or base misincorporation during PCR, as Taq polymerase

does not have 3' à 5' exonuclease activity. Because the A73V substitution in the coding region

of variant 3 could be the result of such a mistake, it may be artifactual. Thus, it appears that there

are at least two variants of the liver isoform. A helical wheel diagram combining the mature

peptides of all three variants indicates that the ice-binding face of the helix is conserved (Figure

12A), thus supporting the argument that these proteins are functional.

3.7.2 Skin variants A total of fourteen unique clones containing skin isoforms were obtained by PCR using

assorted 5' primers coupled to the 3'univ primer. Seven sequences were isolated with 5'int, four

with allsk and three with 3'int (Figure 6B). Of these, seven were isolated from genomic DNA

whereas seven were found in the primary phage stocks. There was much more variation observed

among the skin variants than among the liver AFPs, but identity remained high over the aligned

regions (Appendix F). Of the eleven clones containing intronic sequence, four had large deletions

58

of 502 – 550 bp in the middle of the intron, and therefore did not contain sequence to which the

stfsk and allsk primers could bind. Two of these four sequences were also missing a poly(G) tract

located approximately 100 – 120 bp from the end of the intron. All eleven sequences contained

an Element S-like sequence that was identical aside from a T à A substitution at position 4. All

but one sequence contained a C à G substitution at position 21 of the 32-base element

(Appendix B, Appendix F).

Analysis of the translated sequences showed that there are eight skin AFP variants; the

total length of the proteins was generally 37 residues, unless otherwise noted. Variant 1 was

isolated from lambda insert #1 (Figure 9), and is described above relative to the winter flounder

skin AFP. Variant 2 was identical to variant 1, except for an A23K substitution (Figure 13). The

C-terminal sequence of variant 3 was much different from that of the other starry flounder skin

AFPs, as it contained A32D and R37A substitutions, along with six extra residues at the C

terminus (AAAAAL, 43 residues in total) not present in any other starry flounder skin AFP

variant. C-terminal extensions may arise due to a mutation in the stop codon, leading to the

translation of the 3' UTR. However, none of the possible reading frames following the stop

codon could generate this sequence. Moreover, since the 3' UTR, including the complementary

sequence for the 3'univ primer, was conserved and aligned well with all other starry flounder skin

variants (page 5 of Appendix F), these extra C-terminal residues appear to be the result of an

insertion. Variant 4 contained two substitutions, R6K and A36G. Variant 5 contained K16R,

A18T and A23E substitutions. Variants 6 and 7 were identical to variant 2 but contained A19T

and D2G substitutions, respectively. Finally, variant 8 was 39 residues long and included a two-

residue Ala-Ala insertion towards the C-terminal end, along with a R6A substitution. Helical

wheel diagrams indicated the frequency with which the various substitutions occurred around the

59

helix (Figure 12); all variants shown in Figure 13 were included. The Thr residue at position 2 is

from variant 5, and was observed in only one 11-amino acid repeat motif out of eight three-repeat

variants.

The frequency with which each variant was encoded by a unique gene was also assessed.

Variants 1 and 2 were isolated three and five times, respectively. Variant 1 was clearly encoded

by more than one gene, as two sequences isolated using 5'int were of significantly different sizes

(g4 and H1, Appendix F). Variant 2 is likely also encoded by at least two genes, as one sequence

(g7) contains two silent single nucleotide substitutions in the coding region. Aside from these

two silent substitutions, the five sequences coding for variant 2 contained six unique single

substitutions scattered in the intron and one in a non-coding portion of exon 2. Variants 3

through 8 were each isolated only once, but the number of changes at the nucleotide level coupled

with the fact that many of these changes were shared with other sequences suggested that these

variants are real. The only exception was variant 6, where the DNA sequence (g10) was identical

to one encoding variant 2 (g5) over 686 bp, except for a unique C à T transition near a poly(G)

tract and a unique G à A transition (Appendix F) that resulted in an A19T substitution. Thus,

there are at least seven variants of the skin isoform in the starry flounder genome.

A comparison of the starry flounder skin AFPs to those of the winter flounder revealed

that the same residues tend to show variability in both species. For example, Lys, Ala or Arg are

seen in both species at position 6 (Figure 13), but at different frequencies. Similarly, both Ala

and Lys are found at position 23 for both species, with an additional substitution (Glu) in stfs-

AFP5. There were also differences in terms of protein length, with all insertions and deletions

confined to the C-terminal portion of the proteins. Most of the winter flounder skin AFPs were

60

39 residues long, whereas most of the starry flounder skin AFPs were 37 residues long. All of the

ice-binding residues and the 11-amino acid repeat motifs were conserved between all isoforms of

both species, except in variant 6 as noted above. Helical wheel diagrams including all variants

from Figure 13 show clear conservation of the ice-binding face in both the starry and winter

flounders (Figure 15), suggesting that the starry flounder skin AFPs are functional.

3.8 The American plaice has multiple variants of the skin AFPs The American plaice was found early on to have variable levels of plasma TH activity

but did not show evidence of a large AFP gene dosage when its genomic DNA was probed with a

cDNA of the flounder liver AFP sequences [62, 73]. Subsequently, it was realized that the lone

source of its plasma TH activity was a large thermolabile hyperactive isoform [71].

Chronologically, this isoform was actually first discovered in the American plaice, but to

maximize the impact of the discovery it was initially reported in the winter flounder [63].

Having clarified the source of the antifreeze activity in the plasma, we wanted to know if

the American plaice had skin AFPs. Given the success of the PCR experiments on starry

flounder DNA, the skin-specific primer allsk was paired with the 3'univ primer and applied to

American plaice genomic DNA. Standard conditions were used, with the following

modifications: (NH4)2SO4 buffer (Fermentas), 1.5 mM MgCl2, annealing temperature of 62 °C,

30 cycles. PCR products were TA cloned, and nineteen of the twenty clones selected were

sequenced. These results were obtained only weeks prior to submission of this thesis and thus,

exhaustive analysis cannot be reported here. However, all products were quite similar, with a

total of 37 nucleotide substitutions in the ~760 bp long sequences (Appendix G). Only 10 of

61

Figure 15. Helical wheel diagrams of skin AFPs from starry flounder, winter flounder and American plaice

The shaded areas indicate the frequency of naturally-occurring amino acid substitutions in all

known starry flounder (A), winter flounder (B), and American plaice (C) skin AFPs, and the

black arcs indicate the residues that make up the ice-binding face. The diagrams omit residues

prior to and including the first Pro (Figure 13) because Pro is a known helix breaker and because

these residues are thought to form an N-terminal cap structure that arches away from the ice-

binding face. Also omitted is the terminal residue, which is thought to curl away from the ice-

binding surface (L. Graham, personal communication).

61

ThrThr

AlaAla

ArgArg

GluGlu

Asp

A

Starry flounder

B

Winter flounder

C

American plaice

LysLys

GlyGly

62

these substitutions (37%) were unique whereas the rest were present in multiple clones, and there

were no major insertions or deletions in any of the clones relative to the rest. A sequence similar

to the Element S enhancer [59] was present, with one T à C substitution present at position 16 in

all clones and a C à T substitution at position 17 in fifteen out of nineteen clones. The effects of

these substitutions in this context remain to be determined. Use of the 3'univ primer precluded

conservation analysis of the 3' UTR, but the eight nucleotides present between the end of the

coding region and the start of the primer were identical in all clones.

Overall, we sequenced seventeen unique genes encoding four different proteins. The

proteins are identical aside from small insertions in the C-terminal region and a terminal Pro in

aps-AFP4 in place of the His seen in the other variants (Figure 13). Residues 35 – 37 have been

deleted from aps-AFP2, relative to aps-AFP1, but this does not affect the ice-binding face.

Similarly, residues 38 – 40 have been deleted from aps-AFP4 and this also does not affect the

binding face. The last variant, aps-AFP3, contains a six-residue deletion (positions 35 – 40 in

aps-AFP1). A helical wheel diagram combining the sequences of all four American plaice skin

variants was constructed (Figure 15C), and there is clear conservation of the ice-binding face,

suggesting that these proteins are functional. Comparison of a representative skin variant from

each of the winter flounder, starry flounder and American plaice indicate that the ice-binding face

is conserved in spite of a trend of higher C-terminal variation across species (Figure 13).

3.9 The starry flounder has multiple variants of the hyperactive AFPs Alignments used to design the 5'int, 3'int, and 3'univ PCR primers included the intron of

the wfh-5a “pseudogene” and the 3' UTRs of both wfh-5a and wfh-AFP1. However, no bands

63

large enough to contain a hyperactive isoform were observed after optimization of PCR reaction

conditions on starry flounder genomic DNA. To improve the chances of cloning sections that

code for the hyperactive AFP, four new primers were developed. Primers 2 and 3 were designed

in unique and relatively Ala-poor regions of wfh-AFP1 (Table 1), while primers 1 and 4 were

based on 3'int and 3'univ, respectively, but shifted inwards to include unique sequence at the ends

of the second wfh-AFP1 exon (Figure 6C). PCR conditions for these primers were optimized on

starry flounder genomic DNA. Four magnesium concentration were assessed (1.5, 2.0, 2.5, and

3.0 mM MgCl2) and an annealing temperature of 53 °C was used, a temperature 7 – 9 °C below

the calculated Tm, in order to encourage priming. Two successive rounds of amplification

generated a large number of bands, but only two conditions produced bands of the expected size

(261 bp). Sequencing of three TA clones made from these PCR products confirmed that they

were of the hyperactive type (Appendix H). These clones were highly similar, with only four

single nucleotide substitutions. One clone contained two silent nucleotide substitutions while

another contained two nucleotide substitutions that resulted in two amino acid substitutions

(Table 3). Codon usage for Ala reflected typical type I AFP frequencies (Table 2). Alignment of

one sequence, stfh-AFP2, with wfh-AFP1 and wfh-5a showed moderately high identity (87% and

83%, respectively); there was 82% identity between the two winter flounder genes (Appendix I).

Many of these changes led to alterations in the protein sequence, relative to the expressed winter

flounder AFP (Figure 16). In the winter flounder hyperactive isoform, the 11-amino acid repeat

motif is not as well-conserved as it is in the liver and skin isoforms, and the same trend was

observed for the starry flounder sequence, indicated in Figure 16. However, the ice-binding face

has only been modelled to date [65] and it is not possible at this time to say if these amino acid

substitutions in the starry flounder AFP would affect antifreeze activity.

64

Figure 16. Protein alignment of hyperactive AFP variants from starry flounder and winter flounder

The starry flounder hyperactive (stfh) AFPs are shown, along with the winter flounder

hyperactive AFP and the putative pseudogene wfh-5a. The starry flounder sequence stfh-AFP2

was obtained from insert #2 (Figure 17). The end of the signal peptide is italicized, but the rest of

the signal peptide sequence was not present in the insert and is unknown. The sequence of stfh-

AFP1 is a partial hyperactive variant isolated from genomic DNA via PCR. Putative ice-binding

residues are marked with an asterisk [65] and the number of residues is noted at the end of each

row. Genbank accession numbers for wfh-AFP1 and wfh-5a are EU188795 and M63477,

respectively.

64

* * * * wfh-AFP1 ITEANIDPAARAAAAAAASKAAVTAADAAAAAATIAASAASVAAATAADD 50 wfh-5a ITEA-IDPAAKAAAAAAAATAVVTAAAAAAAAAAIAATAAAVAGATAADA 49 stfh-AFP2 ITEA-IDPAAQAAAAAAAAAAVVTAADAAAAAANAAANAAAVAAATAADV 49 * * * * * wfh-AFP1 AAASIATINAASAAAKSIAAAAAMAAKDTAAAAASAAAAAVASAAKALET 100 wfh-5a AAASIASINANTAAAAAIAAAAAKAAEEAAATAAAAAATTAATAATAQAT 99 stfh-AFP2 ATASIATIKANAAAAAATAAAAAIAAEEAATAAATAAAAAAATAATAQAA 99 stfh-AFP1 ANAAAAAATAAAAAIAAEEAATAAATAAAAAAATAATAQAA 42 * * * * wfh-AFP1 INVKAAYAAATTANTAAAAAAATATTAAAAAAAKATIDNAAAAKAAAVAT 150 wfh-5a IKDKAAAAAASTATNAAAAAAATATTAAAAAVAKTTIDKAAAAVAVAAAT 149 stfh-AFP2 IFDKAAAAASTTATTAATAAAATATTAAAAAAATETIDKAAAAAAAAAAT 149 stfh-AFP1 IFDKAAAAASTTATTAATAAATIATTAAAAA 72 * * * * wfh-AFP1 AVSDAAATAATAAAVAAATLEAAAAKAAATAVSAA-AAAAAAAIAFAAAP 199 wfh-5a AVAAAAATAATAAATAAATLGAATVKAAATAVNAAAAAAAATAAAAAAPP 199 stfh-AFP2 AVATAAAAAATAAATAAATLGAAAAKAAATAVAAAAAAAIAAAAAAAAPP 199

65

The small portion of the hyperactive isoform that was isolated indicated that the gene was

quite divergent from the winter flounder sequence, and this was likely the cause behind the

difficulty in isolating it using winter flounder-based primers. Using this new sequence, five new

primers specific to the starry flounder hyperactive isoform were designed. Primer 2a was based

on primer 2, but was shifted downstream by nine bases, so that the 3' end of the primer would be

specific to the starry flounder sequence. Primers 3a1 and 3a2 were based on primer 3, but were

shifted downstream by three and nine nucleotides, respectively, for the same reason. Primers 2b

and 3b were designed to regions that were unique to the starry flounder hyperactive gene. PCR

conditions were again optimized for these primers, using three magnesium concentrations (1.5,

2.0, and 2.5 mM MgCl2) as well as six annealing temperatures (52.2, 54.0, 56.0, 58.0, 59.9, 61.6

°C), so as to maximize specific binding. Using optimal conditions (2.0 mM MgCl2 and 58.0 °C),

two successive rounds of PCR were performed using first the 2a and 3b primers, then nesting the

reaction products with the 2b and 3b primers. A single band of the expected size was isolated

from plaque H11. TA cloning and sequencing of this 148 bp product confirmed that it was

identical to positions 59 – 116 of the three genomic clones previously examined (Appendix H).

3.10 Genes for starry flounder hyperactive and skin AFPs are closely linked Having confirmed the presence of a hyperactive isoform in the H11 primary plaque via

PCR analysis, this phage was plaque purified as described above and sequenced using the 454

method (Genome Quebec). Three gaps (~ 500 bp) remained following assembly using

DNAMAN, and preliminary analysis indicated that the unresolved regions were highly repetitive.

Repetitive and highly similar sequence data are not amenable to the 454 method due to the short

length of the 454-generated sequence reads (average 218 bp), and Genome Quebec attempted

66

unsuccessfully to sequence through these gaps. Therefore, primers were designed to the ends of

available sequence in order to PCR-amplify the regions and resolve the gaps via conventional

methods. No optimization was performed, and reactions were carried out on the lambda DNA

under standard conditions, with the following modifications: (NH4)2SO4 buffer (Fermentas), 1.5

mM MgCl2, annealing temperature of 55 °C, 3 minute extension step, 30 cycles. One 414 bp gap

was resolved with the primers Right arm and 3'univ (Table 1), whereas the other two gaps (21 bp

and 63 bp) were resolved using 3'int with Contig1#1 and maxisfstop with con2end2#1,

respectively.

This phage insert (insert #2) was 18024 bp in length and contained a complete gene for a

skin variant and exon 2 of a hyperactive variant (Figure 17). The 414 bp gap in the GC-rich

interior of the hyperactive exon was presumably not amplified at all using the 454 method, as it

was not once represented in the approximately 12000 reads (~ 2.6 Mb) sequenced. As well, the

short sequence reads could not be used to reliably assemble the sequence immediately

downstream of the two genes, as these regions contained (GT)n repeats and were highly similar,

with 98% identity over 661 bp, after exclusion of a 44 bp gap (data not shown). The two genes in

this insert were oriented in the same direction and separated by 5.6 kb of intergenic sequence

(Figure 18). Based on the PCR experiments, we were expecting one of the hyperactive variants

and stfs-AFP8; we obtained stfh-AFP2 and stfs-AFP8, and no other AFP genes were present in

the insert. These genes were located within a 7.6 kb stretch at one end of the insert.

As this sequence was received only a few weeks prior to the submission of this thesis,

exhaustive analysis has not yet been performed. The sequence between the two genes as well as

that downstream of the skin gene were used to query Genbank nucleotide databases in search of

67

Figure 17. Schematic diagram of starry flounder genomic DNA insert #2

The 18024 bp insert is indicated by the solid line and the lambda arms by the dotted lines.

Predicted exons of the two starry flounder AFP genes are shown by the thicker grey boxes. Note

that the short (23 bp) non-coding 5' exon of the skin AFP appears as a solid line at this scale and

that the 5' region of the hyperactive gene is missing as the insert starts within the intron. The

diamond-topped lines indicate EcoRI cut sites. The four underlined regions indicate regions

outside the AFP genes that are similar to known sequences in the Genbank database: (A) 100 bp

segment, beginning at base 2838, shows 77 to 92% identity to the D8 domain of the 28S rRNA

gene in several fish, (B) and (C) 217 bp and 178 bp segments at positions 3998 and 5073

respectively, with 79% identity to the same non-coding sequence in the 5' flank of wfl-AFP6, (D)

393 bp segment beginning at base 8265 with 82% identity to a non-coding sequence in a cluster

of MHC class II antigen genes in the three-spined stickleback Gasterosteus aculeatus.

67

118

024

Ski

n

Hyp

erac

tive

1 kb

1 kb

1732

453

8859

10

A

B

C

D

λ λλλRλ λλλL

68

Figure 18. Schematic diagram of the AFP gene organization in genomic DNA insert #2

The portion of the insert from Figure 17 containing the two AFP genes is shown with intronic and

intergenic sequences represented by black lines and exons by boxes in which the translated

portions are indicated with shading. The 5.6 kb of intergenic sequence between the AFP genes

was shortened for clarity with the break marked by diagonal lines. A putative TFIID binding site

is marked by a grey vertical arrow and a putative enhancer sequence is denoted with an asterisk.

68

Pro

mo

ter

Un

-tra

nsl

ated

Tra

nsl

ated

En

han

cer

250

bp

5.6

kb0.

8 kb

Hyp

erac

tive

Ski

n

..

*

*3’u

niv

5’in

t3’

int

3’u

niv

3’in

t

69

other genes, and preliminary results showed four regions with significant identity to other genes

or gene segments. Two matches in the intergenic region were 178 bp (match C in Figure 17) and

217 bp (match B in Figure 17) in length, and both showed 79% identity to a non-coding region

upstream of the main the winter flounder liver isoform, wfl-AFP6 (respective E-values 5 x 10-31

and 3 x 10-40). Another stretch of approximately 100 bp in the intergenic region showed 77 –

92% identity to domain 8 of the 28S rRNA gene in several fish; the minimum E-value was 1 x 10-

14 (match A in Figure 17). Finally, a 393 bp region approximately 650 bp downstream of the skin

gene aligned with 82% identity to a non-coding sequence in a cluster of MHC class II antigen

genes (E-value 3 x 10-104) (match D in Figure 17). Other matches aligned with highly repetitive

(GT)n and (CA)n repeats and were not considered further. Restriction analysis revealed three

EcoRI sites starting at positions 5388, 5910 and 17324 (Figure 17). The latter two sites produce

an 11.4 kb fragment, which presents as a faint band on all Southern blots presented here (Figure

8). No SacI sites were present in the insert, indicating that these two genes are likely not included

in the predicted tandem array.

3.11 Regulatory elements are conserved in stfs-AFP8 Like the stfs-AFP1 gene, stfs-AFP8 is also quite similar to the winter flounder skin genes.

Alignment with wfs-11-3 showed 96% identity in the first exon and 96% in the second, with 95%

identity in the intron (68% including gaps) (Appendix J). The stfs-AFP8 intron was 835 bp in

length, compared to the 794 wfs-11-3 intron and the 1212 bp stfs-AFP1 intron, and contained

both a 96 bp deletion and a 149 bp insertion relative to wfs-11-3. Target sequence for the stfsk

and allsk primers was not present within the intron of stfs-AFP8, but the Element S enhancer was

conserved except for two substitutions at positions 4 and 21. The coding regions were 94%

70

identical, and of the seven substitutions present, five were silent. The 5' and 3' UTRs showed a

comparable degree of identity, 97% and 99% respectively, and the polyadenylation signal and

putative TFIID binding site were conserved perfectly.

3.12 Starry flounder hyperactive AFP is homologous to its winter flounder counterpart Isolation of a substantial portion of the starry flounder hyperactive gene and its alignment

with the corresponding portion of the winter flounder gene marked the first interspecific

comparison for this isoform. The choice of the maxi5'mid primer (Table 1) initially used to

search for this isoform was fortunate, as the sequence is conserved perfectly in the starry flounder

gene. Some of the initial difficulty in isolating this gene may have arisen from the choice of the

maxi3'mid primer, which contains three mismatches in the 3' half of the 21-bp primer (Appendix

I). Over the length of exon 2, there was 83% identity (82% including gaps), a noticeable drop

from the same comparison in the other isoforms (92% for liver and 96 – 97% for skin). Within

the coding region, the sequences were 81% identical (80% including gaps). There are no major

insertions or deletions, but it is interesting to note that a third of the substitutions in the coding

region are silent. There was high conservation of the 3' UTR, with 92% identity and complete

conservation of the polyadenylation signal. The codon bias for Ala is typical of type I AFPs, with

GCC being most common, followed by GCA and GCT (Table 2). Because sequence from both

the wfh-AFP1 intron and the first exon of stfh-AFP2 are currently unavailable, no other analysis

could be performed for regulatory elements or gene structure. However, it appears that the end of

a signal peptide has been conserved in stfh-AFP2, and the gene structure is expected to be

conserved as well. The predicted product of the stfh-AFP2 exon contained many substitutions

71

compared to its winter flounder homologue; there was 75% identity between the proteins. As

with the winter flounder hyperactive isoform, the 11-amino acid repeat motif is less well-defined

(Figure 16) than in the smaller isoforms, but the implications of this have yet to be determined.

3.13 Starry flounder-specific probes do not alter Southern blot banding patterns Because the PCR-isolated starry flounder hyperactive gene fragment was significantly

different from that of the winter flounder, it was used to reprobe the same blot. The banding

pattern in the winter flounder lanes was similar to that seen with the winter flounder hyperactive

probe, except for an increase in the intensity of some of the larger bands in the SacI digest (3.7,

4.8 and >10 kb) (Appendix K). The relative intensities in the BamHI digest remained the same as

before. In the starry flounder lanes, the banding pattern did not change significantly; the 3.5 kb

and 4.7 kb bands in the SacI digest still hybridized strongly, as did the 7.8 kb band in the EcoRI

digest. Differences between fish were the same as described above – there was a relatively weak

signal from fish 4 (EcoRI digest), possibly caused by partial degradation or variable signal

strength due to intrapopulational polymorphism. It was not possible to draw any conclusions

based on relative band intensities due to multiple rounds of stripping and probing of the blot,

differences in the washing conditions, and lack of densitometric analysis, all of which led to

increased background hybridization.

Few Southern blots have been probed with the type I skin AFPs. With the ready

availability of a starry flounder skin probe from the abovementioned PCR experiments, the

Southern blot was stripped for a fourth time and re-probed. The signal in the winter flounder

lanes was much weaker than previously observed with any probe (Figure 8C) despite a strong

72

signal in the starry flounder lanes and high identity between the skin isoforms of the two species

(Appendix B). This may have been due to heavy use of the blot, and these signals must be

confirmed on a fresh blot. The band intensities in the starry flounder lanes were significantly

stronger than observed previously, and the 4.7 kb SacI and 7.8 kb EcoRI bands were saturated on

the film (Figure 8C). In the SacI digest, weak bands are still visible at 3.5 kb, but bands that were

barely visible in the previous blots now hybridize clearly at 1.5 kb; similarly, previously weak

bands are still visible at 4.2 kb in the EcoRI digest, but others at 2.5 kb intensify significantly.

The increased intensity in the previously observed weak bands is likely due to different wash

conditions, but the bands appearing at 1.5 kb (SacI) and 2.5 kb (EcoRI) may contain a cluster of

skin genes. In spite of the poor quality of the blot, the fact that some of the same bands hybridize

strongly to skin probes as well as liver and hyperactive probes provides further evidence of close

linkage of the skin AFP genes with the other two isoforms.

73

Chapter 4

Discussion

The first correlation between AFP gene dosage and the degree of exposure to icy

seawater was made with the ocean pout [88]. Fish sampled from two discrete populations off the

New Brunswick and Newfoundland coasts had very different mid-winter plasma AFP levels, with

the former being 2 – 3 mg/mL and the latter 20 – 25 mg/mL [101]. Despite this order of

magnitude difference in AFP concentrations, the two populations showed similar AFP isoform

complexity in their plasma, with approximately twelve different types resolved by reverse-phase

HPLC. These differences in plasma AFP levels correlated with type III AFP gene dosage: the

more southerly New Brunswick population had 30 – 40 copies of the AFP gene and the

Newfoundland population had ~150 copies [88]. However, as only two locations were sampled,

the correlation between AFP gene amplification and selection by a harsh environment was

tenuous. Similar but less spectacular results were seen between shallow- and deep-water

populations of winter flounder in the Canadian Maritimes [89], but results were confounded by

contributing factors besides habitat depth, such as latitude and exposure to different currents, so

again, no solid conclusions could be drawn.

Therefore, a better system was needed to investigate the influence of environment on

AFP gene amplification. The starry flounder was chosen because it inhabits a long continuous

stretch of the Pacific coast spanning a wide range of latitudes from California to the Arctic [90].

Starry flounder are one of the few species of Pleuronectidae in which fish can be right-eyed or

left-eyed, and there is a clinal variation in the proportion of left-eyed to right-eyed individuals

74

that varies from approximately 50% in California to almost 100% in Japan [102]. Such graded

variation suggests that populations experience limited mixing over the range. Initial samples

were collected in the late 1980’s and genomic Southern blots were probed with wfl-AFP6 (Figure

5). The results looked promising, as there were large increases in signal strength between fish

from California, BC and Alaska. However, with a limited sample size (only two Alaskan fish),

more individuals and additional sample sites were required to establish a trend. A collection of

starry flounder tissues was assembled in the years since the initial blot, but preparation of

genomic DNAs from those tissues in the experiments described here revealed that the new Alaska

samples were too degraded to be used for genomic Southern blots. DNA prepared from the QCI

fish was of high quality, and Southern blots showed that their hybridization patterns were very

similar to those of fish from Vancouver (Figure 8A vs. Figure 5).

When the original starry flounder Southern blot was probed with the winter flounder liver

isoform cDNA, it was assumed that the observed signal measured the one and only set of AFP

genes present in the genome. The analysis of gene dosage in this manner has become more

complex since this blot was done, due to the subsequent discoveries of the skin [50, 51] and

hyperactive isoforms [63, 65]. Knowing the extent to which these AFP probes cross-hybridize to

other isoforms is vital for the accuracy of gene dosage assessment. Thus, we chose to study the

contribution of each isoform to the AFP gene signals in starry flounder before making further

attempts to broaden the collection of samples.

The signal intensity observed in each of the four QCI starry flounder DNAs with the

winter flounder liver isoform probe was comparable to that seen in the winter flounder controls.

The winter flounder signal is known to represent 30 – 40 copies of this isoform, suggesting that

75

the QCI fish have at least this many AFP genes. This is consistent with the 1990 blot, whereas in

the BC sample, what appears to be single copy gene signals are seen together with intense bands

of hybridization that could easily contain 20 – 30 gene copies. The fact that the QCI AFP gene

signals are largely concentrated in a 4.7 kb SacI fragment and in much larger EcoRI fragments

strongly suggests that the genes are present in tandem repeats. In the winter flounder, a single

liver AFP gene is present in a tandemly repeated 7.8 kb BamHI fragment that lacks an EcoRI cut

site [32]. Therefore, EcoRI and other restriction enzymes that do not cut within the amplicon

produce uncharacteristically long tracts of genomic DNA containing numerous tandem repeats.

Such repeats might also occur in the starry flounder genome. Although it is difficult to estimate

the length of the large EcoRI fragments near the top of the gel, they appear large enough to

contain several of the smaller SacI fragments. Migration distances in this area of the gel are

considerably compressed, such that 1 mm of migration may translate to variation in a fragment

length of 1 x 104 or 1 x 105 bp. As well, it is difficult to avoid shearing DNA of this size during

extraction and these DNA fragments may be much shorter than they really are in the genome.

The stripped and re-probed Southern blot of the QCI DNAs strongly suggests that genes

for the hyperactive isoform are present in the same 4.7 kb SacI fragment observed with the liver

probe, as well as a smaller 3.5 kb SacI fragment to which the liver probe hybridized weakly.

Consistent with this suggestion is the shift in the hybridization signal to larger fragments in the

EcoRI digests, some of which are clearly the same as those detected with the liver probe.

Another interpretation of these results could be that the probes cross-hybridize. We suggest this

is unlikely because, while both the liver and hyperactive probes are GC-rich (63% and 72%,

respectively), dot matrix comparisons (2 mismatches per 20 bp) show that only a few small

segments between them are well matched. As well, the hyperactive probe contains only mature

76

sequence and a 15 bp stretch of 3' UTR that is known to be conserved in the liver probe. A more

obvious argument against cross-hybridization is that the two probes detect completely different

restriction fragments in the winter flounder control DNAs. This result is consistent with what is

known about the organization of AFP genes in winter flounder. The 7.8 kb BamHI repeats,

which give rise to the 3 kb SacI fragments, contain only the liver isoform [32]. This is the first

time winter flounder DNA has been probed with the hyperactive AFP, and the hybridization

signal was surprising in the intensity of several different-sized bands. As well, the sequencing of

one winter flounder control showed two single nucleotide substitutions compared to wfh-AFP1.

Together, these results indicate that this isoform is also encoded by multiple genes.

The signal obtained from probing the blot a third time with the skin isoform gave a

hybridization pattern that appeared to be a combination of the two preceding blots. Here, the

possibility of cross-hybridization between skin and liver AFP probes is more likely because the

genes are similarly sized and because larger sections of the probes are conserved between

isoforms. For example, the liver probe ends with a portion of the 3' UTR that is highly conserved

across all three isoforms (positions 1 – 18, identical to wfl-AFP6 in Appendix C), while the skin

probe ends with a similar segment that is twice as long (positions 1 – 34, identical to stfs-AFP8 in

Appendix C). As well, the skin probe begins with a 31 bp sequence corresponding to the end of

the intron, a region that is also highly conserved across all three isoforms (data not shown).

Together, these two regions make up almost 34% of the total length of the skin probe.

Nevertheless, the intensity of the signal and the persistence of certain bands in all blots strongly

suggests that the skin, liver and hyperactive isoforms are closely linked in the same restriction

fragments in the starry flounder genome.

77

Our attempt to follow up on these results via genomic cloning was only partially

successful. The first lambda clone, which was selected with the hyperactive probe, produced

insert #1 and was sequenced in its entirety. It contained both a liver and a skin isoform in close

proximity, which is consistent with the idea that AFP genes are closely linked in the starry

flounder. However, the 4.7 kb SacI fragment was not present in the insert. The second lambda

clone that was sequenced, insert #2, was similar in this respect. It was isolated using a starry

flounder hyperactive probe and contained part of a hyperactive AFP gene and a different skin

variant, but was lacking a 4.7 kb SacI amplicon. These results led to suspicions that these repeats

might be incompatible with phage amplification and growth. This has been observed before,

particularly with the AFP complement in the wolffish (Anarhichas lupus) [103], and at that time,

extraordinary measures were required to obtain genomic clones of the repeated sequences. To

check on this possibility for the starry flounder, another member of the lab prepared several crude

phage cultures from phage plaques picked during the primary library screen. Three of the plaques

had been isolated using the winter flounder hyperactive probe and three had been isolated using

the liver probe. Southern blots revealed that none of these clones included the 4.7 kb SacI

fragment, but instead appeared to contain the ends of tandem arrays or linked but irregularly

spaced AFP genes (S. Gauthier, personal communication).

Previous work by the Davies research group has found that it is relatively easy to obtain

clones of linked but irregularly spaced AFP genes from species that have a tandemly repeated

gene cluster without ever isolating clones that contain the entire tandem array. This has occurred

while characterizing AFP genes from the wolffish [103] and the winter flounder [31]. In the

latter case, the AFP genes were missed from an initial winter flounder genomic library because it

was made with EcoRI-digested genomic DNA. Since most of the liver-specific AFP genes in

78

winter flounder are in tandem repeats that do not contain an EcoRI restriction site, the fragments

generated were far too big to be cloned into lambda phage. Lambda clones that were successfully

purified from this winter flounder library showed that the three main isoforms were indeed

linked, and two of the gene pairings were coincidentally a hyperactive isoform next to a skin

isoform, and a skin isoform adjacent to a liver isoform [31]. The results of the present

experiments suggest two possibilities: 1) that the tandem AFP array in starry flounder genome

contains all three isoforms in the same repeat unit, or 2) the genes for the three different isoforms

amplified separately to generate repeats of identical size and restriction site distribution. The

former is much more likely, and it may be worthwhile to isolate and sequence one of the 4.7 kb

SacI fragments to prove the point.

Genomic PCR followed by cloning and sequencing has been a useful way to sample

sequence variation and overcome the bias of missing the 4.7 kb SacI fragments from the lambda

library. Because these sequences were obtained directly from the starry flounder genome, the

genes missing from the lambda clones should be well represented. The cloning and sequencing

of the three isoforms has produced a great deal of structural information that can be used to test

the current model of structure-function relationships for the type I AFPs. However, it must be

stressed that some of these genomic sequences might not be functional. Random mutations that

prevent a gene from being expressed may lead to the accumulation of mutations in the coding

region, which would in turn complicate the structure-function interpretations. Similarly, an

expressed gene may acquire mutations that render it less functional, with similar effects on

structure-function studies. Despite that disclaimer, all of the starry flounder liver variants fit with

the current view that the type I AFP is an amphipathic helix with an Ala-rich ice-binding surface

immediately adjacent to a line of Thr residues. This hydrophobic surface is conserved, whereas

79

the opposite, hydrophilic side is more plastic and contains some features that help reinforce the

helix and increase its solubility. The compilation of numerous skin variants from the winter

flounder, starry flounder and American plaice resulted in similar conclusions. The N-terminal

amino acids including the Pro at position 4 were excluded from the helical wheel projections, as

these initial residues are likely involved in a helix-capping structure. Potentially, the more

informative sequences will be those of the hyperactive isoform. There is currently limited

sequence information for this isoform and only a very rough model of the dimer [65]. Here

sequence conservation between isoforms and orthologues will potentially reveal the ice-binding

site(s) and the dimerization interface, both of which are thought to run the entire length of this

approximately 200-residue alpha-helix.

The discovery of the skin and hyperactive isoforms has spurred a re-evaluation of the

phylogenetic Southern blot (Figure 3), which was also probed with the cDNA of wfl-AFP6. The

initial interpretation of this blot was that only a subset of the right-eye flounders – those in lanes

C8 to E3 showing strong hybridization signals – had AFP genes, and that the faint signals in other

fishes were non-specific due to the GC-richness of the probe (P. Davies, personal

communication). This suggested that the type I AFP gene was very recent in its origin.

However, with the discovery of the new isoforms, the fainter bands in A3 to C6 and E6 to F6

could now be interpreted as skin or hyperactive AFP genes in species that lack the liver isoform.

The liver isoform may well have arisen recently from the splicing of a liver gene-specific signal

peptide exon onto a 3’ exon that is a derivative of a skin or hyperactive AFP gene.

The extent to which the skin AFPs alone can protect a fish is unknown, as is whether the

hyperactive isoform can replace the smaller liver AFPs. Studies on American plaice showed that

80

the hyperactive AFPs are the only type present in the plasma [71]. We therefore decided to use

universal PCR primers for the skin isoform on the American plaice genome to determine if these

AFPs were present. This strategy was extremely effective and produced many unique sequences.

To further support the hypothesis that the liver isoform evolved recently from the older skin or

hyperactive isoforms, one could use PCR to verify that the presence of liver AFP isoforms is

limited to that subset of closely related flounders that hybridize strongly to the liver probe. As

well, PCR could be used to look for skin and hyperactive isoforms in the other flounders on the

phylogenetic blot that show signals akin to those of the American plaice (B3). If present, this

would support the hypothesis that the liver AFP developed from one of the other isoforms. If this

fusion of the 3' coding region to a 5' signal sequence driven by a liver-specific promoter/enhancer

occurred then it might have happened in the common ancestor to C8 to E3.

4.1 Conclusions

It has been nearly 40 years since the discovery of AFPs in flounder, yet their evolutionary

origin remains a mystery. In an attempt to test the hypothesis that these proteins arose due to the

acute selection pressure of icy seawater, I examined a close relative of the well-studied winter

flounder, the starry flounder. I have shown that its genome contains a large family of genes for

the type I AFPs, including multiple variants of each of the liver, skin and hyperactive isoforms.

Many regulatory elements in the starry flounder proteins are highly similar to other well-

characterized type I AFPs, although their expression and functionality require characterization

through transcriptional analysis and antifreeze activity assays. Using Southern blotting and

genomic cloning, I established that the three AFP isoforms in the starry flounder are interspersed

81

and are likely closely linked in a tandem array. Future screening of an unamplified genomic

library will be required to determine the nature of the gene clusters. The refinement here of a

PCR-based gene-sampling approach has allowed rapid detection of hyperactive and skin AFP

sequences in two other closely related flounders, and further screening of other pleuronectid fish

will no doubt provide insight as to the impetus and direction behind this evolutionary innovation.

82

References

1. Volff, J.-N. (2005). Genome evolution and biodiversity in teleost fish. Heredity. 94(3):280-294.

2. Scholander, P.F., van Dam, L., Kanwisher, J.W., Hammel, H.T., and Gordon, M.S. (1957). Supercooling and osmoregulation in arctic fish. Journal of Cellular and Comparative Physiology. 49(1):5-24.

3. Green, J.M. (1974). A localized mass winter kill of cunners in Newfoundland. Canadian Field-Naturalist. 88:1.

4. DeVries, A.L. and Wohlschlag, D.E. (1969). Freezing resistance in some Antarctic fishes. Science. 163(3871):1073-1075.

5. Fletcher, G.L., Hew, C.L., and Davies, P.L. (2001). Antifreeze proteins of teleost fishes. Annual Review of Physiology. 63:359-390.

6. Raymond, J.A. and DeVries, A.L. (1977). Adsorption inhibition as a mechanism of freezing resistance in polar fishes. Proceedings of the National Academy of Sciences of the USA. 74(6):2589-2593.

7. Wilson, P.W. (1993). Explaining thermal hysteresis by the Kelvin effect. Cryo-Letters. 14:31-36.

8. Duman, J.G. and DeVries, A.L. (1974). Freezing resistance in winter flounder Pseudopleuronectes americanus. Nature. 247(5438):237-238.

9. Slaughter, D., Fletcher, G.L., Ananthanarayanan, V.S., and Hew, C.L. (1981). Antifreeze proteins from the sea raven, Hemitripterus americanus: Further evidence for diversity among fish polypeptide antifreezes. Journal of Biological Chemistry. 256(4):2022-2026.

10. Hew, C.L., Slaughter, D., Joshi, S., Fletcher, G.L., and Ananthanarayanan, V.S. (1984). Antifreeze polypeptides from the Newfoundland ocean pout, Macrozoarces americanus: Presence of multiple and compositionally diverse components. Journal of Comparative Physiology B. 155(1):81-88.

11. Deng, G., Andrews, D.W., and Lauersen, R.A. (1997). Amino acid sequence of a new type of antifreeze protein, from the longhorn sculpin Myoxocephalus octodecimspinosis. FEBS Letters. 402(1):17-20.

12. Ananthanarayanan, V.S. and Hew, C.L. (1977). Structural studies on the freezing-point-depressing protein of the winter flounder Pseudopleuronectes americanus. Biochemical and Biophysical Research Communications. 74(2):685-689.

83

13. Davies, P.L., Roach, A.H., and Hew, C.L. (1982). DNA sequence coding for an antifreeze protein precursor from winter flounder. Proceedings of the National Academy of Sciences of the USA. 79(2):335-339.

14. Hew, C.L. and Yip, C. (1976). The synthesis of freezing-point-depressing protein of the winter flounder Pseudopleuronectes americanus in Xenopus laevis oocytes. Biochemical and Biophysical Research Communications. 71(3):845-850.

15. Davies, P.L. and Hew, C.L. (1980). Isolation and characterization of the antifreeze protein messenger RNA from the winter flounder. Journal of Biological Chemistry. 255(18):8729-8734.

16. Pickett, M.H., Scott, G.K., Davies, P.L., Wang, N.-C., Joshi, S., and Hew, C.L. (1984). Sequence of an antifreeze protein precursor. European Journal of Biochemistry. 143(1):35-38.

17. Gourlie, B., Lin, Y., Price, J., DeVries, A.L., Powers, D., and Huang, R.C.C. (1984). Winter flounder antifreeze proteins: a multigene family. Journal of Biological Chemistry. 259(23):14960-14965.

18. Hew, C.L., Liunardo, N., and Fletcher, G.L. (1978). In vivo biosynthesis of the antifreeze protein in the winter flounder - evidence for a larger precursor. Biochemical and Biophysical Research Communications. 85(1):421-427.

19. Kenward, K.D., Altschuler, M., Hildebrand, D., and Davies, P.L. (1993). Accumulation of type I fish antifreeze protein in transgenic tobacco is cold-specific. Plant Molecular Biology. 23(2):377-385.

20. Yang, D.S.C., Sax, M., Chakrabartty, A., and Hew, C.L. (1988). Crystal structure of an antifreeze polypeptide and its mechanistic implications. Nature. 333(6170):232-237.

21. Scott, G.K., Davies, P.L., Shears, M.A., and Fletcher, G.L. (1987). Structural variations in the alanine-rich antifreeze proteins of the Pleuronectinae. European Journal of Biochemistry. 168(3):629-633.

22. Sicheri, F. and Yang, D.S.C. (1995). Ice-binding structure and mechanism of an antifreeze protein from winter flounder. Nature. 375:427-431.

23. Duman, J.G. and DeVries, A.L. (1976). Isolation, characterization, and physical properties of protein antifreezes from the winter flounder, Pseudopleuronectes americanus. Comparative Biochemistry and Physiology Part B: Biochemistry and Molecular Biology. 54(3):375-380.

24. DeVries, A.L. and Lin, Y. (1977). Structure of a peptide antifreeze and mechanism of adsorption to ice. Biochimica et Biophysica Acta. 495(2):388-392.

25. Wen, D. and Laursen, R.A. (1992). A model for binding of an antifreeze polypeptide to ice. Biophysical Journal. 63:1659-1662.

84

26. Knight, C.A., Cheng, C.-H.C., and DeVries, A.L. (1991). Adsorption of alpha-helical antifreeze peptides on specific ice crystal surface planes. Biophysical Journal. 59:409-418.

27. Baardsnes, J., Kondejewski, L.H., Hodges, R.S., Chao, H., Kay, C., and Davies, P.L. (1999). New ice-binding face for type I antifreeze protein. FEBS Letters. 463(1-2):87-91.

28. Davies, P.L. and Hew, C.L. (1990). Biochemistry of fish antifreeze proteins. FASEB Journal. 4(8):2460-2468.

29. Knight, C.A., DeVries, A.L., and Oolman, L.D. (1984). Fish antifreeze protein and the freezing and recrystallization of ice. Nature. 308(5956):295-296.

30. Low, W.K., Lin, Q., Stathakis, C., Miao, M., Fletcher, G.L., and Hew, C.L. (2001). Isolation and characterization of skin-type, type I antifreeze polypeptides from the longhorn sculpin, Myoxocephalus octodecemspinosus. Journal of Biological Chemistry. 276(15):11582-11589.

31. Davies, P.L., Hough, C., Scott, G.K., Ng, N., White, B.N., and Hew, C.L. (1984). Antifreeze protein genes of the winter flounder. Journal of Biological Chemistry. 259(14):9241-9247.

32. Scott, G.K., Hew, C.L., and Davies, P.L. (1985). Antifreeze protein genes are tandemly liked and clustered in the genome of the winter flounder. Proceedings of the National Academy of Sciences of the USA. 82(9):2613-2617.

33. Low, W.K., Miao, M., Ewart, K.V., Yang, D.S.C., Fletcher, G.L., and Hew, C.L. (1998). Skin-type antifreeze protein from the shorthorn sculpin, Myoxocephalus scorpius: Expression and characterization of a Mr 9, 700 recombinant protein. Journal of Biological Chemistry. 273(36):23098-23103.

34. Gauthier, S.Y., Wu, Y., and Davies, P.L. (1990). Nucleotide sequence of a variant antifreeze protein gene. Nucleic Acids Research. 18(17):5303.

35. Chao, H., Hodges, R.S., Kay, C.M., Gauthier, S.Y., and Davies, P.L. (1996). A natural variant of Type I antifreeze protein with four ice-binding repeats is a particularly potent antifreeze. Protein Science. 5(6):1150-1156.

36. Fourney, R.M., Joshi, S., Kao, M.H., and Hew, C.L. (1984). Heterogeneity of antifreeze polypeptides from the Newfoundland winter flounder, Pseudopleuronectes americanus. Canadian Journal of Zoology. 62(1):28-33.

37. Griffiths, A.J.F., Miller, J.H., Suzuki, D.T., Lewontin, R.C., and Gelbart, W.M. (2000) An Introduction to Genetic Analysis, 7 ed.W.H. Freeman and Company: New York. p. 860.

38. Davies, P.L. (1992). Conservation of antifreeze protein-encoding genes in tandem repeats. Gene. 112(2):163-170.

85

39. Rancourt, D.E., Walker, V.K., and Davies, P.L. (1987). Flounder antifreeze protein synthesis under heat shock control in transgenic Drosophila melanogaster. Molecular and Cellular Biology. 7(6):2188-2195.

40. Chan, S.-L., Miao, M., Fletcher, G.L., and Hew, C.L. (1997). The role of CCAAT/enhancer-binding protein alpha and a protein that binds to the activator-protein-1 site in the regulation of liver-specific expression of the winter flounder antifreeze protein gene. European Journal of Biochemistry. 247(1):44-51.

41. Fletcher, G.L. (1981). Effects of temperature and photoperiod on the plasma freezing point depression, Cl− concentration, and protein "antifreeze" in winter flounder. Canadian Journal of Zoology. 59(2):193-201.

42. Fourney, R.M., Fletcher, G.L., and Hew, C.L. (1984). Accumulation of winter flounder messenger RNA after hypophysectomy. General and Comparative Endocrinology. 54(3):392-401.

43. Fletcher, G.L., Idler, D.R., Vaisius, A., and Hew, C.L. (1989). Hormonal regulation of antifreeze protein gene expression in winter flounder. Fish Physiology and Biochemistry. 7:387-393.

44. Slaughter, D. and Hew, C.L. (1982). Radioimmunoassay for the antifreeze polypeptides of the winter flounder: seasonal profile and immunological cross-reactivity with other fish antifreezes. Canadian Journal of Biochemistry. 60(8):824-829.

45. Duncker, B.P., Koops, M.D., Walker, V.K., and Davies, P.L. (1995). Low temperature persistence of type I antifreeze protein is mediated by cold-specific mRNA stability. FEBS Letters. 377(2):185-188.

46. Gong, Z., King, M.J., Fletcher, G.L., and Hew, C.L. (1995). The antifreeze protein genes of the winter flounder, Pleuronectus americanus, are differentially regulated in liver and non-liver tissues. Biochemical and Biophysical Research Communications. 206(1):387-392.

47. Schneppenheim, R. and Theede, H. (1982). Freezing-point depressing peptides and glycoproteins from arctic-boreal and antarctic fish. Polar Biology. 1(2):115-123.

48. Valerio, P.F., Kao, M.H., and Fletcher, G.L. (1990). Thermal hysteresis activity in the skin of the cunner, Tautogolabrus adspersus. Canadian Journal of Zoology. 68(5):1065–1067.

49. Valerio, P.F., Kao, M.H., and Fletcher, G.L. (1992). Fish skin: An effective barrier to ice crystal propagation. Journal of Experimental Biology. 164:135-151.

50. Gong, Z., Fletcher, G.L., and Hew, C.L. (1992). Tissue distribution of fish antifreeze protein mRNAs. Canadian Journal of Zoology. 1992(4):810-814.

86

51. Gong, Z., Ewart, K.V., Hu, Z., Fletcher, G.L., and Hew, C.L. (1996). Skin antifreeze protein genes of the winter flounder, Pleuronectes americanus, encode distinct and active polypeptides without the secretory signal and prosequences. Journal of Biological Chemistry. 271(8):4106-4112.

52. Davies, P.L. and Gauthier, S.Y. (1992). Antifreeze protein pseudogenes. Gene. 112:171-178.

53. Low, W.K., Lin, Q., Ewart, K.V., Fletcher, G.L., and Hew, C.L. (2002). The skin-type antifreeze polypeptides: A new class of type I AFPs, in Fish Antifreeze Proteins, K.V. Ewart and C.L. Hew, eds. World Scientific Publishing Co. Pte. Ltd.: River Edge, NJ. p.161-186.

54. Lin, Q., Ewart, K.V., Yan, Q., Wong, W.K., Yang, D.S.C., and Hew, C.L. (1999). Secretory expression and site-directed mutagenesis studies of the winter flounder skin-type antifreeze polypeptides. European Journal of Biochemistry. 264:49-54.

55. Low, W.K., Lin, Q., and Hew, C.L. (2003). The role of N and C termini in the antifreeze activity of winter flounder (Pleuronectes americanus) antifreeze proteins. Journal of Biological Chemistry. 278(12):10334-10343.

56. Murray, H.M., Hew, C.L., and Fletcher, G.L. (2003). Spatial expression patterns of skin-type antifreeze protein in winter flounder (Pseudopleuronectes americanus) epidermis following metamorphosis. Journal of Morphology. 257(1):78-86.

57. Mignatti, P., Morimoto, T., and Rifkin, D.B. (1992). Basic fibroblast growth factor, a protein devoid of secretory signal sequence, is released from cells via a pathway independent of the ER-Golgi complex. Journal of Cellular Physiology. 151(1):81-93.

58. Nickel, W. (2003). The mystery of nonclassical protein secretion. European Journal of Biochemistry. 270(10):2109-2119.

59. Miao, M., Chan, S.-L., Hew, C.L., and Gong, Z. (1998). The skin-type antifreeze protein gene intron of the winter flounder is a ubiquitous enhancer lacking a functional C/EBP binding motif. FEBS Letters. 426(1):121-125.

60. Murray, H.M., Hew, C.L., and Fletcher, G.L. (2002). Skin-type antifreeze protein expression in integumental cells of larval winter flounder. Journal of Fish Biology. 60(6):1391-1406.

61. Pereira, J.J., Goldberg, R., Ziskowski, J.J., Berrien, P.L., Morse, W.W., and Johnson, D.L., Essential fish habitat source document: Winter flounder, Pseudopleuronectes americanus, life history and habitat characteristics, U.D.o. Commerce, Editor. 1999, US Department of Commerce: Woods Hole, MA. p. 48.

62. Scott, G.K., Davies, P.L., Kao, M.H., and Fletcher, G.L. (1988). Differential amplification of antifreeze protein genes in the Pleuronectinae. Journal of Molecular Evolution. 27(1):29-35.

87

63. Marshall, C.B., Fletcher, G.L., and Davies, P.L. (2004). Hyperactive antifreeze protein in a fish. Nature. 429(6988):153.

64. Marshall, C.B., Chakrabartty, A., and Davies, P.L. (2005). Hyperactive antifreeze protein from winter flounder is a very long rod-like dimer of alpha-helices. Journal of Biological Chemistry. 280(18):17920-17929.

65. Graham, L.A., Marshall, C.B., Lin, F.-H., Campbell, R.L., and Davies, P.L. (2008). Hyperactive antifreeze protein from fish contains multiple ice-binding sites. Biochemistry. 47(7):2051-2063.

66. Sakamoto, K. (1984). Interrelationships of the family Pleuronectidae. PhD thesis from the Department of Fisheries Science, Hokkaido University, Hokkaido, Japan. p. 95-215.

67. Duman, J.G. and DeVries, A.L. (1975). The role of macromolecular antifreezes in cold water fishes. Comparative Biochemistry and Physiology Part A: Physiology. 52(1):193-199.

68. Berendzen, P.B. and Dimmick, W.W. (2002). Phylogenetic relationships of pleuronectiformes based on molecular evidence. Copeia. 3(3):642–652.

69. Cooper, J.A. and Chapleau, F. (1998). Monophyly and intrarelationships of the family Pleuronectidae (Pleuronectiformes), with a revised classification. Fishery Bulletin. 96(4):686-726.

70. Johnson, D.L., Morse, W.W., Berrien, P.L., and Vitaliano, J.J., Essential fish habitat source document: Yellowtail flounder, Limanda ferruginea, life history and habitat characteristics. 1999, US Department of Commerce: Woods Hole, MS. p. 38.

71. Gauthier, S.Y., Marshall, C.B., Fletcher, G.L., and Davies, P.L. (2005). Hyperactive antifreeze protein in flounder species: The sole freeze protectant in American plaice. FEBS Journal. 272:4439-4449.

72. Johnson, D.L., Morse, W.W., Berrien, P.L., and Vitaliano, J.J., Essential fish habitat source document: American plaice, Hippoglossoides platessoides, life history and habitat characteristics, 1st ed. 1999, US Department of Commerce: Woods Hole, MS. p. 40.

73. Goddard, S.V. and Fletcher, G.L. (2002). Physiological ecology of antifreeze proteins - a northern perspective, in Fish Antifreeze Proteins, K.V. Ewart and C.L. Hew, eds. World Scientific: River Edge, NJ. p.17-60.

74. DeVries, A.L. (1983). Antifreeze peptides and glycopeptides in cold-water fishes. Annual Review of Physiology. 45:245-260.

75. Mouches, C., Pasteur, N., Berge, J.B., Hyrien, O., Raymond, M., de Saint Vincent, B.R., de Silvestri, M., and Georghiou, G.P. (1986). Amplification of an esterase gene is responsible for insecticide resistance in a California Culex mosquito. Science. 233(4765):778-780.

88

76. Ohno, S. (1970) Evolution by gene duplication.Springer-Verlag: New York. p. 160.

77. Alt, F.W., Kellems, R.E., Bertino, J.R., and Schimke, R.T. (1978). Selective multiplication of dihydrofolate reductase genes in methotrexate-resistant variants of cultured murine cells. Journal of Biological Chemistry. 253(5):1357-1370.

78. Scott, G.K., Fletcher, G.L., and Davies, P.L. (1986). Fish antifreeze proteins: recent gene evolution. Canadian Journal of Fisheries and Aquatic Sciences. 43(5):1028-1034.

79. Maisey, J.G. (1996) Discovering Fossil Fishes, 1 ed.Henry Holt & Company. p. 223.

80. Hurley, I.A., Mueller, R.L., Dunn, K.A., Schmidt, E.J., Friedman, M., Ho, R.K., Prince, V.E., Yang, Z., Thomas, M.G., and Coates, M.I. (2007). A new time-scale for ray-finned fish evolution. Proceedings of the Royal Society B: Biological Sciences. 274(1609):489-498.

81. Greenwood, P.H., Rosen, D.E., Weitzman, S.H., and Myers, G.S. (1966). Phyletic studies of teleostean fishes, with a provisional classification of living forms. Bulletin of the American Museum of Natural History. 141:341-455.

82. Munroe, T.A. (2005). Systematic diversity of the Pleuronectiformes, in Flatfishes: Biology and Exploitation, R.N. Gibson, ed Blackwell Publishing: Ames, IA. p.10-41.

83. Friedman, M. (2008). The evolutionary origin of flatfish asymmetry. Nature. 454(7201):209-212.

84. Trevisani, E., Papazzoni, C.A., Ragazzi, E., and Roghi, G. (2005). Early Eocene amber from the “Pesciara di Bolca” (Lessini Mountains, Northern Italy). Palaeogeography, Palaeoclimatology, Palaeoecology. 223(3-4):260-274.

85. Sellwood, B.W. and Valdes, P.J. (2006). Mesozoic climates: General circulation models and the rock record. Sedimentary Geology. 190(1-4):269-287.

86. Sluijs, A., Schouten, S., Pagani, M., Woltering, M., Brinkhuis, H., Sinninghe Damste, J.S., Dickens, G.R., Huber, M., Reichart, G.-J., Stein, R., Matthiessen, J., Lourens, L.J., Pedentchouk, N., Backman, J., and Morimoto, T. (2006). Subtropical Arctic Ocean temperatures during the Palaeocene/Eocene thermal maximum. Nature. 441(7093):610-613.

87. Moran, K., Backman, J., Brinkhuis, H., Clemens, S.C., Cronin, T., Dickens, G.R., Eynaud, F., Gattacceca, J., Jakobsson, M., Jordan, R.W., Kaminski, M., King, J., Koc, N., Krylov, A., Martinez, N., Matthiessen, J., McInroy, D., Moore, T.C., Onodera, J., O'Regan, M., Pälike, H., Rea, B., Rio, D., Sakamoto, T., Smith, D.C., Stein, R., St John, K., Suto, I., Suzuki, N., Takahashi, K., Watanabe, M., Yamamoto, M., Farrell, J., Frank, M., Kubik, P., Jokat, W., and Kristoffersen, Y. (2006). The Cenozoic palaeoenvironment of the Arctic Ocean. Nature. 441(7093):601-605.

89

88. Hew, C.L., Wang, N.-C., Joshi, S., Fletcher, G.L., Scott, G.K., Hayes, P.H., Buettner, B., and Davies, P.L. (1988). Multiple genes provide the basis for antifreeze protein diversity and dosage in the ocean pout, Macrozoarces americanus. Journal of Biological Chemistry. 263(24):12049-12055.

89. Hayes, P.H., Davies, P.L., and Fletcher, G.L. (1991). Population differences in antifreeze protein gene copy number and arrangement in winter flounder. Genome. 34:174-177.

90. Orcutt, H.G., The life history of the starry flounder, Platichthys stellatus (Pallas). 1950, State of California, Department of Natural Resources, Division of Fish and Game Bureau of Marine Fisheries. p. 64 pp.

91. Eschmeyer, W.N. (1983) A field guide to Pacific Coast fishes of North America: from the Gulf of Alaska to Baja, California. The Peterson field guide series.Houghton Mifflin: Boston.

92. Blin, N. and Stafford, D.W. (1976). A general method for isolation of high molecular weight DNA from eukaryotes. Nucleic Acids Research. 3(9):2303-2308.

93. Sambrook, J. and Russell, D.W. (2001) Molecular Cloning: A Laboratory Manual, 3rd ed.Cold Spring Harbor Laboratory Press: Cold Spring Harbor, NY.

94. Bendtsen, J.D., Nielsen, H., von Heijne, G., and Brunak, S. (2004). Improved prediction of signal peptides: SignalP 3.0. Journal of Molecular Biology. 340(4):783-795.

95. Reese, M.G. Neural Network Promoter Prediction Tool, v.2.2. 1999 9 May 2008 [cited 6 November 2008; Available from: http://www.fruitfly.org/seq_tools/promoter.html.

96. Hinegardner, R. and Rosen, D.E. (1972). Cellular DNA content and the evolution of teleostean fishes. American Naturalist. 106(951):621-644.

97. Douglas, S.E., Patrzykat, A., Pytyck, J., and Gallant, J.W. (2003). Identification, structure and differential expression of novel pleurocidins clustered on the genome of the winter flounder, Pseudopleuronectes americanus (Walbaum). European Journal of Biochemistry. 270(18):3720-3730.

98. Cunningham, D.F. and O'Connor, B. (1997). Proline specific peptidases. Biochimica et Biophysica Acta. 1343(2):160-186.

99. Lambeir, A.-M., Durinx, C., Scharpé, S., and De Meester, I. (2003). Dipeptidyl-peptidase IV from bench to bedside: An update on structural properties, functions, and clinical aspects of the enzyme DPP IV. Critical Reviews in Clinical Laboratory Sciences. 40(3):209-295.

100. Breathnach, R. and Chambon, P. (1981). Organization and expression of eucaryotic split genes coding for proteins. Annual Review of Biochemistry. 50:349-383.

90

101. Fletcher, G.L., Hew, C.L., Li, X.M., Haya, K., and Kao, M.H. (1985). Year-round presence of high levels of plasma antifreeze peptides in a temperate fish, ocean pout (Macrozoarces americanus). Canadian Journal of Zoology. 63(3):488-493.

102. Bergstrom, C.A. (2007). Morphological evidence of correlational selection and ecological segregation between dextral and sinistral forms in a polymorphic flatfish, Platichthys stellatus. Journal of Evolutionary Biology. 20(3):1104–1114.

103. Scott, G.K., Hayes, P.H., Fletcher, G.L., and Davies, P.L. (1988). Wolffish antifreeze protein genes are primarily organized as tandem repeats that each contain two genes in inverted orientation. Molecular and Cellular Biology. 8(9):3670-3675.

91

Appendix A

DNA alignment of winter and starry flounder liver AFPs

An alignment of the starry flounder liver AFP from lambda insert #1 (stfl-AFP1, Figure

10) and the winter flounder liver variant wfl-AFP6 is shown. Identical bases are marked with a

dot (.) and gaps with a hyphen (-). Intronic and flanking sequences are in lower case, while

exonic sequence is in upper case. The coding region is bolded with the translated sequences of

stfl-AFP1 or wfl-AF6 above or below the respective gene sequence, and silent substitutions are

underlined. The number of nucleotides is noted at the end of each row. The core promoter

sequences ( | ), the transcriptional start site (+), and polyadenylation signals (#) are marked based

on previous characterization of wfl-AFP6. The intronic enhancer, Element B, is in white text and

highlighted in black [40]. The Genbank accession number for wfl-AFP6 is M62415.

92

||||| stfl-AFP1 aacaaaactgggggagtgttgtaccaatctgctcagattggtcgacagtc 50 wfl-AFP6 c....c............................................ 50 ||||||| stfl-AFP1 aagcgatgactcaggctccatttactacaaaacagactcacattcgcctg 100 wfl-AFP6 ..........c.........g......t........t.......ga.... 100 + stfl-AFP1 tatctttACCACATCTTCCTTTTGTAGTGAAGCAGTGCTCCCTAAAAACT 150 wfl-AFP6 g..a..c...........A............C............C..GT. 150 M A L S L F T V G Q F I F L F stfl-AFP1 CTCAAAATGGCTCTCTCACTCTTCACCGTCGGACAATTCATTTTCTTATT 200 wfl-AFP6 ....................T.....T...........G........... 200 M A L S L F T V G Q L I F L F W T I R stfl-AFP1 TTGGACAATAAGgtacgtgaacactcactttgtttcttctataaatctgg 250 wfl-AFP6 .........G................................g....... 250 W T M R stfl-AFP1 ttttactgtaaatatcttgggaaggaaggaatgatatctgcattatccca 300 wfl-AFP6 ...............................g.................c 300 stfl-AFP1 gaggggcaggcacgtgcacagatattttggggggcaagtgctaaaacccc 350 wfl-AFP6 .......catttgt.tt....cc.gcggt.-------------...gatg 337 stfl-AFP1 caaaaaagggcacccatcgccaaaatgtaaagctgacaacacagtacaca 400 wfl-AFP6 a.g.tcttcatc.gtg.tcatctgt.tg.ccctgatt......ag.tggt 387 stfl-AFP1 caca------atcttttttttactgttttctgacagagttgaagtattag 444 wfl-AFP6 ....tggacc......a...aca.aa.g.t.c.tcagcacttcc.g..tt 437 stfl-AFP1 cagcattacactaataatatgccacaatatacctcaccagactggcatat 494 wfl-AFP6 ....ccg.a...t.a.gagg-----------.....tgga.actt.c.ga 476 stfl-AFP1 agctcaacttaagacctaccgttcatttcaataattacttgattatttga 544 wfl-AFP6 t.a..tgg.g.c-....g.t.g.tgaagg..ac.gagt....gaggcg.c 525 stfl-AFP1 attttaaatgaaacaaaactagagaacttgtctgatttgtagaacagtaa 594 wfl-AFP6 .gaaa....t.tttt.gtt.ga.tg.agaag...tca.t.gatttca.gt 575 stfl-AFP1 aactgcctttaatttctatcacacacagatattgaacactgtcatcactg 644 wfl-AFP6 tggg.ggggggggggtc..................ta............. 625 stfl-AFP1 ggttcggtgaaagtgacggaccagtaaatgttgtgatatataatattatc 694 wfl-AFP6 a...t............................................. 675

93

I S E A N stfl-AFP1 atc---atttcaataataccattaatctctgcagAATCAGTGAAGCCAAC 741 wfl-AFP6 ..aata...at............................C........GA 725 I T E A R P D P A A K A A A V A D P A A A A stfl-AFP1 CCCGACCCCGCAGCCAAAGCTGCCGCAGTCGCCGACCCTGCCGCAGCCGC 791 wfl-AFP6 ....................C...------------..A..A.....T.. 763 P D P A A K A A P A A A A * V A P A A D A F S A A A D T A S stfl-AFP1 TGTAGCCCCCGCCGCTGACGCCTTCTCAGCAGCCGCCGACACTGCCTCTG 841 wfl-AFP6 C.CC---------------C..GC.G....C...C.A.....C....... 798 A P A A A A P D T A S * * * * * D A A A A A A A T A A A A K A A A stfl-AFP1 ACGCCGCCGCCGCCGCCGCCGCCACCGCCGCTGCCGCCAAAGCCGCCGCA 891 wfl-AFP6 ..........T..A......CTT........CAA............T..C 848 D A A A A A A L T A A N A K A A A * * * * E K T A R D A A A A A A A T A R G stfl-AFP1 GAAAAAACCGCCCGGGACGCCGCTGCAGCAGCCGCAGCCACCGCCAGAGG 941 wfl-AFP6 ...CTC..T...GCCA.......C..C..C..A................. 898 E L T A A N A A A A A A A T A R G stfl-AFP1 TTAAGGATCGTGGTCGTCTTGATGTGGGATCATGTGAACATCTGAGCAGC 991 wfl-AFP6 .................................................. 948 ###### stfl-AFP1 GAGATGTTACCAATCTGCTGAATAAACCTGAGAAGCTGTTTGTTGAaaac 1041 wfl-AFP6 ......................................A.....A..... 998 stfl-AFP1 caagtgtccagttcatttcatctctgaaactccttcacagtttctgtaga 1091 wfl-AFP6 .............................t...................g 1048 stfl-AFP1 tcatgtttttaacacataaacctccagaaatcatgatgcgtcacgtttgg 1141 wfl-AFP6 ...g..-------------------------------------------- 1054 stfl-AFP1 actttgggttagaataaaatgacggactgcagctacataagatatgatat 1191 wfl-AFP6 -------------------------------------------------- 1054 stfl-AFP1 gttagtgatcttaaagaggttcttgtttccattatgctaagctaacagtt 1241 wfl-AFP6 -------------------------------------------------- 1054

94

stfl-AFP1 catatttacacgtagactccaggaagtgatgccattgtgctgcctgaaac 1297 wfl-AFP6 -------------.............................tt--g... 1089 stfl-AFP1 ctgcaggtctacaaggtttcataactgatttagattttaaaatactgact 1347 wfl-AFP6 ......................................t........... 1139 stfl-AFP1 aattattcacattttcgttctcaccagctctatgagtatttctccttcaa 1397 wfl-AFP6 t...............a................................. 1189 stfl-AFP1 gtacagatgtggacagtgttggaggaa-tcctgaagtttagtacttaagt 1446 wfl-AFP6 ..................g........g.a.................... 1239 stfl-AFP1 aaaagtacaagtacccaggaaaatatatacttaagtaaaagtaaaagtac 1496 wfl-AFP6 ...........a....t................................. 1289 ###### stfl-AFP1 tacatcaacaatcctacttatttaaaagtaaaaagtacttacttttaaat 1546 wfl-AFP6 ....................a..............g.............. 1339 stfl-AFP1 ttactataagtattataagtaaaagtattgacgcaatgggttgcctctca 1596 wfl-AFP6 ...............a...........c.c.................... 1389 stfl-AFP1 atgtctaggctgtgccattttgataaagaatgcatatatagctactggta 1646 wfl-AFP6 ..................................g............... 1439 stfl-AFP1 atactcatgcctctacagatgtcactactagtaataattataagcaacaa 1696 wfl-AFP6 ..................................c............... 1489 stfl-AFP1 catttgtttattggaaaggttggtgtacttattgtgcttaccctctgtaa 1746 wfl-AFP6 ......................t........................... 1539 stfl-AFP1 cactgttcacactctatcttacaattctgcggatgacaagttatctacca 1796 wfl-AFP6 t............................g......g............. 1589 ###### stfl-AFP1 gggattatctgcaaagttaaaaccattaagacaaatcaataaagacaaca 1846 wfl-AFP6 ..........................c..ct....c.............. 1639 stfl-AFP1 agttatatctttaaat--cttatatttaattgtaagtgtgtaaaaaatgg 1894 wfl-AFP6 t....a..........gtt..................a.t.......... 1689 stfl-AFP1 aaacatggaacatgaaaaacaac--taaaactggtcagaacaaggcagat 1942 wfl-AFP6 .......................tg.....................c... 1739

95

stfl-AFP1 cttagaatgagaaaatttaaa-tgaaggaccttgaaatgaaaatttgagc 1991 wfl-AFP6 ............c-.......a..........................c. 1788 stfl-AFP1 attggtggtttagactcaggcagtcaaaatattcatcttctgaatatttt 2041 wfl-AFP6 .....................................c............ 1838 stfl-AFP1 tgaaaggaaggagaaatgttatacttttattttgaaaaggtagttcctga 2091 wfl-AFP6 -................................................. 1887 stfl-AFP1 aaaacgaaaaaggtcgctaaatggcaggtgttcctcgttgttgcatgaaa 2141 wfl-AFP6 ............a..............---.g..a........------- 1927 stfl-AFP1 aagcattacgcttagtt--gtttagcacctggctgataaaggcacaagca 2189 wfl-AFP6 ---..............gtt.......ggtg.a.acg.g...atgctgca 1974

96

Appendix B

DNA alignment of winter and starry flounder skin AFPs

An alignment of the starry flounder skin AFP from lambda insert #1 (stfs-AFP1, Figure 10) and

the winter flounder skin variant wfs-11-3 is shown. Identical bases are marked with a dot (.) and

gaps with a hyphen (-). Intronic and flanking sequences are in lower case, while exonic sequence

is in upper case. The coding region is bolded with the translated sequences of stfs-AFP1 or wfs-

11-3 above or below the respective gene sequence, and silent substitutions are underlined. The

number of nucleotides is noted at the end of each row. The putative TFIID binding site ( | ), the

transcriptional start sites (+), and the polyadenylation signal (#) are marked based on previous

characterization of the winter flounder skin AFPs. The intronic enhancer, Element S, is in white

text and highlighted in black [59]. The Genbank accession number for wfs-11-3 is M63478.

97

stfs-AFP1 ttacaaaacaagttcatactggccaggatgttcgccacaccttccttttg 50 wfs-11-3 ......................-----...g.t.............g... 45 | stfs-AFP1 ttgtgaaccagtcggagccgacaacctgctgcgtcgcaaacttgaagtga 100 wfs-11-3 a.....................gc...........a.g..a.ca...... 95 |||||| ++ stfs-AFP1 ataaataagagctgctccctaaaagttttcatcaggactcacacACTTTT 150 wfs-11-3 .......gag...............................A........ 145 stfs-AFP1 CACTGTCGAACACTCAGgtacgtgaacactcactttgtttctcctacaaa 200 wfs-11-3 .........C.................................a...... 195 stfs-AFP1 tctggttt-actgtaaatatcttgggaaggaaggaaggatatctgcatta 249 wfs-11-3 ........t......................................... 245 stfs-AFP1 tcccagaggggccatttgttttacagccagtggtaaaagttgaagatctt 299 wfs-11-3 ...t-.........................c...g....a.......... 294 stfs-AFP1 catctgtgttcgtcggatggaaagtttgttctgaaaccttcagtggaagt 349 wfs-11-3 ....ca........t..................................- 343 stfs-AFP1 gtagtatattccccttagcaaatatccatagccttgaatcttaagttcaa 399 wfs-11-3 -------------------------------------------------- 343 stfs-AFP1 acctttaagtattatctccagatgtgttcagtgtgtgtctccttgtctga 449 wfs-11-3 -------------------------------------------------- 343 stfs-AFP1 actatccttgaactgcctatggaataatgagaggagagatggtttccagc 499 wfs-11-3 -------------------------------------------------- 343 stfs-AFP1 gggtccttaaatcttaaggtacgacacattcccaaatttaggcagaaggc 549 wfs-11-3 -------------------------------------------------- 343 stfs-AFP1 cgggttgtgtgacgtcattatatctctaggtttgtggtaaacaacccctc 599 wfs-11-3 -------------------------------------------------- 343 stfs-AFP1 ctatttaacgccttaccttgcagagtcaaggcggattttcactattcggc 649 wfs-11-3 -------------------------------------------------- 343 stfs-AFP1 ttgtgtgttatctccgagttttctagaaactcgtcctgacctataatact 699 wfs-11-3 -------------------------------------------------- 343

98

stfs-AFP1 cattatacttgtaagtactgggtccgcgtctcctctcttcgaacaccgac 749 wfs-11-3 -------------------------------------------------- 343 stfs-AFP1 ttctacaagacactactgcgggaaacatacgatagaagaaagagattcat 799 wfs-11-3 --------------------------------------...c........ 355 stfs-AFP1 gtgttcaggcctaaacctgaaaaaatctgagctctgttcaatcatgggaa 849 wfs-11-3 ..c.......t........c..................a........... 405 stfs-AFP1 acaactttttaattgagtcatggctgcaaaactcttttatatgaacagaa 899 wfs-11-3 ..............c.....g.....g......a.........c...... 455 stfs-AFP1 gaagaagaagtgatctttagttcatcactgtggaaacatcagcagcagtt 949 wfs-11-3 ............................ca...........t........ 505 stfs-AFP1 aaattctgtctgcttcagtatcaccggccagttccagtgctcatgtttct 999 wfs-11-3 ...g.............................................. 555 stfs-AFP1 gatcagcttggtttgaatgatatgaaa-cggatggagtccctgtttgacc 1048 wfs-11-3 ...........................a...................... 605 stfs-AFP1 ctgtttaacacaagat-ggccaagtggaccatctttattaacataatgtt 1097 wfs-11-3 ................t..a.gca...............t.......... 655 stfs-AFP1 ttacatgagcacttcctgttttcagccctaaacctaaagaggcctcatgg 1147 wfs-11-3 ......c..........................t................ 705 stfs-AFP1 aaacttcctgatgatctggtgacacctgctggttgaaggaaacagagttc 1197 wfs-11-3 .................................................t 755 stfs-AFP1 gagaggcagctgaacaaattattttagtttgaaagaagaagctgtcattt 1247 wfs-11-3 ..........a........g.............t................ 805 stfs-AFP1 gagattatgtt-gtagggggggggggg-----------------atactg 1279 wfs-11-3 t.tt....a..t.g.............gggatcaccacacacag...t.. 855 stfs-AFP1 aactctgtcatcaccgggttcggtgaaagtgacggacaagtacatgttgt 1329 wfs-11-3 ...a..........t.................a.a..c............ 905 stfs-AFP1 gataaataattatatcataataattataat–aataccattaatttctgca 1378 wfs-11-3 ....t.....at..................t............c...... 955

99

M D A P A R A A A A T stfs-AFP1 gAATCACTAAAACGAACATGGACGCCCCAGCCAGAGCCGCCGCAGCCACC 1428 wfs-11-3 ........G.C.TC...........A.......A................ 1005 M D A P A K A A A A T A A A A K A A A E A T A A A A A K stfs-AFP1 GCCGCCGCCGCCAAAGCCGCCGCAGAAGCCACCGCCGCCGCAGCTGCCAA 1478 wfs-11-3 ..............G................................... 1055 A A A A K A A A E A T A A A A A K A A A A T K A A R Ter stfs-AFP1 AGCAGCAGCCGCCACCAA------AGCAGCCCGTTAATGATCGTGGTCGT 1522 wfs-11-3 ..................AGCCGC.......................... 1105 A A A A T K A A A A R Ter stfs-AFP1 CTTGATGTGGGATCATGTGAACATCTGAGCAGCGAGATGTTACCAATATG 1572 wfs-11-3 ...............................................C.. 1155 ###### stfs-AFP1 CTGAATAAACCTGAGAAGCTGTTTGTTGAaaaccaagtgtcctgttcatt 1622 wfs-11-3 ...........................T...................... 1205 stfs-AFP1 tcatctctggaactccttcacactttctgtagatcatgtttttattttgt 1622 wfs-11-3 .........a.....a.................................. 1205 stfs-AFP1 ccagacgatgttgaactggagcagaatccagaaacgatcc 1642 wfs-11-3 t...................t...............c... 1245

100

Appendix C

DNA alignment of the 3' regions from all three type I AFP isoforms of starry flounder and

winter flounder and the liver isoform of yellowtail flounder

The sequences downstream of winter flounder (wfl-AFP6, -AFP8, AFP9, wfs-11-3, -F2, wfh-

AFP1, -5a) and yellowtail flounder (ytl-AFP1) AFP genes were obtained from the non-redundant

Genbank database, whereas the starry flounder sequences correspond to the liver (stfl-AFP1) and

skin (stfs-AFP1) variants found in lambda insert #1 (Figure 9) as well as the skin (stfs-AFP8) and

hyperactive (stfh-AFP1) variants isolated from lambda insert #2 (Figure 17). The alignment

begins with the stop codon and positions that are identical in all sequences are marked with an

asterisk (*). Exonic sequence is capitalized, differences are highlighted in grey, and

polyadenylation signals are marked with pound signs (#). The number of nucleotides is noted at

the end of each row. Genbank accession numbers for deposited sequences are M62415 (wfl-

AFP6), X53718 (wfl-AFP9), M63478 (wfs-11-3), M63479 (wfs-F2), X06356 (ytl-AFP1),

M63477 (wfh-5a), EU188795 (wfh-AFP1).

101

Ter stfl-AFP1 TAAGGATCGTGGTCGTCTTGATGTGGGATCATGTGAACATCTGAGCAGCG 50 wfl-AFP6 TAAGGATCGTGGTCGTCTTGATGTGGGATCATGTGAACATCTGAGCAGCG 50 wfl-AFP9 TAAGGATCGTCGTCGTCTTGATGTGGGATCATGTGAACATCTGAGCAGCG 50 ytl-AFP1 TAAGGATCGTGGTCGTCTTGATGTGGGATCATGTGAACATCTGAGCAGTG 50 stfs-AFP1 TAATGATCGTGGTCGTCTTGATGTGGGATCATGTGAACATCTGAGCAGCG 50 wfs-11-3 TAATGATCGTGGTCGTCTTGATGTGGGATCATGTGAACATCTGAGCAGCG 50 wfs-F2 TAATGATCGTGGTCGTCTTGATGTGGGATCATGTGAACATCTGAGCAGCG 50 stfs-AFP8 TAATGATCATGGTCGTCTTGATGTGGGATCATGTGAACATCTGAGCAGCG 50 5a TAAGGATCGTGGTCGTCTTGATGTGGGATCATGTGAACATCTGAGCAGCG 50 wfh-AFP1 TAAGGATCGTGGTCGTCTTGATGTGGGATCATGTGAACATCTGAGCAATG 50 stfh-AFP1 TAAGGATCGTGGTCGTCTTGATGTGGGATCATGTGAACATCTGAGCAGCG 50 consensus *** **** * ************************************ * ###### stfl-AFP1 AGATGTTACCAATCTGCTGAATAAACCTGAGAAGCTGTTTGTTGAaaacc 100 wfl-AFP6 AGATGTTACCAATCTGCTGAATAAACCTGAGAAGCTGATTGTTAAaaacc 100 wfl-AFP9 AGATGTTACCAATCTGTTGAATAAAGCTGAGAAGCTGTTTGTTTAaaacc 100 ytl-AFP1 AGATGTTATTAATCTGATGAATAAACCTGAGAAGCTGTTTGTTGA 95 stfs-AFP1 AGATGTTACCAATATGCTGAATAAACCTGAGAAGCTGTTTGTTGAaaacc 100 wfs-11-3 AGATGTTACCAATCTGCTGAATAAACCTGAGAAGCTGTTTGTTTAaaacc 100 wfs-F2 AGATGTTACCAATCTGCTGAATAAACCTGAGAAGCTGATTGTTAAaaacc 100 stfs-AFP8 AGATGTTACCAATCTGCTGAATAAACCTGAGAAGCTGTTTGTTGAaaacc 100 5a AGATGTTACCAATCTGCTGAATAAAC 76 wfh-AFP1 AGATATCACCAATCTGTTGAATAAAGCTGAGAAGCTGTTTGTT 93 stfh-AFP1 AGATGTTACCAATCTGCTGAATAAACCTGAGAAGCTGTTTGTTGAaaacc 100 consensus **** * * *** ** ******** *********** ***** ****** stfl-AFP1 aagtgtccagttcatttcatctctgaaactccttcacagtttctgtagat 150 wfl-AFP6 aagtgtcctgttcatttcatctctgaaagtccgtcacagtttctgtagat 150 wfl-AFP9 aagtgtcctgttcatttcatctctgaaactccgtcacagtttctttagat 150 stfs-AFP1 aagtgtcctgttcatttcatctctggaactccttcacactttctgtagat 150 wfs-11-3 aagtgtcctgttcatttcatctctgaaactcattcacagtttctgtagat 150 wfs-F2 aagtgtcctgttcatttcatctctgaaagtccgtcacagtttctgtagat 150 stfs-AFP8 aagtgtccagttcatttcatctctgaaactccttcacactttctgtagat 150 stfh-AFP1 aagtgtcctgttcatttcatcactgaaactccttcacactttctgtagat 150 consensus ******** **************** ** ** ***** ***** ***** stfl-AFP1 catgtttttaacacataaacctccagaaatcatgatgcgtcacgtttgga 200 wfl-AFP6 catgtagactccaggaagtgatgccattgtgctgttgaacctgcaggtct 200 wfl-AFP9 catgtttttcttaacacataaacctccagaaatcatgatgcgtcacgttt 200 stfs-AFP1 catgtttttattttgtccagacgatgttgaactggagcagaatccagaaa 200 wfs-11-3 catgtttttattttgttcagacgatgttgaactggatcagaatccagaaa 200 wfs-F2 catgtagactccaggaagtgatgccattgtgctgttgaacctgcaggg 198 stfs-AFP8 catgtttttaacacataaacctccagaaatcatgatgtgtcacatttgga 200 stfh-AFP1 catgtttttaacacataaacctccagaaatcatgatgtgtcacatttgga 200 consensus ***** *

102

Appendix D

DNA alignment of the upstream region and exon 1 in the liver and hyperactive AFPs of

winter, starry and yellowtail flounders

Sequences spanning exon 1 and flanking 5' sequences of liver and hyperactive AFP genes were

obtained from the non-redundant Genbank database for the winter flounder and yellowtail

flounder. The starry flounder liver isoform found in lambda insert #1 (stfl-AFP1, Figure 10) was

also included. The alignment terminates at the 3' end of the first exon, which is marked by

uppercase letters with the coding region in bold. Positions that are identical in all sequences are

marked with an asterisk (*) and differences are highlighted in grey. The number of nucleotides is

noted at the end of each row. Genbank accession numbers for sequences used are M62415 (wfl-

AFP6), X53718 (wfl-AFP9), X06356 (ytl-AFP1), M63476 (wfh-5a), EU188795 (wfh-AF1).

103

wfh-5a ggattgttgtaccaatctgctctgattggtcgacagtcaagcgatgactc 50 wfl-AFP6 ggagtgttgtaccaatctgctcagattggtcgacagtcaagcgatgaccc 50 wfl-AFP9 ggagtgttgtaccaatctgctcagattggtcgacagtcaagcgatgaccc 50 wfl-AFP8 gtcgacagtcaagcgatgaccc 22 stfl-AFP1 ggagtgttgtaccaatctgctcagattggtcgacagtcaagcgatgactc 50 consensus *** ****************** ************************* * wfh-5a aggctcaaattactacaaaacagattcacactcacctggatattcACCAC 100 wfl-AFP6 aggctccagttactataaaacagattcacattgacctggatattcACCAC 100 wfl-AFP9 aggctccagttactataaaacagattcacattgacctggatattcACCAC 100 wfl-AFP8 aggctccagttactataaaacagattcacattcacctggatattcACCAC 72 stfl-AFP1 aggctccatttactacaaaacagactcacattcgcctgtatctttACCAC 100 consensus ****** * ****** ******** ***** * **** ** ** ***** wfh-AFP1 AAGTTCTCAAAATGGCT 17 wfh-5a ATCTTCATTTTCTAGTGAACCACTGCTCCCTAAAAGTTCTCAAAATGGCT 150 wfl-AFP6 ATCTTCATTTTGTAGTGAACCAGTGCTCCCTACAAGTTCTCAAAATGGCT 150 wfl-AFP9 ATCTTCATTTTGTAGTAAACCAGTGCTCCCTACAAGTTCTCAAAATGGCT 150 wfl-AFP8 ATCTTCATTTTGTAGTGAACCAGTGCTCCCTACAAGTTCTCAAAATGGCT 122 stfl-AFP1 ATCTTCCTTTTGTAGTGAAGCAGTGCTCCCTAAAAACTCTCAAAATGGCT 150 ytl-AFP1 GTGAAGCAGTGCTCCCTAAAAGTTCTCAAAATGGCT 36 consensus ****** **** **** ** ** ********* ** ************* wfh-AFP1 CTCTCACTTTTCACTGTCGGACAATTCATTTTCTTATTTTGGACAATCAG 67 wfh-5a CTCTCACTCTTCACTGTCGGACAATTCATTTTCTTATTTTGGACAATCAG 200 wfl-AFP6 CTCTCACTTTTCACTGTCGGACAATTGATTTTCTTATTTTGGACAATGAG 200 wfl-AFP9 CTCTCACTTTTCACTGTCGGACAATTGATTTTCTTATTTTGGACAATGAG 200 wfl-AFP8 CTCTCACTTTTCACTGTCGGACAATTGATTTTCTTATTTTGGACAATGAG 172 stfl-AFP1 CTCTCACTCTTCACCGTCGGACAATTCATTTTCTTATTTTGGACAATAAG 200 ytl-AFP1 CTCTCACTCTTCACTGTTGGACAATTAATTTTCTTATTTTGGACACTCAG 86 consensus ******** ***** ** ******** ****************** * **

104

Appendix E

DNA alignment of four unique starry flounder liver AFP gene sequences

DNA sequences were obtained from genomic DNA (g1 and g3) or phage insert DNA (L5) by

PCR (Figure 14) or from lambda insert #1 (H1, Figure 10, Appendix A). Each of the ten

additional sequence sources listed in Table 3 was identical to one of these four sequences.

Differences are highlighted in grey and the number of nucleotides is noted at the end of each row.

Intronic sequence is in lower case, while exonic sequence is in upper case with bolding used to

denote coding sequence. The silent substitution is underlined. The primers used for PCR

amplification to isolate these sequences are labelled as per Figure 6 and are marked with

asterisks. Notes detailing the frequency with which each sequence was isolated are below.

105

*********5’int********* H1 ggaaggaatgatatctgcattatcccagaggggcaggcacgtgcacagatattttggggg 60 g1 cccagaggggcaggcacgtgcacagatattttggggg 37 g3 cccagaggggcaggcacgtgcacagatattttggggg 37 H1 gcaagtgctaaaacccccaaaaaagggcacccatcgccaaaatgtaaagctgacaacaca 120 g1 gcaagtgctaaaacccccaaaaaagggcacccatcgccaaaacgtaaagctgacaacaca 97 g3 gcaagtgctaaaacccccaaaaaagggcacccatcgccaaaatgtaaagctgacaacaca 97 H1 gtacacacacaatcttttttttactgttttctgacagagttgaagtattagcagcattac 180 g1 gtacacacacaatcttttttttactgttttctgacagagttgaagtattagcagcattac 157 g3 gtacacacacaatcttttttttactgttttctgacagagttgaagtattagcagcattac 157 H1 actaataatatgccacaatatacctcaccagactggcatatagctcaacttaagacctac 240 g1 actaataatatgccacaatatacctcaccagactggcatatagctcaacttaagacctac 217 g3 actaataatatgccacaacatacctcaccagactggcatatagctcaacttaagacctac 217 H1 cgttcatttcaataattacttgattatttgaattttaaatgaaacaaaactagagaactt 300 g1 cgttcatttcaataattacttgattatttgaattttaaatgaaacaaaactagagaactt 277 g3 cgttcatttcaataattacttgattatttgaattttaaatgaaacaaaactagagaactt 277 H1 gtctgatttgtagaacagtaaaactgcctttaatttctatcacacacagatattgaacac 360 g1 gtctgatttgtagaacagtaaaactgcctttaatttctatcacacacagatattgaacac 337 g3 gtctgatttgtagaacagtaaaactgcctttaatttctatcacacacagatattgaacac 337 H1 tgtcatcactgggttcggtgaaagtgacggaccagtaaatgttgtgatatataatattat 420 g1 tgtcatcactgggttcggtgaaagtgacggaccagtaaatgttgtgatatataatattat 397 g3 tgtcatcactgggttcggtgaaagtgacggaccagtaaatgttgtgatatataatattat 397 **********3'int********* H1 catcatttcaataataccattaatctctgcagAATCAGTGAAGCCAACCCCGACCCCGCA 480 g1 catcatttcaataataccattaatctctgcagAATCAGTGAAGCCAACCCCGACCCCGCA 457 g3 catcatttcaataataccattaatctctgcagAATCAATGAAGCCAACCCCGACCCCGCA 457 L5-2 AATCAATGAAGCCAACCCCGACCCCGCA 28 H1 GCCAAAGCTGCCGCAGTCGCCGACCCTGCCGCAGCCGCTGTAGCCCCCGCCGCTGACGCC 540 g1 GCCAAAGCTGCCGCAGTCGCCGACCCTGCCGCAGCCGCTGTAGCCCCCGCCGCTGACGCC 517 g3 GCCAAAGCTGCCGCAGTCGCCGACCCTGCCGCAGCCGCTGTAGCCCCCGCCGCTGACGCC 517 L5-2 GCCAAAGCTGCCGCAGTCGCCGACCCTGCCGCAGCCGCTGTAGCCCCCGCCGCTGACGCC 88 H1 TTCTCAGCAGCCGCCGACACTGCCTCTGACGCCGCCGCCGCCGCCGCCGCCACCGCCGCT 600 g1 TTCTCAGCAGCCGCCGACACTGCCTCTGACGCCGCCGCCGCCGCCGCCGCCACCGCCGCT 577 g3 TTCTCAGCAGCCGCCGACACTGCCTCTGACGCCGCCGCCGCCGCCGCCGCCACCGCCGCT 577 L5-2 TTCTCAGCAGCCGCCGACACTGCCTCTGACGCCGCCGCCGCCGCCGCCGCCACCGCCGCT 148

106

H1 GCCGCCAAAGCCGCCGCAGAAAAAACCGCCCGGGACGCCGCTGCAGCAGCCGCAGCCACC 660 g1 GCCGCCAAAGCCGCCGCAGAAAAAACCGCCCGGGACGCCGCTGCAGCAGCCGCAGCCACC 637 g3 GCCGCCAAAGCCGCCGCAGAAAAAACCGCCCGAGACGCCGCTGCAGCAGCCGCAGCCACC 637 L5-2 GCCGCCAAAGCCGTCGCAGAAAAAACCGCCCGAGACGCCGCTGCAGCAGCCGCAGCCACC 208 *********3’univ******** H1 GCCAGAGGTTAAGGATCGTGGTCGTCTTGATGTGGGATCATGT 703 g1 GCCAGAGGTTAAGGATCGTG 657 g3 GCCAGAGGTTAAGGATCGTG 657 L5-2 GCCAGAGGTTAAGGATCGTG 228 Stfl-AFP1 was isolated 7x in total: g1, g2, H1, H7-3, L1-4, L2-1, L4-1; the last 4 were isolated from the 3’int primer only. The sequence of g1 is identical to that of H1, except for a single nucleotide in the intron. Stfl-AFP2 was isolated 6x in total: g3, H9-2, H13-4, H14-2, L7-2, L8-1; the last 5 were isolated from the 3’int primer only. Stfl-AFP3 was isolated once, from L5-2. It is identical to a genomic sequence (g3) coding for Stfl-AFP2, except for a single nucleotide in the cds.

107

Appendix F

DNA alignment of fourteen unique genes encoding starry flounder skin AFPs

DNA sequences were obtained by PCR from either genomic DNA (g4 to g10, as labeled in

Figure 14) or phage insert DNA (H5-4 and H5-2), and from the lambda library (H1, in Figure 10

and Appendix B, and H11, in Figure 18 and Appendix J). Together, they encode the skin AFP

variants listed in Table 3. Polymorphisms are highlighted in grey and the number of nucleotides

is noted at the end of each row. Intronic sequence is in lower case, while exonic sequence is in

upper case with bolding used to indicate coding sequence. Silent substitutions are underlined,

gaps are marked with hyphens (-), and the Element S enhancer [59] is in white text that is

highlighted in black. The PCR primers used are labelled as in Figure 6 and are marked with

asterisks. Notes detailing the frequency with which each sequence was isolated are below.

108

*********5’int********* H1 ggaaggaaggatatctgcattatcccagaggggccatttgttttacagccagtggtaaaa 60 g4 cccagaggggccatttgttttacagccagtggtaaaa 37 g5 cccagaggggccatttgttttacagccagtggtaaaa 37 g6 cccagaggggccatttgttttacagccagtggtaaaa 37 g8 cccagaggggccatttgttttacagccagtggtaaaa 37 g9 ccccgaggggccattcgttttacagccagtggtaaaa 37 H11 ggaaggaaggatatctgcattatcccagaggggccatttgttttacagccagcggtaaaa 60 H1 gttgaagatcttcatctgtgttcgtcggatggaaagtttgttctgaaaccttcagtggaa 120 g4 gatgaagatcttcatccttattcgtctgatggaaagtttgttctgaaaca---------- 87 g5 gttgaagatcttcatctgtgttcgtcggatggaaagtttgttctgaaaccttcagtggaa 97 g6 gttgaagatcttcatctgtgttcgtcggatggaaagtttgttctgaaaccttcagtggaa 97 g8 gatgaagatcttcatccttattcgtctgatggaaagtttgttctgaaaca---------- 87 g9 gatgaagatcttcatccgtgttcgtctgatggaaagtttgttc----------------- 80 H11 gatgaagatcttcaaccgtgttcgtctgatggaaagtttgttctgaaaca---------- 87 H1 gtgtagtatattccccttagcaaatatccatagccttgaatcttaagttcaaacctttaa 180 g4 ------------------------------------------------------------ 87 g5 gtgtagtatattccccttagcaaatatccatagccttgaatcttaagttcaaacctttaa 157 g6 gtgtagtatattccccttagcaaatatccatagccttgaatcttaagttcaaacctttaa 157 g8 ------------------------------------------------------------ 87 g9 ------------------------------------------------------------ 80 H11 ------------------------------------------------------------ 87 H1 gtattatctccagatgtgttcagtgtgtgtctccttgtctgaactatccttgaactgcct 240 g5 gtattatctccagatgtgttcagtgtgtgtctccttgtctgaactatcgttgaactgcct 217 g6 gtattatctccagatgtgttcagtgtgtgtctccttgtctgaactatccttgaactgcct 217 g8 ------------------------------------------------------------ 87 g9 ------------------------------------------------------------ 80 H11 ------------------------------------------------------------ 87 H1 atggaataatgagaggagagatggtttccagcgggtccttaaatcttaaggtacgacaca 300 g4 ------------------------------------------------------------ 87 g5 atggaataatgagaggagagatggtttccagcgggtccttaaatcttaaggtacgacaca 277 g6 atggaataatgagaggagagatggtttccagcgggtccttaaatcttaaggtacgacaca 277 g8 ------------------------------------------------------------ 87 g9 ------------------------------------------------------------ 80 H11 ------------------------------------------------------------ 87 H1 ttcccaaatttaggcagaaggccgggttgtgtgacgtcattatatctctaggtttgtggt 360 g4 ------------------------------------------------------------ 87 g5 ttcccaaatttaggcagaaggccgggttgtgtgacgtcattatatctctaggtttgtggt 337 g6 ctcccaaatttaggcagaaggccgggttgtgtgacgtcattatatctctaggtttgtggt 337 g8 ------------------------------------------------------------ 87 g9 ------------------------------------------------------------ 80 H11 ------------------------------------------------------------ 87

109

H1 aaacaacccctcctatttaacgccttaccttgcagagtcaaggcggattttcactattcg 420 g4 ------------------------------------------------------------ 87 g5 aaacaacccctcctatttaacgccttaccttgcagagtcaaggcggattttcactattcg 397 g6 aaacaacccctcctatttaacgccttaccttgcagagtcaaggcggattttcactattcg 397 g8 ------------------------------------------------------------ 87 g9 ------------------------------------------------------------ 80 H11 ------------------------------------------------------------ 87 H1 gcttgtgtgttatctccgagttttctagaaactcgtcctgacctataatactc------- 473 g4 ------------------------------------------------------------ 87 g5 gcttgtgtgttatctccgagttttctagaaactcgtcctgacctataatactcttgtgta 457 g6 gcttgtgtgttatctccgagttttctagaaactcgtcctgacctataatactcttgtgta 457 g8 ------------------------------------------------------------ 87 g9 ------------------------------------------------------------ 80 H11 ------------------------------------------------------------ 87 H1 ------attatacttgtaagtactgggtccgcgtctcctctcttcgaacaccgacttcta 527 g4 ------------------------------------------------------------ 87 g5 ataaacattatacttgtaagtactgggtccgcgtctcctctcttcgaacaccgacttcaa 517 g6 ataaacattatacttgtaagtactgggtccgcgtctcctctcttcgaacaccgacttcaa 517 g8 ------------------------------------------------------------ 87 g9 ------------------------------------------------------------ 80 H11 ------------------------------------------------------------ 87 *********stfsk********* ******** H1 caagacactactgcgggaaacatacgatagaagaaagagattcatgtgttcaggcctaaa 587 g4 ------------------------------------------------------------ 87 g5 caagacactactgcgggaaacatacgatagaagaaagagattcatgtgttcaggcctaaa 577 g6 caagacaatactgcgggaaacatacgatagaagaaagagattcatgtgttcaggcctaaa 577 g8 ------------------------------------------------------------ 87 g9 ------------------------------------------------------------ 80 H11 ------------------------------------------------------------ 87 **allsk********** H1 cctgaaaaaatctgagctctgttcaatcatgggaaacaactttttaattgagtcatggct 647 g4 ------------------------------------------------------------ 87 g5 cctgaaaaaatctgagctctgttcaatcatgggaaacaactttttaattgagtcatggct 637 g6 cctgaaaaaatctgagctctgttcaatcatggggaacaactttttaattgagtcatggct 637 g7 tctgttcaatcatgggaaacaactttttaattgagtcatggct 43 g8 ------------------------------------------------------------ 87 g9 -----aaaaatctgagctctgttca----tcggaaacaaattt-gaattaagtcagggct 130 H5-4 tctgttcaatcatgggaaacaactttttaattaagtcatggct 43 g10 tctgttcaatcatgggaaacaactttttaattgagtcatggct 43 H5-2 tctgttcaatcatgggaaacaaccttttaattaagtcatggct 43 H11 ------------------------------------------------------------ 87

110

H1 gcaaaactcttttatatgaacagaagaagaagaagtgatctttagttcatcactgtggaa 707 g4 ggaaaactatttgatatgcacagaagaagaagaagtgatctttagttcatcactgtggaa 147 g5 gcaaaactcttttatatgaacagaagaagaagaagtgatctttagttcatcactgtggaa 697 g6 gcaaaactcttttatatgaacagaagaagaagaagtgatctttagttcatcactgtggaa 697 g7 gcaaaactcttttatatgaacagaagaagaagaagtgatctttagttcatcactgtggaa 103 g8 ggaaaactatttgatatgcacagaagaagaagaagtgatctttagttcatcactgtggaa 147 g9 ggaaaact---ttatatgaacagaagaagaagaagtgatctttagttcatcactgtggaa 187 H5-4 gca--------------------------------------------------------- 46 g10 gcaaaactcttttatatgaacagaagaagaagaagtgatctttagttcatcactgtggaa 103 H5-2 gca--------------------------------------------------------- 46 H11 gtaaaactcttttatatgaacagaagaagaagaagtgatctttagttcatcactgtggaa 147 H1 acatcagcagcagttaaattctgtctgcttcagtatcaccggccagttccagtgctcatg 767 g4 acatcagcagcagttaaagtctgtctgcttcagtatcaccggccagttccagtgctcatg 207 g5 acatcagcagcagttaaattctgtctgcttcagtatcaccggccagttccagtgctcatg 757 g6 acatcagcagcagttaaattctgtctgcttcagtatcaccggccagttccagtgctcatg 757 g7 acatcagcagcagttaaattctgtctgcttcagtatcaccggccagttccagtgctcatg 163 g8 acatcagcagcagttaaagtctgtctgcttcagtatcaccggccagttccagtgctcatg 207 g9 acatcagcagcagttaaagtctgtctgcttcagtatcactggccagttccagtgctcatg 247 H5-4 -----------------------------tcagtatcaccggccagttccagtgctcatg 77 g10 acatcagcagcagttaaattctgtctgcttcagtatcaccggccagttccagtgctcatg 163 H5-2 -----------------------------tcagtatcaccggccagttccagtgctcatg 77 H11 acatcagcagcagttaaagtctgtctgcttcagtatcactggccagttccagtgctcatg 207 H1 tttctgatcagcttggtttgaatgatatgaaacggatggagtccctgtttgaccctgttt 827 g4 -ttctgatcagcttgttttgaatgatataaaacggatggagtccctgtttgaccctgttt 267 g5 tttctgatcagcttggtttgaatgatatgaaacggatggagtccctgtttgaccctgttt 817 g6 tttctgatcagcttggtttgaatgatatgaaacggatggagtccctgtttgaccctgttt 817 g7 tttctgatcagcttggtttgaatgatatgaaacggatggagtccctgtttgaccctgttt 223 g8 -ttctgatcagcttgttttgaatgatataaaacggatggagtccctgtttgaccctgttt 267 g9 tttctgatcagcttggtttgaatgatatgaaacggatggagtccctgtttgaccctgttt 307 H5-4 tttctgatcagcttggtttgaatgatatgaaacggatggagtccctgtttgaccctgttt 137 g10 tttctgatcagcttggtttgaatgatatgaaacggatggagtccctgtttgaccctgttt 223 H5-2 tttctgatcagcttggtttgaatgatatgaaacggatggagtccctgtttgaccctgttt 137 H11 tttctgatcagcttggtttgaatgatatgaaacggatggagtccctgtttgaccctgttt 267 H1 aacacaagatggccaagtggaccatctttattaacataatgttttacatgagcacttcct 887 g4 aacacaagatggcaacgtggaccatctttattaacataatgttttacatgagcacttcct 327 g5 aacacaagatggccaagtggaccatctttattaacataatgttttacatgagcacttcct 877 g6 aacacaagatggccacgtggatcatctttattaacataatgttttacatgagcacttcct 877 g7 aacacaagatggccaagtggaccatctttattaacataatgttttacatgagcacttcct 283 g8 aacacaagatggcaacgtggaccatctttattaacataatgttttacatgagcacttcct 327 g9 aacacaagatgg--------accatctttattaacataatgttttacatcagcacttcct 359 H5-4 aacacaagatggccacgtggatcatctttattaacataatgttttacatgagcacttcct 197 g10 aacacaagatggccaagtggaccatctttattaacataatgttttacatgagcacttcct 283 H5-2 aacacaagatggccacgtggaccatctttattaacataatgttttacatgagcacttcct 197 H11 aacacaagatgg--------accatctttattaacataatgttttacatgagcacttcct 319

111

H1 gttttcagccctaaacctaaagaggcctcatggaaacttcctgatgatctggtgacacct 947 g4 gttttcagccctaaacttaaagaa-cctcatggaaacttcctgatgatctggtgacacct 386 g5 gttttcagccctaaacctaaagaggcctcatggaaacttcctgatgatctggtgacacct 937 g6 gttttcagccctaaacctaaagaggcctcatggaaacttcctgatgatctggtgacacct 937 g7 gttttcagccctaaacctaaagaggcctcatggaaacttcctgatgatctggtgacacct 343 g8 gttttcagccctaaacttaaagaa-cctcatggaaacttcctgatgatctggtgacacct 386 g9 gttttcagcccgaaacttaaagaa-cctcatggaatcttcctgatgatctggtgacacct 418 H5-4 gttttcagccctaaacctaaagaggcctcatggaaacttcctgatgatctggtgacacct 257 g10 gttttcagccctaaacctaaagaggcctcatggaaacttcctgatgatctggtgacacct 343 H5-2 gttttcagccctaaacctaaagaggcctcatggaaacttcctgatgatctggtgacacct 257 H11 gttttcagccctaaacctaaagaggcctcatagaaacttcctgatgatctggtgacacct 379 H1 gctggttgaaggaaacagagttcgagaggcagctgaacaaattattttagtttgaaagaa 1007 g4 gctggttgaaggaaacagagtttgagagtcagaagaacaaatgattttagtttgaatgaa 446 g5 gctggttgaaggaaacagagttcgagaggcagctgaacaaattattttagtttgaaagaa 997 g6 gctggttgaaggaaacagagttcgagaggcagctgaacaaattattttagtttgaaagaa 997 g7 gctggttgaaggaaacagagttcgagaggcagctgaacaaattattttagtttgaaagaa 403 g8 gctggttgaaggaaacagagtttgagagtcagaagaacaaatgattttagtttgaatgaa 446 g9 gctggttgaaggaaacagagttttagaggcagccgaacaaatgattttagtttgaatgaa 478 H5-4 gctggttgaaggaaacagagttcgagaggcagctgaacaaattattttagtttgaaagaa 317 g10 gctggttgaaggaaacagagttcgagaggcagctgaacaaattattttagtttgaaagaa 403 H5-2 gctggttggaggaaacagagttcgagaggcagctgaacaaattattttagtttgaaagaa 317 H11 gctggttgaaggaaacagagtttgagagtcagaagaacaaatgattttagtttgaaacaa 439 H1 gaagctgtcatttgagattatgttgtagggggggggggg--------------------a 1047 g4 gaagctgtcatttga--ttttgtt------------------------------------ 468 g5 gaagctgtcatttgagattatgttgtaggggggggggg---------------------a 1036 g6 gaagctgtcatttgagattatgttgtagggggggg------------------------a 1033 g7 gaagctgtcatttgagattatgttgtaggggggggggggg-------------------a 444 g8 gaagctgtcatttga--ttttgtt------------------------------------ 468 g9 gaagctgtcattttattttatgttgggggggggggggggg-------------------- 518 H5-4 gaagctgtcatttgagattatgttgtagggggggg------------------------a 353 g10 gaagctgtcatttgagattacgttgtagggggggggggg--------------------a 443 H5-2 gaagctgtcatttgagattatgttgtagggggggg------------------------a 353 H11 gaagctgtcatttgatattatgttgtgggggggggggcgggggtggtcatcacacacaga 499 H1 tactgaactctgtcatcaccgggttcggtgaaagtgacggacaagtacatgttgtgataa 1107 g4 ------------tcatcactgggttcggtgaaagtgacggaccagtacatgttgtgataa 516 g5 tactgaactctgtcatcaccgggttcggtgaaagtgacggacaagtacatgttgtgataa 1096 g6 tactgaactctgtcatcaccgggttcggtgaaagtgacggacaagtacatgttgtgataa 1093 g7 tactgaactctgtcatcaccgggttcggtgaaagtgacggacaagtacatgttgtgataa 504 g8 ------------tcatcactgggttcggtgaaagtgacggaccagtacatgttgtgatat 516 g9 tactgaacactgtcatcactgggttcggtgaaagtgacggaccagtacatgttgtgatat 578 H5-4 tactgaactctgtcatcaccgggttcggtgaaagtgacggagcagtacatgttgtgataa 413 g10 tactgaactctgtcatcaccgggttcggtgaaagtgacggacaagtacatgttgtgataa 503 H5-2 tactgaactctgtcatcaccgggttcggtgaaagtgacggacaagtacatgttgtgataa 413 H11 tattgaacactgtcatcactgggttcagtgaaagtgacggaccagtacatgttgtgataa 559

112

**********3’int********* H1 ataattatatcataataattataataataccattaatttctgcagAATCACTAAAACGAA 1167 g4 ataattatatcataataattataataataccattaatttctgcagAATCACTAAAACGAA 576 H7-2 AATCACTAAAACGAA 15 g5 ataattatatcataataattataataataccattaatttctgcagAATCACTAAAACGAA 1156 g6 ataattatatcataataattataataataccattaatttctgcagAATCACTAAAACGAA 1153 g7 ataattatatcataataattataataataccattaatttctgcagAATCACTAAAACGAA 564 L5 AATCACTAAAACGAA 15 L7-3 AATCACCAAAACGAA 15 g8 ataacattatcataataattataataataccattaatctctgcagAATCACTCATACTAA 576 g9 ataatattttcaaaataattttaataataccattaatttctgcagAATCACTGACATCAA 638 H5-4 ataattatatcataataattataataataccattaatttctgcagAATCACTAAAACGAA 473 g10 ataattatatcataataattataataataccattaatttctgcagAATCACTAAAACGAA 563 H5-2 ataattatatcataataattataataataccattaatttctgcagAATCACTAAAACGAA 473 H11 ataa---tatcataataattataataataccattaatctctgcagAATCACTGACATCAA 617 H1 CATGGACGCCCCAGCCAGAGCCGCCGCAGCCACCGCCGCCGCCGCCAAAGCCGCCGCAGA 1227 g4 CATGGACGCCCCAGCCAGAGCCGCCGCAGCCACCGCCGCCGCCGCCAAAGCCGCCGCAGA 636 H7-2 CATGGACGCCCCAGCCAGAGCCGCCGCAGCCACCGCCGCCGCCGCCAAAGCCGCCGCAGA 75 g5 CATGGACGCCCCAGCCAGAGCCGCCGCAGCCACCGCCGCCGCCGCCAAAGCCGCCGCAGA 1216 g6 CATGGACGCCCCAGCCAGAGCCGCCGCAGCCACCGCCGCCGCCGCCAAAGCCGCCGCAGA 1213 g7 CATGGACGCCCCAGCCAGAGCCGCGGCAGCCACGGCCGCCGCCGCCAAAGCCGCCGCAGA 624 L5 CATGGACGCCCCAGCCAGAGCCGCCGCAGCCACCGCCGCCGCCGCCAAAGCCGCCGCAGA 75 L7-3 CATGGACGCCCCAGCCAGAGCCGCCGCAGCCACCGCCGCCGCCGCCAAAGCCGCCGCAGA 75 g8 CATGGACGCCCCAGCCAGAGCCGCCGCAGCCACTGCCGCTGCAGCCAAAGCCGCAGCCGA 636 g9 CATGGACGCCCCAGCCAAAGCCGCCGCAGCCACCGCAGCCGCCGCCAAAGCCGCCGCAGA 698 H5-4 CATGGACGCCCCGGCCAGAGCCGCCGCTGCCACCGCCGCCGCCGCCAGAGCCACCGCAGA 533 g10 CATGGACGCCCCAGCCAGAGCCGCCGCAGCCACCGCCGCCGCCGCCAAAGCCGCCACAGA 623 H5-2 CATGGGCGCCCCAGCCAGAGCCGCCGCAGCCACCGCCGCCGCCGCCAAGGCCGCCGCAGA 533 H11 CATGGACGCCCCAGCCGCCGCCGCCGCAGCCACCGCCGCCGCCGCCAAAGCCGCCGCAGA 677 H1 AGCCACCGCCGCCGCAGCTGCCAAAGCAGCAGCCGCCACCAAAGCAGCC----------- 1276 g4 AGCCACCGCCGCCGCAGCTGCCAAAGCAGCAGCCGCCACCAAAGCAGCC----------- 685 H7-2 AGCCACCGCCGCCGCAGCTGCCAAAGCAGCAGCCGCCACCAAAGCAGCC----------- 124 g5 AGCCACCAAAGCCGCAGCTGCCAAAGCAGCAGCCGCCACCAAAGCCGCC----------- 1265 g6 AGCCACCAAAGCCGCAGCTGCCAAAGCAGCAGCCGCCACCAAAGCCGCC----------- 1262 g7 AGCCACCAAAGCCGCAGCTGCCAAAGCAGCAGCCGCCACCAAAGCCGCC----------- 673 L5 AGCCACCAAAGCCGCAGCTGCCAAAGCAGCAGCCGCCACCAAAGCCGCC----------- 124 L7-3 AGCCACCAAAGCCGCAGCTGCCAAAGCAGCAGCCGCCACCAAAGCCGCC----------- 124 g8 AGCCACCGCCGCAGCAGCTGCCAAAGCAGCAGCCGACACCAAAGCTGCCGCAGCCGCCGC 696 g9 AGCCACCGCCGCCGCAGCTGCCAAAGCAGCAGCCGCCACCAAAGCCGGC----------- 747 H5-4 AGCCACCGAAGCCGCAGCTGCCAAAGCAGCAGCCGCCACCAAAGCCGCC----------- 582 g10 AGCCACCAAAGCCGCAGCTGCCAAAGCAGCAGCCGCCACCAAAGCCGCC----------- 672 H5-2 AGCCACCAAAGCCGCAGCTGCCAAAGCAGCGGCCGCCACCAAAGCCGCC----------- 582 H11 AGCCACCGCCGCCGCAGCTGCCAAAGCAGCAGCCGCAACCAAAGCTGCCGCAGCC----- 732

113

*********3’univ******** H1 -------CGTTAATGATCGTGGTCGTCTTGATGTGGGATCATGT 1313 g4 -------CGTTAATGATCGTG 699 H7-2 -------CGTTAATGATCGTG 138 g5 -------CGTTAATGATCATG 1279 g6 -------CGTTAATGATCATG 1276 g7 -------CGTTAATGATCATG 687 L5 -------CGTTAATGATCATG 138 L7-3 -------CGTTAATGATCATG 138 g8 CGCCGCCCTTTGAGGATCGTG 717 g9 -------CGTTAATGATCATG 761 H5-4 -------CGTTAATGATCATG 596 g10 -------CGTTAATGATCATG 686 H5-2 -------CGTTAATGATCATG 596 H11 -------CGTTAATGATCATGGTCGTCTTGATGTGGGATCATGT 746 Stfs-AFP1 was isolated 3x in total: g4, H1, H7-2. H7-2 is from the 3’int primer only. Stfs-AFP2 was isolated 5x in total: g5, g6, g7, L5, L7. The last two were isolated from the 3’int primer only. Stfs-AFP3 was isolated once, from g8. Stfs-AFP4 was isolated once, from g9. Stfs-AFP5 was isolated once, from H5. Stfs-AFP6 was isolated once, from g10. Stfs-AFP7 was isolated once, from H5. Stfs-AFP8 was isolated once, from H11.

114

Appendix G

DNA alignment the American plaice skin AFPs

An alignment of seventeen unique sequences isolated from American plaice genomic DNA is

shown. The sequences were obtained using the allsk and 3'univ primers (Table 1) and are

grouped by the encoded protein variant. Intronic sequence is in lower case, exonic sequence is in

upper case, and the coding region is bolded. Differences are highlighted in grey, silent

substitutions are underlined, and the number of nucleotides is noted at the end of each row.

115

aps-AFP10 tctgttaaaaccatggacctgttttattaggtttgatttaatgttaatca 50 aps-AFP18 tctgttaaaaccatggacctgttttattaggtttgatttaatgttaatca 50 aps-AFP09 tctgttaaaaccatggacctgttttattaggtttgatttaatgttaatca 50 aps-AFP11 tctgttaaaaccatggacctgttttattaggtttgatttaatgttaatca 50 aps-AFP07 tctgttaaaaccatggacctgttttattaggtttgatttaatgttaatca 50 aps-AFP12 tctgttaaaaccatggacctgttttattaggtttgatttaatgttaatca 50 aps-AFP05 tctgttaaaaccatggacctgttttattaggtttgatttaatgttaatca 50 aps-AFP13 tctgttaaaaccatggacctgttttattaggtttgatttaatgttaatca 50 aps-AFP17 tctgttaaaaccatggacctgttttattaggtttgatttaatgttaatca 50 aps-AFP15 tctgttaaaaccatggacctgctttattaggtttgatttaatgttaatca 50 aps-AFP08 tctgttaaaaccatggacctgttttattaggtttgatttaatgttaatca 50 aps-AFP04 tctgttaaaaccatggacctgttttattaggtttgatttaatgttaatca 50 aps-AFP03 tctgttaaaaccatggacctgttttattaggtttgatttaatgttaatca 50 aps-AFP06 tctgttaaaaccatggacctgttttattaggtttgatttaatgttaatca 50 aps-AFP01 tctgttaaaaccatggacctgttttattaggtttgatttaatgttaatca 50 aps-AFP14 tctgttaaaaccatggacctgttttattaggtttgatttaatgttaatca 50 aps-AFP16 tctgttaaaaccatggacctgttttattaggtttgatttaatgttaatca 50

aps-AFP10 tttcaaatcatgggaaacaactttttaattaagtccgggctggaaaactc 100 aps-AFP18 tttcaaatcatgggaaacagctttttaattaagtccgggctggaaaactc 100 aps-AFP09 tttcaaatcatgggaaacaactttttaattaagtccgggctggaaaactc 100 aps-AFP11 tttcaaatcatgggaaacaactttttaattaagtccgggctggaaaactc 100 aps-AFP07 tttcaaatcatgggaaacaactttttaattaagtccgggctggaaaactc 100 aps-AFP12 tttcaaatcatgggaaacaactttttaattaagtccgggctggaaaactc 100 aps-AFP05 tttcaaatcatgggaaacaactttttaattaagtccgggctggaaaactc 100 aps-AFP13 tttcaaatcatgggaaacaactttttaattaagtccgggctggaaaactc 100 aps-AFP17 tttcaaatcatgggaaacaactttttaattaagtccgggctggaaaactc 100 aps-AFP15 tttcaaatcatgggaaacaactttttaattaagtccgggctggaaaactc 100 aps-AFP08 tttcaaatcatgggaaacagctttttaattaagtccgggctggaaaactc 100 aps-AFP04 tttcaaatcatgggaaacaactttttaattaagtccgggctggaaaactc 100 aps-AFP03 ttgcaaatcatgggaaacaactttttaattaagtccgggctggaaaactc 100 aps-AFP06 ttgcaaatcatgggaaacaactttttaattaagtccgggctggaaaactc 100 aps-AFP01 ttgcaaatcatgggaaacaactttttaattaagtccgggctggaaaactc 100 aps-AFP14 tttcaaatcatgggaaacaactttttaattaagtccgggctggaaaacta 100 aps-AFP16 tttcaaatcatgggaaacaactttttaattaagtccgggctggaaaacta 100 aps-AFP10 ttttatatgcacagaagaagaagaagtgatctttagttcatcactgtgga 150 aps-AFP18 ttttatatgcacagaagaagaagaagtgatctttagttcatcactgtgga 150 aps-AFP09 ttttatatgcacagaagaagaagaagtgatctttagttcatcactgtgga 150 aps-AFP11 ttttatatgcacagaagaagaagaagtgatctttagttcatcactgtgga 150 aps-AFP07 ttttatatgcacagaagaagaagaagtgatctttagttcatcactgtgga 150 aps-AFP12 ttttatatgcacagaagaagaagaagtgatctttagttcatcactgtgga 150 aps-AFP05 ttttatatgcacagaagaagaagaagtgatctttagttcatcactgtgga 150 aps-AFP13 ttttatatgcacagaagaagaagaagtgatctttagttcatcactgtgga 150 aps-AFP17 ttttatatgcacagaagaagaagaagtgatctttagttcatcactgtgga 150 aps-AFP15 ttttatatgcacagaagaagaagaagtgatctttagttcatcactgtgga 150 aps-AFP08 ttttatatgcacagaagaagaagaagtgatctttagttcatcactgtgga 150 aps-AFP04 ttttatatgcacagaagaagaagaagtgatctttagttcatcactgtgga 150 aps-AFP03 ttttatatgcacagaagaagaagaagtgatctttagttcatcactgtgga 150 aps-AFP06 ttttatatgcacagaagaagaagaagtgatctttagttcatcactgtgga 150 aps-AFP01 ttttatatgcacagaagaagaagaagtgatctttagtccatcactgtgga 150 aps-AFP14 ttttatatgcacagaaggagaagaagtgatctttagttcatcactgtgga 150 aps-AFP16 ttttatatgcacagaaggagaagaagtgatctttagttcatcactgtgga 150

116

aps-AFP10 aacatcagcagcagttaaagtctgtctgcttcagtgtcaccggccagttc 200 aps-AFP18 aacatcagcagcagttaaagtctgtctgcttcagtgtcaccggccagttc 200 aps-AFP09 aacatcagcagcagttaaagtctgtctgcttcagtgtcaccggccagttc 200 aps-AFP11 aacatcagcagcagttaaagtctgtctgcctcagtgtcaccggccagttc 200 aps-AFP07 aacatcagcagcagttaaagtctgtctgcttcagtgtcaccggccagttc 200 aps-AFP12 aacatcagcagcagttaaagtctgtctgcttcagtgtcaccggccagttc 200 aps-AFP05 aacatcagcagcagttaaagtctgtctgcttcagtgtcaccggccagttc 200 aps-AFP13 aacatcagcagcagttaaagtctgtctgcttcagtgtcaccggccagttc 200 aps-AFP17 aacatcagcagcagttaaagtctgtctgcttcagtgtcaccggccagttc 200 aps-AFP15 aacatcagcagcagttaaagtctgtctgcttcagtgtcaccggccagttc 200 aps-AFP08 aacatcagcagcagttaaagtctgtctgcttcagtgtcaccggccagttc 200 aps-AFP04 aacatcagcagcagttaaagtctgtctgcttcagtgtcaccggccagttc 200 aps-AFP03 aacatcagcagaagttaaagtctgtctgcttcagtgtcaccggccagttc 200 aps-AFP06 aacatcagcagaagttaaagtctgtctgcttcagtgtcaccggccagttc 200 aps-AFP01 aacatcagcagcagttaaagtctgtctgcttcagtgtcaccggccagttc 200 aps-AFP14 aacatcagcagcagttaaagtctgtctgcttcagtatcagcggccagttc 200 aps-AFP16 aacatcagcagcagttaaagtctgtctgcttcagtatcagcggccagttc 200 aps-AFP10 cagtgctcatgtttctgatcagcttggtttgaatgatataaaacggatcg 250 aps-AFP18 cagtgctcatgtttctgatcagcttggtttgaatgatataaaacggatcg 250 aps-AFP09 cagtgctcatgtttctgatcagcctggtttgaatgatataaaacggatcg 250 aps-AFP11 cagtgctcatgtttctgatcagcttggtttgaatgatataaaacggatcg 250 aps-AFP07 cagtgctcatgtttctgatcagcttggtttgaatgatataaaacggatcg 250 aps-AFP12 cagtgctcatgtttctgatcagcttggtttgaatgatataaaacggatcg 250 aps-AFP05 cagtgctcatgtttctgatcagcttggtttgaatgatataaaacggatcg 250 aps-AFP13 cagtgctcatgtttctgatcagcttggtttgaatgatataaaacggatcg 250 aps-AFP17 cagtgctcatgtttctgatcagcttggtttgaatgatataaaacggatcg 250 aps-AFP15 cagtgctcatgtttctggtcagcttggtttgaatgatataaaacggatcg 250 aps-AFP08 cagtgctcatgtttctgatcagcttggtttgaatgatataaaacggatcg 250 aps-AFP04 cagtgctcatgtttctgatcagcttggtttgaatgatataaaacggatcg 250 aps-AFP03 cagtgctcatgtttctgatcagcttggtttgaatgatataaaacggatcg 250 aps-AFP06 cagtgctcatgtttctgatcagcttggtttgaatgatataaaacggatcg 250 aps-AFP01 cagtgctcatgtttctgatcagcttggtttgaatgatataaaacggatcg 250 aps-AFP14 cagtgctcatgtttctggtcagcttggtttgaatgatataaaacggatcg 250 aps-AFP16 cagtgctcatgtttctgatcagcttggtttgaatgatataaaacggatcg 250 aps-AFP10 agtgcctgtttgaccctgtttaacacaagatggccacgtggaccatcttt 300 aps-AFP18 agtgcctgtttgaccctgtttaacacaagatggccacgtggaccatcttt 300 aps-AFP09 agtgcctgtgtgaccctgtttaacacaagatggccacgtggaccatcttt 300 aps-AFP11 agtgcctgtgtgaccctgtttaacacaagatggccacgtggaccatcttt 300 aps-AFP07 agtgcctgtgtgaccctgtttaacacaagatggccacgtggaccatcttt 300 aps-AFP12 agtgcctgtttgaccctgtttaacacaagatggccacgtggaccatcttt 300 aps-AFP05 agtgcctgtgtgaccctgtttaacacaagatggccacgtggaccatcttt 300 aps-AFP13 agtgcctgtgtgaccctgtttaacacaagatggccacgtggaccatcttt 300 aps-AFP17 agtgcctgtgtgaccctgtttaatacaagatggccacgtggaccatcttt 300 aps-AFP15 agtgcctgtgtgaccctgtttaacacaagatggccacgtggaccatcttt 300 aps-AFP08 agtgcctgtgtgaccctgtttaacacaagatggccacgtggaccatcttt 300 aps-AFP04 agtgcctgtttgaccctgtttaacacaagatggccacgtggaccatcttt 300 aps-AFP03 agtgcctgtttgaccctgtttaacacaagatggccacgtggaccatcttt 300 aps-AFP06 agtgcctgtttgaccctgtttaacacaagatggccacgtggaccatcttt 300 aps-AFP01 agtgcctgtttgaccctgtttaacacaagatggccacgtggaccatcttt 300 aps-AFP14 agtgcctgtttgaccctgtttaacacaagatggccacgtggaccatcttt 300 aps-AFP16 agtgcctgtttgaccctgtttaacacaagatggccacgtggaccatcttt 300

117

aps-AFP10 atttacataatgtttcatatcagcacttcctgttttcagccctaaaccta 350 aps-AFP18 atttacataatgtttcatatcagcacttcctgttttcagccctaaaccta 350 aps-AFP09 atttacataatgtttcatatcagcacttcctgtttttagccctaaaccta 350 aps-AFP11 atttacataatgtttcatatcagcacttcctgtttttagccctaaaccta 350 aps-AFP07 atttacataatgtttcatatcagcacttcctgtttttagccctaaaccta 350 aps-AFP12 atttacataatgtttcatatcagcacttcctgttttcagccctaaaccta 350 aps-AFP05 atttacataatgtttcatatcagcacttcctgttttcagccctaaaccta 350 aps-AFP13 atttacataatgtttcatatcagcacttcctgttttcagccctaaaccta 350 aps-AFP17 atttacataatgtttcatatcagcacttcctgttttcagccctaaaccta 350 aps-AFP15 atttacataatgtttcatatcagcacttcctgttttcagccctaaaccta 350 aps-AFP08 atttacataatgtttcatatcagcacttcctgttttcagccctaaaccta 350 aps-AFP04 atttacataatgtttcatatcagcacttcctgttttcagccctaaaccta 350 aps-AFP03 atttacataatgtttcatatcagcacttcctgttttcagccctaaaccta 350 aps-AFP06 atttacataatgtttcatatcagcacttcctgttttcagccctaaaccta 350 aps-AFP01 atttacataatgtttcatatcagcacttcctgttttcagccctaaaccta 350 aps-AFP14 atttacataatgtttcacatcagcacttcctgttttcagccctaaaccta 350 aps-AFP16 atttacataatgtttcacatcagcacttcctgttttcagccctaaaccta 350 aps-AFP10 aagaggcctcatggaaacttcctgatgatctggtgacaactgctggttga 400 aps-AFP18 aagaggcctcatggaaacttcctgatgatctggtgacaactgctggttga 400 aps-AFP09 aagaggcctcatggaaacttcctgatgatctggtgacaactgctggttga 400 aps-AFP11 aagaggcctcatggaaacttcctgatgatctggtgacaactgctggttga 400 aps-AFP07 aagaggcctcatggaaacttcctgatgatctggtgacaactgctggttga 400 aps-AFP12 aagaggcctcatggaaacttcctgatgatccggtgacaactgctggttga 400 aps-AFP05 aagaggcctcatggaaacttcctgatgatctggtgacaactgctggttga 400 aps-AFP13 aagaggcctcatggaaacttcctgatgatctggtgacaactgctggttga 400 aps-AFP17 aagaggcctcatggaaacttcctgatgatctggtgacaactgctggttga 400 aps-AFP15 aagaggcctcatggaaacttcctgatgatctggtgacaactgctggttga 400 aps-AFP08 aagaggcctcatggaaacttcctgatgatctggtgacaactgctggttga 400 aps-AFP04 aagaggcctcatggaaacttcctgatgatctggtgacaactgctggttga 400 aps-AFP03 aagaggcctcatgaaaacttcctgatgatctggtgacacctgctggttga 400 aps-AFP06 aagaggcctcatgaaaacttcctgatgatctggtgacacctgctggttga 400 aps-AFP01 aagaggcctcatgaaaacttcctgatgatctggtgacacctgctggttga 400 aps-AFP14 aaaaggcctcatggaaacttcctgatgatctggtgacacctgctggttga 400 aps-AFP16 aaaaggcctcatggaaacttcctgatgatctggtgacacctgctggttga 400 aps-AFP10 aggaaacaatgtttgagaggcagcagaacaaatgattttagttttaatga 450 aps-AFP18 aggaaacaatgtttgagaggcagcagaacaaatgattttagttttaatga 450 aps-AFP09 aggaaacaatgtttgagaggcagcagaacaaatgattttagttttaatga 450 aps-AFP11 aggaaacaatgtttgagaggcagcagaacaaatgattttagttttaatga 450 aps-AFP07 aggaaacaatgtttgagaggcagcagaacaaatgattttagttttaatga 450 aps-AFP12 aggaaacaatgtttgagaggcagcagaacaaatgattttagttttaatga 450 aps-AFP05 aggaaacaatgtttgagaggcagcagaacaaatgattttagttttaatga 450 aps-AFP13 aggaaacaatgtttgagaggcagcagaacaaatgattttagttttaatga 450 aps-AFP17 aggaaacaatgtttgagaggcagcagaacaaatgattttagttttaatga 450 aps-AFP15 aggaaacaatgtttgagaggcagcagaacaaatgattttagttttaatga 450 aps-AFP08 aggaaacaatgtttgagaggcagcagaacaaatgattttagttttaatga 450 aps-AFP04 aggaaacaatgtttgagaggcagcagaacaaatgattttagttttaatga 450 aps-AFP03 aggaaacagagtttgagaggcagcagaacaaatgattttagtttgaatga 450 aps-AFP06 aggaaacagagtttgagaggcagcagaacaaatgattttagtttgaatga 450 aps-AFP01 aggaaacagagtttgagaggcagcagaacaaatgattttagtttgaatga 450 aps-AFP14 aggaaacagagtttgagaggcagcagaacaaatgattttagtttgaatga 450 aps-AFP16 aggaaacagagtttgagaggcagcagaacaaatgattttagtttgaatga 450

118

aps-AFP10 agaagctgtcatttaattttatgttgtggggga-----gggggtcatcac 495 aps-AFP18 agaagctgtcatttaattttatgttgtggggga-----gggggtcatcac 495 aps-AFP09 agaagctgtcatttaattttatgttgtggggga-----gggggtcatcac 495 aps-AFP11 agaagctgtcatttaattttatgttgtggggga-----gggggtcatcac 495 aps-AFP07 agaagctgtcatttaattttatgttgtggggga-----gggggtcatcac 495 aps-AFP12 agaagctgtcatttaattttatgttgtggggga-----gggggtcatcac 495 aps-AFP05 agaagctgtcatttaattttatgttgtggggga-----gggggtcatcac 495 aps-AFP13 agaagctgtcatttaattttatgttgtggggg-----ggggggtcatcac 495 aps-AFP17 agaagctgtcatttaattttatgttgtggggg-----ggggggtcatcac 495 aps-AFP15 agaagctgtcatttaattttatgttgtggggg-----ggggggtcatcac 495 aps-AFP08 agaagctgtcatttaattttatgttgtggggga-----gggggtcatcac 495 aps-AFP04 agaagctgtcatttaattttatgttgtggggga-----gggggtcatcac 495 aps-AFP03 agaagctgtcctttgattttatgttctgggg-aggggggggggtcatcac 499 aps-AFP06 agaagctgtcctttgattttatgttctgggg-aggggggggggtcatcac 499 aps-AFP01 agaagctgtcctttgattttatgttctgggggaggggggggggtcatcac 500 aps-AFP14 agaagctgtcatttgattttatgttgtgggg---gggggggggtcatcac 497 aps-AFP16 agaagctgtcatttgattttatgttgtgggg----ggggggggtcatcac 496 aps-AFP10 acgagaatattgaacactgtcatcactgggttctgtgaaagtgacggacc 545 aps-AFP18 acgagaatattgaacactgtcatcactgggttctgtgaaagtgacggacc 545 aps-AFP09 acgagaatattgaacactgtcatcactgggttctgtgaaagtgacggacc 545 aps-AFP11 acgagaatattgaacactgtcatcactgggttctgtgaaagtgacggacc 545 aps-AFP07 acgagaatattgaacactgtcatcactgggttctgtgaaagtgacggacc 545 aps-AFP12 acgagaatattgaatactgtcatcactgggttctgtgaaagtgacggacc 545 aps-AFP05 acgagaatattgaacactgtcatcactgggttctgtgaaagtgacggacc 545 aps-AFP13 acgagaatattgaacactgtcatcactgggttctgtgaaagtgacggacc 545 aps-AFP17 acgagaatattgaacactgtcatcactgggttctgtgaaagtgacggacc 545 aps-AFP15 acgagaatattgaacactgtcatcactgggttctgtgaaagtgacggacc 545 aps-AFP08 acgagaatattgaacactgtcatcactgggttctgtgaaagtgacggacc 545 aps-AFP04 acgagaatattgaacactgtcatcactgggttctgtgaaagtgacggacc 545 aps-AFP03 acgaggatattgaacactgtcatcactgggttctgtgaaagtgacggacc 549 aps-AFP06 acgaggatattgaacactgtcatcactgggttctgtgaaagtgacggacc 549 aps-AFP01 acgaggatattgaacactgtcatcactgggttctgtgaaagtgacggacc 550 aps-AFP14 acgaggatattgaacactgtcatcactgggttctgtgaaagtgacggacc 547 aps-AFP16 acgaggatattgaacactgtcatcactgggttctgtgaaagtgacggacc 546 aps-AFP10 cgtacatgttgtgatatataatattatcataataattataata---ccat 592 aps-AFP18 cgtacatgttgtgatatataatattatcataataattataata---ccat 592 aps-AFP09 cgtacatgttgtgatatataatattatcataataattataataatgccat 595 aps-AFP11 cgtacatgttgtgatatataatattatcataataattataataatgccat 595 aps-AFP07 cgtacatgttgtgatatataatattatcataataattataataatgccat 595 aps-AFP12 cgtacatgttgtgatatataatattatcataataattataataatgccat 595 aps-AFP05 agtacatgttgtgatatataatattatcataataattataata---ccat 592 aps-AFP13 cgtacatgttgtgatatataatattatcataataattataata---ccat 592 aps-AFP17 cgtacatgttgtgatatataatattatcataataattataata---ccat 592 aps-AFP15 cgtacatgttgtgatatataatattatcataataattataata---ccat 592 aps-AFP08 agtacatgttgtgatatataatattatcataataattataata---ccat 592 aps-AFP04 cgtacatgttgtgatatataatattatcataataattataata---ccat 592 aps-AFP03 cgtacatgttgtgatatataatattatcataataattataata---ccat 596 aps-AFP06 cgtacatgttgtgatatataatattatcataataattataata---ccat 596 aps-AFP01 cgtacatgttgtgatatataatattatcataataattataata---ccat 597 aps-AFP14 cgtacatgttgtgaaatataatattatcataataattataata---ccat 594 aps-AFP16 cgtacatgttgtgaaatataatattatcataataattataata---ccat 593

119

aps-AFP10 taatctctgcagAATCACTGACATCAACATGGACCCAGCAAAAGCCGCCG 642 aps-AFP18 taatctctgcagAATCACTGGCATCAACATGGACCCAGCAAAAGCCGCCG 642 aps-AFP09 tcatctctgcagAATCACTGACATCAACATGGACCCAGCAAAAGCCGCCG 645 aps-AFP11 tcatctctgcagAATCACTGACATCAACATGGACCCAGCAAAAGCCGCCG 645 aps-AFP07 tcatctctgcagAATCACTGACATCAACATGGACCCAGCAAAAGCCGCCG 645 aps-AFP12 tcatctctgcagAATCACTGACATCAACATGGACCCAGCAAAAGCCGCCG 645 aps-AFP05 taatctctgcagAATCACTGACATCAACATGGACCCAGCAAAAGCCGCCG 642 aps-AFP13 taaaat--gcagAATCACTGACATCAACATGGACCCAGCAAAAGCCGCCG 640 aps-AFP17 taaaat--gcagAATCACTGACATCAACATGGACCCAGCAAAAGCCGCCG 640 aps-AFP15 taaaat--gcagAATCACTGACATCAACATGGACCCAGCAAAAGCCGCCG 640 aps-AFP08 taaaat--gcagAATCACTGACATCAACATGGACCCAGCAAAAGCCGCCG 640 aps-AFP04 taatctctgcagAATCACTGACATCAACATGGACCCAGCAAAAGCCGCCG 642 aps-AFP03 taatctctgcagAATCACTGACATCAACATGGACCCAGCAAAAGCCGTTG 646 aps-AFP06 taatctctgcagAATCACTGACATCAACATGGACCCAGCAAAAGCCGCCG 646 aps-AFP01 taatctctgcagAATCACTGACATCAACATGGACCCAGCAAAAGCCGCCG 647 aps-AFP14 taatctctgcagAATCACTGACATCAACATGGACCCAGCAAAAGCCGCCG 644 aps-AFP16 taatctctgcagAATCACTGACATCAACATGGACCCAGCAAAAGCCGCCG 643 aps-AFP10 CAGCCACCGCCGCCAAAGCCAAAGCCGACGCCGAAAAGACTGCAGCCGCC 692 aps-AFP18 CAGCCACCGCCGCCAAAGCCAAAGCCGACGCCGAAAAGACTGCAGCCGCC 692 aps-AFP09 CAGCCACCGCCGCCAAAGCCAAAGCCGACGCCGAAAAGACTGCAGCCGCC 695 aps-AFP11 CAGCCACCGCCGCCAAAGCCAAAGCCGACGCCGAAAAGACTGCAGCCGCC 695 aps-AFP07 CAGCCACCGCCGCCAAAGCCAAAGCCGACGCCGAAAAGACTGCAGCCGCC 695 aps-AFP12 CAGCCACCGCCGCCAAAGCCAAAGCCGACGCCGAAAAGACTGCAGCCGCC 695 aps-AFP05 CAGCCACCGCCGCCAAAGCCAAAGCCGACGCCGAAAAGACTGCAGCCGCC 692 aps-AFP13 CAGCCACCGCCGCCAAAGCCAAAGCCGACGCCGAAAAGACTGCAGCCGCC 690 aps-AFP17 CAGCCACCGCCGCCAAAGCCAAAGCCGACGCCGAAAAGACTGCAGCCGCC 690 aps-AFP15 CAGCCACCGCCGCCAAAGCCAAAGCCGACGCCGAAAAGACTGCAGCCGCC 690 aps-AFP08 CAGCCACCGCCGCCAAAGCCAAAGCCGACGCCGAAAAGACTGCAGCCGCC 690 aps-AFP04 CAGCCACCGCCGCCAAAGCCAAAGCCGACGCCGAAAAGACTGCAGCCGCC 692 aps-AFP03 CAGCCACCGCCGCCAAAGCCAAAGCCGACGCCGAAAAGACTGCAGCCGCC 696 aps-AFP06 CAGCCACCGCCGCCAAAGCCAAAGCCGACGCCGAAAAGACTGCAGCCGCC 696 aps-AFP01 CAGCCACCGCCGCCAAAGCCAAAGCCGACGCCGAAAAGACTGCAGCCGCC 697 aps-AFP14 CAGCCACCGCCGCCAAAGCCAAAGCCGACGCCGAAAAGACTGCAGCCGCC 694 aps-AFP16 CAGCCACCGCCGCCAAAGCCAAAGCCGACGCCGAAAAGACTGCAGCCGCC 693 aps-AFP10 GCCGCCAAGGCCGCTGCCGACACCGCCGCTGCCGCCGCCAAAGCCGCCAA 742 aps-AFP18 GCCGCCAAGGCCGCTGCCGACACCGCCGCTGCCGCCGCCAAAGCCGCCAA 742 aps-AFP09 GCCGCCAAGGCCGCCGCCGACACCGCCGC---------CAAAGCCGCCAA 736 aps-AFP11 GCCGCCAAGGCCGCCGCCGACACCGCCGC---------CAAAGCCGCCAA 736 aps-AFP07 GCCGCCAAGGCCGCCGCCGACACCGCCGC---------CAAAGCCGCCAA 736 aps-AFP12 GCCGCCAAGGCCGCCGCCGACACCGCCGC---------CAAAGCCGCCAA 736 aps-AFP05 GCCGCCAAGGCCGCCGCCGACACCGCCGC---------CAAAGCCGCCAA 733 aps-AFP13 GCCGCCAAGGCCGCCGCCGACACCGCCGC------------------CAA 722 aps-AFP17 GCCGCCAAGGCCGCCGCCGACACCGCCGC------------------CAA 722 aps-AFP15 GCCGCCAAGGCCGCCGCCGACACCGCCGC------------------CAA 722 aps-AFP08 GCCGCCAAGGCCGCCGCCGACACCGCCGC------------------CAA 722 aps-AFP04 GCCGCCAAGGCCGCCGCCGACACCGCCGC---------TGCCGCCGCCAA 733 aps-AFP03 GCCGCCAAGGCCGCCGCCGACACCGCCGC---------TGCCGCCGCCAA 737 aps-AFP06 GCCGCCAAGGCCGCCGCCGACACCGCCGC---------TGCCGCCGCCAA 737 aps-AFP01 GCCGCCAAGGCCGCCGCCGACACCGCCGC---------TGCCGCCGCCAA 738 aps-AFP14 GCCGCCAAGGCCGCCGCCGACACCGCCGC---------TGCCGCCGCCAA 735 aps-AFP16 GCCGCCAAGGCCGCCGCCGACACCGCCGC---------TGCCGCCGCCAA 734

120

aps-AFP10 AGCCGCCGCCCATTAAGGATCGTG 766 aps-AFP18 AGCCGCCGCCCATTAAGGATCGTG 766 aps-AFP09 AGCCGCCGCCCATTAAGGATCGTG 760 aps-AFP11 AGCCGCCGCCCATTAAGGATCGTG 760 aps-AFP07 AGCCGCCGCCCATTAAGGATCGTG 760 aps-AFP12 AGCCGCCGCCCATTAAGGATCGTG 760 aps-AFP05 AGCCGCCGCCCATTAAGGATCGTG 757 aps-AFP13 AGCCGCCGCCCATTAAGGATCGTG 746 aps-AFP17 AGCCGCCGCCCATTAAGGATCGTG 746 aps-AFP15 AGCCGCCGCCCATTAAGGATCGTG 746 aps-AFP08 AGCCGCCGCCCATTAAGGATCGTG 746 aps-AFP04 AGCCGCCGCCCCTTAAGGATCGTG 757 aps-AFP03 AGCCGCCGCCCCTTAAGGATCGTG 761 aps-AFP06 AGCCGCCGCCCCTTAAGGATCGTG 761 aps-AFP01 AGCCGCCGCCCCTTAAGGATCGTG 762 aps-AFP14 AGCCGCCGCCCCTTAAGGATCGTG 759 aps-AFP16 AGCCGCCGCCCCTTAAGGATCGTG 758

121

Appendix H

DNA alignment of three unique gene sequences encoding starry flounder hyperactive AFPs

An alignment of three unique sequences obtained from PCR on genomic DNA is shown.

Sequences are labeled by codes assigned to genomic clones after sequencing (g#, Table 3).

Different nucleotides are highlighted in grey and the number of nucleotides is noted at the end of

each row. Notes detailing the result of a nested PCR on primary phage plaque H11 (Figure 14)

are at the bottom.

122

g11 AGCCAATGCCGCCGCCGCAGCAGCCACCGCCGCTGCCGCAGCAATAGCAGCCGAGGAAGC 60 g12 AGCCAATGCCGCCGCCGCAGCAGCCACCGCCGCTGCCGCAGCAATAGCAGCCGAGGAAGC 60 g13 AGCCAATGCCGCCGCCGCAGCAGCCACCGCCGCCGCCGCAGCAATAGCAGCCGAGGAAGC 60 g11 CGCAACCGCGGCTGCTACCGCCGCAGCTGCTGCCGCCGCCACCGCCGCCACAGCCCAGGC 120 g12 CGCAACCGCGGCTGCTACCGCCGCAGCTGCTGCCGCCGCCACCGCCGCCACAGCCCAGGC 120 g13 CGCAACCGCGGCTGCTACCGCCGCAGCTGCTGCCGCCGCCACCGCCGCCACAGCCCAGGC 120 g11 AGCCATCTTTGACAAAGCCGCAGCCGCCGCATCCACAACCGCCACCACCGCCGCCACGGC 180 g12 AGCCATCTTTGACAAAGCCGCAGCCGCCGCATCCACAACCGCCACCACCGCCGCCACGGC 180 g13 GGCCATCTTTGACAAAGCCGCAGCCGCCGCATCCACAACCGCCACCACCGCCGCCACGGC 180 g11 GGCCGCCACCATAGCCACCACCGCCGCAGCCGCAGCCGC 219 g12 GGCCGCCGCCACAGCCACCACCGCCGCAGCCGCAGCCGC 219 g13 GGCCGCCGCCACAGCCACCACCGCCGCAGCCGCAGCCGC 219 The product of the nested PCR on phage H11 prior to insert isolation and sequencing was identical to bp 59–116 of the genomic clones. The portion of the H11 insert that corresponds to this portion of the gene is identical to that of g12.

123

Appendix I

DNA alignment of winter and starry flounder hyperactive AFPs

An alignment of the starry flounder hyperactive AFP from lambda insert #2 (stfh-AFP2, Figure

17) and the winter flounder hyperactive variant wfh-AFP1 is shown. Identical bases are marked

with a dot (.), the polyadenylation site is marked with pound signs (#), and the number of

nucleotides is noted at the end of each row. The translated sequences of stfh-AFP2 and wfh-

AFP1 are above or below the respective gene sequence. Substitutions that are silent in all three

translated products are underlined, substitutions that are silent between wfh-5a and stfh-AFP2 are

italicized, and those that are silent between wfh-5a and wfh-AFP1 are bolded. The primers used

to isolate hyperactive sequences from starry flounder DNA (Table Z) are marked with asterisks

and labelled. The Genbank accession numbers for wfh-AFP1 and wfh-5a are EU188795 and

M63477, respectively.

124

I T E A I D P A A Q A A A A A stfh-AFP2 AATCACTGAAGCCA---TCGACCCCGCAGCCCAAGCCGCTGCAGCCGCAG 47 wfh-5a C..T..........---..............A.......C.......... 47 wfh-AFP1 C.............ACA..............AG......A.......... 50 I T E A N I D P A A R A A A A A A A A A A V V T A A D A A A A A A stfh-AFP2 CCGCCGCCGCCGCCGTTGTCACCGCCGCCGACGCTGCCGCAGCCGCCGCC 97 wfh-5a ....A...A....A........T.....A.C................... 97 wfh-AFP1 .A...T..AAA....CA.............................T... 100 A A S K A A V T A A D A A A A A A N A A A N A A A V A A A T A A D V stfh-AFP2 AACGCCGCCGCCAACGCCGCCGCAGTCGCTGCGGCCACCGCCGCCGACGT 147 wfh-5a GC.AT........C.................G...............T.C 147 wfh-AFP1 .C.AT......ATC.......T................T..T..T..T.A 150 T I A A S A A S V A A A T A A D D ******maxi5'mid****** A T A S I A T I K A N A A A A A stfh-AFP2 CGCCACCGCATCCATAGCAACCATCAAAGCCAATGCCGCCGCCGCAGCAG 197 wfh-5a ....G...............G......C.....CA............... 197 wfh-AFP1 ....G......................C...GCAT....T.....GAA.T 200 A A A S I A T I N A A S A A A K A T A A A A A I A A E E A A T A A stfh-AFP2 CCACCGCCGCTGCCGCAGCAATAGCAGCCGAGGAAGCCGCAACCGCGGCT 247 wfh-5a ...T......C..........A.........A........CG..A.C..C 247 wfh-AFP1 ...T......C...........G......A....CA.T..CG....C..G 250 S I A A A A A M A A K D T A A A A A T A A A A A A A T A A T A Q A A stfh-AFP2 GCTACCGCCGCAGCTGCTGCCGCCGCCACCGCCGCCACAGCCCAGGCAGC 297 wfh-5a ..C.....T..C..CA.AA..........A..............A...A. 297 wfh-AFP1 ..C.G......C..C..C....TT...T.........A.....TA.A.A. 300 A S A A A A A V A S A A K A L E T I F D K A A A A A S T T A T T A stfh-AFP2 CATCTTTGACAAAGCCGCAGCCGCCGCATCCACAACCGCCACCACCGCCG 347 wfh-5a ....AAG..T.....G............G..T.C..........A..... 347 wfh-AFP1 ....AAC.T..........TA......TG....C.....T.AT.....T. 350 I N V K A A Y A A A T T A N T A

125

**** A T A A A A T A T T A A A A A A A stfh-AFP2 CCACGGCGGCCGCCGCCACAGCCACCACCGCCGCAGCCGCAGCCGCAGCC 397 wfh-5a ..G....C..G........C....................C....T.... 397 wfh-AFP1 .TG.C..C...........C..............T.....C......... 400 A A A A A A T A T T A A A A A A A **maxi3'mid****** T E T I D K A A A A A A A A A A T stfh-AFP2 ACAGAAACCATCGACAAAGCCGCCGCAGCCGCCGCAGCCGCAGCCGCCAC 447 wfh-5a .A.AC............G........C....TG....TT..G.C...... 447 wfh-AFP1 .A..C............C........T...AAG..........T...... 450 K A T I D N A A A A K A A A V A T A V A T A A A A A A T A A A T A stfh-AFP2 CGCCGTGGCAACTGCCGCAGCCGCCGCAGCCACCGCCGCCGCCACCGCCG 497 wfh-5a ..........G...........A....C.....T..............T. 497 wfh-AFP1 ......TT..GA..........A....C.....T.........GTT.... 500 A V S D A A A T A A T A A A V A A A T L G A A A A K A A A T A V A stfh-AFP2 CCGCAACCCTCGGGGCTGCCGCCGCAAAAGCCGCAGCCACCGCAGTCGCT 547 wfh-5a ....................A.A.T..................C...AA. 547 wfh-AFP1 .T..........AA..........................T......T.. 550 A A T L E A A A A K A A A T A V S A A A A A A I A A A A A A A A P P stfh-AFP2 GCCGCCGCAGCCGCTGCCATCGCTGCTGCCGCCGCCGCCGCCGCCCCCCC 597 wfh-5a ........C........AGCA..CA......................... 597 wfh-AFP1 ......---........TGC...C..C...AT....TT......TG.... 597 A A A A A A A A A I A F A A A P * stfh-AFP2 TTAAGGATCGTGGTCGTCTTGATGTGGGATCATGTGAACATCTGAGCAGC 647 wfh-5a A................................................. 647 wfh-AFP1 A...............................................AT 647 * ###### stfh-AFP2 GAGATGTTACCAATCTGCTGAATAAACCTGAGAAGCTGTTTGTTG 692 wfh-5a ........................... 674 wfh-AFP1 .....A.C.........T........G.................T 692

126

Appendix J

DNA alignment of winter and starry flounder skin AFPs

An alignment of the starry flounder skin AFP from lambda insert #2 (stfs-AFP8, Figure 17) and

the winter flounder skin variant wfs-11-3 is shown. Identical bases are marked with a dot (.) and

gaps with a hyphen (-). Intronic and flanking sequences are in lower case, while exonic sequence

is in upper case. The coding region is bolded with the translated sequences of stfs-AFP8 or wfs-

11-3 above or below the respective gene sequence, and silent substitutions are underlined. The

number of nucleotides is noted at the end of each row. The putative TFIID binding site ( | ), the

transcriptional start sites (+), and the polyadenylation signal (#) are marked based on previous

characterization of the winter flounder skin AFPs. The intronic enhancer, Element S, is in white

text and highlighted in black [59]. The Genbank accession number for wfs-11-3 is M63478.

127

stfs-AFP8 gttacaaaacaagttcatactggcctggatgttcgccacaccttcctttt 50 wfs-11-3 .......................-----...g.t.............g.. 45 stfs-AFP8 gtggtgaaccagtcggagccgacaacatgctgcgtcacaaactcgaagtg 100 wfs-11-3 .at....................gc.c...........g..a..a..... 95 ||||||| ++ stfs-AFP8 aataaataagggatgctccctaaaggttttcatcaggactcaaccACTTT 150 wfs-11-3 ........ga..c...........a..................a...... 145 stfs-AFP8 TCACTGTCGAACACTCAGgtaagtgaacactcactttatttagcaccgca 200 wfs-11-3 ..........C..........c...............g...--------- 186 stfs-AFP8 cgtgcccataactatgttactgtatatgttttgttctatattgtttttat 250 wfs-11-3 -------------------------------------------------- 186 stfs-AFP8 attgttgttttatgtgcagtgcaccaacgacaccaaggcaatttcctgta 300 wfs-11-3 -------------------------------------------------- 186 stfs-AFP8 tgtgcaaacatacttggcaaataaaataattctgattctgattctacaaa 350 wfs-11-3 ----------------------------------------c.ca...... 196 stfs-AFP8 tctggttt-actgtaaatatcttgggaaggaaggaaggatatctgcatta 399 wfs-11-3 ........t......................................... 246 stfs-AFP8 tcccagaggggccatttgttttacagccagcggtaaaagatgaagatctt 449 wfs-11-3 ...t-.............................g............... 295 stfs-AFP8 caaccgtgttcgtctgatggaaagtttgttctgaaac------------- 486 wfs-11-3 ..t..a...............................cttcagtggaaga 345 stfs-AFP8 -------------------------------------------------- 486 wfs-11-3 aacagattcatgtcttcaggcttaaacctgcaaaaatctgagctctgtta 395 stfs-AFP8 ---------------------------------actgtaaaactctttta 503 wfs-11-3 aatcatgggaaacaactttttaattcagtcaggg...g......a..... 445 stfs-AFP8 tatgaacagaagaagaagaagtgatctttagttcatcactgtggaaacat 553 wfs-11-3 ....c..................................ca......... 495 stfs-AFP8 cagcagcagttaaagtctgtctgcttcagtatcactggccagttccagtg 603 wfs-11-3 ..t................................c.............. 545

128

stfs-AFP8 ctcatgtttctgatcagcttggtttgaatgatatgaaa-cggatggagtc 652 wfs-11-3 ......................................a........... 595 stfs-AFP8 cctgtttgaccctgtttaacacaagat---------ggaccatctttatt 693 wfs-11-3 ...........................tggacgcat.............. 645 stfs-AFP8 aacataatgttttacatgagcacttcctgttttcagccctaaacctaaag 743 wfs-11-3 t................c..........................t..... 695 stfs-AFP8 aggcctcatagaaacttcctgatgatctggtgacacctgctggttgaagg 793 wfs-11-3 .........g........................................ 745 stfs-AFP8 aaacagagtttgagagtcagaagaacaaatgattttagtttgaaacaaga 843 wfs-11-3 ................g...c.......................tg.... 795 stfs-AFP8 agctgtcatttgatattatgtt-gtgggggggggggcgggggtggtcatc 892 wfs-11-3 ...........t..t....a..t.ga..........g....---a...c. 842 stfs-AFP8 acacacagatattgaacactgtcatcactgggttcagtgaaagtgacgga 942 wfs-11-3 ...................................g..........a.a. 892 stfs-AFP8 ccagtacatgttgtgataaataat---atcataataattataat-aatac 988 wfs-11-3 ..................t.....att.................t..... 942 M D A P A A stfs-AFP8 cattaatctctgcagAATCACTGACATCAACATGGACGCCCCAGCCGCCG 1038 wfs-11-3 .......................................A......AA.. 992 M D A P A K A A A A T A A A A K A A A E A T A stfs-AFP8 CCGCCGCAGCCACCGCCGCCGCCGCCAAAGCCGCCGCAGAAGCCACCGCC 1088 wfs-11-3 ............................G..................... 1042 A A A A T A A A A K A A A E A T A A A A A K A A A A T K A A A A R * stfs-AFP8 GCCGCAGCTGCCAAAGCAGCAGCCGCAACCAAAGCTGCCGCAGCCCGTTA 1138 wfs-11-3 ..........................C........C..A........... 1092 A A A A K A A A A T K A A A A R * stfs-AFP8 ATGATCATGGTCGTCTTGATGTGGGATCATGTGAACATCTGAGCAGCGAG 1188 wfs-11-3 ......G........................................... 1142

129

###### stfs-AFP8 ATGTTACCAATCTGCTGAATAAACCTGAGAAGCTGTTTGTTGAaaaccaa 1238 wfs-11-3 .........................................T........ 1192 stfs-AFP8 gtgtccagttcatttcatctctgaaactccttcacactttctgtagatca 1288 wfs-11-3 ......t......................a.................... 1242

130

Appendix K

Genomic Southern blot probed with hyperactive AFP cDNAs from the winter flounder and

starry flounder

Genomic DNA (10 µg) from individual fish was digested with SacI or BamHI for winter

flounder, or with SacI or EcoRI for starry flounder. The four starry flounders (1, 4, 6, 9) were

obtained from location 4 (Figure 4). Digested DNA was electrophoresed on a 0.8% agarose gel,

blotted onto nylon and probed with the winter flounder hyperactive isoform (A) or the starry

flounder hyperactive isoform (B). The positions of DNA size markers (kb) are indicated on the

left.

131

Win

ter

flo

un

der

Sta

rry

flo

un

der

Sac

I

EcoR

I1

46

91

46

9

Win

ter

flo

un

der

Sta

rry

flo

un

der

Sac

I

EcoR

I

AB

10 0.75233.5456810 0.752333.54568

14

69

14

69

Bam

HI

SacI

Bam

HI

SacI