Mitochondrial DNA variation in Koryaks and Itel'men ... · Mitochondrial DNAVariation in Koryaks...

39
Mitochondrial DNA Variation in Koryaks and Itel’men: Population Replacement in the Okhotsk Sea–Bering Sea Region During the Neolithic THEODORE G. SCHURR, 1,2 REM I. SUKERNIK, 3 YELENA B. STARIKOVSKAYA, 3 AND DOUGLAS C. WALLACE 1,2 * 1 Department of Anthropology, Emory University, Atlanta, Georgia 30322 2 Center for Molecular Medicine, Emory University, Atlanta, Georgia 30322 3 Laboratory of Human Molecular Genetics, Institute of Cytology and Genetics, Siberian Division, Russian Academy of Sciences, Novosibirsk 630090, Russia KEY WORDS haplotype; haplogroup; control region; sublineage; Kamchatka ABSTRACT In this study, we analyzed the mitochondrial DNA (mtDNA) variation in 202 individuals representing one Itel’men and three Koryak populations from different parts of the Kamchatka peninsula. All mtDNAs were subjected to high resolution restriction (RFLP) analysis and control region (CR) sequencing, and the resulting data were combined with those available for other Siberian and east Asian populations and subjected to statistical and phylogenetic analysis. Together, the Koryaks and Itel’men were found to have mtDNAs belonging to three (A, C, and D) of the four major haplotype groups (haplogroups) observed in Siberian and Native American populations (A–D). In addition, they exhibited mtDNAs belonging to hap- logroups G, Y, and Z, which were formerly called ‘‘Other’’ mtDNAs. While Kamchatka harbored the highest frequencies of haplogroup G mtDNAs, which were widely distributed in eastern Siberian and adjacent east Asian populations, the distribution of haplogroup Y was restricted within a rela- tively small area and pointed to the lower Amur River–Sakhalin Island region as its place of origin. In contrast, the pattern of distribution and the origin of haplogroup Z mtDNAs remained unclear. Furthermore, phylogenetic and statistical analyses showed that Koryaks and Itel’men had stronger genetic affinities with eastern Siberian/east Asian populations than to those of the north Pacific Rim. These results were consistent with colonization events associated with the relatively recent immigration to Kamchatka of new tribes from the Siberian mainland region, although remnants of ancient Beringian populations were still evident in the Koryak and Itel’men gene pools. Am J Phys Anthropol 108:1–39, 1999. r 1999 Wiley-Liss, Inc. Grant sponsor: Wenner-Gren Foundation for Anthropological Research; Grant numbers: GR 5745, GR 5680, GR 5680-S, GR 6077; Grant sponsor: The National Science Foundation; Grant number: 9414900; Grant sponsor: The Leakey Foundation; Grant sponsor: Sigma Xi; Grant sponsor: INTAS; Grant number: 96– 1766; Grant sponsor: Russian Fund for Basic Research; Grant number: 97–04–49816; Grant sponsor: J. Worley Brown Fellow- ship Fund; Grant sponsor: National Institutes of Health; Grant numbers: GM49615, NS21328, HL45572, AG10130, AG13154; Grant sponsor: Emory Clinical Research Center; Grant number: BR00039. *Correspondence to: Douglas C. Wallace, Center for Molecu- lar Medicine, Emory University School of Medicine, 1462 Clifton Road N.E., Atlanta, GA 30322. E-mail: [email protected] Received 20 May 1997; accepted 3 October 1998. AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY 108:1–39 (1999) r 1999 WILEY-LISS, INC.

Transcript of Mitochondrial DNA variation in Koryaks and Itel'men ... · Mitochondrial DNAVariation in Koryaks...

Mitochondrial DNA Variation in Koryaks and Itel’men: PopulationReplacement in the Okhotsk Sea–Bering Sea RegionDuring the Neolithic

THEODORE G. SCHURR,1,2 REM I. SUKERNIK,3YELENA B. STARIKOVSKAYA,3 AND DOUGLAS C. WALLACE1,2*1Department of Anthropology, Emory University, Atlanta, Georgia 303222Center for Molecular Medicine, Emory University, Atlanta, Georgia 303223Laboratory of Human Molecular Genetics, Institute of Cytology andGenetics, Siberian Division, Russian Academy of Sciences,Novosibirsk 630090, Russia

KEY WORDS haplotype; haplogroup; control region; sublineage;Kamchatka

ABSTRACT In this study, we analyzed the mitochondrial DNA (mtDNA)variation in 202 individuals representing one Itel’men and three Koryakpopulations from different parts of the Kamchatka peninsula. All mtDNAswere subjected to high resolution restriction (RFLP) analysis and controlregion (CR) sequencing, and the resulting data were combined with thoseavailable for other Siberian and east Asian populations and subjected tostatistical and phylogenetic analysis. Together, the Koryaks and Itel’menwere found to have mtDNAs belonging to three (A, C, and D) of the four majorhaplotype groups (haplogroups) observed in Siberian and Native Americanpopulations (A–D). In addition, they exhibited mtDNAs belonging to hap-logroups G, Y, and Z, which were formerly called ‘‘Other’’ mtDNAs. WhileKamchatka harbored the highest frequencies of haplogroup G mtDNAs,which were widely distributed in eastern Siberian and adjacent east Asianpopulations, the distribution of haplogroup Y was restricted within a rela-tively small area and pointed to the lower Amur River–Sakhalin Island regionas its place of origin. In contrast, the pattern of distribution and the origin ofhaplogroup Z mtDNAs remained unclear. Furthermore, phylogenetic andstatistical analyses showed that Koryaks and Itel’men had stronger geneticaffinities with eastern Siberian/east Asian populations than to those of thenorth Pacific Rim. These results were consistent with colonization eventsassociated with the relatively recent immigration to Kamchatka of new tribesfrom the Siberian mainland region, although remnants of ancient Beringianpopulations were still evident in the Koryak and Itel’men gene pools. Am JPhys Anthropol 108:1–39, 1999. r 1999 Wiley-Liss, Inc.

Grant sponsor: Wenner-Gren Foundation for AnthropologicalResearch; Grant numbers: GR 5745, GR 5680, GR 5680-S, GR6077; Grant sponsor: The National Science Foundation; Grantnumber: 9414900; Grant sponsor: The Leakey Foundation; Grantsponsor: Sigma Xi; Grant sponsor: INTAS; Grant number: 96–1766; Grant sponsor: Russian Fund for Basic Research; Grantnumber: 97–04–49816; Grant sponsor: J. Worley Brown Fellow-ship Fund; Grant sponsor: National Institutes of Health; Grantnumbers: GM49615, NS21328, HL45572, AG10130, AG13154;

Grant sponsor: Emory Clinical Research Center; Grant number:BR00039.

*Correspondence to: Douglas C. Wallace, Center for Molecu-lar Medicine, Emory University School of Medicine, 1462 CliftonRoad N.E., Atlanta, GA 30322.E-mail: [email protected]

Received 20 May 1997; accepted 3 October 1998.

AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY 108:1–39 (1999)

r 1999 WILEY-LISS, INC.

In the past 10 years, we have conducted anumber of studies in which the mitochon-drial DNA (mtDNA) variation of aboriginalpopulations in Siberia and the Americas wasused to trace their origins and affinities(Wallace et al., 1985; Schurr et al., 1990;Torroni et al., 1992, 1993a,b, 1994a,b;Sukernik et al., 1996; Starikovskaya et al.,1998). These analyses showed that themtDNAs observed in contemporary NewWorld populations were a subset of Asianhaplotypes which consisted of primarily fourmtDNA lineages, or haplogroups, desig-nated A, B, C, and D. Within these mtDNAlineages, only a small number of haplotypeswere found to be shared between Asian andNative American populations, suggestingthat a limited number of founders gave riseto ancestral Native American populations.The sequence divergence values for the hap-logroups present in Native Americans fur-ther indicated that ancestral populationsbringing at least haplogroups A, C, and Darrived early in the New World, around35,000–25,000 years before present (YBP),and that haplogroup B might represent asecond, later migration which contributedmtDNAs to the genetic stock of Amerindians(Torroni et al., 1992, 1993a, 1994a).

Other studies of mtDNAvariation inAsianand Native American populations are atvariance with some of these interpretations.Ward et al. (1991) and Horai et al. (1993)argued that extensive bottlenecks had notcaused limited mtDNA variation among Na-tive American groups since they observedcontrol region (CR) sequence diversity withinNative American tribes that was similar tolevels found in Asian populations. They alsodetected four major clusters of CR sequencesequivalent to haplogroups A–D, althoughHorai et al. (1993) claimed that each hap-logroup represented a separate migrationinto the New World which came between21,000 and 14,000 YBP. On the other hand,Shields et al. (1993) proposed a late entrytime (16,000–14,000 YBP) of ancestral Amer-indians into the New World, along with thelater expansion of northern populations inthe circumarctic region. Moreover, since hap-logroups A–D are present in most Native

American groups, it was argued that all ofthese mtDNA lineages were brought to theAmericas in a single migratory wave (Bail-liet et al., 1994; Merriwether et al., 1994,1995; Kolman et al., 1996).

When the mtDNA variation in native Sibe-rian populations was initially surveyed, onlythree of the four Asian mtDNA haplogroups(A, C, and D) were detected in these groups(Torroni et al., 1993a). With only a fewexceptions (Petrishchev et al., 1993;Sukernik et al., 1996; Derenko and Shields,1998), Siberian groups lacked haplogroup BmtDNAs. Central-east Asian populations,by contrast, had haplogroup B mtDNAs atpolymorphic frequencies (Horai and Matsu-naga, 1986; Ballinger et al., 1992; Hariharaet al., 1992; Kolman et al., 1996; Merri-wether et al., 1996), suggesting that thismtDNA lineage arose in this area of Asia.The mtDNA data also revealed the presenceof additional haplogroups in northern Asiain the form of ‘‘Other’’ (non–haplogroup A–D)haplotypes, which appeared in all nativeSiberian groups except for the Siberian Eski-mos (Torroni et al., 1993b; Sukernik et al.,1996). Some of these ‘‘Other’’ haplogroupsappeared to be related to previously ob-served east Asian mtDNAs (Horai et al.,1984; Horai and Matsunaga, 1986; Ballingeret al., 1992; Torroni et al., 1993b), whereasthe lineal associations of the remainder wereunclear. Similarly, ‘‘Other’’ mtDNAs havebeen observed in Native American popula-tions (Bailliet et al., 1994; Merriwether etal., 1995; Easton et al., 1996; Lorenz andSmith, 1996). However, it is unclear whetheror not the Native American and Siberian‘‘Other’’ mtDNAs are related to each other.

Because of the Kamchatka peninsula’sgeographic proximity to the Bering Straitregion, the Aleutian Islands, and the KurileIslands and northern Japan, the aboriginalpopulations inhabiting that peninsula maybe important for clarifying northern Asianprehistory and the colonization of the NewWorld. It was also possible that these popu-lations were genetically influenced by thehypothesized migration which brought hap-logroup B mtDNAs to the Americas (Torroniet al., 1992, 1993a,b). In addition, an analy-sis of Koryak and Itel’men mtDNA variationmight also delineate the origins and affini-

2 T.G. SCHURR ET AL.

ties of the ‘‘Other’’ haplotypes which hadpreviously been observed in eastern Sibe-rian and Native American populations (Tor-roni et al., 1993b).

However, aside from the classical anthro-pological surveys of aboriginal populationsinhabiting eastern Siberia (Debets, 1951;Levin, 1958), few studies have attempted todelineate the biological variation of Kam-chatkan groups. While some analyses ofmtDNA variation in northeast Siberiangroups have provided little conclusive evi-dence for population affinities in this region(Malyarchuk et al., 1994; Derenko andShields, 1998), our initial analysis of Sibe-rian populations revealed the presence ofhaplogroups A, C, and D in the Chukchi and‘‘Koryaks’’ (Torroni et al., 1993b), implying aclose linkage between them. However, noadditional associations could be made be-cause the data were obtained through par-tial haplotype analysis. In addition, few ifany genetic studies of the Itel’men havebeen conducted, possibly due to their havingsignificant levels of nonnative admixturewith Russians.

To further elucidate the genetic affinitiesof eastern Siberian populations and theirrole in the peopling of the New World, weconducted a detailed molecular analysis ofthe mtDNA variation in Koryaks andItel’men groups from the Kamchatka penin-sula. The data obtained through high resolu-tion restriction fragment length polymor-phism (RFLP) analyses and CR sequencingwere combined with similar data sets fromthe Chukchi and Siberian Eskimos ofChukotka, the Nivkhs and Udegeys of thelower Amur–northern Sakhalin region, andthe Evenks of interior Siberia and subjectedto statistical and phylogenetic analyses. Ourresults genetically reflect the recent immi-grations to Kamchatka of ancestral Paleoasi-atic populations from the Siberian mainlandand their nearly complete replacement ofthe ancient Beringian populations whichformerly inhabited this region.

POPULATIONSThe Koryaks

As noted by Stepan Krasheninnkov (1972:193–195) in Opisanie Zemli Kamchatki (De-scription of the Land of Kamchatka [St.

Petersburg, 1754]), the Koryaks were ‘‘di-vided into two nations, one called the rein-deer Koryak, the other the settled Koryaks.’’The sedentary (Maritime) Koryaks estab-lished permanent settlements along riversflowing into the Sea of Okhotsk and theBering Sea and subsisted to varying degreeson hunting small sea mammals, fishing, andgathering plant and animal species from thelittoral zone (Jochelson, 1908; Antropova,1964a; Krashenninnkov, 1972). Their popu-lation was traditionally subdivided into eightterritorial and dialectical subgroups whosemembers spoke different dialects of Paleoasi-atic, or Chukotko-Kamchatkan, languages(Skorik 1965; Krauss, 1988).

At the turn of the seventeenth century,Maritime Koryaks occupied northern Kam-chatka and the northeastern Okhotsk Searegion (Jochelson, 1908; Antropova, 1964a).However, the expansion of the Evens alongthe Okhotsk Sea coast in the seventeenthand eighteenth centuries gradually reducedthe territory occupied by Maritime Koryaks(Levin and Vasiliev, 1964) and forced thoseinhabiting this region to shift to reindeerbreeding—hence a nomadic way of life (Joch-elson, 1908; Levin and Vasiliev, 1964; Aruti-unov, 1988). By the turn of the twentiethcentury, these Reindeer Koryaks inhabitedthe forest tundra zone of northwestern Kam-chatka and the Penzhina River basin andthe mountain tundras in the northeasternpart of the Kamchatka mainland (Jochelson,1908;Antropova, 1964a). Interestingly, Rein-deer Koryaks spoke a language that wasnearly unintelligible to Maritime Koryaks,while Reindeer Koryak was close enough toChukchi in vocabulary and morphology thatit was mutually intelligible to native Chuk-chi speakers (Antropova, 1964a; Krashenin-nikov, 1972; Vdovin, 1973; Arutiunov, 1988;Krauss, 1988).

Today, the majority of Koryaks residewithin the borders of the Koryak Autono-mous Region, which lies between 56° and65°N and 158° and 174°E in northeasternSiberia (Fig. 1). According to the 1989 All-Union census, the total number of Koryaksin the KoryakAutonomous Region was 6,572,with approximately half of these being Rein-deer Koryaks, whereas a century ago theirpopulation numbered 7,284, of whom 2,913

3MTDNA VARIATION IN KORYAKS AND ITEL’MEN

were Reindeer Koryaks (Krushanov, 1993).These figures indicate that the Koryakshave largely maintained their tribal integ-rity since the Russian entry into northeastSiberia.

The Itel’men

During the initial period of Russian coloni-zation, the Itel’men inhabited much of theKamchatka peninsula. Their populations ex-tended from its southern tip, where Ainupopulations resided, to the north around theUka and Tigil’ Rivers, where they inter-spersed with Koryaks, with their main popu-lation being concentrated in the KamchatkaRiver basin (Antropova, 1964b; Krasheninni-kov, 1972) (Fig. 1). Those groups living in theinterior subsisted primarily through fishingalong the rivers running through the penin-sula, although those inhabiting the eastcoast also hunted sea mammals and those inthe southern tip of the peninsula huntedwhales (Antropova, 1964b; Krasheninnikov,1972). Itel’men populations were also di-vided by language into territorial subgroupswhich resided in the western, central/eastcoast, and southern parts of the peninsula(Antropova, 1964b; Arutiunov, 1988) (Fig. 1).Although sometimes considered part of thePaleoasiatic linguistic group, the Itel’menlanguage is distinct from the Koryak lan-guage and may have become similar to itthrough population contact and linguisticborrowing (Krasheninnikov, 1972; Vdovin,1973; Arutiunov, 1988; Krauss, 1988; Krus-hanov, 1990).

Since contact with Russians, Itel’menpopulations have declined precipitously, fall-ing from approximately 12,000 individualsin the early eighteenth century to only 814persons reported in the 1926–27 census(Antropova, 1964b). During this time, theybecame increasingly culturally and geneti-cally assimilated with Russians such thatonly a small subdivision of their originalpopulation has survived into this century.Its members live primarily in the southwestcorner of the Koryak Autonomous Region,where they have retained their native lan-guage and ethnic identity (Antropova, 1964b;Arutiunov, 1988).

In this context, it should be noted that thesedentary natives of southern Kamchatka

were referred to as Kamchadals throughoutthe colonial period. Due to extensive admix-ture with ethnic Russians, the term Kam-chadal was applied to both Itel’men andmixed Itel’men-Russian individuals as wellas the descendants of Russian Cossacks andpeasants who settled in Kamchatka in theeighteenth and nineteenth centuries (Antro-pova, 1964b; Vdovin 1973; Arutiunov, 1988;Murashko, 1994) and thus did not clearlydiscriminate between the indigenous andimmigrant populations of this region. Dur-ing the early Soviet period, national re-gionalization created new administrativeborders which officially divided these Kam-chadal descendants into two groups: theItel’men, who lived within the boundaries ofthe Koryak Autonomous Region, and theKamchadals, who lived south of the bordersof the Koryak Autonomous Region, withmembers of the latter group being classifiedas ‘‘russified’’ and deprived of their ‘‘smallindigenous people’’ status (Arutiunov, 1988;Murashko, 1994).

Sample collection

In July–August 1993, blood samples werecollected from 104 Koryaks residing in threegeographically proximate villages, Karaga,Ossora, and Tymlat, located in the Karagin-skiy District of the Koryak AutonomousRegion. Almost all of the Koryaks living inOssora and Tymlat (57) represented theAluitor subgroup. Of this total, 27 and 12individuals were born in the villages ofRekinniki and Anapka, respectively, both ofwhich were closed in the early 1960s. Onceconsidered to speak a distinct language(Skorik, 1965), the Aluitor were the largestterritorial group of Koryaks and occupiedthe whole Kamchatka isthmus and adjacentBering Sea coast, combining small-scalereindeer herding with sea mammal huntingand fishing (Arutiunov, 1988).

Whereas approximately half of the Kara-ginskiy District sample consisted of AluitorKoryaks, the remaining half representedthe Karagin Koryaks. Speaking a slightlymodified dialect of the Aluitor language(Vdovin, 1973), the Karagin Koryaks tradi-tionally occupied the territory south of Tym-lat, including Karaginskiy Island and settle-ments scattered along the Bering Sea coast

4 T.G. SCHURR ET AL.

Fig. 1. A map of Kamchatka showing the locations of the villages in which fieldwork was conducted in1993 and 1996 (with arrows) and other traditional settlements of Koryak and Itel’men, the majority ofwhich no longer exist. Inset: The traditional territories of the Koryaks (grey) and Itel’men (white) aroundthe beginning of the eighteenth century, with the geographic locations of the dialectic subgroups for eachpopulation indicated on the Kamchatka peninsula.

5MTDNA VARIATION IN KORYAKS AND ITEL’MEN

down to the village of Uka on the UkinskayaInlet. However, they now live in the villagesof Ossora and Karaga. Although no longer inexistence, some of these traditional villages(including Ivashka and Uka) were once occu-pied by Itel’men, who apparently mergedwith the Karagin Koryaks by the middle ofthe nineteenth century (Antropova, 1964a;Vdovin, 1973; Krushanov, 1993).

In June 1996, blood samples were ob-tained from 51 Koryak and 47 Itel’menresiding in the villages of Voyampolka andKovran, which are located in the Tigil’skiyDistrict of the Koryak Autonomous Region.The Voyampolka sample was comprised ofpersons having mixed Maritime and Rein-deer Koryak origins but who could be consid-ered to belong to the Palan subgroup (Krash-enninnkov, 1972; Vdovin, 1973; Krushanov,1993). The Palan Koryaks traditionally livedin several settlements scattered along theOkhotsk Sea coast of the peninsula betweenVoyampolka in the south and Lesnaya in thenorth but now reside mostly in these twovillages.

The Itel’men samples were obtained fromindividuals living in Kovran. All of thesepersons were born in or derived from one of anumber of traditional settlements scatteredalong the Okhotsk Sea coast between Sopoch-noye in the south and Tigil’ in the north,including the villages of Kovran, Napana,Utkholok, Moroshechnoye, Belogolovoye, andVerkhneye Kharyuzovo. With the exceptionof Kovran and Verkhneye Kharyuzovo, noneof these former Itel’men villages exist today(Fig. 1).

Blood samples were collected from eachparticipant with informed consent in twosets of 10 ml ACD anti-coagulant tubes andkept refrigerated in the local hospitals untilshipped or brought back to Atlanta for mo-lecular genetic analysis. All individuals whoparticipated in these studies were inter-viewed about their family histories, which inturn were verified by senior members of thecommunity for accuracy. Only those personswho lacked maternal and paternal Russianor nonrelated ancestry through three genera-tions were selected for the collection of bloodsamples, although samples were also ob-tained from four Evens who were the mari-tal partners of Koryak participants. Based

on these genealogical data, we estimatedthat approximately half of the Koryaks andmost of the Itel’men are of mixed Russian-Koryak or Russian-Itel’men ancestry, respec-tively, and consider themselves Koryak orItel’men by nationality primarily because oftheir maternal ancestry.

METHODS

Blood sample processing

All blood samples were processed at EmoryUniversity at the Clinical Research Center.Two 10 ml tubes of blood from each indi-vidual were separated into their constituentcellular fractions through low-speed centrifu-gation. Lymphocytes were separated fromthe cellular fraction, and the residual plate-let-rich plasmas from the centrifuged speci-mens were subsequently centrifuged in 15ml Corning (Corning, NY) tubes at 5,000rpm and 10°C for 20 min to precipitate theplatelets, which were subsequently extractedfor mtDNAs (Torroni et al., 1992).

High resolution restriction analysis

All Koryak and Itel’men DNA sampleswere subjected to high resolution restrictionanalysis. The entire mtDNAs of thesesamples were polymerase chain reaction(PCR) amplified in nine partially overlap-ping segments using standard oligonucleo-tide primers and PCR conditions (Torroni etal., 1992, 1993a). Each PCR segment wassubsequently digested with 14 restrictionenzymes (AluI, AvaII, BamHI, DdeI, HaeII,HaeIII, HhaI, HincII, HinfI, HpaI, HpaII,MboI, RsaI, and TaqI) to screen approxi-mately 20% of the mtDNA sequence perindividual. The resulting restriction frag-ments were resolved by electrophoresis in1–2.5% NuSieve plus 1.0% SeaKem agarose(FMC BioProducts, Rockland, ME) gels andvisualized by ethidium bromide staining.The restriction fragment length polymor-phisms (RFLPs) detected by this analysiswere mapped by the sequence comparisonmethod (Johnson et al., 1983; Cann et al.,1984), with the combination of all RFLPsidentified in a mtDNA defining its completehaplotype.

6 T.G. SCHURR ET AL.

Control region sequencing

A total of 525 bp (np 16000–16525) encom-passing CR hypervariable segment I (CR)was dideoxy-sequenced in 155 Koryak and47 Itel’men mtDNAs. For each mtDNA, theCR was read in both directions by double-stranded sequencing of PCR products withan ABI 377A Automated DNA Sequencer(Applied Biosystems, Foster City, CA).Double-stranded PCR (ds-PCR) segmentsencompassing the entire control region (1,121bp) were amplified with primers complemen-tary to the light (np 15978–15997, 58-CTACGCCAATCACTTTATTG-38) and heavy(np 429–408, 58-CTGTTAAAAGTGCATACC-GCC-38) strands of the mtDNA, and thenpurified with Centricon-100 microconcentra-tors (Amicon, Inc., Danvers, MA) to removeexcess primers and dNTPs. The purifiedds-PCR fragments were sequenced with theTaq DyeDeoxy Termination Cycle SequenceKitTM (Perkin Elmer, Oak Brook, IL) usingprimers complementary to the light (np15978–16000, 58-ACCATTAGCACCCAAAG-CTA-38; np 16225–16244, 58-CAACTATCA-CACATCAACTG-38) and heavy (np 16421–16402, 58-TGATTTCACGGAGGATGGT-38)strands. The excess of DyeDeoxyTM termina-tors was removed from the completed se-quencing reactions by using Centri-SepTM

columns (Princeton Separations, Adelphi,NJ), which were then run on 4% polyacryl-amide/6 M urea/13 Tris-Borate-EDTA(TBE)gels. The resulting sequence data were col-lected and analyzed using the SEQED soft-ware included in the ABI 377A DNA Se-quencer. Alignments and comparisons of theCR sequences were performed using theSequencher 3.0 software tool (Gene CodesCorporation, Ann Arbor, MI).

Statistical analysesGene diversity. To quantify the amountof mtDNA diversity within each populationirrespective of the phylogenetic relation-ships between different haplotypes, we esti-mated the gene diversity, or heterozygosity(h), of these groups from both RFLP haplo-type and CR sequence data using equation8.5 from Nei (1987),

h 5(1 2 S xi

2) · n

n 2 1

where xi is the frequency of each mtDNAhaplotype and n is the number of haplotypespresent in the population. The variance[V(h)] of each gene diversity estimate wasalso calculated using the formula

V(h) 52~ 3S xi

32 (S xi2)24

n

Probability of identity. Similarly, therelative genetic similarity of individualswithin native Siberian populations was as-sessed by estimating the probability of iden-tity (p) of any two mtDNAs, using the for-mula

p 5 on

x1 ? x2

where n is the total number of haplotypesand x1 and x2 are the frequencies of any twounique haplotype within a population. Inaddition, the relative genetic similarity ofSiberian populations was assessed by esti-mating the probability of identity (p) be-tween them using the formula

p 5 oi , j

n

x i ? xj

where xi and xj are the frequencies of ashared mtDNA haplotype in populations iand j summed over the n haplotypes ob-served in the two populations (Nei, 1987).

Maximum likelihood estimates. Themean intra- and intergroup sequence diver-gence of the Koryaks and Itel’men as well asthe other native Siberian (Torroni et al.,1993b; Starikovskaya et al., 1998) popula-tions characterized by high resolution restric-tion analysis were estimated from RFLPhaplotype data with the maximum likeli-hood (ML) procedure of Nei and Tajima(1983). Similar sequence divergence esti-mates were calculated for all of the majorhaplogroups present in eastern Siberians(A, C, D, G, Y, Z) and Native Americans(A–D) using the RFLP haplotype data fromthis and published studies (Torroni et al.,1992, 1993a,b, 1994a,b; Huoponen et al.,1997; Starikovskaya et al., 1998). Whencalculating the divergence times for intra-and intergroup variation as well as for indi-

7MTDNA VARIATION IN KORYAKS AND ITEL’MEN

vidual haplogroup variation, we used amtDNA evolutionary rate of 2.2–2.9% permillion years (MYR) (Torroni et al., 1994a).

Nucleotide diversity. The average nucle-otide diversity within and between popula-tions was estimated with the Sendbs pro-gram (N. Takezaki; http://iubio.bio.indiana.edu), which uses the method of Nei and Jin(1989) to estimate pairwise diversity values.In this analysis, diversity estimates werecalculated from the CR sequences from Sibe-rians (Torroni et al., 1993b; Starikovskayaet al., 1998; this study), east Asians (Horaiet al., 1996), and Native Americans (Ward etal., 1991, 1993; Shields et al., 1993). Severaldifferent DNAdistances were calculated fromthe diversity estimates, and standard errors(S.E.) of each value were obtained by boot-strapping over all sites using 500–1,000replications, with the 95% confidence inter-val (C.I.) for the diversity and divergenceestimates being calculated by using 62 S.E.In addition, neighbor-joining (NJ) (Saitouand Nei, 1987) trees were generated fromthe different genetic distances estimatedfrom the nucleotide diversity values. Sinceall distances gave NJ trees which showedthe same relationships among the popula-tions being studied, the one generated fromthe Kimura two-parameter (1980) distancemethod is presented here.

Phylogenetic analyses

The evolutionary relationships betweenthe complete haplotypes of the Koryaks,Itel’men, and other native Siberians (Tor-roni et al., 1993b) were inferred by parsi-mony analysis. Maximum parsimony (MP)trees were generated from the completehaplotype data through heuristic searchesusing the tree bisection and reconnection(TBR) branch-swapping algorithm with therandom addition of taxa in PAUP (version3.1.1) (Swofford, 1994). All MP trees wererooted from three African haplotypes, AF71(Chen et al., 1995), TYPE-5, and HYPANC(Cann et al., 1987), which had the HpaI np3592 site gain defining African macrohap-logroup L (Chen et al., 1995). After all runs,strict and 50% majority rule consensus treeswere generated from the saved MP trees to

determine the consistency of the branchingarrangements.

Parsimony trees were also generated fromthe Siberian CR sequence data with theDNAPARS program in PHYLIP 3.572 (Fel-senstein, 1994). To test the results of thisanalysis, all CR sequences were bootstrappedover 500 replicates using SEQBOOT and theresulting data files run in DNAPARS togenerate unrooted parsimony trees. The par-simony trees were then used to generatestrict and 50% majority rule consensus treeswith CONSENSE, with the 50% majorityrule consensus tree providing approximatebootstrap values for each branch of the tree(Felsenstein, 1994). Although DNAPARSproduces a single tree from any particulardata set, outgroup sequences can be used toroot the tree, and the AF62 CR sequence(Chen et al., submitted) was used for thispurpose.

Phylogenies of Siberian CR sequenceswere also inferred with the NJ method. Thismethod was used because it is known toreconstruct correct phylogenetic trees with ahigh probability when analyzing closely re-lated samples (Saitou and Imanishi, 1989).Rooted and unrooted NJ trees were gener-ated from genetic distances estimated withmodels available in DNADIST in PHYLIP,including the Kimura two-parameter(Kimura, 1980), Jukes and Cantor (1969),and DNAML (Felsenstein, 1994) methods.The data sets used to generate these NJtrees included CR sequences from only Sibe-rian populations (Torroni et al., 1993b; Stari-kovskaya et al., 1998; this study) and thosefrom both Siberian and east Asian (Torroniet al., 1993b; Horai et al., 1996) populations,with the robustness of all trees being checkedby bootstrapping using the algorithms(SEQBOOT and CONSENSE) available inPHYLIP (Felsenstein, 1994).

For population comparisons, NJ trees wereconstructed from ML values estimated withthe Nei and Tajima (1983) method from highresolution RFLP haplotypes in Siberianpopulations (Fig.2). Similarly, NJ trees wereconstructed from genetic distances esti-mated from haplogroup frequencies in na-tive Siberian and Asian populations ana-lyzed by high resolution RFLP methods usingthe methods (Cavalli-Sforza and Edwards,

8 T.G. SCHURR ET AL.

Fig. 2. A map of northeast Asia with the geographic locations of the Siberian and east Asianpopulations analyzed or compared in this study.

9MTDNA VARIATION IN KORYAKS AND ITEL’MEN

1967; Nei, 1972; Reynolds et al., 1983) avail-able in GENDIST (Felsenstein, 1994). Eachhaplogroup was considered a distinctive ‘‘al-lele’’ since all of the RFLPs defining thesehaplogroups were essentially linked and dis-tances calculated from the resulting ‘‘allele’’frequencies. The alleles used to estimategenetic distances between populations in-cluded haplogroups A, B, C, D, F, G, Y, Z and‘‘Other,’’ with ‘‘Other’’ haplotypes being fur-ther classified into three subgroups, the first(I) being defined by -DdeI np 10394, -AluI np10397, 6HaeIII np 16517, the second (II) by1DdeI np 10394, 6 HaeIII np 16517, andthe third (III) by 1DdeI np 10394, 1AluI np10397, 6 HaeIII np 16517.

RESULTSmtDNA haplogroups in Kamchatkan

populations

The high resolution RFLP analysis of 202Koryak and Itel’men mtDNAs revealed atotal of 22 distinct haplotypes defined by 48polymorphic sites (Table 1), some of which

had already been detected in Siberian popu-lations. Three of the four haplogroups (A, C,and D) observed in Native American popula-tions (Schurr et al., 1990; Torroni et al.,1992, 1993a) occurred in Kamchatkangroups, and these encompassed ,43% of allKoryak mtDNAs and 21% of Itel’menmtDNAs, with the majority of these belong-ing to haplogroup C. Consistent with previ-ous studies of northeast Asian populations,Kamchatkan groups also lacked haplogroupB mtDNAs, suggesting these mtDNAs werenever present in Paleoasiatic-speakinggroups. In addition, none of the Koryak orItel’men individuals exhibited mtDNAs fromhaplogroups typically seen in Europeanpopulations (Torroni et al., 1994d, 1996),indicating that they had not experiencednon-native gene flow through their mater-nal lineages.

Despite a third of their gene pool consist-ing of haplogroup A, C, and D mtDNAs, theKoryaks and Itel’men were not closely geneti-cally related to Native American groups.

TABLE 1. RFLP haplotypes in Kamchatkan populations1

Haplogroup Haplotype Polymorphic restriction sites

A SIB41 1663eSIB42 1663e, 25823a

C SIB26 110394c, 110397a, 213259o/113262a, 116517eSIB29 21715c, 110394c, 110397a, 213259o/113262a, 115606a, 116517eSIB45 110394c, 110397a, 213259o/113262aSIB64 13391e, 110394c, 110397a, 213259o/113262a, 116517eSIB65 18484a, 110394c, 110397a, 213259o/113262a, 116517eSIB66 11004h, 110394c, 110397a, 213259o/113262a, 116517e

D SIB40 25176a, 210180l, 110394c, 110397a, 113717a, 114923c, 115437eG SIB08 14830n/14831f, 18198a, 110394c, 110397a, 116517e

SIB35 11588f, 14830n/14831f, 18198a, 110394c, 110397a, 115494c, 116517eSIB36 14830n/14831f, 14923e, 18198a, 110394c, 110397a, 115494c, 116517eSIB37 14830n/14831f, 18198a, 110394c, 110397a, 115494c, 116517eSIB38 14830n/14831f, 18198a, 110394c, 110397a, 115494cSIB39 23315e, 14830n/14831f, 18198a, 110394c, 110397a, 115494c, 116517eSIB60 14830n/14831f, 18198a, 110394c, 110397a, 115494c, 216065g, 116517eSIB61 11588f, 14830n/14831f, 18198a, 110394c, 110397a, 115494c

Y SIB01 17933j, 28391e, 110394c, 116517eZ SIB43 110394c, 110397a, 116517e

SIB44 110394c, 110397a, 111074c, 116517eSIB62 16319i, 110394c, 110397a, 111074c, 116517eSIB63 274171, 110394c, 110397a, 111074c, 116517e

1 Haplotypes SIB01–SIB34 were previously defined in Siberians (Torroni et al., 1993b), and SIB46–SIB59 were defined in the Chukchiand Siberian Eskimos (Starikovskaya et al., 1998) but are not presented here. Polymorphic restriction sites are numbered from thefirst nucleotide of the recognition sequence according to the published sequence (Anderson et al., 1981). The restriction enzymes whichdetected variation are designated by the following single-letter code: a, AluI; c, DdeI; e, HaeIII; f, HhaI; g, HinfI; h, HpaI; i, HpaII; j,MboI; l, TaqI; n, HaeII; o, HincII. Sites separated by a diagonal line indicate either simultaneous site gains or site losses for twodifferent enzymes or a site gain for one enzyme and a site loss for another because of a single common nucleotide substitution. Thesesites are considered to be only one restriction site polymorphism in the statistical analysis. All samples differ from the publishedsequence (Anderson et al., 1981) by the presence or absence of the following sites: 24769a, 17025a, 18858f, 213702e, 214199o,114268g, and 214368g. In addition, the mtDNAs of four persons with maternal Even ancestry and paternal Koryak ancestry wereanalyzed and were found to have SIB08 (1), SIB37 (2), and SIB43 (1) haplotypes. Similarly, the mtDNAs of two persons with maternaland paternal Chukchi ancestry were analyzed and were found to have SIB26 (1) and SIB45 (2) haplotypes. None of these samples wereincluded in the data for the Koryaks.

10 T.G. SCHURR ET AL.

The only mtDNAs held in common betweenKamchatkan and Native American popula-tions were SIB41 (AM01) from haplogroup Aand SIB26 (AM43) and SIB45 (AM32) fromhaplogroup C. All other haplotypes fromthese haplogroups were confined to eitherChukotkan/Kamchatkan or Amerindiangroups, indicating that they must haveevolved in those respective geographicregions. In addition, the few haplogroupD mtDNAs in Koryaks (SIB40) were dis-tantly related to those inAmerindian groups.Since the same haplotype appeared in theChukchi and Siberian Eskimos (Stari-kovskaya et al., 1998) and related typeshave been detected at very low frequenciesin the Koreans (Ballinger et al., 1992) andJapanese (Horai and Matsunaga, 1986),these mtDNAs apparently have a northeastAsian origin and distribution.

The majority of the Koryak (58%) andItel’men haplotypes (80%) did not belong tohaplogroups A, C, and D, and as such theycould technically be defined as ‘‘Other’’mtDNAs. However, we were able to furtherclassify these putative ‘‘Other’’ mtDNAs intothree additional clusters of related haplo-types through high resolution RFLP analy-sis (Table 1). In fact, this analysis clearlyshows that most if not all Asian mtDNAs canbe assigned to a haplogroup based on com-bined RFLP and CR sequence data, henceobviating the need for the category of the‘‘Other’’ haplotype altogether.

The first cluster of ‘‘Other’’ haplotypes wasdefined by the combined HaeII np 4830 andHhaI np 4831 site gains and the linked DdeInp 10394 and AluI np 10397 site gains(hereafter called the DdeI/AluI sites). Thiscluster had previously been observed in Ko-reans and designated Asian haplogroup K(Ballinger et al., 1992) but was subsequentlyobserved in Tibetans and renamed hap-logroup G (Torroni et al., 1994c).

The second cluster of ‘‘Other’’ haplotypeswas defined by the HaeIII np 8391 site lossand the MboI np 7933, DdeI np 10394, andHaeIII np 16517 site gains. This cluster hadnot previously been considered a haplogroupin eastern Siberians even though it encom-passed haplotypes SIB01–SIB07 (Torroni etal., 1993b) and had also been detected inKoreans as AS105 from macrohaplogroup M

in east Asians (Ballinger et al., 1992; Torroniet al., 1994c). It has now been renamedhaplogroup Y, following the revised nomen-clature begun in Torroni et al. (1993a,b) andcontinued in subsequent studies of mtDNAvariation in different world populations (Tor-roni et al., 1994c,d, 1996; Chen et al., 1995).

The remaining Koryak and Itel’men haplo-types (SIB43, SIB44, SIB62, SIB63) be-longed to a third cluster of mtDNAs. Ofthese haplotypes, SIB43 did not initiallyappear to belong to a well-defined hap-logroup. In having only the DdeI/AluI sitesand the HaeIII np 16517 site gain, SIB43appeared to be identical with Asian haplo-type AS118 (Ballinger et al., 1992) fromAsian macrohaplogroup M (Ballinger et al.,1992; Torroni et al., 1994c). HaplotypesSIB44, SIB62, and SIB63 had the threeRFLPs present in SIB43 but also showed anadditional variant, the DdeI np 11074 sitegain. This site gain is created by an A-to-Gtransition at np 11078 in the ND4 gene andconverts an isoleucine (ATT) to valine (GTT).Consequently, these haplotypes constituteda new mtDNA lineage, designated hap-logroup Z, with the relationship of SIB43 tothese haplotypes being somewhat ambigu-ous based on RFLP data alone.

With respect to the DdeI np 11074 sitegain itself, this polymorphism had not previ-ously been detected in any haplotypes fromother world populations except for AM83,which occurred in the South AmericanMakiritare (Torroni et al., 1993a). AM83 hadthe four RFLPs present in SIB44 but dif-fered from the latter by four additionalpolymorphisms, two of which (HinfI np 717site gain and linked HaeII np 1622 and HhaInp 1623 site gains) were unique to AM83,and two (RsaI np 16049 site loss and HaeIIInp 16517 site gain) being mutations whichhave arisen multiple times in different hap-lotypes from various world populations (Ball-inger et al., 1992; Torroni et al., 1993a,1994c, 1996; Chen et al., 1995; Stari-kovskaya et al., 1998). BecauseAM83 sharedthe RsaI np 16049 site loss with severalhaplogroup C mtDNAs in other AmazonianIndian tribes, it was suggested to be ahaplogroup C mtDNA that had lost thecharacteristic markers of haplogroup Cthrough a reversion mutation (Torroni et al.,

11MTDNA VARIATION IN KORYAKS AND ITEL’MEN

1993a). Subsequent phylogenetic analysisconfirmed this interpretation, as the CRsequence for AM83 clustered with otherhaplogroup C mtDNAs from Native Ameri-can populations (Schurr and Brown, unpub-lished). This result implied that SIB44 wasnot an ancestral form of AM83 which wasbrought across the Bering Strait by theancient colonizers of the New World butinstead that AM83 was a Native Americanhaplogroup C mtDNA which had lost thecharacteristic markers of this mtDNA lin-eage in situ.

Haplotype distribution in Kamchatkanpopulations

An analysis of the distribution of RFLPhaplotypes in the Koryaks and Itel’men pro-vided a number of insights into the geneticrelationships of the different Kamchatkanpopulations (Table 2). To begin with, thehaplotype composition among the three ter-ritorial subgroups of Koryaks was markedly

different. The Aluitor differed from the othertwo Koryak subgroups as well as from theItel’men in having haplotype SIB40 fromhaplogroup D. In addition, the Aluitor sub-group lacked haplogroup Z haplotypes andhad the highest frequency of haplogroup GmtDNAs. By contrast, SIB26 from hap-logroup C was more common in the Palansubgroup, and novel haplotypes which de-rived from it (SIB64–66) were observed solelyin this subgroup. A statistical assessment ofthis distribution1 showed that it was signifi-cantly different, whether considering hap-logroup or haplotype frequencies.

Interestingly, there were no significantdifferences in haplotypic composition be-tween persons who were self-identified aseither Maritime or Reindeer Koryak by an-cestry irrespective of their village of origin(Table 3). Based on these results, ReindeerKoryaks appeared to be genetically synony-mous with Maritime Koryaks, not a sepa-rate subgroup of this population, despitespeaking a different dialect and practicing adifferent subsistence strategy. While thesenonsignificant differences may reflect thefact that a certain proportion of the Koryakindividuals sampled were not completelycertain of their maternal ancestry in termsof Maritime or Reindeer Koryak ethnicity, itis more probable that they reveal the degreeto which sedentary and nomadic groups

1All chi-square analyses were carried out with various meth-ods available in StatXact 3 (CTEL Corporation, Cambridge, MA).Since most comparisons included three populations, chi-squareanalyses using Fisher’s exact test were conducted using MonteCarlo estimates of P values, with a 99% confidence interval.

TABLE 2. mtDNA haplotype distribution inKamchatkan populations1

Haplo-group

Haplo-type

Koryaks

Itel’menAluitor Karagin Palan Totals

A SIB41 2 0 2 4 0SIB42 1 1 2 4 3

C SIB26 5 7 24 36 7SIB29 0 1 0 1 0SIB45 4 9 0 13 0SIB64 0 0 1 1 0SIB65 0 0 4 4 0SIB66 0 0 1 1 0

D SIB40 2 0 0 2 0G SIB08 1 0 0 1 0

SIB35 2 1 0 3 3SIB36 0 1 0 1 0SIB37 34 4 16 54 26SIB38 1 1 0 2 0SIB39 0 3 0 3 0SIB60 0 0 0 0 3SIB61 0 1 0 1 0

Y SIB01 4 8 3 15 2Z SIB43 0 3 0 3 0

SIB44 0 4 2 6 1SIB62 0 0 0 0 1SIB63 0 0 0 0 1

Totals 56 44 55 155 471 The distribution of complete haplotypes among the threeKoryak subgroups was significantly different, whether con-sidering haplogroup (Fisher’s exact test: x2 5 42.61, P 5 0.0000,d.f. 5 10) or haplotype (Fisher’s exact test: x2 5 90.59,P 5 0.0000, d.f. 5 36) frequencies. In addition, the distributionof complete haplotypes between the Koryaks and Itel’men wasstatistically significant using both haplogroup (Fisher’s exacttest: x2 5 12.15, P 5 0.0328, d.f. 5 5) and haplotype (Fisher’sexact test: x2 5 46.62, P 5 0.0035, d.f. 5 21) frequencies.

TABLE 3. Complete haplotype distribution in Maritimeand Reindeer Koryaks1

Subgroup N

Haplogroup

A C D G Y Z

Maritime 89 5.6 31.5 1.1 43.8 7.9 10.1Reindeer 54 5.6 40.7 1.9 37.0 14.8 0.0‘‘Koryak’’ 12 0.0 50.0 0.0 50.0 0.0 0.0

Totals 155 5.2 36.1 1.3 41.9 9.7 5.81 Ethnicity was assigned to individuals based on self-identification asrecorded from interview data. The category ‘‘Koryak’’ representspersons who were not certain of their ethnicity in terms ofMaritime vs. Reindeer Koryak. This group was not comparedagainst the Maritime and Reindeer Koryak subgroups in thesechi-square tests. The assessment of subgroup differences did notshow statistically significant differences in haplogroup distributionsbetween the Maritime and Reindeers based on haplogroupfrequencies (Fisher’s exact test: x2 5 9.233, P 5 0.1001, d.f. 5 5).

12 T.G. SCHURR ET AL.

have become mixed in the past several centu-ries.

There were also noteworthy differences inthe haplogroup distribution in the Koryaksof the Kamchatka peninsula relative to thatof the Koryaks from northeastern Kam-chatka, who had haplogroup A and D fre-quencies comparable to Reindeer Chukchipopulations (Torroni et al., 1993b; Stari-kovskaya et al., 1998). The reason for thisdiscrepancy lies in the source of samples forthese populations. The previously analyzed‘‘Koryak’’ population consisted of individualsfrom Middle Pakhachi and Achayvayam vil-lages who were sampled as part of a study ofconventional genetic markers in Chukchipopulations (Sukernik et al., 1981, 1986).Extensive Chukchi admixture in these vil-lages, if not a total replacement of the resi-dent Koryaks resulting from prolongedChukchi-Koryak wars in the nineteenth cen-tury (Bogoras, 1910), probably accounts forthe difference in haplogroup composition ofthe Koryak subgroups. Consequently, theMiddle Pakhachi-Achayvayam subgroup,originally classified as Reindeer Koryaks byGurvich (1966), should instead be more prop-erly considered Reindeer Chukchi, as sug-gested by Bogoras (1910).

When we consider both Kamchatkangroups, the mtDNA distribution in the Kory-aks and Itel’men was also quite different(Table 2). While a number of haplotypeswere shared between these two Paleoasiatic-speaking groups (SIB01, SIB26, SIB35,SIB37, SIB42, SIB44), the majority of thesewere the founding, or nodal, haplotypes forhaplogroups C, G, Y, and Z (see Fig. 3).Otherwise, the Itel’men also lacked anyunique haplotypes (SIB64–66) from hap-logroup C and almost none from haplogroupG, whereas two unique haplogroup Z haplo-types (SIB62–63) occurred in this group.The extent of these genetic differences be-tween the Koryaks and Itel’men was statisti-cally significant whether considering hap-logroup or haplotype frequencies.

The same extent of divergence was ob-served when the mtDNA variation in allthree Paleoasiatic-speaking populations wasassessed (Table 4). Chi-square analysis ofthe Chukchi, Koryaks, and Itel’men mtDNA

distributions revealed statistically signifi-cant differences among them using eitherhaplogroup (Fisher’s exact test: x2 5 145.10,P 5 0.0000, d.f. 5 10) or haplotype (Fisher’sexact test: x2 5 228.2, P 5 0.0000, d.f. 5 58)frequencies, while similar values were ob-tained for pairwise comparisons of thesegroups (data not shown). If we assume thattheir languages are closely related, theseresults could indicate that Paleoasiatic-speaking groups have undergone significantgenetic differentiation since sharing a com-mon origin in northeastern Siberia. Alterna-tively, these differences could suggest theseparate origin and expansion of these popu-lations in this region, with their linguisticaffiliations reflecting the considerable lan-guage sharing which has taken place overthe past several millennia. In the case of theChukchi and the Koryaks, whose linguisticconnection is more strongly supported, itappears that the Chukchi have become ge-netically distinctive from the Koryaksthrough considerable gene flow with Sibe-rian Eskimos and perhaps other ethnic popu-lations this region, such as the Yukagirs andEvens. Additional data from these lattergroups will be necessary to clarify theseinterpretations.

Haplotype distribution in easternSiberians

The recent studies of RFLP variation inKamchatkan populations permitted abroader comparison of haplogroup distribu-tions in eastern Siberian groups (Table 5).Aside from the Koryaks and Itel’men, hap-logroup G mtDNAs were observed in theChukchi but were absent in the SiberianEskimos (Starikovskaya et al., in press).South of Kamchatka, haplogroup G mtD-NAs were detected in 5.3% of the Nivkhs(Torroni et al., 1993b), 23.1% of the Koreans(Ballinger et al., 1992), and 7.5% of theJapanese (Horai et al., 1984; Horai andMatsunaga 1986). Unfortunately, it was notpossible to accurately determine the fre-quency of haplogroup G mtDNAs in the Ainufrom published data, as the previous RFLPanalysis of Ainu mtDNAs (Harihara et al.,1988) did not use the enzymes which detectthe characteristic RFLPs of haplogroup G

13MTDNA VARIATION IN KORYAKS AND ITEL’MEN

(HaeII np 4830 and HhaI np 4831 sitegains).

By contrast, haplogroup Y mtDNAs wereabsent in most Siberian populations, includ-ing the Chukchi and Eskimos. However,these haplotypes represented the third mostfrequent haplogroup in the Koryaks (9.7%)and were present at polymorphic frequen-cies in the Itel’men (4.3%). In the lowerAmur River region, this haplogroup is com-mon in the Udegeys (8.9%) (Torroni et al.,1993b) and reaches its highest frequency inthe Nivkhs of northern Sakhalin (64.9%).Although similar mtDNAs have also been

observed at polymorphic frequencies in theKoreans (7.7%) (Ballinger et al., 1992), theirpresence in the Japanese is uncertain, asprevious RFLP analyses of Japanese mtD-NAs did not use enzymes which would de-tect all characteristic RFLP markers for thishaplogroup (Horai et al., 1984; Horai andMatsunaga, 1986). SIB01 was the most com-mon haplogroup Y haplotype in Siberianpopulations (Torroni et al., 1993b; this study),suggesting that it was the founding haplo-type for this mtDNA lineage. Since deriva-tive haplotypes have been seen only amongthe Nivkhs and Udegeys (Torroni et al.,

TABLE 4. MtDNA haplogroup distribution in Siberian and East Asian populations1

Languagegroup

Population n

Haplogroup frequencies (%)

ReferenceA B C D E F G Y Z

Other

I II III

Siberian YupikSiberian

Eskimos 79 77.2 — 2.5 20.3 — — — — — — — — 1Paleoasiatic

Chukchi 66 68.2 — 10.6 12.1 — — 9.1 — — — — — 1Koryaks 155 5.2 — 36.1 1.3 — — 41.9 9.7 5.8 — — — 2Itel’men 47 6.4 — 14.9 — — — 68.1 4.3 6.4 — — — 2

Isolated languageNivkhs 57 — — — 28.1 — — 5.3 64.9 — 1.8 — — 3

TungusicUdegeys 45 — — 17.8 — — — — 8.9 — 28.9 — 44.4 3Evenks 51 3.9 — 84.3 9.8 — 2.0 — — — — — — 3

Koreans 13 7.7 7.7 — 23.1 7.7 15.4 15.4 7.7 — — 7.7 7.7 4Taiwanese Han 20 10.0 20.0 5.0 5.0 — 5.0 — — — 15.0 15.0 15.0 4Tibetans 54 11.1 5.6 3.7 16.7 7.4 14.8 5.6 — — 1.9 — 33.3 51 ‘‘Other’’ haplotypes are those which do not belong to the haplogroups identified in this table but which may have different haplogroupaffiliations. The mutational composition of these ‘‘Other’’ haplotypes was as follows: I, DdeI np 10394, 2AluI np 10397, 6HaeIII np16517; II, 1DdeI np 10394, 6HaeIII np 16517; III, 1DdeI np 10394, 1AluI np 10397, 6HaeIII np 16517. The references cited in thetable are as follows: 1, Starikovskaya et al. (1998); 2, this study; 3, Torroni et al. (1993b); 4, Ballinger et al. (1992); 5, Torroni et al.(1994c). For the three East Asian populations, there are some discrepancies between this table and the haplogroup frequenciespublished in Torroni et al. (1994c). In Torroni et al. (1994c), haplogroups A, B, C, D, and F corresponded to haplotype groupings H, D* 1C, R, L, and A, respectively, from Ballinger et al. (1992), whereas haplogroups E and G were newly designated haplogroups. In beingdefined by the 2HhaI np 7598, 1DdeI np 10394, and 1AluI np 10397 polymorphisms, haplogroup E may be equivalent to haplotypegrouping G of Ballinger et al. (1992), while haplogroup G replaces haplotype grouping K from the same paper. In this table, haplotypegrouping C from Ballinger et al. (1992) is removed from haplogroup B, with the haplotypes belonging to this mtDNA lineage beingtallied in the Other II category, as a specific label has not yet been given to it. The reason for this separation is that the haplotypes fromthese two haplogroups are mutationally distinctive from each (Ballinger et al., 1992; Passarino et al., 1993) and clearly segregate intodistinct clusters in MP trees of Asian mtDNAs (Fig. 2 [Ballinger et al., 1992]). Since these Other II haplotypes occur in both theKoreans and Taiwanese Han, they have been excluded from the haplogroup B column for these populations. In addition, one of theKorean mtDNAs (AS105 [Ballinger et al., 1992]) placed in the ‘‘Other’’ category by Torroni et al. (1994c) was reclassified as belonging tohaplogroup Y since it was identical to SIB01; this change reduced the overall frequency of ‘‘Other’’ haplotypes in Koreans.Furthermore, the Korean haplotype (AS104 [Ballinger et al., 1992]) placed in haplogroup E by Torroni et al. (1994c) has both the HhaInp 7598 site loss from haplogroup E as well as the linked HaeII np 4830/1HhaI np 4831 site gains from haplogroup G. Hence, itsphylogenetic status is ambiguous, although suggesting some sort of association between these two haplogroups. Similarly, one of theTaiwanese Han haplotypes (AS61 [Ballinger et al., 1992]) that was placed in haplogroup B by Torroni et al. (1994c) had the linkedHincII np 12406/HpaI np 12406 site losses defining haplogroup F and the region V 9 bp deletion defining haplogroup B but lacked theHaeIII np 16517 site gain that is almost always present in haplotypes from the latter mtDNA lineage. Thus, its phylogenetic status isalso uncertain. Consequently, AS104 and AS105 will require further sequence analysis to determine whether or not they have beenplaced in the correct haplogroup category. As for the remaining haplotype groupings of Ballinger et al. (1992), most of these (B, G, I, J,O, P, Q, S, T) appear to represent legitimate haplogroups which are present in East and Southeast Asian populations and which arenow being reclassified due to these letter designations having been given to additional haplogroups that are present in other worldpopulations (e.g., Torroni et al., 1994d, 1996). In the case of haplotype groupings E and F from Ballinger et al. (1992), these did notoriginally represent a single haplogroup but instead those mtDNAs whose general mutational characteristics included the 1DdeI np10394 and 1AluI np 10397 polymorphisms. As such, they can be considered equivalent to Asian macrohaplogroup M, which is definedin the same way (Torroni et al., 1994c; Chen et al., 1995). Likewise, haplotype grouping D of Ballinger et al. (1992) represents a numberof mtDNAs which probably belong to different haplogroups, including those from haplogroup B.

14 T.G. SCHURR ET AL.

1993b), haplogroup Y appears to haveevolved in the lower Amur River region ofsoutheastern Siberia.

As for haplogroup Z, these haplotypesseemed to have a geographic focus in Kam-chatka. No comparable mtDNAs were seenin the Chukchi or Siberian Eskimos (Starik-ovskaya et al., in press), Siberian popula-tions south of Kamchatka (Torroni etal., 1993b), or east Asian populations (Ball-inger et al., 1992). The possible exceptionswere the Tibetans (Torroni et al., 1994c)or Bornean aborigines (Ballinger et al.,1992), amongst whom AS118 appeared tobe identical to SIB43, although additionalCR sequencing will be necessary to confirmtheir lineal association with haplogroup ZmtDNAs. However, haplogroup Z mtDNAswere detected in the Evens (Table 1 foot-note), suggesting that these mtDNAs mayhave originated in Tungusic-speaking popu-lations.

Genetic diversity of Siberian populations

The extensive mtDNA diversity amongeastern Siberian groups was further demon-

strated with various statistical analyses ofthe RFLP haplotype data. The unbiasedestimate of diversity (h) indicated that na-tive Siberians were genetically heteroge-neous while relatively similar in their over-all level of diversity, with the Evenks beingthe most diverse (0.888) and the Nivkhs(0.732) being the least diverse (Table 5, toppanel). These values are also generally re-flected in both the number of distinct haplo-types and the frequency of the most commonhaplotype (MCH) in each population, withthose having high frequencies of their MCHalso showing the lower diversity values.

These findings were further confirmedwith probability of identity (p) estimates.The p values for mtDNAs within popula-tions were five to 40 times higher (31.5–47.2%) than those of mtDNAs between popu-lations (0.76–7.75%), indicating a substantialdegree of population differentiation of theseaboriginal groups (Table 5, bottom panel).The level of differentiation was seen morestarkly when the probability of identity be-tween groups was estimated from only

TABLE 5. Genetic diversity and probability of identity estimates for Siberian populations1

Population N n

Mostcommon

haplotype

Genediversity(h 6 S.E.)

Probabilityof identity

withinpopulation

(%)

Averageprobabilityof identitybetween

population(%) Ratio

Evenks 51 16 21.6 0.888 6 0.001 47.2 3.62 13.04Udegeys 45 10 28.9 0.843 6 0.002 39.5 0.76 51.97Nivkhs 57 11 45.6 0.732 6 0.006 38.5 1.71 22.51Koryaks 155 19 34.8 0.807 6 0.000 41.8 7.75 4.86Itel’men 47 9 55.3 0.739 6 0.012 31.5 7.74 4.07Chukchi 66 11 39.4 0.781 6 0.004 40.6 4.46 9.10Eskimos 79 12 36.7 0.805 6 0.003 40.1 3.22 12.45

Evenks Udegeys Nivkhs Koryaks Itel’men Chukchi Eskimos

Evenks — 2 0 2 1 1 1Udegeys 2.36 — 1 1 1 0 0Nivkhs 0.00 2.01 — 2 1 0 0Koryaks 5.11 0.43 4.44 — 6 3 3Itel’men 3.22 0.19 1.96 23.50 — 1 1Chukchi 2.29 0.00 0.00 3.52 1.58 — 6Eskimos 0.28 0.00 0.00 1.00 0.19 19.22 —1 In the top panel, unbiased estimates of diversity and probability of identity estimates for eight native Siberian populations analyzedwith the high resolution RFLP method are shown. N, the total number of individuals analyzed per population; n, the number ofhaplotypes observed in each population. The average probability of identity between populations is estimated as the averageprobability of identity between each population and the other seven populations. The ratio is the probability of identity withinpopulations/probability of identity between populations. The data for the Evenks, Udegeys, and Nivkhs were taken from Torroni et al.(1993b), and those for the Chukchi and Siberian Eskimos were taken from Starikovskaya et al. (1998). In the bottom panel, theprobability of identity estimates based on shared haplotypes between populations is shown. The numbers above the diagonal indicatethe number of mtDNA haplotypes shared between populations, while those below the diagonal indicate the percent similarity of thepopulations. The population abbreviations are the same as those used in the Appendix.

15MTDNA VARIATION IN KORYAKS AND ITEL’MEN

shared haplotypes. These estimates showedvery small levels of between-group similar-ity (0.00–5.11%), with at most only threemtDNA haplotypes being shared betweenany two populations, except the Chukchiand Siberian Eskimos and the Koryaks andItel’men, who both shared six different hap-lotypes.

Interestingly, both the Koryak popula-tions and the Itel’men shared one to threehaplotypes with the five other Siberian eth-nic groups (Chukchi, Eskimos, Udegeys,Nivkhs, Evenks) analyzed. These apparentgenetic affinities with other Siberian groupsare also shown in the ratio of within-groupto between-group identity values, which arelowest for the Koryaks and Itel’men. How-ever, the majority of the mtDNAs sharedbetween these populations were the puta-tive founding haplotypes for haplogroups A,C, G, Y, and Z rather than more recentlyderived ones, which are usually population-or region-specific. Hence, while showing ge-netic affinities with both the Evenks andAmur River populations, the Koryaks andItel’men may be somewhat more geneticallydistant from the other groups than impliedby these estimates.

The maximum likelihood (ML) method ofNei and Tajima (1983) showed similar levelsof diversity for eastern Siberian popula-tions. Both Koryaks (0.113%) and Itel’men(0.108%) had intrapopulational divergencevalues which were comparable to those ob-tained for the other native Siberian popula-tions analyzed by similar methods (Table 6).The lowest value for these groups was seenin the Nivkhs (0.104%), who had a predomi-nance of two haplotypes (SIB01, SIB10),while the highest occurred in the Udegeys

(0.153%), who exhibited a set of uniquehaplotypes (SIB21–SIB25) relative to thoseof the other groups (Torroni et al., 1993b).Furthermore, the close genetic affinities ofthe Koryaks and Itel’men were shown bytheir extremely small corrected interpopula-tional value, one approached only by theChukchi-Siberian Eskimo value.

The ML values for Siberian populationwere also found to be higher than compa-rable estimates for most Native Americanpopulations (Torroni et al., 1994a). Thisdifference could imply a greater antiquity ofeastern Siberian populations relative to theirNew World counterparts. However, they mayalso reflect the recent hybridization of bothinterior and Pacific Siberian populationsand the concomitant acquisition of mtDNAlineages from more southerly ethnic groups.These genetic influences can be seen by thepresence in the Evenks of SIB20, a mtDNAhaplotype belonging to haplogroup F (Tor-roni et al., 1993b), which commonly occursin Tibetan (Torroni et al., 1994c) and south-east Asian (Ballinger et al., 1992) popula-tions.

Phylogenetic analysis of Siberianhaplotypes

The results of the MP analysis of Koryak,Itel’men, and other native Siberian RFLPhaplotypes recapitulate the trends seen inthe statistical analyses of these data (Fig. 3).With few exceptions, the shared haplotypesamongst these groups were the nodal, orputative founding, haplotypes for each hap-logroup. The remaining shared haplotypeswere usually common to geographicallyproximate populations, such as the Koryaksand Itel’men (e.g., SIB35 from haplogroup

TABLE 6. Genetic divergence estimates for native Siberian populations1

Eskimos Chukchi Koryaks Itel’men Nivkhs Udegeys Evenks

Eskimos 0.1105 0.1133 0.1571 0.1626 0.1493 0.1559 0.1721Chukchi 0.0063 0.1128 0.1456 0.1534 0.1449 0.1534 0.1678Koryaks 0.0453 0.0326 0.1132 0.1139 0.1342 0.1417 0.1506Itel’men 0.0530 0.0434 0.0030 0.1087 0.1444 0.1589 0.1635Nivkhs 0.0419 0.0363 0.0254 0.0379 0.1044 0.1457 0.1576Udegeys 0.0244 0.0208 0.0089 0.0283 0.0173 0.1525 0.1606Evenks 0.0555 0.0500 0.0328 0.0478 0.0440 0.0230 0.1228

1 Genetic distance estimates for native Siberian populations analyzed with the high resolution RFLP method using the maximumlikelihood procedure (Nei and Tajima, 1983). The numbers (in italics) along the diagonal indicate intrapopulational divergences, thenumbers above the diagonal indicate interpopulational divergences and the numbers below the diagonal indicates interpopulationaldivergences corrected for intrapopulational divergences.

16 T.G. SCHURR ET AL.

Fig. 3. An MP tree of Koryak, Itel’men, and othereastern Siberian RFLP haplotypes. The TBR tree is 119steps in length, has a Consensus index (C.I.) of 0.765and an retention index (R.I.) of 0.852, and representsone of 120 MP trees that were generated by the heuristicsearch irrespective of the number of MAXTREES speci-fied. The mtDNA haplogroups observed in native Sibe-rian populations are indicated by the black capitalletters in shaded boxes, while haplotypes appearing ineach population are identified by ellipses specified in the

symbol key. For haplotypes shared between two popula-tions, the appropriate ellipses for each population havebeen positioned behind the haplotype number; thoseappearing in more than two populations are indicated asshared (see Table 2). the African haplotypes used asoutgroups to root this tree included AF71 (Chen et al.,1995) and HYPANC and Type-5 (Cann et al., 1987). Thenumbers located under the major branches of the MPtree represent the percent support for each branchobserved in the 50% majority rule consensus tree.

17MTDNA VARIATION IN KORYAKS AND ITEL’MEN

G), or those with known linguistic affilia-tions, such as the Evenks and Udegeys (e.g.,SIB27). In addition, all haplogroup brancheswere very highly supported in the 50% major-ity rule consensus tree, the weakest beinghaplogroup C. The same pattern was ob-served when Siberian haplotypes were ana-lyzed with those from other Asian popula-tions (Koreans, Taiwanese Han, Tibetans)(Ballinger et al., 1992; Torroni et al., 1994c)(results not shown), with the most divergenthaplotypes from each haplogroup beinglargely population-specific and located atthe terminal positions of the branches, andpopulation-specific clusters occurring withinsome of these haplogroups (e.g., Koryaksand haplogroup G, Tibetans and haplogroupsD and G).

When the ML estimates were used toconstruct NJ trees, the Paleoasiatic-speak-ing groups were split into two separatebranches, one representing Chukotka andthe other Kamchatka (Fig. 4). In addition,the Nivkhs branched off close to the Kam-chatkan groups, while the Udegeys andEvenks formed a separate branch betweenthe Nivkhs and Chukotkan groups. Nearlyidentical populational associations were seenin the NJ tree based on genetic distancesestimated from haplogroup frequencies inSiberian and Asian populations (Fig. 5),with Koreans and Nivkhs clustering to-gether between the Kamchatkan andChukotkan populations. These associationsamong Siberian groups were concordant withthe trends seen in the gene diversity andprobability by identity estimates for thesame groups (Tables 5, 6).

The centrality of the Koreans, Tibetans,and Taiwanese Han in Figure 5 results fromeach of these populations having at leastfour of the main haplogroups (A, C, D, F, G,Y, Z) which collectively appear among thepopulations shown in this tree. As such, thisdistribution reveals the genetic influences ofboth northern and southern Asian groups onthese populations. On the other hand, thelocation of the Udegeys in this tree is prob-ably due to their having a high frequency of‘‘Other’’ haplotypes bearing the DdeI/AluIsites, which the Tibetans and TaiwaneseHan also possess (Table 4), although theUdegey haplotypes are distinctive from those

present in the other two populations. Like-wise, the position of the Evenks likely re-flects their sharing haplogroup C mtDNAswith both Siberian and Asian groups andhaving haplogroup F mtDNAs in commonwith central-east Asian groups.

CR sequence variation in Kamchatkanpopulations

The sequencing of the CR of Koryak andItel’men mtDNAs provided a much moredetailed picture of genetic variation inPaleoasiatic-speaking populations. Overall,this analysis revealed a total of 53 differentCR sequences, as defined by 54 variablenucleotide positions, in Kamchatkan popula-tions, most of which had not previously beenobserved (Table 7). The greatest sequencediversity was observed among haplogroupC and G mtDNAs, while a number of uniqueCR sequences were associated with hap-logroups A, D, Y, and Z. In addition, aconsiderable degree of substructure withinindividual haplogroups was detected, allow-ing a finer discrimination of lineal associa-tions between populations occupying north-east Siberia and the New World.

Haplogroup A. The CR sequences fromKoryak haplogroup A mtDNAs which be-longed to SIB41 had the np 16223T, np16290T, np 16319A, and np 16362C muta-tions which define the sequence motif forthis haplogroup in Asia and the Americas. Inaddition, these CR sequences had the np16111T mutation. This polymorphism hasbeen observed in nearly all haplogroup AmtDNAs of the Chukchi, Eskimos, Na-DeneIndians, and Amerindians (Torroni et al.,1993a; Forster et al., 1996; Starikovskaya etal., 1998) but is absent in those from Asianpopulations (Torroni et al., 1993b; Kolmanet al., 1996). This pattern indicated that thenp 16111T mutation arose in the earliestinhabitants of Beringia, who later gave riseto the ancestral Native American popula-tion(s) in which haplogroup A evolved andthat the Koryaks have retained some hap-logroup A mtDNAs which are related toother Chukotkan and New World popula-tions.

The np 16192T transition was also ob-served in the Koryak haplogroup A CR se-

18 T.G. SCHURR ET AL.

quences which exhibited the 16111T muta-tion. This mutation had previously beenobserved in similar mtDNAs from the Chuk-chi, Siberian and North American Eskimos,Canadian Dogrib, and Navajo (Shields et al.,1993; Torroni et al., 1993b; Starikovskaya etal., 1998) and represents a major sublineagethat is largely confined to the north PacificRim. The distribution of the 16192T muta-tion implies that it arose among the mostrecent common ancestors of the Chukchi,Eskimo-Aleuts, and Na-Dene Indians exclu-sive of the progenitors ofAmerindian popula-tions. Thus, the presence in the Koryaks ofmtDNAs with this mutation likely reflectsthe preservation of ancient Beringian haplo-types in Kamchatkan groups.

In addition, both the Itel’men and Kory-aks had haplogroup A mtDNAs which lackedboth the 16111T and the 16192T polymor-phisms as well as the 16362C mutationtypically seen in haplogroup A mtDNAs inthe New World. All of these were associatedwith haplotype SIB42, the only haplogroupA haplotypes appearing in the Itel’men.While it is possible that one or both of theBeringian mutations were lost from theseCR sequences, they probably representmtDNAs from haplogroup A which neverexperienced these nucleotide substitutions,as is the case for all similar Asian haplo-types (Torroni et al., 1993b; Kolman et al.,1996). The latter explanation is supported

by the presence in these mtDNAs of the np16242T mutation, which is not seen in com-parable haplotypes from other circumpolaror New World populations (Ward et al.,1991, 1993; Shields et al., 1993; Torroni etal., 1993a).

Haplogroup C. Two distinct clusters ofhaplogroup C CR sequences were observedin both Kamchatkan populations (Table 8).The first encompassed CR sequences #05–12and exhibited all of the polymorphisms whichdefine haplogroup C mtDNAs in both Sibe-rian and Native American populations, in-cluding the np 16223T, np 16298C, and np16327T transitions (Torroni et al., 1993a,b).In addition, the majority of the CR se-quences belonging to this cluster (#07–09)also exhibited the np 16124C transition, andmany also had the np 16318T transversion,both of which do not appear in haplogroup C

Fig. 4. An NJ tree based on ML estimates for Sibe-rian populations analyzed by high resolution analysis(see Table 6).

Fig. 5. An NJ tree based on genetic distances esti-mated with the Reynolds et al. (1983) method inPHYLIP (Felsenstein, 1994); the frequencies of thehaplogroup ‘‘alleles’’ used to estimate the genetic dis-tances for these populations are shown in Table 4.

19MTDNA VARIATION IN KORYAKS AND ITEL’MEN

TABLE 7. HVS-I sequences in Koryaks and Itel’men

Haplo-group

Haplo-type

HVS-Isequencenumber

Polymorphic nucleotides

000001111111111122222222222222222222233333333333335555113691122277889901223446666788999999901111122456661122788931946923592778341230156378012346840168978222673957

CAM TTACTCATTGTCCTCCACCTTCTCCACGCTCCCACCTTGTAAGCCTTTCATTACA SIB41 01 -----T--------T---T-----------T-----------A----C------

SIB41 02 -----T--------T---T-----------T--------C--A---CC------SIB42 03 ------------------T--T--------T-----------A-----------SIB42 04 ----C-------------T--T--------T-----------A-----------

C SIB29 05 ---------A--------T-----------------C------T-------C--SIB26 06 ------------------T-----------------C------T-------C--SIB26 07 -------C----------T-----------------C----T-T-------C--SIB64 07 -------C----------T-----------------C----T-T-------C--SIB45 08 -------C----------T-----------------C----T-T----------SIB26 09 -------C----------T-----------------C----T-T------CC--SIB26 10 -------C----------------------------C------T-------C--SIB26 11 -------C----------------------------C------T-----G-C--SIB26 12 -------C----------T-----------------C------T-------C--SIB26 13 ----C--------C----T-----T----C------C--------------C--SIB65 13 ----C--------C----T-----T----C------C--------------C--SIB66 13 ----C--------C----T-----T----C------C--------------C--SIB26 14 -C--C--------C----T-----T----C------C--------------C--SIB26 15 ----C--------C----T-----T-T--C------C--------------C--SIB26 16 ----C--------C----------------------C--------------C--SIB26 17 ----C--------C----T----TT----C------C--------------C--SIB26 18 ----C--------C-T--T-----T----C------C--------------C--SIB26 19 -------------C----T-----T----C---------C-----------C--SIB26 20 ------------------T-----T----C------C--------------C--SIB26 21 ----C--------C----T----------C------C--------------C--SIB26 22 ----C--------C----T-----T----C---------------------C--SIB26 23 -------------C----T-----T-T--C---------------------C--

D SIB40 24 ----C------T------T-----------------------A----C------G SIB08 25 C--------A--------T--------------------------------C--

SIB37 25 C--------A--------T--------------------------------C--SIB37 26 C---C----A--------T--------------------------------C--SIB39 26 C---C----A--------T--------------------------------C--SIB37 27 C---C----AC-------T------G-------------------------C--SIB37 28 C--------AC-------T------G-------------------------C--SIB37 29 C---C----A--------T------------T-------------------C--SIB37 30 C---C----A--------T--------------------------------C--SIB35 31 C---C----A--------T--------------------C-----------C--SIB61 32 C---C----A--------T--------------------C--------------SIB35 33 C--------A--------T--------------------C-----------C--SIB38 34 C--------A--------T--------------------C--------------SIB60 35 C--TC----A--------T--------------------------------C--SIB37 36 C---C-------------T--------------------------C-----C--SIB37 37 C--------A--------T--------------C-----------------C--SIB37 38 C---C----A--------T--------------C-----------------C--SIB37 39 C--------A--------T--------------------------------C-TSIB37 40 C-----G--A--------T--------------------------------C--SIB37 41 C--------A--------T--------A-----------------------C--SIB37 42 C---C----A--------T--------------------------------CG-SIB37 43 C---C----A------G-T--------------------------------C--SIB36 44 C---C-----------G-T--------------------------------C--SIB37 44 C---C-----------G-T--------------------------------C--SIB37 45 C---C-----------G-T-------------T------------------C--SIB37 46 C--------A--------T----------------------------C---C--

Y SIB01 47 --------C----C------C-----T------------------------C--SIB01 48 --------C----C----T-C-----T------------------------C--SIB01 49 --------C----C------C-----T-T-----------G---T------C--SIB01 50 --------C----C------C-----T-T---------C-G---T------C--SIB01 51 --------C----C------C---T-T-----------------T------C--

Z SIB43 52 ---------A--T-----TC---T------------C--------------C--SIB44 52 ---------A--T-----TC---T------------C--------------C--SIB62 53 ---------A--TC----TC---T------------C--------------C--SIB63 53 ---------A--TC----TC---T------------C--------------C--

20 T.G. SCHURR ET AL.

mtDNAs from other Siberian or NativeAmerican tribes (Ward et al., 1991, 1993;Shields et al., 1993; Torroni et al., 1993a,b).Similar CR sequences were seen in somehaplogroup C mtDNAs from the Chukchi

and Siberian Eskimos (Starikovskaya et al.,1998), implying that these mtDNAs arecommon to all Paleoasiatic-speaking groupsand perhaps also Eskimoan populations, inwhich haplogroup C occurs at low frequen-

TABLE 8. Distribution of HVS-I sequences in Kamchatkan populations1

HaplogroupCR

sequence

Koryaks

Itel’menAluitor Karagin Palan Total

A 01 0 0 2 2 002 2 0 0 2 003 1 1 2 4 004 0 0 0 0 3

C 05 0 1 0 1 006 0 0 6 6 007 4 1 4 9 008 4 8 0 12 009 0 1 0 1 010 0 0 2 2 011 1 0 0 1 012 0 0 1 1 013 0 0 12 12 314 0 1 0 1 015 0 0 0 0 216 0 0 1 1 017 0 1 0 1 018 0 0 1 1 019 0 0 0 0 120 0 1 0 1 021 0 1 0 1 0

C 22 0 0 1 1 023 0 0 1 1 0

D 24 2 0 0 2 0G 25 21 0 2 23 2

26 4 2 2 8 327 0 0 2 2 028 0 0 4 4 029 0 0 0 0 430 0 0 0 0 331 2 1 0 3 132 0 1 0 1 033 0 0 0 0 234 1 0 0 1 035 0 0 0 0 336 1 0 0 1 037 0 1 0 1 038 0 1 0 1 039 1 2 0 3 040 2 0 0 2 041 1 0 0 1 042 0 0 2 2 043 0 0 0 0 3

G 44 4 1 4 9 1045 0 0 0 0 146 1 0 0 1 0

Y 47 4 3 3 10 148 0 0 0 0 149 0 1 0 1 050 0 1 0 1 051 0 1 0 1 052 0 6 2 8 153 0 0 0 0 2

Total 56 37 54 147 461 Certain samples did not sequence well and thus were not included in the totals per population and/or ethnic subgroup; these includedseven from the Karagin Koryaks and one from the Palan Koryaks as well as one from the Itel’men. These omissions are the source ofthe discrepancy in the sample sizes between this table and Table 2.

21MTDNA VARIATION IN KORYAKS AND ITEL’MEN

cies (Merriwether et al. 1994; Starikovskayaet al., in press).

The first cluster was further associatedwith several different RFLP haplotypes, in-cluding SIB26, SIB29, SIB45, and SIB64.Because these SIB45 haplotypes differedfrom equivalent haplotypes in Native Ameri-can populations (AM32) (Torroni et al., 1992,1993a,b) by having the 16124C and 16318Tmutations, they appear to have arisen inKamchatkan groups independent of puta-tively identical haplotypes in New Worldgroups, with both sets of SIB45 mtDNAsbecoming differentiated from SIB26 throughthe loss of the HaeIII np 16517 site, a knownhypermutable restriction site (Ballinger etal., 1992; Torroni et al., 1994c, 1996; Chen etal., 1995). Consequently, Siberian SIB45mtDNAs cannot be considered founding hap-logroup C haplotypes for Native Americanpopulations.

The second cluster encompassed CR se-quences #13–23 but differed from the first byhaving the np 16093C, np 16189C, np16261T, and np 16288C transitions and bylacking the 16318T and 16327T mutations.As seen in other Asian and Native AmericanmtDNAs, the 16189C transition creates ahomopolymeric stretch of Cs within a 14 bphypervariable domain (np 16180–16193),which typically results in the insertion of anadditional one or more Cs (Horai and Hay-asaka, 1990; Horai et al., 1993). CR se-quences from this cluster also occurred inthe Chukchi but not in the Siberian Eskimos(Starikovskaya et al., 1998) or distantlyrelated Asian and Native American groupson both sides of the Bering Strait (Shields etal., 1993; Ward et al., 1993; Torroni et al.,1993a,b), and thus appeared to be unique forPaleoasiatic-speaking populations. This clus-ter was also associated with the foundinghaplotype, SIB26, as well as the uniquehaplotypes SIB65 and SIB66. The presenceof at least two distinct CR clusters withinhaplogroup C and the association of both ofthem with the founder haplotype, SIB26,implied a considerable degree of divergenceof this mtDNA lineage in Asia. Furthermore,since CR sequences from the first and thesecond clusters differed on average by five orsix mutations, they could represent multipleexpansions of haplotypes from the same

mtDNA lineage in northeastern Siberiangroups at different times.

Haplogroup D. Only one type of CRsequence from haplogroup D was observedin the Koryaks, and it was linked withhaplotype SIB40. This CR sequence (#24)was defined by five different sequence poly-morphisms, most of which appeared in oneor more of the other haplogroups presentin the Kanchatkan populations (16093C,16223T, 16319A, and 16362C transitions),and had one distinguishing polymorphism,the np 16173T transition. The same CRsequence also appeared in four SiberianEskimos and Chukchi (Starikovskaya et al.,1998) and had previously been observed inAlaskan Inupik Eskimos (CR lineage #58)(Shields et al., 1993). However, it has notbeen found in other Siberian and NativeAmerican populations (Shields et al., 1993;Torroni et al., 1993a).

Haplogroup G. Haplogroup G showedthe greatest diversity of CR sequences rela-tive to the other haplogroups. In general, theCR sequences for haplogroup G were definedby a set of four different sequence polymor-phisms, including the np 16017C, 16093C,np 16129A, and 16223T transitions. Alongwith the linked HaeII np 4830 and HhaI np4831 site gains, the 16017C transition clearlydistinguished haplotypes from this hap-logroup from those of other Siberian orAsian mtDNA lineages. Similar CR se-quences were also seen in the Chukchi, whoexhibited a limited number of haplogroup GmtDNAs (Starikovskaya et al., 1998). Thesedata indicated that haplogroup G mtDNAswere part of the genetic makeup of ancientPaleoasiatic-speaking populations and thatthis mtDNA lineage has undergone a consid-erable degree of genetic diversification sincebeing brought to northeastern Siberia.

Haplogroup Y. The CR sequences for thishaplogroup exhibited a set of nucleotidepolymorphisms which were unique to thismtDNA lineage, including the np 16126C,16189C, np 16231C, np 16266T, and 16519Ctransitions. One cluster within this hap-logroup was defined by only these muta-tions, while another also had the np 16287T,16316G, and 16328T mutations which do

22 T.G. SCHURR ET AL.

not appear in the other Siberian hap-logroups. Since both of these sublineagesappeared in SIB01 haplotypes from theKoryaks and Itel’men, the founding haplotypefor this haplogroup, they were probably part ofthe ancestral pool of Kamchatkan groups.

Haplogroup Z. The CR sequences forhaplogroup Z also exhibited a set of nucleo-tide polymorphisms unique to this mtDNAlineage, including the 16129A, 16185T,16223T, 16224C, 16260T, 16298C, and16519C mutations. In addition, SIB62 andSIB63 had the 16189C polymorphism whichin this case did not create a homopolymericstretch of Cs. More importantly, CR se-quences with these mutations were seen inboth haplotypes SIB43 and SIB44. Thisfinding confirmed that SIB43 belonged tohaplogroup Z and suggested that it had lostthe DdeI np 11074 site gain seen in the otherhaplotypes from this mtDNA lineage. Be-cause haplogroup Z mtDNAs appeared inboth the Koryaks and Itel’men, they toowere likely part of the ancestral pool ofKamchatkan groups.

Distribution of CR sequences in Koryaksand Itel’men

As can be seen in Table 8, a number of CRsequences from haplogroups A, C, G, Y, andZ were shared amongst all Koryak sub-groups (#03, #07, #26, #44, #47). Their ubiq-uity amongst the Koryaks and high fre-quency relative to other CR sequencessuggested that they represented the found-ing mtDNAs for this Paleoasiatic-speakinggroup. The high frequency of CR sequence#25 in the Aluitor Koryaks (37.5%) sug-gested its origin in this subgroup and itsspread to the Palan Koryaks and Itel’menthrough gene flow, with the Karagin Kory-aks possibly missing this CR sequence dueto drift effects. In addition, the Aluitor andKaragin Koryaks were found to share CRsequences #08, #31, and #39, a result whichsupported their known close linguistic asso-ciation. On the other hand, the Karagin andPalan Koryaks shared CR sequence #52,suggesting some differences between themand the Aluitor Koryaks. Aside from thesecommon or shared types, the majority of CRsequences occurred in one Koryak subgroup

or another, with many showing village speci-ficity (e.g., within the Palan subgroup, #06and #13 were detected only in persons bornin Voyampolka).

A more restricted number of CR sequencesfrom haplogroups G, Y, and Z were sharedbetween the Koryaks and Itel’men. BothPaleoasiatic groups had CR sequences #25,#26, #31, and #44 from haplogroup G, aswell as #47 from haplogroup Y and #52 fromhaplogroup Z. Because of their prevalence ineach population, these sequences are likelythe founding types for their respective hap-logroups. In addition, the np 16207G muta-tion defined a set of haplogroup G mtDNAs(#43–45) present in both the Koryaks andItel’men. Its high frequency in the Itel’men(30.4%) relative to the Koryaks (6.1%) sug-gested that this set may have originated inthis population and spread to the Koryaksthrough gene flow. The only other shared CRsequence (#13) occurred in the Palan Kory-aks and Itel’men, a distribution which mightreflect the southward expansion of ReindeerKoryak groups into traditional Itel’men ter-ritory (Jochelson, 1908). Otherwise, all re-maining CR sequences were population-specific for either of the two Kamchatkangroups.

Another intriguing finding was the nonuni-form distribution of the CR clusters fromhaplogroup C in the Koryaks and Itel’men.The cluster defined by the 16124C-16223T-16298C-16318T-16327T motif (#05–12) wasdistributed across all three Koryak sub-groups at more or less the same frequencybut was completely absent from the Itel’men.In contrast, the cluster defined by the16093C-16189C-16261T-16288C motif (#13–23) was present in only the Karagin andPalan Koryaks as well as amongst theItel’men. Such a pattern suggested thatmtDNAs from the first cluster were broughtto the Kamchatka peninsula by ancestralKoryak groups, whereas those from the sec-ond may already have been present in theItel’men and subsequently acquired by thesouthernmost Koryak groups through inter-marriage with Itel’men populations.

When CR sequence diversity in all Paleoa-siatic-speaking groups was examined, strik-ing differences between Chukotkan andKamchatkan populations were observed

23MTDNA VARIATION IN KORYAKS AND ITEL’MEN

(Table 9).As a whole, the Koryaks and Itel’menshowed a greater frequency of unique typesthan did the Chukchi and Siberian Eskimos.The latter population had a dramaticallylower frequency of these unique types due tosharing ten different CR sequences with theChukchi, including ones identical to #01,#07 and #24 from this study (Starikovskayaet al., 1998). Interestingly, nearly all of theseCR sequences were the same set sharedbetween the two Kamchatkan populations,with only #01 from haplogroup A and #24from haplogroup D being more highly fre-quent in Chukotkan groups. Although theirstrong genetic similarity to Siberian Eski-mos suggested that the Chukchi might haveacquired haplogroup C and G mtDNAs fromKoryak subgroups through recent gene flow,the fact that all Chukchi subdivisions ana-lyzed for mtDNA variation possessed haplo-types from both haplogroups (Torroni et al.,1993b; Starikovskaya et al., 1998; this study;Schurr et al., unpublished data) argues thatthese mtDNAs were part of the ancestralgene pool for all Paleoasiatic-speakinggroups.

Phylogenetic analysis of Siberian CRsequences

When these CR sequences were subjectedto phylogenetic analysis, all six of the majorhaplogroups present in northeastern Siberi-ans formed distinct branches in the result-ing NJ tree (Fig. 6) irrespective of whetheror not African mtDNAs were used as out-groups. The same overall pattern was ob-tained through parsimony analysis withDNAPARS (results not shown). In addition,the substructure within haplogroups A, C,

and D noted in the CR sequence data wasclearly revealed in this phylogeny. Withinhaplogroup A, three distinct clusters or sub-lineages were observed. The first two sublin-eages (I and II) possessed the np 16111Tmutation, with the first (I) also having thenp 16192T mutation and the second (II)having the np 16265G mutation which arosein Eskimoan populations, whereas the thirdsublineage (III) lacked the 16111T mutationaltogether. Sublineage III sequences ap-peared in haplogroup A mtDNAs from eastAsian populations and the Evenks and prob-ably represent the ancestral state for thismtDNA lineage. Among Paleoasiatic speak-ers, the Koryaks and Itel’men had hap-logroup A CR sequences from both sublin-eages I and III, whereas those of the Chukchibelonged to sublineages I and II.

The previously identified CR sequencesublineages in haplogroup C were also ob-

Fig. 6. An NJ tree of CR sequences from northeastSiberian and east Asian populations constructed fromgenetic distances estimated with the Kimura two-parameter model in DNADIST (Felsenstein, 1994). TheCR sequences occurring in the Koryaks and Itel’menwere indicated as Kamchatkan (KAM) samples, andthose occurring in the Chukchi and Siberian Eskimos(Starikovskaya et al., submitted) were indicated asChukotkan (CHU) haplotypes, with the numbers of eachcorresponding to the CR sequence numbers in Table 8from this study and those described in Starikovskaya etal. (1998). Otherwise, the CR sequences occurring innative Siberians or east Asians were indicated as the CRsequences enumerated in Table 5 of Torroni et al.(1993b). AF62 is the African CR sequence used as anoutgroup in this tree, although the same overall branch-ing structure was maintained when no outgroups wereused. The Roman numerals specify distinct clusters orsublineages of the particular haplogroup in which theyoccur, while all haplogroups are indicated by boxedcapital letters.

TABLE 9. CR sequence diversity in northeast Siberian populations1

Population N n

Numberof uniquetypes (%

individuals)

Genediversity(h 6 S.E.)

Probabilityof identity

withinpopulations

(%)

HVS-I sequences sharedbetween populations

A#01

C#07

C#13

D#24

G#25

G#26

Eskimos 77 12 3 (3.9) 0.819 6 0.008 22.6 37 1 — 4 — —Chukchi 65 19 23 (35.4) 0.883 6 0.001 42.8 19 1 4 1 2 2Koryaks 147 41 70 (47.6) 0.945 6 0.000 36.6 2 9 12 2 23 8Itel’men 46 19 26 (56.5) 0.931 6 0.000 43.1 — — 3 — 2 31 N, number of mtDNAs subjected to CR sequencing; n, number of distinct sequences observed in each population. The KamchatkanHVS-1 sequences enumerated in the table are equivalent to those present in Chukotkan populations in the following way: KAM01 5CHU01, KAM07 5 CHU17, KAM13 5 CHU19, KAM24 5 CHU13, KAM25 5 CHU20, and KAM26 5 CHU21 (KAM, Kamchatka;CHU, Chukotka) (Starikovskaya et al., 1998; this study). The number of mtDNAs with each CR sequence is specified by population.The letter above each CR sequence number indicates the haplogroup to which each one belongs.

24 T.G. SCHURR ET AL.

Fig. 6.

25MTDNA VARIATION IN KORYAKS AND ITEL’MEN

served in this NJ tree. Two of them moreclosely resembled the haplogroup C mtDNAspresent in Native American populations byhaving the 16223T-16298C-16327T motifcharacteristic of this mtDNA lineage. Theone defined only by these mutations (I)occurred predominantly in east Asian andAmur River populations as well as theEvenks, and the other (II), having the16124C and 16318T mutations, occurredonly in Paleoasiatic groups. The third sublin-eage (III) had the 16093C-16189C-16261T-16288C-16298C motif and appeared only innorthern Paleoasiatic-speaking groups.

Similarly, haplogroup D had three sublin-eages within it. The first (I), representingSIB40, occurred at very low frequenciesamong Paleoasiatic groups and Siberian Es-kimos and was the only haplogroup DmtDNA in the Koryaks. The second sublin-eage (II) occurred exclusively among Chu-kotkan populations, as no similar types wereseen among Native American groups withhigh frequencies of haplogroup D (Ward etal., 1991, 1993; Shields et al., 1993). Theremaining mtDNAs from east Asian popula-tions, the Evenks, and the Nivkhs formed asublineage (III) which had a sequence motifmost similar to haplogroup D haplotypes inNative American populations (16223T–16362C) and thus probably represented theancestral state for this mtDNA lineage inAsia and the Americas.

Comprised of only Koryak, Itel’men, andChukchi mtDNAs, haplogroup G also showedsome degree of substructure. However, itsbranches were not quite as clearly defined asthe sublineages in haplogroups A, C, and D.Instead, it exhibited small clusters of re-lated CR sequences which appeared mostlyin either the Chukchi, Koryaks, or Itel’men,which may reflect its relatively more recentorigin in eastern Siberia relative to the otherthree haplogroups. By contrast, haplogroupsB, Y, and Z formed small unbranched clus-ters, probably due to the limited number ofmtDNAs analyzed for each mtDNA lineagein east Asian and Siberian populations. No-tably, only east Asian mtDNAs were foundwithin haplogroup B, while Koryak andItel’men mtDNAs constituted all of the CRsequences present in the other two hap-logroups.

Genetic links between eastern Siberianand east Asian populations

Because the distribution of haplogroupsG and Y in northeast Asia pointed to the Seaof Okhotsk region as a possible source areafor these mtDNA lineages, we examined themtDNA variation in populations from thisgeographic region to determine their geneticaffinities with Paleoasiatic groups. In par-ticular, we were interested in assessing therelatedness of the Ainu, Japanese, and Kore-ans to the Koryaks and Itel’men since theseeast Asian groups had also been shown tohave haplogroup G and/or Y mtDNAs (Horaiet al., 1984; Ballinger et al., 1992; Hariharaet al., 1992). To do this, we compared the CRsequence data from the Ainu, Japanese, andKoreans (Torroni et al., 1993b; Horai et al.,1996) with those from Siberian Eskimos,Paleoasiatic-speaking groups, Amur Riverpopulations (Nivkhs and Udegeys), and theEvenks. Although the sequences of Horai etal. (1996) lacked sequence information forthe region between np 16000 and 16048 inwhich the np 16017C mutation from hap-logroup G occurs, all of the other phylogeneti-cally important nucleotide polymorphismsfor haplogroups A–D, G, Y, and Z werecontained within the region encompassed bytheir sequences (np 16048–16530). Hence,inferences about the relationships of themtDNAs from these populations with Sibe-rian groups was possible.

The resulting NJ tree revealed a numberof interesting associations between thesepopulations (Fig. 7). The first notable find-ing was that most of the haplogroups de-fined in this NJ tree occurred in both Paleoa-siatic and east Asian populations. Althoughnot previously classified as belonging tohaplogroup A, due to the HaeIII np 663 sitegain not being clearly identified in the ear-lier RFLP study of these samples (Horai andMatsunaga, 1986), a small number of Ainu,Japanese, and Korean sequences from Horaiet al. (1996) clearly fell into this mtDNAlineage (cluster C6 [Horai et al., 1996]). Allof these sequences lacked the np 16111Tmutation and hence were part of the Asian/Siberian sublineage III of this haplogroup(Fig. 6). In addition, the Koryaks and theAinu shared CR sequence #03 from this

26 T.G. SCHURR ET AL.

study, and the Itel’men had the closelyrelated CR sequence #04 from this hap-logroup. By contrast, the Japanese andKoreans exhibited a set of related CR se-quences from haplogroup A which weredistinctive from those in Kamchatkangroups by having the np 16187T mutationand also lacking the 16362C transition.

Second, unlike Siberian populations, allthree of the east Asian groups analyzedby Horai et al. (1996) had haplogroup BmtDNAs (cluster C2 [Horai et al., 1996]).The Ainu had a very low frequency of thesehaplotypes, as seen in an earlier RFLPstudy (Harihara et al., 1988), and thosepresent in this population were very similarto types detected in the Japanese and Kore-ans, who were also previously noted topossess deletion haplotypes (Horai and Mat-sunaga et al., 1986; Ballinger et al., 1992).Based on this distribution, it appears thatthe Ainu acquired deletion haplotypesthrough gene flow with Japanese popula-tions rather than having them as part oftheir ancestral gene pool. In addition, hap-logroup B mtDNAs separated into two dis-tinct subbranches rather than remain in asingle cluster, as seen in Horai et al. (1996).This result is in concordance with findingsin other studies of Asian mtDNA variation(Ballinger et al., 1992; Schurr, Starikov-skaya et al., unpublished) which argueagainst this mtDNA lineage being a mono-phyletic group.

Third, all three east Asian groups (Ainu,Japanese, Koreans) had haplogroup CmtDNAs at low frequencies (cluster C14[Horai et al., 1996]). This mtDNA lineagewas previously observed at low frequenciesin the Japanese (morph-9 [Horai et al.,1984]) but had not been detected in theKoreans (Ballinger et al., 1992) or the Ainu(Harihara et al., 1988). The majority ofthese CR sequences were located in sublin-eage I defined by the 16223T-16298C-16327T motif (Fig. 6), with all of thembelonging to the Japanese and Koreans. Incontrast, a minority of these CR sequencesshowed affinities with sublineage III inPaleoasiatic groups, including those pre-sent in the Ainu, although only the Koreanshad closely related mtDNAs. This distribu-tion again suggested an eastern Siberian/

east Asian source for sublineage I mtDNAswithin this haplogroup and perhaps an eastAsian source for sublineage III mtDNAs.

Another novel finding was that hap-logroup D mtDNAs appeared to be verycommon in the Ainu, Japanese, and Kore-ans. In fact, a number of different clusters ofCR sequences in these populations (clustersC5, C7, C8, C9, C10, and C11 [Horai et al.,1996]) had the 16223T–16362C motif whichcharacterizes this haplogroup. In general,the east Asian CR sequences from theseclusters were interspersed among similartypes from eastern Siberian groups, includ-ing the neighboring Nivkhs (sublineage II;Fig. 6), with the only exceptions being thecluster C11 mtDNAs, which, in having thenp 16189C mutation, formed a separatesubbranch. Similarly, the Chukchi and Es-kimo CR sequences from SIB48–53 alsoformed a separate branch (sublineage III;Fig. 6), as did that of SIB40 (sublineage I;Fig. 6), which was positioned closest tohaplogroupA, due to having several polymor-phisms in common with this mtDNA lin-eage. This branching pattern further illus-trated the significant diversity of CRsequences within the haplogroups presentin east Asian and Siberian populations andindicated that many of the CR clusters seenin Figure 7 are not monophyletic groupingsequivalent to haplogroups, as suggested byHorai et al. (1996).

Similarly, CR sequences from haplogroupG were apparently present in all three eastAsian populations, a result which was consis-tent with previous RFLP analyses of Korean(Ballinger et al., 1992) and Japanese (Horaiet al., 1984; Harihara et al., 1992) popula-tions. The Ainu showed the highest fre-quency of these types (cluster C16 [Horai etal., 1996]), and nearly all of them clusteredamong putatively similar mtDNAs fromPaleoasiatic populations, implying a com-mon genetic origin. On the other hand, all ofthe Japanese and Korean CR sequences inthis haplogroup cluster together separatefrom the Siberian/Ainu branch, implyingthat they might represent a divergent sublin-eage of this mtDNA lineage which arose intheir common ancestral population.

Concerning haplogroup Y, previous RFLPstudies of mtDNA variation (Horai et al.,

27MTDNA VARIATION IN KORYAKS AND ITEL’MEN

Fig. 7. An NJ tree of Siberian and east Asian CRsequences based on genetic distances estimated with theKimura two-parameter model in DNADIST (Felsen-stein, 1994). All Siberian individuals analyzed in Starik-ovskaya et al. (submitted) and this study are indicatedby branches with black triangles at their terminal tips,while those from Ainu, Korean, and Japanese individu-als analyzed in Horai et al. (1996) are indicated by

ellipses, with the population affiliation specified in thekey. The Korean and Taiwanese Han samples analyzedby Torroni et al. (1993b) are also indicated by ellipsesbut are denoted with asterisks to distinguish them fromthose analyzed in Horai et al. (1996). The haplogroups towhich the sequences belonged or were assigned based ontheir CR sequence motif and position in this NJ tree areindicated by the capital letters. The capital letters in

28 T.G. SCHURR ET AL.

Fig. 7. (Continued) shaded boxes correspond to thehaplogroups of Torroni et al. (1993a,b, 1994c) and thisstudy, whereas the groupings specified with a capital Cfollowed by a number correspond to the CR sequenceclusters identified in Horai et al. (1996). When there wasan exact correlation between these cluster designations,both were positioned by the bracket which encloses therelated CR sequences. No Siberian population showed

CR sequences which belonged to clusters C7, C9, C11,C13, C15, and C17. Cluster C3 did not appear in thistree because its sequences did not appear in the Ainu,Japanese, or Koreans, and C18 is absent because onlyone Korean from Horai et al. (1996) had a sequencebelonging to it. The circled a in both panels of the figureindicates where the two portions of the NJ tree areconnected.

29MTDNA VARIATION IN KORYAKS AND ITEL’MEN

1984; Horai and Matsunaga, 1986) were notable to show the presence of these haplo-types in the Japanese. However, CR se-quence analysis confirmed the presence ofhaplogroup Y mtDNAs in the Japanese andKoreans at very low frequencies as well as in19.6% of the Ainu from Hokkaido (cluster C1[Horai et al., 1996]). These findings sug-gested that Paleoasiatic and Ainu popula-tions had haplogroup Y mtDNAs as part oftheir ancestral genetic makeup or possiblythat the Itel’men and Koryaks acquired somehaplogroup Y mtDNAs through contactwith Ainu populations (Near Kurilers) whooccupied the southern tip of the Kam-chatkan population in prehistoric times. Fur-thermore, the occurrence of haplogroup YmtDNAs in the Japanese and Koreans im-plied that these populations obtained themthrough gene flow with the Ainu or otherSiberian groups having these mtDNAs.

Very few if any east Asian CR sequencesclustered with haplogroup Z mtDNAs fromKamchatkan populations (cluster C14 [Horaiet al., 1996]). All of the Japanese and KoreanCR sequences which clustered near thesetypes had at least two of the defining muta-tions of this mtDNA lineage (16185T and16260T and/or 16298C) but lacked two oth-ers (16129A, 16224C) which were present inall comparable Kamchatkan mtDNAs. Thisfinding suggested that, if part of haplogroupZ, these mtDNAs were distantly related tothose appearing in the Koryaks and Itel’men.These differences, along with the presencein Mongolians (Kolman et al., 1996) andEvens (Table 1) of haplogroup Z CR se-quences identical to those in Kamchatkanpopulations, probably means that thismtDNA lineage did not evolve in the Sea ofOkhotsk/Amur River region.

There were also several other clusters ofCR sequences from the east Asian popula-tions which did not appear in Siberian popu-lations. At least one of these was haplogroupF mtDNAs, since this mtDNA lineage wasknown to be present in the Japanese (Horaiet al., 1984; Harihara et al., 1992), Koreans(Ballinger et al., 1992), and Ainu (Hariharaet al., 1992). Based on preliminary CR se-quence data for Southeast Asians (Schurr etal., unpublished data), this branch can tenta-tively be identified as cluster C4 in Horai

et al. (1996). However, in Figure 7, thesemtDNAs were split into two clusters, withmost CR sequences belonging to the largecluster located between cluster C17 of Horaiet al. (1996) and haplogroup D.

Due to the lack of RFLP data for theremaining east Asian CR sequences, theexact lineal affiliations of the other two mainclusters (C15 and C17 [Horai et al., 1996])remains unknown, although both have verydistinctive CR sequence motifs. C15 andC17 mtDNAs appeared in the Japanese andKoreans but occurred at the highest fre-quency in the Ainu and the Ryukyuans ofOkinawa, the two aboriginal populations ofthe Japanese archipelago. Because theseclusters were not present in eastern Sibe-rian populations and occurred at low fre-quencies in Koreans and Japanese, theymust have evolved in ancestral populationsof the Ainu and Ryukyuans which expandedinto these islands before the ancestors of themodern Japanese and Koreans arrived. Thisinterpretation is generally consistent withthe hybridization hypothesis for Japaneseorigins (Hanihara, 1991) which proposesthat Jomon peoples originated in SoutheastAsia and came to the Japanese islands.12,000 YBP, where they gave rise to theAinu and Ryukyuans, while the progenitorsof the Japanese and Koreans, the Yayoipeople, emigrated from the Korean penin-sula some 2,300 YBP and replaced or ab-sorbed these aboriginal groups.

The relationships shown in Figure 7 werecompletely consistent with the nucleotidediversity estimates calculated for Paleoasi-atic and east Asian populations (data notshown). These estimates, along with theassociated NJ tree (Fig. 8), clearly showedthe close genetic affinities of contemporaryKorean and Japanese populations as well asthe genetic similarity of the Ainu to bothgroups, with the latter association probablybeing attributable to recent admixture be-tween the Ainu and Japanese. Moreover,there was the large split between Paleoasi-atic-speaking populations, with the Koryaksand Itel’men showing much closer geneticties to the Ainu and the Chukchi being moreclosely linked with the Siberian Eskimosand Northwest Coast Amerindian popula-tions. These associations were consistent

30 T.G. SCHURR ET AL.

with previous phylogenetic evidence andstatistical estimates of genetic identity bydescent which implied the considerable di-vergence of Paleoasiatic-speaking popula-tions (Tables 6, 7) as well as with similarcomparisons of Native American and eastAsian CR sequence diversity (Shields et al.,1993; Horai et al., 1996).

DISCUSSIONGenetic history of Kamchatkan

populations

One of the most significant findings of thisstudy was the genetic discontinuity betweenPaleoasiatic-speaking populations of Chukotkaand Kamchatka. Several different measuresof haplotypic diversity showed that the Kory-ak and Itel’men populations were geneti-cally very similar to one another but quitedistinct from the Chukchi, who are linguisti-

cally related to the Koryaks. The Kamchat-kan groups were also quite divergent fromthose which evolved from the ancient Berin-gian gene pool, such as the Eskimo-Aleutsand Na-Dene Indians, suggesting that theirancestral populations replaced the survivorsof the Bering land bridge in this regionduring the Neolithic period. The only linksto these earlier populations were the pres-ence in the Koryaks of haplotype SIB41mtDNAs from haplogroup A having both the16111T and np 16192T mutations and ofhaplotype SIB40 from haplogroup D. How-ever, the recent acquisition of these haplo-types through gene flow with the neighbor-ing Chukchi, in whom these types of mtDNAsare more common, cannot be excluded.

While more closely related to each otherthan to any other Siberian population, theKoryaks and Itel’men showed significantdifferences between them. The CR sequencedata did show that Koryaks and Itel’menshared the putative founding mtDNAs ofhaplogroups C, G, Y, and Z, suggesting theymight have originated from a common ances-tral population in the Okhotsk Sea region.However, they also exhibited significant dif-ferences in haplogroup frequencies and hap-lotype distributions, with nearly all uniqueCR sequences occurring in one population orthe other. These results support other lin-guistic and culture evidence that the Itel’menand Koryak populations arose from tempo-rally distinct expansions into the Kam-chatka peninsula, with the ancestral Itel’menbeing the first to enter this region during theSiberian Neolithic (Arutiunov, 1988; Dikov1990, 1994). These results further reveal aninconsistency between the linguistic affilia-tions and genetic profiles of Paleoasiaticspeakers, a phenomenon which appears tobe common for most indigenous populationsof Siberia (Szathmary, 1981; Sukernik, 1992).

Moreover, both RFLP and CR sequencedata further revealed the considerable differ-entiation of Koryak subgroups living inKamchatka. Although sharing several puta-tive founding mtDNAs, the Aluitor, Karagin,and Palan Koryaks exhibited significant dif-ferences in haplogroup frequencies and hap-lotype distribution. These results were con-sistent with both ethnographic and linguisticevidence for dialectic subdivisions of the

Fig. 8. An NJ tree based on genetic distances gener-ated from pairwise nucleotide diversity estimates forSiberian and east Asian populations. Branch lengths areproportionate to the relative distances between popula-tions. The sources and designations for these popula-tions are specified in Methods.

31MTDNA VARIATION IN KORYAKS AND ITEL’MEN

Koryak population, as summarized byVdovin (1973). In addition, the Aluitor andKaragin Koryaks appeared to be more geneti-cally similar to each other than either was tothe Palan Koryaks, as expected from theircloser linguistic association, although differ-ences in the distribution of the two major CRsublineages from haplogroup C (I and II)separated theAluitor Koryaks from the othertwo subgroups. This apparent discrepancy isprobably attributable to considerable admix-ture between Itel’men populations and theKaragin and Palan Koryaks, as marital ex-changes between the Itel’men and Koryaksliving along the Tigil’ River have been docu-mented since the seventeenth century(Krasheninnikov, 1972) and records of theRussian Orthodox Church from the villageof Ivashka dating between ,1850 and 1930(Sukernik and Schurr, unpublished data)indicate that the remnants of the easternItel’men fused with the adjacent KaraginKoryaks during the second half of the nine-teenth century.

The high statistical significance of thedifferences in haplotype distribution in Kam-chatkan populations and between theKoryak subgroups was also notable for otherreasons. The closing of traditional settle-ments throughout the peninsula during theSoviet period led to the consolidation ofwidely separate settlements, which in turnprobably caused the mixing of differentKoryak subgroups. The most direct evidenceof the effects of population amalgamationmay be that Reindeer and Maritime Kory-aks are largely genetically indistinguishablefrom one another despite the fact that Rein-deer Koryaks developed out of Okhotsk Ko-ryak populations. In addition, warfare, epi-demics, and intermarriage with nonnativesin the last 300 years have had a profoundeffect on their population size and composi-tion. Nevertheless, our results suggest thatremnants of the former dialectical and terri-torial subdivisions of Koryaks have per-sisted into modern times despite the enor-mous demographic impact of Russiancolonization.

The mtDNA data were also consistentwith the archeological evidence from north-eastern Siberia. Since 14,000 YBP, there hasbeen a series of population expansions along

the Asiatic coastline of the Bering Sea bycultures having different subsistence strate-gies and lithic technologies. The Early Ushkisite in the Kamchatka peninsula (14,300YBP) was occupied by a culture based on biggame hunting and salmon fishing whichemployed bifacial projectile points similar tothe Late Paleolithic Paleoindian stemmedpoint industry of western North America(Arutiunov and Sergeev, 1990a,b; Dikov,1990, 1994). However, by 10,860 YBP, theLate Ushki culture differed markedly fromthe earlier one in terms of the size and formof its dwellings, the use of bifacially re-touched leaf-shaped microblades, and thefirst appearance of stone lip labret ornamen-tation. These innovations in the earlyHolocene, along with the development ofnontoggling, multibarbed harpoon technolo-gies for hunting sea mammals which werewidely employed in the Sea of Okhotsk andBering Sea regions, demarcate the begin-nings of the proto-Eskimo-Aleut culturaltradition in northeast Siberia (Arutiunovand Sergeev, 1990a,b; Dikov, 1990, 1994).

During the mid-Holocene climatic opti-mum, at approximately 6,000–4,000 YBP,there was substantial population growth inthe littoral area of the Okhotsk Sea regionassociated with the spread of continentalcultures of reindeer hunters from the Lenaand Kolyma River basins (Mochanov, 1962;Fedoseeva, 1968; Simchenko, 1976). As sug-gested by Vasilievskiy (1971), the expansionof these continental tribes into the northernOkhotsk Sea region apparently gave rise tothe ancestral Koryak and Itel’men popula-tions, whereas movement from the lowerAmur River–Sakhalin region appeared tohave played a supplementary role in theirorigins. Furthermore, the expansion of Neo-lithic ‘‘Southern Okhotsk’’ cultures intonorthern Japan from the lower Amur Riverregion might have substantially contributedto the origin of the Ainu, who, until veryrecently, occupied Sakhalin Island, theKurile Islands, and the southern tip of theKamchatka peninsula (Vasilievskiy, 1971;Krasheninnikov, 1972;Arutiunov, 1988). Thisscenario, based primarily on archeologicaldata, suggests that the genetic profiles of theKoryaks and Itel’men should be distinctivefrom those of the Nivkhs and Ainu, an

32 T.G. SCHURR ET AL.

interpretation which is supported by themtDNA data.

On a broader scale, the expansion ofPaleoasiatic-speaking peoples into north-east Asia led to the near total replacement ofthe ancient Bering Sea cultures in Kam-chatka, with different varieties of the an-cient Koryak culture diffusing extensivelyalong the Okhotsk Sea and coastline of thenorthwestern Pacific (Vasilievskiy, 1971;Arutiunov and Sergeev, 1990a,b; Dikov,1994). However, the mtDNA data indicatethat, while absorbing elements of the Es-kimo-Aleut culture during their expansion,ancestral Koryak and Itel’men groups didnot extensively incorporate members of thesemaritime tribes (Torroni et al., 1993b; Shieldset al., 1993; Starikovskaya et al., 1998; thisstudy). This pattern was also seen in theY-chromosome data for the same popula-tions (Lell et al., 1997a,b), which showedstrong links between Native American andChukotkan populations and their distinctive-ness from other northeast Asian groups.Thus, both the genetic and archeologicaldata indicate that multiple population and/orcultural expansions have taken place in theOkhotsk Sea and Bering Sea region over thelast 10,000 years, with more recently evolvedgenotypes and cultural traditions from north-east Asia overlapping and/or replacing moreancient ones.

Genetic discontinuity at the northPacific Rim

The analysis of mtDNA variation withinand among Paleoasiatic speakers (Chukchi,Koryaks, and Itel’men), Eskimos, Na-Dene

Indians, and Amerindian tribes of the Pa-cific Northwest have shown that these groupsare quite divergent from one another (Shieldset al., 1993; Torroni et al., 1993a,b; Stari-kovskaya et al., 1998; this study). Althoughadditional mtDNA lineages are present inall Siberian populations except for the Sibe-rian Eskimos, the common ancestry of Sibe-rian and Native American groups is evi-denced by the ubiquitous presence ofhaplogroups A, C, and D in these popula-tions (Table 4). However, aside from theputative founding haplotypes for hap-logroups A (SIB41/AM09), C (SIB26/AM43),and D (SIB13/AM88), these populationsshare no other haplotypes, with the remain-ing mtDNAs from these haplogroups beinglargely population- or region-specific. Alongwith the ML estimates for haplogroups A, C,and D in both regions (Table 10), thesefindings imply the considerable antiquity ofthe primary mtDNA lineages occurring inboth Siberia and the New World as well astheir extensive divergence from each othersince being isolated in each continental re-gion over 20,000 years ago.

While evidence for the antiquity of theinitial colonization of the New World israpidly accumulating (e.g., Bonatto and Sal-zano, 1997a), the population dynamics innortheast Asia subsequent to the last glacialmaximum (,18,000 YBP) are of more impor-tance for determining the origins of Paleoasi-atic speakers of Chukotka and Kamchatkaand their affinities with other Siberian popu-lations. In this regard, certain of the majorpopulational events occurring during thisperiod can be associated with specific mtDNA

TABLE 10. Sequence divergence of mtDNA haplogroups in Siberia and the Americas1

HaplogroupGeographic

region n NSequence

divergence (%)Divergencetime (YBP)

A Siberia 10 119 0.0280 12,714–9,645America 46 189 0.0789 35,550–26,969

B America 30 99 0.0391 17,773–13,483C Siberia 14 123 0.0433 19,686–14,934

America 31 77 0.1223 54,009–40,972D Siberia 13 47 0.1115 50,664–38,434

America 16 62 0.0565 25,682–19,483G Siberia 11 106 0.0239 10,855–8,234Y Siberia 7 58 0.0138 6,273–4,759Z Siberia 4 12 0.0209 9,495–7,2031 n, number of haplotypes for each haplogroup; N, number of individual mtDNAs for each haplogroup. The sequence divergenceestimates were weighted by the number of individuals within each haplogroup, and divergence times were calculated using a mtDNAevolutionary rate of 2.2–2.9% per MYR (Torroni et al., 1994a).

33MTDNA VARIATION IN KORYAKS AND ITEL’MEN

polymorphisms present in haplogroup AmtDNAs. To begin with, the 16111T muta-tion in haplogroup A mtDNAs delineates theemergence of ancestral Paleoindian popula-tions and their dispersal in the New World.Later, after the initial occupation of the NewWorld, ancient Beringian populations appar-ently became isolated from ancestral Paleo-indian groups, during which time the largenorth Pacific Rim sublineage defined by the16192T mutation arose among the ancestralpopulations for the Chukchi, Siberian andAlaskan Eskimos, and Na-Dene Indians. Inaddition, a number of population- or region-specific haplotypes in each of these twosublineages of haplogroup A arose in theBeringian groups independent of those occur-ring in Paleoindian populations (Ward et al.,1991, 1993; Torroni et al., 1992, 1993b;Shields et al., 1993; Starikovskaya et al.,1998). A similar pattern of diversity was alsoobserved for haplogroup D mtDNAs inChukotkan populations, which differed fromcomparable Native American mtDNAs byseveral unique mutations (Starikovskaya etal., in press). These population-specificmtDNA sublineages probably reflect the iso-lation and reemergence of remnant popula-tions occupying biogeographic refugia in Ber-ingia and southern Alaska which existeduntil the end of the last glacial maximum(Rogers et al., 1991).

The pattern and timing of the expansionsout of Beringia are also mirrored by thedifferent divergence values for haplogroup Ain Siberia and the Americas (Table 10). Theestimated sequence divergence for this hap-logroup in Siberia was 0.028%, a value con-siderably less than that for the Americas,0.079%. These values give correspondinglydifferent divergence times for Siberia(13,000–10,000 YBP) and the Americas(36,000–27,000 YBP). This apparent discrep-ancy is largely attributable to almost exclu-sively Chukotkan haplotypes being presentin the haplogroup A estimate for Siberia. Infact, our estimates of the genetic divergenceof haplogroup A in Siberian and NativeAmerican populations, one for Chukotkangroups (0.029%, 12,727–9,655 YBP), an-other for Na-Dene Indians (0.021%, 9,545–7,241 YBP [Torroni et al., 1992]), and a thirdfor Amerindians (0.079%, 35,909–27,241

YBP), clearly show the extent of diversitywhich has developed in them, not just withinthe haplogroup itself. This interpretation issupported by the fact that only the foundinghaplotype (SIB41/AM01) of this haplogroupis shared amongst them, and all other haplo-types are unique to each set of populations(Torroni et al. 1992, 1993a,b, 1994a,b; Stari-kovskaya et al., 1998). Thus, while thesedivergence estimates do not give exact timesfor the ages of specific ethnic groups, they doprovide a temporal framework in which toview the emergence of the ancestral popula-tions for the three major Native Americanlinguistic divisions.

Given this pattern of genetic divergence innortheast Asia and the New World, it wasnot surprising that there was a strikingdiscontinuity in haplotypic diversity be-tween Kamchatkan and Native Americanpopulations. Although having a number ofhaplogroup A, C, and D haplotypes, theKoryaks and Itel’men were not closely geneti-cally related to Native American groups andactually shared only SIB41 (AM01) fromhaplogroup A and SIB26 (AM43) and SIB45(AM32) from haplogroup C. The NJ tree ofCR sequences (Fig. 6) strongly confirmed thepattern seen in the RFLP haplotype data,with the Kamchatkan CR sequences havingthe strongest affinities with those in NativeAmericans belonging only to haplogroups Aand C. However, the Koryak haplogroup AmtDNAs (SIB41) having the 16111T muta-tion also possessed the 16192T mutation,indicating that they were not directly linkedto those in Amerindian groups. The otherhaplogroup A CR sequences in the Koryaksand Itel’men lacked the 16111T mutationand in doing so more closely resembledmtDNAs present in east Asian and easternSiberian groups, in whom they probablyoriginated. Therefore, while Kamchatkangroups have haplogroup A mtDNAs andeven the putative founding haplotype, SIB41(AM01), none of them are closely related tocomparable mtDNAs fromAmerindian popu-lations.

The same trend was observed in hap-logroup C mtDNAs from Kamchatkan popu-lations. The Siberian mtDNAs with CR se-quences most akin to those in NativeAmericans (sublineage I) represented a mi-

34 T.G. SCHURR ET AL.

nority of those present in the Koryaks (11.9%)and did not occur in the Itel’men at all,whereas they occurred at much greaterfrequency in other eastern Siberian (Evenksand Udegeys) and east Asian (Han Chi-nese and Koreans) populations (Torroniet al., 1993b). The remaining haplogroup CmtDNAs in Kamchatkan populations be-longed to sublineage II, which occurred onlyin the Koryaks, and sublineage III, whichoccurred in both Koryaks and Itel’men, withsublineage II being related to haplotypespresent in east Asian and eastern Siberianpopulations. Thus, while SIB26 haplo-types comprised 23.2% of all KamchatkanmtDNAs from haplogroup C, very few ofthem resembled comparable mtDNAs in Na-tive American groups at the CR sequencelevel. Therefore, the majority of the Koryakand Itel’men mtDNAs are not the same asthe founding haplotype in New World popu-lations and instead must have arisen afterthe colonization of the New World.

Our analyses have also shown that hap-logroup B mtDNAs are absent in the Kory-aks and Itel’men as well as the Chukchi andSiberian Eskimos (Torroni et al., 1993b;Starikovskaya et al., in press; this study).Their absence in the Koryaks and Itel’menwas an important finding because the Kam-chatka peninsula was contiguous with therest of Beringia during the last glacial maxi-mum (Fladmark, 1979; Hopkins, 1979; Hoef-fecker et al., 1993) and could have been partof the route that the immigrant popula-tion(s) carrying haplogroup B took whilemoving across the Bering Strait from Asiainto the Americas. Thus, it appears thathaplogroup B was never part of the ances-tral gene pool for Paleoasiatic-speaking popu-lations and that these populations played norole in the dispersal of this mtDNA lineageinto the New World.

In contrast, the Ainu, with whom theItel’men reportedly had considerable con-tact in historic times (Krasheninnikov, 1972),exhibited haplogroup B mtDNAs, althoughat low frequencies (2.0%) (Harihara et al.,1992). Along with the higher frequencies ofhaplogroup B in the modern Japanese andKoreans (,10–16%) (Horai et al., 1984; Ball-inger et al., 1992; Horai et al., 1996) thisfinding may reflect the relatively recent

expansion of this mtDNA lineage into theSea of Okhotsk region. These data alsosuggest that populations bearing haplogroupB mtDNAs could have originated in eastAsia and moved across Beringia via a coastalroute.

Several lines of evidence support the hy-pothesis of a separate migration of peoplescarrying haplogroup B mtDNAs throughBeringia to the New World. First, the ab-sence of haplogroup B mtDNAs in centraland eastern Siberian populations whichshare founding haplotypes from haplogroupsA, C, and D with Native American groupsmay imply that these haplotypes were notpresent in the original progenitors of NewWorld populations. Second, the virtual ab-sence of haplogroup B mtDNAs in modernEskimo, Aleut, and northern Na-Dene In-dian populations, which represent more re-cent demic expansions into North America,implies that haplogroup B mtDNAs werenot present in the Beringian region after10,000 YBP, when these populations werefounded. Third, the ML divergence estimatefor haplogroup B of 17,000–13,000 YBP isconsiderably smaller than that of hap-logroups A, C, and D in the New World(Table 10). These data imply that hap-logroup B mtDNAs arrived in the Americasafter the initial Paleoindian migrationbrought haplogroups A, C, and D to the NewWorld but before the Beringian expansion(s)which gave rise to the ancestral Eskimo-Aleut and Na-Dene Indian populations intothe arctic and subarctic regions. This laterexpansion of Beringian groups into the NewWorld might also explain the low frequen-cies of haplogroup B mtDNAs in most north-ern North American Indian populations(Ward et al., 1991, 1993; Torroni et al., 1992,1993a; Lorenz and Smith, 1994) relative tothose inhabiting regions farther south.

Alternatively, haplogroups A–D may havebeen brought together during the initialcolonization of the New World. All four hap-logroups are observed in most modern andancient Native American populations(Schurr et al., 1990; Ward et al., 1991, 1993;Torroni et al., 1992, 1993a, 1994a,b; Gintheret al., 1993; Horai et al., 1993; Santos et al.,1994; Stone and Stoneking, 1994; Batista etal., 1995; Kolman et al., 1996; Lorenz and

35MTDNA VARIATION IN KORYAKS AND ITEL’MEN

Smith, 1994, 1996; Merriwether et al., 1995).The distribution and age of haplogroup B inAsia also suggest that this mtDNA lineageevolved in and spread from the region encom-passing Mongolia, Tibet, the northern Hima-layas, and southern Siberia by at least30,000–24,000 YBP (Ballinger et al., 1992;Lum et al., 1994; Horai et al., 1996), imply-ing it could have been present in the ances-tral Siberian groups which first moved intothe New World. Such dates are consistentwith recent estimates of CR sequence diver-sity within haplogroup B in Native Ameri-cans, which indicate that this mtDNAlineage arrived in the New World by 30,000–25,000 YBP (Bonatto and Salzano, 1997b).These competing interpretations clearly in-dicate that further research is required todelineate the origin and dispersal of hap-logroup B in the Americas.

On the other hand, haplogroups G, Y, andZ have not been observed among NativeAmerican groups analyzed by high resolu-tion RFLP analysis (Schurr et al., 1990;Torroni et al., 1992, 1993a, 1994b). Although‘‘Other’’ haplotypes have also been detectedin both ancient and modern Native Ameri-can populations by partial haplotype analy-sis and CR sequencing (Bailliet et al., 1994;Hauswirth et al., 1994; Stone and Stone-king, 1994; Merriwether et al. 1995; Lorenzand Smith, 1996; Ribiero-Dos-Santos et al.,1996), the limited data for these samplesindicate that they also do not belong tohaplogroups G, Y, and Z. Therefore, popula-tions bearing these haplotypes must havespread in northeast Asia after the populat-ing of the New World. This interpretation isconsistent with their younger divergencetimes relative to those of haplogroups A–Din Siberia and the Americas (Table 10) andwith haplogroup G being the oldest andmost widespread of these mtDNA lineagesin the eastern Siberia/eastAsia region (Horaiet al., 1984; Ballinger et al., 1992; Torroni etal., 1993b, 1994c; Starikovskaya et al., 1998;this study).

ACKNOWLEDGMENTS

The authors thank Lorri Griffin and theClinical Research Center of the Emory Uni-versity School of Medicine, supported byNIH M01-RR-00039, for their assistance in

the processing of blood samples, and Drs.Andy Kogelnik and Sandro L. Bonatto fortheir assistance with the statistical analysesof the mtDNA data. We also thank thehospital staff and doctors in the villages ofOssora, Karaga, and Tymlat for their assis-tance with this project and the Koryak peoplefrom these villages for their participation inthis research and hospitality during ourfield work in July–August 1993. Similarly,we extend our gratitude to the hospital staffin the villages of Voyampolka and Kovranfor assistance with this project and to theKoryak and Itel’men people from these vil-lages for their participation in this researchduring our field work in June 1996. Thisresearch was supported by grants from TheWenner-Gren Foundation for Anthropologi-cal Research (GR 5745), The National Sci-ence Foundation (NSF 9414900), The LeakeyFoundation, and Sigma Xi to T.G.S.; fromThe Wenner-Gren Foundation for Anthropo-logical Research (GR 5680, GR 5680-S, andGR 6077), INTAS (96–1766), and the Rus-sian Fund for Basic Research (97–04–49816)to R.I.S.; from the J. Worley Brown Fel-lowship Fund and the National Institutesof Health (GM49615, NS21328, HL45572,AG10130, and AG13154) to D.C.W; and fromtheEmoryClinicalResearchCenter (BR00039).

LITERATURE CITED

Anderson S, Bankier AT, Barrell BG, DeBruijn MHL,Coulson AR, Drouin J, Eperon IC, Nierlich DP, RoseBA, Sanger F, Schreier PH, Smith AJH, Staden R,Young IG. 1981. Sequence and organization of thehuman mitochondrial genome. Nature 290:457–465.

Antropova VV. 1964a. The Koryaks. In: Levin MG,Potapov LP, editors. The peoples of Siberia. Chicago:University of Chicago Press. p 851–875.

Antropova VV. 1964b. The Itel’mens. In: Levin MG,Potapov LP, editors. The peoples of Siberia. Chicago:University of Chicago Press. p 876–883.

Arutiunov SA. 1988. Koryaks and Itel’men: dwellers ofthe smoking coast. In: Fitzhugh WW, Crowell A,editors. Crossroads of continents. Baltimore: Smithso-nian Institution Press. p 31–35.

Arutiunov SA, Sergeev DA. 1990a. Issues of the ethnichistory of the Bering Sea. Chapter 1. Problems in thestudy of the ethnic history of the northwestern coast-line of the Pacific coast. Soviet Anthropol Archeol28:50–61.

Arutiunov SA, Sergeev DA. 1990b. Issues of the ethnichistory of the Bering Sea. Chapter 8. The role ofancient Eskimo culture in the ethnic history of thenorthern Pacific basin. Soviet Anthropol Archeol 28:62–77.

Bailliet G, Rothhammer F, Carnese FR, Bravi CM,Bianchi NO. 1994. Founder mitochondrial haplotypesin Amerindian populations. Am J Hum Genet 55:27–33.

36 T.G. SCHURR ET AL.

Ballinger SW, Schurr TG, Torroni A, Gan YY, Hodge JH,Hassan K, Chen K-H, Wallace DC. 1992. SoutheastAsian mitochondrial DNA analysis reveals geneticcontinuity of ancient mongoloid migrations. Genetics130:139–152.

Batista O, Kolman CJ, Bermingham E. 1995. Mitochon-drial DNA diversity in the Kuna Amerinds of Panama.Hum Mol Genet 4:921–929.

Bogoras W. 1910. The Chukchee. Publication of theJesup North Pacific Expedition, vol. 8, part 1. Leiden,NY: American Museum of Natural History. ReprintedNew York: AMS Press, 1975.

Bonatto SL, Salzano FM. 1997a. A single and earlyorigin for the peopling of the Americas supported bymitochondrial DNA sequence data. Proc Natl Acad SciU S A 94:1866–1871.

Bonatto SL, Salzano FM. 1997b. Diversity and age of thefour major mtDNA haplogroups, and their implica-tions for the peopling of the New World. Am J HumGenet 61:1413–1423.

Cann RL, Brown WM, Wilson AC. 1984. Polymorphicsites and the mechanism of evolution in humanmitochondrial DNA. Genetics 106:479–499.

Cann RL, Stoneking M, Wilson AC. 1987. MitochondrialDNA and human evolution. Nature 325:31–36.

Cavalli-Sforza LL, Edwards AWF. 1967. Phylogeneticanalysis: models and estimation procedures. Evolu-tion 21:550–570.

Chen Y-S, Torroni A, Excoffier L, Santachiara-Benere-cetti AS, Wallace DC. 1995. Analysis of mtDNA varia-tion in African populations reveals the most ancient ofall human continent-specific haplogroups. Am J HumGenet 57:133–149.

Debets GF. 1951. Anthropological investigations in theKamchatka Region. In: Transactions of the Instituteof Ethnography, Russian Academy of Sciences, vol. 17.Moscow. p 3–263 (in Russian).

Derenko MV, Shields GF. 1998. Polymorphism in RegionV of mitochondrial DNA in indigenous populations ofnorthern Asia. Genetika 34:411–415.

Dikov NN. 1990. The origin of the aboriginal popula-tions of America. Soviet Anthropol Archeol 28:12–29.

Dikov NN. 1994. The Paleolithic of Kamchatka andChukotka and the problem of the peopling of America.In: Fitzhugh WW, Chausonnet V, editors. Anthropol-ogy in the Pacific Rim. Washington DC: SmithsonianInstitution Press. p 87–95.

Easton RD, Merriwether DA, Crews DE, Ferrell RE.1996. mtDNA variation in Yanomami: evidence foradditional New World founding lineages. Am J HumGenet 59:213–225.

Fedoseeva SA. 1968. Ancient cultures of the upper ViluiRiver. Moscow: Nauka (in Russian).

Felsenstein J. 1994. PHYLIP (Phylogeny Inference Pack-age), version 3.572c. Seattle: University of Washing-ton.

Fladmark KR. 1979. Routes: alternative migration corri-dors for early man in North America. Am Antiquity44:55–69.

Forster P, Harding R, Torroni A, Bandelt HJ. 1996.Origin and evolution of Native American mtDNAvariation: a reappraisal. Am J Hum Genet 59:935–945.

Ginther C, Corach D, Penacino GA, Rey JA, Carnese FR,Hutz MH, Anderson A, Just J, Salzano FM, King MC.1993. Genetic variation among the Mapuche Indiansfrom the Patagonian region of Argentina: mitochon-drial DNA sequence variation and allele frequenciesof several nuclear genes. In: Pena SDJ, ChakrabortyR, Epplan JT, Jefferies AJ, eds. DNA fingerprintingstate of the science. Basel: Berkhauser Verlag, Switzer-land. p 211–219.

Gurvich IS. 1966. Ethnic history of north-east Siberia.Moscow (in Russian).

Hanihara K. 1991. Dual structure model for the popula-tion history of the Japanese. Japan Review 2:1–33.

Harihara S, Saitou N, Hirai M, Gojobori T, Park KS,Misawa S, Ellepola SB. 1988. Mitochondrial DNApolymorphism among five Asian populations. Am JHum Genet 43:134–143.

Harihara S, Hirai M, Suutou Y, Shimizu K, Omoto K.1992. Frequency of a 9-bp deletion in the mitochon-drial DNAamongAsian populations. Hum Biol 64:161–166.

Hauswirth WW, Dickel CD, Rowold DJ, Hauswirth MA.1994. Inter- and intrapopulation studies of ancienthumans. Experientia 50:585–591.

Hoffecker JF, Powers WR, Goebel T. 1993. The coloniza-tion of Beringia and the peopling of the New World.Science 259:46–53.

Hopkins DM. 1979. Landscape and climate of Beringiaduring late Pleistocene and Holocene time. In: Laugh-lin WS, Harper AB, editors. The first Americans:origins, affinities, and adaptations. New York: GustavFischer. p 15–42.

Horai S, Hayasaka K. 1990. Intraspecific nucleotidesequence differences in the major non-coding region ofthe human mitochondrial DNA. Am J Hum Genet46:828–842.

Horai S, Matsunaga E. 1986. Mitochondrial DNA poly-morphism in Japanese. I. Analysis with restrictionendonucleases of six base-pair recognition. Hum Genet68:324–332.

Horai S, Gojobori T, Matsunaga E. 1984. MitochondrialDNApolymorphism. II.Analysis with restriction endo-nucleases of four and five base-pair recognition. HumGenet 72:105–117.

Horai S, Kondo R, Nakasawa-Hattori Y, Hayasaki S,Sonoda S, Tajima K. 1993. Peopling of the Americas,founded by four major lineages of mitochondrial DNA.Mol Evol Biol 10:23–47.

Horai S, Murayama K, Hayasaka K, Matsubayashi S,Hattori Y, Fucharoen G, Harihara S, Park KS, OmotoK, Pan I-H. 1996. mtDNA polymorphism in east Asianpopulations, with special reference to the peopling ofJapan. Am J Hum Genet 59:579–590.

Jochelson WR. 1908. The Koryak. The Jesup NorthPacific Expedition, vol. 6. Leiden, NY: American Mu-seum of Natural History. Reprinted New York: AMSPress, 1975.

Johnson MJ, Wallace DC, Ferris SD, Rattazzi MC,Cavalli-Sforza LL. 1983. Radiation of human mito-chondrial DNA types analysed by restriction endo-nuclease cleavage patterns. J Mol Evol 19:255–271.

Jukes TH, Cantor CR. 1969. Evolution of protein mol-ecules. In: Munro HN, editor. Mammalian proteinmetabolism. New York: Academic Press. p 21–132.

Kimura M. 1980. A simple method for estimating evolu-tionary rates of base substitutions through compara-tive studies of nucleotide sequences. J Mol Evol16:111–120.

Kolman CJ, Sambuughin N, Bermingham E. 1996.Mitochondrial DNA analysis of Mongolian popula-tions and implications for the origins of New Worldfounders. Genetics 142:1321–1334.

Krasheninnikov S. 1972 (1755). A description of theKamchatka region, 1735–1741. Crownhart-VaughnEAP, editor/translator. Portland, OR: Oregon Histori-cal Society.

Krauss ME. 1988. Many tongues—ancient tales. In:Fitzhugh WW, Crowell A, editors. Crossroads of conti-nents. Baltimore: Smithsonian Institution Press. p145–150.

Krushanov AI. 1990. Itel’mens: history and culture.Leningrad: Nauka (in Russian).

37MTDNA VARIATION IN KORYAKS AND ITEL’MEN

Krushanov AI, editor. 1993. The history and culture ofKoryaks. St. Petersburg: Nauka (in Russian).

Lell JT, Brown MD, Schurr TG, Sukernik RI, Starik-ovskaya YB, Torroni A, Moore LG, Troup GM, WallaceDC. 1997a. Y chromosome polymorphisms in NativeAmerican and Siberian populations: identification offounding Native American Y chromosome haplotypes.Hum Genet 100:536–543.

Lell JT, Schurr TG, Brown MD, Sukernik RI, Starik-ovskaya YB, Scozzari R, Cruciani F, Wallace DC.1997b. Y-chromosome compound haplotype analysis:Implications for the peopling of the Americas andelucidation of intra-Siberian relationships. Am J HumGenet 61(Suppl):A204.

Levin MG. 1958. Ethnic anthropology and problems ofthe ethnology of the peoples of the Far East. Moscow/Leningrad (in Russian).

Levin MG, Vasiliev BA. 1964. The Evens. In: Levin MG,Potapov LP, editors. The peoples of Siberia. Chicago:University of Chicago Press. p 670–684.

Lorenz JG, Smith DG. 1994. Distribution of the 9-bpmitochondrial DNA region V deletion among NorthAmerican Indians. Hum Biol 66:777–788.

Lorenz JG, Smith DG. 1996. Distribution of four found-ing mtDNA haplogroups among Native North Ameri-cans. Am J Phys Anthropol 101:307–323.

Lum JK, Rickards O, Ching C, Cann RL. 1994. Polyne-sian mitochondrial DNAs reveal three deep maternallineage clusters. Hum Biol 66:567–590.

Malyarchuk BA, Derenko MV, Solovenchuk LL. 1994.Restriction pattern types of the major noncodingregion of mitochondrial DNA in native and migrantinhabitants of northeastAsia. Russian J Genet 30:750–754.

Merriwether DA, Rothhammer F, Ferrell RE. 1994.Genetic variation in the New World: ancient teeth,bone, and tissue as sources of DNA. Experientia50:592–601.

Merriwether DA, Rothhammer F, Ferrell RE. 1995.Distribution of the four founding lineage haplotypesin Native Americans suggests a single wave of migra-tion for the New World. Am J Phys Anthropol 98:411–430.

Merriwether DA, Hall WW, Vahlne A, Ferrell RE. 1996.mtDNA variation indicates Mongolia may have beenthe source for the founding population for the NewWorld. Am J Hum Genet 59:204–212.

Mochanov YA. 1962. Early Neolithic of the Aldan Riverbasin. Moscow: Nauka (in Russian).

Murashko O. 1994. A demographic history of the Kam-chadal/Itel’men of the Kamchatka Peninsula: Model-ing the precontact numbers and postcontact depopula-tion. Arctic Anthropol 31:16–30.

Nei M. 1972. Genetic distances between populations.Am Naturalist 106:283–292.

Nei M. 1987. Molecular evolutionary genetics. NewYork: Columbia University Press.

Nei M, Jin L. 1989. Variances of the average number ofnucleotide substitutions within and between popula-tions. Mol Evol Biol 6:290–300.

Nei M, Tajima F. 1983. Maximum likelihood estimationof the number of nucleotide substitutions from restric-tion site data. Genetics 105:207–217.

Passarino G, Semino O, Modiano G, Santachiara-Benerecetti AS. 1993. COII/tRNALys intergenic 9-bpdeletion and other mtDNA markers clearly revealthat the Tharus (Southern Nepal) have oriental affini-ties. Am J Hum Genet 53:609–618.

Petrishchev VN, Kutueva AB, Rychkov YG. 1993. Dele-tion-insertion polymorphisms in ten mongoloid popu-lations of Siberia: frequency of deletion correlateswith the geographical coordinates of the area. Gene-tika 29:1196–1204 (in Russian).

Reynolds JB, Weir BS, Cockerham CC. 1983. Estima-tion of the coancestry coefficient: Basis for a short-term genetic distance. Genetics 105:767–779.

Ribiero-Dos-Santos AKC, Santos SEB, Machado AL,Guapindaia V, Zago MA. 1996. Heterogeneity of mito-chondrial DNA haplotypes in pre-Columbian nativesof the Amazon region. Am J Phys Anthropol 101:29–37.

Rogers RA, Rogers LA, Hoffmann RS, Martin LD. 1991.Native American biological diversity and the biogeo-graphic influence of Ice Age refugia. J Biogeography18:623–630.

Saitou N, Imanishi T. 1989. Relative efficiencies of theFitch-Margoliash, maximum-parsimony, maximum-likelihood, minimum-evolution, and neighbor-joiningmethods of phylogenetic tree construction in obtain-ing the correct tree. Mol Biol Evol 6:514–525.

Saitou N, Nei M. 1987. The neighbor-joining method: anew method for reconstructing phylogenetic trees.Mol Biol Evol 4:406–425.

Santos MR, Ward RH, Barrantes R. 1994. mtDNAvariation in the Chibcha Amerindian Huetar fromCosta Rica. Hum Biol 66:963–977.

Schurr TG, Ballinger SW, Gan YY, Hodge JA, Merri-wether DA, Lawrence DN, Knowler WC, Weiss KM,Wallace DC. 1990. Amerindian mitochondrial DNAshave rare Asian variants at high frequencies, suggest-ing they derived from four primary maternal lineages.Am J Hum Genet 46:613–623.

Shields GF, Schmiechen AM, Frazier BL, Redd A, Vo-evoda MI, Reed JK, Ward RH. 1993. mtDNA se-quences suggest a recent evolutionary divergence forBeringian and northern North American populations.Am J Hum Genet 53:549–562.

Simchenko YB. 1976. Culture of reindeer herders innorthern Eurasia. Moscow: Nauka (in Russian).

Skorik PY. 1965. Chukotko-Kamchatkan languages. Lan-guages of northern USSR: Mongolian, Tungusic-Manchuric and Paleoasiatic languages. Leningrad:Nauka (in Russian).

Starikovskaya YB, Sukernik RI, Schurr TG, KogelnikAM, Wallace DC. 1998. Mitochondrial DNA diversityin Chuchi and Siberian Eskimos: Implications for thegenetic history of ancient Beringa and the peopling ofthe New World. Am J Hum Genet 63:1473–1491.

Stone AC, Stoneking M. 1994. Ancient DNA from apre-Columbian Amerindian population. Am J PhysAnthropol 92:1158–1162.

Sukernik RI. 1992. Non-random distribution of GMhaplotypes in northern Siberia. In: Roberts DF, FujikiN, Torizuka K, editors. Isolation, migration and health.Cambridge: Cambridge University Press. p 107–111.

Sukernik RI, Lemza SV, Karaphet TM, Osipova LP.1981. Reindeer Chukchi and Siberian Eskimos: stud-ies on blood groups, serum proteins, and red cellenzymes with regard to genetic heterogeneity. Am JPhys Anthropol 55:121–128.

Sukernik RI, Osipova LP, Karaphet TM, KirpichnikovGA. 1986. GM haplotypes and their frequencies in tenChukchi populations. Genetic structure of ReindeerChukchi. Genetika 22:2361–2368 (in Russian).

Sukernik RI, Schurr TG, Starikovskaya EB, WallaceDC. 1996. Mitochondrial DNA variation in nativeSiberians, with special reference to the evolutionaryhistory of Americans Indians. Studies on restrictionpolymorphism. Genetika 32:432–439 (in Russian).

Swofford D. 1994. Phylogenetic Analysis Using Parsi-mony (PAUP), version 3.1.1. Champaign, IL: IllinoisNatural History Survey.

Szathmary EJE. 1981. Genetic markers in Siberia andnorthern North American populations. Yrbk PhysAnthropol 24:37–73.

38 T.G. SCHURR ET AL.

Torroni A, Schurr TG, Yang C-C, Szathmary EJE, Wil-liams RC, Schanfield MS, Troup GA, Knowler WC,Lawrence DN, Weiss KM, Wallace DC. 1992. NativeAmerican mitochondrial DNA analysis indicates thatthe Amerind and the Na-Dene populations werefounded by two independent migrations. Genetics130:153–162.

Torroni A, Schurr TG, Cabell MF, Brown MD, Neel JV,Larsen M, Smith DG, Vullo CM, Wallace DC. 1993a.Asian affinities and the continental radiation of thefour founding Native American mtDNAs. Am J HumGenet 53:563–590.

Torroni A, Sukernik RI, Schurr TG, Starikovskaya YB,Cabell MA, Crawford MH, Comuzzie AG, Wallace DC.1993b. mtDNA variation of aboriginal Siberians re-veals distinct genetic affinities with Native Ameri-cans. Am J Hum Genet 53:591–608.

Torroni A, Neel JV, Barrantes R, Schurr TG, WallaceDC. 1994a. A mitochondrial DNA ‘‘clock’’ for theAmerinds and its implications for timing their entryinto North America. Proc Natl Acad Sci U S A 91:1158–1162.

Torroni A, Chen Y-S, Semino O, Santachiara-BenerecettiAS, Scott CR, Lott MT, Winter M, Wallace DC. 1994b.mtDNA and Y-chromosome polymorphisms in fourNative American populations from southern Mexico.Am J Hum Genet 54:303–318.

Torroni A, Miller JA, Moore LG, Zamudio S, Zhuang JG,Droma T, Wallace DC. 1994c. Mitochondrial DNA

analysis in Tibet: implications for the origin of theTibetan population and its adaptation to high alti-tude. Am J Phys Anthropol 93:189–199.

Torroni A, Lott MT, Cabell MF, Chen Y-S, Lavergne L,Wallace DC. 1994d. mtDNA and the origin of Cauca-sians: Identification of ancient Caucasian-specific hap-logroups, one of which is prone to a recurrent somaticduplication in the D-loop region. Am J Hum Gen55:760–776.

Torroni A., Huoponen K, Francalacci P, Petrozzi M,Morelli L, Scozzari R, Obinu d, Savontaus M-L, Wal-lace DC. 1996. Classification of European mtDNAsfrom an analysis of three European populations.Genetics 144:1835–1850.

Vasilievskiy RS. 1971. The origins and ancient culture ofthe Koryak. Novosibirsk: Nauka (in Russian).

Vdovin IS. 1973. Essays on the ethnohistory of theKoryaks. Moscow: Nauka (in Russian).

Wallace DC, Garrison K, Knowler WC. 1985. Dramaticfounder effects in Amerindian mitochondrial DNAs.Am J Phys Anthropol 68:149–155.

Ward RH, Frazier BL, Dew-Jager K, Paabo S. 1991.Extensive mitochondrial diversity within a singleAmerindian tribe. Proc Natl Acad Sci U S A 88:8720–8724.

Ward RH, Redd A, Valencia D, Frazier B, Paabo S. 1993.Genetic and linguistic differentiation in the Americas.Proc Natl Acad Sci U S A 90:10663–10667.

39MTDNA VARIATION IN KORYAKS AND ITEL’MEN