Emerging Diversity within Chrysophytes, Choanoflagellates and Bicosoecids Based on Molecular Surveys

14
Protist, Vol. 162, 435–448, July 2011 http://www.elsevier.de/protis Published online date 15 January 2011 ORIGINAL PAPER Emerging Diversity within Chrysophytes, Choanoflagellates and Bicosoecids Based on Molecular Surveys Javier del Campo 1 , and Ramon Massana Departament de Biologia Marina i Oceanografia, Institut de Ciències del Mar, CSIC. Passeig Marítim de la Barceloneta, 37-49, 08003 Barcelona, Catalonia, Spain Submitted June 30, 2010; Accepted October 30, 2010 Monitoring Editor: Michael Melkonian In recent years, a substantial amount of data on aquatic protists has been obtained from culture- independent molecular approaches, unveiling a large diversity and the existence of new lineages. However, sequences affiliated with minor groups (in terms of clonal abundance) have often been under-analyzed, and this hides a potentially relevant source of phylogenetic information. Here we have searched public databases for 18S rDNA sequences of chrysophytes, choanoflagellates and bicosoe- cids retrieved from molecular surveys of protists. These three groups are often considered to account for most of the heterotrophic flagellates, an important functional component in microbial food webs. They represented a significant fraction of clones in freshwater studies, whereas their relative clonal abundance was low in marine studies. The novelty displayed by this dataset was notable. Most envi- ronmental sequences were distant to sequences of cultured organisms, indicating a significant bias in the representation of taxa in culture. Moreover, they were often distant to sequences from other molecular surveys, suggesting an insufficient sequencing effort to characterize the in situ diversity of these groups. Phylogenetic trees with complete sequences present the most accurate representation of the diversity of these groups, with the emergence of several new clades formed exclusively by envi- ronmental sequences. Exhaustive data mining in sequence databases allowed the identification of new diversity hidden inside chrysophytes, choanoflagellates and bicosoecids. © 2010 Elsevier GmbH. All rights reserved. Key words: 18S rDNA; bicosoecids; choanoflagellates; chrysophytes; emerging diversity; heterotrophic flag- ellates; maximum likelihood phylogeny; molecular surveys. Introduction Heterotrophic Flagellates (HF) are distributed in planktonic environments at concentrations between 10 2 and 10 5 cells ml -1 , representing 10-30% of protist cells in upper marine waters (Jürgens and Massana 2008). HF cells are often 1 Corresponding author; fax 93-2309555 e-mail [email protected] (J. del Campo). phagotrophs that graze and control the abundance of prokaryotes and picoeukaryotes (Pernthaler 2005), but also may include dispersal stages of parasites of other marine organisms (Guillou et al. 2008). Consequently, HF are important actors in microbial food webs and play key roles in global biogeochemical cycles (Chambouvet et al. 2008; Sherr and Sherr 2002;). Traditionally, the diversity of HF assemblages has been studied by microscopy and culturing, yielding the impression that most © 2010 Elsevier GmbH. All rights reserved. doi:10.1016/j.protis.2010.10.003

Transcript of Emerging Diversity within Chrysophytes, Choanoflagellates and Bicosoecids Based on Molecular Surveys

PhP

O

ECB

J

DP

SM

IiHuscfTarimtord©

Ke

I

Hib1(

1

e

©

rotist, Vol. 162, 435–448, July 2011ttp://www.elsevier.de/protisublished online date 15 January 2011

RIGINAL PAPER

merging Diversity within Chrysophytes,hoanoflagellates and Bicosoecidsased on Molecular Surveys

avier del Campo1, and Ramon Massana

epartament de Biologia Marina i Oceanografia, Institut de Ciències del Mar, CSIC.asseig Marítim de la Barceloneta, 37-49, 08003 Barcelona, Catalonia, Spain

ubmitted June 30, 2010; Accepted October 30, 2010onitoring Editor: Michael Melkonian

n recent years, a substantial amount of data on aquatic protists has been obtained from culture-ndependent molecular approaches, unveiling a large diversity and the existence of new lineages.owever, sequences affiliated with minor groups (in terms of clonal abundance) have often beennder-analyzed, and this hides a potentially relevant source of phylogenetic information. Here we haveearched public databases for 18S rDNA sequences of chrysophytes, choanoflagellates and bicosoe-ids retrieved from molecular surveys of protists. These three groups are often considered to accountor most of the heterotrophic flagellates, an important functional component in microbial food webs.hey represented a significant fraction of clones in freshwater studies, whereas their relative clonalbundance was low in marine studies. The novelty displayed by this dataset was notable. Most envi-onmental sequences were distant to sequences of cultured organisms, indicating a significant biasn the representation of taxa in culture. Moreover, they were often distant to sequences from other

olecular surveys, suggesting an insufficient sequencing effort to characterize the in situ diversity ofhese groups. Phylogenetic trees with complete sequences present the most accurate representationf the diversity of these groups, with the emergence of several new clades formed exclusively by envi-

onmental sequences. Exhaustive data mining in sequence databases allowed the identification of newiversity hidden inside chrysophytes, choanoflagellates and bicosoecids.2010 Elsevier GmbH. All rights reserved.

ey words: 18S rDNA; bicosoecids; choanoflagellates; chrysophytes; emerging diversity; heterotrophic flag-llates; maximum likelihood phylogeny; molecular surveys.

ntroduction

eterotrophic Flagellates (HF) are distributedn planktonic environments at concentrationsetween 102 and 105 cells ml-1, representing0-30% of protist cells in upper marine watersJürgens and Massana 2008). HF cells are often

Corresponding author; fax 93-2309555-mail [email protected] (J. del Campo).

phagotrophs that graze and control the abundanceof prokaryotes and picoeukaryotes (Pernthaler2005), but also may include dispersal stages ofparasites of other marine organisms (Guillou et al.2008). Consequently, HF are important actors inmicrobial food webs and play key roles in globalbiogeochemical cycles (Chambouvet et al. 2008;Sherr and Sherr 2002;). Traditionally, the diversity ofHF assemblages has been studied by microscopyand culturing, yielding the impression that most

2010 Elsevier GmbH. All rights reserved.doi:10.1016/j.protis.2010.10.003

436 J. del Campo and R. Massana

cells belong to chrysophytes, choanoflagellates orbicosoecids (Arndt et al. 2000; Fenchel 1982). How-ever, the in situ diversity and ecological relevanceof these taxonomic groups remain poorly investi-gated.

The chrysophytes is a large group of stra-menopiles with about 100 described genera(Lee et al. 2000). They include colorless cells(heterotrophs) and chloroplast-containing cells(phototrophs or mixotrophs) with one or two flagella(Preisig et al. 1991). The majority lives in fresh-water but there are also some well-known marinespecies, such as Paraphysomonas imperforata.The phylogeny of chrysophytes using 18S rDNAwas presented by Andersen et al. (1999), and cur-rently there are 30 genera represented in GenBank.The choanoflagellates are colorless ovoid cells withabout 50 genera described from marine, brack-ish and freshwater systems (Leadbeater 1991; Leeet al. 2000). They have a collar surrounding aunique flagellum, and some are covered by an intri-cate lorica. They belong to Opisthokonta and arethe closest metazoan relatives, thus attracting theinterest of evolutionary biologists (King et al. 2008).Their phylogeny using the 18S rDNA was pre-sented in Carr et al. (2008) and currently there are16 genera in GenBank’s Taxonomy. Bicosoecidsare colorless flagellates that belong to the stra-menopiles and include 11 genera (Cavalier-Smithand Chao 2006; Lee et al. 2000;), all representedin GenBank’s Taxonomy with their 18S rDNA. Cellshave typically two flagella. Both marine and fresh-water species are known, including the well-knownmarine species Cafeteria roenbergensis (Fencheland Patterson 1988).

Cultured strains have been essential for delin-eating the physiology and phylogeny of the threegroups (Andersen et al. 1999; Cavalier-Smith andChao 2006; Leipe et al. 1994), but it is not clearif these cultured strains are ecologically relevant.For instance, a very low abundance of Para-physomonas imperforata (Lim et al. 1999) andCafeteria roenbergensis (Massana et al. 2007)was recorded in samples from which these twospecies were easily enriched. In situ diversitycan be better addressed by culture-independentmolecular techniques (Caron et al. 2004). Envi-ronmental 18S rDNA libraries targeting microbialeukaryotes highlighted new lineages that appearedin most studies in high clonal abundance, suchas MAST (Marine Stramenopiles) (Massana et al.2006) and MALV (Marine Alveolates) (Guillou et al.2008), whereas chrysophytes, choanoflagellatesor bicosoecids were generally represented byfew sequences in marine (Massana and Pedrós-

Alió 2008) and freshwater (Lefranc et al. 2005;Richards et al. 2005; Slapeta et al. 2005) individ-ual studies. These later groups have been underanalyzed due to their low clonal abundance, andwe hypothesize that new diversity would emergeonce we put together sequences from independentstudies.

Here, we searched public databases (nucleotidecollection nr/nt in GenBank) for chrysophyte,choanoflagellate and bicosoecid 18S rDNAsequences obtained in molecular surveys. Weused this sequence dataset to pursue three goals:First, to determine the clonal contribution ofthese groups in marine and freshwater systems.Second, to analyze the sequence novelty withineach group, i.e. the difference between targetsequences and those deposited in GenBank (bothfrom cultured strains and from other molecularsurveys). This novelty can then be interpreted interms of sequencing effort and representation oftaxa in culture. Third, to present a robust phylogenyof each group combining all available sequencesto better describe their diversity and identify newclades formed by environmental sequences only.These phylogenetic trees can serve as a backbonewhere to map tag sequences that begin to appearby Next Generation Sequencing technologies(Amaral-Zettler et al. 2009; Stoeck et al. 2009).For each of the three taxonomic groups, majordifferences are found in clonal abundance, noveltypattern and new diversity in marine and freshwatersystems.

0%

20%

40%

60%

80%

100%

Other HF

MALV

MAST

Chrysophytes

Choanoflagellates

Bicosoecids

Marine Freshwater

% o

f clo

nes

Figure 1. Relative clonal abundance of different tax-onomic groups putatively forming the heterotrophicflagellate assemblages in marine and freshwater sys-tems (data from 82 clone libraries of 18S rDNA genes;see Supplementary Table S3).

Emerging Diversity Within Three Protist Groups 437

Table 1. Novelty degree represented by environmental sequences of chrysophytes, choanoflagellates andbicosoecids. In this integrated analysis we show the average similarity (standard error in brackets) with closestenvironmental match (CEM) and closest cultured match (CCM) for all sequences separated by environmentsand together. The second column shows to the number of sequences analyzed and the last column the statisticaltests (***: p< 0.0001, ns: not significant).

Environment n % CEM (SE) % CCM (SE) t-student

Chrysophytes Marine 144 97.6 (0.2) 94.2 (0.3) ***Freshwater 86 95.3 (0.3) 95.8 (0.3) nsAll 230 96.8 (0.2) 94.8 (0.2) ***

Choanoflagellates Marine 69 95.3 (0.3) 94.7 (0.4) nsFreshwater 20 90.8 (0.5) 91.6 (0.7) nsAll 89 94.3 (0.3) 94.0 (0.3) ns

Bicosoecids Marine 45 98.1 (0.4) 98.3 (0.5) nsFreshwater 31 90.9 (0.4) 90.6 (0.6) nsAll 76 95.1 (0.3) 95.0 (0.4) ns

Results

To obtain an exhaustive description of the phyloge-netic diversity of chrysophytes, choanoflagellatesand bicosoecids, we screened GenBank and ourunpublished libraries to retrieve all sequences fromthese groups obtained in marine and freshwatermolecular surveys. The dataset inspected included292 environmental clone libraries of 18S rDNAgenes (representing more than 13000 sequences)that have been published in 58 scientific papersand targeted a large variety of systems, depths inthe water column, and physical-chemical settings(Supplementary Table S1). Some studies focusedon the smallest eukaryotic microbes (<3-5 �m) andothers to the whole water community. Overall, weobtained 230 chrysophyte, 89 choanoflagellate and76 bicosoecid environmental sequences (listed inthe Supplementary Table S2). Sequences weregrouped into two categories (marine and freshwa-ter) before further abundance, novelty and diversityanalyses.

Relative Clonal Abundance inEnvironmental Surveys

The representation of chrysophyte, choanoflag-ellate and bicosoecid sequences in 18S rDNAlibraries was addressed considering only the stud-ies that reported the clonal abundance of distincttaxonomic groups (82 libraries published in 14papers, Supplementary Table S3). In each library,clones were assigned to putative heterotrophic flag-ellate (HF) groups, to putative phototrophic (PP)protist groups (prasinophytes, dinoflagellates, hap-tophytes and others) and to other heterotrophicprotists (OHP) (ciliates and fungi). Then, the

proportion of clones within different HF groupswas displayed (Fig. 1). Chrysophyte sequencesappeared in most environmental surveys, aver-aging 3.3% of HF clones in marine and 11.8%in freshwater studies (Fig. 1). The relative clonalabundance of choanoflagellates averaged 1.3% inmarine and 3.7% in freshwater systems. Bicosoe-cids were rarely found in marine surveys (0.6%relative clonal abundance on average) and wererather abundant in freshwater systems (21.6% onaverage, in some cases up to 50%). The bulk ofsequences from putative HF in marine systemsaffiliated with MALV and MAST. In freshwater sys-tems, other alveolates and cercozoans accountedfor a significant number of clones.

Novelty of Environmental Sequences

Figure 2 plots together two values obtained for eachenvironmental sequence after a GenBank search:the similarity against the closest environmentalmatch (CEM) and the similarity against the clos-est cultured match (CCM). Sequences appearedwidely distributed in the graph with each taxo-nomic group displaying a distinct novelty pattern.Most chrysophyte sequences from marine sam-ples accumulated in two plot regions: those withhigh CEM-CCM similarity values (above 98%), thussimilar to sequences from cultures and molecularsurveys, and those with high CEM (above 98%)and low CCM values (below 94%), thus similar onlyto sequences from molecular surveys (Fig. 2A).Choanoflagellates sequences showed a more uni-form dispersion in the graph, with a tendencyof freshwater sequences to have lower valuesin both axis (Fig. 2B). Interestingly, we detectedsome sequences that were very close to culturedspecies but had not been retrieved in other molec-

438 J. del Campo and R. Massana

86%

88%

90%

92%

94%

96%

98%

100%

86% 88% 90% 92% 94% 96% 98% 100%

86%

88%

90%

92%

94%

96%

98%

100%

86% 88% 90% 92% 94% 96% 98% 100%

86%

88%

90%

92%

94%

96%

98%

100%

86% 88% 90% 92% 94% 96% 98% 100%

Closest Environmental Match Similarity

Clo

sest

Cul

ture

d M

atch

Sim

ilarit

y

Marine Freshwater

chrysophytes

choanoflagellates

bicosoecids

A

B

C

ular surveys (this did not occur in chrysophytes).The novelty pattern for bicosoecids also showeda uniform dispersion of dots in the graph, as theprevious example, but here the difference betweensystems was very marked, with sequences frommarine environments being above 98% in both axis(Fig. 2C).

Averaging the similarity values against CEMand CCM for all sequences yielded the noveltydegree of a given dataset (Table 1). The differencebetween CCM similarity and 100% representedthe bias in representation of cultures, whereas thedifference between CEM similarity and 100% rep-resented the bias in environmental sequencing.Considering all sequences together yielded aver-age similarities of 94-95% in all cases (exceptchrysophytes against CEM). This general overviewobscured clear differences between systems, withchoanoflagellates and bicosoecids being signifi-cantly more novel in freshwater (91% similarity)than in marine systems (95% and 98%, respec-tively). The difference between CEM and CCMsimilarity in each row represented the increase ofknowledge gained by environmental sequencing.Surprisingly, in most cases both values were verysimilar. The only exception was the marine chryso-phytes, that showed significant differences betweenboth values (t-student test, p<0.0001). Altogether,the novelty degree was larger in freshwater than inmarine systems.

Phylogenetic Trees and New Clades

Using complete 18S rDNA sequences, we con-structed Maximum Likelihood phylogenetic treesfor chrysophytes (Fig. 3), choanoflagellates (Fig. 4)and bicosoecids (Fig. 5). Environmental sequencesappeared in the trees in different color dependingon their origin (blue: marine; green: freshwater),whereas reference sequences from cultured organ-isms appeared in black. Trees were divided intoseparate clades, some of them already definedin published trees and others being new, derivedfrom the present analysis. Clades always contained

Figure 2. Novelty pattern derived from chrysophyte(A), choanoflagellate (B) and bicosoecid (C) environ-mental sequences. Dots represent the % similaritywith the closest environmental match (CEM) and theclosest cultured match (CCM) for each sequencewithin the three taxa (229, 88, and 76 sequences,respectively) and are colored depending the envi-ronment where they originate (dark: marine; light:freshwater).

Emerging Diversity Within Three Protist Groups 439

0.1

AY919725 LG12-10

FJ537340 Biosope T60.030

AB022864 Paraphysomonas foraminifera

AY919698 LG06-01

AY642726 PG5.3

EF172948 SSRPD64

AY919772 LG25-07

AY919724 LG12-01

EF172998 SSRPE02

AY919818 LG48-10

EF172972 N10E01

AY919806 LG35-09

M87332 Chromulina chionophila

EF165146 Lagynion ampullaceum

EF165106 Chromophyton rosanoffii

AY919812 LG44-07

AY180010EF165134 Chrysophyceae sp.

OA3.6

AF123299 Chrysosphaera parvula

FJ537347 Biosope T65.123

EU025002 Ochromonas sp.

AY642717 P1.35

AY180017 CCW27

AY919817 LG48-06

AY919791 LG32-01

EF185316 Chrysosphaerella sp.

AY919829 LG92-06

AY919743 LG18-09

AY665995 Paraphysomonas sp.

DQ310258 FV23 1C3

AY651090 Spumella JBC29

AY919798 LG33-07

AY651096 Spumella JBM06

AF109324 Paraphysomonas imperforata

AY642709 P34.48AF123293 Ochromonas tuberculata

AY642705 P34.45

AY919815 LG46-06

AF109323 Paraphysomonas imperforata

AY426840 BL000921.17

Z38025 Paraphysomonas foraminifera

AY821968 CV1.B1.34

AY129063 UEPAC48p3

EU247834 Chrysophyceae sp.

Z28335 Paraphrysomonas vestita

AY919757 LG21-07

FJ537338 Biosope T60.011

EF165133 Ochromonas sp.

AY651098 Spumella JBM08

FJ537315 Biosope T39.013

AY919744 LG18-10

AB168053 Monas sp.

AY919811 LG43-07

DQ103874 M3 18A12

AF123285 Chromulina nebulosa

DQ103789 M1 18H01

AY919759 LG22-01

DQ103782 M4 18B07

AY520451 Oikomonas sp.

FJ537351 Biosope T65.151

EU561701 IND31.28

AY821972 CV1.B1.76

AF123300 Chrysosaccus sp.

FJ537348 Biosope T65.136

FJ537322 Biosope T39.120

FJ537319 Biosope T39.098

AY179989 CCI40

AY919766 LG23-10

AY919804 LG34-12AY919813 LG44-09

AY129065 UEPAC37p4

AY651092 Spumella JBM18

EF165121 Chrysosaccus sp.

FJ537339 Biosope T60.024

DQ103808 M4 18F06

FJ537317 Biosope T39.040

AY919816 LG47-07

U42454 Oikomonas mutabilis

AY919699 LG06-07

DQ310307 FV36 CilF11

AY651091 Spumella JBNZ40

AY919684 LG02-12

EF165120 Chrysosaccus sp.

AY919765 LG23-07

EU561718 IND31.45

AY919702 LG07-07

FJ537343 Biosope T65.104

EF165102 Chrysamoeba tenera

AY919691 LG04-04

AB275089 CYSGM-6

AY651093 Spumella JBC27

AF123296 Phaeoplaca thallosa

AF109326 Paraphysomonas butcheri

DQ647511 CD8.06

OA3.9

AY919778 LG26-11

DQ647519 CD8.18

DQ103873 M3 18G02

AF109322 Paraphysomonas bandaiensis

EF165107 Chromophyton rosanoffii

AF044845 Chrysosaccus sp.

AY919802 LG34-04

AF123292 Cyclonexis annularis

AY642746 A1

AF123288 Lagynion scherfelii

AY651071 Spumella JBAF35

AY919747 LG19-10

AF123286 Chrysamoeba pyrenoidifera

FJ537356 Biosope T84.071

AF174376 Paraphysomonas foraminifera

AY919742 LG18-01

AJ236863 Paraphysomonas sp.

DQ310247 FV18 3B4

DQ647516 CD8.15

AF109325 Paraphysomonas vestita

AY919800 LG34-01

EF172974 Q2B03N10

AY919688 LG03-12

EF165101 Chromulina nebulosa

FJ537350 Biosope T65.146

AY919789 LG31-01

AY642697 P34.28

DQ310257 FV23 1B7

AF123287 Chrysamoeba mikrokonta

DQ310204 FV18 3A1

98

100

98

50

89

65

60

Clade E (-/9)

Clade B2 (-/9)

Clade H (28/9)

Clade D (3/2)

Clade I (40/-)

Clade F1 (22/8)

Clade F2 (4/3)

92

51

100

100

52

97

94

100

Clade G (28/10)

.

.

.

.

.

.

.

72

100

62

100

..

Figure 3.

440 J. del Campo and R. Massana

0.1

EF165119 Tessellaria volvocina

AY919807 LG35-11

AY651084 Spumella JBM19

DQ487199 Synura sp.

DQ388551 Spumella 194f

EU024983 Uroglena sp.

AY919796 LG33-02

AY642741 A43

AF123290 Uroglena americana

AY082987 RT5iin35

EF165142 Ochromonas sp.

M55285 Mallomonas papillosa

AY699607 Poterioochromonas sp.

AY082999 RT5in36

EU025019 Dinobryon divergens

U73231 Mallomonas rasilis

AF123301 Epipyxis aurea

EF165103 Chromulina sp.

U73223 Synura petersenii

DQ388543 Spumella 8b3

AB052273 Nannochloropsis ocean

EF165116 Synura petersenii

AY919762 LG22-12

AB275091 CYSGM-8

DQ388560 Spumella 1020

M87333 Mallomonas striata

AF123284 Chrysochaete britannica

AY919717 LG10-03

U73220 Synura mammillosa

EF165137 Ochromonas sp.

EF165145 Chrysocapsa paludosa

EF165139 Ochromonas sp.

EF165108 Ochromonas danica

AY520450 Oikomonas sp.

U73227 Mallomonas matvienkoae

DQ388554 Spumella 45b3hm

EF165128 Synura curtispina

AY082982 RT5in4

U73226 Mallomonas splendens

EF165105 Chrysocapsa vernalis

AY520447 Ochromonas sp.

EU024970 Mallomonas tonsurataEF165118 Mallomonas insignis

EF165132 Uroglena sp.

U42382 Ochromonas sp.

AF123291 Dinobryon sociale

U73221 Synur sphagnicola

DQ310261 FV23 1B1

AB275090 CYSGM-7

AY919828 LG81-06

AB023070 Poterioochromonas malhamensis

EF165127 Mallomonas annulata

EF165124 Ochromonas aestuar

EF027354 Spumella sp. GOT220

AF123298 Epipyxis pulchra

AB425951 Spumella Mbc3C

AJ236858 Spumella 37G

AF123282 Chromophyton rosanoffii

U73224 Synura glabra

AJ236860 Spumella obliqua

EF165110 Ochromonas sp.

EF165123 Ochromonas sphaerocystis

EF165117 Synura petersenii

U73229 Mallomonas akrokomos

AF123289 Dinobryon sertularia

AY919777 LG26-10

AY919756 LG21-05

EU025006 Synura sphagnicola

AY651078 Spumella JBC2

DQ388540 Spumella JBC21

AY651086 Spumella JBL14

AY919752 LG20-09

EF633325 Chrysophyta JZH200700

EF165143 Ochromonas perlata

U73219 Tessellaria volvocina

EF165131 Uroglena americana

AY651097 Spumella JBC07

EU024980 Dinobryon crenulatum

EF165135 Ochromonas sp.

AF123283 Chrysocapsa vernalis

AJ236862 Spumella SpiG

EU024973 Dinobryon bavaricum

DQ388559 Spumella 1013

AY651089 Spumella JBM28

EF165129 Synura petersenii

AF123297 Chrysolepidomonas dendrolepidota

DQ388568 Spumella 1305

EF165111 Ochromonas vasocystis

EU024976 Dinobryon divergens

EF023675 Amb.18S.936

AY651077 Spumella JBAF33

EF165112 Ochromonas gloeopara

AY651088 Spumella JBNZ39

EF165115 Ochromonas sp.

AY651083 Spumella JBM/S11

DQ388542 Spumella JBNA46

M87336 Synura spinosa

EU076736 Dinobryon divergens

EF165140 Dinobryon cylindricum

EU247838 Ochromonadaceae sp.

EF023552 Amb.18S.772

EF165136 Ochromonas distigma

EU024975 Dinobryon sociale

EF024085 Amb.18S.6261

AJ236857 Spumella 15G

AJ236861 Spumella danic6a

U73225 Mallomonas adamas

DQ310291 FV36 CilC7

AF123295 Poterioochromonas stipitata

U73222 Synura uvella

EF165130 Chrysocapsa sp.

EU076735 Dinobryon bavaricum

AY651080 Spumella JBC13

M87331 Hibberdia magna

AY651079 Spumella JBAS36

EF165114 Poterioochromonas malhamensis

AY919719 LG10-11

DQ388562 Spumella 1027

EU076737 Dinobryon divergens

EF023425 Amb.18S.766

U42381 Ochromonas sp.

U73230 Mallomonas annulata

AY642745 A34

EF165141 Dinobryon sociale

U73228 Mallomonas caudata

AF123302 Chrysoxys sp.

EF165138 Ochromonas marina

DQ310336 FV233A12

AY919824 LG73-06

DQ388561 Spumella 1026

U73232 Mallomonas striata

DQ388557 Spumella 391f

EF165126 Ochromonas sp.

AY651081 Spumella JBC/S23

M32704 Ochromonas danica

AJ236859 Spumella elongata

AF123294 Ochromonas sphaerocystis

DQ388565 Spumella 1036

U71196 Chrysonephele palustris

EU024993 Dinobryon pediforme

67

74

100

87

Clade C (17/31)

Clade A

Clade J (1/1)

Clade B1 (-/6)

.

.

.

AY651085 Spumella JBM/512

AY082970 RT5in9

82

96

97

.

Figure 3. ( Continued ).

Emerging Diversity Within Three Protist Groups 441

sequences from different studies and were gener-ally well supported by high Maximum Likelihoodbootstrap values. In addition, Neighbor Joiningphylogenetic trees were done to assign partialsequences to the clades delineated by completesequences (trees not shown). The total numberof environmental sequences (complete and par-tial) within each clade was shown in brackets afterthe clade name (in blue for marine and greenfor freshwater sequences). Most clades containedenvironmental sequences.

The chrysophyte tree obtained here showedgood agreement with the topology described inAndersen et al. (1999), displaying the same cladesA to F defined there (although clade F was subdi-vided into two lineages in our tree) plus 4 additionalnew clades (Fig. 3). In general these clades pre-sented ML bootstrap values above 60%. Exceptclade A (Synurophyceae), the other eleven cladesincorporated environmental sequences. CladesB1, B2 and E contained only freshwater represen-tatives, whereas Clades C, D, F1 and F2 containedsequences from both freshwater and marine sys-tems. New chrysophyte clades described for LakeGeorge (Richards et al. 2005) belonged to cladeC (LG-G and LG-H) and clade F1 (LG-I). Manyof the environmental sequences affiliated with thefour new chrysophyte clades. Clade G containedthe Marine A group from Shi et al. (2009), clonesfrom different marine systems and also freshwatersequences from Lake George. Clade I containedonly marine sequences, including the ones belong-ing to Shi’s Marine B group. Clade H contained amonophyletic subclade of sequences from marinesamples, corresponding to Shi’s Marine C group,together with sequences from freshwater origin.Finally, clade J was formed by only few sequences.Since clades G, H and I included sequencesfrom both pigmented cells (Shi et al. 2009) andputative heterotrophic cells growing in unamendeddark incubations (Massana et al. 2006), theypreferentially included heterotrophic or mixotrophiccells.

The emerging diversity observed in thechoanoflagellate tree was also notable, withtwo new clades (E and F) unveiled by environmen-tal sequences (Fig. 4). All nine defined clades werewell supported by high ML bootstrap values (above

85%) and included environmental sequences.Clade C (corresponding to clade 2 of Carr et al.2008), contained sequences from freshwaterorigin only, whereas the rest of the clades includedonly marine representatives. Carr’s clade 1 wasseparated into clades A and B, which are distantlyrelated phylogenetically, and the remaining cladeswould form Carr’s clade 3.

The bicosoecid tree showed a clear separa-tion between a large freshwater clade and severalmarine clades, all supported by high ML boot-strap values (Fig. 5). Most sequences retrievedfrom marine systems affiliated with the generaCaecitellus and Cafeteria. The Bicosoeca clusterincluded sequences previously named as MAST-13 (Zuendorf et al. 2006) that clearly belonged tobicosoecids in our stramenopile tree (not shown)and in recent studies (Park and Simpson 2010).On the other hand, most freshwater sequencesappeared in two clades that were already describedfrom Lake George, one of them (LG HeterokontaI) contained exclusively environmental sequences.Several cultured strains formed long brancheswithout a clear position and no environmental rep-resentation.

The phylogenetic and novelty analyses could becombined to display the novelty of each clade as itsposition in the CEM/CCM plot based on the aver-aged values for all environmental sequences, andthe relevance of the clade by sizing the dot pro-portionally to the number of sequences (Fig. 6).It is interesting to note the distinct placement ofeach clade within the plotted area. For instance thefour new chrysophyte clades (G to J) and the twonew choanoflagellate clades (D and E) all appearedbelow the diagonal revealing higher similarity withCEM than with CCM, confirming the environmen-tal origin of its sequences. Another interesting casewas the bicosoecid clades, all distributing along thediagonal, with extreme novelty displayed by the LGHeterokonta I clade.

Discussion

This study is an effort to analyze the data existingin environmental molecular surveys for three pro-tist groups, chrysophytes, choanoflagellates andbicosoecids, which are often observed in aquatic

Figure 3. Maximum Likelihood phylogenetic tree of chrysophytes constructed with 270 complete 18S rDNAsequences (1648 informative positions). Sequences from cultured taxa appear in black and environmentalsequences appear in blue (marine) or green (freshwater). ML bootstrap values are shown for the named clades.The number of complete and partial environmental sequences assigned to each clade appear after the cladename. The scale bar indicates 0.1 substitutions per position.

442 J. del Campo and R. Massana

0,1

EF024012 Amb.18S.1490

EU446411 cLA14H07

EU011923 Diplotheca costata

EU011925 Salpingoeca amphoridium

EU011926 Stephanoeca diplocostata

EF023936 Amb.18S.1397

AF271999 Monosiga ovata

Y16260 Sphaeroforma arctica

EF023385 Amb.18S.720

DQ310312 FV36.CilG10

DQ310214 FV23.1A4

AB275066 DSGM-66

EU446410 cLA14G03

EU446388 cLA12G11

AF084230 Monosiga ovata

AY642707 P1.39

FJ153672 GoC3.C08

AY642728 PG5.16

EU011927 Stephanoeca diplocostata

EU446321 UI12G07

AF084235 Stephanoeca diplocostata

EU154974 DB25.BASS

AY426868 BL001221.16

DQ310302 FV36.CilA12

EU446354 UI13H07

EU446385 cLA12E05

AY821949 CV1.B2.17

AY149898 Choanoeca perplexa

L10824 Diaphanoeca grandis

AJ402325 OLI11041

DQ310285 FV36.CilB9

DQ310311 FV36 CilF8

EU446341 UI13C07

DQ310290 FV36.CilA8

DQ310248 FV36.2A12

AY426848 BL000921.30

EU011922 Acanthoeca spectabilis

DQ059032 Salpingoeca amphoridium

DQ310239 FV36.2B09

EU011924 Proterospongia sp.

DQ310306 FV36.CilF10

DQ310339 FV36.CilD8D9D12

DQ120005 NOR46.34

EF024015 Amb.18S.1493

EF023856 Amb.18S.1307

DQ310309 FV36.CilH9

EU446378 cLA12B02

AY426842 BL000921.20

EU446337 UI13A05

EU011928 Savillea micropora

AY426933 BL010625.36

DQ310287 FV36.CilC11

AY149899 Stephanoeca diplocostata

DQ995807 Lagenoeca antarctica

DQ310313 FV36.CilE11

L10823 Acanthocoepsis unguigulata

EU446377 cLA12A08

EU011929 Salpingoeca napiformis

EU011931 Salpingoeca urceolata

AF084618 Monosiga brevicollis

DQ103820 M1.18E10EU371175 NPK2.136

AJ402331 OLI11013

EF023626 Amb.18S.870

AY348876 Chondrosia reniformis

AF084231 Desmarella moniliformis

DQ103821 M1.18A02

AY426845 BL000921.24

DQ310289 FV36.CilD7

AY149897 Codonosiga gracilis

AF10094 Salpingoeca infusionum

EU011930 Salpingoeca pyxidium

EU446305 UI11E03

DQ310315 FV36.CilH12

DQ310286 FV36.CilC10

AY149896 Proterospongia choanojuncta

AF272000 Calliacantha sp.

AY821948 CV1.B1.36

DQ310249 FV36.2D08

Clade G ( 2 / - )

Clade E ( 6 / - )

Clade H ( 30 / - )

Clade D ( 14 / - )

Clade F ( 5 / - )

Clade C ( - / 20 )

Clade A ( 3 / - )

Clade B ( 6 / - )

Clade I ( 3 / - )

85

98

100

100

100

97

97

98

99

..

.

.

.

.

.

.

.

.

100

54

99

95

96

96

Figure 4. Maximum Likelihood phylogenetic tree of choanoflagellates constructed with 79 complete 18S rDNAsequences (1428 informative positions). Legend as in Figure 3.

samples and thought to account for a significantfraction of heterotrophic flagellates (Arndt et al.2000; Patterson and Lee 2000). There is littledoubt that sequencing of environmental clonesoffers an enhanced view of in situ diversity forvery small protists (Caron et al. 2004; Jürgensand Massana 2008). Environmental sequenceshighlight the dominant members of natural assem-blages and may reveal new and unexpectedlineages. We do not assume that the data ana-lyzed here do not face methodological limitations.

PCR-based clone libraries suffer a variety of draw-backs that have been discussed in detail (vonWintzingerode et al. 1997). Also, different micro-bial size fractions were analyzed in each study(see Supplementary Table S1), potentially bias-ing against protists from certain size classes. Inaddition, intrinsic differences may occur betweenmarine and freshwater environments, with freshwa-ter systems being generally less homogeneous andundersampled as compared with marine systems.Nevertheless, our analysis clearly identified new

Emerging Diversity Within Three Protist Groups 443

0.1

AF174366 Cafeteria sp.

AF072883 Siluania monomastiga

AF174364 Cafeteria roenbergensis

AF243501 Adriamonas peritocrescens

AY520448 Anoeca atlantica

AF174367 Caecitellus parvulus

AY827849 Cafeteria roenbergensis

EF620528 IND58.32

AF315604 Boroka karpovii

AY642126 Caecitellus paraparvulus

AY827848 Caecitellus paraparvulus

AY827850 Cafeterias roenbergensis

AY520457 Caecitellus pseudoparvulus

AF185052 Symbiomonas_scintilla

EF050072 He001005.33

AY827851 Cafeteria roenbergensis

AY919782 LG28.12

AY919774 LG25.12

AY520453 Nerada mexicana

AY520446 Caecitellus paraparvulus

AY919718 LG10.05

AY919737 LG15.12

AY919748 LG19.12

FJ537321 Biosope.T39.110

AY821966 CH1.5A.8

AF174368 Caecitellus parvulus

DQ103795 M2.18B03

AY919697 LG05.12

AY919758 LG21.12

EU162647 PSH9SP2005

EF620526 IND33.38

AY919753 LG20.12

DQ102392 Cafeteria mylnikovii

AY919808 LG36.05

EU162645 PSE8SP2005

AY520445 Bicosoeca vacillans

EF023971 Amb.18S.1440

EF620527 IND58.06

DQ269470 Halocafeteria seosinensis

AY827847 Caecitellus paraparvulus

EU162646 PSA11SP2005

AY919714 LG09.12

AY821965 CH1.2B.3

AF174365 Cafeteria sp.

EF620524 OC4.14

DQ310274 FV18.2D1

AY520455 Caecitellus pseudoparvulus

AB032606 Wobblia lunata

EF620521 OC4.1

AY919822 LG60.06

AY919726 LG12.12

AY520456 Caecitellus pseudoparvulus

AY520452 Paramonas globosa

AY821964 CH1.2A.3

L27633 Cafeteria sp.

AY520444 Bicosoeca petiolataEF023669 Amb.18S.929

AF185053 Symbiomonas_scintilla

AY919785 LG30.01

EF620523 OC4.7

AY919683 LG02.05

DQ103774 M1.18B12

EU446304 UI11D07

EF620522 OC4.2

AY919797 LG33.04

DQ269469 Halocafeteria sp.

DQ103786 M1.18G05

EF620525 OC4.19

AY520449 Anoeca atlantica

Caecitellus ( 25 / - )

Halocafeteria

Boroka ( 2 / - )

Bicosoeca ( 5 / 5 )

Cafeteria ( 10 / - )

LG Heterokonta 2 ( - / 11 )

LG Heterokonta 1 ( - / 15 )

74

73

92

100

100

100

100

.

.

.

.

.

.

.

46.

Figure 5. Maximum Likelihood phylogenetic tree of bicosoecids constructed with 66 complete 18S rDNAsequences (1485 informative positions). Legend as in Figure 3.

diversity and reduced the knowledge gaps withinthese groups. We provide a snapshot of the nov-elty of the groups that will change in the futuredepending on the effort of their study.

We first estimated the relative clonal abundanceof chrysophytes, choanoflagellates and bicosoe-cids with respect to other groups of putativeheterotrophic flagellates. This exercise should notbe translated into absolute abundances, but insteadused for a relative comparison among groups.In marine systems, only 5% of clones belongedto chrysophytes, choanoflagellates and bicosoe-cids, a low number given that these groups wereproposed to account for most of the marine het-erotrophic flagellates (Arndt et al. 2000; Brandt

and Sleigh 2000; Patterson and Lee 2000), andin contrast with the large clonal abundance ofthe marine uncultured MAST or MALV (Massanaand Pedrós-Alió 2008). This contribution couldstill be lower, since a fraction of environmen-tal chrysophyte sequences could derive fromchlorophyll-containing cells (Fuller et al. 2006).Also, half of the studies analyze small protists(Supplementary Table S1) and in these samplesthe contribution of choanoflagellates could havebeen underestimated, since these cells are usu-ally larger than 3-5 �m and some are covered bya mineral lorica. However, choanoflagellates arethought to be less abundant than stramenopile flag-ellates (Arndt et al. 2000; Brandt and Sleigh 2000),

444 J. del Campo and R. Massana

88%

90%

92%

94%

96%

98%

100%

88% 90% 92% 94% 96% 98% 100%

bicosoecida

CaecitellusCafeteria

Boroka

LG Heterokonta 1

LG Heterokonta 2

Bicosoeca

Clo

sest

Cul

ture

d M

atch

Sim

ilarit

y

Closest Environmental Match Similarity

88%

90%

92%

94%

96%

98%

100%

88% 90% 92% 94% 96% 98% 100%

chrysophytes

E

B1

F1

GB2

C F2

I

H

D

J

A

88%

90%

92%

94%

96%

98%

100%

88% 90% 92% 94% 96% 98% 100%

choanoflagellates

C

B

D

A

E

F

IG

H

B

C

Figure 6. Novelty pattern derived from eachdescribed clade within chrysophytes (A), choanoflag-ellates (B) and bicosoecids (C). Dots representingthe novelty of the clades (average similarity against

although they may reach up to 20% of the het-erotrophic flagellates in polar systems (Leakey et al.2002). A very different situation occurs in freshwa-ter systems, where bicosoecids represent 22% andchrysophytes 12% of clonal abundance, matchingthe importance given to these organisms in fresh-water systems (Arndt et al. 2000; Carrias et al.1998).

The estimates of relative clonal abundance sug-gested that chrysophytes, choanoflagellates andbicosoecids might be less important than expectedin marine systems. The presence of these threegroups was independently assessed by the anal-ysis of GOS metagenomes (Rusch et al. 2007),which were built by sequencing the environmen-tal DNA directly, and so were free of PCR biases.From the 115 sequences of eukaryotic 18S rDNAretrieved from all samples (Not et al. 2009),only one affiliated with choanoflagellates and twoto chrysophytes. As comparison, other groupssuch as MAST or MALV were much more rep-resented in the GOS metagenomes (15 and 36sequences, respectively). This PCR-independentapproach does not give a definitive answer, either,since it could be strongly affected by the variablecopy number of the rDNA operon in different taxa(Zhu et al. 2005). To validate the cell abundanceof chrysophytes, choanoflagellates and bicosoe-cids in the marine plankton, quantitative methodssuch as FISH (or quantitative-PCR with the propercontrols) are needed.

We propose a new approach (Massanaet al. 2010) to address the novelty of a givendataset based on the similarity against GenBanksequences. Overall, the novelty displayed by theenvironmental sequences of each group was ratherlarge, and this was interpreted in terms of effortsin culturing and environmental sequencing. Inour context the correspondence of environmentalsequences with sequences derived from culturesmeans that ecologically relevant protists have beencultured. It combines the culturing effort with theability of a given taxa to grow in the laboratory. Inour dataset, such correspondence was apparentonly in a few cases, like in marine bicosoecids.A low correspondence between environmentalsequences and sequences obtained from cultureswas the more common situation, being extremefor freshwater bicosoecids and choanoflagellates

CEM and CCM for all environmental sequences withinthe clade) have a size proportional to the number ofsequences. Different grey tones are used for conve-nience.

Emerging Diversity Within Three Protist Groups 445

whose environmental sequences only shared91% similarity with CCM. Enhanced efforts andnovel culturing strategies will be needed to bringmore ecologically relevant (i.e. abundant) protistsinto culture, in a similar manner that has beenso successful with dominant marine prokaryotes(Könneke et al. 2005; Rappé et al. 2002).

On the other hand, sequencing environmentalDNA is relatively straightforward and there are lit-tle chances to miss quantitatively important majorphylogenetic groups. An insufficient sequencingeffort was generally found in our study, with lowaveraged similarity values of our target sequencesagainst those from other molecular surveys. In addi-tion, similarities against CCM and CEM for differentsequence sets were rather similar (Table 1), withthe exception of marine chrysophytes for whichsequencing was decreasing the novelty. This sug-gests that there is plenty of room to discoveradditional diversity for these groups using envi-ronmental molecular surveys, which should alsotake advantage of new high-throughput sequenc-ing technologies (Amaral-Zettler et al. 2009; Stoecket al. 2009) or use group-specific primers (Bass andCavalier-Smith 2004). Alternatively, another expla-nation for low similarity with CEM would be a largeendemism of the studied sequences, which mightappear only in the studied site. At any rate, our nov-elty analysis showed that the three protists groupsstudied here (except marine bicosoecids) need fur-ther sequencing effort to reach a full understandingof the in situ diversity.

Our use of environmental sequences from publicdatabases improved the chrysophyte, choanoflag-ellate and bicosoecid phylogeny and identi-fied emergent new diversity. Thus, four novelclades appeared within chrysophytes, two withinchoanoflagellates and two within bicosoecids. Thetree topologies and clade divisions promise to bevery useful as a backbone reference for future stud-ies. An interesting observation from the bicosoecidand choanoflagellate trees was the appearanceof a single monophyletic freshwater clade nestedwithin several marine clades. This could be a signof a single and perhaps ancient transition eventfrom marine to freshwater systems in both protistgroups (Logares et al. 2007). In marine systems,chrysophytes harbored an important new diversity,suggesting that uncultured chrysophytes, unlike theeasily cultured Spumella or Paraphysomonas, maybe ecologically more relevant (Lim et al. 1999).The same applied for marine choanoflagellates,which showed a great discrepancy between theirrepresentation in culture and their abundance inclone libraries. In contrast, marine bicosoecids

were highly similar to cultured organisms. Finally,the three groups contained a significant hiddendiversity in freshwater systems, specially bicosoe-cids and choanoflagellates.

In summary, our culture-independent analy-sis highlighted a large diversity of chrysophytes,choanoflagellates and bicosoecids in aquatic envi-ronments that was accompanied with a high noveltydegree. This indicated a bias in the representationof cultures and an incomplete sequencing effort forthese groups. This analysis should be extended toother protist groups in order to fully benefit fromenvironmental molecular surveys (e.g. Marin andMelkonian 2010). Increasing the effort of environ-mental sequencing of aquatic protists is alreadyon the research agenda of several laboratoriesworldwide (Amaral-Zettler et al. 2009; Stoeck et al.2009). On the other hand, it is equally important toincrease the culturing efforts, to match the diversityof protist cultures with the in situ diversity of eco-logically relevant protists. Besides culturing efforts,other techniques such as FISH should be appliedto assess the abundance and ecological role ofnew taxa (Chambouvet et al. 2008; Massana et al.2006). The extent of environmental diversity andnovelty is striking even for protist groups that wereconsidered well characterized.

Methods

Sequence dataset retrieval: Environmental 18S rDNAsequences of chrysophytes, choanoflagellates and bicosoe-cids were obtained from GenBank in a two-step screening.First, sequences found by the NCBI Taxonomy Applicationwere retrieved and checked by BLAST (Altschul et al. 1997)to confirm their placement. Second, we used these and otherpublished sequences from cultures or environmental surveysthat belong to the target groups (but are not labeled assuch in GenBank) to retrieve additional sequences by BLAST.Putative chimeric sequences were checked by KeyDNATools(www.keydnatools.com) as described before (Guillou et al.2008). Neighbor Joining phylogenetic trees (see later) wereconstructed with a wide taxon coverage to find out whetheror not ambiguous divergent sequences belong to a givengroup. Related sequences from cultured organisms were alsoretrieved from GenBank and pruned to keep only a few repre-sentatives for phylogeny.

Two 18S rDNA clone libraries were constructed from darkunamended incubations done in March 2006 and October 2007with Blanes Bay (Mediterranean Sea) seawater prefiltered bya 3 �m filter. These incubations are known to promote thegrowth of uncultured HF (Massana et al. 2006). Picoplank-tonic biomass was collected on filters, and community DNAwas extracted. Complete 18S rDNA genes were PCR-amplifiedwith eukaryote-specific primers, and the PCR products werecloned. Details of the filtering setup, DNA extraction proto-col, and PCR and cloning conditions are described elsewhere(Massana et al. 2004, 2006). Twenty-five and 44 clones werepartially sequenced with the primer 528f by the MACROGEN

446 J. del Campo and R. Massana

Genomics Sequencing Services. Sequences were identifiedand inspected for chimeras by BLAST and KeyDNATools, yield-ing 18 target sequences (accession numbers HQ437173 –HQ437184 and HQ437193 – HQ437196). Ten clones fromthese libraries and from published libraries (BL in Massana et al.2004; IND in Not et al. 2008) were completely sequenced withfive internal primers by the same service. The final sequencedataset consisted in 395 complete or partial environmentalsequences from the three target groups.

Novelty analysis: To infer the novelty of an environmentalsequence dataset, we noted for each sequence its similarity ina BLAST search with the closest environmental match (CEM)and the closest cultured match (CCM). The CEM is the firstsequence in the output that derives from a molecular survey(excluding those from the same library), and the CCM is thefirst sequence in the output that belongs to a known organism(often cultured). Both similarity values for all sequences areplotted in a 2D dispersion graph, giving the “novelty pattern” ofthe dataset. Dots with high CCM similarity (i.e. above 98%) rep-resent environmental sequences close to cultured organisms,whereas dots with low CCM similarity (i.e. below 94%) highlightenvironmental sequences with no cultured counterpart. Con-versely, sequences with high CEM similarity indicate an optimalsequencing effort (they have been found in other environmen-tal surveys), and those with low CEM similarity highlight aninsufficient sequencing effort. Finally, the “novelty degree” ofthe dataset is obtained by averaging the similarity values for allsequences.

Phylogenetic analyses: 18S rDNA sequences were alignedusing MAFFT (Katoh et al. 2002) using a close relative asoutgroup. Alignments were checked with Seaview 3.2 (Galtieret al. 1996) and highly variable regions of the alignmentwere removed using Gblocks (Castresana 2000). NeighborJoining trees were first done with PAUP 4.0b10 (Swofford2002) with all partial sequences in order to define all possi-ble diversity, and to assure that each clade has at least oneclone with the complete sequence. Then, Maximum likelihood(ML) phylogenetic trees with complete sequences were con-structed with the fast ML method RAxML (Stamatakis 2006)using the evolutionary model GTRMIXI. Phylogenetic analyseswere done in the freely available University of Oslo Bioportal(www.bioportal.uio.no). Repeated runs on distinct starting treeswere carried out to select the tree with the best topology (theone having the best Likelihood of 1000 alternative trees). Boot-strap ML analysis was done with 1000 pseudo-replicates andthe consensus tree was computed with MrBayes (Huelsenbeckand Ronquist 2001). Trees were edited with FigTree v1.3.1(http://tree.bio.ed.ac.uk/software/figtree/).

Acknowledgements

This study was supported by projects GEMMA(CTM2007-63753-C02-01/MAR, MEC) andMICROVIS (CTM2007-62140/MAR, MEC) and theEuropean Funding Agencies from the ERA-netprogram BiodivERsA under the BioMarKs project.Javier del Campo was funded by I3P program(I3PPRE-06-00676, CSIC). We thank RaquelRodríguez-Martínez for her help in the unamendedincubations, Vanessa Balagué for her laboratoryassistance and Joseph Jr. Campo for his help inthe English.

Appendix A. Supplementary data

Supplementary data associated with this arti-cle can be found, in the online version, atdoi:10.1016/j.protis.2010.10.003.

References

Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z,Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST:a new generation of protein database search programs. NucleicAcids Res 25:3389–3402

Amaral-Zettler LA, McCliment EA, Ducklow HW, Huse SM(2009) A method for studying protistan diversity using massivelyparrallel sequencing of V9 hypervariable regions of small-subunit ribosomal RNA genes. PLoS ONE 4:e6372

Andersen RA, van de Peer Y, Potter D, Sexton JP, KawachiM, LaJeunesse T (1999) Phylogenetic analysis of the SSUrRNA from members of the Chrysophyceae. Protist 150:71–84

Arndt H, Dietrich D, Auer B, Cleven EJ, Gräfenhan T, WeitereM, Mylnikov AP (2000) Functional Diversity of HeterotrophicFlagellates in Aquatic Ecosystems. In Leadbeater BSC, GreenJC (eds) The Flagellates: Unity, Diversity and Evolution. Taylor& Francis Press, London, pp 240–268

Bass D, Cavalier-Smith C (2004) Phylum-specific environmen-tal DNA analysis reveals remarkably high global biodiversity ofCercozoa (Protozoa). Int J Syst Evol Microbiol 54:2393–2404

Brandt SM, Sleigh MA (2000) The quantitative occurrence ofdifferent taxa of heterotrophic flagellates in Southampton water,U.K. Estuar Coast Shelf Sci 51:91–102

Caron DA, Countway PD, Brown MV (2004) The growingcontributions of molecular biology and immunology to protistanecology: Molecular signatures as ecological tools. J EukaryotMicrobiol 51:38–48

Carr M, Leadbeater BSC, Hassan R, Nelson M, BaldaufSL (2008) Molecular phylogeny of choanoflagellates, the sistergroup to Metazoa. Proc Natl Acad Sci USA 105:16641–16646

Carrias JF, Amblard C, Quiblier-Lloberas C, Bourdier G(1998) Seasonal dynamics of free and attached heterotrophicnanoflagellates in an oligomesotrophic lake. Freshwat Biol39:91–101

Castresana J (2000) Selection of conserved blocks from mul-tiple alignments for their use in phylogenetic analysis. Mol BiolEvol 17:540–552

Cavalier-Smith T, Chao EE (2006) Phylogeny and megasys-tematics of phagotrophic heterokonts (Kingdom Chromista). JMol Evol 62:388–420

Chambouvet A, Morin P, Marie D, Guillou L (2008) Controlof toxic marine dinoflagellate blooms by serial parasitic killers.Science 322:1254–1257

Fenchel T (1982) Ecology of heterotrophic microflagellates.IV. Quantitative occurrence and importance as bacterial con-sumers. Mar Ecol Prog Ser 9:35–42

Fenchel T, Patterson DJ (1988) Cafeteria roenbergensis nov.gen., nov. sp., a heterotrophic microflagellate from marineplankton. Mar Microb Food Webs 3:9–19

Emerging Diversity Within Three Protist Groups 447

Fuller NJ, Campbell C, Allen DJ, Pitt FD, Zwirglmaier K,LeGall F, Vaulot D, Scanlan DJ (2006) Analysis of photosyn-thetic picoeukaryotic diversity at open ocean sites in the ArabianSea using a PCR biased towards marine algal plastids. AquatMicrob Ecol 43:79–93

Galtier N, Gouy M, Gautier C (1996) SeaView and Phylo_Win,two graphic tools for sequence alignment and molecular phy-logeny. Comput Applic Biosci 12:543–548

Guillou L, Viprey M, Chambouvet A, Welsh RM, KirkhamAR, Massana R, Scanlan DJ, Worden AZ (2008) Widespreadoccurrence and genetic diversity of marine parasitoids belong-ing to Syndiniales (Alveolata). Environ Microbiol 10:397–408

Huelsenbeck JP, Ronquist F (2001) MrBayes: Bayesian infer-ence of phylogenetic trees. Bioinfomatics 17:754–755

Jürgens K, Massana R (2008) Protistan Grazing on MarineBacterioplankton. In Kirchman, DL (ed) Microbial Ecology of theOceans, Second Edition. John Wiley & Sons, Inc., New York,USA, pp 383–441

Katoh K, Misawa K, Kuma L, Miyata Y (2002) MAFFT: a novelmethod for rapid multiple sequence alignment based on fastFourier transform. Nucleic Acids Res 30:3059–3066

King N, Westbrook MJ, Young SL, Kuo A, Abedin M, Chap-man J, Fairclough S, Hellsten U, Isogai Y, Letunic I, MMarr M, et al. (2008) The genome of the choanoflagellateMonosiga brevicollis and the origin of metazoans. Nature 451:783–788

Könneke M, Bernhard AE, de la Torre JR, et al. (2005) Iso-lation of an autotrophic ammonia-oxidizing marine archaeon.Nature 437:543–546

Leadbeater BSC (1991) Choanoflagellate Organization withSpecial Reference to the Loricate Taxa. In Patterson DJ, LarsonJ (eds) The Biology of the Free-living Heterotrophic Flagellates.Clarendon Press, Oxford, pp 241–258

Leakey RJG, Leadbeater BSC, Mitchell E, MccreadySMM, Murray AWA (2002) The abundance and biomass ofchoanoflagellates and other nanoflagellates in waters of con-trasting temperature to the north-west of South Georgia in theSouthern Ocean. Eur J Protistol 38:333–350

Lee JJ, Leedale GF, Bradbury P (2000) The Illustrated Guideto the Protozoa, 2nd ed. Lawrence: Society of Protozoologists

Lefranc M, Thénot A, Lepère C, Debroas D (2005) Geneticdiversity of small eukaryotes in lakes differing by their trophicstatus. Appl Environ Microbiol 71:5935–5942

Leipe DD, Winright PO, Gunderson JH, Porter D, Pat-terson DJ, Valois F (1994) 16S-like rRNA sequences fromLabyrinthuloides minuta and Cafeteria roenbergensis. Phycolo-gia 33:369–377

Lim EL, Dennet MR, Caron DA (1999) The ecology ofParaphysomonas imperforata based on studies employingoligonucleotide probe identification in coastal water samplesand enrichment cultures. Limnol Oceanogr 44:37–51

Logares R, Shalchian-Tabrizi K, Boltovskoy A, RengeforsK (2007) Extensive dinoflagellate phylogenies indicate infre-quent marine–freshwater transitions. Mol Phylogen Evol 45:887–903

Marin B, Melkonian M (2010) Molecular phylogeny and classi-fication of the Mamiellophyceae class. nov. (Chlorophyta) based

on sequence comparisons of the nuclear- and plastid-encodedrRNA operons. Protist 161:304–336

Massana R, Pedrós-Alió C (2008) Unveiling new micro-bial eukaryotes in the surface ocean. Curr Opin Microbiol11:213–218

Massana R, Balagué V, Guillou L, Pedrós-Alió C (2004)Picoeukaryotic diversity in an oligotrophic coastal site studiedby molecular and culturing approaches. FEMS Microbiol Ecol50:231–243

Massana R, del Campo J, Dinter C, Sommaruga R (2007)Crash of a population of the marine heterotrophic flagellateCafeteria roenbergensis by viral infection. Environ Microbiol9:2660–2669

Massana R, Pernice M, Bunge J, del Campo J (2010)Sequence diversity and novelty of natural assem-blages of picoeukaryotes from the Indian Ocean. ISMEJ doi:10.1038/ismej.2010.104

Massana R, Guillou L, Terrado R, Forn I, Pedrós-Alió C(2006) Growth of uncultured heterotrophic flagellates in una-mended seawater incubations. Aquat Microb Ecol 45:171–180

Not F, Latasa M, Scharek R, Viprey M, Karleskind P, BalaguéV, Ontoria-Oviedo I, Cumino A, Goetze E, Vaulot D, MassanaR (2008) Protistan assemblages across the Indian Ocean, witha specific emphasis on the picoeukaryotes. Deep Sea Res PartI 55:1456–1473

Not F, del Campo J, Balagué V, de Vargas C, Massana R(2009) New insights into the diversity of marine picoeukaryotes.PLoS ONE 4:e7143

Patterson DJ, Lee WJ (2000) Geographic Distribution andDiversity of Free-Living Heterotrophic Flagellates. In Lead-beater BSC, Green JC (eds) The Flagellates: Unity, Diversityand Evolution. Taylor & Francis Press, London, pp 269–287

Pernthaler J (2005) Predation on prokaryotes in the watercolumn and its ecological implications. Nature Rev Microbiol3:537–546

Preisig HR, Vørs N, Hällfors G (1991) Diversity of Het-erotrophic Heterokont Flagellates. In Patterson DJ, Larson J(eds) The Biology of the Free-living Heterotrophic Flagellates.Clarendon Press, Oxford, pp 361–399

Rappé MS, Connon SA, Vergin KL, Giovannoni SJ (2002)Cultivation of the ubiquitous SAR11 marine bacterioplanktonclade. Nature 418:630–633

Richards TA, Vepritskiy AA, Gouliamova DE, Nierzwicki-Bauer SA (2005) The molecular diversity of freshwaterpicoeukaryotes from oligotrophic lake reveals diverse, dis-tinctive and globally dispersed lineages. Environ Microbiol7:1413–1425

Rusch DB, Halpern AL, Sutton G, Heidelberg KB,Williamson S, Yooseph S, Wu D, Eisen JA, Hoffman JM,Remington K, Beeson K, et al. (2007) The Sorcerer II GlobalOcean Sampling expedition: Northwest Atlantic through East-ern Tropical Pacific. PLoS Biology 5:398–431

Sherr EB, Sherr BF (2002) Significance of predation by pro-tists in aquatic microbial food webs. Antonie van Leeuwenhoek81:293–308

Shi XL, Marie D, Jardillier L, Scanlan DJ, Vaulot D (2009)Groups without cultured representatives dominate eukaryotic

448 J. del Campo and R. Massana

picophytoplankton in the oligotrophic South East Pacific Ocean.PLoS ONE 4:e7657

Slapeta J, Moreira D, López-García P (2005) Theextent of protist diversity: insights from molecular ecol-ogy of freshwater eukaryotes. Proc Biol Sci 272:2073–2083

Stamatakis A (2006) RAxML-VI-HPC: Maximum likelihood-based phylogenetic analyses with thousands of taxa and mixedmodels. Bioinformatics 22:2688–2690

Stoeck T, Behnke A, Christen R, Amaral-Zettler L,Rodriguez-Mora MJ, Chistoserdov A, Orsiand W, Edg-comb VP (2009) Massively parallel tag sequencing reveals thecomplexity of anaerobic marine protistan communities. BMCBiology 7:72

Swofford DL (2002) PAUP*. Phylogenetic Analysis Using Parsi-mony (*and Other Methods). Sinauer Associates, Sunderland,Mass

von Wintzingerode F, Göbel UB, Stackebrandt E (1997)Determination of microbial diversity in environmental samples:pitfalls of PCR-based rRNA analysis. FEMS Microbiol Rev21:213–229

Zhu F, Massana R, Not F, Marie D, Vaulot D (2005) Mappingof picoeucaryotes in marine ecosystems with quantitative PCRof the 18S rRNA gene. FEMS Microbiol Ecol 52:79–92

Zuendorf A, Bunge J, Behnke A, Barger KJA, StoeckT (2006) Diversity estimates of microeukaryotes below thechemocline of the anoxic Mariager Fjord, Denmark. FEMSMicrobiol Ecol 58:476–491