Académique Documents
Professionnel Documents
Culture Documents
doi:10.1111/j.1462-2920.2009.02023.x
Eukaryotic diversity and phylogeny using smalland large-subunit ribosomal RNA genes from
environmental samples
emi_2023
1..10
Introduction
The use of molecular methods has played a major role in
our understanding of microbial diversity. During the last
two decades, PCR amplification and sequencing of the
small-subunit ribosomal RNA gene (SSU rDNA) has been
extensively applied to survey the huge bacterial and
archaeal diversity found in many common and extreme
habitats (Barns et al., 1996; Hugenholtz et al., 1998).
Recently, this technique was also adopted for the analysis
of eukaryotic diversity and it revealed a variety of new
groups, such as the marine Group I alveolates (LpezGarca et al., 2001; Moon-van der Staay et al., 2001) or
the putative new algal clade picobyliphytes (Not et al.,
2007), as well as a large diversity within many well-known
groups such as the prasinophyte algae (Guillou et al.,
2004; Viprey et al., 2008). In addition, several SSU rDNA
sequences that branched deeply in the eukaryotic phylogeny were impossible to be specifically related to any
known group, even at the kingdom level, suggesting the
controversial existence of several novel eukaryotic
kingdom-level groups (Dawson and Pace, 2002;
Edgcomb et al., 2002).
A key component of all the SSU rDNA-based diversity
studies is the taxonomic classification of the organisms
whose sequences are retrieved, which is commonly
inferred by phylogenetic analysis of the novel environmental phylotypes. In addition to artefacts derived from
undetected chimeric sequences, single-marker phylogenies have a series of inherent methodological problems,
such as the lack of enough informative signal for the
resolution of deep nodes, which can engender artefactual misplacement in phylogenetic trees (Philippe and
Adoutte, 1998; Stiller and Hall, 1999; Philippe et al.,
2000). A typical phylogenetic reconstruction problem, the
long-branch attraction (LBA) artefact, often causes highly
divergent SSU rDNA sequences to be placed at the base
of phylogenetic trees, leading them to be considered as
potential novel eukaryotic lineages. Recently, it was demonstrated that from 28 published phylotypes representing
putative novel high-level eukaryotic taxa, only 11 were
actual potential new lineages (Berney et al., 2004;
Cavalier-Smith, 2004). For example, the phylotypes
DH145-EKD11 and CCW75, putative members of a new
Sample
Station
Coordinates
Depth (m)
Volume
filtered (l)
DH18
DHARMA 5
South Atlantic
DHARMA 32
South Atlantic
DHARMA 18b
South Atlantic
Marmara Sea
622211 533556
25
19
46/36
545944 582217
25
14
47/48
592245 554627
25
14
138/114
4050.295 N 281.397E
25
42/37
DH114
DH141
Ma131
Clones sequenced
(SSU/SSU + LSU)
2009 Society for Applied Microbiology and Blackwell Publishing Ltd, Environmental Microbiology
i
ng
Fu
va
tes
ia
ca
lar
Ex
dio
Ra
im
a
ls
s
An
yte
ts
ph
on
pto
Ha
He
ter
ok
ola
tes
tes
ve
op
hy
tes
Ch
top
yp
Cr
lor
yte
ph
pto
Ma131
hy
Clo n e freq u en cy (% )
90
80
70
60
50
40
30
20
10
0
s
yte
ph
Ha
on
ts
re
pto
ok
St
He
ter
ola
tes
tes
ve
hy
Al
op
lor
Ch
yp
top
hy
tes
Clo n e freq u en cy (% )
DH18
DH114
Al
DH141
90
80
70
60
50
40
30
20
10
0
Cr
Cr
Clo n e freq u en cy (% )
yp
to
Ch phyt
lor es
op
hy
Al tes
ve
o
He lates
ter
ok
on
t
An s
im
Gl
a
au
Ch co ls
oa ph
no yte
s
fla
ge
Ha llate
pto s
ph
yte
s
Cr
Clo n e freq u en cy (% )
yp
top
Ch hy
lor tes
op
hy
Al tes
ve
o
He late
ter s
ok
Ha ont
pto s
ph
Te ytes
lon
em
i
Ce ds
rco
zo
a
90
80
70
60
50
40
30
20
10
0
Fig. 1. Eukaryotic diversity identified using SSU and SSU + LSU rDNA libraries. Histograms represent the percentage of clones belonging to
each eukaryotic group for a given environmental sample. For the number of clones analysed see Table 1.
of library contained very few sequences (24), corresponding very likely to stochastic differences due to the incomplete exploration of the libraries. Our analysis showed that
for each of the most abundant OTUs, identical or very
closely related clones were retrieved with both markers
(Table 2). This was in agreement with the results already
reported for the SSU + LSU rDNA-based analysis of bacterial diversity in marine plankton (Suzuki et al., 2001). All
this suggested that the SSU + LSU rDNA approach is able
to retrieve a quite similar diversity than the SSU rDNA. This
opened the possibility to apply that approach to improve
the phylogenetic analysis of the environmental sequences
difficult to place on the basis of the SSU rDNA alone.
Phylogenetic reconstruction using SSU + LSU
rDNA concatenations
After confirming the feasibility of the construction of
SSU + LSU rDNA libraries for eukaryotic diversity description and their overall similarity when compared with the
classical SSU rDNA-based approach, we compared the
resolving power of these two data sets in phylogenetic
analysis. Given that the concatenation of SSU + LSU rDNA
2009 Society for Applied Microbiology and Blackwell Publishing Ltd, Environmental Microbiology
Table 2. OTUs retrieved in the eight marine surface SSU and SSU + LSU rDNA libraries.
Clones per library (SSU/SSU + LSU)
OTU
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
Singletons
DH141
DH18
0/2
6/4
1/3
2/0
1/2
1/1
6/1
2/0
1/0
2/0
9/2
1/0
1/1
1/0
2/0
9/4
2/0
DH114
2/0
1/0
2/3
1/0
0/1
5/1
3/1
1/0
1/0
12/7
2/0
3/0
4/4
9/17
1/1
1/0
7/2
Ma131
Phylum
Alveolates
Dinoflagellates
Dinoflagellates, Karlodinium
Dinoflagellates
Dinoflagellates
Dinoflagellates, Lepidodinium
Dinoflagellates, Gyrodinium
Dinoflagellates, Pentapharsodinium
Dinoflagellates, Gymnodinium
Dinoflagellates, Karlodinium
Dinoflagellates, Karlodinium
Dinoflagellates
Dinoflagellates, Gyrodinium rubrum
Ciliates, Strombidium
Ciliates, Strombidium
Ciliates, Myrionecta
Ciliates, Strobilidium
Ciliates, Strombidium
Ciliates, Strombidium
Ciliates
Alveolates, Group I
Alveolates, Group I
Alveolates, Group I
Alveolates, Group I
Diatoms
Diatoms, Fragilariopsis
Dictyochophytes, Dictyocha
Mast1
Mast1
Mast1
Prasinophytes, Pyramimonas
Prasinophytes, Bathycoccus
Prasinophytes, Micromonas
Pyrenomonadales, Geminigera
Phaeocystales, Phaeocystis
Unclassified
Thaumatomonads, Protapsis
Picobiliphytes
Ascomycetes
Crustacea
Crustacea
Crustacea
1/0
1/1
1/0
3/7
0/3
3/9
2/3
3/0
0/2
1/0
2/5
1/2
0/2
0/1
14/11
3/0
67/63
8/0
0/2
0/2
0/2
0/2
Heterokonts
0/1
0/2
1/0
3/1
2/0
0/2
Chlorophytes
0/3
2/5
0/1
2/0
0/1
2/0
2/3
0/2
2/3
0/2
7/7
3/2
0/3
6/9
Cryptophytes
Haptophytes
Streptophytes
Cercozoa
Picobiliphytes
Fungi
Animals
28/14
a. OTUs are affiliated to a genus only if > 97% identical with a genus representative.
removed four groups composed exclusively of fastevolving species for which we did not retrieve representatives in our samples: Microsporidia, Diplomonadida,
Parabasalida and Amoebozoa. This allowed us including
~500 additional aligned sites, giving a total of 3247 conserved positions for phylogenetic reconstruction.
Maximum likelihood (ML) and Bayesian inference phylogenetic trees reconstructed from the SSU + LSU rDNA
alignment (Fig. 2) recovered a number of well-known
eukaryotic groups, in agreement with global eukaryotic
phylogenies published recently (Medina et al., 2003;
Rodrguez-Ezpeleta et al., 2005; 2007; Burki et al., 2007).
We found strong support for the monophyly of the
Opisthokonta, uniting Metazoa, Fungi and their unicellular
close relatives, the Choanozoa, Ichthyosporea and the
2009 Society for Applied Microbiology and Blackwell Publishing Ltd, Environmental Microbiology
100/1
0.06
2009 Society for Applied Microbiology and Blackwell Publishing Ltd, Environmental Microbiology
topology of the trees based on our data set was in agreement with most of the accepted intra- and inter-group
relationships (see above). To test the hypothesis that the
SSU + LSU rDNA may be useful to infer the taxonomic
affiliation of fast-evolving, conflictive environmental
sequences, we analysed two phylotypes closely related to
the red-tide forming ciliate species M. rubra obtained
during our environmental diversity surveys. The very fastevolving SSU rDNA of this ciliate was sequenced only a
few years ago (Johnson et al., 2004), and it was shown
to affiliate with a number of divergent environmental
sequences that were originally thought to be members of
a potential novel eukaryotic phylum (Lpez-Garca et al.,
2001; Edgcomb et al., 2002; Savin et al., 2004). It was
shown that the extremely rapid evolutionary rate of those
SSU rDNA sequences made it impossible to retrieve their
correct ciliate affinity in SSU rDNA analyses because of a
LBA artefact.
Thanks to taxon-rich SSU rDNA phylogenetic analyses
including the SSU rDNA portion of our SSU + LSU phylotypes, we recognized that two of them (DH114-1A22
and DH18-4A88) were very closely related to M. rubra
(99% sequence identity). We incorporated the two
Myrionecta-like phylotypes to our data set and reconstructed phylogenetic trees with only the SSU rDNA
part (1221 positions) and the concatenated SSU + LSU
rDNA sequences (3247 positions). In the SSU rDNA
tree (Fig. 3), the Myrionecta-like phylotypes branched
together with the other long branches, represented by
the excavate species (Euglenozoa and Reclinomonas
americana), with moderate support (69% ML BP and PP
of 0.52), which testified for a LBA artefact. Interestingly,
in the tree reconstructed with the two concatenated
markers, the Myrionecta-like sequences correctly
branched within the Alveolata with maximal support
(100% ML BP and PP of 1), close to the ciliates (57%
ML BP, not retrieved with the Bayesian method). In addition, the overall topology of the SSU + LSU rDNA tree
was not affected by the addition of these two highly
divergent sequences (compare Figs 2 and 3) demonstrating the robustness of the SSU + LSU rDNA
approach. Therefore, even if the statistical support
remained moderate, the concatenation of the two rDNA
markers allowed identifying these highly divergent phylotypes as ciliates (and certainly as alveolates with
strong support), which was impossible when using the
SSU rDNA information alone.
Conclusion
For several years, the SSU rDNA has been used as the
preferred marker to explore the diversity of microbial
eukaryotic communities in a variety of environments,
leading to the discovery of a huge hidden diversity. Con-
2009 Society for Applied Microbiology and Blackwell Publishing Ltd, Environmental Microbiology
SSU rDNA
69/.85
57/1
Dinoflagellata
100/1
Dinoflagellata
-/.76
97/1
Perkinsida
84/51/-
100/1
74/.94
Chromera velia
70/.84
98//.99
100/1
Apicomplexa
Perkinsida
Chromera velia
87/-
Ma131_1A49
100/1
Apicomplexa
100/1
Ma131_1A57
100/1
57/-
Ciliophora
99/1
Ciliophora
57*/-
100/.5
DH114_1A22
Heterokonta
(1/4)
89/1
97/1*
96/1
53/-
100/1
75/100/1
60/-
DH18_4A88
Rhizaria
Rhodophyta
100/1
50/1
100/1
77/-
Centrohelida
100/1
57/-
Cryptophyta
79/.92
Viridiplantae
100/1
100/1
93/.78
Opisthokonta
100/1
71/90/1
100/1
Opisthokonta
Ancyromonas sigmoides
80/1
Euglenozoa
Apusomonas sp.
Myrionecta rubra
100/1
62/.97
69/.52
Cryptophyta
72/.98
Ancyromonas sigmoides
(1/4)
Plantae
57/.92
72/.96
Apusomonas sp.
(1/4)
Haptophyta
Glaucophyta
67/.57
Rhizaria
100/1
(1/4)
Haptophyta
94/.98
100/1
81/-
Heterokonta
90/1
100/1
95/-
100/1
Centrohelida
DH114_1A22
100/1
(1/4)
DH18_4A88
Reclinomonas americana
Euglenozoa
100/1
89/1
Reclinomonas americana
0.06
0.06
Fig. 3. ML phylogenetic trees with Myrionecta-like phylotypes (grey boxes). The taxonomic sampling is the same as in Fig. 2, but large groups
have been collapsed. On the left, the tree based on the SSU rDNA alone (1200 positions); on the right, the tree based on the SSU + LSU
rDNA concatenation (3247 positions).
2009 Society for Applied Microbiology and Blackwell Publishing Ltd, Environmental Microbiology
Experimental procedures
DNA samples and library construction
We amplified rRNA genes from four DNA samples of smallsize plankton (0.25 mm size fraction) collected in various sea
surface (25 m) locations in the South Atlantic (DH18, DH114
and DH141) and in the Marmara Sea (Ma131) that were
available in the laboratory from previous studies (LpezGarca et al., 2001; Lara et al., 2009). For each sample, SSU
and SSU + LSU rDNA libraries were constructed by PCR
amplification and cloning. The SSU rDNA was amplified with
eukaryote specific primers, EK-42F (CTCAARGAYTAAGC
CATGCA) and EK-1498R (CACCTACGGAAACCTTGTTA),
and the PCR products were cloned with the Topo TA cloning
kit (Invitrogen) following the manufacturers instructions. The
protocol was slightly modified for the construction of the
SSU + LSU rDNA libraries because of the much larger size of
the fragments to be amplified and cloned. The size of a
complete eukaryotic ribosomal RNA gene cluster, i.e. the
SSU rDNA, the internal transcribed spacers (ITS1 and ITS2),
the 5.8S ribosomal RNA genes and the LSU rDNA, is of
around 5.5 kbp, so we used a long-run enzyme, the Ex
TaKaRa (TaKaRa) to avoid premature elongation end during
the PCR cycles. PCR reactions were carried out under the
following conditions: 35 cycles (denaturation at 94C for 15 s,
annealing at 55C for 30 s, extension at 72C for 6 min)
preceded by 2 min denaturation at 94C, and followed by
10 min extension at 72C using the SSU rDNA-specific
primer Euk-42F and the LSU rDNA primer 28S-4R (TTCTGACTTAGAGGCGTTCAG). The library construction was
made with the Topo TA XL kit (Invitrogen), designed for high
cloning efficiency of long PCR products, following the manufacturers instruction. Two additional libraries were constructed for the SSU + LSU rDNA marker only, from samples
DH22 (5 m depth South Atlantic) and Ma121 (500 m depth
Marmara Sea).
Diversity analysis
For each SSU and SSU + LSU rDNA library, we selected 48
positive clones and sequenced their 5 region of the SSU
rRNA gene using the eukaryotic primer Euk-42F. For preliminary phylogenetic affiliation, the sequences were blasted
against the nr GenBank database to identify their closest
relatives. Saturation analysis of our libraries was performed
with those partial sequences using the software DOTUR
(Schloss and Handelsman, 2005). The OTU identification
was carried out with Codon Code Aligner (Licor), by
Phylogenetic analysis
All new sequences were incorporated to the available data
set of concatenated SSU + LSU rDNA sequences and
aligned using the ED program of the package MUST (Philippe,
1993). Ambiguously aligned regions and gaps were excluded
in phylogenetic analyses. The ML phylogenetic analyses
were carried out with the program Treefinder (Jobb et al.,
2004) and bootstrap values were calculated from 1000 replicates. Bayesian trees were inferred using MrBAYES 3.1.
Markov chain Monte Carlo searches were run with four
chains for 1000 000 generations, with trees being sampled
every 100 generations. The first 5000 trees were discarded
as burnin, keeping only trees generated well after chain
parameters stabilized. For both the ML and the Bayesian
methods, we applied a GTR + G + I model of nucleotide
substitution, taking into account a proportion of invariable
sites, and a G-shaped distribution of the rates of substitution
among sites with four rate categories. Fast-evolving groups
(Microsporidia, Parabasalida, Diplomonadida and Amoebozoa), for which we did not retrieve closely related phylotypes
in our samples, were removed from our analyses. Sequence
alignments are available upon request.
2009 Society for Applied Microbiology and Blackwell Publishing Ltd, Environmental Microbiology
References
Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J.,
Zhang, Z., Miller, W., and Lipman, D.J. (1997) Gapped
BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 33893402.
Barns, S.M., Delwiche, C.F., Palmer, J.D., and Pace, N.R.
(1996) Perspectives on archaeal diversity, thermophily and
monophyly from environmental rRNA sequences. Proc Natl
Acad Sci USA 93: 91889193.
Berney, C., Fahrni, J., and Pawlowski, J. (2004) How many
novel eukaryotic kingdoms? Pitfalls and limitations of
environmental DNA surveys. BMC Biol 2: 13.
Burki, F., Shalchian-Tabrizi, K., Minge, M., Skjaeveland, A.,
Nikolaev, S.I., Jakobsen, K.S., and Pawlowski, J. (2007)
Phylogenomics reshuffles the eukaryotic supergroups.
PLoS ONE 2: e790.
Cavalier-Smith, T. (1999) Principles of protein and lipid targeting in secondary symbiogenesis: euglenoid, dinoflagellate, and sporozoan plastid origin and the eukaryote family
tree. J Eukaryot Microbiol 46: 347366.
Cavalier-Smith, T. (2002) The phagotrophic origin of eukaryotes and phylogenetic classification of Protozoa. Int J Syst
Evol Microbiol 52: 297354.
Cavalier-Smith, T. (2004) Only six kingdoms of life. Proc R
Soc Lond B Biol Sci 271: 12511262.
Dawson, S.C., and Pace, N.R. (2002) Novel kingdom-level
eukaryotic diversity in anoxic environments. Proc Natl Acad
Sci USA 99: 83248329.
Dez, B., Pedrs-Ali, C., and Massana, R. (2001) Study of
genetic diversity of eukaryotic picoplankton in different
oceanic regions by small-subunit rRNA gene cloning and
sequencing. Appl Environ Microbiol 67: 29322941.
Edgcomb, V.P., Kysela, D.T., Teske, A., De Vera Gomez, A.,
and Sogin, M.L. (2002) Benthic eukaryotic diversity in the
Guaymas Basin hydrothermal vent environment. Proc Natl
Acad Sci USA 99: 76587662.
Guillou, L., Eikrem, W., Chretiennot-Dinet, M.J., Le Gall, F.,
Massana, R., Romari, K., et al. (2004) Diversity of picoplanktonic prasinophytes assessed by direct nuclear SSU
rDNA sequencing of environmental samples and novel
isolates retrieved from oceanic and coastal marine
ecosystems. Protist 155: 193214.
Habura, A., Pawlowski, J., Hanes, S.D., and Bowser, S.S.
(2004) Unexpected foraminiferal diversity revealed by
small-subunit rDNA analysis of Antarctic sediment.
J Eukaryot Microbiol 51: 173179.
Hampl, V., Horner, D.S., Dyal, P., Kulda, J., Flegr, J., Foster,
P.G., and Embley, T.M. (2005) Inference of the phylogenetic position of oxymonads based on nine genes: support
for Metamonada and Excavata. Mol Biol Evol 22: 2508
2518.
2009 Society for Applied Microbiology and Blackwell Publishing Ltd, Environmental Microbiology
2009 Society for Applied Microbiology and Blackwell Publishing Ltd, Environmental Microbiology