Académique Documents
Professionnel Documents
Culture Documents
ABSTRACT Sexual size dimorphism in the postca- cies. Using these samples, we also evaluated molar dimor-
nine dentition of the late Miocene hominoid Lufengpithe- phism and taxonomic composition in two other Miocene
cus lufengensis exceeds that in Pongo pygmaeus, demon- ape samples—Ouranopithecus macedoniensis from
strating that the maximum degree of molar size dimor- Greece, specimens of which can be sexed based on associ-
phism in apes is not represented among the extant ated canines and P3s, and the Sivapithecus sample from
Hominoidea. It has not been established, however, that Haritalyangar, India. Ouranopithecus is more dimorphic
the molars of Pongo are more dimorphic than those of than the extant taxa but is similar to Lufengpithecus,
any other living primate. In this study, we used resam- demonstrating that the level of molar dimorphism
pling-based methods to compare molar dimorphism in required for the Greek fossil sample under the single-spe-
Gorilla, Pongo, and Lufengpithecus to that in the papio- cies taxonomy is not unprecedented when the compara-
nin Mandrillus leucophaeus to test two hypotheses: tive framework is expanded to include extinct primates.
(1) Pongo possesses the most size-dimorphic molars In contrast, the Haritalyangar Sivapithecus sample, if
among living primates and (2) molar size dimorphism in it represents a single species, exhibits substantially
Lufengpithecus is greater than that in the most dimorphic greater molar dimorphism than does Lufengpithecus.
living primates. Our results show that M. leucophaeus Given these results, the taxonomic status of this sample
exceeds great apes in its overall level of dimorphism and remains equivocal. Am J Phys Anthropol 140:253–264,
that L. lufengensis is more dimorphic than the extant spe- 2009. V 2009 Wiley-Liss, Inc.
C
A frequently encountered problem in hominoid paleon- However, adopting a multiple-species taxonomy for a fos-
tology is identification of the source of high levels of size sil sample solely on the basis of excessive size variation
variation in a fossil sample (e.g., Kay 1982a,b; Lieber- relative to Gorilla and Pongo is problematic for two rea-
man et al., 1988; Cope and Lacy, 1992; Albrecht and sons. First, it is not clear that the upper limit of intra-
Miller, 1993; Kramer, 1993; Martin and Andrews, 1993; specific variation in extant primates is represented by
Richmond and Jungers, 1995; Lockwood et al., 1996, these taxa. Among living primates, Gorilla and Pongo
2000; Plavcan and Cope, 2001; Silverman et al., 2001; are exceeded in body-mass dimorphism (and presumably
Scott and Lockwood, 2004; Villmoare, 2005). While some intraspecific variation in body mass) by the African
sources are relatively easily identified and controlled papionin Mandrillus sphinx (Jungers and Smith, 1997;
(e.g., variation due to ontogeny or pathology), others Setchell et al., 2001). Although it has not been estab-
present greater difficulty. For example, high variation in lished whether this difference is reflected in aspects of
a single fossil sample can be interpreted as evidence of skeletal or dental size variation and dimorphism, data
the presence of multiple species, changes in size over from other papionins, particularly Papio, indicate that at
time, or marked sexual dimorphism, or some combina- least some of the members of this clade may be more
tion of these factors. Determining which of these alterna- skeletally and dentally dimorphic than the great apes
tives is responsible for the pattern of variation in a given (e.g., Wood, 1976; Uchida, 1996a,b; Plavcan, 2002, 2003).
fossil assemblage is important because each has different Second, the upper limit of intraspecific variation may
implications regarding species diversity, modes of evolu- not be represented by any extant primate. Among fossil
tionary change (i.e., anagenesis vs. cladogenesis), and primates, the hominoid sample from the late Miocene
social behavior.
One perspective on fossil hominoid taxonomy specifies
that the degree of variation in extinct species should not *Correspondence to: Jeremiah E. Scott, School of Human Evolution
be greater than that in Gorilla and Pongo, the most sex- and Social Change, Institute of Human Origins, Arizona State Univer-
ually dimorphic extant hominoids, which logically sity, Tempe, AZ 85287-4101, USA. E-mail: jeremiah.scott@asu.edu
requires rejection of a single-species hypothesis in cases
where a temporally and geographically restricted fossil Received 3 July 2008; accepted 28 January 2009
sample is more variable than these great apes (e.g., Kay,
1982a,b; Lieberman et al., 1988; Martin and Andrews DOI 10.1002/ajpa.21059
et al., 1993; Teaford et al., 1993; Walker et al., 1993; see Published online 8 April 2009 in Wiley InterScience
also Cope and Lacy, 1992; Cope, 1993; Plavcan, 1993). (www.interscience.wiley.com).
C 2009
V WILEY-LISS, INC.
254 J.E. SCOTT ET AL.
site of Lufeng, China, represents a single species— time represented by the hominoid-bearing deposits at
Lufengpithecus lufengensis—that exceeds Gorilla and Lufeng is unknown, temporal variation is unlikely to be
Pongo in its degree of postcanine sexual dimorphism a major component of the high level of size variation in
(Kelley and Xu, 1991; Kelley, 1993; Kelley and Plavcan, L. lufengensis, given that intrasexual variation in the
1998). Establishing that the Lufeng sample represents a sample is within the range of modern species (Kelley
single highly dimorphic species was made possible by and Plavcan, 1998). The fact that L. lufengensis exceeds
two key characteristics of the assemblage: the sample is Pongo in its level of molar dimorphism means that it is
large, comprising hundreds of teeth (e.g., Kelley and potentially more dimorphic in the molar dentition than
Etler, 1989; Wood and Xu, 1991), and a number of post- any extant primate, as Pongo is commonly thought to
canine dentitions have been confidently sexed using possess the greatest level of molar dimorphism among
associated canines and P3s (e.g., Kelley and Xu, 1991; living primates (e.g., Mahler, 1973; Kelley and Xu, 1991;
Kelley, 1993; Kelley, 1995a,b). Using the sexed specimens Kelley and Plavcan, 1998). If true, then including the
(n 16 for each molar position), Kelley and colleagues Lufeng sample as part of the comparative framework for
(Kelley and Xu, 1991; Kelley, 1993; Kelley and Plavcan, assessing variation in fossil primate samples becomes
1998) demonstrated that molar dimorphism in L. lufen- even more critical.
gensis is so high that there is no overlap between male In fact, it has not been quantitatively verified that
and female individuals in bivariate plots of mesiodistal Pongo expresses the greatest degree of molar dimor-
and buccolingual dimensions. Several researchers have phism among living primates, and therefore the claim
argued that the Lufeng sample contains multiple species that the degree of molar dimorphism documented in L.
(e.g., Wu and Oxnard, 1983a,b; Martin, 1991; Cope and lufengensis falls outside the range observed in living pri-
Lacy, 1992; Plavcan, 1993), but a mixture of two or more mate species has not been adequately tested. Thus, in
species is unlikely to have produced the pattern of varia- this study, we test two hypotheses regarding molar size
tion observed in the sample, unless one appeals to highly dimorphism in primates: (1) Pongo represents the upper-
improbable sampling events (Kelley and Plavcan, 1998). most extreme of molar dimorphism among living pri-
Thus, L. lufengensis extends the known range of intra- mates, and (2) molar dimorphism in L. lufengensis is
specific size variation and sexual dimorphism in greater than that in the most dimorphic living primate
the Hominoidea, at least with respect to the postcanine species. We then apply the results of these analyses to
dentition. other potential instances of extreme dimorphism in the
Despite initial objections based on both ontological and late Miocene hominoid fossil record—the Sivapithecus
epistemological grounds (e.g., Ruff et al., 1989; Martin, material from Haritalyangar, India, and the Ouranopi-
1991; Cope and Lacy, 1992; Martin and Andrews, 1993; thecus macedoniensis material from Greece (Kelley,
Plavcan, 1993; Teaford et al., 1993; Walker et al., 1993), 2005; Schrein, 2006). Specifically, we evaluate whether
the idea that some fossil hominoid species were more levels of apparent sexual dimorphism (i.e., the level of
dimorphic than living great apes has gained wider accep- dimorphism required if the distinct large and small size
tance, and many researchers now acknowledge extreme clusters evident in the Sivapithecus and O. macedonien-
dimorphism as a potential source of high measures of sis molar samples represent conspecific males and
variation that must be considered when evaluating fossil females, respectively) in these fossil samples fall within
samples (e.g., Plavcan, 2001; Plavcan and Cope, 2001; the limits of dimorphism established for living primates
Scott and Lockwood, 2004; Schrein, 2006; Skinner et al., and L. lufengensis.
2006; Simons et al., 2007; Humphrey and Andrews,
2008).1 This is not to say that extreme dimorphism
should be regarded as the null hypothesis for Miocene
hominoids; rather, we are suggesting that extreme MATERIALS AND METHODS
dimorphism is a viable alternative to the hypothesis that Three extant species were included in the analysis:
high levels of size variation in a fossil sample indicate the western lowland gorilla (Gorilla gorilla), the Bor-
the presence of multiple species. Acceptance of L. lufen- nean orangutan (Pongo pygmaeus), and the drill (Man-
gensis as a single highly dimorphic species highlights drillus leucophaeus) (Table 1). The drill was chosen to
the need to incorporate other comparative species—in represent papionins because Plavcan’s (1990) data set
addition to the living great apes—when evaluating fossil indicates that Mandrillus probably has the most dimor-
samples. One option is to use the highly dimorphic phic postcanine teeth of any extant papionin. Mandrillus
papionins as analogues, which some studies have done leucophaeus is smaller in body size than M. sphinx and
(e.g., Ruff et al., 1989; Teaford et al., 1993; Uchida may not be as sexually dimorphic in body mass (Jungers
1996b; Harvati et al., 2004; Baab, 2008). Another option and Smith, 1997), but the two species have similar
is to use L. lufengensis as an analogue (Kelley, 2005). degrees of postcanine dimorphism. This assessment is
The use of fossil species to model intraspecific varia- based on a comparison of Plavcan’s (1990) M. leuco-
tion in other fossil assemblages was suggested by Wood phaeus data set to unpublished data for M. sphinx col-
(1991), who used Australopithecus boisei to determine lected by S. Frost, R. Nuger, and M. Singleton. The
whether variation in A. africanus and A. robustus indi- M. leucophaeus data were used in order to avoid the
cated the presence of multiple species in each of these potential for interobserver error in the M. sphinx data.
hypodigms (for other examples of the use of extinct spe- For each of the extant species, maximum length (mesio-
cies to evaluate variation in fossil samples, see Kelley, distal, MD) and width (buccolingual, BL) dimensions of
2005; Skinner et al., 2006; Baab, 2008). Although Wood’s the mandibular molars were taken from the literature
(1991) intent in taking this approach was to control for (for G. gorilla: Mahler, 1973; for P. pygmaeus and M. leu-
temporal variation, the purpose of using L. lufengensis cophaeus: Plavcan, 1990). Maxillary molars were not
as an analogue would be to include a reference sample included in the analysis because sample sizes for these
that possesses a level of intraspecific variation not repre- teeth are not as large as those for the mandibular
sented among extant hominoids. Although the amount of molars in the fossil samples.
are identical to those that would have been obtained had no difference was considered falsified at the a 5 0.05
we simply used the mean method [i.e., dividing the sam- level. For these tests, two-tailed P-values were obtained
ple into ‘‘males’’ and ‘‘females’’ about the mean (Plavcan, by counting the total number of bootstrap ISD values
1994; Gordon et al., 2008)]. that were as extreme as or more extreme than the Siva-
The Sivapithecus and O. macedoniensis samples were pithecus and O. macedoniensis values (including the val-
evaluated using resampling methods, but because these ues for the two fossil samples) and dividing that number
samples are small (n 10 for all molar positions), they by the total number of ISDs (i.e., 1001). In this case,
were treated as point estimates for the purpose of com- ‘‘extreme’’ refers to values that when subtracted from
paring them to the extant taxa and L. lufengensis. Fol- the comparative sample’s ISD produce a difference
lowing previous studies (e.g., Lockwood et al., 1996, (regardless of sign) as large as or larger than the differ-
2000; Lockwood, 1999; Silverman et al., 2001; Reno ence produced by subtracting the fossil sample’s ISDA
et al., 2003; Villmoare, 2005; Harmon, 2006; Schrein, from the comparative sample’s ISD.
2006), we bootstrapped the comparative species Because our division of the Sivapithecus sample into sexes
(including L. lufengensis) to obtain samples that were was based solely on size, we also analyzed this sample using
identical to the Sivapithecus and O. macedoniensis sam- a sex-blind statistic—the CV—to estimate dimorphism. For
ples in size and sex ratio, creating distributions for each molar position, we bootstrapped the comparative sam-
determining the probability of obtaining a sample from ples 1000 times each at a sample size equal to the Sivapithe-
G. gorilla, P. pygmaeus, M. leucophaeus, and L. lufengen- cus sample but without regard to sex (i.e., the sex ratios of
sis with the same level of molar dimorphism observed in the bootstrapped samples did not necessarily match the
the Sivapithecus and O. macedoniensis samples. hypothesized sex ratio of the Sivapithecus sample) and cal-
In the procedure describe above, resampling without culated the CV for each. For this part of the analysis, the
replacement can be used instead of bootstrapping (e.g., comparative samples (including the Lufeng sample) were
Gordon et al., 2008). In fact, resampling without replace- modified so that the sex ratios for each tooth were balanced
ment is more likely to produce lower P-values than prior to being bootstrapped. We then compared the CVs for
resampling with replacement given that, at small sample the Sivapithecus molars to the resulting distributions and
sizes (e.g., n 5 5–7 in the case of the Sivapithecus analy- determined the statistical significance of the sample differ-
sis), the latter can, in principle, produce samples com- ences as described above. For this analysis, we only exam-
posed only of multiple entries of the largest male and ined the individual molar positions, as there are currently
smallest female, or samples composed only of multiple no methods for dealing with missing data in the calculation
entries of the smallest male and largest female. Clearly, of the CV. The results of these analyses did not differ sub-
such samples would produce wider bootstrap distribu- stantively from the analyses in which the specimens were
tions (i.e., with very high and very low ISDs) that will sexed, and thus only the ISD-based results are reported.
be more likely to encompass the fossil value. However, Finally, it is important to note that, because our com-
Cope and Lacy (1992, p. 361), in their study of the use of parative samples are not composed entirely of complete
the coefficient of variation (CV) for evaluating variation molar rows, the P-values reported for the analyses of
in fossil samples, noted that a comparative sample ‘‘of multivariate molar dimorphism should be considered ap-
hundreds or thousands is needed to properly simulate proximate. For example, consider a case in which the
CV sample distributions.’’ This problem motivated them molars of a species are identical in their degree of dimor-
to develop a method in which a very large (n 5 10,000) phism. If a sample of 40 males and 40 females is col-
simulated ‘‘population’’ is generated using descriptive lected in which the ten largest males are missing their
statistics from samples of extant species. This simulated M3s, then the estimate of dimorphism for the M3 will be
population is then resampled without replacement at a lower than in the other teeth. When the teeth are com-
sample size equal to the fossil assemblage in order to bined and multivariate molar dimorphism is estimated,
determine the probability of sampling the CV observed the estimate will be biased due to the missing M3 data.
in the fossil sample from the simulated population. As Such a sample will produce a bootstrap distribution that
an alternative for overcoming the intractable problem of is shifted to the left (i.e., toward monomorphism), result-
limited comparative material, Lockwood et al. (1996) ing in an artificially low P-value—and a potential type II
used the bootstrap to generate samples equal in size to error—in a pairwise comparison with a fossil sample.
fossil samples directly from the comparative samples. In However, this problem is unlikely to be an issue in our
this study, we preferred the bootstrap over resampling analysis because the missing teeth in our samples are
without replacement because it is not clear that our com- not size-biased. Thus, the effects of missing data on the
parative samples are sufficiently large. P-values for the analysis of multivariate molar dimor-
For each molar position, two sets of 1000 bootstrap phism are likely to be minimal. Therefore, in order to
samples were drawn from each of the extant species use samples that are as large as possible, we have cho-
samples and from the Lufengpithecus sample, with one sen not to limit the comparative samples to only those
set matching the composition of the Sivapithecus sample individuals that preserve complete molar rows.
and the other matching the composition of the O. mace-
doniensis sample.3 For example, in the Sivapithecus RESULTS
analysis, each bootstrap sample contained seven M1s
(four males, three females), six M2s (three males, three The ISDs for the extant taxa and L. lufengensis are
females), and five M3s (two males, three females). For presented in Table 3. For multivariate molar size, sam-
each bootstrap sample, log-transformed (base e) ISDs ple dimorphism is greatest in L. lufengensis, followed
were calculated as described above. The Sivapithecus in rank order by M. leucophaeus, P. pygmaeus, and
and O. macedoniensis ISDAs (log-transformed) were then G. gorilla. This pattern is repeated at each individual
compared to the bootstrap distributions; if the values for molar position with one exception: M3 sample dimor-
these two fossil samples fell outside the middle 95% of phism is greater in G. gorilla than in P. pygmaeus. The
the bootstrap distributions, then the null hypothesis of bootstrap tests for multivariate molar size dimorphism
Mandrillus
in parentheses (probabilities are two-tailed). Abbreviations: G, Gorilla; P, Pongo; M, Mandrillus; L, Lufengpithecus. An asterisk (*) indicates that the comparison is not significant
Nonsignificant differences are indicated by an equality symbol; greater-than symbols indicate significance and the direction of difference. P-values for each comparison are given
tan are more dimorphic than those of the gorilla, Uchida
5 (0.0885)
(1996a) documented intrageneric variation in molar
dimorphism in both Pongo and Gorilla. Thus, a more
complete analysis that includes material from eastern
lowland gorillas, mountain gorillas, and Sumatran
orangutans could reveal differences between these two
M [ P* (0.0305)
genera.
L [ P (0.001)
It is also important to point out that our ability to
Pongo
detect differences among the living taxa is hampered in
M3
for each species are about half the total sample sizes.
Thus, given that the differences in molar sample ISDs
among G. gorilla, P. pygmaeus, and M. leucophaeus are
relatively small, especially when compared to differences
L [ M* (0.0195)
Nonsignificant differences are indicated by an equality symbol; greater-than symbols indicate significance and the direction of
difference. P-values for each comparison are given in parentheses (probabilities are two-tailed). Abbreviations: G, Gorilla; P, Pongo;
M, Mandrillus; O, Ouranopithecus. An asterisk (*) indicates that the comparison is not significant after sequential Bonferroni
adjustment (applied within each variable).
Nonsignificant differences are indicated by an equality symbol; greater-than symbols indicate significance and the direction of dif-
ference. P-values for each comparison are given in parentheses (probabilities are two-tailed). Abbreviations: G, Gorilla; P, Pongo; M,
Mandrillus; L, Lufengpithecus; S, Sivapithecus.
form a fairly tight cluster, precluding the use of size as a unknown. Thus, based on the current evidence, the tax-
criterion for sex assignment; see Fig. 8.3 in Kelley, onomy of the Haritalyangar Sivapithecus material
2005). remains ambiguous (see also Kelley, 2005).
If we accept that fossil species are not constrained by
currently known limits of intraspecific variation and sex- CONCLUSIONS
ual dimorphism (Kelley, 1993; Kelley and Plavcan, 1998;
Plavcan and Cope, 2001; Schrein, 2006), then the single- Our analysis of sexual dimorphism in molar size in
species hypothesis cannot be definitively ruled out for great apes and the drill demonstrates that the latter spe-
the Haritalyangar sample, despite the fact that such a cies exceeds the extant hominoids in some aspects of
species would have to be even more dimorphic than molar dimorphism. Thus, Pongo does not possess the
L. lufengensis. While the high levels of size variation most dimorphic molars among living primates, though
and apparent sexual dimorphism in the Haritalyangar they are among the most dimorphic. These results indi-
second and third mandibular molars can be used to cate that Mandrillus leucophaeus (and perhaps other
argue for the presence of two species (Kelley, 2005), such papionins) should be included in extant comparative
evidence cannot be used as the sole basis for rejecting samples when evaluating variation in fossil hominoid
the single-species hypothesis (Kelley and Plavcan, 1998; assemblages, particularly when variation appears to
Plavcan and Cope, 2001; Schrein, 2006). Presumably, exceed that in gorillas and orangutans. We confirmed
there is a limit to the amount of molar dimorphism that the extreme degree of molar dimorphism in the Lufeng-
can be expressed in primates, but whether or not pithecus lufengensis sample relative to extant species,
L. lufengensis (and O. macedoniensis) represents that thereby establishing the importance of using this species
limit is currently unknown. Other recently reported cases as a comparative analogue to represent levels of intra-
of extreme dimorphism, including the middle Miocene specific variation and sexual dimorphism not sampled
hominoid Griphopithecus alpani from Pas alar, Turkey among living primates (e.g., Kelley, 2005).
[apparent even though a second species has been identi- Using the drill and L. lufengensis samples as part of
fied in the Pas alar assemblage (Humphrey and Andrews, our comparative framework, we analyzed apparent size
2008; Kelley et al., 2008)], and the Oligocene early catar- dimorphism in the molars of two other late Miocene
rhine Aegyptopithecus zeuxis from the Fayum, Egypt hominoid assemblages that have been identified as possi-
(Simons et al., 2007), could provide insight into this issue. bly comprising single highly dimorphic species. Our
On the other hand, given that the Haritalyangar sec- results for the Haritalyangar Sivapithecus sample show
ond and third mandibular molars do not fit any of our that, if only one species is present at this site, then it is
comparative models of sexual dimorphism, additional more dimorphic than L. lufengensis for at least some
lines of evidence are required before accepting a single- teeth. Given the uncertainties concerning the sex of the
species taxonomy for this sample, as the two-species tax- individuals in this sample, and because the sample as a
onomy cannot be rejected based on current evidence whole does not fit any of our comparative models, we are
either. In this context, demonstration that the size clus- unable to choose with confidence between the hypotheses
ters evident in the M2 and M3 samples are truly homoge- that the sample contains (1) a single, extremely size-
neous with respect to sex using canine and/or P3 size dimorphic species or (2) two species, differing primarily
and morphology, as was done for the L. lufengensis and in size. On the other hand, in the case of Ouranopithecus
O. macedoniensis samples (Kelley and Xu, 1991; Kelley, macedoniensis, our analysis shows that the level of
1993; Schrein, 2006; see Fig. 2), would provide support molar dimorphism required under a single-species taxon-
for the single-species hypothesis. Conversely, a two-spe- omy is fully compatible with the degree of sexual dimor-
cies hypothesis—with each size cluster representing a phism in the molars of L. lufengensis, thus demonstrat-
different species—would be supported if it is shown that ing that, while a single-species taxonomy for the O. mac-
males and females are present in both clusters. Unfortu- edoniensis sample requires a degree of molar
nately, the current sex assignments are based on size dimorphism that is extreme compared to that in living
because there are no associated canines or P3s, which anthropoids, it is not extreme in comparison to at least
precludes unequivocally linking the high levels of varia- one other hominoid species from the late Miocene of
tion in the sample to sex differences. Eurasia.
Finally, it is important to note that the temporal span
for Sivapithecus at Haritalyangar, at approximately
400,000 years (Pillans et al., 2005), is certainly greater ACKNOWLEDGMENTS
than for either the Lufengpithecus or Ouranopithecus
samples, but how much of this range is represented in We thank Xu Qinghua of the Institute of Vertebrate
the portion of the Haritalyangar sample analyzed here is Paleontology and Paleoanthropology, Beijing, China, for