Weaver and Roseman (2005) review the case for Neandertal extinction based on ancient mtDNA. They present simulations that demonstrate that a Neandertal contribution to the modern human mtDNA pool is very unlikely. These simulations are simpler than those carried out by Currat and Excoffier (2004), but considering the publication schedule of Current Anthropology, they were probably completed earlier. Their conclusion is the same: if mtDNA has not undergone a selective sweep, then Neandertals almost certainly became extinct without issue.
But that's not especially news: indeed, it was strongly suspected even before ancient Neandertal mtDNA sequences were discovered (e.g. Manderscheid and Rogers 1996). The question has been, and remains, is human mtDNA actually neutral? Or is its recent pattern of variation in living humans the result of recent selection within the human lineage? If there was a selective sweep in humans, then the mtDNA of Neandertals shouldn't look like modern human mtDNA for that reason. This wouldn't prove that Neandertals contributed other genes to later people, but it would make their mtDNA variation irrelevant to their ultimate fate.
Unlike most papers on the topic, Weaver and Roseman (2005) review this issue. On the basis of their review, they conclude that mtDNA is almost certainly neutral. Here's how they put it:
For the selective-sweep alternative to be correct there would have to have been virtually simultaneous selective sweeps in hundreds of unlinked genetic regions, which seems unlikely (Weaver and Roseman 2005:682).
That does indeed seem unlikely, so it might seem that they have made a solid case.
But -- unsurprisingly to those who read the weblog often -- I think they have left out many important aspects of the story. There is a strong case for mtDNA selection, but Weaver and Roseman (2005) omit much of the data that point to that conclusion. And some of the data that they include actually indicates quite the opposite of what they claim.
They make a very common assumption -- one that is widespread in the human genetics literature -- but one that is nonetheless wrong: every expansion is the same expansion. A review of the sources that Weaver and Roseman (2005) cite shows quite the opposite: expansions are not all alike, and expansions estimated for nuclear DNA may actually prove that mtDNA was selected.
This is a very long post, and I have hidden most of it beneath the fold. Click on in if you want my take on selection on mtDNA....
Is it reasonable to think that mtDNA was selected?
Before embarking on a review of Weaver and Roseman's argument, it is important to tackle one central question: Is it even reasonable to think that mtDNA was selected?
If this were an unreasonable idea, there would be no point to arguing for it. So what reason might one have to think that a selective sweep of mtDNA might have happened?
Within the past 30,000 to 1 million years, human populations have changed radically in longevity (Caspari and Lee 2004), brain size (Lee and Wolpoff 2003; Ruff et al. 1997), diet, and energetics (Leonard and Robertson 1997; Sorensen and Leonard 2001). Human mtDNA variants have been found to be associated with chronic diseases of aging , brain disorders (Zhu et al. 2004), performance in athletes (Niemi and Majamaa 2005), and longevity itself (Niemi et al. 2005). The present pattern of variation also appears to be correlated with climate (Ruiz-Pesini et al. 2004), and may affect the dietary energetics and insulin metabolism (Lowell and Schulman 2005).
Simply put, variation in mtDNA is a strong target for further research into the effects of aging, metabolism, and disorders of the brain for a reason: it impacts all these areas strongly.
Together, these facts strongly suggest that human mtDNA may have undergone multiple adaptive substitutions within the past million years. They don't prove that such selection happened, but they give abundant reason to suppose that it might have. Indeed, Ruiz-Pesini et al. (2004) suggest that adaptive selection has happened on mtDNA in some regions of the world in recent times. This suggestion is fully consistent with -- and even foreshadows -- the idea that mtDNA underwent many adaptive substitutions during human evolution.
In fact, this is exactly the same logic by which many nuclear genes have been asserted to have been positively selected recently in human evolution. Consider the case of FoxP2. Enard et al. (2002) proposed that this gene had undergone a selective sweep within the past 200,000 years in humans, and Klein (2002) made it the centerpiece of his argument that language had evolved recently at the origin of modern humans. Like mtDNA, FoxP2 is strongly out of mutation-drift equilibrium. But human FoxP2 shows many fewer amino acid substitutions compared to chimpanzees than does human mtDNA (2 for FoxP2, 50-60 for mtDNA), meaning that the possibility of selection on human mtDNA should be greater, not less. And FoxP2 is only one functional gene; mtDNA contains 13 peptide-coding regions (Arnason et al. 1996), selection on any one of which would affect the entire molecule.
So what exactly is the difference that leads the same people to say that FoxP2 is selected and mtDNA is not? There is no statistical test of selection that shows FoxP2 to be selected and mtDNA not. Human mtDNA completely fails standard tests of neutrality such as Tajima's D (Merriwether et al. 1991), the ratio of synonymous to nonsynonymous substitutions (Wise et al. 1998), and comparison of within-species to between-species diversity (Wise et al. 1997).
In fact, the only difference I can find is that there is a literature that assumes mtDNA neutrality and attempts to use its variation to determine what kind of neutral event could explain its variation without selection. The only kind of event that suffices is a massive expansion of the human population -- estimated to be a hundred-fold or greater -- from an initial effective population size of fewer than 10,000 individuals to many millions (Harpending et al. 1998; 1993; Sherry et al. 1994). This event is proposed to have occurred anytime from 40,000 years ago to as much as 150,000 years ago or longer -- although the data indicates that it must have occurred earlier in Africa and later in Europe and East Asia.
There is no history of such an assumption for FoxP2 (although it might equally be suggested to represent such an event), therefore its variation is logically assumed to represent a selective sweep.
Thus, the issue of mtDNA selection cannot be separated from the issue of population expansion. It should be noted that an expansion of the human population does not disprove the hypothesis that a selective sweep occurred, but it does provide an alternative hypothesis that might explain the pattern of variation (although not the involvement of mtDNA in energy metabolism, diseases of aging, brain disorders, etc.).
Happily, we can test this alternative. Other genes, in particular, all neutral regions of the nuclear genome, must reflect the same demographic history as mtDNA. Thus, these genes should also show evidence of a massive population expansion.
What's an order of magnitude between friends?
Here's what Weaver and Roseman (2005:681-682) say about nuclear genomic evidence for population expansions:
We believe that the close correspondence between studies of mtDNA and microsatellites and recent studies of autosomal and X-chromosome SNPs that controlled for ascertainment bias and population subdivision makes it reasonable to assume a model of rapid human population growth from a small size, starting 200,000-40,000 years ago.
The microsatellite study they cite is the review by Zhivotovsky et al. (2003). Here's what Weaver and Roseman (2005:681) say:
Most populations show strong signatures of population growth with estimated start times that are consistent with those for mtDNA.
Here are the actual figures from Zhivotovsky et al. (2003:1179):
|Estimated expansion time (kya)||4.3||35.3||25.3||17.6|
|Effective population size before growth||2609||1883||1760||1688|
In short, these populations are all estimated to have expanded from an effective size of less than 2000 at a time between 17,000 and 35,000 years ago (except hunter-gatherers, who are estimated to have expanded 4300 years ago).
Weaver and Roseman (2005:681) cite a wide range of estimates for mtDNA expansion times: from 200,000 to 40,000 years ago. This range of estimates is wide mainly because of uncertainty about the mtDNA mutation rate. In their simulations, Weaver and Roseman (2005:679) assume a growth starting at 40,000 years ago; this is at or near the latest possible date of expansion for Europe only from other studies (e.g. Harpending et al. 1993; Sherry et al. 1994). Weaver and Roseman (2005:679) assume a preexpansion female effective size of 1000.
Compared to these mtDNA estimates, the microsatellite estimates are not so bad. They are more recent than the most recent possible for mtDNA by a factor of two, but their preexpansion population size is consistent.
What about the SNPs? The autosomal and X chromosomal SNP study they refer to is Marth et al. (2004). Weaver and Roseman (2005:681) say this:
They concluded that all samples showed signatures of population growth consistent with the results for mtDNA. The East Asian and European-American diversity fit a model of a bottleneck followed by growth while the African-American sample fit a model of growth alone. Marth et al. estimated that post-bottleneck growth began 84,000-60,000 years ago for the East Asian sample and 86,000-54,000 years ago for the European-American sample, which overlaps the 200,000-40,000 years ago range for mtDNA. They estimated that growth in the African-American sample started earlier.
Here are the actual figures from Marth et al. (2004:360), for the best-fit models (bottleneck for European and Asian, expansion for Africa), with generations converted to years (assuming 20-year generations):
|Original population size||10,000||10,000||10,000|
|Final population size||20,000||25,000||18,000|
The sharp-eyed will notice a couple of things about these tables. First, the initial population size between the microsatellite estimates and the SNP estimates is different by an order of magnitude. Now, that has an immense effect on the coalescence times expected for autosomal genes. The initial effective size of 10,000 in the SNP study recognizes that autosomal genes have coalescence times ranging from as little as 200,000 years (or less) to as ancient as 3 million years (or older). This range of dates is simply inconsistent with a long-term effective size of 1000-2000. This inconsistency alone means that the microsatellite estimates must be wrong.
This is a more severe problem than it might appear, because the signature for population expansion from the microsatellites requires a very, very small initial population size. This is because the estimates are based on the variance of allele size taken among sites -- meaning that different loci must have nearly exactly the same coalescent time to show a sign of population expansion (Kimmel et al. 1998; Zhivotovsky et al. 2000). The methods used to substantiate an expansion on microsatellite data simply lack the power to detect an expansion from a initial size as great as 10,000.
In other words, expansions are not all alike: the "expansions" estimated for the SNPs lie outside the statistical power of microsatellites entirely; the expansions estimated to explain microsatellite data are absolutely inconsistent with the large initial population size estimated for SNP data. Both these kinds of loci are autosomal: their evolution -- if neutral -- must follow precisely the same constraints.
Of course, the dates are also different, by a factor of three or more. But the other key difference is that these SNPs indicate not a simple expansion, but a bottleneck.
Can this bottleneck be consistent with the diversity of mtDNA? On the surface, it might seem that a post-bottleneck expansion and an expansion are the same thing. And Fay and Wu (1999) showed that certain kinds of bottlenecks might be consistent with mtDNA disequilibrium and nuclear DNA equilibrium. But the bottleneck estimated by Marth et al. (2004) is much less severe than the simulations of Fay and Wu (1999): [CORRECTION 9/5/05: these bottlenecks simulated by Fay and Wu (1999) are] three times longer (30,000 years) and involves one-half to one-third the population size during the bottleneck. In contrast, the simulations that are most consistent with the estimates of Marth et al. (2004) show that no large effect is expected upon mtDNA variation.
The same conclusion may be drawn from the simulations presented by Ambrose (1998), who tested whether a bottleneck associated with the Toba volcanic event 71,000 years ago might be consistent with human mtDNA variation. These simulations found that such a bottleneck could not be excluded by mtDNA variation. But they also found that the bottleneck could not by itself explain the pattern of mtDNA variation. Instead, a more ancient reduction in population size must have occurred, if mtDNA is neutral. Such a reduction has not been found for nuclear genomic data.
So, the evidence for population expansion from nuclear DNA is not consistent with mtDNA variation. Nor are estimates taken from microsatellites consistent with those taken from nuclear SNPs. All bottlenecks and expansions are simply not alike. Weaver and Roseman (2005) imply that these different sources of evidence are converging on a single answer. In fact, they are diverging from each other.
Why isn't an expansion just an expansion?
It is important to keep in mind the parameters of the possible models, to understand why evidence for expansions is not all alike. The simplest model of ancient demography is a one-parameter model: a single population size, unchanging over time back to infinity. This is the hypothesis of "no expansion", and it is in fact an assertion: the assertion that a model with more parameters does not explain the data better than the one-parameter model.
The next simplest model of demography is a three-parameter model. The parameters are the population size before a change, the population size after the change (or alternately, the magnitude of the change), and the time that the change happened. (A two-parameter model would include only time and magnitude of change; as long as the actual size of the population is important to us we are stuck with the third parameter.) This kind of model is often called a "two-epoch" model, meaning that the population was one size for some period of time, and another size for a second period of time. The only two-epoch demographic models are simple expansions and crashes.
A population crash causes genetic drift to increase -- and genetic drift tends to eliminate rare alleles from the population. So populations that have undergone a crash are expected to show a deficit of rare (low-frequency) alleles, or a surplus of high-frequency alleles. (This, by the way, is also the prediction of balancing selection.)
In contrast, a population expansion reduces the strength of genetic drift, meaning that rare (new) alleles should be more common than expected if population size had been constant. It takes a while for these new alleles to appear, so the strength of evidence depends on the time of the expansion -- that third parameter.
Now, let's stop to notice a couple of things. First of all, microsatellites are different from SNPs in that SNPs are often unique mutations, whereas microsatellite alleles are length polymorphisms that can be arrived at by lots of different mutations. This means that "rare" microsatellite alleles do not provide the same kind of evidence for expansion that rare SNPs do. In practice, estimates of ancient demography from microsatellite data do not depend on rare alleles at all; instead, they depend on the pattern of allele size variation among different microsatellite loci. That's one reason why the results of the microsatellite and SNP studies look so different: they are using different, apparently incommensurable, observations of variation.
Second, the consideration of the two-epoch model shows two options: a population crash or a population expansion. But one of these -- the crash -- is extraordinarily unlikely to be true for humans.
For one thing, the human population really did expand in size recently. Not only was there an incredible explosion of populations after the advent of agricultural subsistence, but also there is good archaeological evidence for substantial population expansions in the Late Pleistocene (e.g. Stiner et al. 2000).
For another, the human population is subdivided into many populations -- a structure that itself increases the proportion of rare alleles in the global population (Ptak and Przeworski 2002). So a population crash is essentially not an option. If the data significantly refute the one-parameter model (i.e. constant size), then expansion is the only kind of three-parameter model at play.
In other words, human genetic data are biased. They ought to show evidence of strong population expansion. They ought to be inconsistent with constant population size back to infinity.
So why, in so many cases, aren't they?
No expansion? Are you kidding?
As Weaver and Roseman (2005:681) note, Ptak and Przeworski (2002) reviewed more than 400 genomic regions and found no substantial evidence for expansion. And the "expansion" found by Marth et al. (2004; likewise Marth et al. 2003) is not the manyfold population growth that actually happened: it is an expansion from 10,000 people to 18,000 (at a minimum) or 25,000 (at a maximum) (!). As we have seen, only microsatellite data look remotely like the magnitude of human population growth, but they are completely wrong about the initial size and time of expansion.
There is a simple answer: the proportion of rare alleles is affected by things besides population size. As Ptak and Przeworski (2002), population subdivision is one of these. As Polanski and Kimmel (2003) note, ascertainment bias may be another. And natural selection is very likely to be another influence on the proportion of rare alleles -- even at "neutral" sites, considering the effects of linkage to selected sites (Gillespie 2000).
And as discussed by Marth et al. (2004), nuclear SNP data actually do not fit the three-parameter model. For non-African populations, they fit a five-parameter model: a bottleneck. This model reflects a mix of observations at different sites: there is an excess of low-frequency (rare) alleles, at the same time there is an excess of high-frequency alleles (Sherry 1996), as compared to both the three-parameter and one-parameter models. This excess of common alleles does not greatly reduce the appearance of a slight expansion in the three-parameter model, so it cannot account for the mismatch between genomic variation and archaeological data. But it may also result from the effects of selection, as certain SNPs may have been driven to high frequencies by positive or balancing selection.
Eswaran et al. (2005) suggest another explanation for the signature of a bottleneck in nuclear SNPs. They find that such a pattern is the expected result of the assimilation of archaic human lineages into an expanding modern human population. This pattern contrasts strongly with the expected signature of population expansion under a replacement scenario of modern human origins. They conclude that archaic assimilation is more consistent with the pattern of genomic SNP data than replacement.
As we can see, the explanation of nuclear genomic variation requires the consideration of many complexities that may affect the outcome. These complexities, taken together, mean that nuclear DNA variability bears no simple causal relationship with ancient population sizes. In particular, it cannot replicate the pattern of Upper Paleolithic and Holocene expansions that are reconstructed from archaeological data. Instead, estimates based on genetics (under assumptions of neutrality in up to five-parameter models) are generally more than an order of magnitude different.
It is therefore incorrect to say that nuclear genomic evidence is consistent with mtDNA variation. In fact, it currently appears to be inconsistent, although it is fairer to say that we do not know the relevance (if any) of genomic variation to demographic reconstruction. Strikingly, one of the few ways to make these different sources of data consistent with each other may be to require the survival and proliferation of nuclear gene lineages from archaic humans (Eswaran et al. 2005). Populations did grow, and and this growth may have affected the pattern of mtDNA variation. But the inconsistencies are great enough that they cannot currently be explained by demography alone.
Ancient genes and geography: the data Weaver and Roseman omit
Weaver and Roseman (2005) substantiated their assertion that mtDNA is neutral by using four primary references: Marth et al. (2004), Zhivotovsky et al. (2003), Ptak and Przeworski (2002), and Pritchard et al. (1999). As reviewed above, one of these (Ptak and Przeworski 2002) contradicts their argument, as do two others mentioned more briefly (Pereira et al. 2001; Hammer et al. 2003).
But there are other relevant sources of information not included in the paper. Three of them are absolutely critical, since they bear directly on the hypothesis that genetic material from archaic humans survives in present human populations.
The first category of "missing information" are the many studies of gene-geography relationships by Alan Templeton and colleagues (e.g., Templeton 2002). Based on the examination of the geographic variability of a dozen nuclear genes, these studies have concluded that an Out of Africa replacement of archaic humans cannot explain the pattern of human genetic diversity. Instead, the studies find significant evidence for ancient genetic structure including Europeans and East Asians. Templeton has argued (2002) that the pattern of mtDNA (and Y chromosomal) variation may represent one recent migration among many out of Africa; but the data are also consistent with a recent selective sweep. These data conclusively refute the hypothesis that no archaic gene lineages survived into living human populations.
Templeton's studies have been critiqued on the grounds that they may not actually distinguish survival of archaic gene lineages outside Africa from survival of archaic lineages inside Africa. In other words, it has been claimed (Pearson 2003; Eswaran et al. 2005) that ancient population structure within Africa might mimic the survival of gene lineages from outside Africa. Eswaran et al. (2005) show that this scenario is likely not the case, as nuclear genomic data apparently reflect the survival of archaic non-African lineages. But this equivocation bears little importance to the explanation of mtDNA: even the survival of archaic lineages from within Africa challenges the idea that mtDNA variation reflects the expansion of one small African population and the displacement of others. Instead, survival of archaic African lineages suggests that the ancient population size of Africans was effectively much larger (perhaps many orders of magnitude larger) than a neutral mtDNA hypothesis would admit. Together Templeton's work and the analysis of Eswaran et al. (2005) indicate that the majority of genomic loci preserve allelic variation that originally characterized archaic human populations.
The second category of evidence missing from Weaver and Roseman (2005) also bears on this issue of archaic survival. In addition to the genomewide analyses and Templeton's phylogeographic studies, both of which suggest the survival of a large proportion of archaic gene lineages, recent work has uncovered several genes that cannot be accommodated within the framework of a recent mtDNA expansion and replacement of archaic humans. These genes include (but are not limited to) the region around Xp21.1 (Garrigan et al. 2005), the Xp/Yp and 12q telomeric regions (Baird et al. 2000), and an inversion on 17q21.31 (Stefansson et al. 2005). Several other loci were discussed at the 2005 AAPA meetings, and I know of a few more that are in press.
In short, genomic data are not consistent with mtDNA neutrality, and a growing number of detailed studies have documented loci that represent the survival (and proliferation) of archaic human gene lineages. Much of this literature on specific loci has emerged in the last year; it is no surprise that it is missing from Weaver and Roseman (2005).
But Templeton's analyses have been well known for years, and they bear directly on the issue. They are mentioned by Weaver and Roseman (2005:677) only as evidence that anthropological geneticists "favor a predominantly extra-European origin for the earliest modern populations in Europe" (!!!).
The third category of missing information is the fossil and archaeological record. According to Weaver and Roseman (2005:682):
Our results stress the importance of fully integrating archaeological, fossil, and genetic evidence in investigations of modern human origins.
It is therefore striking that the paper includes no acknowledgement of the fossil and archaeological evidence that supports Neandertal-modern reproductive continuity. This evidence includes (but is not limited to):
- The persistence of Neandertal traits in post-Neandertal populations (Frayer 1993; Duarte et al. 1999; Wolpoff et al. 2001; Trinkaus et al. 2003). Trinkaus (2005:218) concludes that the model of no interbreeding between Neandertals and modern humans is "intellectually dead."
- The intergradation of Neandertal and contemporary populations, illustrated by their substantial morphological overlap in West Asia (Kramer et al. 2001; McCown and Keith 1939).
- Shared directionality of morphological evolution in Neandertals and contemporary populations (Hawks and Wolpoff 2001).
- The cognitive continuity of Neandertals and succeeding populations, evidenced by their substantially shared technical and symbolic ability (d'Errico 2003).
It seems clear that no genetic model that excludes a role for Neandertal genetic persistence can be considered to be "integrated" with these archaeological and fossil observations. Certainly considerable disagreement exists about the level of Neandertal contribution to later Europeans (as well as the level of Upper Paleolithic contribution to the more recent European gene pool). But evidence of intermixture is clear and recognized even by those who suspect that the actual level of such intermixture was low (Bräuer et al. 2004). Together with the full pattern of genomic evidence, it seems clear that such intermixture is a potent explanation for the evolutionary pattern of the early Upper Paleolithic in Europe, as well as other regions during the Late Pleistocene.
The bottom line
The issue here is not whether a population expansion occurred in the Late Pleistocene and Holocene. It certainly did. But does this expansion by itself explain the distinctive pattern of human mtDNA variation? And if so, does the fate of the Neandertals hinge on this demographic hypothesis?
A consideration of a fuller set of genomic data indicates that the answer to both these questions is no.
Human mtDNA has very likely been under positive selection. The evidence for this selection is as strong as for nearly any other selected locus. Although the specific target of the most recent selective sweep has not yet been identified, the same is true of other genes that are believed to have been under selection, such as FoxP2. The pattern of variation cannot be explained by population expansion, because other genomic regions are either inconsistent with mtDNA, inconsistent with each other, or inconsistent with any expansion at all.
Determining whether Neandertals particularly contributed genetic material to the living human population is a challenge. Even if clear evidence of archaic lineages is found, it is difficult to substantiate that these lineages were found in a particular region of Europe over 40,000 years ago.
Yet, substantial evidence of archaic lineages has been found. There is no question that some -- perhaps most -- human genes preserve allelic variation from archaic human populations.
The morphological and archaeological evidence suggest strongly that Neandertal genetic lineages survived into later Upper Paleolithic populations. Ultimately, the genetic test of Neandertal survival may be carried out by finding nuclear DNA sequences from Neandertal fossils themselves. Until that time, we can say only that some Neandertal contribution to the modern human nuclear gene pool is consistent with the known evidence.
Baird DM, Coleman J, Rosser ZH, Royle NJ. 2000. High levels of sequence polymorphism and linkage disequilibrium at the telomere of 12q: Implications for telomere biology and human evolution. Am J Hum Genet 66:235-250.
Duarte C, Maurcio J, Pettitt PB, Souto P, Trinkaus E, van der Plicht H, Zilhao J. 1999. The early Upper Paleolithic human skeleton from the Abrigo do Lagar Velho (Portugal) and modern human emergence in Iberia. Proc Natl Acad Sci U S A 96:7604-7609.
Marth G, Schuler G, Yeh R, Davenport R, Agarwala R, Church D, Wheelan S, Baker J, Ward M, Kholodov M, Phan L, Czabarka E, Murvai J, Cutler D, Wooding S, Rogers A, Chakravarti A, Harpending HC, Kwok PY, Sherry ST. 2003. Sequence variations in the public human genome data reflect a bottlenecked population history. Proc Natl Acad Sci U S A 100:376-381.
Marth GT, Czabarka E, Murvai J, Sherry ST. 2004. The allele frequency spectrum in genome-wide human variation data reveals signals of differential demographic history in three large world populations. Genetics 166:351-372.
Niemi AK, Moilanen JS, Tanaka M, Hervonen A, Hurme M, Lehtimki T, Arai Y, Hirose N, Majamaa K. 2005. A combination of three common inherited mitochondrial DNA polymorphisms promotes longevity in Finnish and Japanese subjects 13:166-170.
Stefansson H, Helgason A, Steinthorsdottir GTV, Masson G, Bernard J, Baker A, Jonasdottir A, Ingason A, Gudnadottir VG, Desnica N, Hicks A, Gylfason A, Gudbjartsson DF, Jonsdittir GM, Sainz J, Agnarsson K, Birgisdottir B, Ghosh S, Olafsdottir A, Cazier JB, Kristjansson K, Frigge ML, Thorgeirsson TE, Gulcher JR, Kong A, Stefansson K. 2005. A common inversion under selection in Europeans. Nature Genet 37:129-137.