john hawks weblog

paleoanthropology, genetics and evolution

mtDNA

  • Mitochondrial DNA selection review

    Wed, 2009-05-27 00:05 -- John Hawks

    I was reading through an excellent review of the recent literature about mtDNA and selection, from Damian Dowling and colleagues (2008). The review focuses on the patterning of evidence for selection in ecological and phylogenetic terms, and to some extent upon the function of mtDNA or the mito-nuclear complex of proteins involved in oxidative metabolism. It includes a long passage covering the significant mismatch between mtDNA variation and effective population sizes across animals (but not mammals). A short section discusses the possibility of adaptive polymorphism maintained by mito-nuclear interactions:

    Knowing that deleterious mutations in mtDNA can accumulate within populations because of genetic drift [21], there certainly seems to be scope for mito-nuclear co-evolution to proceed via a ‘compensatory’ model. Under this model, deleterious mutations accumulate in the mitochondrial genome, with selection then favouring an adaptive response in the nuclear genome to restore any compromised metabolic function [24]. In effect, mtDNA mutations will act as the drivers of adaptive evolution in nuclear genes. This scenario is not unlikely, given that more than 1000 nuclear-encoded proteins, which are essential for metabolism, are transported into the mitochondrion [25].

    Additionally, given that at least some mtDNA polymorphism might have been shaped via positive selection [7] and [8], scope might also exist for mito-nuclear co-evolution to proceed via a model in which adaptive mutations in one genome select for a response in the other.

    There has been recent interest in the coinheritance of sex chromosomes and mtDNA. Because the sex-determining chromosome is opposite in birds from mammals, a number of natural experiments may be available to examine the role of coevolution for the mtDNA and co-inherited sex chromosomes. Further, a number of studies have identified a substantial cytoplasmic contribution to fitness and lifespan variance in Drosophila, suggesting that adaptive variation in mtDNA may be segregating within populations.

    The review discusses the possible importance of the adaptive perspective for aspects of biology ranging from life history and aging to speciation (where fast-evolving mtDNA genes may induce hybrid incompatibilities). And sperm are a surprising focus of research -- mtDNA mutations affect motility, fertility, and the outcome of sperm competition. On that topic, more later.

    References:

    Dowling DK, Friberg U, Lindell J. 2008. Evolutionary implications of non-neutral mitochondrial genetic variation. Trends Ecol Evol 23:546-554. doi:10.1016/j.tree.2008.05.011

  • mtDNA selection in Iceland?

    Wed, 2009-01-21 17:11 -- John Hawks

    Leave it to me to have readers unwilling to ignore selection in recent populations! Here's an e-mail:

    Why couldn't the Icelandic genetic changes have been the result of selection that favored some mtDNA lineages rather than others? We know the population of Iceland derived from settlers that were transplanted into a relatively alien climate and ecology, and had to adjust agriculture and subsistence activity to survive there. We know that there were dramatic environmental insults to the population: disease, starvation, eruptions. At least some of these insults would have likely been more severe than the ancestral populations would have encountered, whether they were Scandinavian or "Celtic".

    So why isn't there at least a token mention of selection, either by you or by the authors? Is "genetic drift" that much more likely than selection? Is selection a more academically risky proposition than the comforting mathematics of "drift"?

    This refers to my post earlier this week about the loss of mtDNA haplotypes from medieval Icelanders. Iceland has around half the number of unique haplotypes that other European samples have, despite similar sampling intensity. I went through some quick simulations of genetic drift in a relatively small, medieval Iceland-sized population to show that the loss of these alleles by drift was quite likely. I also pointed out that very large changes in frequency of common alleles were not a likely outcome of drift, even in the small population. This went along with another post, ("Could genetic drift really break your heart?"), which claimed that a relatively new heart-disease risk mutation was not likely to have become common by genetic drift. In that case, I thought that some selection was probably necessary -- both because the South Asian population has been very large for a long time, and because the mutation has a deleterious effect that would impede it from drifting upward.

    But showing that genetic drift could explain the Iceland results is not the same as rejecting the hypothesis of selection.

    In other ancient European samples, it is pretty clear that selection must explain differences in mtDNA haplotype frequencies over time. Neolithic Germans were dominated by a haplotype that is vanishingly rare in Europe today ("Early European mtDNA: only mysterious if you want it to be"). Similar but less extreme results are also true of medieval British and medieval Danes. All over Europe, some mtDNA haplotypes have been proliferating in the last few thousand years, and others have been declining. That is in accordance with other evidence that mtDNA has been selected in recent populations (Mitochondrial DNA adaptations in living human populations"), as well as the associations of various mtDNA haplotypes with human chronic diseases.

    Humans are not exceptional here, either -- for example medieval skeletons of Scandinavian dogs show mtDNA selection has happened to them, too.

    In the case of Iceland, we have a lot of mtDNA haplotypes that have been lost. It is possible that purifying selection took out some number of these. Probably more likely, positive selection may have increased the frequency of one or more common haplotypes, inevitably reducing the collective frequency of the others. Under that hypothesis, the loss of any particular mtDNA haplotype is a random event, but the fact that an unexpectedly large number were lost would be attributable to selection on common haplotypes.

    We could test this hypothesis by considering the frequencies of the more common haplotypes. That would also allow us to examine whether the same mtDNA haplotypes have been favored in other European populations.

    In the meantime, the loss of mtDNA haplotypes in Iceland does appear to be consistent with drift, given what we know about Iceland historical demography. That's the most important part: Genetic drift is not a null hypothesis, it involves many assumptions about demography. When we know information about the demography, we shouldn't pretend that we don't.

    One final point: Is selection an "academically risky proposition"?

    There's no question that I'm the leading proponent of the idea that natural selection has been widespread, powerful and important in human evolution. I didn't exactly elbow my way into this position. Very few people were willing to work with the idea that natural selection materially affected human genetic variation. This has given me some success, partly because I've been able to test easy and obvious hypotheses that R. A. Fisher might have thought up.

    I find all of this to be very puzzling. Why is it, for example, that Rush Limbaugh and the Drudge Report could pick up on the importance of natural selection, but so many human geneticists can't?

    Whatever the reasons, I've been talking about mtDNA selection for ten years. I reviewed the issue here in 2005 ("Selection, nuclear genetic variation, and mtDNA"):

    Within the past 30,000 to 1 million years, human populations have changed radically in longevity (Caspari and Lee 2004), brain size (Lee and Wolpoff 2003; Ruff et al. 1997), diet, and energetics (Leonard and Robertson 1997; Sorensen and Leonard 2001). Human mtDNA variants have been found to be associated with chronic diseases of aging , brain disorders (Zhu et al. 2004), performance in athletes (Niemi and Majamaa 2005), and longevity itself (Niemi et al. 2005). The present pattern of variation also appears to be correlated with climate (Ruiz-Pesini et al. 2004), and may affect the dietary energetics and insulin metabolism (Lowell and Schulman 2005).

    Simply put, variation in mtDNA is a strong target for further research into the effects of aging, metabolism, and disorders of the brain for a reason: it impacts all these areas strongly.

    I've revisited the issue many times in the last four years. At one point, it got to the stage where I wondered if I was writing about mtDNA selection too much. I wrote:

    I'm sure some readers are beginning to think this is mtDNA Selection Central. Believe it or not, I've gotten a lot of requests to cover this topic, which of course is one of the central issues in the Neandertal problem as well as the unraveling of human origins.

    And it's an exciting developing story: it shows how medical genetics is steamrolling the human genetics of the past thirty years. Finding mutations that actually do things has great medical interest, and the search is accelerating. This work is being undertaken by people who have no investment in the idea that variation among humans should be completely neutral.

    After all, what's more important: that a neutral mtDNA lets us trace human migrations, or that understanding mtDNA selection helps us find treatments for Alzheimer's disease?

    Has it hurt my career? Well, I have to admit that some of my colleagues think I'm a total wackaloon. Ten years ago, when I started talking about mtDNA selection, I heard snickers from high-ranking scientists when I ascended the podium. Among other things, I fought peer reviews that said that positive selection in humans must be "rare, if it happens at all."

    All I can say is, if the idea of natural selection has damaged my career, I hope for much more damage like this in the future!

  • Genetic drift eliminated rare mtDNA haplotypes from Iceland

    Tue, 2009-01-20 12:33 -- John Hawks

    How powerful has genetic drift been in recent human evolution? That's the question I raised the other day with reference to the claim that a heart disease risk-inducing allele had become common by drift in India during the last 30,000 years.

    Another paper released earlier this week in PLoS Genetics claims that mtDNA haplotypes have been recently lost from the Icelandic population by strong genetic drift. The evidence for such changes in haplotypes comes from sequencing the mtDNA of thousand-year-old skeletons unearthed in Iceland during the last 150 years. These ancient remains have haplotypes that are found elsewhere in Europe today, but not in Iceland. The conclusion is that the modern-day descendants of these early Iceland settlers have experienced genetic drift within the last 1000 years, relieving them of of a load of rare mtDNA haplotypes.

    Could genetic drift have accomplished this loss of haplotypes? Although the paper does not present any analysis of this question, a quick consideration of some theory will show that genetic drift could easily have caused the observed results. It also shows a contrast between this case and others where genetic drift has been described as "strong". Even in this case, on an island with a limited human population, genetic drift is only "strong" in the sense of eliminating alleles that are already quite rare in the population.

    Today, Iceland is a large population of more than 300,000 individuals. As in many countries, the current size is a result of twentieth-century population growth. From Wikipedia:

    The population of the island is believed to have varied from 40,000 to 60,000 in the period from initial settlement until the mid-19th century. During that time, cold winters, ashfall from volcanic eruptions, and bubonic plagues adversely affected the population several times. The first census was carried out in 1703 and revealed that the population was then 50,358. After the destructive volcanic eruptions of the Laki volcano during 1783–1784 the population reached a low of about 40,000. Improving living conditions have triggered a rapid increase in population since the mid-19th century - from about 60,000 in 1850 to 320,000 in 2008.

    Forty thousand is a large population, in evolutionary terms. But not all these 40,000 people would have counted toward the variance in reproduction of gene lineages (See my discussion of the Wright-Fisher model). If we took a census of the medieval Iceland population, we would find that a large fraction of individuals, maybe half, were children. A rather small fraction would be postreproductive adults. So at most half the population, and probably less, were of reproductive age at any given time. At a first approximation, this population of 40,000 individuals would reproduce like a Wright-Fisher population of less than 20,000. Around half that number will be females -- although probably a bit more than half, since reproductive variance is usually higher among men. So less than 10,000 mating females per generation. Other factors, such as within-family fitness correlations, may limit the effective number even further. I'll assume a value of 5000 effective women, which is certainly an underestimate for the last 300 years, but may not be too much of an underestimate for earlier times.

    We should keep in the back of our minds that the population did not grow instantaneously to this value -- it would have taken a couple hundred years to reach its ultimate medieval size. So genetic drift might have been substantially stronger when these skeletons were being buried than it would have been 300 or 400 years later.

    Helgason et al. (2009) applied an unusual test to demonstrate the genetic difference between early and current Icelanders. They don't consider the frequencies of haplotypes. Nor do they use a measure that would ultimately be influenced by frequencies, like FST. Instead, they consider the number of shared haplotypes between the two samples as a measure of similarity.

    The statistical question is whether the modern and ancient samples share significantly fewer haplotypes than expected if the ancient and modern samples had been randomly drawn from a single population. The authors tested this hypothesis with a randomization test: they repeatedly drew random samples of the same sizes as the ancient and modern samples, from a distribution including both samples. It's a clever technique.

    It doesn't yield an easy answer to the question of how strong genetic drift needs to be to explain the data. But we can address this question by considering the data and nature of the comparison.

    Here, we're talking about the presence or absence of haplotypes that each have a very low frequency. For example, the modern Scandinavian sample in the paper includes 337 distinct haplotypes. The modern Icelandic sample, with a slightly larger same sample size (947 versus 898 individuals), has many fewer haplotypes -- only 172. That's a substantial deficit in the Icelandic sample.

    Now, let's consider the ancient sample, with 73 individuals and 58 haplotypes. Obviously, nearly all the haplotypes were present in only a single sampled individual. This implies that many haplotypes existed in that population that were not sampled in these 73 individuals. It also implies that the average frequency of a unique mtDNA haplotype in that population was less than 1.4 percent (less, in other words, than 1/73).

    We don't know that these alleles were lost in Iceland; what we know is that they weren't sampled today. So a real answer to this question would include some considerations of sampling theory.

    We can do a quick calculation to figure out the chances that an allele of 1.5 percent would be eliminated by drift in less than 1000 years (around 40 generations). Here's a small piece of Mathematica code that begins with an allele at 0.015 frequency, a population of 5000 effective individuals, runs it under drift for 40 generations:

    drift := Module[{popSize, a, result, count, b},
    count = 0;
    For[i = 1, i popSize = 5000;
    a = N[RandomInteger[BinomialDistribution[popSize, 0.015]]];
    For[j = 1, j a = N[RandomInteger[BinomialDistribution[popSize, a/popSize]]]];
    If[a == 0, count++]];
    count]

    The variable count returns the number of trials out of 10,000 that the allele disappeared. For an initial frequency of 1.5 percent, roughly 3 percent of trials (308 out of 10000) end in loss of the allele. That's not a very high proportion of loss by drift.

    Still, we're not interested in loss from the population; we're interested in why haplotypes might not show up in the present-day sample. That might happen for two reasons:

    1. The alleles weren't so common to begin with -- many haplotypes were present in the early population that weren't in the skeletal sample. We can use the simulation to see what happens to rarer haplotypes. For example, if the initial frequency was only half a percent (0.005), then they are lost from the population after 40 generations nearly a third of the time (3142 out of 10000).

    2. The alleles are still in the Icelandic population, but rare enough they didn't get sampled. A sample of 900 people will miss roughly half of haplotypes with frequencies of 0.005 or less. Altering the simulation again, we find that drift achieves this result in roughly 5 percent (526 out of 10000) of cases, beginning at an initial frequency of 0.015.

    So altogether, the observed result is not at all unlikely. Genetic drift in Iceland had the power to eliminate rare alleles, even after the population was founded and grew to its medieval size.

    Still, this "strength" of genetic drift is manifested in its ability to eliminate initially rare alleles. If we ask what effect it would have on a common allele, we see a somewhat different dynamic. For mtDNA, the variance of allele frequency after one generation of drift is p(1-p)/(N), where N is the female effective population size (for autosomal genes with two copies, this expression would include 2N instead of N). After 40 generations, we expect a variance of 40p(1-p)/(N). What that means is that in 95 percent of cases, an allele that begins at 20 percent frequency will be between 13 and 27 percent after 40 generations.

    In other words, even in the small population of medieval Iceland, genetic drift is not strong enough to cause large increases or losses of common mtDNA haplotypes. And the effect of drift is four times greater for mtDNA than for autosomal genes.

    As a result of the loss of rare haplotypes, the medieval Iceland sample shares more alleles with Eastern Europe, Russia, and Spain and Portugal than with today's Icelanders. But if we look at a measure like FST, which is dominated by common alleles, the differences between these populations are very slight.

    I've gone through the example to point out that some kinds of observations are explained well by drift, and others aren't. Genetic drift has been an important cause of some evolutionary changes, even in recent human populations. But in any given case it is a hypothesis. We test this hypothesis by applying our knowledge of demographic history, mathematical models, and other genes. The hypothesis of drift might well be useful to explain one kind of observation (loss of rare alleles), while it is useless to explain others (elevation of one rare allele to a high frequency).

    References:

    Helgason A, Nicholson G, Stefánsson K, Donnelly P. 2003. A reassessment of genetic diversity in Icelanders: Strong evidence from multiple loci for relative homozygosity caused by genetic drift. Ann Hum Genet 67:281-297. doi:10.1046/j.1469-1809.2003.00046.x

    Helgason A, Lalueza-Fox C, Ghosh S, Sigurðardóttir S, Sampietro ML, Gigli E, Baker A, Bertranpetit J, Arnadottir L, Þorsteinsdotter U, Stefánsson K. 2009. Sequences From First Settlers Reveal Rapid Evolution in Icelandic mtDNA Pool. PLoS Genet 5(1): e1000343. doi:10.1371/journal.pgen.1000343

  • Double the migrations, double the fun

    Mon, 2009-01-12 00:25 -- John Hawks

    Several news stories have reported on an article by Ugo Perego and colleagues, titled "Distinctive Paleo-Indian Migration Routes from Beringia Marked by Two Rare mtDNA Haplogroups." The Discover blog, 80beats, has a good two-paragraph summary of the results:

    In the study, published in Current Biology [subscription required], a team led by geneticist Antonio Torroni analyzed entire genomic sequences of mitochondrial DNA, the genetic material in cells’ energy-generating units that gets passed from mothers to children…. The researchers focused on the disparate geographic distributions of two rare mitochondrial DNA haplogroups — which are characterized by a distinctive DNA sequence derived from a common maternal ancestor — that still appear in Native Americans [Science News]. Both haplogroups appear to have arisen about 16,000 years ago.

    The researchers found that all the people with the D4h3 haplogroup presently live in South America, while those with the X2a haplogroup live in Canada and the United States, which suggests that the two genetically distinct bands of early humans struck off in different directions around 16,000 years ago.

    I don't have a lot to say about this. Tracking the frequencies and geographic distribution of rare haplotypes poses different issues than doing so for common alleles. Two closely related populations might nevertheless differ in the presence or absence of rare alleles.

    I really just wanted to post with reference to a broader point. If the data don't distinguish between a single migration at one time and multiple migrations at different times, then it's pretty much certain that they won't distinguish between a single migration and multiple migrations at one time.

    The two-simultaneous-migrations model might solve problems so far unaddressed by other models. But it's not obvious that it solves any -- there's no test here, just a discussion of the plausibility of the scenario. Each of these scenarios for New World habitation involves the dispersal of many populations across thousands of years. That means lots of free parameters, even in the simplest of the models. Given that necessary complexity, it seems pretty likely that there's a way for the simplest model to account for the frequencies of two rare alleles. It will take a whole lot more genetic comparisons to really test hypotheses about the founding population.

    References:

    Perego UA and 15 others. Distinctive Paleo-Indian Migration Routes from Beringia Marked by Two Rare mtDNA Haplogroups. Curr Biol 19:1-8. doi:10.1016/j.cub.2008.11.058

  • Glucose metabolism and memory

    Tue, 2009-01-06 08:56 -- John Hawks

    Roni Caryn Rabin reports on a study linking blood glucose spikes to age-related memory decline:

    Researchers said the effects can be seen even when levels of blood sugar, or glucose, are only moderately elevated, a finding that may help explain normal age-related cognitive decline, since glucose regulation worsens with age.

    I wonder if this physiological link may also underlie the association between some mtDNA haplogroups and Alzheimer's. In this case, the paper is able to analyze functional consequences of glucose levels because the authors were able to manipulate glucose levels in experimental animals. Remember this:

    Previous observational studies have shown that physical activity reduces the risk of cognitive decline, and studies have also found that diabetes increases the risk of dementia. Earlier studies had also found a link between Type 2 diabetes and dysfunction in the dentate gyrus.

    Here the causality is not necessarily clear. Maybe people who have healthy metabolic profiles are more likely to be active and less likely to exhibit cognitive declines. In that scenario, you wouldn't necessarily benefit from changing your activity pattern.

  • The unbearable hotness of Neandertals

    Wed, 2008-12-24 01:35 -- John Hawks

    According to the Telegraph (UK), Neandertals became extinct because their mitochondria leaked excess heat:

    Professor Patrick Chinnery, a neurogeneticist at Newcastle University, believes the differences in this mitochondrial DNA could have caused Neanderthals to be inefficient at producing energy, meaning their cells leaked heat.

    He said: "The question is why did Neanderthals disappear? There are lots of explanations to do with changes in climate and the food supply.

    "Differences in these mitochondrial DNA sequences might explain why modern humans were able to survive while Neanderthals were not.

    So, is it true? Did Neandertals go panting into that long good night?

    Well, Siberians may have mtDNA alleles that leak extra heat, and they're not extinct. It seems like a good idea if you don't live in the tropics and have enough food. Because it's not like Europeans lack opportunities to take off a layer to deal with the heat.

    Plus, as the Neandertal morphology waned in Europe, the climate was getting colder, not warmer.

    The real mistake here is assuming that the mtDNA necessarily shared the same fate as the rest of the genome. Sure, there aren't any living members of the Neandertal mtDNA clade, at least that we know of. But that suggests selection favoring human mtDNA, not necessarily Neandertal extinction. The idea of selection is supported by the finding of functional variations between human and Neandertal COX2, which I discussed in August.

    The current research seems like it probably adds detail to this comparison, but that's not an argument for Neandertals going extinct in the heat.

  • Poor Ötzi's doomed mitochondria

    Thu, 2008-10-30 23:24 -- John Hawks

    I must have seen a dozen stories today that started this way:

    Reuters:

    "Otzi," Italy's prehistoric iceman, probably does not have any modern day descendants, according to a study published Thursday.

    Washington Post:

    Sparking a new mystery about early man, Italian scientists have unraveled the DNA of the 5,300-year-old "Iceman" mummy, only to discover that he doesn't appear to have modern descendants anywhere near where he was found in Europe.

    I didn't really think about how funny that line is, until I was talking to someone about it this evening -- there's no way that the mtDNA can be informative about Ötzi's descendants, because, well, it's maternally inherited. D'oh!

    Meanwhile, we can ask what it means that a randomly picked Neolithic man would have a now-extinct mtDNA lineage. According to ScienceNews, Antonio Torroni thinks that it is a case of marker loss:

    It’s possible but unlikely that Ötzi belonged to a fourth branch of K1 that is now extinct or rare, Torroni says. He considers it more probable that a random mutation in the Iceman’s mitochondrial DNA erased the only genetic marker currently used to identify members of the most common K1 branch.

    Could be.

    Whether the sequence was a unique branch of K or a slight variation on a well-represented subtype, there's a natural hypothesis for why it no longer exists, that somehow is mentioned in none of the reports. The Iceman is hardly singular: Remember that the mtDNA pool of Central Europe in the Neolithic was dominated by lineages that are now rare. And Medieval Danes had several mtDNA sequences that are now rare or absent in Scandinavia. And the Cambridge sequence has been increasing in frequency in Britain since medieval times. And so on.

    There's no mystery here. These are large populations, and mtDNA haplogroups have been changing in frequencies between ancient DNA samples and the present. MtDNA has functions that plausibly were subject to changing environments after the Neolithic. This seems like a good candidate for recent selection.

    UPDATE (2008-10-31): A reader writes:

    Hi John,

    You raised the hypothesis that recent selection might explain the apparent dearth of modern examples of his haplotype, with private mutations at 3513T and 8137T. Those mutations are synonymous, which would seem to rule that out, leaving drift as the better alternative.

    That's a good point, and one often raised as a criticism of the selection hypothesis. If a sequence differs from some extant (still-existing) variant by only synonymous mutations, then selection can't explain why one is gone and one is still here. Only genetic drift can explain the extinction of one and the survival of the other.

    But this is not the entire story. In this instance, we have synonymy with one major haplogroup (K) which has coding differences from other haplogroups. Those haplogroups have been changing in relative frequencies in Europe over the last few thousand years. A decrease in the frequency of K would naturally cause rare K variants to become rarer or extinct, even though they are neutral with respect to each other. In this instance, lineage extinction would be the result of selection, even though the extinct lineages have no disadvantage relative to some that still survive.

    Now, is that the case with Ötzi's haplotype? I would say it's a good hypothesis, but not yet testable. We really would like to know the frequency of the haplogroup in the Neolithic. For that matter, further ancient DNA sampling will test many hypotheses of genetic drift and selection, because direct observation of ancient frequencies gives us a source of information that does not depend on sampling models from living populations.

  • Sample sizes and the "Neandertal haplogroup"

    Tue, 2008-08-26 11:47 -- John Hawks

    I have an excellent e-mail question about last week’s Neandertal mtDNA paper, which has provoked a lot of commentary.

    I just skimmed over your comments on the recent paper and I have a couple questions. First, how many Neanderthals did they receive mitochondrial DNA from? I think I read somewhere that it was fewer than ten.

    Second if that is true, what the hell does it mean? I wouldn’t try and predict anything based on even fifty humans from that long ago much less 8 or 9 in genetic terms. I don’t think that anyone else would either unless they are grandstanding. You can’t prove a negative so they really can’t say that no modern humans have any Neanderthal DNA. Did they study Neanderthals from Asia? I just think they don’t have a good enough sample and until we can resequence a Neanderthal nucleus and bring the little tyke to term and wait for him or her to marry then wait for those kids to have kids will we really be sure we’ve got the goods.

    Krause et al. (2007) list 15 Neandertal partial mtDNA sequences. Ten of these at that time presented relatively long portions, including the central Asian Okladnikov and Teshik Tash specimens, Mezmaiskaya, Feldhofer 1 and 2, Vindija 75 and 80, Scladina, Monte Lessini, and El Sidrón 1252. The same paper lists five additional specimens for which only a very short sequence had been recovered (just enough to diagnose as part of the Neandertal clade), including Vindija 77, El Sidrón 441, Engis 2, Rochers de Villeneuve, and La Chapelle-aux-Saints.

    We do not know that every Neandertal belonged to the same mtDNA clade as those 15 sequences. Some of them may have looked different, possibly including the new clade otherwise present in later Upper Paleolithic and living people. But based on the 15 sequences we have, we can say that a large fraction of Neandertals must have carried the “Neandertal haplogroup.” Exactly how large a fraction depends on what we are willing to believe about contamination, preservation, and the randomness of our sample.

    Now, let’s consider the question: Can we predict anything about Neandertal evolution and relationships based on this small, possibly unrepresentative sample of mtDNA?

    The answer is that it doesn’t matter very much whether we have 5 sequences or 500. If 15 out of 15 specimens from different sites across Europe preserve a single mtDNA haplogroup, we can’t say it was universal, but we can say it was common. If 40 out of 50, or 400 out of 500 specimens had the same haplogroup, that would increase the precision, but not change the basic fact: Neandertals had at least one common haplogroup that is now so rare it has never been found in a sample of 100,000 or more people. We deserve some explanation.

    The possible explanations are:

    1. Random genetic drift
    2. Accelerated genetic drift due to demographic turnover
    3. Population extinction and replacement
    4. Natural selection


    Drift

    Random genetic drift is fairly easy to refute, although it might not appear so at first. In favor of drift: There were few Neandertals, and the population size of the succeeding Upper Paleolithic, up through the Last Glacial Maximum, was also small—the best estimates are on the order of 2000 people for Western Europe and 5000 for continental Europe to the Urals (Bocquet-Appel et al.2005). There would have been perhaps twice or more that number across the entire Neandertal range. The effective population size represented by this population would have been smaller; perhaps 3000–5000 for Neandertals and Aurignacian-era people, only half, or around 2000, females. Genetic drift in this small mtDNA population would have been much stronger than for autosomal genes, and very much stronger than in most recent human populations.

    But when we plug these numbers into a model of random genetic drift, it starts to appear very unlikely that drift alone could explain the observations. Let’s assume (falsely) that our Neandertal genetic samples all dated to 40,000 years ago, and the female effective size was 2000 individuals between then and 15,000 years ago, and that the population of Neandertal country were a random mating pool. Following these assumptions, on averageall the mtDNA genomes at 15,000 years ago would descend from only 4 or 5 ancestral copies in the population 40,000 years ago. If these five ancestral copies were, by chance, a different haplogroup from the 15 copies we’ve already found, then drift could explain the data.

    However, this still doesn’t appear very likely. So far, every one of the Neandertals shares a single haplogroup. The frequency of this haplogroup was apparently very high, making it very unlikely that all five ancestral copies would have belonged to some other haplogroups of which we have never found any trace.

    Notice that this argument does not depend very much on the number of Neandertal mtDNA sequences that we have found. The fact that there are 15 helps to constrain the frequency of the haplogroup within the population 40,000 years ago, in our model. That frequency is unlikely to be less than around 85%, assuming random sampling. But suppose there were only five. We would still know that the Neandertal haplogroup was very common in its population, even if we thought it was only 50%. It would still be unlikely to draw four or five ancestral copies and have all of them be some other haplogroup that we haven’t found.

    This gives us a considerable confidence margin against drift. We need it. After all, the Neandertals were not randomly sampled at a single time, and it is possible that some of them actually carried a human-like mtDNA sequence, which we now falsely interpret as contamination. But even with these shadows hanging over us, it would still be unlikely that none of the ancestors of today’s mtDNA variation were like the Neandertal haplogroup.

    Also, the population was not a random-mating pool. When we add geographic structure to the story, which tends to reduce the importance of genetic drift, we find that the possibility that drift alone is almost zero, and it remains very unlikely that a single migration of modern humans interbreeding with Neandertals under random drift could explain the observations, either (Currat and Excoffier2004).

    Extinction

    It is at this point that most geneticists turn to the hypothesis of complete Neandertal extinction. They have a point. Genetic drift apparently cannot explain what we have observed, In their point of view, if genetic drift alone cannot explain the Neandertal mtDNA disappearance, then the only other random process at hand is extinction.

    I think that hypothesis is false. It does not account for morphological similarities between Neandertals and later people, genetic evidence that suggests a strong ancient population structure with introgression, or with the apparent behavioral continuity in the Upper Paleolithic.

    Happily, I don’t have a commitment to random processes. Instead, I think that the mtDNA evolution of Europe was driven by nonrandom processes of demographic turnover and selection.

    Demographic turnover

    Here we come to an important point. No one believes that later Europeans evolved from earlier Neandertals by a random process of genetic drift. Yet that is precisely the hypothesis that most studies have set up to refute. Without question it is valuable to set up boundary conditions under the hypothesis of random genetic drift. But the time has come to investigate more interesting models.

    Personally, I am surprised that more complicated metapopulation dynamics have not gotten more attention as an explanation for the Neandertal mtDNA results. Population sources and sinks are a hot topic in biology, and you would think that anthropologists would have picked up on this. To my knowledge, the only time anyone has examined a population sink model was in 2001, when Milford Wolpoff and I worked with mathematician Per Enflo on such an idea for Neandertals (Enflo et al.2001). This idea deserves a fuller treatment (I think I’ll suggest it as a project for one of my classes this year!).

    In a nutshell, a population sink is a region where the average rate of reproduction is below replacement levels. This region can remain populated only if individuals migrate in from other places. The places that reproduce above replacement are called population sources. The continual migration from sources to sinks creates a genetic gradient. Individuals sampled at any given time in the population sink are overwhelmingly likely to have ancestors not in the sink but in one or more source populations.

    Europe today is a population sink. The population of the continent does not produce enough children to replace itself, and immigration from other parts of the world is high. There are several reasons to suggest that Europe may have been a population sink in prehistory as well. In Neandertal and Upper Paleolithic times, climate fluctuations created unique challenges in Europe, where caloric expenditures were high and food harder to obtain than some other regions.

    Continual migration into Europe would provide a simple explanation for why none of today’s mtDNA haplogroups derive from the European Neandertals. The mtDNA population of 15,000 years ago had a few ancestors 40,000 years ago, and none of these ancestors lived in the sink population—all came from the source population in Africa or West Asia. The Neandertal mtDNA variation would have been a short-lived phenomenon, continually being turned over from source populations. Some Neandertal genes would have survived in Europe for hundreds of thousands of years, but some would have come in with more recent migrants from the population source.

    There are points that argue against this source-sink hypothesis. The Neandertal-human divergence time for mtDNA is not very different than that estimated for the autosomal genome. If a European population sink had made genetic drift more powerful, that should have affected mtDNA more than the autosomes, so we might expect a more recent mtDNA divergence. Still, there is nor reason why the source-sink dynamic need have been constant over Neandertal evolution, and there may have been multiple sources in the Pleistocene, not only Africa and West Asia. Investigating the boundary conditions of the source-sink model and its correspondence to autosomal genetic results would be helpful.

    I should note that mtDNA is not special. Neandertals had lots of traits that are now very rare. The horizontal-oval, or “bridged” mandibular foramen is a prominent example. Out of the relatively small sample of Neandertal mandibles, half have this derived form. Fewer than one percent of recent European mandibles have this form. As for mtDNA, a once-common variant is now very rare. And as for mtDNA, we deserve some explanation. A source-sink model would appear consistent with the continued evolution of such traits during the Upper Paleolithic—a time when the extinction and replacement hypothesis predicts no change in these characters.

    Natural selection

    The other nonrandom hypothesis is natural selection, which would presumably have favored one or more modern human types while eliminating the original Neandertal haplogroup. I won’t say much about that hypothesis here, since I discussed it in my initial post about the whole-mtDNA-genome sequencing. Selection has a leg up over the other hypotheses now because it seems like there’s good evidence it happened.

    Still, selection on mtDNA alone could not explain the total pattern of observations about Neandertals. Physical traits that were once frequent in Neandertals were much less common or absent in later Europeans, and some continued to reduce in frequencies over time. To explain these changes, we must invoke either selection on other traits, or continued demographic turnover in the post-Neandertal population (probably more immigration into Europe) or both.

    So selection on mtDNA has never been a sufficient or necessary hypothesis, even if we assume that other genes carried by Neandertals still survive. But given the current evidence that suggests something distinctive about the mtDNA of recent humans, natural selection may receive renewed attention as a factor in the disappearance of the Neandertal mtDNA haplogroup.

    References


       Bocquet-Appel JP, Demars PY, Noiret L, Dobrowsky D. 2005. Estimates of Upper Palaeolithic meta-population size in Europe from archaeological data. J Archaeol Sci 32:1656–1668. doi:10.1016/j.jas.2005.05.006.

       Currat M, Excoffier L. 2004. Modern humans did not admix with Neanderthals during their range expansion into Europe. PLoS Biol 2:e421.

       Enflo P, Hawks J, Wolpoff MH. 2001. A simple reason why Neanderthal ancestry can be consistent with current DNA information. Am J Phys Anthropol 114:S62.

       Krause J, et al. 2007. Neanderthals in central Asia and Siberia. Nature 449:902–904. doi:10.1038/nature06193.

  • The mtDNA sequence of Paglicci 23

    Sat, 2008-07-19 00:26 -- John Hawks

    Is there anything surprising about finding the Cambridge Reference Sequence in Paglicci 23?

    UPDATE follows at the bottom.

    Original post:

    I don't think it's surprising in the least. No European who has yet been sampled carries a mitochondrial DNA sequence that looks like any known Neandertal mtDNA sequence. The Neandertal sequences had to disappear from the European population sometime. The sequence diversity of modern mtDNA haplogroups in Europe suggests that several of them entered Europe during Upper Paleolithic times, although some entered later. Some Upper Paleolithic Europeans must have had sequences like some living Europeans, and the CRS is one of the most common.

    Does that mean that all Upper Paleolithic Europeans carried sequences that are present in living Europeans? Now, this question cannot really be answered without an exhaustive sampling of Upper Paleolithic remains. The sampling of living Europeans has been pretty exhaustive, but not so complete as to rule out the presence of Neandertal mtDNA entirely.

    But David Serre and colleagues (2004) did a fairly thorough sampling of early Upper Paleolithic European remains, finding that five preserved recoverable mtDNA sequences (out of 40 sampled), none of which had a Neandertal-like sequence. If we add in the two Paglicci specimens, that makes seven. That is enough to show that it's unlikely that more than a quarter of Upper Paleolithic Europeans had Neandertal-like mtDNA sequences. Still, the mtDNA distribution may have changed under the influence of selection, so it's not possible to say much more about the ancestry of Upper Paleolithic Europeans for
    other genetic loci. To know that, we'll want to sample more genes.

    Therein lies the problem. We have enough trouble telling whether Neandertal genomic sequence has been affected by contamination (see my earlier entry on that topic). Neandertals must share a substantial sequence similarity with humans, and most randomly-chosen 100-bp fragments will be identical to any randomly-chosen living person. The problem is even worse when we consider Upper Paleolithic and more recent people. It is inevitable that they will be identical to some living people, and the most common alleles for most loci then will often be the same as the most common alleles today.

    The real debate is about how far we should be skeptical when sequences from ancient specimens look like sequences found in living people. This debate is important not because Upper Paleolithic people shouldn't look like living people, but because Neandertals sometimes should. Sequence identity between Neandertal specimens and living people is expected for almost all the nuclear genome sequence. That means that it is impossible to authenticate Neandertal genomic fragments based on sequence alone, and we must resort to other characteristics of the fragments, such as length, proportion of base misincorporations or deamination, or intrasample polymorphism (more than two alleles is usually bad).

    In this regard, I think that the current study is a helpful example. When we deal with any kind of evidence, we must establish its provenience. Forensic scientists call this "chain of custody" -- that is, how do we know that the evidence is really the same thing that was originally found, instead of something caused by events along the way during analysis? If you're doing a radiocarbon sample, you want to make sure that stray radioactivity hasn't affected your sample. If you're extracting DNA, you want to find the sequence of everyone who could be a possible source of contamination. So the fact that Caramelli and colleagues have done this is a great thing.

    In fact, a 2006 paper authored by Maria Lourdes Sampietro and colleagues, including Caramelli, covered a much larger sample of Neolithic teeth and attempted to track down all cases of contamination by the excavation and laboratory teams. Here's the abstract:

    DNA contamination arising from the manipulation of ancient calcified tissue samples is a poorly understood, yet fundamental, problem that affects the reliability of ancient DNA (aDNA) studies. We have typed the mitochondrial DNA hypervariable region I of the only 6 people involved in the excavation, washing, and subsequent anthropological and genetic study of 23 Neolithic remains excavated from Granollers (Barcelona, Spain) and searched for their presence among the 572 clones generated during the aDNA analyses of teeth from these samples. Of the cloned sequences, 17.13% could be unambiguously identified as contaminants, with those derived from the people involved in the retrieval and washing of the remains present in higher frequencies than those of the anthropologist and genetic researchers. This finding confirms, for the first time, previous hypotheses that teeth samples are most susceptible to contamination at their initial excavation. More worrying, the cloned contaminant sequences exhibit substitutions that can be attributed to DNA damage after the contamination event, and we demonstrate that the level of such damage increases with time: contaminants that are >10 years old have approximately 5 times more damage than those that are recent. Furthermore, we demonstrate that in this data set, the damage rate of the old contaminant sequences is indistinguishable from that of the endogenous DNA sequences. As such, the commonly used argument that miscoding lesions observed among cloned aDNA sequences can be used to support data authenticity is misleading in scenarios where the presence of old contaminant sequences is possible. We argue therefore that the typing of those involved in the manipulation of the ancient human specimens is critical in order to ensure that generated results are accurate.

    The second half of that is kind of scary. Post-excavation contamination rapidly converges on the damage pattern of genuine endogenous DNA. That means that patterns of damaged DNA will not authenticate a sequence from an ancient skeleton.

    Yuck.

    We have to find criteria that will let us assign confidence to ancient sequences. To some extent, a lab that has relatively recent skeleton or mummy can drown out contamination by grinding up more material. But some preservation contexts won't allow that kind of sampling, and remains that are more and more ancient preserve less and less DNA.

    Obviously for some samples it will be impossible to provide equivalent information. Bones from important sites like Krapina have been touched by hundreds of people over the years, many of whom are no longer available for DNA extraction. For newer sites, it might be possible to provide a list of all people who have contacted the bones, but it still might not be desirable.

    Imagine a policy requiring credible researchers to put their DNA sequences on file, before they are allowed to touch a specimen. That would greatly restrict access to new finds, at least until DNA testing becomes much more routine than today. More access restrictions are the last thing we need.

    And I still worry about other sources of contamination. When I was familiar with laboratories, the biggest contamination problems were not the people working on the DNA, but instead were the other samples being studied. Yes, yes, I know. Clean rooms, positive air pressure, the whole lot, all those things keep this kind of laboratory contamination. Well, except for one reason or another sometimes they don't. Sometimes the contaminant is from the next room; sometimes it's from another floor of the same building. It's very hard to detect the source of this kind of contamination unless it's some kind of exotic sequence. And when your result is the CRS, it's hard to get a less exotic than that.

    I am especially interested in this question because of my work on recent evolutionary changes. Our findings show that people have many new alleles that should have been very rare or absent even 5000 years ago. Sampling ancient skeletons is an obvious way to test this hypothesis of recent evolutionary change. Already, results from ancient skeletons have confirmed some cases of recent selection -- for example, the absence of lactase persistence in Neolithic Germans (Burger et al. 2007).

    With Neandertals we have the problems that stem from the use of difference as a means of authentication. If you throw out sequences that look similar to modern humans, then you have a biased estimate of population variation. With more recent people, difference may not exist, and changes in the pattern of variation may be subtle. Positively selected alleles are a rare case where we can bend the biases in our favor, since they are prominent cases where the common alleles today may have been rare or absent in the past.

    UPDATE (2008/07/19): A reader writes:

    In the post you commend the authors of the study for genotyping all people who have come into contact with the bone and its DNA extract. Their main line of evidence against contamination is that the sequence they recover is not any of the people who contacted the sample. However, in our experience this is a control that is nearly useless.

    First, it is nearly impossible to know for sure that one has DNA samples for all people who really have come into contact with the samples. Similarly, it's not possible to know how much or how close contact must be in order to introduce a risk for contamination.

    Second, in our experience, contamination introduced from laboratory reagents is also a great danger. Every enzyme, oligo, etc. can potentially introduce contamination. Extraction and PCR blanks, properly done, can control for contamination by lab reagents. However, the details of these controls are quite important to known (at least as important as knowing the sequences of the bone handlers) to evaluate the data presented.

    In any case, I worry that the field of ancient DNA is sliding towards a criteria of authenticity that is not helpful. To imagine that one can round up, genotype, and then rule out all possible contamination suspects seems unreasonable.

    This jibes with my experience, and the point about laboratory reagents is very well taken. Here's what I wrote back:

    You know, I almost wrote that post more critically, because my contact with labs has always included stories about these weird contamination sources. With one case, it was DNA in the next building that had come in due to a faulty ventilation system. In another lab, an early UP specimen looked like Polynesians. Well, they were working on Polynesians down the hall, but they were pretty sure that all their precautions would keep the ancient stuff completely separate.

    There's no chance you can rule out everything, so why try to get all the people who touched it? Especially since that's going to be useless for most of what you can sample, which has been touched by dozens of people.

    On the other hand, it may be worthwhile to get people who excavate and handle things to be more careful, and taking their sequences adds some seriousness. It may be more for show than genuine contamination control, no doubt. I suppose that's the main purpose of all these proposals about protocols -- mainly ways to prevent arguments between labs, about "you can trust my sequence, but who knows about those guys?"

    Looking from the outside, I think the contamination problem may be no worse for ancient DNA than for other kinds of sequencing, and errors are creeping into databases all the time. The metagenomics scheme is a tremendous improvement, since the computers actually can keep track of this contamination and correct for it. So working on Neandertals is really more a question of getting the algorithm right. Probably, we should try to get people to stop doing extractions for single-gene probing, it's a waste of material, and the question of contamination is never going away. It's really a matter of time before we have all these sequences from specimens that preserve them, and we want to keep as much as possible against the chance of future improvements.

    References:

    Burger J, Kirchner M, Bramanti B, Haak W, Thomas MG. 2007. Absence of the lactase-persistence-associated allele in early Neolithic Europeans.

    Caramelli D and 13 others. 2008. A 28,000 years old Cro-Magnon mtDNA sequence differs from all potentially contaminating modern sequences. PLoS ONE 3:e2700. doi:10.1371/journal.pone.0002700

    Sampietro ML, Gilbert MTP, Lao O, Caramelli D, Lari M, Bertranpetit J, Lalueza-Fox C. 2006. Tracking down human contamination in ancient human teeth. Mol Biol Evol 23:1801-1807. doi:10.1093/molbev/msl047

    Serre D and 8 others. 2004. No evidence of Neandertal mtDNA contribution to early modern humans. PLoS Biol 2:313-317. doi:10.1371/journal.pbio.0020057

  • Were ancient Africans divided into small, isolated bands?

    Thu, 2008-05-08 12:05 -- John Hawks

    Last week when I wrote about the study of African mtDNA variation by Behar and colleagues, I focused on the issue of population size. To me, that must be the first parameter that we try to estimate, because the simplest relevant model of population history -- the Wright-Fisher model -- is described by that single parameter: the number of individuals. If we are going to evaluate evidence for population structure, we first must deal with the question of size.

    The claim in the press release is that the African population was divided into separate populations:

    Doron Behar, Rambam Medical Center, Haifa, said: "We see strong evidence of ancient population splits beginning as early as 150,000 years ago, probably giving rise to separate populations localized to Eastern and Southern Africa. It was only around 40,000 years ago that they became part of a single pan-African population, reunited after as much as 100,000 years apart."

    Is it true? Certainly that describes the model tested in the paper. But is it the right model? Is there evidence to justify that model as opposed to simpler alternatives?

    A real population may be structured in many ways -- by age, by caste or class, by space. If we have samples that are taken from different geographic locations, as in this study, it is natural to test hypotheses about structuring across geography. That's what Behar and colleagues did: they tested a hypothesis of panmixia, or random mating across space.

    Panmixia is the simplest model -- the null hypothesis -- about population structure. If everyone mates randomly, then there is no geographic structure. The population would be a single, unstructured gene pool. The paper refutes this model, demonstrating that people did not mate randomly across the geography of Africa during a certain period of time.

    But the question is: which model do we adopt once we have refuted panmixia?

    I rather like isolation-by-distance as a model for human population history. Isolation-by-distance (IBD) assumes that people travel some distance before they reproduce. It's a simple model -- the distance traveled may vary among individuals, but the variance in this value is the only parameter necessary to predict the structure of the population. IBD can explain quite a lot -- why people look like their neighbors, why intermediate populations on the map tend to look intermediate in allele frequencies, and why selected alleles take some time to disperse across space. It is generally consistent with what we know about hunter-gatherer demography. People tend to stay where they are, but a fairly large fraction move to marry into neighboring groups, and a smaller fraction go beyond the neighboring groups to marry further away. So I think this is the null hypothesis once panmixia is refuted. IBD is not a hypothesis of small, isolated bands -- it is a hypothesis of a geographically dispersed population with gene flow.

    The Genographic Project has done more than any other single project to extend the sampling of human populations. The paper by Behar and colleagues is a testament to that -- they are able to work with a broader and deeper sampling of mitochondrial variation in Africa than has yet been available. This is a credit both to the ambitious goals of the project and to today's genetic technology, which has made it possible to sequence more whole mitochondrial genomes on the project's budget. It is a great example of how spending money can circumvent some theoretical problems.

    Still, the Project likely wanted to maximize the effectiveness of its money, so it focused on sequencing only those variants that were underrepresented or rare in previous studies. From the Methods:

    Samples were chosen to include the widest possible range of Hg L(xM,N) internal variation on the basis of the previously available sequence analysis of the mtDNA control region and are, therefore, biased toward rare variants. In addition, we attempted to focus on branches (e.g., L0d, L0k), populations (e.g., Khoisan), and geographic regions (e.g., Chad) for which the current data were scant. Last, we preferred to sequence variants that the current literature suggested to be rare or anecdotal in any given geographic region (e.g., L0k in the Near East).

    Ummm... wait a minute. This is definitely not what you want to do if you're going to test hypotheses of population history. They have deliberately narrowed their sample in a way that distinguishes Khoisan from other peoples, and have excluded some proportion of variants already known to be common. We can predict, based on the sampling scheme alone, that Khoisan and other people ought to be more distinct that would be expected under a random sampling of each population, and certainly more so than expected under a random sampling of the African continent. This means that if the data were to reject IBD, we would have to examine whether that was because of the population history, or instead because of the sampling scheme.

    Do the data reject IBD? Well, we don't actually know from the paper. The study employs an island model, in which Khoisan and all others are assumed to represent either one panmictic population or two isolated ones. They devised a test based on permuting the number of lineages that they inferred to have existed during past time intervals. An island model with isolation of two populations predicts that each will share some gene lineages lacking in the other -- so-called "private" haplotypes. In contrast, two samples taken from a single panmictic population would each have a small proportion of "private" haplotypes, as well as some number of common haplotypes shared by both samples.

    So, the study (reasonably) tests the null hypothesis that the African mtDNA samples derive from a single panmictic population going back to the mtDNA coalescent. They estimate the date of this coalescent (based on their mutation rate model) as around 200,000 years ago, so this is a test of panmixia in Africa across this time period. They use a permutation test to evaluate the likelihood that some number of closely related lineages would all be private to the Khoisan population, under the hypothesis that they are randomly drawn from the African population as a whole. The lineages they examine are the ones they infer to have been present in the Khoisan population at various time intervals in the past -- again, based on their model of mutation rate. They can disprove panmixia across times after 100,000 years and before 80,000 years. Before this time, too few coalescent lineages are inferred to have existed to obtain a significant refutation of the test of panmixia. After 40,000 years, there are obvious shared lineages between Khoisan and other samples that could only have been shared by gene flow.

    I worry that there is a bias in this test. The authors applied it only to a period of time earlier than the coalescence times of recent shared lineages, but after the diversification of the ancient lineages that are not shared. In other words, there appeared to be a gap in the coalescence times of shared haplogroups. Usually, you would correct the test for multiple comparisons not only across haplogroups, but also across time periods. Given that we are considering a range of 150,000 years, across which there is evidence for gene flow both early and late in that history, what is the significance of the fact that we see few shared lineages at intermediate times? That will be less significant than the values reported in the paper, but how much less it is difficult to predict.

    In the end, what do the observations in the paper mean? In the simplest interpretation, either Africans were not random-mating after 100,000 years ago or regional selection differentiated southern and other African mtDNA pools.

    Did ancient Africans live in two isolated groups? I wouldn't say that: the authors didn't test that hypothesis.

    Did ancient Africans live in small bands scattered across the continent? Well, all ancient humans lived in small bands. The question of whether they were scattered is a question about the population size -- and as I showed last week, the population size during this period of time was not small. So we can imagine a population structure like recent historic hunter-gatherers -- with Africa possibly having something like the population size and structure of indigenous Australians.

    What's the bottom line? The results are consistent with isolation-by-distance in ancient Africans. That model, followed by a subsequent global expansion, has been around for a long time. In 1993, Henry Harpending and colleagues called it the "Weak Garden of Eden" model: a geographically structured African population that underwent an expansion and dispersal to other regions. Certainly for the mitochondrial DNA, this seems to be the model that presently best fits the data.

    What remains in question is how much of the subsequent spread of mtDNA was also reflected by spread of nuclear DNA haplotypes, and how much was induced by natural selection on mtDNA haplogroups. As I continue to write about population histories, we will meet this issue again.

    References:

    Behar DM, 14 others, and The Genographic Consortium (consortium again? Whoa). 2008. The dawn of human matrilineal diversity. Am J Hum Genet 82:1-11. doi:10.1016/j.ajhg.2008.04.002

    Synopsis: 
    Revisiting a paper that claims an African bottleneck, I examine the subject of population structure

Pages

Subscribe to mtDNA

Neandertals

For years, I've worked on their bones. Now I'm working on their genes. Read more about the science studying these ancient people.

Denisova

From a finger bone of an ancient human came the record of a completely unexpected population. My lab is working on the science of the Denisova genome.

Acceleration

The advent of agriculture caused natural selection to speed up greatly in humans. We're uncovering some of the ways that populations have rapidly changed during the last 10,000 years.

Malapa

Just outside Johannesburg, the Malapa site is producing some of the most exciting finds in human evolution. This site is the headquarters of the Malapa Soft Tissue Project.