john hawks weblog

paleoanthropology, genetics and evolution

coalescent

  • Mailbag: Y chromosome Adam

    Thu, 2011-02-17 11:45 -- John Hawks
    Hi John,

    I enjoy your blog very much. I’ve been reading a lot recently on human origins and genetics (most recently, for example, Nicholas Wade’s book Before the Dawn).

    One issue that I find does not seem to be clear in popular science accounts, and I thought you could clarify--- around the so-called Y-Chromosome Adam (or Mitochondrial Eve). Have geneticists determined that “Adam” was actually an individual in the ancestral population? Or is this shorthand for what I understand to be a ‘deme’ or a subpopulation within the ancestral population.

    There is a unique ancestor for the Y chromosome, so it is really an individual. As in the case of the mtDNA, this would not be the only man who was alive at that time, it is just inevitable that at some point all the other Y chromosome lineages have become extinct.

    The reason you haven't heard too much about it lately is that there is a huge dispute about how long ago the Y chromosome ancestor lived. Most estimates put it within the last 70,000 years, which is too young. But we don't really know how much too young. I expect we'll have better estimates upon whole-genome sequencing, but not yet.

  • Time to revise the mtDNA timescale?

    Wed, 2010-08-18 23:35 -- John Hawks

    Krzysztof Cyran and Marek Kimmel (2010) have presented a revised set of estimates of the human mtDNA most recent common ancestor (MRCA). It's an interesting theoretical paper, written for the purpose of developing a method that doesn't rely on the same assumptions as the usual coalescent models.

    Their new method gives an estimate of 174,000 years ago for the human MRCA. They report an upper/lower range as 96,000 to 449,000 years ago. That range does not represent a confidence interval on the estimate, it's an upper/lower based on extreme assumptions about human/Neandertal genetic distance and the human/Neandertal MRCA.

    The Neandertal mtDNA has really affected the way we estimate human MRCA, at least for the mitochondrial genome. Chimpanzees are just too distant. When we compare human and chimpanzee mtDNA genomes, there has been a lot of parallelism and reversal on both lineages, because mutations have hit the same place multiple times. Multiple hits and purifying selection make a mess out of rate estimation -- generally, they make the human MRCA seem a lot older than it truly was. The Neandertals are closer, and are therefore less of a problem.

    But the Neandertal-human MRCA itself was poorly known, as long when we had only chimpanzees to calibrate the mutation rate....

    That's what we discovered earlier this year with the mtDNA genome of the Denisova specimen [1] ("The Denisova mtDNA sequence: The X-Woman"). Denisova is an outgroup to the human-Neandertal mtDNA clade, which diverged from our mtDNA ancestors around a million years ago. Sliding in that branch redated the human-Neandertal MRCA down to 460,000 years ago. Unfortunately, that paper came too late for Cyran and Kimmel [2] to use the revised human-Neandertal MRCA in their calculations. They assumed a date of 511,000 years ago for the human-Neandertal MRCA.

    Still, the paper gives enough detail to work out the effect of a lower human-Neandertal MRCA on their estimate. They obtained their lower bound (96,000 years) by assuming a human-Neandertal MRCA of 389,000 years. If we substitute in the Denisova-informed human-Neandertal MRCA, we can figure that the human MRCA will be around 130,000 years ago or so.

    That's awfully recent.

    I don't want to go too far with these numbers. My first objection is that they all assume the total absence of selection, when we have long known that some human mtDNA clades have been selected in some parts of the world. It's entirely possible that the human MRCA is recent because of natural selection on some mitochondrial-linked phenotype ("Complete Neandertal mitochondrial sequence, and selection on human (not Neandertal) mtDNA", "Has the dam broken on mtDNA selection?", "Selection, nuclear genetic variation, and mtDNA").

    And even if we assume no selection at all, there's not a lot to be gained by increased precision of these estimates. Branch lengths of an mtDNA genealogy give only extremely wide estimates of ancient events. Saying that something happened "around 50,000 years ago, plus or minus 35,000", it hardly matters whether we change that to "around 43,200 years ago, plus or minus 35,000." I would even argue that the round estimate is better, because it doesn't communicate a misleading impression of precision.

    Still, it does a lot of good to know whether estimates are systematically biased in one direction. And this work, combined with what we know about the Neandertal and Denisova complete mtDNA genomes, suggests that our mtDNA branch lengths may have been biased too high.

    It remains to be seen how much of the human mtDNA tree will be affected by this logic. The most recent branches can in many cases be calibrated against historical events, and ultimately parent-offspring comparisons. So those aren't likely to change much. What worries me is that critical period around 30,000--80,000 years ago, when human mtDNA lineages were diversifying worldwide. The timescale of mtDNA divergence is already out of whack with the rest of the genome. Pushing these divergences more recent will make the fit between mtDNA and autosomal estimates worse. But given the wide variance on coalescence times, Cyran and Kimmel's estimates are consistent with the hypothesis that these might be substantially higher -- so it's hard to guess whether the apparent mismatch is real or not.

    I might have missed this paper if it weren't for the press release about it from Rice University. But what a misleading release! It's headlined, "Mother of all humans lived 200,000 years ago" -- which the paper doesn't conclude. If that were the conclusion, it wouldn't be news, because it's confirming a widely-used estimate that's more than 20 years old.

    But there are actually several interesting angles to the story that the press release fails to mention. Their estimation method may prove useful for many species for which we have no good demographic model -- a problem that the release alludes to, but doesn't feature. The method they develop came from a similar process, which had formerly led to a much, much higher estimate of human MRCA. Their estimate is a lot lower -- in large part because they can exploit the Neandertal genetic information. And then there's the likely possibility that the actual MRCA may be much lower, which would truly be unexpected compared to most earlier work.

    At the end of their paper, Cyran and Kimmel give a short discussion of the history of the Out of Africa mtDNA story. They mention the idea that some people favoring the multiregional hypothesis had suggested older dates for the human mtDNA MRCA. Aside from O'Connell [3], however, they didn't cite this literature. The conclusion of a short timescale, with a MRCA around 200,000 years ago, was challenged by a number of geneticists [4],[5]. The most common point was that the upper confidence limit on the MRCA estimate must be very high -- potentially 800,000 years ago or more, because of the great uncertainty about rates, coming from the chimpanzee-human branch length. This remains a problem, although the availability of a Neandertal outgroup helps to clarify which changes on the human lineage are actually recent.

    It's sort of interesting that even in the current paper, we still have an upper estimate of the human MRCA that's nearly 450,000 years ago! I don't think that the assumptions going into that value are realistic, but there's no real upper confidence bound on the central estimate. It might well go as high as 450,000 years, given the huge uncertainty in the depth of the deepest branches of that African mtDNA genealogy.

    So I guess I'm not really sure we've advanced very far in 20 years!


    References

    Synopsis: 
    A study of human variation adds precision to the human mtDNA mutation rate; I compare to results from archaic humans.
  • A low human mutation rate may throw everything out of whack

    Thu, 2010-03-18 16:30 -- John Hawks

    Last week, a paper looking for the genetic causes of Miller syndrome reported the whole genomes of four members of a single family: two siblings with the disorder and their two parents without. The idea was that they would simply compare the affected and unaffected genomes. They would then find candidate loci that might account for Miller syndrome in the affected siblings. By exploiting some other sources of information, they found what they were looking for. Daniel MacArthur covered the story in his post, "Disease hunting with whole genome sequences: the good news, and the bad news".

    I got interested in another aspect of the story. With whole-genome sequences of parents and offspring, it becomes possible to directly determine the rate of mutations in each generation. The paper by Roach and colleagues did just that -- they counted 28 in the 2.3 billion bases of sequence they included in their comparison. That makes a per-site mutation rate of 1.1 x 10-8 per generation.

    Which is a pretty interesting number. You see, it's less than half what it ought to be:

    [O]ur estimated human mutation rate is lower than previous estimates, the most widely cited of which is 2.5 x 10-8 per generation (10) based on three parameters: a human-chimpanzee nucleotide divergence per site (Kt) of 0.013, a species divergence time of five million years ago, and an ancestral effective population size of 10,000. More recent estimates indicate a nucleotide divergence of 0.012 (9), species divergence time between six and seven million years ago (11–15), and ancestral effective population size between 40,000 and 148,000 (16–19). With these parameter ranges and a generation length of 15 to 25 years, the mutation rate estimate is between 7.6 x 10-9 and 2.2 x 10-8 per generation, which is consistent with our intergenerational estimate of 1.1 x 10-8. Our estimate is within one standard deviation (SD) of an earlier estimate of 1.7 x 10-8 (SD: 9 x 10-9) based on 20 disease-causing loci (20). The rate we report is for autosomes, and should be several-fold lower than that of the Y chromosome, as in the male germline more cell divisions occur per generation. Though our rate differs approximately as expected from the recently reported estimate of 3.0 x 10-8 (95% CI: 8.9 x 10-9 – 7.0 x 10-8) for the Y chromosome, the error rates make this difference not significant (21).

    You can see the obvious implication: If this mutation rate is accurate, then the average human-chimpanzee gene divergence has to be up around 11 million years ago. That can be accommodated with a 7-million-year-old species divergence only if we assume a very large ancestral population -- on the order of 50,000 or higher. Or, the ancestral effective size could be lower -- but that would make the species divergence substantially older -- 9 million years or more.

    There is a second implication. Most studies of human genetic variation have assumed that 5-million-year-old human-chimpanzee divergence and the high associated rate of mutations. If the true rate is less than half that, then the coalescence times of human genes are more than double most estimates. That would include our estimates of human-Neandertal genetic differences.

    Well, that's a fine pickle.

    I'm not quite ready to believe the very low rate estimate. The analysis in this paper uncovered tens of thousands of false positives, and had to filter through those to arrive at 28 true mutations. The filtering involved resequencing all the positives to determine which were true and which were false, but maybe there's room in there for a substantial number of false negatives, too.

    If this low estimate were true of the human-chimpanzee divergence, it would imply vastly higher ages for other primate divergences, or a much lower rate on the human lineage specifically. So that allows another check on the process.

    But generally, I'll be looking at whole-genome family comparisons with great interest, because they will give us a much more precise understanding of the rate of mutations and recombinations across the genome.

    References:

    Roach JC and 14 others. 2010. Analysis of Genetic Inheritance in a Family Quartet by Whole-Genome Sequencing. Science (early online) doi:10.1126/science.1186802

    Synopsis: 
    Whole genome sequencing of a family finds a very low number of mutations, suggesting evolution doesn't have the timescale we thought.
Subscribe to coalescent

Neandertals

For years, I've worked on their bones. Now I'm working on their genes. Read more about the science studying these ancient people.

Denisova

From a finger bone of an ancient human came the record of a completely unexpected population. My lab is working on the science of the Denisova genome.

Acceleration

The advent of agriculture caused natural selection to speed up greatly in humans. We're uncovering some of the ways that populations have rapidly changed during the last 10,000 years.

Malapa

Just outside Johannesburg, the Malapa site is producing some of the most exciting finds in human evolution. This site is the headquarters of the Malapa Soft Tissue Project.