john hawks weblog

paleoanthropology, genetics and evolution

Holocene

  • Fast selection in high altitude, but how fast?

    Fri, 2010-07-02 15:56 -- John Hawks

    Did the altitude of the Tibetan plateau lead to the fastest instance of human adaptation yet known?

    That's the claim in the new paper by Xin Yi and colleagues [1]:

    Given our estimate that Han and Tibetans diverged 2750 years ago and experienced subsequent migration, it appears that our focal SNP at EPAS1 may have experienced a faster rate of frequency change than even the lactase persistence allele in northern Europe, which rose in frequency over the course of about 7500 years (26). EPAS1 may therefore represent the strongest instance of natural selection documented in a human population, and variation at this gene appears to have had important consequences for human survival and/or reproduction in the Tibetan region.

    I have a significant criticism of that conclusion, but first I want to say I think this is really cool work. They sequenced 50 whole exomes of people of Tibetan ancestry. An exome is the coding fraction of the genome, leaving out the non-coding stuff. This let them do a genome-wide association including every SNP they found. As it turns out, the key gene (EPAS1) has no coding SNPs that differentiate strongly in these samples. It's an intronic SNP that shows a really large frequency difference (87% in Tibetans, 9% in Han Chinese). That's a really big difference.

    And it takes a big difference to test neutrality in this sample. Fifty exomes is a whole lot of sequencing, but it's really a small sample for finding selection. It takes a really big frequency change to exceed chance. Besides that, most new adaptive mutations will be missed because they haven't gotten off the ground yet. Finding one major allele that correlates strongly with population, and then doing the work to show its association with red blood cell production, that's all pretty neat stuff. This paper should be added to the paper last month by Cynthia Beall and colleagues [2], who also found an association with Tibetans and made a functional link with high altitude adaptation. This gene is part of the system that adapts people to hypoxia in the Tibet/Nepal area, although it certainly does not act alone and we don't yet know how the system works. It's a solid first step.

    OK, so what's my problem with the paper? Hypoxia is a strong selective agent, affecting performance, health, and -- maybe most important -- birth weight. As soon as people began living on the Tibetan Plateau, they were in a compromised environment. That makes this a really great example of recent selection associated with a novel environment. But the archaeological evidence suggests that people have been living in this environment for a lot longer than 3000 years. The population model in the paper is a mess.

    People have been living on the Tibetan Plateau for more than 15,000 years. They may have occupied the area intermittently before the Last Glacial Maximum, and certainly were in nearby medium-altitude areas of northwestern China before that time. The Paleolithic-era occupation of northeastern highland Tibet was reviewed by Madsen and colleagues [3] and Brantingham and colleagues [4]. Aldenderfer [5] reviewed what is known about Neolithic-era occupation of highland Tibet. Sites with ceramics, evidence of sedentary village occupation and domesticated animals occur between 4000 and 6500 calendar years B.P. That doesn't mean that today's Tibetan population derives entirely from these early Neolithic settlers or the even earlier Paleolithic occupants. But the archaeological record does show that the opportunity for genetic adaptation would have been present long before 3000 years ago.

    So there's a potential inconsistency. The inconsistency could be resolved by recognizing that selection is stochastic. Selection cannot start changing the frequency of an allele until after the mutation has occurred.

    The following passage comes from Nicholas Wade's account of the research, in the NY Times. Wade also picked up on the problem with the demography in the paper, and probed the authors about it:

    Geneticists have a more elastic view of dates than do archaeologists, and the estimate of a Han-Tibetan population split at 3,000 years ago could probably have been adjusted to 6,000 if the geneticists had taken any account of any other kind of evidence.

    Rasmus Nielsen, a Danish researcher at the University of California, Berkeley, did the statistical calculations for the Beijing study. “We feel fairly confident that something on the order of 3,000 years is correct,” he said. But in a later e-mail message, Dr. Nielsen said, “I cannot with confidence rule out that the divergence time is 6,000 instead of 3,000.”

    There is similar flexibility in the estimates of population sizes. The Beijing team calculates that at the time of divergence there were only 288 Han Chinese and 22,642 Tibetans. These estimates have bewildered archaeologists, given that rice cultivation in southern China started 10,000 years ago and that there was an extensive civilization by 3,000 years ago. Dr. Nielsen said that the figure of 288 people was meant simply to indicate a bottleneck in the Han population, meaning a time when it was very small, and that this bottleneck could just as easily have occurred 10,000 years ago.

    I think that's totally remarkable. "Geneticists have a more elastic view of dates than do archaeologists"! I think that phrase should be framed and hung in every classroom teaching anthropological genetics.

    Look at the expansion model. In what universe were there only 288 ancestors of Han Chinese people in the last 3000 years? We're talking about the late Bronze Age, here! This is just after the end of the Shang Dynasty, whose capital at Anyang had a walled area of 1000 hectares. That's 1000 soccer pitches full of city, within an empire that spanned the northern half of China.

    It is completely lame to claim that the model could represent a bottleneck as long ago as 10,000 years. You see, the size of the population determines the rate of differentiation under genetic drift. If the population was big, it shouldn't have changed very fast, so the present populations shouldn't be very different. Putting it into numbers, if there hasn't been a bottleneck for 10,000 years, then the divergence must be a lot older than 3000 years. Probably older than 10,000 years.

    These hypotheses can be tested directly with genetics, and the data are certainly rich enough now to do it. If they point to a genetic bottleneck in China during the last 10,000 years, we should be very, very surprised. Because then who was farming all the millet and rice, and domesticating pigs?

    Does it matter? For EPAS1, the timing really doesn't affect the interpretation of selection -- there's no way that drift made the populations as different as they are for this one locus. But it seems clear that this is not a new mutation because it has no long, linked haplotype around it that also differs in frequency in the two populations. Selection on a standing variant is indeed newsworthy, as these are hard to find. Since we don't have a long haplotype to date, the only way that we can estimate the timing of selection is with the population model. Use the wrong model, and you get the wrong time. That is probably what has happened here.

    Also, using this weird population model vastly increases the chance that genetic drift could cause large frequency changes in Tibet or China. This makes us much less likely to recognize genes that really have been subject to selection in either population. With respect to EPAS1 the test is conservative, but the genome-wide comparison will miss a lot of genes and give less significant p-values to others. It's a waste, because it means that we have to collect that much more data to get the same result.

    UPDATE (2010-07-06): Rasmus Nielsen has written me to clarify his remarks to the Times and give more information about the demographic model in the paper. I have posted his full remarks along with some comments of my own. It is well worth reading.


    References

  • Humans still evolving...

    Sun, 2009-10-25 22:45 -- John Hawks

    Time has a story about Stephen Stearns and colleagues' work characterizing ongoing selection using the Framingham Heart Study sample:

    If these trends were to continue with no cultural changes in the town for the next 10 generations, by 2409 the average Framingham woman would be 2 cm (0.8 in) shorter, 1 kg (2.2 lb.) heavier, have a healthier heart, have her first child five months earlier and enter menopause 10 months later than a woman today, the study found. "That rate of evolution is slow but pretty similar to what we see in other plants and animals. Humans don't seem to be any exception," Stearns says.

    I haven't had a chance to see the new study yet, and I'll do a little review when I get it. Jerry Coyne has some more information based on a preprint.

    My students have heard me say many times that it would take a sample of thousands of people to test the hypothesis of neutrality within today's population. Well, Framingham is one such sample, and it's not surprising that some things would be found significantly to affect fitness.

    The Time article mentions our work on recent evolution in a very positive way. Of course, the Framingham sample isn't suitable for testing what has been going on during the last 40,000 years; it is about mass selection on phenotypes in the present American population. That will involve mostly selection on standing variants, things that are already common in the population. Some of those may be things that were increasing in the past, others not -- some may even be reversals in direction compared to pre-industrial times. And there's no predicting how they might change in the future, as we continue to change our environment out from under ourselves.

    I've seen a few comments that we shouldn't trust the sample because it's unrepresentative, too small, etc. I think people may be overlooking the fact that the Framingham Heart Study is bigger than the census sizes of many species in nature. You can detect selection on phenotypes in this sample, and they surely know the heritabilities of many of them. But I'll have to see the paper.

  • Neolithic migrationism

    Tue, 2009-10-13 15:34 -- John Hawks

    Dienekes has a nice post about the relation of Neolithic Europeans, migration models, and how anthropological views of migration have changed over the last century. He starts with Carleton Coon, although he might have gone back substantially earlier.

    I'll note that Franz Weidenreich, writing shortly after the cited work by Coon, had a very different view of the essential data underlying migrationism, especially the trend toward brachycephalization.

    Anyway, he traces the move from full-on folk migration to "demic diffusion" and "acculturation" models, back through recent genetic work that suggests some substantial genetic replacement -- either by means of selection or folk migration/demographic expansion.

    We have come full circle. Once again, Paleolithic Europeans assume the status of survivors, as their typical lineages are observed in a small minority of modern Europeans. The evidence for widespread acculturation of European hunter-gatherers or their significant genetic contribution to incoming farmers along a wave of advance is just not there. Hunters and farmers possessed distinctive gene pools, and farmers expanded with barely a trace of absorption of hunter gene pools.

    With the India genetics paper from a couple of weeks ago, I think we're seeing that recent large-scale genetic changes are not limited to Europe.

  • End of the ice age, cored

    Sun, 2008-12-28 09:25 -- John Hawks

    A Danish newspaper reports on some recent ice core research:

    A Danish ice drilling project has conclusively ended the discussion on the exact date of the end of the last ice age.

    The extensive scientific study shows that it was precisely 11,711 years ago - and not the indeterminate figure of ‘some’ 11,000 years ago – that the ice withdrew, allowing humans and animals free reign.

    ...

    “Our new, extremely detailed data from the examination of the ice cores shows that in the transition from the ice age to our current warm, interglacial period the climate shift is so sudden that it is as if a button was pressed”, explains ice core researcher Jørgen Peder Steffensen, Centre for Ice and Climate at NBI at the University of Copenhagen.

    These rapid climate reversals seem to have been a growing theme for the last decade or so. I think a review of chronology of the last 40,000 years might be in order here. I'll put it on my list.

  • Weaknesses of evolution

    Fri, 2008-06-06 10:54 -- John Hawks

    The NY Times reports on evolution and education in Texas:

    Starting this summer, the state education board will determine the curriculum for the next decade and decide whether the "strengths and weaknesses" of evolution should be taught. The benign-sounding phrase, some argue, is a reasonable effort at balance. But critics say it is a new strategy taking shape across the nation to undermine the teaching of evolution, a way for students to hear religious objections under the heading of scientific discourse.

    Already, legislators in a half-dozen states -- Alabama, Florida, Louisiana, Michigan, Missouri and South Carolina -- have tried to require that classrooms be open to "views about the scientific strengths and weaknesses of Darwinian theory," according to a petition from the Discovery Institute, the Seattle-based strategic center of the intelligent design movement.

    The story mainly covers the local Texas aspects of the story, with quotes from the state education board chairman ("I believe a lot of incredible things") and some pro-evolution opponents.

    I looked at the website where the Texans for Better Science Education lay out examples of the "weaknesses" that should be taught. They're pretty weak, all right. I think that most of these could be included in a science course as "common myths about evolutionary theory."

    Consider these:

    The Cambrian explosion quickly produced all of the basically different body structures, and some of these have since become extinct. This is very different from the evolutionary tree of life, which suggests a slow and gradual increase in body structures.

    No, no it doesn't. Evolutionary theory provides no reason to think that body structures should change at a slow constant rate. The synthetic theory emphasizes why bursts of adaptive change should happen episodically.

    Many life forms persist through large expanses of geologic time with essentially no change. Evolution theory suggests that mutations occur randomly over time and are selected to produce continuing change as the environment continually changes.

    No, no it doesn't. Some organisms may well have relatively constant environments for millions of years.

    Selective breeding has produced only very limited change with no new structures occurring over thousands of years and multitudes of generations of selection.

    Umm... teosinte? I think that biology texts should devote a lot more attention to selective breeding, as the best concrete examples of evolution in action.

    So, that reflects on the basic problem with the idea of teaching evolution's "weaknesses": A real weakness is not a matter of ignorance, but a matter of evidence weighing in favor of some alternative hypothesis. We don't have that here.

  • FOXP2 is really recent, it really did introgress (if it's not contamination)

    Fri, 2008-04-18 10:34 -- John Hawks

    That's the thrust of a technical comment by Graham Coop and colleagues, now online in Molecular Biology and Evolution. The letter refers to the extraction of FOXP2 from two Neandertal specimens from El Sidrón, by Johannes Krause and colleagues, reported last year (I wrote about the paper here).

    First, the bad news. The current letter raises the prospect of contamination. Notably, the controls applied by Krause et al. (2007) may be relatively weak evidence against contamination, because of polymorphism within large human comparative samples. The tests rely on the assumption that there is little DNA from living humans in the samples. But if we cannot distinguish Neandertal from human DNA with great accuracy, then we will be mistaken some proportion of the time. Krause et al.'s test, based on derived human alleles absent from the Neandertal genome draft, can still go wrong if the human contaminants happen to have all the ancestral (non-derived) human alleles.

    Well, that seems to be the story these days with Neandertal DNA extraction. No test of contamination is good enough. (And remember, that every "test" of contamination is really a procedure for excluding the hypothesis that ancient sequences are identical to recent ones.)

    Now, the more interesting news. Coop and colleagues verify that the selective sweep affecting human FOXP2 was indeed recent -- they estimate 42,000 years ago:

    To demonstrate this, we estimated the time of the most recent common ancestor (tMRCA) of the selected haplotype (see Figure 1), using an approach sometimes called phylogenetic dating (Thomson et al. 2000; Hudson 2007). This method does not make assumptions about demography and selection, but only requires that the mutations in the intron be neutral or nearly neutral. Taking this approach, we obtained a mean tMRCA of 42 Kya (see SOM for details). While there is considerable uncertainty associated with this estimate, it is surprisingly recent if selection took place over 300 Kya (see SOM). In other words, the selective scenario proposed by the authors cannot account readily for patterns of variation in modern humans. Given that we have no power to detect a beneficial substitution that occurred over 250 Kya, (cf. Sabeti et al. 2006) yet we see a footprint of positive selection at FOXP2, the conclusion of a recent selective sweep at FOXP2 is not surprising (Coop et al. 2008:3-4).

    FOXP2 is in one of the ENCODE regions, so its variation is pretty well known. This is not a problematic case: it has a very limited amount of variation around it, and has a strong excess of rare alleles, both signs of a recent sweep.

    Coop and colleagues suggest that the beneficial human allele spread into Neandertals (or vice versa) by low levels of gene flow coupled with its selective advantage -- in other words, introgression.

    They do allow for an alternative -- perhaps the two amino-acid-coding mutations were not the target of selection, but instead some linked locus. This would not erase the necessity of gene flow from Neandertals, but would question whether this gene flow had involved the FOXP2-language scenario, since it might be some linked gene unrelated to language.

    (CORRECTION (2008/04/18): If selection were on a linked site, then Neandertals might share the human-derived amino acids as a result of ancient shared ancestry with humans, while the linked selected sweep might be absent in Neandertals, not necessitating any gene flow.)

    I doubt this hypothesis of a linked sweep, since the two sites with human-derived substitutions are otherwise very strongly conserved among mammals. This looks like a credible target for recent selection. But the hypothesis of selection on a linked site cannot presently be tested.

    So that's the story. It seems very likely that Neandertals got the language gene from us, or us from them, long after many other genes in the two populations diverged. I write "many" rather than "most" because we haven't really been able to assess the proportion of derived alleles shared by humans and Neandertals. The completion of the draft sequence may help, but I'm afraid that the specter of contamination is going to keep on being raised whenever a part of the Neandertal draft genome looks humanlike.

    (via Dienekes)

    References:

    Coop G, Bullaughey K, Luca F, Przeworski M. 2008. The timing of selection at the human FOXP2 gene. Mol Biol Evol (in press) doi:10.1093/molbev/msn091

  • Why human evolution accelerated

    Wed, 2007-12-12 07:50 -- John Hawks

    n. b. This is a story about my work on recent human evolution, describing some of the main results and how the work came about. The story refers to my paper (with Gregory Cochran, Eric Wang, Henry Harpending, and Robert Moyzis), "Recent acceleration of human adaptive evolution," which came out in December, 2007.

    Like most good stories in biology, this one begins with Darwin. Darwin was always very interested in animal breeding, which he considered the best analogy for the process of natural selection. Of course, if you're breeding livestock and want to select for some characteristics, it is important to select from as large a herd as possible, because large populations have more variation in them. Darwin recognized this as an important condition for natural selection, which relies on sufficient variation in natural populations.

    [A]s variations manifestly useful or pleasing to man appear only occasionally, the chance of their appearance will be much increased by a large number of individuals being kept.... Hence, number is of the highest importance for success.

    These words from the Origin, "number is of the highest importance for success" were influential.

    This is a quick review of the research, based on a presentation I gave earlier this year. It is not complete, and glosses a number of very important details. A close reader looking for how to do genomics would be better served reading the actual research paper. Here, I'm trying to express the science for everyone else.

    By 1930, R. A. Fisher picked up Darwin's idea about numbers, predicting that evolution in large populations could be faster than in small populations. However, this is not in all circumstances, but only where the number of new adaptive mutations is quite small -- in other words, where evolution is "mutation-limited":

    The great contrast between abundant and rare species lies in the number of individuals available in each generation as possible mutants.... The importance of the contrast lies with the extremely rare mutations, in which the number of new mutations occurring must increase proportionately to the number of individuals available.

    A long history of research in plant genetics (corn breeding), microbial chemostat experiments, and the examination of pesticide resistance in insects support Fisher's concept. For example, flies subjected to low doses of pesticide in the laboratory tend to acquire very complicated patterns of resistance -- involving slight changes in many different genes. These usually aren't transmitted perfectly and often have fitness costs; it's a very imperfect adaptation. But if pesticide is sprayed over a large area, flies sometimes appear very quickly with a single mutation that confers very complete resistance. Here, the very advantageous resistance mutation is incredibly rare -- it only occurs in maybe one in a billion flies. It would never occur in the small laboratory population.

    Our growing population

    Human populations have been growing rapidly during the last 50,000 years or so. That increase began around the time of the Upper Paleolithic -- that's documented by archaeological evidence. There was a later massive increase during the Neolithic. This agricultural transition actually was quite heterogeneous: earlier in West Asia and China, later in Europe, and then later still in subsaharan Africa. Last, we have within the last few hundred years seen a massive increase in numbers associated with industrialization and globalization of technology.

    One day a couple of years ago, Greg Cochran and I were talking about brain evolution. You have to understand, this is long before we knew about any of these genome scans -- they hadn't come out yet. One of the main mysteries of human brain evolution is why it happened apparently gradually for such a long period of time. It is one of the best cases of evolutionary gradualism. But this is a problem, because directional selection would have too be too weak to take such a long time. Now, we know that brain size is constrained in two directions -- larger brains cost more energy to maintain, but smaller brains come with some functional disadvantages. So this creates a situation where new variants that satisfy both constraints -- costing little energy, or making great improvements in brain function -- must be very rare. It should be mutation-limited.

    I remember very well, that at precisely the same moment, we both realized -- "Hey, maybe this great increase in human population size made a difference!" Because as we'll see later, the pattern of change in brain size really changed when populations started to get really big.

    You see, this is one of those very rare cases where the theory preceded the data! It is quite simple; the rate of mutations in a population is a linear product of the rate per genome and the population size.

    Not all mutations are advantageous, and not all advantageous mutations will be fixed. The vast majority are lost. If a mutation has a selective advantage, then the chance that it will proceed toward fixation (and attain high frequency) is 2s -- "s" here is the fitness advantage. That means that 90 percent of new mutations with a 5 percent fitness advantage are simply lost.

    The most beneficial mutations are very rare; it is much more likely that a new mutation will be weakly selected. This is another aspect of selection that has been well-known since Fisher. So the chance of fixation increases with s, but the likelihood of the mutation decreases with s -- in fact, the number decreases exponentially as selection is stronger and stronger.

    If you put all these together, you can predict how many selected changes you should see in a population that has been growing in size. This tells us the number of new adaptive mutations that should come into the population each generation. It is still linear with population size -- a larger population should have more mutations in precise proportion to its size.

    Still, a very small fraction of the mutations in any given population will be advantageous. And the longer a population has existed, the more likely it will be close to its adaptive optimum -- the point at which positively selected mutations don't happen because there is no possible improvement. This is the most likely explanation for why very large species in nature don't always evolve rapidly.

    Instead, it is when a new environment is imposed that natural populations respond. And when the environment changes, larger populations have an intrinsic advantage, as Fisher showed, because they have a faster potential response by new mutations.

    From that standpoint, the ecological changes documented in human history and the archaeological record create an exceptional situation. Humans faced new selective pressures during the last 40,000 years, related to disease, agricultural diets, sedentism, city life, greater lifespan, and many other ecological changes. This created a need for selection.

    Larger population sizes allowed the rapid response to selection -- more new adaptive mutations. Together, the the two patterns of historical change have placed humans far from an equilibrium. In that case, we expect that the pace of genetic change due to positive selection should recently have been radically higher than at other times in human evolution.

    Finding selection in the genome

    Now, it comes to a problem of how we can see recent mutations that have been selected. A genome scan is based on things that vary, not things that are fixed. So we are looking at some window of frequencies. In our study, that was a window from around 22 to 78 percent.

    Before we go too far, it is important to point out that an adaptive gene will be in a window where we can detect it for only a short time -- it spends a long time getting up to an appreciable frequency (here 22 percent, which is our lower ascertainment bound) and a long time going from a high frequency (here 78 percent) to fixation -- this is for a dominant. But it spends only a very short time in the window where we can see it.

    And strongly selected genes go through this window quite a lot faster than weakly selected ones.

    The importance of this is that we will see genes with different strengths of selection at different ages. Our constraint is that right now all the things we can see are variable -- but some are variable because they originated a short time ago and were very strongly selected, and others are variable because they originated a long time ago, but were very weakly selected.

    You can guess, that we expect to see more of the weak ones than the strong ones, because there should be more of them! So the window should give us a view of the strength of selection as well as the number of mutations. If we can estimate the ages of our mutations, then we can predict how many there should be at different strengths of selection, and try to quantify the effect of population size.

    Here, we've drawn a graph showing the number of genes in the window, compared with the number that are still variable in the population -- they are on their way to fixation -- but they are outside the window. This is for a growing population, so you see that the number of these genes increases as you get closer to the present.

    Tip of the iceberg

    There are many more that we can't see than the ones we can see -- this is like the tip of the iceberg. That is one aspect of recent selection; these genes are in this intermediate frequency range for a short time, and there will be many more genes that are too rare for us to see with our current methods, but might be very important regionally or locally in some populations.

    Based on a model of population growth, we expect to see a big peak corresponding to the period when humans were growing rapidly during the Neolithic. The distribution should plunge down toward the present, because selection would have to be so strong on such a recent mutation for us to see it -- we're talking about 20 percent or more. Those just almost never happen. The true number, remember, is the iceberg under the water -- but we must make predictions about the part we can see.

    Linkage disequilibrium and selection

    Now, I need to say a few words about how we find these genes when we scan the genome. The International HapMap consists of a list of over 3 million genetic polymorphisms -- SNPs -- taken from a sample of people with ancestry in Northern Europe, West Africa, and East Asia. When we look at a sample of a long stretch of DNA from several people, we will be considering the frequency of many different polymorphisms.

    But more important, we have studied whether each polymorphism is linked to the others. As a new positively selected allele increases in frequency in a population, it is initially linked to a wide region including many nearby polymorphisms. This induces a long-distance association among SNPs, which is called linkage disequilibrium.

    When we are looking at a stretch of chromosome, what we can observe is that there are areas where recombination seems to be very rare around one SNP -- an in particular where one of the two SNP alleles has almost no recombinant chromosomes, but the other allele appears to have been recombining normally. That kind of mismatch is a strong indication of selection.

    I'm not going into the details of that process right now; I'll be posting some real examples of such LD decay analyses later in the week. After applying the analysis, we found more than 3000 in the Yoruba sample, more than 2800 in Europeans, and more than 2300 in Asians.

    These numbers are very large -- they make it look like this aspect of evolution, positive selection on new adaptive alleles, has been going very fast. But how long a time period are we looking at? Based on the local rate of crossing-over, we can say how quickly LD ought to be broken by new recombinations, and that allows us to derive age estimates. The ages represent the time that has elapsed since the initial mutation that established each adaptive allele.

    Here is a comparison between the ages of selected variants in the African HapMap and in the European HapMap. Let's look at this graph a little bit.

    Selected variants

    Each of these dots represents a number of different genes -- the y-axis is number; this is a histogram. The x-axis is the age. So you see, there are many of these selected genes that started around 10,000 years ago; there are many fewer that started around 40,000 years ago, and even fewer starting 80,000 years ago.

    These fitted lines are what you get if you fit a one-parameter model with very strong selection to these curves. You can fit these without considering the effects of population growth.

    But you notice some differences here between the African and European distributions. Africa has a few more total variants, but it especially has more older variants, before 10,000 years ago. You can see that during that time period, Europe has very few. And Europe has this later peak, where we see an earlier peak in Africa.

    These details are a very good match to demographic growth -- Africa had much larger population size during the Late Pleistocene than Europe, but West Asia, and then Europe had earlier Neolithic expansion than Africa -- so we see these early times have a lot more selected variants within Africa, and later on there is a pulse of adaptive variants in Europe.

    Testing acceleration

    At this point, we have a theory that predicts acceleration of new adaptive variants, and we have data that appear to show a very fast recent rate. But we haven't yet directly tested the hypothesis of acceleration.

    We chose a null hypothesis approach. After all, the rate of change looks like it has been very high recently, but what it if were always very high. A constant rate of change is a null hypothesis -- the hypothesis of no change, or in our case, no acceleration. So we worked out the predictions of this hypothesis: a constant, high rate of selection. If we could show that those predictions aren't true, then we could disprove the null hypothesis and show that adaptive human evolution accelerated.

    We took several different approaches, testing predictions on different kinds of data. For one thing, if the null hypothesis were true, then there should be a whole lot more selected mutations that have already reached or approached fixation, than the relatively small number that we see still varying in human populations. So to test the null hypothesis, we should look for evidence of these fixed selected substitutions.

    That's exactly what we did -- we looked at other means of assessing the number of recently fixed and near-fixed variants.

    Fixed variants

    On the bottom of this graph, we have the European age distribution of variants in our window. This should represent a small fraction of the total number that have happened across this time period. But you can see from this graph, that if the rate was constant, the total number should be very, very large -- since we are looking at 10-generation bins, here we have around 150 predicted substitutions every 10 generations, or around 1/2 per year. Most of these should be way above our window, in fact, as we go back toward 40,000 years ago, almost all should be close to or at fixation.

    This large number of completed sweeps should have vastly reduced human genetic variation, because polymorphisms tend to hitchhike along with nearby selected alleles. Hitchhiking up to fixation tends to eliminate variation. When we look at the effect of hitchhiking under this constant selection hypothesis, the genome-wide average diversity should be less than a tenth of what we actually observe. So that also disproves the null hypothesis.

    How much acceleration?

    Down at the bottom of the graph, you see the predicted number of selected variants over our window, under the hypothesis of population growth -- exactly the demographic growth that really happened to humans. And here you see, that there are many, many fewer of these predicted, and in fact over the long course of human evolution, the rate would have been very low.

    We can put a number on just how low, and when we do that, we can see how much human evolution has sped up. For example, if we have 1/2 of a substitution per year, well, there are around 12,000,000 years separating humans and chimpanzees (6 million since the common ancestor, in both these lineages). So if adaptive substitutions had happened at a constant rate as high as the last few thousand years, we should be looking at around 6 million fixed adaptive substitutions between humans and chimpanzees.

    But in reality there have been nowhere near that number. There are only 40,000 total amino acid substitutions between humans and chimps. Not all those were selected -- maybe only a third. We can add in some additional selected sites outside of coding regions, but still we are looking at an increase in the rate of new adaptive mutations in humans that is 100 times faster than could possibly have been true during most of human evolution.

    Our evolution has recently accelerated by around 100-fold. And that's exactly what we would expect from the enormous growth of our population.

    What is all this selection for?

    We know something about the functional categories of genes inferred to be under selection; we are studying this now. We expect it will keep us busy for some time.

    In a general view, they illustrate the idea that changing cultures and ecologies have been important in changing the pattern of selection. For example, many of the selected genes are involved with pathogen defense -- for new pathogens that didn't always exist. Some are apparently related to metabolism or even directly to diet, in terms of processing new food sources. Of course, lactase is an excellent example in this category.

    These are not the kinds of phenotypes that have a lot of visibility in skeletal remains. But we have a skeletal record of these populations during the last 40,000 years. We know a lot about what they looked like and how they changed. So we may try to relate the pattern of genetic, skeletal, archaeological, and other kinds of changes over time.

    One obvious way to test hypotheses about these changes would be to sample ancient DNA from skeletons. In this way, we could see if the new selected alleles are in them or not. This spring, a paper by Burger and colleagues (PNAS) sampled ancient European skeletons, Neolithic skeletons, for the lactase persistence allele. They didn't find any who had that allele -- not a single one, and this is in Neolithic populations where today the allele is up over 90 percent in frequency. What is going on there?

    Lactase allele over time

    In this case, it is quite obvious by considering population genetics. We have a very good date for this lactase persistence allele, from many sources -- it is around 6000-10,000 years old. And you can see in the figure, a new selected allele will remain at a very low frequency for a long, long time after its origin. Here, these skeletons were sampled at a time when the selection pressure favoring the allele was present, but the allele had not yet increased to a substantial frequency. In fact, this allele would have been rapidly increasing through these intermediate frequencies much more recently -- we're talking here about Roman times. And today it is over 90 percent in Scandinavia, but considerably lower in Italy and Southern Europe.

    In the future, we will be able to sample for genes more widely in ancient skeletons. At the same time, we will be able to sample skeletal changes to try to correlate them with allele origins. That is some research that I have applied for a number grants to support, and I think it will be very promising.

    Conclusion

    I hope that this essay gives an introduction to the work we have done. This was based on a presentation about the research I gave earlier this year. There are many missing ends, and I'll be adding more information over the next several days about ways of testing for selection, as well as some of the more surprising implications of our research. I've written it without a bibliography, which I can direct you to the paper for a full set of references.

    Synopsis: 
    I describe the background of our 2007 work on accelerating human evolution.
  • EVOLUTION IS OVER...WATCH MORE TV

    Thu, 2007-09-06 13:54 -- John Hawks

    That was the message that just flashed surreally on my TV screen, from the old U2 "ZooTV" tour. Yes, that's the one where the Edge is wearing a beret.

    So, having this excellent 17-year-old advice from Bono, I decided to Google "EVOLUTION IS OVER" to see what I would find.

    Here's an old article (2002) in The Observer by Robin McKie:

    For those who dream of a better life, science has bad news: this is the best it is going to get. Our species has reached its biological pinnacle and is no longer capable of changing.

    That is the stark, controversial view of a group of biologists who believe a Western lifestyle now protects humanity from the forces that used to shape Homo sapiens.

    'If you want to know what Utopia is like, just look around - this is it,' said Professor Steve Jones, of University College London, who is to present his argument at a Royal Society Edinburgh debate, 'Is Evolution Over?', next week. 'Things have simply stopped getting better, or worse, for our species.'

    There is more, including quotes from Chris Stringer noting the continuing evolutionary change during the past 10,000 years. There seems to have been much emphasis on trying to predict where things are going in the future.

    Then, there is this article by Freeman Dyson in New Perspectives Quaterly from this summer:

    Now, after some 3 billion years, the Darwinian era is over. The epoch of species competition came to an end about 10,000 years ago when a single species, Homo sapiens, began to dominate and reorganize the biosphere. Since that time, cultural evolution has replaced biological evolution as the driving force of change. Cultural evolution is not Darwinian. Cultures spread by horizontal transfer of ideas more than by genetic inheritance. Cultural evolution is running a thousand times faster than Darwinian evolution, taking us into a new era of cultural interdependence that we call globalization. And now, in the last 30 years, Homo sapiens has revived the ancient pre-Darwinian practice of horizontal gene transfer, moving genes easily from microbes to plants and animals, blurring the boundaries between species. We are moving rapidly into the post-Darwinian era, when species will no longer exist, and the evolution of life will again be communal.

    As he explains early in the essay, Dyson is building here on the ideas of Carl Woese, referring to early microbial evolution as "pre-Darwinian" because of the prevalence of horizontal gene transfer.

    Well, there's surely no harm in watching more TV...

  • Iceman pseudo-aneurysm

    Wed, 2007-04-04 22:30 -- John Hawks

    That's the story in this article by Patrizia Pernter and colleagues:

    A possible cause of death of the Iceman -- a ca. 5,300 BP natural human glacier mummy from the Tyrolean Alps -- is an intrathoracic stone arrowhead. The aim of this study was to prove radiologically his enigmatic cause of death. In August 2005, the Iceman underwent his first multislice computed tomography examination. As the main pathologic finding, the left dorsal subclavian artery contures shows a 13 mm-long part where the vessel wall is damaged and a 3 mm-long irregular pseudo-aneurysm -- a typical complication of a laceration of the subclavian artery. In the surrounding soft tissue a large haematoma is visible. Historic records highlight the fatal destiny of subclavian artery injuries e.g. due to massive active bleeding and shock-related cardiac arrest. Therefore, the Iceman's cause of death by an arrowhead lacerating among others the left subclavian artery and leading to a deadly hemorrhagic shock can be now postulated with almost complete certainty, especially when taking the environmental (3,210 meters above sea level) and historic (5,300 BP) settings into account.

    But did he have the lactase persistence allele?

    References:

    Pernter P, Gostner P, Vigl EE, Rühli FJ. 2007. Radiologic proof for the Iceman's cause of death (ca 5,300 BP). J Archaeol Sci (in press) doi:10.1016/j.jas.2006.12.019

    Tags: 
  • Recent megatsunamis

    Tue, 2006-11-14 13:06 -- John Hawks

    The NY Times has an article by Sandra Blakeslee describing geological evidence for recent (i.e. Holocene) megatsunamis:

    The explanation is obvious to some scientists. A large asteroid or comet, the kind that could kill a quarter of the world's population, smashed into the Indian Ocean 4,800 years ago, producing a tsunami at least 600 feet high, about 13 times as big as the one that inundated Indonesia nearly two years ago. The wave carried the huge deposits of sediment to land.

    It's set up as an opposition between these geologists (notably, Dallas Abbott) and astronomers, who don't think there are enough big space rocks to cause that many recent impacts.

    And of course there is the requisite mythical flood connection:

    Dr. Masse analyzed 175 flood myths from around the world, and tried to relate them to known and accurately dated natural events like solar eclipses and volcanic eruptions. Among other evidence, he said, 14 flood myths specifically mention a full solar eclipse, which could have been the one that occurred in May 2807 B.C.

    (via Gene Expression)

Pages

Subscribe to Holocene

Neandertals

For years, I've worked on their bones. Now I'm working on their genes. Read more about the science studying these ancient people.

Denisova

From a finger bone of an ancient human came the record of a completely unexpected population. My lab is working on the science of the Denisova genome.

Acceleration

The advent of agriculture caused natural selection to speed up greatly in humans. We're uncovering some of the ways that populations have rapidly changed during the last 10,000 years.

Malapa

Just outside Johannesburg, the Malapa site is producing some of the most exciting finds in human evolution. This site is the headquarters of the Malapa Soft Tissue Project.