john hawks weblog

paleoanthropology, genetics and evolution

R. A. Fisher

  • Unraveling Fisher's mysteries

    Wed, 2013-04-10 09:11 -- John Hawks

    Haldane's Sieve has a great post by James Lee giving context to a new preprint from him and Carson Chow: "Our paper: The causal meaning of Fisher’s average effect".

    This paradox continued to bother me over the next several years. Soon after my daughter was born, I indulged one of those wild impulses that strike the sleepless: I emailed my questions regarding this matter to Anthony W. F. Edwards, the last student of the great Fisher himself. Anthony very generously sent me some of his unpublished work and also his correspondence with Falconer about the very article that had spurred my thoughts. This correspondence spanned a period of more than 20 years, and it provided a very poignant portrait of Douglas Falconer as a scientist (Hill and Mackay, 2004). I did not immediately find the answers to my questions in the materials that Anthony sent to me, but they set me on the path toward finding the answers. These are presented in the paper, which will shortly appear in Genetics Research.

    Fisher invented a lot of statistical concepts, many of which are used universally by everybody, even far outside genetics. But he also invented some that nobody else has been able to understand. The "average effect" of an allele is one of them. The concept was central in the development of his "Fundamental Theorem" of natural selection, but why it works is not obvious. Lee's post does a great job explaining why this was an interesting and useful project to undertake.

  • Spatial dispersal, parallel adaptation, and the "Stooge effect"

    Thu, 2010-10-14 00:06 -- John Hawks

    Peter Ralph and Graham Coop have an interesting paper in the current Genetics, titled, "Parallel Adaptation: One or Many Waves of Advance of an Advantageous Allele?" [1]

    Fisher [2] famously considered the case in which an advantageous allele is dispersing through a spatially dispersed population, showing that the dispersal forms a "wave of advance". This work was the foundation for a lot of progress in understanding spatial dynamics of organisms.

    As I discussed in 2008 ("Overstating the obvious"), one of the consequences of the Fisher wave model for human evolution is that advantageous alleles will spread very slowly through the population. During the course of the Holocene, a strongly selected mutation might move only across a radius of a thousand or so kilometers. That provides one explanation for why new advantageous alleles haven't spread very far beyond their points of origin -- they just haven't had time yet.

    Another reason why an allele might not have spread widely is interference from other alleles with similar effects. I mentioned this process last year ("Spatial variation and near-fixed selected alleles"):

    Greg Cochran and I have been discussing this idea for some time. We call it the "Stooge effect". Think of the Three Stooges all trying to run through a door at the same time and getting stuck in the middle. That's what these genes are doing -- all of them are competing to respond to selection, but each is slowed by the presence of the others.

    Ralph and Coop have cleverly combined the "Stooge effect" phenomenon with spatial dispersal. They suppose a case in which two separate advantageous mutations arise in different geographic locations, each affecting the same trait. Each begins to spread independently as a Fisher wave of advance. What happens when they meet?

    As they show, the dynamics in this case give rise to a static equilibrium -- once the "waves of advance" meet, they stop moving, forming a stable boundary. A new favorable mutation makes headway only so long as it has no equally favorable mutation to compete against.

    I like the way they used both analytical approaches and simulations to come to this outcome. The appearance of stable boundaries in a reaction-diffusion system has long been known (demonstrated first by Alan Turing, actually!). But to my knowledge, no one has considered this specific case from an analytical perspective.

    The Fisher equation is not all that simple for most students to work with. If you become familiar with the equation, you will notice the key aspect is that it has two separate components -- a logistic (or reaction) component representing the increase in frequency at a single point in space, and a diffusion component representing the dispersal across space.

    The muscle of the dispersal process comes from the logistic component. Without the intrinsic growth of the selected allele, the dispersal of individuals along the boundary would not carry many copies of the selected allele into new geographic areas. If the local selective advantage dies, the wave of advance rapidly stalls. A static equilibrium arises, with the frequency of the selected allele forming a cline that correlates with the local selection pressure.

    Ralph and Coop's model approximates this case, in a dynamical sense. Each new selected mutation forms an increasing zone in which the selective advantage of other mutations is zero. When those other mutations encounter this zone, they form a stable cline. The cline is stable in the short term, but the diffusion component still disperses copies of an allele; they just lack the muscle to continue their deterministic expansion.

    The most interesting simulations by Ralph and Coop show the two-dimensional case, in which the stable boundaries emerge in a "tesselation" pattern.

    Tesselations

    Figure 6 from Ralph and Coop (2010), showing "tesselations" in 2-d simulations of waves of advance.

    The lower three panes in the figure show the stability of the boundaries between the selected alleles. They proceed to fixation locally, but their dispersal stops where they come into contact with other adaptive alleles. Over the very long term, the population will mix -- the diffusion process will slowly carry all these alleles throughout the species' range. Look at the process after a million generations and the entire zone will be gray. But this dispersal occurs at the neutral rate, where the diffusion term is the only factor driving the dispersal.

    What about humans?

    My graduate student Zach Throckmorton and I have been working in this area for a while now. One of the things that impresses us is the way that much more interesting dynamics can emerge when you alter the assumptions. I learned some of this stuff by talking to Frank Livingstone, who gave a lot of thought to these issues of spatial dispersal and selection as applied to malaria resistance alleles.

    In particular, Frank thought about the case where one allele has a slightly larger advantage than another. In some contexts, this allows the "better" allele to overtake and swamp the expansion of the "weaker" (but nonetheless adaptive) one. In others, the two come to a near standstill, one displacing the other only very gradually. Much depends on the timing of the two mutations and the local conditions controlling their initial dispersal.

    Ralph and Coop briefly consider this case in their paper, noting that the difference in fitness advantage of two alleles will allow one to advance into the range of the other, albeit at a slower rate. In humans, we may be seeing a smaller subset of cases, where one or more of the alleles have not yet established a wavefront. In these cases, the arrival of another wave can disrupt the spatial pattern of the rarer allele. The diploid case gives rise to the possibility of more complex epistases. Well-defined boundaries between selected alleles are rare, and where they occur (as may be the case with HbC and HbS in Africa), many have focused on negative epistasis as an explanation.

    Also, alleles are unlikely to substitute perfectly for each other. In many cases, they may work synergistically -- individuals carrying two selected alleles that affect the same function may outperform those carrying only one such allele. At some point, new selected mutations may start to have diminishing returns, even on a trait like skin pigmentation where dozens of alleles may have been selected in widespread human populations. So the current distribution may to some extent be "frozen", but by a more complicated dynamic than the simple intersection of waves of advance.

    As Coop and colleagues showed last year [3], and we discussed in 2007 [4], there are really only few genes that have approached local fixation in recent human evolution. The current spatial pattern of recently selected alleles doesn't look like a tesselation with many alleles near local fixation. Over most of the Old World, it looks like populations have a very large number of very new alleles, far from fixation, and few up over 70 percent in frequency.

    So the specific scenario in this paper by itself probably does not explain the overall empirical pattern in humans. But if we consider the current pattern as a transient, approximating the early stages of dispersal for many selected alleles, we may not be terribly far off the mark.

    Mutation-limited evolution

    This is a long dense paper and there's a lot in it. One further aspect of the paper that I think is essential is the way that Ralph and Coop reiterate the basic point that more people means more mutations. In their case, they focus on population density over space (population number, when you multiply them) as a constraint on the number of possible adaptive mutations. They apply this idea as a hypothesis to account for parallel adaptations that may have emerged in recent human evolution.

    Multiple mutational origins are likely if the characteristic length is shorter than the physical dimensions of the region. Eurasia measures >8000 km across, and so Table 1 suggests that multiple origins at a single base pair are very unlikely at the lower population density. On the other hand, if the mutational target is large, then multiple origins are likely at low densities, while at high densities independent origins are ubiquitous. The complementary cases of (rho = 2, µ = 10–8) and (rho = 0.002, µ = 10–5) give identical characteristic lengths of 3000 km, although the timescale on which the mutations spread differs. Thus for these two parameter combinations we can expect a few mutations to dominate within continents and for multiple mutations to be common in a population spread across an area the size of Eurasia. Obviously these calculations are very crude, as population densities vary through space and time, and dispersal across continents is not simply a function of geographic distance and individual dispersal. Nevertheless, these calculations suggest that it is plausible that for adaptive traits with reasonable mutational targets (e.g., a change anywhere within a gene or pathway) even low population densities can lead to parallel adaptation across an area the size of Eurasia, and higher densities almost certainly will.

    We note that as human population densities have increased dramatically over time, so too has the probability of parallel adaptation. It is interesting therefore to note that a number of recent human adaptations (e.g., sickle cell alleles) involve repeated changes at very small mutational targets in relatively small geographic areas, while older adaptations from single changes (e.g., skin pigmentation) are more broadly spread.

    They are describing a scenario in which small human populations would have been mutation-limited -- that is, the number of new mutations is small, making it unlikely that adaptive mutations will happen in any given generation. In such populations, the rate of adaptation is limited by the availability of new mutations. In an extreme -- in the very small effective sizes of Pleistocene human populations -- the rate of adaptation may be extremely slow and regional populations may come to differ at many weakly selected loci, which spread very slowly.

    As the population grows, strongly adaptive mutations become more and more likely to happen somewhere in the species' range. Yet they are still relatively rare -- meaning that they have an opportunity to spread fairly far before encountering another equally strongly selected mutation affecting the same trait.

    This process can give rise to very large differences on a continental scale, even when the selection pressures in different regions do not differ. In humans, the dispersal of selected alleles across space may have been significantly accelerated by actual dispersals of populations. It is not a mere coincidence that very widespread alleles in Eurasia also tend to be much older than 20,000 years old -- long-distance dispersals prior to that time had a higher chance of leaving a lasting influence on subsequent populations.

    But as the population gets bigger and bigger, parallel mutations are more and more likely to happen. As Ralph and Coop point out, at the extreme of large population size and likely mutations, you shouldn't see any new mutations emerging and spreading over very large areas. Any of these mutations would be very likely to encounter other new mutations that do the same thing.

    Is this likely in humans? Clearly some mutations have happened recurrently. Making a broken gene is easy -- there's a large mutational target, since a large fraction of nonsynonymous substitutions might do the job. So if there's a net selective advantage to breaking a gene, we ought to see that happen recurrently in human populations.

    In contrast, if the mutational target is very small, then mutations will still be rare even in a very large population. If only one base change can have an adaptive effect, that precise change will happen less than once in 109 births (remember that not just any mutation at a site, but some particular mutation is what we may need). If a rare duplication or gene conversion is the necessary change, then it may be much rarer.

    Looking across the last few million years, when human population numbers were much smaller than the Holocene, we can be pretty sure that some aspects of our evolution were mutation-limited. The changes that took hold in our ancestors were the ones that happened, and that survived the winnowing of genetic drift. Many changes that would have been adaptive didn't happen in our ancestors. They just weren't lucky enough.

    But some of those changes would still be adaptive now, if we could get them. And we have had much larger numbers in the last 10,000 years. Homo erectus needed these mutations, but we only now are seeing them selected in the human population.

    Malaria adaptation

    Hemoglobinopathies are among the cases of easy mutations -- where breaking a gene is adaptive. It's not just any broken version of alpha- or beta-globin that does the job, though. The hemoglobin needs to be impaired in certain ways to impede the parasites while maintaining blood function. This provides many of the classic cases of human adaptation, and Ralph and Coop turn to this system for examples of parallel adaptation:

    The sickle cell allele HbS at the β-globin gene in humans provides a particularly interesting case of putative parallel adaptation. The HbS allele (β6 Glu-Val) has been driven to intermediate frequencies by selection within the past 10,000 years due to increased resistance to malaria of heterozygotes for the allele (HALDANE 1949; ALLISON 1954; CURRAT et al. 2002; KWIATKOWSKI 2005). The HbS allele is present on at least four major distinct haplotypes in Africa, each at intermediate frequency within a different geographic region; the haplotypes are named after the population sample where they were first discovered (Central African Republic, Senegal, Benin, and Cameroon). This is consistent with multiple origins of this single-base-pair change. Note that a distinct, malaria resistance allele, HbC (β6 Glu-Lys), has also arisen in Africa at the same codon as the HbS allele (TRABUCHET et al. 1991; AGARWAL et al. 2000; WOOD et al. 2005a), increasing our confidence that the mutational input was high enough to allow multiple types to arise. However, FLINT et al. (1998) thought the hypothesis of multiple new mutations arising at a single base pair was extremely unlikely and proposed that it was more likely that gene conversion had spread a single mutation across multiple haplotypes.

    The theory we have developed can be used to assess the plausibility of the multiple mutational origins of the sickle cell allele, by exhibiting parameter combinations that yield characteristic lengths consistent with the separation of the sample locations. [Recall that the wave of advance, and thus also our model, works in the case of heterozygote advantage (ARONSON and WEINBERGER 1975).] The different HbS haplotypes co-occur within a few thousand kilometers of each other (see Table 5 of FLINT et al. 1998) (noting that these locations are unlikely to reflect the geographic mutational origins, and mutations will have been spread by large population movements). As the HbS changes occur at a single base pair, the mutation rate would have been 10–8, and we take an s = 0.05 (as in CURRAT et al. 2002). If human dispersal at that time was well approximated by a Gaussian kernel with sigma = 100 km, then a characteristic length of 1000 km would require an effective density of individuals of rho = 25 km–2, while if sigma = 10 km, then we would require only rho = 2.5 km–2. This latter set of parameters does not seem unrealistic, considering our knowledge of population density and dispersal parameters, so our model suggests that the hypothesis of multiple origins is not unreasonable.

    I think they've got the basic idea correct here, but there are some additional details to consider. The distribution of HbE is not quite so easy to understand if parallel mutations are really so likely, and of course there is the negative epistasis of different alleles (and the thalassemias) which impacts their dispersal ability when they become moderately common. The dynamic may be of similar form to the one described here, but boundaries between alleles may be reinforced by the fitness costs of carrying multiple ones.

    This situation raises the issue of path dependence. Some mutations have "first mover" advantages. Once they are common, other adaptive mutations may still occur -- even mutations that are better from the standpoint of fitness -- but be lost or grow very slowly because their net fitness advantage over the common mutant is slight. Where HbE is common, new HbS alleles are unlikely to invade quickly. Where HbS is common, new HbE mutants are similarly unlikely to invade -- even though HbE has a higher fitness.

    Network effects among genes may also dominate the spatial dynamics. HbS spread most widely in the context of populations that were already Duffy null, and in which G6PD deficiency was rapidly increasing. The first conditioned the parasite environment -- P. vivax had a strong disadvantage in Duffy null populations, P. falciparum made up most of the parasite load. G6PD deficiency should have impacted the relative advantage of HbS, more and more as it became more common. Those are two loci among many that alter malaria dynamics in Africa compared to South and Southeast Asia.

    Conclusions

    There is much more to say about this paper -- it's 22 journal pages. But I think I've given an impression of what's there and how the ideas may impact our interpretation of recent human evolution. Many of the central concepts were presaged by earlier work in 2007 and 2008, as reviewed here on the blog. The new analytical and simulation work, I really like.

    Hopefully we can get out some shorter papers that will focus on aspects of these problems as applied to humans. A message that comes across very clearly in our work and this new paper is that different time periods in our evolutionary history must have had very different selection dynamics. Pleistocene humans were not only in a different ecology than us, they experienced a radically lower potential for adaptation.


    References

  • Quote: R. A. Fisher on chance and natural selection

    Thu, 2010-03-04 09:40 -- John Hawks

    From p. 37 of the Genetical Theory of Natural Selection (1930):

    The statement of the principle of Natural Selection in the form of a theorem determining the rate of progress of a species in fitness to survive (this term being used for a well-defined statistical attribute of the population), together with the relation between this rate of progress and its standard error [Fisher describes here his Fundamental Theorem], puts us in a position to judge of the validity of the objection which has been made, that the principle of Natural Selection depends on a succession of favourable chances. The objection is more in the nature of an innuendo than of a criticism, for it depends for its force upon the ambiguity of the word chance, in its popular uses. The income derived from a Casino by its proprietor may, in one sense, be said to depend upon a succession of favourable chances, although the phrase contains a suggestion of improbability more appropriate to the hopes of the patrons of his establishment. It is easy without any very profound logical analysis to perceive the difference between a succession of favourable deviations from the laws of chance, and on the other hand, the continuous and cumulative action of these laws. It is on the latter that the principle of Natural Selection relies.

  • R. A. Fisher's model of adaptation

    Mon, 2009-10-26 01:25 -- John Hawks

    Chapter 2 of R. A. Fisher's Genetical Theory of Natural Selection is remarkable for many reasons. In it, he presents a model of selection in an age-structured population, the concept of reproductive value, and the Fundamental Theorem. Toward the end of the chapter, he discusses "The Nature of Adaptation," presenting a geometric model to justify the assertion that the probability of favorable genetic changes declines as the effect size of those changes increases.

    In order to consider in outline the consequences to the organic world of the progressive increase of fitness of each species of organism, it is necessary to consider the abstract nature of the relationship which we term 'adaptation.' This is the more necessary since any simple example of adaptation, such as the lengthened neck and legs of the giraffe as an adaptation to browsing on high levels of foliage, or the conformity in average tint of an animal to its natural background, lose, by the very simplicity of statement, a great part of the meaning which the world really conveys. For the more complex the adaptation, the more numerous the different features of conformity, the more essentially adaptive the situation is recognized to be. An organism is regarded as adapted to a particular situation, or to the totality of situations which constitute its environment, only in so far as we can imagine an assemblage of slightly different situations, or environments, to which the animal would on the whole be less well adapted; and equally only in so far as we can imagine an assemblage of slightly different organic forms, which would be less well adapted to that environment (38).

    I've highlighted that last sentence, which is saying that organisms fit their environments in possibly many different ways, so that their fitness is not actually tied in any single feature, such as "tint," but is instead an optimum of many features, with respect to any single factor the organism may be more or less well adapted. The rest of that paragraph continues on to make the same point.

    Then:

    The statistical requirements of the situation, in which one thing [the organism] is made to conform to another [the environment] in a large number of different respects, may be illustrated geometrically. The degree of conformity may be represented by the closeness with which a point A approaches a fixed point O. In space of three dimensions we can only represent conformity in three different respects, but even with only these the general character of the situation may be represented. The possible positions representing adaptations superior to that represented by A will be enclosed by a sphere passing through A and centered at O.

    This is really very simple, and the geometric model reveals an interesting switch. Suppose that we imagine an organism as a set of a traits, each of which lies at some distance d1, d2, d3, ..., da from the optimum value for that trait, O. We could imagine adaptation as a stepwise process, in which a any one of the a traits may change, and only those changes that reduce da will potentially be selected.

    But there's no reason at all why we should consider every trait as an independent entity. Suppose that a single genetic change could improve two traits, or three, or even all the traits at the same time.

    Or, more interesting, a change might greatly improve trait 1, while making all the rest of the traits marginally worse. Without a word, Fisher has removed the issue of adaptation from the fit of many separate variables, to a single distance in multidimensional space -- a change from cartesian to polar coordinates, as it were.

    If A is shifted through a fixed distance, r, in any direction its translation will improve the adaptation if it is carried to a point within this sphere, but will impair it if the new position is outside.

    The geometric model really assumes very little. It tells us nothing at all about the relationship between fitness and any particular phenotype, aside from assuming that (1) the relation is continuous within the sphere centered on O with radius A, and (2) there are no "holes" of low fitness within that sphere. This is not a fitness landscape. Fisher's view, as will become clear, was that species are well-adapted in nature. He assumes that the distance between A and O is always rather small in comparison to the kinds of phenotypic effects that mutations might cause. So in assuming that the fitness function is continuous, and that the area is small, he more or less automatically arrived at the assumption that there's only one peak at O.

    The next part is the famous one that people remember:

    If r is very small it may be perceived that the chances of these two events are approximately equal, and the chance of an improvement tends to the limit 1/2 as r tends to zero; but if r is as great as the diameter of the sphere or greater, there is no longer any chance whatever of improvement, for all points within the sphere are less than this distance from A.

    In this model, small changes are roughly 50-50 beneficial versus deleterious, but since their effect is very small, it hardly matters. Big changes are much less likely to be beneficial -- and if they exceed twice the distance from A to O, they can never be beneficial.

    After this quote, he gives an exact expression for the probability that a given change r is beneficial ((1/2)(1-(r/diameter))), but this is limited to the three-dimensional model. Then, there's another interesting one-sentence switch:

    The chance of improvement thus decreases steadily from its limiting value 1/2 when r is near zero, to zero when r equals d. Since A in our representation may signify either the organism or its environment, we should conclude that a change on either side has, when this change is extremely minute, an almost equal chance of effective improvement or the reverse; while for greater changes the chance of improvement diminishes progressively, becoming zero, or at least negligible, for changes of a sufficiently pronounced character.

    Remember that A represents the current "degree of conformity" of the organism to its "particular situation." This degree of conformity might be changed either by changing the organism or its environment -- the implication fo the last confusing sentence from the first paragraph, above.

    The point O is not an objective location (as it would be if we assumed it is the optimum within the current environment). Instead, it is a geometric abstraction that exists only in so far as it bears a relation to A in the present multidimensional space of "adaptation". We may change the distance from A to O either by changing the organism or by changing its environment. Fisher refers explicitly to this "assemblage of slightly different situations" with respect to both -- and again, the concept of "slightly different" underlies the assumption that the fitness function within the sphere is continuous.

    It seems to me that the idea of "niche construction" falls very easily within Fisher's model. An organism that systematically alters its environment may thereby change its level of adaptation. So even though mutation is an obvious candidate for a process described by the model, I've continued to refer to "change" as a nonspecific term for the unit of adaptation.

    The representation in three dimensions is evidently inadequate; for even a single organ, in cases in which we know enough to appreciate the relation between structure and function, as is, broadly speaking the case with the eye in vertebrates, often shows this conformity in many more than three respects. It is of interest therefore, that if in our geometrical problem the number of dimensions be increased, the form of the relationship between the magnitude of the change r. and the probability of improvement, tends to a limit which is represented in Fig. 3. The primary facts of the three dimensional problem are conserved in that the chance of improvement, for very small displacements tends to the limiting value 1/2, while it falls off rapidly for increasing displacements, attaining exceedingly small values, however, when the number of dimesnions is large, even while r is still small compared to d.

    Here we see the problem of universal pleiotropy emerging. As the number of dimensions of adaptability increases, the probability that one change will be a net increase in fitness, considering all dimensions, must decrease. This probability remains larger when the distance between A and O is larger, but declines as the number of dimensions increases.

    However, what we would call "universal pleiotropy" is intrinsic to Fisher's assumption that the direction of changes in this multidimensional space is random. If changes may occur in any direction, this is the same as asserting that a mutation may induce correlated changes between any set of phenotypes. With thousands of genes, and therefore thousands of dimensions of genetic "adaptedness", we might guess that the dimensionality is high enough to make this assumption useful.

    If on the other hand, the direction of changes is constrained in some way, then the dimensionality of the space accordingly is smaller by some degree. This is the rough equivalent of modularity in Fisher's model: if we say that some genetic correlations cannot be changed, we are saying that the phenotypic structure of the organism is modularized.

    The remainder of the section discusses the general case in spaces with higher than three dimensions. The basic point is that the probability that a change of a given size will be adaptive increases with the distance from A to O, decreases with the effect size r, and also decreases with the number of dimensions.

    Fisher finishes with a paragraph that, had he begun the section with it, might have made everything much clearer:

    The conformity of these statistical requirements with common experience will be perceived by comparison with the mechanical adaptation of an instrument, such as the microscope, when adjusted for distinct vision. If we imagine a derangement of the system by moving a little each of the lenses, either longitudinally or transversely, or by twisting through an angle, by altering the refractive index and transparency of the different components, or the curvature, or the polish of the interfaces, it is sufficiently obvious that any large derangement will have a very small probability of improving the adjustment, while in the case of alterations much less than the smallest of those intentionally effected by the maker or the operator, the chance of improvement should be almost exactly half.

    But there is an obvious objection: if you twist a knob on a microscope, it may move one of the lenses, but it isn't going to polish it. It's not possible to effect simultaneous changes on distinct elements of the microscope, because a microscope is in fact modular in its construction and controls. That doesn't disprove Fisher's model, but at least in the microscope case, the possible changes are not random in direction, but are constrained to "Cartesian"-like independent axes.

    Fisher's general idea is different from the fitness landscape, in the sense used by Sewall Wright. Fisher assumes a single adaptive optimum; Wright espoused the possibility of a "rugged" landscape in which large genetic changes might diverge from the nearest local optimum yet place the population near an alternate (possibly higher) local optimum. But the geometric model shares many properties with the fitness landscape, including its assumptions about hill-climbing toward the local optimum.

    I pointed to Sergey Gavrilets work a couple of weeks ago. Fisher's geometric model is most similar to the second meaning of fitness landscape, which pertains to the mean fitness of a population given a discrete genotype -- or in this case, a discrete phenotype-environment combination. Fisher's model is entirely about the relationship of two equilibria: the population mean fitness before and after a given change. It does not deal with the dynamics of the transition from one state to another.

    References:

    Fisher RA. 1930. The Genetical Theory of Natural Selection. Clarendon Press, Oxford.

  • Sergey Gavrilets on the two fitness landscapes

    Sat, 2009-10-10 23:48 -- John Hawks

    Sewall Wright's metaphor of the "fitness landscape" is fundamental in the way many biologists think about adaptation. The idea of a population "climbing" toward "adaptive peaks" is a visually compelling image for the increase in mean fitness that results from selection on many genes.

    However, the correspondence between this metaphor and the mathematics of population genetics leaves several ambiguities that tend to confuse people. One of the main sources of ambiguity concerns the meaning of the spatial dimensions in the fitness landscape. Do the dimensions represent the frequencies of alleles in the population? Or do they represent particular genotypes that individuals may have? Wright used mathematics that implied both approaches in different places. For purposes of metaphorical visualization, the difference between these perspectives may not matter. But if we want to guide our thinking about the evolutionary process, it's helpful to know where real-life cases are supposed to fit.

    Sergey Gavrilets' book, Fitness Landscapes and the Origin of Species takes on this problem in chapter 2. This post comes from my notes about the book, which I read some time ago. So although I've brushed them up, many holes remain -- think of it as a synopsis of points I found worth noting. What I don't have is a thesis -- in case you're wondering why you should care.

    For me (and many others), the most important aspect of Gavrilets' work is the demonstration that a "rugged" landscape does not exist if we consider a sufficiently high number of interacting genotypes. The genomes of organisms, from E. coli to humans, don't have that many genes, but the number of combinations among only 1000 biallelic genes is so large that Wright's "rugged landscape" analogy may never apply to them. Never mind our 20,000 multiallelic genes. I'll return to that issue another time, because this question of genomic searches has shaped my thinking about mutation-limited evolution and recent selection.

    In the meantime, back to the fitness landscape metaphor.

    Gavrilets describes two competing mathematical versions of fitness landscape: (1) Fitness landscape as relationship between genotypes and fitness of individuals, and (2) fitness landscape as relationship between genotype frequencies and the mean fitness of the population. In the first, the space is defined by the genotypes, and is therefore discrete. Two neighboring genotypes may have very different fitnesses -- even extremely so, since a single mutation might be lethal. In the second, the space is defined by the possible states of the population, and is (for sufficiently large populations) approximately continuous. Two nearby combinations of frequencies are likely to have nearly the same mean fitnesses.

    After the descriptions, he comes to the issue of whether the two versions are consistent or commensurable with each other. He comes to several very powerful arguments why the mean fitness version of the landscape is not useful for considering evolution across many loci (pp. 31-33):

    I note that [the mean fitness] version of fitness landscapes is often defined as a relationship between the allele (or gene) frequencies in a population and its mean fitness (e.g., Coyne et al. 1997; Fear and Price 1988; Arnold et al. 2001). Because allele frequencies in general are not sufficient for specifying the mean fitness of populations (unless fitness is additive or selection is very weak relative to recombination, so that linkage disequilibrium can be completely neglected), this definition is, strictly speaking, meaningless in multilocus systems.

    To take a concrete case, recent human evolution has involved selection on many genes which interact both with each other and with environments. In the set of genes that have responded to this selection, dominance and overdominance effects are likely. For example, new adaptive mutations are much more likely to escape drift if they are dominant in effect. Also, in this context, the number of genes and strength of selection are large enough that we probably cannot neglect linkage. The probability that a allele would have become more common in this population depends on the fitness of the genotypes that include it, and the correlation between these and other genotypes.

    Another common misconception is that the two versions of fitness landscapes are "wholly incompatible" (e.g., Provine 1986 [this is Provine's history of population genetics]). Yet, it is straightforward to find mathematically the average fitness of the population ... if one knows the individual fitnesses. That is, the first version can be transformed into the second version in a straightforward manner. However, knowing [the mean fitness] indeed does not allow one to find individual fitnesses.

    So the first version appears useful because you can find the mean fitness if you want for any possible frequencies of genotypes. The second version loses information because you can't work back from the population mean to the individual fitnesses of genotypes.

    Evolutionary dynamics of populations on fitness landscapes is a very complex process. In general, all relevant evolutionary factors (e.g., natural and sexual selection, random genetic drift, mutation, spatial structure and migration, environmental variability) and their interactions are expected to play important roles. Excluding some special cases (e.g., one-locus models of viability selection), the feature and patterns of evolutionary dynamics cannot be captured or predicted on the basis of any single characteristic such as the mean fitness. Indeed, it is well known that the mean fitness of the population does not necessarily increase and there are no general multilocus analogs of Wright's equations [predicting changes in frequency for a single locus] that do not rely on rather strong approximations [citations omitted]. Therefore, this second version of fitness landscapes is not particularly useful in a multilocus context and for modeling speciation.

    That's Gavrilets' aim in the book, so it makes sense for him to lay out that reason for avoiding the second version of fitness landscapes.

    But, does that criticism have more general force? I find the next sentences more persuasive:

    The dimensionality of fitness landscapes in this version is very high and increases exponentially rather than linearly with the number of loci. Moreover, even with a relatively small number of loci, the number of possible genotypes is much larger than any reasonable population size. This implies that most genotype frequencies will be equal to zero. In this case, describing populations by listing genotype frequencies becomes equivalent to listing all genotypes present, which is exactly what is done in the first version of fitness landscapes.

    Keep in mind that "genotype frequencies" here refers to the frequencies of multilocus genotypes, across very large sets of loci. A large number of genotypes at different loci can be combined in an extremely large number of ways -- many more than the number of individuals in natural populations. This is the principle behind DNA fingerprinting -- it only takes a couple of dozen multiallelic loci before the probability of two different individuals sharing genotypes converges on zero.

    These are persuasive arguments why the mean fitness interpretation of a fitness landscape doesn't help very much in considering evolution in a multilocus sense. Multilocus genotypes tend to be absent from the population, and probably even more important, independent assortment means that they don't reproduce themselves.

    But if we interpret the fitness landscape as representing the fitness of genotypes, then we can discuss the movement of a population on the landscape only under very constrained circumstances. The "strong selection, weak mutation" model of change (first explicitly discussed by Gillespie) predicts that a population can explore only very small parts of the fitness space from any given location. In that model, the entire population will move to a nearby combination of genotypes, wait for a while until a better mutation can occur, and then move again. Evolution in this model proceeds stepwise through a clear sequence of genotypes, and is mutation-limited not only as regards any single trait, but in fact across all traits.

    By contrast, if selection is weak and mutation is strong, then individuals in the population may occupy far-off locations in the fitness space, and the entire population is smeared across a large region of that space. On the surface, it might seem that evolution in such a population would be more deterministic, because the waiting time to beneficial mutations is much lower. But with many beneficial genotypes available to the population at any given time, the rate and direction of evolution are still affected strongly by stochasticity/

    Metaphors

    Next, Gavrilets describes the metaphor of a fitness landscape as a different issue from the mathematical description of a fitness landscape. The metaphorical version of a fitness landscape is the conventional textbook illustration: one fitness axis varies continuously against a two-dimensional genotype space. Neither the units nor the identities of the genotype dimensions are specified. The metaphor functions as an aid to imagination: Mean fitness changes as a hill-climbing function, without implying any specific idea of how much fitness should change as genotypes change. If you wanted some predictive qualities, you would need to specify the dimensions mathematically.

    Two issues arise with the metaphorical use of fitness landscapes. First, is the metaphor useful at all?

    Gavrilets implies that this distinction doesn't really matter -- that it is an error to try to interpret the metaphor of a fitness landscape in terms of mathematics, when the purpose of the metaphor is to illustrate broad potentialities instead of specific predictions.

    I don't really agree. I think that people familiar with the metaphor are consistently misdirected toward the mean fitness/allele frequency version (that is, Gavrilets' second mathematical version) of the fitness landscape. There are two qualities of the metaphor that lead to this natural conclusion. First, the "genotype dimensions" of the metaphor are continuous, and delimited on both ends. Second, the fitness function, whether drawn as rugged or smooth, is always continuous. Nearby locations on the spatial axes always have nearby fitnesses, without apparent discontinuities. An array of genotypes would not have this quality. Many lethal sequences differ by only one mutation from highly fit ones. In general, there's no reason to expect neighboring genotypes to have nearby fitnesses unless some kind of rank ordering were presupposed. Besides that, if we were considering an indefinitely large number of genotypes, we wouldn't use two spatial axes. It may be a mistake to judge the metaphor in mathematical terms, but if so, I suggest that the metaphor generates more confusion than the math.

    From Gavrilets' discussion of the "rugged fitness landscape", I think he agrees that the metaphor misleads in just this way.

    The value of Wright's metaphor is that it attempts to explain something very complex (multidimensional fitness landscapes and evolutionary dynamics) using something that everybody has a close knowledge of -- geographic landscapes. The view of rugged landscapes explicitly emphasizes the existence of different alternative solutions (alternative fitness peaks) to the problem of survival. In most three-dimensional fitness landscapes, peaks are isolated and there is no way to get from one peak to another without first descending to some valley. The metaphor of rugged fitness landscapes imposes a belief that the same is true in multidimensional fitness landscapes (36).

    But the rest of Gavrilets' work is a demonstration that highly multidimensional fitness landscapes don't have this intuitive three-dimensional shape. It's not obvious to me that the genotype fitness metaphor informs us either, because the strong selection/weak mutation model does not apply to most populations with sufficient fidelity to make evolution look like a simple directed walk through a series of genotypes. Understanding the dynamics requires us to consider how the population's mean fitness evolves, yet Fisher and Wright both had recourse to extreme simplifying assumptions about the relationship of mean fitness and the fitnesses of genotypes.

    Problems, problems...

    References:

    Gavrilets S. 2004. Fitness landscapes and the origin of species. Princeton University Press, Princeton NJ. Amazon

    .

  • Quote: Fisher defining epistasis

    Fri, 2009-06-05 18:19 -- John Hawks

    People often complain that R. A. Fisher wrote in a hard-to-read style; unnecessarily verbose and indirect. Either I don't tend to mind, or I find that the style makes me read with greater care. In either case, there are select passages from his writings that stand out as very clear to me. His description of epistasis and dominance as deviations from additivity, in his famous 1918 paper (p. 404), is one of them:

    The steps from recessive to heterozygote and from heterozygote to dominant are genetically identical, and may change from one to the other in passing from father to son. Somatically the steps are of different importance, and the soma to some extent disguises the true genetic nature. There is in dominance a certain latency. We may say that the somatic effects of identical genetic changes are not additive, and for this reason the genetic similarity of relations is partly obscured in the statistical aggregate. A similar deviation from the addition of superimposed effects may occur between different Mendelian factors. We may use the term Epistacy to describe such deviation, which although potentially more complicated, has similar statistical effects to dominance. If the two sexes are considered as Mendelian alternatives, the fact that other Mendelian factors affect them to different extents may be regarded as an example of epistacy.

    The terms we use today are familiar by use. A biologist doesn't necessary consider how idiosyncratic is the genetic use of term "additive". When I read a passage like this, it brings to mind a long-ago time when the select group of people using a term all had read the same papers. I wonder how many geneticists still read Fisher during their training. I can tell you this: the bound volume of the Proceedings of the Royal Society of Edinburgh in our library didn't look like it's been picked up for 30 years. I mean, serious dust on the cover.

    I wrote last month about how Fisher invented "variance", and noted the very useful property that the variance is a sum of contributions from different causes. It seems remarkable that Fisher could arrive at statistical framework for identifying the interactions of multiple genes on a trait, at a time when only a relative handful of "Mendelian factors" had yet been found.

    Now that we are able to find Mendelian factors in whole-genome association studies, it's remarkable that Fisher's framework is so often forgotten!

    References:

    Fisher RA. 1918. The correlation between relatives on the supposition of Mendelian inheritance. Proc R Soc Edinburgh 52:399-433.

  • Phenotypic variance

    Sat, 2009-04-18 17:38 -- John Hawks

    I've intermittently been reading through William Provine's The Origins of Theoretical Population Genetics. It's related to a project simmering on my back burner.

    Meanwhile, last week I was talking with some students about the recent papers at the AAPA meetings about natural selection as assessed by quantitative traits. The students thought that some of these papers had omitted some basic details that seemed obvious from the point of view of quantitative genetics. Also, George Armelagos had mentioned Raymond Pearl, so I figured as long as I'm reading about Pearl, William Castle, R. A. Fisher and their attitudes toward quantitative genetics, I might as well note a few passages from Provine's account.

    Provine:

    Fisher's express purpose in the paper was to interpret the well-established results of biometry in terms of Mendelian inheritance by ascertaining the biometrical properties of a Mendelian population.

    I'll just pause to note that Fisher's formulation begins almost all textbooks in quantitative genetics and many in population genetics. The model that relates quantitative variation and genotypic variation is essential to all genetic analysis.

    In particular, he wanted to show that Pearson was mistaken in concluding that the correlations between relatives in man contradicted the Mendelian scheme of inheritance. He began by defining a measure of the variability of a character in a population.

    This is an essential step for any introduction to genetics also. I spend some time in all my courses talking about the relationship between genetic and phenotypic variation, using the measures of each as ways to talk about the ways they differ. We can analogize genetic variation to a digital readout -- you have a genotype, or a set of genotypes, and the population's variation has to do with the frequencies of those genotypes or the alleles that comprise them. So the variation is something that emerges from counting genes. You have heterozygosity (expected frequency of heterozygous genotypes), or number of alleles. At the sequence level, you count both alleles and the number of mutations that separate them -- average pairwise difference, number of segregating sites.

    Back to Provine:

    Often the standard deviation σ was used for this purpose. But Fisher noted that

    Now Provine gives a direct quote from Fisher 1918:

    when there are two independent sources of variability capable of producing in an otherwise uniform population distributions with standard deviations σ1 and σ2, it is found that the distribution, when both causes act together, has a standard deviation σ12 + σ22. It is therefore desirable in analysing the causes of variability to deal with the square of the standard deviation as the measure of variability. We shall term this quantity the Variance of the normal population to which it refers, and we may now ascribe to the constituent causes fractions or percentages of the total variance which they together produce (Fisher 1918:399).

    I have always thought that this was a work of magic by Fisher. The additive quality of variance is such a useful characteristic for a measure of variation, it's hard to imagine using anything else. Fisher continues:

    For stature the coefficient of correlation between brothers is about .54, which we may interpret by saying that 54 per cent of their variance is accounted for by ancestry alone, and that 46 per cent must have some other explanation.

    It is not sufficient to ascribe this last residue to the effects of environment. Numerous investigations by Galton and Pearson have shown that all measurable environment has much less effect on such measurements as stature. Further, the facts collected by Galton respecting identical twins show that in this case, where the essential nature is the same, the variance is far less. The simplest hypothesis, and the one which we shall examine, is that such features as stature are determined by a large number of Mendelian factors, and that the large variance among children of the same parents is due to the segregation of those factors in respect to which the parents are heterozygous. Upon this hypothesis we will attempt to determine how much more of the variance, in different measurable features, beyond that which is indicated by the fraternal correlation, is due to innate and heritable factors (Fisher 1918:400).

    And that, in a nutshell, is why the correlation between relatives is not a measure of heritability. Fisher attempted to show that the segregation of Mendelian factors could account for a large fraction of the variance of stature, and substantially succeeded in showing that the environment had much less impact than had been assumed from the correlation between relatives.

    Provine's discussion continues along a different line, but he includes the characteristic line:

    Fisher's 1918 paper was well received by the few geneticists who could understand his mathematics (147).

  • Early concepts of cultural diffusion: the Boasians

    Mon, 2008-10-27 23:39 -- John Hawks

    I went looking for Lowie, because I was curious about the introduction of the diffusion concept into cultural anthropology. The mathematical description of diffusion, originally developed in thermodynamics, became important in statistical genetics during the first half of the twentieth century. In particular, R. A. Fisher introduced diffusion methods to examine the effects of natural selection in his 1922 paper, "On the dominance ratio." Diffusion methods made it possible to derive analytical approximations for many interesting biological parameters, and also came to underlie models of population dynamics outside the field of genetics.

    So I wondered: How did the use of diffusion in cultural anthropology compare to the introduction of the diffusion concept in genetics? Kroeber's systematization of the concept of cultural diffusion was certainly later than Fisher and Wright's major works on diffusion theory.

    It turns out that "diffusion" was initiated into cultural anthropology by E. B. Tylor himself. The OED has the earliest anthropological use of the term in Primitive Culture:

    How good a working analogy there really is between the diffusion of plants and animals and the diffusion of civilization, comes well into view when we notice how far the same causes have produced both at once.

    Later, Boas also makes extensive use of the concept of diffusion, in essentially the modern sense as an alternative explanation to independent invention for a cultural trait. For example, in his 1891 article, "Dissemination of tales among the natives of North America," he compares the transfer of myths and stories among groups in the New World to that in the Old. He writes:

    Then, we may ask, is there no criterion which we may use for deciding the question whether a tale is of independent origin, or whether its occurrence at a certain place is due to diffusion? I believe we may safely assume that, wherever a story which consists of the same combination of several elements is found in two regions, we must conclude that its occurrence in both is due to diffusion. The more complex the story is, which the countries under consideration have in common, the more this conclusion will be justified (Boas 1891:13-14).

    So by the time Lowie was writing his Culture and Ethnology, the concept of cultural diffusion was well-established, and the Boasian school was concerned with classifying culture similarity in terms of diffusion.

    Cultural problems tended to involve the spread of relatively large quantities of information -- reflected by the Boas quote above and its concern with the "combination of several elements." We can interpret this focus as a statistical consequence of observing culture: With so many potential observations, diffusion is difficult to distinguish from the null hypothesis of parallel development (chance similarity) unless the similarities are sufficiently detailed (involve a threshold of information) to prove otherwise.

    I have not yet found anywhere that this concept of assessing diffusion by "information content" was formalized beyond verbal descriptions like Boas' above. I will review Kroeber's contribution later; he does not provide any formalism at all.

    What I want to point out here is that this concept of diffusion is delimited quite differently from the use of diffusion models in genetics. Fisher and Wright initially introduced diffusion methods to deal with the effects of random changes of gene frequencies. In the case of Boasian cultural diffusion, random change will almost always fall short of the information content necessary to identify specific resemblances in a cross-cultural context.

    I can imagine datasets on cultural traits that would be sufficient to test the hypothesis of undirected (non-selected) diffusion. For example, phonological data on dialects is often detailed and coded to geographic locations. If we approached these data with a diffusion model, they would in many cases be sufficient to demonstrate departures from the null model of undirected diffusion.

    But most observations from ethnography are not of this character. For this reason, cultural diffusion is a priori a phenomenon involving direction by some selective mechanism. In genetics, this would be natural selection. In cultural variation, the selective mechanism may be less clear -- some combination of conscious decision, customs concerning borrowed behaviors, and functional efficacy may be involved.

    In the case of natural selection driving a gene substitution through space, Fisher's model of a wave of diffusion assumed only a single parameter determining intrinsic increase -- the "reaction" term in a reaction-diffusion equation. This was sufficient because the only relevant difference between the selected and non-selected alleles was fitness.

    The case of cultural diffusion, in contrast, makes it tempting to suggest that there might be many independent terms contributing to the spatial dispersion of a cultural trait. Traits might be different in that some are more transmissible than others; thereby spreading more widely. Some might have greater functional advantages than others. Some ideas might be impeded because they conflict with widespread taboos; others might be facilitated by the same factors.

    I write all this because I'm curious about why there was not a more formal development of diffusion in the context of cultural theory. There was every potential of it: Boas did not begin with a formalization, but the ideas critical to a formal theory are present in his description. But the development of the field quickly went in the opposite direction -- away from formalization and toward description. This was despite the fact that the concept of diffusion became incredibly important in the conflict between the Boasian school of ethnology and the cultural evolutionism of Leslie White and others.

    Others (non-anthropologists) did develop more formal theory, but this had little (if any) impact within anthropology. As I continue, I'll go back to the early cultural evolutionists Tylor and Morgan, and trace their influence on 20th century neo-evolutionists. Additionally, no account of cultural diffusion can omit the importance of the Kulturkreis school and its concept of the culture area.

  • Why human evolution accelerated

    Wed, 2007-12-12 07:50 -- John Hawks

    n. b. This is a story about my work on recent human evolution, describing some of the main results and how the work came about. The story refers to my paper (with Gregory Cochran, Eric Wang, Henry Harpending, and Robert Moyzis), "Recent acceleration of human adaptive evolution," which came out in December, 2007.

    Like most good stories in biology, this one begins with Darwin. Darwin was always very interested in animal breeding, which he considered the best analogy for the process of natural selection. Of course, if you're breeding livestock and want to select for some characteristics, it is important to select from as large a herd as possible, because large populations have more variation in them. Darwin recognized this as an important condition for natural selection, which relies on sufficient variation in natural populations.

    [A]s variations manifestly useful or pleasing to man appear only occasionally, the chance of their appearance will be much increased by a large number of individuals being kept.... Hence, number is of the highest importance for success.

    These words from the Origin, "number is of the highest importance for success" were influential.

    This is a quick review of the research, based on a presentation I gave earlier this year. It is not complete, and glosses a number of very important details. A close reader looking for how to do genomics would be better served reading the actual research paper. Here, I'm trying to express the science for everyone else.

    By 1930, R. A. Fisher picked up Darwin's idea about numbers, predicting that evolution in large populations could be faster than in small populations. However, this is not in all circumstances, but only where the number of new adaptive mutations is quite small -- in other words, where evolution is "mutation-limited":

    The great contrast between abundant and rare species lies in the number of individuals available in each generation as possible mutants.... The importance of the contrast lies with the extremely rare mutations, in which the number of new mutations occurring must increase proportionately to the number of individuals available.

    A long history of research in plant genetics (corn breeding), microbial chemostat experiments, and the examination of pesticide resistance in insects support Fisher's concept. For example, flies subjected to low doses of pesticide in the laboratory tend to acquire very complicated patterns of resistance -- involving slight changes in many different genes. These usually aren't transmitted perfectly and often have fitness costs; it's a very imperfect adaptation. But if pesticide is sprayed over a large area, flies sometimes appear very quickly with a single mutation that confers very complete resistance. Here, the very advantageous resistance mutation is incredibly rare -- it only occurs in maybe one in a billion flies. It would never occur in the small laboratory population.

    Our growing population

    Human populations have been growing rapidly during the last 50,000 years or so. That increase began around the time of the Upper Paleolithic -- that's documented by archaeological evidence. There was a later massive increase during the Neolithic. This agricultural transition actually was quite heterogeneous: earlier in West Asia and China, later in Europe, and then later still in subsaharan Africa. Last, we have within the last few hundred years seen a massive increase in numbers associated with industrialization and globalization of technology.

    One day a couple of years ago, Greg Cochran and I were talking about brain evolution. You have to understand, this is long before we knew about any of these genome scans -- they hadn't come out yet. One of the main mysteries of human brain evolution is why it happened apparently gradually for such a long period of time. It is one of the best cases of evolutionary gradualism. But this is a problem, because directional selection would have too be too weak to take such a long time. Now, we know that brain size is constrained in two directions -- larger brains cost more energy to maintain, but smaller brains come with some functional disadvantages. So this creates a situation where new variants that satisfy both constraints -- costing little energy, or making great improvements in brain function -- must be very rare. It should be mutation-limited.

    I remember very well, that at precisely the same moment, we both realized -- "Hey, maybe this great increase in human population size made a difference!" Because as we'll see later, the pattern of change in brain size really changed when populations started to get really big.

    You see, this is one of those very rare cases where the theory preceded the data! It is quite simple; the rate of mutations in a population is a linear product of the rate per genome and the population size.

    Not all mutations are advantageous, and not all advantageous mutations will be fixed. The vast majority are lost. If a mutation has a selective advantage, then the chance that it will proceed toward fixation (and attain high frequency) is 2s -- "s" here is the fitness advantage. That means that 90 percent of new mutations with a 5 percent fitness advantage are simply lost.

    The most beneficial mutations are very rare; it is much more likely that a new mutation will be weakly selected. This is another aspect of selection that has been well-known since Fisher. So the chance of fixation increases with s, but the likelihood of the mutation decreases with s -- in fact, the number decreases exponentially as selection is stronger and stronger.

    If you put all these together, you can predict how many selected changes you should see in a population that has been growing in size. This tells us the number of new adaptive mutations that should come into the population each generation. It is still linear with population size -- a larger population should have more mutations in precise proportion to its size.

    Still, a very small fraction of the mutations in any given population will be advantageous. And the longer a population has existed, the more likely it will be close to its adaptive optimum -- the point at which positively selected mutations don't happen because there is no possible improvement. This is the most likely explanation for why very large species in nature don't always evolve rapidly.

    Instead, it is when a new environment is imposed that natural populations respond. And when the environment changes, larger populations have an intrinsic advantage, as Fisher showed, because they have a faster potential response by new mutations.

    From that standpoint, the ecological changes documented in human history and the archaeological record create an exceptional situation. Humans faced new selective pressures during the last 40,000 years, related to disease, agricultural diets, sedentism, city life, greater lifespan, and many other ecological changes. This created a need for selection.

    Larger population sizes allowed the rapid response to selection -- more new adaptive mutations. Together, the the two patterns of historical change have placed humans far from an equilibrium. In that case, we expect that the pace of genetic change due to positive selection should recently have been radically higher than at other times in human evolution.

    Finding selection in the genome

    Now, it comes to a problem of how we can see recent mutations that have been selected. A genome scan is based on things that vary, not things that are fixed. So we are looking at some window of frequencies. In our study, that was a window from around 22 to 78 percent.

    Before we go too far, it is important to point out that an adaptive gene will be in a window where we can detect it for only a short time -- it spends a long time getting up to an appreciable frequency (here 22 percent, which is our lower ascertainment bound) and a long time going from a high frequency (here 78 percent) to fixation -- this is for a dominant. But it spends only a very short time in the window where we can see it.

    And strongly selected genes go through this window quite a lot faster than weakly selected ones.

    The importance of this is that we will see genes with different strengths of selection at different ages. Our constraint is that right now all the things we can see are variable -- but some are variable because they originated a short time ago and were very strongly selected, and others are variable because they originated a long time ago, but were very weakly selected.

    You can guess, that we expect to see more of the weak ones than the strong ones, because there should be more of them! So the window should give us a view of the strength of selection as well as the number of mutations. If we can estimate the ages of our mutations, then we can predict how many there should be at different strengths of selection, and try to quantify the effect of population size.

    Here, we've drawn a graph showing the number of genes in the window, compared with the number that are still variable in the population -- they are on their way to fixation -- but they are outside the window. This is for a growing population, so you see that the number of these genes increases as you get closer to the present.

    Tip of the iceberg

    There are many more that we can't see than the ones we can see -- this is like the tip of the iceberg. That is one aspect of recent selection; these genes are in this intermediate frequency range for a short time, and there will be many more genes that are too rare for us to see with our current methods, but might be very important regionally or locally in some populations.

    Based on a model of population growth, we expect to see a big peak corresponding to the period when humans were growing rapidly during the Neolithic. The distribution should plunge down toward the present, because selection would have to be so strong on such a recent mutation for us to see it -- we're talking about 20 percent or more. Those just almost never happen. The true number, remember, is the iceberg under the water -- but we must make predictions about the part we can see.

    Linkage disequilibrium and selection

    Now, I need to say a few words about how we find these genes when we scan the genome. The International HapMap consists of a list of over 3 million genetic polymorphisms -- SNPs -- taken from a sample of people with ancestry in Northern Europe, West Africa, and East Asia. When we look at a sample of a long stretch of DNA from several people, we will be considering the frequency of many different polymorphisms.

    But more important, we have studied whether each polymorphism is linked to the others. As a new positively selected allele increases in frequency in a population, it is initially linked to a wide region including many nearby polymorphisms. This induces a long-distance association among SNPs, which is called linkage disequilibrium.

    When we are looking at a stretch of chromosome, what we can observe is that there are areas where recombination seems to be very rare around one SNP -- an in particular where one of the two SNP alleles has almost no recombinant chromosomes, but the other allele appears to have been recombining normally. That kind of mismatch is a strong indication of selection.

    I'm not going into the details of that process right now; I'll be posting some real examples of such LD decay analyses later in the week. After applying the analysis, we found more than 3000 in the Yoruba sample, more than 2800 in Europeans, and more than 2300 in Asians.

    These numbers are very large -- they make it look like this aspect of evolution, positive selection on new adaptive alleles, has been going very fast. But how long a time period are we looking at? Based on the local rate of crossing-over, we can say how quickly LD ought to be broken by new recombinations, and that allows us to derive age estimates. The ages represent the time that has elapsed since the initial mutation that established each adaptive allele.

    Here is a comparison between the ages of selected variants in the African HapMap and in the European HapMap. Let's look at this graph a little bit.

    Selected variants

    Each of these dots represents a number of different genes -- the y-axis is number; this is a histogram. The x-axis is the age. So you see, there are many of these selected genes that started around 10,000 years ago; there are many fewer that started around 40,000 years ago, and even fewer starting 80,000 years ago.

    These fitted lines are what you get if you fit a one-parameter model with very strong selection to these curves. You can fit these without considering the effects of population growth.

    But you notice some differences here between the African and European distributions. Africa has a few more total variants, but it especially has more older variants, before 10,000 years ago. You can see that during that time period, Europe has very few. And Europe has this later peak, where we see an earlier peak in Africa.

    These details are a very good match to demographic growth -- Africa had much larger population size during the Late Pleistocene than Europe, but West Asia, and then Europe had earlier Neolithic expansion than Africa -- so we see these early times have a lot more selected variants within Africa, and later on there is a pulse of adaptive variants in Europe.

    Testing acceleration

    At this point, we have a theory that predicts acceleration of new adaptive variants, and we have data that appear to show a very fast recent rate. But we haven't yet directly tested the hypothesis of acceleration.

    We chose a null hypothesis approach. After all, the rate of change looks like it has been very high recently, but what it if were always very high. A constant rate of change is a null hypothesis -- the hypothesis of no change, or in our case, no acceleration. So we worked out the predictions of this hypothesis: a constant, high rate of selection. If we could show that those predictions aren't true, then we could disprove the null hypothesis and show that adaptive human evolution accelerated.

    We took several different approaches, testing predictions on different kinds of data. For one thing, if the null hypothesis were true, then there should be a whole lot more selected mutations that have already reached or approached fixation, than the relatively small number that we see still varying in human populations. So to test the null hypothesis, we should look for evidence of these fixed selected substitutions.

    That's exactly what we did -- we looked at other means of assessing the number of recently fixed and near-fixed variants.

    Fixed variants

    On the bottom of this graph, we have the European age distribution of variants in our window. This should represent a small fraction of the total number that have happened across this time period. But you can see from this graph, that if the rate was constant, the total number should be very, very large -- since we are looking at 10-generation bins, here we have around 150 predicted substitutions every 10 generations, or around 1/2 per year. Most of these should be way above our window, in fact, as we go back toward 40,000 years ago, almost all should be close to or at fixation.

    This large number of completed sweeps should have vastly reduced human genetic variation, because polymorphisms tend to hitchhike along with nearby selected alleles. Hitchhiking up to fixation tends to eliminate variation. When we look at the effect of hitchhiking under this constant selection hypothesis, the genome-wide average diversity should be less than a tenth of what we actually observe. So that also disproves the null hypothesis.

    How much acceleration?

    Down at the bottom of the graph, you see the predicted number of selected variants over our window, under the hypothesis of population growth -- exactly the demographic growth that really happened to humans. And here you see, that there are many, many fewer of these predicted, and in fact over the long course of human evolution, the rate would have been very low.

    We can put a number on just how low, and when we do that, we can see how much human evolution has sped up. For example, if we have 1/2 of a substitution per year, well, there are around 12,000,000 years separating humans and chimpanzees (6 million since the common ancestor, in both these lineages). So if adaptive substitutions had happened at a constant rate as high as the last few thousand years, we should be looking at around 6 million fixed adaptive substitutions between humans and chimpanzees.

    But in reality there have been nowhere near that number. There are only 40,000 total amino acid substitutions between humans and chimps. Not all those were selected -- maybe only a third. We can add in some additional selected sites outside of coding regions, but still we are looking at an increase in the rate of new adaptive mutations in humans that is 100 times faster than could possibly have been true during most of human evolution.

    Our evolution has recently accelerated by around 100-fold. And that's exactly what we would expect from the enormous growth of our population.

    What is all this selection for?

    We know something about the functional categories of genes inferred to be under selection; we are studying this now. We expect it will keep us busy for some time.

    In a general view, they illustrate the idea that changing cultures and ecologies have been important in changing the pattern of selection. For example, many of the selected genes are involved with pathogen defense -- for new pathogens that didn't always exist. Some are apparently related to metabolism or even directly to diet, in terms of processing new food sources. Of course, lactase is an excellent example in this category.

    These are not the kinds of phenotypes that have a lot of visibility in skeletal remains. But we have a skeletal record of these populations during the last 40,000 years. We know a lot about what they looked like and how they changed. So we may try to relate the pattern of genetic, skeletal, archaeological, and other kinds of changes over time.

    One obvious way to test hypotheses about these changes would be to sample ancient DNA from skeletons. In this way, we could see if the new selected alleles are in them or not. This spring, a paper by Burger and colleagues (PNAS) sampled ancient European skeletons, Neolithic skeletons, for the lactase persistence allele. They didn't find any who had that allele -- not a single one, and this is in Neolithic populations where today the allele is up over 90 percent in frequency. What is going on there?

    Lactase allele over time

    In this case, it is quite obvious by considering population genetics. We have a very good date for this lactase persistence allele, from many sources -- it is around 6000-10,000 years old. And you can see in the figure, a new selected allele will remain at a very low frequency for a long, long time after its origin. Here, these skeletons were sampled at a time when the selection pressure favoring the allele was present, but the allele had not yet increased to a substantial frequency. In fact, this allele would have been rapidly increasing through these intermediate frequencies much more recently -- we're talking here about Roman times. And today it is over 90 percent in Scandinavia, but considerably lower in Italy and Southern Europe.

    In the future, we will be able to sample for genes more widely in ancient skeletons. At the same time, we will be able to sample skeletal changes to try to correlate them with allele origins. That is some research that I have applied for a number grants to support, and I think it will be very promising.

    Conclusion

    I hope that this essay gives an introduction to the work we have done. This was based on a presentation about the research I gave earlier this year. There are many missing ends, and I'll be adding more information over the next several days about ways of testing for selection, as well as some of the more surprising implications of our research. I've written it without a bibliography, which I can direct you to the paper for a full set of references.

    Synopsis: 
    I describe the background of our 2007 work on accelerating human evolution.
Subscribe to R. A. Fisher

Neandertals

For years, I've worked on their bones. Now I'm working on their genes. Read more about the science studying these ancient people.

Denisova

From a finger bone of an ancient human came the record of a completely unexpected population. My lab is working on the science of the Denisova genome.

Acceleration

The advent of agriculture caused natural selection to speed up greatly in humans. We're uncovering some of the ways that populations have rapidly changed during the last 10,000 years.

Malapa

Just outside Johannesburg, the Malapa site is producing some of the most exciting finds in human evolution. This site is the headquarters of the Malapa Soft Tissue Project.