john hawks weblog

paleoanthropology, genetics and evolution

natural selection

  • Mitochondrial DNA and "performance enhancement"

    Mon, 2005-09-19 09:40 -- John Hawks

    One of the articles by Douglas Wallace referenced in the previous post covering mtDNA selection is subtitled "On the road to therapeutics and performance enhancement."

    One theme of the review article is the adaptive value of mtDNA variants to different human populations. He discusses in some detail the role of some of this variation in high latitude populations as a possible cold adaptation, with variants that decrease ATP production, but generate more heat as a side effect. These same variants are associated with greater longevity, less degenerative disease, but more disorders of energy metabolism.

    But Wallace doesn't say a single word about the obvious inverse of these observations: if some populations have high-frequency variants that decrease ATP production, doesn't it follow that the other populations have variants that don't decrease it?

    True to its subtitle, the paper does include a section on "performance enhancement". It is a short section, which includes a conclusory paragraph for the paper as a whole, but here are the two relevant paragraphs:

    It is now clear that not all mtDNA variation is deleterious. Indeed, about 25% of all ancient mtDNA variation appears to have caused functional mitochondrial changes and thus been adaptive. Those mtDNA variants that are adapted to warm climates have mtDNA variants that result in tightly coupled OXPHOS, thus maximizing ATP output and minimizing heat production. The presence of these mtDNAs permits maximum muscle performance but also predispose sedentary individuals that consume excess calories to multiple problems. They would be prone to be overweight and their mitochondria would generate excessive ROS, thus making them susceptile to a variety of degenerative diseases, cancer and premature aging. Partially uncoupled mitochondria generate more heat, but at the expense of ATP production. Individual's with these variants are better able to tolerate the cold, and are less prone to obesity. They also generate less ROS making then resistant to degenerative diseases and aging. Finally, the mitochondria are why we breathe. Hence, mitochondrial variation might be an important factor in individual predisposition to altitude sickness.

    All of these factors and numerous others are areas that influence our daily lives. Consequently, some individuals may wish to change their energetic phenotype by changing their mtDNA genotype. If some people will undergo surgery to change their appearance, there will certainly be some who will submit to mtDNA alterations to change their life style, appearance, and physical performance. For example, changing a single mtDNA nucleotide of a high performance athlete to increase mitochondrial ATP production through altered OXPHOS coupling could increase performance by several percent and mean the difference between Olympic immortality versus obscurity. Since such a change would be undetectable by any reasonable standard screening procedure. Why wouldn't a competitive athlete take advantage of such an opportunity?

    So Wallace tells us that there are population differences in mtDNA metabolism, that some variants common at low latitudes have higher ATP production than others, and that genetic engineering an individual to have one of these high-ATP producing mtDNA types might be a form of performance enhancement.

    Am I the only one who senses something missing here? Is Wallace saying that the "difference between Olympic immortality versus obscurity" is already a result of mtDNA metabolic differences?

    If Wallace is really saying that these low-latitude mtDNA variants may "increase performance by several percent", then isn't that the most explosive aspect of his paper?

    And while I'm at it, why should it be that low-latitude populations have high-ATP-producing variants? As far as I can tell, it's a total mystery that isn't addressed in the least.

    References:

    Wallace DC. 2005a. The mitochondrial genome in human adaptive radiation and disease: On the road to therapeutics and performance enhancement. Gene 354:169-180. Full text (subscription)

  • Mitochondrial DNA adaptations in living human populations

    Sun, 2005-09-18 16:15 -- John Hawks

    I had read this paper by Ruiz-Pesini et al. (2004) before, but the particular combination of factors it suggests came together for me in a new way recently:

    Effects of Purifying and Adaptive Selection on Regional Variation in Human mtDNA

    A phylogenetic analysis of 1125 global human mitochondrial DNA (mtDNA) sequences permitted positioning of all nucleotide substitutions according to their order of occurrence. The relative frequency and amino acid conservation of internal branch replacement mutations was found to increase from tropical Africa to temperate Europe and arctic northeastern Siberia. Particularly highly conserved amino acid substitutions were found at the roots of multiple mtDNA lineages from higher latitudes. These same lineages correlate with increased propensity for energy deficiency diseases as well as longevity. Thus, specific mtDNA replacement mutations permitted our ancestors to adapt to more northern climates, and these same variants are influencing our health today.

    Here are the final two paragraphs of the paper:

    This combination of the increased predilection to energy deficiency diseases, but protection from neurodegenerative diseases and aging is consistent with the expectations for mtDNA coupling efficiency mutations. Uncoupling mutations would reduce ATP production, increasing the probability of energetic failure. However, they would also decrease mitochondrial ROS production by increasing the oxidation of the electron transport chain, thus reducing oxidative damage and apoptosis. This could decrease neuronal and other cell loss, thus increasing longevity.

    Our observations support the hypothesis that certain ancient mtDNA variants permitted humans to adapt to colder climates, resulting in the regional enrichment of specific mtDNA lineages (haplogroups). Today these same variants result in differences in energy metabolism and altered mitochondrial oxidative damage, thus affecting health and longevity. Therefore, to understand individual predisposition to modern diseases, we must also understand our genetic past, the goal of the new discipline of evolutionary medicine (Ruiz-Pesini et al. 2004:226).

    So in other words, populations in northern latitudes today are enriched for a number of mtDNA haplogroups that are likely adaptive to cold. Today, these haplogroups (as a class) are largely protective against degenerative diseases of aging, possibly because they reduce oxygen free radical production. But they are also more susceptible to disorders of energy metabolism, because they reduce ATP production.

    Needless to say, this says some interesting things about the relationship of longevity and energy metabolism in recent human populations.

    But at the moment, I'm thinking about Neandertals. They lived in a cold place, but their lifestyle suggests that energy metabolism was at a premium. At the same time, they had a much shorter maximum lifespan than living people. According to the model of mtDNA mutations outlined by Ruiz-Pesini et al. (2004), this would be a very odd combination: cold adaptation today is linked to longevity and lower energy metabolism; Neandertals required high metabolism but had lower longevity.

    Those functional considerations alone suggest that Neandertals needed a highly specialized mtDNA type that would have been unlike those of living people.

    But additionally, the increase in longevity and difference in lifestyle apparent in later Upper Paleolithic people gives a clear reason for the replacement of the Neandertal mtDNA type. These people lived longer, and they had markedly less energy expenditure than Neandertals did. Their dietary and cultural adaptations would have been much more similar to recent arctic peoples (and indeed, might well have been completely identical to the ancestors of recent arctic peoples).

    Would this have been an exceptional event? I don't really think so, because Ruiz-Pesini et al. (2004) outline how similar cold-adaptive mutations occurred in different macrohaplogroups that today are all present at higher latitudes. The occurrence of potentially adaptive mtDNA mutations appears to have been quite a common event throughout human prehistory, because today's haplogroups appear to be separated by many mutations that are adaptive in different contexts.

    The situation is reviewed in two papers by Douglas C. Wallace (2005a, 2005b). The thing that surprised me about these two reviews is that they embrace a positive selection hypothesis for mtDNA migration out of Africa. Consider:

    This mtDNA history is remarkable for the striking discontinuities that exist in mtDNA diversity between climatic zones. Of the extensive mtDNA variation present in Africa only two mtDNA lineages (M and N) succeeded in colonizing all of Eurasia. Of the plethora of Asian mtDNA types that subsequently accumulated, only three haplogroups (A, C, and D) and much later G came to occupy the extreme northeastern Chukotka Peninsula of Siberia. This strikingly correlation between mtDNA lineages and latitude and climate led to the hypothesis that mutations in certain mtDNAs that decreased the coupling efficiency increased mitochondrial heat production and permitted people to survive the cold of the more northern latitudes (Ruiz-Pesini et al., 2004) (Wallace 2005a:173).

    This passage is followed by a section to support it, of which I find several parts very suggestive (my emphasis):

    This hypothesis is supported by the fact that missense mutations in mtDNA protein genes show regional specificity. Missense mutations are prevalent in the ATP6 gene in the arctic, in the cytb gene in Europe, and in the COI gene in Africa. Mutations in different ND genes also show regional correlation (Mishmar et al., 2003). Moreover, many of the ancient missense mutations change amino acids that are as highly evolutionarily conserved as are most known pathogenic mutations, yet have been retained in the human population for tens of thousands of years. Hence, they could not be pathogenic in the environment in which they reside, but rather must be adaptive and thus beneficial. Furthermore, an analysis of the missense mutations in cytb of complex III, for which the crystal structure is known, revealed that many of these missense mutations affected CoQ binding sites which would alter the Q-cycle, proton pumping, and thus OXPHOS coupling (Ruiz-Pesini et al., 2004).

    Finally, when European mtDNA haplogroups were correlated with longevity and predisposition to Alzheimer Disease (AD) and Parkinson Disease (PD), it was found that mtDNAs harboring uncoupling variants were enriched in the elderly and deficient in AD and PD patients. This led to the conclusion that the uncoupling mutations must enhance the flow of electrons through the ETC keeping the ETC carriers oxidized. This, in turn, reduces the spurious transfer of electrons to O2 thus minimizing ROS production and reducing mitochondrial and cellular damage.

    These same uncoupling mutations would also reduce the efficiency of ATP generation which could then exacerbate ATP deficiencies resulting from milder mtDNA mutations. This could account for the predilection of patients with Leber's Hereditary Optic Neuropathy (LHON) that harbor the milder mtDNA mutations to also have haplogroup J mtDNAs which harbor either the np 14798 or np 15257 cytb missense mutations. Thus ancient adaptive mtDNA variants are affecting individual predisposition to degenerative diseases and aging today.

    This last one also would apply to milder slightly deleterious mutations of nuclear genomic loci that contribute to mitochondrial metabolism. One might even conclude that today's mitochondrial degenerative disorders may in part be a legacy of ancient adaptive mtDNA variants that no longer exist.

    Now, if we seriously accept the hypothesis that human mtDNA variation is regionally adaptive, then we have to conclude that a lot of literature that assumes mtDNA neutrality is just wrong. For example:

    1. If mtDNA is neutral, then the dates of mtDNA lineage divergences may tell us about the initial migrations of some human populations. If mtDNA is adaptive to different regions, then the dates of lineage divergences tell us about the times that adaptive mutations occurred.
    2. If mtDNA is neutral, then it is surprising that archaic mtDNA variants are gone. If mtDNA is selected, then this is not at all surprising: the current global mtDNA variation is simply the product of the last globally adaptive mutation.
    3. If mtDNA is neutral, then it is reasonable to explain the lack of ancestral haplogroup L outside of Africa as the consequence of an out-of-Africa population bottleneck. But if mtDNA is selected, this distribution is explained as Wallace (2005b:376) suggests: many mtDNA lineages may have entered Eurasia, but only a few survived local selection.

    Is it possible that there have been no globally adaptive mutations? If the present pattern of variation is fine-tuned to climate and diet, it seems very unlikely that the massive life history, brain, and energetic changes during Pleistocene human evolution had no effect whatsoever.

    The present distribution of adaptive mtDNA variants suggests a scenario for the replacement of Neandertal mtDNA. Variants of human mtDNA that appear to be adaptive in Eurasia, and particularly in the northern parts of Eurasia, evolved recently upon an African background. The present variation of human mtDNA is comparatively recent, but it is ancient enough that some of today's variants were segregating within Africa over 100,000 years ago, and the haplogroup M dispersal from Africa appears to have occurred between 60 and 70 thousand years ago. Ultimately these gave rise to European haplogroups H, T, U, V, W, X, I, J, and K (Wallace 2005b), in the time period between 50,000 and 9,000 years ago.

    We know that these variants were superior to indigenous European mtDNA variants because the Neandertal mtDNA is gone today. Yet, we must suspect that the Neandertal mtDNA would have been very well adapted to their cold climate and high energetic requirements. The advantages of the incoming African-derived mtDNA variants were great, but they would not have been free of disadvantages -- especially with respect to either cold (which has historically restricted non-European mtDNA haplogroups to the south) or energy metabolism (which currently restricts European mtDNA haplogroups to the north).

    Thus, the replacement of Neandertal mtDNA could occur only upon the abandonment of Neandertal lifeways. Only a reduction in energy expenditure and exposure to cold could allow the spread of the African-derived mtDNA variants. Both these changes could be accomplished by a cultural transition, which additionally could increase dietary supply and thereby change selective constraints on energy efficiency.

    In this context, it is very significant that the latest Neandertals adopted Upper Paleolithic tool industries and other cultural elements usually associated with modern humans. This cultural transition may have decreased the selective advantages of endogenous European mtDNA variants and allowed the substitution of newer European variants of African derivation. In other words, it may have been the very process of adopting new cultural and demographic patterns that resulted in the selection against old Neandertal mtDNA, even within the European Neandertal population.

    So far, I have said nothing of what benefit the African-derived mtDNA variants may have provided. It seems likely that it was not related to cold (considering the Neandertals had plenty of time to become cold-adapted), energy (considering that the Neandertals appear to have had higher total energy expenditure than later people), or diet (since Upper Paleolithic people had broadly similar (if slightly different) diets to Neandertals).

    Instead, I would propose that the advantage of the African-derived mtDNA variants was in the one area (out of mtDNA-associated factors) where Neandertals and later humans significantly differ: longevity. It is not at all obvious that living longer is a better adaptation for humans as compared to the shorter lifespan of Neandertals. As a very recent adaptive change, it may have required fairly exceptional demographic conditions, such as large population sizes, a reliance on extensive trade networks, or other behavioral attributes of recent people. Only in such a cultural context can the survival of older individuals provide a fitness advantage to their younger kin.

    I do not think that the mtDNA change was the most important one; it probably followed many other genomic changes in favor of longevity. This would be similar to the effect of mtDNA variants in the face of climate or dietary differences today: no population was likely restricted from inhabiting the arctic by the lack of favorable mtDNA variants, but the fast mutation rate of mtDNA ensured that populations living in the arctic quickly gained new adaptive variants for their cold climates. Likewise, other genetic changes that led to a longer lifespan would quickly have led to mtDNA variants adaptive to the new demographic reality. The global human mtDNA variability likely reflects such trailing adaptive mutations. This might imply that the transition to greater longevity or other aspects of modern human life history would have been accompanied by not one, but multiple adaptive sweeps of global mtDNA variation.

    References:

    Ruiz-Pesini E, Mishmar D, Brandon M, Procaccio V, Wallace DC. 2004. Effects of purifying and adaptive selection on regional variation in human mtDNA. Science 303:223-226. Full text (subscription)

    Wallace DC. 2005a. The mitochondrial genome in human adaptive radiation and disease: On the road to therapeutics and performance enhancement. Gene 354:169-180. Full text (subscription)

    Wallace DC. 2005b. A mitochondrial paradigm of metabolic and degenerative diseases, aging, and cancer: a dawn for evolutionary medicine. Annu Rev Genet (Online before print)

  • How much sex selection is there?

    Fri, 2005-08-26 11:00 -- John Hawks

    I discuss biotechnology and society in my genetics course, and today I wandered across this working paper discussing sex control of offspring, including selective abortion in the US and abroad, preimplantation and prefertilization screening, and possible future effects of the technologies. I'm noting it here because of its inclusion of some numbers:

    Even in just the short time that these various methods of sex control have been available, they have had dramatic effects on sex ratios in many parts of the world. Generally, any variation in the sex ratio exceeding 106 boys born per 100 girls born can be assumed to be evidence of sex control. Here are just a few examples of skewed sex ratios around the world today (most recent figures provided). The sex ratio in Venezuela is 107.5, in Yugoslavia 108.6, in Egypt 108.7, in Hong Kong 109.7, in South Korea 110, in Pakistan 110.9, in Delhi, India 117, in China 117, in Cuba 118, in the Caucuses nations of Azerbaijan, Armenia, and Georgia, the sex ratio has reached as high as 120. While the sex ratio in the United States has remained stable at 104.8, certain American ethnic groups have seen a statistically significant rise in their sex ratios. In 1984, the sex ratio for Chinese Americans was 104.6 and for Japanese Americans 102.6; in 2000, these ratios had risen respectively to 107.7 and 106.4 (citation elided).

    On the subject of commercial application of sex control technologies, there is this:

    Today, sex-control services are openly advertised on the Internet, and sex control could in the future become a big business. Here's how Fortune magazine recently summed up at least the potential market for MicroSort alone: "Each year, some 3.9 million babies are born in the U.S. In surveys, a consistent 25 percent to 35 percent of parents and prospective parents say they would use sex selection if it were available. If just 2 percent of the 25 percent were to use MicroSort, that's 20,000 customersÉ. [and] a $200-million-a-year business in the U.S. alone." (Wadman M, "So you want a girl?" Fortune, Feb. 9, 2001)

    And I was actually quite taken by this almost poetic evocation of parenthood:

    The salient fact about human procreation in its natural context is that children are not made but begotten. By this we mean that children are the issue of our love, not the product of our wills. A man and a woman do not produce or choose a particular child, as they might buy a particular brand of soap; rather, they stand in relation to their child as recipients of a gift. Gifts and blessings we learn to accept as gratefully as we can; products of our wills we try to shape in accordance with our wants and desires. Procreation as traditionally understood invites acceptance, not reshaping or engineering. It encourages us to see that we do not own our children and that our children exist not simply for our fulfillment. Of course, parents seek to shape and nurture their children in a variety of ways; but being a parent also means being open to the unbidden and unelected in life (emphasis in original).

    I think that pretty much sums up my feelings about children.

    Yet it is one way of looking at the issue that comes into direct conflict with other cultural forces, such as the intense pressure to have a son, at least for many families. Those pressures differ with different cultural backgrounds, but they are pretty much present at some level everywhere. People's attitudes about biotechnology ultimately reflect deep cultural divisions between different -- sometimes irreconcilible -- goals (via Althouse).

    UPDATE: This later post discusses the alternative hypothesis that the elevated male sex ratio in many populations may be explained by hepatitis B infection rates. That hypothesis came after the 2003 working paper discussed in this post, and the issue at present seems to be unresolved.

  • Selection on synonymous mutations

    Fri, 2005-08-19 00:03 -- John Hawks

    Here's an interesting thought:

    Background

    In mammals, contrary to what is usually assumed, recent evidence suggests that synonymous mutations may not be selectively neutral. This position has proven contentious, not least because of the absence of a viable mechanism. Here we test whether synonymous mutations might be under selection owing to their effects on the thermodynamic stability of mRNA, mediated by changes in secondary structure.

    Results

    We provide numerous lines of evidence that are all consistent with the above hypothesis. Most notably, by simulating evolution and reallocating the substitutions observed in the mouse lineage, we show that the location of synonymous mutations is non-random with respect to stability. Importantly, the preference for cytosine at 4-fold degenerate sites, diagnostic of selection, can be explained by its effect on mRNA stability. Likewise, by interchanging synonymous codons, we find naturally occurring mRNAs to be more stable than simulant transcripts. Housekeeping genes, whose proteins are under strong purifying selection, are also under the greatest pressure to maintain stability.

    Conclusion

    Taken together, our results provide evidence that, in mammals, synonymous sites do not evolve neutrally, at least in part owing to selection on mRNA stability. This has implications for the application of synonymous divergence in estimating the mutation rate.

    That's the abstract of a study (free text online) in Genome Biology by J. V. Chamary and Laurence D. Hurst.

    What is the net effect of such selection? The short answer is nobody has any idea. Consider:

    The substitution rate at synonymous sites in exons is often used as a measure of the mutation rate [8,9]; however, this assumes neutral evolution of synonymous mutations [1,2]. By providing a parsimonious mechanism by which selection could act on synonymous sites, we can ignore the objection that prior evidence is indirect. Nevertheless, it is presently unclear to what degree synonymous mutations are favored or opposed by selection due to their effects on mRNA stability. Without being able to quantify the latter, as well as the net effect of other biases (for example, splice-associated), it will not be possible to directly estimate the extent to which use of the synonymous substitution rate leads to underestimates of the mutation rate and the mutational load.

    And:

    Indeed, it is quite possible that there exist no preferred codon within a gene while at the same time synonymous mutations are under selection. More generally, a complex set of trade-offs between different forms of selection and mutational biases may render interpretation of patterns of codon usage very difficult.

    It seems possible to me that selection on mRNA stability may allow certain kinds of fine-tuning changes, analogous to selection on promotors or inhibitors. If the half-life of an mRNA within a cell could be decreased slightly, it might well have an adaptive (or conversely maladaptive) result. Would it make enough difference to be selected? Maybe not in most cases, but in some cases it might well do so. And Charmary and Hurst are able to show that genes subject to strong purifying selection appear to have greater constraint on mRNA stability -- in my view the most persuasive of their arguments for the effect.

    It also occurs to me that selection on mRNA structural stability would also be a consideration for the origin of the genetic code. Certain patterns of redundancy in silent sites would potentially be targets of selection, for long-lasting or shorter-lasting RNA codons as options for the same amino acid.

    It just goes to show how many real unknowns we still face when looking at molecular evolution. If synonymous mutations aren't really neutral, what will we discover next?

    References:

    Chamary JV and Hurst LD. 2005. Evidence for selection on synoymous mutations affecting stability of mRNA secondary structure in mammals. Genome Biol 6:R75. Free full text online

  • Kreitman on human mtDNA selection

    Wed, 2005-06-29 12:45 -- John Hawks

    This is an old paper that I ran across today, a review of tests of selection with application to humans. Martin Kreitman is well known as a specialist in the population genetics of selection. The abstract says this:

    Attempts to understand the nonequilibrium configuration of silent polymorphism in human mitochondrial DNA illustrate the difficulty of distinguishing between selection and alternative demographic hypotheses. The range of plausible alternatives to selection will become better defined, however, as additional population genetic data sets become available, allowing better null models to be constructed.

    And the conclusion says this:

    An instructive example of this problem lies in the interpretation of human mitochondrial nucleotide polymorphism. In a very insightful paper, Di Rienzo and Wilson reported that the genealogy of mitochondrial sequences in non-Africans was more starlike in shape than might be expected under neutrality and that the distribution of pairwise differences was Poisson shaped (20; also see 74). Di Rienzo interpreted this apparent departure from neutrality as an indication of recent population expansion. Theoretical treatment of the problem provided additional support for the expansion hypothesis (90), but a bottleneck at ~50,000 -- 100,000 years ago, possibly caused by the selective sweep of a favorable allele, could not be rejected.

    Mitochondrial DNA has been assumed to be nonrecombining (but for evidence of recombination, see 4, 24); the sweep of a favorable mutation anywhere in the mitochondrial genome will cause the fixation of a single haplotype. Support for the selection hypothesis has come from the analysis of nuclear encoded genes. The nuclear genome shows little evidence for a skew towards rare alleles (18, 37, 38, 42, 83, 106), and thus towards a negative Tajima's D, as predicted under the population expansion hypothesis.

    Theoretical investigation of bottlenecks and subsequent expansions (25) shows, however, that Tajima's D can be negative or positive depending on the size of the bottleneck and the timing and magnitude of an expansion. Given that the mitochondrial genome has a smaller effective population size (being maternally inherited and effectively haploid) than the nuclear genome, the conflicting portraits of polymorphism in the two genomes may be consistent with a population bottleneck (25). The exciting possibility of a selective sweep in the modern mitochondrial genome remains, unfortunately, an unresolved issue (Kreitman 2000:553).

    I was happy to run across this reference that I previously missed, and so I'm posting it for others. It's good to read a review that appreciates the difficulties of detecting selection and distinguishing it from demography. The final paragraphs are sobering:

    The only current safeguard against gross misinterpretation of test results vis-a-vis selection vs historical demography is to have an a priori hypothesis about the type and direction of selection that are expected for the locus under investigation. The previously described work on Duffy provides a good example of this approach (37). There are two reasons to hope, however, that the situation for analyzing human polymorphism data sets will improve. First, as additional data sets accumulate, a reduction in the number of plausible historical demographic scenarios will be possible. The specific range of parameter values, for example, allowing mitochondrial genes but not nuclear genes to differ in the observed frequency spectrum of mutations may be shown to be unrealistic. Second, population history, whether it involves ancient bottlenecks, recent expansions, or specific population movements, affects the polymorphism of all nuclear genes equally. From a practical perspective, this means that the common signatures of human history on genetic variation should yield to the avalanche of data expected in future polymorphism studies. Better data mining techniques and sharper theoretical predictions are needed, however, to make this a reality (Kreitman 2000:553-554).

    It should be possible, in principle, to construct a realistic neutral model of human variation that takes into account major features of human history. Such a model would then serve as a null hypothesis, a selectively neutral backdrop, against which to look for evidence of natural selection in individual genes. In no other organism is this possibility likely to be achieved at the high level of resolution possible for humans. Our species, despite its low levels of nucleotide polymorphism, issues in ethical sampling of native populations, and the inability to control matings, may thus replace Drosophila species as the poster child for molecular population genetics.

    On a positive note, I think he's right that humans have become the best model for considering the molecular correlates of microevolution. We clearly know much more about ourselves than we do about other species, and it is hard to ignore evidence for long-term regional or local selective pressures.

    I wonder whether we are at a tipping point now, in 2005, compared to 2000. Have we reached the point where no single demographic hypothesis can explain both mtDNA and other genetic variation? Certainly we are, but there is more than that. Many autosomal genetic loci are inconsistent with each other in their pattern of variation. Selection has affected most areas of the genome in different ways. And some of those changes have been very recent.

    On the one hand, there is nothing surprising about this, since we know that humans have been evolving and still are. On the other hand, the universality of selection is not necessarily something anybody expected to find. We may need to ask ourselves, is there anything that doesn't bear the mark of selection on some linked site?

    References:

    Kreitman M. 2000. Methods to detect selection in populations with applications to the human. Annu Rev Genom Hum Genet 1:539-559. Full text online

  • Evaluating selection and demography in human evolution

    Tue, 2005-06-07 00:41 -- John Hawks

    Williamson et al. (2005) present a new mathematical method for deriving information about population size change and selection from the allele frequency spectrum of variation taken at multiple genetic loci. Their method depends on separating sites that are selected from those that are neutral, and thereby isolating the effects of demography from those of selection. They then apply their technique to human genetic data to derive estimates of the average selection on selected sites, and the timing and magnitude of population size change from nonselected sites.

    Oh, if it were really so easy.

    Selection

    To be fair, the paper states a major concern with accurately identifying selection in the context of species like humans and Drosophila that have experienced recent population growth. In other words, the main interest is not in deriving evidence about human prehistory, but instead about making sure that estimates of selection are not biased by population growth.

    With respect to selection, their major conclusion is as follows:

    We find evidence that negative selection on nonsynonymous mutations is widespread, which implies that deleterious mutations make up a significant proportion of standing nonsynonymous variation. Exactly how this genetic variation contributes to phenotypic variation is a matter of considerable debate, especially for medically interesting phenotypes such as multifactorial genetic disease. Because deleterious mutations, by definition, have phenotypic effects, and because of the widespread nature of negative selection on nonsynonymous mutations, it seems likely that negatively selected, generally rare nonsynonymous SNPs have some negative impact on human health. If there is a general relationship between nonsynonymous polymorphism and human genetic disease, then our genomic estimates of the fitness effects of different types of mutations contain prior information about the likelihood that a mutation contributes to disease. It may be possible to use this information to aid in identifying SNPs that cause disease. Other studies have suggested this approach (e.g., Livingston et al. 2004), but it was unclear which of the many measures of exchangeability to use. We feel that the relative fitness of different amino acid changes is the best way to evaluate exchangeability, and we have done that here by using a model that includes demography and selection (Williamson et al. 2005:7887).

    Readers may note that other studies have found evidence for a very high proportion of positive selection across the human genome (discussed in this post). The test applied in the current paper is not well suited to detecting evidence of positive selection, particularly if it is widespread, because it depends on the difference in frequency spectra between "selected" and "neutral" sites. Why the scare quotes? Because although noncoding sites or synonymous SNPs may well be neutral in the literal functional sense of not being targets of selection, it is impossible to verify that they are unlinked to selected sites. For the purposes of detecting negative (purifying) selection, this is not such a problem, because linkage will affect nearby sites only weakly (although this weak effect, called "background selection," may well influence the average level of variation in Drosophila).

    In any event, even if positive selection has been very common across the genome, most sites that have been subject to positive selection should have been fixed long ago. Only a few should still be under selection now, and these are predominantly very recent mutations.

    Consider the following scenario. The study considered 301 human genes. According to common knowledge, repeated here, positive selection leads to a relative excess of high-frequency alleles, compared to the predictions of neutrality (which predicts that there should be very few high-frequency alleles). But these high-frequency variants represent only a small proportion of the total number of genes currently under positive selection, since an allele being driven to fixation passes through every intermediate frequency, not merely the high ones. To detect evidence for positive selection, this study would have to find dozens of high-frequency variants in excess of neutral theory, representing scores of selected genes. But suppose instead that only one positively selected gene actually was in the sample. If so, then out of the human genome of approximately 20,000 genes, we might expect to find 60 or 70 genes currently under positive selection. In our fictive scenario it would be rash to extrapolate from a sample of 1, but in fact there are good reasons to think the true number is much higher. One such gene might take 1000 generations to transit from its appearance to fixation. There have been 100,000 generations in the 2 million years since the origin of our genus, and at least 300,000 since our divergence from chimpanzees. In other words, the complete transformation of the human genome by positive selection, altering thousands of genes -- or even all of them, multiple times -- would be far from detectable by this test.

    But remarkably, this test does find evidence for positive selection -- in noncoding substitutions! The authors put it less sensationally: "Interestingly, we find marginal evidence for weak positive selection on noncoding indel polymorphisms" (7885). I have no explanation for it. But if there actually is a statistically detectable excess of high-frequency variants for these polymorphisms, it may reflect selection at linked sites, or issues with the composition of the sample. If the level of positive selection is detectable, it is another strong evidence of the power of such selection over the long timespan of human evolution.

    In contrast to positive selection, even very strong purifying selection may leave low-frequency variants within the population for a long time. These variants are picked up within samples in large numbers. Low frequency variants are predicted to make up most genetic variation under neutrality, so the proportion of such variants is always a substantial part of the sample. High numbers make for powerful tests. For the human data examined in this study, the nonsynonymous coding sites have a higher proportion of low-frequency variants than do the noncoding, synonymous sites. Thus, they provide strong evidence of negative (purifying) selection.

    Human demography

    So the results of the method applied to selection are mixed. It detects the weak force of purifying selection strongly; it detects the strong force of positive selection weakly. But as the authors perceptively note, the inference of demographic history and the inference of selection are not independent of each other. Therefore, the inferences about demography are in part subject to the weaknesses in detecting the effects of past selection. This study shares this problem with all previous work that has attempted to estimate past human population size from genetic evidence.

    How can selection affect interpretations of demography? Here's one way: Positive selection occurs rapidly relative to rate of recombination between sites. This means that a selective sweep may affect a relatively large section of a chromosome, including many "neutral" sites. This is the principle behind John Gillespie's (2002) pseudohitchhiking, or "genetic draft" model of neutral evolution. In a nutshell, if positive selection has been common, there is no reason to think that genetic variation at noncoding sites provides any indication of demographic parameters. The current study (by Williamson et al. 2005) assumes that positive selection has not had such an effect, nor has any other force significantly affected the variation of neutral sites.

    These are the kinds of influences that have been suggested to result in the large difference between census population sizes (the number of individuals within living species) and estimates of effective population sizes (measures of the rate of genetic drift) in nature. In humans and in most other animal species, the rate of genetic drift on neutral sites appears to have been much stronger than the census population sizes of those species would predict. This is a systematic difference that leads species to have much lower genetic variation than would be expected if they evolved under genetic drift alone. At present, the relative importance of selection and demographic factors in leading to this systematic difference is unknown. I suspect that selection has been strongly important in this difference, others argue that demographic factors have been the most important.

    In most previous genetic work, the effective population size (denoted as Ne) is around 10,000 individuals. Some scientists have suggested that the human population actually was once that small -- that only a few tens of thousands of people once comprised the entirety of humanity. If this were true, then the human population must have expanded in size massively sometime in the recent past. The evidence for a recent change in the mitochondrial DNA molecule was once suggested to be evidence for this change in population size, which was inferred to have occurred during the Late Pleistocene, perhaps 50,000 years ago. From these estimates comes the scenario of an expansion from a single small African population beginning after 100,000 years ago, reaching Europe and the Far East by 30,000 - 50,000 years ago.

    Recently, it has become clear that a single massive expansion of a global human population cannot explain the pattern of genetic variation in living people. Simply put, the pattern of the 16,000 base pairs of the mtDNA molecule is not replicated by the 3 billion base pairs of the nuclear genome.

    To be sure, some genes do show a pattern of recent ancestry and apparent expansion. The FoxP2 gene, for example, has a recent common ancestor for living people (within the past 200,000 years), and shows strong signs that it has not evolved neutrally. If all other genes looked like this, it would be strong evidence of massive population growth.

    But most genes do emphatically not look like this. This has been understood for several years, following reviews by Molly Przeworski and colleagues (2000), Jeff Wall (2000), and even my own dissertation (Hawks 1999). Many genes show no excess of rare variants, most show only a slight excess. The average gene shows no sign whatsoever of a massive population expansion during the Late Pleistocene. This has been concluded most powerfully by recent genome-wide studies of SNP variation by Marth and colleagues (2003; 2004; reviewed previously in this post).

    Where does the current paper (Williamson et al. 2005) come in? Summarizing evidence from over 300 genes, this study does find evidence of a population expansion. Yes indeed -- a population expansion that happened 18,000 years ago! This expansion took the human population from a previous size of around 8000 individuals to a current size of around 50,000 individuals.

    Of course these estimates are far from realistic in anthropological terms. If anything, 18,000 years ago much of the human population should have been contracting rather than expanding. The idea that the human population could have been as amll as 8000 individuals (or very generously 100,000 individuals) during the LGM is simply ridiculous. By that time, the certain ancestors of living people were present from the western tip of Iberia to the edge of (or possibly well into) Beringia. If a genetic estimate cannot gauge a population that must have numbered several millions of people, it is time to stop talking about genetic estimates.

    To be fair, the demographic conclusions of the paper are phrased cautiously:

    Therefore, although we find it striking that the time of population growth (18,200 years B.P.) roughly corresponds with events in human history that may have induced population growth, such as the end of the last ice age and the origin of agriculture, we feel that our demographic inferences should be interpreted cautiously until the full range of plausible demographic models has been explored in one coherent framework (7887).

    At the same time, this apparently cautious discussion raises the more critical problem of a complete lack of communication or citation from any anthropologist. Hmm, I guess the last glacial maximum does roughly correspond to the "end of the last ice age and the origin of agriculture," in the usual manner of genetics confidence intervals. That is to say, it is only twice as old as either, so it might as well be the same.

    Speaking of confidence intervals, again in this paper there are none. No confidence intervals on the demographic estimates, no confidence intervals in the supporting text, no figure showing the likelihood surface, none, nothing, nada.

    What remains?

    In a sense, the bone I am picking is different from that pursued by Williamson et al. (2005). What I care about is evidence for ancient demography. What they care about is better quantifying selection. I think that their paper is incomplete on their own terms, because of the problem quantifying positive selection, but that it is a credible theoretical effort. In particular, the insights about the frequency of genetic disorders based on their findings are a likely contribution to the future study of genetic variation in coding gene regions.

    But the inclusion of demography in this study confuses much more than it clarifies from the perspective of the anthropologist. Its estimates of demographic changes are clearly false, and the lack of detail about confidence intervals makes them impossible to evaluate. In the face of this fatal problem, it is fair to wonder whether the apparent insights about purifying selection have any value.

    The main importance of the data from these many genes is what they do not show. They do not show an expansion of many orders of magnitude. They do not show a current effective size that is anywhere near the current human population size (or a size sufficient to settle any large part of the world). They do not show evidence for expansion coincident with an "out of Africa" movement of people, over 50,000 years ago.

    Instead, the conclusion is concordant with the discussion of Eswaran and colleagues (2005:3):

    Thus, the nuclear data do not consistently signal expansion, and when they do, the signal is of a mild expansion, perhaps reflecting only post-Pleistocene population growth associated with the spread of agriculture.

    The summary of current work is that we can completely exclude the hypothesis that "neutral" genetic variation in humans is explained entirely by past human population size. It simply cannot be true, because if it were, there should be strong signs of expansion that we do not in fact observe.

    On the other hand, perhaps genetic data may tell us something about past human population size, even if population size is not the only explanation for genetic variation. We might expect that some demographic changes may have influenced genetic variation in distinctive ways that could be separated from the effects of selection. If so, then the results of the current paper may be relevant. If natural selection -- especially purifying selection -- explains most rare alleles at nonsynonymous coding sites, then perhaps the residue of rare alleles at synonymous or noncoding sites is a sign of recent changes in demographic patterns?

    This possibility is suggestive, but it appears at present to be fairly far from the data. If unknown factors (which may include selection) have altered "neutral" genetic variation by an order of magnitude or more from their neutral predictions, then it is hard to believe that a relatively small change in population size will be accurately measured by any genetic observations.

    References:

    Eswaran V, Harpending HC, and Rogers AR. 2005. Genomics refutes an exclusively African origin of humans. J Hum Evol Online advance before print.

    Gillespie JH. 2000. Genetic drift in an infinite population: the pseudohitchhiking model. Genetics 155:909-919.

    Williamson SH, Hernandez R, Fledel-Alon A, Zhu L, Nielsen R, and Bustamante CD. 2005. Simultaneous inference of selection and population growth from patterns of variation in the human genome. Proc Nat Acad Sci USA 102:7882-7887. PNAS online

  • Diet and mtDNA selection

    Sat, 2005-02-26 18:22 -- John Hawks

    Lowell and Shulman (2005) report on the possible links between the metabolic defects underlying type 2 diabetes and mitochondrial dysfunction. These links go through two channels. In the first, decreases in mitochondrial activity in older adults were associated with higher levels of triglycerides in muscle and liver tissue as well as greater insulin resistance in muscle tissues. This observation supports the hypothesis that mitochondrial oxidation of fatty acids becomes less effective in older individuals, "which in turn lead[s] to increases in intracellular fatty acid metabolites...that disrupt insulin signalling'' (384). It is not clear whether this alteration is due to mitochondrial loss or reduction in function, but the authors suggest based on several other studies that there may be a connection with an accumulation of mtDNA mutations in elderly individuals.

    The second channel involves the secretion of insulin by beta cells in the pancreas. In individuals with insulin resistance, the body can sometimes adapt to greater insulin requirements by ramping up the production of insulin in the pancreas. This pathway of insulin secretion depends on the mitochondrial metabolism of the beta cells. This connection has been established by the fact that mtDNA mutations can induce hereditary diabetes by causing beta cell dysfunctions.

    Changes in fatty acid metabolism would likely be necessary at least twice during the evolution of early humans. With a dietary change toward greater meat eating, either at the origin of the habilines or that of early large-bodied Homo, a greater dietary availability of animal fats and focus on those resources might well have driven a selective change in digestive metabolism. The highly meat-dependent diet of people in the northern extremes, including the Neandertals, would have focused most digestive and metabolic resources toward animal protein and fat, and might have required additional changes. Then, a shift from a Neandertal-like diet to a broader diet during the Upper Paleolithic might well have required an additional change. It is not obvious that these shifts occurred globally, and there may well have been regional differences in meat digestion and metabolism based on local selection due to dietary differences. If the mtDNA was one of the genetic regions affected by such selection, there may well have been a very complex pattern of evolutionary changes in this molecule across the human lineage. This could account for changes within the Neandertal lineage, as well as the apparent replacement of Neandertal mtDNA by a type prevalent in recent humans.

    References:

    Lowell BB and Shulman GI. 2005. Mitochondrial dysfunction and Type 2 diabetes. Science 307:384-387.

  • The probability of parallel evolution

    Sat, 2005-02-19 13:52 -- John Hawks

    Orr (2005) considers the likelihood of the same mutants being fixed in two populations as a function of parallel selection, compared to drift. The model used is a very simple one, basically involving a single locus in each population with a limited number of advantageous mutants that may be presented to both populations.

    The argument for the idea that beneficial mutations are limited is probably right:

    Throughout this analysis, I make a major assumption: the number of beneficial mutations is small. This will almost certainly be true for two reasons. First, environments are autocorrelated through time, making it unlike [sic] that a previously highly fit wild-type allele would suddenly plummet in relative fitness; second, random changes in a functional protein are much more likely to worsen than to improve protein function (216).

    The result of the paper is that parallel evolution is likely under such circumstances. This is not especially surprising, and the innovative aspects of the paper are the demonstration that this is true under many models of the distribution of fitnesses of mutations. The equations in the paper are derived from extreme value theory, with the basic theme being that the fittest possible new mutations are also the rarest, so these will preferentially be incorporated into populations.

    Does this study apply to natural populations? Even most closely related populations typically differ in ecology in some respects, so it is hard to say that the model where mutations have the same fitness characteristics in two different populations is always relevant. Likewise, over the long term it is likely that a natural population will be as near to an optimum allele as is practicable. That is to say, the argument above that wild-type alleles are unlikely to plummet in relative fitness, carried to its logical extreme, would predict that any natural population of substantial size would already have had the opportunity to explore all the adaptive space available to it by recurring mutations.

    Only in fairly unusual circumstances will populations be limited from achieving higher fitness (for any single gene) because mutations don't occur often enough. Instead, they will be limited by the fact that the mutations that do occur are never more adaptive than the current wild-type. The unusual circumstances would include cases in which the adaptive landscape really is complex; for example, where the phenotypic characters influenced by the gene are themselves subject to complex patterns of stabilizing selection. Here, the possibility for stepped advantages among many genes creates the opportunity for a progression of mutations. That is to say, many genes that interact with each other are all highly optimized and adaptive mutations at each of them are incredibly rare. But when an adaptive mutation occurs at one of these genes, it may shift the interaction in ways that make a new (perhaps recurring and previously neutral or deleterious) mutation at one or more of the other genes more likely to be adaptive. In this way, a highly polygenic trait might be mutation-limited in its evolution, while no individual gene can be said to be mutation-limited.

    References:

    Orr HA. 2005. The probability of parallel evolution. Evolution 59(1):216-220.

  • How much selection does it take?

    Wed, 2005-02-02 23:58 -- John Hawks

    I was involved in a discussion this weekend that I think reveals much about the current state of evolutionary genomics. The forum was the "Neanderthals Revisited" conference at NYU, although I might have had the same discussion almost anywhere. Earlier, I had given my presentation about the importance of selection when considering Neandertal relationships. My point was that ignoring selection leads to a preordained result, since a small amount of selection can have the same effect as very extreme hypotheses of demographic change. I applied to both morphological examples and mitochondrial genetics as examples of the way that small magnitudes of selection can have great influence on the pattern of variation. Needless to say, this line of argument did not go over very well with geneticists who had previously been engaged in mitochondrial research, employing the paradigm that assumes that mtDNA is neutral.

    The comment that surprised me quite a bit (and I can say that at least a few others in the crowd were surprised as well) was given the next day by Mark Stoneking, a geneticist at the Max-Planck Institute for Evolutionary Anthropology in Leipzig. Stoneking was commenting on research the Leipzig lab has been doing in the area of regulatory control across the genome. His observation--and this was the surprise--was that regulatory changes in the genome appear consistent with the hypothesis of neutral change over time. In other words, his argument was that neutral change was the predominant mode of genomic evolution leading to living humans. He used this as a point of departure to suggest that a neutral evolution of phenotypes during the course of human evolution was likely productive hypothesis to pursue when considering the pattern of variation in ancient hominids.

    I must say that I was fairly stunned by this statement. It really struck me as being inconsistent with my knowledge of genomic variation in humans. So I looked up some of the recent research from the Leipzig lab, and compared it with the papers that I had been aware of that examined the level of positive selection responsible for human evolutionary change.

    The promise of evolutionary genomics toward driving information about the pattern of selection leading to human-specific traits lies mostly in the comparison of human genetic sequences with those of other primates. By examining the rate of evolution for different genes and parts of genes, it is possible to address whether the changes responsible for human phenotypic evolution were mainly due to changes in coding regions of genes, regulatory elements, or other regions. And by comparing genes with each other, it is possible to judge whether most of the changes were concentrated at a few important genes, or whether they were more broadly distributed across the genome.

    But most important is the need to discover exactly what types of mutations were common in the human lineage, and for that matter in the chimpanzee lineage and other primate lineages as well. For example, a gene that has repeatedly experienced positive selection during the course of human evolution will show a greater difference between humans and chimpanzees in whatever kinds of changes were under selection. This might mean that the gene will exhibit a greater degree of the amino acid substitutions then expected at random from the total number of mutational changes. It might mean that the certain areas of the gene, such as upstream regulatory regions, will exhibit more divergence than others.

    Geneticists have developed tests for these different patterns of departure from neutrality. They can test whether the number of amino acid changes is significantly higher than expected given a certain rate of mutations. They can test for equality of rates between different genetic loci. And they can test directly for violation of mutation-drift equilibrium. It is this last test that is violated by human mitochondrial DNA, for example, which leads me along with many other geneticists to believe that the molecule has been under positive selection during human prehistory.

    But there are problems in applying tests of neutrality to genome-wide questions of the abundance or frequency of positive selection. Tests of neutrality are notoriously conservative, meaning that it is difficult for genetic data to demonstrate that selection actually took place. One of the reasons why tests of neutrality are so conservative is that the effects of positive selection are actually counterbalanced by other forms of selection on the genome. For example, if the coding sequence of a gene has been under positive selection during evolutionary history, then one expected effect on the distribution of variation is that the number of amino acid changes separating two species will be relatively high, especially compared to the number of amino acid differences noted within each of the species. In other words, one of the species or both of them should have been driven further apart by selection on the amino acid sequence than we would expect from their neutral level of variation. But exactly the opposite effect is expected for purifying selection. As purifying selection consistently eliminates new amino acid mutations, it should leave to species relatively close to each other in their amino acid sequences compared to the level of differences within each of those species. What this means is that if both positive selection and purifying selection have affected the gene over the course of its evolutionary history, the two opposing forces will to some extent cancel each other out. The effect of this cancellation causes an underestimation of the rate of positive selection, because purifying selection is usually continuous and positive selection is much more episodic. In essence, for most genes we may predict that the positive selection never happened, at the same time we are estimating that it purifying selection is substantially weaker than actually was.

    This is the point raised by Justin Fay and colleagues (2001). Their paper was principally concerned with estimating the rate of negative, or purifying, selection across the human genome. Their research was motivated by the question of what the typical effects of mutations are--are mutations usually neutral, or they usually deleterious? In setting out to answer this question they realized that the rate of negative selection could not be independently estimated without first considering the effect of positive selection on the same genes. To estimate the rate of positive selection, Fay and colleagues devised a test that depended upon dividing polymorphisms into three subsets. One subset consisted of polymorphisms at low frequency (

    In a review article in Nature, Sean Carroll (2003) applied this estimate of 35 percent adaptive substitutions to gain an understanding of the number of selected changes during the past 5 to 7 million years. Based on our understanding of a subset of human genes, Carroll extrapolated that approximately 200,000 amino acid changes have occurred on the human lineage during human evolution. If 35 percent of these changes were positively selected, then this number implies that some 70,000 adaptive substitutions happened during human evolution. This is a stunningly large number. It evens out to over two adaptive changes for every one of our 30,000 genes, or one adaptive change per every hundred years. If these were evenly distributed over time, then any time period of 10,000 years during our evolution was likely to have seen 100 adaptive substitutions underway. Nor does the figure of 70,000 include the adaptive changes in regulatory elements and other non-coding portions of the DNA.

    An independent line of inquiry has provided support for the idea that human genes have been regularly under positive selection. Vallender and Lahn (2004) review a list of genes that are now believed to have been under repeated positive selection during the course of hominoid evolution. Some of these display strong evidence of selection during the evolution of ancestral anthropoids or hominoids, others apparently have been under selection during the course of human evolution during the past seven million years or less.

    The work of the Leipzig lab that Stoneking was referring to is reflected by the paper by Hellmann and colleagues (2003). The innovation of this research is the ability to compare human genomic regions with the same regions in chimpanzees. In contrast to earlier studies like that of Fay and colleagues (2001), who relied mainly upon macaque comparisons, this study gives a very close analogue as a comparison sample for human variation. But unlike the earlier study, Hellmann and colleagues (2003) basically ignore the issue of positive selection on the genome. They do not examine the ratio of amino acid changing variants for different frequency sets, nor do they attempt to quantify the number of adaptive substitutions in humans as compared to chimpanzees. The information in the data that would address positive selection is therefore hidden by the overall pattern of purifying selection against deleterious mutations that the researchers find.

    One interesting addition to our information about positive selection is present in Hellmann and colleagues' data, however. They find that the 5' untranslated region of the genes in their study is significantly more divergent between humans and chimpanzees than other parts of the genes. This observation is consistent with the idea that this region has been under positive selection in many of these genes. Presumably the source of this positive selection is adaptive change in regulatory elements upstream of the coding regions of these genes. If this pattern is widespread, it offers an additional large number of episodes of positive selection beyond those necessary to explain the pattern of amino acid changes in humans.

    So the data from the Leipzig lab do not contradict the findings of high levels of positive selection on human genes. They do confirm the idea that tests of neutrality will not pick up evidence of positive selection when purifying selection against deleterious mutations has also been acting on the genes. These data do not suggest that genetic drift has been the predominant force affecting human evolutionary change. Instead, purifying selection is shown to be a very strong force affecting the current variation of most genes, while the same estimate of positive selection made by Fay and colleagues (2001), added to the evidence for positive selection on 5' untranslated regions, are applicable to these data.

    What are the stakes in this inquiry? That is, why does it matter what the level of positive selection has been in human evolution or the evolution of any other lineage?

    One implication of a very high rate of adaptive evolution is that most of the molecular changes affect cellular metabolism, homeostatic processes, and other small-scale molecular features of the cell and organism. This conclusion stems from the idea that there just have not been that many changes in gross structural and anatomical aspects of organisms to require tens of thousands of selected changes. Carroll (2003) goes so far as to suggest that developmental genes may have been relatively conserved compared to the molecular processes that account for this widespread adaptive evolution. Vallender and Lahn (2003:R245) say, "Many other aspects of human biology not necessarily related to the 'branding' of our species, such as host-pathogen interactions, reproduction, dietary adaptation, and physical appearance, have also been the substrate of varying levels of positive selection."

    The most important implication of a high rate of selection, at least to me, is that ancient demography is likely not a major cause of human genetic evolution and differentiation. John Gillespie's work has made clear over the past five years that positive selection can explain the pattern of genetic variation in many species. In essence, he explains a low level of polymorphism in most species as the possible result of widespread positive selection and linkage between selected and neutral sites. As large areas of the genome undergo genetic hitchhiking (reductions in variation caused by linkage to positively selected sites), the variation ultimately becomes greatly limited in ways that resemble the effects of genetic drift in a small population. The difference is that with positive selection and linkage, the level of polymorphism that can persist in a population has little if any relation to the size of the population. This theory is called "genetic draft."

    If the variation of most human genes is limited by selection across the genome, then it follows that genes cannot be used to estimate ancient population size or other demographic characteristics.

    Likewise, if positive selection on human genes is common enough, it appears very likely that some areas of the genome often used for demographic inquiry are themselves direct targets of selection. The most obvious are the mitochondrial DNA and the Y chromosome, both of which are completely linked over their entire lengths. For example, the Y chromosome contains around 80 genes. If these were selected at the rate typical of nuclear genes in the rest of the genome, then we can estimate that the Y chromosome underwent some 200 adaptive substitutions during the past 7 million years, or one in approximately 35,000 years. In this context, it is interesting that the most recent human ancestor of human Y chromosomes appears to have lived within the past hundred thousand years (and possibly less), and that the genetic sequences thus far studies show clear violations of neutrality. This has previously been considered evidence of a large-scale demographic replacement in recent human evolution, but it is better explained as the consequence of positive selection on the Y chromosome. A similar line of argument could apply to the mtDNA, although estimates of the frequency of adaptive changes are more difficult because the mtDNA has a substantially different rate of mutations compared to nuclear DNA.

    If positive selection is such a common force in human evolution, then the possibility is clearly opened that many of the genetic differences among human populations are the result of local positive selection. We have tended to examine interpopulation differences under the assumption that genetic drift and migration are the only important parameters. But if selection plays an important role in diversifying populations, then we can expect that the level of genetic differences (for example Fst) between populations has little to do with classical correlates of genetic drift, such as population size. Instead of drift-migration equilibrium, the important factor is migration-selection equilbrium, probably constantly changing in a dynamical sense.

    What mysteries remain?

    Although it appears that positive selection has been very common, this says little about the distribution of such selection across the genome. Evidence from some genes and sets of genes suggests that they have been subjected to repeated episodes of adaptive evolution during the course of primate evolution. The study by Dorus and colleagues on the adaptive evolution of brain-related genes provides an example of the way that positive selection has been focused on some areas of the genome. Certainly some individual genes have probably undergone dozens of instances of positive selection--probably including many genes involved in host-pathogen interactions.

    If positive selection is really so common across the genome, then a greater frequency of selection in the human lineage as opposed to other animal species might explain the level of human genetic variation. But if positive selection has been so common, then why do some animal genes appear to have fairly great variation? For example, why do so many animal species have relatively great mtDNA variation, when the mitochondrial DNA appears to be a very likely target of selective sweeps?

    References:

    Carroll SB. 2003. Genetics and the making of Homo sapiens. Nature 422:849-857.

    Fay JC, Wyckoff GJ, Wu CI. 2001. Positive and negative selection on the human genome. Genetics 158:1227-1254.

    Hellmann I, Zollner S, Enard W, Ebersberger I, Nickel B, Paabo S. 2003. Selection on human genes as revealed by comparisons to chimpanzee cDNA. Genome Res 13:831-837.

    Vallender EJ, Lahn BT. 2004. Positive selection on the human genome. Hum Mol Genet 13:R245-R254.

Pages

Subscribe to natural selection

Neandertals

For years, I've worked on their bones. Now I'm working on their genes. Read more about the science studying these ancient people.

Denisova

From a finger bone of an ancient human came the record of a completely unexpected population. My lab is working on the science of the Denisova genome.

Acceleration

The advent of agriculture caused natural selection to speed up greatly in humans. We're uncovering some of the ways that populations have rapidly changed during the last 10,000 years.

Malapa

Just outside Johannesburg, the Malapa site is producing some of the most exciting finds in human evolution. This site is the headquarters of the Malapa Soft Tissue Project.