john hawks weblog

paleoanthropology, genetics and evolution

recent selection

  • Inversion biology

    Tue, 2010-04-06 12:52 -- John Hawks

    Razib Khan's post at his new digs (Discover blogs) about the 17q21 inversion is worth reading for anyone interested in the complexities of discovering the history of genes in populations ("The many lives of an inverted genetic region").

    The inversion in this region, common in Europeans, was described in 2005 as one of the earliest clear examples of recent positive selection determined from genotype data alone. It appeared that the apparently selected allele had diverged quite a long time ago from the wild type, leading to the hypothesis that it had sojourned in some ancient species of hominins before re-entering the human population by introgression.

    But complexities followed -- for one thing, the region has repeatedly undergone inversions in other primate lineages. Now, as Razib points out, it looks like the inversion isn't all that old.

    It's all very curious...

  • Selection's genome-wide effect on population differentiation

    Sun, 2010-03-28 08:30 -- John Hawks

    Alon Keinan and David Reich [1] have tested an obvious prediction of the hypothesis that recent selection has had a major effect on variation across the genome, and in doing so have provided some strong support for our hypothesis of a recent acceleration.

    A new mutation that increases rapidly under positive selection will carry with it a lot of nearby variants that are physically linked to it. The region of this "genetic hitchhiking" will depend on the local rate of recombination -- the lower the recombination rate, the longer the extent of the hitchhiking region.

    Meanwhile, a new mutation takes a while, sometimes many thousands of years, to spread widely beyond its population of origin. We can measure population differences for a single locus as FST. The FST attained by a new selected variant depends on what frequency it has reached in different populations. For many selected alleles, they have not yet attained high frequencies anywhere, and so FST is low. But for a few, the selected variant has reached a high frequency in a few populations, but remains rare elsewhere. These are recognizable as high FST loci.

    What is true of the selected allele itself will also be true, to a lesser extent, of the linked haplotype that is hitchhiking along with it. And so, if selection has been sufficiently common in recent human history, there should be a relationship between the local rate of recombination and measures of population differentiation like FST.

    Which is exactly what Keinan and Reich found.

    Further, they found that this relationship is true of regions of the genome that contain a lot of coding loci, and much less true of gene-poor regions:

    We cannot envision any demographic or mechanistic explanation that would produce a correlation between recombination rate and allele frequency differentiation as observed and we hypothesize that our observations reflect a history of natural selection. Natural selection is usually expected to increase population differentiation at linked neutral sites, an effect that is expected to extend over longer physical distances in regions of lower recombination rate. A prediction of an explanation based on natural selection is that the effect would be more marked in regions that are more likely to be influenced by selection, such as genes.

    The observed FST in these categories is not super-high -- we're not looking predominantly at genes for which more than 20 percent of the variation is the between-population component. Therefore, the comparison can encompass quite a bit more of the selected variation in the genome, instead of the extremely stringent cutoffs required to identify an individual candidate gene. It's a bit like getting a measure of wind speed as opposed to looking at the few highest-flying kites.

    Perhaps the most interesting aspect of the study is that they compared the Phase 3 HapMap samples, which include some pairs of nearby populations. They found that the apparent effect of selection on population differentiation was much higher for those nearby pairs of populations:

    In addition to qualitatively replicating our findings, analysis of HapMap 3 data allows us to generalize them to additional populations. A striking result is that the relationship between FST and recombination rate is stronger for FST between pairs of closely-related populations, whether within or outside Africa: FST between a West African sample and Maasai (of mixed West African and East African ancestry [57]) decreases by an average of 6% for every 1 cM/Mb (Figure 4D), FST between Italians and individuals of North-Western European ancestry decreases by 10% for every cM/Mb (Figure 4E), and FST between Japanese and individuals of Chinese ancestry decreases by 4% (Figure 4E). In view of the large effective population size in recent human history since each of these pairs of populations have split, these observations support the possibility that the different patterns observed between different pairs of populations are due to natural selection operating more efficiently in the context of larger population sizes.

    That's a direct sign, in other words, of the recent acceleration of positive selection in human populations. There are a lot more genes that are geographically circumscribed and low in frequency affecting FST at a more localized level, and fewer affecting major allele frequencies between continental regions. It's a neat comparison, and it helps to answer the comment that selection is somehow "weak", or insignificantly different from drift, because the new selected alleles haven't spread very far. The point is, most of them are so new that they haven't had time to disperse widely and reach appreciable frequencies very far from their origins.

    UPDATE (2010-03-28): A reader pointed out an error in the post; I had written "lower" recombination rate at one point that should have been "higher". I have corrected the text.


    References

  • Pigment controller OCA2

    Fri, 2010-03-05 15:23 -- John Hawks

    Razib: "OCA2 makes East Asians white and Europeans blue."He discusses a study out of Esteban Parra's lab in PLoS Genetics (open access), which characterizes a non-synonymous polymorphism in China that lies on a recently selected haplotype. The genotypes correlate additively with skin melanin -- a very different effect from the European new mutant allele.

    Edwards M, Bigham A, Tan J, Li S, Gozdzik A, et al. (2010) Association of the OCA2 Polymorphism His615Arg with Melanin Content in East Asian Populations: Further Evidence of Convergent Evolution of Skin Pigmentation. PLoS Genet 6(3): e1000867. doi:10.1371/journal.pgen.1000867

  • Mailbag: Hidden lactase persistence alleles?

    Sat, 2010-02-20 12:15 -- John Hawks

    I just noticed this new article that I thought you might be interested in, suggesting that lactase persistence known genetics can't currently wholly explain the trait in large regions around Africa and Europe:

    http://www.biomedcentral.com/1471-2148/10/36/abstract

    Thanks. I've been studying that one for a couple of days. It looks like pretty suggestive evidence that there should be some as-yet-undiscovered lactase persistence allele in West Africa, or maybe a gene conversion from one of the already-known African mutations.

    See also Razib on this paper:

    http://scienceblogs.com/gnxp/2010/02/what_genes_cant_tell_us_about.php

  • You are what your ancestors ate, part 1

    Fri, 2009-12-11 13:20 -- John Hawks

    Ann Gibbons has a long news article in the current Science reporting on an interdisciplinary conference on recent human diet evolution ("What's for Dinner? Researchers Seek Our Ancestors' Answers"). The article covers a lot of ground, from Michael Richards' work on the isotopic signature of diet in early Upper Paleolithic people, to Bill Leonard's work on diet adaptations in Siberian reindeer herders, to Jonathan Wells' work on maternal nutritional status and epigenetics.

    It's a good "why evolution matters to today's nutritional choices" article.

    A section of interest to me:

    The agricultural revolution favored people lucky enough to have gene variants that helped them digest milk, alcohol, and starch. Those mutations therefore spread among farmers. But other populations remained more carnivorous, such as the Saami of frigid northern Norway, whose ancestors herded reindeer. Among Saami ancestors, genes to digest meat and fat efficiently were apparently favored. One gene variant, for example, makes living Saami less likely to get uric acid kidney stones—common in people who eat high-protein diets—than are people whose ancestors were vegetarian Hindus and lack this gene variant, says geneticist Mark Thomas of University College London (UCL).

    I'll have more on a similar topic later -- recent shifts in genes due to agricultural subsistence has become a favorite subject of local interest. One would think I might get some funding from the Wisconsin dairy industry for this, but nothing so far...

    There is an unresolved tension in the article: Is there a better diet for everyone? Clearly some populations have undergone large recent diet changes with bad consequences; the same bad outcomes occur in some people despite possibly adapting to new diets for thousands of years. And yet, every metabolic or diet-related syndrome is variable, and we know that some genes related to digestion and metabolism have rapidly changed. "Westernization" is not as simple as it seems, nor is agriculture (or, for that matter, pastoralism) -- and the responses to each vary for stochastic reasons in different populations.

    It's a good interesting complexity, in a field where simple categorical statements can get a lot of attention.

  • Humans still evolving...

    Sun, 2009-10-25 22:45 -- John Hawks

    Time has a story about Stephen Stearns and colleagues' work characterizing ongoing selection using the Framingham Heart Study sample:

    If these trends were to continue with no cultural changes in the town for the next 10 generations, by 2409 the average Framingham woman would be 2 cm (0.8 in) shorter, 1 kg (2.2 lb.) heavier, have a healthier heart, have her first child five months earlier and enter menopause 10 months later than a woman today, the study found. "That rate of evolution is slow but pretty similar to what we see in other plants and animals. Humans don't seem to be any exception," Stearns says.

    I haven't had a chance to see the new study yet, and I'll do a little review when I get it. Jerry Coyne has some more information based on a preprint.

    My students have heard me say many times that it would take a sample of thousands of people to test the hypothesis of neutrality within today's population. Well, Framingham is one such sample, and it's not surprising that some things would be found significantly to affect fitness.

    The Time article mentions our work on recent evolution in a very positive way. Of course, the Framingham sample isn't suitable for testing what has been going on during the last 40,000 years; it is about mass selection on phenotypes in the present American population. That will involve mostly selection on standing variants, things that are already common in the population. Some of those may be things that were increasing in the past, others not -- some may even be reversals in direction compared to pre-industrial times. And there's no predicting how they might change in the future, as we continue to change our environment out from under ourselves.

    I've seen a few comments that we shouldn't trust the sample because it's unrepresentative, too small, etc. I think people may be overlooking the fact that the Framingham Heart Study is bigger than the census sizes of many species in nature. You can detect selection on phenotypes in this sample, and they surely know the heritabilities of many of them. But I'll have to see the paper.

  • Neolithic migrationism

    Tue, 2009-10-13 15:34 -- John Hawks

    Dienekes has a nice post about the relation of Neolithic Europeans, migration models, and how anthropological views of migration have changed over the last century. He starts with Carleton Coon, although he might have gone back substantially earlier.

    I'll note that Franz Weidenreich, writing shortly after the cited work by Coon, had a very different view of the essential data underlying migrationism, especially the trend toward brachycephalization.

    Anyway, he traces the move from full-on folk migration to "demic diffusion" and "acculturation" models, back through recent genetic work that suggests some substantial genetic replacement -- either by means of selection or folk migration/demographic expansion.

    We have come full circle. Once again, Paleolithic Europeans assume the status of survivors, as their typical lineages are observed in a small minority of modern Europeans. The evidence for widespread acculturation of European hunter-gatherers or their significant genetic contribution to incoming farmers along a wave of advance is just not there. Hunters and farmers possessed distinctive gene pools, and farmers expanded with barely a trace of absorption of hunter gene pools.

    With the India genetics paper from a couple of weeks ago, I think we're seeing that recent large-scale genetic changes are not limited to Europe.

  • "The worm in the fruit of the mitochondrial DNA tree"

    Thu, 2009-09-17 09:39 -- John Hawks

    François Balloux (2009) has a polemic in the online access area of Heredity presenting references about mtDNA selection, and arguing that the use of this single genetic marker is no longer warranted without support from other loci.

    Yay! I've been saying that both here, and in peer-reviewed articles, for several years. I think serious workers know that one gene is not enough; two genes (mtDNA and Y chromosome, for example) aren't enough -- we have to integrate information across every possible source, genetic, skeletal, and anthropological, to really test hypotheses about the past.

    Still, an industry of mtDNA sequencing has grown up, reviewing each others' grants and papers, and shutting down any discussion of adaptive changes. Balloux's commentary addresses this problem -- I'm going to quote the same paragraph as Dienekes:

    Let us assume I gave a seminar. I would tell the audience about my latest results on the population history of the pigmy shrew. My findings would be based on a stretch of DNA comprising several metabolic genes, showing no signs of genetic recombination. Armed with sequences from a large number of individuals sampled over a broad geographical area, I would make some inference on the colonization routes and times. To make life easier, I would restrict my analysis to the mutations I liked best, with nice names having been given to related sequences, rather than relying on dull mathematical quantities. As I reach one of the key conclusions of the lecture, which would go as follows: 'It is obvious from the distribution of haplotypes Amanda, Eugenie* and Hector_2 that the Outer Hebrides were colonised about 50,000 years ago, this was followed by considerable population fluctuations, a bottleneck during the last Ice Age, a swift recovery and a dramatic recent expansion over the last 200 years and...'. Imagine that, at that climactic stage I was interrupted by someone in the audience. The impertinent would say, 'Sir, can I just ask you whether this confidence in your conclusions may not be misplaced; your analysis is based on a single genetic marker, which comprises genes with a central role in metabolism and is thus likely to have been affected by natural selection'. An awkward silence may ensue, as I would find it difficult to dismiss this criticism easily.

    Well, let me tell you, I've been in dozens of audiences, and have raised that exact point. Here is a sample of the bogus responses I've gotten to this question:

    Bogus answer 1: There are no functional differences between humans and chimpanzees in the mtDNA, so it can't have been selected during human evolution. False, false false!

    Bogus answer 2: Metabolic processes are highly conserved, and humans couldn't have changed much. Hello? Have you noticed that your breakfast didn't exist in the Paleolithic?

    Bogus answer 3: But the pattern of variation can be equally explained by a bottleneck. Some aspects can, others can't so easily.

    Bogus answer 4: We examined only noncoding parts of the mtDNA, so there could be no selection. Yes, believe it or not, this is the most common response. I guess they don't teach people about linkage anymore.

    Bogus answer 5: There's little or no evidence of selection on any gene in recent human evolution. Human evolution may have stopped entirely. Oh, lord. Yes, I've gotten this one many times.

    There have been others over the years. Yet mtDNA is a big business -- people seem to be worried that the slightest criticism will bring down the whole thing like a house of cards. That's not true, even if mtDNA has sometimes been selected during human prehistory or history, that doesn't mean it isn't a useful marker for many purposes. But many seem more comfortable avoiding the issue entirely.

    I think that taking the hypothesis of selection seriously would improve most of the work in this field. The possibility of selection doesn't eliminate demographic interpretation -- for example, the high ancient African mtDNA variation allows us to test hypotheses about African demography before 50,000 years ago, and there the data appear to reject the hypothesis of selection, at least after around 150,000 years ago. Gene genealogies don't allow us to see the whole past, just the time and forces that they experienced. If we ignore one of the major forces, we are reducing our knowledge.

    There is an obvious problem testing the hypothesis of selection with mtDNA. When we consider any one single locus, it's always possible to find some demographic scenario that yields exactly the same predictions as selection. It's just a mathematical necessity -- selection is fundamentally a demographic phenomenon, and the increase in frequency of selected alleles looks similar to exponential growth of a small population.

    So what can we do? Fortunately we have lots of options. We can test the proposed demographic hypotheses against the historical record. When we make observations that show that people 1000 years ago had very different frequencies of common haplotypes, well, we know it was selection. There hasn't been any genetically significant bottleneck in the last 1000 years! When we see small Neolithic population samples dominated by haplotypes that are very rare today, again, no historically possible bottleneck could have caused that.

    Balloux with his colleagues (2009) has shown that one aspect of mtDNA patterning -- the association of haplogroup diversity with geography -- is very unlikely to have arisen by genetic drift. Here's part of their abstract:

    We show that populations living in colder environments have lower mitochondrial diversity and that the genetic differentiation between pairs of populations correlates with difference in temperature. These associations were unique to mtDNA; we could not find a similar pattern in any other genetic marker. We were able to identify two correlated non-synonymous point mutations in the ND3 and ATP6 genes characterized by a clear association with temperature, which appear to be plausible targets of natural selection producing the association with climate. The same mutations have been previously shown to be associated with variation in mitochondrial pH and calcium dynamics. Our results indicate that natural selection mediated by climate has contributed to shape the current distribution of mtDNA sequences in humans.

    They took a dual approach to testing the hypothesis of selection. First, they modeled the evolution of haplotype diversity under neutrality, and showed that the empirical distribution lies significantly outside that range of results. But even so, we might imagine some bottleneck scenario that would cause low diversity in high-latitude peoples, and this would be difficult to refute historically because many of those populations have poor historical documentation. But demography should have similar effects on other genes, and they were able to show that the rest of the genome doesn't share the mtDNA pattern.

    It's really not that hard to test demographic hypotheses, using comparative genomics and anthropological knowledge. That's what anthropological genetics should be doing more and more. There was a time when obtaining a reasonable sample of mtDNA was an accomplishment, and comparing that sample to other genes was not feasible. But that time is past, and hopefully the review process -- journals and grants -- will start demanding some integration of mtDNA phylogeography with results from the rest of the genome.

    Back to Balloux's conclusion:

    Exploiting these new resources of autosomal variation will present significant challenges, but it will not help overcoming them if a large fraction of the community of human population biologists persists in sticking to mtDNA as the marker of choice.

    Mitochondrial DNA isn't the tip of the iceberg -- it's an ice cube on top of the tip of the iceberg.

    Related:

    "Mitochondrial DNA selection review"

    "Mitochondrial DNA and sperm"

    "mtDNA selection in Iceland?"

    "Complete Neandertal mitochondrial sequence, and selection on human (not Neandertal) mtDNA"

    "Did Neandertals need better mitochondria?"

    "Has the dam broken on mtDNA selection?"

    Mitochondrial DNA adaptations in living human populations"

    OK, that's enough related posts. But you can find a whole lot more by searching the topic!

    References:

    Balloux F. 2009. Mitochondrial phylogeography: The worm in the fruit of the mitochondrial DNA tree. Heredity (advance online): doi:10.1038/hdy.2009.122

    Balloux F, Lawson Handley L-J, Jombart T, Liu H, Manica A. 2009. Climate shaped the worldwide distribution of human mitochondrial DNA sequence variation. Proc Roy Soc Lond B 276:3447-3455. doi:10.1098/rspb.2009.0752

  • Mailbag: Blue eyes and sexual selection

    Wed, 2009-09-16 08:26 -- John Hawks

    "Nobody 10,000 years ago had blue eyes," Hawks said. "Why is it that blue-eyed people had a 5% advantage in reproducing compared to non-blue-eyed people? I have no idea."

    I was thinking about this yesterday looking at someone's eyes and was wondering if it was as simple as blue eyes being pretty. I know we usually find as 'pretty' the things that have an evolutionary advantage (hips, muscles, etc).

    But what about the other way around operating also? If someone had very rare, lovely blue sapphire-like eyes (instead of brown, which is a much more common color in nature), wouldn't that person have appeared more special in the past?

    i've heard boys like blue because it's part of our training, to find water (and girls like pink to help find ripening fruits). we're predisposed to like the color, and it happened rarely, we mated more with those people, and hence the number of blue eyes increased dramatically?

    You describe Darwin's hypothesis, that blue eyes were sexually selected. It's a fair possibility. A problem with the hypothesis is that blue eyes are mostly recessive, meaning that most people who have blue eyes have two copies of the allele. That wouldn't happen after the allele first originated because there would have been too few people carrying the gene.

    Possibly the mutation's initial success was due to chance, and when it go common enough sexual selection took hold. Or maybe there was selection on some other phenotype correlated with the allele -- in which case we have yet to identify the actual target of selection.

  • Quote: Peter Turchin on the "bugbear" of randomness

    Sun, 2009-08-30 16:18 -- John Hawks

    I'll probably have some more material on quantitative analysis of dispersal in the few days. Here's a quote from Peter Turchin (1998:17-18):

    Of course, we do not know that animals truly move at random, like flipping coins to decide whether to turn right or left. Each individual could be a perfect automaton, rigidly reacting to environmental cues and its internatl states in accordance with some set of behavioral rules. However, even if this were true, we might still choose to model behavior of such animals stochastically, because we would not have the perfect knowledge of all the deterministic rules driving these animals. Even if we did, we might not want to include them all in our dispersal model, since such a model would have an enormous number of parameters and would require a very accurate representation of all environmental "micro-cues." The point is that randomness is a modeling convention. Because it is impractical, and not even helpful, to attempt to model individual movement deterministically, we use a more parsimonious probabilistic model.

    I'm pausing the quote to point out my boldface. It has become computationally feasible in the last few years to model enormously complicated scenarios with individuals acting pseudo-deterministically. The most popular use of such modeling is to try to constrain dispersal models by some geographic conditions, such as local habitat richness, rainfall, or altitude (see also, "One model, hold the extra parameters"). Of course, animals really do disperse in ways that depend on such geographic parameters. The question is whether any datasets are sufficient to test models involving so many parameters.

    This approach is aptly termed behavioral minimalism (Lima and Zollner 1996). In essence, we adopt a thermodynamic approach: the behavior of individuals is erratic, or irregular, but the redistibution process at the population level has many regular features. There is a direct analogy with with thermodynamic theory. The motion of each gas molecule is chaotic and essentially unpredictable, and can only be described probabilistically. When dealing with large numbers of molecules, however, the laws at the aggregate level are for all intents and purposes deterministic. Similarly, the problem of biological dispersal can be treated by starting with a probabilistic description of individual movements (in other words, formulating the problem as a random walk), and then approximating the redistribution process of the ensemble of individuals with a deterministic equation, diffusion.

    The effective scale of stochastic versus deterministic processes is important. I'm chiefly interested in the dispersal of adaptive genes in human populations, for which the deterministic approximation may be considered to have become more and more relevant over time, as the population sizes of regional populations grew. Still, the present pattern in many cases may reflect the stochasticity of populations from earlier time periods, when they were smaller. And formerly important deterministic processes, such as the adoption of agriculture, may no longer be directly observable. So how do we model variance?

    The thermodynamic approach to dispersal does not have to assume that the movement of each "particle" is completely random. The important feature of this approach is that we can control the degree of realism in the model. Environmental factors that have strong effects on movement can be included explicitly in the model, while other factors that have weak effects (or about which we have no information) are included in the stochastic component.

    This would incorporate the geographic modeling approaches mentioned above -- deterministic processes related to spatial variance of habitat or dispersal potential. But then the important step must be to find a minimal deterministic model to account for the data, and then test it with other observations -- such as more extensive genetic sampling, archaeological information, or historical documentation.

    References:

    Turchin P. 1998. Quantitative Analysis of Movement. Sinauer, Sunderland MA.

Pages

Subscribe to recent selection

Neandertals

For years, I've worked on their bones. Now I'm working on their genes. Read more about the science studying these ancient people.

Denisova

From a finger bone of an ancient human came the record of a completely unexpected population. My lab is working on the science of the Denisova genome.

Acceleration

The advent of agriculture caused natural selection to speed up greatly in humans. We're uncovering some of the ways that populations have rapidly changed during the last 10,000 years.

Malapa

Just outside Johannesburg, the Malapa site is producing some of the most exciting finds in human evolution. This site is the headquarters of the Malapa Soft Tissue Project.