john hawks weblog

paleoanthropology, genetics and evolution

Neandertal DNA

  • Archaic genome snooping from GWAS

    Tue, 2011-10-18 22:08 -- John Hawks

    The 23andMe blog reports on a recent genome-wide association study of type 2 diabetes in South Asian people: "SNPWatch: Genetic Variants Associated with Type 2 Diabetes in South Asians and Europeans". The study was published in August in Nature Genetics, by Kooner and colleagues [1]. As described in the post:

    The authors behind this study carried out one of the largest type 2 diabetes studies to date, scanning the genomes of nearly 19,000 people with the disease and 40,000 without it, all of South Asian descent. Their analysis identified six SNPs linked to this condition. When they combined their results with previously published findings in other ethnicities, they found suggestive evidence that five of the six SNPs were also associated with type 2 diabetes in European populations. Similarly, there was some evidence that the majority of the genetic risk factors in Europeans were also linked to disease in South Asians. Only three genetic factors were not shared at all between the two groups.

    Type 2 diabetes is presently a very interesting topic from an evolutionary viewpoint, and we're beginning to think about it very seriously now. Whenever I see a study like this, I quickly look at the Neandertal and Denisovan genomes to see if any interesting patterns emerge. Sharing GWAS SNP alleles is not necessarily very interesting, because the GWAS risk alleles are mostly not causative themselves; each may be linked to some causative allele that remains to be discovered. The linkage is a function of the evolutionary history of that chromosome region, and many of the key historical events that affect linkage happened within the last 10,000 years. So we really shouldn't expect GWAS alleles to be predictive of phenotypes in Neandertals or Denisovans.

    Still, these alleles are associated with disease in living people, and their genotypes in ancient humans may illuminate cases where the evolutionary history links the population across the gene networks that influence disease. A closer examination of the genealogy around these loci will be more informative, but as a first look I often just genotype the archaic genomes for SNPs in a study. The six SNPs reported here include two cases where the archaic genomes have the derived risk alleles, one of them present in Neandertals but not the Denisova genome. Again, that doesn't tell us anything about the phenotype of the ancient people, but worth a closer look to see if one or both of these is an introgressive allele.

    We have here the GWAS Catalog genotypes for all the archaic genomes. Not much actionable information but there are some interesting phenotypes in there. I'll share some more of those later this week.


    References

  • Hawks lecture at University of Birmingham Sept. 22

    Mon, 2011-09-19 19:25 -- John Hawks

    I'll be in the U.K. the rest of this week. The University of Birmingham has invited me to give a lecture for their "Great Read" event as they begin the new academic year. If you're in the area, the talk is at 3:30 on Thursday, September 22, in the Concert Hall of the Barber Institute. I'll be appearing after Ken Miller, widely known for his work in evolutionary biology and his advocacy of evolution education in the U.S.

    As for myself, I'll be talking about Neandertal and Denisovan DNA and what they tell us about human evolution. All my talks have new, unpublished stuff in them, and this is no exception.

    I notice that the topic of evolution education has really hit the news this week in the U.K, as a group of 30 prominent scientists, including Paul Nurse and Richard Dawkins, have signed a letter protesting lax evolution education standards ("David Attenborough joins campaign against creationism in schools", "Scientists demand tougher guidelines on teaching creationism in schools"). Looks like I'll be going there just in time.

    My host has planned some exceptional activities later this week for us, and I'll plan to report back when I can.

  • No Neandertal safe sex

    Wed, 2011-09-14 09:39 -- John Hawks

    Laurent Excoffier and colleagues' work has investigated how range expansions may have affected human genetic diversity. I've commented on this work several times ("One model, hold the extra parameters", "The Neandertal mtDNA story, 2004 edition, "Surfing and recent selection). They have applied a "geographically explicit" model to questions of human population history, modeling how populations expand and interact in the face of a simulated model of the Old World.

    In the past, I've found some things I like in this work, and other points where I disagree with the models' assumptions. Personally, I like to examine analytical models first, because the assumptions are often much more explicit, so we can see more easily how the results follow from them.

    This week, a new paper by Mathias Currat and Excoffier in PNAS claims to find evidence for some degree of reproductive incompatibility between Neandertals and modern humans. This is another case where I think the approach is very clever but I disagree with the model's assumptions. I just don't believe that today's distribution of genetic variation can tell us about "reproductive incompatibility" with Neandertals or other archaic people.

    Today, if you take a large random sample of people within the continental U.S. and look for a DNA legacy from Precolumbian American people, you will find it in your sample at a level somewhat under 2 percent. This percentage results from the differential growth of European and African-derived peoples during the last 500 years of American history. Whatever else it may be, the current percentage is notevidence for hybrid incompatibility of the world's populations before 500 years ago.

    It's not a perfect analogy. Today, Native American ancestry is heterogeneous in the continental U.S., with some people carrying very high fractions. After 30,000 years, such heterogeneity would likely have balanced out. With the Neandertals, we are looking at a much longer history, and different events.

    I would contend that the events that have affected today's representation of Neandertal-derived genes were demographically larger than those leading to the European colonization of the Americas. The contraction of the European population during the Last Glacial Maximum, the subsequent movements of Late Upper Paleolithic and settlement of Mesolithic peoples, followed by the introduction of agriculture and waves of population growth and invasions, have partially erased the genetic patterns of the initial Upper Paleolithic. We know that the mtDNA complement of Europe changed markedly both before and after the Neolithic. Today's Europeans are not the people who encountered the Neandertals 35,000 years ago. The genes of those initial Upper Paleolithic people may be almost as rare today as Neandertal genes.

    Range expansions and surfing

    Nevertheless, I think the analysis in this paper gives us some valuable information about how populations may have interacted at that final stage of population mixture among archaic populations.

    A range expansion occurs when a population that is initially limited to some small area begins to expand outward across a larger area. The expansion may include interbreeding with other populations who already occupy those areas, for example, the movement of Neolithic agriculturalists into Europe. Or the range expansion may go into territory where nobody lives, like the initial habitation of the Americas some 14,000 years ago.

    Range expansions can distort allele frequencies beyond the pattern expected in a random-mating population. As the population pushes its boundary outward, individuals at the frontier carry with them a slightly skewed sample of the alleles in the population as a whole. Pushing further and further along, this skewed sample gives rise to a founder effect. This phenomenon has been called "allele surfing", by analogy with a spreading wave of population expansion.

    When a population expands its range into that of another population, the invaders usually mate with the natives. As the "wave" of migration continues to spread, more and more of the natives' genes are picked up into the expanding population. As a result, you expect to see a gradient of genetic contribution from the original native population, higher and higher as you look farther from the invaders' point of origin.

    Currat and Excoffier [1] assume that a group of 50 people originated in Northeast Africa 50,000 years ago and then began to spread throughout the Old World. This population (moderns) expands into the range of another human population (Neandertals) by virtue of a higher carrying capacity: in fact they assume that modern humans existed at four times the density of Neandertal populations. The modern human value is set at 1 person per 10 square kilometers, which is very low compared to ethnographically described hunter-gatherers. The population as a whole is made up of demes that occupy an area 100 km on a side (in some trials, four times as many demes 50 km on a side). The outcome is inevitable: the higher carrying capacity leads modern humans to replace Neandertals, while incorporating some amount of Neandertal ancestry.

    Any model is unrealistic to some extent. An unrealistic model generally leads to results that are very different from reality. In modeling, there's a common strategy to deal with this problem: Leave one free parameter and change it until the results fit reality. In this case, the free parameter is migration rate, the probability that an individual will move to an adjoining deme. Currat and Excoffier used values for this parameter that caused their "modern" population to displace Neandertals in Europe over a span of 6000 years. The value that made this dispersal speed was 20 percent per generation for the dispersing modern human population.

    I'm a little concerned that a whole literature of geographically explicit population models has emerged in human genetics without any apparent reference to the anthropological literature on human demography. If you know ethnography, a migration rate of 20 percent per generation over 100 km distances seems very high. It's more than double the observed rate of intertribe marriages among precontact Aboriginal Australian people, for example. The value of one person per 10 square kilometers for population density is near the low end ever observed for hunter-gatherers. If it's a stretch to make a model fit with parameters found in known hunter-gatherers, that's when I go back to the drawing board. But then, my philosophy about this is different from most human geneticists. I'm an anthropologist.

    Anyway, with these values the result is foreordained: modern humans will replace Neandertals, and fast. What Currat and Excoffier observe in their simulated populations is that the modern humans tend to pick up a larger fraction of Neandertal genes, especially in Europe. How can we explain why our population today has a relatively small fraction of Neandertal genes? In particular, how can we explain why Europeans have no more Neandertal genes than any other population? They conclude that some kind of reproductive incompatibility must have existed.

    Where I think the method falls short

    I think this paper would be perfectly reasonable if I was willing to assume that the range expansion of modern humans was the last major event in our evolution. If this were true, then echoes of this range expansion would be the most highly visible today — just as astronomers can still find echoes of the Big Bang in the cosmic microwave background.

    But I would offer that our genetic diversity today is not the result of a single Big Bang of movement out of Africa. Many population movements of comparable or even larger scale have happened during the last 30,000 years.

    The paper presents the hypothesis of reproductive incompatibility as an attempt to solve two problems: First, Chinese, New World peoples, Southeast Asians and Europeans today have approximately the same amount of Neandertal ancestry. Second, the amount of Neandertal ancestry in Europe is only around 2-4 percent. A 6000-year wave of population growth and mixture as modern humans entered Europe might have left more Neandertal genes, and a higher proportion in this Neandertal-rich area of the world than in East Asia.

    Here's how I currently see those problems. Europeans today are not the Europeans of the past. They have undergone massive population movements and replacements since the initial Upper Paleolithic people encountered Neandertals. That's not only the result of archaeology, it's also clear from the paleogenetics. If we recognize this subsequent history, then we will find it easy to explain why the rest of the population outside Africa has basically the same small amount of Neandertal ancestry: they received a massive influx of genes from some West Asian population with Neandertal mixture. Europe also got these genes, mostly long after the initial Upper Paleolithic.

    So I don't think the present fraction of Neandertal genes tells us anything about sex between Neandertals and humans, except that it happened. Many times. Hooba-hooba.

    There is some irony in the timing of this publication, since only last week PNAS published a paper claiming that today's African populations derive some of their DNA from a population fully twice as different from non-Africans as Neandertals were.

    I don't fully believe that, either.


    References

    Synopsis: 
    A new paper claims humans and Neandertals were reproductively incompatible. I don't think so.
  • HLA class-I loci in Neandertals and Denisova

    Thu, 2011-08-25 21:08 -- John Hawks

    With draft sequences of genomes from several Neandertals and from Denisova, we can begin to investigate known human variations that affect phenotypes. In practice, this is a very simple approach -- take alleles that we know exist in recent human populations, and see if they are in the DNA sequences of these ancient people. My lab has been following this line of research, trying to get information about aspects of biology that are not evident from the skeleton. The immune system is one of the most fascinating, both because of its extensive variation in living people, and because we might be able to test hypotheses about the diseases and parasites that ancient humans faced.

    Today Science has released an early manuscript edition of a paper by Laurent Abi-Rached and colleagues (bibliographic information not yet available), which identifies the HLA class-I alleles present in the three highest-coverage Neandertal genomes from Vindija (Vi 33.16, 33.25, and 33.26) and the Denisova pinky genome. The paper is very brief and fairly straightforward, providing provisional HLA class 1 allele types for these individuals, discussing possible haplotype associations among these alleles that may have been in the ancient genomes, and providing the frequency of those alleles in present-day human populations.

    These archaic individuals carried HLA types that are presently rare in Africa and more common outside of Africa, supporting the hypothesis that these alleles in living people originated in those archaic populations. The linkage between alleles at different HLA class-I genes also supports that hypothesis. The present immune system biology of humans was strongly shaped by the interaction of different regional populations of archaic humans.

    The title of the paper calls this "multiregional admixture", and the word "introgression" appears 8 times. Good for us!

    (This is the point where I grumble about the lack of citations in this paper....OK, done grumbling.)

    Selected genes may have a very different pattern from neutral genes

    This paper is the first demonstration that gene variants of functional importance were not only inherited from Neandertals and Denisovans but were valuable and selected in later populations.

    We already knew that humans today have gene variants from these archaic humans. Neandertal genes presently account for around 3 percent of the genomes of people outside Subsaharan Africa. My lab has been studying the pattern of frequency of these genes ("Europe and China have different Neandertal genes"). Most of the genes shared between the Neandertal genome and living people outside Africa are presently very rare -- most occur only in a single individual in our sample of Europeans and Chinese people, for example.

    These HLA class-I alleles are different. Some of them are quite common today. If they came from the Neandertals and Denisovans -- that is, if they were not present in the African people who make up most of our ancestry genome-wide -- then these alleles must have increased quite a lot during the recent evolution of people outside Africa.

    The best explanation for the large increase in frequency of these genes in modern human populations is selection. If readers want to get an introduction to the scientific literature on the topic of functional genes, I can suggest a detailed review paper I wrote with Greg Cochran on the dynamics of introgression and selection as applied to Neandertals [1], and a review paper we wrote in Trends in Genetics about identifying genes in living humans that that may have come from archaic populations [2]. In both papers, we discuss the dynamics of functional genes that may be affected by selection in modern human populations and how they differ from the predictions for neutral loci affected only by genetic drift. The new paper by Abi-Rached and colleagues follows on that line of inquiry.

    I think the hypothesis of adaptive introgression is very likely, and that we shouldn't be at all surprised that the immune system might house many good examples of it.

    A look at the most extreme examples, involving the Denisova genome, shows the extent that these functional genes might reflect introgression well beyond that indicated by most of the genome. The HLA class-I alleles present in the Denisova genome are most common today in South Asia (HLA-C*12:02, HLA-C*15 which is also common in Australia) and Southeast Asia (HLA-A*11). These regions of the Old World have no substantial evidence of Denisova inheritance across their genomes. Yet they may very well have substantial frequencies (up to 48 percent for HLA-A*11) of HLA class-I alleles from the archaic Denisovan population.

    Reasons to be cautious

    This is the point where I have to make a note of caution. Even though I personally think it is likely that these HLA alleles really did introgress into the modern population from Neandertals and Denisovans, their geographic pattern really isn't enough to demonstrate this without question.

    Reports earlier this summer described some of the work this group was doing on HLA class-I loci, including a public lecture by PI Peter Parham. I noted at the time that the geographic distribution of the alleles mentioned in that lecture seemed a mismatch for the hypothesis of a Denisovan origin for the alleles ("The immune systems of archaic humans"). For example:

    HLA-A*11 is very common in Papua New Guinea, but it is also very common in north India and in China. These two areas otherwise show no significant evidence of Denisova ancestry. We might conclude that the HLA-A gene just has an unusually high level of introgression into Asian populations, not typical of the genome as a whole. That's certainly possible. But without finding any substantial number of derived mutations in the HLA-A*11 variant in the Denisova genome and in living Asians, it is hard to rule out that the sharing of HLA-A*11 in all these populations is just coincidence.

    Of course, if the allele were absent in Africa, that would weigh in favor of the idea it is shared by Late Pleistocene interbreeding outside Africa. But HLA-A*11 is in Africa, just very rare. And it's in Europe. This is the kind of locus that is difficult to interpret: if it has any tiny disadvantage against malaria, for instance, its rarity in Africa is easily explained as a function of recent evolution, while its presence almost everywhere outside Africa would be no surprise even if there were never any interbreeding.

    The story of HLA-C*12:02 is similar. It's common in PNG, but also broadly across South Asia and into Iran, areas where no substantial evidence of Denisovan ancestry has been demonstrated.

    Introgression under selection is a good hypothesis for why these alleles should be so much more broadly distributed than the evidence from the rest of the genome. But introgression isn't the only explanation, because the alleles might have been retained by balancing selection, with recombinant haplotypes suppressed by purifying selection. We might use haplotype age to test the hypothesis. If the alleles were retained by ILS, they would look much older than if they came in from an archaic population by introgression. But as I'll describe below, in this case we actually have the opposite problem: these haplotypes look too young to have come in by introgression, likely a consequence of selection long after the Neandertals and Denisovans had contributed their genes to us.

    The curious case of HLA-B*73

    If I agree that the results of this paper are pretty likely, why am I still cautious? Well, the most confusing thing in this paper is an allele described in great detail that they didn't find in the archaic genomes. And I know from experience that not finding things is a pretty common occurrence when we go looking for odd things that might have come from Neandertals.

    There's a detective story here, that probably explains the initial interest of this group in the Neandertal genome, but that just didn't pan out in their search through the archaic genomes. The allele is HLA-B*73.

    Parham and colleagues [3] first characterized this allele, which is remarkably different from other HLA-B alleles. Homologs of HLA-B*73 are present in living apes, suggesting that the different human alleles originated before we diverged from gorillas. The retention of such an ancient allele in humans isn't a surprise in the HLA system, because many very divergent alleles have been kept in the population across evolutionary time by balancing selection. What's a bit surprising about HLA-B*73 is its limited diversity in living people. It appears to have persisted in humans throughout our evolution, but people today who carry the allele have very similar sequences, and it is nearly always linked to one single allele at the nearby gene, HLA-C (HLA-C*15). Also, the allele is very rare inside Africa and reaches its highest frequency in West Asia., where it occurs in only 4.5 percent of people. Because of this strange pattern, Parham and colleagues suggested that the allele may have been inherited from Neandertals.

    When I was in graduate school working on modern human origins, I took a special interest in genes that had this pattern of variation. HLA-B*73 was not the only one, there are others.

    The variation of the HLA-B*73 allele and its association with HLA-C*15 correspond very well to the predictions we presented in our paper on identifying introgression from archaic humans [2]. It's a highly divergent allele in humans compared to others, and it appears not to have recombined much with nearby genes, suggesting it was sequestered in another population through much of the diversification of present-day HLA alleles. But the HLA system is actually a rotten place to look for this kind of evidence, because there are many, many instances where ancient alleles have been retained in human populations by balancing selection. As we pointed out in 2008, a deep root to the gene tree and a rarity of recombination can be good evidence of introgression, but balancing selection and inhibitions to recombination are alternatives to introgression for explaining this pattern of variation.

    There's no necessary contradiction between the two processes, and ancient DNA in this case could establish that the allele was both under selection and came from archaic humans. The problem: they didn't find the allele in the archaic genomes.

    So why did they spend so much time in this paper discussing this allele? My guess is that they were surprised not to find it. But they did find HLA-C*15 in the Denisova genome, which is often linked to HLA-B*73 in living people who carry it. That makes for an indirect argument:

    C*12:02 and C*15 were formed before the Out-of-Africa migration (Fig. 2H and fig. S15) and exhibit much higher haplotype diversity in Asia than in Africa (fig. S16), contrasting with the usually higher African genetic diversity (20). These properties fit with C*12:02 and C*15 having been introduced to modern humans through admixture with Denisovans in west Asia, with later spreading to Africa (21, 22) (Fig. 1F and fig. S11 for C*15). Given our minimal sampling of the Denisovan population it is remarkable that C*15:05 and C*12:02 are the two modern HLA-C alleles in strongest LD with B*73 (Fig. 1E). Although B*73 was not carried by the Denisovan individual studied, the presence of these two associated HLA-C alleles provide strong circumstantial evidence that B*73 was passed from Denisovans to modern humans.

    I would go one simpler: Given that HLA-B*73 is most common today in West Asia, I suspect it came from West Asian Neandertals. There's no reason why the HLA genes of European Neandertals should have been identical to West Asian Neandertals. Today's Europeans are different from today's West Asians in the frequencies of these alleles, so why not in the past as well? For that matter, we really only have two alleles from European Neandertals for HLA-B (since the paper finds that

    Why do the Vindija Neandertals all have the same HLA types?

    It's a pretty good question. The paper cannot distinguish the genotypes from these three individuals. That's not the same as saying they're exactly the same type, since the sequences are very low coverage, but probably they were. Here's what the paper says:

    Genome-wide analysis showing three Vindija Neandertals exhibited limited genetic diversity (3) is reflected in our HLA analysis: each individual has the same HLA class I alleles (fig. S17). Because these HLA identities could not be the consequence of modern human DNA contamination of Neandertal samples, which is <1% (3), they indicate these individuals likely belonged to a small and isolated population (fig. S18).

    Still, I think this indicates a pretty high degree of inbreeding among these individuals. I wonder what the organ registry for Neandertals would have looked like.

    (Not so) final words

    I have more to write on the topic of linkage disequilibrium among these genes. The rate of recombination between HLA-B and HLA-C is high enough that a haplotype between these genes should have mostly decayed in the time since our mixture with archaic humans. HLA-C and HLA-A are an order of magnitude further apart, so linkage between alleles of these genes should have been totally erased in the time since any archaic admixture.

    That means that the extended haplotypes reported in this study must reflect selection in the period since the population mixture and introgression. The story isn't a simple case of inheritance from archaic humans, it is rather more complex. But more on that later.

    I think this paper confirms that it will be really productive to look at archaic genomes for variants present in living humans. Identifying modern human alleles in a Neandertal isn't really very exciting science, though. I've been doing this on my blog for a year now. It's a tricky job to type these HLA alleles, compared to genotyping many other genes, as we discovered. Still, I never really expected that reporting on genotypes in the public domain would be sufficient to get printed in Science.

    Still, this set of three genes is particularly interesting. And the paper does add evidence from one additional locus, KIR3DS1, which also has the pattern where an allele rare in Africa but common in Asia is present in the Denisova genome.

    If it turns out that we have widespread adaptive introgression in Asia today from Denisovans, that will change the game of studying the origins of these populations. Based on the genome-wide comparison, it looks like the genetic interaction that led to the habitation of Asia did not involve Denisovans, who contributed only to populations at the most eastern extreme of habitation in island Southeast Asia. But the only Denisovans we know about lived near the geographic center of the Asian landmass, not at the extreme southeastern extreme.

    The HLA pattern may suggest a more widespread pattern of mixture across Asia, which was later overwritten by population movements of people who didn't have Denisovan ancestry. That means that the habitation of Asia was a process of successive migrations and replacements, which imperfectly covered up the evidence of archaic intermixture. The genes that remain as signs of this intermixture are those that had selective advantages in later populations.


    References

    Synopsis: 
    Abi-Rached and colleagues report that the human system owes much to the Neandertals and Denisovans.
  • Mailbag: mtDNA ancestor and speciation?

    Wed, 2011-08-24 23:39 -- John Hawks

    I've got a question about something I wrote in a newsgroup in 1995. Okay, that doesn't sound overly urgent, right? The general subject has come up again for me though, and so I would like to find out if I am right about this, and figured the best way to be sure is to ask someone who will likely know right off. Hence this email.

    One other problem has been the assumption (I don't remember any
    compelling reason being given to assume this) that the end point (going
    backwards) of the MtDNA trail *must* be a speciation point. This sort
    of thing also happens with changes in tool industries; there is
    often an unsupported assumption that it must mark a change in species.
    The MtDNA trail is just that, it's a trail like tracing surnames that
    always pass through one side of a family. The trail just fades out, but
    that doesn't necessarily mean that it marks a major change (mind you, it
    *might*, but it doesn't *necessarily* do so).

    The first paragraph is where I wonder if I am right or wrong, or some muddled middle ground that I'm not aware of.

    Heh..that's taking it to a new level -- someone was WRONG on the INTERNET in 1995!

    Nowadays it's pretty clear that the mtDNA ancestor was not a speciation point, because Neandertals didn't have the same mtDNA ancestor and they interbred with us (new paper tomorrow reports that a large fraction of people today have Neandertal and Denisovan-derived HLA types, for example).

    There's still a serious disagreement about the meaning of these recent common ancestors. Most genes aren't like this, but it's not clear whether mtDNA and the Y chromosome have these recent ancestors because of a population size bottleneck, or natural selection, or some kind of population structure. Humans don't look very much like most other primates in this aspect of our biology, but when you combine us with Neandertals and Denisovans, we do look pretty much like ordinary apes in population structure. So maybe this is an aspect of how we became modern humans, something about our population structure or biology.

    Here's a recent review paper where I discuss these issues in some more detail.

    http://johnhawks.net/research/hawks-2008-genes-numbers-effective-size

    Hope that helps --

    --John

  • Mailbag: Where did Neandertals come from?

    Thu, 2011-08-18 17:51 -- John Hawks

    Dr. Hawks,
    I greatly enjoyed your course on the rise of humans I purchased through the Teaching Company.
    I could not find the answer to this question: if humans migrated out of Africa and met Neandertals and interbred, where did the Neandertals originally come from?
    I am sure you are a busy man but I find this puzzling me. Thank you in advance for answering this question.

    Thank you so much for your kind words!

    We don't strictly know where Neandertals originated. We do know that their population and the African population began to differentiate sometime before 250,000 years ago. I think it is likely that the ancestors of Neandertals migrated out of Africa at that time and began to evolve within western Eurasia, later to come into contact with Africans again. But there are fossil humans who seem to have some Neandertal-like features in Europe far earlier, as early as 600,000 years ago. One possibility is that the ancestors of Africans and Neandertals actually lived outside of Africa, and Neandertals stayed there as other people moved into Africa. Another is that a population representing most of the ancestry of Neandertals left Africa more recently, maybe within the last 150,000 years, and mixed with an earlier European population. It is even possible that the Neandertal and African ancestors lived long-term in Europe and Africa, respectively, with a high rate of gene flow between them for their entire history.

    At this level things seem uncertain and will remain that way until we have a better fossil record in Africa. It's an exciting time for those of us who study that time period!

  • Ancient genomes review

    Thu, 2011-08-18 12:21 -- John Hawks

    Mark Stoneking and Johannes Krause present a review article in the current Nature Reviews Genetics [1] that gives an overview of the science of ancient genomes.

    I think the article is very good about presenting aspects of ancient genome sequencing and assembly, and the attendant problems and biases. I find myself explaining this stuff a lot and it's useful to have the concise descriptions that Stoneking and Krause provide here. For example, here's a paragraph that describes mapping bias:

    However, there are important limitations to current approaches to ancient genome assembly owing to the short length of ancient DNA fragments and the repetitive nature of large parts of mammalian genomes (which creates ambiguities in sequence read mapping). For example, short fragments can cause mapping bias, as highly divergent short fragments cannot be accurately mapped to a reference genome. Fragments may also map to different locations in different reference genomes depending on the completeness and accuracy of the reference genomes. For example, to calculate divergence times between an ancient hominin genome sequence, modern humans and chimpanzees, it is important to first verify that the ancient DNA sequences map to orthologous positions in both the human and chimpanzee genomes. These issues mean that even at 20-fold coverage (which was the coverage obtained for the Saqqaq genome) not more than 85% of the genome could be reconstructed; full genome sequences from fossil samples can probably never be achieved with current methods.

    The article discusses chemical changes in ancient genomes, methods to detect contamination, and specialized methods such as targeted DNA hybridization capture.

    I'm less happy with the second half of the article, which discusses population genetics. A few computational techniques are very briefly described (for example, unsupervised versus model-based approaches) and Stoneking and Krause give quick synopses of some population genetic inferences reported during the last year.

    I guess where I perceive a difference between the first (sequencing) and second (population genetics) parts of the article, is that the sequencing part emphasizes the many problems with analysis and describes approaches to overcome them. It seems as if there's a vibrant discussion of sequencing and biochemistry, giving rise to a fuller account. Meanwhile, the second part, discussing human population history, seems to accept results relatively uncritically. There is very little citation of anthropological or archaeological work, and little indication that the methods of population genetic inference may have weaknesses or assumptions that color their results.

    It's great to see review articles on this topic, given the broad interest I expect we'll see more of them soon. A flood of ancient genetic data means a lot of new results that need to be summarized. But a summary is really not enough -- we need critical examination of the assumptions underlying population genetic inferences and a discussion of how they accord with what we know from archaeology and paleontology.


    References

    Synopsis: 
    A new review article by Mark Stoneking and Johannes Krause presents some useful information.
  • Ancient genomes giving up their secrets

    Wed, 2011-08-17 13:05 -- John Hawks

    Last week, Nature ran a story by Ewen Callaway [1] that hits the highlights of research on ancient genomes this year: "Ancient DNA reveals secrets of human history". The news hook is a bevy of studies trying to estimate the time that Neandertals contributed their genes to recent human populations.

    By comparing individual DNA letters in multiple modern human genomes with those in the Neanderthal genome, the date of that interbreeding has now been pinned down to 65,000–90,000 years ago. Montgomery Slatkin and Anna-Sapfo Malaspinas, theoretical geneticists from the University of California, Berkeley, presented the finding at the Society for Molecular Biology and Evolution meeting in Kyoto, Japan, held on 26–30 July.

    It will be great to dig into the details of these comparisons, which Callaway reports have also been carried out by some other groups. The basic idea is that over time, recombination would have broken up chunks of chromosome inherited from Neandertals. If we know the intensity of recombination across the genome, we should be able to use the lengths of Neandertal-derived blocks to estimate the time that their population contributed to ours. But much depends on the details of the population model. The simplest model supposes that all the gene flow from Neandertals happened at a single moment in time. If we doubt that assumption (as I do) then the estimate will not represent reality. But until we see papers reporting these results, it's hard to know how much they may be affected by different assumptions.

    The article turns toward functional insights from the ancient genomes, and my lab puts in an appearance:

    Unlike most scientists mining the ancient genomes, Hawks has reported some of his more prosaic findings — Denisovans didn't have red hair, for example — on his blog (see go.nature.com/irclra). "These genomes are publicly available. There's nothing stopping high-school students from doing this, and the kind of stuff that I'm putting out on my blog is the stuff that a smart high-school student could do."

    I was really pleased that the article featured some of the research from my graduate student, Aaron Sams, who has been investigating the evolution of celiac disease. What does celiac disease have to do with Neandertals? Short answer: We're feeling our way though the study of gene networks that connect genetic evolution with phenotypes. We can use our knowledge of human evolution during the last 100,000 years to figure out how genes work. And we can use knowledge from living humans about how genes work to investigate their evolution in archaic people. Ancient genomes give us that potential, because genes that work together have evolved in ways that reflect selection on the system as a whole.


    References

    1. Callaway E. Ancient DNA reveals secrets of human history. Nature. 2011;476(7359):136 - 137.
    Synopsis: 
    A news article in Nature features our lab's research in a report on ancient genomes.
  • Mailbag: Where did Neanderancestors live?

    Sat, 2011-07-16 08:54 -- John Hawks

    Re: European Middle Plesitocene (via Twitter):

    Maybe a dumb question....how do you know Neandergenes derive from Africa vs. f.e. recent Africa and Neandergenes both derive from Atapuerca?

    Not a dumb question at all. I discussed this exact issue with David Reich last week. There is no strong fossil argument for an African ancestor at that time, Europe and West Asia are anatomically and archaeologically just as plausible. My inclination is to suspect Africa because of the deep genetic variation still retained in that population, but that variation could have been retained in other ways -- particularly since every scenario of human origins now must involve population mixture.

  • Mailbag: Neandertal fraction

    Mon, 2011-06-27 23:17 -- John Hawks

    Dear Professor Hawks,

    I enjoy reading your webpage and musing about the data. I have long thought our family were part Neanderthal. Recent vindication is sweet.

    The estimate of 1-4% Neanderthal SNP's in non African DNA begs the question: What part of the Neanderthal contribution resides within the 2% of the genome that codes for proteins? Alternately phrased, what part of the Neanderthal contribution is neutral, and what part is under selection pressure?

    I expect you would say I have no clue. Up to 100% of non African DNA protein coding regions could contain Neanderthal SNP's somewhere in the population.

    We're working on this problem very actively now. The initial answer is that the 1-4% is largely different in everybody. As more and more people are examined, I think it very likely that most of the protein-coding complement of the Neandertal genome is preserved in somebody living today at low frequency.

    We cannot yet say whether the proportion surviving in protein-coding regions is larger than the proportion in gene deserts, which is an interesting question...

Pages

Subscribe to Neandertal DNA

Neandertals

For years, I've worked on their bones. Now I'm working on their genes. Read more about the science studying these ancient people.

Denisova

From a finger bone of an ancient human came the record of a completely unexpected population. My lab is working on the science of the Denisova genome.

Acceleration

The advent of agriculture caused natural selection to speed up greatly in humans. We're uncovering some of the ways that populations have rapidly changed during the last 10,000 years.

Malapa

Just outside Johannesburg, the Malapa site is producing some of the most exciting finds in human evolution. This site is the headquarters of the Malapa Soft Tissue Project.