john hawks weblog

paleoanthropology, genetics and evolution

Denisovans

  • New Denisova and Neandertal DNA results reported

    Fri, 2013-05-17 08:37 -- John Hawks

    Elizabeth Pennisi reports from the Biology of Genomes conference at Cold Spring Harbor, New York: "More Genomes From Denisova Cave Show Mixing of Early Human Groups". The article describes a talk by Svante Pääbo about new results from Neandertal DNA, as well as new analyses of the Denisovan genome. It has lots of details for those interested in these topics, but the article is paywalled, so I can only share a little of it here:

    From the detailed genomes of both Neandertals and Denisovans, Pääbo and Montgomery Slatkin of the University of California, Berkeley, estimated that 17% of the Denisovan DNA was from the local Neandertals. And the comparison revealed another surprise: Four percent of the Denisovan genome comes from yet another, more ancient, human—"something unknown," Pääbo reported. "Getting better coverage and more genomes, you can start to see the networks of interactions in a world long ago," says David Kingsley, an evolutionary biologist at Stanford University in Palo Alto, California.

    With all the interbreeding, "it's more a network than a tree," points out Carles Lalueza-Fox, a paleogeneticist from the Institute of Evolutionary Biology in Barcelona, Spain. Pääbo hesitates to call Denisovans a distinct species, and the picture is getting more complicated with each new genome.

    We have been finding some of this in our comparisons of the genomes also. These were not isolated groups of ancient people, and some of them were more similar to living people than others. It is just wonderful to have more and more DNA coming out -- although that makes it hard to think we won't learn something new from high-coverage data that will require us to re-run various comparisons. That's the cost of discovery!

    Meanwhile, the article sheds light on two interesting contradictions in the Denisova data. The analysis of the high-coverage data last fall [1] noted that the pinky bone genome is consistent with a very small long-term effective size, because of its limited genetic variation ("Denisova at high coverage". These results included a "drastic decline in size" around the time the Denisovans were estimated to have separated their population from the ancestors of living sub-Saharan Africans.

    That result was curious in comparison with the mtDNA evidence. The Denisovan mtDNA is substantially more divergent from living human and Neandertal mtDNA, with an estimated time for the last common ancestor of mtDNA among these groups a bit more than a million years ago. In the initial analysis of the Denisova genome, Reich and colleagues [2] pointed out that even a deep divergence might be consistent with a neutral population history in a single population. But a population of radically reduced size, with a substantially more recent common ancestry shared with Neandertals and other ancestors of living people? Seems odd.

    Now, we may be learning that the Denisovan genome itself represents different ancestral groups -- not only a more ancient "something unknown" population, but substantially the local Neandertals. That kind of mixture is not the population history described by papers on the Denisova genome so far. And a third Denisovan mtDNA from one of the third molars at the site is substantially different from the other two, pointing to greater mtDNA diversity within the Denisovan population than now known from either Neandertals or living people.

    What does it mean? I don't think there's a contradiction here in the data. What this shows is that the methods applied to the data have been too simplistic. The methods will come to a result, but that result may not fit the data as well as a population model with more complexity. Looking only at one kind of comparison -- as the Li and Durbin model applied to the Denisova genome by Meyer and colleagues last year [1] -- will probably not give a result that describes the true population history. We need to keep our minds open to more complex population histories that may be more consistent with other sources of data, including archaeological and fossil information.


    References

    Synopsis: 
    A talk on new ancient DNA results at the Biology of Genomes conference
  • Geno2 users showing unexpected Denisovan ancestry

    Sun, 2013-01-06 23:08 -- John Hawks

    I have been excited to hear in the last few days from several readers who have gotten results from the new Genographic Geno2 genotyping chip. One aspect of the result reporting is a person's estimated proportion of Neandertal ancestry, which is a simple percentage. This is like the report from 23andMe, and should be a pretty straightforward estimate given a model of Neandertal-human genetic similarity from complete genomes.

    Another aspect of the Genographic results is an estimated proportion of Denisovan ancestry. This might seem a bit surprising, as for most participants in the project who lack Polynesian or Melanesian ancestry this proportion should be extremely low. I've written about Denisovan DNA similarity with living peoples a few times ("Denisovan DNA in the islands, and an Australian genome", "How widespread is Denisovan ancestry today?"). Based on the science published to date, I would have expected the Geno2 calculations just to confirm the very low ancestry estimation found in last year's research based on genotyping Asian and Australasian populations.

    So I have been extraordinarily surprised to see that people are getting Geno2 results with up to 6% Denisovan ancestry!

    What gives? None of my correspondents so far has anything other than European self-reported ancestry, making it seem very unlikely that have substantial Denisovan ancestry.

    The first time I heard from a reader with this result, my immediate reaction was that there must be some problem with the algorithm. This one in particular wouldn't be to hard to get wrong considering the rarity of whole genome evidence from populations known to have substantial Denisovan ancestry. Or possibly, some problem with an individual's genotype chip data might trigger the algorithm to look more Denisovan. For many loci that vary among humans, Denisovans are very unlikely to have the derived human variant; so an individual with an unusual proportion of ancestral homozygote loci might look Denisovan in a human-Denisovan comparison.

    However, this is all speculation without knowing the details of the Genographic analysis. And as I hear from more people with varied results, I am having trouble thinking of how data errors could be patterned. It's a tough one to think about because of the unique aspects of the Geno2 chip, and until I've gotten a feel for results from that platform compared to other datasets I probably won't have a solid idea.

    I should point out that if there is a problem with the algorithm underlying ancestry prediction from Denisova, it almost certainly affects the Neandertal ancestry estimate also. The estimation from both these ancient genomes involves the same procedure, although with Denisovan DNA it requires subtracting out the DNA similarity with Neandertals first.

    So I would be interested in hearing from anyone who is surprised to find that they have Denisovan ancestry. My preliminary assumption it that the result is spurious but I'll try to figure out if there is a possibility of some Denisovan fraction beyond what has been shown in published work.

    Synopsis: 
    Trying to diagnose the odd results from a new genotyping project
  • Mailbag: Where did Neanderancestors live?

    Sat, 2011-07-16 08:54 -- John Hawks

    Re: European Middle Plesitocene (via Twitter):

    Maybe a dumb question....how do you know Neandergenes derive from Africa vs. f.e. recent Africa and Neandergenes both derive from Atapuerca?

    Not a dumb question at all. I discussed this exact issue with David Reich last week. There is no strong fossil argument for an African ancestor at that time, Europe and West Asia are anatomically and archaeologically just as plausible. My inclination is to suspect Africa because of the deep genetic variation still retained in that population, but that variation could have been retained in other ways -- particularly since every scenario of human origins now must involve population mixture.

  • Did Denisovans have genetic adaptations to high altitude?

    Tue, 2011-06-21 12:26 -- John Hawks

    We don't really know the extent of territory that might have been occupied by the population represented by the Denisova genome. The signs of mixture into the Melanesian/New Guinea population suggests that the Denisova individual shared many genes with people who lived somewhere along the South or Southeast Asian coast. Denisova itself, however, is in the Altai Mountains.

    Last week I wrote some thoughts about the possible introgression of HLA alleles from Denisovans into more recent populations. HLA genes pose many problems for testing this hypothesis -- including the difficulty of identifying the alleles in a low-coverage genome and the high chance of incomplete lineage sorting of ancient alleles in recent populations. Other parts of the genome in principle may be much easier to find evidence of introgression.

    If an allele that originated in Denisovans had some advantage in later populations, it might today be found very widely spread across Asian populations, even if the amount of Denisovan ancestry in most of these populations is very small. This was the theme of my paper with Gregory Cochran several years ago [1] ("The inevitability of introgression"). The probability that a single copy of an advantageous allele will survive and increase in the population is roughly 2s, where s is the fitness advantage in a heterozygote carrying the allele. A relatively small number of copies of an allele might have entered a recent human population by introgression from some ancient population, but these few copies would have a high likelihood of surviving and increasing in frequency, possibly toward fixation. HLA alleles could easily be in this category, but the challenges identifying them and high chance of ILS make the hypothesis hard to test.

    Another strategy is to identify genes that have been selected in recent populations and see if the linked haplotype shows up in the Denisova genome. Recently, several studies have attempted to identify genes related to high altitude adaptation in Tibetans. At least some Denisovans lived in the mountainous areas of central Asia, and so I'm curious whether they might have some alleles adapted to this environment. The Altai are not nearly as high as the Tibetan plateau (in fact Denisova itself is not much higher than western Kansas), and we don't know how long Denisovan people might have been resident in Central Asia, but if we're looking for selected alleles there are some strong candidates in this category of genes.

    So let's look at some of them. All positions here are mapped to the hg18 human genome assembly.

    Yi and colleagues [2] find a strong frequency difference between China and Tibet for a SNP in EPAS1, at chr2:46441523. The derived allele, G, has a frequency of 87% in their Tibetan sample but only 9% in their Chinese sample (and zero in Denmark). The Denisova genome is represented by two reads at this site, both C, the ancestral allele. We don't necessarily have to accept that this is a functional site, but as the marker most strongly differentiating the high altitude population it would likely be closely linked to any functional variant. So the Denisova allele suggests that this ancient individual lacked whatever functional variant might currently be common in Tibetans for this gene.

    Simonson and colleagues [3] took a different approach, focusing on candidate genes that they argued a priori were likely to be involved in adaptation to hypoxia because of their physiological role. They evaluated these genes for evidence of positive selection in Tibetans, finding several candidate haplotypes for recent adaptive evolution to high altitude.

    For each of five genes, they identified a three-locus "core selection haplotype" that shows signs of selection within Tibet. The purpose of these three-SNP haplotypes was to examine the correlation of haplotypes and phenotypes in a sample of people where physiological data were taken. So they are intended as tags, not as comprehensive and unique identifiers of the candidates at the genetic level. But the three-locus haplotypes are the only ones reported in the supplement to the paper, so that's what I have to compare.

    EGLN1: The three-allele candidate selected haplotype consists of A at chr1:229793717, T at chr1:229667980 and T at chr1:229665156. Denisova apparently has the selected haplotype with A at chr1:229793717 (2/2 reads), T at chr1:229667980 (3/3 reads) and T at chr1:229665156 (1/1 reads). However, it is not obvious whether this is significant. All three alleles on the candidate selected haplotype are the ancestral (present in chimpanzees and gorillas) alleles, which are much more likely to show up in the archaic genomes than derived alleles. These ancestral alleles are also present in several of the whole genomes provided along with the Denisova sequence reads. So it's not clear to me how good a candidate for selection the haplotype really is.

    CYP17A1: Here the three-allele candidate selected haplotype includes G at chr10:104568521, G at chr10:104594906, and C at chr10:104517420. Denisova has C (5/5 reads, ancestral), T (4/4 reads, ancestral), and C (3/3 reads, ancestral). Again, Denisova has the all-ancestral haplotype here, but in this case it is not the selection candidate.

    PTEN: The selected candidate haplotype is G at chr10:89770364, C at chr10:89790851 and C at chr10:89778618. Denisova has G (5/5 reads, ancestral), T (2/2 reads, derived), and C (4/4 reads, ancestral). Not selected.

    I always find it interesting when the Denisova genome has a derived allele at an interesting site -- it is the shared derived alleles between these archaic genomes and living people that constitute evidence of genetic persistence of the archaic people. No single site carries that information (any one allele may be shared by incomplete lineage sorting), but I still like to note them. The Papuan and half the Native American, Sardinian and Mongolian reads share the derived T at chr10:89790851 with Denisova.

    HMOX2: The candidate selected haplotype has C at chr16:4456093, T at chr16:4465266, T at chr16:4442515. Denisova has this candidate selected haplotype: C (3/3 reads, ancestral), T (4/4 reads, ancestral), T (5/5 reads, ancestral). That haplotype may also be in the Cambodian whole genome accompanying the Denisova data, and can't be ruled out for the Mongolian. Again, the all-ancestral haplotype and wider distribution argue against the hypothesis that this haplotype was specifically selected in Tibet.

    PPARA: The core candidate selected haplotype has A at chr22:44827140, C at chr22:44832376 and T at chr22:44842095. Denisova has A (8/8 reads, ancestral), A (5/5 reads, ancestral), and C (2/2 reads, ancestral). Notice again, Denisova has the all-ancestral haplotype. As an ancient sequence, we are finding this is the usual case, human-derived alleles are just rarer in this genome.

    OK, where are we? Out of six genes that are candidates for selection on altitude adaptation in Tibetans, the Denisova genome has two -- at ELGN1 and HMOX2. In both cases, the core selected haplotype consists entirely of ancestral alleles, and so I think they are actually poor evidence of introgression on the surface. I would test them by looking at more SNPs linked to the presumed selected haplotype, hoping to find some derived SNPs shared by the Denisovan genome and the presumed selected haplotypes. Unfortunately, publications do not yet routinely report long haplotypes, so it will take some more digging to test these cases.


    References

    Synopsis: 
    Noodling through the Denisova genome data for signs of candidate altitude adaptations.
  • The immune systems of archaic humans

    Fri, 2011-06-17 09:37 -- John Hawks

    I've just submitted an abstract for a conference in the fall, with the title, "Immunogenetics of archaic humans."

    Ten years ago, it would have been beyond imagining that this kind of science would be possible. Now, my graduate student Aaron Sams has been working directly with HLA and other immune system genes in ancient DNA sequences. It's pretty tough to work with the HLA region because of the low coverage of the ancient genomes and the high variation and repetitiveness of the HLA. But it is possible to find some of the basic human alleles in the ancient sequences, and those open the possibility of examining the coevolution of pathogens and human immunity in our recent evolution.

    Turns out we're not alone: According to New Scientist, Peter Parham has also been looking at HLA in archaic humans: "Breeding with Neanderthals helped humans go global".

    One allele, HLA-C*0702, is common in modern Europeans and Asians but never seen in Africans; Parham found it in the Neanderthal genome, suggesting it made its way into H. sapiens of non-African descent through interbreeding. HLA-A*11 had a similar story: it is mostly found in Asians and never in Africans, and Parham found it in the Denisovan genome, again suggesting its source was interbreeding outside of Africa.

    HLA-A*11 is actually the most common allele of HLA-A in Papua New Guinea, the population that otherwise shows significant evidence of ancestry from a Denisova-like genome. However, I don't agree with the main idea of the article. The major human HLA alleles are evolutionarily ancient -- most of them predate the origins of modern human groups and are older than the founding of the Denisova-Neandertal populations. This is actually perhaps the worst region to look for evidence of interbreeding among these populations because the probability of incomplete lineage sorting (maintained by balancing selection) is very high.

    As a case in point, HLA-A*11 is very common in Papua New Guinea, but it is also very common in north India and in China. These two areas otherwise show no significant evidence of Denisova ancestry. We might conclude that the HLA-A gene just has an unusually high level of introgression into Asian populations, not typical of the genome as a whole. That's certainly possible. But without finding any substantial number of derived mutations in the HLA-A*11 variant in the Denisova genome and in living Asians, it is hard to rule out that the sharing of HLA-A*11 in all these populations is just coincidence.

    Of course, if the allele were absent in Africa, that would weigh in favor of the idea it is shared by Late Pleistocene interbreeding outside Africa. But HLA-A*11 is in Africa, just very rare. And it's in Europe. This is the kind of locus that is difficult to interpret: if it has any tiny disadvantage against malaria, for instance, its rarity in Africa is easily explained as a function of recent evolution, while its presence almost everywhere outside Africa would be no surprise even if there were never any interbreeding. This is not a case where the geographic distribution is an unusual coincidence -- it's present in Africa and relatively more common everywhere outside sub-Saharan Africa. So the distribution outside Africa cannot simply be explained by interbreeding with Denisovans -- not without selection -- leaving us stuck. Parham's hypothesis may be correct, but the data are really not sufficient to decide.

    HLA-C*07:02 -- the one apparently mentioned in the story -- is all over sub-Saharan Africa at low frequencies. Allelefrequencies.net has a dozen entries for the frequency of HLA-C*07:02 in sub-Saharan Africa, they all have it at frequencies up to around 7 percent (except for the small (n

    What about the question of hybrid vigor that the article raises? Is it possible that modern humans got HLA mojo from Neandertals and Denisovans?

    While only 6 per cent of the non-African modern human genome comes from other hominins, the share of HLAs acquired during interbreeding is much higher. Half of European HLA-A alleles come from other hominins, says Parham, and that figure rises to 72 per cent for people in China, and over 90 per cent for those in Papua New Guinea.

    I just don't think it's clear that these HLA alleles in humans have actually come from the archaic genomes.

    We've tried to match these at more precise levels (in the HLA system, that would be four- or six-digit haplotypes) and have not found the quality of the data high enough to manage a close match. That leaves us with the most superficial classification, which isn't enough to argue that the present human types are derived from the archaic genomes. Incomplete lineage sorting remains a good explanation for the similarities. In fact, we're thinking it makes a nice case study of just how hard it is to work with these genomes, which have lower than 2x coverage. Just typing the Denisova genome requires an assumption about whether the individual was a homozygote or heterozygote across the locus -- an assumption that we can test easily with higher coverage, but not so much with 1x and many gaps. It also requires greater trust in the mapping quality of the reads than we probably should have. With those caveats, the match to HLA-A*11 is likely but not totally solid. Saying that HLA-A*11 in modern humans came from Denisovans is simply premature. And while I've focused here on HLA-A, this is also true of all the other loci. There's a tipping point at higher coverage where typing becomes more secure, and the archaic data are not there.

    Anyway, I imagine that anyone typing HLA in whole genome data knows all this. The press account isn't going to go into the complexity, and I think it's worth noting the real difficulty of making inferences in this region of the genome on the archaic data. It's a tough problem and I've spoken to many human geneticists who thought we were foolhardy to start. But with the first information about the immune systems of archaic humans as the goal, you can see it's a worthwhile problem to tackle.

    Synopsis: 
    We're gathering the first information on the immune systems of ancient humans. Some challenges await.
Subscribe to Denisovans

Neandertals

For years, I've worked on their bones. Now I'm working on their genes. Read more about the science studying these ancient people.

Denisova

From a finger bone of an ancient human came the record of a completely unexpected population. My lab is working on the science of the Denisova genome.

Acceleration

The advent of agriculture caused natural selection to speed up greatly in humans. We're uncovering some of the ways that populations have rapidly changed during the last 10,000 years.

Malapa

Just outside Johannesburg, the Malapa site is producing some of the most exciting finds in human evolution. This site is the headquarters of the Malapa Soft Tissue Project.