john hawks weblog

paleoanthropology, genetics and evolution

Neandertals

  • Neandertal anti-defamation files, 13

    Fri, 2011-04-29 10:06 -- John Hawks

    From The Economist, "What's wrong with America's economy?"

    Of course, plenty more could be done to spur innovation. The system of corporate taxation is a mess and deters domestic investment. Mr Obama is right that America’s infrastructure is creaking (see article). But the solution there has as much to do with reforming Neanderthal funding systems as it does with the greater public spending he advocates. Too much of the “competitiveness” talk is a canard—one that justifies misguided policies, such as subsidies for green technology, and diverts attention from the country’s real to-do list.

    There's a Neanderthal funding system?! Holy lucre, Batman!

    Obviously if the Neanderthal funding system isn't directing government funding to my lab, it is badly in need of reform!

    Or...wait a minute. We all know that Her Majesty's Secret Service has information the rest of us don't. What if they've stumbled onto a black op Neanderthal project? OMG, THE CLONES!

    I swear, I won't tell a soul. Just bring me under the Neanderthal funding umbrella. It's classified.

  • Mailbag: Diet and isotopes

    Tue, 2011-03-29 19:34 -- John Hawks

    Re: "Tartar control and Neandertal plant use".

    In your review of the study on Neadertals and grain in dental calculi, you wrote the following:

    "The remains of starch grains and phytoliths tell us about diet breadth but not the proportions of different foods. They do note that nitrogen stable isotopes are most informative about protein-rich food sources, so that a substantial consumption of starchy plants such as grains and USOs might be hidden by isotope analysis."

    I also read this study, and I was curious about this comment in the discussion, as this is way outside my field of expertise. I was wondering if you could write a blog post commenting in more detail about what isotope data can and cannot tell us about the proportion of foods or food groups eaten by pre-historic populations, or if you have already done so, if you could direct me there.

    Many thanks for writing! You've been doing some nice work there.

    I have a long essay on the stable isotopes and diet:

    http://johnhawks.net/weblog/reviews/early_hominids/diet/stable_isotopes_...

    And two that deal more extensively with Neandertals and nitrogen isotopes:

    http://johnhawks.net/weblog/fossils/neandertal/neandertal_mammoth_diet_2...

    http://johnhawks.net/weblog/fossils/neandertal/neandertal_fish_drucker_2...

    And here's one about cave bears:

    http://johnhawks.net/weblog/reviews/behavior/non-primate/ursus/cave_bear...

    But all these are out of date in some respects. I've since had several conversations about the nitrogen isotopes. One thing that elevates 15N is breastfeeding, so the time of enamel formation relative to weaning makes a big difference. In more recent populations, the isotopes are often employed to give a picture of the place of birth relative to where the bones were found. A more comprehensive review is in order, but I'm not sure what the next find will be in hominins.

    Anyway, I hope that helps, and thanks again for the kind words!

  • Neandertal anti-defamation files, 12

    Tue, 2011-03-29 00:31 -- John Hawks

    British wine guru, Oz Clarke, of Top Gear-head James May as he teaches him how to judge wine and squash grapes:

    Time to show the wine Neandertal how it's done.

    Now, I think there's a more than decent chance that a Neandertal would be the ultimate sommelier. Humans have a whole raft of olfactory receptors that have been deactivated during our evolution, and some people have functional olfactory receptor genes that other human lack. I'd say going to Neandertals, we're likely to find that they have more functional olfactory receptors than most living people. "Wine Neandertal" could be a real compliment, in other words.

  • Mac attack

    Sat, 2011-03-26 09:25 -- John Hawks

    From satirical site Glossy News: "Painting of Big Mac Found in Neanderthal Cave".

    “What is most striking about this painting,” says Bouisquet, “is that this is precisely the time period during which Neanderthals went extinct. One naturally wonders, Could the consumption of fast food have contributed to the extinction? As yet we simply don’t know.”

  • Neandertal taste

    Thu, 2011-03-24 00:13 -- John Hawks

    I didn't comment on this study when it came out in 2009, but as I'm reviewing some materials I thought it worth taking down a note. Carles Lalueza-Fox and colleagues [1] intensively sequenced the TAS2R38 gene in a Neandertal bone flake from El Sidrón cave. This is a gene in humans that enhances bitter taste perception, and is responsible for the classic taster/nontaster polymorphism for the substance PTC. Some 1.5 million years ago, a mutant version of this gene arose with less sensitivity to bitter substances; both the high-sensitivity and low-sensitivity versions exist in human populations today. People who carry two copies of the less active allele are often called "non-tasters", in contrast to "tasters" who have at least one copy of the more active allele.

    Part of the difficulty of studying Neandertal genetic information is the low coverage of the genome reads available to date. Some parts of the sequence are not covered at all, and only a small fraction of sites in the genome are covered multiple times. If we want to study polymorphisms in a single Neandertal individual, we are limited to those areas with high read coverage, and even then we shouldn't put much confidence in them.

    For TAS2R38, Lalueza-Fox and colleagues did much deeper sampling of a single relevant site by PCR amplification. They ended with thousands of reads of the site they targeted:

    A total of 4307 sequences were generated for the TAS2R38 gene F142-R166 fragment (figure 1). Of the total, 2391 (55.51%) showed a C in nucleotide position 145, corresponding to a proline amino acid (taster haplotype), and 1916 showed a G (44.49%), corresponding to an alanine amino acid (non-taster haplotype). Three clones show singleton C to T or G to A substitutions that are the most common form of postmortem DNA damage (Briggs et al. 2007). The main researcher involved in the laboratory analysis (C.L.-F.) is proline homozygous. All the Y-chromosome sequences identified (n = 141) showed the ancestral allele and, thus, no male contamination of European origin could be detected in this amplification.

    That's a pretty good argument in favor of this individual having the polymorphism in question. There is of course a high probability that all these reads actually come from a very small number of template molecules, so it's not as convincing as it might look. But it's a picture of the kind of work involved in confirming polymorphisms from ancient sequence data. We will probably be reasonably confident when we have read coverage up to 15-20x coverage for most loci.

    Then we'll just have to worry about phasing. Maybe El Sidrón, with its related individuals, will turn out to be the perfect site for genomics.


    References

  • Did humans colonize the northern latitudes without fire?

    Mon, 2011-03-14 20:47 -- John Hawks

    Wil Roebroeks and Paola Villa [1] review the evidence for human control and use of fire in the archaeology of Europe during the Middle Pleistocene (130,000-780,000 years ago) and earlier. They observe that no evidence of human-controlled fire occurs in Europe before 400,000 years ago. This raises a puzzle: How did humans occupy the northern part of Europe without fire?

    The argument about the antiquity of fire is not new. There is very early evidence of fire at Swartkrans, Koobi Fora, and Chesowanja, which includes burned bones and heated artifacts, along with clay nodules that show evidence of heating as high as 400 degrees Celsius. The criticism of these early finds (reviewed by James [2]) centers around the difficulty of distinguishing human-made fire from natural bush fires. The association of the fire with artifacts can be readily explained: archaeologists only look for evidence of fire where they already have artifacts. The remaining question is whether artifacts or bones have been heated to temperatures hotter than those possible in bush fires, thereby providing evidence of human involvement. Burned bone from Swartkrans at least did reach such temperatures, seemingly unlikely without human involvement given their presence in the cave. I tend to think that humans did control fire early in some cases.

    Roebroeks and Villa do not dispute possible earlier evidence of fire, but claim that it was not habitual. Or to put it another way, some early humans may have used fire, but many or most did not do so. The lack of fire seems particularly surprising in the northern latitudes of Europe, where sites like Happisburgh (and Pakefield) show evidence of human habitation in the late Lower Pleistocene. Their review of the early sites is really worth reading and impressively compact. Nonetheless, I can't quote it in full; it's just too much text to extract. After a discussion of the earliest archaeological occurrences, they turn to the long sequences from Arago and Gran Dolina, where we really should expect to see some evidence of fire if people were using it.

    Arago and Gran Dolina contain long sequences and large quantities of lithic and faunal remains, subjected to taphonomic analyses (34–36). Their settings are comparable to the ones that, in later times, have often provided strong evidence of fire, such as Bau de l’Aubesier, Grotte XVI, and Lazaret in France; Bolomor Cave in Spain (Dataset S1); and Middle Paleolithic/Middle Stone age caves in Israel and in South Africa. Traces of fire have been found in the upper part of the sequence at Arago, in layers younger than 350 ka. No charcoal, no burnt bones, nor any other evidence of fire have been reported from any of the assemblages from the lower levels (dated to MIS 10–14). No charred bones or heated artifacts have been reported from the Gran Dolina sequence (TD4– TD10). Rare charcoal particles have been found in thin sections of the TD6 sediments, but these sediments originate from the exterior of the cave, and there is evidence of low-energy transport (37); thus, the charcoal may not be in situ. However, the high density of human, faunal, and lithic remains as well as their state of preservation and refitting studies (38, 39) clearly indicate an occupation in situ with little postdepositional disturbance. The absence of any heated material from the long sequences of Gran Dolina and Arago, both documenting hominin occupations over several hundred thousand years (36, 40), is striking. This is a strong pattern, which can be tested by future work at other hominin habitation sites. We suggest that the European record displays a strong signal, in the sense that, from ~400 to 300 ka ago, many proxies indicate a habitual use of fire, but from the preceding 700 ka of hominin presence in Europe, we have no evidence for fire use.

    One thing that really impressed me visiting Roc de Marsal last summer was that the site preserves a long archaeological sequence in which some levels are densely packed with charcoal and the remains of hearths, and at least one well-defined layer, with abundant evidence of tools and debitage, just has hardly any evidence of fire at all. These were Neandertals, not Middle Pleistocene Homo, and they managed to get by without leaving any clear evidence of fire even though many Neandertal populations clearly did control and use fire extensively, including at this very site at other times.

    There really were people living in the Pleistocene of Europe who didn't use fire very much, at least as evidenced by relatively long cultural deposits in well-stratified rock shelters and caves. Unfavorable preservation can explain the lack of charcoal or hearths at some sites, but not all of them. If we don't have a single good instance of fire in Europe before 400,000 years ago, people may well have done without it.

    The authors' review of fire evidence after 400,000 years ago in Europe is also very useful, and they include supplementary data table with fuller information and references for all the sites they discuss. It is impressive just how much evidence has accumulated over the years, and Roebroeks and Villa have doggedly tracked it down. They conclude that Neandertals had essentially the same degree of control of fire as Upper Paleolithic humans, and consider the use of fire as a processing step in the manufacture of complex tools:

    A recent study provides evidence of early modern humans at the site of Pinnacle Point in Southern Africa regular use of heat treatment to increase the quality and efficiency of their stone tool manufacture process 164 ka ago (13). The authors infer that the technology required a novel association between fire, its heat, and a structural change in stone with consequent flaking benefits that demanded “an elevated cognitive ability.” They also suggest that, when these early modern humans moved into Eurasia, their ability to alter and improve available raw material may have been a behavioral advantage in their encounters with the Neandertals. However, this interpretation ignores that Neandertals used fire as an engineering tool to synthesize birch bark pitch tens of thousands of years before some modern humans at Pinnacle Point decided to put their stone raw material in it. In more general terms, a greater control and more extensive use of fire is sometimes (12) seen as one of the behavioral innovations that emerged in Africa among modern humans and favored the spread of modern humans throughout the world. As stressed by Daniau et al. (52), if extensive fire use for ecosystem management were indeed a component of the modern human technical and cognitive package, one would expect to find major disturbances in the natural biomass burning variability associated with and after the colonization of Eurasia by modern humans. In their study of microcharcoal particles from two deep-sea cores off of Iberia and France, spanning the 70- to 10-ka period of biomass burning, the authors did not recover any sign that Upper Paleolithic humans made any difference: either Neandertals and modern humans did not affect the natural fire regime, or they did so in comparable ways.

    I do think the silcrete processing is interesting, but so is the pitch processing. For that matter, the possibility of fire-hardening in the Schoeningen spears would be a case of deliberate production of a complex tool using fire (complex, in that the fire-processing adds a step).

    Gesher Benot Ya'aqov, in Israel dating to around 800,000 years ago, is a highly compelling site in terms of evidence of fire. There are distinct hearth areas that correlate with archaeological scatter and have burned nut hulls and other foodstuffs. While Roebroeks and Villa express skepticism about the earlier evidence from Africa (specifically pointing to the high likelihood of bush fire as an explanation), they do accept Gesher Benot Ya'aqov as a likely fire location, while discussing the strength of the evidence. It's not such a high threshold to set; it seems like other sites should be able to meet it if fire was common.

    Personally, I am quite ready to accept that fire was invented many times by Lower Pleistocene humans and may have occurred in some regions of the world ephemerally. The maintenance of this tradition may have been a challenge that these early humans couldn't meet over long spans of time. This view does imply that the advantages of fire, including cooking, were not a typical part of the repertoire of Early Paleolithic people. But that would be consistent with what we understand of traditions in other species of primates; where one population may be pursuing complex and apparently valuable extractive foraging that another population lacks, despite otherwise being ecologically similar.


    References

    1. Roebroeks W, Villa P. On the earliest evidence for habitual use of fire in Europe. Proceedings of the National Academy of Sciences [Internet]. 2011;108:5209–5214. Available from: http://dx.doi.org/10.1073/pnas.1018116108
    2. James SR. Hominid use of fire in the {Lower} and {Middle Pleistocene}: a review of the evidence. Current Anthropology. 1989;30:1–26.
    Synopsis: 
    Wil Roebroeks and Paola Villa claim the archaeological record doesn't provide evidence for systematic fire use in Europe before 400,000 years ago.
  • Neandertal anti-defamation files, 11

    Wed, 2011-03-02 18:39 -- John Hawks

    Slate has an editorial by Farhad Manjoo, featuring the idiocy of people who write crank letters to NPR ("We Listen to NPR Precisely To Avoid This Sort of Stupidity"). Yes, I know some of my readers probably sympathize with the letter writers, plaintive plaints to keep their highbrow high.

    But lo, there at the end of the dyspeptic quote-mine, we find this:

    "You can't mention sports without someone saying, 'Why are you covering sports—it's just a bunch of Neanderthals, it's just a bunch of fascists!' " says NPR sports correspondent (and Slate sports podcast "Hang Up and Listen" panelist) Mike Pesca.

    Imagine! I ask you: What kind of quisling would stack Neandertals and fascists in the same breath?

  • Neandertal segments of X chromosomes

    Wed, 2011-02-23 16:06 -- John Hawks

    Last year, this Neandertal genome came out. No doubt you've heard about it. So maybe by now you're wondering where the new science is that's being done on this genetic information.

    We've been ramping up here in my lab for a few months, working with these data. My students have a couple of projects that we'll be keeping close to the vest. But for the most part I think we'll share stuff as we go along. This is all open access data, and there are some questions of fundamental interest that are actually pretty easy to resolve.

    The initial Neandertal genome draft publication [1] came with some analysis of the genome-wide similarity of the Neandertal draft genome and a few human genomes. A new review of the basic method of comparison has appeared in Molecular Biology and Evolution, by Eric Durand and colleagues [2]. The basic idea is that a branching model between populations without gene flow predicts that two members of one population have equal amounts of sequence similarity to a third individual in another population. If that third individual turns out to be closer to one or the other of the first two, we can reject the hypothesis that those first two are part of a population that has branched without gene flow away from the third individual's population. When we bring an African and a European as the first two individuals, and a Neandertal as the third, we find that the European is in fact closer to the Neandertal. So we can infer gene flow from Neandertals into the ancestors of Europeans. This comparison is nearly equally significant when we compare an African and a Chinese individual, or an African and an individual from Papua New Guinea. Thus we can infer that Neandertals contributed genes to the ancestors generally of present-day non-Africans, not specifically present-day Europeans. The amount of gene flow that can explain the pattern of genetic similarities adds up to around 2.5% of the total ancestry of non-Africans today. Again, it's not a direct observation; it's a model that explains the greater similarity of the Neandertal genome to people outside Africa than within sub-Saharan Africa.

    As you can see, this leaves open a key question. We don't know whether genetic similarities between Neandertals and present non-Africans are the same in different areas outside Africa.

    The whole-genome comparisons have great statistical power to test the hypothesis of gene flow in general. With a hundred thousand or so actual sequence differences between Neandertals and any given human genome, the method can potentially detect very small amounts of gene flow. What we're seeing in the Neandertal data is anything but small -- it amounts to greater non-African similarity to Neandertals at thousands and thousands of sites.

    But comparison of three whole genomes gives us very little power to identify the specific loci affected by gene flow. If a French genome has three percent ancestry from Neandertals, we can predict that other genomes in France probably do also. That's a consequence of independent assortment -- we're not looking at people who actually have Neandertal grandparents, we're looking at a population that had Neandertal ancestors thousands of generations ago. So all French genomes are probably more-or-less alike in the Neandertal quotient. But will they have the same three percent of Neandertal-derived alleles? Almost certainly not: each Neandertal-derived locus would have to be fixed in France for them to be identical in all genomes. Much more likely, a much larger number of Neandertal-derived alleles exist at an average frequency of three percent. Such a distribution would predict that the average Neandertal-derived variant found in our first French genome has only a 3 percent probability of showing up in a second genome. Looking at one genome in one population will find only a small fraction of loci that have been affected by Neandertal gene flow.

    Hence, if we want to answer the question about different populations, we need to look at a reasonably large sample of individuals. We need to know whether a Neandertal-derived variant in France occurs at the same frequency in China, and vice-versa. Are there loci where a Neandertal allele occurs at 10 percent in France, but never in China? Does a full list of loci with one or more Neandertal-derived variants include any interesting functional genes? Answering these questions would tell us a lot about the demographic and adaptive conditions that led to our Neandertal heritage.

    Enter the HapMap

    You'd think that a genome-wide set of SNP genotypes would be useful for testing hypotheses of population history. The HapMap has more than 3 million SNP genotypes from hundreds of individuals from China, Japan, Utah, and Nigeria, and more than a million genotypes from nearly a thousand other individuals from other populations. In other words, it's the kind of sample that could tell us a lot about the frequencies of Neandertal-derived alleles if we could find them.

    But the HapMap project didn't identify its set of genotypes to help us reconstruct population history. The aim was to find most common variants, and secondarily to add more variants in low-variation regions to allow linkage mapping of medically interesting phenotypes. SNP sites were disproportionately found in some populations (first, Europeans) more than others. These processes of SNP discovery led to ascertainment biases, in which the difference between samples depends not only on their histories, but also on where we chose to look.

    Ascertainment bias is a real pain if we want to test the hypothesis of Neandertal genetic contribution to today's humans. Look at it this way: Suppose we find a rare SNP allele in Europeans, absent in Africans, but present in the Neandertal genome. Looks like a piece of support for Neandertal ancestry of Europeans. If those sites outnumber the sites where we find a rare allele in Africans shared with Neandertals, not in Europeans, then that would seem like the same scenario outlined above -- a case where one of the living populations carries more Neandertal similarities than the other. Evidence of gene flow, right?

    Ascertainment bias leaves another possibility: Maybe we looked harder for rare variants in one of the living populations. If so, the lack of rare Neandertal-shared variants in the other population may be an accident of our SNP discovery procedure.

    There are ways around this problem. For instance, if the Neandertal genome carries many derived alleles for SNPs shared with Europeans, it weighs strongly in favor of recent genetic exchanges instead of ancient incomplete lineage sorting. But this basic question of "which population has more Neandertal ancestry" may still be hard to resolve.

    Haplotypes from Neandertals

    Green and colleagues [1] also presented a second approach for testing Neandertal ancestry. They used SNP data to identify regions of the genome where non-African populations appear to have a "deep root" to their genealogy, but Africans do not. These regions are rare across the genome; they focused on 100-kb intervals, finding only a dozen genome-wide that fit their criteria. But each of these is a case where non-Africans appear to have an ancient genealogical split between two haplotypes, all the SNPs lining up to distinguish one branch of the genealogy from another. If both are not represented in Africa, then presumably one of them came from some non-African ancient population. And indeed, they found ten of the deep branches within the Neandertal genome.

    This approach makes use of the information that SNP data provide about linkage. A segment of a chromosome from a living human that is similar to a Neandertal segment may be explained either by recent ancestry from Neandertals or from incomplete lineage sorting from the ancient human-Neandertal common ancestors. But if that segment is long, it probably isn't from the ancient common ancestors of humans and Neandertals, because recombination should have broken up the linkage across that long interval. Hence, long haplotypes shared by living humans and Neandertals are best explained by recent mixture. If those long haplotypes are predominantly found in non-Africans but not Africans, it tends to confirm that they have come from recent population mixture with Neandertals.

    But how long should these intervals be? This is an area where we can improve on the approach taken by Green and colleagues [1]. A hundred kilobases is way too long to represent the average Neandertal-derived haplotype. The average rate of recombination across the genome is around one centimorgan per megabase -- meaning that an interval of one million base pairs has a one percent chance of recombination per generation. That's a chance 1/1000 of recombination per 100 kb per generation, meaning that half the linkage across 100 kb should be broken up in roughly 700 generations. For humans, half the linkage at that distance decays after only 18,000 years or so, except in regions of low recombination. If we go as far back as 100,000 years ago, half of the linkage decays across regions as short as 18 kilobases. That means if we look at windows 20 kb long for evidence of Neandertal-derived haplotypes, we are likely to miss a large fraction of them. Hundred-kilobase intervals will miss nearly all of them.

    Bottom line, we want to look at intervals as short as we can. But if we look too short, we won't have much evidence to work with. The 3-million SNPs in the HapMap version 2 give us one site every kilobase on average. Ten kilobases will give us around 10 SNPs. A 10-SNP haplotype may sound impressive, but if most of those SNPs have a derived allele at low frequency (say, less than 10 percent), then it starts to become more likely that a given haplotype resembles the Neandertal genome just because they share ancestral SNP alleles. Ideally we'd like more SNPs, but in reality the Neandertal sequence draft is likely to lack several, so if we want 10 SNPs worth of comparison, we'll need to look at longer intervals.

    And really, HapMap 2 is a small sample to try to find low-frequency haplotypes from Neandertals. By analogy with the method used by Green and colleagues, we can find haplotypes that are present in the CEU (European ancestry) sample, present in the Neandertal genome draft, but absent in the YRI (West African ancestry) sample. But HapMap 2 includes only 120 genomes from each of the YRI CEU samples. If we have a variant at in Europe at 1 percent, we're pretty likely to miss it. Worse, if we find a haplotype in Europe at 1 percent, we're really not able to reject the hypothesis that it's in Africa at the same frequency, even if no copies of it are in YRI. We can help fix this problem by looking at HapMap phase 3 samples, which include two more African populations, bringing the total sample up to more than 300 within Africa. But there are fewer SNPs in HapMap 3, limiting our comparisons to longer windows. One could even contemplate the HGDP sample as a way to add even more individuals to our comparative samples. But that sample has many fewer SNPs, so we would need really long intervals to test the hypothesis of Neandertal ancestry for particular haplotypes.

    By the end of this I'll surely be pining for sequence data. Of course for that we haven't long to wait. But I have an aim for which genotype data are at the moment the only feasible approach. So I'm a bit stuck: Using a bigger sample means using longer intervals, which means I'll miss more and more Neandertal-derived haplotypes. But we should thereby get reasonable power to find any common haplotypes derived from Neandertals.

    Phasing and the haploid Neandertal

    The HapMap 2 samples and some of the HapMap 3 samples were taken from pairs of parents, where a child was also genotyped. Those trios make it possible to determine which SNP alleles were linked on the parents' chromosomes, providing a natural "phase" for the haplotypes. For some other samples, the phase was inferred algorithmically, using assumptions about population history and knowledge about which haplotypes are present in the populations with trios. Phasing algorithms are not ideal, because the assumptions about population history (inferred in many cases from the data) may be false. But over the relatively short intervals we're considering here, phasing will probably not lead to false positives.

    Neandertal draft genomes are themselves more of a problem. Each sampled individual is known from a large number of short reads, which (with some luck) can be aligned with the human genome map. The present data include many gaps. More important, there are only a very small number of places where the number of reads is high enough to determine whether a Neandertal individual was a homozygote or not. The Neandertal consensus sequences are built by taking the most frequent base from these reads aligned to any given site in the human genome. That means that the Neandertal "haplotype" across any set of SNP loci may well be a jumbled chimera of two different haplotypes carried by the Neandertal individual. For the current analyses, I have kept the Neandertal individuals separate -- so the haplotypes here were derived only from the Vindija 33.16 individual. If we use a consensus sequence taken from multiple individuals, we will have fewer gaps but potentially more jumble of different haplotypes.

    There's not much to be done about this problem. It should mostly cause us to miss true instances of Neandertal genetic ancestry, and we may be able to quantify the extent of this error in some high-coverage areas.

    (UPDATE 2011-02-24): I should mention, my lab has found that the Neandertal consensus sequences themselves have issues; the consensus-building algorithm appears in many cases to have included the human reference genome SNP allele in the place of the allele found in the majority of Neandertal reads. We are not yet sure how extensive this phenomenon is across the genome, but we have found it recurrently. We hypothesize that this is because of the priors on accepting calls with low read quality; the reference sequence seems to heavily bias the algorithm even in the presence of multiple contrary reads. We will have to check SNP calls manually in candidate regions.

    OK, so let's find the Neandertal regions!

    The strategy is fairly clear. I'll take a 10-SNP window from the HapMap, determine the haplotype of the Vindija 33.16 genome, see if that Neandertal haplotype occurs in the CEU HapMap sample, and then see if it also occurs in the YRI, MKK and LWK samples. When I find a haplotype shared with the Neandertal in Europe but not in Africa, I'll take that as a candidate haplotype for Neandertal ancestry.

    I probably want to be a little more permissive than that, actually. A Neandertal haplotype that is present in Europe, and present but rare in Africa may still be a good candidate. A Neandertal haplotype that does not match at all SNPs may also nonetheless be a good candidate, considering that the consensus is often merging two true haplotypes together. There's not much I can do about the consensus problem, because I don't have any way of figuring out the missing information except in rare cases with multiple sequence reads. But to address the first problem I can relax my criteria a bit with respect to variation inside Africa.

    Sliding the window down the chromosome will allow me to find the length of Neandertal-identical haplotypes in each individual, which could lead to an estimate of linkage decay. Across the genome, this will yield an estimate of the time that population mixture with Neandertals took place.

    Several other observations should lend some confidence in particular candidate haplotypes. The more a candidate includes derived alleles that are not themselves common in Africa, the more convincing it will be. If it does represent a "deep root" -- that is, if no close relative of the Neandertal haplotype occurs in the African sample, that also helps. The region with Neandertal identity shouldn't be too long. It might be quite common -- a few Neandertal-derived alleles may have been positively selected in later populations. But most of them are likely to be rare -- so I should expect to see many of them in only one or two copies in the CEU sample.

    I'm obviously interested in whether different populations (for example, Europe and China) have the same Neandertal-derived haplotypes. I'll leave that off for now -- there's much too much in this post already.

    So to be clear, this procedure will find haplotypes that are likely to have come into non-African populations from Neandertals. No single test will confirm these; but a combination of factors may be compelling for individual haplotypes. We can identify which genes may be in or near an interval where a candidate haplotype is found, but in all likelihood we will not have any known functional polymorphisms in the SNP data. This procedure then will provide no evidence that a particular Neandertal-derived allele has any functional effect in any living people.

    Some results

    I'll be reporting an awful lot more about results over the next few days. My first series of comparisons was the X chromosome, for reasons that will become clear shortly. On the X, there are 396 intervals where a 10-SNP Neandertal haplotype is identical to some CEU phased haplotypes and two or fewer within African HapMap samples.

    They vary in frequency in more or less the expected way -- a few of them are relatively common (10 or more copies out of the CEU sample, for example) most have only one or two copies in CEU.

    These vary substantially in length, mostly because some areas have very low Neandertal coverage. A few are more than 100-kb in length, most are 30 kb or less.

    The haplotype with the strongest signature -- 100-kb interval encompassing 26 SNPs in the Vindija 33.16 genome, is found in more than 15 (and centrally, in 22) CEU individuals and in no African individuals. The interval spans across part of the DMD gene (associated with Duchenne's muscular dystrophy). Conveniently, this is precisely the interval identified by Yotova and colleagues [3] as a site with Neandertal-derived alleles in non-African populations. They used comparisons at the sequence level, finding the Neandertal-derived variant at a frequency of 9% overall outside Africa. I have not yet confirmed that the SNP haplotype corresponds to this Neandertal-derived allele at the sequence level, but we should be able to manage that using public genomes. It's a nice confirmation that we're looking at the right kind of candidate loci.


    References

    Synopsis: 
    My research is outlining regions of human genomes that were derived from Neandertals. Here are some of the methods.
  • Neandertal anti-defamation files, 10

    Tue, 2011-02-22 17:20 -- John Hawks

    Today, it was Senator John Kerry:

    And in what might have might been the most bizarre exchange of the two-hour gathering, Kerry apologized to one woman who claimed he had called her a Neanderthal for not believing in global warming.

    Hmmm.... I think we can forgive the Neandertals for being skeptical about global warming.

  • Neandertal anti-defamation files, 9

    Fri, 2011-02-11 00:09 -- John Hawks

    Why do they have to bring poor Neandertals into it?

    The eyebrow-raising slap came in response to [former senator Rick] Santorum's recent comments that Palin was likely skipping an annual gathering of conservatives in Washington this weekend because of other "business opportunities" and her mothering responsibilities.

    ...

    "I will not call him the knuckle-dragging Neanderthal," Palin continued. "I'll let his wife call him that instead."

    Oh, well, this is so easy a caveman could do it.

Pages

Subscribe to Neandertals

Neandertals

For years, I've worked on their bones. Now I'm working on their genes. Read more about the science studying these ancient people.

Denisova

From a finger bone of an ancient human came the record of a completely unexpected population. My lab is working on the science of the Denisova genome.

Acceleration

The advent of agriculture caused natural selection to speed up greatly in humans. We're uncovering some of the ways that populations have rapidly changed during the last 10,000 years.

Malapa

Just outside Johannesburg, the Malapa site is producing some of the most exciting finds in human evolution. This site is the headquarters of the Malapa Soft Tissue Project.