john hawks weblog

paleoanthropology, genetics and evolution

The diploid genome sequence of an individual human.

Sat, 2013-02-09 10:27 -- John Hawks
TitleThe diploid genome sequence of an individual human.
Publication TypeJournal Article
Year of Publication2007
AuthorsLevy, S, Sutton, G, Ng, PC, Feuk, L, Halpern, AL, Walenz, BP, Axelrod, N, Huang, J, Kirkness, EF, Denisov, G, Lin, Y, Macdonald, JR, Pang, AWC, Shago, M, Stockwell, TB, Tsiamouri, A, Bafna, V, Bansal, V, Kravitz, SA, Busam, DA, Beeson, KY, McIntosh, TC, Remington, KA, Abril, JF, Gill, J, Borman, J, Rogers, Y-H, Frazier, ME, Scherer, SW, Strausberg, RL, Venter, CJ
JournalPLoS Biol
Volume5
Issue10
Paginatione254
Date Published2007 Sep 4
ISSN1545-7885
Keywordschromosomes, J. Craig Venter, sequencing, whole-genome
Abstract

Presented here is a genome sequence of an individual human. It was produced from approximately 32 million random DNA fragments, sequenced by Sanger dideoxy technology and assembled into 4,528 scaffolds, comprising 2,810 million bases (Mb) of contiguous sequence with approximately 7.5-fold coverage for any given region. We developed a modified version of the Celera assembler to facilitate the identification and comparison of alternate alleles within this individual diploid genome. Comparison of this genome and the National Center for Biotechnology Information human reference assembly revealed more than 4.1 million DNA variants, encompassing 12.3 Mb. These variants (of which 1,288,319 were novel) included 3,213,401 single nucleotide polymorphisms (SNPs), 53,823 block substitutions (2-206 bp), 292,102 heterozygous insertion/deletion events (indels)(1-571 bp), 559,473 homozygous indels (1-82,711 bp), 90 inversions, as well as numerous segmental duplications and copy number variation regions. Non-SNP DNA variation accounts for 22% of all events identified in the donor, however they involve 74% of all variant bases. This suggests an important role for non-SNP genetic alterations in defining the diploid genome structure. Moreover, 44% of genes were heterozygous for one or more variants. Using a novel haplotype assembly strategy, we were able to span 1.5 Gb of genome sequence in segments >200 kb, providing further precision to the diploid nature of the genome. These data depict a definitive molecular portrait of a diploid human genome that provides a starting point for future genome comparisons and enables an era of individualized genomic information.

DOI10.1371/journal.pbio.0050254
Alternate JournalPLoS Biol.
Citation KeyLevy:genome:2007
PubMed ID17803354

Neandertals

For years, I've worked on their bones. Now I'm working on their genes. Read more about the science studying these ancient people.

Denisova

From a finger bone of an ancient human came the record of a completely unexpected population. My lab is working on the science of the Denisova genome.

Acceleration

The advent of agriculture caused natural selection to speed up greatly in humans. We're uncovering some of the ways that populations have rapidly changed during the last 10,000 years.

Malapa

Just outside Johannesburg, the Malapa site is producing some of the most exciting finds in human evolution. This site is the headquarters of the Malapa Soft Tissue Project.