| Title | Identifying Personal Genomes by Surname Inference |
| Publication Type | Journal Article |
| Year of Publication | 2013 |
| Authors | Gymrek, M, McGuire, AL, Golan, D, Halperin, E, Erlich, Y |
| Journal | Science |
| Volume | 339 |
| Issue | 6117 |
| Pagination | 321 - 324 |
| Date Published | 01/2013 |
| ISSN | 0036-8075 |
| Keywords | genomics, privacy, testing, whole-genome |
| Abstract | Sharing sequencing data sets without identifiers has become a common practice in genomics. Here, we report that surnames can be recovered from personal genomes by profiling short tandem repeats on the Y chromosome (Y-STRs) and querying recreational genetic genealogy databases. We show that a combination of a surname with other types of metadata, such as age and state, can be used to triangulate the identity of the target. A key feature of this technique is that it entirely relies on free, publicly accessible Internet resources. We quantitatively analyze the probability of identification for U.S. males. We further demonstrate the feasibility of this technique by tracing back with high probability the identities of multiple participants in public sequencing projects. |
| DOI | 10.1126/science.1229566 |
| Short Title | Science |
| Citation Key | Gymrek:2013 |
Identifying Personal Genomes by Surname Inference
Neandertals
For years, I've worked on their bones. Now I'm working on their genes. Read more about the science studying these ancient people.
Denisova
From a finger bone of an ancient human came the record of a completely unexpected population. My lab is working on the science of the Denisova genome.
Acceleration
The advent of agriculture caused natural selection to speed up greatly in humans. We're uncovering some of the ways that populations have rapidly changed during the last 10,000 years.
Malapa
Just outside Johannesburg, the Malapa site is producing some of the most exciting finds in human evolution. This site is the headquarters of the Malapa Soft Tissue Project.






