john hawks weblog

paleoanthropology, genetics and evolution

blogging

  • Opening a bibliography database for human evolution

    Tue, 2010-08-10 13:53 -- John Hawks

    I'm announcing today the new availability of a bibliography section here on the weblog. At present this database includes more than 11,500 entries. These represent a large fraction of the historical and contemporary literature in human evolution.

    The database as it exists today owes to the work of many, many people. Foremost among these is Milford Wolpoff, who compiled and has curated an immense bibliography as a flat text file over many years. It includes entries that have been cited in many papers by Milford, his students, and many coauthors.

    Milford says that these folks are too numerous to remember. That has the ring of truth. Just by looking through the entries you get a picture of an active group of people over more than forty years.

    To these "legacy" references, I have begun to add fuller bibliographic information during the last few years. This began with Digital Object Identifier (DOI) tags for new, and later for older entries. In the last few months, I have begun to archive further information, including abstract.

    I continue to update and backfill references that belong in the database. There are many cited here on the blog that haven't been properly archived, and I will be adding these along with new citations.

    What you'll find

    The bibliography has a search filter, search terms will match author, keyword, title or abstract (where present). With more than 11,000 entries, you want to be a little selective about how you search. Author names work really well, and yield a list separated by year of publication.

    You'll find each reference preceded by a unique citation key in brackets. I did this purely for my own convenience, but for those who may want to download lists of citations, it may also prove useful.

    A list of search results can be exported to BibTeX or RTF format for download.

    There is also a "filter" tab that allows keyword, author, and year filtering of the list. This is really not very useful; the size of the database makes it much simpler to search than to filter all entries. I have entered keywords for only very recent entries, so keyword filtering is not especially helpful.

    Each citation in the reference list comes accompanied by several links. You can click on the title, taking you to a reference page with full bibliographic information. For recent entries, the reference page includes DOI, direct links to journals, and the abstract. For all entries, the links include Google Scholar lookups and citation downloads (in BibTeX or RTF format). Author links give a list of all my database entries from that author. These require exact name matching, so don't rely too much on them -- a search of the database for an author's last name is usually more complete.

    My bibliography section has its own feed, which lists new entries in reverse chronological order. I tend to upload new references in batches, so you'll frequently see a dozen or so show up at a time. I've put a short excerpt of the feed in a block on the left side of every page, just above my blogroll.

    Snafus

    On the topic of author names: I know that many of you will immediately search for your own name. You'll find some of your publications there, but you almost certainly will see that many are missing. A few of you may not find a single paper!

    Please don't be offended. Remember that the entire list was typed by somebody. If a paper isn't there, it's because nobody typed it into the list. Really, only a few of mine are in there!

    I'm not volunteering to add another 11,000 entries to the list right away. It will take me 5 years or so to manage that, at the current rate. But if you find yourself wanting to pitch in, to help add some of your references to the database, then let me know and I'll work with you.

    There are countless typos. I do not intend to fix these, unless I cite the paper moving forward. It would literally take me weeks of work to fix the typos in these entries. Please don't contact me to fix these, because I won't do it!

    Some errors in the database have resulted from a script gone wrong. I scripted them all into BibTeX. When I did this, my script langage was Pascal, which was no picnic. That's a lot of pattern matching on citations. To the credit of all the people who entered data, the overwhelming majority of citations fell into a few common patterns. But there were oddballs, which didn't get translated properly. Over the years I've fixed many of these script-induced errors, but not all. Some of these are ugly and weird. I'm sorry!

    A few quirks come from the software that presents the database. I haven't begun hacking on this, and I've decided to leave the defaults for awhile to see how well they handle the requirements.

    Most critically -- inline links to the references get mixed up when there are more than one reference list on the page. Since this is usually true of my front page, that's a problem! This is a minor annoyance, since the inline links only save a bit of scrolling, but it may not be easy to fix.

    Google Scholar lookups work pretty well, but I've run into some issues. For some older entries, the links fail to find papers that Google actually knows about. The problem is that the system passes the full title on to Google as a "quote", requiring an exact match. These often fail. Removing the quotes from these will often recover the citation in Google Scholar.

    If you do a search, the biblio page will continue to filter on your search phrase until you reset it. There's always a "reset search" link right up at the top of the page.

    Keyword searching works quite well with recent entries to the database, but not with the legacy references. If you are working on a research paper and want to do a lot of keyword searches, you'll be much better off starting with Google Scholar or Web of Science.

    The inner workings

    I got to talking with some researchers from Microsoft a few months ago, who were interested in the ways that I curate information. During our conversation, I came to realize that maintaining bibliographic data is the worst bottleneck in my work.

    I use BibTeX in my research work, but I've never had a good workflow that would encompass both research articles and blog posts under the same bibliography system. Often, I'd blog about a paper but fail to get it into my research database. If I wanted to blog about a historical paper, I'd have to go looking for it in my bibliography.

    Once I talked this through, I saw what I needed to do. I had to find a better way.

    Drupal has really significantly added to its bibliographic capabilities in the last couple of years. I found that the Biblio module now has almost everything I need to import and share references. With this module, Drupal can provide inline citations for blog posts and automatically compile formatted bibliographies. It also provides all those cool links to Google Scholar and other online sources.

    One more sed script to replace some macros, and I had all my BibTeX database ready to upload.

    But that didn't solve my problems with data input. I found that CiteULike could take on this task pretty well. The site automatically scrapes bibliography information from the websites of journals. Increasingly, everything with a Digital Object Identifier link has some way for CiteULike to get its information --- even many edited volumes.

    I've tried CiteULike and many other bibliographic tools in the past. I know that other folks like Zotero, or Papers, or Mendeley, or RefWorks, or CiteSeerX. I've tried them all -- every few months I've given them another look to see if they've reached that critical point where they're useful. I never had much luck. There was always some snag keeping me from getting back value for the effort I put in.

    I can't say whether CiteULike is the best of them today, but it did what I needed done -- crucially, accurate BibTeX import of my whole reference file, including citation keys, easy setting citation keys for new references, and bulk export so I can update both my local file and Drupal without extra typing. Plus, the recommendation list is really useful.

    Final thoughts

    Most of the database is also available under my CiteULike profile, so if you're a user of that service, you may find that useful. It's more searchable in some respects, less so in others. CiteULike also offers a feed of my new entries, which run with abstracts. It's like a whole abstracts blog!

    Meanwhile, I'll be filling in references very quickly for the next few months. It remains a work in progress, especially as I continue to merge blog citations with the database. If you find it useful, please drop me a note so I'll know how you're using it!

  • Siphoning the firehose

    Tue, 2010-07-20 14:22 -- John Hawks

    Bora Zivkovic leaves ScienceBlogs and reminds us of the imprint that blogging has made on some careers in the last five years. Reading his thoughts on blogging and media, I found some similarities on something written by Richard Fernandez today: "The best of times".

    Editors of the future, if they still exist, will be graduated from Carnegie Mellon, Caltech or MIT rather than the Columbia School of Journalism. The journalists themselves I think, will be replaced by what may be called embedded sensors in place. The age of scribbler is over and the age of the literate practitioner and whistle blower has just begun. Interior debates within the industry, the professions and government will soon become the primary source of news. The primary challenge of reporting in the future will be to find entree into a circle to which one does not belong in order to write a story as an outsider. Absent that the insiders will generate the story on their own.

    In science, the firehose is open: Everyone publishes research, some very good, some bad, much useless. It's written in an obfuscating language, much of it in journals that are accessible only to the select.

    The "embedded sensors" are important -- giving perspective on how science is done and what may be coming next. The best science journalists are writing those stories, some of them almost are anthropologists of science.

    At the same time, every scientist of note is already an aggregator, choosing articles to read, to discuss in journal clubs, and possibly to cite. We're all editors of a sort, but few take the time to be good translators.

  • Snark and trade

    Mon, 2010-07-12 10:28 -- John Hawks

    If you haven't had your fill of angsty petulance, then Scienceblogs and its stable of writers have been wading through Edward versus Jacob territory. Jonathan Eisen's parody leverages the other summer drama, LeBron's departure from Cleveland:

    Morris, MN — For the final act of the drama that has captured the imagination of science blogging fans around the world, PZ Myers of Pharyngula plans to announce his decision live on the Discovery Channel at 9 on Thursday night.

    The arrangement, first reported by Discovery, was confirmed by a person close to Myers, who requested anonymity because he was not authorized to speak on Myers behalf.

  • Meeting casts

    Mon, 2010-04-19 22:10 -- John Hawks

    I'm just back from the physical anthropology meetings. What a lot of interesting things there were -- a few in the sessions, and many outside of them!

    A few people asked me what I saw or heard that I would be writing up for the blog. I had to explain that a long time ago I decided not to blog about stuff I saw at conferences. It makes life easier in several ways -- I don't have to sit around taking notes, and people don't have to worry what they tell me.

    Naturally, there are exceptions to every rule. Sometimes people have already published their stuff, or they're hoping to publicize work that won't be appearing in "embargo" journals. And sometimes there's stuff that isn't science, at least not directly, but deserves a record of some kind.

    About these meetings, I want to write one thing -- these were outstanding for sharing casts of new things with the field at large. Darryl de Ruiter brought casts of many of the Malapa specimens, and made many opportunities to share them with everybody interested. Scott Simpson had brought a cast of his new reconstruction of the pelvis of the Nariokotome skeleton, and was showing it along with his poster about it.

    For readers who don't usually study anatomy, I have to emphasize just how important it is to be able to look at a physical object. Descriptions and photos are, of course, necessary for publication, and they give a formalized account of anatomy. But the sheer size and three-dimensional appearance of a physical object carries tremendous information, not easily conveyed in words. I have worked with bone and fossils for many years, so that handling a cast allows me to place it next to thousands of objects in my tactile mental catalog. I have a much better understanding of those fossils now that I've gotten a chance to handle the casts, and that memory will stay with me.

    I haven't seen such an availability of new casts at the meetings since 2001, when Maeve Leakey had casts of Kenyanthropus.

    So I want to recognize how open and accessible those objects were at these meetings. These guys are real class acts, and their willingness to share and talk about the new fossils will advance the science. The quality of the reviews for their upcoming research papers will certainly be higher, since many of the potential reviewers will have a much greater familiarity with the fossils. Besides that, the mere opportunity to look at things along with a wide range of experts is really unique. I congratulate them.

  • Has Technorati suddenly gotten useful again?

    Sun, 2010-04-04 21:52 -- John Hawks

    I had a surprise this weekend. After years of declining value through increasing noise, I'd essentially stopped checking my Technorati stats. What had degraded almost beyond usefulness took a leap into unrecognizable territory last year, as the team started revamping its software. Yet, it now seems that the tinkering has worked: Technorati got its mojo back.

    Back in the early days of blogging, all the kool kids "claimed" their blogs on Technorati. In those days, pre-2005, there was no Google Blogsearch. Not everybody had a feed yet, and the diversity of blog software was very uneven in its support for feeds, trackbacks, and blogrolling.

    Topical "communities" grew mostly by the blogrolls of well-trafficked sites -- those who knew the ropes led by example, others picked up the techniques as they read. A few commercial ventures started to promote the idea of shared hosting for blogs with similar interests -- leading to the birth of the "science blog" concept, both in the trademarked and more ecumenical versions. In the blogoverse, one ruling credo came to the fore: It wasn't how many readers you had, it was how many inbound links. Links gave an air of reciprocity to the entire enterprise -- you read other people, linked to what you liked, and thereby shared their work with others. Link-love was born.

    Technorati did two things well. Most important, the site provided a report of inbound links. For those (like me) who never did get trackbacks to work right, having a third-party site provide link data was the best way to find new blogs, return links to them, add them to the blogroll, and build the community. I checked the site every day, to find the conversations that my posts started elsewhere.

    But Technorati's more visible accomplishment was its blog ranking. One numbered list, from one to twenty-four million, listing every blog in order of inbound links. In the early days, the list was almost aristocratic. Latecomers like me could aspire to get the number of links of more established blogs, but it wasn't too easy to climb the charts because they tracked the total number of links, not the number in the last few months. It took an interesting writer to climb the chart. Technorati wasn't the only ranking or link-tracker -- N. Z. Bear's "Ecosystem" memorably gave a cute name to everyone in its tiers of blog ranks, and other upstart rankings would appear over the next several years.

    In 2006, mainstream science started to take notice of blogs. Nature ran a news story about the phenomenon, listing 50 "top" science blogs, taken straight from the Technorati rankings at the time. My position on the list, at number 14, proved to be a great advantage as I tried to explain to skeptical colleagues what the heck I was doing. It's one thing for Technorati to say you're number 14 at something, but when Nature agrees -- well, that's scientific. It made a difference.

    But in many ways, that was the high water mark of the independent ranking and link-counting engine. The growth of blog "communities" gave new ways to game the system. Repeated blogrolls and aggregators guaranteed some blogs a link count vastly out of kilter with the conversation that they actually provoked. "Link trading" among completely unrelated blogs further distorted the picture. Over time, Technorati started to degrade. The worst of the problem came from spam blogs. As the spambots ruthlessly interlinked with each other, they climbed the rankings and pushed the real blogs down the list. By 2008, most of my inbound links turned out to be either spam comments (not me, I swear!) or automated blogrolls from sites that have nothing to do with science. With all this static, somebody badly needed to adjust the rabbit ears.

    New aggregation services, like Postgenomic, Wikio, and others, tried to create topical lists for science. I tried many of them over the years, but each fell short -- either leaving out big chunks of the conversation, or failing to fix problems with incoming feeds. I'm running stock Drupal, but for some reason many third-party sites can't index my feed and won't give me any service to figure out why. Does anybody care? Heck no -- the point of these sites is either to test algorithms or to place advertisements. Technorati was just as mercenary as any of the others, but at least they got it right -- even when I was running Bloxsom with RSS 0.85 cobbled in Perl. But this same accessibility may have made the site's Achilles heel: spambots were pinging ten times as much as the real blogs.

    Last year I'd almost given up on finding incoming links. Google gives me a daily digest, but their indexing misses a surprising number of people, even those with well-established sites who write every day. I would sometimes discover that somebody had linked to me weeks before. Often I'd write, or go and put up a comment -- but after days the opportunity for conversation may already have passed by.

    There were signs that other people had the same problem. Blogs no longer seemed the conversational tools they once had been -- people were taking true conversations onto Twitter. The social web started to draw people into backchannel conversations, inaccessible to ordinary readers. Fewer and fewer of the established science blogs seemed to explain things from the beginning. For those of us who may not want to "follow" or "friend" each other all the time, the web started to seem like a lonely place. Was there still room for a real correspondence, with revision and thought over time, instead of jotted into 140 characters?

    Seeing the last few days of incoming links, all in one place, makes me feel a lot better. It was just chance -- really, looking for some blog coverage of how Sketchbook Pro works on the new iPad -- and there they all were. My inbound links! No spamblogs, no endless list of blogroll links. Now I'm back to discovering new blogs, from people who are engaging with my field. Old friends may have gone over the last few years, or have cut back their online writing as they moved from graduate school into industry or the tenure track. But new ones are rising to take their place. The aristocratic element of the ranking has relaxed -- the new Technorati algorithm is ignoring blogrolls and is instead reading feeds for new links. That gives it an immediate quality, and a blog's rank will ebb and flow with the conversation it provokes.

    After all this time, I'm still in the top 14 for science.

  • Bloggers and embargoes

    Wed, 2010-03-03 13:30 -- John Hawks

    An interesting factoid from Bora Zivkovic, writing about PLoS media coverage:

    First, as you probably already know, PLoS makes no distinction between Old and New media. We have bloggers on our press list who apply/sign-up in the same way and abide by the same rules as traditional journalists (and, unlike mainstream media, bloggers NEVER break embargos, not once in the past three years since we started adding bloggers to our press list).

  • Your online presence

    Fri, 2009-09-25 08:30 -- John Hawks

    The Times Higher Education supplement:

    I'm a celebrity academic ... in the blogosphere

    British universities have been encouraged to embrace the concept of the "celebrity academic" and follow in the footsteps of their "shamelessly" self-promoting peers in North America.

    Chris Brauer, lecturer in online journalism at City University London, said academics should be encouraged to use the blogosphere to raise their profiles.

    If the goal is to "raise your profile" to celebrity academic, good luck with that. The first-mover advantage of blogging is long gone. Martin Weller thinks more rationally:

    This isn't the first time I've heard the celebrity argument, but I think it misunderstands the aim, or benefits of blogging. It assumes that becoming a celebrity is the only goal for an academic blogger. This seems to me to exhibit a lack of imagination and makes a straightforward analogy of print journalism to blogging. Sure, there are some good academic bloggers who perform the role of interpreting events for the general public, but there are many more who write about their subject in detail, where the intended audience is that of their peers or community. If have a very specialised area of expertise, medieval dance (say), then it's not about becoming a celebrity by blogging about this, but rather having influence and being recognised within your (probably quite small) community.

    Personally, I Google for people when I want to find out about their work. When they don't have a presence on the first page of results, that discourages me from following up further. You don't have to blog to be engaged and accessible, but you do need some kind of online presence.

    (via Clock)

    Tags: 
  • Book notes: Free, by Chris Anderson

    Tue, 2009-09-22 11:22 -- John Hawks

    I read Chris Anderson's book because it was, well, "Free". The book's thesis is simple: Sometimes people profit by giving things away.

    I have been, for several years now, making scientific knowledge available for no cost to any readers who care to come by my site. In academic circles, this practice is ordinarily considered to be insane. Therefore, whenever I come across anything explaining why blogging isn't such a stupid idea, I put it right into my files. That's for Luddites on future promotion committees.

    How do I review the book without making points like a Slashdot comment thread?

    Somewhere in the book, Anderson wrote his plan for making money from Free: Get businesses to pay for the Chris Anderson "Free" seminar. The short business profiles and catchy anecdotes in the book were pretty well crafted as advertisements for the seminar. But beneath the chrome, there are some interesting -- sometimes wacky -- ideas about the nature of human economic interactions.

    Anderson describes the $10,000 wager between economist Julian Simon and Paul Ehrlich. Simon believed that commodity prices would not rise over the long term, betting that the "substitution effect" would spur people to find new technological solutions to replace expensive raw materials, thus lowering the commodity prices. Ehrlich believed that resource shortages were inevitable as population and economic pressures grew. In 1990, Simon collected on the bet, as the five commodity metals chosen by the two had all fallen, many substantially.

    Others have cast this story as a parable about bad predictions, or the inherent fallacy of future pessimism. Anderson gives the story a sociobiological spin:

    Humans are wired to understand scarcity better than abundance. Just as we've evolved to overreact to threats and danger, one of our survival tactics is to focus on the risk that supplies are going to run out. Abundance, from an evolutionary perspective, resolves itself, while scarcity needs to be fought over. The result is that despite Simon's victory, the world seemed to assume that Ehrlich, on some level, was still right.

    As [Wired's Ed] Regis noted, "Simon complained that, for some reason he could never comprehend, people were inclined to believe the very worst about anything and everything; they were immune to contrary evidence just as if they'd been medically vaccinated against the force of fact." Ehrlich's gloomy predictions continued (and continue) to have influence. Meanwhile Simon's own observations seem to be of interest only to commodities traders (49-50).

    If we really want to explain the phenomenon of "Free", we need to turn to psychology and sociology. At several points in the book, Anderson does connect to these fields -- mentioning the "Dunbar number" in the context of MySpace "friends", Lewis Hyde's work The Gift in the context of non-monetary economies, and Abraham Maslow's "Theory of human motivations" in the context of why bloggers write for free. But Anderson's goal is not to explain, but to popularize. So his use of academic sources is, well, eclectic. The "Dunbar number" is mostly an anthropological urban myth. There's a very deep literature on the gift in ethnology. Sure, there's no market for Mauss seminars on Anderson's lecture circuit, but there are some entertaining classic stories about confusion, gifts, and cross-cultural contacts.

    Anderson's theme simplifies this complexity of social interactions into a binary:

    There is a reason why economics is defined as the science of "choice under scarcity": In abundance you don't have to make choices, which means that you don't have to think about it at all (50).

    There's an anthropological claim -- that humans are "wired" as a "survival tactic" to perceive scarcity. It makes a good story. But is it true?

    Human lives are long. Sure, there are some essential resources so abundant that we don't need to think about them -- air, for example. But most resources vary over time or space. People have always needed to consider whether to stay or move, whether to hunt today or wait until tomorrow, to gather more firewood or risk the cold. It's the ant and the grasshopper.

    We might imagine a version of the Ehrlich-Simon bet during the Pleistocene. Imagine humans occupying a few abundant habitats with plenty of food. As the population grows, they put pressure on these habitats. What happens? On the Ehrlich side, resource scarcity might trigger a demographic crisis, with hunger, warfare, and a population crash. On the Simon side, people might expand their behavioral niche, moving into less favorable habitat with more complex cultural adaptations.

    Humans are tricky creatures, and our potential for increasing complexity depends on the level of complexity we've already reached. Only some parts of a complex system may be amenable to measurement. Anderson points out this problem from the standpoint of business information technology:

    When your phone company tells you that your voice mail box is full, that's artificial scarcity -- it costs less than a nickel to store one hundred voice messages, and the average iPod could store thirty thousand of them (voice messages are recorded at lower quality than music, so they take less space). By forcing subscribers to take the time to delete voice mails, the phone companies are saving a little money in storage costs by spending a lot of consumer time. They managed the scarcity they could measure (storage) but neglected to manage the much larger scarcity of their customers' goodwill. No wonder phone companies are second only to cable TV companies in the "most hated" rankings (191).

    Reading this made me think of behavioral science in the role of the stupid phone company. Natural selection optimizes fitness, that much is algorithmic. But how does this optimization process affect any given behavioral trait? That depends how the trait is connected to fitness, how heritable it is, and whether anything else correlated with the trait exerts its own independent negative effect on fitness. It's a mess, and we generally can't figure it out. So, we measure what we can. The fallacy is that what we can measure may have little connection to the important output -- for the phone company, profit per customer; for the biologist fitness.

    Today with an embarrassment of abundance of food and goods, people still agonize over choices. From one point of view, this is just a waste of time -- why argue over arbitrary markers of status, when the essential resources are super-abundant?

    But from a social point of view, seeking out limited information may be in our nature. This leads me to question whether Anderson is right about the value of information itself. Does it really trend toward free?

    Humans evolved to be users (and broadcasters) of social information, but in the past our communication was limited by many of the same constraints that other animals face. Animal communication is not free -- it comes with direct and indirect costs. The direct costs are energetic and developmental -- animals have to build and maintain the organs of communication, and supply the power to run them. The indirect costs are the perils of advertising: an animal that reveals itself runs a greater risk of predation. Worse, the peril of honest advertising is that potential mates may see what a loser you really are.

    The Internet might have enabled more complex systems of information presentation, but for the most part, people use it for old-fashioned reading. At its best, it has enabled a social transformation, empowering millions of people to use very simple means of information transfer -- from short-form blogs to messaging in World of Warcraft.

    That's "messaging", not "massaging".

    If the cost of information appears to be trending downward, it may be that's because the production of information is increasing with a lot higher slope than the production of money that might purchase it. Consider genomics: What would you be willing to pay for your genome today? Whatever your answer, you can expect that you would be willing to pay even less 10 years from now, unless the health value of that information radically increases.

    [T]he more products are made of ideas, rather than stuff, the faster they can get cheap. This is the root of the abundance that leads to Free in the digital world, which we today shorthand as Moore's Law.

    However, this is not limited to digital products. Any industry where information becomes the main ingredient will tend to follow this compound learning curve and accelerate in performance while it drops in price. Take medicine, which is shifting from "we don't know why it woks, it just does" (there's a reason it's called drug "discovery") to a process that starts with the first principles of molecular biology ("now we know why it works"). The underlying science is information, while observed efficacy is just anecdote. Once you understand the basics, you can create an abundance of better drugs, faster.

    DNA sequencing is falling in price by 50 percent every 1.9 years, and soon our individual genetic makeup will be another information industry. More and more medical and diagnostic services will be provided by software (which get cheaper, to the point of being free) as opposed to doctors (who get more expensive) (84).

    Once upon a time, the only diagnostic service was a doctor. Doctors offloaded some of their diagnostic effort as lab tests became more and more important. Nowadays, one of the major reasons for the increase in health care costs is the routine ordering of expensive tests. Doctors order these tests because they reduce risk -- risk of bad outcomes, and risk of malpractice suits. Risk is money.

    The least satisfactory chapter for me was about science fiction and abundance. Anderson argues throughout the book that humans are "wired" for scarcity; that we just don't understand abundance. In chapter 15, he turns to fictional worlds -- from E. M. Forster to Cory Doctorow -- in which some machine (or other invention) had created endless abundance. Invariably, these works describe how society degenerates when freed from scarcity -- freed from "striving", people are robbed of purpose.

    Anderson misses a darker connection. The people who worried about the degeneration of human moral purpose in the face of abundance also worried about our genetic degeneration. The eugenics movement was born in the same post-industrial society as science fiction, and for the same reason. Later in the book Anderson references the similarity:

    [R]eplace "free" with "steam" and you can imagine the Victorian concern about flabby muscles and minds (229).

    I wonder whether there is something inherently dystopian about a society where genetic information is too cheap to meter. In a world where risk is money, and very small risks are increasingly quantifiable, it is not hard to imagine an inexorable process toward removing freedom and imposing control. Certainly that has been the theme of many science fiction works.

    But rather than end on that depressing note, I'll point instead to a happy consequence of free information exchange: the creativity expressed in online communities:

    RuneScape, yet another Web-based world of orcs and elves, counts more than 1 million subscribers (out of more than 6 million users) paying $5 a month, creating a $60 million annual business. As a point of reference, that's about the same size as the subscriber user base and annual revenues of the Wall Street Journal's subscription-based Web site, which is the biggest paid site of all the world's newspapers. It's also larger than the New York Times's paid online subscriber base was before the paper dropped the model in favor of Free in 2008. It appears that people would rather pay to cast pretend spells than to read Pulitzer Prize-winning news. (I'll leave whether that's a good thing or a bad thing to others.) (150).

    People are using their power to make new things -- sometimes frivolous, fictitious things, but things that make them happy. It's possible that genetic information can serve this purpose, too -- a point I'll return to on a different topic tomorrow.

  • Write with a knife

    Sun, 2009-09-06 12:48 -- John Hawks

    It's that time of year again, when students all over the country are facing their first writing assignment. I always encourage a bloggy style -- concise, journalistic, and thesis-driven.

    Well, I don't even manage that ideal myself a lot of the time, but here's some useful writing advice from Copyblogger. First, "Do long blog posts scare away readers?". Well, they don't scare away mine, but I can always use suggestions for how to punch things up (or, like overleavened dough, down).

    So I highly recommend the follow-up, "How to write with a knife". One piece of advice I like:

    2. Cut the first paragraph

    ...Try cutting the first paragraph or two from your post and see what happens. You may find a much more powerful opening.

    That technique would work wonders for more than half the student papers I grade. I always underline the thesis statement (or at least the best facsimile of one I can find) and an awful lot of the time, it's there at the beginning or end of the second paragraph.

    There are six more recommendations at the link, and I can see my students in every one of them. (Not to mention myself).

  • Nature on conference blogging

    Wed, 2009-07-08 13:45 -- John Hawks

    Nature's editorial, "How to stop blogging" (which might sound like a self-help piece), takes a position on the conference blogging issue:

    We are in the midst of a clash of conference-going cultures. Attendees who have taken to blogs and other social-media applications such as Twitter and Friend Feed will value the instantaneous communication of fact, conjecture and commentary as a way to network beyond badge-holders. Most researchers, in contrast, will focus on the science and ways to network with fellow attendees. If they are aware of social-networking applications, they are likely to regard them as distractions at best. At worst, they will fear them as tools to undermine and scoop, to release data not ready for consumption by anyone other than the trusted colleagues who bothered to make it to their talk or walk up to their poster and start asking questions.

    Conference organizers are stuck in the middle. They want to let the world know that their meetings are worthwhile, and yet they also want to attract speakers presenting the newest and most cutting-edge findings. So how to protect speakers from having sensitive, unfinished or 'scoopable' work broadcast to the world?

    There is a problem of attribution worth considering. Suppose some young blood watching a presentation has a great idea. Maybe it's an idea the presenter has already thought about, maybe not. Now, she blogs it. Now anyone can see how great (or obvious, or terrible) the idea is, and how it applies to the topic of the presentation. What are the presenter's obligations? As she prepares the publication, does she need to cite the blogger? Does she need to invite the blogger as a coauthor? Will reviewers know about the blogger's idea and demand that the manuscript be altered?

    If we're going to open science conferences, we have to think about the meaning of authorship. Comment systems on scientific papers may help address the issue, by giving more opportunities for sharing ideas. But in the worst cases, a topic may draw such broad interest -- and at the bottom be seemingly so simple -- as to create a tangled mess of irresolvable ideas. Imagine crowdsourcing the hobbits....

Pages

Subscribe to blogging

Neandertals

For years, I've worked on their bones. Now I'm working on their genes. Read more about the science studying these ancient people.

Denisova

From a finger bone of an ancient human came the record of a completely unexpected population. My lab is working on the science of the Denisova genome.

Acceleration

The advent of agriculture caused natural selection to speed up greatly in humans. We're uncovering some of the ways that populations have rapidly changed during the last 10,000 years.

Malapa

Just outside Johannesburg, the Malapa site is producing some of the most exciting finds in human evolution. This site is the headquarters of the Malapa Soft Tissue Project.