Saturday, April 17, 2010

and a thousand new dissertations were born...

The U.S. Library of Congress will be creating "a digital archive of Twitter as a historical record." Money quote:

In an extraordinary agreement with Twitter's founders, the Library of Congress – the world's largest library and America's oldest federal institution – is to create a digital archive of the several billion tweets publicly posted on the social networking site since its inception in 2006.

Sounds like one deeeeeeelicious linguistic corpus to me. Me want.

No comments:

A linguist asks some questions about word vectors

I have at best a passing familiarity with word vectors, strictly from a 30,000 foot view. I've never directly used them outside a handfu...