<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-520807396714463309</id><updated>2012-01-27T17:03:19.383-05:00</updated><category term='it&apos;s always sunny in philadelphia'/><category term='Penélope Cruz'/><category term='celebrity names'/><category term='college costs'/><category term='descriptivism'/><category term='millionth word'/><category term='Cristina Kirchner'/><category term='oddball'/><category term='interesting'/><category term='lexicon'/><category term='crazy english'/><category term='derek bickerton'/><category term='community'/><category term='snowcones'/><category term='prizes'/><category term='narrow syntax'/><category term='weird queries'/><category term='constructions'/><category term='web 2.0 names'/><category term='quine'/><category term='ASL'/><category term='analogy'/><category term='origins of speech'/><category term='alien language'/><category term='middle east name pronunciation'/><category term='category theory'/><category term='emotional words'/><category term='AI'/><category term='natural language processing'/><category term='FOXP2'/><category term='semantics'/><category term='Let&apos;s Make a Deal'/><category term='stanford nlp'/><category term='sound symbolism'/><category term='cognition'/><category term='dolphin'/><category term='nbc community'/><category term='A World Lit Only by Fire'/><category term='pinker'/><category term='Joel McHale'/><category term='historical semantics'/><category term='Bob carpenter'/><category term='gays in the military'/><category term='antecedent'/><category term='hereos'/><category term='baseball'/><category term='google wave'/><category term='word of the year'/><category term='law and linguistics'/><category term='panini'/><category term='voice synthgesizers'/><category term='sotu linguistiucs'/><category term='smack'/><category term='conversation analysis'/><category term='language change'/><category term='online'/><category term='aspect'/><category term='funny sentences'/><category term='german'/><category term='why linguists should study math.'/><category term='txting'/><category term='pragmatics'/><category term='flickr'/><category term='google. search algorithm'/><category term='The Journal of Serendipitous and Unexpected Results'/><category term='braille'/><category term='Nicole Kidman'/><category term='constraint based'/><category term='sign language phonemes'/><category term='statistics'/><category term='coexist bumpersticker'/><category term='orland ca'/><category term='linguistics society of america'/><category term='conferences'/><category term='google'/><category term='singular they'/><category term='technology'/><category term='goofiness'/><category term='linguistic relativity'/><category term='glottal stop'/><category term='snowtoriousBIG'/><category term='nate silver'/><category term='buffalo'/><category term='ut austin'/><category term='invented language'/><category term='sign language'/><category term='lexical entries'/><category term='grammar'/><category term='corpus'/><category term='indefinite article'/><category term='web linguistics'/><category term='skynet'/><category term='naming experiment'/><category term='Cognitive linguistics'/><category term='high speed internet'/><category term='burma'/><category term='wh clause'/><category term='lexical decision task'/><category term='Obama'/><category term='grammar checker'/><category term='podcasts'/><category term='misogyny'/><category term='Astadhyayi'/><category term='talmy'/><category term='#linguistics'/><category term='naming'/><category term='ABC'/><category term='Saussure'/><category term='cko'/><category term='news headlines'/><category term='eugene o&apos;neil'/><category term='empirical'/><category term='talking brains'/><category term='sports analogy'/><category term='sarcasm'/><category term='ghoti'/><category term='crash blossoms'/><category term='particle-verbs'/><category term='emotional language'/><category term='city of languages'/><category term='language murder'/><category term='question order'/><category term='language and thought'/><category term='Language Log'/><category term='NYT'/><category term='funny search queries'/><category term='google microsoft'/><category term='defintite reference'/><category term='Grice'/><category term='farts'/><category term='blogosphere'/><category term='discourse analysis'/><category term='stewie'/><category term='twitter'/><category term='evolutionary biology'/><category term='yo xochitl'/><category term='language evolution. blogs'/><category term='online experiments'/><category term='ramsey test'/><category term='film'/><category term='color terms'/><category term='kitten mittons'/><category term='daily dish'/><category term='code-switching'/><category term='tuition costs'/><category term='swadesh'/><category term='gmail'/><category term='drug slang'/><category term='journals'/><category term='child language'/><category term='mankiew'/><category term='Indexed'/><category term='avatar'/><category term='John McWhorter'/><category term='pustejovsky'/><category term='christian bale'/><category term='language loss'/><category term='methodology'/><category term='great eskimo vocabulary hoax'/><category term='ontology'/><category term='neologism'/><category term='verbs'/><category term='lexicalist'/><category term='hypercorrection'/><category term='language silliness'/><category term='awlobf'/><category term='self aware'/><category term='cynical'/><category term='Douglas Hofstadter'/><category term='advice for linguists'/><category term='hypotheticals'/><category term='Swine Flu In The Hundred Acre Wood'/><category term='conversation linguistic analysis'/><category term='coordination'/><category term='george bernard shaw'/><category term='meaning of cynical'/><category term='multilingualism'/><category term='sleeping language'/><category term='drug language'/><category term='yiddish'/><category term='pop culture'/><category term='tschüß'/><category term='star trek'/><category term='goethe institute'/><category term='tacos'/><category term='swine flu'/><category term='accents'/><category term='anthropological linguistics'/><category term='economist'/><category term='christian bale rant'/><category term='stephen colbet'/><category term='speech synthesis'/><category term='internet connection'/><category term='neurolinguistics'/><category term='word of the decade'/><category term='Danny Pudi'/><category term='language'/><category term='fMRI'/><category term='typology'/><category term='online tests'/><category term='Buffalo bills'/><category term='grammatical illusions'/><category term='southwest agrivculture'/><category term='Marc Changizi'/><category term='commas'/><category term='conversation'/><category term='tweet'/><category term='implicature'/><category term='errors'/><category term='why linguists should study math'/><category term='disfluency'/><category term='regular expressions'/><category term='nine'/><category term='words evolve'/><category term='plural apostrophe'/><category term='evolution of language'/><category term='crowdsourcing'/><category term='peter griffen'/><category term='puns'/><category term='cognates'/><category term='Freakonomics'/><category term='fact checking'/><category term='animal language'/><category term='obama reid'/><category term='orthography'/><category term='family guy linguistics'/><category term='garden path'/><category term='linguistic myths'/><category term='linguistic hegemony'/><category term='crash blossom'/><category term='comics'/><category term='eskimo'/><category term='fuckin'/><category term='how we think'/><category term='London Summit'/><category term='phonetics'/><category term='new blogger'/><category term='Kate Hudson'/><category term='frame semantics'/><category term='kenneth the page'/><category term='frikkin'/><category term='ambiguity'/><category term='attentional blink'/><category term='opinion poll wording'/><category term='Frequently Asked Questions'/><category term='nfl'/><category term='linguistic annotation'/><category term='sex'/><category term='inguistics society of america'/><category term='academics'/><category term='peer review'/><category term='obama rhetoric'/><category term='spell check'/><category term='homesexuals in the military'/><category term='linguistics on the web'/><category term='stanford.'/><category term='google linguistics'/><category term='football'/><category term='wave'/><category term='Family guy'/><category term='garden path linguistics'/><category term='taboo vocabulary'/><category term='language and politics'/><category term='david foster wallace'/><category term='linguistics'/><category term='bad journalism'/><category term='spinvox'/><category term='crowd sourcing'/><category term='TV cancelation'/><category term='snub'/><category term='language quiz'/><category term='empirical linguistics'/><category term='gibberish'/><category term='chinese braille'/><category term='volcano'/><category term='melbourne cup'/><category term='LSA'/><category term='theater'/><category term='etymology'/><category term='adam&apos;s tongue'/><category term='speech therapy'/><category term='Erin McKean'/><category term='Edmund Blair Bolles'/><category term='monkey threat down'/><category term='online learning'/><category term='economics'/><category term='blogger'/><category term='fixed'/><category term='word meaning'/><category term='conor clark'/><category term='geoffrey pullum'/><category term='probabiltiy'/><category term='poetry'/><category term='blends'/><category term='myanmar'/><category term='IR'/><category term='language game'/><category term='universal translator'/><category term='restrictive query'/><category term='hiro speaks like yoda'/><category term='President Obama'/><category term='as your linguist'/><category term='brain maps'/><category term='how are Frequently Asked Questions counted'/><category term='meta book'/><category term='brand names'/><category term='crowds'/><category term='Gillian Jacobs'/><category term='urban planning'/><category term='mind machine project'/><category term='NLTK'/><category term='frak'/><category term='berlin and kay'/><category term='collaboration'/><category term='the iceman cometh'/><category term='conceptual processing'/><category term='linguistics math'/><category term='Louwerse'/><category term='sitemeter'/><category term='drug guidelines'/><category term='john oliver'/><category term='parsing'/><category term='spelling'/><category term='football rant'/><category term='adieu'/><category term='sigh...'/><category term='HATProject'/><category term='online psycholinguistics'/><category term='tuition'/><category term='sense vs metaphor'/><category term='gavagai'/><category term='dfw grammar'/><category term='language death'/><category term='chinese tone'/><category term='theory of meaning'/><category term='pronouncify'/><category term='modal verb'/><category term='30 rock'/><category term='Oldest Example of Written English Discovered'/><category term='iceland'/><category term='rhetoric'/><category term='googling'/><category term='causative. semantics'/><category term='the nightman cometh'/><category term='why linguists should study math. Steven Strogatz'/><category term='funny headlines'/><category term='POS tagging'/><category term='gestures linguistics'/><category term='speaking in tongues'/><category term='names of famous events'/><category term='utpoia'/><category term='womens langauge'/><category term='State of the nation'/><category term='dadt'/><category term='daily show'/><category term='Juan Uriagereka'/><category term='memory'/><category term='double quote'/><category term='Judi Dench'/><category term='embodied linguistics'/><category term='psycholinguistics'/><category term='Daume'/><category term='ipa'/><category term='research blogging'/><category term='machine translation'/><category term='google dictionary'/><category term='annotation'/><category term='yoda japanese'/><category term='philosophy of science'/><category term='how many linguists are there'/><category term='web corp'/><category term='blog scoops'/><category term='chomsky'/><category term='state of the union'/><category term='The Full Liberman'/><category term='texting'/><category term='avoidance characters'/><category term='searching the web'/><category term='preschool blogger'/><category term='education'/><category term='obama linguistics'/><category term='conditionals'/><category term='infinite recursion podcast'/><category term='arabic brand names'/><category term='butter'/><category term='demetri martin'/><category term='flatulence'/><category term='Snowmageddon'/><category term='Barack Obama Obama'/><category term='embodied cognition'/><category term='jeff runner'/><category term='language and relationships'/><category term='London G20'/><category term='bullshit'/><category term='tumblr'/><category term='hal daume'/><category term='home alone twitter'/><category term='perfect game'/><category term='innateness hypothesis'/><category term='kottke'/><category term='punctuation'/><category term='One Drawing for Every Page of Moby-Dick'/><category term='winnie the pooh'/><category term='testable hyposthese'/><category term='nuance'/><category term='anaphoric'/><category term='college bubble'/><category term='binged'/><category term='Unsolved Problems in Linguistics'/><category term='learning'/><category term='Stephen Harper'/><category term='artificial intelligence'/><category term='branding'/><category term='internet slang'/><category term='9/11'/><category term='NLP'/><category term='dfw'/><category term='language as metaphor'/><category term='english'/><category term='community ahmed'/><category term='anaphor'/><category term='pronouns'/><category term='morphology'/><category term='nouns'/><category term='brain fart'/><category term='eugene volokh'/><category term='word learning'/><category term='Bushism'/><category term='David Crystal'/><category term='humanities'/><category term='vintage ad'/><category term='inference'/><category term='Cognitive daily'/><category term='qatar'/><category term='one language'/><category term='linguistic diversity'/><category term='hyper-correction'/><category term='web demos'/><category term='vegetarian'/><category term='gender'/><category term='thomas more'/><category term='national grammar day'/><category term='gender gap'/><category term='Ta-Nehisi Coate'/><category term='TED'/><category term='christopher manning'/><category term='political rhgetoric'/><category term='meaning of arbitrary'/><category term='adjectives'/><category term='lsa annual meeting'/><category term='brodman&apos;s area'/><category term='spread or agriculture'/><category term='toporaphy of language'/><category term='Phonetics and phonology'/><category term='causative semantics'/><category term='forensic linguistics'/><category term='heros'/><category term='elephant listening project'/><category term='fucking'/><category term='speech errors'/><category term='discourse'/><category term='metaphor'/><category term='poker'/><category term='cognitive language'/><category term='Zwicky'/><category term='a pod of whales'/><category term='na&apos;vi'/><category term='infinte jest'/><category term='syntax'/><category term='colin phillips'/><category term='iconicity effect'/><category term='lingpipe'/><category term='soever'/><category term='the three reallys construction'/><category term='teaching phonetics'/><category term='netflix'/><category term='online collaboration tools'/><category term='reflexives'/><category term='james cameron'/><category term='arbitrary'/><category term='sports'/><category term='metanomy'/><category term='kitchen experiments'/><category term='adult language'/><category term='ben zimmer'/><category term='grammaticality judgements'/><category term='piss poor'/><category term='grammar test'/><category term='corpus linguistics'/><category term='Yoani Sánchez'/><category term='synonyms snowmageddon snowpocalypse'/><category term='English manuscript'/><category term='danish phonology'/><category term='humor'/><category term='word formation'/><category term='horse'/><category term='computational linguistics'/><category term='apostrophe'/><category term='eskimo words for snow'/><category term='TV'/><category term='famous dates in history'/><category term='American Heritage dictionary'/><category term='language learning'/><category term='logic'/><category term='phonemes'/><category term='elephant language'/><category term='really really? really.'/><category term='McGurk effect'/><category term='semantic web'/><category term='dialects'/><category term='writing systems'/><category term='andrew sullivan'/><category term='cognitive science'/><category term='usage'/><category term='Macro-economics'/><category term='frequency'/><category term='collective nouns'/><category term='kumquat'/><category term='colbert nation'/><category term='nine movie review'/><category term='medieval europe'/><category term='kanji'/><category term='prosodic phrase'/><category term='color naming experiment'/><category term='language talks'/><category term='bad math'/><category term='Monty hall'/><category term='truthiness'/><category term='neuroscience'/><category term='fun'/><category term='language revitalization'/><category term='Barack Obama'/><category term='testing'/><category term='sociolinguistics'/><category term='powerset'/><category term='wheatsoever'/><category term='dissertation'/><category term='prescriptivism'/><category term='snowclone'/><category term='advice for grad students'/><category term='lexical ambiguity'/><category term='glossolalia'/><category term='Chitimacha'/><category term='anawiki'/><category term='search query syntax'/><category term='snowpocalypse'/><category term='538'/><category term='language philosophy'/><category term='black english'/><category term='renaissance'/><category term='tranlation'/><category term='evolution'/><category term='offsets'/><category term='Marion Cotillard'/><category term='William Manchester'/><category term='oldest man dies'/><category term='headlines'/><category term='bing'/><category term='excelletn blogs'/><category term='diphone frequency'/><category term='efl'/><category term='bad science reporting'/><category term='bilinguialism'/><category term='taco bell'/><category term='linguistic research'/><category term='internet'/><category term='Kanye west'/><category term='deixis'/><category term='the bills'/><category term='Patrick J. Gannon'/><category term='describing tacos'/><category term='scientific breakthroughs'/><category term='games with words'/><category term='twitter project'/><category term='science'/><category term='science blogging'/><category term='lakoff'/><category term='Sophia Loren'/><category term='hashtags'/><category term='synesthesia'/><category term='The World&apos;s Lousy Farts'/><category term='bayes theorum'/><category term='sotu'/><category term='daniel hamermesh'/><category term='politics'/><category term='hotel internet'/><category term='adams tongue'/><category term='high speed'/><category term='fart jokes'/><category term='prepositions'/><category term='communication'/><category term='boroditsky'/><category term='complex systems'/><category term='linguistics departments'/><category term='Jeuniaux'/><category term='#awlobf'/><category term='bumperstickers'/><category term='christina applegate'/><category term='sapir whorf'/><category term='dictionaries'/><category term='historical linguistics'/><category term='language evolution'/><category term='next blog'/><category term='love language'/><category term='dictionary'/><category term='cupertino effect'/><category term='samantha who'/><category term='microsoft'/><category term='PGSLTSS'/><category term='Daniel Day-Lewis'/><category term='Neuroplasticity'/><category term='barrio linguistics'/><category term='framenet'/><category term='typos'/><category term='corpora'/><title type='text'>The Lousy Linguist</title><subtitle type='html'>Notes on linguistics and cognition</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><link rel='next' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default?start-index=101&amp;max-results=100'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>552</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-6890434518068734898</id><published>2012-01-24T22:53:00.003-05:00</published><updated>2012-01-24T22:53:39.479-05:00</updated><title type='text'>my annual SOTU &amp; word clouds rant</title><content type='html'>&lt;span style="background-color: white; color: #333333; font-family: HelveticaNeue, 'Helvetica Neue', Helvetica, Arial, sans-serif; font-size: 14px; line-height: 18px;"&gt;&amp;nbsp;Good to revisit: word clouds are the Fox News of linguistics (see &lt;a href="http://www.economist.com/blogs/johnson/2011/01/word_counts"&gt;here &lt;/a&gt;and &lt;a href="http://thelousylinguist.blogspot.com/2011/01/obamas-state-of-union-and-word.html"&gt;here&lt;/a&gt;)&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-6890434518068734898?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/6890434518068734898/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=6890434518068734898' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/6890434518068734898'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/6890434518068734898'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2012/01/my-annual-sotu-word-clouds-rant.html' title='my annual SOTU &amp; word clouds rant'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-4723063513504544064</id><published>2012-01-24T08:30:00.000-05:00</published><updated>2012-01-24T22:07:51.107-05:00</updated><title type='text'>CompuPolitics or Wonkonomics?</title><content type='html'>Philip Resnik, computational linguist extraordinaire, has requested name candidates for a “new domain of activity” he described recently in a &lt;a href="http://languagelog.ldc.upenn.edu/nll/?p=3712"&gt;Language Log post&lt;/a&gt;.&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/-TOHzgSTmyuw/Tx4_ND1B9YI/AAAAAAAAAZw/s62uuYu-dT0/s1600/philly-dog+resnik+tweet.JPG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="51" src="http://2.bp.blogspot.com/-TOHzgSTmyuw/Tx4_ND1B9YI/AAAAAAAAAZw/s62uuYu-dT0/s320/philly-dog+resnik+tweet.JPG" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;But naming is not easy. It’s fun to mock bad names like Netflix’s dud &lt;i&gt;Qwikster&lt;/i&gt;, but try coming up with a better one. Go on. I’ll wait …&lt;br /&gt;&lt;br /&gt;Yeah, see. Not easy. &lt;br /&gt;&lt;br /&gt;&lt;b&gt;Money quote&lt;/b&gt;:&lt;br /&gt;“…&lt;i&gt;&lt;span style="color: #6aa84f;"&gt;people are starting to suggest that volume and sentiment analysis on tweets … might produce useful information about people's viewpoints, or even predict the success of political campaigns. Indeed, it's been suggested that numbers derived from Twitter traffic might be better than polls, or at least better than pundits&lt;/span&gt;&lt;/i&gt;.”&lt;br /&gt;&lt;br /&gt;So, what to call this pursuit? Resnik suggests &lt;i&gt;CompuPolitics&lt;/i&gt;. But, I mean, really? It’s so 90s. Evoking &lt;i&gt;CompuServe&lt;/i&gt;. It’s dead... stale... dry. The stench of moth-infested dust permeates my nostrils just reading it. &lt;br /&gt;&lt;br /&gt;Resnik gave it a shot. He misfired, but I respect the effort. So what are we looking for in a name? An analogy could be drawn with &lt;a href="http://www.culturomics.org/"&gt;Culturomics&lt;/a&gt;, the Google Ngrams inspired NLP endeavor. But let's be honest, that's a gawd-awful name reeking of middle-aged literary professors amazed that &lt;a href="http://en.wikipedia.org/wiki/ELIZA"&gt;Eliza talks back&lt;/a&gt;. Rather, a better analogy might be &lt;a href="http://www.freakonomics.com/"&gt;Freakonomics&lt;/a&gt;, one of the best academic coinages of the last few decades (maybe ever). It has captured the spirit of its practitioners in a way that is immediately obvious to the lay person*. &lt;br /&gt;&lt;br /&gt;But just what is Resnik trying to name? His Language Log post seems to be referencing a new field of academic pursuit, like &lt;i&gt;behavioral economics&lt;/i&gt; or &lt;i&gt;bioinformatics&lt;/i&gt;. But it’s also deeply embedded in industry practices which evoke terms like &lt;i&gt;big data&lt;/i&gt; and &lt;i&gt;text analytics&lt;/i&gt; Again, this suggests that Freakonomics is indeed an apt analogy as it refers not just to the style of analysis, but also a for-profit business model involving books, a NYT blog, and now a movie. &lt;br /&gt;&lt;br /&gt;Let’s assume Resnik’s goal is to find a name for a business-savvy methodology.  Let us recognize that many academic field names that we currently take as meaningful evocations of deep thinking actually have ridiculous names (the prevalence of Latin and Greek terms and blends alone  should make us all giggle).&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;Philosophy &lt;/i&gt;– blend of Greek &lt;i&gt;philo&lt;/i&gt;- "loving" and &lt;i&gt;sophia &lt;/i&gt;"knowledge” (‘love-knowledge’?? I mean, how stupid is that…)&lt;/li&gt;&lt;li&gt;&lt;i&gt;Biology &lt;/i&gt;– blend of Greek &lt;i&gt;bios &lt;/i&gt;"life" and –&lt;i&gt;logia &lt;/i&gt;“to speak” (what? ‘life speaks’? huh? ‘the talking life’? dumb dumb dumb)&lt;/li&gt;&lt;li&gt;&lt;i&gt;English&lt;/i&gt; – I mean, frik, really? Is this a real academic department? Next, you’ll tell me there’s a real &lt;i&gt;Department of French and Romance Philology&lt;/i&gt;.&lt;/li&gt;&lt;/ul&gt;To coin a new term, we want some understanding of the themes we want the new term to evoke. Here’s a stab at what I think Resnik is looking for:&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Technical Themes&lt;/b&gt;:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;Computational (&lt;i&gt;oooh, “science”&lt;/i&gt;)&lt;/li&gt;&lt;li&gt;Political  (&lt;i&gt;ugh, windbags&lt;/i&gt;)&lt;/li&gt;&lt;li&gt;New (&lt;i&gt;oooh, shiny&lt;/i&gt;)&lt;/li&gt;&lt;li&gt;Search/discovery (&lt;i&gt;ya mean like the googles&lt;/i&gt;?)&lt;/li&gt;&lt;/ol&gt;&lt;b&gt;Emotional Themes&lt;/b&gt;: &lt;br /&gt;&lt;ol&gt;&lt;li&gt;Pulse of opinion (&lt;i&gt;I care about what other people care about&lt;/i&gt;)&lt;/li&gt;&lt;li&gt;Gravitas (&lt;i&gt;really smart people care about this&lt;/i&gt;)&lt;/li&gt;&lt;li&gt;Better than polls or pundits (&lt;i&gt;I hate them anyway, now I know why&lt;/i&gt;)&lt;/li&gt;&lt;li&gt;Value (&lt;i&gt;who doesn’t love a bargain&lt;/i&gt;)&lt;/li&gt;&lt;li&gt;Honesty (&lt;i&gt;fake tweets and Twitter-bots are not my friends&lt;/i&gt;)&lt;/li&gt;&lt;/ol&gt;&lt;b&gt;Big Picture Themes&lt;/b&gt;:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;Rising above the noise (&lt;i&gt;Needle? Check. Haystack? Check&lt;/i&gt;.)&lt;/li&gt;&lt;li&gt;Making a difference (&lt;i&gt;like Morgan Freeman in &lt;/i&gt;&lt;a href="http://www.imdb.com/title/tt0097722/"&gt;Lean on Me&lt;/a&gt;)&lt;/li&gt;&lt;li&gt;Finding the truth (“the truth is out there”)&lt;/li&gt;&lt;li&gt;This is the future, and it’s good (&lt;i&gt;like Justin Beiber&lt;/i&gt;?)&lt;/li&gt;&lt;/ol&gt;Now, to find a single word or phrase that captures all of that at once… hmmmm. Here's a half hearted attempt:&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Name Candidates&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Literal&lt;/b&gt; (typically the most clunky, clumsy, and goofy, worth avoiding)&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;CompuPolitics &lt;/b&gt;– blend of &lt;i&gt;computer &lt;/i&gt;and &lt;i&gt;politics&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;SentiMent&lt;/b&gt; – riff on one word &lt;i&gt;sentiment&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;Sentics &lt;/b&gt;– blend of &lt;i&gt;sentiment &lt;/i&gt;and &lt;i&gt;politics&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;Poliments&lt;/b&gt; – blend of &lt;i&gt;politics &lt;/i&gt;and &lt;i&gt;sentiment&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;PoliTude &lt;/b&gt;– blend of &lt;i&gt;politics&lt;/i&gt; and &lt;i&gt;attitude&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;Crowditude &lt;/b&gt;– blend of &lt;i&gt;crowd &lt;/i&gt;and &lt;i&gt;attitude&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;PoliInformatics &lt;/b&gt;– blend of &lt;i&gt;politics&lt;/i&gt; and &lt;i&gt;informatics &lt;/i&gt;ala &lt;i&gt;bioinformatics&lt;/i&gt;&lt;/li&gt;&lt;/ul&gt;&lt;b&gt;&lt;b&gt;Associative&lt;/b&gt;&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;Flutter&lt;/b&gt; – ala “Twitter”, this evokes temporary changes in sentiment captured by the technology.&lt;/li&gt;&lt;li&gt;&lt;b&gt;CanaryStats &lt;/b&gt;– evokes canary in coal mine analogy&lt;/li&gt;&lt;li&gt;&lt;b&gt;Wonkonomics&lt;/b&gt; - Fun, professional, evokes both George Stephanopoulos and Willy Wonka in the same breath&lt;/li&gt;&lt;/ul&gt;&lt;b&gt;Homage&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;Freakoment &lt;/b&gt;- blend of &lt;i&gt;Freakonomics&lt;/i&gt; and&lt;i&gt; sentiment&lt;/i&gt; (I mean, fuck it, those guys are making bank, right?)&lt;/li&gt;&lt;li&gt;&lt;b&gt;Groupthink &lt;/b&gt;– a little creepy, but ya know, Orwell knew language…&lt;/li&gt;&lt;/ul&gt;Clearly, I favor the awesomely evocative &lt;i&gt;Wonkonomics&lt;/i&gt;. But you be the judge...&lt;br /&gt;&lt;br /&gt;* Let us put aside the inevitable debate over whether or not Freakonomics is good science. It’s a great name.&lt;br /&gt;&lt;br /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-4723063513504544064?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/4723063513504544064/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=4723063513504544064' title='5 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/4723063513504544064'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/4723063513504544064'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2012/01/compupolitics-or-wonkonomics.html' title='CompuPolitics or Wonkonomics?'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/-TOHzgSTmyuw/Tx4_ND1B9YI/AAAAAAAAAZw/s62uuYu-dT0/s72-c/philly-dog+resnik+tweet.JPG' height='72' width='72'/><thr:total>5</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-6790727865627145961</id><published>2011-11-25T10:00:00.001-05:00</published><updated>2011-11-25T10:00:55.294-05:00</updated><title type='text'>the incoherent accents of Hugo</title><content type='html'>&lt;br /&gt;Saw &lt;a href="http://www.imdb.com/title/tt0970179/"&gt;Hugo&lt;/a&gt;. It's more for big kids than little ones, imho. I enjoyed it and found its retro-whimsy entertaining, but the 3D is frustrating for someone who wears prescription glasses. It has a talented cast of veteran British actors who don't get nearly enough screen time (Kingsley may get a best supporting actor nod come Oscar time). Also, I'm an unabashed &lt;a href="http://www.imdb.com/name/nm1631269/"&gt;Chloë Moretz&lt;/a&gt; fan, and she is every bit as good in this film as I expected. She is one of the best teen actors in history and she keeps getting better. &amp;nbsp;She has the facial expressiveness of Brando, and I don't say that lightly. Unfortunately, the film rests on the performance of its lead, Asa Butterfield, who is, sorry to say, flat and unconvincing as the orphan genius tinkerer Hugo. Partly this is because his dialogue is awful and clunky. No one talks that way, especially not a scared orphan. He also lack the facial expressiveness of Moretz but is in constant juxtaposition with her, so he pales even the greater in comparison.&lt;br /&gt;&lt;br /&gt;For a little linguistic aside (since this is The Lousy Linguist, not The Lousy Film Critic), the film is set in Paris, but everyone has an English accent, including the Atlanta born American actress Moretz. Rumor has it she auditioned for Scorsese with the accent and he didn't know she was American (though I find this hard to believe since she has been a well known actress for several years now).&lt;br /&gt;&lt;br /&gt;Moretz did an acceptable job affecting the British accent, as far as I could tell, though I'm not that good at spotting phonies unless they're really bad. I did detect the occasional break, though. More to the point, &lt;i&gt;why is everyone speaking with a British accent in Paris&lt;/i&gt;!!!&lt;br /&gt;&lt;br /&gt;Accents in movies are a storyteller's way to &lt;i&gt;set the mood&lt;/i&gt;, so to speak. I find it to be one of the most incoherent, yet successful, tricks in the movie biz. When Hollywood makes a movie about WW2, the Nazi's all have German accents.&lt;br /&gt;&lt;br /&gt;But here's the linguistically incoherent part. Back then, during WW2, when Nazis spoke to each other ... they didn't have accents! Not to each others' ears. When French people speak to each other in French, they don't sound foreign to each other. They sound like native speakers. Yet, Hollywood (and other film markets too, I'm sure) has decided that "sounding foreign" sets the mood for a film set in a foreign land. &amp;nbsp;Even more incoherent is when film makers think they are being more "authentic" by having actors speak in foreign accents, when, linguistically speaking, this is about as INauthentic as you can get. When native speakers of any language speak to each other, they don't sound foreign. Yet, when audiences watch films set in foreign lands, the key to making the audience feel the sense of authenticity, is to make the actors sound foreign. Cognitive dissonance anyone?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-6790727865627145961?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/6790727865627145961/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=6790727865627145961' title='9 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/6790727865627145961'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/6790727865627145961'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/11/incoherent-accents-of-hugo.html' title='the incoherent accents of Hugo'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>9</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-397964852080713569</id><published>2011-11-11T09:53:00.001-05:00</published><updated>2011-11-11T10:13:36.557-05:00</updated><title type='text'>Perry's tip-of-the-tongue flubb</title><content type='html'>Much virtual ink is being spilled/spilt about US Presidential candidate Rick Perry's tip-of-the-tongue gaff at Wednesday night's GOP debate. His inability to remember a third government department he would cut is being decried as the ultimate end of his candidacy. This may be the case, I honestly don't know. But I think the linguists of the world should point out that tip-of-the-tongue speech errors are universal and say nothing about a person's intelligence or even their preparedness. They are entirely a function of neuro-biological processes which we all encounter. Luckily one (and as far as I can tell right now, only one) journalist bothered to follow up on &lt;a href="http://www.thedailybeast.com/articles/2011/11/10/the-science-behind-rick-perry-s-debate-brain-freeze.html"&gt;The Science Behind Rick Perry’s Debate Brain Freeze&lt;/a&gt;. Money quote:&lt;br /&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;When the brain juggles a reasonable quantity of information and tries to make sense of it—as Perry was presumably trying to do as he channeled what he knew, and began to answer CNBC moderator Maria Bartiromo’s question—activity in the dorsolateral prefrontal cortex, just behind the forehead, increases: this is the circuitry that handles decision making and emotional control.&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;But as you attempt to tap into more and more information, as Perry was presumably trying to do (imagine him desperately going down the list of cabinet departments and other federal agencies trying to come up with the third one on his hit list), activity in the dorsolateral PFC drops like a stone. It’s as if a circuit breaker pops as a result of “cognitive and information overload,” Angelika Dimoka of Temple University told me for a recent story&lt;/span&gt;.&lt;br /&gt;&lt;br /&gt;And that's all I have to say about that.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-397964852080713569?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/397964852080713569/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=397964852080713569' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/397964852080713569'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/397964852080713569'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/11/perrys-tip-of-tongue-flubb.html' title='Perry&apos;s tip-of-the-tongue flubb'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-9093472556600888719</id><published>2011-11-03T21:23:00.000-04:00</published><updated>2011-11-03T21:31:08.829-04:00</updated><title type='text'>Calling all Brits stuck in the colonies...</title><content type='html'>The Economist's &lt;a href="http://www.economist.com/blogs/johnson/2011/11/british-and-american-english"&gt;Johnson blog&lt;/a&gt; wants Brits living in the US to take &lt;a href="https://docs.google.com/a/economistgroup.com/spreadsheet/viewform?formkey=dDYtU3hWTEpMUUM4bU02X0N3eVRkcmc6MQ"&gt;a dialect survey&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;i&gt;One of the set-piece conversations that Britons living in America have with each other, besides how cold it is, how hot it is, or how interesting it is that people here don't talk about the weather all the time, is about which British words or pronunciations they have shed in favour of their American equivalents...If you're a Brit living in the United States, please take a minute (no longer) to fill out &lt;a href="https://docs.google.com/spreadsheet/viewform?formkey=dDYtU3hWTEpMUUM4bU02X0N3eVRkcmc6MQ"&gt;this web form&lt;/a&gt;. Just put in how long you've lived in the United States, and mark which Americanisms you use. I'll post the results in a few days.&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;The survey, in case you missed it, is &lt;a href="https://docs.google.com/a/economistgroup.com/spreadsheet/viewform?formkey=dDYtU3hWTEpMUUM4bU02X0N3eVRkcmc6MQ"&gt;here&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-9093472556600888719?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/9093472556600888719/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=9093472556600888719' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/9093472556600888719'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/9093472556600888719'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/11/calling-all-brits-stuck-in-colonies.html' title='Calling all Brits stuck in the colonies...'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-1137525884997024753</id><published>2011-10-31T16:18:00.005-04:00</published><updated>2011-10-31T16:19:01.806-04:00</updated><title type='text'>So, you want a real job?</title><content type='html'>My guest post about the job market for linguistics outside academia at&amp;nbsp;Zoltan Varju's blog is available &lt;a href="http://szamitogepesnyelveszet.blogspot.com/2011/10/so-you-want-real-job.html"&gt;here&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-1137525884997024753?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/1137525884997024753/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=1137525884997024753' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/1137525884997024753'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/1137525884997024753'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/10/so-you-want-real-job.html' title='So, you want a real job?'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-5357414649377135500</id><published>2011-10-28T22:45:00.000-04:00</published><updated>2011-10-28T22:45:23.731-04:00</updated><title type='text'>no one is scared of Johnny Depp (a film review)</title><content type='html'>I just saw &lt;a href="http://www.imdb.com/title/tt0376136/"&gt;The Rum Diary&lt;/a&gt; starring Johnny Depp. I enjoyed this film a lot. It is well paced, fun, beautiful, caustic, and funny ...yet... Depp just never seemed right as Thompson (thinly veiled as Paul Kemp, Puerto Rico journalist in the early 1960s).&lt;br /&gt;&lt;br /&gt;Before critiquing Depp, let me run through a few non-Depp related points:&lt;br /&gt;&lt;br /&gt;I can recommend this film without hesitation as unquestionably worth your time and money. &lt;b&gt;&lt;span class="Apple-style-span" style="color: red;"&gt;Go see this film&lt;/span&gt;&lt;/b&gt;. This film is better than 95% of the films that have come out in the last 20 years. I'm not joking, &lt;b&gt;&lt;span class="Apple-style-span" style="color: red;"&gt;Go see this film&lt;/span&gt;&lt;/b&gt;.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Brilliant dialogue&lt;/i&gt;&lt;/b&gt;: Kudos to writer/direct Bruce Robinson for allowing Thompson's witty yet brutally incisive social critiques to highlight the dialogue. This is a beautifully rendered script. It was an honor to listen to characters say interesting things again after decades of trite piffle littering our movie screens. This is the best screenplay since 2009's &lt;a href="http://www.imdb.com/title/tt1174732/"&gt;An Education&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;b&gt;Giovanni Ribisi&lt;/b&gt;&lt;/i&gt; is fantastic as the weird, drunk, Hitler-loving Moburg. His performance alone is worth the price of admission.&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;b&gt;Amber Heard&lt;/b&gt;&lt;/i&gt; is visually stunning, but she is rarely given more than 6 words to say at a time (same is true for her performance in &lt;a href="http://en.wikipedia.org/wiki/Pineapple_Express_(film)"&gt;Pineapple Express&lt;/a&gt;). With so few lines, it's not clear if she is a bad actor or if no one will give her the chance to act. I couldn't help but be reminded of Scarlett Johansson, who is truly beautiful, but is an utterly cringe-worthy actor (only Sofia Coppola has managed to wring a competent performance out of her).&lt;br /&gt;&lt;br /&gt;That said...&lt;br /&gt;&lt;br /&gt;I am a Johnny Depp fan. I like most of his films. I have known Depp to be a Thompson devotee for some time, but only this week learned he actually lived with Thompson for a couple years in the early 1990s (if I read &lt;a href="http://www.thedailybeast.com/newsweek/2011/10/23/johnny-depp-on-hunter-s-thompson.html"&gt;this &lt;/a&gt;correctly).&lt;br /&gt;&lt;br /&gt;But! Depp has something of the vaudevillian in him. A knowing performer, deft at character, but always with a nod to the audience about the façade, the game, a wink to their suspension of disbelief. &lt;i&gt;Enjoy the show folks, glad you could spare some time for my little charade&lt;/i&gt;. It is precisely this man-behind-the-curtain nod that troubled me throughout the film. I had a similar response to Depp's turn as Thompson in the &lt;a href="http://en.wikipedia.org/wiki/Fear_and_Loathing_in_Las_Vegas_(film)"&gt;1998 film&lt;/a&gt;, but I couldn't quite put it to words.&lt;br /&gt;&lt;br /&gt;I grew up in a family of Hunter Thompson fans. My three older brothers lived Thompson-esque lives (complete with hilarious, madcap drug addled stories, but also to horrible ends in one case). I read &lt;a href="http://en.wikipedia.org/wiki/Fear_and_Loathing_in_Las_Vegas"&gt;Fear and Loathing in Las Vegas&lt;/a&gt; when I was 12 and loved it. I read everything Thompson I could find. My dad once found about 30 old Rolling Stone magazines at a yard sell for like $2 and bought the lot and they were filled with Thompson articles. I read them all. I read &lt;a href="http://en.wikipedia.org/wiki/Hell%27s_Angels:_The_Strange_and_Terrible_Saga_of_the_Outlaw_Motorcycle_Gangs"&gt;Hell Angels&lt;/a&gt;, The &lt;a href="http://en.wikipedia.org/wiki/The_Great_Shark_Hunt"&gt;Great Shark Hunt&lt;/a&gt;. Hell, I even read &lt;a href="http://en.wikipedia.org/wiki/The_Curse_of_Lono"&gt;The Curse of Lono&lt;/a&gt;. Like all Thompson fans, I knew of the existence of &lt;a href="http://en.wikipedia.org/wiki/The_Rum_Diary_(novel)"&gt;The Rum Diary&lt;/a&gt;, his first novel written in the 1960s but not published until the late 1990s, and I yearned for the chance to read the unpublished manuscript. When it finally did get published, I jumped at it. Some of you line up for Harry Potter movies, I lined up for &lt;i&gt;The Rum Diaries&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;So, I respect Johnny Depp's talent and sincerity and I defer to the fact that he actually knew the man quite well (and I didn't); nonetheless, Hunter S. Thompson played a remarkably important role in my formative years and I feel that entitles me to a certain level of critique (I could be wrong about that...entitlements are typically bullshit ... but this is my bullshit).&lt;br /&gt;&lt;br /&gt;And my critique is this: Johnny Depp lacks authenticity. I cannot buy him as Thompson. At least, I cannot buy him as the embodiment of my conception of Thompson. Depp is fun, quirky, smarmy, but always superficial. Thompson is dangerously loose, and frighteningly deep. Depp is never dangerous because his knowing smirk lets us know it's just a game. Thompson never lets us off the hook this way. You may question the truth of Thompson's claims secretly, in your heart, but Thompson never lets you question him to his face, you wouldn't dare. You'd be too scared. But no one is scared of Johnny Depp. He's a kid playing dress up.&lt;br /&gt;&lt;br /&gt;This is not a losing battle. Bill Murray's 1980 performance as Thompson in &lt;a href="http://en.wikipedia.org/wiki/Where_the_Buffalo_Roam"&gt;Where the Buffalo Roam&lt;/a&gt; is dead on. Murray nails Thompson's duality of &lt;i&gt;hilarious and dangerous&lt;/i&gt;, &lt;i&gt;fun and scary&lt;/i&gt;, &lt;i&gt;fake and real&lt;/i&gt;. Johnny Depp nails the &lt;i&gt;hilarious, fun, fake&lt;/i&gt; parts, but fails at the &lt;i&gt;dangerous, scary, real&lt;/i&gt; parts. No duality. Perhaps my early, formative exposure to that 1980 film has poisoned me to all others. Maybe.&lt;br /&gt;&lt;br /&gt;Regardless, I end with my opening plea:  &lt;b&gt;&lt;span class="Apple-style-span" style="color: red;"&gt;Go see this film&lt;/span&gt;&lt;/b&gt;. You will be happy you did.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-5357414649377135500?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/5357414649377135500/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=5357414649377135500' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/5357414649377135500'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/5357414649377135500'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/10/no-one-is-scared-of-johnny-depp-film.html' title='no one is scared of Johnny Depp (a film review)'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-2542803350560357124</id><published>2011-08-13T12:22:00.003-04:00</published><updated>2011-09-09T20:47:36.055-04:00</updated><title type='text'>Stanford professors and grammatical illusions</title><content type='html'>I started reading &lt;a href="http://fukuyama.stanford.edu/origins_of_political_order"&gt;The Origins of Political Order&lt;/a&gt; by Stanford professor Francis Fukuyama when I stumbled across this sentence:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #6aa84f;"&gt;But it is clear that the political job of finding the right regulatory mechanisms to tame capitalism's volatility have not yet been found&lt;/span&gt;&lt;/i&gt;.&lt;/li&gt;&lt;/ul&gt;I read it a second time. Then a third. It just didn't seem right. The verb "have" is wrong, right? It should be "has", I think. But it was difficult to see why until I broke the sentence down to its essential elements. The linguistic dependencies are obscured by multiple embedded PPs, in this case, making it difficult to get the agreement right.&lt;br /&gt;&lt;br /&gt;Distilling a complex sentence down to its essential elements is a technique I taught to students for years because it helps us see what the sentence "means" in a basic sense, and all we need is our intuition. No linguistic theory or technical terms are required. One way to do begin is to simply replace noun phrases with variables like X and Y:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;But it is clear that the X of finding the right Y to tame Z have not yet been found&lt;/i&gt;.&lt;/li&gt;&lt;/ul&gt;Already we see that "have" sounds more clearly wrong. Next we can delete the introductory phrase "But..."&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;The X of finding the right Y to tame Z have not yet been found&lt;/i&gt;.&lt;/li&gt;&lt;/ul&gt;Now we have distilled the sentence to its basic elements. To figure out why "have" sounds wrong, we can delete all prepositional phrases, as they are not essential to the basic meaning of the sentence:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;The X have not yet been found&lt;/i&gt;.&lt;/li&gt;&lt;/ul&gt;Finally, we need only put the original X phrase back in:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;The political job have not yet been found&lt;/i&gt;.&lt;/li&gt;&lt;/ul&gt;Using just our intuition, two things should be obvious:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;The subject "political job" is singular and requires the verb to be "has" (syntax).&lt;/li&gt;&lt;li&gt;It is odd to speak about finding a political job (when it clearly does not mean job in the sense of getting paid to do something). Rather, this is referring to political will (semantics).&lt;/li&gt;&lt;/ol&gt;A little analysis of the original and we can guess that Fukuyama was probably trying to construct a linguistic dependency between "have" and "regulatory mechanisms." But this dependency is ungrammatical in the strict sense that "regulatory mechanisms" is not the subject of the sentence. Yet the sentence is not obviously ungrammatical (like "Bob feet have."). I taught 18 year old public school freshmen about this, but we can see that this kind of thing gets the best of even highly educated, accomplished Stanford professors. It is a function of how our brains construct and parse sentences.&lt;br /&gt;&lt;br /&gt;This sentence presents what Colin Phillips has called a grammatical illusion. He explains them thusly:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #6aa84f;"&gt;Research on the online implementation of grammatical constraints reveals a strikingly uneven profile. The parser shows impressive accuracy in the application of some rather complex constraints, but makes many errors in the implementation of some relatively simple constraints&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;His plenary talk at LSA 2010 provided a variety of examples (many involving "have", curiously enough). Here's a pre-print PDF of his work on the topic:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://ling.umd.edu/~colin/research/papers/phillips_wagers_lau_2010.pdf"&gt;Grammatical illusions and selective fallibility in real-time language comprehension&lt;/a&gt;. Colin Phillips, Matt Wagers, &amp;amp; Ellen Lau. 26pp. June 2009. To appear in Language and Linguistics Compass.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-2542803350560357124?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/2542803350560357124/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=2542803350560357124' title='7 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/2542803350560357124'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/2542803350560357124'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/08/stanford-professors-and-grammatical.html' title='Stanford professors and grammatical illusions'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>7</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-3857123553074569013</id><published>2011-07-08T07:33:00.002-04:00</published><updated>2011-07-08T07:33:00.569-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='metaphor'/><title type='text'>"the definition of “metaphoricity” is problematic in itself"</title><content type='html'>One of the metaphor recognition papers I read this week had an interesting finding wrt inter-annotator agreement and metaphor: &lt;a href="http://www.abstract-project.eu/papers/metaphor_malta_2.2.1.pdf"&gt;The Automatic Identification of Conceptual Metaphors in Hungarian Texts: A Corpus-based Analysis &lt;/a&gt;(Babarczy et a., LREC 2010 Workshop). &lt;br /&gt;&lt;br /&gt;The purpose of the paper was to run a sort-of bake-off between three methods of creating source/target word lists (to be used by selection preference metaphor recognition system):&amp;nbsp;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Three different methods of compiling the word lists were tested: a) word association experiment, b) dictionary of synonyms, and c) reference corpus&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;Ultimately they found that their corpus based method was most successful as measured by recall/precision, but there was a more striking result rather buried in the paper that I feel deserves more analysis. They created a gold standard by hand-tagging a 30,000 word "baseline" corpus. Here's what they found:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;At the first attempt, inter-annotator agreement was only 17%. After refining the annotation instructions, we made a second attempt, which resulted in an agreement level of 48%, which is still a strikingly low value. These results indicate that &lt;b&gt;the definition of “metaphoricity” is problematic in itself&lt;/b&gt;&lt;/span&gt;&lt;/i&gt; [emphasis added]. &lt;br /&gt;&lt;br /&gt;They reported three general sources of inter-annotator DISagreement:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;Direct vs. Indirect Reference&lt;/b&gt;: &lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;For example, in the case of the conceptual metaphors ANGER IS HEAT or CONFLICT IS FIRE, the source domain should be an expression referring to a sort of “heated thing”. However, in some cases, one or the other annotator included words indirectly suggesting the presence of heat, such as kiolt ('extinguish'), kihől ( 'get cold') etc.&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;Lexical Ambiguity&lt;/b&gt;: &lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;For example, the expression eljutottam a mai napig ('I've gotten to this day') may or may not represent a CHANGE IS MOTION metaphor depending on whether the Hungarian verb jut (literally: get somewhere, reach a place by moving the entire body) is taken only to denote physical movement or to be ambiguous&lt;/span&gt;&lt;/i&gt;.&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;Discrepancies in Classification&lt;/b&gt;:  &lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;...it is difficult to make an informed decision on whether the following example contains a CHANGE IS MOTION or a PROGRESS IS MOTION &lt;/span&gt;&lt;/i&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;FORWARD metaphor, neither of which appear to be an intuitively correct choice: a járvány végigsöpört szülıvárosukon ('the epidemic swept through their hometown')&lt;/span&gt;&lt;/i&gt;.&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;Of the four or five articles I've reviewed on automatic metaphor identification, this is the only one which reported on the results of human-tagging a corpus for metaphor. This strikes me as the sort of thing that should be a first step for anyone seriously interested in this program (certainly anyone interested in the &lt;a href="http://www.iarpa.gov/solicitations_metaphor.html"&gt;IARPA Metaphor Program&lt;/a&gt;).I don't doubt that others have done this, but it seems to be under-reported, suggesting it is not be treated as a core part of the problem.&lt;br /&gt;&lt;br /&gt;I've complained in &lt;a href="http://thelousylinguist.blogspot.com/search/label/metaphor"&gt;my previous posts&lt;/a&gt; that there is an overly restricted definition of metaphor underlying contemporary approaches to auto identification, but even within a highly restricted definition like those used by Babarczy et al. and others, there appears to be problems at the heart of identification for humans. So what exactly is being identified?&lt;br /&gt;&lt;br /&gt;&lt;span style="float: left; padding-bottom: 5px; padding-left: 5px; padding-right: 5px; padding-top: 5px;"&gt;&lt;a href="http://www.researchblogging.org/"&gt;&lt;img alt="ResearchBlogging.org" src="http://www.researchblogging.org/public/citation_icons/rb2_large_gray.png" style="border: 0;" /&gt;&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Z3988" title="ctx_ver=Z39.88-2004&amp;amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;amp;rft.jtitle=LREC+2010+Workshop.+Proceedings&amp;amp;rft_id=info%3A%2F&amp;amp;rfr_id=info%3Asid%2Fresearchblogging.org&amp;amp;rft.atitle=The+Automatic+Identification+of+Conceptual+Metaphors+in+Hungarian+Texts%3A+A+Corpus-Based+Analysis&amp;amp;rft.issn=&amp;amp;rft.date=2010&amp;amp;rft.volume=&amp;amp;rft.issue=&amp;amp;rft.spage=&amp;amp;rft.epage=&amp;amp;rft.artnum=http%3A%2F%2Fwww.abstract-project.eu%2Fpapers%2Fmetaphor_malta_2.2.1.pdf&amp;amp;rft.au=Anna+Babarczy&amp;amp;rft.au=Ildik%C3%B3+Bencze+M.&amp;amp;rft.au=Istv%C3%A1n+Fekete&amp;amp;rft.au=Eszter+Simon&amp;amp;rfe_dat=bpr3.included=1;bpr3.tags=Computer+Science+%2F+Engineering%2CPhilosophy%2CPsychology%2CSocial+Science%2COther%2CLinguistics%2C+Metaphor"&gt;Anna Babarczy, Ildikó Bencze M., István Fekete, &amp;amp; Eszter Simon (2010). The Automatic Identification of Conceptual Metaphors in Hungarian Texts: A Corpus-Based Analysis &lt;span style="font-style: italic;"&gt;LREC 2010 Workshop. Proceedings&lt;/span&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-3857123553074569013?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/3857123553074569013/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=3857123553074569013' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3857123553074569013'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3857123553074569013'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/07/definition-of-metaphoricity-is.html' title='&quot;the definition of “metaphoricity” is problematic in itself&quot;'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-1949009086764318085</id><published>2011-07-07T07:33:00.012-04:00</published><updated>2011-07-07T10:52:26.110-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='metaphor'/><title type='text'>more on auto metaphor recognition methods</title><content type='html'>A quick follow-up to my previous post on &lt;a href="http://thelousylinguist.blogspot.com/2011/07/big-picture-automatic-metaphor.html"&gt;automatic metaphor recognition&lt;/a&gt; wrt the IARPA &lt;a href="http://www.iarpa.gov/solicitations_metaphor.html"&gt;Metaphor Program&lt;/a&gt;. The paper &lt;a href="http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5681552&amp;amp;tag=1"&gt;Automatic Metaphor Recognition Based on Semantic Relation Patterns&lt;/a&gt; by Tang et al. challenges the dominant selectional preferences method by substituing their own Semantic Relations Patterns. They point out the problems with Selection Preferences (unfortunately I don't think they solved the problems with their own method, more on that in a bit).&lt;br /&gt;&lt;br /&gt;Again I'll give the Ling 101, &lt;i&gt;computational linguistics for dummies&lt;/i&gt; version (as I understand it ...): Selection Preferences assumes that words frequently co-occur with other words that are literally associated with the same semantic domain. For example,&lt;br /&gt;&lt;ol&gt;&lt;li&gt;&lt;i&gt;That ship has sailed the mighty ocean&lt;/i&gt;.&lt;/li&gt;&lt;li&gt;&lt;i&gt;That boat has sailed across lake Erie&lt;/i&gt;.&lt;/li&gt;&lt;li&gt;&lt;i&gt;That captain has sailed many seas&lt;/i&gt;.&lt;/li&gt;&lt;/ol&gt;In these three sentences, the verb &lt;i&gt;sailed&lt;/i&gt; occurs with three different subjects (&lt;i&gt;ship, boat, captain&lt;/i&gt;) and three different objects (&lt;i&gt;ocean, lake, seas&lt;/i&gt;), but all of them evoke the SAILING domain. So a computer could use this info to create a model of the verb &lt;i&gt;sail &lt;/i&gt;that would match up the semantics of its expected subjects and objects, then compare them to a new sentence. If the computer encountered the new sentence&lt;br /&gt;&lt;br /&gt;&amp;nbsp; &amp;nbsp; 4. &lt;i&gt;That student sailed through final exams&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;It could automatically use the model created from sentences 1-3 above to recognize that the verb &lt;i&gt;sailed &lt;/i&gt;occurs with a subject and object not from the SAILING domain, but rather from the STUDENT domain. Then it could use a metaphor mapping component to recognize that &lt;i&gt;HUMANS as MACHINES&lt;/i&gt; is an acceptable mapping and thus recognize that #4 might be coherent under a metaphorical interpretation.&lt;br /&gt;&lt;br /&gt;Tang et al. rightly point out that matching frequency-based selectional preferences is not the same thing as literal meaning. First, they note that some times, a metaphorical pairing is actually MORE FREQUENT than a litertal pairing. They use some Chinese examples, but I think the English translation makes the point. Take the following two uses of &lt;i&gt;close&lt;/i&gt;:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;The plane is &lt;b&gt;close &lt;/b&gt;to the tower&lt;/i&gt;.&lt;/li&gt;&lt;li&gt;&lt;i&gt;Opinion are &lt;b&gt;close&lt;/b&gt;&lt;/i&gt;.&lt;/li&gt;&lt;/ul&gt;In their corpus, Chinese uses like '&lt;i&gt;opinions are close&lt;/i&gt;' were more frequent, even though this is a non-literal use of &lt;i&gt;close&lt;/i&gt;. Frequency would lead the Selectional Preference method to believe that the &lt;i&gt;opinions-&lt;/i&gt;type use is literal simply because it is more frequent. This outcome is predicted by Lakoff &amp;amp; Johnson, btw, because one of the core tenants of their seminal work on metaphors was that metaphors are NOT special uses of language, but rather quite common and normal.&lt;br /&gt;&lt;br /&gt;Tang et al.'s solution is a new method they call Semantic Relation Patterns. Their explanation is brief and highly technical, making it a slog to get through, but it hinges on incorporating an existing semantic relations knowledge base, HowNet, and adding a probabalistic model. Note, I had trouble getting the HowNet website to load, but here is a &lt;a href="http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=01276017"&gt;PDF explanation&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;How Net is an on-line common-sense knowledge base unveiling inter-conceptual relations and inter-attribute relations of concepts as connoting in Chinese and English bilingual lexicons. &lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;In my quick read the two methods differed only minimally in the crucial ways (namely, they are both lexalist and local). Semantic Relation patterns are still based on lexical semantics and still derived entirely locally. I don't see how SRP would handle this metaphor from my earlier post any better than SP:&lt;br /&gt;&lt;br /&gt;Imagine a situation in a biology class where two students, Alger and Miriam, were originally going to be partners for a lab assignment. Then they got into an argument. A third student, Annette, asks Miriam:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;Annette&lt;/b&gt;: &lt;i&gt;Are you still going to be lab partners with Alger&lt;/i&gt;?&lt;/li&gt;&lt;li&gt;&lt;b&gt;Miriam&lt;/b&gt;: &lt;i&gt;No. That ship has sailed&lt;/i&gt;.&lt;/li&gt;&lt;/ul&gt;In this scenario, the sentence "&lt;i&gt;That ship has sailed&lt;/i&gt;" is entirely coherent and literal from a selectional preferences perspective (i.e., ships really do sail). Yet it is clearly being used metaphorically (there is literally no ship). Here, the metaphor is only detectable if we link two sentences together via co-reference. The phrase "the ship" does not co-refer to a real ship in the discourse. Rather, it refers to the possible event of &lt;i&gt;be-lab-partners-with-Alger&lt;/i&gt;. Unless we can link phrases between sentences and between types (i.e., allowing an NP to co-refer to an event), then we are not going to get a computer to recognize these types of metaphors (which I suspect are quite common).&lt;br /&gt;&lt;br /&gt;I appreciate Tang et al.'s critique of the SP method and their attempt to get beyond it, but I think their methodology fails to make the critical improvements to automatic metaphor recognition that will be crucial to creating a full scale tool that handles real world metaphor.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="float: left; padding-bottom: 5px; padding-left: 5px; padding-right: 5px; padding-top: 5px;"&gt;&lt;a href="http://www.researchblogging.org/"&gt;&lt;img alt="ResearchBlogging.org" src="http://www.researchblogging.org/public/citation_icons/rb2_large_gray.png" style="border: 0;" /&gt;&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Z3988" title="ctx_ver=Z39.88-2004&amp;amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;amp;rft.jtitle=International+Conference+on+Asian+Language+Processing&amp;amp;rft_id=info%3A%2F&amp;amp;rfr_id=info%3Asid%2Fresearchblogging.org&amp;amp;rft.atitle=Automatic+Metaphor+Recognition+Based+on+Semantic+Relation+Patterns&amp;amp;rft.issn=&amp;amp;rft.date=2010&amp;amp;rft.volume=&amp;amp;rft.issue=&amp;amp;rft.spage=&amp;amp;rft.epage=&amp;amp;rft.artnum=http%3A%2F%2Fieeexplore.ieee.org%2Fxpls%2Fabs_all.jsp%3Farnumber%3D5681552%26tag%3D1&amp;amp;rft.au=Xuri+Tang&amp;amp;rft.au=Weiguang+Qu&amp;amp;rft.au=Xiaohe+Chen&amp;amp;rft.au=Shiwen+Yu&amp;amp;rfe_dat=bpr3.included=1;bpr3.tags=Computer+Science+%2F+Engineering%2CPhilosophy%2CPsychology%2CSocial+Science%2CResearch+%2F+Scholarship%2CLinguistics%2C+Artificial+Intelligence%2C+Philosophy+of+Mind"&gt;Xuri Tang, Weiguang Qu, Xiaohe Chen, &amp;amp; Shiwen Yu (2010). Automatic Metaphor Recognition Based on Semantic Relation Patterns &lt;span style="font-style: italic;"&gt;International Conference on Asian Language Processing&lt;/span&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-1949009086764318085?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/1949009086764318085/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=1949009086764318085' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/1949009086764318085'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/1949009086764318085'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/07/more-on-auto-metaphor-recognition.html' title='more on auto metaphor recognition methods'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-5484084086318602255</id><published>2011-07-05T09:49:00.000-04:00</published><updated>2011-07-05T09:49:15.387-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='metaphor'/><title type='text'>the big picture: automatic metaphor identification</title><content type='html'>The recently popularized &lt;a href="http://www.iarpa.gov/solicitations_metaphor.html"&gt;IARPA Metaphor Program&lt;/a&gt; piqued my curiosity, so I've been reviewing a variety of articles on contemporary approaches to automatic metaphor identification. I've read three articles so far and one thing is somewhat dissapointing: they all severely restrict the notion of metaphor to mean local metaphors within single sentences.&lt;br /&gt;&lt;br /&gt;They all pay considerable lip service to Lakoff &amp;amp; Johnson's seminal 1980 work &lt;a href="http://www.amazon.com/Metaphors-We-Live-George-Lakoff/dp/0226468011"&gt;Metaphors We Live By&lt;/a&gt;, taking as gospel the notion that metaphor is defined as a mapping from one conceptual domain to another. But their examples are all of a limited type. Here are three representative examples from the papers I've been reading:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;Achilles was a lion&lt;/i&gt;. (Babarczy et al.)&lt;/li&gt;&lt;li&gt;&lt;i&gt;The sky is sad&lt;/i&gt;. (Tang et al.)&lt;/li&gt;&lt;li&gt;&lt;i&gt;I attacked his arguments&lt;/i&gt; (Baumer)&lt;/li&gt;&lt;/ul&gt;What struck me is the methods used to identify metaphor are remarkably lexalist. The dominant strategy is Selectional Preferences whereby a list of source and target conceptual domains is created. Then from each, a list of words typically associated with that domain is culled from corpora or intuition or dictionaries. Then, each word is given a set of selectional preferences which constrain what kinds of subjects or predicates it typically occurs with.&lt;br /&gt;&lt;br /&gt;Here is my Ling 101 version of this methodology: If I understand correctly (and I may not), for Tang et al.'s example "The sky is sad", we would have a concept like THE ENVIRONMENT IS HUMAN. We would have a list of words typically associated with the environment (e.g., "sky") and a list of words typically associated with being human (for example "sad"). A computer could then recognize the following:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;The subject (&lt;i&gt;the sky&lt;/i&gt;) is associated with the environment.&lt;/li&gt;&lt;li&gt;The predicate (&lt;i&gt;sad&lt;/i&gt;) is associated with humans.&lt;/li&gt;&lt;li&gt;This subject (&lt;i&gt;the sky&lt;/i&gt;) is not typical for this predicate (&lt;i&gt;sad&lt;/i&gt;).&lt;/li&gt;&lt;li&gt;This sentence is incoherent on first analysis.&lt;/li&gt;&lt;li&gt;The concept THE ENVIRONMENT IS HUMAN links these non-typical phrases coherently.&lt;/li&gt;&lt;li&gt;This sentence is only coherent using conceptual mapping, therefore it is probably metaphorical.&lt;/li&gt;&lt;/ol&gt;This is a gross oversimplification, but I think it gets the big picture about right.&lt;br /&gt;&lt;br /&gt;At first blush, I'm impressed with the simplicity and elegance of this solution. However, it seems to me that much metaphorical language is not local like this (local here = within a single sentence). For example, imagine a situation in a biology class where two students, Alger and Miriam, were originally going to be partners for a lab assignment. Then they got into an argument. A third student, Annette, asks Miriam:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;Annette&lt;/b&gt;: &lt;i&gt;Are you still going to be lab partners with Alger&lt;/i&gt;?&lt;/li&gt;&lt;li&gt;&lt;b&gt;Miriam&lt;/b&gt;: &lt;i&gt;No. That ship has sailed&lt;/i&gt;.&lt;/li&gt;&lt;/ul&gt;In this scenario, the sentence "&lt;i&gt;That ship has sailed&lt;/i&gt;" is entirely coherent from a selectional preferences perspective (i.e., ships really do sail). Yet it is clearly being used metaphorically (there is literally no ship). Here, the metaphor is only detectable if we link two sentences together via co-reference. The phrase "the ship" does not co-refer to a real ship in the discourse. Rather, it refers to the possible event of &lt;i&gt;be-lab-partners-with-Alger&lt;/i&gt;. Unless we can link phrases between sentences and between types (i.e., allowing an NP to co-refer to an event), then we are not going to get a computer to recognize these types of metaphors (which I suspect are quite common).&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="float: left; padding-bottom: 5px; padding-left: 5px; padding-right: 5px; padding-top: 5px;"&gt;&lt;a href="http://www.researchblogging.org/"&gt;&lt;img alt="ResearchBlogging.org" src="http://www.researchblogging.org/public/citation_icons/rb2_large_gray.png" style="border: 0;" /&gt;&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Z3988" title="ctx_ver=Z39.88-2004&amp;amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;amp;rft.jtitle=International+Conference+on+Asian+Language+Processing&amp;amp;rft_id=info%3A%2F&amp;amp;rfr_id=info%3Asid%2Fresearchblogging.org&amp;amp;rft.atitle=Automatic+Metaphor+Recognition+Based+on+Semantic+Relation+Patterns&amp;amp;rft.issn=&amp;amp;rft.date=2010&amp;amp;rft.volume=&amp;amp;rft.issue=&amp;amp;rft.spage=&amp;amp;rft.epage=&amp;amp;rft.artnum=http%3A%2F%2Fieeexplore.ieee.org%2Fxpls%2Fabs_all.jsp%3Farnumber%3D5681552%26tag%3D1&amp;amp;rft.au=Xuri+Tang&amp;amp;rft.au=Weiguang+Qu&amp;amp;rft.au=Xiaohe+Chen&amp;amp;rft.au=Shiwen+Yu&amp;amp;rfe_dat=bpr3.included=1;bpr3.tags=Computer+Science+%2F+Engineering%2CPhilosophy%2CPsychology%2CSocial+Science%2CResearch+%2F+Scholarship%2CLinguistics%2C+Artificial+Intelligence%2C+Philosophy+of+Mind"&gt;Xuri Tang, Weiguang Qu, Xiaohe Chen, &amp;amp; Shiwen Yu (2010). Automatic Metaphor Recognition Based on Semantic Relation Patterns &lt;span style="font-style: italic;"&gt;International Conference on Asian Language Processing&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Other citations:&lt;br /&gt;&lt;a href="http://www.abstract-project.eu/papers/metaphor_malta_2.2.1.pdf"&gt;The Automatic Identification of Conceptual Metaphors in Hungarian Texts&lt;/a&gt;: A&lt;br /&gt;Corpus-Based Analysis. Anna Babarczy, Ildikó Bencze M.1, István Fekete1, Eszter Simon1&lt;br /&gt;&lt;br /&gt;&lt;a href="http://ericbaumer.com/publications/Baumer-Dissertation.pdf"&gt;Computational Metaphor Identification to Foster Critical Thinking and Creativity&lt;/a&gt;. ERIC BAUMER (dissertation). 2009.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-5484084086318602255?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/5484084086318602255/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=5484084086318602255' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/5484084086318602255'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/5484084086318602255'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/07/big-picture-automatic-metaphor.html' title='the big picture: automatic metaphor identification'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-1616813287256732734</id><published>2011-07-01T08:33:00.000-04:00</published><updated>2011-07-01T08:33:00.627-04:00</updated><title type='text'>the largest whorfian study EVER! (and why it matters)</title><content type='html'>Let me take the ball Mark Liberman threw on Monday and run with it a bit. Liberman posted &lt;a href="http://languagelog.ldc.upenn.edu/nll/?p=2592"&gt;a thorough discussio&lt;/a&gt;n of Fausey and Broditsky's neo-Whorfian &lt;a href="http://csjarchive.cogsci.rpi.edu/proceedings/2008/pdfs/p205.pdf"&gt;English and Spanish speakers remember causal agents differently&lt;/a&gt;. Specifically, he invited readers to carefully examine the methodology of the&amp;nbsp;experiments themselves, and not just focus on the conclusions. It turns out that a few years ago another set of neo-Whorfians, &lt;a href="http://www.acsu.buffalo.edu/~jb77/"&gt;Jürgen Bohnemeyer&lt;/a&gt; and company, published a paper that addressed similar methodological concerns:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.acsu.buffalo.edu/~jb77/WaysToGo.pdf"&gt;Ways to go: Methodological considerations in Whorfian studies on motion events&lt;/a&gt;. (With S. Eisenbeiss and B. Narasimhan) Colchester: University of Essex, Department of Language and Linguistics (Essex Research Reports in Linguistics 50: 1-19). 2006.&lt;br /&gt;&lt;br /&gt;This paper addressed experiments involving motion events like &lt;i&gt;rolling&lt;/i&gt; and &lt;i&gt;falling &lt;/i&gt;whereas Fausey and Broditsky's work addressed agentivity like &lt;i&gt;breaking &lt;/i&gt;and &lt;i&gt;popping&lt;/i&gt;, but there's enough overlap to warrant some comparison, particularly since the Bohnemeyer et al. paper specifically addresses methodology wrt Whorfian experiments.&lt;br /&gt;&lt;br /&gt;But before I get into the details, let me state clearly why I think this is important. In other posts, I have dismissed popular lingo-topics like language evolution as outside the mainstream of linguistics because they don't bear directly on what I consider to be the center of the linguistics universe: How the brain does language. But linguistic relativity (aka, The Whorfian hypothesis) is one of the great questions of linguistics and cognitive science precisely because it bears directly on the question of how the brain does language. And we're only just now developing the proper tools and methodologies to study the question with scientific rigor. It may turn out that language does not affect other cognitive processes or the effect is minor. I don't care. I just want to know one way or the other. And it's work like Bohnemeyer's and Broditsky's that will lead us to knowing, eventually.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: blue;"&gt;Now the fun stuff&lt;/span&gt;&lt;/b&gt;.&lt;br /&gt;&lt;a name='more'&gt;&lt;/a&gt;&lt;br /&gt;Bohnemeyer et al. start with an assumption about language types based on Talmy's cognitive semantics typology that classes languages&amp;nbsp;as either satellite-framing or verb-framing, From &lt;a href="http://linguistics.buffalo.edu/people/faculty/talmy/talmyweb/Recent/verb.html#Introduction"&gt;Talmy&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;...&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;languages fall into two main types on the basis of where the Path is represented in a sentence expressing a Motion event [...]. In this two-category typology, if the Path is characteristically represented in the main verb or verb root of a sentence, the language is "verb framed", but if it is characteristically represented in the satellite and/or preposition, the language is "satellite framed"&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;In this typology, English is a satellite-frame language and Spanish is a verb-frame language. Bohnemeyer et al. &amp;nbsp;conclude that “&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;verb-framed” (V) languages lexicalize path information in the verb root. Consequently, they require &lt;b&gt;a separate expression&lt;/b&gt; for the “manner” of the motion event [...] The additional syntactic position renders manner in V–languages less “codable”. Consequently, manner is encoded more routinely in S–language discourse. Thus, the question arises &lt;b&gt;whether S–language native speakers also pay more attention to manner&lt;/b&gt; when committing a motion event to memory and/or comparing it to other events&lt;/span&gt;&lt;/i&gt;." (emphasis added).&lt;br /&gt;&lt;br /&gt;So satellite-frame languages like English push path info into separate phrases and verb-frame languages like Spanish push manner into separate phrases. Take the following English sentences:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;The ball rolled down the hill.&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;The ball bounced down the hill&lt;/i&gt;.&lt;/li&gt;&lt;/ul&gt;In&amp;nbsp;English, the manner of motion (&lt;i&gt;rolling &lt;/i&gt;vs. &lt;i&gt;bouncing&lt;/i&gt;) is encoded on the verb (i.e., different verbs), but the path is encoded as a separate phrase ("&lt;i&gt;down the hill&lt;/i&gt;") as opposed to something like this:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;The ball moved down the hill rollingly&lt;/i&gt;.&lt;/li&gt;&lt;/ul&gt;Here the verb &lt;i&gt;move &lt;/i&gt;is fairly simple and encodes no manner info by itself. As a note, Talmy's typology is based on colloquial and high frequency language use. Many languages allow both kinds of constructions, it's just that one is highly frequent in "everyday" speech.&lt;br /&gt;&lt;br /&gt;And there's the crux of Bohnemeyer et al.'s experiments. But, they also noticed conflicting results in several other neo-Whorfian studies that they believed were a result of methodology so they set out to investigate methodology. Those other papers used a similarity-judgement task, so Bohememeyer et al. used that task as well. Particularly, they gave particpants a series of three short animated videos to watch. For example, one video was a tomato rolling down a hill. Their methodology is difficult to understand if you just read through, so I'm going to try to help with some bold-facing and re-structuring of paragraphs and a few re-wordings. What follows is a near-quote:&lt;br /&gt;&lt;br /&gt;...&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;we conducted a similarity-judgment task analogous in design to those reviewed above with native speakers of 17 genetically and typologically diverse languages – to our knowledge, &lt;b&gt;the largest sample of languages ever used in a Whorfian study&lt;/b&gt;. To control for the effects of individual manner or path contrasts, we cross-classified six path types with four manner types, realizing all possible combinations in our stimulus set and counterbalancing for frequency of occurrence&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;[... FYI, the 17 languages were &lt;i&gt;Basque, Catalan, Dutch, French, German, Hindi, Italian, Jalonke, Japanese, Lao, Polish, Spanish, Tamil, Tidore, Tiriyó, Turkish,&lt;/i&gt; and &lt;i&gt;Yukatek&lt;/i&gt;]&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/i&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;The materials consisted of 72 triads&lt;span class="Apple-style-span" style="font-style: normal;"&gt; &lt;span class="Apple-style-span" style="color: black;"&gt;(i.e., 72 sets of three videos).&lt;/span&gt;&lt;/span&gt; The targets&lt;span class="Apple-style-span" style="font-style: normal;"&gt;&lt;span class="Apple-style-span" style="color: black;"&gt; (i.e., main videos) &lt;/span&gt;&lt;/span&gt;were 24 motion-event video-animations which systematically varied:&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;ol&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;four manners of motion (SPIN, ROLL, BOUNCE, SLIDE),&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;three scenarios with different “ground” objects (inclined ramp; field with tree and rock; field with hut and cave), and&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;two directed paths (motion UP/RIGHT, DOWN/LEFT)&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;/ol&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;For each of these targets (e.g. tomato-ROLLs-UP-RAMP, see Figure1), we created a same-manner (different-path) variant (e.g. tomato-ROLLs-DOWN-RAMP), and three types of same-path (different-manner) variants (here, OUNCE/SLIDE/SPIN-UP-RAMP).&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/i&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;This resulted in 72 triads with a target clip, a same-manner variant and one of the three same-path variants. The variants were presented side by side, 1 second after the target-clip presentation ended (see Figure 1)&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_uqqKLCxLtII/TH1ESwauzuI/AAAAAAAAAVc/ov0yp1Me2JQ/s1600/bohnemeyer_tomato.JPG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="180" src="http://1.bp.blogspot.com/_uqqKLCxLtII/TH1ESwauzuI/AAAAAAAAAVc/ov0yp1Me2JQ/s320/bohnemeyer_tomato.JPG" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div style="text-align: center;"&gt;(figure&amp;nbsp;from page 7)&lt;/div&gt;&lt;br /&gt;Their methodology was as follows (emphasis added to help readability):&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;The 72 triads were distributed across &lt;b&gt;6 randomized presentation lists&lt;/b&gt; in a Latin-square design.&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Each list was given to &lt;b&gt;two participants per language&lt;/b&gt; (in reverse presentation order).&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Each list contained &lt;b&gt;12 triads&lt;/b&gt;, with the target clips combining the four manners of motions with the three scenarios so that each participant saw all 12 combinations in the target clip.&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;The number of UP/RIGHT and DOWN/LEFT motions in the target- and variant-clips as well as the manners of motions in the different-manner variants was &lt;b&gt;counterbalanced across the lists&lt;/b&gt;, as was the position in which the variants were presented on the screen.&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;The &lt;b&gt;position of the ground objects remained the same&lt;/b&gt; in all clips.&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Minimal variations in the triad clips allow us to take into account the effects of different manners, paths and scenarios, but make our &lt;b&gt;test triads quite similar&lt;/b&gt;.&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Added &lt;b&gt;38 filler triads&lt;/b&gt; to each list, which involved other types of events and variations (e.g. replacing either the agent or the goal in a possession-transfer event with another character) and served to prevent the participants from settling into a fixed response pattern.&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Instructions to participants were &lt;b&gt;translated into their native languages&lt;/b&gt;.&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;/ul&gt;So participants were watching a bunch of short videos in sets of three [A, B, C]. If I understand correctly, for each set, they were asked to determine if B or C is more similar to A than the other. &amp;nbsp;For example, imagine I show you three cartoons:&lt;br /&gt;&lt;br /&gt;A) a tomato rolling up a hill (forget physics, it's cartoon world).&lt;br /&gt;B) a tomato rolling down a hill.&lt;br /&gt;C) a tomato bouncing up a hill.&lt;br /&gt;&lt;br /&gt;Then I ask you to chose B or C as more similar to A (forced choice, you gotta pick one).&amp;nbsp;If you say B, then the manner of rolling is more salient for you than directional path. If you say C, then directional path is more salient than manner.&lt;br /&gt;&lt;br /&gt;Now, if your native language forces you to push path information into separate phrases like prepositional phrases (e.g., "&lt;i&gt;up a hill&lt;/i&gt;"), it may be the case that this would cause you to say C is more similar because the path is the same. Hence your language caused your perception of similarity to behave in a particular way, hence language affects thought.&lt;br /&gt;&lt;br /&gt;So what did they discover?&lt;br /&gt;&lt;ul&gt;&lt;li&gt;First, they found no simple categorical distinction between verb-frame language speakers and satellite-language speakers decisions. If they had, it would have been fairly strong evidence for linguistic relativity and maybe then they'd be the ones writing WSJ articles instead of Broditsky, haha.&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;They did find that manner of motion by itself seemed to influence similarity judgement significantly (in the technical sense), but this influence was independent of language (i.e., it didn't have anything clearly to do with verb-framing or satellite-framing).&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;They found that the data they gathered could not be accounted for by socio-cultural factors because they had such a diverse participant pool.&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;Interestingly, they did find that contrasts between certain manner pairs DID correlate significantly with language. So, Turkish speakers (a verb-frame language) chose the video with the same manner 88% of the time when the contrast was &lt;i&gt;slide &lt;/i&gt;vs. &lt;i&gt;roll&lt;/i&gt;, but only 50% of the time when it was &lt;i&gt;roll &lt;/i&gt;vs. &lt;i&gt;bounce&lt;/i&gt;. They left open the question of why this is so.&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;The most striking conclusion, however, was that more is better in the sense that they discovered that "&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;strong claims regarding the (in)validity of the Whorfian hypothesis in the encoding of motion events &lt;b&gt;cannot be made on the basis of a limited number of languages&lt;/b&gt; or a restricted range of manner and path contrasts&lt;/span&gt;&lt;/i&gt;." (emphasis added).&lt;/li&gt;&lt;/ul&gt;I found the following&amp;nbsp;passage to be &lt;b&gt;the most important&lt;/b&gt; in the paper, especially wrt Fausey and Broditsky's work:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;If only Polish (an S-language) and Yukatek (a V-language) had been selected to test for language-specific performance effects in our nonlinguistic task, the highly significant difference between speakers of these two languages in their degree of manner preference would have supported a strong version of the Whorfian claim. Conversely, had we chosen only to contrast German (S) with Spanish (V) using the identical stimuli and experimental procedure, we would have reached the opposite conclusion, since speakers of these languages do not differ significantly from one another in the frequency with which they base their similarity judgements on manner of motion&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;The lesson here is that methodology counts.&amp;nbsp;Fausey and Broditsky used a very small participant pool and it's hard to tell how that affected their results.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;FYI: the word &lt;i&gt;profound&lt;/i&gt; does not occur in Bohnemeyer's article.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-1616813287256732734?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/1616813287256732734/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=1616813287256732734' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/1616813287256732734'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/1616813287256732734'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/07/largest-whorfian-study-ever-and-why-it.html' title='the largest whorfian study EVER! (and why it matters)'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_uqqKLCxLtII/TH1ESwauzuI/AAAAAAAAAVc/ov0yp1Me2JQ/s72-c/bohnemeyer_tomato.JPG' height='72' width='72'/><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-7346599310627051648</id><published>2011-06-29T16:18:00.000-04:00</published><updated>2011-06-29T16:18:13.020-04:00</updated><title type='text'>Through the Language Glass (Part 2) [reposted]</title><content type='html'>This is part 2 of my review of Guy Deutscher's new book &lt;a href="http://www.amazon.com/Through-Language-Glass-Different-Languages/dp/080508195X"&gt;Through the Language Glass: Why the World Looks Different in Other Languages&lt;/a&gt;. This covers &lt;i&gt;The Language Lens&lt;/i&gt; (129-249). Part 1 is &lt;a href="http://thelousylinguist.blogspot.com/2010/09/through-language-glass-part-1.html"&gt;here&lt;/a&gt;. This review will cover the scientific evidence that Deutscher reviews suggesting that language affects thought, and will end with&lt;i&gt; a shocking proposal&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;To sum up my review of part one: &lt;i&gt;meh&lt;/i&gt;. Okay, we've established that culture can influence language. This is a lot less controversial than Deutscher makes it seem and he spent a large amount of text defending that position. Okay, whatever, time to move on. In part 2 he again begins with historical review explaining why he thinks Whorf was a con man, but also why he thinks the core insights of early linguist relativity deserve closer, honest investigation. He complains that based his Hopi claims on just one lonely informant (p142). We'll see later that Deutscher himself falls for the same trap. He replaces Whorf with the Boas-Jakobson principle that languages differ in what they &lt;i&gt;must &lt;/i&gt;convey, not what they &lt;i&gt;may &lt;/i&gt;convey” (151). I respect Deutscher for making this a central theme in his book because I think he's right. To parrot his own recitation of Humbolt: &lt;i&gt;any thought can be expressed in any language&lt;/i&gt;. It is what our native language forces us to foreground that makes linguistic relativity an interesting topic.&lt;br /&gt;&lt;br /&gt;Deutscher spends most of the second part of the book reviewing three areas of language that have provided evidence that language affects thought: &lt;i&gt;spatial coordinates&lt;/i&gt;, &lt;i&gt;grammatical gender&lt;/i&gt;, and &lt;i&gt;color terms&lt;/i&gt; (familiar from part 1). The general point I want to make about his evidence is that it is far weaker than he maintains. But is is interesting. A brief set of reactions:&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: blue;"&gt;Spatial Coordinates -- everything is embodied&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;Most of his argumentation about the affect of spacial coordinate terms on thought stems from Levinson's evidence from speakers of the Australian language Guugu Yimithirr which is famous for giving us the word “kangaroo.” Speakers of GY do not generally use ego-centric terms like "right" and "left" but rather use cardinal direction terms like "east" and "west." As a result, Deutscher claims, they remember information about situations differently than speakers of English. They have, so the argument goes, a &lt;i&gt;perfect pitch&lt;/i&gt; for direction and they are always attuned to where north is. Deutscher's claim is that only the linguistic repetition of such terms can possibly account for this. Hence, their language affects what they pay attention to and what they remember, hence language affects thought.&lt;br /&gt;&lt;br /&gt;I've never found this line of research all that convincing regarding linguistic relativity and Deutscher does not really add much to the debate. Like Deutscher's complaint above regarding Whorf's one lonely Hopi speaker, it turns out there are not many native speakers of Guugu Yimithirr left and haven't been for a while. These experiments on directional language involve very few speakers, and most of them have both cardinal direction and ego-centric direction in their dialect. If we're going to complain about Whorf's restricted subject pool, we must complain about Levinson's too.&lt;br /&gt;&lt;br /&gt;But more to the point, I believe all direction terms are ultimately ego-centric insofar as they are &lt;i&gt;embodied&lt;/i&gt;. The terms "north" and "south" are not magically universal. They are based on a human being's body and orientation (i.e., ego-centric). Don't believe me, ask yourself, what does "north" mean in space? What does "north" mean to an amoeba? Mostly what Deutscher does in his discussions of direction terms is reiterate the point he belabored in Part 1: culture affects language. Yeah, we got that already.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: orange;"&gt;The rise of similarity judgments&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;That is until he discusses the table experiments. These experiments show subjects tables with objects on them and ask them to arrange them in accordance with a target. Basically, they ask for similarity judgement. How can you make this table arrangement &lt;i&gt;similar to&lt;/i&gt; the previous table. This methodological paradigm has become prominent in psycholinguistics and cognitive linguistics, especially studies testing linguistic relativity. In fact, all of the studies Deutscher discusses are similarity judgment studies of one sort or another. The point is that I show you one target thing, then two test things and ask, which test is MORE SIMILAR to the target than the other? Ultimately Deutscher himself problematizes spatial coordinate terms so much, they fall flat and remain unconvincing as a base of evidence for linguistic relativity.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: blue;"&gt;Grammatical Gender&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;Most languages have terms for classifying things. Some languages have more elaborate classifier systems than others. In German, the term for &lt;i&gt;the fork&lt;/i&gt; is &lt;i&gt;die Gabel&lt;/i&gt;, marked by feminine &lt;i&gt;die&lt;/i&gt;. Ultimately, most languages with elaborate classifiers have systems that can be described as incoherent in so far as most things given one classification have no inherent properties that signify that classification (there is nothing inherently feminine about a fork). However, Deutscher provides evidence that speakers of languages with grammatical&amp;nbsp;gender will evoke properties of things in keeping with their gender classifier, suggesting that the classifier is causing them to imagine a fork would speak with a female voice, for example. But these experiments mainly test vague associations of imagination, not linguistic causality, as Deutscher admits.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: blue;"&gt;Color Terms&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;It is not until chapter 9 &lt;i&gt;Russian Blues&lt;/i&gt; that Deutscher really delivers the goods. It is this chapter which provides the most interesting evidence for the effect of language on thought. Pity it is only about 15 pages of the book. The whole book should have been more like this. The facts he discusses involve the basic point that the brain sees what it wants to see. It turns out our perception of color has little to do with any objective feature of the thing we're looking at (he explains this fact brilliantly in the Appendix which I highly recommend, and frankly, should have been the first chapter, not relegated to the attic of an appendix). The point is that our brains change the input. As our eyes take in objective photons, our brain photoshops the input (a great analogy from Deutscher which really brings the point home).&lt;br /&gt;&lt;br /&gt;The experimental results Deutscher discusses involve more similarity judgements, albeit with a twist. Instead of relying solely on the similarity judgments, researchers studied the more objective reaction time. They showed people different color patches and asked them to judge the sameness. Despite the various and clever variations on this theme, they all relied on subjective judgements of similarity. And this is where they fail to extricate themselves from the problem of strategizing.&lt;br /&gt;&lt;br /&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;Unfortunately they all share the critical flaw that making a similarity judgment is a logical reason act and may be mitigated by strategizing. Deutscher discusses this fact, but doesn't realize that none of the fixes work. A similarity judgment is always a logical process susceptible to the effects of strategizing. This will be a major issue in my &lt;i&gt;Shocking Proposal&lt;/i&gt; at the end. You see, regardless of how clever the test, as long as you are basically asking a subject to make a similarity judgment, you are asking them to reason about the task. So your results will be tinged by the strategizing of human subjects as they logically try to game the system. This is well known in psycholinguistics and difficult to avoid. So how do you objectively test what colors a person considers blue?&lt;/div&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: blue;"&gt;A Shocking Proposal&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;The paradigm already exists. How can you objectively prove that English speakers really do consider aspirated /kh/ and unaspirated&amp;nbsp;/k/ both the same phoneme? You condition them to fear aspirated /kh/ by shocking them every time they hear it (measuring their galvanic skin response). Once they are conditioned, you then play them unaspirated /k/ (with no shock) and check to see if you get the same GSR spike (in anticipation).&lt;br /&gt;&lt;br /&gt;Okay, now apply this to color terms. Condition subjects to fear center of the category blue, then show them gradations. What causes the GSR spike? That's what they consider blue. now do that with speakers of 40 different languages.&lt;br /&gt;&lt;br /&gt;If the hippies on the human subjects review board let you do it, there's your dissertation.&lt;br /&gt;&lt;br /&gt;&lt;span style="float: left; padding-bottom: 5px; padding-left: 5px; padding-right: 5px; padding-top: 5px;"&gt;&lt;a href="http://www.researchblogging.org/"&gt;&lt;img alt="ResearchBlogging.org" src="http://www.researchblogging.org/public/citation_icons/rb2_large_gray.png" style="border: 0;" /&gt;&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Z3988" title="ctx_ver=Z39.88-2004&amp;amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;amp;rft.jtitle=Metropolitan+Books&amp;amp;rft_id=info%3A%2F&amp;amp;rfr_id=info%3Asid%2Fresearchblogging.org&amp;amp;rft.atitle=Through+the+Language+Glass%3A+Why+the+World+Looks+Different+in+Other+Languages.&amp;amp;rft.issn=&amp;amp;rft.date=2010&amp;amp;rft.volume=&amp;amp;rft.issue=&amp;amp;rft.spage=&amp;amp;rft.epage=&amp;amp;rft.artnum=http%3A%2F%2Fus.macmillan.com%2Fthroughthelanguageglass&amp;amp;rft.au=Guy+deutscher&amp;amp;rfe_dat=bpr3.included=1;bpr3.tags=Anthropology%2CPhilosophy%2CPsychology%2CSocial+Science%2CLinguistics"&gt;Guy Deutscher (2010). Through the Language Glass: Why the World Looks Different in Other Languages. &lt;span style="font-style: italic;"&gt;Metropolitan Books&lt;/span&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-7346599310627051648?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/7346599310627051648/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=7346599310627051648' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/7346599310627051648'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/7346599310627051648'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/06/through-language-glass-part-2-reposted.html' title='Through the Language Glass (Part 2) [reposted]'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-3707597345152157621</id><published>2011-06-18T10:29:00.000-04:00</published><updated>2011-06-18T10:29:00.265-04:00</updated><title type='text'>Super 8: more homework than movie</title><content type='html'>A rare non-linguistics post: Saw &lt;i&gt;Super 8&lt;/i&gt;. Critics are spot on: it's a movie that shows how to construct a coherent narrative like they used to make. It is simple, straight forward. It makes contemporary movies like &lt;i&gt;Transformers&lt;/i&gt;, &lt;i&gt;Thor&lt;/i&gt;, and &lt;i&gt;Pirates &lt;/i&gt;seem amateurish and sad. It's like a tutorial in narrative film-making.&lt;br /&gt;&lt;br /&gt;Unfortunately, it's like a tutorial in narrative film-making ... more like homework than enjoyment. I felt like I should be taking notes. Like there'd be a quiz at the end.&lt;br /&gt;&lt;br /&gt;One reason this is so is simple modern history. 30 years ago, good narrative films were strictly the domain of Hollywood. TV was the domain of kitsch and schmaltz. But since the 1990s, TV (largely due to HBO, and more recently Showtime, TNT, AMC...) has taken over the mantel of prime story teller. We know from good narratives. We know from good structure. We know from good plot. We know from good formula (we loved &lt;i&gt;Law and Order&lt;/i&gt; for 20 years, after all).&lt;br /&gt;&lt;br /&gt;So now, in 2011, the &lt;i&gt;ET/Goonies&lt;/i&gt; routine rings a bit hollow. Like movie-plot-by-the-book; paint-by-numbers film making. Sure, it's more coherent than &lt;i&gt;Thor &lt;/i&gt;(a truly gawdawful film), but &lt;i&gt;Breaking Bad&lt;/i&gt; and &lt;i&gt;Mad Men&lt;/i&gt; are still way better, and they don't cost me $11 and a special trip. Don't even get me started on how much better the revived &lt;i&gt;Doctor Who&lt;/i&gt; is than this Hollywood trash.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-3707597345152157621?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/3707597345152157621/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=3707597345152157621' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3707597345152157621'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3707597345152157621'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/06/super-8-more-homework-than-movie.html' title='Super 8: more homework than movie'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-4757634390731672561</id><published>2011-06-15T10:03:00.000-04:00</published><updated>2011-06-15T10:03:05.729-04:00</updated><title type='text'>simple english wikipedia</title><content type='html'>I didn't know this existed until it was referenced briefly in an unrelated article. &lt;a href="http://simple.wikipedia.org/wiki/Main_Page"&gt;Simple English Wikipedia&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;This is the front page of the Simple English Wikipedia. Wikipedias are places where people work together to write encyclopedias in different languages. We use Simple English words and grammar here. The Simple English Wikipedia is for everyone! That includes children and adults who are learning English.&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;Great idea! I perused some of the articles relating to linguistics, language, and grammar. Not bad. Under &lt;a href="http://simple.wikipedia.org/wiki/Grammar"&gt;Grammar&lt;/a&gt;, they write:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;When we speak, we use the native-person's grammar, or as near as we can. When we write, we try to write with correct usage grammar. So, speaking and writing a language each have their own style.&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;While one may want to quibble a little with this (I don't quite understand the "as near as we can" hedge), it's actually a nice way to explain the difference between speaking and writing to, say, a 10 year old.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-4757634390731672561?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/4757634390731672561/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=4757634390731672561' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/4757634390731672561'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/4757634390731672561'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/06/simple-english-wikipedia.html' title='simple english wikipedia'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-287824114463063932</id><published>2011-04-28T09:50:00.001-04:00</published><updated>2011-04-29T00:09:52.086-04:00</updated><title type='text'>Shakespeare and the brain? maybe not...</title><content type='html'>&lt;div class="MsoNormal"&gt;Jezebel, &lt;a href="http://www.guardian.co.uk/books/2011/apr/24/shakespeare-anniversary-neuroscience-robert-mccrum"&gt;The Guardian&lt;/a&gt;, and other sources have been promoting recent &lt;a href="http://jezebel.com/#!5795876/this-is-your-brain-on-shakespeare"&gt;neuroscience involving reading Shakespeare&lt;/a&gt;. This research has popped up in the blogosphere before and was widely misunderstood. I fear this time is no better, so I offer this re-posting of my original response from &lt;a href="http://thelousylinguist.blogspot.com/2007/12/andrew-sullivan-please-take-cog-sci.html"&gt;late 2007&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;Even though he blogs at a mere &lt;i&gt;undergrad&lt;/i&gt; level I basically respect Andrew Sullivan as a blogger.  He blogs about a diverse set of topics and has thoughtful and intelligent (even if controversial) comments and analysis.  And he’s prolific, to say the least (surely the advantage of being a professional blogger, rather than stealing the spare moment at work while your test suite runs its course).  That said, he can sometimes really come across as a snobbish little twit.  Like yesterday when he &lt;a href="http://andrewsullivan.theatlantic.com/the_daily_dish/2007/12/the-neuroscienc.html"&gt;linked &lt;/a&gt;to an &lt;a href="http://moreintelligentlife.com/node/298"&gt;article&lt;/a&gt; about Shakespearean language which talks about a psycholinguistics study initiated by an English professor, &lt;a href="http://www.liv.ac.uk/english/staff/philipdavis.htm"&gt;Philip Davis&lt;/a&gt;&lt;a href="http://www.liv.ac.uk/english/staff/philipdavis.htm"&gt;&lt;/a&gt;; as is so often the case, the professor has wildly exaggerated the meaning of the study.  Please see Language Log’s post &lt;a href="http://itre.cis.upenn.edu/~myl/languagelog/archives/004578.html"&gt;&lt;i&gt;Distracted By The Brain&lt;/i&gt;&lt;/a&gt; for related discussion.  Here’s crucial quote from that post:&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="-moz-background-clip: -moz-initial; -moz-background-inline-policy: -moz-initial; -moz-background-origin: -moz-initial; background: rgb(224, 224, 224) none repeat scroll 0% 50%;"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;b&gt;&lt;span style="color: green;"&gt;The neuroscience information had a particularly striking effect on non-experts’ judgments of bad explanations, masking otherwise salient problems in these explanations.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/b&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;b&gt;&lt;span style="color: blue;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;b&gt;&lt;span style="color: blue;"&gt;My claim&lt;/span&gt;&lt;/b&gt;:  the neuroscience study discussed in the &lt;st1:city st="on"&gt;Davis&lt;/st1:city&gt; article distracts the reader from &lt;st1:place st="on"&gt;&lt;st1:city st="on"&gt;Davis&lt;/st1:city&gt;&lt;/st1:place&gt;’s essentially absurd interpretations, and Andrew Sullivan takes the bait, hook, line and sinker (and looks like a twit in the end). &lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;o:p&gt;&lt;/o:p&gt;The article does not go into the crucial details of the study, but it says that it involves EEG (electroencephalogram) &lt;span style="color: blue;"&gt;and&lt;/span&gt; MEG (magnetoencephalograhy) &lt;span style="color: blue;"&gt;and&lt;/span&gt; fMRI (Functional Magnetic Resonance Imaging) noting that only the EEG portion has been completed.  A pretty impressive array of tools for a single psycholinguistics study, I must say.  Most published articles in the field would involve one or maybe two of these, but all three for a single study?  Wow, impressive. &lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;o:p&gt;&lt;/o:p&gt;It’s not clear to me if this was a well designed study or not (my hunch is, no, it is a poorly designed study, but without the crucial details, I really don’t know).  However, it is undeniable that professor Davis has gone off the deep end of interpretation.  The study does not even involve Shakespearean English!!!  It involves Modern English!  Then &lt;st1:city st="on"&gt;&lt;st1:place st="on"&gt;Davis&lt;/st1:place&gt;&lt;/st1:city&gt; makes the following claims (false, all of them, regardless of the study):&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="-moz-background-clip: -moz-initial; -moz-background-inline-policy: -moz-initial; -moz-background-origin: -moz-initial; background: rgb(224, 224, 224) none repeat scroll 0% 50%;"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;b&gt;&lt;span style="color: green;"&gt;["word class conversion"] is an &lt;u&gt;economically compressed form of speech&lt;/u&gt;, as from an age when the &lt;u&gt;language was at its most dynamically fluid and formatively mobile&lt;/u&gt;; an age in which &lt;u&gt;a word could move quickly from one sense to another&lt;/u&gt;… &lt;/span&gt;&lt;/b&gt;&lt;span style="color: black;"&gt;(underlines added)&lt;/span&gt;&lt;b&gt;&lt;span style="color: green;"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/b&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;o:p&gt;&lt;/o:p&gt;This is the classic English professor bullshit. I don’t even know what “economically compressed” means (&lt;st1:place st="on"&gt;&lt;st1:city st="on"&gt;Davis&lt;/st1:city&gt;&lt;/st1:place&gt; gives no definition); it has no meaning to linguistics that I know of.  The quote also suggests Shakespeare’s English had some sort of magical linguistic qualities that today’s English does not possess.  FALSE!  Modern English allows tremendous productivity of constructions, neologisms, and ambiguity.  A nice introduction to ambiguity can be found here: &lt;a href="http://www.kurzweilai.net/meme/frame.html?main=/articles/art0186.html"&gt;&lt;i&gt;Ambiguous Words&lt;/i&gt;&lt;/a&gt; by &lt;a href="http://www.kurzweilai.net/bios/frame.html?main=/bios/bio0058.html"&gt;George A. Miller&lt;/a&gt;.&lt;a href="http://www.kurzweilai.net/bios/frame.html?main=/bios/bio0058.html" target="_blank"&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;st1:city st="on"&gt;&lt;st1:place st="on"&gt;Davis&lt;/st1:place&gt;&lt;/st1:city&gt; ends with a flourish of artistic bullshit hypothesizing:&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="-moz-background-clip: -moz-initial; -moz-background-inline-policy: -moz-initial; -moz-background-origin: -moz-initial; background: rgb(224, 224, 224) none repeat scroll 0% 50%;"&gt;&lt;b&gt;&lt;span style="color: green;"&gt;For my guess, more broadly, remains this: that Shakespeare's syntax, its shifts and movements, can lock into the existing pathways of the brain and actually move and change them—away from old and aging mental habits and easy long-established sequences.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/b&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;a href="http://en.wikipedia.org/wiki/Neuroplasticity"&gt;Neuroplasticity&lt;/a&gt;&lt;a href="http://en.wikipedia.org/wiki/Neuroplasticity"&gt;&lt;/a&gt; is only just now being studied in depth and it’s far from well understood, but the study in question says NOTHING about plasticity!!!  There’s also no reason to believe that Shakespeare’s language does anything that other smart, well crafted language does not do.  And we’re a generation at least away from having the tools to study any of this.&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;o:p&gt;&lt;/o:p&gt;I’m accustomed to simply letting these all too common chunks of silliness go without comment, but then Andrew had to slip in his unfortunate bit of snooty arrogance.  After pasting a chunk of the obvious linguistics bullshit on his site (then follow-up comments), he has to finish with &lt;i&gt;"I knew all that already&lt;/i&gt;".  Exactly what did you know, Andrew?  Since all of the major claims Davis makes are obvious bullshit, what exactly do you claim to have had prior knowledge of?  What did Andrew know, and when did he know it?&lt;/div&gt;&lt;div class="MsoNormal"&gt;&lt;br /&gt;&lt;o:p&gt;&lt;/o:p&gt;Really, Andrew, did you never take so much as a single linguistics course during all your years at Harvard and &lt;st1:place st="on"&gt;&lt;st1:city st="on"&gt;Oxford&lt;/st1:city&gt;&lt;/st1:place&gt;?  The University at &lt;st1:state st="on"&gt;Maryland&lt;/st1:state&gt; has excellent &lt;a href="http://www.ling.umd.edu/cnl/"&gt;psycholinguists &lt;/a&gt;as does &lt;a href="http://explore.georgetown.edu/people/michael/?PageTemplateID=129"&gt;&lt;st1:place st="on"&gt;&lt;st1:city st="on"&gt;Georgetown&lt;/st1:city&gt;&lt;/st1:place&gt;&lt;/a&gt;.  Please, consider sitting in on a course, won’t you?&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-287824114463063932?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/287824114463063932/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=287824114463063932' title='5 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/287824114463063932'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/287824114463063932'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/04/shakespeare-and-brain-maybe-not.html' title='Shakespeare and the brain? maybe not...'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>5</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-6283861525224022291</id><published>2011-04-11T10:53:00.006-04:00</published><updated>2011-04-11T15:34:54.837-04:00</updated><title type='text'>Google linguist interview</title><content type='html'>Purpose:  This post reviews my experience interviewing for a Linguist position at Google in Santa Monica, CA on February 29, 2008. I've long meant to post this but only now got around to it. There are lots of &lt;a href="http://www.google.com/search?sourceid=chrome&amp;amp;ie=UTF-8&amp;amp;q=google+interview"&gt;Google interview stories&lt;/a&gt; on the web. It appears to be its own genre. This is my contribution to the genre.&lt;br /&gt;&lt;br /&gt;I originally wrote it as an email to a friend who wanted to know how my big day at Google went. It’s rather long, but then again, you don’t have to read it, you clearly have better things to do…&lt;br /&gt;&lt;br /&gt;I found a job posting on the Google jobs board for a full time Linguist. I applied and was given a phone interview with a recruiter around late January, 2008:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;Thank you for your interest in Google.  I'd like to set up a time for us to discuss Google Linguist opportunities and your qualifications. Please let me know a day/time when you would be available to speak with me as well as the best phone number for me to contact you. I'll email you back to confirm.&lt;br /&gt;&lt;br /&gt;I hope to hear from you soon!&lt;br /&gt;&lt;br /&gt;Cheers,&lt;br /&gt;JF&lt;br /&gt;Google Staffing&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;During that phone interview the recruiter shared a Google doc which I was instructed to complete in about 45 minutes…&lt;br /&gt;&lt;br /&gt;&lt;a name='more'&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;I do not recall the exact questions, but they mostly dealt with asking me to rank ads with web pages results (e.g., if I do a Google search for “lock” and an ad for a locksmith appears, how relevant to the search term was that ad?). It also gave me topics and asked me to provide example web pages that matched the topics.&lt;br /&gt;&lt;br /&gt;Once done with this task it was clear that the position I was applying for was not a linguist in any traditional sense. &lt;br /&gt;&lt;br /&gt;After they reviewed my answers I was granted a phone interview with a “Linguist” at Google that occurred about a week after the online doc interview.&lt;br /&gt;&lt;br /&gt;During this week I snooped around and discovered some interesting facts: The back story is that Google acquired a company called &lt;a href="http://newsbreaks.infotoday.com/nbreader.asp?ArticleID=16713"&gt;Applied Semantics in 2003&lt;/a&gt;.  This company was based in Santa Monica, so the office stayed there.  Applied Semantics created AdSense, a Latent Semantic Analysis-style algorithm to compare the linguistic similarity of web pages and ads.  That company hired people they called "linguists" to 1) evaluate the quality of the comparisons and 2) build and test taxonomies of web pages and ads.  These people now work for Google.  They also routinely hire people on 1 year contracts to act as test evaluators (see &lt;a href="http://www.google.com/intl/en/jobs/uslocations/santa-monica/custsupport/quality-evaluator-adsense-policy-temporary-santa-monica/"&gt;here&lt;/a&gt;), but I was interviewing for a full-time, permanent position.&lt;br /&gt;&lt;br /&gt;Before the trip, I had a phone interview with LS in the taxonomy group. He had a PhD in formal semantics (I’m pretty sure he was a UCLA linguistics alum). I would say his academic background more closely matched his Google work than anyone else.  I don’t recall the specifics of that interview, but I must have done well enough to be invited to fly to Santa Monica for a full day of interviews in late February 2008.&lt;br /&gt;&lt;br /&gt;&lt;i&gt;Hi Christopher,&lt;br /&gt;&lt;br /&gt;We are very excited that you are coming to talk with us about opportunities here at Google!  I've scheduled your interviews per the availability that you gave to JF. We look forward to seeing you on Friday, February 29, 2008 at 10:45am.  Your interviews will last approximately 2 to 5 hours.  Please ask for MC upon arrival.&lt;br /&gt;&lt;br /&gt;Google takes an academic approach to its interviewing process.  This means that we are interested in your thought process, your approach to problem solving, and in your programming skills as well.  You may also be asked questions that relate to algorithms, data structures, and distributed systems.  The dress is business casual/something you are comfortable in (we are more interested in what you have to say than what you are wearing).&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;The trip began oddly. I was booked into a more expensive hotel than I was originally told (I had to pay for it myself and wait for re-imbursement). Plus, my whole coast-to-coast trip was 36 hours long. Little time to adjust. &lt;br /&gt;&lt;br /&gt;I want to give my general impressions of those Google interviews.  Nothing specific (because I don’t recall the specifics, hehe) but rather the "vibe" I got.  Keep in mind I interviewed with what they call "linguists", not with any engineers, so these are people outside their core employee constituency.  I’m pretty sure most, if not all, of the people I interviewed with were hired by the previous company Applied Semantics prior to the Google acquisition.&lt;br /&gt;&lt;br /&gt;Generally speaking, I walked away thinking I was qualified to perform any and all of the tasks they described (at least as qualified as anyone I met with given their own backgrounds).  I felt like all of these people were smart, but performing tasks for which they had no special training. &lt;br /&gt;&lt;br /&gt;I met with five people on-site in addition to my phone interview with LS earlier.  One had a PhD in linguistics but on a theoretical topic and one was ABD in art history (both non-empirical methods training) and the phone interview guy had a PhD in linguistics and formal semantics (again, non-empirical training) but he had done empirical, lab-based linguistics work as a post-doc at a well respected east coast university.  One person had a B.A in linguistics and another had a B.A. in literature and classics.  One guy had a B.A. in French and literature and was a professional translator before being hired.&lt;br /&gt;&lt;br /&gt;Almost all of them had been hired prior to the Google acquisition, so they mostly had 6 or more years of experience doing the job, but none seemed to have prior training that matched. I was encouraged by this as my own background did not match what they did. Apparently, that was fine with them.&lt;br /&gt;&lt;br /&gt;There were two "groups" of linguists that I met with (literally in different buildings a couple blocks from each other in downtown Santa Monica (near the 3rd street promenade) and very close to the beach.  Group one dealt mostly with the taxonomy, or categorization, of web pages and ads (e.g., is web page X a kind of sports page or auction page, etc).  The second group dealt with human evaluation of relevance of ad + search correspondence (e.g., ‘how relevant is ad X to web page Y?’).&lt;br /&gt;&lt;br /&gt;The first interview I had during the trip was with a taxonomy guy M. who had a BA in French and was a translator who was hired pre-acquisition.  He had been with this group for almost 10 years, and he was now a "project manager" and he had the authority to originate his own projects.  We discussed a series of things that I felt were close to what I am good at and I liked that interview the best.  But, I realized that this guy I was talking to had had little background for the stuff he was doing.  He must have learned entirely on-the-job.  He asked me two semi-technical questions:  He asked me to define “precision” and “recall” (easy) and then he asked me which would be better to use on a new project and why, ASCII or UTF-8 (UTF-8).  He described two projects he was working on (one involved trying to auto-detect compound nouns and their relationships, like the difference between “hotdog” and “wedding cake”) and I thought to myself, "why is a guy like you working on that?  These are computational linguistics problems. There are people who have solid training in this.  Why is a French translator doing this?" &lt;br /&gt;&lt;br /&gt;Part of the answer to this is the corporate culture of Google.  They let anyone at his level initiate any project they want.  He just has to show some kind of results at some point.  That probably works great when you have a small group of talented engineers working on tasks within their field of specialization, but when you get so big that you let French translators try to solve problems that PhDs in computational linguistics try to solve, it's bound to go bad.&lt;br /&gt;&lt;br /&gt;The second guy I met with, J., only had lunch with me so it wasn't really an "interview".  He had a B.A. in lit and classics. I got the worst vibe from him.  He wasn't very talkative and I got the impression he wasn't very ambitious.  He had been there as long as the first guy, but seemed to be doing lower level tasks, and I think he liked it that way.  I felt I had to work a bit too hard to get him to talk, like he was barely tolerating my presence.&lt;br /&gt;&lt;br /&gt;After lunch I moved to the second group who deal with human evaluators where I met with three people.  I liked everyone I met. They were all responsible for designing online studies to test relevance.  These tests were web based human evaluations.  I had spent a few years tangentially associated with a psycholinguistics lab in grad school.  That lab ran experiments on human subjects testing the natural processing of sentences.  I also taught a psycholinguistic course.  I have a modest background in testing methodology.  I'm not a pro by any means, but at least I've had some exposure to experimental methods.  &lt;br /&gt;&lt;br /&gt;I felt like most of these people had nothing even close to training in experimental design.  The first person, TW, had a BA in linguistics from Berkeley.  That’s a nice degree, but it’s unlikely she had much experimental design experience at that level. Unfortunately, I was also suffering a bit of a post-lunch lull in energy and jet lag during her interview, so my concentration and energy was off and I fumbled a bit with some of her hypothetical situations. I felt I did the worst with her.  &lt;br /&gt;&lt;br /&gt;The second evaluation guy, C., was ABD in art history.  I tended to over-talk with him and he had to cut me off a few times in order to ask more questions.  I may have been over-compensating given my weak performance with TW just before. The final guy, A.M., had a PhD in linguistics (again, from UCLA).  His diss was an OT analysis of something.  His advisors were Steriade and Hayes as I recall.  We seemed to have the most fun together and laughed quite a bit (I was also loopy from jet lag and a full day of interviews).  He accidentally stabbed me with his pencil at one point.  It didn’t hurt, but I joked about it.  “The guy’s almost outta here and you killed him!”&lt;br /&gt;&lt;br /&gt;All of the second group’s interviews went roughly the same.  Each one kept asking me how I would design an evaluation for specific kinds of relevancy tasks.  Essentially, it was a bunch of what-if questions.  They all seemed most interested in my ability to understand experimental methodology.&lt;br /&gt;&lt;br /&gt;After the day was done, I walked to the beach (it was over 70 mdegrees and I was about to fly back to the east coast with temps in the teens). I literary took my shoes off and just stood in the warm Pacific ocean for an hour.&lt;br /&gt;&lt;br /&gt;So, they’ve got a French translator doing computational linguistics and an art historian doing experimental human evaluation.  Hmmmm?  What to make of this?&lt;br /&gt;&lt;br /&gt;I don’t want to give the impression that I think a person’s college degree pigeon-holes their career.  Not at all; my own career proves the opposite, but Google is famous for hiring highly trained subject matter experts.  The Google Linguists are different.  They are not doing linguistics. Therefore, my own education and experience should have been no more of an issue than the people with whom I interviewed.  I feel my resume was at least as good as anyone there for the tasks they described.&lt;br /&gt;&lt;br /&gt;Another  suspicious part was the fact that at least four of these interviewers asked me if I would be comfortable doing all the little, day-to-day tasks associated with the job.  M. even went so far as to say “we support the engineers”.  The message I received was that they are low men on the totem pole.  I gave the same answer I always give to this question: This is what being a professional means.  You do whatever tasks it takes to get the job done. &lt;br /&gt;&lt;br /&gt;I left feeling very good. I felt my resume matched well with the people I met and the interviews mostly went well. Three years later, I still haven’t heard from them. Any chance they’re still mulling it over?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-6283861525224022291?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/6283861525224022291/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=6283861525224022291' title='7 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/6283861525224022291'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/6283861525224022291'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/04/google-linguist-interview.html' title='Google linguist interview'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>7</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-2495254720172959385</id><published>2011-03-16T07:33:00.003-04:00</published><updated>2011-03-23T09:19:58.146-04:00</updated><title type='text'>open science</title><content type='html'>Recently in North Carolina, &lt;a href="http://twitter.com/#!/moximer"&gt;moximer&lt;/a&gt; &amp;amp; David Dobbs and others &lt;a href="http://www.wired.com/wiredscience/?p=54257&amp;amp;preview=true"&gt;discussed&lt;/a&gt; the value of opening up science research (such that all research is freely available for searching and interpretation, even draft versions and failed experiments, at least under the strong proposal). It's an interesting discussion (audio is a bit crappy, but whaddayagonnado?):&lt;br /&gt;&lt;br /&gt;&lt;iframe frameborder="0" height="225" src="http://player.vimeo.com/video/20039742" width="400"&gt;&lt;/iframe&gt;&lt;br /&gt;&lt;a href="http://vimeo.com/20039742"&gt;What's Keeping Us from Open Science? Is It the Powers That Be, Or Is It... Us?&lt;/a&gt; from &lt;a href="http://vimeo.com/user2419982"&gt;Smartley-Dunn&lt;/a&gt; on &lt;a href="http://vimeo.com/"&gt;Vimeo&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Hence, I thought it might be nice to list some open source journals offering free access to scientific research:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://www.plos.org/"&gt;PLoS &lt;/a&gt;is a nonprofit organization of scientists and physicians committed to making the world's scientific and medical literature a freely available public resource.&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://www.archive.org/"&gt;The Internet Archive&lt;/a&gt;, a 501(c)(3) non-profit, is building a digital library of Internet sites and other cultural artifacts in digital form. Like a paper library, we provide free access to researchers, historians, scholars, and the general public.&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://citeseer.ist.psu.edu/"&gt;CiteSeer&lt;/a&gt;: The NEC Scientific Literature Digital Library incorporating autonomous citation indexing, awareness and tracking, citation context, related document retrieval.&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://arxiv.org/"&gt;arXiv.org&lt;/a&gt; e-Print archive: Open access to 664,014 e-prints in Physics, Mathematics, Computer Science, Quantitative Biology, Quantitative Finance and Statistics.&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://www.doaj.org/"&gt;Directory of open access journals&lt;/a&gt;:&amp;nbsp;This service covers free, full text, quality controlled scientific and scholarly journals. We aim to cover all subjects and languages. There are now 6271 journals in the directory. Currently 2722 journals are searchable at article level.&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://en.wikipedia.org/wiki/Category:Open_access_journals"&gt;Wikipedia Open access journals&lt;/a&gt;. A list of open access journals...&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;Google "&lt;a href="http://www.google.com/search?rlz=1C1RNNN_enUS357US361&amp;amp;sourceid=chrome&amp;amp;ie=UTF-8&amp;amp;q=open+journals"&gt;open journals&lt;/a&gt;"&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://www.ssrn.com/csn/index.html"&gt;Cognitive Science Network&lt;/a&gt; directed by Mark Turner (HT &lt;a href="http://twitter.com/#!/SportLinguist"&gt;Sport Linguist&lt;/a&gt;).&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://www.knowmade.com/free-fulltext-pdf.html"&gt;Free Full Text&lt;/a&gt;: a search engine returning full text scientific articles with no access fees.&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-2495254720172959385?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/2495254720172959385/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=2495254720172959385' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/2495254720172959385'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/2495254720172959385'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/03/open-science.html' title='open science'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-8354149882682043659</id><published>2011-03-12T12:40:00.000-05:00</published><updated>2011-03-12T12:40:31.123-05:00</updated><title type='text'>Korean in Killeen</title><content type='html'>Having spent nearly 4 months of the last year and a half working at Fort Hood, in &lt;a href="http://en.wikipedia.org/wiki/Killeen,_Texas"&gt;Killeen Texas&lt;/a&gt;, I finally decided to leave the safe confines of the hotel-centric chain restaurants and Target/Wal-Mart shopping centers and take a drive to historic downtown Killeen. I found pretty much what I expected to find, empty one storey store fronts, dusty unused parking spaces, and lots of lots of &lt;a href="http://en.wikipedia.org/wiki/Hangul"&gt;Hangul &lt;/a&gt;... (screeching sound) ... huh?&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh5.googleusercontent.com/-J7PoidQh6jg/TXuuyozj1II/AAAAAAAAAZY/APQ5q9999Eg/s1600/2011-03-12_09-59-29_952.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="180" src="https://lh5.googleusercontent.com/-J7PoidQh6jg/TXuuyozj1II/AAAAAAAAAZY/APQ5q9999Eg/s320/2011-03-12_09-59-29_952.jpg" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;Yep, turns out historic downtown Killeen, heartland of America, is being somewhat revitalized by Korean immigration. My favorite grocery store by far is the Korean &lt;a href="http://www.yelp.com/biz/o-mart-killeen"&gt;O-Mart&lt;/a&gt; (not the one pictured above, btw), where I can find genuinely fresh vegetables and dumplings (as well as shitake mushrooms, plenty of seaweed for soup, and a wide array of spicy sauces that I have been eagerly experimenting with).&lt;br /&gt;&lt;br /&gt;It was a nice lesson in American multi-linguialism.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-8354149882682043659?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/8354149882682043659/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=8354149882682043659' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8354149882682043659'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8354149882682043659'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/03/korean-in-killeen.html' title='Korean in Killeen'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='https://lh5.googleusercontent.com/-J7PoidQh6jg/TXuuyozj1II/AAAAAAAAAZY/APQ5q9999Eg/s72-c/2011-03-12_09-59-29_952.jpg' height='72' width='72'/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-879313540919528342</id><published>2011-03-07T08:33:00.001-05:00</published><updated>2011-03-07T08:33:00.492-05:00</updated><title type='text'>turning gaga into water = 200 terabytes</title><content type='html'>How much storage would it take to store the first 5 years of a child's linguistic environment? Apparently, 200 terabytes. From &lt;a href="http://www.fastcompany.com/1733627/mit-scientist-captures-his-sons-first-90000-hours-on-video?partner=rss&amp;amp;utm_source=feedburner&amp;amp;utm_medium=feed&amp;amp;utm_campaign=Feed:+fastcompany/headlines+(Fast+Company+Headlines)"&gt;Fast Company&lt;/a&gt;: &lt;br /&gt;&lt;br /&gt;...&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;cognitive scientist &lt;a href="http://web.media.mit.edu/~dkroy/bio/index.html"&gt;Deb Roy&lt;/a&gt; Wednesday shared a remarkable experiment that hearkens back to an earlier era of science using brand-new technology. From the day he and his wife brought their son home five years ago, the family's every movement and word was captured and tracked with a series of fisheye lenses in every room in their house. The purpose was to understand how we learn language, in context, through the words we hear. A combination of new software and human transcription called Blitzscribe allowed them to parse &lt;b&gt;200 terabytes of data&lt;/b&gt; to capture the emergence and refinement of specific words in Roy’s son’s vocabulary&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;The data visualization techniques he uses are pretty cutting edge ... and awesome! I love the fact that he is trying to use visualization techniques to help us understand something beyond raw statistics (which is where most graphs and pie charts die &amp;nbsp;miserable deaths). Statistics are like molecules. Visualize them one by one and it's difficult for the average person to conceptualize the big picture of how they work together to create a grander whole. Roy appears to be trying to get beyond the yawn-inducing graphs that plague modern science. I mean, he uses freaky-deaky &lt;i&gt;time-worms&lt;/i&gt;! How cool is that!&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh6.googleusercontent.com/-7DcQ7lsAKhs/TXRWtANxVbI/AAAAAAAAAZU/TkV1q0KjZkM/s1600/time+worm.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="64" src="https://lh6.googleusercontent.com/-7DcQ7lsAKhs/TXRWtANxVbI/AAAAAAAAAZU/TkV1q0KjZkM/s320/time+worm.jpg" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;Roy talk's about feed-back loops as well:&lt;br /&gt;&lt;br /&gt;..."&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Caregiver speech dipped to a minimum and slowly ascended back out in complexity.” In other words, when mom and dad and nanny first hear a child speaking a word, they unconsciously stress it by repeating it back to him all by itself or in very short sentences. Then as he gets the word, the sentences lengthen again. The infant shapes the caregivers’ behavior, the better to learn&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;He gave a TED talk recently, but the video is not yet available.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-879313540919528342?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/879313540919528342/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=879313540919528342' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/879313540919528342'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/879313540919528342'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/03/turning-gaga-into-water-200-terabytes.html' title='turning &lt;i&gt;gaga&lt;/i&gt; into &lt;i&gt;water&lt;/i&gt; = 200 terabytes'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='https://lh6.googleusercontent.com/-7DcQ7lsAKhs/TXRWtANxVbI/AAAAAAAAAZU/TkV1q0KjZkM/s72-c/time+worm.jpg' height='72' width='72'/><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-812885820223064192</id><published>2011-03-03T08:33:00.001-05:00</published><updated>2011-03-03T08:33:00.498-05:00</updated><title type='text'>Hosni prefers "Hosny" in transliterated attire</title><content type='html'>Rachel Maddow et al. &lt;a href="http://www.msnbc.msn.com/id/26315908/#41880301"&gt;discovered a delicious gem&lt;/a&gt; fit for the annals of transliteration. Namely, how to write a specific Arabic name in the &lt;a href="http://en.wikipedia.org/wiki/Latin_alphabet"&gt;Roman alphabet&lt;/a&gt; (what we English speakers like to call "regular spelling"). She (and her staff) reported that &lt;i&gt;Hosni Mubarak&lt;/i&gt; attended a head-of-state meeting in Albania a couple years ago wearing the world's most narcissistic pinstriped suit*, where the pin stripes were actually composed of lines of his name written in Roman alphabetic transliteration (this man really knows how to live the life of a tyrant, am I right?):&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh4.googleusercontent.com/-aXHlAVp4cIs/TW8bFoJiDJI/AAAAAAAAAZE/NlTWDaMLkDk/s1600/hosni-hosny.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="218" src="https://lh4.googleusercontent.com/-aXHlAVp4cIs/TW8bFoJiDJI/AAAAAAAAAZE/NlTWDaMLkDk/s320/hosni-hosny.jpg" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;It is a troublesome fact of human language that writing the damned thing down is never easy. It's difficult enough to construct a writing system that is consistent for a single language, more difficult still to take a linguistic term (like a person's name) and write it down in a script which was not designed for that particular language. So when English language writers (like journalists) have to write down Arabic names in "regular spelling" they inevitably face difficult choices about which letters to use to represent particular sounds. Vowels are particularly difficult creatures to pin down with alphabetic rope (e.g., the whole &lt;i&gt;and sometimes y&lt;/i&gt; fiasco).&lt;br /&gt;&lt;br /&gt;The act of writing a linguistic term in a foreign script is called &lt;a href="http://en.wikipedia.org/wiki/Transliteration"&gt;transliteration&lt;/a&gt;, and it's troublesome enough to have spawned a cottage industry sub-field within computational linguistics. For example, if you wanted to Google information about the currently exiled president of Egypt, you would be wise to Google the term "Hosni Mubarak." That is by far the most common spelling of the man's name on the internet (by a better than 20-1 margin, at least according to Google hit counts). Even if you choose the "Hosny" variant, you're basically just redirected to the "Honsi" results anyway. Yet the tyrant himself, ever the maverick, prefers the road less traveled.&lt;br /&gt;&lt;br /&gt;Sadly, there's not much more to say about this than to emphasize the simple fact that transliteration is largely arbitrary and disputes about guidelines are largely trivial. Just flip a coin and move on ... (I just seriously pissed off the world's four transliteration experts).&lt;br /&gt;&lt;br /&gt;...and in closing I'd like to &lt;a href="http://twitpic.com/42sekk"&gt;repeat my assertion&lt;/a&gt; that Hosni/y Mubarak looks suspiciously like The Face of Bo**:&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh4.googleusercontent.com/-qCqoGB6q_kQ/TW8c0oeC74I/AAAAAAAAAZQ/762euouHWHE/s1600/hosni+face+of+bo.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="240" src="https://lh4.googleusercontent.com/-qCqoGB6q_kQ/TW8c0oeC74I/AAAAAAAAAZQ/762euouHWHE/s320/hosni+face+of+bo.jpg" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;*FYI, I have no independent verification of the truth of this story. If Maddow's staff got punk'd, their bad.&lt;br /&gt;**Damn you Captain Jack!!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-812885820223064192?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/812885820223064192/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=812885820223064192' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/812885820223064192'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/812885820223064192'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/03/hosni-prefers-hosny-in-transliterated.html' title='Hosni prefers &quot;Hosny&quot; in transliterated attire'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='https://lh4.googleusercontent.com/-aXHlAVp4cIs/TW8bFoJiDJI/AAAAAAAAAZE/NlTWDaMLkDk/s72-c/hosni-hosny.jpg' height='72' width='72'/><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-1571370406543300803</id><published>2011-02-23T10:33:00.001-05:00</published><updated>2011-02-23T10:33:00.457-05:00</updated><title type='text'>the linguistics of 404 FILE NOT FOUND</title><content type='html'>&lt;a href="http://www.ibiblio.org/gutenberg/cgi-bin/sdb/t9.cgi"&gt;A cute site providing humorous translations&lt;/a&gt; of the world's most frustrating search result. Personal favs:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;American South&lt;/b&gt; - &lt;i&gt;Ah cain't find th' page yer lookin' fer&lt;/i&gt;.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Australia &lt;/b&gt;- &lt;i&gt;Strewth mate yer bloody page has shot through&lt;/i&gt;.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Blond&lt;/b&gt; - &lt;i&gt;like omg! ur file has not been found, go paint ur nails and try back later, lol^^....I FOUND A QUARTER&lt;/i&gt;!&lt;/li&gt;&lt;li&gt;&lt;b&gt;Cockney&lt;/b&gt; - &lt;i&gt;No chance luv, carrnt find it neever&lt;/i&gt;.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Pirate&lt;/b&gt; - &lt;i&gt;Haaarr, Lubber! I've sailed yon seas with toil and trial, and yet I cannot find ye file&lt;/i&gt;!&lt;/li&gt;&lt;li&gt;&lt;b&gt;Pittsburghese&lt;/b&gt; - &lt;i&gt;This page needs fixed n'at... it's all caddywhompus! Yinz needs look somewheres else&lt;/i&gt;.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Zombie&lt;/b&gt; - &lt;i&gt;Arrgrg 404 BrAiNs aAAArrggh No ggrrgrh page brAiNz heRe BrAAAAIIINNSSSS&lt;/i&gt;!&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-1571370406543300803?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/1571370406543300803/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=1571370406543300803' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/1571370406543300803'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/1571370406543300803'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/02/linguistics-of-404-file-not-found.html' title='the linguistics of 404 FILE NOT FOUND'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-8954055565294675042</id><published>2011-02-23T08:33:00.001-05:00</published><updated>2011-02-23T08:33:00.172-05:00</updated><title type='text'>fuck C++</title><content type='html'>&lt;a href="http://andrewvos.com/2011/02/21/amount-of-profanity-in-git-commit-messages-per-programming-language/"&gt;Andrew Vos&lt;/a&gt; provides us with valuable data analysis of the correlation between programming languages and profanity:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;The plan was to find out how much profanity I could find in commit messages, and then show the stats by language. These are my findings:&amp;nbsp;Out of 929857 commit messages, I found 210 swear words (using George Carlin's Seven dirty words)&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/-0yKTk-KGjF4/TWRzXiB4x8I/AAAAAAAAAZA/Kdhnrb5M_bk/s1600/profanity-bar-chart.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="192" src="http://4.bp.blogspot.com/-0yKTk-KGjF4/TWRzXiB4x8I/AAAAAAAAAZA/Kdhnrb5M_bk/s320/profanity-bar-chart.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;Oh, Python, beautiful Python ... no wonder the &lt;a href="http://www.nltk.org/"&gt;NLTK &lt;/a&gt;guys chose it as their NLP language of choice.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-8954055565294675042?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/8954055565294675042/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=8954055565294675042' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8954055565294675042'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8954055565294675042'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/02/fuck-c.html' title='fuck C++'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/-0yKTk-KGjF4/TWRzXiB4x8I/AAAAAAAAAZA/Kdhnrb5M_bk/s72-c/profanity-bar-chart.png' height='72' width='72'/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-8347772411568181411</id><published>2011-02-20T10:16:00.000-05:00</published><updated>2011-02-20T10:16:38.339-05:00</updated><title type='text'>economists are bad linguists</title><content type='html'>&lt;a href="http://twitter.com/#!/techczech"&gt;Dominik Lukes&lt;/a&gt; at &lt;a href="http://metaphorhacker.net/2011/02/the-most-ridiculous-metaphor-of-education/"&gt;Metaphor Hacker&lt;/a&gt; has a thorough discussion of Harvard economist Ed Glaeser's mis-use of metaphor theory by trying to use NYC restaurants as a metaphor for schools. Lukes teases out the mis-mappings that Glaeser fails to recognize. Money quote:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;[Restaurants] also use a number of tricks to make the dining experience better – cheat on ingredients, serve small portions on large plates, etc. They rely on ‘secret recipes’ – the last thing we want to see in education. And this is exactly the experience of schools that compete in the market. They fudge, cheat and flat out lie to protect their competitive advantage.  They provide the minimum of education that they can get away with to look good. Glaeser, as he conveniently forgets, there is a huge amount of centralized oversight of New York restaurants – much more, in some ways, than on charter schools. &lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;The full discussion is thorough and well worth reading.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-8347772411568181411?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/8347772411568181411/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=8347772411568181411' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8347772411568181411'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8347772411568181411'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/02/economists-are-bad-linguists.html' title='economists are bad linguists'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-3367396036403440332</id><published>2011-02-18T08:33:00.000-05:00</published><updated>2011-02-18T08:33:00.322-05:00</updated><title type='text'>evolution = chaos?</title><content type='html'>&lt;a href="http://kottke.org/11/02/the-evolution-of-lines"&gt;Kottke&lt;/a&gt; points to a graphical variation of the &lt;a href="http://en.wikipedia.org/wiki/Chinese_whispers"&gt;Chinese whispers&lt;/a&gt; game whereby an original sign (in this case, a line drawn by a human) is rapidly degraded by multiple repetitions (the more people try to repeat the original line, the less &lt;i&gt;line-like&lt;/i&gt; it becomes, eventually degrading into chaos). &lt;br /&gt;&lt;br /&gt;&lt;iframe frameborder="0" height="300" src="http://player.vimeo.com/video/18998570" width="400"&gt;&lt;/iframe&gt;&lt;br /&gt;&lt;a href="http://vimeo.com/18998570"&gt;A Sequence of Lines Traced by Five Hundred Individuals&lt;/a&gt; from &lt;a href="http://vimeo.com/clementvalla"&gt;clement valla&lt;/a&gt; on &lt;a href="http://vimeo.com/"&gt;Vimeo&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Kottke marvels that "&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;The lines get really messy surprisingly fast [...] this is a nice demonstration of evolution&lt;/span&gt;&lt;/i&gt;."&lt;br /&gt;&lt;br /&gt;But is it? Is it the case that evolution leads to chaos*? I don't think so. Evolution leads to variation and change, sure, but chaos? The difference between evolution and this line transformation, I think, is pressures. In evolution there are pressures that greatly effect which changes last more than one generation and hence become &lt;s&gt;permanent&lt;/s&gt; stable. But in this game, there are no pressures, as far as I can tell. There is no survival of the fittest because each turn gets to survive for exactly one generation with no pressure to be fitter than another in order to persist beyond one generation. So this exercise, cute as it may be, does not resemble evolution at all, I don't think.&lt;br /&gt;&lt;br /&gt;*or &lt;i&gt;messiness&lt;/i&gt; in Kottke's phrasing&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-3367396036403440332?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/3367396036403440332/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=3367396036403440332' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3367396036403440332'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3367396036403440332'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/02/evolution-chaos.html' title='evolution = chaos?'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-2687800344493718339</id><published>2011-02-14T22:18:00.000-05:00</published><updated>2011-02-14T22:18:56.635-05:00</updated><title type='text'>Ngram Viewer sucks, true dat</title><content type='html'>proof positive....&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/-QfYodT9yxYo/TVnwjHjH8HI/AAAAAAAAAY8/s1600/ngram_tru_dat.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://4.bp.blogspot.com/-QfYodT9yxYo/TVnwjHjH8HI/AAAAAAAAAY8/s1600/ngram_tru_dat.png" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/-dmhJnQETs7s/TVnwZhHShTI/AAAAAAAAAY4/t-aToE0b7UI/s1600/ngram_tru_dat.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="117" src="http://1.bp.blogspot.com/-dmhJnQETs7s/TVnwZhHShTI/AAAAAAAAAY4/t-aToE0b7UI/s320/ngram_tru_dat.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;a href="http://www.urbandictionary.com/define.php?term=true%20dat"&gt;true dat&lt;/a&gt;...&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-2687800344493718339?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/2687800344493718339/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=2687800344493718339' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/2687800344493718339'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/2687800344493718339'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/02/ngram-viewer-sucks-true-dat.html' title='Ngram Viewer sucks, &lt;i&gt;true dat&lt;/i&gt;'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/-QfYodT9yxYo/TVnwjHjH8HI/AAAAAAAAAY8/s1600/s72-c/ngram_tru_dat.png' height='72' width='72'/><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-1639162600949370409</id><published>2011-02-08T09:33:00.001-05:00</published><updated>2011-02-08T09:33:00.104-05:00</updated><title type='text'>Linguist List FAIL</title><content type='html'>I've been kicked around a few NLP blocks in my time so I've developed a sixth sense about what employers are looking for when they post job announcements. When I read &lt;a href="http://linguistlist.org/jobs/get-jobs.cfm?JobID=83202&amp;amp;SubID=4493277"&gt;this one from Intelius&lt;/a&gt; on The Linguist List today, my reaction was clear, concise, and unconditional: &lt;b&gt;This is NOT for linguists&lt;/b&gt;. &lt;br /&gt;&lt;br /&gt;This posting says &lt;i&gt;engineers only&lt;/i&gt; to me! There's nothing wrong with that, but why use the Linguist Lists' job postings board with a job that no actual linguist will be considered for? My reaction is based on what I consider to be engineering &lt;a href="http://en.wikipedia.org/wiki/Dog-whistle_politics"&gt;dog-whistles&lt;/a&gt; that are designed to encourage the "right" people to apply (i.e., engineers) and the wrong people to go away (i.e., linguists).&lt;br /&gt;&lt;br /&gt;A quick breakdown of their rhetorical dog-whistles:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;The Data Research Group is a team of &lt;b&gt;scientists&lt;/b&gt; at Intelius&lt;/span&gt;&lt;/i&gt;... Much as I would like linguists to be considered scientists, the truth is, in the "real world" of job announcements, they are not. This is a red flag.&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Team members have published papers in top &lt;b&gt;research &lt;/b&gt;conferences&lt;/span&gt;&lt;/i&gt;...Ah hah, not "conferences" per se, but "research conferences". This means &lt;a href="http://www.aclweb.org/"&gt;ACL&lt;/a&gt;.&lt;/li&gt;&lt;li&gt;Mentors will include Dr. &lt;a href="http://www.cs.cmu.edu/~vitor/"&gt;Vitor R. Carvalho&lt;/a&gt; and Dr. &lt;a href="http://www.cs.nyu.edu/web/Research/Theses/borthwick_andrew.pdf"&gt;Andrew Borthwick&lt;/a&gt; (diss PDF)... NOT linguists.&lt;/li&gt;&lt;li&gt;Required Skills: &lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Strong hands-on skills in Java and/or Python&lt;/span&gt;&lt;/i&gt;... i.e., we assume you lay awake at night worrying about &lt;i&gt;arrays &lt;/i&gt;and &lt;i&gt;functions&lt;/i&gt;, not &lt;i&gt;unnaccusative &lt;/i&gt;marking and &lt;i&gt;tone sandhi&lt;/i&gt;.&amp;nbsp;&lt;/li&gt;&lt;li&gt;Required Skills: &lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Self-motivated, creative, and independent researching skills&lt;/span&gt;&lt;/i&gt; ... we will teach you nothing. You are on your own. Your teachers are gone. What can you give us?&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;FYI: Recently, &lt;a href="http://www.blogger.com/profile/14505565281151328789"&gt;bulbul&lt;/a&gt; has quite rightly &lt;a href="https://www.blogger.com/comment.g?blogID=520807396714463309&amp;amp;postID=4123261285193149310"&gt;taken me to task&lt;/a&gt; for being a tad hypocritical in arguing two seemingly contradictory points: (1) that 21st Century &lt;a href="http://thelousylinguist.blogspot.com/2010/01/why-linguists-should-study-math.html"&gt;linguists should study math&lt;/a&gt; and (2) that the time consuming effort of learning computational tools is a &lt;a href="http://thelousylinguist.blogspot.com/2011/02/why-we-need-good-tools.html"&gt;deterrent to being a linguist&lt;/a&gt;.  I can imagine this post as falling victim to that same complaint. My pre-defense is that I believe there is a skill set distinct to linguists that is valuable and worthy of investment by NLP capitalists that has been largely ignored.&lt;br /&gt;&lt;br /&gt;Engineers alone will not solve the critical language issues necessary to create the great products of the next generation of NLP tools. I believe in team building where linguists and engineers work together as equals&amp;nbsp;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-1639162600949370409?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/1639162600949370409/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=1639162600949370409' title='9 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/1639162600949370409'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/1639162600949370409'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/02/linguist-list-fail.html' title='Linguist List &lt;b&gt;FAIL&lt;/b&gt;'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>9</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-4615430295592592119</id><published>2011-02-08T07:33:00.001-05:00</published><updated>2011-02-08T07:33:00.050-05:00</updated><title type='text'>our foundational tongues?</title><content type='html'>A commentator at &lt;a href="http://andrewsullivan.theatlantic.com/the_daily_dish/2011/02/physical-health-break-update-ctd.html"&gt;The Daily Dish&lt;/a&gt; writes: &lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;I recently learned that in &lt;b&gt;our foundational tongues&lt;/b&gt; of Latin, Greek, and Hebrew the words for breath and spirit are one and the same: spiritus, pneuma, and ruach&lt;/span&gt;&lt;/i&gt; [emphasis added].&lt;br /&gt;&lt;br /&gt;I'm not sure what the author had in mind for "our foundational tongues." Assuming the author is referring to English, then Latin, okay sure, Romance languages have had an important influence on English. Greek, less so. But Hebrew??? What's most striking is the notable lack of Germanic languages as "foundational." This author needs a Ling 101 class.&lt;br /&gt;&lt;br /&gt;And as for the author's claim about words for &lt;i&gt;breath&lt;/i&gt; and &lt;i&gt;spirit&lt;/i&gt; being the same, there is a related poetic pairing common to good ol' fashioned English. The word &lt;i&gt;breath&lt;/i&gt; is often used as a metonymy for &lt;i&gt;life&lt;/i&gt; or &lt;i&gt;spirit&lt;/i&gt;. Here are a few choice examples:&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: blue;"&gt;The Bard&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;&lt;b&gt;Henry V&lt;/b&gt; -- King Henry's &lt;a href="http://shakespeare.mit.edu/henryv/henryv.3.1.html"&gt;Once more unto the breach, dear friends&lt;/a&gt; speech (III, 1):&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Now set the teeth and stretch the nostril wide,&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&lt;b&gt;Hold hard the breath and bend up every spirit&lt;/b&gt;&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;To his full height. On, on, you noblest English.&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Whose blood is fet from fathers of war-proof!&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;In my reading of this line, King Henry pairs &lt;i&gt;holding of the breath&lt;/i&gt; with &lt;i&gt;spiritual courage&lt;/i&gt; to draw a parallel between the two.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Hamlet &lt;/b&gt;-- Hamlet's Mother, Queen Gertrude, whilst arguing with her tortured son (III, 4):&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Be thou assured, if words be made of breath,&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;And breath of life, I have no life to breathe&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;What thou hast said to me&lt;/span&gt;&lt;/i&gt; .&lt;br /&gt;&lt;br /&gt;Prior to this line, Hamlet prods his mother to stop sleeping with his uncle/king and to "&lt;i&gt;break your own neck down.&lt;/i&gt;" In my reading of her lines, Gertrude connects the dots between words, breath, and spirit because of her son's harsh words. She is saying &lt;i&gt;it is not in my spirit to do what you are asking of me&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;And here is a really nice 2009 discussion of poetry and breath by Melissa Zeiger: &lt;a href="http://cww.oxfordjournals.org/content/3/2/158.full"&gt;Grace Paley's Poetics of Breath&lt;/a&gt;. Money quote:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;The Romantic poets reemphasized breath as a force in poetry, liking to imagine that poetic breath mediated between the human and the transcendent, as, famously, in Coleridge's “The Eolian Harp,” where the wind joins breath to participate in “one Life within us and abroad,/ Which meets all motion and becomes its soul&lt;/span&gt;&lt;/i&gt;” &lt;br /&gt;&lt;br /&gt;And this trope is not limited to Western literature either. The traditional Chinese concept of Qi is deeply rooted in an analogy of &lt;i&gt;breath = life&lt;/i&gt;. From &lt;a href="http://en.wikipedia.org/wiki/Qi"&gt;the Wikipedia page&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Qi is frequently translated as "energy flow". Qi is often compared to Western notions of energeia or élan vital (vitalism), as well as the yogic notion of prana, meaning vital life or energy, and pranayama, meaning control of breath or energy. The literal translation of "qi" is air, breath, or gas. Compare this to the original meaning of the Latin word "spiritus", meaning breathing; or the Koine Greek "πνεῦμα", meaning air, breath, or spirit; and the Sanskrit term "prana", meaning breath&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;What this suggests to me is that there is something deeply natural to our cognitive perceptions about this analogy between breath and life. It is natural for humans to perceive breathing and thinking to be related somehow. Without breath, you cannot think. Fair enough. But this might be a deeply human logic insofar as ants or dolphins may not conceive of this relationship in the same way. I blogged about this last year in&amp;nbsp;&lt;a href="http://thelousylinguist.blogspot.com/2010/02/dolphin-bikes-and-iconicity-effect.html#more"&gt;Dolphin-Bikes and The Iconicity Effect&lt;/a&gt;. I'm still waiting for a dolphin bike.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-4615430295592592119?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/4615430295592592119/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=4615430295592592119' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/4615430295592592119'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/4615430295592592119'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/02/our-foundational-tongues.html' title='our foundational tongues?'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-4123261285193149310</id><published>2011-02-06T09:33:00.003-05:00</published><updated>2011-02-06T09:43:57.222-05:00</updated><title type='text'>why we need good tools...</title><content type='html'>Because we're not all interested in being &lt;a href="http://www.r-project.org/"&gt;R&lt;/a&gt; experts. By far, the single most frustrating part of my own graduate linguistics experience was the fact that in order to study the kinds of linguistic phenomena I wanted to, I had to spend most of my time learning tools that I didn't actually care about, like Tgrep2, Perl, Python*, R, etc. As a linguist, I don't really give a damn about any of those things. They were all obstacles in my way. The more time I spent learning tools, the less interested in linguistics I became. I respect the hell out of engineers who build great tools that are valuable to linguists, but if those tools are not user friendly, I might as well scream into the darkness.&lt;br /&gt;&lt;br /&gt;Which is why I am impressed with The Stanford Visualization Group's recent &lt;a href="http://www.readwriteweb.com/hack/2011/02/datawrangler.php"&gt;Visualization Tool for Cleaning Up Data&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Another thing I often hear is that a large fraction of the time spent by analysts -- some say the majority of time -- involves data preparation and cleaning: transforming formats, rearranging nesting structures, removing outliers, and so on. (If you think this is easy, you've never had a stack of ad hoc Excel spreadsheets to load into a stat package or database!)&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;Yes, more help please.&lt;br /&gt;&lt;br /&gt;HT &lt;a href="http://twitter.com/#!/LingFan1"&gt;LingFan1&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;*Mad props to the &lt;a href="http://www.nltk.org/"&gt;NLTK&lt;/a&gt;!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-4123261285193149310?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/4123261285193149310/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=4123261285193149310' title='6 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/4123261285193149310'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/4123261285193149310'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/02/why-we-need-good-tools.html' title='why we need good tools...'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>6</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-8389848178463329668</id><published>2011-02-02T08:33:00.002-05:00</published><updated>2011-02-02T19:08:17.539-05:00</updated><title type='text'></title><content type='html'>Neuro-blogger Bradley Voytek posts a &lt;a href="http://blog.ketyov.com/2011/01/how-to-be-neuroscientist.html"&gt;&lt;span id="goog_1012523228"&gt;&lt;/span&gt;nice discussion&lt;span id="goog_1012523229"&gt;&lt;/span&gt;&lt;/a&gt; helping us all understand how to consume neuroscience in the news:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;In this post, I will teach you all how to be proper, skeptical neuroscientists. By the end of this post, not only will you be able to spot "neuro nonsense" statements, but you'll also be able to spot nonsense neuroscience questions&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;Well worth the read.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-8389848178463329668?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/8389848178463329668/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=8389848178463329668' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8389848178463329668'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8389848178463329668'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/02/neuro-blogger-bradley-voytek-posts-nice.html' title=''/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-8615036981101669712</id><published>2011-02-01T10:23:00.002-05:00</published><updated>2011-02-01T10:23:00.400-05:00</updated><title type='text'>my classic snowclone rant</title><content type='html'>As yet another winter storm threatens the US, lingo-tweeter cum lingo-grad student &lt;a href="http://gradstudents.wcas.northwestern.edu/~lma777/"&gt;Lauren Ackerman&lt;/a&gt; &lt;a href="http://twitter.com/#!/VerbingNouns"&gt;marvels&lt;/a&gt; at the media's lust for snowmageddon and terms of its ilk, and I was reminded of my own ruminations on the many words for snow in my own peculiar dialect (it helped that I spent 6 hours in near motionless traffic a few days ago while the DC metro region was castrated by a vicious and sudden sleet storm that halted traffic as well as sanity). So I offer this re-post from &lt;a href="http://thelousylinguist.blogspot.com/2010/02/my-many-words-for-snow.html"&gt;February 5 2010&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;As the snow descends upon Northern Virginia in the latest winter storm, and as DC's elite line-up at their local Whole Foods and Trader Joe's clutching their reusable bags filled with heavily packaged prepared meals, cardboard-container salads, 6 bottles of wine, and one bottle of water ('cause, ya know, it's an "emergency"), I am struck by the fact that &lt;a href="http://users.utu.fi/freder/Pullum-Eskimo-VocabHoax.pdf"&gt;the great Eskimo vocabulary hoax&lt;/a&gt; (pdf) is no hoax at all! It turns out that I too have a great many words for &lt;i&gt;snow&lt;/i&gt;. This evening, while running a few modest errands before the night's predicted 20 inch snow drop, I meticulously recorded the various terms I uttered as synonyms for the fluffy white stuff which descended, rather gracefully, upon the landscape. &lt;br /&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: blue;"&gt;&lt;b&gt;A few choice examples (NSFW):&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: blue;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: black; font-weight: normal;"&gt;&lt;i&gt;&lt;/i&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: blue;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: black; font-weight: normal;"&gt;&lt;i&gt;&lt;/i&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;a name='more'&gt;&lt;/a&gt;&lt;span class="Apple-style-span" style="color: blue;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: black; font-weight: normal;"&gt;&lt;i&gt;shit&lt;/i&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-style: normal;"&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-style: normal;"&gt;"Why do people drive like such morons in this shit?"&lt;/span&gt;&lt;/i&gt;&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;"Hey asshole! This shit's not Vasoline! You can drive faster that 6 miles an hour!"&lt;/li&gt;&lt;/ul&gt;&lt;i&gt;crap&lt;/i&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;"This crap's gonna be piled up in disgusting dirty brown heaps for weeks."&lt;/li&gt;&lt;/ul&gt;&lt;i&gt;fuck&lt;/i&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;"Fuck these fucking fuckers who can't drive in this fuck!"&lt;/li&gt;&lt;/ul&gt;&lt;i&gt;asshole-shit-motherfucker*&lt;/i&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;"Ahhhh! You drive on this&amp;nbsp;asshole-shit-motherfucker like it's nuclear!"&lt;/li&gt;&lt;/ul&gt;&lt;i&gt;fucking-fuck-fuck&lt;/i&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;(directed at a plow driver) "push the fucking fuck fuck onto the curb, not back into the road!"&lt;/li&gt;&lt;/ul&gt;&lt;i&gt;grrrrrrr&lt;/i&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;"gawd I hate everybody! All of you! All because of this ... grrrrrrr!" (picture head exploding)&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;*&lt;i&gt;asshole-shit-motherfucker&lt;/i&gt; is actually quite productive in my dialect. It replaces a great many phrases.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: orange;"&gt;Addendum &lt;/span&gt;&lt;/b&gt;(1-31-2011): I want to see a&amp;nbsp;&lt;a href="http://www.visualthesaurus.com/howitworks/"&gt;Visual Thesaurus&lt;/a&gt;&amp;nbsp;word map of my words for snow!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-8615036981101669712?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/8615036981101669712/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=8615036981101669712' title='5 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8615036981101669712'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8615036981101669712'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/02/my-classic-snowclone-rant.html' title='my classic snowclone rant'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>5</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-336396382550683501</id><published>2011-01-31T20:41:00.004-05:00</published><updated>2011-01-31T23:38:57.108-05:00</updated><title type='text'>the false narrative of small town slang</title><content type='html'>There is a common critique of journalists that they often let an internal narrative color their reporting, to the point where they simply parrot back the narrative in their head rather than report the facts on the ground (see here for a &lt;a href="http://andrewsullivan.theatlantic.com/the_daily_dish/2011/01/dissent-of-the-day-1.html"&gt;discussion &lt;/a&gt;of this).&lt;br /&gt;&lt;br /&gt;My hometown of &lt;a href="http://en.wikipedia.org/wiki/Chico,_California"&gt;Chico&lt;/a&gt; got its spotlight in the sun recently because its favorite son Aaron Rogers is the star quarterback of the Packers about to play in the Super Bowl. Unfortunately, the &lt;a href="http://www.nytimes.com/2011/01/31/sports/football/31rodgers.html"&gt;NYT's article&lt;/a&gt; is a near perfect example of journalists letting a narrative do the talking when the facts blatantly contradict their claims:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;The usual slang words like awesome or cool are not heard much. Nice is in. As in: “You won the lottery? Nice.&lt;/span&gt;&lt;/i&gt;”&lt;br /&gt;&lt;br /&gt;The narrative this spins is that small towns are all &lt;a href="http://en.wikipedia.org/wiki/The_Andy_Griffith_Show"&gt;Mayberrys&lt;/a&gt; where everyone is pure and innocent and righteous and &lt;i&gt;better&lt;/i&gt; than them damned city-folk. It has been evoked routinely in political reporting.&lt;br /&gt;&lt;br /&gt;I'm a Chico boy. I graduated from Chico jr. High and walked across the street and graduated from Chico High then walked across the street and graduated from Chico State*. And I can assure you that &lt;i&gt;awesome &lt;/i&gt;and &lt;i&gt;cool &lt;/i&gt;are every bit as frequent there as anywhere else (personally, I had an unhealthy fondness for &lt;i&gt;hella &lt;/i&gt;back in 1987). And believe me, if you won the lottery in Chico, no one would say &lt;i&gt;nice&lt;/i&gt;. They would say, "No fukkin way! No fukkin way! Really! No fukkin way!" ... just like everywhere else.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;*no joke, those three schools are literally across the street from each other.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-336396382550683501?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/336396382550683501/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=336396382550683501' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/336396382550683501'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/336396382550683501'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/false-narrative-of-small-town-slang.html' title='the false narrative of small town slang'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-5265964386669440137</id><published>2011-01-31T09:34:00.001-05:00</published><updated>2011-01-31T22:53:38.377-05:00</updated><title type='text'>how (not) to do linguistics</title><content type='html'>Jonah Lehrer, the neuro-blogger, has a mixed track record, as far as I'm concerned. His initial blogging was nice, but a tad lightweight, then he started to sound a bit too &lt;i&gt;Malcom Gladwell&lt;/i&gt;-ee (in that I wasn't entirely sure he knew what he was talking about beyond having a few short phone calls with one or two scientists then babbling on about a topic).&lt;br /&gt;&lt;br /&gt;But he's hit a home run with this long New Yorker piece about the failure of the journal review process in science: &lt;a href="http://www.newyorker.com/reporting/2010/12/13/101213fa_fact_lehrer?currentPage=all"&gt;The Truth Wears Off&lt;/a&gt;. He draws examples from medicine, physics, and psychology.&lt;br /&gt;&lt;br /&gt;Perhaps the most disappointing part is the realization the the standards of testing and conclusiveness in linguistics are so far from those in more established science. &lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Before the effectiveness of a drug can be confirmed, it must be tested and tested again. Different scientists in different labs need to repeat the protocols and publish their results. The test of replicability, as it’s known, is the foundation of modern research. Replicability is how the community enforces itself. It’s a safeguard for the creep of subjectivity&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;Repeating studies is virtually unheard of in linguistics. Also, Lehrer mentions the publication bias in journals. When a result is discovered, there is a bias towards positive results. After a while, once the result is accepted, then only negative results are published because only that is "interesting" anymore. But I would expand this point to say this same bias exists at every stage of the research process. We want to find things that happen, we don't care about spending 5 years and thousands of hours discovering that X does NOT cause Y! So when young grad students begin scoping out a new study, they throw away anything that doesn't seem fruitful, where fruitful is defined as yielding positive results. This bias affects the very foundation of the research process, namely  answering the basic question: what should I study?&lt;br /&gt;&lt;br /&gt;As a side note, engineers seem perfectly happy to follow through on null results. They need to know the full scope of their problem before solving it. Scientists can learn a lot from engineers (and vice versa).&lt;br /&gt;&lt;br /&gt;[Psychology professor Jonathan] &lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Schooler recommends the establishment of an open-source database, in which researchers are required to outline their planned investigations and document all their results. “I think this would provide a huge increase in access to scientific work and give us a much better way to judge the quality of an experiment,” Schooler says. “It would help us finally deal with all these issues that the decline effect is exposing&lt;/span&gt;&lt;/i&gt;.”&lt;br /&gt;&lt;br /&gt;Coincidentally, I was recently tweeting with &lt;a href="http://twitter.com/#!/moximer"&gt;moximer&lt;/a&gt; and &lt;a href="http://twitter.com/#!/jasonpriem"&gt;jasonpriem&lt;/a&gt; about this and we agreed that research wikis are worth explolring. My vision would be something akin to Wikipedia but where a researcher stores all of their data, stimuli, results, etc, finished or not. The data could be tagged as &lt;i&gt;tentative, draft, failed, successful&lt;/i&gt;, etc. As the research goes on, the data get updated. Not only would this record failure (which, as Leherer points out in the article) is as valuable as success, it also records change. How did a study evolve over time?&lt;br /&gt;&lt;br /&gt;True, the data would become huge over time across many disciplines, but that just means means we need better and better data mining tools (and the boys at &lt;a href="http://alias-i.com/lingpipe/"&gt;LingPipe&lt;/a&gt; are working away at those tools).&lt;br /&gt;&lt;br /&gt;HT &lt;a href="http://twitter.com/#!/rapella"&gt;rapella&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-5265964386669440137?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/5265964386669440137/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=5265964386669440137' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/5265964386669440137'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/5265964386669440137'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/how-not-to-do-linguistics.html' title='how (not) to do linguistics'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-2265825675497353394</id><published>2011-01-28T11:52:00.002-05:00</published><updated>2011-01-31T23:58:42.382-05:00</updated><title type='text'>the sociolinguistics of height in China</title><content type='html'>Ingrid at &lt;a href="http://www.languageonthemove.com/language-learning-gender-identity/language-learning-and-height?utm_source=rss=rss=language-learning-and-height"&gt;Language on the Move&lt;/a&gt; has some thoughtful comments on the relationship between height and learning English in China. If you're under 1.6 meters, forget it. There are subtle but very real socioeconomic barriers in your way. Money quote:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;I have supervised research related to English language learning and teaching in China for almost a decade and have read most of the research on the topic published in English. However, never before have I come across the importance of height. I take this as evidence for the importance of doing ethnographic research. Otherwise, what is the point of doing sociolinguistic research if you can’t discover anything you hadn’t already decided in advance would be important&lt;/span&gt;&lt;/i&gt;?!&lt;br /&gt;&lt;br /&gt;I taught EFL in China back in 1998 at a private school in Guangzhou catering mostly to working professionals. Much has changed since then, as China has changed so much. I don't recall ever talking about height as a factor, but certainly cost and hours were a significant issue that made it virtually impossible for any poor workers to consider taking English classes. As a 6 foot 4 blond American, though, I was treated like a rock star. It was kinda weird.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-2265825675497353394?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/2265825675497353394/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=2265825675497353394' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/2265825675497353394'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/2265825675497353394'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/sociolinguistics-of-height-in-china.html' title='the sociolinguistics of height in China'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-3445990871143468459</id><published>2011-01-28T08:06:00.000-05:00</published><updated>2011-01-28T08:06:26.933-05:00</updated><title type='text'>chomsky and performance art</title><content type='html'>Artist Annie Dorsen has created a chatbot performance piece around the debate between Noam Chomsky and Michele Foucault on Dutch TV in 1971 (videos &lt;a href="http://www.google.com/search?rlz=1C1RNNN_enUS357US361&amp;amp;sourceid=chrome&amp;amp;ie=UTF-8&amp;amp;q=chomsky+foucault+tv#q=chomsky+foucault+tv&amp;amp;hl=en&amp;amp;rlz=1C1RNNN_enUS357US361&amp;amp;prmd=ivns&amp;amp;source=univ&amp;amp;tbs=vid:1&amp;amp;tbo=u&amp;amp;ei=M71CTdqLB47TgQev4J3PAQ&amp;amp;sa=X&amp;amp;oi=video_result_group&amp;amp;ct=title&amp;amp;resnum=1&amp;amp;ved=0CCcQqwQwAA&amp;amp;fp=514833c2eeb483db"&gt;here&lt;/a&gt;). &lt;br /&gt;&lt;br /&gt;This snippet is mildly interesting, but I couldn't help wondering what technology was used, especially the speech synthesis, because, frankly, it's a bit clunky and old-fashioned. Perhaps that's part of the point. The computer screens appear to be running DOS shells too. Nothing wrong with that, purists will likely prefer it even, but combined with the clunky speech, the performance appears to be trading on a very outdated computational linguistic aesthetic. Truly the desert of the real? (okay, I had to through in some Baudrillard)&lt;br /&gt;&lt;br /&gt;&lt;iframe allowfullscreen="" class="youtube-player" frameborder="0" height="390" src="http://www.youtube.com/embed/3PiwEQQNnBk" title="YouTube video player" type="text/html" width="640"&gt;&lt;/iframe&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-3445990871143468459?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/3445990871143468459/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=3445990871143468459' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3445990871143468459'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3445990871143468459'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/chomsky-and-performance-art.html' title='chomsky and performance art'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://img.youtube.com/vi/3PiwEQQNnBk/default.jpg' height='72' width='72'/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-830248203531945419</id><published>2011-01-27T11:30:00.001-05:00</published><updated>2011-01-27T11:48:18.678-05:00</updated><title type='text'>can a machine learn jazz?</title><content type='html'>There's a contest dedicated to trying to answer that question: &lt;a href="http://tunedit.org/challenge/music-retrieval"&gt;ISMIS 2011 Contest: Music Information Retrieval&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Computer scientists and engineers have long used contests and &lt;i&gt;bake-offs&lt;/i&gt; to stimulate cutting edge research in linguistics (e.g., &lt;a href="http://en.wikipedia.org/wiki/Message_Understanding_Conference"&gt;MUC&lt;/a&gt;), but linguists have lagged in this department. You rarely if ever hear about contests that  pit one linguistic theory against another using a standardized data set (or maybe I've just missed them).&lt;br /&gt;&lt;br /&gt;Nobel prize winning economist Joseph Stiglitz argues &lt;a href="http://www.project-syndicate.org/commentary/stiglitz81/English"&gt;here &lt;/a&gt;that prizes are good for stimulating academic research. I agree whole heartedly and would like to see more direct competition between theorists. Exactly how a contest would be constructed is up for debate (I have a vague memory of some group trying to devise criteria by which to evaluate linguistic theories, maybe out of UCLA, but I can't seem to track it down; it's a remarkably difficult Google query to form).&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;HT: &lt;a href="http://jochenleidner.posterous.com/machine-learning-of-jazz-music"&gt;Jochen L. Leidner&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-830248203531945419?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/830248203531945419/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=830248203531945419' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/830248203531945419'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/830248203531945419'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/can-machine-learn-jazz.html' title='can a machine learn jazz?'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-7275150081016571724</id><published>2011-01-27T07:33:00.008-05:00</published><updated>2011-01-27T12:48:29.176-05:00</updated><title type='text'>the linguistics of heaven and hell</title><content type='html'>The value of pop culture data for legitimate research is being put to the test. Exactly what, if anything, can the reality show &lt;a href="http://www.channel4.com/bigbrother/"&gt;Big Brother&lt;/a&gt; tell us about language change over time?&lt;br /&gt;&lt;br /&gt;&lt;a href="http://en.wikipedia.org/wiki/Voice_onset_time"&gt;Voice Onset Time&lt;/a&gt; is a measure of how long you wait to begin vibrating your vocal folds after you release a stop consonant. Voiced stop consonants like /b/ and /d/ require two things: 1) stop all airflow from escaping the airway by closing the glottis and 2) after the air is released, begin vibrating the glottis (by using the rushing air). For non-linguists, think of a garden hose. Imagine you use your thumb to stop the water for a second and you let the pressure build, then you let go and water rushes out, but then you use your thumb to clamp down just a bit on the water to spray it. This is kinda like the speech production of voiced stop consonants in human language.&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_uqqKLCxLtII/TUCZDGuhQsI/AAAAAAAAAYo/AH9XkaCRlX8/s1600/water_hose.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="150" src="http://1.bp.blogspot.com/_uqqKLCxLtII/TUCZDGuhQsI/AAAAAAAAAYo/AH9XkaCRlX8/s200/water_hose.jpg" width="200" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div style="text-align: center;"&gt;(image from &lt;a href="http://www.kval.com/news/52001622.html"&gt;Kval.com&lt;/a&gt;)&lt;/div&gt;&lt;br /&gt;Though I’m no phoneticist, I really like VOT as a target of linguistic study for one crucial reason: it’s a clear example of a linguistic feature that varies according to your human language system but which you do NOT have conscious control over. What that means is that you cannot consciously change the length of your own personal VOT. Go ahead, try it. Make your VOT 20 milliseconds longer. Go ahead, I’ll wait…&lt;br /&gt;&lt;br /&gt;Of course you can’t. Well, not consciously, but what researchers have found is that your brain, quite independent of conscious will or knowledge, can! Lab studies have found that people will unknowingly alter their VOTs according to certain situations, and the results are predictable. For example, they found that when listening to a set of long VOT stimuli, subjects will begin to lengthen their own VOTs, in essence accommodating the longer VOTs. Over the longer term it has also been shown that people will lengthen their VOT over their lifetime to accommodate cultural shifts. It has been shown that The Queen Mother herself had a longer VOT in her later life than during her younger days (few other people have been recorded consistently over a long period to provide such valuable data).&lt;br /&gt;&lt;br /&gt;Here’s what Bane et al. did: They took recordings of confessional sequences from the UK reality TV show Big Brother (where groups of strangers are made to live with each other and occasionally speak to a camera alone like a video diary) and tested what happened to 4 crucial individuals (the ones that stayed on the show long enough to provide several months worth of data points). What they found was that their VOTs did in fact change, though no linear pattern was discovered (i.e., they did not simply get longer in a steady line).  This paper is labeled as a &lt;i&gt;progress report&lt;/i&gt; because they don't have a firm hypothesis about what actually is happening. Nice trick there boys, ;)&lt;br /&gt;&lt;br /&gt;They did find one interesting thing: During part of the show, the house mates were physically divided into basically a caste system where half the people were low caste and half were high (a &lt;i&gt;heaven&lt;/i&gt; and a &lt;i&gt;hell&lt;/i&gt;). And this seemed to have an effect on VOT as well (sociolinguists are slap happy about this, I'm sure).&lt;br /&gt;&lt;br /&gt;I haven’t looked at the actual numbers very closely, but in section 6, they say “Housemate trajectories seem to diverge when the divide is present…” However, just taking a glance at the Figure 3, it looks like they diverge at the beginning, then converge at the end, episode 65 (and remain somewhat similar until several episodes of non-DIVIDE have gone by). If my cursory glance is correct, I would assume it takes awhile for the convergence to manifest, and then it persists for awhile after DIVIDE is gone. But this is just me looking at the picture, not the actual data.&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_uqqKLCxLtII/TUCb3h8E_TI/AAAAAAAAAYw/be4QYX4TpRM/s1600/VOT+Divide.JPG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="152" src="http://2.bp.blogspot.com/_uqqKLCxLtII/TUCb3h8E_TI/AAAAAAAAAYw/be4QYX4TpRM/s320/VOT+Divide.JPG" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;Finally, and this is just a readability point, but I would order the names in Figure 3 in the same order as the end point of each trajectory, making it easier to follow who is doing what.&lt;br /&gt;&lt;br /&gt;&lt;span style="float: left; padding-bottom: 5px; padding-left: 5px; padding-right: 5px; padding-top: 5px;"&gt;&lt;a href="http://www.researchblogging.org/"&gt;&lt;img alt="ResearchBlogging.org" src="http://www.researchblogging.org/public/citation_icons/rb2_large_gray.png" style="border: 0;" /&gt;&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Z3988" title="ctx_ver=Z39.88-2004&amp;amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;amp;rft.jtitle=+Linguistic+Society+of+America&amp;amp;rft_id=info%3A%2F&amp;amp;rfr_id=info%3Asid%2Fresearchblogging.org&amp;amp;rft.atitle=Longitudinal+phonetic+variation+in+a+closed+system&amp;amp;rft.issn=&amp;amp;rft.date=2011&amp;amp;rft.volume=&amp;amp;rft.issue=&amp;amp;rft.spage=&amp;amp;rft.epage=&amp;amp;rft.artnum=http%3A%2F%2Fweb.mit.edu%2Fgraff%2Fwww%2FBaneGraffSonderegger2010LongitudinalPhoneticVariationInAClosedSystem.pdf&amp;amp;rft.au=Max+Ban&amp;amp;rft.au=Peter+Graf&amp;amp;rft.au=Morgan+Sonderegge&amp;amp;rfe_dat=bpr3.included=1;bpr3.tags=Computer+Science+%2F+Engineering%2CMathematics%2CPsychology%2CSocial+Science%2CResearch+%2F+Scholarship%2CLinguistics"&gt;Max Bane, Peter Graff, &amp;amp; Morgan Sonderegger (2011). Longitudinal phonetic variation in a closed system &lt;span style="font-style: italic;"&gt; Linguistic Society of America 2011 Annual Meeting&lt;/span&gt;&lt;/span&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-7275150081016571724?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/7275150081016571724/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=7275150081016571724' title='9 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/7275150081016571724'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/7275150081016571724'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/linguistics-of-heaven-and-hell.html' title='the linguistics of heaven and hell'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_uqqKLCxLtII/TUCZDGuhQsI/AAAAAAAAAYo/AH9XkaCRlX8/s72-c/water_hose.jpg' height='72' width='72'/><thr:total>9</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-3825027539198962489</id><published>2011-01-26T15:46:00.000-05:00</published><updated>2011-01-26T15:46:20.007-05:00</updated><title type='text'>more jobs for linguists</title><content type='html'>As the economy continues to grow (Dow over 12,000), so do the non-academic opportunities for linguists. Here's &lt;a href="http://linguistlist.org/jobs/get-jobs.cfm?JobID=83100&amp;amp;SubID=4493096"&gt;an interesting one&lt;/a&gt; for an &lt;i&gt;Analyst&lt;/i&gt; in the California* Bay Area :&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;The Analyst looks for opportunities to improve our Natural Language and Directed-dialog applications using the data logged by them. The Analyst is primarily the responsible team member charged with analyzing the data and making new implementation recommendations [...] Besides analyzing our speech applications and improving our Analytics framework, you will also have the opportunity to carry out independent research, which forms a big part of the success of our speech applications&lt;/span&gt;&lt;/i&gt; [...]&lt;br /&gt;&lt;br /&gt;*living in the metro DC region has taught this Northern California boy that there's more than one "Bay Area."&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-3825027539198962489?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/3825027539198962489/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=3825027539198962489' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3825027539198962489'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3825027539198962489'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/more-jobs-for-linguists.html' title='more jobs for linguists'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-3338349260798273131</id><published>2011-01-26T13:06:00.002-05:00</published><updated>2011-01-26T16:16:17.796-05:00</updated><title type='text'>Call For Participation</title><content type='html'>More and more researchers are using the web to gather data for serious research, but they need your help as participants. As a proponent, I like to do my part and share the calls for participation that I know about. If you know of any others, I'm happy to add:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://lux09.mpi.nl/cgi-bin/exp/default.pl"&gt;MPI&lt;/a&gt; -- The Max Planck Institute for Psycholinguistics: &lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Investigates how people use language&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.surveygizmo.com/s3/401477/Cue-word-Memories"&gt;Cue-word memories&lt;/a&gt; -- Clare Rathbone, University of Reading: &lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;The study is specifically interested in the way people remember events from their lives. You will be asked to recall 16 memories and then rate them for details, such as vividness and how often you have thought about the memories before. You will also be asked to fill in two short questionnaires. Please note, this questionnaire is for people over the age of 40 only - please do not take part if you are aged 39 or younger&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://anawiki.essex.ac.uk/phrasedetectives/"&gt;Phrase Detectives&lt;/a&gt; -- University of Essex: &lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Lovers of literature, grammar and language, this is the place where you can work together to improve future generations of technology. By indicating relationships between words and phrases you will help to create a resource that is rich in linguistic information&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.gameswithwords.org/"&gt;Games With Words&lt;/a&gt; -- Joshua Hartshorne, Harvard University: &lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Test your language sense! Play a game while participating in cutting-edge research. How good is your language sense&lt;/span&gt;&lt;/i&gt;?&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.colornaming.net/"&gt;Color Naming&lt;/a&gt; --&amp;nbsp;Dimitris Mylonas,&amp;nbsp;London College of Communication: &lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;This is a multi-lingual colour naming experiment. It is part of research on colour naming and colour categorisation within different cultures, and aims to improve the inter-cultural colour dialogue. By taking part you are helping us to develop an online colour naming model which will be based on the "natural" language provided from your responses&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://coglab.wadsworth.com/"&gt;CogLab 2.0&lt;/a&gt; -- The Cognitive Psychology Online Laboratory: Aggregated set of many online research projects.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-3338349260798273131?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/3338349260798273131/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=3338349260798273131' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3338349260798273131'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3338349260798273131'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/call-for-participation.html' title='Call For Participation'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-885787522616178765</id><published>2011-01-26T08:48:00.000-05:00</published><updated>2011-01-26T08:48:58.303-05:00</updated><title type='text'>do we need parsed corpora?</title><content type='html'>Maybe not, according to &lt;a href="http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en/us/pubs/archive/35179.pdf"&gt;Google&lt;/a&gt;: &lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;For many tasks, words and word combinations provide all the representational machinery we need to learn from text...invariably, simple models and a lot of data trump more elaborate models based on less data&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;I've been wondering about this very issue for 5 years or so. When I first started collecting parsed BNC data for my defunct &lt;a href="http://thelousylinguist.blogspot.com/search?q=barrier+verbs"&gt;dissertation&lt;/a&gt;, I needed sentences involving various verbs and prepositions, but the examples I found were often of the wrong structural type because of preposition attachment ambiguity. I used &lt;a href="http://tedlab.mit.edu/~dr/Tgrep2/"&gt;Tgrep2&lt;/a&gt; queries to find proper examples, but even then there were false positives, so I did some error correction. One of the more interesting discoveries I made was a relationship between a verb's role in its semantic class and its error rate.&lt;br /&gt;&lt;br /&gt;I was trying to find a way to objectively define core members of a semantic verb class and peripheral members. I had a pretty good intuition about which were which, but I wanted to get beyond intuition (yes yes, it's all very &lt;a href="http://www.stanford.edu/~bclevin/"&gt;Beth Levin&lt;/a&gt;).&lt;br /&gt;&lt;br /&gt;For example, one of the objective clues for barrier verbs (a class of &lt;a href="http://www.amazon.com/Syntax-Negation-Outstanding-Dissertations-Linguistics/dp/081531728X/ref=sr_1_1?ie=UTF8&amp;amp;s=books&amp;amp;qid=1197996223&amp;amp;sr=8-1"&gt;negative verbs&lt;/a&gt; encoding obstruction, like &lt;i&gt;prevent, ban, exclude&lt;/i&gt;, etc) was the unusual role of the preposition &lt;i&gt;from&lt;/i&gt; in sentences like these:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;She prevented them from entering the pub&lt;/i&gt;.&lt;/li&gt;&lt;li&gt;&lt;i&gt;He banned them from the pub&lt;/i&gt;.&lt;/li&gt;&lt;li&gt;&lt;i&gt;They were excluded from the pub&lt;/i&gt;.&lt;/li&gt;&lt;/ul&gt;The preposition &lt;i&gt;from&lt;/i&gt; is usually used to mark sources (&lt;i&gt;He drove here from Buffalo&lt;/i&gt;) but in these sentences it's acting much more like a complementizer. This is fairly unique to barrier verbs and I felt it was distinctive of the verb class, so I wanted a bunch of examples. Because I needed to exclude examples involving old-fashioned source &lt;i&gt;from&lt;/i&gt;, I used a Tgrep2 search that required the PP to be in a particular relationship to the verb (the BNC parse was a bit odd as I recall, and required some gymnastics).&lt;br /&gt;&lt;br /&gt;Again, I had a lot of false positives even with Tgrep2 so I did some manual error analysis and discovered that certain verbs had very low error rates while others had very high rates and the difference coincided nicely with my intuition about which verbs were core members of the class and which were peripheral: core members like &lt;i&gt;prevent &lt;/i&gt;had very low error rates. This means that when &lt;i&gt;prevent&lt;/i&gt; is followed by a &lt;i&gt;from&lt;/i&gt;-PP, it's almost always the complementizer &lt;i&gt;from&lt;/i&gt;; obvious to adults, the meaning of a barrier verb doesn't easily include source (necessary for old-fashioned &lt;i&gt;from)&lt;/i&gt;, but how would a kid learn that? If I &lt;i&gt;ban you from the pub&lt;/i&gt;, how does a kid know the pub is NOT where you started (source) but rather the opposite, it's where you're not allowed to end up (goal)? Cool little learning problem, I thought ... and with a data set other than frikkin dative (which Pinker and Levin have, let's face it, done to death).&lt;br /&gt;&lt;br /&gt;I assumed there was something central to the meaning of the verb class that caused this special use of &lt;i&gt;from&lt;/i&gt;. Then it occurred to me, if this is true, why do I need the parse? Imagine I ignore structure, take all sentences where &lt;i&gt;from&lt;/i&gt; follows a relevant verb, then sample for false positives. That should give me basically the same thing.&lt;br /&gt;&lt;br /&gt;I became increasingly fascinated with this methodology. I was now interested in &lt;i&gt;how&lt;/i&gt; I was studying language, not what I was studying. And that led me to ask whether or not the parse info was all that valuable for other linguistic studies? But then I realized that when big news stories start getting old, the media always, always starts reporting on themselves, on &lt;i&gt;how&lt;/i&gt; the news gets made ... I didn't like where I was heading ...&lt;br /&gt;&lt;br /&gt;...and then I got a job and that was that ...&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;HT: &lt;a href="http://twitter.com/#!/moximer"&gt;Melodye&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-885787522616178765?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/885787522616178765/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=885787522616178765' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/885787522616178765'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/885787522616178765'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/do-we-need-parsed-corpora.html' title='do we need parsed corpora?'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-8262510765103377679</id><published>2011-01-25T08:36:00.001-05:00</published><updated>2011-01-25T15:40:40.774-05:00</updated><title type='text'>Obama's State Of The Union and word frequency</title><content type='html'>In anticipation of President Obama's 2011 State Of The Union speech tonight, and the inevitable bullshit word frequency analysis to follow, I am re-posting &lt;a href="http://thelousylinguist.blogspot.com/2010/01/bad-linguistics-sigh.html"&gt;my post&lt;/a&gt; from last year's SOTU reaction, in hope that maybe, just maybe, some political pundit might be slightly less stupid than they were last year ... sigh .. here's to hope ...&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_uqqKLCxLtII/S2Gx93Dz-mI/AAAAAAAAAPk/mKmHMal1Lbc/s1600-h/obama_sotu_2010_small.JPG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://2.bp.blogspot.com/_uqqKLCxLtII/S2Gx93Dz-mI/AAAAAAAAAPk/mKmHMal1Lbc/s320/obama_sotu_2010_small.JPG" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;(cropped image from &lt;a href="http://www.huffingtonpost.com/2010/01/28/obama-emphasizes-bipartis_n_439753.html"&gt;Huffington Post&lt;/a&gt;) &lt;/div&gt;&lt;br /&gt;It has long been a grand temptation to use simple word frequency* counts to judge a person's mental state. Like &lt;i&gt;Freudian Slips&lt;/i&gt;, there is an assumption that this will give us a glimpse into what a person "really" believes and feels, deep inside. This trend came and went within linguistics when digital corpora were first being compiled and analyzed several decades ago. Linguists quickly realized that this was, in fact, a bogus methodology when they discovered that many (most) claims or hypotheses based solely on a person's simple word frequency data were easily refuted upon deeper inspection. Nonetheless, the message of the weakness of this technique never quite reached the outside world and word counts continue to be cited, even by reputable people, as a window into the mind of an individual. Geoff Nunberg recently railed against the practice here: &lt;a href="http://people.ischool.berkeley.edu/~nunberg/countingwords.html"&gt;The I's Dont Have It&lt;/a&gt;. &lt;br /&gt;&lt;br /&gt;The latest victim of this scam is one of the blogging world's most respected statisticians, Nate Silver who performed a word frequency experiment on a variety of U.S. presidential State Of The Union speeches going back to 1962 &lt;a href="http://www.fivethirtyeight.com/2010/01/obamas-sotu-clintonian-in-good-way.html"&gt;HERE&lt;/a&gt;. I have a lot of respect for Silver, but I believe he's off the mark on this one. Silver leads into his analysis talking about his own &lt;i&gt;pleasant surprise &lt;/i&gt;at the fact that the speech demonstrated "&lt;i&gt;an awareness of the difficult situation in which the President now finds himself&lt;/i&gt;." Then, he justifies his linguistic analysis by stating that "&lt;i&gt;subjective evaluations of Presidential speeches are notoriously useless. So let's instead attempt something a bit more rigorous, which is a word frequency analysis..&lt;/i&gt;." He explains his methodology this way:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span style="color: #38761d;"&gt;To investigate, we'll compare the President's speech to the State of the Union addresses delivered by each president since John F. Kennedy in 1962 in advance of their respective midterm elections. We'll also look at the address that Obama delivered -- not technically a State of the Union -- to the Congress in February, 2009. I've highlighted a total of about 70 buzzwords from these speeches, which are broken down into six categories. The numbers you see below reflect the number of times that each President used term in his State of the Union address.&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;The comparisons and analysis he reports are bogus and at least as "subjective" as his original intuition. &lt;b style="color: blue;"&gt;Here's why:&lt;/b&gt;&lt;br /&gt;&lt;a name='more'&gt;&lt;/a&gt;&lt;ol&gt;&lt;li&gt;We don't know what causes word frequencies.&lt;/li&gt;&lt;li&gt;We don't know what the effects of word frequencies are. &lt;/li&gt;&lt;li&gt;His sample is skewed.&lt;/li&gt;&lt;li&gt;Silver invented categories that have no cognitive reality. &lt;/li&gt;&lt;li&gt;There are good alternatives.&lt;br /&gt;&lt;/li&gt;&lt;/ol&gt;&lt;br /&gt;&lt;div style="color: blue;"&gt;&lt;b&gt;We don't know what causes word frequencies.&lt;/b&gt;&lt;/div&gt;&lt;div style="color: black;"&gt;Why does a person use one word more than another? WE. DON'T. KNOW. I understand the simple intuition that this should mean something, but no one actually knows what it means. We simply don't understand the workings of the brain well enough to study the speech production system well enough to answer this question (despite &lt;a href="http://thelousylinguist.blogspot.com/2009/12/on-linguistic-fingerprinting.html"&gt;these guys' suspect claims&lt;/a&gt;). So we are left with pure &lt;b&gt;intuition &lt;/b&gt;(which is generally bad in the cognitive sciences because &lt;i&gt;we don't think the way we think we do&lt;/i&gt;). So, again, this methodology is not "objective" as Silver claims (not the simplistic way he implemented it, anyway).&lt;/div&gt;&lt;div style="color: blue;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="color: blue;"&gt;&lt;b&gt;We don't know what the effects of word frequencies are.&lt;/b&gt;&lt;/div&gt;The correlate to #1: When a person hears another person use one word more than another, what effect does it have? WE. DON'T. KNOW. Same reasons as above. This remains the realm of intuition and guesswork.&lt;span style="color: blue;"&gt;&lt;/span&gt;&lt;br /&gt;&lt;div style="color: black;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;b&gt;&lt;span style="color: blue;"&gt;His sample is skewed.&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;While I understand that to the lay person, the set of SOTU speeches seems like a coherent category to analyze, it is in fact a linguistically incoherent grouping because these sorts of speeches are constructed slowly, painfully, over time, by teams of individuals, NOT spoken extemporaneously by a single individual. Silver could spin this as a positive in the sense that the speeches represent presidential administrations as a whole, but this makes the "evidence" (i.e., word frequency) extremely messy. What factor is driving the frequency of a particular word in a speech? No clue. The variables are numerous and unknown (two bad things for "rigorous" analysis). Having such a messy data set makes interpretation nearly impossible even if we DID know the answers to #1 and #2 (which we don't).&lt;br /&gt;&lt;br /&gt;&lt;div style="color: blue;"&gt;&lt;b&gt;Silver invented categories that have no cognitive reality. &lt;/b&gt;&lt;/div&gt;&lt;div style="color: black;"&gt;Silver's 70 buzzwords are shoved into six arbitrary categories. Linguists have bee keen on word categories for ... well ... let's say at least &lt;a href="http://en.wikipedia.org/wiki/P%C4%81%E1%B9%87ini"&gt;2500 years&lt;/a&gt;. This we care about. Deeply. William Labov famously wrote, "&lt;i style="color: #38761d;"&gt;If linguistics can be said to be any one thing it is the study of categories&lt;/i&gt;" (full text &lt;a href="http://books.google.com/books?id=kbfhf6U_7goC&amp;amp;pg=PA29&amp;amp;lpg=PA29&amp;amp;dq=THE+BOUNDARIES+OF+WORDS+AND+THEIR+MEANINGS+WILLIAM+LABOV&amp;amp;source=bl&amp;amp;ots=7lNkdOQObK&amp;amp;sig=mMs0BjXiAWArav0XPEhZIsjLQJ8&amp;amp;hl=en&amp;amp;ei=ucVhS43fKozl8QbC1PmVBw&amp;amp;sa=X&amp;amp;oi=book_result&amp;amp;ct=result&amp;amp;resnum=3&amp;amp;ved=0CBYQ6AEwAg#v=onepage&amp;amp;q=THE%20BOUNDARIES%20OF%20WORDS%20AND%20THEIR%20MEANINGS%20WILLIAM%20LABOV&amp;amp;f=false"&gt;here&lt;/a&gt;). More recently, in the last few decades, linguists have expanded their repertoire of tools for analyzing lexical categories using &lt;a href="http://www.gameswithwords.org/Learning/index.html"&gt;psycholinguistic&lt;/a&gt;, &lt;a href="http://www2.hawaii.edu/~bergen/ling640G/lec/lec3.htm"&gt;cognitive linguistic&lt;/a&gt;, and &lt;a href="http://verbs.colorado.edu/verb-index/index.php"&gt;computational linguistic &lt;/a&gt;tools and methods. None of these were employed by Silver in determining whether or not his six categories have any coherence or cognitive reality. He just made them up. How is this MORE objective than intuition?&lt;/div&gt;&lt;div style="color: blue;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="color: blue;"&gt;&lt;b&gt;There are alternatives.&lt;/b&gt;&lt;/div&gt;Let me be clear. I am a fan of corpus linguistics. Counting words is good (as Nunberg says, and as many linguists say. We like this). But this is just the beginning of&amp;nbsp; a long road of analysis. It must be done in a systematic and sophisticated way to be of any use. There are numerous software tools and methodologies that Silver could have made use of that would have given him a more nuanced analysis. There are whole books that teach people how to do this, such as &lt;a href="http://www.amazon.com/Corpora-Cognitive-Linguistics-Corpus-Based-Approaches/dp/3110198266/ref=sr_1_1?ie=UTF8&amp;amp;s=books&amp;amp;qid=1264700002&amp;amp;sr=8-1"&gt;Corpora in Cognitive Linguistics &lt;/a&gt;(just one of many).&lt;br /&gt;&lt;br /&gt;Again, I have a lot of respect for Silver and his advanced skill set in stats. I would love to see Silver bring the full weight of his skills to bear on linguist analysis (as I've said, &lt;a href="http://thelousylinguist.blogspot.com/2010/01/why-linguists-should-study-math.html"&gt;every linguist should study math and stats&lt;/a&gt;), but this experiment falls far short of the mark and he should know better.&lt;br /&gt;&lt;br /&gt;To a certain extent, this critique is unfair to Silver because he implicitly seemed to be acknowledging many of these deficits. All he wanted to do was get a more objective picture of what the SOTU speech meant and how it fits into a bigger picture. On the other hand, it's a fair critique because he put in a lot of effort and posted the results to his popular and influential blog (yes, I note my blog is neither); one ought not to waste such effort. There is the glaringly negative possibility that his popularity and influence as a statistician will actually serve to further strengthen the popular but wrong notion that simple word counts are somehow meaningful. &lt;b&gt;This would be bad.&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;*By "simple word frequency counts" I mean counting the words a person uses (say, in a speech) without counting anything else or adding any other data to give the frequency counts meaning and context.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-8262510765103377679?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/8262510765103377679/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=8262510765103377679' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8262510765103377679'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8262510765103377679'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/obamas-state-of-union-and-word.html' title='Obama&apos;s State Of The Union and word frequency'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_uqqKLCxLtII/S2Gx93Dz-mI/AAAAAAAAAPk/mKmHMal1Lbc/s72-c/obama_sotu_2010_small.JPG' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-8398376780117862992</id><published>2011-01-22T08:33:00.004-05:00</published><updated>2011-01-22T08:33:00.199-05:00</updated><title type='text'>the perils of translation: does und mean well?</title><content type='html'>I'm watching the truly powerful 2009 Oscar winning German film &lt;a href="http://www.imdb.com/title/tt1149362/"&gt;The White Ribbon&lt;/a&gt; on Netflix. Even after a few minutes it has grabbed me and impressed me with its simplicity and power, in the style of many great films. Hollywood used to make films like this. Films that mattered. Films that taught deep truths about what it means to be human. Films like "Inherit The Wind", "Guess Who's Coming To Dinner", "To Kill A Mockingbird". Now Hollywood makes three Jennifer Anister rom-coms a year and panders to fan boys... but I digress ...&lt;br /&gt;&lt;br /&gt;My German is pretty rusty, but the film's dialogue is simple enough (in a good way) for me to catch most of it even without the subtitles, which is exactly the source of the linguistic point I want to discuss. In one early scene, the narrator, a teacher, recounts an incident involving himself* and a student wherein the student was endangering himself, so the teacher demands the student explain his actions. When he fails to get a proper response, he says repeatedly &lt;i&gt;Und? ... Und? &lt;/i&gt;... German &lt;i&gt;und&lt;/i&gt; is easily translated as &lt;i&gt;and&lt;/i&gt; but the film's translators choose to use the English word &lt;i&gt;well&lt;/i&gt; instead.&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_uqqKLCxLtII/TTo_AA39YmI/AAAAAAAAAYg/yCkW_QzcKZI/s1600/white+ribbon+well.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="172" src="http://1.bp.blogspot.com/_uqqKLCxLtII/TTo_AA39YmI/AAAAAAAAAYg/yCkW_QzcKZI/s320/white+ribbon+well.jpg" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div style="text-align: center;"&gt;(screen grab from Netflix)&lt;/div&gt;&lt;br /&gt;As a native speaker of English, I can see the reasoning behind &lt;i&gt;well&lt;/i&gt;, yet I must say, it's equally plausible to use &lt;i&gt;and &lt;/i&gt;in that situation as well, maybe even more so. The use of &lt;i&gt;well&lt;/i&gt; in English would suggest a certain formality that the translator felt was proper, but it also makes me, as an English speaker, feel a bit awkward, like I'm being fed an anachronism. Perhaps that's appropriate for the movie, I'm not sure, it just struck me as an interesting linguistic choice. It's a nice example of the beautiful ambiguity of lexical items, really. For example, just a few scenes later the teacher encounters Eva and asks her about who she is and what he's heard about her, namely that she's a new nanny in town, and her response is &lt;i&gt;und&lt;/i&gt;, but it is translated as English &lt;i&gt;so&lt;/i&gt;:&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_uqqKLCxLtII/TTpElHo2A9I/AAAAAAAAAYk/yoxw9yP9o_8/s1600/white+ribbon+so.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="188" src="http://1.bp.blogspot.com/_uqqKLCxLtII/TTpElHo2A9I/AAAAAAAAAYk/yoxw9yP9o_8/s320/white+ribbon+so.jpg" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div style="text-align: center;"&gt;(screen grab from Netflix)&lt;/div&gt;&lt;div style="text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;Again, as a native speaker of English, I can "get" the translation, but still, I'd be perfectly happy with &lt;i&gt;and&lt;/i&gt; in both. I've never been a translator and I have much respect for the difficult job professional translators do navigating these treacherous waters. I don't mean to second guess. Rather, it strikes my as an interesting point of discussion.&lt;br /&gt;&lt;br /&gt;*why can't I say &lt;i&gt;hisself&lt;/i&gt;? Oh, where are you Jeff Runner when I need you!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-8398376780117862992?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/8398376780117862992/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=8398376780117862992' title='5 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8398376780117862992'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8398376780117862992'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/perils-of-translation-does-und-mean.html' title='the perils of translation: does &lt;i&gt;und&lt;/i&gt; mean &lt;i&gt;well&lt;/i&gt;?'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_uqqKLCxLtII/TTo_AA39YmI/AAAAAAAAAYg/yCkW_QzcKZI/s72-c/white+ribbon+well.jpg' height='72' width='72'/><thr:total>5</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-540691605521918231</id><published>2011-01-21T11:46:00.003-05:00</published><updated>2011-01-21T11:52:32.223-05:00</updated><title type='text'>like wikipedia with a voice?</title><content type='html'>It can be difficult to get a feel for what some tech start-ups are going for. This demo of Qwiki at a Tech Crunch event asks us to think of information as an &lt;i&gt;experience&lt;/i&gt;. I'm pretty sure the voice is synthesized because of some odd prosody and the weird way &lt;i&gt;Yelp&lt;/i&gt; is pronounced (oh, and the unlikelihood that they could pre-record all the possible narration ... yeah, that too). At the end, all I could think of was "it's like Wikipedia with a voice..."&lt;br /&gt;&lt;br /&gt;&lt;iframe frameborder="0" height="225" src="http://player.vimeo.com/video/15444551" width="400"&gt;&lt;/iframe&gt;&lt;br /&gt;&lt;a href="http://vimeo.com/15444551"&gt;Qwiki at TechCrunch Disrupt&lt;/a&gt; from &lt;a href="http://vimeo.com/qwiki"&gt;Qwiki&lt;/a&gt; on &lt;a href="http://vimeo.com/"&gt;Vimeo&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-540691605521918231?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/540691605521918231/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=540691605521918231' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/540691605521918231'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/540691605521918231'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/like-wikipedia-with-voice.html' title='like wikipedia with a voice?'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-1043228255293149535</id><published>2011-01-19T22:11:00.000-05:00</published><updated>2011-01-19T22:11:57.459-05:00</updated><title type='text'>worst word play of the year?</title><content type='html'>Your call...&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_uqqKLCxLtII/TTenv_fO5jI/AAAAAAAAAYc/sJxnTsVXTj4/s1600/guess+hu.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="148" src="http://1.bp.blogspot.com/_uqqKLCxLtII/TTenv_fO5jI/AAAAAAAAAYc/sJxnTsVXTj4/s320/guess+hu.jpg" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-1043228255293149535?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/1043228255293149535/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=1043228255293149535' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/1043228255293149535'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/1043228255293149535'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/worst-word-play-of-year.html' title='worst word play of the year?'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_uqqKLCxLtII/TTenv_fO5jI/AAAAAAAAAYc/sJxnTsVXTj4/s72-c/guess+hu.jpg' height='72' width='72'/><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-6310998455396374673</id><published>2011-01-18T19:31:00.003-05:00</published><updated>2011-01-18T23:34:38.285-05:00</updated><title type='text'>oh snap! daume talkin trash 'bout "stupid" penn tree bank</title><content type='html'>Hal Daume at his excellent &lt;a href="http://nlpers.blogspot.com/2011/01/parsing-with-transformations.html"&gt;NLPers blog&lt;/a&gt; is wondering aloud about parsing algorithms doing "real" syntax:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;One thing that stands in our way, of course, is the &lt;b&gt;stupid&lt;/b&gt; Penn Treebank, which was annotated only with very simple transformations (mostly noun phrase movements) and not really "deep" transformations as most Chomskyan linguists would recognize them&lt;/span&gt;&lt;/i&gt; [emphasis added].&lt;br /&gt;&lt;br /&gt;Oh no he di'nt!&lt;br /&gt;&lt;br /&gt;[&lt;b&gt;&lt;span class="Apple-style-span" style="color: orange;"&gt;UPDATE&lt;/span&gt;&lt;/b&gt;: hal &lt;a href="https://www.blogger.com/comment.g?blogID=520807396714463309&amp;amp;postID=6310998455396374673"&gt;responds&amp;nbsp;thoughtfully&lt;/a&gt;&amp;nbsp;in the comments and properly corrects my misunderstandings of his post.]&lt;br /&gt;&lt;br /&gt;It's certainly fair to say that the &lt;a href="http://www.cis.upenn.edu/~treebank/"&gt;Penn Treebank&lt;/a&gt; is not annotated for everything. Sure. But show me the perfect resource and I'll let you throw all the stones you want. More to the point, once you get beyond deciding what the basic chunks are (NPs,VPs, PPs, etc), there's little agreement on what is and what is not a "real" syntactic thing. In order to annotate anything above this level, you have to choose a theoretical camp to park your tent in. You have to take sides.&lt;s&gt; Daume is happy to be a Chomskyan. He's taken his side. Good for him.&lt;/s&gt;&lt;br /&gt;&lt;br /&gt;In order to annotate Daume's &lt;s&gt;beloved&lt;/s&gt; deep transformations, one must first admit such things exist. I do not. And if Daume started annotating the Penn Treebank with such things, I wouldn't care. I would argue he is wasting his time chasing unicorns.&lt;br /&gt;&lt;br /&gt;Daume may believe that Chomskyan theory is "real" syntax, but I do not. Nor do most linguists (if you surveyed all linguists throughout the world, yes I do believe a majority would disagree with the statement &lt;i&gt;I believe in Chomskyan deep structure&lt;/i&gt;).&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: orange;"&gt;UPDATE&lt;/span&gt;&lt;/b&gt;: Daume's &lt;a href="https://www.blogger.com/comment.g?blogID=19803222&amp;amp;postID=8527687550339950613"&gt;comments &lt;/a&gt;and his responses are well worth reading.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-6310998455396374673?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/6310998455396374673/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=6310998455396374673' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/6310998455396374673'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/6310998455396374673'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/oh-snap-daume-talkin-trash-bout-stupid.html' title='oh snap! daume talkin trash &apos;bout &quot;stupid&quot; penn tree bank'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-6921521448663029769</id><published>2011-01-18T08:33:00.000-05:00</published><updated>2011-01-18T08:33:00.052-05:00</updated><title type='text'>the most difficult linguistics sentence ever?</title><content type='html'>Imagine I give you the sentence template that follows:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;If speakers omit X to avoid Y, optional Z should be less likely if W&lt;/i&gt;.&lt;/li&gt;&lt;/ul&gt;&lt;b&gt;Question&lt;/b&gt;: What X, Y, Z and W could possibly make that sentence EASIER to understand?&lt;br /&gt;&lt;br /&gt;For no particular reason other than (that) I love linguistics and will read any free article that catches my fancy, I've been reading Florian Jaeger's &lt;a href="http://www.bcs.rochester.edu/people/fjaeger/"&gt;Phonological Optimization and Syntactic Variation: The Case of Optional that&lt;/a&gt;. &lt;i&gt;Submitted for Proceedings of 32nd BLS&lt;/i&gt; (pdf). I have nothing but respect for Jaeger as a linguist* and this is a very interesting paper that I have enjoyed reading**. But &lt;i&gt;flo&lt;/i&gt;*** has a knack for producing very difficult to read sentences. Here's the original that produced the template above:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;If speakers omit optional that to avoid segmental OCP violations with the immediately preceding or following segment, optional that should be less likely if the segments was to share some articulatory feature with the adjacent segment of that&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;It actually got worse WITH context, right? And I read the actual paper, with all kinds o' context. And I still had to re-read that sentence many many times. I'm still not sure I understand it. I may have to whip out PowerPoint, a laser pointer, and a flashlight before I figure it out for sure. Now, I'm prepared to admit that the three pints of &lt;a href="http://beeradvocate.com/beer/profile/395/29687/?sort=low&amp;amp;start=0"&gt;BBC Bourbon Barrel Stout&lt;/a&gt; at &lt;a href="http://www.galaxyhut.com/"&gt;Galaxy Hut&lt;/a&gt; may have influenced my critique ...&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_uqqKLCxLtII/TTUGmgf5UFI/AAAAAAAAAYY/I2z4Vob3EPc/s1600/2011-01-17_18-28-12_968%255B1%255D.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="180" src="http://4.bp.blogspot.com/_uqqKLCxLtII/TTUGmgf5UFI/AAAAAAAAAYY/I2z4Vob3EPc/s320/2011-01-17_18-28-12_968%255B1%255D.jpg" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;...but not entirely for the worse. If I ever get around to typing up my awesome and prodigious commentary, it might make a great blog post ... but don't hold your breath. I have a stack of linguistics articles I've read and reviewed over the last 12 months and yet somehow, I just never get around to typing up my truly awesome comments (including in-depth discussion of &lt;i&gt;flo&lt;/i&gt;'s partner-in-crime Peter Graff's &lt;a href="http://web.mit.edu/graff/www/BaneGraffSonderegger2010LongitudinalPhoneticVariationInAClosedSystem.pdf"&gt;Longitudinal Phonetic Variation in a Closed System&lt;/a&gt; -- I got mad comments on that one). Maybe I should have called this blog &lt;i&gt;The Lazy Linguist&lt;/i&gt;?&lt;br /&gt;&lt;br /&gt;*I've never met the guy so maybe he's a bastard in person, I dunno, I hope not...&lt;br /&gt;**Not in the least because it has some tangential connection to my somewhat defunct dissertation research.&lt;br /&gt;***Hey, he &lt;a href="http://hlplab.wordpress.com/wheres-flo/"&gt;calls himself that&lt;/a&gt; on his site...&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-6921521448663029769?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/6921521448663029769/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=6921521448663029769' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/6921521448663029769'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/6921521448663029769'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/most-difficult-linguistics-sentence.html' title='the most difficult linguistics sentence ever?'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_uqqKLCxLtII/TTUGmgf5UFI/AAAAAAAAAYY/I2z4Vob3EPc/s72-c/2011-01-17_18-28-12_968%255B1%255D.jpg' height='72' width='72'/><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-1159500932941666822</id><published>2011-01-16T10:40:00.000-05:00</published><updated>2011-01-16T10:40:15.022-05:00</updated><title type='text'>god awful is an odd phrase</title><content type='html'>I used the phrase &lt;i&gt;god awful&lt;/i&gt; in a comment at &lt;a href="http://languagelog.ldc.upenn.edu/nll/?p=2908#comments"&gt;Language Log&lt;/a&gt; and it occurs to me that it's an odd little creature. From the &lt;a href="http://www.oed.com/view/Entry/79630?redirectedFrom=god-awful#"&gt;OED&lt;/a&gt;*:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Pronunciation: &amp;nbsp;/ˌgɒdˈɔːfʊl/&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Forms: &amp;nbsp;Also God awful, Godawful.(Show More)&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Etymology: &amp;nbsp;&amp;lt; god n. + awful adj.&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;slang (orig. U.S.).&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&amp;nbsp;&amp;nbsp;Terrible; &lt;b&gt;extremely unpleasant&lt;/b&gt;. (In quot. 1878 the sense is ‘&lt;b&gt;impressively large&lt;/b&gt;’.)&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;1878 &amp;nbsp; &amp;nbsp;J. H. Beadle Western Wilds xxxvii. 611 &amp;nbsp; Put thirty acres‥into wheat, and went to work with a hurrah in 1874 to make a God-awful crop.&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;1897 &amp;nbsp; &amp;nbsp;C. M. Flandrau Harvard Episodes 88 &amp;nbsp; Ellis is such a God awful fool.&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;1930 &amp;nbsp; &amp;nbsp;W. S. Maugham Breadwinner ii. 124 &amp;nbsp; Your affairs are in a god-awful mess.&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;1946 &amp;nbsp; &amp;nbsp;‘S. Russell’ To Bed with Grand Music i. 14 &amp;nbsp; Listen to the most godawful programmes on the radio.&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;1958 &amp;nbsp; &amp;nbsp;R. Graves in Times Lit. Suppl. 15 Aug. p. x/4 &amp;nbsp; The credible and vivid story that any context (red-brick, yellow-brick, or otherwise God-awful) offers.&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;1959 &amp;nbsp; &amp;nbsp;P. McCutchan Storm South iv. 63, &amp;nbsp; I heard the most God-awful racket above my head.&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;The meaning is derived from using &lt;i&gt;god &lt;/i&gt;as an intensifier like &lt;i&gt;very&lt;/i&gt;. Fine, I get this analysis, it makes sense. But &lt;b&gt;is &lt;i&gt;god &lt;/i&gt;ever used in any other construction to intensifier a negative quality like &lt;i&gt;awful&lt;/i&gt;&lt;/b&gt;?&lt;br /&gt;&lt;br /&gt;This is a case where corpora are not terribly useful because the instances of god are so frequent, and so frequently NOT in this kind of construction, it's difficult to discover automatically. I could go all qualitative and just read a million phrases with god in them, but that would take a really long time and still have a low probability of success.&lt;br /&gt;&lt;br /&gt;HT to the OED for making their site freely available this month! Use name/password trynwoed/trynewoed.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-1159500932941666822?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/1159500932941666822/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=1159500932941666822' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/1159500932941666822'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/1159500932941666822'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/god-awful-is-odd-phrase.html' title='&lt;i&gt;god awful&lt;/i&gt; is an odd phrase'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-7839963945276249348</id><published>2011-01-15T22:27:00.001-05:00</published><updated>2011-01-15T22:33:23.864-05:00</updated><title type='text'>true grit phonological ambiguity</title><content type='html'>Thanks to Jeff Bridges' now infamous mumbling performance, the clever folks at College Humor give True Grit a version of the &lt;a href="http://thelousylinguist.blogspot.com/2009/12/lip-reading-response.html"&gt;lip reading treatment&lt;/a&gt; that Star Trek received not too long ago.&lt;br /&gt;&lt;br /&gt;&lt;object data="http://www.collegehumor.com/moogaloop/moogaloop.swf?clip_id=1945368&amp;amp;fullscreen=1" height="360" type="application/x-shockwave-flash" width="640"&gt;&lt;param name="allowfullscreen" value="true"/&gt;&lt;param name="wmode" value="transparent"/&gt;&lt;param name="allowScriptAccess" value="always"/&gt;&lt;param name="movie" quality="best" value="http://www.collegehumor.com/moogaloop/moogaloop.swf?clip_id=1945368&amp;fullscreen=1"/&gt;&lt;embed src="http://www.collegehumor.com/moogaloop/moogaloop.swf?clip_id=1945368&amp;fullscreen=1" type="application/x-shockwave-flash" wmode="transparent"  width="640" height="360"  allowScriptAccess="always"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;br /&gt;&lt;div style="padding: 5px 0; text-align: center; width: 640px;"&gt;See more &lt;a href="http://www.collegehumor.com/videos"&gt;funny videos&lt;/a&gt; and &lt;a href="http://www.collegehumor.com/pictures"&gt;funny pictures&lt;/a&gt; at &lt;a href="http://www.collegehumor.com/"&gt;CollegeHumor&lt;/a&gt;.&lt;/div&gt;&lt;br /&gt;apologies&amp;nbsp;for the weird embedding, I don't know how to fix it (I just pasted the &lt;i&gt;embed &lt;/i&gt;code into the Blogger HTML with no option to adjust size)...and yes, I'll have some more of that woop woop, please...&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-7839963945276249348?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/7839963945276249348/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=7839963945276249348' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/7839963945276249348'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/7839963945276249348'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/true-grit-phonological-ambiguity.html' title='true grit phonological ambiguity'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-2115713102416527111</id><published>2011-01-13T15:57:00.000-05:00</published><updated>2011-01-13T15:57:27.208-05:00</updated><title type='text'>how distinctive is app store?</title><content type='html'>Microsoft is &lt;a href="http://www.huffingtonpost.com/2011/01/12/microsoft-apple-app-store_n_807827.html"&gt;arguing&lt;/a&gt; that Apple cannot trademark the term &lt;i&gt;app store&lt;/i&gt; because it is a generic term.&lt;br /&gt;&lt;br /&gt;"&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;An 'app store' is an 'app store'," Russell Pangborn, Microsoft's associate general counsel, said, according to the BBC. "Like 'shoe store' or 'toy store', it is a generic term that is commonly used by companies, governments and individuals that offer apps&lt;/span&gt;&lt;/i&gt;."&lt;br /&gt;&lt;br /&gt;A commenter at &lt;a href="http://news.ycombinator.com/item?id=2097156"&gt;Hacker News&lt;/a&gt; begs to differ:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Ngram data shows no usage of "App Store" or "app store" from the time of 1800 to 2008. I was suspicious of this, but using the terms "app,store" separately produced lots of data points. My tentative hypothesis is that Ngram is using data that existed before the App Store went public and thus will not show up in Ngram&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;I'm no trademark expert, but the basic idea, as Wikipedia defines it, is &lt;a href="http://en.wikipedia.org/wiki/Trademark_distinctiveness"&gt;distinctiveness&lt;/a&gt;:&amp;nbsp;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;A trademark may be eligible for registration, or registrable, if amongst other things it performs the essential trademark function, and has distinctive character. Registrability can be understood as a continuum, with "inherently distinctive" marks at one end, "generic" and "descriptive" marks with no distinctive character at the other end, and "suggestive" and "arbitrary" marks lying between these two points&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;First, I used BYU's Corpus of Contemporary American English and found an instance in 2009 of 'app store" being used to describe Zune's product: &lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Oh, the Zune has an app store, all right. As of today, there are exactly nine programs in the Zune App Store&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;A quick google search reveals that it commonly gets applied to non-Apple related products as well:&amp;nbsp;&lt;a href="http://techcrunch.com/2010/09/27/amazon-android-app-store/"&gt;Yep, Amazon Launching Their Own App Store For Android Too&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;While it may be the case that Apple introduced the term in 2008, it seems to have expanded to generic use in less than a year and now gets used at least semi-regularly for non-Apple products. I'm not an Apple user myself and my own reading of &lt;i&gt;app store&lt;/i&gt; is definitely generic. It does not distinctly mean Apple's product at all, to me. I have no clue if a court would agree.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-2115713102416527111?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/2115713102416527111/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=2115713102416527111' title='11 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/2115713102416527111'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/2115713102416527111'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/how-distinctive-is-app-store.html' title='how distinctive is &lt;i&gt;app store&lt;/i&gt;?'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>11</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-3695340553839749494</id><published>2011-01-13T11:33:00.002-05:00</published><updated>2011-01-13T11:36:24.744-05:00</updated><title type='text'>do you despise eReaders and have tons of extra cash</title><content type='html'>...then this is for you: &lt;i&gt;&lt;a href="http://kottke.org/11/01/penguin-classics-ultra-mega-super-121-gigawatts-box-set"&gt;The Penguin Classics Complete Library&lt;/a&gt; is a massive box set consisting of nearly every Penguin Classics book ever published and is available on Amazon for only (only!) $13,413.30&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;A rundown:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;1,082 titles&lt;/li&gt;&lt;li&gt;laid end to end they would hit the 52-mile mark&lt;/li&gt;&lt;li&gt;700 pounds in weight&lt;/li&gt;&lt;li&gt;828 feet if you stacked them&amp;nbsp;&lt;/li&gt;&lt;li&gt;They arrived in 25 boxes&lt;/li&gt;&lt;/ul&gt;My only complaint would be that Penguin Classics tend to be crappy books physically.&lt;br /&gt;&lt;br /&gt;HT Kottke.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-3695340553839749494?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/3695340553839749494/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=3695340553839749494' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3695340553839749494'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3695340553839749494'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/do-you-despise-ereaders-and-have-tons.html' title='do you despise eReaders and have tons of extra cash'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-4686785693755412644</id><published>2011-01-11T12:07:00.001-05:00</published><updated>2011-01-13T10:04:05.913-05:00</updated><title type='text'>doggie do do at the the HuffPo</title><content type='html'>The Huffington Post is resetting the bar for astoundingly stupid science reporting: &lt;a href="http://www.huffingtonpost.com/2011/01/11/dog-breaks-vocabulary-rec_n_804728.html"&gt;They report on &lt;/a&gt;a dog, Chaser, who has been trained to accurately fetch over 1000 toys by sound of the name and conclude that the dog's abilities, wait for it, &lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;place her at an intelligence level equivalent to a three-year-old human child&lt;/span&gt;&lt;/i&gt;!&lt;br /&gt;&lt;br /&gt;My oh my, their view of the cognitive ability of 3 year olds is as depressing as it is profoundly wrong. Sorry, 3 year old humans can do more than make one-to-one correspondences between sounds and objects. They can, for example, recognize that the sound &lt;i&gt;swing&lt;/i&gt; can mean an object with a seat attached to ropes OR the action you perform when you move your body back and forth on that thing with the ropes, they can watch TV and follow plot developments, ... sigh, I mean fuck it, it's not worth debunking ...&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: orange;"&gt;UPDATE&lt;/span&gt;&lt;/b&gt;: Sean at &lt;i&gt;Replicated Typo&lt;/i&gt; &lt;a href="http://www.replicatedtypo.com/uncategorized/dog-exhibits-mutual-exclusivity-bias/?utm_source=feedburner&amp;amp;utm_medium=feed&amp;amp;utm_campaign=Feed:+ResearchBloggingAllEnglish+(Research+Blogging+-+English+-+All+Topics)"&gt;reviews the original research&lt;/a&gt; involving Chaser.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-4686785693755412644?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/4686785693755412644/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=4686785693755412644' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/4686785693755412644'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/4686785693755412644'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/doggie-do-do-at-the-huffpo.html' title='doggie do do at the the HuffPo'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-5079812176716695479</id><published>2011-01-11T09:22:00.001-05:00</published><updated>2011-01-11T09:23:13.255-05:00</updated><title type='text'>Do rich families talk to their kids more than poor families?</title><content type='html'>Are &lt;i&gt;Children in professional families talked to three times as much as the children in welfare families?&lt;/i&gt; That's the underlying assumption behind &lt;a href="http://www.med.nyu.edu/biosketch/alm5"&gt;a new program at Bellevue hospital&lt;/a&gt; designed to coach "&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;poor families on how to talk to their infant children, encouraging more interaction&lt;/span&gt;&lt;/i&gt;."&lt;br /&gt;&lt;br /&gt;At least, that's how the &lt;a href="http://www.huffingtonpost.com/2011/01/10/alan-mendelsohn-of-new-yo_n_806911.html"&gt;Huffington Post&lt;/a&gt; wants you to think about this story:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;University of Kansas graduate student Betty Hart and her professor, Todd Risley, wanted to figure out the cause of the education gap between the rich and poor. So, they targeted early education and headed a study that recorded the first three years of 40 infants' lives. The conclusion? Rich families talk to their kids more than poor families&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;Pretty impressive, huh? Sounds cutting edge, right? With a little searching &lt;a href="http://www.gpb.org/news/2011/01/10/closing-the-achievement-gap-with-baby-talk"&gt;I discovered the following&lt;/a&gt;:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Betty Hart was a grad student at KU in the 1960s.&lt;/li&gt;&lt;li&gt;The research data for this study was collected in the early 1980s.&lt;/li&gt;&lt;li&gt;The paper publishing these results was &lt;a href="http://www.childrenofthecode.org/interviews/risley.htm"&gt;published&lt;/a&gt; in 1995.&lt;/li&gt;&lt;/ul&gt;I have no problem with the common sense underlying these notions: talking to babies a lot helps them achieve higher success in academics later in life. Good advice all around, no doubt. But I'm suspicious of several assumptions about the finding of the original paper. From &lt;a href="http://www.gpb.org/news/2011/01/10/closing-the-achievement-gap-with-baby-talk"&gt;Alix Spiegel&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;According to their research, the average child in a welfare home heard about 600 words an hour while a child in a professional home heard 2,100. "Children in professional families are &lt;b&gt;talked to&lt;/b&gt; three times as much as the average child in a welfare family," Hart says&lt;/span&gt;&lt;/i&gt; [emphasis added].&lt;br /&gt;&lt;br /&gt;Hearing words in your environment and &lt;i&gt;talking to&lt;/i&gt; children are two different things and need to be distinguished, as well as &lt;a href="http://en.wikipedia.org/wiki/Baby_talk"&gt;child-directed speech&lt;/a&gt;. All I have are secondary sources not the 1995 book (Spiegel's article is the most thorough) so I can't tell how the data was coded and what they looked for (did the make the above three distinctions?).&lt;br /&gt;&lt;br /&gt;But more to the point is the contemporary rush to paint these old findings as rationale to create new programs aimed at poor parents as if &lt;i&gt;being poor makes your language use wrong somehow&lt;/i&gt;. It strikes me as convoluted logic to take a 15 year old book (based on 20 year old data) and decide that poor parents need linguistic intervention. Exactly how much grant money did Dr. Mendelsohn spend on this program? Even if the 3-1 ratio holds true (I suspect it would not under close scrutiny), what other factors might be affecting this?&lt;br /&gt;&lt;br /&gt;It struck me that people with basically good intentions took a small amount of science out of context and used it to reinforce class stereotypes and class pressure.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-5079812176716695479?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/5079812176716695479/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=5079812176716695479' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/5079812176716695479'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/5079812176716695479'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/do-rich-families-talk-to-their-kids.html' title='&lt;i&gt;Do rich families talk to their kids more than poor families?&lt;/i&gt;'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-8900725606159166319</id><published>2011-01-10T08:33:00.000-05:00</published><updated>2011-01-10T08:33:00.301-05:00</updated><title type='text'>The Psychological Functions of Function Words</title><content type='html'>Here is Chung &amp;amp; Pennebaker's 2007 paper on function words which crucially relies on Pennebakers' LWIC data:&amp;nbsp;&lt;a href="http://homepage.psy.utexas.edu/homepage/faculty/pennebaker/reprints/Chung&amp;amp;JWP.pdf"&gt;The Psychological Functions of Function Words&lt;/a&gt; (pdf). I have long felt that function words have been wrongly ignored by computational linguists and SEO specialists. While the use of &lt;a href="http://en.wikipedia.org/wiki/Stop_words"&gt;stop lists&lt;/a&gt; have sped up processing time considerably, they have also wiped out huge amounts of semantically meaningful data.&amp;nbsp;Nonetheless, I also feel the Pennebaker's &lt;a href="http://www.liwc.net/"&gt;LWIC corpus&lt;/a&gt; is not as transparent or as comprehensive as I would prefer it to be.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-8900725606159166319?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/8900725606159166319/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=8900725606159166319' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8900725606159166319'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8900725606159166319'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/psychological-functions-of-function.html' title='The Psychological Functions of Function Words'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-4330536189591556925</id><published>2011-01-08T21:32:00.000-05:00</published><updated>2011-01-08T21:32:55.899-05:00</updated><title type='text'>biggest linguistics story of 2010?</title><content type='html'>I have nothing but respect and admiration for Erin McKean,&amp;nbsp;CEO and Co-Founder of the awesome &lt;a href="http://www.wordnik.com/"&gt;Wordnik&lt;/a&gt; project as well as the person who has given by far the single greatest &lt;a href="http://www.ted.com/talks/view/id/161"&gt;lingo-TED-talk ever&lt;/a&gt;; nonetheless, I take exception to her most recent column in the Boston Globe titled &lt;a href="http://www.boston.com/bostonglobe/ideas/articles/2011/01/09/the_year_in_language/"&gt;The year in language&lt;/a&gt; which is an article about the best and worst language stories of 2010. She notes many worthy events, yet...&lt;br /&gt;&lt;br /&gt;With no offense meant, I can say that I was shocked, SHOCKED! to discover that no mention whatsoever was made of what I consider to be the single most important and shocking linguistics related story of 2010: the revelation that Harvard's Marc Hauser fabricated data regarding rule learning by monkeys. For years, Hauser has posed as a giant in the Chomsky camp, and created an ivy-league cottage industry based on his research. 2010's revelations of his &lt;i&gt;still-unclear-yet-nonetheless-obvious-forgery&lt;/i&gt; is a shock-wave whose full power and ramifications have yet to be fully understood. Plus, it was the Boston Globe itself, the paper Erin publishes in, that broke &lt;a href="http://www.boston.com/news/local/breaking_news/2010/08/journal_editor.html"&gt;the original story&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Language Log's extensive discussions of the Hauser story can be found &lt;a href="http://www.google.com/search?q=hauser+site:http://languagelog.ldc.upenn.edu/nll/&amp;amp;hl=en&amp;amp;num=10&amp;amp;lr=&amp;amp;ft=i&amp;amp;cr=&amp;amp;safe=images&amp;amp;tbs="&gt;here&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-4330536189591556925?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/4330536189591556925/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=4330536189591556925' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/4330536189591556925'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/4330536189591556925'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/biggest-linguistics-story-of-2010.html' title='biggest linguistics story of 2010?'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-1965805585340117155</id><published>2011-01-08T13:34:00.000-05:00</published><updated>2011-01-08T13:34:24.683-05:00</updated><title type='text'>replace QWERTY with little circles?</title><content type='html'>Android users can look forward to a new typing layout specifically designed for one handed, hand-held device typing by &lt;a href="http://www.the8pen.com/"&gt;8pen&lt;/a&gt;. There have long been alternatives to the traditional QWERTY layout, but this one replaces keys with hand motion, so rather than landing your finger on the letter you want to type (the conceptual foundation of most keyboard concepts) this one rests on the idea that you make little circles on the screen while different letters are accessed.&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_uqqKLCxLtII/TSitwimU_EI/AAAAAAAAAYU/pTUPeGGtuuQ/s1600/android_typing.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="239" src="http://2.bp.blogspot.com/_uqqKLCxLtII/TSitwimU_EI/AAAAAAAAAYU/pTUPeGGtuuQ/s320/android_typing.jpg" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;In the words of the horse from &lt;a href="http://www.google.com/images?rlz=1C1RNNN_enUS357US361&amp;amp;q=ren+and+stimpy+horse&amp;amp;um=1&amp;amp;ie=UTF-8&amp;amp;source=univ&amp;amp;ei=DqsoTavpLoSKlweC3NSsAQ&amp;amp;sa=X&amp;amp;oi=image_result_group&amp;amp;ct=title&amp;amp;resnum=4&amp;amp;ved=0CDsQsAQwAw&amp;amp;biw=1280&amp;amp;bih=709"&gt;Ren and Stimpy&lt;/a&gt;, &lt;i&gt;no sir, I don't like it&lt;/i&gt;. Why not?&lt;br /&gt;&lt;br /&gt;While inefficient and clumsy, the classic idea of touching the letter you want is fundamentally natural and clear. Any child or lazy adult can grasp it immediately. The little circles idea creates an artificial and unnatural interface that puts you multiple steps away from what you want. I'm not trying to make circles, I'm trying to type a frikkin &lt;i&gt;k&lt;/i&gt;. I'm sure with practice anyone could get good at this, but I don't wanna practice typing for frik's sake! That's why I've been a clumsy hunt and pecker for 30 years with the damned QWERTY. I could have practiced typing on this damn thing also, but I didn't for the same reason I'm not gonna practice the little circles: I'm lazy. But at least with keys I can just touch the letter I want and get it. It's clear and obvious. I'm sure the little circles would drive me mad.&lt;br /&gt;&lt;br /&gt;&lt;object height="385" width="640"&gt;&lt;param name="movie" value="http://www.youtube.com/v/q3OuCR0EpGo?fs=1&amp;amp;hl=en_US"&gt;&lt;/param&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;/param&gt;&lt;param name="allowscriptaccess" value="always"&gt;&lt;/param&gt;&lt;embed src="http://www.youtube.com/v/q3OuCR0EpGo?fs=1&amp;amp;hl=en_US" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="640" height="385"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-1965805585340117155?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/1965805585340117155/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=1965805585340117155' title='11 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/1965805585340117155'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/1965805585340117155'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/replace-qwerty-with-little-circles.html' title='replace QWERTY with little circles?'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_uqqKLCxLtII/TSitwimU_EI/AAAAAAAAAYU/pTUPeGGtuuQ/s72-c/android_typing.jpg' height='72' width='72'/><thr:total>11</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-4475734472020730041</id><published>2011-01-07T12:24:00.000-05:00</published><updated>2011-01-07T12:24:35.896-05:00</updated><title type='text'>adults process language in a baby way!</title><content type='html'>Do babies process language in a "grown-up" way? First, read this from &lt;a href="http://www.eurekalert.org/pub_releases/2011-01/uoc--bpl010611.php"&gt;UCSD&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Babies, even those too young to talk, can understand many of the words that adults are saying – and their brains process them in a &lt;b&gt;grown-up &lt;/b&gt;way.&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Combining the cutting-edge technologies of MRI and MEG, scientists at the University of California, San Diego show that babies just over a year old process words they hear with &lt;b&gt;the same&lt;/b&gt; brain structures as adults, and in &lt;b&gt;the same&lt;/b&gt; amount of time. Moreover, the researchers found that babies were not merely processing the words as sounds, but were capable of grasping their meaning&lt;/span&gt;&lt;/i&gt;&amp;nbsp;[emphasis added].&lt;br /&gt;&lt;br /&gt;It certainly is an interesting finding to discover that infant and adult lexical processing may be similar, but why couch it in asymmetrical&amp;nbsp;phrasing? Given the facts as this press release states them, could we equally as well say that &lt;i&gt;adults process language in a baby way&lt;/i&gt;? This wouldn't get any press attention,&amp;nbsp;though, would it. Or worse, it would be mocked. The author of the press release,&amp;nbsp;Debra Kain, is referred to as a &lt;i&gt;spokesperson &lt;/i&gt;for the&amp;nbsp;UCSD Medical Center in &lt;a href="http://www.signonsandiego.com/news/2010/jan/21/ucsd-medical-center-bonuses-create-stir/"&gt;this article&lt;/a&gt;. But it's not clear she consulted Jeff Elmen, a very well respected cognitive scientist who participated in the research. I'm not sure how comfortable he would have been with the somewhat excitable language.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-4475734472020730041?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/4475734472020730041/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=4475734472020730041' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/4475734472020730041'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/4475734472020730041'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/adults-process-language-in-baby-way.html' title='adults process language in a baby way!'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-8230538020647369897</id><published>2011-01-06T11:35:00.001-05:00</published><updated>2011-01-06T11:37:48.597-05:00</updated><title type='text'>annals of unnecessary censorship, literary canon edition</title><content type='html'>&lt;a href="http://www.publishersweekly.com/pw/by-topic/industry-news/publisher-news/article/45645-upcoming-newsouth-huck-finn-eliminates-the-n-word.html"&gt;Upcoming NewSouth 'Huck Finn' Eliminates the 'N' Word&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Twain scholar Alan Gribben and NewSouth Books plan to release a version of Huckleberry Finn, in a single volume with The Adventures of Tom Sawyer, that does away with the "n" word (as well as the "in" word, "Injun") by replacing it with the word "slave."&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;div&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&amp;nbsp;[...]&lt;/span&gt;&lt;/i&gt;&lt;/div&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;"What he suggested," said La Rosa, "was that there was a market for a book in which the n-word was switched out for something less hurtful, less controversial. We recognized that some people would say that this was censorship of a kind, but our feeling is that there are plenty of other books out there—all of them, in fact—that faithfully replicate the text, and that this was simply an option for those who were increasingly uncomfortable, as he put it, insisting students read a text which was so incredibly hurtful&lt;/span&gt;&lt;/i&gt;."&lt;br /&gt;&lt;br /&gt;I'm curious about this notion of replacement as an "option" for two reasons. First, it reminds me of Ted Turner's infamous and ill-fated 1980s colorization project whereby he went back and artificially colorized black and white movies. As I recall, Turner also spoke of it as an "option", but it failed miserably as a cultural movement. Second, now that eReaders are becoming commonplace I wonder if publishers will begin to offer &lt;i&gt;sanitized &lt;/i&gt;versions of books as an option. I don't have an eReader, so maybe this is already available, but I could imagine a filter that you click on and magically Henry Miller's &lt;a href="http://en.wikipedia.org/wiki/Tropic_of_Cancer_(novel)"&gt;Tropic of Cancer&lt;/a&gt; becomes a weirdly different novel.&lt;br /&gt;&lt;br /&gt;HT &lt;a href="http://kottke.org/11/01/no-nigger-in-new-edition-of-huck-finn"&gt;kottke&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-8230538020647369897?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/8230538020647369897/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=8230538020647369897' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8230538020647369897'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8230538020647369897'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/annals-of-unnecessary-censorship.html' title='annals of unnecessary censorship, literary canon edition'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-952452042025897281</id><published>2011-01-05T12:07:00.000-05:00</published><updated>2011-01-05T12:07:09.535-05:00</updated><title type='text'>jobs for linguists</title><content type='html'>As the economy slowly starts to wake, I hope and expect to see more &lt;a href="http://linguistlist.org/jobs/get-jobs.cfm?JobID=82340&amp;amp;SubID=3797862"&gt;jobs like this one&lt;/a&gt; popping up where general linguistics skills are being sought by innovative tech companies (these were a dime a dozen in the glory days of the tech boom 90s). Were I a bit younger, and less well-payed, I'd probably consider applying myself.&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;We are seeking a Linguist interested in joining a rapidly growing organization. The Linguist will work closely with our NLP Team in researching and developing lexica and grammars specific to various  languages (“Language Packs”) that will be used for various NLP tasks. She/he will be expected to&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;contribute substantive insight/action with regard to developing language packs and must have a keen eye for understanding the end-user experience.&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Specific responsibilities include:&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;- Research specific languages for their lexical, morphological, and&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;grammatical structures&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;- Develop original lexicons and reformat acquired lexicons&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;- Create grammatical rules using the research done above or other sources&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;- Analyze results from the system for mistakes and plan for improvement&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;- Willingness to focus research and development of Language Packs on&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;meeting the end-user’s needs&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;If you're a linguist interested in a non-academic career, you could do worse than &lt;a href="http://tbe.taleo.net/NA4/ats/careers/requisition.jsp?org=CLARABRIDGE&amp;amp;cws=2&amp;amp;rid=106"&gt;apply here&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;And for the record, I have no association with this company, have never worked for them, get nothing from posting this, but I do know one of their employees (we went to grad school together).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-952452042025897281?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/952452042025897281/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=952452042025897281' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/952452042025897281'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/952452042025897281'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/jobs-for-linguists.html' title='jobs for linguists'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-7606173639485999000</id><published>2011-01-04T10:53:00.000-05:00</published><updated>2011-01-04T10:53:00.997-05:00</updated><title type='text'>the germans fear my language too, muahahaha</title><content type='html'>It's a mighty era to be a native speaker of English. It seems the world fears my language and is instituting fruitless policies to protect their languages against my own. First &lt;a href="http://languagelog.ldc.upenn.edu/nll/?p=2858"&gt;the Chinese banned English words&lt;/a&gt; and phrases. Now, the Germans are getting on the &lt;a href="http://www.independent.co.uk/news/world/europe/denglish-now-verboten-2171795.html"&gt;banning bandwagon&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Germany's Transport Minister claimed to have struck an important blow for the preservation of the German language yesterday after enforcing a strict ban on the use of all English words and phrases within his ministry.&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/i&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Peter Ramsauer stopped his staff from using more than 150 English words and expressions that have crept into everyday German shortly after being appointed in late 2009.&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/i&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;His aim, which was backed by Chancellor Angela Merkel, was to defend his language against the spread of "Denglish" – the corruption of German with words such as "handy" for mobile phone and other expressions including "babysitten" and "downloaden". As a result, words such as "laptop", "ticket" and "meeting" are verboten in Mr Ramsauer's ministry. Instead, staff must use their German equivalents: "Klapprechner", "Fahrschein" and "Besprechung" as well as many other common English words that the minister has translated back into German&lt;/span&gt;&lt;/i&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-7606173639485999000?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/7606173639485999000/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=7606173639485999000' title='16 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/7606173639485999000'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/7606173639485999000'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/germans-fear-my-language-too-muahahaha.html' title='the germans fear my language too, muahahaha'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>16</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-6634005465135047992</id><published>2011-01-04T10:28:00.002-05:00</published><updated>2011-01-04T10:33:14.365-05:00</updated><title type='text'>naive bayes knows restaurants better than 5,000 mechanical turks</title><content type='html'>Yelp recently sponsored &lt;a href="http://www.kdnuggets.com/2011/01/ai-defeats-hivemind.html?utm_source=twitterfeed&amp;amp;utm_medium=twitter&amp;amp;utm_campaign=Feed:+kdnuggets-data-mining-analytics+(KDnuggets:+Data+Mining+and+Analytics)&amp;amp;utm_content=Twitter"&gt;a bake-off&lt;/a&gt; between a&amp;nbsp;Naive Bayes classifier and the&amp;nbsp;online&amp;nbsp;crowd-sourcing&amp;nbsp;site Mechanical Turk. The task was classifying web sites according to their business&amp;nbsp;category (i.e., is it a restaurant or a doctors office?). &amp;nbsp;The classifier beat the turkers handily:&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_uqqKLCxLtII/TSM7i2HArtI/AAAAAAAAAYQ/AiCV5aafn2Q/s1600/turk_classifier.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="169" src="http://2.bp.blogspot.com/_uqqKLCxLtII/TSM7i2HArtI/AAAAAAAAAYQ/AiCV5aafn2Q/s320/turk_classifier.jpg" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;Money quote:&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;In almost every case, the algorithm, which was trained on a pool of 12 million user-submitted Yelp reviews, correctly identified the category of a business a third more often than the humans. In the automotive category, the computer was twice as likely as the assembled masses to correctly identify a business&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;There are a variety of qualifications (why did 99% of Turkers who applied for the task fail the basic test? ESL issues perhaps?). But it's an interesting result.&lt;br /&gt;&lt;br /&gt;HT &lt;a href="http://www.kdnuggets.com/2011/01/ai-defeats-hivemind.html?utm_source=twitterfeed&amp;amp;utm_medium=twitter&amp;amp;utm_campaign=Feed:+kdnuggets-data-mining-analytics+(KDnuggets:+Data+Mining+and+Analytics)&amp;amp;utm_content=Twitter"&gt;kdnuggets&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-6634005465135047992?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/6634005465135047992/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=6634005465135047992' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/6634005465135047992'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/6634005465135047992'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/naive-bayes-knows-restaurants-better.html' title='naive bayes knows restaurants better than 5,000 mechanical turks'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_uqqKLCxLtII/TSM7i2HArtI/AAAAAAAAAYQ/AiCV5aafn2Q/s72-c/turk_classifier.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-6261111360965686310</id><published>2011-01-03T14:13:00.000-05:00</published><updated>2011-01-03T14:13:32.954-05:00</updated><title type='text'>how we hear ourselves speak</title><content type='html'>Science Daily has &lt;a href="http://www.sciencedaily.com/releases/2010/12/101209101515.htm"&gt;a nice article&lt;/a&gt; on new neurolinguistic research out of Cal linking auditory and speech processes:&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;"&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;We used to think that the human auditory system is mostly suppressed during speech, but we found closely knit patches of cortex with very different sensitivities to our own speech that paint a more complicated picture," said Adeen Flinker, a doctoral student in neuroscience at UC Berkeley and lead author of the study.&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;"We found evidence of millions of neurons firing together every time you hear a sound right next to millions of neurons ignoring external sounds but firing together every time you speak," Flinker added. "Such a mosaic of responses could play an important role in how we are able to distinguish our own speech from that of others&lt;/span&gt;&lt;/i&gt;."&lt;br /&gt;&lt;br /&gt;HT &lt;a href="http://linguisticsnewsfeeds.com/"&gt;Linguistic News Feeds&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-6261111360965686310?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/6261111360965686310/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=6261111360965686310' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/6261111360965686310'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/6261111360965686310'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/how-we-hear-ourselves-speak.html' title='how we hear ourselves speak'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-4449300435575910448</id><published>2011-01-03T10:42:00.000-05:00</published><updated>2011-01-03T10:42:37.923-05:00</updated><title type='text'>the evolution of journalistic quotes</title><content type='html'>They're getting &lt;a href="http://www.boston.com/bostonglobe/ideas/articles/2011/01/02/the_incredible_shrinking_sound_bite/?page=full"&gt;shorter&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;According to a new article in the academic journal Journalism Studies by David M. Ryfe and Markus Kemmelmeier, both professors at the University of Nevada, newspaper quotations evolved in much the same way as TV sound bites. By 1916, they found, the average political quotation in a newspaper story had fallen to about half the length of the average quotation in 1892&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;(HT &lt;a href="http://andrewsullivan.theatlantic.com/the_daily_dish/2011/01/well-trimmed-quotes.html"&gt;Daily Dish&lt;/a&gt;)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-4449300435575910448?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/4449300435575910448/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=4449300435575910448' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/4449300435575910448'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/4449300435575910448'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2011/01/evolution-of-journalistic-quotes.html' title='the evolution of journalistic quotes'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-8512861860026498069</id><published>2010-12-30T14:33:00.002-05:00</published><updated>2010-12-30T15:39:30.993-05:00</updated><title type='text'>soccer vs. football</title><content type='html'>Too late for The World Cup, but thanks to Stan Carey at &lt;a href="http://stancarey.wordpress.com/2010/12/29/the-banshee-lives-in-the-handball-alley/"&gt;Sentence First&lt;/a&gt;, I only just now discovered that we Yanks are not the only English speakers who use &lt;i&gt;soccer&lt;/i&gt; to refer to, ya know, that game where you can't touch the ball with your hands (tennis? no... the one that Ronaldo plays). In fact, there are about 74 million OTHER English speakers in this world who use &lt;i&gt;soccer&lt;/i&gt; to refer to Ronaldo's game too. Add the USA's 308 million, and it is almost certainly the case that more English speakers use &lt;i&gt;soccer&lt;/i&gt; than &lt;i&gt;football&lt;/i&gt;. With that, I say &lt;i&gt;thppppt&lt;/i&gt; to the English...&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_uqqKLCxLtII/TRy_DuGduBI/AAAAAAAAAYM/QoWVeopahdI/s1600/soccer+vs.+football.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="140" src="http://2.bp.blogspot.com/_uqqKLCxLtII/TRy_DuGduBI/AAAAAAAAAYM/QoWVeopahdI/s320/soccer+vs.+football.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div style="text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: center;"&gt;(image from &lt;a href="http://en.wikipedia.org/wiki/Names_for_association_football"&gt;Wikipedia&lt;/a&gt;)&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: left;"&gt;&lt;span class="Apple-style-span" style="color: orange;"&gt;UPDATE &lt;/span&gt;[3:38PM eastern]: reader vp points out the following passage from the same Wikipedia article the image came from: &lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;several official publications of the English Football Association have the word "soccer" in the title. Simon Kuper and Stefan Szymanski write that soccer was the most common name for the game in Britain from the 1890s until the 1970s, and suggest that the decline of the word soccer in the UK, and the incorrect perception that it is an Americanism, were linked to awareness of the North American Soccer League in the 1970s&lt;/span&gt;&lt;/i&gt;.&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-8512861860026498069?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/8512861860026498069/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=8512861860026498069' title='5 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8512861860026498069'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8512861860026498069'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/soccer-vs-football.html' title='&lt;i&gt;soccer&lt;/i&gt; vs. &lt;i&gt;football&lt;/i&gt;'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_uqqKLCxLtII/TRy_DuGduBI/AAAAAAAAAYM/QoWVeopahdI/s72-c/soccer+vs.+football.png' height='72' width='72'/><thr:total>5</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-5529698095274606683</id><published>2010-12-30T13:33:00.001-05:00</published><updated>2011-01-01T13:01:26.305-05:00</updated><title type='text'>plagiarism and n-grams</title><content type='html'>Big media plagiarism is once again in the news as ESPN has suspended an on-air host for &lt;a href="http://www.huffingtonpost.com/2010/12/30/will-selva-plagiarism-esp_n_802608.html"&gt; plagiarizing three sentences&lt;/a&gt; from a newspaper columnist. The on air host has admitted the plagiarism*, issued an apology, and asked for forgiveness.&lt;br /&gt;&lt;br /&gt;The multiple and confusing ethical standards for plagiarism &lt;s&gt;has&lt;/s&gt; have been the subject of of several LL posts (recently &lt;a href="http://languagelog.ldc.upenn.edu/nll/?p=2441"&gt;here&lt;/a&gt;) and this led me to wonder about what counts as plagiarism in the first place. Clearly a three sentence, 45 word passage, almost word for word identical with another, in the same semantic domain with the same referents, is a case of plagiarism. But what about a 20 word passage? 10 word? 4 word**?&lt;br /&gt;&lt;br /&gt;Many short phrases are highly frequent, right? You couldn't felicitously accuse me of plagiarism for using the phrase "&lt;i&gt;I am going&lt;/i&gt;..." could you? Even though, there can be no doubt, that someone else before me used it first. Yes, I know you can find guidelines for plagiarism in college student handbooks and such. I dealt with those for years when I taught college writing courses (and I recall flunking at least three students for plagiarism, but those were whole papers, really stupid stuff).&lt;br /&gt;&lt;br /&gt;But I wonder, now that we have a 500 million word corpus available to us, couldn't we simply compare all n-grams to discover how likely it is that any given 5-gram is repeated? I'd prefer to do this up to 20-gram and such, but wouldn't we predict that there comes a point at which the likelihood that a particular phrase was plagiarized (given that we had found two alike) would be based solely on the general likelihood that n-grams of that size are repeated. The situation would be this: you discover that a particular 11 word passage has an identical twin from 2 years ago. Without bothering to look into whether or not the author had access to the previous work, you simply look up the likelihood that any 11-gram passage is repeated and discover that there is a 0.0002% chance that a phrase that long will be repeated.&lt;br /&gt;&lt;br /&gt;With some effort, you could then derive predictions for &lt;i&gt;near identical&lt;/i&gt; passages (using WordNet and similar resources)....&lt;br /&gt;&lt;br /&gt;..just thinking out loud...&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;*I am ignorant of the role ESPN's producers play in the writing of on air speeches, but the quote seems clearly to have been written on a teleprompter at the time of speaking, which means someone else was involved, even if unwittingly. Nonetheless, the host is taking the fall willingly.&lt;br /&gt;&lt;br /&gt;**Excluding obviously famous phrases like &lt;a href="http://www.google.com/search?sourceid=chrome&amp;amp;ie=UTF-8&amp;amp;q=Ich+bin+ein+Berliner"&gt;Ich bin ein Berliner&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-5529698095274606683?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/5529698095274606683/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=5529698095274606683' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/5529698095274606683'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/5529698095274606683'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/plagiarism-and-n-grams.html' title='plagiarism and n-grams'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-7533019907840241861</id><published>2010-12-30T11:12:00.002-05:00</published><updated>2010-12-30T11:14:23.871-05:00</updated><title type='text'>does asbestos really mean 'unquenchable'?</title><content type='html'>Yes, at least etymologically. The &lt;a href="http://www.etymonline.com/index.php?search=asbestos&amp;amp;searchmode=none"&gt;Online Etymology Dictionary&lt;/a&gt; explains its&amp;nbsp;etymology&amp;nbsp;this way:&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;...from O.Fr. abeste, from L. asbestos "quicklime" (which "burns" when cold water is poured on it), from Gk. asbestos, lit. "inextinguishable," from &lt;b&gt;a- "not" + sbestos&lt;/b&gt;, verbal adj. from sbennynai "to quench," from PIE base *(s)gwes- "to quench, extinguish" (cf. Lith. gestu "to go out," O.C.S. gaso, Hittite kishtari "is being put out")&lt;/span&gt;&lt;/i&gt; (emphasis added).&lt;br /&gt;&lt;br /&gt;Like people, every word has lived its own peculiar and unique life. Riffing on &lt;a href="http://thelousylinguist.blogspot.com/2010/12/etymologists-unite.html"&gt;my post below&lt;/a&gt; regarding words that have the opposite meaning of their etymology, my friend Andy (who did graduate work in Classics, and hence, actually reads Greek) challenged me to help him understand why the word &lt;i&gt;asbestos&lt;/i&gt;, whose etymology literally means 'unquenchable' is used today to mean a substance that cannot burn.&lt;br /&gt;&lt;br /&gt;With some Googling, I found &lt;a href="http://virlab.virginia.edu/Nanoscience_class/lecture_notes/Lecture_14_Materials/Asbestos_CNT/Sci%20Am%20-%20Asbestos%20Revisited%20-%20July%201997.pdf"&gt;this &lt;/a&gt;(PDF): &amp;nbsp;"&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;First mention of asbestos appeared in the Greek text On Stones, written by Theophrastus, one of Aristotle’s students. Theophrastus referred to a substance that resembled rotten wood and burned (right) without being harmed when doused with oil&lt;/span&gt;&lt;/i&gt;."&lt;br /&gt;&lt;br /&gt;So, Ol' Theophrastus kept pouring oil onto this stuff, but it never burnt, so he kept pouring, but the stuff was never &lt;i&gt;quenched &lt;/i&gt;by oil/fire. Hence, it was &lt;i&gt;unquenchable&lt;/i&gt;. That's my story and I'm sticking to it (for now).&lt;br /&gt;&lt;br /&gt;Andy did some follow-up of his own and provides the following:&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Yes, that's one of the more likely explanations. In my research I came across the use of asbestos as permanent wicks in lamps, but never noted the bit about being unquenchable with oil. &amp;nbsp;That Theophrastos citation really belongs in the dictionary entry below, as it's the only cite that explains the meaning under A. &amp;nbsp;The lexicon below is massively comprehensive (if you couldn't tell) so it's odd they missed Theo.&lt;/span&gt;&lt;/i&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/i&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;The other possible explanation is II. or "unslaked lime", as quick lime burns underwater. &amp;nbsp;This was a key component in later "Greek fire", but so far I haven't been able to find any ancient source that cites an unquenchable substance (Greek Fire dates to 500 AD, white phosphorus, which also burns underwater, dates to 1600 AD, and sodium, which explodes on contact with water, dates to 1800 AD).&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/i&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;If I had the time and language skill I used to have I would search my CD of all Greek text up to 600 AD for cites of asbestos and then comb thru them, but that would be a day's worth of work &amp;nbsp;I'm pleased that we got close to the meaning in online research and I'm not sure that looking up every instance of asbestos would change anything&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;Andy also provided the following reference&lt;br /&gt;&lt;br /&gt;&lt;div&gt;&lt;/div&gt;&lt;div&gt;&lt;/div&gt;&lt;div&gt;&lt;span style="font-size: x-small;"&gt;&lt;span style="font-family: arial, helvetica, sans-serif;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;div&gt;&lt;div&gt;&lt;span style="font-size: x-small;"&gt;&lt;span style="font-family: arial, helvetica, sans-serif;"&gt;&lt;b&gt;&lt;a href="http://www.perseus.tufts.edu/hopper/morph?l=a%29%2Fsbestos&amp;amp;la=greek" target="_blank"&gt;ἄσβεστος&lt;/a&gt;&lt;/b&gt; , &lt;a href="http://www.perseus.tufts.edu/hopper/morph?l=on&amp;amp;la=greek&amp;amp;prior=a%29/sbestos" target="_blank"&gt;ον&lt;/a&gt;, also &lt;a href="http://www.perseus.tufts.edu/hopper/morph?l=h&amp;amp;la=greek&amp;amp;prior=on" target="_blank"&gt;η&lt;/a&gt;, &lt;a href="http://www.perseus.tufts.edu/hopper/morph?l=on&amp;amp;la=greek&amp;amp;prior=h" target="_blank"&gt;ον&lt;/a&gt; &lt;b&gt;&lt;a href="http://www.perseus.tufts.edu/hopper/text?doc=Perseus:abo:tlg,0012,001:16:123&amp;amp;lang=original" target="_blank"&gt;Il.16.123&lt;/a&gt;&lt;/b&gt;:— &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;/div&gt;&lt;div&gt;&lt;span style="font-size: x-small;"&gt;&lt;span style="font-family: arial, helvetica, sans-serif;"&gt;&lt;b&gt;A. &lt;/b&gt;&lt;i&gt;unquenchable, inextinguishable,&lt;/i&gt; “&lt;a href="http://www.perseus.tufts.edu/hopper/morph?l=flo%2Fc&amp;amp;la=greek&amp;amp;prior=on" target="_blank"&gt;φλόξ&lt;/a&gt;” &lt;i&gt;Il.&lt;/i&gt; l. c.; &lt;i&gt;not quenched,&lt;/i&gt; “&lt;a href="http://www.perseus.tufts.edu/hopper/morph?l=pu%3Dr&amp;amp;la=greek&amp;amp;prior=flo/c" target="_blank"&gt;πῦρ&lt;/a&gt; &lt;a href="http://www.perseus.tufts.edu/hopper/morph?l=a%29%2F&amp;amp;la=greek&amp;amp;prior=pu=r" target="_blank"&gt;ἄ&lt;/a&gt;.” &lt;i&gt;D.H.3.67&lt;/i&gt;, &lt;b&gt;&lt;a href="http://www.perseus.tufts.edu/hopper/text?doc=Plut.+Num.+9&amp;amp;lang=original" target="_blank"&gt;Plu.&lt;u&gt;Num.&lt;/u&gt;9&lt;/a&gt;&lt;/b&gt;; “&lt;a href="http://www.perseus.tufts.edu/hopper/morph?l=kle%2Fos&amp;amp;la=greek&amp;amp;prior=a%29/" target="_blank"&gt;κλέος&lt;/a&gt;” &lt;b&gt;&lt;a href="http://www.perseus.tufts.edu/hopper/text?doc=Perseus:abo:tlg,0012,002:4:584&amp;amp;lang=original" target="_blank"&gt;Od.4.584&lt;/a&gt;&lt;/b&gt;; “&lt;a href="http://www.perseus.tufts.edu/hopper/morph?l=ge%2Flws&amp;amp;la=greek&amp;amp;prior=kle/os" target="_blank"&gt;γέλως&lt;/a&gt;” &lt;b&gt;&lt;a href="http://www.perseus.tufts.edu/hopper/text?doc=Perseus:abo:tlg,0012,001:1:599&amp;amp;lang=original" target="_blank"&gt;Il.1.599&lt;/a&gt;&lt;/b&gt;; “&lt;a href="http://www.perseus.tufts.edu/hopper/morph?l=boh%2F&amp;amp;la=greek&amp;amp;prior=ge/lws" target="_blank"&gt;βοή&lt;/a&gt;” &lt;b&gt;&lt;a href="http://www.perseus.tufts.edu/hopper/text?doc=Perseus:abo:tlg,0012,001:11:50&amp;amp;lang=original" target="_blank"&gt;11.50&lt;/a&gt;&lt;/b&gt;; “&lt;a href="http://www.perseus.tufts.edu/hopper/morph?l=e%29rgma%2Ftwn&amp;amp;la=greek&amp;amp;prior=boh/" target="_blank"&gt;ἐργμάτων&lt;/a&gt; &lt;a href="http://www.perseus.tufts.edu/hopper/morph?l=a%29kti%5Cs&amp;amp;la=greek&amp;amp;prior=e%29rgma/twn" target="_blank"&gt;ἀκτὶς&lt;/a&gt; &lt;a href="http://www.perseus.tufts.edu/hopper/morph?l=kalw%3Dn&amp;amp;la=greek&amp;amp;prior=a%29kti%5Cs" target="_blank"&gt;καλῶν&lt;/a&gt; &lt;a href="http://www.perseus.tufts.edu/hopper/morph?l=a%29%2F&amp;amp;la=greek&amp;amp;prior=kalw=n" target="_blank"&gt;ἄ&lt;/a&gt;. &lt;a href="http://www.perseus.tufts.edu/hopper/morph?l=ai%29ei%2F&amp;amp;la=greek&amp;amp;prior=a%29/" target="_blank"&gt;αἰεί&lt;/a&gt;” &lt;b&gt;&lt;a href="http://www.perseus.tufts.edu/hopper/text?doc=Perseus:abo:tlg,0033,004:4%283%29.42&amp;amp;lang=original" target="_blank"&gt;Pi.&lt;u&gt;I.&lt;/u&gt;4(3).42&lt;/a&gt;&lt;/b&gt;; &lt;a href="http://www.perseus.tufts.edu/hopper/morph?l=a%29%2F&amp;amp;la=greek&amp;amp;prior=ai%29ei/" target="_blank"&gt;ἄ&lt;/a&gt;. &lt;a href="http://www.perseus.tufts.edu/hopper/morph?l=po%2Fros&amp;amp;la=greek&amp;amp;prior=a%29/" target="_blank"&gt;πόρος&lt;/a&gt; &lt;a href="http://www.perseus.tufts.edu/hopper/morph?l=w%29keanou%3D&amp;amp;la=greek&amp;amp;prior=po/ros" target="_blank"&gt;ὠκεανοῦ&lt;/a&gt; ocean's &lt;i&gt;ceaseless&lt;/i&gt; flow, &lt;b&gt;&lt;a href="http://www.perseus.tufts.edu/hopper/text?doc=Perseus:abo:tlg,0085,003:532&amp;amp;lang=original" target="_blank"&gt;A.&lt;u&gt;Pr.&lt;/u&gt;532&lt;/a&gt;&lt;/b&gt; (lyr.); &lt;a href="http://www.perseus.tufts.edu/hopper/morph?l=pu%3Dr&amp;amp;la=greek&amp;amp;prior=w%29keanou=" target="_blank"&gt;πῦρ&lt;/a&gt;, of hell, &lt;b&gt;&lt;a href="http://www.perseus.tufts.edu/hopper/text?doc=Perseus:abo:tlg,0031,002:9:43&amp;amp;lang=original" target="_blank"&gt;&lt;u&gt;Ev.Marc.&lt;/u&gt;9.43&lt;/a&gt;&lt;/b&gt;. &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;/div&gt;&lt;div&gt;&lt;span style="font-size: x-small;"&gt;&lt;span style="font-family: arial, helvetica, sans-serif;"&gt;&lt;b&gt;II. &lt;/b&gt;as Subst., &lt;a href="http://www.perseus.tufts.edu/hopper/morph?l=a%29%2Fsbestos&amp;amp;la=greek&amp;amp;prior=pu=r" target="_blank"&gt;ἄσβεστος&lt;/a&gt; (sc. &lt;a href="http://www.perseus.tufts.edu/hopper/morph?l=ti%2Ftanos&amp;amp;la=greek&amp;amp;prior=a%29/sbestos" target="_blank"&gt;τίτανος&lt;/a&gt;), h(, &lt;i&gt;unslaked lime,&lt;/i&gt; &lt;i&gt;Dsc.5.115&lt;/i&gt;, &lt;b&gt;&lt;a href="http://www.perseus.tufts.edu/hopper/text?doc=Perseus:abo:tlg,0007,042:17&amp;amp;lang=original" target="_blank"&gt;Plu.&lt;u&gt;Sert.&lt;/u&gt;17&lt;/a&gt;&lt;/b&gt;, &lt;b&gt;&lt;a href="http://www.perseus.tufts.edu/hopper/text?doc=Perseus:abo:tlg,0007,041:16&amp;amp;lang=original" target="_blank"&gt;&lt;u&gt;Eum.&lt;/u&gt;16&lt;/a&gt;&lt;/b&gt;; “&lt;a href="http://www.perseus.tufts.edu/hopper/morph?l=a%29%2F&amp;amp;la=greek&amp;amp;prior=ti/tanos" target="_blank"&gt;ἄ&lt;/a&gt;. &lt;a href="http://www.perseus.tufts.edu/hopper/morph?l=koni%2Fa&amp;amp;la=greek&amp;amp;prior=a%29/" target="_blank"&gt;κονία&lt;/a&gt;” &lt;i&gt;Lyc.&lt;/i&gt; ap. &lt;i&gt;Orib.8.25.16&lt;/i&gt;. &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;/div&gt;&lt;div&gt;&lt;span style="font-size: x-small;"&gt;&lt;span style="font-family: arial, helvetica, sans-serif;"&gt;&lt;b&gt;2. &lt;/b&gt;a mineral or gem, &lt;b&gt;&lt;a href="http://www.perseus.tufts.edu/hopper/text?doc=Plin.+Nat.+37.146&amp;amp;lang=original" target="_blank"&gt;Plin.&lt;u&gt;HN&lt;/u&gt;37.146&lt;/a&gt;&lt;/b&gt;. &lt;b&gt;&lt;a href="http://www.perseus.tufts.edu/hopper/morph?l=a%29sbestw%2Fdhs&amp;amp;la=greek&amp;amp;prior=koni/a" target="_blank"&gt;ἀσβεστώδης&lt;/a&gt;:&lt;/b&gt; &lt;i&gt;tofus,&lt;/i&gt; &lt;i&gt;Gloss.&lt;/i&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;Henry George Liddell. Robert Scott. A Greek-English Lexicon. revised and augmented throughout by. Sir Henry Stuart Jones. with the assistance of. Roderick McKenzie. Oxford. Clarendon Press. 1940.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-7533019907840241861?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/7533019907840241861/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=7533019907840241861' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/7533019907840241861'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/7533019907840241861'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/does-asbestos-really-mean-unquenchable.html' title='does &lt;i&gt;asbestos&lt;/i&gt; really mean &apos;unquenchable&apos;?'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-2581285147040897408</id><published>2010-12-29T15:23:00.000-05:00</published><updated>2010-12-29T15:23:14.441-05:00</updated><title type='text'>etymologists , unite!</title><content type='html'>A buddy wrote me an interesting question (to which I did not have an answer):&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;It's been driving me crazy, is there a term of art for when the etymological root of a word is the opposite of the word's modern meaning? &amp;nbsp;For example, asbestos means "an unquenchable fire"; philander means "a lover of men" etc. Cheers, A.&lt;/span&gt;&lt;/i&gt;,&lt;br /&gt;&lt;br /&gt;Anyone know this?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-2581285147040897408?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/2581285147040897408/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=2581285147040897408' title='5 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/2581285147040897408'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/2581285147040897408'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/etymologists-unite.html' title='etymologists , unite!'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>5</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-7072184359917615073</id><published>2010-12-29T09:34:00.001-05:00</published><updated>2010-12-29T15:39:17.003-05:00</updated><title type='text'>dialects map</title><content type='html'>Extremely detailed &lt;a href="http://aschmann.net/AmEng/#Au_BritishColumbia"&gt;North American English Dialects, Based on Pronunciation Patterns&lt;/a&gt;. The site could use a bit of a web re-design ... looks circa 1999. Anyone care to offer free web design help to clean up this otherwise useful resource a little?&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_uqqKLCxLtII/TRtGUVzYg7I/AAAAAAAAAYI/eVh64-QBvfo/s1600/dialect+map.gif" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="275" src="http://1.bp.blogspot.com/_uqqKLCxLtII/TRtGUVzYg7I/AAAAAAAAAYI/eVh64-QBvfo/s320/dialect+map.gif" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-7072184359917615073?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/7072184359917615073/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=7072184359917615073' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/7072184359917615073'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/7072184359917615073'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/extremely-detailed-north-american.html' title='dialects map'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_uqqKLCxLtII/TRtGUVzYg7I/AAAAAAAAAYI/eVh64-QBvfo/s72-c/dialect+map.gif' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-3042984762542271438</id><published>2010-12-28T13:52:00.001-05:00</published><updated>2010-12-29T08:55:24.661-05:00</updated><title type='text'>not any or not one??</title><content type='html'>The NYTs recent&amp;nbsp;&lt;a href="http://topics.blogs.nytimes.com/2010/12/28/the-number-of-none/"&gt;The Number of None&lt;/a&gt; grammar blog post brings up an interesting question: is &lt;i&gt;none &lt;/i&gt;semantically closer to &lt;i&gt;not any&lt;/i&gt; or &lt;i&gt;not one&lt;/i&gt;? And what should its morphosyntactic agreement be, singular or plural?&lt;br /&gt;&lt;br /&gt;The Times takes the &lt;i&gt;not any&lt;/i&gt;, plural position, but I am inclined to disagree based on my intuition about substitution. Below are the two sentences the Times uses to illustrate:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;None of the interim employers or temporary agencies have contributed to a 401(k)&lt;/li&gt;&lt;li&gt;None of the works have gained a foothold in the seasonal repertory.&lt;/li&gt;&lt;/ul&gt;Now, with the substitutions and my personal acceptability rating (where * means mildly unacceptable/not sure and ** means completely unacceptable).&lt;br /&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;/div&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;Not one&lt;/b&gt; of the interim employers or temporary agencies &lt;b&gt;has&lt;/b&gt; contributed to a 401(k)&lt;/li&gt;&lt;li&gt;&lt;b&gt;Not one&lt;/b&gt; of the works &lt;b&gt;has&lt;/b&gt; gained a foothold in the seasonal repertory.&lt;/li&gt;&lt;li&gt;**&lt;b&gt;Not any&lt;/b&gt; of the interim employers or temporary agencies &lt;b&gt;have&lt;/b&gt; contributed to a 401(k)&lt;/li&gt;&lt;li&gt;**&lt;b&gt;Not any&lt;/b&gt; of the works &lt;b&gt;have&lt;/b&gt; gained a foothold in the seasonal repertory.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Not one&lt;/b&gt;&amp;nbsp;of the interim employers or temporary agencies &lt;b&gt;have&lt;/b&gt; contributed to a 401(k)&lt;/li&gt;&lt;li&gt;&lt;b&gt;Not one&lt;/b&gt; of the works &lt;b&gt;have&lt;/b&gt; gained a foothold in the seasonal repertory.&lt;/li&gt;&lt;li&gt;**&lt;b&gt;Not any&lt;/b&gt; of the interim employers or temporary agencies &lt;b&gt;has&lt;/b&gt; contributed to a 401(k)&lt;/li&gt;&lt;li&gt;**&lt;b&gt;Not any&lt;/b&gt; of the works &lt;b&gt;has&lt;/b&gt; gained a foothold in the seasonal repertory.&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;The above ratings suggest that I make no distinction in acceptability between &lt;i&gt;none has&lt;/i&gt; and &lt;i&gt;none have&lt;/i&gt;. But wait, there's more. Let's remove the lengthy PP and see how this pans out:&lt;/div&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;*Not one&lt;/b&gt; of them &lt;b&gt;has&lt;/b&gt; contributed to a 401(k)&lt;/li&gt;&lt;li&gt;&lt;b&gt;*Not one&lt;/b&gt; of them &lt;b&gt;has&lt;/b&gt; gained a foothold in the seasonal repertory.&lt;/li&gt;&lt;li&gt;**&lt;b&gt;Not any&lt;/b&gt; of them &lt;b&gt;have&lt;/b&gt; contributed to a 401(k)&lt;/li&gt;&lt;li&gt;**&lt;b&gt;Not any&lt;/b&gt; of them &lt;b&gt;have&lt;/b&gt; gained a foothold in the seasonal repertory.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Not one&lt;/b&gt; of them &lt;b&gt;have&lt;/b&gt; contributed to a 401(k)&lt;/li&gt;&lt;li&gt;&lt;b&gt;Not one&lt;/b&gt; of them &lt;b&gt;have &lt;/b&gt; gained a foothold in the seasonal repertory.&lt;/li&gt;&lt;li&gt;**&lt;b&gt;Not any&lt;/b&gt; of them &lt;b&gt;has&lt;/b&gt; contributed to a 401(k)&lt;/li&gt;&lt;li&gt;**&lt;b&gt;Not any&lt;/b&gt; of them &lt;b&gt;has&lt;/b&gt; gained a foothold in the seasonal repertory.&lt;/li&gt;&lt;/ul&gt;I seem to slightly prefer the singular reading when the word &lt;i&gt;none &lt;/i&gt;is close to the verb but with a plural noun heading the PP. But this is not true if we delete the PP altogether:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;Not one has&lt;/b&gt; contributed to a 401(k)&lt;/li&gt;&lt;li&gt;&lt;b&gt;Not one has&lt;/b&gt; gained a foothold in the seasonal repertory.&lt;/li&gt;&lt;li&gt;&lt;b&gt;*Not one have&lt;/b&gt; contributed to a 401(k)&lt;/li&gt;&lt;li&gt;&lt;b&gt;*Not one have&lt;/b&gt; gained a foothold in the seasonal repertory.&lt;/li&gt;&lt;/ul&gt;It would appear I have an incoherent grammar (surely this is true as I believe all grammars are, in some way, incoherent. As Sapir said, &lt;i&gt;&lt;a href="http://linguistlist.org/issues/4/4-85.html"&gt;all grammars leak&lt;/a&gt;&lt;/i&gt;). But, there's at least one other factor muddying the linguistic waters. The fact that &lt;i&gt;one &lt;/i&gt;also acts a pronoun as in &lt;i&gt;one does one's duty&lt;/i&gt;. When acting as a pronoun, it takes 2nd pers, SG agreement, as in &lt;i&gt;one has to do one's duty&lt;/i&gt; (think &lt;i&gt;he has to do his duty&lt;/i&gt;), not *&lt;i&gt;one have to do one's duty&lt;/i&gt;. It may be that this pronoun agreement is interfering with my reading when &lt;i&gt;one &lt;/i&gt;occurs right next to the verb. Also, I did this pretty fast, so I wouldn't be surprised if I change my mind by COB...&lt;br /&gt;&lt;br /&gt;Of course, how could I resist:&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_uqqKLCxLtII/TRovtZOPywI/AAAAAAAAAYE/GBCJEZylvfc/s1600/none+ALL.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="117" src="http://3.bp.blogspot.com/_uqqKLCxLtII/TRovtZOPywI/AAAAAAAAAYE/GBCJEZylvfc/s320/none+ALL.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;I believe I got the full paradigm:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;not one of them has&lt;/li&gt;&lt;li&gt;not one of them have&lt;/li&gt;&lt;li&gt;not any of them has&lt;/li&gt;&lt;li&gt;not any of them have&lt;/li&gt;&lt;li&gt;none of them has&lt;/li&gt;&lt;li&gt;none of them have&lt;/li&gt;&lt;li&gt;none has&lt;/li&gt;&lt;li&gt;none have&lt;/li&gt;&lt;/ul&gt;It appears as though &lt;i&gt;none have&lt;/i&gt; had a hell of a start to the &lt;s&gt;18th&lt;/s&gt;&amp;nbsp;19th Century, but got killed off along with the Buffalo.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-3042984762542271438?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/3042984762542271438/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=3042984762542271438' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3042984762542271438'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3042984762542271438'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/not-any-or-not-one.html' title='&lt;i&gt;not any&lt;/i&gt; or &lt;i&gt;not one&lt;/i&gt;??'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_uqqKLCxLtII/TRovtZOPywI/AAAAAAAAAYE/GBCJEZylvfc/s72-c/none+ALL.png' height='72' width='72'/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-7127640043114687008</id><published>2010-12-28T11:21:00.000-05:00</published><updated>2010-12-28T11:21:47.598-05:00</updated><title type='text'>refudiate, the word that won't die</title><content type='html'>Thanks in no small measure to the Oxford University Press naming &lt;i&gt;refudiate&lt;/i&gt; its Word Of The Year plus The Daily Dish rekindling &lt;a href="http://andrewsullivan.theatlantic.com/the_daily_dish/2010/12/odd-lie-watch.html"&gt;its favorite topic&lt;/a&gt;, we have a new round of he-said-she-said to deal with. Made famous by Sarah Palin this past summer (see Liberman's original post &lt;a href="http://languagelog.ldc.upenn.edu/nll/?p=2463"&gt;here&lt;/a&gt;, and others &lt;a href="http://www.google.com/search?q=refudiate+site:http://languagelog.ldc.upenn.edu/nll/&amp;amp;hl=en&amp;amp;num=10&amp;amp;lr=&amp;amp;ft=i&amp;amp;cr=&amp;amp;safe=images&amp;amp;tbs="&gt;here&lt;/a&gt;), it is yet again the object of speculation as to why Palin used the form to begin with.&lt;br /&gt;&lt;br /&gt;Palin herself poured fuel on this fire two days ago by tweeting that it was a typo. Liberman thinks that explanation didn't hold water the first time around because she first said it aloud on teevee:  &lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;the original example&lt;/span&gt;&lt;/i&gt; [on teevee] &lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;wasn't a slip of the tongue, but a symptom of the fact that Ms. Palin had a blend of repudiate and refute as a well-established entry in her mental lexicon&lt;/span&gt;&lt;/i&gt; [note added].&lt;br /&gt;&lt;br /&gt;Why the fuss? There's nothing particularly interesting or telling about the linguistic blending of &lt;i&gt;repudiate &lt;/i&gt;and &lt;i&gt;refute&lt;/i&gt;. Everyone does this kind of thing now and again and sometimes it sticks. Some people like to beat up on public figures any time they can, so something like this is a target. But the more serious speculation is that the Palin Camp's public responses expose something important about Sarah Palin's inner circle and consultation. I'll leave it to the political pundits to fight that one out.&lt;br /&gt;&lt;br /&gt;For now,&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_uqqKLCxLtII/TRoN1sKf25I/AAAAAAAAAYA/WC-0vwb6ym4/s1600/refudiate.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="116" src="http://1.bp.blogspot.com/_uqqKLCxLtII/TRoN1sKf25I/AAAAAAAAAYA/WC-0vwb6ym4/s320/refudiate.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-7127640043114687008?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/7127640043114687008/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=7127640043114687008' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/7127640043114687008'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/7127640043114687008'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/refudiate-word-that-wont-die.html' title='&lt;i&gt;refudiate&lt;/i&gt;, the word that won&apos;t die'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_uqqKLCxLtII/TRoN1sKf25I/AAAAAAAAAYA/WC-0vwb6ym4/s72-c/refudiate.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-701201490717728459</id><published>2010-12-28T09:50:00.001-05:00</published><updated>2010-12-28T09:59:23.137-05:00</updated><title type='text'>the linguistics of brand names</title><content type='html'>&lt;a href="http://neurocritic.blogspot.com/2010/12/why-are-letters-z-and-x-so-popular-in.html?utm_source=feedburner&amp;amp;utm_medium=feed&amp;amp;utm_campaign=Feed:+TheNeurocritic+(The+Neurocritic)"&gt;The Neurocritic&lt;/a&gt; reviews &lt;a href="http://www.bmj.com/content/341/bmj.c6895.full"&gt;evidence &lt;/a&gt;for the whopping increase in drug brand names beginning with the letters &lt;i&gt;z&lt;/i&gt; and &lt;i&gt;x&lt;/i&gt; starting in 1986 and quotes the conclusion of the study's authors:&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_uqqKLCxLtII/TRnz_oN2AbI/AAAAAAAAAX8/OihkPIsgcYM/s1600/Z+and+X+drugs.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="195" src="http://2.bp.blogspot.com/_uqqKLCxLtII/TRnz_oN2AbI/AAAAAAAAAX8/OihkPIsgcYM/s320/Z+and+X+drugs.jpg" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Reflecting their infrequent occurrence in English words, x and z count for 8 and 10 points in Scrabble, the highest values (along with j and q) in the game. So names that contain them are likely to seem special and be memorable. “If you meet them in running text, they stand out,” is the way one industry insider explained. Generally, they are also easy to pronounce&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;The last point about being easy to pronounce is basically nonesense, so forgive them that, but their basic point that infrequent sounds are more memorable is basically a restatement of &lt;a href="http://en.wikipedia.org/wiki/Zipf's_law"&gt;Zipf's Law&lt;/a&gt; and may have some truth to it.&lt;br /&gt;&lt;br /&gt;I can tell you this, there are entire companies that charge high fees to help manufacturers develop brand names (see &lt;a href="http://nancyfriedman.typepad.com/away_with_words/2009/06/what-can-a-name-developer-do-for-you.html"&gt;here &lt;/a&gt;for a&amp;nbsp;discussion&amp;nbsp;of what brand name developers do). I worked at one of them ever so briefly and I found there to be a mix of legitimate linguistics and voodoo linguistics mixed together in the "research" they prepared for their customers. I also found a resistance to serious linguistics for two reasons: 1) the customers didn't like science (I'm not joking; this was a serious obstacle) and 2) serious linguistics took too long and didn't come to firm conclusions. Typically, we were asked to initiate, perform, and complete linguistic research on brand names in a matter of weeks.&lt;br /&gt;&lt;br /&gt;Ultimately, though, it was my conclusion that a product's name simply was not that crucial to its success, which teetered on the manufacturers overall marketing strategy more than the name. Think about &lt;i&gt;Google &lt;/i&gt;vs. &lt;i&gt;Microsoft&lt;/i&gt;. So, the rise in z and x named drug products is a fad based more in the board room than in the marketplace.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-701201490717728459?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/701201490717728459/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=701201490717728459' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/701201490717728459'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/701201490717728459'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/linguistics-of-brand-names.html' title='the linguistics of brand names'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_uqqKLCxLtII/TRnz_oN2AbI/AAAAAAAAAX8/OihkPIsgcYM/s72-c/Z+and+X+drugs.jpg' height='72' width='72'/><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-4149807301008624027</id><published>2010-12-28T09:07:00.000-05:00</published><updated>2010-12-28T09:07:12.035-05:00</updated><title type='text'>non-linguistic CAPTCHA</title><content type='html'>David Bradley, writing at &lt;a href="http://www.sciencetext.com/face-based-captcha.html?utm_source=feedburner&amp;amp;utm_medium=feed&amp;amp;utm_campaign=Feed:+ResearchBloggingAllEnglish+(Research+Blogging+-+English+-+All+Topics)"&gt;sciencetech&lt;/a&gt;, reports on a new face-based CAPTCHA process, quoting the team that created it, "&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Unlike a text-based CAPTCHA, a major benefit of the proposed image-based face detection CAPTCHA is that it does not have any language barriers&lt;/span&gt;&lt;/i&gt;..."&lt;br /&gt;&lt;br /&gt;I guess it never really occurred to me that there would be language barriers in CAPTCHAs because so many of the strings are in fact nonesense words, but I guess language specific phonotactics are helpful (often the identity of a single letter is quite ambiguous).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-4149807301008624027?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/4149807301008624027/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=4149807301008624027' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/4149807301008624027'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/4149807301008624027'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/non-linguistic-captcha.html' title='non-linguistic CAPTCHA'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-5752992442051181440</id><published>2010-12-27T15:48:00.000-05:00</published><updated>2010-12-27T15:48:14.153-05:00</updated><title type='text'>history of writing tech</title><content type='html'>American Scientist has &lt;a href="http://www.americanscientist.org/bookshelf/pub/electrifying-language"&gt;a review&lt;/a&gt; of a new book on the history of writing technologies with a focus on how computers fit in. &lt;a href="http://www.amazon.com/Better-Pencil-Readers-Writers-Revolution/dp/0195388445/ref=sr_1_1?ie=UTF8&amp;amp;s=books&amp;amp;qid=1293482792&amp;amp;sr=8-1"&gt;A BETTER PENCIL: Readers, Writers, and the Digital Revolution&lt;/a&gt;&amp;nbsp;by Dennis Baron.&lt;br /&gt;&lt;br /&gt;Money quote:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Will this shift in the technology of writing and reading be a positive development in human culture? Will it promote literacy, or impair it? Baron takes a moderate position on these questions. On the one hand, he acknowledges that the computer offers remarkable opportunities for self-expression and communication (at least for those of us in the wealthier parts of the world). Suddenly, we can all be published authors, and we all have access to the writings—or if nothing else the Twitterings—of millions of other authors. On the other hand, much of what these new channels of communication bring us is mere noise and distraction, and we may lose touch with more serious kinds of reading and writing. (Another recent book—The Shallows, by Nicholas Carr—argues this point strenuously.) Baron remarks: “That position incorrectly assumes that when we’re not online we throw ourselves into high-culture mode, reading Tolstoi spelled with an i and writing sestinas and villanelles instead of shopping lists&lt;/span&gt;&lt;/i&gt;.”&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-5752992442051181440?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/5752992442051181440/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=5752992442051181440' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/5752992442051181440'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/5752992442051181440'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/history-of-writing-tech.html' title='history of writing tech'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-5465132235020821605</id><published>2010-12-27T11:51:00.000-05:00</published><updated>2010-12-27T11:51:52.500-05:00</updated><title type='text'>another lingo toy...</title><content type='html'>I love free online lingo toys like &lt;a href="http://corpus.byu.edu/"&gt;BYU's Corpora&lt;/a&gt; and Google's &lt;a href="http://ngrams.googlelabs.com/"&gt;Ngram Viewer&lt;/a&gt; and now there's a new one: The &lt;a href="http://www.media.mit.edu/cogmac/projects/hsp.html"&gt;Human Speechome Project&lt;/a&gt; from MIT"&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt; provides a look into the most complete record of a single child’s speech development ever created. The data has been organized to show the age of the child when he spoke each of his first 400 words.&lt;/span&gt;&lt;/i&gt;" It's&amp;nbsp;profiled in Forbes &lt;a href="http://www.forbes.com/2010/12/21/human-speechome-mit-media-lab-children-language-acquisition.html"&gt;here&lt;/a&gt;. And they provide a nifty interactive graph to sort the data:&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_uqqKLCxLtII/TRjD5VmZ_DI/AAAAAAAAAX4/iCBkIrHr7L4/s1600/word+births.JPG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="193" src="http://3.bp.blogspot.com/_uqqKLCxLtII/TRjD5VmZ_DI/AAAAAAAAAX4/iCBkIrHr7L4/s320/word+births.JPG" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-5465132235020821605?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/5465132235020821605/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=5465132235020821605' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/5465132235020821605'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/5465132235020821605'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/another-lingo-toy.html' title='another lingo toy...'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_uqqKLCxLtII/TRjD5VmZ_DI/AAAAAAAAAX4/iCBkIrHr7L4/s72-c/word+births.JPG' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-5507081972453991501</id><published>2010-12-26T09:00:00.007-05:00</published><updated>2010-12-26T09:00:03.024-05:00</updated><title type='text'>true grit</title><content type='html'>I posted recently about the phrase "&lt;a href="http://thelousylinguist.blogspot.com/2010/12/bustin-cap.html"&gt;bust a cap&lt;/a&gt;" occurring in the original 1969 John Wayne movie &lt;a href="http://www.imdb.com/title/tt0065126/"&gt;True Grit&lt;/a&gt;. I got a chance to see the new &lt;a href="http://www.imdb.com/title/tt1403865/"&gt;Coen Bros&lt;/a&gt; version and my reactions are worth airing...or not, you decide...&lt;br /&gt;&lt;br /&gt;First, it turns out the phrase &lt;i&gt;true grit&lt;/i&gt; has &lt;a href="http://ngrams.googlelabs.com/graph?content=true+grit&amp;amp;year_start=1800&amp;amp;year_end=2000&amp;amp;corpus=0&amp;amp;smoothing=3"&gt;a storied history&lt;/a&gt; in the history of English letters:&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_uqqKLCxLtII/TRawvRZ0fjI/AAAAAAAAAX0/bUBeTOwEQYE/s1600/true_grit_ngram.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="146" src="http://4.bp.blogspot.com/_uqqKLCxLtII/TRawvRZ0fjI/AAAAAAAAAX0/bUBeTOwEQYE/s400/true_grit_ngram.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;But this review is destined to be of the non-linguistic kind...&lt;br /&gt;&lt;br /&gt;I also had the chance to re-watch the original John Wayne version just a couple days before watching the new one. While it may be the case that this is a bit unfair because it means the recent version is asked to &lt;i&gt;live up to&lt;/i&gt; the original is some ways, nonetheless, it is instructive (insofar as it does NOT). I hereby forgive the Coen Bros for not watching the original again in preparation for their version. Surely this would have scuttled their project.&lt;br /&gt;&lt;br /&gt;Let me make it clear that the individual performances in the Coen Bros movie alone make it worth watching. Each actor is given great opportunity to breath life into their character and I respect the Coen Bros for allowing that. They are truly dedicated to the fine craft of acting and I enjoyed watching their version of &lt;i&gt;True Grit&lt;/i&gt;. Frankly, I could watch Jeff Bridges eat oatmeal and be amazed at how weird and wonderfully he did it.&lt;br /&gt;&lt;br /&gt;Nonetheless, my primary complaint is devastating: the new Coen Bros version lacks the basic narrative structure and emotional depth that made the original so fundamentally enjoyable and satisfying. For the record, I have never read the novel, so I have no clue what it says and the Coen Bros based their new version entirely on that. However, I can say that one of the most deeply satisfying elements of the John Wayne movie is the development of the relationships that evolve between the child Mattie Ross, the drunken but courageous Rooster Cogburn, and the goofy, but basically decent La Boeuf. Throughout the original movie, those three characters find a way to forge a sort of dysfunctional, yet basically good and meaningful family unit between them. This family unit is completely absent from the new version. And I missed it.&lt;br /&gt;&lt;br /&gt;One of the most touching and important moments of the original movie involves Rooster finally opening up to Mattie about his past and his wife and son while the two sit and wait for Ned Pepper's gang to arrive. This scene reveals Rooster's humanity and deeply emotional character. It is this scene that helps forge a familial bond, almost like an uncle/niece relationship, between Rooster and Mattie. And this deep relationship is played out for the rest of the movie. Developing this scene during a crucial moment of patience and waiting is pure narrative brilliance. Yet, the Coen Bros took this and turned it into camp and parody. The lines about his wife and son are basically thrown away in a drunken mumbling as his horse barely manages to contain his heavy frame while they trod along meaninglessly. What should be a deeply emotional connection forged in a tense moment of expectation becomes slapstick and meaningless. Why throw this away?&lt;br /&gt;&lt;br /&gt;I would need a copy of the new film to point out all of the moments lacking narrative continuity, but here are a few to suffice:&lt;br /&gt;&lt;br /&gt;Late in both movies, Mattie stumbles upon her nemesis Tom Chaney while gathering water from a river. In the original film, the proximity of Ned Pepper's gang is made clear and ominous. The likelihood that she would find trouble while going for water is made plain. But in the new version, it plays out like some wildly random coincidence. The ending of both movies requires these events to take place, but the original movie at least gives us some reasons behind the events, not just chaos and random nothingness.&lt;br /&gt;&lt;br /&gt;Ned Pepper is a critical character in the story. In the original movie, the truly great actor Robert Duvall is given the chance to give the man some decency and honor. He is a killer, yes, but he also saves Mattie's life, despite claiming to be willing to end it. In fact, it is Ned Pepper, more than anyone else (in the original), who keeps Mattie alive (until the snake-hole scene at least). Robert Duvall was given the opportunity to create a Ned Pepper who is full and complex. In the Coen Bros version the actor &lt;a href="http://www.imdb.com/name/nm0001608/"&gt;Barry Pepper&lt;/a&gt; (seriously, no joke, that's his name, weird right?) is barely a grubby and dirty (really seriously dirty, nasty dirty, disgustingly dirty...) killer. The pathos of Ned Pepper is gone.&lt;br /&gt;&lt;br /&gt;By far, the most iconic moment of the original movie is the scene where Rooster takes the reigns of his horse in his mouth and single handedly draws down against four armed opponents. This is one of the greatest moments of American Western lore, involving the single greatest actor of American Western mythology. It is truly a moment of&amp;nbsp;cinematic&amp;nbsp;greatness. &amp;nbsp;Leading up to this, Rooster describes a previous moment in his storied life much like this (earlier in both films) and it forms a crucial part of his legend and character. When the ultimate moment arrives in the original version, it is a moment of destiny, built up by the dialogue and scenes that have come before it. But in the Coen Bros version, the whole &lt;i&gt;raison d'etre&lt;/i&gt; has been obscured by mumbling and misdirection. It's almost as if this were every bit as random as everything else that came before it. You may well argue that randomeness and chaos is in fact the Coen Bros' &lt;i&gt;raison d'etre&lt;/i&gt;, and I can't argue against that. Fair enough. But then, why bother making a movie about a story for which destiny and courage is so crucial a factor? Without the great inevitable showdown of Rooster's grit against the despots' manpower, well, why make this movie at all? If you believe in pure chaos, fine, make &lt;i&gt;No Country For Old Men&lt;/i&gt; over and over, got it. That makes sense. That's coherent. But why take this novel and make a movie? If your primary goal as movie makers is to take previous material well loved by the public and trash it for your own philosophical gain, that's just pure douchebaggery, so screw you Joel and Ethan.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-5507081972453991501?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/5507081972453991501/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=5507081972453991501' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/5507081972453991501'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/5507081972453991501'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/true-grit.html' title='true grit'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_uqqKLCxLtII/TRawvRZ0fjI/AAAAAAAAAX0/bUBeTOwEQYE/s72-c/true_grit_ngram.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-3804713892365940360</id><published>2010-12-25T12:02:00.000-05:00</published><updated>2010-12-25T12:02:56.770-05:00</updated><title type='text'>Choose Your Own Career in Linguistics</title><content type='html'>&lt;a href="http://www.linkedin.com/in/treyj"&gt;Trey Jones&lt;/a&gt;* at Speculative Grammarian invites y'all to play his cute, and yet somewhat depressing, game: &lt;a href="http://specgram.com/choose/"&gt;Choose Your Own Career in Linguistics&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&lt;i&gt;As a service to our young and impressionable readers who are considering pursuing a career in linguistics, Speculative Grammarian is pleased to provide the following Gedankenexperiment to help you understand the possibilities and consequences of doing so. For our old and bitter readers who are too far along in their careers to have any real hope of changing the eventual outcome, we provide the following as a cruel reminder of what might have been.&lt;/i&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&lt;i&gt;Let the adventure begin&lt;/i&gt;&lt;/span&gt;...&lt;br /&gt;&lt;br /&gt;*hehe, he used to work at &lt;a href="http://www.cyc.com/"&gt;Cycorp&lt;/a&gt;, hehe...&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-3804713892365940360?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/3804713892365940360/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=3804713892365940360' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3804713892365940360'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3804713892365940360'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/choose-your-own-career-in-linguistics.html' title='Choose Your Own Career in Linguistics'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-5851081897449718023</id><published>2010-12-23T08:33:00.002-05:00</published><updated>2010-12-23T08:33:00.274-05:00</updated><title type='text'>bustin' a cap</title><content type='html'>Watching the original &lt;a href="http://www.imdb.com/title/tt0065126/"&gt;True Grit&lt;/a&gt; on teevee and what do I hear? Ned Pepper (Robert Duvall) says something to the effect "&lt;i&gt;I ain't never busted a cap in no girl before&lt;/i&gt;." I thought only contemporary gansta movies and rap lyrics used that phrase (and yes, I did find some examples of &lt;a href="http://ngrams.googlelabs.com/graph?content=bust+a+cap,busted+a+cap&amp;amp;year_start=1800&amp;amp;year_end=2000&amp;amp;corpus=0&amp;amp;smoothing=3"&gt;bust(-ed) a cap&lt;/a&gt; using the Ngram Viewer).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-5851081897449718023?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/5851081897449718023/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=5851081897449718023' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/5851081897449718023'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/5851081897449718023'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/bustin-cap.html' title='&lt;i&gt;bustin&apos; a cap&lt;/i&gt;'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-8143431528782451260</id><published>2010-12-22T09:23:00.000-05:00</published><updated>2010-12-22T09:23:58.881-05:00</updated><title type='text'>my bad, global edition</title><content type='html'>&lt;a href="http://en.wikipedia.org/wiki/Manute_Bol"&gt;Manute Bol&lt;/a&gt; is often credited with coining the phrase &lt;i&gt;my bad&lt;/i&gt; (see &lt;a href="http://www.randomhouse.com/wotd/index.pperl?date=20000111"&gt;here&lt;/a&gt; and &lt;a href="http://www.unlikelywords.com/2010/06/21/manute-bol-popularized-the-phrase-my-bad/"&gt;here&lt;/a&gt;, or &lt;a href="http://itre.cis.upenn.edu/~myl/languagelog/archives/001927.html"&gt;here&lt;/a&gt; for alternate hypotheses). It has apparently made the jump, in some way, to international usage, it's just not clear to me how.&lt;br /&gt;&lt;br /&gt;While watching &lt;a href="http://www.imdb.com/title/tt1216487/"&gt;The Girl Who played with Fire&lt;/a&gt; again last night, I noticed Lisbeth says something that is translated as &lt;i&gt;my bad&lt;/i&gt;, but what she actually says is in Swedish, of course.&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_uqqKLCxLtII/TRIHSG_gu-I/AAAAAAAAAXs/C-7HQ7Y26yk/s1600/my_bad_girl_fire.JPG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="178" src="http://4.bp.blogspot.com/_uqqKLCxLtII/TRIHSG_gu-I/AAAAAAAAAXs/C-7HQ7Y26yk/s320/my_bad_girl_fire.JPG" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div style="text-align: center;"&gt;(screen shot from Netflix)&lt;/div&gt;&lt;br /&gt;To my non-Swedish speaking ears, it sounds like she says &lt;i&gt;mitt viel&lt;/i&gt;, which would mean something closer to &lt;i&gt;my very&lt;/i&gt;, if Google translate is any help. Google translates &lt;i&gt;my bad&lt;/i&gt; into Swedish as &lt;i&gt;mitt dåliga&lt;/i&gt; (&lt;i&gt;dåliga&lt;/i&gt; appears to be a literal translation of &lt;i&gt;bad&lt;/i&gt;). I'm pretty sure that's not what she said, but I'd have to re-listen to be sure.&lt;br /&gt;&lt;br /&gt;So, the linguistic questions are these:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;What does she say in Swedish?&lt;/li&gt;&lt;li&gt;What is the history of the Swedish phrase?&lt;/li&gt;&lt;li&gt;Is &lt;i&gt;my bad&lt;/i&gt; the best English translation (given its history in slang and in pop culture)?&lt;/li&gt;&lt;/ol&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-8143431528782451260?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/8143431528782451260/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=8143431528782451260' title='8 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8143431528782451260'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8143431528782451260'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/my-bad-global-edition.html' title='&lt;i&gt;my bad&lt;/i&gt;, global edition'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_uqqKLCxLtII/TRIHSG_gu-I/AAAAAAAAAXs/C-7HQ7Y26yk/s72-c/my_bad_girl_fire.JPG' height='72' width='72'/><thr:total>8</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-3552910804088438243</id><published>2010-12-21T12:59:00.000-05:00</published><updated>2010-12-21T12:59:55.852-05:00</updated><title type='text'>i know your email address...so what?</title><content type='html'>Cory Doctorow over at &lt;a href="http://www.boingboing.net/2010/12/21/why-i-have-a-public.html"&gt;Boing Boing&lt;/a&gt; makes the bold claim that there's no compelling evidence that obscuring your email address online using techniques like &lt;i&gt;john DOT smith at host DOT com&lt;/i&gt; actually reduces the amount of spam you recieve. As long as his spam filters are catching the spam effectively, he doesn't mind sharing his email address with the world.&lt;br /&gt;&lt;br /&gt;Are you willing to follow his lead?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-3552910804088438243?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/3552910804088438243/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=3552910804088438243' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3552910804088438243'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3552910804088438243'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/i-know-your-email-addressso-what.html' title='i know your email address...so what?'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-1063066936059969918</id><published>2010-12-21T10:51:00.000-05:00</published><updated>2010-12-21T10:51:58.010-05:00</updated><title type='text'>half a million language deaths?</title><content type='html'>Lera Boroditsky's recent concluding statement in &lt;a href="http://www.economist.com/debate/days/view/628"&gt;The Economist's debate&lt;/a&gt; about how language shapes thought states "&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;At the moment we have good linguistic descriptions of only about 10% of the world's existing languages (and we know even less about the &lt;b&gt;half a million or so&lt;/b&gt; languages that have existed in the past)&lt;/span&gt;&lt;/i&gt; (emphasis added).&lt;br /&gt;&lt;br /&gt;In my previous post on language death &lt;a href="http://thelousylinguist.blogspot.com/2010/11/94000-language-deaths.html"&gt;here&lt;/a&gt;, I used the number 100,000 to estimate how many languages have previously existed and related it favorably to David Crystal's 64,000 to 140,000 &lt;i&gt;reasonable guesstimate&lt;/i&gt;. I'm just curious to know where Boroditsky came up with the half million number? I've managed to come up with a few references to this 500,000 number, but they claim it's a "radical estimate" (e.g., see &lt;a href="http://hubpages.com/hub/Languages-How-Many-Languages"&gt;here&lt;/a&gt;).&lt;br /&gt;&lt;br /&gt;My hunch is that this is yet another example of Boroditsky's &lt;i&gt;profound&lt;/i&gt;-problem. She has a tendency to call modest results &lt;i&gt;profound&lt;/i&gt; when they are not. She is, I suspect, a tad prone to hyperbole.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-1063066936059969918?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/1063066936059969918/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=1063066936059969918' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/1063066936059969918'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/1063066936059969918'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/half-million-language-deaths.html' title='half a million language deaths?'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-4265433850949974174</id><published>2010-12-20T16:17:00.000-05:00</published><updated>2010-12-20T16:17:50.048-05:00</updated><title type='text'>language and thought votes</title><content type='html'>On the eve of the conclusion to Mark Liberman and Lera Boroditsky's &lt;a href="http://www.economist.com/debate/overview/190"&gt;debate at The Economist&lt;/a&gt;, there are two vote totals that are interesting to compare.&lt;br /&gt;&lt;br /&gt;The obvious one is the lopsided results so far on the main question: &lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Do you agree with the motion&lt;/span&gt;&lt;/i&gt;? Here, Boroditsky has a 77%-23% advantage. However, if you mouse-over each day's vote, it tells you how many &lt;i&gt;yes&lt;/i&gt;'s have switched to &lt;i&gt;no&lt;/i&gt; and vice versa. The totals there are the near exact opposite: by a 5-1 margin &lt;i&gt;yes&lt;/i&gt;'s have switched to &lt;i&gt;no&lt;/i&gt;. You are free to interpret this as you wish.&lt;br /&gt;&lt;br /&gt;Unfortunately I don't see any raw totals for the number of people voting, so it's anyone's guess what proportion of votes the 6 changes represent (likely, a very small percentage).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-4265433850949974174?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/4265433850949974174/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=4265433850949974174' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/4265433850949974174'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/4265433850949974174'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/language-and-thought-votes.html' title='&lt;i&gt;language and thought&lt;/i&gt; votes'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-2434765373911094923</id><published>2010-12-20T12:01:00.002-05:00</published><updated>2010-12-20T15:52:34.778-05:00</updated><title type='text'>digg's c**ktail</title><content type='html'>[&lt;b&gt;&lt;span class="Apple-style-span" style="color: orange;"&gt;UPDATE &lt;/span&gt;&lt;/b&gt;below)&lt;br /&gt;&lt;br /&gt;I couldn't help but notice a story on Digg: &lt;a href="http://digg.com/news/science/images_of_alcoholic_drinks_under_the_microscope_from_vodka_cocktails_to_pina_colada"&gt;Images of alcoholic drinks under the microscope from vodka c**ktails to pina colada&lt;/a&gt;. I checked the original &lt;a href="http://www.dailymail.co.uk/sciencetech/article-1339967/Images-alcoholic-drinks-microscope-vodka-cocktails-pina-colada.html"&gt;Daily Mail&lt;/a&gt; story and saw that the word &lt;i&gt;cocktail &lt;/i&gt;was not censored. I looked for other instances of &lt;i&gt;cocktail&lt;/i&gt; on Digg's site and found that all instances look censored, except when the string &lt;i&gt;c-o-c-k&lt;/i&gt; occurs in a user name, as the image below demonstrates:&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_uqqKLCxLtII/TQ-K0CVH7iI/AAAAAAAAAXo/OeKltzk0McI/s1600/cocktail_search_digg_cropped.JPG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="153" src="http://3.bp.blogspot.com/_uqqKLCxLtII/TQ-K0CVH7iI/AAAAAAAAAXo/OeKltzk0McI/s320/cocktail_search_digg_cropped.JPG" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;This appears to be a candidate for unnecessary censorship. I sent an email to Digg asking them if this is intentional censorship or an inside joke within the site. I'll report any response (don't hold your breath).&lt;br /&gt;&lt;br /&gt;[&lt;b&gt;&lt;span class="Apple-style-span" style="color: orange;"&gt;UPDATE&lt;/span&gt;&lt;/b&gt;: 3:01 Eastern)&lt;br /&gt;&lt;br /&gt;Digg support did in fact reply, noting that it was a function of a profanity filter that can be turned off:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Hello ,&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;You see that because you have the profanity filter enabled. &amp;nbsp;To disable it just log in and go to:&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;http://digg.com/settings/preferences&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;--Digg Support&lt;/span&gt;&lt;/i&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-2434765373911094923?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/2434765373911094923/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=2434765373911094923' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/2434765373911094923'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/2434765373911094923'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/diggs-cktail.html' title='digg&apos;s c**ktail'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_uqqKLCxLtII/TQ-K0CVH7iI/AAAAAAAAAXo/OeKltzk0McI/s72-c/cocktail_search_digg_cropped.JPG' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-8186492965125285667</id><published>2010-12-19T20:21:00.001-05:00</published><updated>2010-12-19T20:24:07.869-05:00</updated><title type='text'>the linguistics of the simpsons</title><content type='html'>The magnificent and admiral Snowclone &lt;a href="http://snowclones.org/2007/12/18/x-is-the-y-of-z/"&gt;X is the Y of Z&lt;/a&gt; made a surprise and instructive appearance on The Simpsons tonight*:&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Marge&amp;nbsp;&lt;/b&gt;--&amp;nbsp;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Don't worry Lisa, you could still go to McGill, it's the Harvard of Canada&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;b&gt;Lisa &lt;/b&gt;--&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Anything that is the something of the something isn't the anything of anything&lt;/span&gt;&lt;/i&gt;...&lt;br /&gt;&lt;br /&gt;Too true, Lisa, too true. It's never good to be the shadow of something else.&lt;br /&gt;&lt;br /&gt;*This appears to have been a repeat of the 10-10-2010 episode &lt;a href="http://en.wikipedia.org/wiki/MoneyBART"&gt;MoneyBART &lt;/a&gt;(a nice allusion to &lt;a href="http://en.wikipedia.org/wiki/Moneyball"&gt;Moneyball&lt;/a&gt;, btw).&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-8186492965125285667?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/8186492965125285667/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=8186492965125285667' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8186492965125285667'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8186492965125285667'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/linguistics-of-simpsons.html' title='the linguistics of the simpsons'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-293889547235312923</id><published>2010-12-19T10:18:00.000-05:00</published><updated>2010-12-19T10:18:12.957-05:00</updated><title type='text'>ngram roundup</title><content type='html'>It's not difficult to find glee and excitement surrounding Google's new &lt;a href="http://ngrams.googlelabs.com/"&gt;Ngram Viewer&lt;/a&gt;. Hyperbolic praise is&amp;nbsp;whirling around the innerwebz like mad. As an antidote and a nod to the role skepticism should play in our contemporary society, I present a brief&amp;nbsp;round up of criticisms:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://chronicle.com/article/Counting-on-Google-Books/125735/"&gt;Geoffrey Nunberg&lt;/a&gt;:&lt;br /&gt;...&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;there are still a fair number of misdated works, and there's no way to restrict a query by genre or topic.&amp;nbsp;But in the end, the most important consequence of the Science paper, and of allowing public access to the data, is that it puts "culturomics" into conversational play&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://corpus.byu.edu/coha/compare-culturomics.asp"&gt;Mark Davies&lt;/a&gt;:&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Google Books can't use wildcards to search for parts of words. For example, try searching for freak* out (all forms: freak_, freaked, freaking, etc) or even a simple search like teenager* ... if Google Books doesn't know about part of speech tags or variant forms of a word, then how can it look at change in grammar? ... To use collocates with Google Books, you would have to manually download thousands or millions of hits to your hard drive, and then use another program to look for and categorize the collocates&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://languagelog.ldc.upenn.edu/nll/?p=2848#more-2848"&gt;Mark Liberman&lt;/a&gt;:&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;The Science paper says that "Culturomics is the application of high-throughput data collection and analysis to the study of human culture". &amp;nbsp;But as long as the historical text corpus itself remains behind a veil at Google Books, then "culturomics" will be restricted to a very small corner of that definition, unless and until the scholarly community can reproduce an open version of the underlying collection of historical texts&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://david-crystal.blogspot.com/2010/12/on-culturomics.html"&gt;David Crystal&lt;/a&gt;:&lt;br /&gt;.&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;..this is just a collection of books - no newspapers, magazines, advertisements, or other orthographic places where culture resides. No websites, blogs, social networking sites. No spoken language, of course, so over 90 percent of the daily linguistic usage of the world isn't here...The approach, in other words, shows trends but can't interpret or explain them. It can't handle ambiguity or idiomaticity.&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://thebinderblog.com/2010/12/18/google-ngrams-thin-description/"&gt;The Binder Blog&lt;/a&gt;:&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;The value of the Ngrams Viewer rests on a bold conceit:  that the number of times a word is used at certain periods of time has some kind of relationship to the culture of the time.  For example, the fact that the word “slavery” peaks around 1860 suggests that people in 1860 had a lot to say about slavery.  Another spike around the 1970s meshes nicely with the Civil Rights Movement. Well, that’s sort of interesting.  However, I didn’t need ngrams to tell me that a lot of people were writing about slavery in 1860.  These data are broad but not deep, which makes them relatively useless to most humanities majors interested in intensive study&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;The one positive comment that I think bears repeating is the role this fun little tool might play is sparking the imagination of young students interested in the role technology can play in the humanities.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://chronicle.com/article/Counting-on-Google-Books/125735/"&gt;Geoffrey Nunberg&lt;/a&gt;:&lt;br /&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Whatever misgivings scholars may have about the larger enterprise, the data will be a lot of fun to play around with. And for some—especially students, I imagine—it will be a kind of gateway drug that leads to more-serious involvement in quantitative research&lt;/span&gt;&lt;/i&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-293889547235312923?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/293889547235312923/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=293889547235312923' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/293889547235312923'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/293889547235312923'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/ngram-roundup.html' title='ngram roundup'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-3159853191677729515</id><published>2010-12-18T09:19:00.000-05:00</published><updated>2010-12-18T09:19:24.735-05:00</updated><title type='text'>how NOT to interpret ngrams</title><content type='html'>Andrew Sullivan has predictably misunderstood the value of Google's &lt;a href="http://ngrams.googlelabs.com/graph?content=try+it&amp;amp;year_start=1985&amp;amp;year_end=2008&amp;amp;corpus=0&amp;amp;smoothing=3"&gt;Ngram Viewer&lt;/a&gt;. He spent all day yesterday posting trite and simplistic mis-interpretations of the data. For example,&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://andrewsullivan.theatlantic.com/the_daily_dish/2010/12/time-suck-of-the-day-ctd.html"&gt;the concept of ideology is a relatively recent one&lt;/a&gt; because the word &lt;i&gt;ideology &lt;/i&gt;has become more frequent recently (this is almost certainly false).&lt;/li&gt;&lt;li&gt;&lt;a href="http://andrewsullivan.theatlantic.com/the_daily_dish/2010/12/jesus-vs-the-beatles.html"&gt;Jesus "wins" (his word, not mine) against the Beatles&lt;/a&gt; because the word &lt;i&gt;Jesus &lt;/i&gt;is more frequent.&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;I like the Ngram Viewer, but simply plotting the frequency of words against each other to determine something about culture or concepts is a very weak technique that leads to massive mis-interpretations, as we've seen recently with things like counting the number of times President Obama uses pronouns in his speeches. I discussed the failings of simple word counts as a technique &lt;a href="http://thelousylinguist.blogspot.com/2010/01/bad-linguistics-sigh.html"&gt;here&lt;/a&gt;. To sum up,&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;We don't know what causes word frequencies.&lt;/li&gt;&lt;li&gt;We don't know what the effects of word frequencies are.&lt;/li&gt;&lt;li&gt;There are good alternatives.&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;/div&gt;&lt;div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-3159853191677729515?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/3159853191677729515/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=3159853191677729515' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3159853191677729515'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3159853191677729515'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/how-not-to-interpret-ngrams.html' title='how NOT to interpret ngrams'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-2860321781379275467</id><published>2010-12-17T13:59:00.000-05:00</published><updated>2010-12-17T13:59:22.232-05:00</updated><title type='text'>magical machine translation</title><content type='html'>This is un-fucking-believable:&lt;br /&gt;&lt;br /&gt;&lt;object height="385" width="640"&gt;&lt;param name="movie" value="http://www.youtube.com/v/h2OfQdYrHRs?fs=1&amp;amp;hl=en_US"&gt;&lt;/param&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;/param&gt;&lt;param name="allowscriptaccess" value="always"&gt;&lt;/param&gt;&lt;embed src="http://www.youtube.com/v/h2OfQdYrHRs?fs=1&amp;amp;hl=en_US" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="640" height="385"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;br /&gt;&lt;br /&gt;The future has arrived.&lt;br /&gt;&lt;br /&gt;HT &lt;a href="http://kottke.org/10/12/word-lens"&gt;kottke&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-2860321781379275467?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/2860321781379275467/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=2860321781379275467' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/2860321781379275467'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/2860321781379275467'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/magical-machine-translation.html' title='magical machine translation'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-2447831576185768832</id><published>2010-12-17T11:18:00.000-05:00</published><updated>2010-12-17T11:18:08.914-05:00</updated><title type='text'>ngram or n-gram?</title><content type='html'>The hottest story of the day is clearly Google's &lt;a href="http://ngrams.googlelabs.com/"&gt;Ngram Viewer&lt;/a&gt;. It's all over blogs, twitter and even the MSM. But why did Google call it the &lt;i&gt;Ngram Viewer&lt;/i&gt; and not the &lt;i&gt;N-gram Viewer&lt;/i&gt;?&lt;br /&gt;&lt;br /&gt;The hyphenated form is more common in the NLP industry and in general search results (by a 10-1 margin at that). Nunberg's &lt;a href="http://languagelog.ldc.upenn.edu/nll/?p=2847"&gt;LL post&lt;/a&gt; and Languagehat's &lt;a href="http://www.languagehat.com/archives/004086.php"&gt;post&lt;/a&gt; both prefer &lt;i&gt;n-gram&lt;/i&gt; when speaking about the tokens themselves and only use &lt;i&gt;Ngram&lt;/i&gt; when referencing Google's named product. Even Google's own people used &lt;i&gt;n-gram&lt;/i&gt; in a blog post &lt;a href="http://googleresearch.blogspot.com/2006/08/all-our-n-gram-are-belong-to-you.html"&gt;here&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;You gotta wonder what kind of branding process Google went through to decide on &lt;i&gt;ngram &lt;/i&gt;(they are notoriously conscious about that kind of thing). The popularity of this story also demonstrates how much more media savvy Google is because Microsoft has almost exactly the same tool, but no one knows about it. See &lt;a href="http://research.microsoft.com/en-us/collaboration/focus/cs/web-ngram.aspx"&gt;here&lt;/a&gt;. The difference is that Microsoft didn't link its use to studying culture and history and give us a nifty online tool to play with, making it more dull sounding than perhaps it otherwise would.&lt;br /&gt;&lt;br /&gt;Also, note Microsoft uses &lt;i&gt;N-gram&lt;/i&gt; ... frikkin Microsoft.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-2447831576185768832?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/2447831576185768832/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=2447831576185768832' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/2447831576185768832'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/2447831576185768832'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/ngram-or-n-gram.html' title='ngram or n-gram?'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-7907004266385268211</id><published>2010-12-16T21:39:00.000-05:00</published><updated>2010-12-16T21:39:46.613-05:00</updated><title type='text'>google has a huge tool</title><content type='html'>NPR ran a story today called &lt;a href="http://www.npr.org/2010/12/16/132106374/google-book-tool-tracks-cultural-change-with-words"&gt;Google Book Tool Tracks Cultural Change With Words&lt;/a&gt;. It's about "&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;the biggest collection of words ever assembled&lt;/span&gt;&lt;/i&gt;*", Google's 500 billion word corpus is drawn from the books they've scanned, but here's the catch: many of those books are copywrited, so what Google did is pull a trick that goes back to the very beginnings of computational linguistics, they present the words as an unordered set, or &lt;a href="http://en.wikipedia.org/wiki/Bag_of_words_model"&gt;bag o' words&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Many of these books are covered by copyright, and publishers aren't letting people read them online. But the new database gets around that problem: It's just a collection of words and phrases, stripped of all context except the date in which they appeared&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;I &amp;nbsp;first learned about this technique back in 1999 in an intro to computational linguistics course (bit of trivia: we we're using an incomplete pre-print of Martin and Jurafsky; as I recall, the discourse chapter was composed entirely of one page that read &lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;21 Computational Discourse write something here...&lt;/span&gt;&lt;/i&gt;) and I remember being appalled at its crass simplicity. I mean, how dare those idiot engineers reduce language down to simple lists of words. How dare they try to use simple word lists to discover important facts about language and devise important linguistic tools.&lt;br /&gt;&lt;br /&gt;It took less than a week for me to change my tune. The fact is, the bag o' words technique is remarkably powerful and useful. No, it doesn't solve all problems in one swoop, but it solves a hell of a lot more than I could possibly predict as a naive 2nd year linguistics grad student. For example:&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_uqqKLCxLtII/TQq38JvOqXI/AAAAAAAAAXg/QlE1ppnoCLw/s1600/google_verbs.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="126" src="http://4.bp.blogspot.com/_uqqKLCxLtII/TQq38JvOqXI/AAAAAAAAAXg/QlE1ppnoCLw/s400/google_verbs.jpg" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Irregular verbs are used as a model of grammatical evolution. For each verb, researchers plotted the usage frequency of its irregular form in red ("thrived"), and the usage frequency of its regular past-tense form in blue ("throve/thriven"). Virtually all irregular verbs are found from time to time used in a regular form, but those used more often tend to be used in a regular way more rarely&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;Google labs lets you play with its tool &lt;a href="http://ngrams.googlelabs.com/"&gt;here&lt;/a&gt; (hehe).&lt;br /&gt;&lt;br /&gt;*Not sure where this claim originated, but Google has already released a 1 trillion word corpus via LDC, the &lt;a href="http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2006T13"&gt;Web 1T 5-gram Version 1&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-7907004266385268211?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/7907004266385268211/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=7907004266385268211' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/7907004266385268211'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/7907004266385268211'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/google-has-huge-tool.html' title='google has a huge tool'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_uqqKLCxLtII/TQq38JvOqXI/AAAAAAAAAXg/QlE1ppnoCLw/s72-c/google_verbs.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-6322403760455482206</id><published>2010-12-16T09:54:00.000-05:00</published><updated>2010-12-16T09:54:20.696-05:00</updated><title type='text'></title><content type='html'>From &lt;a href="http://www.stephenfry.com/"&gt;Stephen Fry's&lt;/a&gt; twitter feed:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Just had an fMRI scan at UCL (part of BBC doc on language I'm making). Had to play Just A Minute while being scanned. Fun&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;A BBC&amp;nbsp;documentary&amp;nbsp;about language?&amp;nbsp;Ugh...I don't think even the talents of Stephen Fry can save that one.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-6322403760455482206?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/6322403760455482206/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=6322403760455482206' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/6322403760455482206'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/6322403760455482206'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/from-stephen-frys-twitter-feed-just-had.html' title=''/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-3316796081654286517</id><published>2010-12-15T11:18:00.000-05:00</published><updated>2010-12-15T11:18:10.297-05:00</updated><title type='text'>harvard jumps the linguistic shark</title><content type='html'>Harvard Business Review editor Julia Kirby &lt;a href="http://blogs.hbr.org/hbr/hbreditors/2010/12/could_a_bad_accent_help_you_se.html#comments"&gt;adds to the mountain &lt;/a&gt;of pseudo-scientific bullshit filling the innerwebz by taking the modest results of &lt;a href="http://www.psychologicalscience.org/index.php/news/releases/imitating-someones-accent-makes-it-easier-to-understand-them.html"&gt;a small study&lt;/a&gt;&amp;nbsp;(about the fact that&amp;nbsp;mimicking&amp;nbsp;accents helps sentence comprehension) and&amp;nbsp;jumping&amp;nbsp;to the wild and unfounded conclusion that&amp;nbsp;salespeople&amp;nbsp;should start faking accents. It would make a great Monty Python skit, but it's a sad blog post from an editor of a&amp;nbsp;prestigious&amp;nbsp;business magazine. Money quote:&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;But this study suggests another possibility. Perhaps part of why mirroring and matching works is not because of how it operates on the prospect in a sales conversation, but how it operates on the salesperson. When we switch into another person's mode, however superficially, perhaps our brains are triggered to do so on a deeper level, and we become more able to receive the information that person is trying to convey.&lt;/span&gt;&lt;/i&gt;&lt;/div&gt;&lt;div&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/i&gt;&lt;/div&gt;&lt;div&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;We all know the key to empathy is to walk a mile in another's shoes. That can never literally be done, especially in brief sales encounters. But at least we can put on their brogues&lt;/span&gt;&lt;/i&gt;.&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;sigh...&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-3316796081654286517?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/3316796081654286517/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=3316796081654286517' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3316796081654286517'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3316796081654286517'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/harvard-jumps-linguistic-shark.html' title='harvard jumps the linguistic shark'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-7988911762112262394</id><published>2010-12-14T08:33:00.002-05:00</published><updated>2010-12-14T23:30:33.352-05:00</updated><title type='text'>so you want to study linguistics?</title><content type='html'>Recently, a reader asked me for advice about studying linguistics. She is an undergraduate in the USA at a college that does not offer a BA in linguistics and she likes math and language, particularly historical linguistics. I've posted advice to students before &lt;a href="http://thelousylinguist.blogspot.com/2008/04/on-jobs-and-nlp-degrees.html"&gt;here&lt;/a&gt;, but this new request was a particularly interesting variation. What do you do if you're a smart 20 year old at a school that does not quite offer what you want? What follows is an edited version of the email I sent back:&lt;br /&gt;&lt;br /&gt;I must begin with a &amp;nbsp;warning: academic linguistics is a small field, there is precious little room for mediocrity. There are two kinds of academic linguists, the top 15% and the unemployed.&lt;br /&gt;&lt;br /&gt;With that said, if your school doesn't offer linguistics as a degree, then I suggest psychology (the experimental, lab-based kind) or computer science. Get hands-on experience in lab settings where you are collecting and analyzing data. Learn basic scientific method. Both psychology and computer science can offer that. Computational linguistics is a hot field with lots of opportunities in all sub-fields of linguistics. Plus, they can get jobs, hehe. High paying jobs! Computational linguists are one the the few who can get jobs outside of academia, but the truth is most industry CL jobs are really programming jobs where your programing skills are the real reason you get a job; your Natural Language Processing (NLP) skills are little more than icing on the cake. The industry is really looking for engineers with some NLP experience, not linguists with some programming skills.&lt;br /&gt;&lt;br /&gt;There's nothing wrong with majoring in math (I definitely think all 21st Century linguists &lt;a href="http://thelousylinguist.blogspot.com/2010/01/why-linguists-should-study-math.html"&gt;should study math&lt;/a&gt;), though I think knowing stats is preferable, and that's really a separate field. There is some controversy regarding whether linear algebra or calculus is better for linguistics (see &lt;a href="http://languagelog.ldc.upenn.edu/nll/?p=1461"&gt;here&lt;/a&gt;, especially the comments), but I really do think stats is key.&lt;br /&gt;&lt;br /&gt;Studying biology or genetics is a possibility (neurolinguistics is a hot field). Liberman posted about&amp;nbsp;genetics&amp;nbsp;and linguistics&amp;nbsp;&lt;a href="http://languagelog.ldc.upenn.edu/nll/?p=2835"&gt;here&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Probably the single best thing you can do for yourself right now is work your way through the &lt;a href="http://www.nltk.org/"&gt;NLTK book&lt;/a&gt;. This will teach you about basic concepts, plus teach you basic tools as well, and it's completely free! You could also start learning the &lt;a href="http://www.r-project.org/"&gt;R language&lt;/a&gt;, a great stats based language that many linguists are using these days.&lt;br /&gt;&lt;br /&gt;You could also work your way through &lt;a href="http://www.amazon.com/Tarskis-World-Expanded-Language-Information/dp/1575864843"&gt;Tarski's World&lt;/a&gt; because basic logic is a sound foundation for all disciplines.&lt;br /&gt;&lt;br /&gt;If you want a serious challenge, get your hands on the late Partha Niyogi's '&lt;a href="http://mitpress.mit.edu/catalog/author/default.asp?aid=7467"&gt; The Computational Nature of Language Learning and Evolution&lt;/a&gt;'. He passed away recently, far too young for a rising star. He was a pioneer in using mathematical models to understand linguistics.&lt;br /&gt;&lt;br /&gt;If you're interested in cognitive science and linguistics, I suggest regularly reading the &lt;a href="http://scientopia.org/blogs/childsplay/"&gt;Child's Play&lt;/a&gt; blog, written by two Stanford cognitive science grad students.&lt;br /&gt;&lt;br /&gt;My general advice to any undergrad is simple: don't sweat your undergrad too much; it's the least important part of your education. Just get it done, regardless of which major you choose, and move on to the good stuff in grad school.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-7988911762112262394?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/7988911762112262394/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=7988911762112262394' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/7988911762112262394'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/7988911762112262394'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/so-you-want-to-study-linguistics.html' title='so you want to study linguistics?'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-7828028646708270435</id><published>2010-12-13T11:27:00.001-05:00</published><updated>2010-12-13T11:27:43.594-05:00</updated><title type='text'>a debate!</title><content type='html'>From The Economist:&amp;nbsp;&lt;i&gt;This house believes that the language we speak shapes how we think&lt;/i&gt;. &lt;a href="http://www.economist.com/debate/debates/overview/190"&gt;Discuss&lt;/a&gt;...&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-7828028646708270435?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/7828028646708270435/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=7828028646708270435' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/7828028646708270435'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/7828028646708270435'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/very-l.html' title='a debate!'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-938311818305086705</id><published>2010-12-13T10:30:00.001-05:00</published><updated>2010-12-13T10:30:45.722-05:00</updated><title type='text'></title><content type='html'>A new drink from the&amp;nbsp;&lt;a href="http://specgram.com/CLX.3/10.mixologists.cocktails.html"&gt;SPECULATIVE&amp;nbsp;GRAMMARIAN&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&lt;i&gt;The Psycholinguist&lt;/i&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&lt;i&gt;wine (any kind: color is not a dependent variable in this study)&lt;/i&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&lt;i&gt;several glasses&lt;/i&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&lt;i&gt;1 stopwatch&lt;/i&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;&lt;i&gt;Pour the wine into a glass while whining about how no one has properly modeled the process of wine pouring. Observe the wine under controlled conditions for an hour. Present a wordy but content-less paper to an international conference on what wine might look like in infants. Rerun the analysis in a different glass in case the receptor affects the nature of the process. Wait another hour. Drink the wine. Drink more wine. Fall onto the floor drunk, bumping your head on a pipe on the way down. Write an even less coherent paper on the effects of head bumping on linguistic processing. Gain professorship&lt;/i&gt;&lt;/span&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-938311818305086705?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/938311818305086705/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=938311818305086705' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/938311818305086705'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/938311818305086705'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/new-drink-from-speculative.html' title=''/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-3286912579471095913</id><published>2010-12-13T08:33:00.000-05:00</published><updated>2010-12-13T08:33:01.050-05:00</updated><title type='text'>pimp grammar</title><content type='html'>There's a pimp's &lt;a href="http://www.youthradio.org/news/pimp-business-plan"&gt;handwritten business plan&lt;/a&gt; floating around the interwebz. While the soundness of its basic logic cannot be denied ("&lt;i&gt;Treat This Pimpin Like it's a Business&lt;/i&gt;" indeed), the former writing teacher in me could not help but pull out the old red pen and make a few suggestions. But here's the thing, it's a fact of contemporary college education that most writing teachers are loath to outright criticize or correct their students (they're paying tuition after all). You see, outside of the Ivy League, most college writing teachers are faced with whole classrooms filled with pimps like &lt;i&gt;Keep It Pimpin&lt;/i&gt;', and they're our bread and butter (we can't all be blessed with students like the &lt;a href="http://mashable.com/2010/12/09/winklevoss-twins-lawsuit-facebook/"&gt;Winkelvi&lt;/a&gt;, can we?). As a result, we are careful to word our feedback delicately, so as not to offend the senses of the ones who pad our, admittedly thin, paychecks*.&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_uqqKLCxLtII/TQWYr9oTD0I/AAAAAAAAAXc/YsMDYfSiPcg/s1600/pimp+grammar.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="480" src="http://1.bp.blogspot.com/_uqqKLCxLtII/TQWYr9oTD0I/AAAAAAAAAXc/YsMDYfSiPcg/s640/pimp+grammar.jpg" width="640" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;*Absolute truth: I taught college level research writing courses for the whopping total price of $1250/semester. The MOST I ever got paid for teaching a college level course was $2800. In the (modified) words of my literary hero &lt;a href="http://www.lyred.com/lyrics/Soundtracks/Hustle+&amp;amp;+Flow/It's+Hard+Out+Here+For+A+Pimp/"&gt;DJay&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;You know it's hard out here for a [rhetoric &amp;amp; writing instructor].&lt;/i&gt;&lt;br /&gt;&lt;i&gt;When he tryin to get this money for the rent.&lt;/i&gt;&lt;br /&gt;&lt;i&gt;For the Cadillacs and gas money spent&lt;/i&gt;&lt;br /&gt;&lt;i&gt;Because a whole lotta [students] talkin [nonsense]&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;HT &lt;a href="http://kottke.org/10/12/pimp-business-plan"&gt;kottke&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-3286912579471095913?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/3286912579471095913/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=3286912579471095913' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3286912579471095913'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/3286912579471095913'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/pimp-grammar.html' title='pimp grammar'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_uqqKLCxLtII/TQWYr9oTD0I/AAAAAAAAAXc/YsMDYfSiPcg/s72-c/pimp+grammar.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-5259207466130448133</id><published>2010-12-09T10:44:00.000-05:00</published><updated>2010-12-09T10:44:27.783-05:00</updated><title type='text'>a brief history of stanford linguistics dissertations</title><content type='html'>&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_uqqKLCxLtII/TQD2ww1QyUI/AAAAAAAAAXY/NbCMz6x_htQ/s1600/stanford_diss_ling.JPG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="298" src="http://1.bp.blogspot.com/_uqqKLCxLtII/TQD2ww1QyUI/AAAAAAAAAXY/NbCMz6x_htQ/s400/stanford_diss_ling.JPG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;The above image comes from the &lt;a href="http://nlp.stanford.edu/projects/dissertations/"&gt;Stanford Dissertation Browser&lt;/a&gt; and is centered on Linguistics. This tool performs some kind of textual analysis of Stanford dissertations: &lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;every dissertation is taken as a weighted mixture of a unigram language model associated with every Stanford department. This lets us infer, that, say, dissertation X is 60% computer science, 20% physics, and so on...Essentially, the visualization shows word overlap between departments measured by letting the dissertations in one department borrow words from another department.&lt;/span&gt;&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;Thus, the image above suggests that Linguistics borrows more words from Computer Science, Education, and Psychology than it does from other disciplines. What was most interesting was using the Back button to creating a moving picture of dissertation language over the last 15 years. you'll see a lot of bouncing back and forth. Stats makes a couple jumps here and there.&lt;br /&gt;&lt;br /&gt;HT &lt;a href="http://blogs.discovermagazine.com/gnxp/2010/12/how-close-are-scientific-disciplines/"&gt;Razib Khan&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-5259207466130448133?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/5259207466130448133/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=5259207466130448133' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/5259207466130448133'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/5259207466130448133'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/brief-history-of-stanford-linguistics.html' title='a brief history of stanford linguistics dissertations'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_uqqKLCxLtII/TQD2ww1QyUI/AAAAAAAAAXY/NbCMz6x_htQ/s72-c/stanford_diss_ling.JPG' height='72' width='72'/><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-6057388972833338244</id><published>2010-12-08T08:33:00.001-05:00</published><updated>2010-12-08T08:33:00.269-05:00</updated><title type='text'>the baffling linguistics of job postings</title><content type='html'>While Googling around for other things, I caught this odd fish contained within &lt;a href="http://sandiego.ebayclassifieds.com/sales-biz-dev/san-diego/account-manager-manufacturing-control-process-flow-software/?ad=7638045#ixzz17ShzOJ9u"&gt;a job posting&lt;/a&gt; for an &lt;i&gt;Account Manager&lt;/i&gt;:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;DISCLAIMER: ... &lt;b&gt;Linguistics used herein may use First Person Singular and First Person Plural grammatical person construction for and with the meaning of Third Person Singular and Third Person Plural references. We reserves&lt;/b&gt; the right to amend and change responsibilities to meet business and organizational needs as necessary &lt;/span&gt;&lt;/i&gt;(emphasis added).&lt;br /&gt;&lt;br /&gt;If I understand this correctly, the bold faced passage says that the authors are allowing themselves to use constructions like "&lt;i&gt;we walk&lt;b&gt;s&lt;/b&gt;&lt;/i&gt;..." and "&lt;i&gt;we talk&lt;b&gt;s&lt;/b&gt;&lt;/i&gt;..."&lt;br /&gt;&lt;br /&gt;But, if you look at the uses of "we" within the text of the actual job posting, nowhere do they actually do this, EXCEPT in the disclaimer itself. I find this baffling. What is the purpose of this? Simply to allow them to write "&lt;i&gt;We reserve&lt;b&gt;s&lt;/b&gt;&lt;/i&gt;..." I Googled the sentence and found it popping up in all kinds of &lt;a href="http://www.google.com/search?sourceid=chrome&amp;amp;ie=UTF-8&amp;amp;q=Linguistics+used+herein+may+use+First+Person+Singular+and+First+Person+Plural+grammatical+person+construction+for+and+with+the+meaning+of+Third+Person+Singular+and+Third+Person+Plural+references.#sclient=psy&amp;amp;hl=en&amp;amp;q=%22Linguistics+used+herein+may+use+First+Person+Singular+and+First+Person+Plural+grammatical+person+construction+for+and+with+the+meaning+of+Third+Person+Singular+and+Third+Person+Plural+references.%22&amp;amp;aq=f&amp;amp;aqi=&amp;amp;aql=&amp;amp;oq=&amp;amp;gs_rfai=&amp;amp;pbx=1&amp;amp;fp=9282fdae014bcd4b"&gt;job postings&lt;/a&gt; and the same thing is true. The only time a posting invokes its self-appointed right to this grammatical modification, is within the disclaimer. It appears to be boiler-plate job-speak of some kind. I'm remarkably freaked out by this.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-6057388972833338244?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/6057388972833338244/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=6057388972833338244' title='7 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/6057388972833338244'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/6057388972833338244'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/baffling-linguistics-of-job-postings.html' title='the baffling linguistics of job postings'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>7</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-5558608415639342979</id><published>2010-12-02T11:44:00.001-05:00</published><updated>2010-12-02T14:04:15.880-05:00</updated><title type='text'>how to spot an academic con artist</title><content type='html'>If you've been to college, you were taught how to scrutinize research sources at some point. Let's test your skills, shall we? Imagine you run across a popularized article and the author promotes his own expertise using the following:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;"Ph.D" after his name.&lt;/li&gt;&lt;li&gt;Referencing his multiple books.&lt;/li&gt;&lt;li&gt;Noting his academic appointments.&lt;/li&gt;&lt;li&gt;You look at his personal web page list of publications and you see dozens of articles and books going back several decades.&lt;/li&gt;&lt;/ul&gt;Must be an expert, right? Must be legit, right? This is what I saw for &lt;a href="http://www.johnmedina.com/?q=bio"&gt;John Medina&lt;/a&gt;, Ph.D., author of the HuffPo article '&lt;a href="http://www.huffingtonpost.com/john-medina-phd/parentese-baby-talk_b_789023.html"&gt;Parentese': Can Speaking To Your Baby This Way Make Her Smarter&lt;/a&gt;? But I quickly became suspicious about this man's credentials. Why? Let's look more closely at those bullet points:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="color: blue;"&gt;"Ph.D" after his name.&lt;/span&gt;&lt;/li&gt;I recall a professor once saying something like "&lt;i&gt;Once you've been to grad school, everyone you know has a Ph.D. It's just not that special&lt;/i&gt;." This may sound elitist, but the truth is, most people with Ph.Ds don't use the alphabet to promote themselves. They use their body of work. I'm almost always suspicious of people who promote themselves using their degrees. Plus, nowhere on his own site does he list a CV or even where he got his Ph.D. I had to find this at the UW web page, listing "PhD, Molecular Biology, Washington State University, 1988" and impressive degree, no doubt, but why hide this? It has become common practice for serious academics to provide their full CV on their web page. Medina fails to follow this practice.&lt;li&gt;&lt;span class="Apple-style-span" style="color: blue;"&gt;Referencing his multiple books&lt;/span&gt;.&lt;/li&gt;All of his books are aimed at non-academics. There's nothing wrong with trying to explain your expertise to a lay audience, but at some point you should also be trying to explain your expertise top other experts.&lt;li&gt;&lt;span class="Apple-style-span" style="color: blue;"&gt;Noting his academic appointments&lt;/span&gt;.&lt;/li&gt;Here, Medina does seem to have some impressive qualifications. He is an "Affiliate Professor, Bioengineering" at &lt;a href="http://depts.washington.edu/bioe/people/affiliate/people_affiliate.html"&gt;The University of Washington&lt;/a&gt;. As well as director of the &lt;a href="https://banweb.spu.edu/pls/prod/wwgkwhit.p_white_pages?submit_action=similar&amp;amp;search_string=~PIDM:10689591."&gt;Brain Center for Applied Learning Research&lt;/a&gt; at Seattle Pacific University (which, as far as I can tell, is &lt;a href="http://www.spu.edu/info/buildings/center-for-brain-research/"&gt;a house&lt;/a&gt; and has exactly two members, the director and his assistant).&lt;li&gt;&lt;span class="Apple-style-span" style="color: blue;"&gt;You look at his personal web page list of publications and you see dozens of articles and books going back several decades&lt;/span&gt;.&lt;/li&gt;This is the most suspicious by far. Yes he lists &lt;a href="http://www.johnmedina.com/?q=publications"&gt;dozens of publications&lt;/a&gt;, but almost all of them are short, 2-4 page articles IN THE SAME MAGAZINE, &lt;a href="http://www.psychiatrictimes.com/home"&gt;Psychiatric Times&lt;/a&gt;, a dubious looking magazine at best. The only others are in the equally dubious looking &lt;a href="http://www.cmellc.com/geriatrictimes/"&gt;Geriatric Times&lt;/a&gt;. His publications page does list a REFEREED PAPERS section with some more legitimate academic articles, but he's second or third author on almost all.&lt;/ul&gt;&lt;div&gt;Add to this the fact that his recommendations seem to be little more than common sense (i.e., talk to your kids more...no duh!). I have no problem with someone making money off their education, but this seems to be an example of trying to con people into believing he has more to say than he really does simply because of the letters P-h-D after his name.&lt;/div&gt;&lt;br /&gt;&lt;div&gt;Despite writing this, I don't feel terribly comfortable casting aspersions on someone who may indeed be a serious, legitimate academic. If I have made mistakes in this critique, I will apologize. But then again it is incumbent upon Medina to do a better job of representing his credentials. And it is incumbent upon us as us a lay readers (hey, I ain't no molecular biologists either) to scrutinize supposed experts who are asking us to pay for their expertise (in the form of book prices and speaking fees).&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-5558608415639342979?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/5558608415639342979/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=5558608415639342979' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/5558608415639342979'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/5558608415639342979'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/12/how-to-spot-academic-con-artist.html' title='how to spot an academic con artist'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-6991628703924813358</id><published>2010-12-01T08:33:00.000-05:00</published><updated>2010-11-30T11:06:19.022-05:00</updated><title type='text'>94,000 language deaths!</title><content type='html'>History's only Emmy-nominated linguist* K. David Harrison answers questions over at &lt;a href="http://www.economist.com/blogs/johnson/2010/11/interview"&gt;The Johnson blog&lt;/a&gt; about language death, his favorite topic. He repeats what he's been saying for the last few years about language death, and he generally makes good points; however, he says two things worth responding to:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;"&lt;span class="Apple-style-span" style="color: #38761d; font-style: italic;"&gt;The human knowledge base is eroding as we lose languages&lt;/span&gt;"&lt;/li&gt;&lt;li&gt;"...&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;bilingualism strengthens the brain&lt;/span&gt;&lt;/i&gt;"&lt;/li&gt;&lt;/ol&gt;The first one is a vague and complicated claim often promoted by language-deathers** and the second is a goofy metaphor (at best). Let's walk through the reasons why these statements should not be a part of a serious discussion of language death:&lt;br /&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: #38761d; font-style: italic;"&gt;The human knowledge base is eroding as we lose languages&lt;/span&gt;&lt;br /&gt;My primary critique of this claim is that it's just not clear what it really means. In what way does a language uniquely encode information? Harrison provides a few simple examples, mostly lexical items that show us how a particular language fore-fronted particular features to encode, and the argument is that that tells us something about that culture's perceptions of what was important to them. This is probably true to some extant, but honestly, we still do not understand language well enough to truly understand what lexical features tell us about a culture. This is hyperbole at best. But this is NOT an argument against language death per se, it's just a fact.&lt;br /&gt;&lt;br /&gt;So what if we lose some facts about a culture's perceptions of the world? Let's assume there are 6000 language alive today. How many have already died? We don't know. For a rough estimate, let's draw an analogy and ask the question, how many humans have ever lived? A few years ago, the Population Reference Bureau did a "&lt;a href="http://www.prb.org/Articles/2002/HowManyPeopleHaveEverLivedonEarth.aspx"&gt;semi-scientific" guesstimate&lt;/a&gt; of this question and determined that less than 6% of all people who had ever lived, were still alive in 2002. If we assume that languages come and go at a pace that correlates with populations, then we can assume that the current 6000 living languages are about 6% of the total number of languages that ever existed. That means the total number of languages that have ever existed is around 100,000***. This means we've already lost 94,000 languages that were never documented. 94,000 language deaths. 94,000 lost knowledge bases. Oh, the horror, the horror!&lt;br /&gt;&lt;br /&gt;Exactly how bad off should we currently be if Harrison is correct about the ill effects of language death now that we know we've lost 94,000 languages? Are we really that bad off? Clearly the answer is no, we're not that bad off. If losing 94,000 languages has not caused grave danger to humanity, why would losing another 3,000?&lt;br /&gt;&lt;br /&gt;Yes, I agree that all languages have unique linguistic properties that are worth studying in themselves. But just because we find interesting data in every language does NOT mean we should stop language death per se. We need a broader understanding of the system of language interaction and language evolution, otherwise stopping language death may be as irresponsible as causing language death. Genetics blogger Razib Khan has made &lt;a href="http://blogs.discovermagazine.com/gnxp/2010/07/linguistic-diversity-poverty/"&gt;a compelling argument&lt;/a&gt;&amp;nbsp;that "&lt;i&gt;high linguistic diversity is not conducive to economic growth, social cooperation, and amity&lt;/i&gt;." This is just one speculative claim, but at least it's a voice on the other side of this issue.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;bilingualism strengthens the brain&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;This is just goofy phrasing. He's referencing important neurolinguistic research, so why trivialize it by using such patently absurd language?&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;*I actually don't know this to be true, definitively.&lt;br /&gt;**Ooooh, I'm being a little caustic there, hehe.&lt;br /&gt;***This estimate is remarkably similar to the ones David Crystal&amp;nbsp;discusses&amp;nbsp;in his book &lt;a href="http://www.amazon.com/Language-Death-David-Crystal/dp/0521012716"&gt;Language Death&lt;/a&gt;. In that book, he says anywhere from 64,000 to 140,000 is a reasonable&amp;nbsp;guesstimate. My 100,000 splits that damn near down the middle.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-6991628703924813358?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/6991628703924813358/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=6991628703924813358' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/6991628703924813358'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/6991628703924813358'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/11/94000-language-deaths.html' title='94,000 language deaths!'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-7392758581359503286</id><published>2010-11-23T07:33:00.004-05:00</published><updated>2010-11-23T09:43:53.307-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='research blogging'/><title type='text'>purple pain and a gene called 'straightjacket'</title><content type='html'>Dr. Kevin Mitchell, a neuroscientist at Smurfit Institute of Genetics, University of Dublin, posted at his excellent blog &lt;a href="http://wiringthebrain.blogspot.com/2010/11/synaesthetic-mouse.html?utm_source=feedburner&amp;amp;utm_medium=feed&amp;amp;utm_campaign=Feed:+ResearchBloggingAllEnglish+(Research+Blogging+-+English+-+All+Topics)"&gt;Wiring the Brain&lt;/a&gt; about a weird, interesting study* that points to a possible genetic explanation of synaesthesia** (e.g., hearing a word and experiencing the color &lt;span class="Apple-style-span" style="color: red;"&gt;red&lt;/span&gt;). The authors were studying pain mechanisms in fruit flies (turns out the mechanisms are similar to us mammals, whuddathunk?). Once they identified a particular gene they dubbed &lt;i&gt;straightjacket&lt;/i&gt;*** which is "&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;involved in modulating neurotransmission,&lt;/span&gt;&lt;/i&gt;" they systematically deleted it in test flies and discovered that the test subjects**** no longer processed the pain stimuli, even though the pain stimuli was following the pathway. In Mitchell's words: &lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Somehow, deletion of CACNA2D3 alters connectivity within the thalamus or from thalamus to cortex in a way that precludes transmission of the signal to the pain matrix areas. This is where the story really gets interesting. While they did not observe responses of the pain matrix areas in response to painful stimuli, they did observe something very unexpected – responses of the &lt;b&gt;visual &lt;/b&gt;and &lt;b&gt;auditory &lt;/b&gt;areas of the cortex! What’s more, they observed similar responses to &lt;b&gt;tactile &lt;/b&gt;stimuli administered to the whiskers. Whatever is going on clearly affects more than just the pain circuitry &lt;/span&gt;&lt;/i&gt; (emphasis added).&lt;br /&gt;&lt;br /&gt;So, if I understand this, they turned off the ability to recognize pain, but when they administered painful stimuli (heat), the test subjects had visual, auditory, and tactile experiences. Imagine putting a flame to your hand and seeing &lt;span class="Apple-style-span" style="color: #741b47;"&gt;purple&lt;/span&gt;. Pretty frikkin awesome. Dr. Mitchell's post does more justice to this complex study, I just thought it was awesome.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;*Geez! Take a look at the author list of the publication. Do you have a place for &lt;i&gt;12th author&lt;/i&gt; on YOUR CV?&lt;br /&gt;&lt;br /&gt;**FYI: Synaesthesia is NOT the same thing as sound symbolism, necessarily. True synaesthesia is a rare phenomenon that appears to have biophysical roots. Sound symbolism is mostly hippie-dippy bullshit exploited by marketing professionals to sell stuff.&lt;br /&gt;&lt;br /&gt;***I have no clue why they called it this, but it's a hell of a lot more awesome than CACNA2D3.&lt;br /&gt;&lt;br /&gt;****There were multiple studies referenced, some involving fruit flies, some involving mice, and it wasn't clear to me which evidence came from which studies, so I have chosen to use the cover term "test subjects."&lt;br /&gt;&lt;br /&gt;&lt;span style="float: left; padding-bottom: 5px; padding-left: 5px; padding-right: 5px; padding-top: 5px;"&gt;&lt;a href="http://www.researchblogging.org/"&gt;&lt;img alt="ResearchBlogging.org" src="http://www.researchblogging.org/public/citation_icons/rb2_large_gray.png" style="border: 0;" /&gt;&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;span class="Z3988" title="ctx_ver=Z39.88-2004&amp;amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;amp;rft.jtitle=Cell&amp;amp;rft_id=info%3Apmid%2F21074052&amp;amp;rfr_id=info%3Asid%2Fresearchblogging.org&amp;amp;rft.atitle=A+Genome-wide+Drosophila+Screen+for+Heat+Nociception+Identifies+%CE%B12%CE%B43+as+an+Evolutionarily+Conserved+Pain+Gene.&amp;amp;rft.issn=0092-8674&amp;amp;rft.date=2010&amp;amp;rft.volume=143&amp;amp;rft.issue=4&amp;amp;rft.spage=628&amp;amp;rft.epage=38&amp;amp;rft.artnum=&amp;amp;rft.au=Neely+GG&amp;amp;rft.au=Hess+A&amp;amp;rft.au=Costigan+M&amp;amp;rft.au=Keene+AC&amp;amp;rft.au=Goulas+S&amp;amp;rft.au=Langeslag+M&amp;amp;rft.au=Griffin+RS&amp;amp;rft.au=Belfer+I&amp;amp;rft.au=Dai+F&amp;amp;rft.au=Smith+SB&amp;amp;rft.au=Diatchenko+L&amp;amp;rft.au=Gupta+V&amp;amp;rft.au=Xia+CP&amp;amp;rft.au=Amann+S&amp;amp;rft.au=Kreitz+S&amp;amp;rft.au=Heindl-Erdmann+C&amp;amp;rft.au=Wolz+S&amp;amp;rft.au=Ly+CV&amp;amp;rft.au=Arora+S&amp;amp;rft.au=Sarangi+R&amp;amp;rft.au=Dan+D&amp;amp;rft.au=Novatchkova+M&amp;amp;rft.au=Rosenzweig+M&amp;amp;rft.au=Gibson+DG&amp;amp;rft.au=Truong+D&amp;amp;rft.au=Schramek+D&amp;amp;rft.au=Zoranovic+T&amp;amp;rft.au=Cronin+SJ&amp;amp;rft.au=Angjeli+B&amp;amp;rft.au=Brune+K&amp;amp;rft.au=Dietzl+G&amp;amp;rft.au=Maixner+W&amp;amp;rft.au=Meixner+A&amp;amp;rft.au=Thomas+W&amp;amp;rft.au=Pospisilik+JA&amp;amp;rft.au=Alenius+M&amp;amp;rft.au=Kress+M&amp;amp;rft.au=Subramaniam+S&amp;amp;rft.au=Garrity+PA&amp;amp;rft.au=Bellen+HJ&amp;amp;rft.au=Woolf+CJ&amp;amp;rft.au=Penninger+JM&amp;amp;rfe_dat=bpr3.included=1;bpr3.tags=Biology%2CPsychology%2CSocial+Science%2CResearch+%2F+Scholarship%2COther%2CNeuroscience%2CLinguistics"&gt;Neely GG, Hess A, Costigan M, Keene AC, Goulas S, Langeslag M, Griffin RS, Belfer I, Dai F, Smith SB, Diatchenko L, Gupta V, Xia CP, Amann S, Kreitz S, Heindl-Erdmann C, Wolz S, Ly CV, Arora S, Sarangi R, Dan D, Novatchkova M, Rosenzweig M, Gibson DG, Truong D, Schramek D, Zoranovic T, Cronin SJ, Angjeli B, Brune K, Dietzl G, Maixner W, Meixner A, Thomas W, Pospisilik JA, Alenius M, Kress M, Subramaniam S, Garrity PA, Bellen HJ, Woolf CJ, &amp;amp; Penninger JM (2010). A Genome-wide Drosophila Screen for Heat Nociception Identifies α2δ3 as an Evolutionarily Conserved Pain Gene. &lt;span style="font-style: italic;"&gt;Cell, 143&lt;/span&gt; (4), 628-38 PMID: &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/21074052" rev="review"&gt;21074052&lt;/a&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-7392758581359503286?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/7392758581359503286/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=7392758581359503286' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/7392758581359503286'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/7392758581359503286'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/11/purple-pain-and-gene-called.html' title='purple pain and a gene called &apos;straightjacket&apos;'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-8786163048492483863</id><published>2010-11-22T15:29:00.000-05:00</published><updated>2010-11-22T15:29:37.428-05:00</updated><title type='text'>unaxseptable</title><content type='html'>kottke &lt;a href="http://kottke.org/10/11/hard-coding-bias-in-google-algorithmic-search-results"&gt;rants against&lt;/a&gt; Google's biased calculator (because it provides an answer rather than search results) then finishes with this:&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #38761d;"&gt;Google! This. Is. Un. Acce. Ptable!&lt;/span&gt;&lt;/i&gt;&lt;/div&gt;&lt;br /&gt;Huh? Does kottke have dyslexia*? I get the staccato pronunciation he's representing, but that's not at all how I would say it. I would say (and write): &lt;i&gt;Un. Ac. Cept. Able&lt;/i&gt;! Even if you want to pose your own variation, can we agree that kottke's is simply not a viable analysis of the syllable structure of the word? The double "c" spelling makes it a bit odd looking, but it actually helps in the analysis. Each "c" represents a different sound, and each sound should go in a different syllable. And how does he start the final syllable with "pt"! That is unaxseptable!&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;*Dyslexics can have difficulty counting syllables.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-8786163048492483863?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/8786163048492483863/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=8786163048492483863' title='6 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8786163048492483863'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/8786163048492483863'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/11/unaxseptable.html' title='unaxseptable'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>6</thr:total></entry><entry><id>tag:blogger.com,1999:blog-520807396714463309.post-2645395402317559530</id><published>2010-11-20T08:33:00.004-05:00</published><updated>2010-11-20T20:15:51.631-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='pronouncify'/><title type='text'>Pronouncify.com and the fictional Princeton Linguistics department</title><content type='html'>I spent Thursday night on a plane so I missed &lt;i&gt;30 Rock&lt;/i&gt; and the most linguistics oriented sit-com episode since ... since ... um ... okay, the most linguistics oriented sitcom episode EVER! But thanks to the innerwebz, I have caught up on my TV addiction.&lt;br /&gt;&lt;br /&gt;The set-up has Jack Donaghy being the voice of &lt;a href="http://www.nbc.com/30-rock/exclusives/pronouncify/"&gt;Pronouncify.com&lt;/a&gt;, (I BEG you to sign up, PLEASE!!!!) &amp;nbsp;a website that demonstrates the &lt;i&gt;correct &lt;/i&gt;pronunciation of all English words.&amp;nbsp;Apparently, when Jack was a poor undergrad at Princeton, he was hired by the "Linguistics Department" to pronounce every word in an English dictionary to preserve the &lt;i&gt;correct &lt;/i&gt;pronunciation for generations to come. But they sold his readings, and hence his voice is now the voice of Pronouncify.com (as well as the first perfect microwave...).&lt;br /&gt;&lt;br /&gt;Here is as faithful a transcript of the critical dialogue as I can muster:&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: red;"&gt;Jack&lt;/span&gt;&lt;/b&gt;: &lt;i&gt;Those bastards&lt;/i&gt;!&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: blue;"&gt;Liz&lt;/span&gt;&lt;/b&gt;: &lt;i&gt;Who bastards&lt;/i&gt;?&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: red;"&gt;Jack&lt;/span&gt;&lt;/b&gt;: &lt;i&gt;Part of my Princeton scholarship included work for the &lt;b&gt;Linguistics department&lt;/b&gt;. They wanted me to record every word in the dictionary to preserve the perfect American accent in case of nuclear war. Well, the cold war ended, and Princeton began selling the recordings&lt;/i&gt;.&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: blue;"&gt;Liz&lt;/span&gt;&lt;/b&gt;: &lt;i&gt;So people can just buy your voice&lt;/i&gt;?&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: red;"&gt;Jack&lt;/span&gt;&lt;/b&gt;: &lt;i&gt;Ohhhh, the things it's been dragged into. Thomas the Tank Engine; Wu-Tang songs&lt;/i&gt;...&lt;br /&gt;&lt;br /&gt;This must have been the glory days before the hippies took over and started "protecting" undergrads from "exploitation." Whatever...&lt;br /&gt;&lt;br /&gt;In any case, it's understandable that this trivial tid-bit of academic minutia blew right by most people, but it is a fact of the world we live in that Princeton University does not have a linguistics department per se. They do offer an &lt;a href="http://www.princeton.edu/linguistics/about/"&gt;Undergraduate Program in Linguistics&lt;/a&gt; in which students can "pursue a Certificate in Linguistics," but this is not an official department as far as I understand it. Jack, if he is the same age as the actor Alec Baldwin, would have been at Princeton in late 1970s. Maybe they had a full fledged department back then, I honestly don't know.&lt;br /&gt;&lt;br /&gt;You can watch the episode &lt;a href="http://www.nbc.com/30-rock/video/college/1261073/"&gt;College &lt;/a&gt;at NBC, or wherever else you prefer. BTW, there's an awesome ode to color perception conundrums at the end as well. It's all kinda linguisticee/cog sciencee (I never know how to add the -ee morpheme?).&lt;br /&gt;&lt;br /&gt;Random after-point: Near the end of Thursday's episode of &lt;i&gt;Community&lt;/i&gt;, Dean Pelton actually utilized the Shakespearean subjunctive construction &lt;i&gt;Would that X were Y&lt;/i&gt;... He says "&lt;i&gt;Would that this hoodie were a time hoodie&lt;/i&gt;" around the 19:20 mark (see Hamlet, &lt;i&gt;would it were not so, you are my mother&lt;/i&gt;). Just thought that was kinda awesome.&lt;br /&gt;&lt;br /&gt;And not for nuthin', but if you haven't seen Tina Fey's Mark Twain Prize speech, it's a gem: &lt;a href="http://www.huffingtonpost.com/2010/11/16/tina-fey-mark-twain-speech_n_784178.html"&gt;HERE&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/520807396714463309-2645395402317559530?l=thelousylinguist.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://thelousylinguist.blogspot.com/feeds/2645395402317559530/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=520807396714463309&amp;postID=2645395402317559530' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/2645395402317559530'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/520807396714463309/posts/default/2645395402317559530'/><link rel='alternate' type='text/html' href='http://thelousylinguist.blogspot.com/2010/11/pronuncifycom-and-fictional-princeton.html' title='Pronouncify.com and the fictional Princeton Linguistics department'/><author><name>Chris</name><uri>http://www.blogger.com/profile/09558846279006287148</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='27' height='32' src='http://4.bp.blogspot.com/_uqqKLCxLtII/Si_d4pyMBVI/AAAAAAAAAIs/-knM8aWUTMo/S220/me_us_news.jpg'/></author><thr:total>3</thr:total></entry></feed>
