Tuesday, October 16, 2007

Data and Models

Mankiw on Greenspan and macro-economics:

Better monetary policy, he suggests, is more likely to follow from better data than from better models. Relatively little modern macro has been directed at improving data sources. Perhaps that is a mistake.

Methinks this same sentiment could be said of linguistics. However, I am ambivalent. On the one hand, I am trained in a department long dedicated to descriptive linguistics, so I’m frightened by the lack of good description for most of the world’s languages. I believe in supporting field linguists and old fashioned grammar writing tasks. But I’m equally frightened by the lack of good models of language, particularly of language change and evolution. I’m sympathetic to the recent flood of computationally minded engineers into the field of linguistics who have brought fresh approaches (e.g., statistical). Here’s a representative sample of very smart people bringing mathematical/computational modeling into linguistics:

Sandiway Fong -- U. Arizona
Partha Niyogi -- U. Chicago
Josh Tenenbaum -- MIT
Charles Yang -- U. Penn


Jason Adams said...

Great list of people bringing computational/mathematical models to linguistics. I'm doing a literature review in a class on computational methods being applied to historical linguistics. Any other pointers?

Chris said...

Jason, thanks for the post. Partha Niyogi is spot on for this, but other than him, I'm not sure. I would check Joan Bybee's page to see if she references anything. Hope this helps.

A linguist asks some questions about word vectors

I have at best a passing familiarity with word vectors, strictly from a 30,000 foot view. I've never directly used them outside a handfu...