For some time now, the English speaking linguistics world has anxiously awaited the arrival of our millionth word in English (see here and here). I have a bottle of Freixenet permanently on ice just for that wondrous day. But alas! It appears that Inuktitut has beaten my native language to the prize of all prizes. According to this story about Microsoft's Inuit language software,
More than one million words have been programmed in Inuktitut through the collaboration, about 5,000 of which are new Inuktitut words (emphasis added).
I'm a gracious loser. Congratulations Inuktitut. See you at the 2 millionth mark.
Subscribe to:
Post Comments (Atom)
TV Linguistics - Pronouncify.com and the fictional Princeton Linguistics department
[reposted from 11/20/10] I spent Thursday night on a plane so I missed 30 Rock and the most linguistics oriented sit-com episode since ...
-
The commenters over at Liberman's post Apico-labials in English all clearly prefer the spelling syncing , but I find it just weird look...
-
Matt Damon's latest hit movie Elysium has a few linguistic oddities worth pointing out. The film takes place in a dystopian future set i...
-
(image from Slate.com ) I tend to avoid Slate.com these days because, frankly, I typically find myself scoffing at some idiot article they...
5 comments:
You can have a look at the IT terminology that was standardized by Microsoft in collaboration with local experts, governments and universities in the Microsoft Community Glossary Project site, which is part of the Language Interface Pack (LIP) project. The list of terms is the same for all LIP languages and obviously it does not include function words or the standard lexicon that make up the bulk of the million words in the localized software strings.
Licia, thanks for the links! I'll have a closer look tomorrow. I'm still suspicious of the whole "million words" concept, but I'm open minded, and the links you provided look like a fruitful area to peruse.
I also should say that I'm impressed with Microsoft's cross-linguistics efforts, just as I was with Rosetta Stones' here
LIP is indeed a very interesting project.
BTW, a product like Office easily exceeds the “million words” mark :-)
In this press release about Afrikaans, isiXhosa, Setswana, isiZulu and Sesotho sa Leboa, the number of words quoted for the localization of Windows and Office is four million.
You need to take into account the large number of duplicate strings; words like OK and Cancel can occur thousand of times in the same product. Just to give you an example, if you look up “shared” with Inuktitut as target language in the Microsoft Language Portal, you’ll get about 80 strings / 200 source words and quite some repetition (and you need to bear in mind that exact duplicates are removed from the search results).
Ahhhhh, okay, it's the old type/token distinction. I wasn't making that connection. I assumed they meant 1 million unique types. Thanks!
There's also a LIP blog, http://blogs.technet.com/office_language_interface_pack/default.aspx and a Microsoft Language Portal blog, http://blogs.technet.com/terminology/archive/2010/02/22/sharing-it-terminology-with-local-language-communities.aspx
Post a Comment