NATIVE AMERICAN LANGUAGES.

The useful Native Languages of the Americas site compiles as many links as they can find:

We are a small non-profit organization dedicated to the survival of Native American languages, particularly through the use of Internet technology. Our website is not beautiful. Probably, it never will be. But this site has inner beauty, for it is, or will be, a compendium of online materials about more than 800 indigenous languages of the Western Hemisphere and the people that speak them.

I’m particularly taken with their faq page, linked on the front page from the question “Why aren’t there any links about how American Indian languages are descended from Ancient Egyptian?” Their discussion of this issue includes a very good chart explaining how languages are grouped into families and the role of coincidence:

As you can probably see even from this small amount of data, English is related to Dutch and German; Hebrew is related to Arabic and Maltese; and Ojibwe is related to Algonquin and Cree. On the other hand, if I had taken only the English word “seven” and the Hebrew word “sheva,” maybe I could have convinced you English was related to Hebrew. And if I had shown you only Hebrew “shalosh” and Arabic “thalatha,” you might not have noticed they were related.

(Via plep [scroll down to 1st November].)

PIRAHA AND MUNDURUKU.

I have previously written about the Pirahã and the issues raised by their lack of number words (update here); Mark Liberman has now posted at Language Log an extensive collection of links and analysis about them and the Mundurukú, another Amazonian tribe whose language (part of the Tupí family) has words only for 1, 2, 3, and 4, with pogbi ‘hand’ being used for ‘five or so’ and everything else being ‘some’ or ‘many.’ Mark’s MetaFilter post contains more links and discussion; my favorite bit from Brian Butterworth’s Guardian story is: “…even in the range of their vocabulary, the Munduruku are approximate – ‘ebadipdip’ is typically used for four, but also used for three, five and six.” I just love that word ebadipdip; if it were part of my language, I’d use it for as many numbers as I could.

LANGUAGE GUESSER.

Maciej Cegłowski of Idle Words has created something called Languid (langu– ID, get it?):

I’ve set up a little web service for identifying language. If you paste in some text (the more the better), it will tell you what language it’s in. Not rocket science, but perhaps useful to somebody.

There’s an API for people who like to do things programatically.

Note that I’m logging all the queries, so you don’t have to email me and say “I pasted BLAH and it gave me the wrong answer”. But any other feedback is welcome.

Me, I pasted Inuit (the text string from my Last Samurai post) and it told me it was Cebuano; this perplexed me less when I saw that on the right of the Languid page is a vertical list of all the languages he’s programmed into it, which includes Cebuano but not Inuit. Anyway, it’s a lot of fun, and I thank Margaret Marks (of Transblawg) for alerting me to it (via this Blethers post).

SEXIST DECLENSION.

Just when you think you’ve seen everything:

Can sexist ideologies be reflected in inflectional classes? On the basis of a detailed discussion of the Russian a-declension, the present paper answers this question in the affirmative. More specifically the central claims are:
— The a-declension reflects the Idealized Cognitive Models of “women as the second sex” and “woman as Madonna and whore”.
— Cognitive linguistics provides an adequate account for the category structure in terms of schematicity and metaphorical extension.

As Alexei, from whom I swiped this absurdity, says, it’s “an abstract of a paper in what’s called Cognitive Linguistics… Yes, the author is a man: Tore Nesset is a male professor at the University of Tromsø, Norway.” He is “tempted to suppose this piece appeared in an April 1 issue of a linguistics journal,” but I’m afraid it’s just another example of academic silliness run amok.

JOHN DOE’S COUSINS.

From a wide-ranging Transblawg post on names of anonymous litigants:

Wilbur H. FRIEDMAN, Plaintiff-Appellant, v. Thomas B. FERGUSON, Director, Department of Animal Control, a state actor, in his official and individual capacities; Brett Boe; Carla Coe; Donna Doe; Frank Foe; Grace Goe; Harry Hoe; State Actors, Advisors To Defendant Ferguson, In Their Official and Individual Capacities (identities currently unknown); Marta Moe; Norma Noe; Paula Poe; Ralph Roe; Sammy Soe; Tommy Toe; Private Individuals Who Conspired With the Foregoing State Actors (identities currently unknown); Roger W. Galvin, Chairman, Animal Matters Hearing Board; Vince Voe; William Woe; Xerxes Xoe; Members of the Animal Matters Hearing Board, State Actors, In Their Official and Individual Capacities (identities currently unknown), Defendants-Appellees

Xerxes Xoe?!

ENGLISH IN MALAY.

Jordan MacVay, a Canadian (or as he puts it “Caper, Bluenoser, Canuck, former Haligonian”) living in Malaysia, discusses many things in his blog MACVAYSIA, some of them language-related; he has, for instance, an excellent post about the “invasion” of English words in Malay, sensibly pooh-poohing the doom-cryers and pointing out the usefulness of loanwords:

First of all, the ‘purists’ who decry the use of English words have to realize that stripping words of foreign origin out of Malay would leave it with hardly any words at all. One prominent historian has pointed out that only three Malay words are exclusively Malay: kayu (wood), batu (stone) and babi (pig). Another historian has added padi (rice field) and two or three other words to that list. So where the heck did all the other Malay words come from? Most Malay words came from other languages including (but not limited to) Sanskrit, Arabic, Javanese, Portuguese and, you guessed it, English. For many centuries Malays have had a flair for adopting foreign words and adapting them to suit their language needs. A closer look at the English words in the above list shows that while some of them are in their original English form (bank, hospital, hotel), this is only because their spelling suits Malay conventions of spelling and pronunciation. Other words are altered to reflect these conventions, and these alterations make the words uniquely Malay despite their English origins. This is the case of the word bajet, which has prompted some purists to question why the government didn’t use the Malay term, anggaran belanja (at least I think that’s the official term, I’ll have to check a dictionary). The government has explained that the old term does not adequately express the exact meaning of a budget being tabled by the a government, so the English word has been adopted, albeit in a modified form. So who’s to say bajet is not a Malay word? It serves a purpose, and now there it is.

I suspect the “three Malay words” thing is a rhetorical exaggeration, but the point is a good one. And for a fascinating example of just how useful borrowings can be, check out his followup post on English pronouns (yes, pronouns) in Malay!

VAPNYAR.

The latest New Yorker has a story by Lara Vapnyar; when I saw the name I guessed it was Indian, but it turns out to be Russian—or to be more accurate, from one of the many nationalities that were bundled into the USSR. My question is, which one? The name is not in any of my reference works, even Unbegaun’s magnificent Russian Surnames, and its indecipherability is eating away at my composure. Is it Udmurt? Bashkir? Some remote Caucasian nationality? Google has failed me, but I have confidence in my readership.

Update (Sept. 2025). I was annoyed that the link now got a 404, and decided to replace it with an archived version, which I did, but I thought I’d link to the story as well, and that turns out to be impossible — it was in the November 1, 2004 issue, and they don’t even mention it on that page. It was turned into a novel, and I guess one of the conditions was that the magazine scrub it from its website. Bah.

UYEZWA NA?

Bill Poser at Language Log describes a project to translate Free Software into the eleven official languages of South Africa, giving the following excellent quote from an article in the Toronto Globe and Mail:

Dwayne Bailey hears the question all the time. “Why bother translating software into isiZulu?” people ask him. “Who needs it? English is the language of global business — you’d be better off spending your energy teaching people English. To which Mr. Bailey replies, quite simply, “Izixhobo kufuneka zisebenzele abantu, hayi abantu izixhobo. Isoftware sisixhobo ngoko ke kumele sisebenzele abantu ngolwimi lwabo lwasemzini!

For translation, see Bill’s post.

TALKIN’ CAPE BRETON.

Cape Breton (French: île du Cap-Breton, Scottish Gaelic: Eilean Cheap Breatuinn, Mi’kmaq: U’namakika) is a linguistically complex place. Many Mi’kmaq (Micmac) still speak their Algonquian language; it’s “the only area in the world – outside of Scotland itself – where Gaelic continues as a living language and culture” (the “grouping of people according to their place of origin in Scotland allowed for the transfer, whole and intact, of localized dialects, of music, song and dance traditions, and of patterns of religious adherence”); there’s a community of Acadian French speakers (“When we speak about a cat we pronounce ‘chat’, but when we refer… to a mess or to a wad of chewing tobacco, we pronounce ‘tchat'”); and of course there’s the local variety of English, about which you can get a lively report here. A sample:

The plural of the word “you” is, of course, “youz”. Thus, the phrase “Each of you appears to be quite intoxicated.” becomes “Youz are all fuckin’ right out of ‘er.” If a native Cape Breton is out of work, they are entitled to collect Unemployment Insurance. This is colloquially known as “The Pogey”. Therefore: “Buddy got his pogey, picked up a few points of Keets and got right fuckin’ out of ‘er” is a very common local sentiment.

In Cape Breton it is customary to greet an acquaintance with a warm phrase. Often an inquiry wondering how the other person is faring is rendered as “What’s goin’ on, b’y?” In the fashion of so many cultures, the proper answer to this greeting is “What’s goin’ on?” Everyone else may safely be called “Buddy”. An exchange of pleasantries between two strangers may begin with “Eh? Buddy.” and be reciprocated with “Eh!”

This last is via jb’s MonkeyFilter thread, where you can find more links in the comments.

MORE BOREDOM.

As a followup to my earlier entry on the construction “boring of the task” and the Language Log entries by Mark Liberman linked therein, Mark has posted Horror and boredom in Castile, a summary of Christopher J. Pountain’s paper “The Castilian reflexes of ABHORRERE/ABHORRESCERE: a case-study in valency“:

The basic observation is that Latin abhorrere started out meaning “to shrink back from, have an aversion for, shudder at, abhor”, but one of the Spanish descendents, aburrir, wound up meaning “to bore”. So not only did the meaning change, but also the “valency” (in the sense of which verbal arguments go where). “I abhor you” turned into “you bore me”.

The original paper has several useful diagrams showing semantic ranges, and Mark reproduces the one showing the historical development in Castilian.

I would also like to second the recommendation in Mark’s more recent post for Pountain’s book (coauthored with R. E. Batchelor) Using Spanish: A Guide to Contemporary Usage. It’s part of a Cambridge series (of which I also have the German volume by Martin Durrell), and it’s extremely well done, with lists of “misleading similarities,” fields of meaning, complex verbal expressions, and the like, all with careful attention to register and geographical restrictions. (I assume, by the way, that Pountain rhymes with fountain, but if anyone knows for sure, please comment.) [It does: see Y’s comment below.]