languagehat.com

Thus Spoke Hesiod.

February 26, 2026 by languagehat 3 Comments

Matthew Scarborough, LH’s house Indo-Europeanist (e.g.), has a new project going, which I’ll let him introduce:

A small project that I’ve started doing for fun – and to be frank give me a small outlet in this current moment of being between jobs – is an account I’ve made over on Bluesky posting a line-by-line translation of Hesiod’s Theogony (for the non-Hellenists out there, a roughly seventh to sixth-century BCE short epic on the origins of the Greek pantheon) together with metrical analyses and, since line three, phonological transcriptions of the Greek text.

I am currently aiming to post one line per day, together with parsed vocabulary, and I’ve already put together a full translation and transcription with metrical analyses and parsing of the entire text on my computer, so the only thing preventing me from not doing the whole text over the course of the next two and a half years is simply me remembering to post a line per day (and Bluesky continuing to be a viable online platform, which in this current social media ecosystem, who knows?).

I do have an ulterior goal from all this, however. I aim to add philological and etymological commentary to the master file that I’ve prepared and add as an introduction a sketch of Greek historical grammar. In something more of a book-like format, it could serve as an introduction to Greek historical linguistics and Indo-European to students of Greek or students of Indo-European, and possibly a reliable linguistically annotated text for general linguists who might want illustrative examples of Ancient Greek in the heterogeneous variety of East Ionic that is typically used for early Greek epic poetry.

Anyway, if this is the sort of thing you might be interested in check it out. As of today I’ve posted up to line 43 of 1021, so there’s already a month’s backlog for reading. I think the expanded historical linguistic commentary and grammar outline is a good idea for a textbook project, but I don’t know how big of an audience there is for it. I can’t be the only person who thinks that the textbook would be a good thing to be out there if I can just manage to finish it in the first place. Let me know if the comments what you think.

I for one think this is a great idea and hope he keeps it up.

A New Irritant.

February 25, 2026 by languagehat 8 Comments

As I said here, my wife and I are reading Paul Scott’s Raj Quartet, and I’ve gotten to another passage of sufficient Hattic interest to post. One of the protagonists is Hari Kumar, who was raised by his father Duleep to be a perfect Englishman; in this section we are learning about Duleep’s background:

Duleep Kumar was the youngest of a family of four boys and three girls. Perhaps a family of seven children was counted auspicious, for after his birth (in 1888) it seemed for several years that his parents were satisfied and intended no further addition. […] Of the four brothers only Duleep completed the course at the Government Higher School and went on to the Government College. His family thought that to study at the college was a waste of time, so they opposed the plan, but finally gave in. In later years he was fond of quoting figures from the provincial census taken round about this time, that showed a male population of twenty-four and a half million and a female population of twenty-three million. Of the males one and a half million were literate; of the females less than fifty-six thousand, a figure which did not include his three elder sisters. His father and brothers were literate in the vernacular, semi-literate in English. It was because as a youth Duleep had acquired a good knowledge of the language of the administrators that he began to accompany his father on visits to petition the sub-divisional officer, and had the first intimations of the secrets hidden behind the bland face of the white authority. There grew in him a triple determination – to break away from a landlocked family tradition, to become a man who instead of requesting favours, granted them, and to save Shalini from the ignorance and domestic tyranny not only his other sisters but his two elder brothers’ wives seemed to accept uncomplainingly as all that women could hope for from the human experience. When Shalini was three years old he began to teach her her letters in Hindi. When she was five she could read in English.

Duleep was now sixteen years old. The Government College to which he had gained admittance was at the other end of the earth: a hundred miles away. His mother wept at his going. His brothers scoffed. His elder sisters and sisters-in-law looked at him as if he were setting out on some shameful errand. His father did not understand, but gave him his blessing the night before his departure and in the morning accompanied him to the railway station in a doolie drawn by bullocks.

[Read more…]

Periodic Table of Swearing.

February 24, 2026 by languagehat 10 Comments

Via Pengio’s MeFi post on the 30th anniversary of Trainspotting, “the intro to Glaswegian slang that is Modern Toss’s Periodic Table of Swearing (The Scottish Field Report).” It goes from C 1 Cunt to Sbb 102 Shitting A Fucking Breeze Block, and if you click on a square you will hear the sweary element spoken.

Has it really been thirty years? I loved that movie when it came out, and I obviously need to see it again or I’ll be a Prat in a Hat (48).

Ta dobra.

February 23, 2026 by languagehat 48 Comments

A fine post from bulbulistan redivivus, that collection of writings of bulbul/Slavo (see this LH post):

Hans Stumme (1864-1936) was a German linguist whose work is is probably known to anyone interested in Berber and North-African varieties of Arabic. Stumme travelled a lot and collected huge amounts of spoken data from – inter alia – Tunisians, Išelḥiyen and the Maltese. […] It is quite clear that Stumme was particularly interested in collecting folk literature, such fairytales and songs, where his books remain an invaluable source of data for folklorists. At the same time, Stumme’s work is extremely valuable for the study of the languages involved […]

This applies doubly to Maltese where there have been at least two major studies of the fairytales (1, 2). As far as I can tell, there is little focus in reevaluating Stumme’s dialectological work (but that might change soon), which is a shame, because there is so much fascinating stuff in there. Like for example song no. 70 from the collection of Maltese songs (Kössler-Ilg and Stumme 1909, p. 27). I am reproducing the text below in standard Maltese orthography and Stumme’s original German translation accompanied by my English one based on the Maltese text.

Ta’ dobra sejrin jsiefru […]

Die Slawen wollen abreisen […]

The Slavs are about to leave […]

[Read more…]

Who Hit John on the Picketwire.

February 22, 2026 by languagehat 29 Comments

I just rewatched The Man Who Shot Liberty Valance after decades, and it was just as good (and cynical) as I remembered — I especially enjoyed Edmond O’Brien as Dutton Peabody, editor of the Shinbone Star who orates about the power of the press as he swigs from a jug of booze. But what drove me to post was the name someone uses for that booze: who-hit-John, which Wiktionary defines as “Hard liquor; whiskey.” Nobody seems to know the origin; it’s not in the OED, and Green’s has only one cite for it, from the ridiculously late date of 1980 (“But without a hangover and a headful of Who-hit-John, it is a different light”), but I like it and will try to remember to use it when the occasion arises.

Also, the river which plays such a role in the movie (gun-slinging cattlemen to the north, law-abiding farmers to the south) is called the Picketwire, which is the wonderfully anglicized name of the Purgatoire River in southeastern Colorado (though the territory-turned-state in the movie is never named):

The Purgatoire River, also known as Rio de las Ánimas, has had multiple names. It was named by New Mexican Governor Antonio Valverde y Cosío in 1719 during his exploration of the region. Valverde named it “Rio de las Ánimas,” meaning “River of the Spirits,” as a warning to subsequent explorers of the dangers of crossing the nearby Ratón Pass. Surviving the crossing, they found water and firewood at the river. Over time, the true meaning of the river’s name became lost, and various interpretations emerged. By the end of the 18th-century Spanish traders believed it to be “Rio de las Ánimas en Purgatorio,” or “River of the Souls in Purgatory,” after a supposed massacre that occurred on its banks. This led to the birth of a legend of the same name that explained its history. French trappers learned the name and later translated it as “Rivière des âmes au Purgatoire.” They related their translation to members of the Stephen H. Long expedition in 1820 who renamed it “Purgatory Creek” by removing all references to souls. Mexican traders on the old Santa Fe Trail expanded on the legend and named the river “Rio de las Ánimas Perdidas en Purgatorio,” or “River of the Souls Lost in Purgatory,” believing the souls to have become lost. Mountain Men had difficulty pronouncing the French translation and called it “Picatoire,” while Anglophone settlers during the Colorado Gold Rush anglicized it to “Picketwire,” despite the river having no relation to any fence.

Oh, and if you’re wondering about the surname Valance, it’s a variant of Vallance: “English and Scottish: of Norman origin a habitational name from Valence in Drôme France named with Latin valentia ‘strength capacity’.”

The Science of Blunders.

February 21, 2026 by languagehat 51 Comments

I will not say James Willis’s “The Science of Blunders: Confessions of a Textual Critic” is the best thing ever written about textual criticism; that would be absurd, since I’ve read very little about the subject and Willis would probably rise from his grave and smite me for blasphemy. But it is so much fun to read I am tempted to reproduce the whole thing; instead I will just quote a few bits that delight me irresistibly as I scroll down. After a biographical introduction (which ends, sadly, “We regret that we have found neither obituary nor likeness of Willis to share with curious readers”), Willis’s text begins:

Some apology is sure to be demanded for a life largely devoted to what has been often called “mere verbal criticism” and regarded as no more than fiddling with letters and words which are of no importance in the wider horizon of the historian or the literary critic. Now while it would be useless to attempt to apologize for the lack of success with which I personally have practised the trade of a critic, for the trade itself much may be said in its defence. That textual criticism is a waste of time will be always believed by those who accept the texts of Greek and Latin authors as coming from heaven above by permission of the Syndics of the Oxford University Press, and therefore I will preach only to the convertible – to those who are willing to ask the simple question, “How do we have any knowledge of the Greek and Roman world?”

He discusses the difficult path to survival of literary texts, saying:

At every copying there is the possibility of human error. I say “the possibility”, but it is nearer to certainty. Copying is usually a boring task; boredom breeds inattention; inattention breeds mistakes. Therefore the manuscripts of classical authors contain mistakes. The detection and correction of mistakes in texts is the function of textual criticism. Therefore textual criticism is necessary, Q.E.D.

And he provides a splendid catalogue of examples, beginning:
[Read more…]

Éireannach sa spás.

February 20, 2026 by languagehat 12 Comments

I’m stealing ShooBoo’s MeFi post, title and all (it means ‘Irishman in space’), because I can’t improve on it:

Manannán, written in 1940 by Máiréad Ní Ghráda is an Irish-language young adult sci-fi space travel book. It may contain the first use of a Mecha outside of Japan. And the first mention of a gravity assist in literature. A machine translation to English of the first 20 pages.

Hacker News discussion.

As a lover of both Irish and sf, I can only applaud. (That last link has responses from actual Irish speakers, e.g. “I suspect these are actually mistranscribed by the project. […] Comparing the transcription of the first chapter with the source in the PDF they’re missing a fada.”)

Dumbwaiter.

February 19, 2026 by languagehat 63 Comments

This is one of those words I thought I knew, but it turns out I had only a partial view of. My wife and I were watching the making-of extra for I, Claudius when one of the actors talked about how food was brought to the dining table from “dumbwaiters.” From the context it didn’t seem possible that the word was used in the sense familiar to me (Wikipedia: “a small freight elevator or lift intended to carry food”), so I looked it up and discovered a quite different sense — to quote the OED (which has it as two words, dumb waiter; the entry was revised in 2023) “1. Chiefly British. A movable table, typically with revolving shelves, used for serving food and drink” (first citation from ?1730: “Two fine India japan dumb Waiters”). Then we get:

2. Originally North American. A movable platform or compartment inside a vertical shaft, used to deliver items between floors in a building, esp. food or empty plates between a kitchen and a dining area, and accessed through a hatch in a wall. Also occasionally: such a hatch itself.

1833 On the side and in the centre of the main stairway, the dumb waiters rise, by the aid of the steam-engine in the basement, to the tower.
G. M. Davison, Traveller’s Guide Middle & Northern States & Provinces Canada 109

So what Brits call a dumb waiter, we Yanks call a lazy Susan; is there an alternate UK word for the American sense? And are you familiar with both meanings, or does each side of the pond rest in blissful ignorance of the other side’s usage?

Cateran.

February 18, 2026 by languagehat 67 Comments

A MeFi post by clavdivs (who’s been a member even longer than I have, and whom I think of when my wife and I watch an episode of I, CLAVDIVS) sent me to Wilfred Scawen Blunt’s long poem “Satan Absolved,” written in odd-for-English Alexandrine couplets, and there I found this passage:

Tʜᴇ Lᴏʀᴅ Gᴏᴅ
And thou wouldst be incarnate?

Sᴀᴛᴀɴ
As the least strong thing,
The frailest, the most fond, an insect on the wind,
Which shall prevail by love, by ignorance, by lack
Of all that Man most trusteth to secure his back,
To arm his hand with might. What Thy Son dreamed of Man
Will I work out anew as some poor cateran,
The weakest of the Earth, with only beauty’s power
And Thy good grace to aid, the creature of an hour
Too fugitive for fight, too frail even far to fly,
And at the hour’s end, Lord, to close my wings and die.
Such were the new redemption.

I was pretty sure I’d never seen the word cateran before; the OED (entry from 1889) has:
[Read more…]

Tsez Annotated Corpus.

February 17, 2026 by languagehat 16 Comments

I was intrigued by Martin Haspelmath’s Facebook post:

What’s the most user-friendly corpus site? Maybe Abdulaev et al. (2022)’s corpus of the Dagestanian language Tsez (78 texts, almost 5000 text units, 2388 morphemes)? Which other corpus site is as user-friendly as this one? (Admittedly, it does not include sound.)

Since I presume I’m not the only one curious about Dagestanian languages, I thought I’d share The Tsez Annotated Corpus Project:

The texts that constitute this corpus were collected by Arsen Kurbanovič Abdulaev and Isa Kurbanovič Abdullaev and published with Russian translation as Abdulaev and Abdullaev (2010). The intended audience of this book publication was primarily the Tsez-speaking community and Russian-speaking readers interested in folklore. Work on the book was sponsored by the Max Planck Institute for Evolutionary Anthropology (MPI-EVA) and part of the agreement was that the institute would be allowed to post on-line a version of the text suitable for scientific use by linguists, with morpheme glosses and an English translation added to the materials available in the book.

The first text is Allahes ašuni: The rainbow; click through and you’ll see it is indeed beautifully presented. I presume esin šebi xecin šebi ‘What is to be said, what is to be left out’ is the local equivalent of Georgian იყო და არა იყო რა [iqo da ara iqo ra] ‘it was and it wasn’t’ and similar “once upon a time” formulas mentioned by me here and discussed later in the thread, beginning here.