Just a map, but a nicely done one, with some interesting discussion (and explanation of obscure abbreviations) in the comments. Thanks, Trevor!
Comments
Trackbacks
-
[…] Hat links to this 2008 map showing lexical différences between Europe’s […]
Just a map, but a nicely done one, with some interesting discussion (and explanation of obscure abbreviations) in the comments. Thanks, Trevor!
[…] Hat links to this 2008 map showing lexical différences between Europe’s […]
Commented-On Language Hat Posts (courtesy of J.C.; contains useful Random Link feature)
E-mail:
languagehat AT gmail DOT com
My name is Steve Dodson; I’m a retired copyeditor currently living in western Massachusetts after many years in New York City.
If your preferred feed is Twitter, you can follow @languagehat to get
links to new posts here as they appear. (I don’t otherwise participate
in Twitter.)
If you’re feeling generous:
my Amazon wish list
And you can support my book habit without even spending money on me by following my Amazon links to do your shopping (if, of course, you like shopping on Amazon); As an Amazon Associate I earn from qualifying purchases (I get a small percentage of every dollar spent while someone is following my referral links), and every month I get a gift certificate that allows me to buy a few books (or, if someone has bought a big-ticket item, even more). You will not only get your purchases, you will get my blessings and a karmic boost!
If your comment goes into moderation (which can happen if it has too many links or if the software just takes it into its head to be suspicious), I will usually set it free reasonably quickly… unless it happens during the night, say between 10 PM and 8 AM Eastern Time (US), in which case you’ll have to wait. And occasionally the software will decide a comment is spam and it won’t even go into moderation; if a comment disappears on you, send me an e-mail and I’ll try to rescue it. You have my apologies in advance. Also, my posts should be taken as conversation-starters; there is no expectation of “staying on topic,”and some of the best threads have gone in entirely unexpected directions. I have strong opinions and sometimes express myself more sharply than an ideal interlocutor might, but I try to avoid personal attacks, and I hope you will do the same.
Songdog
Kaleidoscope
The Daily Growler †
wood s lot †
MetaFilter
an eudæmonist
Avva (Russian)
No-sword
The Cassandra Pages
Transblawg
Epigrues
Far Outliers
paperpools
Lizok’s Bookshelf
A Bad Guide †
Poemas del río Wang
The Flaxen Wave
ТЕТРАДКИ: Что о нас думают в Европе?
Russian Dinosaur
XIX век
Wuthering Expectations
Boris Dralyuk
Laudator Temporis Acti
The Untranslated
The Fate of Books
The Millions
Linguablogs:
Language Log
Anggarrgoon
Jabal al-Lughat
Dick & Garlick
bulbulovo
Ἡλληνιστεύκοντος (in English)
Word Routes
Sentence first
Balashon
Separated by a common language
Ozwords (a blog from the Australian National Dictionary Centre)
The *Bʰlog (“A blog devoted to all matters Indo-European”)
Strong Language (“a sweary blog about swearing”)
Language resources:
Arnold Zwicky’s list of blogs and resources
Multitran
American Heritage Dictionary
Green’s Dictionary of Slang
Wiktionary
bab.la
TypeIt (IPA keyboards, language character sets)
Clickable IPA chart (by Weston Ruter)
Wordorigins
Wikipedia:Reference desk/Language
The sci.lang FAQ
Omniglot
ScriptSource
BibleOnline
Jewish Lexicon Project
Cambridge International Dictionary of Idioms
TITUS: Thesaurus Indogermanischer Text- und Sprachmaterialien
American Heritage Dictionary Indo-European Roots Appendix
Andras Rajki’s Etymological Dictionary of Arabic
Germanic Lexicon Project
Dictionary of the Scots Language
Das Deutsche Wörterbuch von Jacob und Wilhelm Grimm
Wortschatz Deutsch
Etymologisches Wörterbuch des Deutschen
DWDS (Der deutsche Wortschatz von 1600 bis heute)
etymologiebank.nl (Dutch etymology)
Den Danske Ordbog (Danish dictionary, with audio)
Latin Dictionary and Grammar Aid
Thesaurus Linguae Latinae (Open Access PDF version of volumes A – M and O – P)
Trésor de la langue française informatisé
Bob, dictionnaire d’argot
Dictionnaire de l’Académie francaise
Centre National de Ressources Textuelles et Lexicales
Lexilogos (French)
Dictionnaires d’autrefois
Französisches Etymologisches Wörterbuch
Real Academia dictionary (Spanish)
Diccionari català
Vocabolario Etimologico della Lingua Italiana
RAI Dizionario d’Ortografia e di Pronunzia (includes proper names)
Dizy: Il dizionario pratico con curiosità e informazioni utili
Dicționare ale limbii române
electronic Dictionary of the Irish Language
Cadhan Irish Dictionary (bridge to eDIL)
MacBain’s Etymological Dictionary of the Gaelic Language (1911 edition)
Cornish dictionary online
Arak-29 (Armenian links)
Verb Conjugator
World Wide Words
Online Etymology Dictionary
Tower of Babel etymological database
Perseus Digital Library 4.0
Logeion (Greek-to-English and Latin-to-English dictionary search)
Greek language and linguistics
Λεξικό της κοινής νεοελληνικής [Modern Greek Dictionary] (comprehensive; includes etymologies)
LBG (Lexikon zur byzantinischen Gräzität = Lexicon of Byzantine Greek)
Orbis Latinus
Slovopedia (links to Russian dictionaries; sidebar has links to comparable pages for German, Ukrainian, Belorussian, Georgian, and Kazakh)
dic.academic.ru (Russian dictionary search)
ПоискСлов (Russian dictionary search)
Русский этимологический словарь (A.E. Anikin’s new Russian etymological dictionary, now on letter д)
etymolog.ruslang.ru (Russian etymology and word history links)
Philology.ru
Philolog.ru
Slavenitsa (converts from modern Russian to pre-reform orthography)
Minority Languages of Russia
Ru_slang (Russian)
Vasmer’s etymological dictionary(Russian)
Russian language links
Russian literature online
Национальный корпус русского языка (Corpus of the Russian Language)
Большой толково-фразеологический словарь Михельсона (1896-1912)
Словарь русских народных говоров [Russian dialect dictionary]
Старославянский словарь [Old Church Slavic dictionary]
Словарь русского языка XVIII в [Dictionary of 18th-c. Russian]
Russian Word of the Day
Ukrainian etymological dictionary
Речник на личните и фамилни имена у българите (Bulgarian names)
A Dictionary of Tocharian B (with etymologies)
Chinese Character Dictionary
Zhongwen.com
The Kanji Site
Mongolian/English dictionary
Digital Dictionaries of South Asia
Monier-Williams Sanskrit Dictionary
Nişanyan’s Turkish Etymological Dictionary
The Austronesian Comparative Dictionary
The Polynesian Lexicon Project (POLLEX)
An ka taa (resources and lessons for Bambara, Dioula, Malinké, and Mandinka)
Bargery’s web site (contains A Hausa-English Dictionary and English-Hausa Vocabulary)
Movies listed by language at IMDB
Languages online
Historical Dictionary of Science Fiction
Speculative Grammarian
Word Oddities
Jan Freeman’s Boston Globe column
Character converter
Mailing list
Hattics mailing list
Visual pleasures
Nick Jainschigg’s blog
Citrus Moon
Ramage
Favorite rave review, by Teju Cole:
“Evidence that the internet is not as idiotic as it often looks. This site is called Language Hat and it deals with many issues of a linguistic flavor. It’s a beacon of attentiveness and crisp thinking, and an excellent substitute for the daily news.”
From “commonbeauty”
(Cole’s blog circa 2003)
All comments are copyright their original posters. Only messages signed “languagehat” are property of and attributable to languagehat.com. All other messages and opinions expressed herein are those of the author and do not necessarily state or reflect those of languagehat.com. Languagehat.com does not endorse any potential defamatory opinions of readers, and readers should post opinions regarding third parties at their own risk. Languagehat.com reserves the right to alter or delete any questionable material posted on this site.
Copyright © 2025 · languagehat.com
What’s a Greek?
It’s just a mix of French and Albanian!
@SFR: With a dash of Dutch and Lithuanian, apparently.
I haven’t read the many comments below the map. I understand that the line styles indicate lexical distance, but it’s not clear to me what the actual distances between the languages on the diagram signify, and what determines which languages are connected by a line and which aren’t.
Any realistic representation of lexical impact, it should be directional. For example, there’s much more French in English, German in Polish or Latin in Albanian than vice versa. And… whassat? Hungarian influence in Lithuanian, Latvian and Estonian? I beg your pardon?
“The original research data for the chart comes from K. Tyshchenko (1999), Metatheory of Linguistics.” Not from real-world text corpora?
I also wonder why certain lines are not drawn. Lack of available data? Catalan-Provençal, Romanian and Greek-any Slavic language in the Balkans, Greek-everywhere, Hungarian-any neighbour, Dutch-Scandinavian languages, German-Estonian…
Did some googling and found that the author of the original research for the map is this great Ukrainian linguist (in every sense of the word):
Тищенко, Константин Николаевич (укр. Костянти́н (Кость) Микола́йович Ти́щенко) (30 июля 1941, Глухов Сумская область) — украинский лингвист, педагог, переводчик. Доктор филологических наук (1992), профессор (1995). Автор более 180 трудов по метатеории языкознания, знаковой теории языка, лингвистических закономерностей, вопросов оптимизации морфологического описания языка, преподавания языков, проблем развития языка, романского и восточного языкознания, а также цикла статей по германистике, славистике, кельтологии, баскскому языку, финскому языку, балканистике и алтаистике. Преподаватель нескольких десятков различных языков. Читает лекции по общему языкознанию, ведет практические занятия по французскому, фарси, финскому, итальянскому, баскскому, валлийскому и др. языкам. В 2001–2010 гг. был заведующим основанного им Лингвистического музея при Киевском университете. С 2011 — профессор кафедры Ближнего Востока.
check the Ukrainian Wikipedia article on Tyshchenko.
https://uk.wikipedia.org/wiki/Тищенко_Костянтин_Миколайович
It’s simply overwhelming…
Found some English review of Tyschenko’s career and his work on the theory of lexical distance among European languages here
http://wantedinlinguistics.blog.tiscali.it/2014/01/14/the-theory-of-lexical-distance-among-european-languages-by-kostiantyn-tyshchenko/
Professor Tyshchenko of Taras Shevchenko Kyiv National University has several dozen languages in his arsenal, including exotic like Basque, Welsh, Finnish, Persian and others. He does not view his multilingualism as worthy of special note, because it is merely a research tool to him. Tyshchenko is known for several accomplishments, such as a meta-theory of linguistics which one of his colleagues aptly dubbed “the periodic system” for linguistics. He is also the founder of the world’s first and only Linguistic Educational Museum (located in the Red Building of Kyiv University) where he eagerly acquaints students and interested outsiders with the achievements of the modern linguistic science. Finally, Professor Tyshchenko has authored a series of works that trace the influence of other peoples in the language, history and genotype of Ukrainians.
I have strange feelings about this character.
His credentials are amazing, but so far I’ve never heard of him (and I bet none of the Hatters have). And his metatheory of linguistics or theory of lexical distances or theory of Avestan influence on Ukrainian and so on all sound like the stuff of usual nationalistic pseudo-science.
I don’t know.
Yeah, from a cursory look at his video lectures at YouTube I understand he has found evidence of a mighty satrapy of the Achaemenid Empire in the Dnieper basin (which is why Ukrainian is so Iranian), and he believes the westernmost Slavic languages (Sorbian and Polabian) have particularly close ties with Ukrainian.
Uh-oh! Well, it’s still a pretty map…
never mind. we’ll forgive him for the pretty map and as far as I am concerned, having learned 50 languages does make him a great linguist….
Sounds like Balkan-style nationalism to me, except that would tend to go in the other direction: I bet somebody has proposed Serbian influence on Avestan somewhere.
The Wikipedia map of Persian satrapies shows one on the northern coast of the Black Sea, including Crimea. But the Black Sea Scythians were never subjugated by the Persians. The Sakā paradraya from the Old Persian lists of administrative units stands for the main satrapy of the Dahae, and the only minor satrapy on the continent of Europe, Skudra (comprising Thrace and Macedon), probably did not extend north of the Danube.
I tried to identify the author of the map, and found this:
But his maps are still there, confusing Wikipedia users.
Well, one thing that pretty map indicates is an empirical contention that there is less lexical distance between Ukrainian and Polish than there is between Ukrainian and Russian. Which seems a bit implausible . . .
Not that much, Ukrainian words are exact cognates to Polish words, only with East Slavic sound changes.
No surprises but one link caught my attention, the link between Irish and Portuguese. It makes sense, it’s just that you don’t often see it noted.
Well, one thing that pretty map indicates is an empirical contention that there is less lexical distance between Ukrainian and Polish than there is between Ukrainian and Russian. Which seems a bit implausible . . .
Well, Ukrainian looks like Russian with lots of Polish loanwords, so that’s not that implausible…
Let me try another essentialist definition: Russian is East Slavic spoken as the L form of Church Slavonic, and Ukrainian is East Slavic spoken as the L form of Polish.
Russian is East Slavic spoken as the L form of Church Slavonic, and Ukrainian is East Slavic spoken as the L form of Polish.
I love it!
And both are on the queue for the next Essentialist Explanations edition.
*applause*
I certainly don’t find it implausible that Ukrainian would be lexically closer to Polish than Russian is. I only find it implausible that it diverges from Russian toward Polish by greater than 50% of the total “distance” between Russian and Polish, at least if your metric for distance is a sensible one. And that perhaps not coincidentally seems like the exactly the sort of implausible claim you would make if you were incentivized to overstate the degree of difference between Ukrainian and Russian because your nationalism was overriding your scholarship. Note that the same pretty map shows English as closer to German than to French, despite the massive number of French-origin loanwords in English.
An anecdote re Polish and Ukrainian. When I was in the US Army–in a military intelligence unit in Germany–we had a number of men who were native speakers of one of the two languages. Some were young men of my age at the time (early 20s; this was 1970-1), who had come to the US as young children and had grown up in Polish or Ukrainian communities in the US; others were somewhat older non-commissioned officers who had managed to enlist in the US Army in the decades after WWII. The Polish speakers and the Ukrainian speakers could understand one another to some extent, and of course they shared a common cultural background that set them off from the rest of us (though this wasn’t a source of tension).
Our unit fielded a volleyball team, in which most if not all of the members were Polish or Ukrainian, against teams from other US Army units in Germany, for whom the Polish-Ukrainian back-and-forth that went on among our team’s members on the court was a great source of puzzlement–exactly what army did the team they were playing come from?
The question is, of course, what vocabulary list is used to make these comparisons. It can’t be the whole vocabulary of every language; it might be just one of the Swadesh lists. The Ethnologue says: “The percentage of lexical similarity between two linguistic varieties is determined by comparing a set of standardized wordlists and counting those forms that show similarity in both form and meaning”, but it doesn’t say what wordlists.
I can’t imagine a sensible Swadesh-type list yielding more “lexical similarity” between Hungarian and Lithuanian than between Lithuanian and Polish. And any such list would be bound to show a lot of similarity between German and Polish (mostly due to one-way borrowing). And why should Slovene have some similarity with Albanian than isn’t shared by Serbo-Croatian? The map looks rubbish to me.
Thanks!
J. W. Brewer: I don’t know about the data either. I still consider such a result not intrinsically improbable, because classification and lexical similarity measure very different things. Classification in general prefers shared sound changes, while lexical similarity would be based on cognacy and/or superposability. Ukrainian koróva “cow” is of course more similar to Russian koróva than to Polish krowa, but any sane measure of lexical similarity will give an equal point for Ukrainian-Russian and Ukrainian-Polish. Slavic languages being too similar one another (we Chinese would consider all of them subdialects of a single Chinese macrodialect), with only a couple of examples of Ukrainian robíti “to do” = robić ≠ delát’, the balance will be seriously tilted towards Polish.
minus273 – that’s a fair point re how finely-tuned the measuring instrument is or isn’t assumed to be given the high percentage of situations where the words in all three languages are obviously cognates. My sense is also that there’s some regional variation in the percentage of Polish loanwords in active use in Ukrainian, and that could also affect the result. Quite possibly also some variation in register, with a “literary” register plausibly (given the cultural/political dynamics) shaped by some extent by a self-conscious desire to maximize distinctions from Russian, because less nationalistically-minded writers would have been more likely to just write in Russian. NB that the most recent Nobel laureate in literature (Svetlana Alexievich), who was born in the Ukrainian SSR and raised in the Byelorussian SSR, writes in Russian despite not being an ethnic Russian. She gave her acceptance speech in Russian, except for one brief portion where she deliberately code-switched into Belarusian and then back out again, which was sort of beautiful and sort of heartbreaking.
An old joke: a Polish and a Russian (Soviet) border guards walk opposite each other on two sides of the border.
– How do you say “zhopa” [ass] in Polish? – cries the Russian guard.
– Dupa!
– That’s just as good.
I think the absence of stippled lines means that comparisons were not made; and which ones were made appears to be pretty random.
“She gave her acceptance speech in Russian, except for one brief portion where she deliberately code-switched into Belarusian and then back out again, which was sort of beautiful and sort of heartbreaking.”
Does anyone actually speak Belarusan? Correct me if I’m wrong, but my understanding is that Belarusan is a artifact of 19th century nationalism that was cobbled together out of several west-Russian rural dialects and perpetuated in the Soviet era as a token nod to nationalism within the centralized Russocentric Soviet state; and that business in Belarus is universally conducted in Russian, with Belarusan as essentially window-dressing–even more so than Irish in most of Ireland.
Maybe someone who knows about this can give me a more nuanced picture.
From what I can gather online, it appears that roughly one eighth of the population uses Belarusian as their primary language, while another eighth uses something intermediate between the two – so I suppose its position is marginally stronger than that of Irish. Wiki also claims, based on a dead hyperlink from the Belarusian government, that Poles are the ethnic group most likely to use Belarusian at home, having abandoned Polish almost entirely.
See also this comment by SFReader.
“Our unit fielded a volleyball team, in which most if not all of the members were Polish or Ukrainian, against teams from other US Army units in Germany, for whom the Polish-Ukrainian back-and-forth that went on among our team’s members on the court was a great source of puzzlement–exactly what army did the team they were playing come from?”
Think of the fun you would have had if someone had thought to put them on various command nets at different echelons.
It would have been great if the author had included Turkish, since many Balkan languages have many Turkish loanwords. Also, Romanian has a lot of Slavic loanwords (e.g. even da = yes), but it’s somehow invisible.
An old joke
Where does the humor come in? The Polish guard’s answer is straightforwardly correct, and the Soviet guard’s answer makes no sense to me.
Perhaps less surprising given the fact that there was no Latin word for “yes”; the Romance languages have innovated words for this new concept in different ways, and Romanian borrowed one instead of repurposing an existing word or contracting a longer phrase.
Sometimes Romanian is the only one to have kept a Latin word; Italian and everything to the west of it has imported “fresh”, while Romanian uses rece, Latin recens.
Where does the humor come in? The Polish guard’s answer is straightforwardly correct, and the Soviet guard’s answer makes no sense to me.
I assumed the funny part was just the guard’s pleasure in learning another dirty word, but I’ll let Russian readers provide a more knowledgeable answer. Here’s a version in Russian:
And here’s one where the nationalities are reversed (the Pole thinks the Russian word is nice):
A rather etymological rendering of pociąg. 🙂
Here’s a prettier and more extensive version of the map, with pictures of Tyshchenko in front of his enormous original hand-drawn map, which reaches farther into Asia. And with a link to an interview with him. (Added: bad link for the interview. I can’t find an archived version.)
«dupa – дупа – zhopa
Where does the humor come in? The Polish guard’s answer is straightforwardly correct, and the Soviet guard’s answer makes no sense to me.»
Hat, John, the [additional] humour here is that the Russian word zhopa means asshole, and the Polish word dupa to a Russian ear also sounds vaguely like ‘hole’ as in duplo – hollow, cavity in a tree (or in a tooth). That’s why the Russian soldier says ‘it’s also nice’. Back passage – asshole, zhopa, dupa.
In fact, Vasmer puts them as cognates: укр. дупло́, ст.-слав. доупина σπήλαιον (Супр.), русск.цслав. дупль “пустой, полый”, болг. ду́пъл “полый”, сербохорв. ду́пља “дупло”, словен. dúpǝlj “полый”, dúplo “дупло”, чеш. doupa, doupě ж. – то же, польск. dupa “задница”, dupel м. “дупло”. (here: https://classes.ru/all-russian/russian-dictionary-Vasmer-term-3479.htm)
Also, I didn’t know but the word dupa apparently circulates in Russian in regional dialects and in youth slangs.
And! This is the best bit. some commenters suggest that zhopa comes from the common French colloquial jais pas meaning je ne sais pas – dunno. see here: https://словарь-синонимов.рф/blog/istoriya-proishozhdeniya-slova-zhopa-
That’s chais pas (j’sais pas) and would have given you a ш.
That’s decadent Parisian garble! Same as shuui for je suis.
In my generation and younger, the whole country, plus half each of Switzerland and Belgium, speaks Parisian garble now, a few word choices aside. The only difference is that the Parisians speak it twice as fast as everyone else.