Intense but irregular NWIE and Indo-Iranian contacts show Uralic disintegrated in the West


Open access PhD thesis Indo-Iranian borrowings in Uralic: Critical overview of sound substitutions and distribution criterion, by Sampsa Holopainen, University of Helsinki (2019), under the supervision of Forsberg, Saarikivi, and Kallio.

Interesting excerpts (emphasis mine):

The gap between Russian and Western scholarship

Many scholars in the Soviet Union and later the Russian Federation also have researched this topic over the last five decades. Notably the eminent Eugene Helimski dealt with this topic in several articles: his 1992 article (republished in Helimski 2000) on the emergence of Uralic consonantal stems used Indo-Iranian and other Indo-European loans as key evidence, and it was one of the first serious attempts to stratify the loanwords, paying attention to the non-initial syllables as well. Helimski (1997b) discusses Indo-Iranian loanwords more generally, but it is especially notable for the introduction of the “Andronovo Aryan” idea: Helimski argues that some loanwords in Ob-Ugric and Permic are derived from an unattested, third branch of Indo-Iranian. Helimski’s idea has been supported by at least Mikhail Zhivlov in a 2013 article, but otherwise it has not received wide acceptance. Helimski was also known for his criticism (see especially Helimski 2001) of Jorma Koivulehto’s etymological work: although the main targets of Helimski’s criticism were Koivulehto’s writings on Proto-Indo-European and Germanic borrowings (which fitted poorly with Helimski’s ideas of the Nostratic roots of Proto-Uralic and his other theories on Uralic linguistic prehistory), also some of his Indo-Iranian ideas received unnecessarily sharp criticism in Helimski (2001).

Vladimir Napol’skikh is another important Russian scholar who has written on several occasions about Indo-Iranian–Uralic contacts. His 2014 article is notable for its criticism on Helimski’s Andronovo Aryan theory and his arguments in favour of Indo-Aryan loanwords. Napol’skikh also considered some of the traditional Indo-Iranian loanwords to be borrowings from Tocharian (see below) in some of his earlier works, an idea which has been criticized by Kallio (2004) and Widmer (2002) and which Napol’skikh himself has since dropped in later publications (2010, 2014), where many of these alleged Tocharian loans are again considered Indo-Iranian.

Some of the main characteristics of Russian research is that the earliest Indo-European loanwords are usually considered to represent an inheritance from the Nostratic proto-language (Helimski [2001]; Kassian, Zhivlov & Starostin [2015]), an idea which is not widely accepted by scholars of Uralic in the West. Although this often does not concern the Indo-Iranian loanwords at all, or it concerns only a part of them, the works of Jorma Koivulehto, who dealt with both earlier Indo-European and Indo-Iranian loans, receive so much criticism from the Russian scholars that his important ideas are often totally rejected or left unmentioned in Russian research.

This kind of rejection of central etymological research literature can be considered one of the most pressing problems in Uralic loanword studies, and it leaves a regrettable gap between Russian and Western European scholars in this perspective.




Among the Indo-Iranian loanwords in Uralic, one can easily mention examples that follow the classification of semantic change as described above. For widening or generalization, vasara ‘hammer’ is a good example: the Indo-Iranian original denotes ‘the weapon of the god Indra’ in Indic and ‘the weapon of the god Mithra’ in Avestan, whereas Finnish ‘hammer’ (and the Mordvin meaning ‘axe’) are more general meanings of tools. Fi huhta is a good example of narrowing: Iranian *tsuxta- means simply ‘burned’, whereas in Finnic huhta means specifically ‘a burned patch used in slash-and-burn agriculture’. Metonomy has taken place in Mordvin, where čuvto denotes simply ‘tree’; this probably developed through the meaning ‘wood burned for agriculture’. Khanty (South) wǟrəs denotes ‘horse’s mane’, but its Iranian original probably had a more general meaning of hair (cf. Avestan varəsa- ‘hair of human and animal, mostly hair of the head’).

An interesting example of degeneration is the etymology of Finnic orja ‘slave’, probably borrowed from the Indo-Iranian ethnonym *(H)ārya- ‘Aryan’ (for the original semantics of this word, see the entry *orja in Chapter 2). A similar development is seen in English slave which is etymologically connected to the ethnonym Slav.

Distribution as a criterion in the dating of loanwords

(…) some of the Indo-Iranian loans seem to have a wide distribution, but upon a closer look it becomes clear that they include phonological irregularities, which can only be explained by assuming that they are parallel loans. The ability to recognize parallel borrowings is extremely important in Uralic loanword studies, and it has been developed with success in the research of Germanic and Baltic loanwords (see Junttila 2015).

Interestingly, K. Häkkinen (1983: 207) argues that although words disappear from languages, the most basic words often remain stable and are maintained for longer periods. Although this is probably true, here the notion of “basicness” is something that is open to different interpretations. Many central concepts in culture and livelihoods are often described with prestige words that are borrowed, and these central words can be very easily replaced. In determining the age of the loanwords one has to always keep in mind that a reflex of a very early cultural borrowing from Indo-Iranian to Proto-Uralic/Proto- West Uralic etc. can easily have been lost in some daughter language, if a later prestige loan for the same concept has been borrowed from some later contact language (such as from some form of Germanic or Baltic into Finnic or from some Turkic language into Udmurt, Mari or Mordvin).

In Uralic linguistics the common loanword layers shared by some intermediary proto-language have often been seen as giving support to the reconstruction of these stages, but K. Häkkinen (100–108) considers this problematic. It should also be noted that the distribution of Indo-Iranian loanwords very rarely matches the assumed taxonomic divisions: there are some loanwords confined to the Finno-Permic, Finno-Volgaic or Ugric languages, but very few loanwords that would be Finno-Permic, Finno-Volgaic or Ugric in the way that the word is found in all the languages that belong to the branch.



There are only very few possible examples of a consonantal substitution of the word-initial laryngeal. It seems probable that the word-initial laryngeal, if it was retained, was not substituted in any way in Uralic. *karšV (> Fi karhu), an uncertain etymology, is the only possible example.

(…) Even if *k was a result of laryngeal hardening, the development would probably be earlier than Proto-Indo-Iranian, meaning that by the time the word was borrowed, the Indo-Iranian word simply had the stop *k that was regularly substituted by Uralic *k.

Evidence for Andronovo Aryan and Indo-Aryan loanwords?

None of the loanwords have to be considered as Andronovo Aryan or Proto-Indo-Aryan based on the criteria that were presented in the Introduction. The Uralic palatal affricate *ć or sibilant *ś can in all cases be explained from Proto-Indo-Iranian *ć, and there is no need to assume that it should reflect Andronovo Aryan *ć or PIA *ś. In the etymological material of this study, no further positive evidence was found for the distinction of PU *ś and *ć as substitutions of the Proto-Indo-Iranian affricates. This means that at least in word-initial position there probably was no difference between *ć and *ś, and even though we do not know what this sound was phonetically, it is safe to assume that Uralic words showing *ś reflect a sound substitution of Indo-Iranian *ć and *Ʒ́.

Regarding the distribution of the etymologies within Indo-Iranian, all the loanwords which cannot be from Iranian because of the lack of attested Iranian cognates have a more or less secure Proto-Indo-Iranian etymology, and nothing prevents us from assuming that these words reflect Proto-Indo-Iranian borrowings. It is also possible that some words with solid Proto-Indo-Iranian etymologies were present in Iranian but were lost before the first Old Iranian texts were composed.



List of Indo-European and Indo-Iranian Etymologies


*ertä ‘side’, *kekrä ‘wheel’, *kečrä ‘spindle’, *mekši ‘bee’, (*meti ‘honey’), *ońća ‘part’, (*orpa ‘orphan’), *peijas ‘feast’, *pejmä ‘milk’, Pre-P *pertä ‘wing’, *repä ‘fox’, *rećmä ‘rope’, *sejti ‘bridge’


*aćtara ‘whip’, *anti/onta, *ora ‘awl’, *orja ‘slave; south’, (*orpa ‘orphan’), *pośi ‘penis’, *śaŋka ‘handle’, Pre-Md *śaγa ‘goat’, *śarwi ‘horn’, *śaδa- ‘to rain’, śara- ‘shit’, *śi̮ta ‘hundred’, Pre-P *śVta ‘hundred’, *śasra ‘thousand’, *śišta ‘wax’, *śoma- ‘sad’, *waćara ‘hammer’, *woraći ‘boar’

Ambiguous early loans (can be either from PII or PI)

*ajša ‘shaft’, *asVra ‘lord’, *iha ‘yearning. passion’, *ihta ‘lust’, *jama ‘twin’, *jawi/jowa (> Mo juv) ‘awn’, *jawi (> PS *jäə̑) ‘flour’, *ji̮ni ‘way, path’, *juma ‘god’, *kana- ‘to dig’, *kara- ‘to dig’, *kata- ‘to graze’, *kertä- ‘to bind’, *ki̮ntaw ‘tree stump’, *kürtńV ‘iron’, PKh *kǟrtV ‘iron’, *kärtä ‘iron’, *martas ‘dead’, *ńātV- ‘to help’, *pakas ‘god’, *para ‘good’, Kh pĕnt ‘way’, PMs *pē̮ńtV ‘brother-in-law’, *pora ‘old’, *poči- ‘to boil’, Pre-P *porta ‘vessel’, *puntaksi ‘bottom’, Pre-Ma *pänti- ‘to bind’, PMa *pärća ‘ear of corn’, *pätäri- ‘to flee’, *saγi- ‘to get, obtain’, *sampas ‘pillar’, *saŋka ‘old’, *sara ‘lake’, *sasara ‘sister’, *säptä ‘seven’, *tajwas ‘sky’, *takra ‘piece of flesh’, *tarna ‘grass’, *tojwV ‘wish’, *toraksi ‘through’, *tora- ‘to fight’, *täjV ‘milk’, *täjinV ‘cow’, *täši, *uška ‘bull’, *wakša- (> PS *wåtå-) ‘to grow’, *wajna- ‘to see’, *wojna- ‘to see’, *wiša ‘venom’, *wi̮rna ‘wool’, *wärkä ‘kidney’, PS *wǝ̑rkǝ̑ ‘wolf’, *wirtV- ‘to hold, raise’, *äŋkärä ‘coal’

List of uncertain Indo-Iranian etymologies

PFi *aiwa (← Germanic ?), Ma *arša ‘mane’, PMs *ǟrV ‘fire’, *aštira ‘barren earth’, POug *ćakV ‘hammer’, *ćara- ‘brown; ? to dawn’, *ćero ‘hill-top’, *ćerti ‘group’, *itä- ‘to appear’, Pre-Fi *karšV ‘bear’, PMs *kīrV ‘iron’, *kota ‘chum’, Pre-Sa *kupa ‘pit’, PFi *kärsä ‘snout’, *maksa- ‘to pay’, PFi *mana-, PUg ? *mańći, Ma marij ‘Mari; man; husband’, *mē̮ja ‘wedding’, *mykkä ‘dumb’, PP *oč ‘corn’, *orpV ‘relative’, PFi *paksu ‘thick’, *peji- ‘to milk’, *pi̮ŋka ‘psychedelic mushroom’ POUg *porV ‘phratry’, Pre-Sa *poti ‘against’, Pre-Fi *šatas ‘germ’, *sentü- ‘to be born’, *šerä- ‘to wake up’, Ms šVšwǝŋ ‘hare’, PUg *śeŋkV ‘nail’, Pre-Sa *soma/sami ‘some’, PP *sur ‘beer’, PFi *süte- ‘to hit’ (< ? *sewči-), Hu szekér ‘wagon’, Kh ʌīkər ‘Narte’ PUg *taja- ‘secret’, Pre-Fi *terni ‘young’, *terwV ‘healthy’, ? *towkV ‘spring’, PWU *utarV ‘udder’ (← Germanic ?; Mari *waδar ← II), *waŋka ‘hook’, Mo E v́eŕges, M vərǵas ‘wolf’

Etymologies that were probably borrowed from another Indo-European source (PIE, PBSl, Germanic, Baltic)

*aisa ‘shaft’ ← Balto-Slavic, PFi *aiwa (← Germanic ?), *apV ‘help’ ← Germanic, *jewä ‘grain’ ← Balto-Slavic, Ma karaš etc. ‘honeycomb’ ← Baltic, (*meti ‘honey’ ← ? PIE,) Fi *ojas ‘shaft’ ← Slavic, *ola ← Baltic, *oŋki ← Germanic, *porćas ← Balto-Slavic, Pre-Sa *porta ‘vessel’ ← Germanic, *salV ‘salt’ (cannot be reconstructed for PU, various later parallel loans), *śi̮lkaw ← Balto-Slavic, *sammu- ← Germanic, *śuka ← Balto-Slavic, Mari *šŭžar ← Baltic/Balto-Slavic or Slavic, *tejniš ‘pregnant animal’ ← Baltic/Balto-Slavic, PWU *utarV ‘udder’ (? ← Germanic)

Early loans into differentiated branches

Proto-West Uralic

Only in Finnic:

*aćnas ‘voracious’, *iha ‘wish’, *ihta ‘lust’, PFi *isV ‘appetite’, *martas ‘dead’, *očra ‘barley’, *peijas ‘feast’, *pejmä ‘milk’, *pe̮rna ‘spleen’, *sampas ‘pillar’, *sooja ‘shelter’, *tajwas ‘sky’, *takra ‘piece of flesh’, *terwV ‘healthy’, *tojwV ‘wish’

All of these words, with the exception of *sooja ‘shelter’, were clearly borrowed into Early Proto-Finnic (Pre-Finnic) at the latest. Formally most of the loans could be from PII or PI.

Only in Saami:

*kata- ‘to graze’, *kertä- ‘to bind’, *pora ‘old’, *wojna- ‘to see’

All of the loans were acquired before the Saami vowel changes. Formally all could be either from Proto-Indo-Iranian or Proto-Iranian.

Only in Finnic and Saami:

*asma ‘voracious’, *jama ‘twin’, *kekrä ‘wheel’, *mača ‘insect’

*asma ‘voracious’, *jama ‘twin’, *kekrä ‘wheel’, *mača ‘insect’ Of these, *mača from Proto-Iranian and *jama is ambiguous. As the -sm- in asma does not point to Proto-Indo-Iranian *ć, this is probably an Iranian loan too. It is possible that these words were borrowed into Proto-West Uralic, as there is no general support for a Finno-Saamic proto-language today. As the cognates within Finnic and Saami are regular, there is no need to assume parallel borrowings. *kekrä has to be from Proto-Indo-Iranian.

NOTE. Based on the discussion of stages of borrowing from Indo-Iranian, and of the distribution of *kekrä among Uralic dialects in particular, Holopainen probably means Pre-Indo-Iranian for this example.

Only in Mordvin and/or Finnic and/or Saami (can point to a borrowing into Proto-West Uralic):

*ji̮ni ‘way’, *kečrä ‘spindle’, *rećmä ‘rope’, *śaŋka, *waćara ‘hammer’, *warsa ‘foal’, *wasa ‘calf’, *woraći ‘pig’

Based on phonological criteria, these loans do not form a chronologically coherent layer, but probably their modern distribution is accidental (their original distribution can have been wider). *kečrä ‘spindle’ and *rećmä ‘rope’ are from Pre-II, *śaŋka, *waćara and *woraći from PII, *warsa and *wasa from later Iranian (Alanic). *ji̮ni is ambiguous. Also the loans confined to Finnic and Saami mentioned above probably were borrowed into Proto-West Uralic, as it is a more convincing taxonomic entity than Proto-Finno-Saamic.


Only in Mordvin, Finnic and/or Saami and Mari

*juma ‘good’

This loan can be either from PII or PI. As it is obvious that these four branches do not form any taxonomical entity (Salminen 2002; J. Häkkinen 2009), it is only logical that there are no other loanwords with a “Finno-Volgaic” distribution.

Only in Mari:

*kVrtnV ‘metal’ (← PII, PI or later), Pre-Ma *pänti- ‘to bind’, PMa *pärća ‘ear of corn’, *si̮rńa ‘gold’ (← Old Iranian)

Only very few early Indo-Iranian loans can be found in Mari and in no other Uralic language. It is unclear what the reason for this is. It is, of course, possible that some uncertain loanwords like marij ‘man; Mari’ turn out to be correct after all, but even that does not make the number of loans in Mari very high. The situation has to be explained either with loss of vocabulary and replacement by later loans (from Turkic, and also perhaps from Permic) or with Mari’s location on the periphery at the time of the later contacts with the Iranian languages. Agyagási (2019: 254–258) argues that the current area where Mari is spoken was formed only relatively late, after the Mongol invasion in the High Middle Ages. If this is indeed correct, and Mari was spoken in more northern areas before that, it can be assumed that Pre-Mari had only sporadic contacts with the Iranian languages after it split off from Proto-Uralic.

Only in Permic (early loans; for later loans confined to Permic)

*a(č)wa ‘stallion’, PP *ju ‘awn’, *kertä ‘house’, *kärtä ‘metal’, *kada- ~ *gada- ‘to steal’, *karka ‘chicken’, *parśa ~ *barśa ‘mane’, *parta ‘knife’, *pertä ‘wing’, *poči- ‘to boil’, *porta ‘vessel’, *dura ‘long’, *domV ‘to tame’, PP *śumi̮s ‘band’, PP *šud‘luck’, *uška ‘bull’, *wi̮rna ‘wool’, *wirä ‘man, husband’, *äŋkärä ‘coal’

The number of loanwords in Permic is relatively high, and many of these can be considered to be Iranian loanwords. Technically many loans are ambiguous, but as some of the words were borrowed late due to historical reasons (‘iron’), and some were borrowed into a Pre-Permic which already had a phonological system that was different from Proto-Uralic (*šud- has d which cannot reflect PU *δ).

It is probable that the Permic languages were in continuous contact with the Indo-Iranian languages from the time they split from Proto-Uralic until the early mediaeval era.


Only in Khanty and Mansi (regular cases):

POUg *ēräɣ ‘song’, POUg *eträ ‘clear sky’, POug *mɔ̈ŋki ‘forest-spirit’, *ńātV- ‘to help’, *päčäɣ ‘reindeer’

The number of these etymologies is so low that it is very difficult to determine whether these words were borrowed into Proto-Ob-Ugric or some earlier proto-language, such as Proto-Ugric.

Only in Khanty and/or Mansi and/or Hungarian (regular cases):

*säptä ‘seven’ (Khanty + Hungarian regular), *sara ‘lake’

There are so few convincing loanwords with a “Ugric” distribution that they provide very little evidence. Either of these loans could be from Proto-Indo-Iranian or Proto-Iranian, if we assume that *s > *h was a common Iranian sound change. Both loans were acquired

Only in Samoyed:

*jäwi (> PS *jäə̑), PS *pulə̑ ~ *pi̮lə̑ ‘bridge’, *täjki ‘spear’, PS *wǝ̑rkə̑ ‘wolf’, Pre-S *täši (> PS *tät), *wakša- (> PS *wåtå) ‘to grow’

Of these, only *wåtå- has to be a very early loan because of *s > *t. *jäwi (> PS *jäə̑) and PS *wə̑rkə̑ were possibly acquired before the Proto-Samoyed vowel developments, making them probably early loanwords too. Formally all of them could be either from PII or PI. *pulə̑ ~ *pi̮lə̑ could have been borrowed into Proto-Samoyed (with Iranian *u corresponding to Samoyed *u), and because of the *l the word is probably from a relatively late, Middle Iranian language.

The following loanwords have a distribution with a cognate in both Samoyed and some other branch:

*śaδa- ‘to rain’, *tora- ‘to fight’ (also *itä-, which is more uncertain, belongs here)

Pan-Uralic loans

The following loanwords have a distribution with regular cognates with at least one Ugric branch and some other branch, which points to early borrowing. Although formally *kana- and *kara- are ambiguous, they are probably from Proto-Indo-Iranian because of their distribution. The rest of the loans are from Pre-II or PII.

*kana- ‘to dig’, *kara- ‘to dig’, *meti ‘honey’, *mekši ‘bee’, *orpV ‘orphan’, *ora ‘awl’, *peji- ‘to milk’, *pätäri- ‘to flee’, *śara- ‘shit’, *śoma- ‘sad’

The following loanwords are found in at least two non-adjacent branches of Uralic (the ones listed in the above categories are not counted). As there are no widely accepted criteria for a word to be considered “Uralic”, all of these could be considered loanwords into Proto-Uralic, in this case probably from Proto-Indo-Iranian or Pre-Indo-Iranian.

*ajša ‘shaft’, *anti/onta ‘grass’, *ertä ‘side’, *ki̮ntaw ‘tree stump’, *mertä ‘human’, *orja ‘slave’, *para ‘good’, *počaw ‘reindeer’, *puntaksi ‘bottom’, *saγi- ‘to get, obtain’, *repä ‘fox’, *si̮ŋka ‘old’, *sasara ‘sister’, *sejti ‘bridge’, *śišta ‘wax’, *tarna ‘grass’, *toraksi ‘through’, *wiša ‘venom’



Discussion about the distribution and its impact on Uralic taxonomy

(…) there are Proto-Iranian loanwords which were borrowed simultaneously into several early branches of Uralic, making it likely that Uralic had split into several branches by the time of these contacts.

Also the fact that many of the Proto-Indo-Iranian loanwords either show a restricted distribution (such as West Uralic *waćara, *woraći) or irregular correspondences (*asVra, *śasra, *śi̮ta) can point to the conclusion that Proto-Uralic was fragmenting by the time when contacts with Proto-Indo-Iranian took place.

The earlier, Pre-Indo-Iranian loanwords usually show a wider distribution and regular sound correspondences. Although the number of these earliest loans is quite small, based on their distribution and regular correspondences it can be assumed that the Pre-Indo-Iranian stage (after RUKI, *l > *r and the merger of velars and labiovelars but before the merger of non-high vowels) was concurrent with Proto-Uralic, with the changes leading to Proto-Indo-Iranian happening after the dispersal of Proto-Uralic.

The distribution of loanwords reinforces the old idea that Samoyed is a lexical outlier, as only few convincing Indo-Iranian etymologies for Proto-Uralic words (*saδa- ‘to rain’, *tora- ‘to fight’) have a convincing reflex in Samoyed. However, the fact that such etymologies exist means rather that the situation is due to lexical loss in Samoyed, and that the earliest contact occurred before Samoyed split off from Proto-Uralic.

There are very few loanwords that have a Ugric distribution (being found in at least one Ob-Ugric branch and Hungarian), and likewise rather few in Ob-Ugric. The few loans that have a distribution confined to Ugric were borrowed before the change *s > *θ took place. This means that the Ugric distribution does not mean much from the point of view of chronology or taxonomy, as the words were borrowed into a language that was still identical to Proto-Uralic. Even some loans borrowed into Khanty and Mansi have to be so early.

Impacts on dating and the location of the contact zones

Because of the very limited number of convincing etymologies found only in Finnic or Saami, it is probable that there were not (extensive) contacts with Pre-Finnic or Pre-Saami after the split of Proto-West Uralic.

The great number of loanwords of varying ages in Permic inevitably points to the conclusion that the pre-form of the Permic branch had been constantly spoken in an area that was adjacent to the Iranian languages. The different layers of loanwords in Permic clearly point to chronological differences in the donor languages, but it also seems that Permic was in contact with various forms of Iranian and not with different diachronic stages of the same language.

In general, the words that have been borrowed are typical cultural words, and the contacts between Indo-Iranian and Uralic seems to have been a typical contact situation in which a culturally less-advanced language group borrows various cultural terms from a more “advanced” group. The words in various loanword layers related to horse and cattle breeding show obvious cultural influence in the field of domesticated animals, and the borrowing of some names of grains points to agricultural influence from the Indo-Iranians on the speakers of Uralic.

Needless to say, many of the borrowings I listed in A Song of Sheep and Horses suffer from the same ailment attributed to Indo-Europeanists in general:

With slight exaggeration one can agree with the remark by Koivulehto (1999a: 209–210) that the Indo-Europeanists often use outdated sources or are simply uninterested in the topic. The problem is further complicated by the various and often obsolete views expressed in even relatively modern Uralicist works, such as those of Rédei (1986c; 1988) or Katz (2003); (…) Mallory & Adams (2006) adequately refer to the importance of the early loanwords, but they use mostly Rédei’s outdated reconstructions and stratigraphy in support of their theories.

I need to review all related texts with this thesis and the works recently published by Kümmel, as well as the recent book of the Leiden school on Indo-Uralic.

Also, does anyone know the (traditional?) why of the resistance to the Indo-Uralic concept among Uralicists? Maybe it’s a reaction against the Nostraticist and Siberian views of Uralic espoused by the Soviets?


European hydrotoponymy (IV): tug of war between Balto-Slavic and West Uralic


In his recent paper on Late Proto-Indo-European migrations, when citing Udolph to support his model, Frederik Kortlandt failed to mention that the Old European hydrotoponymy in northern Central-East Europe evolved into Baltic and Slavic layers, and both take part in some Northern European (i.e. Germanic – Balto-Slavic) commonalities.


From Expansion slavischer Stämme aus namenkundlicher und bodenkundlicher sicht, by Udolph, Onomastica (2016), translated into English (emphasis mine):

NOTE. An archived version is available here. The DOI references for Onomastica do not work.

(…) there is a clear center of Slavic names in the area north of the Carpathians. Among them are root words of the Slavic languages such as reka / rzeka, potok u. a. m.

Even more important than this mapping is the question of how the dispersion of ancient Slavic names happened. What is meant by ancient Slavic names? I elaborated on this in this journal years ago (Udolph, 1997):

(1)Ancient suffixes that are no longer productive today.

This clearly includes Slavic *-(j)ava as in Vir-ava, Vod-ava, Il-ava, Glin-iawa, Breg-ava, Ljut-ava, Mor-ava, Orl-java among others. It has clear links to the ancient common Indo-European language (Lupawa, Morava-March-Moravia, Orava, Widawa). They have a center north of the Carpathians.


(2) Unproductive appellatives (water words), which have disappeared from the language, are certain witnesses of ancient Slavic settlements. A nice example of this is Ukr. bahno, Pol. bagno ‘swamp, bog, morass’ etc. The word has long been missing in South Slavic, although it appears in South Slavic names, but only in very specific areas (see Udolph, 1979, pp. 324-336).

(3) Names that go back to different sound shifts. [Examples:]

  • (…) the Slavic clan around Old Sorbian brna ‘feces, earth’, Bulgarian OCS brьnije ‘feces, loam’, OCS brъna ‘feces’, Slovenian brn, ‘river mud’, etc. is solved with the inclusion of onomastic materials (Udolph, 1979, p. 499-514). (…) Toponymic mapping shows important details.
  • bryn-slavic
    Karte 4. brъn < *brŭn und bryn- < *brūn- in slavischen Namen
  • (…)We also have an ablauting *krŭn-:*krūn- in front of us. Map 5 shows the distribution of both variants in Slavic names.
  • The next case is quite similar. It concerns Russ. appellative grjaz’ ‘dirt, feces, mud’, (…) for which an Old Slavic form *gręz exists. Slavic also knows the ablauting variant *grǫz.

    These maps (see Map 6, p. 222) show that a homeland of Slavic tribes can only be inferred north of the Carpathians.

    (4) Place-names formed by Slavic suffixes of Pre-Slavic nature, i.e. derived from Old European hydronyms.

    (a) The largest river in Poland, the Wisła, German Vistula, bears a clearly Pre-Slavic name, no matter how one explains it (Babik, 2001, pp. 311-315; Bijak, 2013, p. 34, Udolph, 1990 , Pp. 303-311).

    (b) With the same suffix are formed Sanok, place on the southwest of Przemyśl; Sanoka, a no longer known waters name, 1448 as fluvium Szanoka, near the place Sanoka and with a diminutive suffix -ok- a tributary of the Sanok, which is called Sanoczek (for details see Udolph, 1990, pp. 264-270; Rymut / Majtan, 1998, p. 222). The San also has a single-language name, but that does not change anything about the right etymology. The suffix variant -očь also includes Liwocz and Liwoczka, river names near Cracow; also a mountain range of the Beskydy is mentioned at Długosz as Lywocz.

    According to the opinion of the “Słownik prasłowiański” (Sławski (red.), 1974, p. 92), the suffix -ok- represents a Proto-Slavic archaism. It appears, for example, in sъvědokъ, snubokъ, vidokъ, edok, igrok, inok among others, but its antiquity also shows, among other things, that it started at archaic athematic tribes.

    Mapping of older and younger East Slavic place-names and translation into settlement evolution.

    Slavonic Urheimat

    If we apply this to the loess distribution in western Ukraine and south-eastern Poland, it is very noticeable that the center of the Old Slavic place names lies in the area where loess dispersal is gradually “frayed out”, i.e. for example, in the area west of Kiev between Krakow in the west and Winnycja and Moldavia in the east. In short, the distribution of good soils coincides with ancient Slavic names. If that is correct, we can expect a homeland in the Pre-Carpathian region, or better, a core landscape of Slavic settlement.

    The existence of Pre-Slavic Indo-European place names and water names whose structure indicates that they originated from an Indo-European basis, but then also developed Slavic peculiarities, can now – as stated above – only be understood to mean that the language group that we call today Slavic emerged in a century-long process from an Indo-European dialectal area.

    Loess areas between Poland and Ukraine. Image from Jary et al. (2018).

    From a genetic point of view, the scarce data published to date show a clear shift of central-east populations from more Corded Ware-like groups in the EBA towards more BBC-derived ancestry in the common era, to the point where ancient DNA samples from East Germany, Poland and Lithuania evolve from clustering between Corded Ware and Sub-Neolithic peoples to clustering close to Bell Beaker-derived groups, such as West Germanic peoples, Tollense samples, etc. (see below)

    Furthermore, sampled Early Slavs show bottlenecks under “Dinaric” I2a-L621 and central-eastern E1b-V13, which – in combination with the known phylogeography of Únětice and Urnfield – is compatible with its late expansion from a central-east European Slavonic homeland, such as the Pomeranian culture, in turn likely derived from Lusatian culture groups.

    This doesn’t preclude a more immediate expansion of Common Slavic in Antiquity closer to the northern Carpathians, which is also supported by the available Early Slavic sampling, apart from samples from the Avar and Hungarian polities.

    Likely Baltic (yellow-green) and Slavic (orange) groups ca. 500 AD on, with Finnic (cyan) and Mordvinic (blue) groups roughly divided through hydrotoponymy line ca. 1000 AD Top Left: Late Iron Age cultures. Top right: PCA of groups from the Iron Age to the Middle Ages. Y-DNA haplogroups during the Germanic migrations (Bottom left) and during the Middle Ages (Bottom right). Notice a majority non-R1a lineages among sampled Early Slavs. See full maps and PCAs.

    Proto-Baltic / Proto-Slavic

    Northern European hydronymy

    From Alteuropäische Hydronymie und urslavische Gewässernamen, by Udolph, Onomastica (1997), translated into English (emphasis mine):

    NOTE. An HTML version is available at Jurgen Udolph’s personal site.

    Because of the already striking similarities as the well-known “-m-case”, the number-words for ‘1000’, ’11’ and ’12’ and so on, J. Grimm had already assumed a close relationship between Germanic and Baltic and Slavic. (…)

    In my own search, I approached this trinity from the nomenclature side. In doing so, I noticed some name groups that can speak for a certain common context:

    1.* bhelgh-, *bholgh-.

    Map 10, p. 64, shows that a root * bhelgh- occurs in the name material of a region from which later Germanic, Baltic and Slavic originated. The Balkans play no role in this.


    2. *dhelbh-, *dholbh-, *dhl̥bh-

    The proof of the three ablauting * dhelbh, * dholbh, * dhl̥bh- within a limited area shows the close relationship that this root has with the Indo-European basis. Again it is significant in which area the names meet (…)


    3. An Indo-European root extension *per-s- with the meaning ‘spray, splash, dust, drop’ is detectable in several languages (…). From a Baltic-Slavic-Germanic peculiarity cannot therefore be spoken from the toponymic point of view. The picture changes, however, if one includes the derived water names.

    4. The root extension *pel-t-, *pol-t-, *pl̥-t- of a tribe widely spread in the Indo-European languages around *pel-, pol- ‘pour, flow, etc.’, whose reflexes are found Armenian through Baltic and Slavic to the Celtic area, is found in the Baltic toponymy, cf. Latv. palts, palte ‘puddle, pool’.

    The dynamics of stylistic changes of the form of the “Trzciniec pot” in the lowland regions of Central Europe, and spreading routes of the Trzciniec package in Central Europe. A good proxy for contacts through the Northern European Plain during the Early Bronze Age. Modified from Czebreszuk (1998).

    Early Balto-Finnic

    In order to properly delimit (geographically and chonologically) the Proto-Baltic and Proto-Slavic expansions, it is necessary to understand where the late Balto-Finnic homeland was located during the Bronze Age. The following are excerpts from the comprehensive hydrotoponymic study by Pauli Rahkonen (2013):

    In any case, Finnic probably had its origin somewhere around the Gulf of Finland. Names of large and central rivers such as Vuoksi (< Finnic vuo ‘stream’) and Neva (< Finnic neva ‘marsh, river’) must be very old and might represent Proto-Finnic hydronyms. In the southern coastal area of Finland, the names Kymi and Nietoo < *Niet|oja (id. later Porvoonjoki) may also be of Finnic origin and derive from, respectively, kymi ‘stream’ (see SSA I s.v. *kymi; see however SPK s.v. Kemijärvi; Rahkonen 2013: 24) and nieto(s) ‘heap of snow’ (SSA II s.v. nietos), in hydronyms probably ‘high (snowy?) banks of a river’. Mustion|joki is clearly a Finnish name < *must|oja ‘black river’. The river name Vantaa remains somewhat obscure, although Nissilä (see SPK s.v. Vantaanjoki) has derived it from the Finnic word vana ‘water route’. In western Finland the names of large rivers, such as Aura and Eura, are supposedly of Germanic origin (Koivulehto 1987).

    In Estonia the names of many of the most important rivers might be of Finnic origin: e.g. Ema|jõgi Est. ema ‘mother’ [Tartu district] (?? cf. the Lake Piiga|ndi < Est. piiga ‘maiden’), Pärnu [Pärnu district] < Est. pärn ‘linden’, Valge|jõgi [Loksa district] < Est. valge ‘white’, Must|jõgi [Võru district] < Est. must ‘black’. It is possible that Emajogi and especially Piigandi are the result of later folk etymologizing of a name with some unknown origin. However, as a naming motif there exist in Finland numerous toponyms with the stems Finnic *emä (e.g. 3 Emäjoki), *neit(V)- ‘maiden’ (e.g. Neitijärvi, Neittävänjoki, Neittävänjärvi) and Saami stems that can be derived from Proto Saami *nejte̮ ‘id’ (GT2000; NA).

    The historical southern boundary of Finnic hydronyms, excluding hydronyms produced by the Karelian refugees of the 17th century.

    These seemingly very old names of relatively large rivers in southern Finland, modern Leningrad oblast and Estonia support the hypothesis that Proto-Finnic was spoken for a long time on both sides of the Gulf of Finland and it thus basically corresponds to the hypothesis of Terho Itkonen (see below). In the Novgorod, Tver or Vologda oblasts of Russia, Finnic names for large rivers cannot be found (Rahkonen 2011: 229). For this reason, it is likely that the Late Proto-Finnic homeland was the area around the Gulf of Finland.

    Beyond the southeastern boundary of the modern or historically known Finnic-speaking area, there exists a toponymic layer belonging to the supposedly non-Finnic Novgorodian Čudes (see Rahkonen 2011). In theory it is possible that Proto-Finnic and Proto-Čudian separated from each other at an early stage or it is even possible that Proto-Čudian was identical with Proto-Finnic. However, this cannot be proven, because there is not enough material available describing what Novgorodian Čudic was like exactly.

    Yakhr-, -khra, yedr-, -dra and yer-/yar, -er(o), -or(o) names of lakes in Central and North Russia and the possible boundary of the proto-language words *jäkra/ä and *järka/ä. Rahkonen (2013)

    A summary of the data is then:

    • The Daugava River and the Gulf of Livonia formed the most stable south-western Balto-Finnic border (up until ca. 1000 AD): the Daugava shows a likely Indo-European etymology, while some of its tributaries are best explained as derived from Uralic.
    • The first layer of “Early Baltic” loans in Early Balto-Finnic are of a non-attested Baltic dialect closest to Proto-Balto-Slavic (read more about this early layer).
    • The latest samples of the Trzciniec culture (or derived Iron Age group) from its easternmost group in Turlojiškė (ca. 1000-800 BC?) show a western shift towards Bell Beaker, although they show a majority of hg. R1a-Z280; while the earliest sample from Gustorzyn (ca. 1900 BC), likely from Trzciniec/Iwno, from the westernmost area of the culture, shows a Corded Ware-like ancestry (and hg. R1a-Z280, likely S24902+) among a BA sampling from Poland clearly derived from Bell Beaker groups.

    One can therefore infer that the expansion of the Trzciniec culture – as the earliest expansion of central-west European peoples into the Baltic after the Bell Beaker period – represented either the whole disintegrating Balto-Slavic community, or at least an Early Baltic-speaking community expanding from the West Baltic area to the east.

    The similarity of Early Slavs and the Trzciniec outlier with the Czech BA cluster, formed by samples from Bohemia (ca. 2200–1700 BC), and the varied haplogroups found among Early Slavs – reminiscent of the variability of the Unetice/Urnfield sampling – may help tentatively connect the early Proto-Slavic homeland more strongly with a Proto-Lusatian community immediately to the south-west of the Iwno/Proto-Trzciniec core.

    Top Left:Likely Baltic, Slavic, and Balto-Finnic-speaking territories (asynchronous), overlaid over Late Bronze Age cultures. Balto-Slavic in green: West(-East?) Baltic (B1), unattested early Baltic (B2), and Slavic (S). Late Balto-Finnic (F) in cyan. In red, Tollense and Turlojiškė sampling. Dashed black line: Balto-Slavic/West Uralic hydrotoponymy border until ca. 1000 AD. Top right: PCA of groups from the Early Bronze Age to the Late Bronze Age. Marked are Iwno/Pre-Trzciniec of Gustorzyn (see below), Late Trzciniec/Iron Age samples from Turlojiškė, and in dashed line approximate extent of Tollense cluster; Y-DNA haplogroups during the Late Bronze Age (Bottom left) and during the Early Iron Age (Bottom right). Notice a majority non-R1a lineages among sampled Early Slavs. See full maps and PCAs.

    Proto-Balto-Slavic homeland

    Disconnected western border: Germanic

    The common Balto-Slavic – Germanic community must necessarily be traced back to the West Baltic. From Udolph’s Namenkundliche Studien zum Germanenproblem, de Gruyter (1994), translated from German (emphasis mine):

    My work [Namenkundliche Studien zum Germanenproblem] has shown how strong the Germanic toponymy is related to the East, less to Slavic, much more to Baltic. It confirms the recent thesis by W.P. Schmid on the special relationship Germanic and Baltic, according to which “the formation of the typical Germanic linguistic characteristics…must have taken place in the neighborhood of Baltic“.

    If one starts from a Germanic core area whose eastern boundary is to be set on the middle Elbe between the Erzgebirge and Altmark, there are little more than 400 km. to the undoubtedly Baltic settlement area east of the Vistula. Stretching the Baltic area westwards over the Vistula (as far as the much-cited Persante), the distance is reduced to less than 300 km. Assuming further that Indo-European tribes between the developing Germanic and the Baltic groups represent the connection between the two language groups, so can one understand well the special relationship proposed by W.P. Schmid between Germanic and Baltic. In an earlier period shared Slavic evidently the same similarities (Baltic-Slavic-Germanic peculiarities).

    Top: Palaeo-Germanic (G2, blue area), Proto-Balto-Slavic/Pre-Baltic (PBSL, green area) and Early Proto-Balto-Finnic (PBF, cyan area) homelands superimposed over Early Bronze Age cultures. Persante hydronym and Gustorzyn ancient DNA sample location marked. Y-DNA haplogroups during the Early Bronze Age (Bottom left) and during the Middle Bronze Age (Bottom right). Notice a mix of R1b-L151 samples from the west and the process of integration of R1a-Z645 lineages from the the north-east. See full maps and PCAs.

    Substrate and immediate eastern border: Early Balto-Finnic

    While Balto-Finnic shows a late Balto-Slavic adstrate, Balto-Slavic has a Balto-Finnic(-like) substrate, also found later in Baltic and Slavic, which implies that Balto-Slavic (and later Baltic and Slavic) replaced the language of peoples who spoke Balto-Finnic(-like) languages, influencing at the same time the language of neighbouring peoples, who still spoke Balto-Finnic (or were directly connected to the Balto-Finnic community).

    For more on this relative chronology in Balto-Slavic – Balto-Finnic contacts, see e.g. the recent posts on Kallio (2003), Olander (2019), or a summary of this substrate.

    While Rahkonen (2013) entertains Parpola’s theory of a West-Uralic-speaking Netted Ware area (ca. 1900-500 BC), due to the Uralic-like hydrotoponymy of its territory, he also supports Itkonen’s idea of the ancient presence of almost exclusively Balto-Finnic place and river names in the Eastern Baltic and the Gulf of Finland since at least the Corded Ware period, due to the lack of Indo-European layers there:

    NOTE. This idea was also recently repeated by Kallio (2015), who can’t find a non-Uralic layer of hydrotoponymy in Balto-Finnic-speaking areas.

    It should be observed that the territory between the historical Finnic and Mordvin-speaking areas matches quite well with the area of the so-called Textile Ceramics [circa 1900–800 BC] (cf. Parpola 2012: 288). The culture of Textile Ceramics could function as a bridge between these two extreme points. Languages that were spoken later in this vast territory between Finland–Estonia and Mordovia seem to derive from Western Uralic (WU) as well. I have called those languages Meryan-Muroma, Eastern and Western Čudian and an unknown “x” language spoken in inland Finland, Karelia and the Lake Region of the Russian North (Rahkonen 2011; 241; 2012a: 19–27; 2013: 5– 43). This might mean that the territory of the Early Textile Ceramics reflects to some extent the area of late Western Uralic.

    The archaeologically problematic area is Estonia, Livonia and Coastal Finland – the area traditionally assumed to have been populated by the late Proto-Finns. The Textile Ceramics culture was absent there. It is very difficult to believe that the Textile Ware population in inland Finland migrated or was even the main factor bringing the Pre- or Early Proto-Finnic language to Estonia or Livonia. There are no archaeological or toponymic signs of it. Therefore, I am forced to believe that Textile Ceramics did not bring Uralic-speaking people to those regions. This makes it possible, but not absolutely proven, to assume that some type of Uralic language was spoken in the region of the Gulf of Finland already before Textile Ceramics spread to the northwest (circa 1900 BC).

    Top Left: Corded Ware culture expansion. Top right: PCA of Corded Ware and Sub-Neolithic groups. Y-DNA haplogroups during the Corded Ware expansion (Bottom left) and during the subsequent Bell Beaker expansion (Bottom right). Notice the rapid population replacement of typical Corded Ware R1a-Z645 lineages by expanding Bell Beakers of hg. R1b-L23 in central-east Europe, while they show continuity in the described ancestral Fennoscandian West-Uralic-speaking territory. See full maps and PCAs.

    The Corded Ware population in Finland is thought to have been NW Indo-European by many scholars (e.g. Koivulehto 2006: 154–155; Carpelan & Parpola 2001: 84). At least, it is probable that the Corded Ware culture was brought to Finland by waves of migration, because the representatives of the former Late Comb Ceramics partially lived at the same time side by side with the Corded Ware population. However, it is possible that the immigrants were a population that spoke Proto-Uralic, who had adopted the Corded Ware culture from their Indo-European neighbors, possibly from the population of the Fatjanovo culture, e.g. in the Valdai region. This was suggested by Terho Itkonen (1997: 251) as well. In that case the population of the Typical and Late Comb Ceramics may have spoken some Paleo European language (see Saarikivi 2004a). In the Early Bronze Age, the Baltic Pre-Finnic language that I have suggested must have been very close to late WU and therefore no substantial linguistic differences existed between the Baltic Pre-Finns and the population of Textile Ceramics in inland Finland. I admit that this model is difficult to prove, but I have presented it primarily in order to offer new models of thinking.16 At least, there is no archaeological or linguistic reason against this idea.

    This dubitative attribution of Proto-Uralic to the expansion of Corded Ware groups in eastern Europe, which is what hydrotoponymic data suggests in combination with archaeology, has to be understood as a consequence of how striking Rahkonen finds the results of his research, despite Itkonen’s previous proposal, in the context of an overwhelming majority of Indo-Europeanists who, until very recently, simplistically associated Corded Ware with the Indo-European expansion.


    Even Kortlandt accepts at this point the identification of expanding East Bell Beakers from the Carpathian Basin as those who left the Alteuropäische layer reaching up to the Baltic. However, he identified Udolph’s data solely with West Indo-European, forgetting to mention the commonly agreed upon western Proto-Balto-Slavic homeland, most likely because it contradicts two of his main tenets:

    1. that Balto-Slavic split from a hypothetical Indo-Slavonic (i.e. Satem) group expanding from the east; and
    2. that laryngeals can be reconstructed for Balto-Slavic – unlike for North-West Indo-European.
    Indo-European hydrotoponymy in Europe and the Middle East (scarce Central Asian data). Baltic data compensated, statistical method RBF: intermediate regions devoid of Indo-European toponyms are inferred to have them; it compensates thus e.g. for the scarce Indo-European hydrotoponyms in Poland by assuming ‘soft’ continuity from West Germany to the Baltic.

    A hypothetic “Pre-Indo-Slavonic” laryngeal Indo-European layer reaching Fennoscandia and the Forest Zone with Corded Ware is fully at odds with all known data:

    • in comparative grammar, since the one feature that characterizes Graeco-Aryan is precisely its set of innovations relative to Northern Indo-European, which presupposes a longer contact (and further laryngeal loss) once Tocharian and North-West Indo-European had separated – hence probably represented by Palaeo-BalkanCatacomb-Poltavka contacts once Afanasevo and Yamna settlers from the Carpathian Basin / East Bell Beakers had become isolated;
    • in hydrotoponymy, because of the prehistoric linguistic areas that can be inferred from (1) the distribution of Old European hydrotoponymy; (2) Udolph’s work on Germanic and the likely non-Indo-European substrate in Scandinavia and land contacts with Balto-Finnic; (3) from the Northern European traits in the Northern European Plain; or (4) from the decreasing proportion of Indo-European place and river names from central Europe towards the east and north.
    • NOTE. An alternative explanation of Old European/Balto-Slavic layers, e.g. by a ‘Centum’ Temematic – even if one obviates the general academic rejection to Holzer’s proposal – couldn’t account for the absolute lack of an ancestral layer of Indo-European hydrotoponymy in North-Eastern Europe (i.e. the longest-lasting Corded Ware territory), in sharp contrast with Western Europe, South-Eastern Europe, and South Asia. All of that contradicts an Eastern Indo-European community, even without a need to recall that the oldest hydrotoponymic layers common to Fennoscandia and the Forest Zone are of Uralic nature.

    • in archaeology, because cultural expansions of the Eastern European Early Bronze Age province since the Bell Beaker period (viz. Mierzanowice, Trzciniec, Lusatian, Pomeranian, West Baltic Culture of Cairns) suggest once and again west-east movements, most (if not all) of which – based on the presence of Indo-European speakers during the common era – were likely associated with Indo-European-speaking communities replacing or displacing previous ones.
    • in palaeogenomics, because of the late and different association of Corded Ware ancestry and haplogroups among Balto-Slavic and Indo-Iranian communities, in turn corresponding to the different satemization processes found in both dialects, which may have actually been related to the Uralic substrate that is found in both (read more on Uralic influences on Balto-Slavic and on Indo-Iranian).

    On the other hand, a careful combination of Uralic and Indo-European comparative grammar, hydrotoponymic data, and population genomics fits perfectly well Itkonen’s and Rahkonen’s association of Corded Ware in Eastern Europe with Uralic languages, as well as the traditional mainstream view of Uralic before Indo-European in Fennoscandia and in the Forest Zone, as I explained in a recent post about genetic continuity in the East Baltic area.

    Population genomics is not the main reason to reject the Indo-European Corded Ware theory – or any other prehistoric ethnolinguistic identification, for that matter. It can’t be. This new field offers just the occasional confirmation of a well-founded theory or, alternatively, another nail in the coffin of fringe theories that were actually never that likely, but seemed impossible to fully dismiss on purely theoretical grounds.

    The problem with Corded Ware was that we couldn’t see how unlikely its association with Indo-European languages was until we had ancient DNA to corroborate archaeological models, because few (if any) Indo-Europeanists really cared about the linguistic prehistory of eastern and northern Europe, or about Uralic languages in general (contrary to the general trend among Uralicists to be well-versed in Indo-European studies). Now they will.


    Waves of Palaeolithic ANE ancestry driven by P subclades; new CWC-like Finnish Iron Age

    New preprint The population history of northeastern Siberia since the Pleistocene, by Sikora et al. bioRxiv (2018).

    Interesting excerpts (emphasis mine; most internal references removed):

    ANE ancestry

    The earliest, most secure archaeological evidence of human occupation of the region comes from the artefact-rich, high-latitude (~70° N) Yana RHS site dated to ~31.6 kya (…)

    The Yana RHS human remains represent the earliest direct evidence of human presence in northeastern Siberia, a population we refer to as “Ancient North Siberians” (ANS). Both Yana RHS individuals were unrelated males, and belong to mitochondrial haplogroup U, predominant among ancient West Eurasian hunter-gatherers, and to Y chromosome haplogroup P1, ancestral to haplogroups Q and R, which are widespread among present-day Eurasians and Native Americans.

    Symmetry tests using f4 statistics reject tree-like clade relationships with both Early West Eurasians (EWE; Sunghir) and Early East Asians (EEA; Tianyuan); however, Yana is genetically closer to EWE, despite its geographic location in northeastern Siberia

    Using admixture graphs (qpGraph) and outgroup-based estimation of mixture proportions (qpAdm), we find that Yana can be modelled as EWE with ~25% contribution from EEA

    Among all ancient individuals, Yana shares the most genetic drift with Mal’ta, and f4 statistics show that Mal’ta shares more alleles with Yana than with EWE (e.g. f4(Mbuti,Mal’ta;Sunghir,Yana) = 0.0019, Z = 3.99). Mal’ta and Yana also exhibit a similar pattern of genetic affinities to both EWE and EEA, consistent with previous studies.The ANE lineage can thus be considered a descendant of the ANS lineage, demonstrating that by 31.6 kya early representatives of this lineage were widespread across northern Eurasia, including far northeastern Siberia.


    Ancient Palaeosiberian

    (…) the 9.8 kya Kolyma1 individual, representing a group we term “Ancient Paleosiberians” (AP). Our results indicate that AP are derived from a first major genetic shift observed in the region. Principal component analysis (PCA), outgroup f3-statistics and mtDNA and Y chromosome haplogroups (G1b and Q1a1a, respectively) demonstrate a close affinity between AP and present-day Koryaks, Itelmen and Chukchis, as well as with Native Americans.

    For both AP and Native Americans, ANS ancestry appears more closely related to Mal’ta than Yana, therefore rejecting a direct contribution of Yana to later AP or Native American groups.

    Lake Baikal Neolithic – Bronze Age

    (…) the newly reported genomes from Ust’Belaya and recently published neighbouring Neolithic and Bronze Age sites show a succession of three distinct genetic ancestries over a ~6 ky time span. The earliest individuals show predominantly East Asian ancestry, closely related to the ancient individuals from DGC. In the early Bronze Age (BA), we observe a resurgence of AP ancestry (up to ~50% ancestry fraction), as well as influence of West Eurasian Steppe ANE ancestry represented by the early BA individuals from Afanasievo in the Altai region (~10%) This is consistent with previous reports of gene flow from an unknown ANE-related source into Lake Baikal hunter-gatherers.

    Our results suggest a southward expansion of AP as a possible source, which is also consistent with the replacement of Y chromosome lineages observed at Lake Baikal, from predominantly haplogroup N in the Neolithic to haplogroup Q in the BA. Finally, the most recent individual from Ust’Belaya, dated to ~600 years ago, falls along the Neosiberian cline, similar to the ~760 year-old ‘Young Yana’ individual from northeastern Siberia, demonstrating the widespread distribution of Neosiberian ancestry in the most recent epoch.

    Genetic structure of ancient northeast Siberians. PCA of ancient individuals projected onto a set of modern Eurasian and American individuals. Abbreviations in group labels: UP – Upper Palaeolithic; LP – Late Palaeolithic; M – Mesolithic; EN – Early Neolithic; MN – Middle Neolithic; LN – Late Neolithic; EBA – Early Bronze Age; LBA – Late Bronze Age; IA – Iron Age; PE – Paleoeskimo; MED – Medieval

    Finland Saami

    At the western edge of northern Eurasia, genetic and strontium isotope data from ancient individuals at the Levänluhta site documents the presence of Saami ancestry in Southern Finland in the Late Holocene 1.5 kya. This ancestry component is currently limited to the northern fringes of the region, mirroring the pattern observed for AP ancestry in northeastern Siberia. However, while the ancient Saami individuals harbour East Asian ancestry, we find that this is better modelled by DGC rather than AP, suggesting that AP influence was likely restricted to the eastern side of the Urals. Comparison of ancient Finns and Saami with their present-day counterparts reveals additional gene flow over the past 1.6 kya, with evidence for West Eurasian admixture into modern Saami. The ancient Finn from Levänluhta shows lower Siberian ancestry than modern Finns .

    EDIT (27 OCT 2018): By comparing the three, I see these are samples published already (at least two) in Lamnidis et al. (2018), but here with added (1) specific radiocarbon dates, (2) comparison with Neosiberian populations and (3) strontium isotope analyses.

    Finnish_IA (ca. 350 AD) is probably a Saami-speaking individual, just like the Saami_IA with newly reported radiocarbon dates from Levänluhta ca. 400-600 AD (since Fennic peoples were then likely around the Gulf of Finland).

    The conflicting strontium isotope data on marine dietary resources on certain samples from the supplementary material hint at possible external origin of the diet of some of the previously reported (and possibly one newly reported) Saami Iron Age individuals, from some 25-30 km. to the northwest through the river up to hundreds of km. to the southwest of Levänluhta (i.e. the whole coast of the Bothnian Sea). It is unclear why they would prefer an origin of the dietary source in southern Baltic regions instead of some km. to the west, though, unless that’s what they want to propose based on the sample’s admixture…

    The coast of the Bothnian Sea (=the northern part of the Baltic Sea, between Sweden and Finland) lay only 25-30 km to the northwest, and accessible to the Iron Age people of the Levänluhta region via the Kyrönjoki river. (…) For individual JA2065/DA236, the low 87Sr/86Sr value (0.71078) would imply an exceptionally heavy reliance on Baltic Sea resources. The δ13C and δ15N values of the individual are near comparable (especially considering within-Baltic latitudinal gradients in δ13C; Torniainen et al. 2017) to the δ13C and δ15N values of a Middle Neolithic population on the Baltic island of Gotland (Eriksson, 2004) interpreted to have subsisted primarily on seals.

    These new data on the samples give us some more information than what we already had, because the early date of Finnish_IA implies that there was few East Asian admixture (if any at all) in west Finland during the Roman Iron Age, which pushes still farther forward in time the expected appearance of Siberian ancestry among Saamic (first) and Fennic populations (later). It is unclear whether this East Asian ancestry found in Finnish_IA is actually related to DGC, or it is rather related to the ENA-like ancestry found already in Baltic hunter-gatherers (i.e. in some EHG samples from Karelia), for which Baikal_EN is a good proxy in Lazaridis et al. (2018).

    Since Bronze Age and Iron Age samples from Estonia show more Baltic_HG drift compared to Corded Ware samples, it is likely that this supposedly DGC-related ancestry (here considered part of the ‘Siberian ancestry’) is actually an EHG-related ENA component of north-east European hunter-gatherers, with whom Finno-Saamic peoples admixed during the expansion of the Corded Ware culture into Finland.

    The paper finds thus increased (probably the actual) Siberian ancestry in modern Finns compared to this Iron Age Saami individual. Coupled with the later Saami Iron Age samples, from between one to three centuries later – showing the start of Siberian ancestry influx – , we can begin to establish when the expansion of Siberian ancestry happened in central Finland, and thus quite likely when the Saami began to expand to the north and east and admix with Palaeo-Laplandic peoples.

    Admixture modelling using qpAdm. Maps showing locations and ancestry proportions of ancient (left) and modern (right) groups.

    One sample of haplogroup N1a1a1a1a4a1-M1982, Yana_MED, is found in the Arctic region (north-eastern Yakutia) ca. 1100 AD. Since it is derived from N1a1a1a1a-L392, it might be a surprise for some to find it in a clearly non-Uralic speaking environment at the same time other subclades of this haplogroup were admixing in the west with well-established Finno-Saamic, Volga-Finnic, Ugric, and Samoyedic populations…

    On the growing doubts that these data – contradicting the CWC=IE theory – are creating among geneticists (from the supplementary materials):

    NOTE. This paper comes from the Copenhagen group, also signed by Kristiansen, one of today’s strongest supporters of this connection

    The Proto-Saami language evolved in southern Finland and Karelia in the Early Iron Age, an area now host to Finnish and the closely related Karelian, but with Saami toponyms showing that the latter two languages are intrusive here (Saarikivi 2004). Saami-speaking populations are thought to have retreated to Lapland during the Middle Iron Age (300–800 AD), where it diverged into the modern Saami dialects. Genetically, the northward retreat of the Saami language correlates with the documented decrease of Saami ancestry in Southern Finland between the Iron Age and the modern period (cf. Lamnidis et al. 2018).

    On the way to Lapland, the Saami replaced at least two linguistically obscure groups. This can be inferred from 1) an influx of non-Uralic loanwords into Proto-Saami in the Finnish Lakeland area, and 2) an influx of non-Uralic, non-Germanic words into Saami dialects in Lapland (Aikio 2012). Both of these borrowing events imply contact with non-Saami-speaking groups, e.g. non-Uralic-speaking hunter-gatherers that may have left a genetic and linguistic footprint on modern Saami populations.

    The linguistic prehistory of Finland thus does not allow for a straightforward interpretation of the genetic data. The detection of East Asian ancestry in the genetically Saami individual is indicative of a population movement from the east (cf. Lamnidis et al. 2018, Rootsi et al. 2007), one that given the affinities with the ~7.6 ky old individuals from the Devil’s Gate Cave may have been a western extension of the Neosiberian turnover. However, it remains unclear whether this gene flow should be associated with the arrival of Uralic speakers, thus providing further support for a Uralic homeland in Eastern Eurasia, or with an earlier immigration of pre-Uralic, so-called “Paleo-Lakelandic” groups.

    I think the genetic interpretation is already straightforward, though. We had a sneak peek at how this late admixture with non-Uralians (mainly Palaeo-Lakelandic and Palaeo-Laplandic peoples from Lovozero and related asbestos ware cultures) is going to unfold among expanding Saami-speaking populations thanks to Lamnidis et al. (2018):

    PCA plot of 113 Modern Eurasian populations, with individuals from this study projected on the principal components. Uralic speakers are highlighted in light purple. Image modified from Lamnidis et al. (2018)

    Also, still no trace of R1a in far East Asia (reported as M17 ca. 5300 BC near Lake Baikal by Moussa et al. 2016), so I still have doubts about my previous assessment that R1a split into M17 (and thus also M417) in Siberia, with those expanding hunter-gatherer pottery.


    Iron Age bottleneck of the Proto-Fennic population in Estonia


    Demographic data and figures derived from Estonian Iron Age graves, by Raili Allmäe, Papers on Anthropology (2018) 27(2).

    Interesting excerpts (emphasis mine):


    Inhumation and cremation burials were both common in Iron Age Estonia; however, the pattern which burials were prevalent has regional as well temporal peculiarities. In Estonia, cremation burials appear in the Late Bronze Age (1100–500 BC), for example, in stone-cist graves and ship graves, although inhumation is still characteristic of the period [28, 18]. Cremation burials have occasionally been found beneath the Late Bronze Age cists and the Early Iron Age (500 BC–450 AD) tarand graves [30, 28]. In south-eastern Estonia, including Setumaa, the tradition to bury cremated human remains in pit graves also appeared in the Bronze Age and lasted during the Pre-Roman period (500 BC–50 AD) and the Roman Iron Age (50–450 AD), and even up to the medieval times [30, 23, 33, 9]. During the Early Iron Age, cremations appear in cairn graves and have occasionally been found in many Pre-Roman early tarand graves where they appear with inhumations [27, 28, 19, 20, 21, 22, 24]. In Roman Iron Age tarand graves, cremation as well inhumation were practiced [28, 36, 37]; however, cremation was the prevailing burial practice during the Roman Iron Age, for example, in tarand graves in south-eastern Estonia [30, 28]. Roman Iron Age (50 AD–450 AD) burial sites have not been found in continental west Estonia [28, 38]). At the beginning of the Middle Iron Age (450–800 AD), burial sites, for example stone graves without a formal structure, like Maidla I, Lihula and Ehmja ‘Varetemägi’, appear in Läänemaa, west Estonia; in these graves cremations as well inhumations have been found [39, 48]. Like underground cremation burial, the stone grave without a formal structure was the most common grave type during the Late Iron Age (800– 1200 AD) in west Estonia [39, 35, 48]. In south-eastern and eastern Estonia, sand barrows with cremation burials appeared at the beginning of the Middle Iron Age. Cremation barrows are attributed to the Culture of Long Barrows and are most numerous in the villages Laossina and Rõsna in northern Setomaa, on the western shore of Lake Peipsi [8, 48]. Apparently during the Iron Age, the practiced burial customs varied in Estonia.

    Typical prehistoric Estonian graves. Top: Cist-graves common during the Bronze Age, by Terker (GNU FDL 1.2). Bottom: Tarand graves of the Iron Age, by Marika Mägi (2017)


    Three Iron Age cremation graves from south-eastern Estonia and four graves including cremations as well inhumations from western Estonia were analysed by osteological and palaeodemographic methods in order to estimate the age and sex composition of burial sites, and to propose some possible demographic figures and models for living communities.

    The crude birth/death rate estimated on the basis of juvenility indices varied between 55.1‰ and 60.0‰ (58.5‰ on average) at Rõsna village in south-eastern Estonia in the Middle Iron Age. The birth/death rates based on juvenility indices for south eastern graves varied to a greater degree. The estimated crude birth/death rate was somewhat lower (38.9‰) at Maidla in the Late Iron Age and extremely high (92.1‰) at Maidla in the Middle Iron Age, which indicates an unsustainable community. High crude birth/death rates are also characteristic of Poanse tarand graves from the Pre-Roman Iron Age – 92.3‰ for the 1st grave and 69.6‰for the 2nd grave. Expectedly, newborn life expectancies are extremely low in both communities – 10.8 years at Poanse I and 14.4 years at Poanse II. Most likely, both Maidla I and Poanse I were unsustainable communities.

    Locations of the investigated Estonian Iron Age graves. Map by R. Allmäe

    According to the main model where the given period of grave usage is 150 years, the burial grounds were most likely exploited by communities of 3–14 people. In most cases, this corresponds to one family or household. In comparison with other graves, Maidla II stone grave in western Estonia and Rõsna-Saare I barrow cemetery in south-eastern Estonia could have been used by a somewhat larger community, which may mean an extended family, a larger household or usage by two nuclear families.

    More papers on the same subject by the author – who participated in the recent Mittnik et al. (2018) paper – include Observations On Estonian Iron Age Cremations (2013), and The demography of Iron Age graves in Estonia (2014).

    Fast life history in Iron Age Estonia

    While the demographic situation in the Gulf of Finland during the Iron Age is not well known – and demography is always difficult to estimate based on burials, especially when cremation is prevalent – , there is a clear genetic bottleneck in Finns, which has been estimated precisely to this period, coincident with the expansion of Proto-Fennic.

    PCA of Estonian samples from the Bronze Age, Iron Age and Medieval times. Tambets et al. (2018, upcoming).

    The infiltration of N1c lineages in Estonia – the homeland of Proto-Fennic – happened during the Iron Age – as of yet found in 3 out of 5 sampled Tarand graves – , while the previous period was dominated by 100% R1a and Corded Ware + Baltic HG ancestry. With the Iron Age, a slight shift towards Corded Ware ancestry can be seen, which probably signals the arrival of warrior-traders from the Alanino culture, close to the steppe. They became integrated through alliances and intermarriages in a culture of chiefdoms dominated by hill forts. Their origin in the Mid-Volga area is witnessed by their material culture, such as Tarand-like graves (read here a full account of events).

    This new political structure, reminiscent of the chiefdom system in Sintashta (with a similar fast life history causing a bottleneck of R1a-Z645 lineages), coupled with the expansion of Fennic (and displaced Saamic) peoples to the north, probably caused the spread of N1c-L392 among Finnic peoples. The linguistic influence of these early Iron Age trading movements from the Middle Volga region can be seen in similarities between Fennic and Mordvinic, which proves that the Fenno-Saamic community was by then not only separated linguistically, but also physically (unlike the period of long-term Palaeo-Germanic influence, where loanwords could diffuse from one to the other).

    NOTE. Either this, or the alternative version: an increase in Corded Ware ancestry in Estonia during the Iron Age marks the arrival of the first Fennic speakers ca. 800 BC or later, splitting from Mordvinic? A ‘Mordvin-Fennic’ group in the Volga, of mainly Corded Ware ancestry…?? Which comes in turn from a ‘Volga-Saamic’ population of Siberian ancestry in the Artic region??? And, of course, Palaeo-Germanic widely distributed in North-Eastern Europe with R1a during the Bronze Age! Whichever model you find more logical.


    Pre-Germanic born out of a Proto-Finnic substrate in Scandinavia


    A commenter, Old Europe, drew my attention to the Uralic (Finnic-Saamic) substrate in Germanic proposed by Schrijver in Chapter V. Origins of Language Contact and the Origins of the Germanic Languages, Routledge (2014).

    I wanted to share here some interesting excerpts (emphasis mine):

    NOTE. I have avoided many detailed linguistic discussions. You should read the whole chapter to check them out.

    The origins of the Germanic subfamily of Indo-European cannot be understood without acknowledging its interactions with a language group that has been its long-time neighbour: the Finnic subgroup of the Uralic language family. Indo-European and Uralic are linked to one another in two ways: they are probably related to one another in deep time — how deep is impossible to say3 — and Indo-European has been a constant source from which words were borrowed into Uralic languages, from the fourth millennium BC up to the present day.4 The section of the Uralic family that has always remained in close proximity to the Indo-European dialects which eventually turned into Germanic is Finnic. I use the term Finnic with a slightly idiosyncratic meaning : it covers the Finno-Saamic protolanguage and both of its children, Saami and Balto-Finnic.(…)

    Schrijver (2014). The Finnic family tree (simplified)

    Linguistically, the relationship between Indo-European and Uralic has always been asymmetrical. While hundreds of loanwords flowed into Uralic languages from Indo-European languages such as Germanic, Balto-Slavic, Iranian, and Proto-Indo-European itself, hardly any Uralic loanwords have entered the Indo-European languages (apart from a few relatively late dialectal loans into e.g. Russian and the Scandinavian languages). This strongly suggests that Uralic speakers have always been more receptive to ideas coming from Indo-European–speaking areas than the other way around. This inequality probably began when farming and the entire way of life that accompanies it reached Uralic-speaking territory via Indo-European–speaking territory, so that Uralic speakers, who traditionally were hunter-gatherers of the mixed and evergreen forest zone of northeastern Europe and gradually switched to an existence as sedentary farmers, were more likely to pick up ideas and the words that go with them from Indo-European than from anywhere else.

    Farming requires a different mind-set from a hunter-gatherer existence. Farmers are generally sedentary, model the landscape, and have an agricultural calendar to determine their actions. Hunter-gatherers of the northern forest zone are generally nomadic, and rather than themselves modelling the natural environment they are modelled by it: their calendar depends on when and where a particular natural resource is available.(…)

    All of this is no doubt a simplification of the thousands of years of associations between speakers of Uralic and speakers of Indo-European, but the loanword evidence strongly suggests that by and large relations between the two groups were highly unequal. The single direction in which loanwords flowed, and the mass of loanwords involved, can be compared with the relation between Latin and the vernacular languages in the Roman Empire, almost all of which disappeared in favour of Latin. It is therefore certain that groups of Uralic speakers switched to Indo-European. The question is whether we can trace those groups and, more particularly, whether Finnic speakers switching to Indo-European were involved in creating the Indo-European dialect we now know as Germanic.

    Convergence of Finnic and Germanic

    What both have in common is that the sound structures of Finnic and Germanic, which started from very different beginnings, apparently came to resemble one another significantly. If that is what we observe, we must conclude that both languages converged as a result of contact.

    During the approximately five to six millennia that separate Proto-Uralic from Modern Finnish, there was only one episode during which the consonantal system underwent a dramatic overhaul. This episode separates the Finno-Saamic protolanguage, which is phonologically extremely conservative, from the Balto-Finnic protolanguage, which is very innovative.


    By the time Finno-Saamic developed into Balto-Finnic, the consonant system was very different:


    In Balto-Finnic, the entire palatal series has been lost, apart from j, and the contrast between dentals and alveolars has disappeared: out of three different s-sounds only one remains. The fricatives ð and γ have been lost, and so has the velar nasal ŋ. The only increase has been in the number of long (geminate) consonants by the appearance of ss, mm, nn, and ll. The loss of separate alveolar and palatal series and the disappearance of ŋ could be conceived as convergences towards Proto-Germanic, which lacked such consonants. This is not obvious for the loss of the voiced fricatives γ, ð, which Proto-Germanic did possess. However, this way of comparing Balto-Finnic and Germanic is flawed in an important respect: what we are doing is assessing convergence by comparing the dynamic development from Finno-Saamic to Balto-Finnic to the static system of Proto-Germanic, as if Proto-Germanic is not itself the result of a set of changes to the ancestral Pre-Germanic consonantal system. If we wish to find out whether there was convergence and which language converged on which, what we should do, therefore, is to compare the dynamic development of Finno-Saamic to Balto-Finnic to the dynamic development of Pre-Germanic to Proto-Germanic, because only that procedure will allow us to state whether Balto-Finnic moved towards Proto-Germanic, or Proto-Germanic moved towards Balto-Finnic, or both moved towards a third language. The Pre-Germanic consonantal system can be reconstructed as follows: 7


    The slashes in the second and third rows indicate the uncertainty about the Proto-Indo-European nature of the sounds involved. (…)

    What resulted was the following Proto-Germanic consonant system:


    We are now in a better position to answer the question whether Proto-Germanic and Balto-Finnic have converged. Three striking developments affected both languages:

    • Both languages lost the palatalized series of consonants (apart from j), which in both languages became non-palatalized.
    • Both languages developed an extensive set of long (geminate) consonants; Pre-Germanic had none, while Finno-Saamic already had a few.
    • Both languages developed an h.

    These similarities between the languages are considerable.

    The idea that perhaps both languages moved towards a lost third language, whose speakers may have been assimilated to both Balto-Finnic and Germanic, provides a fuller explanation but suffers from the drawback that it shifts the full burden of the explanation to a mysterious ‘language X’ that is called upon only in order to explain the developments in Proto-Germanic and Balto-Finnic. That comes dangerously close to circular reasoning.

    Verner’s Law in Pre-Germanic

    As we have seen in the preceding section, Verner’s law is a sound change that affected originally voiceless consonants, so *p , t , k , kj , kw, s of the Pre-Germanic system. These normally became the Proto-Germanic voiceless fricatives *f, θ, h, h, hw, s, respectively. But if *p, t, k etc. were preceded by an originally unstressed syllable, Verner’s law intervened and they were turned into voiced consonants. Those voiced consonants merged with the series *bh, dh, gh of the Pre-Germanic system and therefore subsequently underwent all changes that the latter did, turning out as *b/v , *d/ð , g/γ in the Proto-Germanic system (that is, v, ð, γ after a vowel and b, d, g in all other environments in the word). When *s was affected by Verner’s Law, a new phoneme *z arose. In a diagram:


    While it is very common in the history of European languages for stress to influence the development of vowels, it only very rarely affected consonants in this part of the world. Verner’s law is a striking exception. It resembles a development which, on a much larger scale, affected Finno-Saamic: consonant gradation.(…)

    In all Finno-Saamic languages, rhythmic gradation has become phonemic and fossilized. The connection between rhythmic gradation and Verner’s law is relatively straightforward: both processes involve changing a voiceless consonant after an unstressed syllable. (…)

    We can therefore repeat for Proto-Uralic the argument that persuaded us earlier that gradation in Saami and Balto-Finnic must go back to the common Finno-Saamic protolanguage: the similarity of the gradation rules in Nganasan to those in Finno-Saamic is so specific and so detailed, and the phenomenon of gradation so rare in the languages of the world, that gradation must be reconstructed for the Uralic protolanguage.

    Verner’s law turns all voiceless obstruents (Pre-Germanic *p, t, k, kj, kw, s) into voiced obstruents (ultimately Proto-Germanic *b/v , d/ð, g/γ, g/γ, gw, z) after a Pre-Germanic unstressed syllable. Rhythmic gradation turns all voiceless obstruents after an unstressed syllable into weak-grade consonants, which means that *p, t, k, s become Finnic *b/v , d/ð , g/γ, z. This is striking. Given the geographical proximity of Balto-Finnic and Germanic and given the rare occurrence of stress-related consonant changes in European languages, it would be unreasonable to think that Verner’s law and rhythmic gradation have nothing to do with one another.

    It is very hard to accept, however, that gradation is the result of copying Verner’s law into Finnic. First of all, Verner’s law, which might account for rhythmic gradation, in no way accounts for syllabic gradation in Finnic. And, second, gradation can be shown to be an inherited feature of Finnic which goes all the way back to Proto-Uralic. Once one acknowledges that Verner’s law and gradation are causally linked and that gradation cannot be explained as a result of copying Verner’s law into Finnic, there remains only one possibility: Verner’s law is a copy of Finnic rhythmic gradation into Germanic. That means that we have finally managed to find what we were looking for all along: a Finnic sound feature in Germanic that betrays that Finnic speakers shifted to Germanic and spoke Germanic with a Finnic accent. The consequence of this idea is dramatic: since Verner’s law affected all of Germanic, all of Germanic has a Finnic accent.

    Late Chalcolithic migrations ca. 2600-2250 BC.

    On the basis of this evidence for Finnic speakers shifting to Germanic, it is possible to ascribe other, less specifically Finnic traits in Germanic to the same source. The most obvious trait is the fixation of the main stress on the initial syllable of the word. Initial stress is inherited in Finno-Saamic but was adopted in Germanic only after the operation of Verner’s law, quite probably under Finnic influence. The consonantal changes described in section V.3.1 can be attributed to Finnic with less confidence. The best case can be made for the development of geminate (double) consonants in Germanic, which did not inherit any of them, while Finno-Saamic inherited *pp, tt, kk, cc and took their presence as a cue to develop other geminates such as *nn and *ll . Possibly geminates developed so easily in Proto-Germanic because Finnic speakers (who switched to Germanic) were familiar with them. Other consonantal changes, such as the loss of the palatalized series in both Germanic and Balto-Finnic and the elimination of the different s- and c-phonemes, might have occurred for the same reason: if Balto-Finnic had undergone them earlier than Germanic, which we do not know, they could have constituted part of the Balto-Finnic accent in Germanic. An alternative take on those changes starts from the observation that they all constitute simplifications of an older, richer system of consonants. While simplifications can be and often are caused by language shift if the new speakers lacked certain phonemes in their original language, simplifications do not require an explanation by shift: languages are capable of simplifying a complex system all by themselves. Yet the similarities between the simplifications in Germanic and in Balto-Finnic are so obvious that one would not want to ascribe their co-occurrence to accidental circumstances.

    Grimm’s Law in Proto-Germanic (speculative)

    Voiceless lenis pronunciation of b, d, g is typical of the majority of German and Scandinavian dialects, so may well have been inherited from Proto-Germanic. Voiceless lenis is also the pronunciation that has been assumed to underlie the weak grades of Finno-Saamic single *p, t, k. If Proto-Germanic *b, d, g were indeed voiceless lenis, the single most striking result of the Germanic consonant shift is that it eliminated the phonological difference between voiced and voiceless consonants that Germanic had inherited from Proto-Indo-European (…) Since neither Finno-Saamic nor Balto-Finnic possessed a phonological difference between voiced and voiceless obstruents, its loss in Proto-Germanic can be regarded as yet another example of a Finnic feature in Germanic.


    It is clear that this account of the first Germanic consonant shift as yet another example of Finnic influence is to some degree speculative. The point I am making is not that the Germanic consonant shift must be explained on the basis of Finnic influence, like Verner’s law and word-initial stress, only that it can be explained in this way, just like other features of the Germanic sound system discussed earlier, such as the loss of palatalized consonants and the rise of geminates.

    A consequence of this account of the origins of the Proto-Germanic consonantal system is that the transition from Pre-Germanic to Proto-Germanic was entirely directed by Finnic. Or, to put it in less subtle words: Indo-European consonants became Germanic consonants when they were pronounced by Finnic speakers.

    Post-Bell-Beaker Europe, after ca. 2200 BC.

    The vocalic system, on the other hand, presented less difficulties for both, Indo-European and Uralic speakers, since it was quite similar.

    Schrijver goes on to postulate certain asymmetric differences in loans, especially with regard to Proto-Germanic, Balto-Finnic, Proto-Saamic, Proto-Baltic, and later contacts, including a potential non-Uralic, non-IE substrate language to justify some of these, which may in turn be connected with Kroonen’s agricultural substrate hypothesis of Proto-Germanic, and thus also with the other surviving Scandinavian Neolithic cultures before the eventual simplification of the cultural landscape during the Bronze Age.

    Conclusion on the origin of Germanic

    The Finnic-Germanic contact situation has turned out to be of a canonical type. To Finnic speakers, people who spoke prehistoric Germanic and its ancestor, Pre-Germanic, must have been role models. Why they were remains unclear. In the best traditions of Uralic–Indo-European contacts, Finnic speakers adopted masses of loanwords from (Pre-)Germanic. Some Finnic speakers even went a crucial step further and became bilingual: they spoke Pre-Germanic according to the possibilities offered by the Finnic sound system, which meant they spoke with a strong accent. The accent expressed itself as radical changes in the Pre-Germanic consonantal system and no changes in the Pre-Germanic vowel system. This speech variety became very successful and turned an Indo-European dialect into what we now know as Germanic. Bilingual speakers became monolingual speakers of Germanic.

    What we do not know is for how long Finnic-Germanic bilingualism persisted. It is possible that it lasted for some time because both partners grew more alike even with respect to features whose origin we cannot assign to either of them (loss of palatalized consonants): this suggests, perhaps, that both languages became more similar because generally they were housed in the same brain. What we can say with more confidence is that the bilingual situation ultimately favoured Germanic over Finnic: loanwords continued to flow in one direction only, from Germanic to Finnic, hence it is clear that Germanic speakers remained role models.

    This is as far as the linguistic evidence can take us for the moment.

    Based on archaeology and genetics, I think we can say that the close North-West Indo-European – Proto-Finnic interaction in Scandinavia lasted for hundreds of years, during the time when a unifying Nordic culture and language developed from Bell Beaker maritime elites dominating over Corded Ware groups.

    As we know, Uralic languages were in close contact with Middle PIE, and also later with Proto-Indo-Iranian. This Pre-Germanic development in Scandinavia is therefore another hint at the identification of a rather early Proto-Finnic spoken in the Baltic area – potentially then by Battle Axe groups – , and thus the general identification of Uralic expansion with the different Corded Ware groups.

    NOTE. The ‘common’ loss of certain palatals, which Schrijver interprets as a change of Pre-Germanic from the inherited Proto-Indo-European, may in fact not be such – in the opinion of bitectalists, including us, and especially taking the North-West Indo-European reconstruction and the Corded Ware substrate hypothesis into account – , so this effect would be a rather unidirectional shift from Finnic to Germanic. On the other hand, certain palatalization trends which some have described for Germanic could in fact be explained precisely by this bidirectional influence.


    On Proto-Finnic language guesstimates, and its western homeland


    Recent chapter The Indo-Europeans and the Non-Indo-Europeans in Prehistoric Northern Europe, by Petri Kallio, In: Language and Prehistory of the Indo- European Peoples: A Cross-Disciplinary Perspective, Copenhagen (2017).

    Interesting excerpts (emphasis mine), especially when read in combination with the most recent papers on Early Indo-Iranian, Corded Ware, and Fennoscandian genomes:

    Like the Indo-Europeanists, also the Uralicists suffer from their “school who wants it large and wants it early”. This time, however, the desired homeland is even larger and earlier, covering the whole northern half of Europe already at the end of the Ice Age (Wiik 2002). As a Finn, admittedly, I find such an idea very flattering indeed. As a historical linguist, however, I also find it absurd for the same reasons which I already gave above in the case of the Indo-Europeans.

    True, linguistic palaeontology is less helpful in the case of the Uralians, even though especially Common Uralic *pata ‘clay pot’ and *wäśkä ‘copper’ indicate that Proto-Uralic was not spoken before the Subneolithic period, which in the East-Baltic area is dated about 5300–3200 BC. However, the most valuable evidence comes from the earliest Indo-European loanwords in the Uralic languages, which show that Proto-Uralic cannot have been spoken much earlier than Proto-Indo-European dated about 3500 BC (Koivulehto 2001: 235, 257).

    As the same loanword evidence naturally also shows that the Uralic and Indo-European homelands were not located far from one another, the Uralic homeland can most likely be located in the Middle and Upper Volga region, right north of the Indo-European homeland*. From the beginning of the Subneolithic period about 5900 BC onwards, this region was an important innovation centre, from where several cultural waves spread to the Finnish Gulf area, such as the Sperrings Ware wave about 4900 BC, the Combed Ware wave about 3900 BC, and the Netted Ware wave about 1900 BC (Carpelan & Parpola 2001: 78–90).

    * Interestingly, the only Uralicists who generally reject the Central Russian homeland are the Russian ones who prefer the Siberian homeland instead. Some Russians even advocate that the Central Russian homeland is only due to Finnish nationalism or, as one of them put it a bit more tactfully, “the political and ideological situation in Finland in the first decades of the 20th century” (Napolskikh 1995: 4). Still, some Finns (and especially those who also belong to the “school who wants it large and wants it early”) simultaneously advocate that exactly the same Central Russian homeland is due to Finnlandisierung (Wiik 2001: 466). Fortunately, I do not even need to resort to playing the politics card myself, because there is enough convincing evidence for the Central Russian homeland anyway.

    Remarkably, the loanword evidence furthermore suggests that the ancestors of Finnic and Saamic had at least phonologically remained very close to Proto-Uralic as late as the Bronze Age (ca. 1700–500 BC). In particular, certain loanwords, whose Baltic and Germanic sources point to the first millennium BC, after all go back to the Finno-Saamic proto-stage, which is phonologically almost identical to the Uralic proto-stage (see especially the table in Sammallahti 1998: 198–202). This being the case, Dahl’s wave model could perhaps have some use in Uralic linguistics, too.

    Even though Bronze-Age Finnic and Saamic were still two dialects rather than two languages, it does not mean that they would still have been spoken in a geographically limited area. On the contrary, their Indo-European loanwords dating to this period indicate that their speech areas were already geographically separate. The fact that at that time both Baltic and Germanic influenced Finnic much more strongly than Saamic must be considered a crucial piece of information when we are trying to locate the Finnic and Saamic homelands.

    Iron Age migrations in Europe.

    (…)the fact that Palaeo-Germanic loanwords are much more numerous in Finnic than in Saamic must lead to the same conclusion. As I noted above, the most likely Palaeo-Germanic speaking carriers of the Nordic Bronze culture (ca. 1700–500 BC) spread from Scandinavia to the Finnish and Estonian coastal areas. As they never spread any further to the east than as far as the bottom of the Finnish Gulf, the idea that the Finnic homeland included neither Finland nor Estonia completely fails to explain the very existence of Palaeo-Germanic loanwords, whose quantity and quality in Finnic presuppose a superstrate rather than an adstrate.

    (…) as the Nordic Bronze culture influenced coastal Finland much more strongly than it did coastal Estonia, the idea that the Finnic homeland did not include Finland but Estonia alone similarly fails to explain the very strength of the Bronze-Age Palaeo-Germanic superstrate in Finnic, which can indeed be compared with the Medieval French superstrate in English, for instance (Kallio 2000: 96–97). From a Germanicist point of view, therefore, Itkonen’s theory concerning the Finnic homeland does not only seem to be the best but also the only alternative (Koivulehto 1984: 198–200).

    As the same can now also be said about the Indo-Europeanization of the Baltic speech area, the fact that Baltic and Finnic are the most conservative branches of their language families and that they have relatively few substrate words may really be due to exactly the same reason, namely that before their arrival the East-Baltic region was still very sparsely populated by Subneolithic hunter-fisher-gatherers, whose linguistic influence on the newcomers was therefore rather limited. On the other hand, as these language shifts already took place millennia ago, there has been a lot of time for the Baltic and Finnic speakers to replace most of their old substrate words by all kinds of new lexical innovations.

    Speaking of loanword evidence, the Aikios and especially Saarikivi (2004b) have furthermore argued that the Indo-Iranian loanwords occurring in Finnic and/or Saamic alone force us to locate the Finnic and Saamic homelands further to the east (e.g. near the White Lake). Still, I fail to see why the Indo-Iranian loanwords counted in dozens should be more relevant in locating these two homelands than the Germanic loanwords counted in hundreds. Besides, the Indo-Iranian loanwords mainly consist of cultural borrowings which do not necessarily presuppose a superstrate but only an adstrate. Moreover, they must be dated so much earlier than Vedic Sanskrit (ca. 1500–1000 BC) and Gathic Avestan (ca. 1000–800 BC) anyway that their spread can very well be connected with the abovementioned Netted Ware wave about 1900 BC.

    An interesting read, where the author expressly refers to the many political (nationalist) and xenophobic overtones (including his own) that arise in ethnolinguistic identifications of prehistoric cultures.

    We are seeing how the newest dialectalisation trends want it ‘late and small’, and ‘late’ corresponds smoothly with the most recent genomic findings involving Chalcolithic and Bronze Age expansions.

    In the Uralic case, in North-Eastern Europe only Corded Ware migrants are known to have expanded within a suitable time frame into the region, and their patrilineal descendants show a widespread distribution in the region during the Bronze Age.

    Also, if Proto-Finnic is coeval with Pre-Proto-Germanic, and expanded from the western part of North-East Europe (necessarily including the Gulf of Finland), well… You know the drill.

    Of course, regarding Proto-Indo-European and Uralic, there are also a lot of people who still want itlarge and early‘ – and the most recent research won’t deter them from such proposals.