Intense but irregular NWIE and Indo-Iranian contacts show Uralic disintegrated in the West

chalcolithic-early-uralic-indo-european

Open access PhD thesis Indo-Iranian borrowings in Uralic: Critical overview of sound substitutions and distribution criterion, by Sampsa Holopainen, University of Helsinki (2019), under the supervision of Forsberg, Saarikivi, and Kallio.

Interesting excerpts (emphasis mine):

The gap between Russian and Western scholarship

Many scholars in the Soviet Union and later the Russian Federation also have researched this topic over the last five decades. Notably the eminent Eugene Helimski dealt with this topic in several articles: his 1992 article (republished in Helimski 2000) on the emergence of Uralic consonantal stems used Indo-Iranian and other Indo-European loans as key evidence, and it was one of the first serious attempts to stratify the loanwords, paying attention to the non-initial syllables as well. Helimski (1997b) discusses Indo-Iranian loanwords more generally, but it is especially notable for the introduction of the “Andronovo Aryan” idea: Helimski argues that some loanwords in Ob-Ugric and Permic are derived from an unattested, third branch of Indo-Iranian. Helimski’s idea has been supported by at least Mikhail Zhivlov in a 2013 article, but otherwise it has not received wide acceptance. Helimski was also known for his criticism (see especially Helimski 2001) of Jorma Koivulehto’s etymological work: although the main targets of Helimski’s criticism were Koivulehto’s writings on Proto-Indo-European and Germanic borrowings (which fitted poorly with Helimski’s ideas of the Nostratic roots of Proto-Uralic and his other theories on Uralic linguistic prehistory), also some of his Indo-Iranian ideas received unnecessarily sharp criticism in Helimski (2001).

Vladimir Napol’skikh is another important Russian scholar who has written on several occasions about Indo-Iranian–Uralic contacts. His 2014 article is notable for its criticism on Helimski’s Andronovo Aryan theory and his arguments in favour of Indo-Aryan loanwords. Napol’skikh also considered some of the traditional Indo-Iranian loanwords to be borrowings from Tocharian (see below) in some of his earlier works, an idea which has been criticized by Kallio (2004) and Widmer (2002) and which Napol’skikh himself has since dropped in later publications (2010, 2014), where many of these alleged Tocharian loans are again considered Indo-Iranian.

Some of the main characteristics of Russian research is that the earliest Indo-European loanwords are usually considered to represent an inheritance from the Nostratic proto-language (Helimski [2001]; Kassian, Zhivlov & Starostin [2015]), an idea which is not widely accepted by scholars of Uralic in the West. Although this often does not concern the Indo-Iranian loanwords at all, or it concerns only a part of them, the works of Jorma Koivulehto, who dealt with both earlier Indo-European and Indo-Iranian loans, receive so much criticism from the Russian scholars that his important ideas are often totally rejected or left unmentioned in Russian research.

This kind of rejection of central etymological research literature can be considered one of the most pressing problems in Uralic loanword studies, and it leaves a regrettable gap between Russian and Western European scholars in this perspective.

11-chalcolithic-late-cultures

11-chalcolithic-late-uralic

Semantics

Among the Indo-Iranian loanwords in Uralic, one can easily mention examples that follow the classification of semantic change as described above. For widening or generalization, vasara ‘hammer’ is a good example: the Indo-Iranian original denotes ‘the weapon of the god Indra’ in Indic and ‘the weapon of the god Mithra’ in Avestan, whereas Finnish ‘hammer’ (and the Mordvin meaning ‘axe’) are more general meanings of tools. Fi huhta is a good example of narrowing: Iranian *tsuxta- means simply ‘burned’, whereas in Finnic huhta means specifically ‘a burned patch used in slash-and-burn agriculture’. Metonomy has taken place in Mordvin, where čuvto denotes simply ‘tree’; this probably developed through the meaning ‘wood burned for agriculture’. Khanty (South) wǟrəs denotes ‘horse’s mane’, but its Iranian original probably had a more general meaning of hair (cf. Avestan varəsa- ‘hair of human and animal, mostly hair of the head’).

An interesting example of degeneration is the etymology of Finnic orja ‘slave’, probably borrowed from the Indo-Iranian ethnonym *(H)ārya- ‘Aryan’ (for the original semantics of this word, see the entry *orja in Chapter 2). A similar development is seen in English slave which is etymologically connected to the ethnonym Slav.

Distribution as a criterion in the dating of loanwords

(…) some of the Indo-Iranian loans seem to have a wide distribution, but upon a closer look it becomes clear that they include phonological irregularities, which can only be explained by assuming that they are parallel loans. The ability to recognize parallel borrowings is extremely important in Uralic loanword studies, and it has been developed with success in the research of Germanic and Baltic loanwords (see Junttila 2015).

Interestingly, K. Häkkinen (1983: 207) argues that although words disappear from languages, the most basic words often remain stable and are maintained for longer periods. Although this is probably true, here the notion of “basicness” is something that is open to different interpretations. Many central concepts in culture and livelihoods are often described with prestige words that are borrowed, and these central words can be very easily replaced. In determining the age of the loanwords one has to always keep in mind that a reflex of a very early cultural borrowing from Indo-Iranian to Proto-Uralic/Proto- West Uralic etc. can easily have been lost in some daughter language, if a later prestige loan for the same concept has been borrowed from some later contact language (such as from some form of Germanic or Baltic into Finnic or from some Turkic language into Udmurt, Mari or Mordvin).

In Uralic linguistics the common loanword layers shared by some intermediary proto-language have often been seen as giving support to the reconstruction of these stages, but K. Häkkinen (100–108) considers this problematic. It should also be noted that the distribution of Indo-Iranian loanwords very rarely matches the assumed taxonomic divisions: there are some loanwords confined to the Finno-Permic, Finno-Volgaic or Ugric languages, but very few loanwords that would be Finno-Permic, Finno-Volgaic or Ugric in the way that the word is found in all the languages that belong to the branch.

Consontants

Laryngeals

There are only very few possible examples of a consonantal substitution of the word-initial laryngeal. It seems probable that the word-initial laryngeal, if it was retained, was not substituted in any way in Uralic. *karšV (> Fi karhu), an uncertain etymology, is the only possible example.

(…) Even if *k was a result of laryngeal hardening, the development would probably be earlier than Proto-Indo-Iranian, meaning that by the time the word was borrowed, the Indo-Iranian word simply had the stop *k that was regularly substituted by Uralic *k.

Evidence for Andronovo Aryan and Indo-Aryan loanwords?

None of the loanwords have to be considered as Andronovo Aryan or Proto-Indo-Aryan based on the criteria that were presented in the Introduction. The Uralic palatal affricate *ć or sibilant *ś can in all cases be explained from Proto-Indo-Iranian *ć, and there is no need to assume that it should reflect Andronovo Aryan *ć or PIA *ś. In the etymological material of this study, no further positive evidence was found for the distinction of PU *ś and *ć as substitutions of the Proto-Indo-Iranian affricates. This means that at least in word-initial position there probably was no difference between *ć and *ś, and even though we do not know what this sound was phonetically, it is safe to assume that Uralic words showing *ś reflect a sound substitution of Indo-Iranian *ć and *Ʒ́.

Regarding the distribution of the etymologies within Indo-Iranian, all the loanwords which cannot be from Iranian because of the lack of attested Iranian cognates have a more or less secure Proto-Indo-Iranian etymology, and nothing prevents us from assuming that these words reflect Proto-Indo-Iranian borrowings. It is also possible that some words with solid Proto-Indo-Iranian etymologies were present in Iranian but were lost before the first Old Iranian texts were composed.

12-bronze-age-early-cultures

12-bronze-age-early-uralic

List of Indo-European and Indo-Iranian Etymologies

Pre-Indo-Iranian

*ertä ‘side’, *kekrä ‘wheel’, *kečrä ‘spindle’, *mekši ‘bee’, (*meti ‘honey’), *ońća ‘part’, (*orpa ‘orphan’), *peijas ‘feast’, *pejmä ‘milk’, Pre-P *pertä ‘wing’, *repä ‘fox’, *rećmä ‘rope’, *sejti ‘bridge’

Proto-Indo-Iranian

*aćtara ‘whip’, *anti/onta, *ora ‘awl’, *orja ‘slave; south’, (*orpa ‘orphan’), *pośi ‘penis’, *śaŋka ‘handle’, Pre-Md *śaγa ‘goat’, *śarwi ‘horn’, *śaδa- ‘to rain’, śara- ‘shit’, *śi̮ta ‘hundred’, Pre-P *śVta ‘hundred’, *śasra ‘thousand’, *śišta ‘wax’, *śoma- ‘sad’, *waćara ‘hammer’, *woraći ‘boar’

Ambiguous early loans (can be either from PII or PI)

*ajša ‘shaft’, *asVra ‘lord’, *iha ‘yearning. passion’, *ihta ‘lust’, *jama ‘twin’, *jawi/jowa (> Mo juv) ‘awn’, *jawi (> PS *jäə̑) ‘flour’, *ji̮ni ‘way, path’, *juma ‘god’, *kana- ‘to dig’, *kara- ‘to dig’, *kata- ‘to graze’, *kertä- ‘to bind’, *ki̮ntaw ‘tree stump’, *kürtńV ‘iron’, PKh *kǟrtV ‘iron’, *kärtä ‘iron’, *martas ‘dead’, *ńātV- ‘to help’, *pakas ‘god’, *para ‘good’, Kh pĕnt ‘way’, PMs *pē̮ńtV ‘brother-in-law’, *pora ‘old’, *poči- ‘to boil’, Pre-P *porta ‘vessel’, *puntaksi ‘bottom’, Pre-Ma *pänti- ‘to bind’, PMa *pärća ‘ear of corn’, *pätäri- ‘to flee’, *saγi- ‘to get, obtain’, *sampas ‘pillar’, *saŋka ‘old’, *sara ‘lake’, *sasara ‘sister’, *säptä ‘seven’, *tajwas ‘sky’, *takra ‘piece of flesh’, *tarna ‘grass’, *tojwV ‘wish’, *toraksi ‘through’, *tora- ‘to fight’, *täjV ‘milk’, *täjinV ‘cow’, *täši, *uška ‘bull’, *wakša- (> PS *wåtå-) ‘to grow’, *wajna- ‘to see’, *wojna- ‘to see’, *wiša ‘venom’, *wi̮rna ‘wool’, *wärkä ‘kidney’, PS *wǝ̑rkǝ̑ ‘wolf’, *wirtV- ‘to hold, raise’, *äŋkärä ‘coal’

List of uncertain Indo-Iranian etymologies

PFi *aiwa (← Germanic ?), Ma *arša ‘mane’, PMs *ǟrV ‘fire’, *aštira ‘barren earth’, POug *ćakV ‘hammer’, *ćara- ‘brown; ? to dawn’, *ćero ‘hill-top’, *ćerti ‘group’, *itä- ‘to appear’, Pre-Fi *karšV ‘bear’, PMs *kīrV ‘iron’, *kota ‘chum’, Pre-Sa *kupa ‘pit’, PFi *kärsä ‘snout’, *maksa- ‘to pay’, PFi *mana-, PUg ? *mańći, Ma marij ‘Mari; man; husband’, *mē̮ja ‘wedding’, *mykkä ‘dumb’, PP *oč ‘corn’, *orpV ‘relative’, PFi *paksu ‘thick’, *peji- ‘to milk’, *pi̮ŋka ‘psychedelic mushroom’ POUg *porV ‘phratry’, Pre-Sa *poti ‘against’, Pre-Fi *šatas ‘germ’, *sentü- ‘to be born’, *šerä- ‘to wake up’, Ms šVšwǝŋ ‘hare’, PUg *śeŋkV ‘nail’, Pre-Sa *soma/sami ‘some’, PP *sur ‘beer’, PFi *süte- ‘to hit’ (< ? *sewči-), Hu szekér ‘wagon’, Kh ʌīkər ‘Narte’ PUg *taja- ‘secret’, Pre-Fi *terni ‘young’, *terwV ‘healthy’, ? *towkV ‘spring’, PWU *utarV ‘udder’ (← Germanic ?; Mari *waδar ← II), *waŋka ‘hook’, Mo E v́eŕges, M vərǵas ‘wolf’

Etymologies that were probably borrowed from another Indo-European source (PIE, PBSl, Germanic, Baltic)

*aisa ‘shaft’ ← Balto-Slavic, PFi *aiwa (← Germanic ?), *apV ‘help’ ← Germanic, *jewä ‘grain’ ← Balto-Slavic, Ma karaš etc. ‘honeycomb’ ← Baltic, (*meti ‘honey’ ← ? PIE,) Fi *ojas ‘shaft’ ← Slavic, *ola ← Baltic, *oŋki ← Germanic, *porćas ← Balto-Slavic, Pre-Sa *porta ‘vessel’ ← Germanic, *salV ‘salt’ (cannot be reconstructed for PU, various later parallel loans), *śi̮lkaw ← Balto-Slavic, *sammu- ← Germanic, *śuka ← Balto-Slavic, Mari *šŭžar ← Baltic/Balto-Slavic or Slavic, *tejniš ‘pregnant animal’ ← Baltic/Balto-Slavic, PWU *utarV ‘udder’ (? ← Germanic)

Early loans into differentiated branches

Proto-West Uralic

Only in Finnic:

*aćnas ‘voracious’, *iha ‘wish’, *ihta ‘lust’, PFi *isV ‘appetite’, *martas ‘dead’, *očra ‘barley’, *peijas ‘feast’, *pejmä ‘milk’, *pe̮rna ‘spleen’, *sampas ‘pillar’, *sooja ‘shelter’, *tajwas ‘sky’, *takra ‘piece of flesh’, *terwV ‘healthy’, *tojwV ‘wish’

All of these words, with the exception of *sooja ‘shelter’, were clearly borrowed into Early Proto-Finnic (Pre-Finnic) at the latest. Formally most of the loans could be from PII or PI.

Only in Saami:

*kata- ‘to graze’, *kertä- ‘to bind’, *pora ‘old’, *wojna- ‘to see’

All of the loans were acquired before the Saami vowel changes. Formally all could be either from Proto-Indo-Iranian or Proto-Iranian.

Only in Finnic and Saami:

*asma ‘voracious’, *jama ‘twin’, *kekrä ‘wheel’, *mača ‘insect’

*asma ‘voracious’, *jama ‘twin’, *kekrä ‘wheel’, *mača ‘insect’ Of these, *mača from Proto-Iranian and *jama is ambiguous. As the -sm- in asma does not point to Proto-Indo-Iranian *ć, this is probably an Iranian loan too. It is possible that these words were borrowed into Proto-West Uralic, as there is no general support for a Finno-Saamic proto-language today. As the cognates within Finnic and Saami are regular, there is no need to assume parallel borrowings. *kekrä has to be from Proto-Indo-Iranian.

NOTE. Based on the discussion of stages of borrowing from Indo-Iranian, and of the distribution of *kekrä among Uralic dialects in particular, Holopainen probably means Pre-Indo-Iranian for this example.

Only in Mordvin and/or Finnic and/or Saami (can point to a borrowing into Proto-West Uralic):

*ji̮ni ‘way’, *kečrä ‘spindle’, *rećmä ‘rope’, *śaŋka, *waćara ‘hammer’, *warsa ‘foal’, *wasa ‘calf’, *woraći ‘pig’

Based on phonological criteria, these loans do not form a chronologically coherent layer, but probably their modern distribution is accidental (their original distribution can have been wider). *kečrä ‘spindle’ and *rećmä ‘rope’ are from Pre-II, *śaŋka, *waćara and *woraći from PII, *warsa and *wasa from later Iranian (Alanic). *ji̮ni is ambiguous. Also the loans confined to Finnic and Saami mentioned above probably were borrowed into Proto-West Uralic, as it is a more convincing taxonomic entity than Proto-Finno-Saamic.

Proto-Mari-Permic

Only in Mordvin, Finnic and/or Saami and Mari

*juma ‘good’

This loan can be either from PII or PI. As it is obvious that these four branches do not form any taxonomical entity (Salminen 2002; J. Häkkinen 2009), it is only logical that there are no other loanwords with a “Finno-Volgaic” distribution.

Only in Mari:

*kVrtnV ‘metal’ (← PII, PI or later), Pre-Ma *pänti- ‘to bind’, PMa *pärća ‘ear of corn’, *si̮rńa ‘gold’ (← Old Iranian)

Only very few early Indo-Iranian loans can be found in Mari and in no other Uralic language. It is unclear what the reason for this is. It is, of course, possible that some uncertain loanwords like marij ‘man; Mari’ turn out to be correct after all, but even that does not make the number of loans in Mari very high. The situation has to be explained either with loss of vocabulary and replacement by later loans (from Turkic, and also perhaps from Permic) or with Mari’s location on the periphery at the time of the later contacts with the Iranian languages. Agyagási (2019: 254–258) argues that the current area where Mari is spoken was formed only relatively late, after the Mongol invasion in the High Middle Ages. If this is indeed correct, and Mari was spoken in more northern areas before that, it can be assumed that Pre-Mari had only sporadic contacts with the Iranian languages after it split off from Proto-Uralic.

Only in Permic (early loans; for later loans confined to Permic)

*a(č)wa ‘stallion’, PP *ju ‘awn’, *kertä ‘house’, *kärtä ‘metal’, *kada- ~ *gada- ‘to steal’, *karka ‘chicken’, *parśa ~ *barśa ‘mane’, *parta ‘knife’, *pertä ‘wing’, *poči- ‘to boil’, *porta ‘vessel’, *dura ‘long’, *domV ‘to tame’, PP *śumi̮s ‘band’, PP *šud‘luck’, *uška ‘bull’, *wi̮rna ‘wool’, *wirä ‘man, husband’, *äŋkärä ‘coal’

The number of loanwords in Permic is relatively high, and many of these can be considered to be Iranian loanwords. Technically many loans are ambiguous, but as some of the words were borrowed late due to historical reasons (‘iron’), and some were borrowed into a Pre-Permic which already had a phonological system that was different from Proto-Uralic (*šud- has d which cannot reflect PU *δ).

It is probable that the Permic languages were in continuous contact with the Indo-Iranian languages from the time they split from Proto-Uralic until the early mediaeval era.

Proto-Ugro-Samoyedic

Only in Khanty and Mansi (regular cases):

POUg *ēräɣ ‘song’, POUg *eträ ‘clear sky’, POug *mɔ̈ŋki ‘forest-spirit’, *ńātV- ‘to help’, *päčäɣ ‘reindeer’

The number of these etymologies is so low that it is very difficult to determine whether these words were borrowed into Proto-Ob-Ugric or some earlier proto-language, such as Proto-Ugric.

Only in Khanty and/or Mansi and/or Hungarian (regular cases):

*säptä ‘seven’ (Khanty + Hungarian regular), *sara ‘lake’

There are so few convincing loanwords with a “Ugric” distribution that they provide very little evidence. Either of these loans could be from Proto-Indo-Iranian or Proto-Iranian, if we assume that *s > *h was a common Iranian sound change. Both loans were acquired

Only in Samoyed:

*jäwi (> PS *jäə̑), PS *pulə̑ ~ *pi̮lə̑ ‘bridge’, *täjki ‘spear’, PS *wǝ̑rkə̑ ‘wolf’, Pre-S *täši (> PS *tät), *wakša- (> PS *wåtå) ‘to grow’

Of these, only *wåtå- has to be a very early loan because of *s > *t. *jäwi (> PS *jäə̑) and PS *wə̑rkə̑ were possibly acquired before the Proto-Samoyed vowel developments, making them probably early loanwords too. Formally all of them could be either from PII or PI. *pulə̑ ~ *pi̮lə̑ could have been borrowed into Proto-Samoyed (with Iranian *u corresponding to Samoyed *u), and because of the *l the word is probably from a relatively late, Middle Iranian language.

The following loanwords have a distribution with a cognate in both Samoyed and some other branch:

*śaδa- ‘to rain’, *tora- ‘to fight’ (also *itä-, which is more uncertain, belongs here)

Pan-Uralic loans

The following loanwords have a distribution with regular cognates with at least one Ugric branch and some other branch, which points to early borrowing. Although formally *kana- and *kara- are ambiguous, they are probably from Proto-Indo-Iranian because of their distribution. The rest of the loans are from Pre-II or PII.

*kana- ‘to dig’, *kara- ‘to dig’, *meti ‘honey’, *mekši ‘bee’, *orpV ‘orphan’, *ora ‘awl’, *peji- ‘to milk’, *pätäri- ‘to flee’, *śara- ‘shit’, *śoma- ‘sad’

The following loanwords are found in at least two non-adjacent branches of Uralic (the ones listed in the above categories are not counted). As there are no widely accepted criteria for a word to be considered “Uralic”, all of these could be considered loanwords into Proto-Uralic, in this case probably from Proto-Indo-Iranian or Pre-Indo-Iranian.

*ajša ‘shaft’, *anti/onta ‘grass’, *ertä ‘side’, *ki̮ntaw ‘tree stump’, *mertä ‘human’, *orja ‘slave’, *para ‘good’, *počaw ‘reindeer’, *puntaksi ‘bottom’, *saγi- ‘to get, obtain’, *repä ‘fox’, *si̮ŋka ‘old’, *sasara ‘sister’, *sejti ‘bridge’, *śišta ‘wax’, *tarna ‘grass’, *toraksi ‘through’, *wiša ‘venom’

12-bronze-age-middle-cultures

12-bronze-age-middle-uralic

Discussion about the distribution and its impact on Uralic taxonomy

(…) there are Proto-Iranian loanwords which were borrowed simultaneously into several early branches of Uralic, making it likely that Uralic had split into several branches by the time of these contacts.

Also the fact that many of the Proto-Indo-Iranian loanwords either show a restricted distribution (such as West Uralic *waćara, *woraći) or irregular correspondences (*asVra, *śasra, *śi̮ta) can point to the conclusion that Proto-Uralic was fragmenting by the time when contacts with Proto-Indo-Iranian took place.

The earlier, Pre-Indo-Iranian loanwords usually show a wider distribution and regular sound correspondences. Although the number of these earliest loans is quite small, based on their distribution and regular correspondences it can be assumed that the Pre-Indo-Iranian stage (after RUKI, *l > *r and the merger of velars and labiovelars but before the merger of non-high vowels) was concurrent with Proto-Uralic, with the changes leading to Proto-Indo-Iranian happening after the dispersal of Proto-Uralic.

The distribution of loanwords reinforces the old idea that Samoyed is a lexical outlier, as only few convincing Indo-Iranian etymologies for Proto-Uralic words (*saδa- ‘to rain’, *tora- ‘to fight’) have a convincing reflex in Samoyed. However, the fact that such etymologies exist means rather that the situation is due to lexical loss in Samoyed, and that the earliest contact occurred before Samoyed split off from Proto-Uralic.

There are very few loanwords that have a Ugric distribution (being found in at least one Ob-Ugric branch and Hungarian), and likewise rather few in Ob-Ugric. The few loans that have a distribution confined to Ugric were borrowed before the change *s > *θ took place. This means that the Ugric distribution does not mean much from the point of view of chronology or taxonomy, as the words were borrowed into a language that was still identical to Proto-Uralic. Even some loans borrowed into Khanty and Mansi have to be so early.

Impacts on dating and the location of the contact zones

Because of the very limited number of convincing etymologies found only in Finnic or Saami, it is probable that there were not (extensive) contacts with Pre-Finnic or Pre-Saami after the split of Proto-West Uralic.

The great number of loanwords of varying ages in Permic inevitably points to the conclusion that the pre-form of the Permic branch had been constantly spoken in an area that was adjacent to the Iranian languages. The different layers of loanwords in Permic clearly point to chronological differences in the donor languages, but it also seems that Permic was in contact with various forms of Iranian and not with different diachronic stages of the same language.

In general, the words that have been borrowed are typical cultural words, and the contacts between Indo-Iranian and Uralic seems to have been a typical contact situation in which a culturally less-advanced language group borrows various cultural terms from a more “advanced” group. The words in various loanword layers related to horse and cattle breeding show obvious cultural influence in the field of domesticated animals, and the borrowing of some names of grains points to agricultural influence from the Indo-Iranians on the speakers of Uralic.

Needless to say, many of the borrowings I listed in A Song of Sheep and Horses suffer from the same ailment attributed to Indo-Europeanists in general:

With slight exaggeration one can agree with the remark by Koivulehto (1999a: 209–210) that the Indo-Europeanists often use outdated sources or are simply uninterested in the topic. The problem is further complicated by the various and often obsolete views expressed in even relatively modern Uralicist works, such as those of Rédei (1986c; 1988) or Katz (2003); (…) Mallory & Adams (2006) adequately refer to the importance of the early loanwords, but they use mostly Rédei’s outdated reconstructions and stratigraphy in support of their theories.

I need to review all related texts with this thesis and the works recently published by Kümmel, as well as the recent book of the Leiden school on Indo-Uralic.

Also, does anyone know the (traditional?) why of the resistance to the Indo-Uralic concept among Uralicists? Maybe it’s a reaction against the Nostraticist and Siberian views of Uralic espoused by the Soviets?

Related

Samoyedic shows Yeniseic substrate; both influenced Tocharian

chalcolithic-late-tocharian

Open access paper The deviant typological profile of the Tocharian branch of Indo-European may be due to Uralic substrate influence by Peyrot, Indo-European Linguistics (2019).

NOTE. This seems to be part of the master’s thesis by Abel Warries, but the paper is authored only by Peyrot.

Interesting excerpts (emphasis mine):

1. The stop system

The loss in Tocharian of the Proto-Indo-European obstruent distinctions conventionally noted as voice and aspiration is a very strong indication of foreign influence. Since Proto-Indo-European roots mostly have at least one stop, and often two, the merger of all three stop series into one must have led to massive homonymy and subsequently to heavy restructuring of the lexicon. It is difficult to see how these changes could be motivated language-internally.

tocharian-consonants

It is this innovative typological feature of Tocharian that is the strongest indication of Uralic influence (cf. e.g. Bednarczuk 2015:56). A single stop series as found in Tocharian is reconstructed for Proto-Uralic as well as for Proto-Samoyedic, while other possibly relevant languages all show a system with a contrast between voiced and unvoiced stops, i.e. Proto-Yeniseian, Old Iranian and Yukaghir, or, in Proto-Turkic, a contrast between strong and weak obstruents (see also below).

For Proto-Uralic, Janhunen (1982:23) reconstructs the following obstruents: *k, *c, *t, *p; *δ, *δ´; and *ś, *s. With the development of *s to *t, *ś to *s, *δ to *r and *δ´ to *j, the Proto-Samoyedic obstruent system had become: *k, *c, *t, *p, *s (a secondary *ś arose later). The Tocharian obstruent system is much closer to both these reconstructed obstruent systems than to the Proto-Indo-European system that is commonly assumed.

tocharian-uralic-consonants

Interestingly, from the perspective of a two-velar series reconstructed for the parent Late Proto-Indo-European, Tocharian shows thus a satemization trend and Uralic influence similar to (but qualitatively different than) the one seen in Balto-Slavic and Indo-Iranian, probably due to the less marked population replacement evidenced by the continuity of Afanasievo-related ancestry among Iron Age Common Tocharians.

2. The vowel system

(…) the development of the Tocharian vowel system can be understood very well in light of a South Siberian vowel system today represented by the Yeniseian language Ket. This South Siberian vowel system is different from both the Proto-Tocharian and the Proto-Uralic and Proto-Samoyedic vowel systems. However, a successful comparison is possible when intermediate phases are taken into account: a Pre-Proto-Tocharian phase between Proto-Indo-European and Proto-Tocharian; and a Pre-Proto-Samoyedic phase between Proto-Uralic and Proto-Samoyedic. For a Pre-Proto-Tocharian phase, a vowel system identical to that of Ket can be reconstructed. For Proto-Samoyedic, several different reconstructions of the vowel system have been proposed. Depending on which reconstruction turns out to be correct, a Pre-Proto-Samoyedic vowel system can be reconstructed that is close to the Ket system or perhaps even identical to it.

The basic vowel changes from Proto-Indo-European to Proto-Tocharian are the following (Ringe 1996; Hackstein 2017):

tocharian-vowel-system

It is the seven-vowel system of Pre-Proto-Tocharian stage 5 above that is structurally identical to the South Siberian system represented by Ket. According to Vajda (2004:5), Ket ɨ and ə are further back than IPA central [ɨ] and [ə], but not as far back as the unrounded back vowels [ɯ] and [ɤ] of IPA. The allophonic variation in the mid vowels e, ə, o is correlated with tone: they are pronounced as high-mid [e, ə, o] with high-even tone, and as low-mid [ɛ, ʌ, ɔ] elsewhere (Vadja l.c.).

Obviously, this parallel with Ket can only be meaningful for Tocharian linguistic prehistory if the same vowel system can be reconstructed for earlier stages. Indeed, Vajda assumes an original Pre-Proto-Yeniseian five-vowel system with i, a, ʌ, o, u that was in Common Yeniseian enlarged with *e and *ɨ (2010:78–79).

tocharian-ket-siberian-vowel

Pre-Proto-Samoyedic

Of the eleven vowels reconstructed for Proto-Samoyedic by Janhunen and Sammallahti, the following arose in the course of Pre-Proto-Samoyedic:

  • *ö is rare and was clearly added at a late stage;
  • *ü arose secondarily, amongst others from PU *i, while PU *ü changed to PSam. *i;
  • *ä arose secondarily, while PU *ä changed to PSam. *e;
  • *ə in first syllables, or back *ə̑ and front *ə̈, arose secondarily from *u and *i.

Since these four vowels arose secondarily, the following seven-vowel system can be assumed for a very early stage of Pre-Proto-Samoyedic. This system is structurally identical to the system of Ket and to that reconstructed for Pre-Proto-Tocharian:

pre-proto-samoyedic-tocharian-vowels

The vowel system of Ket, which has also been reconstructed for Pre-Proto-Tocharian, and which may possibly be reconstructed for Pre-Proto-Samoyedic as well, has a further parallel in Siberia: it is very close to that reconstructed for Proto-Yukaghir by Nikolaeva (2006:57).

It is attractive to think that the imbalances of the Yukaghir vowel system and vowel harmony reflect the adaptation of an original system with front rounded *ü and *ö to a system very similar to that seen in Yeniseian, Pre-Proto-Samoyedic and Pre-Proto-Tocharian.

ket-substrate-river-names
Location of contemporary speakers of Ket (shown in black) and of Yeniseian groups in
1600 as well as Yeniseian substrate river names (marked by labels such as -ses)

3. Agglutinative case marking and case functions

Although other Indo-European languages also occasionally show agglutinative case markers, one of the most striking typological characteristics of Tocharian are the agglutinative so-called “secondary” cases. It is obvious that for such a major shift in language type substrate influence must be considered as a serious option.

The key to identifying the model of the Tocharian case system is to be found in the functions of the cases. On the functional level, the Tocharian case system shows the following non-Indo-European peculiarities: it lacks a dative, whose functions are fulfilled by the genitive; and it has a local case termed “perlative” which denotes movement along, through or over something, as well as a comitative case denoting accompaniment.

Another interesting functional phenomenon is the lack of a dative in Tocharian. Here the best match is offered by Uralic, where nominative, accusative and genitive are generally analysed as being the “grammatical cases,” while the remaining cases are the “local cases.”

Tocharian, in spite of its comitative, agrees better with the Samoyedic case system than with the more elaborate sets of e.g. Finnish and Hungarian: there is no inessive : adessive or ablative : elative contrast. The Ket system, too, is more elaborate than the Tocharian set.

Evaluation and interpretation of the parallels

I consider the evidence from the stop system (§ 2.1), the vowel system (§ 2.2) and the agglutinative case system (§ 2.3) as the strongest indications of language contact. The Tocharian stop system with only voiceless stops is the best evidence for Uralic influence. The vowel system shows neat parallels with Yeniseian and Pre-Proto-Samoyedic. Taken together, this suggests that the Uralic variety with which Tocharian was in contact was a form of Pre-Proto-Samoyedic. Agglutinative case systems are widely found in Siberia and Eastern Central Asia, but the case functions, in particular the Tocharian perlative, best match Uralic and comparable systems in South Siberia.

The perlative is the strongest indication of Siberian, and most probably Uralic or Pre-Proto-Samoyedic influence. A similar local case is widely found across Uralic and in Samoyedic, and also in Yukaghir and Ket, but not in Turkic.

bronze-age-early-chemurchek

bronze-age-early-tocharian

Conclusion

The author ends by trying to fit the relative chronology of a Samoyedic and Tocharian spread from the Cis-Urals with the ideas set forth (mainly) by the Copenhagen group, with which he has participated in the past interpreting their results from a linguistic perspective. Hence the difficulties in finding potentially fitting settings to the proposed contacts.

I think it is self-evident that the push of the Abashevo-related, Seima-Turbino-mediated Andronovo-like cultural horizon perfectly explains the expansion of Pre-Proto-Samoyedic into the previous Afanasievo territory, strongly influencing the Chemurchek and related populations that most likely represent the evolving Tocharian-speaking community.

My recent video of expansion of Indo-Europeans illustrates this linguistic evolution quite accurately (although, admittedly, I didn’t put much effort in the maps to follow Palaeosiberian languages).

bronze-age-middle-chemurchek

bronze-age-middle-tocharian

The finding of Afanasievo-like population in Iron Age Tian Shan, including haplogroup replacement (among them hg. R1a-M417, proper of Disintegrating Uralians), as well as the finding of R1a-Z645 subclades up to the Deer Stone-Khirigsuur Complex in northern Mongolia, confirms this setting of a sudden expansion of (originally) Uralic-speaking populations through northern Eurasia up to Lake Baikal, disrupting the Afanasievo-like Tocharian-speaking community.

Similarly, the highly divergent genetic make-up of the Samoyedic population relative to other Uralic groups is consistent with the dilution of their typically Uralic Corded Ware ancestry among Siberian populations, on top of the multiple acculturation events of traditionally multilingual North Siberian populations (especially among Northern Samoyeds, similar to other Circum-Arctic groups).

This paper is not the first, and certainly not the last to confirm strong language contacts between Uralic and Indo-European dialects with the previous native speakers of Siberia, such as Palaeosiberians and Altaic peoples, causing the aberrant (but seemingly closely related) traits of Samoyedic and Tocharian, proper of European languages introduced into an area foreign to Indo-Uralic languages.

Related

Corded Ware ancestry in North Eurasia and the Uralic expansion

uralic-clines-nganasan

Now that it has become evident that Late Repin (i.e. Yamnaya/Afanasevo) ancestry was associated with the migration of R1b-L23-rich Late Proto-Indo-Europeans from the steppe in the second half of the the 4th millennium BC, there’s still the question of how R1a-rich Uralic speakers of Corded Ware ancestry expanded , and how they spread their languages throughout North Eurasia.

Modern North Eurasians

I have been collecting information from the supplementary data of the latest papers on modern and ancient North Eurasian peoples, including Jeong et al. (2019), Saag et al. (2019), Sikora et al. (2018), or Flegontov et al. (2019), and I have tried to add up their information on ancestral components and their modern and historical distributions.

Fortunately, the current obsession with simplifying ancestry components into three or four general, atemporal groups, and the common use of the same ones across labs, make it very simple to merge data and map them.

Corded Ware ancestry

There is no doubt about the prevalent ancestry among Uralic-speaking peoples. A map isn’t needed to realize that, because ancient and modern data – like those recently summarized in Jeong et al. (2019) – prove it. But maps sure help visualize their intricate relationship better:

natural-modern-srubnaya-ancestry
Natural neighbor interpolation of Srubnaya ancestry among modern populations. See full map.
kriging-modern-srubnaya-ancestry
Kriging interpolation of Srubnaya ancestry among modern populations. See full map

Interestingly, the regions with higher Corded Ware-related ancestry are in great part coincident with (pre)historical Finno-Ugric-speaking territories:

uralic-languages-modern
Modern distribution of Uralic languages, with ancient territory (in the Common Era) labelled and delimited by a red line. For more information on the ancient territory see here.

Edit (29/7/2019): Here is the full Steppe_MLBA ancestry map, including Steppe_MLBA (vs. Indus Periphery vs. Onge) in modern South Asian populations from Narasimhan et al. (2018), apart from the ‘Srubnaya component’ in North Eurasian populations. ‘Dummy’ variables (with 0% ancestry) have been included to the south and east of the map to avoid weird interpolations of Steppe_MLBA into Africa and East Asia.

modern-steppe-mlba-ancestry2
Natural neighbor interpolation of Steppe MLBA-like ancestry among modern populations. See full map.

Anatolia Neolithic ancestry

Also interesting are the patterns of non-CWC-related ancestry, in particular the apparent wedge created by expanding East Slavs, which seems to reflect the intrusion of central(-eastern) European ancestry into Finno-Permic territory.

NOTE. Read more on Balto-Slavic hydrotoponymy, on the cradle of Russians as a Finno-Permic hotspot, and about Pre-Slavic languages in North-West Russia.

natural-modern-lbk-en-ancestry
Natural neighbor interpolation of LBK EN ancestry among modern populations. See full map.
kriging-modern-lbk-en-ancestry
Kriging interpolation of LBK EN ancestry among modern populations. See full map

WHG ancestry

The cline(s) between WHG, EHG, ANE, Nganasan, and Baikal HG are also simplified when some of them excluded, in this case EHG, represented thus in part by WHG, and in part by more eastern ancestries (see below).

modern-whg-ancestry
Natural neighbor interpolation of WHG ancestry among modern populations. See full map.
kriging-modern-whg-ancestry
Kriging interpolation of WHG ancestry among modern populations. See full map.

Arctic, Tundra or Forest-steppe?

Data on Nganasan-related vs. ANE vs. Baikal HG/Ulchi-related ancestry is difficult to map properly, because both ancestry components are usually reported as mutually exclusive, when they are in fact clearly related in an ancestral cline formed by different ancient North Eurasian populations from Siberia.

When it comes to ascertaining the origin of the multiple CWC-related clines among Uralic-speaking peoples, the question is thus how to properly distinguish the proportions of WHG-, EHG-, Nganasan-, ANE or BaikalHG-related ancestral components in North Eurasia, i.e. how did each dialectal group admix with regional groups which formed part of these clines east and west of the Urals.

The truth is, one ought to test specific ancient samples for each “Siberian” ancestry found in the different Uralic dialectal groups, but the simplistic “Siberian” label somehow gets a pass in many papers (see a recent example).

Below qpAdm results with best fits for Ulchi ancestry, Afontova Gora 3 ancestry, and Nganasan ancestry, but some populations show good fits for both and with similar proportions, so selecting one necessarily simplifies the distribution of both.

Ulchi ancestry

modern-ulchi-ancestry
Natural neighbor interpolation of Ulchi ancestry among modern populations. See full map.
kriging-modern-ulchi-ancestry
Kriging interpolation of Ulchi ancestry among modern populations. See full map.

ANE ancestry

natural-modern-ane-ancestry
Natural neighbor interpolation of ANE ancestry among modern populations. See full map.
kriging-modern-ane-ancestry
Kriging interpolation of ANE ancestry among modern populations. See full map.

Nganasan ancestry

modern-nganasan-ancestry
Natural neighbor interpolation of Nganasan ancestry among modern populations. See full map.
kriging-modern-nganasan-ancestry
Kriging interpolation of Nganasan ancestry among modern populations. See full map.

Iran Chalcolithic

A simplistic Iran Chalcolithic-related ancestry is also seen in the Altaic cline(s) which (like Corded Ware ancestry) expanded from Central Asia into Europe – apart from its historical distribution south of the Caucasus:

modern-iran-chal-ancestry
Natural neighbor interpolation of Iran Neolithic ancestry among modern populations. See full map.
kriging-modern-iran-neolithic-ancestry
Kriging interpolation of Iran Chalcolithic ancestry among modern populations. See full map.

Other models

The first question I imagine some would like to know is: what about other models? Do they show the same results? Here is the simplistic combination of ancestry components published in Damgaard et al. (2018) for the same or similar populations:

NOTE. As you can see, their selection of EHG vs. WHG vs. Nganasan vs. Natufian vs. Clovis of is of little use, but corroborate the results from other papers, and show some interesting patterns in combination with those above.

EHG

damgaard-modern-ehg-ancestry
Natural neighbor interpolation of EHG ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-ehg-ancestry
Kriging interpolation of EHG ancestry among modern populations. See full map.

Natufian ancestry

damgaard-modern-natufian-ancestry
Natural neighbor interpolation of Natufian ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-natufian-ancestry
Kriging interpolation of Natufian ancestry among modern populations. See full map.

WHG ancestry

damgaard-modern-whg-ancestry
Natural neighbor interpolation of WHG ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-whg-ancestry
Kriging interpolation of WHG ancestry among modern populations. See full map.

Baikal HG ancestry

damgaard-modern-baikalhg-ancestry
Natural neighbor interpolation of Baikal hunter-gatherer ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-baikal-hg-ancestry
Kriging interpolation of Baikal HG ancestry among modern populations. See full map.

Ancient North Eurasians

Once the modern situation is clear, relevant questions are, for example, whether EHG-, WHG-, ANE, Nganasan-, and/or Baikal HG-related meta-populations expanded or became integrated into Uralic-speaking territories.

When did these admixture/migration events happen?

How did the ancient distribution or expansion of Palaeo-Arctic, Baikalic, and/or Altaic peoples affect the current distribution of the so-called “Siberian” ancestry, and of hg. N1a, in each specific population?

NOTE. A little excursus is necessary, because the calculated repetition of a hypothetic opposition “N1a vs. R1a” doesn’t make this dichotomy real:

  1. There was not a single ethnolinguistic community represented by hg. R1a after the initial expansion of Eastern Corded Ware groups, or by hg. N1a-L392 after its initial expansion in Siberia:
  2. Different subclades became incorporated in different ways into Bronze Age and Iron Age communities, most of which without an ethnolinguistic change. For example, N1a subclades became incorporated into North Eurasian populations of different languages, reaching Uralic- and Indo-European-speaking territories of north-eastern Europe during the late Iron Age, at a time when their ancestral origin or language in Siberia was impossible to ascertain. Just like the mix found among Proto-Germanic peoples (R1b, R1a, and I1)* or among Slavic peoples (I2a, E1b, R1a)*, the mix of many Uralic groups showing specific percentages of R1a, N1a, or Q subclades* reflect more or less recent admixture or acculturation events with little impact on their languages.

*other typically northern and eastern European haplogroups are also represented in early Germanic (N1a, I2, E1b, J, G2), Slavic (I1, G2, J) and Finno-Permic (I1, R1b, J) peoples.

ananino-culture-new
Map of archaeological cultures in north-eastern Europe ca. 8th-3rd centuries BC. [The Mid-Volga Akozino group not depicted] Shaded area represents the Ananino cultural-historical society. Fading purple arrows represent likely stepped movements of subclades of haplogroup N for centuries (e.g. Siberian → Ananino → Akozino → Fennoscandia [N-VL29]; Circum-Arctic → forest-steppe [N1, N2]; etc.). Blue arrows represent eventual expansions of Uralic peoples to the north. Modified image from Vasilyev (2002).

The problem with mapping the ancestry of the available sampling of ancient populations is that we lack proper temporal and regional transects. The maps that follow include cultures roughly divided into either “Bronze Age” or “Iron Age” groups, although the difference between samples may span up to 2,000 years.

NOTE. Rough estimates for more external groups (viz. Sweden Battle Axe/Gotland_A for the NW, Srubna from the North Pontic area for the SW, Arctic/Nganasan for the NE, and Baikal EBA/”Ulchi-like” for the SE) have been included to offer a wider interpolated area using data already known.

Bronze Age

Similar to modern populations, the selection of best fit “Siberian” ancestry between Baikal HG vs. Nganasan, both potentially ± ANE (AG3), is an oversimplification that needs to be addressed in future papers.

Corded Ware ancestry

bronze-age-corded-ware-ancestry
Natural neighbor interpolation of Srubnaya ancestry among Bronze Age populations. See full map.

Nganasan-like ancestry

bronze-age-nganasan-like-ancestry
Natural neighbor interpolation of Nganasan-like ancestry among Bronze Age populations. See full map.

Baikal HG ancestry

bronze-age-baikal-hg-ancestry
Natural neighbor interpolation of Baikal Hunter-Gatherer ancestry among Bronze Age populations. See full map.

Afontova Gora 3 ancestry

bronze-age-afontova-gora-ancestry
Natural neighbor interpolation of Afontova Gora 3 ancestry among Bronze Age populations. See full map.

Iron Age

Corded Ware ancestry

Interestingly, the moderate expansion of Corded Ware-related ancestry from the south during the Iron Age may be related to the expansion of hg. N1a-VL29 into the chiefdom-based system of north-eastern Europe, including Ananyino/Akozino and later expanding Akozino warrior-traders around the Baltic Sea.

NOTE. The samples from Levänluhta are centuries older than those from Estonia (and Ingria), and those from Chalmny Varre are modern ones, so this region has to be read as a south-west to north-east distribution from the Iron Age to modern times.

iron-age-corded-ware-ancestry
Natural neighbor interpolation of Srubnaya ancestry among Iron Age populations. See full map.

Baikal HG-like ancestry

The fact that this Baltic N1a-VL29 branch belongs in a group together with typically Avar N1a-B197 supports the Altaic origin of the parent group, which is possibly related to the expansion of Baikalic ancestry and Iron Age nomads:

iron-age-baikal-ancestry
Natural neighbor interpolation of Baikal HG ancestry among Iron Age populations. See full map.

Nganasan-like ancestry

The dilution of Nganasan-like ancestry in an Arctic region featuring “Siberian” ancestry and hg. N1a-L392 at least since the Bronze Age supports the integration of hg. N1a-Z1934, sister clade of Ugric N1a-Z1936, into populations west and east of the Urals with the expansion of Uralic languages to the north into the Tundra region (see here).

The integration of N1a-Z1934 lineages into Finnic-speaking peoples after their migration to the north and east, and the displacement or acculturation of Saami from their ancestral homeland, coinciding with known genetic bottlenecks among Finns, is yet another proof of this evolution:

iron-age-nganasan-ancestry
Natural neighbor interpolation of Nganasan ancestry among Iron Age populations. See full map.

WHG ancestry

Similarly, WHG ancestry doesn’t seem to be related to important population movements throughout the Bronze Age, which excludes the multiple North Eurasian populations that will be found along the clines formed by WHG, EHG, ANE, Nganasan, Baikal HG ancestry as forming part of the Uralic ethnogenesis, although they may be relevant to follow later regional movements of specific populations.

iron-age-whg-ancestry
Natural neighbor interpolation of WHG ancestry among Iron Age populations. See full map.

Conclusion

It seems natural that people used to look at maps of haplogroup distribution from the 2000s, coupled with modern language distributions, and would try to interpret them in a certain way, reaching thus the wrong conclusions whose consequences are especially visible today when ancient DNA keeps contradicting them.

In hindsight, though, assuming that Balto-Slavs expanded with Corded Ware and hg. R1a, or that Uralians expanded with “Siberian” ancestry and hg. N1a, was as absurd as looking at maps of ancestry and haplogroup distribution of ancient and modern Native Americans, trying to divide them into “Germanic” or “Iberian”…

The evolution of each specific region and cultural group of North Eurasia is far from being clear. However, the general trend speaks clearly in favour of an ancient, Bronze Age distribution of North Eurasian ancestry and haplogroups that have decreased, diluted, or become incorporated into expanding Uralians of Corded Ware ancestry, occasionally spreading with inter-regional expansions of local groups.

Given the relatively recent push of Altaic and Indo-European languages into ancestral Uralic-speaking territories, only the ancient Corded Ware expansion remains compatible with the spread of Uralic languages into their historical distribution.

Related

A Song of Sheep and Horses, revised edition, now available as printed books

cover-song-sheep-and-horses

As I said 6 months ago, 2019 is a tough year to write a blog, because this was going to be a complex regional election year and therefore a time of political promises, hence tenure offers too. Now the preliminary offers have been made, elections have passed, but the timing has slightly shifted toward 2020. So I may have the time, but not really any benefit of dedicating too much effort to the blog, and a lot of potential benefit of dedicating any time to evaluable scientific work.

On the other hand, I saw some potential benefit for publishing texts with ISBNs, hence the updates to the text and the preparation of these printed copies of the books, just in case. While Spain’s accreditation agency has some hard rules for becoming a tenured professor, especially for medical associates (whose years of professional experience are almost worthless compared to published peer-reviewed papers), it is quite flexible in assessing one’s merits.

However, regional and/or autonomous entities are not, and need an official identifier and preferably printed versions to evaluate publications, such as an ISBN for books. I took thus some time about a month ago to update the texts and supplementary materials, to publish a printed copy of the books with Amazon. The first copies have arrived, and they look good.

series-song-sheep-horses-cover

Corrections and Additions

Titles
I have changed the names and order of the books, as I intended for the first publication – as some of you may have noticed when the linguistic book was referred to as the third volume in some parts. In the first concept I just wanted to emphasize that the linguistic work had priority over the rest. Now the whole series and the linguistic volume don’t share the same name, and I hope this added clarity is for the better, despite the linguistic volume being the third one.

Uralic dialects
I have changed the nomenclature for Uralic dialects, as I said recently. I haven’t really modified anything deeper than that, because – unlike adding new information from population genomics – this would require for me to do a thorough research of the most recent publications of Uralic comparative grammar, and I just can’t begin with that right now.

Anyway, the use of terms like Finno-Ugric or Finno-Samic is as correct now for the reconstructed forms as it was before the change in nomenclature.

west-east-uralic-schema

Mediterranean
The most interesting recent genetic data has come from Iberia and the Mediterranean. Lacking direct data from the Italian Peninsula (and thus from the emergence of the Etruscan and Rhaetian ethnolinguistic community), it is becoming clearer how some quite early waves of Indo-Europeans and non-Indo-Europeans expanded and shrank – at least in West Iberia, West Mediterranean, and France.

Finno-Ugric
Some of the main updates to the text have been made to the sections on Finno-Ugric populations, because some interesting new genetic data (especially Y-DNA) have been published in the past months. This is especially true for Baltic Finns and for Ugric populations.

ananino-culture-new

Balto-Slavic
Consequently, and somehow unsurprisingly, the Balto-Slavic section has been affected by this; e.g. by the identification of Early Slavs likely with central-eastern populations dominated by (at least some subclades of) hg. I2a-L621 and E1b-V13.

Maps
I have updated some cultural borders in the prehistoric maps, and the maps with Y-DNA and mtDNA. I have also added one new version of the Early Bronze age map, to better reflect the most likely location of Indo-European languages in the Early European Bronze Age.

As those in software programming will understand, major changes in the files that are used for maps and graphics come with an increasing risk of additional errors, so I would not be surprised if some major ones would be found (I already spotted three of them). Feel free to communicate these errors in any way you see fit.

bronze-age-early-indo-european
European Early Bronze Age: tentative langage map based on linguistics, archaeology, and genetics.

SNPs
I have selected more conservative SNPs in certain controversial cases.

I have also deleted most SNP-related footnotes and replaced them with the marking of each individual tentative SNP, leaving only those footnotes that give important specific information, because:

  • My way of referencing tentative SNP authors did not make it clear which samples were tentative, if there were more than one.
  • It was probably not necessary to see four names repeated 100 times over.
  • Often I don’t really know if the person I have listed as author of the SNP call is the true author – unless I saw the full SNP data posted directly – or just someone who reposted the results.
  • Sometimes there are more than one author of SNPs for a certain sample, but I might have added just one for all.
ancient-dna-all
More than 6000 ancient DNA samples compiled to date.

For a centralized file to host the names of those responsible for the unofficial/tentative SNPs used in the text – and to correct them if necessary -, readers will be eventually able to use Phylogeographer‘s tool for ancient Y-DNA, for which they use (partly) the same data I compiled, adding Y-Full‘s nomenclature and references. You can see another map tool in ArcGIS.

NOTE. As I say in the text, if the final working map tool does not deliver the names, I will publish another supplementary table to the text, listing all tentative SNPs with their respective author(s).

If you are interested in ancient Y-DNA and you want to help develop comprehensive and precise maps of ancient Y-DNA and mtDNA haplogroups, you can contact Hunter Provyn at Phylogeographer.com. You can also find more about phylogeography projects at Iain McDonald’s website.

Graphics
I have also added more samples to both the “Asian” and the “European” PCAs, and to the ADMIXTURE analyses, too.

I previously used certain samples prepared by amateurs from BAM files (like Botai, Okunevo, or Hittites), and the results were obviously less than satisfactory – hence my criticism of the lack of publication of prepared files by the most famous labs, especially the Copenhagen group.

Fortunately for all of us, most published datasets are free, so we don’t have to reinvent the wheel. I criticized genetic labs for not releasing all data, so now it is time for praise, at least for one of them: thank you to all responsible at the Reich Lab for this great merged dataset, which includes samples from other labs.

NOTE. I would like to make my tiny contribution here, for beginners interested in working with these files, so I will update – whenever I have time – the “How To” sections of this blog for PCAs, PCA3d, and ADMIXTURE.

-iron-age-europe-romans
Detail of the PCA of European Iron Age populations. See full versions.

ADMIXTURE
For unsupervised ADMIXTURE in the maps, a K=5 is selected based on the CV, giving a kind of visual WHG : NWAN : CHG/IN : EHG : ENA, but with Steppe ancestry “in between”. Higher K gave worse CV, which I guess depends on the many ancient and modern samples selected (and on the fact that many samples are repeated from different sources in my files, because I did not have time to filter them all individually).

I found some interesting component shared by Central European populations in K=7 to K=9 (from CEU Bell Beakers to Denmark LN to Hungarian EBA to Iberia BA, in a sort of “CEU BBC ancestry” potentially related to North-West Indo-Europeans), but still, I prefer to go for a theoretically more correct visualization instead of cherry-picking the ‘best-looking’ results.

Since I made fun of the search for “Siberian ancestry” in coloured components in Tambets et al. 2018, I have to be consistent and preferred to avoid doing the same here…

qpAdm
In the first publication (in January) and subsequent minor revisions until March, I trusted analyses and ancestry estimates reported by amateurs in 2018, which I used for the text adding my own interpretations. Most of them have been refuted in papers from 2019, as you probably know if you have followed this blog (see very recent examples here, here, or here), compelling me to delete or change them again, and again, and again. I don’t have experience from previous years, although the current pattern must have been evidently repeated many times over, or else we would be still talking about such previous analyses as being confirmed today…

I wanted to be one step ahead of peer-reviewed publications in the books, but I prefer now to go for something safe in the book series, rather than having one potentially interesting prediction – which may or may not be right – and ten huge mistakes that I would have helped to endlessly redistribute among my readers (online and now in print) based on some cherry-picked pairwise comparisons. This is especially true when predictions of “Steppe“- and/or “Siberian“-related ancestry have been published, which, for some reason, seem to go horribly wrong most of the time.

I am sure whole books can be written about why and how this happened (and how this is going to keep happening), based on psychology and sociology, but the reasons are irrelevant, and that would be a futile effort; like writing books about glottochronology and its intermittent popularity due to misunderstood scientist trends. The most efficient way to deal with this problem is to avoid such information altogether, because – as you can see in the current revised text – they wouldn’t really add anything essential to the content of these books, anyway.

Continue reading

Official site of the book series:
A Song of Sheep and Horses: eurafrasia nostratica, eurasia indouralica

Genetic continuity among Uralic-speaking cultures in north-eastern Europe

east-europe-bronze-age

The recent study of Estonian Late Bronze Age/Iron Age samples has shown, as expected, large genetic continuity of Corded Ware populations in the East Baltic area, where West Uralic is known to have been spoken since at least the Early Bronze Age.

The most interesting news was that, unexpectedly for many, the impact of “Siberian ancestry” (whatever that actually means) was small, slow, and gradual, with slight increases found up to the Middle Ages, compatible with multiple contact events in north-eastern Europe. Haplogroup N became prevalent among Finnic populations only through late bottlenecks, as research of modern populations have long suggested, and as ancient DNA research hinted since at least 2015.

I risked to correlate the arrival of chiefs from the south-west with the infiltration of N1c-VL29 subclades during the transition to the Iron Age, coupled with that minimal “Siberian” ancestry (see e.g. here and here). Now we know that the penetration of this non-CW ancestry started, as predicted, in the Iron Age; that it was highly variable in the few samples where it appeared, with ca. 1-4%, while most Iron Age individuals show 0%; and that it was not especially linked to individuals of N1c-Vl29 lineages.

It is also basically confirmed, based on the (ancient and Modern Swedish) N1c-L550 subclades found among Iron Age Estonians, that N1c-VL29 lineages and the so-called “Siberian” ancestry will be found simultaneously around the Baltic coastal areas, and that different lineages must have suffered later founder effects among Finns, which suggests that these alliances through exogamy brought exactly as much language change in Sweden, Lithuania, or Poland, as they did in the East Baltic region…

On the other hand, the paper has also shown a potential movement of Corded Ware-derived peoples, if the change from LBA to IA samples is meaningful; in fact, even more Corded Ware-like than Baltic and Estonian BA populations. The exact origin of that movement is difficult to pinpoint, and it may not be related to the arrival of Akozino warrior-traders from the south-east, since theirs seems to be a minor impact proper of elites in a chiefdom system around the Baltic.

fortified-settlements-lba-ia
Distribution of fortified settlements (filled circles) and other hilltop sites (empty circles) of the Late Bronze Age and Pre-Roman Iron Ages in the East Baltic region. Tentative area of most intensive contacts between Baltic and Balto-Finnic communities marked with a dashed line. Image modified from (Lang 2016).

Also suggesting a potential movement is the ‘southern’ shift observed in the West and East Baltic areas, likely showing the arrival of Proto-East Baltic speakers (such as the Trzciniec outlier), as we have already discussed in this blog. The unexpected increase in Corded Ware-like ancestry in the Eastern Baltic, coupled with the expected large continuity of hg. R1a-Z283 in the homeland of Balto-Finnic expansions, gives even more support to the known complex system of exogamy along the Baltic coasts, and offers another potential reason for the rise of Baltic-speaking territories in the West Baltic: elite domination.

It is nevertheless important to understand that, even among the most “genetic continuous” regions like Estonia, not a single population in Europe is heir of some ancestral, immutable people. Not in terms of haplogroups, and not in terms of admixture. Balto-Finnic speakers, however continuous they might seem (e.g. in Southern Estonians) aren’t an exception.

After all, this blog was (re)born to fight the currently prevalent sheer stupidity surrounding the simplistic “R1a/steppe ancestry=Indo-European” association, so I wouldn’t like to see it replaced with some other stupid continuity or purity ideas within 10 to 20 years…

Late Uralic stems from East Corded Ware groups

With the currently available tools – linguistics, archaeology, and now genetics -, I don’t think there is any argument to date to question the direct connection of the Late Proto-Uralic expansion with all Eastern Corded Ware groups (i.e. Battle Axe, Fatyanovo-Balanovo, and Abashevo), and thus at least with the unifying A-horizon of Corded Ware and the bottlenecks under R1a-Z645.

NOTE. The only out-group among Corded Ware cultures is the Single Grave culture. It appears to be an early Corded Ware offshoot, reflected in their non-unitary cultural traits (distinct from later unifying waves), in their varied patrilineal clans, and in the short-lasting cultural effect in northern Europe before their complete demise under pressure of expanding Yamna/Bell Beaker peoples from the Danube. The culture’s minimal (if any) effects on succeeding peoples might be seen mostly in the (mainly phonetic) Uralic substrate found in Balto-Slavic – although this may also stem from a more eastern influence, close to the Baltic – and in the contacts of Celtic with Uralic. The huge time depth between this early hypothetic Uralic layer in northern Europe and the emergence of peoples inhabiting these territories in recorded history have no doubt been erroneously interpreted as a lack of Uralic presence in the area.

1) That connection was evident in the Yamna – CWC differences in archaeology, and especially later, with at least Fatyanovo-Balanovo and Abashevo representing the obvious replacement of the Volosovo culture before further expansions of CWC-related groups west and east of the Urals.

The mythical millennia-long continuity of Volosovo hunter-gatherers, including centuries among Corded Ware peoples, as expected lately by the Copenhagen group (and anyone who doesn’t want to question the 1960s association of Indo-European with CWC) must be rejected today in population genomics, as the recent studies of ancient and modern populations show, and as ancient DNA from the region will confirm.

2) In linguistics, the survival of Volosovo as The Uralic-speaking culture was also hardly believable. From Kallio (2015):

While we can say at least something about Uralic substrates in Northeastern Europe, non-Uralic substrates cannot at all easily be identified, because of multiple language shifts, viz. first from non-Uralic to Uralic and then from Uralic to Russian. Yet the Soviet Uralicist Boris Serebrennikov (1956, 1959) argued that there are some non-Uralic substrate toponyms in the Volga-Oka region, but his idea was never taken seriously in the west (cf. Sauvageot 1958), and it pretty soon also sank into oblivion in Russia, even though it can still occasionally pop up there in non-onomastic circles (cf. Napolskikh 1995: 18–19). However, not all the hypotheses on non-Uralic substrates in Northeastern Europe should be rejected (see e.g. Helimski 2001b).

bronze-age-early-languages-east-europe
Tentative map of the distribution of known languages in Eastern Europe during the Early Bronze Age. See full map.

Helimski (2001) argues for a non-Uralic topo-hydronomy in Northern Russia, whose population may have kept their languages up to the Common Era despite the Corded Ware expansion, which is in line with the survival of some non-Indo-European languages everywhere in Europe after the expansion of Yamna and its offshoots:

It should be borne in mind that these [Uralic] hydronyms reached us mainly through Northern Russian and, accordingly, with a tendency to phonetic-morphological adaptation and unification (for river names it is “natural” to be, like the word ‘river’ itself, feminine and to end in -a). Taking into account this circumstance, it may turn out to be non-useless for etymological identification of at least some of the hydronyms on the Finno-Ugric basis.

On the other hand, I wouldn’t exclude the possibility that some parts of this large geographical area were never (completely) Finno-Ugric. The population that created the most important part of the hydronymy of the Russian North could be finally pushed aside or assimilated only at the end of the 1st – beginning of the 2nd millennium AD, during the Russian colonization, retaining the memory of the White-Eyed Chude in its own memory.

NOTE. For more on this non-IE substrate in (especially West) Uralic, see e.g. Zhivlov (2015),

The same non-Uralic substrate is most likely behind most of the shared traits by Mordvinic and Balto-Finnic (see below).

3) In genetics, I don’t think the picture could get any clearer. I don’t know what “Steppe ancestry = Indo-European” proponents expected from 2019, if they expected anything at all (I haven’t seen any coherent model, proposal, or prediction for a long time now), but I doubt the recent results are compatible with any of their implied expectations.

corded-ware-pca-sub-neolithic-europe
Detail of the PCA of the Corded Ware expansion. See full PCA and more related files.

Notice, from the PCA above, how this Baltic Late Neolithic group shows actually a shift from Sredni Stog (see PCA with Sredni Stog) towards typical Khvalynsk-Urals-related ancestry, i.e. populations from eastern European forested regions, derived from hunter-gatherer pottery groups, as I have proposed for a very long time, since the first time a Baltic LN “outlier” appeared. It’s amazing how some amateurs can find 0.1% of any Siberian outlier’s ancestry among Uralians 4,000 years later, but fail to see the direct connection here. The esoteric uses of qpAdm, I guess…

Especially noticeable is the extra WHG-like ancestry and corresponding shift, seen especially marked in late Polish CWC samples, but also in Baltic CWC and especially in one Sweden Battle Axe sample, all of them shifting apparently closer to Pitted Ware and SHG. While that may have been interpreted as an in situ admixture in Scandinavia before, the late Polish CWC samples show likely a resurgence of local populations, so we can assume that both shifts (to SHG- and EHG-like populations) of available CWC samples around the Baltic are clearly part of the WHG:EHG continuum that will be found in the eastern European sub-Neolithic cultures, from Narva to Volosovo.

This WHG-related ancestry is clearly predominant in groups with which Battle Axe peoples admixed, based on the shift towards Pitted Ware, which – I can only guess based on modern Volga Finns – is different from the shift we will see in Netted Ware, more towards the Khvalynsk-Urals cluster. This is in line with the expansion of Battle Axe eastward through coastal areas (West to East Baltic and Finland into Sweden), while Fatyanovo peoples probably emerged from a slightly different route, but also a northern one, if one is to follow archaological similarities and their chronology.

bronze-age-europe-baltic
Detail of the PCA of European Bronze Age populations. See full PCA and more related files.

During the Iron Age, the only peoples that probably shifted strongly (based on modern populations) are West Baltic ones, getting closer to the available Late Trzciniec samples, and even closer to the Trzciniec outlier, i.e. away from the earlier Eastern Corded Ware cluster, and towards Central European groups like Czech EBA or Poland EBA, both of them clearly derived from Bell Beakers, but also admixed with (and thus shifted toward) CW-like populations.

If one looks carefully at the previous PCA on Bronze Age populations, and the next one on Iron Age clusters, it is evident that adding the Swedish LN outlier to East Baltic BA (both strongly related to Battle Axe populations) essentially gives us the continuity of East Baltic BA into the Iron Age. This cluster is continued also in two outliers from Sigtuna, a Viking town close to the Gulf of Finland, known to be an important trading site, 1,500 years later. Not much of a change around the Gulf of Finland, then:

iron-age-eastern-europe
Detail of the PCA of East and North European Iron Age populations. See full PCA and more related files.

Based on the two simplistic Uralic clines one might see described (among the many that certainly existed, from Corded Ware to different Eurasian populations), and just like BOO was for some months fashionable as “Samic”, some may be tempted to say that certain Sintashta or Srubna outliers close to the Urals mark the True Uralic™ peoples. Because, of course they do. Ghost haplogroup N and stuff. And Corded Ware never ever Uralic. Because Gimbutas, and my IE R1a grandfather.

NOTE. Funny thing here: there might be Corded Ware, Iranian, Slavic, Germanic, etc… outliers or out-groups, and they might form the widest genetic clusters ever seen, but they are all of one language, because archaeology and linguistics; however, one “outlier” (also, put your own definition of “outlier” here, let’s say 1% of whatever, and strontium isotope potentially from 100 km away) ca. 600 BC in the Baltic who (surprise!) happens to show hg. N, and he signals the first incoming True Uralic™ speaker from wherever… It won’t be the first or the last time some people resort to “the complexity of Uralic-speaking peoples” in ancestry, just to look for “hg. N = Uralic” like crazy. You only need common sense to understand that this is not how this works. Amateur genomics can’t get more embarrassing than the current “let’s look for ‘Siberian ancestry’ in every individual of haplogroup N” trend. Or maybe it can, and it will, but I can’t see it yet.

If one were to insist on looking for ‘foreign’ contributions among Iron Age Estonians, though, I think one should also check out first archaeology, and then the PC3 (or, more graphically, a 3D plot), to understand what might be happening with the many Uralic clines derived from Corded Ware, before starting to play around with bioinformatic tools to discover a teeny tiny 1% admixture of the wrong population, and rushing to build far-fetched narratives. Apparently, one of the different clines formed roughly between southern (steppe – forest-steppe) and northern (tundra-taiga) populations in Uralians is also seen in some Iron Age Estonian individuals – especially in some late samples from Ingria…This is not my main interest, so I will leave this here for others to keep wasting their time chasing the white whale of the 0.5% of True Uralic™ ancestry in ancient Baltic samples of hg. N.

pca-3d-estonians-iron-age-boo-samic
Still images of the 3D plot of Eurasian samples. Typical PC1 vs. PC2 visualization to the left, and shift of the view to PC3 on the right image. See full PCA and more related files.

An exclusive Volga-Kama homeland for Disintegrating Uralic?

Since I don’t believe in macro-regions of largely continuous ethnolinguistic communities, as I have often said about Slavic (naively associated with prehistoric tribes of Eastern Europe) or Germanic (absurdly considered to be represented by Battle Axe), it is difficult for me to believe that Battle Axe-derived cultures remained of the same Finno-Samic dialects since the Corded Ware expansion…unless we live in Westeros, where everything happens “for thousands of years”.

I have to admit, then, that the now prevalent identification among Uralicists has become quite attractive:

  • Fatyanovo-Balanovo as Finno-Permic:
    • Fatyanovo/Netted Ware with West Uralic (also called Finno-Mordvinic).
    • Balanovo/Chirkovo-Kazan with Central Uralic (Mari-Permic).
  • Abashevo, into the Andronovo-like Horizon through the Seima-Turbino phenomenon, with East Uralic (also Ugro-Samoyedic).

Exactly like the identification of Yamna Hungary – Bell Beaker transition as the North-West Indo-European homeland, it gives us simplicity and small and late ethnolinguistic communities, away from the traditionally overused big and early language territories.

This late homeland would be supported, among others, by:

  • The presence of Indo-Iranian loanwords in Finno-Permic and Ugric (probably also in Samoyedic, either lost, or – much more likely – underresearched), compatible with the immediate contact between Abashevo – Sintashta-Potapovka-Filatovka and Fatyanovo-Balanovo.
  • The supposed expansion of Netted Ware from Fatyanovo to the north-west, which may be explained as the split and expansion of Balto-Finnic and Samic ca. 1900 BC.
  • A longer-lasting Finno-Permic (West+Central Uralic) community contrasting with the early separation of East Uralic.
  • The compatibility of this late expansion with the late expansion of Pre-Germanic from Denmark with the Dagger Period, and of Balto-Slavic with Trzciniec, which puts all three dialects reaching the Baltic Sea in the EBA.

NOTE. I meant to update the linguistic text to include the most recently favoured phylogenetic tree of Uralic languages after Häkkinen (2007, 2009, 2014), which has very quickly become the new normal among Uralicists, but I don’t think I will have enough time to review the necessary papers for that. I am rushing to publish a printed edition, so the text will wind up being a mixture of “traditional” (meaning, basically, pre-2010s) description of Uralic dialects but using modern divisions; say, “West Uralic” instead of “Finno-Samic”. By the way, I am still amazed that none of my reader-haters (or any online user discussing Uralic migrations, for that matter) have come up with the questions that the new division pose, and it supports my suspicion about the complete lack of interest in linguistics of most (a)DNA fans, except for the occasional use of old and free PDFs Googled to support new narratives invented expressly for some qpAdm results…

textile-ceramics-europe-bronze-age
Textile ceramic styles and influence of Bronze Age cultures divided in clusters.

Problems with this Parpola-Carpelan’s (2012-2018) interpretation include:

  • The differentiation between Fennoscandian Textile Ceramics vs. Netted Ware, which is not warranted in archaeology. The assumption that Netted Ware expanded to the Baltic Sea (as Kallio does, following the traditional view) is thus weak, and it was probably a question of cultural contacts coupled with short-distance population movements/exchange in both directions (from the Baltic to the Volga and vice versa). In fact, the culture division relies on some fairly common and technically simple ornamentation patterns, widespread all over northern Europe, even before the Corded Ware expansion, and it is very difficult to separate certain neighboring Textile Ceramics from Netted Ware groups in southern Finland (i.e. Sarsa-Tomitsa groups).
  • The strict and radical direction described for the Netted Ware by Carpelan, as an eastward and northward expansion, within a very short time frame (ca. 1900-1800 BC), based on few radiocarbon dates, which seems to me like a very risky assumption. We know how this kind of descriptions of direction of culture expansion based on radiocarbon dates has turned out in much more complex “packages”, like the Bell Beaker culture… In fact, the earliest dates for Textile Ware are from the East Baltic, earlier than those of Netted Ware.
  • The assumption that Balto-Finnic traits shared with Mordvinic are a) late and b) meaningful for dialectalization of two closely related dialects, when it is clear that both dialects separated quite early. Phonologically Finnic is more conservative, morphologically less so, and the shared traits include a handful of non-Uralic substrate words which can’t be traced to a single common source, hence they were adopted when both languages had already separated… All in all, Finnic – Mordvinic correspondances are not even close to Italo-Celtic ones, which is clearly fully incompatible with a proposal of a Finnic separation from Mordvinic coinciding with the LBA-IA transition.

Especially problematic for Parpola’s model is the lack of genetic impact in Bronze Age or Iron Age Estonians, not reaching a significant level under any possible statistical threshold – which I am sure was quite disappointing for some of my readers -, but is in line with major archaeological continuity of groups the from region, only disturbed in cultural (and Y-chromosome) terms by the expansion of Akozino warrior-traders all over the Baltic Sea. Any proposed population movement will be very difficult to support in genetics, given the Corded Ware-derived populations that we will see in both regions, and the continued Baltic-Volga contacts since the Corded Ware expansion.

Problems with an interpretation of such a small impact in population genomics includes the similarly weak impacts and haplogroup infiltrations that can be seen among populations basically everywhere in Eurasia, during any given period, and much greater genetic impacts that are supposed to be (or that were certainly) followed by ethnolinguistic continuity.

akozino-malar-axes-fennoscandia
Distribution of the Akozino-Mälar axes according to Sergej V. Kuz’minykh (1996: 8, Abb. 2).

The Battle Axe question

From Kallio (2015), about choosing a tentative homeland for Proto-Uralic:

(…) linguistically uniform Proto-Uralic would have been spoken in the Volga-Oka region until the mid-third millennium BC when the Proto-Uralic-speaking area would have expanded to the Volga-Kama region as well. By the end of the same millennium, this expansion would have led to the earliest dialectal splits within Uralic into Finno-Mordvin, Mari-Permic, and Ugro-Samoyed. The splitting up of these three soon followed during the early second millennium BC when the Uralic-speaking area finally stretched from the Baltic Sea in the west to the Altai mountains in the east. Indeed, no matter where Proto-Uralic was spoken, the branching into the nine well-attested subgroups (viz. Finnic, Saami, Mordvin, Mari, Permic, Hungarian, Mansi, Khanty, and Samoyed) must have taken less than a millennium, because their shared phonological and morphosyntactic isoglosses are rather limited (see Salminen 2002). The traditional view that all this branching would have taken several millennia violates everything linguistic typology teaches us about the rate of language change.

The basic problem of this identification of Fatyanovo-Balanovo as West-Central Uralic and Abashevo as East Uralic is the nature of the Battle Axe culture, including the Bronze Age East Baltic and Gulf of Finland area. Even if it is accepted that Fatyanovo-Balanovo represented all Western groups, Battle Axe must have represented West Uralic-like dialects.

The ethnolinguistic identification of Battle Axe depends ultimately on the nature of contacts of Fatyanovo/Netted Ware with Battle Axe/Textile Ceramics. If both groups were close and interacted profusely, as it seems, it doesn’t seem granted that we will be able to distinguish a close Para-West Uralic dialect of Scandinavia from the actual expanding Balto-Finnic and Samic dialects, if they were actually linked to the Netted Ware expansion. Also from Kallio (2015):

No doubt the most convincing substrate theory has recently been put forward by the Saami Uralicist Ante Aikio (2004), who has not only rehabilitated but also improved the old idea of a non-Uralic substrate in Saami. His study shows that there were still non-Uralic languages spoken in Northern Fennoscandia as recently as the first millennium AD. Most of all, they were not only genetically non-Uralic but also typologically non-Uralic-looking, bearing a closer resemblance to the so-called Palaeo-European substrates (for which see e.g. Schrijver 2001; Vennemann 2003).

In comparison, the case of Finnic is much more difficult. The fact that Proto-Uralic was not spoken in the East Baltic region means that this area must have originally been non-Uralic-speaking, but so far the evidence for a non-Uralic substrate in Finnic has consisted of appellatives and proper names with no etymology (cf. Ariste 1971; Saarikivi 2004a). Contrary to the proposed substrate words in Saami, those in Finnic show no structural non-Uralisms, as if they had indeed been borrowed from some genetically related or at least typologically similar languages, as I suggested above. Also none of them is more recent than the Middle Proto-Finnic stage, which makes them at least two millennia old. All this agrees with archaeological evidence discussed earlier that the Uralicization of the East Baltic region occurred during the Bronze Age (ca. 1900–500 BC).

The discussion of the paper continues with an unsuccessful attempt to find a hypothetical ancient Indo-European substrate that Kallio believes must be associated with the expansion of Corded Ware, in line with the traditional belief. For example, the often mentioned – almost folk etymology-like, unsurprisingly popular among amateurs – ‘Neva’ as derived from IE “young” is logically rejected…Unlike Parpola, Kallio’s view seems to be confident that Netted Ware (as Textile Ware) expanded into the East Baltic, on both sides of the Gulf of Finland, already during the Bronze Age.

As it has become apparent in population genomics, none of them was right, and Textile Ceramics will essentially show – like Netted Ware – a large genetic continuity of Corded Ware peoples in the whole north-eastern European forest zone – despite small regional population movements, obviously -, which necessarily implies that the whole Corded Ware culture – and not only Fatyanovo-Balanovo and Abashevo – were Uralic-speaking territories.

The similarities in terms of culture and Y-DNA bottlenecks between Battle Axe and Fatyanovo-Balanovo also imply that the linguistic differences between these groups were probably not many, and became strongly divided only after their territorial division. Continued contacts between Battle Axe- and Fatyanovo-derived groups can explain the proposed contacts (Finnic with Samic, Finnic with Mordvinic) after their linguistic-but-not-physical separation.

east-european-fatyanovocwc
East European movement directions (arrows) of the representatives of the Central European Corded Ware Culture (according to I.I. Artemenko).

Battle Axe spoke “Para-Balto-Finnic”?

The Balto-Finnic-speaking nature of Battle Axe is thus supported by:

  • The lack of non-Uralic substrates in Balto-Finnic territory (Kallio 2015).
  • The early separation of Samic and Finnic from Mordvinic, and the virtual identity of Proto-West-Uralic and Proto-Uralic, which suggests that Proto-Uralic spread fast (Parpola 2012).
  • The scarce non-Uralic topo-hydronymy in the East Baltic and around the Gulf of Finland (Saarikivi 2004), comparable to that on the Upper Volga region.
  • The strong influence of a Balto-Finnic-like substrate on Pre-Germanic (or, in Kallio’s opinion, the same Scandinavian substrate influencing both Germanic and Balto-Finnic at the same time), and the continued influence of Balto-Finnic on Proto-Baltic and Proto-Slavic.
  • The continued influence of Corded Ware-derived groups in central-east Sweden in Finland and the East Baltic in terms of agricultural innovations appearing in the LBA, compatible with Schrijver’s proposal of intermediate Germanic-shifted Balto-Finnic groups and Balto-Finnic groups influenced by their pronunciation.
  • The intense Palaeo-Germanic and late Balto-Slavic / early Proto-Baltic superstrate on Balto-Finnic, which place all three dialects around the Baltic Sea since the Early Bronze Age.
  • The easy replacement of a hypothetic Para-Balto-Finnic dialect by incoming Proto-Balto-Finnic-speaking peoples (say, with textile ceramics), without much linguistic impact.

In fact, the continuous contacts of the East Baltic with the Volga, and especially the close interaction with Akozino warrior-traders just before the Tarand-grave period, could be the actual origin of the recent (if any) Finnic-Mordvinic connections that need to be traced back to the LBA-IA (maybe here the number ‘ten’), since most of them can be related to a Pit-Comb Ware culture substrate and earlier contacts through the forest zone, which Samic (due to its early split and presence to the north of the Gulf of Finland during the BA) does not share. In fact, some of them can be traced back to Balto-Finnic first

These are the most often mentioned, in order of descending relevance for a shared ancient community:

  • Noun paradigms and the form and function of individual cases.
  • The geminate *mm (foreign to Proto-Uralic before the development of Fennic under Germanic influence) and other non-Uralic consonant clusters.
  • The change of numeral *luka ‘ten’ with (non-Uralic) *kümmen.
  • The presence of loanwords of non-Uralic origin, related to farming and trees, potentially Palaeo-European in nature.

It’s not only a question of quantity. Are these shared Mordvinic – Balto-Finnic traits really more relevant than, say, those between Italo-Celtic, which are supposed to have formed a community for a very short period at the end of the 3rd millennium around the Alps? Are these traits even sufficient to propose a common early Mordvinic-Finnic group within West Uralic, rather than loose Mordvinic – Balto-Finnic contacts, i.e. contacts between East Baltic (Textile Ceramics) and Volga-Kama (Netted Ware)?

Based on the alternative (Kallio’s) view of continued contacts between Textile Ceramics groups, even without knowing anything about linguistics, you can guess that Parpola is spinning very thin when assuming that these changes suggest that Balto-Finnic may have expanded with Akozino warrior-traders, separating thus ca. 800 BC from Mordvinic…

Genetic findings now clearly help dismiss any meaningful population impact in the LBA-IA transition, although any linguist can obviously argue for linguistic change in spite of major genetic continuity. But then we are stuck in the pre-ancient DNA era, so what’s ancient DNA for.

netted-ware-textile-ceramics
Middle Bronze Age cultures of Eastern Europe.

Genetic continuity = language continuity?

In the end, it’s very difficult to say how much language continuity there is around Estonia since the arrival of Corded Ware peoples. Looking at Modern Estonians, they have been clearly influenced by recent contacts with Baltic- and Germanic-speaking peoples clustering to the south-west in the PCA. They seem to have also received contacts from north(-east)ern peoples, likely from Finland, evidenced by their shifts toward the modern Estonian cluster during and after the Middle Ages, with a slight increase in Siberian ancestry and N1c subclades associated with Lovozero Ware. How much language change did these contacts bring? Maybe an expansion of Gulf of Finland Finnic (Northern Estonian) over Inland Finnic (Southern Estonian) and Gulf of Riga Finnic (Livonian)? Difficult to know, exactly, but, in the traditional view of Balto-Finnic dialectal distribution among Uralicists like Kallio, possibly no change at all.

So, if the obvious changes in the Estonia_MA cluster relative to Estonia_IA cluster and Estonia_Modern relative to Estonia_MA do not represent radical language change…Why would Estonia_IA represent a change relative to Estonia_BA, when it is statistically basically the same? Or Estonia_BA relative to CWC_Baltic? Because of the infiltration of haplogroup N1c around the whole Baltic? Because of the occasional 1% “Siberian” ancestry in some non-locals of varied haplogroups across the whole Baltic area?

In spite of all this, the amount of special pleading we are seeing among openly Nordicist amateurs when discussing the Uralic homeland relative to the Indo-European question in genetics has become a matter of plain willful ignorance. Like the living corpses of the Anatolian homeland, the Armenian homeland, the OIT proponents, or the nativist Basque R1b association, the personal involvement in the revival of “R1a=Indo-European” and “N=Uralic” trends is just painful to watch.

[Next post in this line, if I manage to make time for it: “Genetic (dis)continuity in Central Europe“. Let’s see if early Balts and early Slavs, as well as Germanic peoples, show a cluster closer to Danubian EBA (viz. Maros), Hungary-Balkans BA, and Urnfield-related samples than their predecessors in their areas, i.e. away from East Corded Ware groups… If you want, you can enjoy for the moment the new PCAs I could get done and the tentative map of languages in the Early Bronze Age, that will probably give you the right idea about early Indo-European and Uralic population movements]

bronze-age-early-indo-european
European Early Bronze Age: tentative language map based on linguistics, archaeology, and genetics. See full map.

Related

Baltic Finns in the Bronze Age, of hg. R1a-Z283 and Corded Ware ancestry

estonian-bronze-age-dna

Open access The Arrival of Siberian Ancestry Connecting the Eastern Baltic to Uralic Speakers further East, by Saag et al. Current Biology (2019).

Interesting excerpts:

In this study, we present new genomic data from Estonian Late Bronze Age stone-cist graves (1200–400 BC) (EstBA) and Pre-Roman Iron Age tarand cemeteries (800/500 BC–50 AD) (EstIA). The cultural background of stone-cist graves indicates strong connections both to the west and the east [20, 21]. The Iron Age (IA) tarands have been proposed to mirror “houses of the dead” found among Uralic peoples of the Volga-Kama region [22].

(…) The 33 individuals included 15 from EstBA, 6 from EstIA, 5 from Pre-Roman to Roman Iron Age Ingria (500 BC–450 AD) (IngIA), and 7 from Middle Age Estonia (1200–1600 AD) (EstMA) and yielded endogenous DNA ∼4%–88%, average genomic coverages ∼0.017–0.734×, and contamination estimates <4% (Table S1). We analyzed the data in the context of modern and other ancient individuals, including from Neolithic Estonia [13].

estonian-y-dna-bronze-iron-age
Archaeological Information, Genetic Sex, mtDNA and Y Chromosome Haplogroups, and Average Coverage of the Individuals of This Study. Modified from the paper to mark distinct Y-DNA haplogroups in the LBA and IA.

We identified chrY hgs for 30 male individuals (Tables 1 and S2; STAR Methods). All 16 successfully haplogrouped EstBA males belonged to hg R1a, showing no change from the CWC period, when this was also the only chrY lineage detected in the Eastern Baltic [11, 13, 30, 31]. Three EstIA and two IngIA individuals also belonged to hg R1a, but three EstIA males belonged to hg N3a, the earliest so far observed in the Eastern Baltic. Three EstMA individuals belonged to hg N3a, two to hg R1a, and one to hg J2b. ChrY lineages found in the Baltic Sea region before the CWC belong to hgs I, R1b, R1a5, and Q [10, 11, 12, 13, 17, 32]. Thus, it appears that these lineages were substantially replaced in the Eastern Baltic by hg R1a [10, 11, 12, 13], most likely through steppe migrations from the east [30, 31]. (…) Our results enable us to conclude that, although the expansion time for R1a1 and N3a3′5 in Eastern Europe is similar [25], hg N3a likely reached Estonia or at least became comparably frequent to modern Estonia [1] only during the BA-IA transition.

A clear shift toward West Eurasian hunter-gatherers is visible between European LN and BA (including Baltic CWC) and EstBA individuals, the latter clustering together with Latvian and Lithuanian BA individuals [11]. EstIA, IngIA, and EstMA individuals project between BA individuals and modern Estonians, partially overlapping with both.

(…) EstBA individuals are clearly distinguishable from Estonian CWC individuals as the former have more of the blue component most frequent in WHGs and less of the brown and yellow components maximized in Caucasus hunter-gatherers and modern Khanty, respectively. The individuals of EstBA, EstIA, IngIA, EstMA, and modern Estonia are quite similar to each other on average, indicating that the relatively high proportion of WHG ancestry in modern Eastern Baltic populations compared to other present-day Europeans [15] traces back to the BA.

estonian-pca-published
Detail of the PCA, modified from the paper to label populations. Estonian Bronze Age and Iron Age samples cluster close to Early Corded Ware from the Baltic.. Principal-component analysis results of modern West Eurasians with ancient individuals projected onto the first two components (PC1 and PC2). BA, Bronze Age; EF, early farmers; HG, hunter-gatherers; IA, Iron Age; IMA, Iron/Middle Ages; LN, Late Neolithic; LNBA, Late Neolithic/Bronze Age; MA, Middle Ages

When comparing Estonian CWC and EstBA using autosomal outgroup f3 and Patterson’s D statistics (Table S3), the latter is more similar to other Baltic BA populations, to Baltic IA and Middle Age (MA) populations, and also to populations similar to WHGs and Scandinavian hunter-gatherers (SHGs), but not to Estonian CCC (Figures 2A and S2A; Data S1). The increase in WHG or SHG ancestry could be connected to western influences seen in material culture [20, 21] and facilitated by a decline in local population after the CCC-CWC period [20]. A slight trend of bigger similarity of Estonian CWC to forest or steppe zone populations and of EstBA to European early farmer populations can also be seen.

(…) When comparing to modern populations, Estonian CWC is slightly more similar to Caucasus individuals but EstBA to Baltic populations and Finnic speakers (Figure 2B; Data S1). Outgroup f3 and D statistics do not reveal apparent differences when comparing EstBA to EstIA, EstIA to IngIA, and EstIA to EstMA (Data S1).

estonian-ba-ia-ancestry
qpAdm results. Error bars indicate one SE. Central MN, Central European Middle Neolithic; EstBA, Estonian Bronze Age; EstIA, Estonian Iron Age; IngIA, Ingrian Iron Age; EstMA, Estonian Middle Ages; WHG, western hunter-gatherers.

These results highlight how uniparental and autosomal data can lead to different demographic inferences—the genetic change between CWC and BA not seen in uniparental lineages is clear in autosomal data and the appearance of chrY hg N in the IA is not matched by a clear shift in autosomal profiles.

EstBA individuals have no Nganasan-related ancestry and EstIA, IngIA, and EstMA individuals on average have 2% or 4% (Figure 3; Data S1). The differentiation remains when using BA or IA Fennoscandian populations [26] instead of Nganasans (Data S1). Notably, the proportion of Nganasan-related ancestry varies between 0% and 12% among sampled EstIA, IngIA, and EstMA individuals (Data S1), which may suggest its relatively recent admixture into the target population. Moreover, two individuals from Kunda (0LS10 and V10) have the highest proportions of Nganasan ancestry among EstIA (6% and 8%), one of them has chrY hg N3a, and isotopic analysis suggests neither individual being born in Kunda [34].

About these two males from Tarand-graves, ‘foreign’ to Kunda:

0LS10: Male from tarand III (burial 9; TÜ 1325: L777), age 17–25 years [34]. He had a fragment of a sheep/goat bone and ceramics as grave goods. This burial has two radiocarbon dates: 2430 ± 35 BP (Poz-10801; 760–400 cal BC) and 2530 ± 41 BP (UBA-26114; 800–530 cal BC) [34]. According to the isotopic analysis, the person was not born in the vicinity of Kunda; his place of birth is still unknown (but south-western Finland and Sweden are excluded) [34]. Sampled tooth r P1.

V10: Male from tarand XI (burial 24; TÜ 1325: L1925), age 25–35 years [34], date 2484 ± 40 BP (UBA-26115; 790–430 cal BC) [34]. He had a few potsherds near the skull. Likewise, this person was not locally born [34]. Sampled tooth l P1.

estonia-bronze-iron-age-steppe-siberian
Autosomal Analyses’ Results for Gyvakarai1 as the closest available Corded Ware source for Balto-Finnic populations.

The paper shows thus:

  • Major continuity of ancestry from Corded Ware to modern Estonians, with only slight changes in different periods. In fact, one of the best fits for the Late Bronze Age ancestry is Gyvakarai1, one of the Corded Ware “outliers” described as “closer to Yamna”, which I already said may be closer to Sredni Stog/EHG populations instead. Another interesting take is that the change from Bronze Age to Iron Age corresponds to an increase in Baltic Corded Ware-related ancestry, rather than being driven by Siberian ancestry.
  • pca-mittnik-gyvakarai
    File modified by me from Mittnik et al. (2018) to include the approximate position of the most common ancestral components, and an identification of potential outliers. Zoomed-in version of the European Late Neolithic and Bronze Age samples. “Principal components analysis of 1012 present-day West Eurasians (grey points, modern Baltic populations in dark grey) with 294 projected published ancient and 38 ancient North European samples introduced in this study (marked with a red outline). From Mittnik et al. (2018).
  • A Volosovo-related migration of hg. N1c with Netted Ware into the area seems to be discarded, based on the full replacement of paternal lines and continuity of R1a-Z283. It is only during the Tarand-grave period when a system of chiefdoms (spread from Ananyino/Akozino) brings haplogroup N1c to the Gulf of Finland. During the Iron Age, the proportion of paternal lineages is still clearly in favour of R1a (50% in the coast, 100% in Ostrobothnia), which indicates a gradual replacement led by elites, likely because of the incorporation of Akozino warrior-traders spreading all over the Baltic, bringing the described shared Mordvinic traits in Fennic.
  • finno-ugric-haplogroup-n
    Map of archaeological cultures in north-eastern Europe ca. 8th-3rd centuries BC. [The Mid-Volga Akozino group not depicted] Shaded area represents the Ananino cultural-historical society. Fading purple arrows represent likely stepped movements of subclades of haplogroup N for centuries (e.g. Siberian → Ananino → Akozino → Fennoscandia [N-VL29]; Circum-Arctic → forest-steppe [N1, N2]; etc.). Blue arrows represent eventual expansions of Uralic peoples to the north. Modified image from Vasilyev (2002).
  • The arrival of Akozino warrior-traders (bringing N1c and R1a lineages) was probably linked to this minimal “Nganasan-like” ancestry of some samples in the transition to the Iron Age. This arrival is supported by samples 0LS10 (the earliest hg. N1c) and V10 (of hg. R1a), both dated to ca. 800-400 BC, with V10 showing the highest “Nganasan-like” ancestry with 4.8%, both of them neighbouring samples showing 0%. This variable admixture among local and foreign paternal lineages might support the described social system of family alliances with intermarriages. In fact, a medieval sample, 0LS03_1 (hg. R1a) also shows a recent “Nganasan-like” ancestry, which probably points to the integration of different Arctic-related ancestry components among Modern Estonians, in this case related to Finnish expansions and thus integration of Levänluhta-related ancestry, as per the supplementary data.
  • NOTE. Such minimal proportions of “Nganasan-like” ancestry evidence the process of admixture of Volga Finns in Akozino territory through their close interactions with Permians of Ananyino, who in turn acquired this Palaeo-Arctic admixture most likely during the expansion of the linguistic community to hunter-gatherer territories, to the north of the Cis-Urals. This process of stepped infiltration and expansion without language change is not dissimilar to the one seen among Indo-Iranians and Balto-Slavs of hg. R1b, or Vasconic speakers of hg. I2a, although in the case of Baltic Finns of hg. R1a the process of infiltration and expansion of hg. N1c is much less dramatic, with no radical replacement anywhere before the huge bottlenecks observable in Finns.

  • The expansion of haplogroup N1c among Finnic populations, as we are going to see in samples from the Middle Ages such as Luistari, is the consequence of late founder effects after huge bottlenecks expected based on the analysis of modern populations. The expansion of N1c-VL29 is different in origin from that of N1c-Z1936 among Samic (later integrated into Finnish populations), most likely from the east and originally associated with Lovozero Ware.
haplogroup_n3a3
Frequency-Distribution Maps of Individual Subclade N3a3 / N1a1a1a1a1a-CTS2929/VL29, probably initially with Akozino warrior-traders. Map from Ilumäe et al. (2016).

In spite of all this, the conclusion of the paper is (surprise!) that Siberian ancestry and hg. N heralded the arrival of Finnic to the Gulf of Finland in the Iron Age… However, this conclusion is supposedly* supported, not by their previous papers, but by a recent phylogenetic study by Honkola et al. (2013), which doesn’t actually argue for such a late ‘arrival’: it argues for the split of Balto-Finnic around 1500 BC.

NOTE. I say ‘supposedly’ because Kristiina Tambets, for example, has been following the link of Uralic with haplogroup N since the 2000s, so this is not some conclusion they just happened to misread from some random paper they Googled. In those initial assessments, she argued that the “ancient homeland” of the Tat C mutation suggested that Finno-Ugrians were in Fennoscandia before Indo-Europeans. Apparently, since haplogroup N appears later and from the east, it is now more important to follow this haplogroup than what is established in archaeology and linguistics.

Even in the referred paper, this split is considered an in situ development, since the phylogenetic study takes the information – among others – 1) from Parpola and Carpelan, who consider Netted Ware, a culture derived from Fatyanovo/Abashevo and Volosovo, as the culprit of the Finno-Ugric expansion; and 2) from Kallio (2006), who clearly states that Proto-Balto-Finnic (like Proto-Finno-Samic) was spoken around the Gulf of Finland during the Bronze Age. Both of them set the terminus ante quem of the language presence in the Baltic ca. 1900 BC.

Anyways, as a consequence of geneticists keeping these untenable pre-ancient DNA haplogroup-based arguments today, I expect to see this “Finnic” language expansion also described for the Western Baltic, Scandinavia or northern Europe, when this same proportion of hg. N1c and “Nganasan” ancestry is observed in Iron Age samples around the Baltic Sea. The nativist trends that this domination of “Finns” all over Northern Europe 2,500 years ago will create will be even more fun to read than the current ones…

EDIT (10 May 2019) How I see the reaction of many to ancient DNA, in keeping their old theories:

Related

Uralic speakers formed clines of Corded Ware ancestry with WHG:ANE populations

steppe-forest-tundra-biomes-uralic

The preprint by Jeong et al. (2018) has been published: The genetic history of admixture across inner Eurasia Nature Ecol. Evol. (2019).

Interesting excerpts, referring mainly to Uralic peoples (emphasis mine):

A model-based clustering analysis using ADMIXTURE shows a similar pattern (Fig. 2b and Supplementary Fig. 3). Overall, the proportions of ancestry components associated with Eastern or Western Eurasians are well correlated with longitude in inner Eurasians (Fig. 3). Notable outliers include known historical migrants such as Kalmyks, Nogais and Dungans. The Uralic- and Yeniseian-speaking populations, as well as Russians from multiple locations, derive most of their Eastern Eurasian ancestry from a component most enriched in Nganasans, while Turkic/Mongolic speakers have this component together with another component most enriched in populations from the Russian Far East, such as Ulchi and Nivkh (Supplementary Fig. 3). Turkic/Mongolic speakers comprising the bottom-most cline have a distinct Western Eurasian ancestry profile: they have a high proportion of a component most enriched in Mesolithic Caucasus hunter-gatherers and Neolithic Iranians and frequently harbour another component enriched in present-day South Asians (Supplementary Fig. 4). Based on the PCA and ADMIXTURE results, we heuristically assigned inner Eurasians to three clines: the ‘forest-tundra’ cline includes Russians and all Uralic and Yeniseian speakers; the ‘steppe-forest’ cline includes Turkic- and Mongolic-speaking populations from the Volga and Altai–Sayan regions and Southern Siberia; and the ‘southern steppe’ cline includes the rest of the populations.

eurasian-clines-uralic-altaic
The first two PCs summarizing the genetic structure within 2,077 Eurasian individuals. The two PCs generally mirror geography. PC1 separates western and eastern Eurasian populations, with many inner Eurasians in the middle. PC2 separates eastern Eurasians along the northsouth cline and also separates Europeans from West Asians. Ancient individuals (color-filled shapes), including two Botai individuals, are projected onto PCs calculated from present-day individuals.

For the forest-tundra populations, the Nganasan + Srubnaya model is adequate only for the two Volga region populations, Udmurts and Besermyans (Fig. 5 and Supplementary Table 8).

For the other populations west of the Urals, six from the northeastern corner of Europe are modelled with additional Mesolithic Western European hunter-gatherer (WHG) contribution (8.2–11.4%; Supplementary Table 8), while the rest need both WHG and early Neolithic European farmers (LBK_EN; Supplementary Table 2). Nganasan-related ancestry substantially contributes to their gene pools and cannot be removed from the model without a significant decrease in the model fit (4.1–29.0% contribution; χ2 P ≤ 1.68 × 10−5; Supplementary Table 8).

west-urals-finno-ugrians-qpadm
Supplementary Table 8. QpAdm-based admixture modeling of the forest-tundra cline populations. For the 13 populations west of the Urals, we present a four-way admixture model, Nganasan+Srubnaya+WHG+LBK_EN, or its minimal adequate subset. Modified from the article, to include colors for cultures, and underlined best models for Corded Ware ancestry among Uralians.

NOTE. It doesn’t seem like Hungarians can be easily modelled with Nganasan ancestry, though…

For the 4 populations east of the Urals (Enets, Selkups, Kets and Mansi), for which the above models are not adequate, Nganasan + Srubnaya + AG3 provides a good fit (χ2 P ≥ 0.018; Fig. 5 and Supplementary Table 8). Using early Bronze Age populations from the Baikal Lake region (‘Baikal_EBA’; Supplementary Table 2) as a reference instead of Nganasan, the two-way model of Baikal_EBA + Srubnaya provides a reasonable fit (χ2 P ≥ 0.016; Supplementary Table 8) and the three-way model of Baikal_EBA + Srubnaya + AG3 is adequate but with negative AG3 contribution for Enets and Mansi (χ2 P ≥ 0.460; Supplementary Table 8).

east-urals-ugric-samoyedic-qpadm
Supplementary Table 8. QpAdm-based admixture modeling of the forest-tundra cline populations. For the four populations east of the Urals, we present three admixture models: Baikal_EBA+Srubnaya, Baikal_EBA+Srubnaya+AG3 and Nganasan+Srubnaya+AG3. For each model, we present qpAdm p-value, admixture coefficient estimates and associated 5 cM jackknife standard errors (estimate ± SE). Modified from the article, to include colors for cultures, and underlined best models for Corded Ware ancestry among Uralians.

Bronze/Iron Age populations from Southern Siberia also show a similar ancestry composition with high ANE affinity (Supplementary Table 9). The additional ANE contribution beyond the Nganasan + Srubnaya model suggests a legacy from ANE-ancestry-rich clines before the Late Bronze Age.

bronze-age-iron-age-karasuk-mezhovska-tagar-qpadm
Supplementary Table 9. QpAdm-based admixture modeling of Bronze and Iron Age populations of southern Siberia. For ancieint individuals associated with Karasuk and Tagar cultures, Nganasan+Srubnaya model is insufficient. For all five groups, adding AG3 as the third ancestry or substituting Nganasan with Baikal_EBA with higher ANE affinity provides an adequate model. For each model, we present qpAdm p-value, admixture coefficient estimates and associated 5 cM jackknife standard errors (estimate ± SE). Models with p-value ≥ 0.05 are highlighted in bold face. Modified from the article, to include colors for cultures, and underlined best models for Corded Ware ancestry among Uralians.

Lara M. Cassidy comments the results of the study in A steppe in the right direction (you can read it here):

Even among the earliest available inner Eurasian genomes, east–west connectivity is evident. These, too, form a longitudinal cline, characterized by the easterly increase of a distinct ancestry, labelled Ancient North Eurasian (ANE), lowest in western European hunter-gatherers (WHG) and highest in Palaeolithic Siberians from the Baikal region. Flow-through from this ANE cline is seen in steppe populations until at least the Bronze Age, including the world’s earliest known horse herders — the Botai. However, this is eroded over time by migration from west and east, following agricultural adoption on the continental peripheries (Fig. 1b,c).

Strikingly, Jeong et al. model the modern upper steppe cline as a simple two-way mixture between western Late Bronze Age herders and Northeast Asians (Fig. 1c), with no detectable residue from the older ANE cline. They propose modern steppe peoples were established mainly through migrations post-dating the Bronze Age, a sequence for which has been recently outlined using ancient genomes. In contrast, they confirm a substantial ANE legacy in modern Siberians of the northernmost cline, a pattern mirrored in excesses of WHG ancestry west of the Urals (Fig. 1b). This marks the inhospitable biome as a reservoir for older lineages, an indication that longstanding barriers to latitudinal movement may indeed be at work, reducing the penetrance of gene flows further south along the steppe.

eurasian-clines-uralic-turkic-mongol-altaic
The genomic formation of inner Eurasians. b–d, Depiction of the three main clines of ancestry identified among Inner Eurasians. Sources of admixture for each cline are represented using proxy ancient populations, both sampled and hypothesised, based on the study’s modelling results. The major eastern and western ancestries used to model each cline are shown in bold; the peripheral admixtures that gave rise to these are also shown. Additional contributions to subsections of each cline are marked with dashed lines. b, The northernmost cline, illustrating the legacy of WHG and ANE-related populations. c,d, The upper (c) and lower (d) steppe clines are shown, both of which have substantial eastern contributions related to modern Tungusic speakers. The authors propose these populations are themselves the result of an admixture between groups related to the Nganasan, whose ancestors potentially occupied a wider range, and hunter-gatherers (HGs) from the Amur River Basin. While the upper steppe cline in c can be described as a mixture between this eastern ancestry and western steppe herders, the current model for the southern steppe cline as shown in d is not adequate and is likely confounded by interactions with diverse bordering ancestries. Credit: Ecoregions 2017, Resolve https://ecoregions2017.appspot.com/

Given the findings as reported in the paper, I think it should be much easier to describe different subclines in the “northernmost cline” than in the much more recent “Turkic/Mongolic cline”, which is nevertheless subdivided in this paper in two clines. As an example, there are at least two obvious clines with “Nganasan-related meta-populations” among Uralians, which converge in a common Steppe MLBA (i.e. Corded Ware) ancestry – one with Palaeo-Laplandic peoples, and another one with different Palaeo-Siberian populations:

siberian-clines-uralic-altaic
PCA of ancient and modern Eurasian samples. Ancient Palaeo-Laplandic, Palaeosiberian, and Altai clines drawn, with modern populations labelled. See a version with higher resolution.

The inclusion of certain Eurasian groups (or lack thereof) in the PCA doesn’t help to distinguish these subclines visually, and I guess the tiny “Naganasan-related” ancestral components found in some western populations (e.g. the famous ~5% among Estonians) probably don’t lend themselves easily to further subdivisions. Notice, nevertheless, the different components of the Eastern Eurasian source populations among Finno-Ugrians:

uralic-admixture-qpadm
Characterization of the Western and Eastern Eurasian source ancestries in inner Eurasian populations. [Modified from the paper, includes only Uralic populations]. a, Admixture f3 values are compared for different Eastern Eurasian (Mixe, Nganasan and Ulchi; green) and Western Eurasian references (Srubnaya and Chalcolithic Iranians (Iran_ChL); red). For each target group, darker shades mark more negative f3 values. b, Weights of donor populations in two sources characterizing the main admixture signal (date 1 and PC1) in the GLOBETROTTER analysis. We merged 167 donor populations into 12 groups (top right). Target populations were split into five groups (from top to bottom): Aleuts; the forest-tundra cline populations; the steppe-forest cline populations; the southern steppe cline populations; and ‘others’.

Also remarkable is the lack of comparison of Uralic populations with other neighbouring ones, since the described Uralic-like ancestry of Russians was already known, and is most likely due to the recent acculturation of Uralic-speaking peoples in the cradle of Russians, right before their eastward expansions.

west-eurasian-east-eurasian-ancestry
Supplementary Fig. 4. ADMIXTURE results qualitatively support PCA-based grouping of inner Eurasians into three clines. (A) Most southern steppe cline populations derive a higher proportion of their total Western Eurasian ancestry from a source related to Caucasus, Iran and South Asian populations. (B) Turkic- and Mongolic-speaking populations tend to derive their Eastern Eurasian ancestry more from the Devil’s Gate related one than from Nganasan-related one, while the opposite is true for Uralic- and Yeiseian-speakers. To estimate overall western Eurasian ancestry proportion, we sum up four components in our ADMIXTURE results (K=14), which are the dominant components in Neolithic Anatolians (“Anatolia_N”), Mesolithic western European hunter-gatherers (“WHG”), early Holocene Caucasus hunter-gatherers (“CHG”) and Mala from southern India, respectively. The “West / South Asian ancestry” is a fraction of it, calculated by summing up the last two components. To estimate overall Eastern Eurasian ancestry proportion, we sum up six components, most prevalent in Surui, Chipewyan, Itelmen, Nganasan, Atayal and early Neolithic Russian Far East individuals (“Devil’s Gate”). Eurasians into three clines. (A) Most southern steppe cline populations derive a higher proportion of their total Western Eurasian ancestry from a source related to Caucasus, Iran and South Asian populations. (B) Turkic- and Mongolic-speaking populations tend to derive their Eastern Eurasian ancestry more from the Devil’s Gate related one than from Nganasan-related one, while the opposite is true for Uralic- and Yeiseian-speakers. To estimate overall western Eurasian ancestry proportion, we sum up four components in our ADMIXTURE results (K=14), which are the dominant components in Neolithic Anatolians (“Anatolia_N”), Mesolithic western European hunter-gatherers (“WHG”), early Holocene Caucasus hunter-gatherers (“CHG”) and Mala from southern India, respectively. The “West / South Asian ancestry” is a fraction of it, calculated by summing up the last two components. To estimate overall Eastern Eurasian ancestry proportion, we sum up six components, most prevalent in Surui, Chipewyan, Itelmen, Nganasan, Atayal and early Neolithic Russian Far East individuals (“Devil’s Gate”).

A comparison of Estonians and Finns with Balts, Scandinavians, and Eastern Europeans would have been more informative for the division of the different so-called “Nganasan-like meta-populations”, and to ascertain which one of these ancestral peoples along the ancient WHG:ANE cline could actually be connected (if at all) to the Cis-Urals.

Because, after all, based on linguistics and archaeology, geneticists are not supposed to be looking for populations from the North Asian Arctic region, for “Siberian ancestry”, or for haplogroup N1c – despite previous works by their peers – , but for the Bronze Age Volga-Kama region…

Related

N1c-L392 associated with expanding Turkic lineages in Siberia

haplogroup-n1c-tat

Second in popularity for the expansion of haplogroup N1a-L392 (ca. 4400 BC) is, apparently, the association with Turkic, and by extension with Micro-Altaic, after the Uralic link preferred in Europe; at least among certain eastern researchers.

New paper in a recently created journal, by the same main author of the group proposing that Scythians of hg. N1c were Turkic speakers: On the origins of the Sakhas’ paternal lineages: Reconciliation of population genetic / ancient DNA data, archaeological findings and historical narratives, by Tikhonov, Gurkan, Demirdov, and Beyoglu, Siberian Research (2019).

Interesting excerpts:

According to the views of a number of authoritative researchers, the Yakut ethnos was formed in the territory of Yakutia as a result of the mixing of people from the south and the autochthonous population [34].

These three major Sakha paternal lineages may have also arrived in Yakutia at different times and/ or from different places and/or with a difference in several generations instead, or perhaps Y-chromosomal STR mutations may have taken place in situ in Yakutia. Nevertheless, the immediate common ancestor(s) from the Asian Steppe of these three most prevalent Sakha Y-chromosomal STR haplotypes possibly lived during the prominence of the Turkic Khaganates, hence the near-perfect matches observed across a wide range of Eurasian geography, including as far as from Cyprus in the West to Liaoning, China in the East, then Middle Lena in the North and Afghanistan in the South (Table 3 and Figure 5). There may also be haplotypes closely-related to ‘the dominant Elley line’ among Karakalpaks, Uzbeks and Tajiks, however, limitations in the loci coverage for the available dataset (only eight Y-chromosomal STR loci) precludes further conclusions on this matter [25].

yakutia-haplogroup-n1c
17-loci median-joining network analysis of the original/dominant Elley, Unknown and Omogoy Y-chromosomal STR haplotypes with the YHRD matches from outside Yakutia populations.

According to the results presented here, very similar Y-STR haplotypes to that of the original Elley line were found in the west: Afghanistan and northern Cyprus, and in the east: Liaoning Province, China and Ulaanbaator, Northern Mongolia. In the case of the dominant Omogoy line, very closely matching haplotypes differing by a single mutational step were found in the city of Chifen of the Jirin Province, China. The widest range of similar haplotypes was found for the Yakut haplotype Unknown: In Mongolia, China and South Korea. For instance, haplotypes differing by a single step mutation were found in Northern Mongolia (Khalk, Darhad, Uryankhai populations), Ulaanbaator (Khalk) and in the province of Jirin, China (Han population).

n1c-uralic-altaic-siberia
14-loci median-joining network analysis for the original/dominant Elley (Ell), Unknown Clan
(Vil), Omogoy (Omo), Eurasian (Eur) and Xiongnu (Xuo) Y-chromosomal STR haplotypes and that for a representative ancient DNA sample (Ch0 or DSQ04) from the Upper Xiajiadian Culture
recovered from the Inner Mongolia Autonomous Region, China.

Notably, Tat-C-bearing Y-chromosomes were also observed in ancient DNA samples from the 2700-3000 years-old Upper Xiajiadian culture in Inner Mongolia, as well as those from the Serteya II site at the Upper Dvina region in Russia and the ‘Devichyi gory’ culture of long barrow burials at the Nevel’sky district of Pskovsky region in Russia. A 14-loci Y-chromosomal STR median-joining network of the most prevalent Sakha haplotypes and a Tat-C-bearing haplotype from one of the ancient DNA samples recovered from the Upper Xiajiadian culture in Inner Mongolia (DSQ04) revealed that the contemporary Sakha haplotype ‘Xuo’ (Table 2, Haplotype ID “Xuo”) classified as that of ‘the Xiongnu clan’ in our current study, was the closest to the ancient Xiongnu haplotype (Figure 6). TMRCA estimate for this 14-loci Y-chromosomal STR network was 4357 ± 1038 years or 2341 ± 1038 BCE, which correlated well with the Upper Xiajiadian culture that was dated to the Late Bronze Age (700-1000 BCE).

eurasian-n-subclades
Geographical location of ancient samples belonging to major clade N of the Y-chromosome.

NOTE. Also interesting from the paper seems to be the proportion of E1b1b among admixed Russian populations, in a proportion similar to R1a or I2a(xI2a1).

It is tempting to associate the prevalent presence of N1c-L392 in ancient Siberian populations with the expansion of Altaic, by simplistically linking the findings (in chronological order) near Lake Baikal (Damgaard et al. 2018), Upper Xiajiadian (Cui et al. 2013), among Khövsgöl (Jeong et al. 2018), in Huns (Damgaard et al. 2018), and in Mongolic-speaking Avars (Csáky et al. 2019).

However, its finding among Palaeo-Laplandic peoples in the Kola peninsula ca. 1500 BC (Lamnidis et al. 2018) and among Palaeo-Siberian populations near the Yana River (Sikora et al. 2018) ca. AD 1200 should be enough to accept the hypothesis of ancestral waves of expansion of the haplogroup over northern Eurasia, with acculturation and further expansions in the different regions since the Iron Age (see more on its potential expansion waves).

Also, a simple look at the TMRCA and modern distribution was enough to hypothesize long ago the lack of connection of N1c-L392 with Altaic or Uralic peoples. From Ilumäe et al. (2016):

Previous research has shown that Y chromosomes of the Turkic-speaking Yakuts (Sakha) belong overwhelmingly to hg N3 (formerly N1c1). We found that nearly all of the more than 150 genotyped Yakut N3 Y chromosomes belong to the N3a2-M2118 clade, just as in the Turkic-speaking Dolgans and the linguistically distant Tungusic-speaking Evenks and Evens living in Yakutia (Table S2). Hence, the N3a2 patrilineage is a prime example of a male population of broad central Siberian ancestry that is not intrinsic to any linguistically defined group of people. Moreover, the deepest branch of hg N3a2 is represented by a Lebanese and a Chinese sample. This finding agrees with the sequence data from Hallast et al., where one Turkish Y chromosome was also assigned to the same sub-clade. Interestingly, N3a2 was also found in one Bhutan individual who represents a separate sub-lineage in the clade. These findings show that although N3a2 reflects a recent strong founder effect primarily in central Siberia (Yakutia, Sakha), the sub-clade has a much wider distribution area with incidental occurrences in the Near East and South Asia.

haplogroup-n1a-M2118
Frequency-Distribution Maps of Individual Sub-clades of hg N3a2, by Ilumäe et al. (2016).

The most striking aspect of the phylogeography of hg N is the spread of the N3a3’6-CTS6967 lineages. Considering the three geographically most distant populations in our study—Chukchi, Buryats, and Lithuanians—it is remarkable to find that about half of the Y chromosome pool of each consists of hg N3 and that they share the same sub-clade N3a3’6. The fractionation of N3a3’6 into the four sub-clades that cover such an extraordinarily wide area occurred in the mid-Holocene, about 5.0 kya (95% CI = 4.4–5.7 kya). It is hard to pinpoint the precise region where the split of these lineages occurred. It could have happened somewhere in the middle of their geographic spread around the Urals or further east in West Siberia, where current regional diversity of hg N sub-lineages is the highest (Figure 1B). Yet, it is evident that the spread of the newly arisen sub-clades of N3a3’6 in opposing directions happened very quickly. Today, it unites the East Baltic, East Fennoscandia, Buryatia, Mongolia, and Chukotka-Kamchatka (Beringian) Eurasian regions, which are separated from each other by approximately 5,000–6,700 km by air. N3a3’6 has high frequencies in the patrilineal pools of populations belonging to the Altaic, Uralic, several Indo-European, and Chukotko-Kamchatkan language families. There is no generally agreed, time-resolved linguistic tree that unites these linguistic phyla. Yet, their split is almost certainly at least several millennia older than the rather recent expansion signal of the N3a3’6 sub-clade, suggesting that its spread had little to do with linguistic affinities of men carrying the N3a3’6 lineages.

haplogroup_n3a3
Frequency-Distribution Maps of Individual Subclade N3a3 / N1a1a1a1a1a-CTS2929/VL29.

It was thus clear long ago that N1c-L392 lineages must have expanded explosively in the 5th millennium through Northern Eurasia, probably from a region to the north of Lake Baikal, and that this expansion – and succeeding ones through Northern Eurasia – may not be associated to any known language group until well into the common era.

Related