Corded Ware ancestry in North Eurasia and the Uralic expansion

uralic-clines-nganasan

Now that it has become evident that Late Repin (i.e. Yamnaya/Afanasevo) ancestry was associated with the migration of R1b-L23-rich Late Proto-Indo-Europeans from the steppe in the second half of the the 4th millennium BC, there’s still the question of how R1a-rich Uralic speakers of Corded Ware ancestry expanded , and how they spread their languages throughout North Eurasia.

Modern North Eurasians

I have been collecting information from the supplementary data of the latest papers on modern and ancient North Eurasian peoples, including Jeong et al. (2019), Saag et al. (2019), Sikora et al. (2018), or Flegontov et al. (2019), and I have tried to add up their information on ancestral components and their modern and historical distributions.

Fortunately, the current obsession with simplifying ancestry components into three or four general, atemporal groups, and the common use of the same ones across labs, make it very simple to merge data and map them.

Corded Ware ancestry

There is no doubt about the prevalent ancestry among Uralic-speaking peoples. A map isn’t needed to realize that, because ancient and modern data – like those recently summarized in Jeong et al. (2019) – prove it. But maps sure help visualize their intricate relationship better:

natural-modern-srubnaya-ancestry
Natural neighbor interpolation of Srubnaya ancestry among modern populations. See full map.
kriging-modern-srubnaya-ancestry
Kriging interpolation of Srubnaya ancestry among modern populations. See full map

Interestingly, the regions with higher Corded Ware-related ancestry are in great part coincident with (pre)historical Finno-Ugric-speaking territories:

uralic-languages-modern
Modern distribution of Uralic languages, with ancient territory (in the Common Era) labelled and delimited by a red line. For more information on the ancient territory see here.

Edit (29/7/2019): Here is the full Steppe_MLBA ancestry map, including Steppe_MLBA (vs. Indus Periphery vs. Onge) in modern South Asian populations from Narasimhan et al. (2018), apart from the ‘Srubnaya component’ in North Eurasian populations. ‘Dummy’ variables (with 0% ancestry) have been included to the south and east of the map to avoid weird interpolations of Steppe_MLBA into Africa and East Asia.

modern-steppe-mlba-ancestry2
Natural neighbor interpolation of Steppe MLBA-like ancestry among modern populations. See full map.

Anatolia Neolithic ancestry

Also interesting are the patterns of non-CWC-related ancestry, in particular the apparent wedge created by expanding East Slavs, which seems to reflect the intrusion of central(-eastern) European ancestry into Finno-Permic territory.

NOTE. Read more on Balto-Slavic hydrotoponymy, on the cradle of Russians as a Finno-Permic hotspot, and about Pre-Slavic languages in North-West Russia.

natural-modern-lbk-en-ancestry
Natural neighbor interpolation of LBK EN ancestry among modern populations. See full map.
kriging-modern-lbk-en-ancestry
Kriging interpolation of LBK EN ancestry among modern populations. See full map

WHG ancestry

The cline(s) between WHG, EHG, ANE, Nganasan, and Baikal HG are also simplified when some of them excluded, in this case EHG, represented thus in part by WHG, and in part by more eastern ancestries (see below).

modern-whg-ancestry
Natural neighbor interpolation of WHG ancestry among modern populations. See full map.
kriging-modern-whg-ancestry
Kriging interpolation of WHG ancestry among modern populations. See full map.

Arctic, Tundra or Forest-steppe?

Data on Nganasan-related vs. ANE vs. Baikal HG/Ulchi-related ancestry is difficult to map properly, because both ancestry components are usually reported as mutually exclusive, when they are in fact clearly related in an ancestral cline formed by different ancient North Eurasian populations from Siberia.

When it comes to ascertaining the origin of the multiple CWC-related clines among Uralic-speaking peoples, the question is thus how to properly distinguish the proportions of WHG-, EHG-, Nganasan-, ANE or BaikalHG-related ancestral components in North Eurasia, i.e. how did each dialectal group admix with regional groups which formed part of these clines east and west of the Urals.

The truth is, one ought to test specific ancient samples for each “Siberian” ancestry found in the different Uralic dialectal groups, but the simplistic “Siberian” label somehow gets a pass in many papers (see a recent example).

Below qpAdm results with best fits for Ulchi ancestry, Afontova Gora 3 ancestry, and Nganasan ancestry, but some populations show good fits for both and with similar proportions, so selecting one necessarily simplifies the distribution of both.

Ulchi ancestry

modern-ulchi-ancestry
Natural neighbor interpolation of Ulchi ancestry among modern populations. See full map.
kriging-modern-ulchi-ancestry
Kriging interpolation of Ulchi ancestry among modern populations. See full map.

ANE ancestry

natural-modern-ane-ancestry
Natural neighbor interpolation of ANE ancestry among modern populations. See full map.
kriging-modern-ane-ancestry
Kriging interpolation of ANE ancestry among modern populations. See full map.

Nganasan ancestry

modern-nganasan-ancestry
Natural neighbor interpolation of Nganasan ancestry among modern populations. See full map.
kriging-modern-nganasan-ancestry
Kriging interpolation of Nganasan ancestry among modern populations. See full map.

Iran Chalcolithic

A simplistic Iran Chalcolithic-related ancestry is also seen in the Altaic cline(s) which (like Corded Ware ancestry) expanded from Central Asia into Europe – apart from its historical distribution south of the Caucasus:

modern-iran-chal-ancestry
Natural neighbor interpolation of Iran Neolithic ancestry among modern populations. See full map.
kriging-modern-iran-neolithic-ancestry
Kriging interpolation of Iran Chalcolithic ancestry among modern populations. See full map.

Other models

The first question I imagine some would like to know is: what about other models? Do they show the same results? Here is the simplistic combination of ancestry components published in Damgaard et al. (2018) for the same or similar populations:

NOTE. As you can see, their selection of EHG vs. WHG vs. Nganasan vs. Natufian vs. Clovis of is of little use, but corroborate the results from other papers, and show some interesting patterns in combination with those above.

EHG

damgaard-modern-ehg-ancestry
Natural neighbor interpolation of EHG ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-ehg-ancestry
Kriging interpolation of EHG ancestry among modern populations. See full map.

Natufian ancestry

damgaard-modern-natufian-ancestry
Natural neighbor interpolation of Natufian ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-natufian-ancestry
Kriging interpolation of Natufian ancestry among modern populations. See full map.

WHG ancestry

damgaard-modern-whg-ancestry
Natural neighbor interpolation of WHG ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-whg-ancestry
Kriging interpolation of WHG ancestry among modern populations. See full map.

Baikal HG ancestry

damgaard-modern-baikalhg-ancestry
Natural neighbor interpolation of Baikal hunter-gatherer ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-baikal-hg-ancestry
Kriging interpolation of Baikal HG ancestry among modern populations. See full map.

Ancient North Eurasians

Once the modern situation is clear, relevant questions are, for example, whether EHG-, WHG-, ANE, Nganasan-, and/or Baikal HG-related meta-populations expanded or became integrated into Uralic-speaking territories.

When did these admixture/migration events happen?

How did the ancient distribution or expansion of Palaeo-Arctic, Baikalic, and/or Altaic peoples affect the current distribution of the so-called “Siberian” ancestry, and of hg. N1a, in each specific population?

NOTE. A little excursus is necessary, because the calculated repetition of a hypothetic opposition “N1a vs. R1a” doesn’t make this dichotomy real:

  1. There was not a single ethnolinguistic community represented by hg. R1a after the initial expansion of Eastern Corded Ware groups, or by hg. N1a-L392 after its initial expansion in Siberia:
  2. Different subclades became incorporated in different ways into Bronze Age and Iron Age communities, most of which without an ethnolinguistic change. For example, N1a subclades became incorporated into North Eurasian populations of different languages, reaching Uralic- and Indo-European-speaking territories of north-eastern Europe during the late Iron Age, at a time when their ancestral origin or language in Siberia was impossible to ascertain. Just like the mix found among Proto-Germanic peoples (R1b, R1a, and I1)* or among Slavic peoples (I2a, E1b, R1a)*, the mix of many Uralic groups showing specific percentages of R1a, N1a, or Q subclades* reflect more or less recent admixture or acculturation events with little impact on their languages.

*other typically northern and eastern European haplogroups are also represented in early Germanic (N1a, I2, E1b, J, G2), Slavic (I1, G2, J) and Finno-Permic (I1, R1b, J) peoples.

ananino-culture-new
Map of archaeological cultures in north-eastern Europe ca. 8th-3rd centuries BC. [The Mid-Volga Akozino group not depicted] Shaded area represents the Ananino cultural-historical society. Fading purple arrows represent likely stepped movements of subclades of haplogroup N for centuries (e.g. Siberian → Ananino → Akozino → Fennoscandia [N-VL29]; Circum-Arctic → forest-steppe [N1, N2]; etc.). Blue arrows represent eventual expansions of Uralic peoples to the north. Modified image from Vasilyev (2002).

The problem with mapping the ancestry of the available sampling of ancient populations is that we lack proper temporal and regional transects. The maps that follow include cultures roughly divided into either “Bronze Age” or “Iron Age” groups, although the difference between samples may span up to 2,000 years.

NOTE. Rough estimates for more external groups (viz. Sweden Battle Axe/Gotland_A for the NW, Srubna from the North Pontic area for the SW, Arctic/Nganasan for the NE, and Baikal EBA/”Ulchi-like” for the SE) have been included to offer a wider interpolated area using data already known.

Bronze Age

Similar to modern populations, the selection of best fit “Siberian” ancestry between Baikal HG vs. Nganasan, both potentially ± ANE (AG3), is an oversimplification that needs to be addressed in future papers.

Corded Ware ancestry

bronze-age-corded-ware-ancestry
Natural neighbor interpolation of Srubnaya ancestry among Bronze Age populations. See full map.

Nganasan-like ancestry

bronze-age-nganasan-like-ancestry
Natural neighbor interpolation of Nganasan-like ancestry among Bronze Age populations. See full map.

Baikal HG ancestry

bronze-age-baikal-hg-ancestry
Natural neighbor interpolation of Baikal Hunter-Gatherer ancestry among Bronze Age populations. See full map.

Afontova Gora 3 ancestry

bronze-age-afontova-gora-ancestry
Natural neighbor interpolation of Afontova Gora 3 ancestry among Bronze Age populations. See full map.

Iron Age

Corded Ware ancestry

Interestingly, the moderate expansion of Corded Ware-related ancestry from the south during the Iron Age may be related to the expansion of hg. N1a-VL29 into the chiefdom-based system of north-eastern Europe, including Ananyino/Akozino and later expanding Akozino warrior-traders around the Baltic Sea.

NOTE. The samples from Levänluhta are centuries older than those from Estonia (and Ingria), and those from Chalmny Varre are modern ones, so this region has to be read as a south-west to north-east distribution from the Iron Age to modern times.

iron-age-corded-ware-ancestry
Natural neighbor interpolation of Srubnaya ancestry among Iron Age populations. See full map.

Baikal HG-like ancestry

The fact that this Baltic N1a-VL29 branch belongs in a group together with typically Avar N1a-B197 supports the Altaic origin of the parent group, which is possibly related to the expansion of Baikalic ancestry and Iron Age nomads:

iron-age-baikal-ancestry
Natural neighbor interpolation of Baikal HG ancestry among Iron Age populations. See full map.

Nganasan-like ancestry

The dilution of Nganasan-like ancestry in an Arctic region featuring “Siberian” ancestry and hg. N1a-L392 at least since the Bronze Age supports the integration of hg. N1a-Z1934, sister clade of Ugric N1a-Z1936, into populations west and east of the Urals with the expansion of Uralic languages to the north into the Tundra region (see here).

The integration of N1a-Z1934 lineages into Finnic-speaking peoples after their migration to the north and east, and the displacement or acculturation of Saami from their ancestral homeland, coinciding with known genetic bottlenecks among Finns, is yet another proof of this evolution:

iron-age-nganasan-ancestry
Natural neighbor interpolation of Nganasan ancestry among Iron Age populations. See full map.

WHG ancestry

Similarly, WHG ancestry doesn’t seem to be related to important population movements throughout the Bronze Age, which excludes the multiple North Eurasian populations that will be found along the clines formed by WHG, EHG, ANE, Nganasan, Baikal HG ancestry as forming part of the Uralic ethnogenesis, although they may be relevant to follow later regional movements of specific populations.

iron-age-whg-ancestry
Natural neighbor interpolation of WHG ancestry among Iron Age populations. See full map.

Conclusion

It seems natural that people used to look at maps of haplogroup distribution from the 2000s, coupled with modern language distributions, and would try to interpret them in a certain way, reaching thus the wrong conclusions whose consequences are especially visible today when ancient DNA keeps contradicting them.

In hindsight, though, assuming that Balto-Slavs expanded with Corded Ware and hg. R1a, or that Uralians expanded with “Siberian” ancestry and hg. N1a, was as absurd as looking at maps of ancestry and haplogroup distribution of ancient and modern Native Americans, trying to divide them into “Germanic” or “Iberian”…

The evolution of each specific region and cultural group of North Eurasia is far from being clear. However, the general trend speaks clearly in favour of an ancient, Bronze Age distribution of North Eurasian ancestry and haplogroups that have decreased, diluted, or become incorporated into expanding Uralians of Corded Ware ancestry, occasionally spreading with inter-regional expansions of local groups.

Given the relatively recent push of Altaic and Indo-European languages into ancestral Uralic-speaking territories, only the ancient Corded Ware expansion remains compatible with the spread of Uralic languages into their historical distribution.

Related

European hydrotoponymy (IV): tug of war between Balto-Slavic and West Uralic

germanic-balto-slavic-expansion

In his recent paper on Late Proto-Indo-European migrations, when citing Udolph to support his model, Frederik Kortlandt failed to mention that the Old European hydrotoponymy in northern Central-East Europe evolved into Baltic and Slavic layers, and both take part in some Northern European (i.e. Germanic – Balto-Slavic) commonalities.

Proto-Slavic

From Expansion slavischer Stämme aus namenkundlicher und bodenkundlicher sicht, by Udolph, Onomastica (2016), translated into English (emphasis mine):

NOTE. An archived version is available here. The DOI references for Onomastica do not work.

(…) there is a clear center of Slavic names in the area north of the Carpathians. Among them are root words of the Slavic languages such as reka / rzeka, potok u. a. m.

Even more important than this mapping is the question of how the dispersion of ancient Slavic names happened. What is meant by ancient Slavic names? I elaborated on this in this journal years ago (Udolph, 1997):

(1)Ancient suffixes that are no longer productive today.

This clearly includes Slavic *-(j)ava as in Vir-ava, Vod-ava, Il-ava, Glin-iawa, Breg-ava, Ljut-ava, Mor-ava, Orl-java among others. It has clear links to the ancient common Indo-European language (Lupawa, Morava-March-Moravia, Orava, Widawa). They have a center north of the Carpathians.

ava-slavic

(2) Unproductive appellatives (water words), which have disappeared from the language, are certain witnesses of ancient Slavic settlements. A nice example of this is Ukr. bahno, Pol. bagno ‘swamp, bog, morass’ etc. The word has long been missing in South Slavic, although it appears in South Slavic names, but only in very specific areas (see Udolph, 1979, pp. 324-336).

(3) Names that go back to different sound shifts. [Examples:]

  • (…) the Slavic clan around Old Sorbian brna ‘feces, earth’, Bulgarian OCS brьnije ‘feces, loam’, OCS brъna ‘feces’, Slovenian brn, ‘river mud’, etc. is solved with the inclusion of onomastic materials (Udolph, 1979, p. 499-514). (…) Toponymic mapping shows important details.
  • bryn-slavic
    Karte 4. brъn < *brŭn und bryn- < *brūn- in slavischen Namen
  • (…)We also have an ablauting *krŭn-:*krūn- in front of us. Map 5 shows the distribution of both variants in Slavic names.
  • The next case is quite similar. It concerns Russ. appellative grjaz’ ‘dirt, feces, mud’, (…) for which an Old Slavic form *gręz exists. Slavic also knows the ablauting variant *grǫz.

    These maps (see Map 6, p. 222) show that a homeland of Slavic tribes can only be inferred north of the Carpathians.

    (4) Place-names formed by Slavic suffixes of Pre-Slavic nature, i.e. derived from Old European hydronyms.

    (a) The largest river in Poland, the Wisła, German Vistula, bears a clearly Pre-Slavic name, no matter how one explains it (Babik, 2001, pp. 311-315; Bijak, 2013, p. 34, Udolph, 1990 , Pp. 303-311).

    (b) With the same suffix are formed Sanok, place on the southwest of Przemyśl; Sanoka, a no longer known waters name, 1448 as fluvium Szanoka, near the place Sanoka and with a diminutive suffix -ok- a tributary of the Sanok, which is called Sanoczek (for details see Udolph, 1990, pp. 264-270; Rymut / Majtan, 1998, p. 222). The San also has a single-language name, but that does not change anything about the right etymology. The suffix variant -očь also includes Liwocz and Liwoczka, river names near Cracow; also a mountain range of the Beskydy is mentioned at Długosz as Lywocz.

    According to the opinion of the “Słownik prasłowiański” (Sławski (red.), 1974, p. 92), the suffix -ok- represents a Proto-Slavic archaism. It appears, for example, in sъvědokъ, snubokъ, vidokъ, edok, igrok, inok among others, but its antiquity also shows, among other things, that it started at archaic athematic tribes.

    east-slavic-language-expansion
    Mapping of older and younger East Slavic place-names and translation into settlement evolution.

    Slavonic Urheimat

    If we apply this to the loess distribution in western Ukraine and south-eastern Poland, it is very noticeable that the center of the Old Slavic place names lies in the area where loess dispersal is gradually “frayed out”, i.e. for example, in the area west of Kiev between Krakow in the west and Winnycja and Moldavia in the east. In short, the distribution of good soils coincides with ancient Slavic names. If that is correct, we can expect a homeland in the Pre-Carpathian region, or better, a core landscape of Slavic settlement.

    The existence of Pre-Slavic Indo-European place names and water names whose structure indicates that they originated from an Indo-European basis, but then also developed Slavic peculiarities, can now – as stated above – only be understood to mean that the language group that we call today Slavic emerged in a century-long process from an Indo-European dialectal area.

    Loess areas between Poland and Ukraine. Image from Jary et al. (2018).

    From a genetic point of view, the scarce data published to date show a clear shift of central-east populations from more Corded Ware-like groups in the EBA towards more BBC-derived ancestry in the common era, to the point where ancient DNA samples from East Germany, Poland and Lithuania evolve from clustering between Corded Ware and Sub-Neolithic peoples to clustering close to Bell Beaker-derived groups, such as West Germanic peoples, Tollense samples, etc. (see below)

    Furthermore, sampled Early Slavs show bottlenecks under “Dinaric” I2a-L621 and central-eastern E1b-V13, which – in combination with the known phylogeography of Únětice and Urnfield – is compatible with its late expansion from a central-east European Slavonic homeland, such as the Pomeranian culture, in turn likely derived from Lusatian culture groups.

    This doesn’t preclude a more immediate expansion of Common Slavic in Antiquity closer to the northern Carpathians, which is also supported by the available Early Slavic sampling, apart from samples from the Avar and Hungarian polities.

    pca-balto-slavic-iron-age
    Likely Baltic (yellow-green) and Slavic (orange) groups ca. 500 AD on, with Finnic (cyan) and Mordvinic (blue) groups roughly divided through hydrotoponymy line ca. 1000 AD Top Left: Late Iron Age cultures. Top right: PCA of groups from the Iron Age to the Middle Ages. Y-DNA haplogroups during the Germanic migrations (Bottom left) and during the Middle Ages (Bottom right). Notice a majority non-R1a lineages among sampled Early Slavs. See full maps and PCAs.

    Proto-Baltic / Proto-Slavic

    Northern European hydronymy

    From Alteuropäische Hydronymie und urslavische Gewässernamen, by Udolph, Onomastica (1997), translated into English (emphasis mine):

    NOTE. An HTML version is available at Jurgen Udolph’s personal site.

    Because of the already striking similarities as the well-known “-m-case”, the number-words for ‘1000’, ’11’ and ’12’ and so on, J. Grimm had already assumed a close relationship between Germanic and Baltic and Slavic. (…)

    In my own search, I approached this trinity from the nomenclature side. In doing so, I noticed some name groups that can speak for a certain common context:

    1.* bhelgh-, *bholgh-.

    Map 10, p. 64, shows that a root * bhelgh- occurs in the name material of a region from which later Germanic, Baltic and Slavic originated. The Balkans play no role in this.

    bholgh-germanic-balto-slavic

    2. *dhelbh-, *dholbh-, *dhl̥bh-

    The proof of the three ablauting * dhelbh, * dholbh, * dhl̥bh- within a limited area shows the close relationship that this root has with the Indo-European basis. Again it is significant in which area the names meet (…)

    dhelbh-germanic-balto-slavic

    3. An Indo-European root extension *per-s- with the meaning ‘spray, splash, dust, drop’ is detectable in several languages (…). From a Baltic-Slavic-Germanic peculiarity cannot therefore be spoken from the toponymic point of view. The picture changes, however, if one includes the derived water names.

    4. The root extension *pel-t-, *pol-t-, *pl̥-t- of a tribe widely spread in the Indo-European languages around *pel-, pol- ‘pour, flow, etc.’, whose reflexes are found Armenian through Baltic and Slavic to the Celtic area, is found in the Baltic toponymy, cf. Latv. palts, palte ‘puddle, pool’.

    trzciniec-riesenbecher-culture
    The dynamics of stylistic changes of the form of the “Trzciniec pot” in the lowland regions of Central Europe, and spreading routes of the Trzciniec package in Central Europe. A good proxy for contacts through the Northern European Plain during the Early Bronze Age. Modified from Czebreszuk (1998).

    Early Balto-Finnic

    In order to properly delimit (geographically and chonologically) the Proto-Baltic and Proto-Slavic expansions, it is necessary to understand where the late Balto-Finnic homeland was located during the Bronze Age. The following are excerpts from the comprehensive hydrotoponymic study by Pauli Rahkonen (2013):

    In any case, Finnic probably had its origin somewhere around the Gulf of Finland. Names of large and central rivers such as Vuoksi (< Finnic vuo ‘stream’) and Neva (< Finnic neva ‘marsh, river’) must be very old and might represent Proto-Finnic hydronyms. In the southern coastal area of Finland, the names Kymi and Nietoo < *Niet|oja (id. later Porvoonjoki) may also be of Finnic origin and derive from, respectively, kymi ‘stream’ (see SSA I s.v. *kymi; see however SPK s.v. Kemijärvi; Rahkonen 2013: 24) and nieto(s) ‘heap of snow’ (SSA II s.v. nietos), in hydronyms probably ‘high (snowy?) banks of a river’. Mustion|joki is clearly a Finnish name < *must|oja ‘black river’. The river name Vantaa remains somewhat obscure, although Nissilä (see SPK s.v. Vantaanjoki) has derived it from the Finnic word vana ‘water route’. In western Finland the names of large rivers, such as Aura and Eura, are supposedly of Germanic origin (Koivulehto 1987).

    In Estonia the names of many of the most important rivers might be of Finnic origin: e.g. Ema|jõgi Est. ema ‘mother’ [Tartu district] (?? cf. the Lake Piiga|ndi < Est. piiga ‘maiden’), Pärnu [Pärnu district] < Est. pärn ‘linden’, Valge|jõgi [Loksa district] < Est. valge ‘white’, Must|jõgi [Võru district] < Est. must ‘black’. It is possible that Emajogi and especially Piigandi are the result of later folk etymologizing of a name with some unknown origin. However, as a naming motif there exist in Finland numerous toponyms with the stems Finnic *emä (e.g. 3 Emäjoki), *neit(V)- ‘maiden’ (e.g. Neitijärvi, Neittävänjoki, Neittävänjärvi) and Saami stems that can be derived from Proto Saami *nejte̮ ‘id’ (GT2000; NA).

    finnic-toponyms
    The historical southern boundary of Finnic hydronyms, excluding hydronyms produced by the Karelian refugees of the 17th century.

    These seemingly very old names of relatively large rivers in southern Finland, modern Leningrad oblast and Estonia support the hypothesis that Proto-Finnic was spoken for a long time on both sides of the Gulf of Finland and it thus basically corresponds to the hypothesis of Terho Itkonen (see below). In the Novgorod, Tver or Vologda oblasts of Russia, Finnic names for large rivers cannot be found (Rahkonen 2011: 229). For this reason, it is likely that the Late Proto-Finnic homeland was the area around the Gulf of Finland.

    Beyond the southeastern boundary of the modern or historically known Finnic-speaking area, there exists a toponymic layer belonging to the supposedly non-Finnic Novgorodian Čudes (see Rahkonen 2011). In theory it is possible that Proto-Finnic and Proto-Čudian separated from each other at an early stage or it is even possible that Proto-Čudian was identical with Proto-Finnic. However, this cannot be proven, because there is not enough material available describing what Novgorodian Čudic was like exactly.

    finno-saamic-mordvin
    Yakhr-, -khra, yedr-, -dra and yer-/yar, -er(o), -or(o) names of lakes in Central and North Russia and the possible boundary of the proto-language words *jäkra/ä and *järka/ä. Rahkonen (2013)

    A summary of the data is then:

    • The Daugava River and the Gulf of Livonia formed the most stable south-western Balto-Finnic border (up until ca. 1000 AD): the Daugava shows a likely Indo-European etymology, while some of its tributaries are best explained as derived from Uralic.
    • The first layer of “Early Baltic” loans in Early Balto-Finnic are of a non-attested Baltic dialect closest to Proto-Balto-Slavic (read more about this early layer).
    • The latest samples of the Trzciniec culture (or derived Iron Age group) from its easternmost group in Turlojiškė (ca. 1000-800 BC?) show a western shift towards Bell Beaker, although they show a majority of hg. R1a-Z280; while the earliest sample from Gustorzyn (ca. 1900 BC), likely from Trzciniec/Iwno, from the westernmost area of the culture, shows a Corded Ware-like ancestry (and hg. R1a-Z280, likely S24902+) among a BA sampling from Poland clearly derived from Bell Beaker groups.

    One can therefore infer that the expansion of the Trzciniec culture – as the earliest expansion of central-west European peoples into the Baltic after the Bell Beaker period – represented either the whole disintegrating Balto-Slavic community, or at least an Early Baltic-speaking community expanding from the West Baltic area to the east.

    The similarity of Early Slavs and the Trzciniec outlier with the Czech BA cluster, formed by samples from Bohemia (ca. 2200–1700 BC), and the varied haplogroups found among Early Slavs – reminiscent of the variability of the Unetice/Urnfield sampling – may help tentatively connect the early Proto-Slavic homeland more strongly with a Proto-Lusatian community immediately to the south-west of the Iwno/Proto-Trzciniec core.

    pca-late-bronze-age-balto-slavic-finnic
    Top Left:Likely Baltic, Slavic, and Balto-Finnic-speaking territories (asynchronous), overlaid over Late Bronze Age cultures. Balto-Slavic in green: West(-East?) Baltic (B1), unattested early Baltic (B2), and Slavic (S). Late Balto-Finnic (F) in cyan. In red, Tollense and Turlojiškė sampling. Dashed black line: Balto-Slavic/West Uralic hydrotoponymy border until ca. 1000 AD. Top right: PCA of groups from the Early Bronze Age to the Late Bronze Age. Marked are Iwno/Pre-Trzciniec of Gustorzyn (see below), Late Trzciniec/Iron Age samples from Turlojiškė, and in dashed line approximate extent of Tollense cluster; Y-DNA haplogroups during the Late Bronze Age (Bottom left) and during the Early Iron Age (Bottom right). Notice a majority non-R1a lineages among sampled Early Slavs. See full maps and PCAs.

    Proto-Balto-Slavic homeland

    Disconnected western border: Germanic

    The common Balto-Slavic – Germanic community must necessarily be traced back to the West Baltic. From Udolph’s Namenkundliche Studien zum Germanenproblem, de Gruyter (1994), translated from German (emphasis mine):

    My work [Namenkundliche Studien zum Germanenproblem] has shown how strong the Germanic toponymy is related to the East, less to Slavic, much more to Baltic. It confirms the recent thesis by W.P. Schmid on the special relationship Germanic and Baltic, according to which “the formation of the typical Germanic linguistic characteristics…must have taken place in the neighborhood of Baltic“.

    If one starts from a Germanic core area whose eastern boundary is to be set on the middle Elbe between the Erzgebirge and Altmark, there are little more than 400 km. to the undoubtedly Baltic settlement area east of the Vistula. Stretching the Baltic area westwards over the Vistula (as far as the much-cited Persante), the distance is reduced to less than 300 km. Assuming further that Indo-European tribes between the developing Germanic and the Baltic groups represent the connection between the two language groups, so can one understand well the special relationship proposed by W.P. Schmid between Germanic and Baltic. In an earlier period shared Slavic evidently the same similarities (Baltic-Slavic-Germanic peculiarities).

    balto-slavic-balto-finnic-homeland
    Top: Palaeo-Germanic (G2, blue area), Proto-Balto-Slavic/Pre-Baltic (PBSL, green area) and Early Proto-Balto-Finnic (PBF, cyan area) homelands superimposed over Early Bronze Age cultures. Persante hydronym and Gustorzyn ancient DNA sample location marked. Y-DNA haplogroups during the Early Bronze Age (Bottom left) and during the Middle Bronze Age (Bottom right). Notice a mix of R1b-L151 samples from the west and the process of integration of R1a-Z645 lineages from the the north-east. See full maps and PCAs.

    Substrate and immediate eastern border: Early Balto-Finnic

    While Balto-Finnic shows a late Balto-Slavic adstrate, Balto-Slavic has a Balto-Finnic(-like) substrate, also found later in Baltic and Slavic, which implies that Balto-Slavic (and later Baltic and Slavic) replaced the language of peoples who spoke Balto-Finnic(-like) languages, influencing at the same time the language of neighbouring peoples, who still spoke Balto-Finnic (or were directly connected to the Balto-Finnic community).

    For more on this relative chronology in Balto-Slavic – Balto-Finnic contacts, see e.g. the recent posts on Kallio (2003), Olander (2019), or a summary of this substrate.

    While Rahkonen (2013) entertains Parpola’s theory of a West-Uralic-speaking Netted Ware area (ca. 1900-500 BC), due to the Uralic-like hydrotoponymy of its territory, he also supports Itkonen’s idea of the ancient presence of almost exclusively Balto-Finnic place and river names in the Eastern Baltic and the Gulf of Finland since at least the Corded Ware period, due to the lack of Indo-European layers there:

    NOTE. This idea was also recently repeated by Kallio (2015), who can’t find a non-Uralic layer of hydrotoponymy in Balto-Finnic-speaking areas.

    It should be observed that the territory between the historical Finnic and Mordvin-speaking areas matches quite well with the area of the so-called Textile Ceramics [circa 1900–800 BC] (cf. Parpola 2012: 288). The culture of Textile Ceramics could function as a bridge between these two extreme points. Languages that were spoken later in this vast territory between Finland–Estonia and Mordovia seem to derive from Western Uralic (WU) as well. I have called those languages Meryan-Muroma, Eastern and Western Čudian and an unknown “x” language spoken in inland Finland, Karelia and the Lake Region of the Russian North (Rahkonen 2011; 241; 2012a: 19–27; 2013: 5– 43). This might mean that the territory of the Early Textile Ceramics reflects to some extent the area of late Western Uralic.

    The archaeologically problematic area is Estonia, Livonia and Coastal Finland – the area traditionally assumed to have been populated by the late Proto-Finns. The Textile Ceramics culture was absent there. It is very difficult to believe that the Textile Ware population in inland Finland migrated or was even the main factor bringing the Pre- or Early Proto-Finnic language to Estonia or Livonia. There are no archaeological or toponymic signs of it. Therefore, I am forced to believe that Textile Ceramics did not bring Uralic-speaking people to those regions. This makes it possible, but not absolutely proven, to assume that some type of Uralic language was spoken in the region of the Gulf of Finland already before Textile Ceramics spread to the northwest (circa 1900 BC).

    corded-ware-west-uralic
    Top Left: Corded Ware culture expansion. Top right: PCA of Corded Ware and Sub-Neolithic groups. Y-DNA haplogroups during the Corded Ware expansion (Bottom left) and during the subsequent Bell Beaker expansion (Bottom right). Notice the rapid population replacement of typical Corded Ware R1a-Z645 lineages by expanding Bell Beakers of hg. R1b-L23 in central-east Europe, while they show continuity in the described ancestral Fennoscandian West-Uralic-speaking territory. See full maps and PCAs.

    The Corded Ware population in Finland is thought to have been NW Indo-European by many scholars (e.g. Koivulehto 2006: 154–155; Carpelan & Parpola 2001: 84). At least, it is probable that the Corded Ware culture was brought to Finland by waves of migration, because the representatives of the former Late Comb Ceramics partially lived at the same time side by side with the Corded Ware population. However, it is possible that the immigrants were a population that spoke Proto-Uralic, who had adopted the Corded Ware culture from their Indo-European neighbors, possibly from the population of the Fatjanovo culture, e.g. in the Valdai region. This was suggested by Terho Itkonen (1997: 251) as well. In that case the population of the Typical and Late Comb Ceramics may have spoken some Paleo European language (see Saarikivi 2004a). In the Early Bronze Age, the Baltic Pre-Finnic language that I have suggested must have been very close to late WU and therefore no substantial linguistic differences existed between the Baltic Pre-Finns and the population of Textile Ceramics in inland Finland. I admit that this model is difficult to prove, but I have presented it primarily in order to offer new models of thinking.16 At least, there is no archaeological or linguistic reason against this idea.

    This dubitative attribution of Proto-Uralic to the expansion of Corded Ware groups in eastern Europe, which is what hydrotoponymic data suggests in combination with archaeology, has to be understood as a consequence of how striking Rahkonen finds the results of his research, despite Itkonen’s previous proposal, in the context of an overwhelming majority of Indo-Europeanists who, until very recently, simplistically associated Corded Ware with the Indo-European expansion.

    Conclusion

    Even Kortlandt accepts at this point the identification of expanding East Bell Beakers from the Carpathian Basin as those who left the Alteuropäische layer reaching up to the Baltic. However, he identified Udolph’s data solely with West Indo-European, forgetting to mention the commonly agreed upon western Proto-Balto-Slavic homeland, most likely because it contradicts two of his main tenets:

    1. that Balto-Slavic split from a hypothetical Indo-Slavonic (i.e. Satem) group expanding from the east; and
    2. that laryngeals can be reconstructed for Balto-Slavic – unlike for North-West Indo-European.
    old-european-asian-hydro-toponymy
    Indo-European hydrotoponymy in Europe and the Middle East (scarce Central Asian data). Baltic data compensated, statistical method RBF: intermediate regions devoid of Indo-European toponyms are inferred to have them; it compensates thus e.g. for the scarce Indo-European hydrotoponyms in Poland by assuming ‘soft’ continuity from West Germany to the Baltic.

    A hypothetic “Pre-Indo-Slavonic” laryngeal Indo-European layer reaching Fennoscandia and the Forest Zone with Corded Ware is fully at odds with all known data:

    • in comparative grammar, since the one feature that characterizes Graeco-Aryan is precisely its set of innovations relative to Northern Indo-European, which presupposes a longer contact (and further laryngeal loss) once Tocharian and North-West Indo-European had separated – hence probably represented by Palaeo-BalkanCatacomb-Poltavka contacts once Afanasevo and Yamna settlers from the Carpathian Basin / East Bell Beakers had become isolated;
    • in hydrotoponymy, because of the prehistoric linguistic areas that can be inferred from (1) the distribution of Old European hydrotoponymy; (2) Udolph’s work on Germanic and the likely non-Indo-European substrate in Scandinavia and land contacts with Balto-Finnic; (3) from the Northern European traits in the Northern European Plain; or (4) from the decreasing proportion of Indo-European place and river names from central Europe towards the east and north.
    • NOTE. An alternative explanation of Old European/Balto-Slavic layers, e.g. by a ‘Centum’ Temematic – even if one obviates the general academic rejection to Holzer’s proposal – couldn’t account for the absolute lack of an ancestral layer of Indo-European hydrotoponymy in North-Eastern Europe (i.e. the longest-lasting Corded Ware territory), in sharp contrast with Western Europe, South-Eastern Europe, and South Asia. All of that contradicts an Eastern Indo-European community, even without a need to recall that the oldest hydrotoponymic layers common to Fennoscandia and the Forest Zone are of Uralic nature.

    • in archaeology, because cultural expansions of the Eastern European Early Bronze Age province since the Bell Beaker period (viz. Mierzanowice, Trzciniec, Lusatian, Pomeranian, West Baltic Culture of Cairns) suggest once and again west-east movements, most (if not all) of which – based on the presence of Indo-European speakers during the common era – were likely associated with Indo-European-speaking communities replacing or displacing previous ones.
    • in palaeogenomics, because of the late and different association of Corded Ware ancestry and haplogroups among Balto-Slavic and Indo-Iranian communities, in turn corresponding to the different satemization processes found in both dialects, which may have actually been related to the Uralic substrate that is found in both (read more on Uralic influences on Balto-Slavic and on Indo-Iranian).

    On the other hand, a careful combination of Uralic and Indo-European comparative grammar, hydrotoponymic data, and population genomics fits perfectly well Itkonen’s and Rahkonen’s association of Corded Ware in Eastern Europe with Uralic languages, as well as the traditional mainstream view of Uralic before Indo-European in Fennoscandia and in the Forest Zone, as I explained in a recent post about genetic continuity in the East Baltic area.

    Population genomics is not the main reason to reject the Indo-European Corded Ware theory – or any other prehistoric ethnolinguistic identification, for that matter. It can’t be. This new field offers just the occasional confirmation of a well-founded theory or, alternatively, another nail in the coffin of fringe theories that were actually never that likely, but seemed impossible to fully dismiss on purely theoretical grounds.

    The problem with Corded Ware was that we couldn’t see how unlikely its association with Indo-European languages was until we had ancient DNA to corroborate archaeological models, because few (if any) Indo-Europeanists really cared about the linguistic prehistory of eastern and northern Europe, or about Uralic languages in general (contrary to the general trend among Uralicists to be well-versed in Indo-European studies). Now they will.

    Related

    Volosovo hunter-gatherers started to disappear earlier than previously believed

    volosovo-corded-ware

    Recent paper (behind paywall) Marmot incisors and bear tooth pendants in Volosovo hunter-gatherer burials. New radiocarbon and stable isotope data from the Sakhtysh complex, Upper-Volga region, by Macānea, Nordqvist, and Kostyleva, J. Archaeol. Sci. (2019) 26:101908.

    Interesting excerpts (emphasis mine):

    The Sakhtysh micro-region is located in the Volga-Oka interfluve, along the headwaters of the Koyka River in the Ivanovo Region, central European Russia (Fig. 1). The area has evidence of human habitation from the Early Mesolithic to the Iron Age, and includes altogether 11 long-term and seasonal settlements (Sakhtysh I–II, IIa, III–IV, VII–XI, XIV) and four artefact scatters (sites V–VI, XII–XIII), in addition to which burials have been detected at five sites (I–II, IIa, VII, VIII) (Kostyleva and Utkin, 2010). The locations have been known since the 1930s and intensively studied since the 1960s under the leadership of D.A. Kraynov, M.G. Zhilin, E.L. Kostyleva, and A.V. Utkin.

    Sakhtysh II and IIa are the most extensively studied sites of the complex, with ca. 1500m2 and around 800m2 excavated, respectively. The burial grounds at both sites are considered as fully investigated.

    volosovo-sakhtysh-dates
    AMS datings from the sites Sakhtysh II and IIa. Sampled contexts are given in parentheses (burial/hoard), “crust” indicates samples of charred organic
    residues on pottery from cultural layer. For data, see Tables 1–2.

    Sakhtysh chronology

    The AMS dates do not support the previously proposed phasing of the Sakhtysh burials to early (4750–4375 BP/3600–3000 cal BCE), late (or developed; 4375–4000 BP/3000–2500 cal BCE), and final (4000–3750 BP/2500–2200 cal BCE): the early and late burials at Sakhtysh IIa do not stand out as two separate groups, and also the burials and hoards from Sakhtysh II, connected to the final phase, are temporally overlapping with these. Neither the use sequence, where the settlement and burial phases are non-overlapping and also complementary between the sites (Kostyleva and Utkin, 2010, 2014), finds support in the present material.

    The AMS datings indicate that the Volosovo people started to bury their dead at Sakhtysh IIa after 3700 cal BCE; dates earlier than this may be affected by FRE or suffer from mixed contexts and poor quality of dates. The present data questions the interpretation that the Sakhtysh IIa cemetery was used without interruptions between 4800 and 4080 BP (Kostyleva and Utkin, 2010), i.e. for a millennium between 3550 and 2600 cal BCE. The AMS dates rather suggest a use period of some centuries only around the mid-4th millennium cal BCE, tentatively 3650–3400 cal BCE. This would also be more realistic considering the number of burials at the site.

    volosovo-sakhtysh-sites
    The core area of Volosovo culture (after Kraynov, 1987) and the sites of the Sakhtysh complex (after Kostyleva and Utkin, 2010). Eurasian map base made with Natural Earth. Illustration: K. Nordqvist.

    Volosovo chronology

    The absolute dating of Volosovo culture was for a long time hampered by the small number of radiocarbon dates (see Kraynov, 1987). Today,>100 datings connected with it can be found in literature (Korolev and Shalapinin, 2010; Chernykh et al., 2011; Nikitin, 2012; Mosin et al., 2014). Unfortunately, the available dates do not form solid grounds for dating the cultural phenomenon, as many of them have quality-related issues, large measurement errors, and ambiguous cultural or physical contexts. Consequently, particular datings may be connected to different cultural phases by different scholars. Finally, a large part of the newly-published datings are obtained through direct dating of potsherds (Kovaliukh and Skripkin, 2007; Zaitseva et al., 2009), and therefore, their cogency must be faced with reservation (see Van der Plicht et al., 2016; Dolbunova et al., 2017).

    The datings connected with Volosovo cover a wide time range between ca. 5500 BP (4400 cal BCE) and ca. 3700 BP (2100 cal BCE). However, datings from secure contexts, with good quality (error ca. 50 years or below) and no probable FRE, place the beginning of Volosovo culture to the first half of the 4th millennium cal BCE, around 3700–3600 cal BCE. This is also supported by the roughly coeval terminal dates given for the preceding Lyalovo (Zaretskaya and Kostyleva, 2011) and Volga-Kama cultures (Lychagina, 2018), as well as the appearance of related neighbouring cultures, for example, in the Kama region (Nikitin, 2012; Lychagina, 2018), the southern forest steppe area (Korolev and Shalapinin, 2014), and north-western Russia and Finland (Nordqvist, 2018). Still, the dating of many of these cultural phases suffers from the same problems as of Volosovo.

    A handful of contested datings place the end of Volosovo culture to the final centuries of the 3rd millennium cal BCE, or even later (Kostyleva and Utkin, 2010; Chernykh et al., 2011; Nikitin, 2012). On the other hand, the new AMS dates indicate that Volosovo activities at Sakhtysh II and IIa ceased before or towards the early 3rd millennium cal BCE; if this reflects the general decline of Volosovo culture must be still confirmed by more dates from Sakhtysh and elsewhere. In this context, the general cultural development must be accounted for. To what extent – if at all – the Volosovo people were present after the arrival of the Corded Ware culture-related Fatyanovo-Balanovo populations? Based on the current, albeit scant and inconclusive radiocarbon data this took place from ca. 2700 cal BCE onwards (Krenke et al., 2013).

    volosovo-fatyanovo-balanovo
    Corded Ware and Comb Ware hunter-gatherer-related populations in north-eastern Europe from ca. 2600 BC. See full map.

    Comments

    One of the interesting genetic papers in the near future will be the one that finally includes samples from Corded Ware groups in the forest zone (i.e. Fatyanovo-Balanovo and Abashevo), which will most likely confirm that they are the origin of the known genetic profile of Central and East Uralic-speaking peoples, seeing how West Uralic peoples show genetic continuity in the East Baltic area, coinciding with the Battle Axe culture.

    Uralicists have come a long way from the 1990s, when the picture of Uralic before Balto-Slavic in the Baltic was already evident, and Uralians were identified with Comb Ware peoples. The linguistic data and relative chronology are still valid, despite the now outdated interpretations of absolute archaeological chronology, as happens with interpretations of Krahe or Villar about Old European.

    As an example, here are some relevant excerpts from Languages in the Prehistoric Baltic Sea Region, by Kallio (2003):

    NOTE. Kallio’s contribution appeared in the book Languages in Prehistoric Europe (2003), which I hold nostalgically close in my Indo-European library (now almost impossible to read fully). It is still one of my preferred books (from those made up of mostly unconnected chunks on European linguistic prehistory), because it contains Oettinger’s essential update of North-West Indo-European common vocabulary, which led us indirectly to our Modern Indo-European project from 2005 on.

    In any case, the Uralic arrival in the region east of the Baltic Sea preceded the Indo-European one (…).

    This theory that the ancestors of Finno-Saamic speakers arrived in the Baltic Sea region earlier than those of Balto-Slavic speakers is still rejected by some scholars (e.g. Napolskikh 1993: 41-44), who claim, for instance, that Finno-Saamic speakers would not have known salmons before they met Balts because the Finno-Saamic word for ‘salmon’ (i.e. *losi) is a borrowing from Baltic. Similarly, one could claim that English speakers would not have known salmons before they met Frenchmen because English salmon is a borrowing from French. In other words, Worter und Sachen are not necessarily borrowed hand in hand. Otherwise, it would not be so easy to explain how many Finnish names of body parts are borrowings from Baltic (e.g. hammas ‘tooth’, kaula ‘neck’, reisi ‘thigh’) and from Germanic (e.g. hartia ‘shoulder’, lantio ‘loin’, maha ‘stomach’).

    A more probative argument is the fact that Balto-Slavic features in Finno-Saamic are mostly lexical ones (i.e. typical superstrate features), where Finno-Saamic features in Balto-Slavic are mostly non-lexical ones (i.e. typical substrate features). Note that there are more Balto-Slavic features in Finnic than in Saamic and more Finno-Saamic features in Baltic than in Slavic. This fact could be explained by presuming that Pre-Saamic was spoken north of the Corded Ware area and Pre-Slavic was spoken south of the Typical Pit-Comb Ware area, whereas Pre-Finnic and Pre-Baltic alone were spoken in the area, where both the Typical Pit-Comb Ware culture (ca. 4000-3600 BC) and the Corded Ware culture (ca. 3200-2300 BC) were situated. This area was most probably bilingual, until Finnic and Baltic won in the north and in the south, respectively.

    As is well-known, the idea of Uralic substrate features in Balto-Slavic is not new (cf. e.g. Pokorny 1936/1968: 181-185). As recent studies (e.g. Bednarczuk 1997) have shown, their density is the most remarkable in the four Balto-Slavic languages spoken in the earlier Pit-Comb Ware area (i.e. Latvian, Lithuanian, Belorussian, Russian). On the other hand, occasional Uralisms in the other Balto-Slavic languages spoken west of the Vistula and south of the Pripyat may rather be considered adstrate features spread from the northeast.

    comb-ware-uralic
    Our beliefs from the 2000s. A hypothetic Uralic Comb Ware distribution before the arrival of a hypothetic North-West Indo-European-speaking Corded Ware. “Generalized distribution of the Pit-Comb Ware cultural complex (Mallory & Adams 1997: 430, Carpelan 1999: 257) and the most probable homelands of Saamic, Finnic, Mordvin, Mari, and Permic.”

    The idea of Indo-European superstrate features in Finnic is not new either (cf. e.g. Posti 1953). As Jorma Koivulehto (1983) has recently shown, the earliest Indo-European loanword stratum in the westernmost Uralic branches alone can be considered Northwest Indo-European and connected with the Corded Ware culture (ca. 3200-2300 BC). Since this layer, there have been continuous contacts between Baltic and Finnic. According to Koivulehto (1990), the following stratum can be called Proto-Balt(o-Slav)ic and dated to the Late Neolithic period (ca. 2300-1500 BC). Note that this Proto-Balt(o-Slav)ic dating agrees with the established ones (cf. e.g. Shevelov 1964: 613-614, Kortlandt 1982: 181), when we remember the fact that archaeologists have also moved their datings back by centuries during the last decades.

    Finally, there is also a Baltic loanword stratum which was not borrowed from the ancestral stage of Latvian, Lithuanian and/or Old Prussian but from some extinct Baltic language or dialect (Nieminen 1957). However, as these words still go back to the early Proto-Finnic stage, they can hardly be dated later than Bronze-Age ( ca. 1500-500 BC). Therefore, we may conclude that they were probably borrowed from a Baltic superstrate, which arrived in the Finnish Gulf area during the Corded Ware period and survived there until the Bronze Age, when it was no longer identical with other Baltic dialects. In any case, as later Baltic loanword strata concern southern Finnic languages alone, we may presume that this ‘North Baltic’superstrate had become extinct.

    The traditional association of Uralic with Volosovo hunter-gatherers doesn’t make sense, since they neither miraculously survived for thousands of years nor mixed for hundreds of years with Corded Ware peoples, so we can now more confidently reject the recent assumption by Carpelan & Parpola that their language was adopted by incoming Fatyanovo, Balanovo and Abashevo groups, to develop into the known Uralic languages (more here). This includes one of the many models of the the Copenhagen group, who simplistically follow “Steppe ancestry” for Indo-Europeannes.

    If one combines the known relative linguistic chronology with the North-West Indo-European hydrotoponymy layer, now more clearly identified as Old Europeans expanding with East Bell Beakers and derived Early Bronze Age groups, I think there is little space left for maneuvering out of the overwhelming evidence for a Uralic homeland in the forest-steppes, linked to the spread of late Sredni Stog/Corded Ware ancestry into north-eastern Europe and beyond the Urals.

    Related

    Genetic continuity among Uralic-speaking cultures in north-eastern Europe

    east-europe-bronze-age

    The recent study of Estonian Late Bronze Age/Iron Age samples has shown, as expected, large genetic continuity of Corded Ware populations in the East Baltic area, where West Uralic is known to have been spoken since at least the Early Bronze Age.

    The most interesting news was that, unexpectedly for many, the impact of “Siberian ancestry” (whatever that actually means) was small, slow, and gradual, with slight increases found up to the Middle Ages, compatible with multiple contact events in north-eastern Europe. Haplogroup N became prevalent among Finnic populations only through late bottlenecks, as research of modern populations have long suggested, and as ancient DNA research hinted since at least 2015.

    I risked to correlate the arrival of chiefs from the south-west with the infiltration of N1c-VL29 subclades during the transition to the Iron Age, coupled with that minimal “Siberian” ancestry (see e.g. here and here). Now we know that the penetration of this non-CW ancestry started, as predicted, in the Iron Age; that it was highly variable in the few samples where it appeared, with ca. 1-4%, while most Iron Age individuals show 0%; and that it was not especially linked to individuals of N1c-Vl29 lineages.

    It is also basically confirmed, based on the (ancient and Modern Swedish) N1c-L550 subclades found among Iron Age Estonians, that N1c-VL29 lineages and the so-called “Siberian” ancestry will be found simultaneously around the Baltic coastal areas, and that different lineages must have suffered later founder effects among Finns, which suggests that these alliances through exogamy brought exactly as much language change in Sweden, Lithuania, or Poland, as they did in the East Baltic region…

    On the other hand, the paper has also shown a potential movement of Corded Ware-derived peoples, if the change from LBA to IA samples is meaningful; in fact, even more Corded Ware-like than Baltic and Estonian BA populations. The exact origin of that movement is difficult to pinpoint, and it may not be related to the arrival of Akozino warrior-traders from the south-east, since theirs seems to be a minor impact proper of elites in a chiefdom system around the Baltic.

    fortified-settlements-lba-ia
    Distribution of fortified settlements (filled circles) and other hilltop sites (empty circles) of the Late Bronze Age and Pre-Roman Iron Ages in the East Baltic region. Tentative area of most intensive contacts between Baltic and Balto-Finnic communities marked with a dashed line. Image modified from (Lang 2016).

    Also suggesting a potential movement is the ‘southern’ shift observed in the West and East Baltic areas, likely showing the arrival of Proto-East Baltic speakers (such as the Trzciniec outlier), as we have already discussed in this blog. The unexpected increase in Corded Ware-like ancestry in the Eastern Baltic, coupled with the expected large continuity of hg. R1a-Z283 in the homeland of Balto-Finnic expansions, gives even more support to the known complex system of exogamy along the Baltic coasts, and offers another potential reason for the rise of Baltic-speaking territories in the West Baltic: elite domination.

    It is nevertheless important to understand that, even among the most “genetic continuous” regions like Estonia, not a single population in Europe is heir of some ancestral, immutable people. Not in terms of haplogroups, and not in terms of admixture. Balto-Finnic speakers, however continuous they might seem (e.g. in Southern Estonians) aren’t an exception.

    After all, this blog was (re)born to fight the currently prevalent sheer stupidity surrounding the simplistic “R1a/steppe ancestry=Indo-European” association, so I wouldn’t like to see it replaced with some other stupid continuity or purity ideas within 10 to 20 years…

    Late Uralic stems from East Corded Ware groups

    With the currently available tools – linguistics, archaeology, and now genetics -, I don’t think there is any argument to date to question the direct connection of the Late Proto-Uralic expansion with all Eastern Corded Ware groups (i.e. Battle Axe, Fatyanovo-Balanovo, and Abashevo), and thus at least with the unifying A-horizon of Corded Ware and the bottlenecks under R1a-Z645.

    NOTE. The only out-group among Corded Ware cultures is the Single Grave culture. It appears to be an early Corded Ware offshoot, reflected in their non-unitary cultural traits (distinct from later unifying waves), in their varied patrilineal clans, and in the short-lasting cultural effect in northern Europe before their complete demise under pressure of expanding Yamna/Bell Beaker peoples from the Danube. The culture’s minimal (if any) effects on succeeding peoples might be seen mostly in the (mainly phonetic) Uralic substrate found in Balto-Slavic – although this may also stem from a more eastern influence, close to the Baltic – and in the contacts of Celtic with Uralic. The huge time depth between this early hypothetic Uralic layer in northern Europe and the emergence of peoples inhabiting these territories in recorded history have no doubt been erroneously interpreted as a lack of Uralic presence in the area.

    1) That connection was evident in the Yamna – CWC differences in archaeology, and especially later, with at least Fatyanovo-Balanovo and Abashevo representing the obvious replacement of the Volosovo culture before further expansions of CWC-related groups west and east of the Urals.

    The mythical millennia-long continuity of Volosovo hunter-gatherers, including centuries among Corded Ware peoples, as expected lately by the Copenhagen group (and anyone who doesn’t want to question the 1960s association of Indo-European with CWC) must be rejected today in population genomics, as the recent studies of ancient and modern populations show, and as ancient DNA from the region will confirm.

    2) In linguistics, the survival of Volosovo as The Uralic-speaking culture was also hardly believable. From Kallio (2015):

    While we can say at least something about Uralic substrates in Northeastern Europe, non-Uralic substrates cannot at all easily be identified, because of multiple language shifts, viz. first from non-Uralic to Uralic and then from Uralic to Russian. Yet the Soviet Uralicist Boris Serebrennikov (1956, 1959) argued that there are some non-Uralic substrate toponyms in the Volga-Oka region, but his idea was never taken seriously in the west (cf. Sauvageot 1958), and it pretty soon also sank into oblivion in Russia, even though it can still occasionally pop up there in non-onomastic circles (cf. Napolskikh 1995: 18–19). However, not all the hypotheses on non-Uralic substrates in Northeastern Europe should be rejected (see e.g. Helimski 2001b).

    bronze-age-early-languages-east-europe
    Tentative map of the distribution of known languages in Eastern Europe during the Early Bronze Age. See full map.

    Helimski (2001) argues for a non-Uralic topo-hydronomy in Northern Russia, whose population may have kept their languages up to the Common Era despite the Corded Ware expansion, which is in line with the survival of some non-Indo-European languages everywhere in Europe after the expansion of Yamna and its offshoots:

    It should be borne in mind that these [Uralic] hydronyms reached us mainly through Northern Russian and, accordingly, with a tendency to phonetic-morphological adaptation and unification (for river names it is “natural” to be, like the word ‘river’ itself, feminine and to end in -a). Taking into account this circumstance, it may turn out to be non-useless for etymological identification of at least some of the hydronyms on the Finno-Ugric basis.

    On the other hand, I wouldn’t exclude the possibility that some parts of this large geographical area were never (completely) Finno-Ugric. The population that created the most important part of the hydronymy of the Russian North could be finally pushed aside or assimilated only at the end of the 1st – beginning of the 2nd millennium AD, during the Russian colonization, retaining the memory of the White-Eyed Chude in its own memory.

    NOTE. For more on this non-IE substrate in (especially West) Uralic, see e.g. Zhivlov (2015),

    The same non-Uralic substrate is most likely behind most of the shared traits by Mordvinic and Balto-Finnic (see below).

    3) In genetics, I don’t think the picture could get any clearer. I don’t know what “Steppe ancestry = Indo-European” proponents expected from 2019, if they expected anything at all (I haven’t seen any coherent model, proposal, or prediction for a long time now), but I doubt the recent results are compatible with any of their implied expectations.

    corded-ware-pca-sub-neolithic-europe
    Detail of the PCA of the Corded Ware expansion. See full PCA and more related files.

    Notice, from the PCA above, how this Baltic Late Neolithic group shows actually a shift from Sredni Stog (see PCA with Sredni Stog) towards typical Khvalynsk-Urals-related ancestry, i.e. populations from eastern European forested regions, derived from hunter-gatherer pottery groups, as I have proposed for a very long time, since the first time a Baltic LN “outlier” appeared. It’s amazing how some amateurs can find 0.1% of any Siberian outlier’s ancestry among Uralians 4,000 years later, but fail to see the direct connection here. The esoteric uses of qpAdm, I guess…

    Especially noticeable is the extra WHG-like ancestry and corresponding shift, seen especially marked in late Polish CWC samples, but also in Baltic CWC and especially in one Sweden Battle Axe sample, all of them shifting apparently closer to Pitted Ware and SHG. While that may have been interpreted as an in situ admixture in Scandinavia before, the late Polish CWC samples show likely a resurgence of local populations, so we can assume that both shifts (to SHG- and EHG-like populations) of available CWC samples around the Baltic are clearly part of the WHG:EHG continuum that will be found in the eastern European sub-Neolithic cultures, from Narva to Volosovo.

    This WHG-related ancestry is clearly predominant in groups with which Battle Axe peoples admixed, based on the shift towards Pitted Ware, which – I can only guess based on modern Volga Finns – is different from the shift we will see in Netted Ware, more towards the Khvalynsk-Urals cluster. This is in line with the expansion of Battle Axe eastward through coastal areas (West to East Baltic and Finland into Sweden), while Fatyanovo peoples probably emerged from a slightly different route, but also a northern one, if one is to follow archaological similarities and their chronology.

    bronze-age-europe-baltic
    Detail of the PCA of European Bronze Age populations. See full PCA and more related files.

    During the Iron Age, the only peoples that probably shifted strongly (based on modern populations) are West Baltic ones, getting closer to the available Late Trzciniec samples, and even closer to the Trzciniec outlier, i.e. away from the earlier Eastern Corded Ware cluster, and towards Central European groups like Czech EBA or Poland EBA, both of them clearly derived from Bell Beakers, but also admixed with (and thus shifted toward) CW-like populations.

    If one looks carefully at the previous PCA on Bronze Age populations, and the next one on Iron Age clusters, it is evident that adding the Swedish LN outlier to East Baltic BA (both strongly related to Battle Axe populations) essentially gives us the continuity of East Baltic BA into the Iron Age. This cluster is continued also in two outliers from Sigtuna, a Viking town close to the Gulf of Finland, known to be an important trading site, 1,500 years later. Not much of a change around the Gulf of Finland, then:

    iron-age-eastern-europe
    Detail of the PCA of East and North European Iron Age populations. See full PCA and more related files.

    Based on the two simplistic Uralic clines one might see described (among the many that certainly existed, from Corded Ware to different Eurasian populations), and just like BOO was for some months fashionable as “Samic”, some may be tempted to say that certain Sintashta or Srubna outliers close to the Urals mark the True Uralic™ peoples. Because, of course they do. Ghost haplogroup N and stuff. And Corded Ware never ever Uralic. Because Gimbutas, and my IE R1a grandfather.

    NOTE. Funny thing here: there might be Corded Ware, Iranian, Slavic, Germanic, etc… outliers or out-groups, and they might form the widest genetic clusters ever seen, but they are all of one language, because archaeology and linguistics; however, one “outlier” (also, put your own definition of “outlier” here, let’s say 1% of whatever, and strontium isotope potentially from 100 km away) ca. 600 BC in the Baltic who (surprise!) happens to show hg. N, and he signals the first incoming True Uralic™ speaker from wherever… It won’t be the first or the last time some people resort to “the complexity of Uralic-speaking peoples” in ancestry, just to look for “hg. N = Uralic” like crazy. You only need common sense to understand that this is not how this works. Amateur genomics can’t get more embarrassing than the current “let’s look for ‘Siberian ancestry’ in every individual of haplogroup N” trend. Or maybe it can, and it will, but I can’t see it yet.

    If one were to insist on looking for ‘foreign’ contributions among Iron Age Estonians, though, I think one should also check out first archaeology, and then the PC3 (or, more graphically, a 3D plot), to understand what might be happening with the many Uralic clines derived from Corded Ware, before starting to play around with bioinformatic tools to discover a teeny tiny 1% admixture of the wrong population, and rushing to build far-fetched narratives. Apparently, one of the different clines formed roughly between southern (steppe – forest-steppe) and northern (tundra-taiga) populations in Uralians is also seen in some Iron Age Estonian individuals – especially in some late samples from Ingria…This is not my main interest, so I will leave this here for others to keep wasting their time chasing the white whale of the 0.5% of True Uralic™ ancestry in ancient Baltic samples of hg. N.

    pca-3d-estonians-iron-age-boo-samic
    Still images of the 3D plot of Eurasian samples. Typical PC1 vs. PC2 visualization to the left, and shift of the view to PC3 on the right image. See full PCA and more related files.

    An exclusive Volga-Kama homeland for Disintegrating Uralic?

    Since I don’t believe in macro-regions of largely continuous ethnolinguistic communities, as I have often said about Slavic (naively associated with prehistoric tribes of Eastern Europe) or Germanic (absurdly considered to be represented by Battle Axe), it is difficult for me to believe that Battle Axe-derived cultures remained of the same Finno-Samic dialects since the Corded Ware expansion…unless we live in Westeros, where everything happens “for thousands of years”.

    I have to admit, then, that the now prevalent identification among Uralicists has become quite attractive:

    • Fatyanovo-Balanovo as Finno-Permic:
      • Fatyanovo/Netted Ware with West Uralic (also called Finno-Mordvinic).
      • Balanovo/Chirkovo-Kazan with Central Uralic (Mari-Permic).
    • Abashevo, into the Andronovo-like Horizon through the Seima-Turbino phenomenon, with East Uralic (also Ugro-Samoyedic).

    Exactly like the identification of Yamna Hungary – Bell Beaker transition as the North-West Indo-European homeland, it gives us simplicity and small and late ethnolinguistic communities, away from the traditionally overused big and early language territories.

    This late homeland would be supported, among others, by:

    • The presence of Indo-Iranian loanwords in Finno-Permic and Ugric (probably also in Samoyedic, either lost, or – much more likely – underresearched), compatible with the immediate contact between Abashevo – Sintashta-Potapovka-Filatovka and Fatyanovo-Balanovo.
    • The supposed expansion of Netted Ware from Fatyanovo to the north-west, which may be explained as the split and expansion of Balto-Finnic and Samic ca. 1900 BC.
    • A longer-lasting Finno-Permic (West+Central Uralic) community contrasting with the early separation of East Uralic.
    • The compatibility of this late expansion with the late expansion of Pre-Germanic from Denmark with the Dagger Period, and of Balto-Slavic with Trzciniec, which puts all three dialects reaching the Baltic Sea in the EBA.

    NOTE. I meant to update the linguistic text to include the most recently favoured phylogenetic tree of Uralic languages after Häkkinen (2007, 2009, 2014), which has very quickly become the new normal among Uralicists, but I don’t think I will have enough time to review the necessary papers for that. I am rushing to publish a printed edition, so the text will wind up being a mixture of “traditional” (meaning, basically, pre-2010s) description of Uralic dialects but using modern divisions; say, “West Uralic” instead of “Finno-Samic”. By the way, I am still amazed that none of my reader-haters (or any online user discussing Uralic migrations, for that matter) have come up with the questions that the new division pose, and it supports my suspicion about the complete lack of interest in linguistics of most (a)DNA fans, except for the occasional use of old and free PDFs Googled to support new narratives invented expressly for some qpAdm results…

    textile-ceramics-europe-bronze-age
    Textile ceramic styles and influence of Bronze Age cultures divided in clusters.

    Problems with this Parpola-Carpelan’s (2012-2018) interpretation include:

    • The differentiation between Fennoscandian Textile Ceramics vs. Netted Ware, which is not warranted in archaeology. The assumption that Netted Ware expanded to the Baltic Sea (as Kallio does, following the traditional view) is thus weak, and it was probably a question of cultural contacts coupled with short-distance population movements/exchange in both directions (from the Baltic to the Volga and vice versa). In fact, the culture division relies on some fairly common and technically simple ornamentation patterns, widespread all over northern Europe, even before the Corded Ware expansion, and it is very difficult to separate certain neighboring Textile Ceramics from Netted Ware groups in southern Finland (i.e. Sarsa-Tomitsa groups).
    • The strict and radical direction described for the Netted Ware by Carpelan, as an eastward and northward expansion, within a very short time frame (ca. 1900-1800 BC), based on few radiocarbon dates, which seems to me like a very risky assumption. We know how this kind of descriptions of direction of culture expansion based on radiocarbon dates has turned out in much more complex “packages”, like the Bell Beaker culture… In fact, the earliest dates for Textile Ware are from the East Baltic, earlier than those of Netted Ware.
    • The assumption that Balto-Finnic traits shared with Mordvinic are a) late and b) meaningful for dialectalization of two closely related dialects, when it is clear that both dialects separated quite early. Phonologically Finnic is more conservative, morphologically less so, and the shared traits include a handful of non-Uralic substrate words which can’t be traced to a single common source, hence they were adopted when both languages had already separated… All in all, Finnic – Mordvinic correspondances are not even close to Italo-Celtic ones, which is clearly fully incompatible with a proposal of a Finnic separation from Mordvinic coinciding with the LBA-IA transition.

    Especially problematic for Parpola’s model is the lack of genetic impact in Bronze Age or Iron Age Estonians, not reaching a significant level under any possible statistical threshold – which I am sure was quite disappointing for some of my readers -, but is in line with major archaeological continuity of groups the from region, only disturbed in cultural (and Y-chromosome) terms by the expansion of Akozino warrior-traders all over the Baltic Sea. Any proposed population movement will be very difficult to support in genetics, given the Corded Ware-derived populations that we will see in both regions, and the continued Baltic-Volga contacts since the Corded Ware expansion.

    Problems with an interpretation of such a small impact in population genomics includes the similarly weak impacts and haplogroup infiltrations that can be seen among populations basically everywhere in Eurasia, during any given period, and much greater genetic impacts that are supposed to be (or that were certainly) followed by ethnolinguistic continuity.

    akozino-malar-axes-fennoscandia
    Distribution of the Akozino-Mälar axes according to Sergej V. Kuz’minykh (1996: 8, Abb. 2).

    The Battle Axe question

    From Kallio (2015), about choosing a tentative homeland for Proto-Uralic:

    (…) linguistically uniform Proto-Uralic would have been spoken in the Volga-Oka region until the mid-third millennium BC when the Proto-Uralic-speaking area would have expanded to the Volga-Kama region as well. By the end of the same millennium, this expansion would have led to the earliest dialectal splits within Uralic into Finno-Mordvin, Mari-Permic, and Ugro-Samoyed. The splitting up of these three soon followed during the early second millennium BC when the Uralic-speaking area finally stretched from the Baltic Sea in the west to the Altai mountains in the east. Indeed, no matter where Proto-Uralic was spoken, the branching into the nine well-attested subgroups (viz. Finnic, Saami, Mordvin, Mari, Permic, Hungarian, Mansi, Khanty, and Samoyed) must have taken less than a millennium, because their shared phonological and morphosyntactic isoglosses are rather limited (see Salminen 2002). The traditional view that all this branching would have taken several millennia violates everything linguistic typology teaches us about the rate of language change.

    The basic problem of this identification of Fatyanovo-Balanovo as West-Central Uralic and Abashevo as East Uralic is the nature of the Battle Axe culture, including the Bronze Age East Baltic and Gulf of Finland area. Even if it is accepted that Fatyanovo-Balanovo represented all Western groups, Battle Axe must have represented West Uralic-like dialects.

    The ethnolinguistic identification of Battle Axe depends ultimately on the nature of contacts of Fatyanovo/Netted Ware with Battle Axe/Textile Ceramics. If both groups were close and interacted profusely, as it seems, it doesn’t seem granted that we will be able to distinguish a close Para-West Uralic dialect of Scandinavia from the actual expanding Balto-Finnic and Samic dialects, if they were actually linked to the Netted Ware expansion. Also from Kallio (2015):

    No doubt the most convincing substrate theory has recently been put forward by the Saami Uralicist Ante Aikio (2004), who has not only rehabilitated but also improved the old idea of a non-Uralic substrate in Saami. His study shows that there were still non-Uralic languages spoken in Northern Fennoscandia as recently as the first millennium AD. Most of all, they were not only genetically non-Uralic but also typologically non-Uralic-looking, bearing a closer resemblance to the so-called Palaeo-European substrates (for which see e.g. Schrijver 2001; Vennemann 2003).

    In comparison, the case of Finnic is much more difficult. The fact that Proto-Uralic was not spoken in the East Baltic region means that this area must have originally been non-Uralic-speaking, but so far the evidence for a non-Uralic substrate in Finnic has consisted of appellatives and proper names with no etymology (cf. Ariste 1971; Saarikivi 2004a). Contrary to the proposed substrate words in Saami, those in Finnic show no structural non-Uralisms, as if they had indeed been borrowed from some genetically related or at least typologically similar languages, as I suggested above. Also none of them is more recent than the Middle Proto-Finnic stage, which makes them at least two millennia old. All this agrees with archaeological evidence discussed earlier that the Uralicization of the East Baltic region occurred during the Bronze Age (ca. 1900–500 BC).

    The discussion of the paper continues with an unsuccessful attempt to find a hypothetical ancient Indo-European substrate that Kallio believes must be associated with the expansion of Corded Ware, in line with the traditional belief. For example, the often mentioned – almost folk etymology-like, unsurprisingly popular among amateurs – ‘Neva’ as derived from IE “young” is logically rejected…Unlike Parpola, Kallio’s view seems to be confident that Netted Ware (as Textile Ware) expanded into the East Baltic, on both sides of the Gulf of Finland, already during the Bronze Age.

    As it has become apparent in population genomics, none of them was right, and Textile Ceramics will essentially show – like Netted Ware – a large genetic continuity of Corded Ware peoples in the whole north-eastern European forest zone – despite small regional population movements, obviously -, which necessarily implies that the whole Corded Ware culture – and not only Fatyanovo-Balanovo and Abashevo – were Uralic-speaking territories.

    The similarities in terms of culture and Y-DNA bottlenecks between Battle Axe and Fatyanovo-Balanovo also imply that the linguistic differences between these groups were probably not many, and became strongly divided only after their territorial division. Continued contacts between Battle Axe- and Fatyanovo-derived groups can explain the proposed contacts (Finnic with Samic, Finnic with Mordvinic) after their linguistic-but-not-physical separation.

    east-european-fatyanovocwc
    East European movement directions (arrows) of the representatives of the Central European Corded Ware Culture (according to I.I. Artemenko).

    Battle Axe spoke “Para-Balto-Finnic”?

    The Balto-Finnic-speaking nature of Battle Axe is thus supported by:

    • The lack of non-Uralic substrates in Balto-Finnic territory (Kallio 2015).
    • The early separation of Samic and Finnic from Mordvinic, and the virtual identity of Proto-West-Uralic and Proto-Uralic, which suggests that Proto-Uralic spread fast (Parpola 2012).
    • The scarce non-Uralic topo-hydronymy in the East Baltic and around the Gulf of Finland (Saarikivi 2004), comparable to that on the Upper Volga region.
    • The strong influence of a Balto-Finnic-like substrate on Pre-Germanic (or, in Kallio’s opinion, the same Scandinavian substrate influencing both Germanic and Balto-Finnic at the same time), and the continued influence of Balto-Finnic on Proto-Baltic and Proto-Slavic.
    • The continued influence of Corded Ware-derived groups in central-east Sweden in Finland and the East Baltic in terms of agricultural innovations appearing in the LBA, compatible with Schrijver’s proposal of intermediate Germanic-shifted Balto-Finnic groups and Balto-Finnic groups influenced by their pronunciation.
    • The intense Palaeo-Germanic and late Balto-Slavic / early Proto-Baltic superstrate on Balto-Finnic, which place all three dialects around the Baltic Sea since the Early Bronze Age.
    • The easy replacement of a hypothetic Para-Balto-Finnic dialect by incoming Proto-Balto-Finnic-speaking peoples (say, with textile ceramics), without much linguistic impact.

    In fact, the continuous contacts of the East Baltic with the Volga, and especially the close interaction with Akozino warrior-traders just before the Tarand-grave period, could be the actual origin of the recent (if any) Finnic-Mordvinic connections that need to be traced back to the LBA-IA (maybe here the number ‘ten’), since most of them can be related to a Pit-Comb Ware culture substrate and earlier contacts through the forest zone, which Samic (due to its early split and presence to the north of the Gulf of Finland during the BA) does not share. In fact, some of them can be traced back to Balto-Finnic first

    These are the most often mentioned, in order of descending relevance for a shared ancient community:

    • Noun paradigms and the form and function of individual cases.
    • The geminate *mm (foreign to Proto-Uralic before the development of Fennic under Germanic influence) and other non-Uralic consonant clusters.
    • The change of numeral *luka ‘ten’ with (non-Uralic) *kümmen.
    • The presence of loanwords of non-Uralic origin, related to farming and trees, potentially Palaeo-European in nature.

    It’s not only a question of quantity. Are these shared Mordvinic – Balto-Finnic traits really more relevant than, say, those between Italo-Celtic, which are supposed to have formed a community for a very short period at the end of the 3rd millennium around the Alps? Are these traits even sufficient to propose a common early Mordvinic-Finnic group within West Uralic, rather than loose Mordvinic – Balto-Finnic contacts, i.e. contacts between East Baltic (Textile Ceramics) and Volga-Kama (Netted Ware)?

    Based on the alternative (Kallio’s) view of continued contacts between Textile Ceramics groups, even without knowing anything about linguistics, you can guess that Parpola is spinning very thin when assuming that these changes suggest that Balto-Finnic may have expanded with Akozino warrior-traders, separating thus ca. 800 BC from Mordvinic…

    Genetic findings now clearly help dismiss any meaningful population impact in the LBA-IA transition, although any linguist can obviously argue for linguistic change in spite of major genetic continuity. But then we are stuck in the pre-ancient DNA era, so what’s ancient DNA for.

    netted-ware-textile-ceramics
    Middle Bronze Age cultures of Eastern Europe.

    Genetic continuity = language continuity?

    In the end, it’s very difficult to say how much language continuity there is around Estonia since the arrival of Corded Ware peoples. Looking at Modern Estonians, they have been clearly influenced by recent contacts with Baltic- and Germanic-speaking peoples clustering to the south-west in the PCA. They seem to have also received contacts from north(-east)ern peoples, likely from Finland, evidenced by their shifts toward the modern Estonian cluster during and after the Middle Ages, with a slight increase in Siberian ancestry and N1c subclades associated with Lovozero Ware. How much language change did these contacts bring? Maybe an expansion of Gulf of Finland Finnic (Northern Estonian) over Inland Finnic (Southern Estonian) and Gulf of Riga Finnic (Livonian)? Difficult to know, exactly, but, in the traditional view of Balto-Finnic dialectal distribution among Uralicists like Kallio, possibly no change at all.

    So, if the obvious changes in the Estonia_MA cluster relative to Estonia_IA cluster and Estonia_Modern relative to Estonia_MA do not represent radical language change…Why would Estonia_IA represent a change relative to Estonia_BA, when it is statistically basically the same? Or Estonia_BA relative to CWC_Baltic? Because of the infiltration of haplogroup N1c around the whole Baltic? Because of the occasional 1% “Siberian” ancestry in some non-locals of varied haplogroups across the whole Baltic area?

    In spite of all this, the amount of special pleading we are seeing among openly Nordicist amateurs when discussing the Uralic homeland relative to the Indo-European question in genetics has become a matter of plain willful ignorance. Like the living corpses of the Anatolian homeland, the Armenian homeland, the OIT proponents, or the nativist Basque R1b association, the personal involvement in the revival of “R1a=Indo-European” and “N=Uralic” trends is just painful to watch.

    [Next post in this line, if I manage to make time for it: “Genetic (dis)continuity in Central Europe“. Let’s see if early Balts and early Slavs, as well as Germanic peoples, show a cluster closer to Danubian EBA (viz. Maros), Hungary-Balkans BA, and Urnfield-related samples than their predecessors in their areas, i.e. away from East Corded Ware groups… If you want, you can enjoy for the moment the new PCAs I could get done and the tentative map of languages in the Early Bronze Age, that will probably give you the right idea about early Indo-European and Uralic population movements]

    bronze-age-early-indo-european
    European Early Bronze Age: tentative language map based on linguistics, archaeology, and genetics. See full map.

    Related

    Corded Ware—Uralic (IV): Hg R1a and N in Finno-Ugric and Samoyedic expansions

    haplogroup-uralians

    This is the fourth of four posts on the Corded Ware—Uralic identification:

    Let me begin this final post on the Corded Ware—Uralic connection with an assertion that should be obvious to everyone involved in ethnolinguistic identification of prehistoric populations but, for one reason or another, is usually forgotten. In the words of David Reich, in Who We Are and How We Got Here (2018):

    Human history is full of dead ends, and we should not expect the people who lived in any one place in the past to be the direct ancestors of those who live there today.

    Haplogroup N

    Another recurrent argument – apart from “Siberian ancestry” – for the location of the Uralic homeland is “haplogroup N”. This is as serious as saying “haplogroup R1” to refer to Indo-European migrations, but let’s explore this possibility anyway:

    Ancient haplogroups

    We have now a better idea of how many ancient migrations (previously hypothesized to be associated with westward Uralic migrations) look like in genetic terms. From Damgaard et al. (Science 2018):

    These serial changes in the Baikal populations are reflected in Y-chromosome lineages (Fig. SA; figs. S24 to S27, and tables S13 and SI4). MAI carries the R haplogroup, whereas the majority of Baikal_EN males belong to N lineages, which were widely distributed across Northern Eurasia (29), and the Baikal_LNBA males all carry Q haplogroups, as do most of the Okunevo_EMBA as well as some present-day Central Asians and Siberians.

    The only N1c1 sample comes from Ust’Ida Late Neolithic, 180km to the north of Lake Baikal, which – together with the Bronze Age sample from the Kola peninsula, and the medieval sample from Ust’Ida – gives a good idea of the overall expansion of N subclades and Siberian ancestry among the Circum-Arctic peoples of Eurasia, speakers of Palaeo-Siberian languages.

    eurasian-n-subclades
    Geographical location of ancient samples belonging to major clade N of the Y-chromosome.

    Modern haplogroups

    What we should expect from Uralic peoples expanding with haplogroup N – seeing how Yamna expands with R1b-L23, and Corded Ware expands with R1a-Z645 – is to find a common subclade spreading with Uralic populations. Let’s see if it works like that for any N-X subclade, in data from Ilumäe et al. (2016):

    haplogroup_n1
    Geographic-Distribution Map of hg N3 / N1c / N1a.

    Within the Eurasian circum-Arctic spread zone, N3 and N2a reveal a well-structured spread pattern where individual sub-clades show very different distributions:

    N1a1-M46 (or N-TAT), formed ca. 13900 BC, TMRCA 9800 BC

       N1a1a2-B187, formed ca. 9800 BC, TMRCA 1050 AD:

    The sub-clade N3b-B187 is specific to southern Siberia and Mongolia, whereas N3a-L708 is spread widely in other regions of northern Eurasia.

         N1a1a1a-L708, formed ca. 6800 BC, TMRCA 5400 BC.

           N1a1a1a2-B211/Y9022, formed ca. 5400 BC, TMRCA 1900 BC:

    The deepest clade within N3a is N3a1-B211, mostly present in the Volga-Uralic region and western Siberian Khanty and Mansi populations.

             N1a1a1a1a-L392/L1026), formed ca. 4400 BC, TMRCA 2800 BC:

    The neighbor clade, N3a3’6-CTS6967, spreads from eastern Siberia to the eastern part of Fennoscandia and the Baltic States

    haplogroup_n3a3
    Frequency-Distribution Maps of Individual Subclade N3a3 / N1a1a1a1a1a-CTS2929/VL29, probably initially with Akozino warrior-traders.

               N1a1a1a1a1a-CTS2929/VL29, formed ca. 2100 BC, TMRCA 1600 BC:

    In Europe, the clade N3a3-VL29 encompasses over a third of the present-day male Estonians, Latvians, and Lithuanians but is also present among Saami, Karelians, and Finns (Table S2 and Figure 3). Among the Slavic-speaking Belarusians, Ukrainians, and Russians, about three-fourths of their hg N3 Y chromosomes belong to hg N3a3.

    In the post on Finno-Permic expansions, I depicted what seems to me the most likely way of infiltration of N1c-L392 lineages with Akozino warrior-traders into the western Finno-Ugric populations, with an origin around the Barents sea.

    This includes the potential spread of (a minority of) N1c-B211 subclades due to contacts with Anonino on both sides of the Urals, through a northern route of forest and forest-steppe regions (equivalent to the distribution of Cherkaskul compared to Andronovo), given the spread of certain subclades in Ugric populations.

    NOTE. An alternative possibility is the association of certain B211 subclades with a southern route of expansion with Pre-Scythian and Scythian populations, under whose influence the Ananino culture emerged -which would imply a very quick infiltration of certain groups of haplogroup N everywhere among Finno-Ugrics on both sides of the Urals – , and also the expansion of some subclades with Turkic-speaking peoples, who apparently expanded with alliances of different peoples. Both (Scythian and Turkic) populations expanded from East Asia, where haplogroup N (including N1c) was present since the Neolithic. I find this a worse model of expansion for upper clades, but – given the YFull estimates and the presence of this haplogroup among Turkic peoples – it is a possibility for many subclades.

               N1a1a1a1a2-Z1936, formed ca. 2800 BC, TMRCA 2400 BC:

    The only notable exception from the pattern are Russians from northern regions of European Russia, where, in turn, about two-thirds of the hg N3 Y chromosomes belong to the hg N3a4-Z1936—the second west Eurasian clade. Thus, according to the frequency distribution of this clade, these Northern Russians fit better among other non-Slavic populations from northeastern Europe. N3a4 tends to increase in frequency toward the northeastern European regions but is also somewhat unexpectedly a dominant hg N3 lineage among most Turcic-speaking Volga Tatars and South-Ural Bashkirs.

    haplogroup_n3a4
    Frequency-Distribution Maps of Individual Subclade N3a4 / N1a1a1a1a2-Z1936, probably with the Samic (first) and Fennic (later) expansions into Paleo-Lakelandic and Palaeo-Laplandic territories.

    The expansion of N1a-Z1936 in Fennoscandia is most likely associated with the expansion of Saami into asbestos ware-related territory (like the Lovozero culture) during the Late Iron Age – and mixture with its population – , and with the later Fennic expansion to the east and north, replacing their language, as well as with Arctic and forest populations assimilated during Permic, Ugric, and Samoyedic expansions to the north.

               N1a1a1a1a4-M2019 (previously N3a2), formed ca. 4400 BC, TMRCA 1700 BC:

    Sub-hg N3a2-M2118 is one of the two main bifurcating branches in the nested cladistic structure of N3a2’6-M2110. It is predominantly found in populations inhabiting present-day Yakutia (Republic of Sakha) in central Siberia and at lower frequencies in the Khanty and Mansi populations, which exhibit a distinct Y-STR pattern (Table S7) potentially intrinsic to an additional clade inside the sub-hg N3a2

    The second widespread sub-clade of hg N is N2a. (…):

       N1a2b-P43 (B523/FGC10846/Y3184), formed ca. 6800 BC, TMRCA ca. 2700 BC:

    The absolute majority of N2a individuals belong to the second sub-clade, N2a1-B523, which diversified about 4.7 kya (95% CI = 4.0–5.5 kya). Its distribution covers the western and southern parts of Siberia, the Taimyr Peninsula, and the Volga-Uralic region with frequencies ranging from from 10% to 30% and does not extend to eastern Siberia (…)

    haplogroup_n2
    Geographic-Distribution Map of hg N2a1 / N1a2b-P43

    The “European” branch suggested earlier from Y-STR patterns turned out to consist of two clades

         N1a2b2a-Y3185/FGC10847, formed ca. 2200 BC, TMRCA 800 BC:

    N2a1-L1419, spread mainly in the northern part of that region.

         N1a2b2b1-B528/Y24382, formed ca. 900 BC, TMRCA ca. 900 BC:

    N2a1-B528, spread in the southern Volga-Uralic region.

    Haplogroup R1a

    We also have a good idea of the distribution of haplogroup R1a-Z645 in ancient samples. Its subclades were associated with the Corded Ware expansion, and some of them fit quite well the early expansion of Finno-Permic, Ugric, and Samoyedic peoples to the east.

    r1a-z282-z280-z2125-distribution
    Modified image, from Underhill et al. (2015). Spatial frequency distributions of Z282 (green) and Z93 (blue) affiliated haplogroups.. Notice the potential Finno-Ugric-associated distribution of Z282 (especially R1a-M558, a Z280 subclade), the expansion of R1a-Z2123 subclades with Central Asian forest-steppe groups.

    This is how the modern distribution of R1a among Uralians looks like, from the latest report in Tambets et al. (2018):

    • Among Fennic populations, Estonians and Karelians (ca. 1.1 million) have not suffered the greatest bottleneck of Finns (ca. 6-7 million), and show thus a greater proportion of R1a-Z280 than N1c subclades, which points to the original situation of Fennic peoples before their expansion. To trust Finnish Y-DNA to derive conclusions about the Uralic populations is as useful as relying on the Basque Y-DNA for the language spread by R1b-P312
    • Among Volga-Finnic populations, Mordovians (the closest to the original Uralic cluster, see above) show a majority of R1a lineages (27%).
    • Hungarians (ca. 13-15 million) represent the majority of Ugric (and Finno-Ugric) peoples. They are mainly R1a-Z280, also R1a-Z2123, have little N1c, and lack Siberian ancestry, and represent thus the most likely original situation of Ugric peoples in 4th century AD (read more on Avars and Hungarians).
    • Among Samoyedic peoples, the Selkup, the southernmost ones and latest to expand – that is, those not heavily admixed with Siberian populations – , also have a majority of R1a-Z2123 lineages (see also here for the original Samoyedic haplogroups to the south).

    To understand the relevance of Hungarians for Ugric peoples, as well as Estonians, Karelians, and Mordovians (and northern Russians, Finno-Ugric peoples recently Russified) for Finno-Permic peoples, as opposed to the Circum-Arctic and East Siberian populations, one has to put demographics in perspective. Even a modern map can show the relevance of certain territories in the past:

    population-density
    Population density (people per km2) map of the world in 1994. From Wikipedia.

    Summary of ancestry + haplogroups

    Fennic and Samic populations seem to be clearly influenced by Palaeo-Laplandic peoples, whereas Volga-Finnic and especially Permic populations may have received gene flow from both, but essentially Palaeo-Siberian influence from the north and east.

    The fact that modern Mansis and Khantys offer the highest variation in N1a subclades, and some of the highest “Siberian ancestry” among non-Nganasans, should have raised a red flag long ago. The fact that Hungarians – supposedly stemming from a source population similar to Mansis – do not offer the same amount of N subclades or Siberian ancestry (not even close), and offer instead more R1a, in common with Estonians (among Finno-Samic peoples) and Mordvins (among Volga-Finnic peoples) should have raised a still bigger red flag. The fact that Nganasans – the model for Siberian ancestry – show completely different N1a2b-P43 lineages should have been a huge genetic red line (on top of the anthropological one) to regard them as the Uralian-type population.

    We know now that ethnolinguistic groups have usually expanded with massive (usually male-biased) migrations, and that neighbouring locals often ‘resurge’ later without changing the language. That is seen in Europe after the spread of Bell Beakers, with the increase of previous ancestry and lineages in Scandinavia during the formation of the Nordic ethnolinguistic community; in Central-West Europe, with the resurgence of Neolithic ancestry (and lineages) during the Bronze Age over steppe ancestry; and in Central-East Europe (with Unetice or East European Bronze Age groups like Mierzanowice, Trzciniec, or Lusatian) showing an increase in steppe ancestry (and resurge of R1a subclades); none of them represented a radical ethnolinguistic change.

    finno-ugric-haplogroup-n
    Map of archaeological cultures in north-eastern Europe ca. 8th-3rd centuries BC. [The Mid-Volga Akozino group not depicted] Shaded area represents the Ananino cultural-historical society. Fading purple arrows represent likely stepped movements of subclades of haplogroup N for centuries (e.g. Siberian → Ananino → Akozino → Fennoscandia [N-VL29]; Circum-Arctic → forest-steppe [N1, N2]; etc.). Blue arrows represent eventual expansions of Uralic peoples to the north. Modified image from Vasilyev (2002).

    It is not hard to model the stepped arrival, infiltration, and/or resurge of N subclades and “Siberian ancestries”, as well as their gradual expansion in certain regions, associated with certain migrations first – such as the expansions to the Circum-Arctic region, and later the Scythian- and Turkic-related movements – , as well as limited regional developments, like the known bottleneck in Finns, or the clear late expansion of Ugric and Samoyedic languages to the north among nomadic Palaeo-Siberians due to traditions of exogamy and multilingualism. This fits quite well with the different arrival of N (N1c and xN1c) lineages to the different Uralic-speaking groups, and to the stepped appearance of “Siberian ancestry” in the different regions.

    The aternative

    It is evident that a lot of people were too attached to the idea of Palaeolithic R1b lineages ‘native’ to western Europe speaking Basque languages; of R1a lineages speaking Indo-European and spreading with Yamna; and N lineages ‘native’ to north-eastern Europe and speaking Uralic, and this is causing widespread weeping and gnashing of teeth (instead of the joy of discovering where one’s true patrilineal ancestors come from, and what language they spoke in each given period, which is the supposed objective of genetic genealogy…)

    Since an Indo-Germanic branch (as revived now by some in the Copenhaguen group to fit Kristiansen’s theory of the 1980s with recent genetic data) does not make any sense in linguistics, the finding of R1a in Yamna would not have led where some think it would have, because North-West Indo-European would still be the main Late PIE branch in Europe. Don’t take my word for it; take James P. Mallory’s (2013).

    mallory-adams-tree
    The levels of Indo-European reconstruction, from Mallory & Adams (2006).

    If an (unlikely) Indo-Slavonic group were posited, though, such a group would still be bound (with Indo-Iranian) to the steppes with East Yamna/Poltavka (admixing with Abashevo migrants, but retaining its language), developing Sintashta/Potapovka → Srubna/Andronovo, and R1a lineages would have equally undergone the known bottlenecks of the steppes where they replaced R1b-Z2103 – which this eastern group shares with Balkan languages, a haplogroup that links therefore together the Graeco-Aryan group.

    As far as I know – and there might be many other similar pet theories out there – there have been proposals of “modern Balto-Slavic-like” populations (in an obvious circular reasoning based on modern populations) in some Scythian clusters of the Iron Age.

    NOTE. I will not enter into “Balto-Slavic-like R1a” of the Late Bronze Age or earlier because no one can seriously believe at this point of development of Population Genetics that autosomal similarity predating 1,500+ years the appearance of Slavs equates to their (ethnolinguistic) ancestral population, without a clear intermediate cultural and genetic trail – something we lack today in the Slavic case even for the late Roman period…

    finno-saamic-palaeo-germanic-substratum
    The Finnic and Saamic separation looks shallower than it actually is. Invisible convergence can be ‘triangulated’ with the help of Germanic layers of mutual loanwords (Häkkinen 2012).

    We also know of R1a-Z280 lineages in Srubna, probably expanding to the west. With that in mind, and knowing that Palaeo-Germanic was in close contact with Finno-Samic while both were already separated but still in contact, and that Palaeo-Germanic was also in contact and closely related to a ‘Temematic’ distinct from Balto-Slavic (and also that early Proto-Baltic and Proto-Slavic from the Roman Iron Age and later were in contact with western Uralic) this will be the linguistic map of the Iron Age if R1a is considered to expand Indo-European from some kind of “patron-client” relationship with west Yamna:

    palaeo-germanic-italo-celtic
    Eastern European language map during the Late Bronze Age / Iron Age, if R1a spread Indo-European languages and Eastern Yamna spoke Indo-Slavonic. Palaeo-Germanic (i.e. Pre- to Proto-Germanic) needs to be in contact with both the Samic Lovozero population and the Fennic west Circum-Arctic one. Italic and Celtic in contact with Pre-Germanic. Germanic in contact with Temematic. Balto-Slavic in contact with Iranian, and near Fennic to allow for later loanwords. For Germanic and Temematic, see Kortlandt (2018).

    You might think I have some personal or political reason against this kind of proposals. I haven’t. We have been proposing Indo-European to be the language of the European Union for more than 10 years, so to support R1b-Italo-Celtic in the whole Western Europe, R1a-Germanic in Central and Eastern Europe, and R1a-Indo-Slavonic in the steppes (as the Danish group seems to be doing) has nothing inherently bad (or good) for me. If anything, it gives more reason to support the revival of North-West Indo-European in Europe.

    My problem with this proposal is that it is obviously beholden to the notion of the uninterrupted cultural, historic and ethnic continuity in certain territories. This bias is common in historiography (von Falkenhausen 1993), but it extends even more easily into the lesser known prehistory of any territory, and now more than ever some people feel the need to corrupt (pre)history based on their own haplogroups (or the majority haplogroups of their modern countries). However, more than on philosophical grounds, my rejection is based on facts: this picture is not what the combination of linguistic, archaeological, and genetic data shows. Period.

    Nevertheless, if Yamna + Corded Ware represented the “big and early expansion” of Germanic and Italo-Celtic peoples proper of the dream Nazi’s Lebensraum and Fascist’s spazio vitale proposals; Uralians were Siberian hunter-gatherers that controlled the whole eastern and northern Russia, and miraculously managed to push (ethnolinguistically) Neolithic agropastoralists to the west during and after the Iron Age, with gradual (and often minimal) genetic impact; and Balto-Slavic peoples were represented by horse riders from Pokrovka/Srubna, hiding then somewhere around the forest-steppe until after the Scythian expansion, and then spreading their language (without much genetic impact) during the early Middle Ages…so be it.

    See also

    Related

    Corded Ware—Uralic (III): “Siberian ancestry” and Ugric-Samoyedic expansions

    siberian-ancestry-tambets

    This is the third of four posts on the Corded Ware—Uralic identification. See

    An Eastern Uralic group?

    Even though proposals of an Eastern Uralic (or Ugro-Samoyedic) group are in the minority – and those who support it tend to search for an origin of Uralic in Central Asia – , there is nothing wrong in supporting this from the point of view of a western homeland, because the eastward migration of both Proto-Ugric and Pre-Samoyedic peoples may have been coupled with each other at an early stage. It’s like Indo-Slavonic: it just doesn’t fit the linguistic data as well as the alternative, i.e. the expansion of Samoyedic first, different from a Finno-Ugric trunk. But, in case you are wondering about this possibility, here is Häkkinen’s (2012) phonological argument:

    ugro-samoyedic-uralic

    The case of Samoyedic is quite similar to that of Hungarian, although the earliest Palaeo-Siberian contact languages have been lost. There were contacts at least with Tocharian (Kallio 2004), Yukaghir (Rédei 1999) and Turkic (Janhunen 1998). Samoyedic also:

    a) has moved far from the related languages and has been exposed to strong foreign influence

    b) shares a small number of common words with other branches (from Sammallahti 1988: only 123 ‘Uralic’ words, versus 390 ‘Uralic’ + ‘Finno-Ugric’ words found in other branches than Samoyedic = 31,5 %)

    c) derives phonologically from the East Uralic dialect.

    The phonological level is taxonomically more reliable, since it lacks the distortion caused by invisible convergence and false divergence at the lexical level. Thus we can conclude that the traditional taxonomic model, according to which Samoyedic was the first branch to split off from the Proto-Uralic unity, is just as incorrect as the view that Hungarian was the first branch to split off.

    Seima-Turbino

    Late Uralic can be traced back to metallurgical cultures thanks to terms like PU *wäśka ‘copper/bronze’ (borrowed from Proto-Samoyedic *wesä into Tocharian); PU *äsa and *olna/*olni, ‘lead’ or ‘tin’, found in *äsa-wäśka ‘tin-bronze’; and e.g. *weŋći ‘knife’, borrowed into Indo-Iranian (through the stage of vocalization of nasals), appearing later as Proto-Indo-Aryan *wāćī ‘knife, awl, axe’.

    It is known that the southern regions of the Abashevo culture developed Proto-Indo-Iranian-speaking Sintashta-Petrovka and Pokrovka (Early Srubna). To the north, however, Abashevo kept its Uralic nature, with continuous contacts allowing for the spread of lexicon – mainly into Finno-Ugric – , and phonetic influence – mainly Uralisms into Proto-Indo-Iranian phonology (read more here).

    The northern part of Abashevo (just like the south) was mainly a metallurgical society, with Abashevo metal prospectors found also side by side with Sintashta pioneers in the Zeravshan Valley, near BMAC, in search of metal ores. About the Seima-Turbino phenomenon, from Parpola (2013):

    From the Urals to the east, the chain of cultures associated with this network consisted principally of the following: the Abashevo culture (extending from the Upper Don to the Mid- and South Trans-Urals, including the important cemeteries of Sejma and Turbino), the Sintashta culture (in the southeast Urals), the Petrovka culture (in the Tobol-Ishim steppe), the Taskovo-Loginovo cultures (on the Mid- and Lower Tobol and the Mid-Irtysh), the Samus’ culture (on the Upper Ob, with the important cemetery of Rostovka), the Krotovo culture (from the forest steppe of the Mid-Irtysh to the Baraba steppe on the Upper Ob, with the important cemetery of Sopka 2), the Elunino culture (on the Upper Ob just west of the Altai mountains) and the Okunevo culture (on the Mid-Yenissei, in the Minusinsk plain, Khakassia and northern Tuva). The Okunevo culture belongs wholly to the Early Bronze Age (c. 2250–1900 BCE), but most of the other cultures apparently to its latter part, being currently dated to the pre-Andronovo horizon of c. 2100–1800 BCE (cf. Parzinger 2006: 244–312 and 336; Koryakova & Epimakhov 2007: 104–105).

    post-eneolithic-steppe-asia
    Schematic map of the Middle Bronze Age cultures (steppe and foreststeppe
    zone)

    The majority of the Sejma-Turbino objects are of the better quality tin-bronze, and while tin is absent in the Urals, the Altai and Sayan mountains are an important source of both copper and tin. Tin is also available in southern Central Asia. Chernykh & Kuz’minykh have accordingly suggested an eastern origin for the Sejma-Turbino network, backing this hypothesis also by the depiction on the Sejma-Turbino knives of mountain sheep and horses characteristic of that area. However, Christian Carpelan has emphasized that the local Afanas’evo and Okunevo metallurgy of the Sayan-Altai area was initially rather primitive, and could not possibly have achieved the advanced and difficult technology of casting socketed spearheads as one piece around a blank. Carpelan points out that the first spearheads of this type appear in the Middle Bronze Age Caucasia c. 2000 BCE, diffusing early on to the Mid-Volga-Kama-southern Urals area, where “it was the experienced Abashevo craftsmen who were able to take up the new techniques and develop and distribute new types of spearheads” (Carpelan & Parpola 2001: 106, cf. 99–106, 110). The animal argument is countered by reference to a dagger from Sejma on the Oka river depicting an elk’s head, with earlier north European prototypes (Carpelan & Parpola 2001: 106–109). Also the metal analysis speaks for the Abashevo origin of the Sejma-Turbino network. Out of 353 artefacts analyzed, 47% were of tin-bronze, 36% of arsenical bronze, and 8.5% of pure copper. Both the arsenical bronze and pure copper are very clearly associated with the Abashevo metallurgy.

    seima-turbino-phenomenon-parpola
    Find spots of artefacts distributed by the Sejma-Turbino intercultural trader network, and the areas of the most important participating cultures: Abashevo, Sintashta, Petrovka. Based on Chernykh 2007: 77.

    The Abashevo metal production was based on the Volga-Kama-Belaya area sandstone ores of pure copper and on the more easterly Urals deposits of arsenical copper (Figure 9). The Abashevo people, expanding from the Don and Mid-Volga to the Urals, first reached the westerly sandstone deposits of pure copper in the Volga and Kama basins, and started developing their metallurgy in this area, before moving on to the eastern side of the Urals to produce harder weapons and tools of arsenical copper. Eventually they moved even further south, to the area richest in copper in the whole Urals region, founding there the very strong and innovative Sintashta culture.

    Regarding the most likely expansion of Eastern Uralic peoples:

    Nataliya L’vovna Chlenova (1929–2009; cf. Korenyako & Ku’zminykh 2011) published in 1981 a detailed study of the Cherkaskul’ pottery. In her carefully prepared maps of 1981 and 1984 (Figure 10), she plotted Cherkaskul’ monuments not only in Bashkiria and the Trans-Urals, but also in thick concentrations on the Upper Irtysh, Upper Ob and Upper Yenissei, close to the Altai and Sayan mountains, precisely where the best experts suppose the homeland of Proto-Samoyed to be.

    cherkaskul-andronovo
    Distribution of Srubnaya (Timber Grave, early and late), Andronovo (Alakul’ and Fëdorovo variants) and Cherkaskul’ monuments. After Parpola 1994: 146, fig. 8.15, based on the work of N. L. Chlenova (1984: map facing page 100).

    Ugric

    The Cherkaskul’ culture was transformed into the genetically related Mezhovka culture (c. 1500–1000 BCE), which occupied approximately the same area from the Mid-Kama and Belaya rivers to the Tobol river in western Siberia (cf. Parzinger 2006: 444–448; Koryakova & Epimakhov 2007: 170–175). The Mezhovka culture was in close contact with the neighbouring and probably Proto-Iranian speaking Alekseevka alias Sargary culture (c. 1500–900 BCE) of northern Kazakhstan (Figure 4 no. 8) that had a Fëdorovo and Cherkaskul’ substratum and a roller pottery superstratum (cf. Parzinger 2006: 443–448; Koryakova & Epimakhov 2007: 161–170). Both the Cherkaskul’ and the Mezhovka cultures are thought to have been Proto-Ugric linguistically, on the basis of the agreement of their area with that of Mansi and Khanty speakers, who moreover in their Fëdorovo-like ornamentation have preserved evidence of continuity in material culture (cf. Chlenova 1984; Koryakova & Epimakhov 2007: 159, 175).

    mezhovska-sargary-irmen
    Cultures of the Final Bronze Age of the Urals and western Siberia (steppe
    and forest-steppe zone).

    The Mezhovka culture was succeeded by the genetically related Gamayun culture (c. 1000–700 BCE) (cf. Parzinger 2006: 446; 542–545).

    From the Gamayun culture descend Trans-Urals cultures in close contact with Finno-Permic populations of the Cis-Ural region:

    • [Proto-Mansi] Itkul’ culture (c. 700–200 BCE) distributed along the eastern slope of the Ural Mountains (cf. Parzinger 2006: 552–556). Known from its walled forts, it constituted the principal Trans-Uralian centre of metallurgy in the Iron Age, and was in contact with both the Anan’ino and Akhmylovo cultures (the metallurgical centres of the Mid-Volga and Kama-Belaya region) and the neighbouring Gorokhovo culture.
      • [Proto-Hungarian] via the Vorob’evo Group (c. 700–550 BCE) (cf. Parzinger 2006: 546–549), to the Gorokhovo culture (c. 550–400 BCE) of the Trans-Uralian forest steppe (cf. Parzinger 2006: 549–552). For various reasons the local Gorokhovo people started mobile pastoral herding and became part of the multicomponent pastoralist Sargat culture (c. 500 BCE to 300 CE), which in a broader sense comprized all cultural groups between the Tobol and Irtysh rivers, succeeding here the Sargary culture. The Sargat intercommunity was dominated by steppe nomads belonging to the Iranian-speaking Saka confederation, who in the summer migrated northwards to the forest steppe
    • [Proto-Khanty] Late Bronze Age and Early Iron Age cultures related to the Gamayunskoe and Itkul’ cultures that extended up to the Ob: the Nosilovo, Baitovo, Late Irmen’, and Krasnoozero cultures (c. 900–500 BCE). Some were in contact with the Akhmylovo on the Mid-Volga.
    sargat-gorokhovo-bolscherechye
    Cultural groups of the Iron Age in the forest-steppe zone of western
    Siberia. (

    Samoyedic

    Parpola (2012) connects the expansion of Samoyedic with the Cherkaskul variant of Andronovo. As we know, Andronovo was genetically diverse, which speaks in favour of different groups developing similar material cultures in Central Asia.

    Juha Janhunen, author of the etymological dictionary of the Samoyed languages (1977), places the homeland of Proto-Samoyedic in the Minusinsk basin on the Upper Yenissei (cf. Janhunen 2009: 72). Mainly on the basis of Bulghar Turkic loanwords, Janhunen (2007: 224; 2009: 63) dates Proto-Samoyedic to the last centuries BCE. Janhunen thinks that the language of the Tagar culture (c. 800–100 BCE) ought to have been Proto-Samoyedic (cf. Janhunen 1983: 117– 118; 2009: 72; Parzinger 2001: 80 and 2006: 619–631 dates the Tagar culture c. 1000–200 BCE; Svyatko et al. 2009: 256, based on human bone samples, c. 900 BCE to 50 CE). The Tagar culture largely continues the traditions of the Karasuk culture (c. 1400–900 BCE), (…)

    chicha-irmen-tagar-baraba-forest-siberian
    Map showing the location of Chicha-1.

    For the most recent expansions of Samoyedic languages to the north, into Palaeo-Siberian populations, read more about the traditional multilingualism of Siberian populations.

    Genetics

    Siberian ancestry

    The use of a map of “Siberian ancestry” peaking in the arctic to show a supposedly late Uralic population movement (starting in the Iron Age!) seems to be the latest trend in population genomics:

    siberian-ancestry-map
    Frequency map of the so-called ‘Siberian’ component. From Tambets et al. (2018) (see below for ADMIXTURE in specific populations).

    I guess that would make this map of Neolithic farmer ancestry represent an expansion of Indo-European from the south, because Anatolia, Greece, Italy, southern France, and Iberia – where this ancestry peaks in modern populations – are among the oldest territories where Indo-European languages were recorded:

    reich-farmer-ancestry
    Modern genome-wide data shows that the primary gradient of farmer ancestry in Europe does not flow southeast-to-northwest but instead in an almost perpendicular direction, a result of a major migration of pastoralists from the east that displaced much of the ancestry of the first farmers.

    Probably not the right interpretation of this kind of simplistic data about modern populations, though…

    The most striking thing about the “Siberian ancestry” white whale is that nobody really knows what it is; just like we did not know what “Yamnaya ancestry” was, until the most recent data is making the picture clearer. Its nature is changing with each new paper, and it can be summed up by “some ancestry we want to find that is common to Uralic-speaking peoples, and should not be CWC-related”. Tambets et al. (2018) explain quite well how they “found it”:

    Overall, and specifically at lower values of K, the genetic makeup of Uralic speakers resembles that of their geographic neighbours. The Saami and (a subset of) the Mansi serve as exceptions to that pattern being more similar to geographically more distant populations (Fig. 3a, Additional file 3: S3). However, starting from K = 9, ADMIXTURE identifies a genetic component (k9, magenta in Fig. 3a, Additional file 3: S3), which is predominantly, although not exclusively, found in Uralic speakers. This component is also well visible on K = 10, which has the best cross-validation index among all tests (Additional file 3: S3B). The spatial distribution of this component (Fig. 3b) shows a frequency peak among Ob-Ugric and Samoyed speakers as well as among neighbouring Kets (Fig. 3a). The proportion of k9 decreases rapidly from West Siberia towards east, south and west, constituting on average 40% of the genetic ancestry of FU speakers in Volga-Ural region (VUR) and 20% in their Turkic-speaking neighbours (Bashkirs, Tatars, Chuvashes; Fig. 3a).

    siberian-ancestry-modern
    Population structure of Uralic-speaking populations inferred from ADMIXTURE analysis on autosomal SNPs in Eurasian context. Individual ancestry estimates for populations of interest for selected number of assumed ancestral populations (K3, K6, K9, K11). Ancestry components discussed in a main text (k2, k3, k5, k6, k9, k11) are indicated and have the same colours throughout. The names of the Uralic-speaking populations are indicated with blue (Finno-Ugric) or orange (Samoyedic). Image from Tambets et al. (2018).

    However, this ‘something’ that some people occasionally find in some Uralic populations is also common to other modern and ancient groups, and not so common in some other Uralic peoples. Simply put:

    siberian-ancestry-modern-populations
    Image modified from Lamnidis et al. (2018). Red line representing maximum “Siberian admixture” in Eastern European hunter-gatherers. In blue, Uralic-speaking groups. “Plot of ADMIXTURE (K=3) results containing West Eurasian populations and the Nganasan. Ancient individuals from this study are represented by thicker bars.”

    I already said this in the recent publication of Siberian samples, where a renamed and radiocarbon dated Finnish_IA clearly shows that Late Iron Age Saami (ca. 400 AD) had little “Siberian ancestry”, if any at all, representing the most likely Fennic (and Samic) ancestral components before their expansion into central and northern Finland, where they admixed with circum-polar peoples of asbestos ware cultures.

    I will say that again and again, any time they report the so-called “Siberian ancestry” in Uralic samples, no matter how it is defined each time: it does not seem to be that special something people are looking for, but rather (at least in a great part) a quite old ancestral component forming an evident cline with EHG, whose best proximate source are Baikal_EN (and/or Devil’s Gate) at this moment, and thus also East European hunter-gatherers for Western Uralic peoples:

    dzudzuana-baikal-en-admixture
    Image modified from Lazaridis et al. (2018). In red: samples with Baikal_EN ancestry in speculative estimates. In pink: samples with Baikal_EN ancestry in conservative estimates (probably marking a recent arrival of Baikal_En ancestry, see here). Modeling present-day and ancient West-Eurasians. Mixture proportions computed with qpAdm (Supplementary Information section 4). The proportion of ‘Mbuti’ ancestry represents the total of ‘Deep’ ancestry from lineages that split prior to the split of Ust’Ishim, Tianyuan, and West Eurasians and can include both ‘Basal Eurasian’ and other (e.g., Sub-Saharan African) ancestry. (Left) ‘Conservative’ estimates. Each population 367 cannot be modeled with fewer admixture events than shown. (Right) ‘Speculative’ estimates. The highest number of sources (≤5) with admixture estimates within [0,1] are shown for each population. Some of the admixture proportions are not significantly different from 0 (Supplementary Information section 4).

    So either Samara_HG, Karelia_HG, and many other groups from eastern Europe all spoke Uralic according to this ADMIXTURE graphic (and the formation of steppe ancestry in the Volga-Ural region brought the Proto-Indo-European language to the steppes through the CHG/ANE expansion), or a great part of this “Siberian ancestry” found in modern Uralic-speaking populations is not what some people would like to think it is…

    Modern populations

    PCA clines can be looked for to represent expansions of ancient populations. Most recently, Flegontov et al. (2018) are attempting to do this with Asian populations:

    For some Turkic groups in the Urals and the Altai regions and in the Volga basin, a different admixture model fits the data: the same West Eurasian source + Uralic- or Yeniseian-speaking Siberians. Thus, we have revealed an admixture cline between Scythians and the Iranian farmer genetic cluster, and two further clines connecting the former cline to distinct ancestry sources in Siberia. Interestingly, few Wusun-period individuals harbor substantial Uralic/Yeniseian-related Siberian ancestry, in contrast to preceding Scythians and later Turkic groups characterized by the Tungusic/Mongolic-related ancestry. It remains to be elucidated whether this genetic influx reflects contacts with the Xiongnu confederacy. We are currently assembling a collection of samples across the Eurasian steppe for a detailed genetic investigation of the Hunnic confederacies.

    jeong-population-clines
    Three distinct East/West Eurasian clines across the continent with some interesting linguistic correlates, as earlier reported by Jeong et al. (2018). Alexander M. Kim.

    There are potential errors with this approach:

    The main one is practical – does a modern cline represent an ancestral language? The answer is: sometimes. It depends on the anthropological context that we have, and especially on the precision of the PCA:

    clines-himalayan
    Genetic structure of the Himalayan region populations from analyses using unlinked SNPs. (A) PCA of the Himalayan and HGDP-CEPH populations. Each dot represents a sample, coded by region as indicated. The Himalayan region samples lie between the HGDP-CEPH East Asian and South Asian samples on the right-hand side of the plot. From Arciero et al. (2018).

    The ‘Europe’, ‘Middle East’, etc. clines of the above PCA do not represent one language, but many. For starters, the PCA includes too many (and modern) populations, its precision is useless for ethnolinguistic groups. Which is the right level? Again, it depends.

    The other error is one of detail of the clines drawn (which, in turn, depends on the precision of the PCA). For example, we can draw two paralell lines (or even one line, as in Flegontov et al. above) in one PCA graphic, but we still don’t have the direction of expansion. How do we know if this supposed “Uralic-speaking cline” goes from one region to the other? For that level of detail, we should examine closely modern Uralic-speaking peoples and Circum-Arctic populations:

    uralic-cline
    Modified from Tambets et al. (2018). Principal component analysis (PCA) and genetic distances of Uralic-speaking populations. a PCA (PC1 vs PC2) of the Uralic-speaking populations

    The real ancient Uralic cluster (drawn above in blue) is thus probably from a North-East European source (probably formed by Battle Axe / Fatyanovo-Balanovo / Abashevo) to the east into Siberian populations, and to the north into Laplandic populations (see below also on Mezhovska ancestry for the drawn ‘European cline’, which some may a priori wrongly assume to be quite late).

    The fact that the three formed clines point to an admixture of CWC-related populations from North-Eastern Europe, and that variation is greater at the Palaeo-Laplandic and Palaeo-Siberian extremities compared to the CWC-related one, also supports this as the correct interpretation.

    However, judging by the two main clines formed, one could be alternatively inclined to interpret that Palaeo-Laplandic and Palaeo-Siberian populations formed a huge ancestral “Uralic” ghost cluster in Siberia (spanning from the Palaeo-Laplandic to the Palaeo-Siberian one), and from there expanded Finno-Samic on one hand, and “Volga-Ugro-Samoyed” on the other. That poses different problems: an obvious linguistic and archaeological one – which I assume a lot of people do not really care about – , and a not-so-obvious genetic one (see below for ancient samples and for the expansion of haplogroup N).

    To understand the simplest solution better, one can just have a look at the PCA from Bell Beaker samples in Olalde et al. (2018), which (as Reich has already explained many times) expanded directly from Yamna R1b-L23 lineages:

    olalde_pca_clines
    Image modified from Olalde et al. (2018). PCA of 999 Eurasian individuals. Marked is the Espersted Outlier with the approximate position of Yamna Hungary, probably the source of its admixture. Different Bell Beaker clines have been drawn, to represent approximate source of expansions from Central European sources into the different regions.

    Unlike this PCA with ancient samples, where Bell Beaker clines could be a rough approximation to the real sources for each population, and where a cluster spanning all three depicted Early Bronze Age clusters could give a rough proximate source of European Bell Beakers in Hungary (and where one can even distinguish the Y-DNA bottlenecks in the L23 trunk created by each cline) the PCA of modern Uralic populations is probably not suitable for a good estimate of the ancient situation, which may be found shifted up or down of the drawn “Uralic” cluster along East European groups.

    After all, we already know that the Siberian cline shows probably as much an ancient admixture event – from the original Uralic expansion to the east with Corded Ware ancestry – as another more recent one – a westward migration of Siberian ancestry (or even more than one). While we know with more or less exactitude what happened with the Palaeo-Laplandic admixture by expanding Proto-Finno-Samic populations (see here), the Proto-Ugric and Pre-Samoyedic populations formed probably more than one cline during the different ancient migrations through central Asia.

    Ancient populations

    Apparently, the Corded Ware expansion to the east was not marked by a huge change in ancestry. While the final version of Narasimhan et al. (2018) may show a little more detail about other forest-steppe Seima-Turbino/Andronovo-related migrations (and thus also Eastern Uralic peoples), we have already had enough information for quite some time to get a good idea.

    mezhovska-pca
    Principal component analysis. PCA of ancient individuals (according colours see legend) projected on modern West Eurasians (grey). Iron Age Scythians are shown in black; CHG, Caucasus hunter-gatherer; LNBA, late Neolithic/Bronze Age; MN, middle Neolithic; EHG, eastern European huntergatherer; LBK_EN, early Neolithic Linearbandkeramik; HG, hunter-gatherer; EBA, early Bronze Age; IA, Iron Age; LBA, late Bronze Age; WHG, western hunter-gatherer.dataset (grey). Iron Age Scythians are shown in black; CHG, Caucasus hunter-gatherer; LNBA, late Neolithic/Bronze Age; MN, middle Neolithic; EHG, eastern European hunter-gatherer; LBK_EN, early Neolithic Linearbandkeramik; HG, hunter-gatherer; EBA, early Bronze Age; IA, Iron Age; LBA, late Bronze Age; WHG, western hunter-gatherer.

    Mezhovska‘s position is similar to the later Pre-Scythian and Scythian populations. There are some interesting details: apart from haplogroup R1a-Z280 (CTS1211+), there is one R1b-M269 (PF6494+), probably Z2103, and an outlier (out of three) in a similar position to the recently described central/southern Scythian clusters.

    NOTE. The finding of R1b-M269 in the forest-steppe is probably either 1) from an Afanasevo-Okunevo origin, or 2) from an admixture with neighbouring Andronovo-related populations, such as Sargary. A third, maybe less likely option is that this haplogroup admixed with Abashevo directly (as it happened in Sintashta, Potapovka, or Pokrovka) and formed part of early Uralic migrations. In any case, since Mezhovska is a Bronze Age society from the Urals region, its association with R1b-Z2103 – like the association of R1b-Z2103 in Scythian clusters – cannot be attributed to “Thracian peoples”, a link which is (as I already said) too simplistic.

    The drawn “European cline” of Hungarians (see above), leading from ‘west-like’ Mansi to Hungarian populations – and hosting also Finnic and Estonian samples – , cannot therefore be attributed simply to late “Slavic/Balkan-like” admixture.

    Karasuk – located further to the east – is basically also Corded Ware peoples showing clearly a recent admixture with local ANE / Baikal_EN-like populations. In terms of haplogroups it shows haplogroup Q, R1a-Z2124, and R1a-Z2123, later found among early Hungarians, and present also in ancient Samoyedic populations now acculturated.

    The most interesting aspect of both Mezhovska and Karasuk is that they seem to diverge from a point close to Ukraine_Eneolithic, which is the supposed ancestral source of Corded Ware peoples (read more about the formation of “steppe ancestry”). This means that Eastern Uralians derive from a source closer to Middle Dnieper/Abashevo populations, rather than Battle Axe (shifted to Latvian Neolithic), which is more likely the source prevalent in Finno-Permic peoples.

    Their initial admixture with (Palaeo-)Siberian populations is thus seen already starting by this time in Mezhovska and especially in Karasuk, but this process (compared to modern populations) is incomplete:

    f4-test-karasuk-mezhovska
    Visualization of f-statistics results. f4(Test, LBK; Han, Mbuti) values are plotted on x axis and f4(Test, LBK; EHG, Mbuti) values on y axis, positive deviations from zero show deviations from a clade between Test and LBK. A red dashed line is drawn between Yamnaya from Samara and Ami. Iron Age populations that can be modelled as mixtures of Yamnaya and East Eurasians (like the Ami) are arrayed around this line and appear to be distinct from the main North/South European cline (blue) on the left of the x axis.
    karasuk-mezhovska-admixture
    ADMIXTURE results for ancient populations. Red arrows point to the Iron Age Scythian individuals studied. LBK_EN: Early Neolithic Linearbandkeramik; EHG: Eastern European hunter-gatherer; Motala_HG: hunter-gatherer from Motala (Sweden); WHG: western hunter-gatherer; CHG: Caucasus hunter-gatherer; IA: Iron Age; EBA: Early Bronze Age; LBA: Late Bronze Age.

    We know now that Samic peoples expanded during the Late Iron Age into Palaeo-Laplandic populations, admixing with them and creating this modern cline. Finns expanded later to the north (in one of their known genetic bottlenecks), admixing with (and displacing) the Saami in Finland, especially replacing their male lines.

    So how did Ugric and Samoyedic peoples admix with Palaeo-Siberian populations further, to obtain their modern cline? The answer is, logically, with East Asian migrations related to forest-steppe populations of Central Asia after the Mezhovska and Karasuk periods, i.e. during the Iron Age and later. Other groups from the forest-steppe in Central Asia show similar East Asian (“Siberian”) admixture. We know this from Narasimhan et al. (2018):

    (…) we observe samples from multiple sites dated to 1700-1500 BCE (Maitan, Kairan, Oy_Dzhaylau and Zevakinsikiy) that derive up to ~25% of their ancestry from a source related to present-day East Asians and the remainder from Steppe_MLBA. A similar ancestry profile became widespread in the region by the Late Bronze Age, as documented by our time transect from Zevakinsikiy and samples from many sites dating to 1500-1000 BCE, and was ubiquitous by the Scytho-Sarmatian period in the Iron Age.

    We already have some information about these later migrations:

    siberian-genetic-component-chronology
    Very important observation with implication of population turnover is that pre-Turkic Inner Eurasian populations’ Siberian ancestry appears predominantly “Uralic-Yeniseian” in contrast to later dominance of “Tungusic-Mongolic” sort (which does sporadically occur earlier). Alexander M. Kim

    The Ugric-speaking Sargat culture in Western Siberia shows the expected mixture of haplogroups (ca. 500 BC – 500 AD), with 5 samples of hg N and 2 of hg R1a1, in Pilipenko et al. (2017). Although radiocarbon dates and subclades are lacking, N lineages probably spread late, because of the late and gradual admixture of Siberian cultures into the Sargat melting pot.

    The Samoyedic-speaking Tagar culture also shows signs of a genetic turnover in Pilipenko et al. (2018):

    The observed reduction in the genetic distance between the Middle Tagar population and other Scythian like populations of Southern Siberia(Fig 5; S4 Table), in our opinion, is primarily associated with an increase in the role of East Eurasian mtDNA lineages in the gene pool (up to nearly half of the gene pool) and a substantial increase in the joint frequency of haplogroups C and D (from 8.7% in the Early Tagar series to 37.5% in the Middle Tagar series). These features are characteristic of many ancient and modern populations of Southern Siberia and adjacent regions of Central Asia, including the Pazyryk population of the Altai Mountains.

    Before the Iron Age, the Karasuk and Mezhovska population were probably already somehow ‘to the north’ within the ancient Steppe-Altai cline (see image below9 created by expanding Seima-Turbino- and Andronovo-related populations. During the Iron Age, further Siberian contributions with Iranian expansions must have placed Uralians of the Central Asian forest-steppe areas much closer to today’s Palaeo-Siberian cline.

    However, the modern genetic picture was probably fully developed only in historic times, when Samoyedic and Ugric languages expanded to the north, only in part admixing further with Palaeo-Siberian-speaking nomads from the Circum-Arctic region (see here for a recent history of Samoyedic Enets), which justifies their more recent radical ‘northern shift’.

    east-uralic-clines
    Modified image from Jeong et al. (2018), supplementary materials. The first two PCs summarizing the genetic structure within 2,077 Eurasian individuals. The two PCs generally mirror geography. PC1 separates western and eastern Eurasian populations, with many inner Eurasians in the middle. PC2 separates eastern Eurasians along the north-south cline and also separates Europeans from West Asians. Ancient individuals (color-filled shapes), including two Botai individuals, are projected onto PCs calculated from present-day individuals.

    This late acquisition of the language by Palaeo-Siberian nomads (without much population replacement) also justifies the wide PCA clusters of very small Siberian populations. See for example in the PCA from Tambets et al. (2018):

    uralic-ugric-samoyedic-modern-clines
    Approximate Ugric and Samoyedic clines (exluding apparent outliers). Modified from Tambets et al. (2018). Principal component analysis (PCA) and genetic distances of Uralic-speaking populations. a PCA (PC1 vs PC2) of the Uralic-speaking populations

    For their relationship with modern Mansi, we have information on Hungarian conqueror populations from Neparáczki et al. (2018):

    Moreover, Y, B and N1a1a1a1a Hg-s have not been detected in Finno-Ugric populations [80–84], implying that the east Eurasian component of the Conquerors and Finno-Ugric people are probably not directly related. The same inference can be drawn from phylogenetic data, as only two Mansi samples appeared in our phylogenetic trees on the side branches (S1 Fig, Networks; 1, 4) suggesting that ancestors of the Mansis separated from Asian ancestors of the Conquerors a long time ago. This inference is also supported by genomic Admixture analysis of Siberian and Northeastern European populations [85], which revealed that Mansis received their eastern Siberian genetic component approximately 5–7 thousand years ago from ancestors of modern Even and Evenki people. Most likely the same explanation applies to the Y-chromosome N-Tat marker which originated from China [86,87] and its subclades are now widespread between various language groups of North Asia and Eastern Europe [88].

    The genetic picture of Hungarians (their formed cline with Mansi and their haplogroups) may be quite useful for the true admixture found originally in Mansi peoples at the beginning of the Iron Age. By now it is clear even from modern populations that Steppe_MLBA ancestry accompanied the Uralic expansion to the east (roughly approximated in the graphic with Afanasievo_EBA + Bichon_LP EasternHG_M):

    siberian-population-expansions
    Admixture modelling using qpAdm. Maps showing locations and ancestry proportions of ancient (left) and modern (right) groups. From Sikora et al. (2018).

    Continue reading the final post of the series: Corded Ware—Uralic (IV): Haplogroups R1a and N in Finno-Ugric and Samoyedic.

    See also

    Related

  • The traditional multilingualism of Siberian populations
  • Iron Age bottleneck of the Proto-Fennic population in Estonia
  • Y-DNA haplogroups of Tuvinian tribes show little effect of the Mongol expansion
  • Corded Ware—Uralic (I): Differences and similarities with Yamna
  • Haplogroup R1a and CWC ancestry predominate in Fennic, Ugric, and Samoyedic groups
  • The Iron Age expansion of Southern Siberian groups and ancestry with Scythians
  • Evolution of Steppe, Neolithic, and Siberian ancestry in Eurasia (ISBA 8, 19th Sep)
  • Mitogenomes from Avar nomadic elite show Inner Asian origin
  • On the origin and spread of haplogroup R1a-Z645 from eastern Europe
  • Oldest N1c1a1a-L392 samples and Siberian ancestry in Bronze Age Fennoscandia
  • Consequences of Damgaard et al. 2018 (III): Proto-Finno-Ugric & Proto-Indo-Iranian in the North Caspian region
  • The concept of “Outlier” in Human Ancestry (III): Late Neolithic samples from the Baltic region and origins of the Corded Ware culture
  • Genetic prehistory of the Baltic Sea region and Y-DNA: Corded Ware and R1a-Z645, Bronze Age and N1c
  • More evidence on the recent arrival of haplogroup N and gradual replacement of R1a lineages in North-Eastern Europe
  • Another hint at the role of Corded Ware peoples in spreading Uralic languages into north-eastern Europe, found in mtDNA analysis of the Finnish population
  • New Ukraine Eneolithic sample from late Sredni Stog, near homeland of the Corded Ware culture