Corded Ware ancestry in North Eurasia and the Uralic expansion

uralic-clines-nganasan

Now that it has become evident that Late Repin (i.e. Yamnaya/Afanasevo) ancestry was associated with the migration of R1b-L23-rich Late Proto-Indo-Europeans from the steppe in the second half of the the 4th millennium BC, there’s still the question of how R1a-rich Uralic speakers of Corded Ware ancestry expanded , and how they spread their languages throughout North Eurasia.

Modern North Eurasians

I have been collecting information from the supplementary data of the latest papers on modern and ancient North Eurasian peoples, including Jeong et al. (2019), Saag et al. (2019), Sikora et al. (2018), or Flegontov et al. (2019), and I have tried to add up their information on ancestral components and their modern and historical distributions.

Fortunately, the current obsession with simplifying ancestry components into three or four general, atemporal groups, and the common use of the same ones across labs, make it very simple to merge data and map them.

Corded Ware ancestry

There is no doubt about the prevalent ancestry among Uralic-speaking peoples. A map isn’t needed to realize that, because ancient and modern data – like those recently summarized in Jeong et al. (2019) – prove it. But maps sure help visualize their intricate relationship better:

natural-modern-srubnaya-ancestry
Natural neighbor interpolation of Srubnaya ancestry among modern populations. See full map.
kriging-modern-srubnaya-ancestry
Kriging interpolation of Srubnaya ancestry among modern populations. See full map

Interestingly, the regions with higher Corded Ware-related ancestry are in great part coincident with (pre)historical Finno-Ugric-speaking territories:

uralic-languages-modern
Modern distribution of Uralic languages, with ancient territory (in the Common Era) labelled and delimited by a red line. For more information on the ancient territory see here.

Edit (29/7/2019): Here is the full Steppe_MLBA ancestry map, including Steppe_MLBA (vs. Indus Periphery vs. Onge) in modern South Asian populations from Narasimhan et al. (2018), apart from the ‘Srubnaya component’ in North Eurasian populations. ‘Dummy’ variables (with 0% ancestry) have been included to the south and east of the map to avoid weird interpolations of Steppe_MLBA into Africa and East Asia.

modern-steppe-mlba-ancestry2
Natural neighbor interpolation of Steppe MLBA-like ancestry among modern populations. See full map.

Anatolia Neolithic ancestry

Also interesting are the patterns of non-CWC-related ancestry, in particular the apparent wedge created by expanding East Slavs, which seems to reflect the intrusion of central(-eastern) European ancestry into Finno-Permic territory.

NOTE. Read more on Balto-Slavic hydrotoponymy, on the cradle of Russians as a Finno-Permic hotspot, and about Pre-Slavic languages in North-West Russia.

natural-modern-lbk-en-ancestry
Natural neighbor interpolation of LBK EN ancestry among modern populations. See full map.
kriging-modern-lbk-en-ancestry
Kriging interpolation of LBK EN ancestry among modern populations. See full map

WHG ancestry

The cline(s) between WHG, EHG, ANE, Nganasan, and Baikal HG are also simplified when some of them excluded, in this case EHG, represented thus in part by WHG, and in part by more eastern ancestries (see below).

modern-whg-ancestry
Natural neighbor interpolation of WHG ancestry among modern populations. See full map.
kriging-modern-whg-ancestry
Kriging interpolation of WHG ancestry among modern populations. See full map.

Arctic, Tundra or Forest-steppe?

Data on Nganasan-related vs. ANE vs. Baikal HG/Ulchi-related ancestry is difficult to map properly, because both ancestry components are usually reported as mutually exclusive, when they are in fact clearly related in an ancestral cline formed by different ancient North Eurasian populations from Siberia.

When it comes to ascertaining the origin of the multiple CWC-related clines among Uralic-speaking peoples, the question is thus how to properly distinguish the proportions of WHG-, EHG-, Nganasan-, ANE or BaikalHG-related ancestral components in North Eurasia, i.e. how did each dialectal group admix with regional groups which formed part of these clines east and west of the Urals.

The truth is, one ought to test specific ancient samples for each “Siberian” ancestry found in the different Uralic dialectal groups, but the simplistic “Siberian” label somehow gets a pass in many papers (see a recent example).

Below qpAdm results with best fits for Ulchi ancestry, Afontova Gora 3 ancestry, and Nganasan ancestry, but some populations show good fits for both and with similar proportions, so selecting one necessarily simplifies the distribution of both.

Ulchi ancestry

modern-ulchi-ancestry
Natural neighbor interpolation of Ulchi ancestry among modern populations. See full map.
kriging-modern-ulchi-ancestry
Kriging interpolation of Ulchi ancestry among modern populations. See full map.

ANE ancestry

natural-modern-ane-ancestry
Natural neighbor interpolation of ANE ancestry among modern populations. See full map.
kriging-modern-ane-ancestry
Kriging interpolation of ANE ancestry among modern populations. See full map.

Nganasan ancestry

modern-nganasan-ancestry
Natural neighbor interpolation of Nganasan ancestry among modern populations. See full map.
kriging-modern-nganasan-ancestry
Kriging interpolation of Nganasan ancestry among modern populations. See full map.

Iran Chalcolithic

A simplistic Iran Chalcolithic-related ancestry is also seen in the Altaic cline(s) which (like Corded Ware ancestry) expanded from Central Asia into Europe – apart from its historical distribution south of the Caucasus:

modern-iran-chal-ancestry
Natural neighbor interpolation of Iran Neolithic ancestry among modern populations. See full map.
kriging-modern-iran-neolithic-ancestry
Kriging interpolation of Iran Chalcolithic ancestry among modern populations. See full map.

Other models

The first question I imagine some would like to know is: what about other models? Do they show the same results? Here is the simplistic combination of ancestry components published in Damgaard et al. (2018) for the same or similar populations:

NOTE. As you can see, their selection of EHG vs. WHG vs. Nganasan vs. Natufian vs. Clovis of is of little use, but corroborate the results from other papers, and show some interesting patterns in combination with those above.

EHG

damgaard-modern-ehg-ancestry
Natural neighbor interpolation of EHG ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-ehg-ancestry
Kriging interpolation of EHG ancestry among modern populations. See full map.

Natufian ancestry

damgaard-modern-natufian-ancestry
Natural neighbor interpolation of Natufian ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-natufian-ancestry
Kriging interpolation of Natufian ancestry among modern populations. See full map.

WHG ancestry

damgaard-modern-whg-ancestry
Natural neighbor interpolation of WHG ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-whg-ancestry
Kriging interpolation of WHG ancestry among modern populations. See full map.

Baikal HG ancestry

damgaard-modern-baikalhg-ancestry
Natural neighbor interpolation of Baikal hunter-gatherer ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-baikal-hg-ancestry
Kriging interpolation of Baikal HG ancestry among modern populations. See full map.

Ancient North Eurasians

Once the modern situation is clear, relevant questions are, for example, whether EHG-, WHG-, ANE, Nganasan-, and/or Baikal HG-related meta-populations expanded or became integrated into Uralic-speaking territories.

When did these admixture/migration events happen?

How did the ancient distribution or expansion of Palaeo-Arctic, Baikalic, and/or Altaic peoples affect the current distribution of the so-called “Siberian” ancestry, and of hg. N1a, in each specific population?

NOTE. A little excursus is necessary, because the calculated repetition of a hypothetic opposition “N1a vs. R1a” doesn’t make this dichotomy real:

  1. There was not a single ethnolinguistic community represented by hg. R1a after the initial expansion of Eastern Corded Ware groups, or by hg. N1a-L392 after its initial expansion in Siberia:
  2. Different subclades became incorporated in different ways into Bronze Age and Iron Age communities, most of which without an ethnolinguistic change. For example, N1a subclades became incorporated into North Eurasian populations of different languages, reaching Uralic- and Indo-European-speaking territories of north-eastern Europe during the late Iron Age, at a time when their ancestral origin or language in Siberia was impossible to ascertain. Just like the mix found among Proto-Germanic peoples (R1b, R1a, and I1)* or among Slavic peoples (I2a, E1b, R1a)*, the mix of many Uralic groups showing specific percentages of R1a, N1a, or Q subclades* reflect more or less recent admixture or acculturation events with little impact on their languages.

*other typically northern and eastern European haplogroups are also represented in early Germanic (N1a, I2, E1b, J, G2), Slavic (I1, G2, J) and Finno-Permic (I1, R1b, J) peoples.

ananino-culture-new
Map of archaeological cultures in north-eastern Europe ca. 8th-3rd centuries BC. [The Mid-Volga Akozino group not depicted] Shaded area represents the Ananino cultural-historical society. Fading purple arrows represent likely stepped movements of subclades of haplogroup N for centuries (e.g. Siberian → Ananino → Akozino → Fennoscandia [N-VL29]; Circum-Arctic → forest-steppe [N1, N2]; etc.). Blue arrows represent eventual expansions of Uralic peoples to the north. Modified image from Vasilyev (2002).

The problem with mapping the ancestry of the available sampling of ancient populations is that we lack proper temporal and regional transects. The maps that follow include cultures roughly divided into either “Bronze Age” or “Iron Age” groups, although the difference between samples may span up to 2,000 years.

NOTE. Rough estimates for more external groups (viz. Sweden Battle Axe/Gotland_A for the NW, Srubna from the North Pontic area for the SW, Arctic/Nganasan for the NE, and Baikal EBA/”Ulchi-like” for the SE) have been included to offer a wider interpolated area using data already known.

Bronze Age

Similar to modern populations, the selection of best fit “Siberian” ancestry between Baikal HG vs. Nganasan, both potentially ± ANE (AG3), is an oversimplification that needs to be addressed in future papers.

Corded Ware ancestry

bronze-age-corded-ware-ancestry
Natural neighbor interpolation of Srubnaya ancestry among Bronze Age populations. See full map.

Nganasan-like ancestry

bronze-age-nganasan-like-ancestry
Natural neighbor interpolation of Nganasan-like ancestry among Bronze Age populations. See full map.

Baikal HG ancestry

bronze-age-baikal-hg-ancestry
Natural neighbor interpolation of Baikal Hunter-Gatherer ancestry among Bronze Age populations. See full map.

Afontova Gora 3 ancestry

bronze-age-afontova-gora-ancestry
Natural neighbor interpolation of Afontova Gora 3 ancestry among Bronze Age populations. See full map.

Iron Age

Corded Ware ancestry

Interestingly, the moderate expansion of Corded Ware-related ancestry from the south during the Iron Age may be related to the expansion of hg. N1a-VL29 into the chiefdom-based system of north-eastern Europe, including Ananyino/Akozino and later expanding Akozino warrior-traders around the Baltic Sea.

NOTE. The samples from Levänluhta are centuries older than those from Estonia (and Ingria), and those from Chalmny Varre are modern ones, so this region has to be read as a south-west to north-east distribution from the Iron Age to modern times.

iron-age-corded-ware-ancestry
Natural neighbor interpolation of Srubnaya ancestry among Iron Age populations. See full map.

Baikal HG-like ancestry

The fact that this Baltic N1a-VL29 branch belongs in a group together with typically Avar N1a-B197 supports the Altaic origin of the parent group, which is possibly related to the expansion of Baikalic ancestry and Iron Age nomads:

iron-age-baikal-ancestry
Natural neighbor interpolation of Baikal HG ancestry among Iron Age populations. See full map.

Nganasan-like ancestry

The dilution of Nganasan-like ancestry in an Arctic region featuring “Siberian” ancestry and hg. N1a-L392 at least since the Bronze Age supports the integration of hg. N1a-Z1934, sister clade of Ugric N1a-Z1936, into populations west and east of the Urals with the expansion of Uralic languages to the north into the Tundra region (see here).

The integration of N1a-Z1934 lineages into Finnic-speaking peoples after their migration to the north and east, and the displacement or acculturation of Saami from their ancestral homeland, coinciding with known genetic bottlenecks among Finns, is yet another proof of this evolution:

iron-age-nganasan-ancestry
Natural neighbor interpolation of Nganasan ancestry among Iron Age populations. See full map.

WHG ancestry

Similarly, WHG ancestry doesn’t seem to be related to important population movements throughout the Bronze Age, which excludes the multiple North Eurasian populations that will be found along the clines formed by WHG, EHG, ANE, Nganasan, Baikal HG ancestry as forming part of the Uralic ethnogenesis, although they may be relevant to follow later regional movements of specific populations.

iron-age-whg-ancestry
Natural neighbor interpolation of WHG ancestry among Iron Age populations. See full map.

Conclusion

It seems natural that people used to look at maps of haplogroup distribution from the 2000s, coupled with modern language distributions, and would try to interpret them in a certain way, reaching thus the wrong conclusions whose consequences are especially visible today when ancient DNA keeps contradicting them.

In hindsight, though, assuming that Balto-Slavs expanded with Corded Ware and hg. R1a, or that Uralians expanded with “Siberian” ancestry and hg. N1a, was as absurd as looking at maps of ancestry and haplogroup distribution of ancient and modern Native Americans, trying to divide them into “Germanic” or “Iberian”…

The evolution of each specific region and cultural group of North Eurasia is far from being clear. However, the general trend speaks clearly in favour of an ancient, Bronze Age distribution of North Eurasian ancestry and haplogroups that have decreased, diluted, or become incorporated into expanding Uralians of Corded Ware ancestry, occasionally spreading with inter-regional expansions of local groups.

Given the relatively recent push of Altaic and Indo-European languages into ancestral Uralic-speaking territories, only the ancient Corded Ware expansion remains compatible with the spread of Uralic languages into their historical distribution.

Related

Genetic continuity among Uralic-speaking cultures in north-eastern Europe

east-europe-bronze-age

The recent study of Estonian Late Bronze Age/Iron Age samples has shown, as expected, large genetic continuity of Corded Ware populations in the East Baltic area, where West Uralic is known to have been spoken since at least the Early Bronze Age.

The most interesting news was that, unexpectedly for many, the impact of “Siberian ancestry” (whatever that actually means) was small, slow, and gradual, with slight increases found up to the Middle Ages, compatible with multiple contact events in north-eastern Europe. Haplogroup N became prevalent among Finnic populations only through late bottlenecks, as research of modern populations have long suggested, and as ancient DNA research hinted since at least 2015.

I risked to correlate the arrival of chiefs from the south-west with the infiltration of N1c-VL29 subclades during the transition to the Iron Age, coupled with that minimal “Siberian” ancestry (see e.g. here and here). Now we know that the penetration of this non-CW ancestry started, as predicted, in the Iron Age; that it was highly variable in the few samples where it appeared, with ca. 1-4%, while most Iron Age individuals show 0%; and that it was not especially linked to individuals of N1c-Vl29 lineages.

It is also basically confirmed, based on the (ancient and Modern Swedish) N1c-L550 subclades found among Iron Age Estonians, that N1c-VL29 lineages and the so-called “Siberian” ancestry will be found simultaneously around the Baltic coastal areas, and that different lineages must have suffered later founder effects among Finns, which suggests that these alliances through exogamy brought exactly as much language change in Sweden, Lithuania, or Poland, as they did in the East Baltic region…

On the other hand, the paper has also shown a potential movement of Corded Ware-derived peoples, if the change from LBA to IA samples is meaningful; in fact, even more Corded Ware-like than Baltic and Estonian BA populations. The exact origin of that movement is difficult to pinpoint, and it may not be related to the arrival of Akozino warrior-traders from the south-east, since theirs seems to be a minor impact proper of elites in a chiefdom system around the Baltic.

fortified-settlements-lba-ia
Distribution of fortified settlements (filled circles) and other hilltop sites (empty circles) of the Late Bronze Age and Pre-Roman Iron Ages in the East Baltic region. Tentative area of most intensive contacts between Baltic and Balto-Finnic communities marked with a dashed line. Image modified from (Lang 2016).

Also suggesting a potential movement is the ‘southern’ shift observed in the West and East Baltic areas, likely showing the arrival of Proto-East Baltic speakers (such as the Trzciniec outlier), as we have already discussed in this blog. The unexpected increase in Corded Ware-like ancestry in the Eastern Baltic, coupled with the expected large continuity of hg. R1a-Z283 in the homeland of Balto-Finnic expansions, gives even more support to the known complex system of exogamy along the Baltic coasts, and offers another potential reason for the rise of Baltic-speaking territories in the West Baltic: elite domination.

It is nevertheless important to understand that, even among the most “genetic continuous” regions like Estonia, not a single population in Europe is heir of some ancestral, immutable people. Not in terms of haplogroups, and not in terms of admixture. Balto-Finnic speakers, however continuous they might seem (e.g. in Southern Estonians) aren’t an exception.

After all, this blog was (re)born to fight the currently prevalent sheer stupidity surrounding the simplistic “R1a/steppe ancestry=Indo-European” association, so I wouldn’t like to see it replaced with some other stupid continuity or purity ideas within 10 to 20 years…

Late Uralic stems from East Corded Ware groups

With the currently available tools – linguistics, archaeology, and now genetics -, I don’t think there is any argument to date to question the direct connection of the Late Proto-Uralic expansion with all Eastern Corded Ware groups (i.e. Battle Axe, Fatyanovo-Balanovo, and Abashevo), and thus at least with the unifying A-horizon of Corded Ware and the bottlenecks under R1a-Z645.

NOTE. The only out-group among Corded Ware cultures is the Single Grave culture. It appears to be an early Corded Ware offshoot, reflected in their non-unitary cultural traits (distinct from later unifying waves), in their varied patrilineal clans, and in the short-lasting cultural effect in northern Europe before their complete demise under pressure of expanding Yamna/Bell Beaker peoples from the Danube. The culture’s minimal (if any) effects on succeeding peoples might be seen mostly in the (mainly phonetic) Uralic substrate found in Balto-Slavic – although this may also stem from a more eastern influence, close to the Baltic – and in the contacts of Celtic with Uralic. The huge time depth between this early hypothetic Uralic layer in northern Europe and the emergence of peoples inhabiting these territories in recorded history have no doubt been erroneously interpreted as a lack of Uralic presence in the area.

1) That connection was evident in the Yamna – CWC differences in archaeology, and especially later, with at least Fatyanovo-Balanovo and Abashevo representing the obvious replacement of the Volosovo culture before further expansions of CWC-related groups west and east of the Urals.

The mythical millennia-long continuity of Volosovo hunter-gatherers, including centuries among Corded Ware peoples, as expected lately by the Copenhagen group (and anyone who doesn’t want to question the 1960s association of Indo-European with CWC) must be rejected today in population genomics, as the recent studies of ancient and modern populations show, and as ancient DNA from the region will confirm.

2) In linguistics, the survival of Volosovo as The Uralic-speaking culture was also hardly believable. From Kallio (2015):

While we can say at least something about Uralic substrates in Northeastern Europe, non-Uralic substrates cannot at all easily be identified, because of multiple language shifts, viz. first from non-Uralic to Uralic and then from Uralic to Russian. Yet the Soviet Uralicist Boris Serebrennikov (1956, 1959) argued that there are some non-Uralic substrate toponyms in the Volga-Oka region, but his idea was never taken seriously in the west (cf. Sauvageot 1958), and it pretty soon also sank into oblivion in Russia, even though it can still occasionally pop up there in non-onomastic circles (cf. Napolskikh 1995: 18–19). However, not all the hypotheses on non-Uralic substrates in Northeastern Europe should be rejected (see e.g. Helimski 2001b).

bronze-age-early-languages-east-europe
Tentative map of the distribution of known languages in Eastern Europe during the Early Bronze Age. See full map.

Helimski (2001) argues for a non-Uralic topo-hydronomy in Northern Russia, whose population may have kept their languages up to the Common Era despite the Corded Ware expansion, which is in line with the survival of some non-Indo-European languages everywhere in Europe after the expansion of Yamna and its offshoots:

It should be borne in mind that these [Uralic] hydronyms reached us mainly through Northern Russian and, accordingly, with a tendency to phonetic-morphological adaptation and unification (for river names it is “natural” to be, like the word ‘river’ itself, feminine and to end in -a). Taking into account this circumstance, it may turn out to be non-useless for etymological identification of at least some of the hydronyms on the Finno-Ugric basis.

On the other hand, I wouldn’t exclude the possibility that some parts of this large geographical area were never (completely) Finno-Ugric. The population that created the most important part of the hydronymy of the Russian North could be finally pushed aside or assimilated only at the end of the 1st – beginning of the 2nd millennium AD, during the Russian colonization, retaining the memory of the White-Eyed Chude in its own memory.

NOTE. For more on this non-IE substrate in (especially West) Uralic, see e.g. Zhivlov (2015),

The same non-Uralic substrate is most likely behind most of the shared traits by Mordvinic and Balto-Finnic (see below).

3) In genetics, I don’t think the picture could get any clearer. I don’t know what “Steppe ancestry = Indo-European” proponents expected from 2019, if they expected anything at all (I haven’t seen any coherent model, proposal, or prediction for a long time now), but I doubt the recent results are compatible with any of their implied expectations.

corded-ware-pca-sub-neolithic-europe
Detail of the PCA of the Corded Ware expansion. See full PCA and more related files.

Notice, from the PCA above, how this Baltic Late Neolithic group shows actually a shift from Sredni Stog (see PCA with Sredni Stog) towards typical Khvalynsk-Urals-related ancestry, i.e. populations from eastern European forested regions, derived from hunter-gatherer pottery groups, as I have proposed for a very long time, since the first time a Baltic LN “outlier” appeared. It’s amazing how some amateurs can find 0.1% of any Siberian outlier’s ancestry among Uralians 4,000 years later, but fail to see the direct connection here. The esoteric uses of qpAdm, I guess…

Especially noticeable is the extra WHG-like ancestry and corresponding shift, seen especially marked in late Polish CWC samples, but also in Baltic CWC and especially in one Sweden Battle Axe sample, all of them shifting apparently closer to Pitted Ware and SHG. While that may have been interpreted as an in situ admixture in Scandinavia before, the late Polish CWC samples show likely a resurgence of local populations, so we can assume that both shifts (to SHG- and EHG-like populations) of available CWC samples around the Baltic are clearly part of the WHG:EHG continuum that will be found in the eastern European sub-Neolithic cultures, from Narva to Volosovo.

This WHG-related ancestry is clearly predominant in groups with which Battle Axe peoples admixed, based on the shift towards Pitted Ware, which – I can only guess based on modern Volga Finns – is different from the shift we will see in Netted Ware, more towards the Khvalynsk-Urals cluster. This is in line with the expansion of Battle Axe eastward through coastal areas (West to East Baltic and Finland into Sweden), while Fatyanovo peoples probably emerged from a slightly different route, but also a northern one, if one is to follow archaological similarities and their chronology.

bronze-age-europe-baltic
Detail of the PCA of European Bronze Age populations. See full PCA and more related files.

During the Iron Age, the only peoples that probably shifted strongly (based on modern populations) are West Baltic ones, getting closer to the available Late Trzciniec samples, and even closer to the Trzciniec outlier, i.e. away from the earlier Eastern Corded Ware cluster, and towards Central European groups like Czech EBA or Poland EBA, both of them clearly derived from Bell Beakers, but also admixed with (and thus shifted toward) CW-like populations.

If one looks carefully at the previous PCA on Bronze Age populations, and the next one on Iron Age clusters, it is evident that adding the Swedish LN outlier to East Baltic BA (both strongly related to Battle Axe populations) essentially gives us the continuity of East Baltic BA into the Iron Age. This cluster is continued also in two outliers from Sigtuna, a Viking town close to the Gulf of Finland, known to be an important trading site, 1,500 years later. Not much of a change around the Gulf of Finland, then:

iron-age-eastern-europe
Detail of the PCA of East and North European Iron Age populations. See full PCA and more related files.

Based on the two simplistic Uralic clines one might see described (among the many that certainly existed, from Corded Ware to different Eurasian populations), and just like BOO was for some months fashionable as “Samic”, some may be tempted to say that certain Sintashta or Srubna outliers close to the Urals mark the True Uralic™ peoples. Because, of course they do. Ghost haplogroup N and stuff. And Corded Ware never ever Uralic. Because Gimbutas, and my IE R1a grandfather.

NOTE. Funny thing here: there might be Corded Ware, Iranian, Slavic, Germanic, etc… outliers or out-groups, and they might form the widest genetic clusters ever seen, but they are all of one language, because archaeology and linguistics; however, one “outlier” (also, put your own definition of “outlier” here, let’s say 1% of whatever, and strontium isotope potentially from 100 km away) ca. 600 BC in the Baltic who (surprise!) happens to show hg. N, and he signals the first incoming True Uralic™ speaker from wherever… It won’t be the first or the last time some people resort to “the complexity of Uralic-speaking peoples” in ancestry, just to look for “hg. N = Uralic” like crazy. You only need common sense to understand that this is not how this works. Amateur genomics can’t get more embarrassing than the current “let’s look for ‘Siberian ancestry’ in every individual of haplogroup N” trend. Or maybe it can, and it will, but I can’t see it yet.

If one were to insist on looking for ‘foreign’ contributions among Iron Age Estonians, though, I think one should also check out first archaeology, and then the PC3 (or, more graphically, a 3D plot), to understand what might be happening with the many Uralic clines derived from Corded Ware, before starting to play around with bioinformatic tools to discover a teeny tiny 1% admixture of the wrong population, and rushing to build far-fetched narratives. Apparently, one of the different clines formed roughly between southern (steppe – forest-steppe) and northern (tundra-taiga) populations in Uralians is also seen in some Iron Age Estonian individuals – especially in some late samples from Ingria…This is not my main interest, so I will leave this here for others to keep wasting their time chasing the white whale of the 0.5% of True Uralic™ ancestry in ancient Baltic samples of hg. N.

pca-3d-estonians-iron-age-boo-samic
Still images of the 3D plot of Eurasian samples. Typical PC1 vs. PC2 visualization to the left, and shift of the view to PC3 on the right image. See full PCA and more related files.

An exclusive Volga-Kama homeland for Disintegrating Uralic?

Since I don’t believe in macro-regions of largely continuous ethnolinguistic communities, as I have often said about Slavic (naively associated with prehistoric tribes of Eastern Europe) or Germanic (absurdly considered to be represented by Battle Axe), it is difficult for me to believe that Battle Axe-derived cultures remained of the same Finno-Samic dialects since the Corded Ware expansion…unless we live in Westeros, where everything happens “for thousands of years”.

I have to admit, then, that the now prevalent identification among Uralicists has become quite attractive:

  • Fatyanovo-Balanovo as Finno-Permic:
    • Fatyanovo/Netted Ware with West Uralic (also called Finno-Mordvinic).
    • Balanovo/Chirkovo-Kazan with Central Uralic (Mari-Permic).
  • Abashevo, into the Andronovo-like Horizon through the Seima-Turbino phenomenon, with East Uralic (also Ugro-Samoyedic).

Exactly like the identification of Yamna Hungary – Bell Beaker transition as the North-West Indo-European homeland, it gives us simplicity and small and late ethnolinguistic communities, away from the traditionally overused big and early language territories.

This late homeland would be supported, among others, by:

  • The presence of Indo-Iranian loanwords in Finno-Permic and Ugric (probably also in Samoyedic, either lost, or – much more likely – underresearched), compatible with the immediate contact between Abashevo – Sintashta-Potapovka-Filatovka and Fatyanovo-Balanovo.
  • The supposed expansion of Netted Ware from Fatyanovo to the north-west, which may be explained as the split and expansion of Balto-Finnic and Samic ca. 1900 BC.
  • A longer-lasting Finno-Permic (West+Central Uralic) community contrasting with the early separation of East Uralic.
  • The compatibility of this late expansion with the late expansion of Pre-Germanic from Denmark with the Dagger Period, and of Balto-Slavic with Trzciniec, which puts all three dialects reaching the Baltic Sea in the EBA.

NOTE. I meant to update the linguistic text to include the most recently favoured phylogenetic tree of Uralic languages after Häkkinen (2007, 2009, 2014), which has very quickly become the new normal among Uralicists, but I don’t think I will have enough time to review the necessary papers for that. I am rushing to publish a printed edition, so the text will wind up being a mixture of “traditional” (meaning, basically, pre-2010s) description of Uralic dialects but using modern divisions; say, “West Uralic” instead of “Finno-Samic”. By the way, I am still amazed that none of my reader-haters (or any online user discussing Uralic migrations, for that matter) have come up with the questions that the new division pose, and it supports my suspicion about the complete lack of interest in linguistics of most (a)DNA fans, except for the occasional use of old and free PDFs Googled to support new narratives invented expressly for some qpAdm results…

textile-ceramics-europe-bronze-age
Textile ceramic styles and influence of Bronze Age cultures divided in clusters.

Problems with this Parpola-Carpelan’s (2012-2018) interpretation include:

  • The differentiation between Fennoscandian Textile Ceramics vs. Netted Ware, which is not warranted in archaeology. The assumption that Netted Ware expanded to the Baltic Sea (as Kallio does, following the traditional view) is thus weak, and it was probably a question of cultural contacts coupled with short-distance population movements/exchange in both directions (from the Baltic to the Volga and vice versa). In fact, the culture division relies on some fairly common and technically simple ornamentation patterns, widespread all over northern Europe, even before the Corded Ware expansion, and it is very difficult to separate certain neighboring Textile Ceramics from Netted Ware groups in southern Finland (i.e. Sarsa-Tomitsa groups).
  • The strict and radical direction described for the Netted Ware by Carpelan, as an eastward and northward expansion, within a very short time frame (ca. 1900-1800 BC), based on few radiocarbon dates, which seems to me like a very risky assumption. We know how this kind of descriptions of direction of culture expansion based on radiocarbon dates has turned out in much more complex “packages”, like the Bell Beaker culture… In fact, the earliest dates for Textile Ware are from the East Baltic, earlier than those of Netted Ware.
  • The assumption that Balto-Finnic traits shared with Mordvinic are a) late and b) meaningful for dialectalization of two closely related dialects, when it is clear that both dialects separated quite early. Phonologically Finnic is more conservative, morphologically less so, and the shared traits include a handful of non-Uralic substrate words which can’t be traced to a single common source, hence they were adopted when both languages had already separated… All in all, Finnic – Mordvinic correspondances are not even close to Italo-Celtic ones, which is clearly fully incompatible with a proposal of a Finnic separation from Mordvinic coinciding with the LBA-IA transition.

Especially problematic for Parpola’s model is the lack of genetic impact in Bronze Age or Iron Age Estonians, not reaching a significant level under any possible statistical threshold – which I am sure was quite disappointing for some of my readers -, but is in line with major archaeological continuity of groups the from region, only disturbed in cultural (and Y-chromosome) terms by the expansion of Akozino warrior-traders all over the Baltic Sea. Any proposed population movement will be very difficult to support in genetics, given the Corded Ware-derived populations that we will see in both regions, and the continued Baltic-Volga contacts since the Corded Ware expansion.

Problems with an interpretation of such a small impact in population genomics includes the similarly weak impacts and haplogroup infiltrations that can be seen among populations basically everywhere in Eurasia, during any given period, and much greater genetic impacts that are supposed to be (or that were certainly) followed by ethnolinguistic continuity.

akozino-malar-axes-fennoscandia
Distribution of the Akozino-Mälar axes according to Sergej V. Kuz’minykh (1996: 8, Abb. 2).

The Battle Axe question

From Kallio (2015), about choosing a tentative homeland for Proto-Uralic:

(…) linguistically uniform Proto-Uralic would have been spoken in the Volga-Oka region until the mid-third millennium BC when the Proto-Uralic-speaking area would have expanded to the Volga-Kama region as well. By the end of the same millennium, this expansion would have led to the earliest dialectal splits within Uralic into Finno-Mordvin, Mari-Permic, and Ugro-Samoyed. The splitting up of these three soon followed during the early second millennium BC when the Uralic-speaking area finally stretched from the Baltic Sea in the west to the Altai mountains in the east. Indeed, no matter where Proto-Uralic was spoken, the branching into the nine well-attested subgroups (viz. Finnic, Saami, Mordvin, Mari, Permic, Hungarian, Mansi, Khanty, and Samoyed) must have taken less than a millennium, because their shared phonological and morphosyntactic isoglosses are rather limited (see Salminen 2002). The traditional view that all this branching would have taken several millennia violates everything linguistic typology teaches us about the rate of language change.

The basic problem of this identification of Fatyanovo-Balanovo as West-Central Uralic and Abashevo as East Uralic is the nature of the Battle Axe culture, including the Bronze Age East Baltic and Gulf of Finland area. Even if it is accepted that Fatyanovo-Balanovo represented all Western groups, Battle Axe must have represented West Uralic-like dialects.

The ethnolinguistic identification of Battle Axe depends ultimately on the nature of contacts of Fatyanovo/Netted Ware with Battle Axe/Textile Ceramics. If both groups were close and interacted profusely, as it seems, it doesn’t seem granted that we will be able to distinguish a close Para-West Uralic dialect of Scandinavia from the actual expanding Balto-Finnic and Samic dialects, if they were actually linked to the Netted Ware expansion. Also from Kallio (2015):

No doubt the most convincing substrate theory has recently been put forward by the Saami Uralicist Ante Aikio (2004), who has not only rehabilitated but also improved the old idea of a non-Uralic substrate in Saami. His study shows that there were still non-Uralic languages spoken in Northern Fennoscandia as recently as the first millennium AD. Most of all, they were not only genetically non-Uralic but also typologically non-Uralic-looking, bearing a closer resemblance to the so-called Palaeo-European substrates (for which see e.g. Schrijver 2001; Vennemann 2003).

In comparison, the case of Finnic is much more difficult. The fact that Proto-Uralic was not spoken in the East Baltic region means that this area must have originally been non-Uralic-speaking, but so far the evidence for a non-Uralic substrate in Finnic has consisted of appellatives and proper names with no etymology (cf. Ariste 1971; Saarikivi 2004a). Contrary to the proposed substrate words in Saami, those in Finnic show no structural non-Uralisms, as if they had indeed been borrowed from some genetically related or at least typologically similar languages, as I suggested above. Also none of them is more recent than the Middle Proto-Finnic stage, which makes them at least two millennia old. All this agrees with archaeological evidence discussed earlier that the Uralicization of the East Baltic region occurred during the Bronze Age (ca. 1900–500 BC).

The discussion of the paper continues with an unsuccessful attempt to find a hypothetical ancient Indo-European substrate that Kallio believes must be associated with the expansion of Corded Ware, in line with the traditional belief. For example, the often mentioned – almost folk etymology-like, unsurprisingly popular among amateurs – ‘Neva’ as derived from IE “young” is logically rejected…Unlike Parpola, Kallio’s view seems to be confident that Netted Ware (as Textile Ware) expanded into the East Baltic, on both sides of the Gulf of Finland, already during the Bronze Age.

As it has become apparent in population genomics, none of them was right, and Textile Ceramics will essentially show – like Netted Ware – a large genetic continuity of Corded Ware peoples in the whole north-eastern European forest zone – despite small regional population movements, obviously -, which necessarily implies that the whole Corded Ware culture – and not only Fatyanovo-Balanovo and Abashevo – were Uralic-speaking territories.

The similarities in terms of culture and Y-DNA bottlenecks between Battle Axe and Fatyanovo-Balanovo also imply that the linguistic differences between these groups were probably not many, and became strongly divided only after their territorial division. Continued contacts between Battle Axe- and Fatyanovo-derived groups can explain the proposed contacts (Finnic with Samic, Finnic with Mordvinic) after their linguistic-but-not-physical separation.

east-european-fatyanovocwc
East European movement directions (arrows) of the representatives of the Central European Corded Ware Culture (according to I.I. Artemenko).

Battle Axe spoke “Para-Balto-Finnic”?

The Balto-Finnic-speaking nature of Battle Axe is thus supported by:

  • The lack of non-Uralic substrates in Balto-Finnic territory (Kallio 2015).
  • The early separation of Samic and Finnic from Mordvinic, and the virtual identity of Proto-West-Uralic and Proto-Uralic, which suggests that Proto-Uralic spread fast (Parpola 2012).
  • The scarce non-Uralic topo-hydronymy in the East Baltic and around the Gulf of Finland (Saarikivi 2004), comparable to that on the Upper Volga region.
  • The strong influence of a Balto-Finnic-like substrate on Pre-Germanic (or, in Kallio’s opinion, the same Scandinavian substrate influencing both Germanic and Balto-Finnic at the same time), and the continued influence of Balto-Finnic on Proto-Baltic and Proto-Slavic.
  • The continued influence of Corded Ware-derived groups in central-east Sweden in Finland and the East Baltic in terms of agricultural innovations appearing in the LBA, compatible with Schrijver’s proposal of intermediate Germanic-shifted Balto-Finnic groups and Balto-Finnic groups influenced by their pronunciation.
  • The intense Palaeo-Germanic and late Balto-Slavic / early Proto-Baltic superstrate on Balto-Finnic, which place all three dialects around the Baltic Sea since the Early Bronze Age.
  • The easy replacement of a hypothetic Para-Balto-Finnic dialect by incoming Proto-Balto-Finnic-speaking peoples (say, with textile ceramics), without much linguistic impact.

In fact, the continuous contacts of the East Baltic with the Volga, and especially the close interaction with Akozino warrior-traders just before the Tarand-grave period, could be the actual origin of the recent (if any) Finnic-Mordvinic connections that need to be traced back to the LBA-IA (maybe here the number ‘ten’), since most of them can be related to a Pit-Comb Ware culture substrate and earlier contacts through the forest zone, which Samic (due to its early split and presence to the north of the Gulf of Finland during the BA) does not share. In fact, some of them can be traced back to Balto-Finnic first

These are the most often mentioned, in order of descending relevance for a shared ancient community:

  • Noun paradigms and the form and function of individual cases.
  • The geminate *mm (foreign to Proto-Uralic before the development of Fennic under Germanic influence) and other non-Uralic consonant clusters.
  • The change of numeral *luka ‘ten’ with (non-Uralic) *kümmen.
  • The presence of loanwords of non-Uralic origin, related to farming and trees, potentially Palaeo-European in nature.

It’s not only a question of quantity. Are these shared Mordvinic – Balto-Finnic traits really more relevant than, say, those between Italo-Celtic, which are supposed to have formed a community for a very short period at the end of the 3rd millennium around the Alps? Are these traits even sufficient to propose a common early Mordvinic-Finnic group within West Uralic, rather than loose Mordvinic – Balto-Finnic contacts, i.e. contacts between East Baltic (Textile Ceramics) and Volga-Kama (Netted Ware)?

Based on the alternative (Kallio’s) view of continued contacts between Textile Ceramics groups, even without knowing anything about linguistics, you can guess that Parpola is spinning very thin when assuming that these changes suggest that Balto-Finnic may have expanded with Akozino warrior-traders, separating thus ca. 800 BC from Mordvinic…

Genetic findings now clearly help dismiss any meaningful population impact in the LBA-IA transition, although any linguist can obviously argue for linguistic change in spite of major genetic continuity. But then we are stuck in the pre-ancient DNA era, so what’s ancient DNA for.

netted-ware-textile-ceramics
Middle Bronze Age cultures of Eastern Europe.

Genetic continuity = language continuity?

In the end, it’s very difficult to say how much language continuity there is around Estonia since the arrival of Corded Ware peoples. Looking at Modern Estonians, they have been clearly influenced by recent contacts with Baltic- and Germanic-speaking peoples clustering to the south-west in the PCA. They seem to have also received contacts from north(-east)ern peoples, likely from Finland, evidenced by their shifts toward the modern Estonian cluster during and after the Middle Ages, with a slight increase in Siberian ancestry and N1c subclades associated with Lovozero Ware. How much language change did these contacts bring? Maybe an expansion of Gulf of Finland Finnic (Northern Estonian) over Inland Finnic (Southern Estonian) and Gulf of Riga Finnic (Livonian)? Difficult to know, exactly, but, in the traditional view of Balto-Finnic dialectal distribution among Uralicists like Kallio, possibly no change at all.

So, if the obvious changes in the Estonia_MA cluster relative to Estonia_IA cluster and Estonia_Modern relative to Estonia_MA do not represent radical language change…Why would Estonia_IA represent a change relative to Estonia_BA, when it is statistically basically the same? Or Estonia_BA relative to CWC_Baltic? Because of the infiltration of haplogroup N1c around the whole Baltic? Because of the occasional 1% “Siberian” ancestry in some non-locals of varied haplogroups across the whole Baltic area?

In spite of all this, the amount of special pleading we are seeing among openly Nordicist amateurs when discussing the Uralic homeland relative to the Indo-European question in genetics has become a matter of plain willful ignorance. Like the living corpses of the Anatolian homeland, the Armenian homeland, the OIT proponents, or the nativist Basque R1b association, the personal involvement in the revival of “R1a=Indo-European” and “N=Uralic” trends is just painful to watch.

[Next post in this line, if I manage to make time for it: “Genetic (dis)continuity in Central Europe“. Let’s see if early Balts and early Slavs, as well as Germanic peoples, show a cluster closer to Danubian EBA (viz. Maros), Hungary-Balkans BA, and Urnfield-related samples than their predecessors in their areas, i.e. away from East Corded Ware groups… If you want, you can enjoy for the moment the new PCAs I could get done and the tentative map of languages in the Early Bronze Age, that will probably give you the right idea about early Indo-European and Uralic population movements]

bronze-age-early-indo-european
European Early Bronze Age: tentative language map based on linguistics, archaeology, and genetics. See full map.

Related

Pre-Germanic and Pre-Balto-Finnic shared vocabulary from Pitted Ware seal hunters

corded-ware-pitted-ware

I said I would write a post about topo-hydronymy in Europe and Iberia based on the most recent research, but it seems we can still enjoy some more discussions about the famous Vasconic Beakers, by people longing for days of yore. I don’t want to spoil that fun with actual linguistic data (which I already summarized) so let’s review in the meantime one of the main Uralic-Indo-European interaction zones: Scandinavia.

Seal hunting

One of the many eye-catching interpretations – and one of the few interesting ones – that could be found in the relatively recent article Talking Neolithic: Linguistic and Archaeological Perspectives on How Indo-European Was Implemented in Southern Scandinavia, by Iversen & Kroonen AJA (2017) was this:

The borrowing of lexical items from hunter-gatherers into Germanic refers to the potential adoption of Proto-Germanic *selhaz “seal” (Old Norse selr, Old English seolh, Old High German selah) as well as Early Proto-Balto-Finnic *šülkeš “seal” (Finnish hylje, Estonian hüljes) from the marine-oriented Sub-Neolithic Pitted Ware culture.

kroonen-iversen
Modified from Kristiansen et al. (2017), with red circle around the hypothesized interaction of Germanic with hunter-gatherers. “Schematic representation of how different Indo-European branches have absorbed words (circles) from a lost Neolithic language or language group (dark fill) in the reconstructed European linguistic setting of the third millennium BC, possibly involving one or more hunter gatherer languages (light fill) (after Kroonen & Iversen 2017)”.

This is what Kroonen thought about this word in his Etymological Dictionary of Proto-Germanic (2006):

Gmc. *selha– m. ‘seal’ – ON selr m. ‘id.’, Far. selur m. ‘id.’, OSw. siæl m. ‘id.’, Sw. själ c. ‘id.’, OE seolh m. ‘id.’, E seal, OS selah m. ‘id.’, EDu. seel, seel-hont m. ‘id.’, Du. zee-hond c. ‘id.’, OHG selah m. ‘id.’, MHG sele m. ‘id.’ (GM).

A Germanic word with no certain IE etymology. The link with Lith. selė́ti ‘to crawl’ (Torp 1909: 436) is erroneous, as this verb corresponds to PGm. *stelan- (q.v.). The *h may nevertheless correspond to the PIE animal suffix *-ko-, for which see *elha{n)- ‘elk’ and *baruga- ‘boar’.

Focusing on this substrate etymon, coupled with archaeology and ancient DNA, in the recent SAA 84th Annual Meeting (Abstracts in PDF):

Kroonen, Guus (Leiden University) and Rune Iversen

[196] The Linguistic Legacy of the Pitted Ware Culture

The Scandinavian hunter-, fisher- and gatherer-based Pitted Ware culture is chronologically situated in the Neolithic. However, it challenges our traditional view on cultural and social evolution by representing a return to an otherwise abandoned hunter-gatherer lifestyle. In general, the Pitted Ware culture must be seen as an offshoot of the “Sub-Neolithic” societies inhabiting wide parts of northern and northeastern Europe in the fourth and third millennium B.C.E.

Isotopic and aDNA studies have shown that people of the east Swedish Pitted Ware culture, both dietarily and genetically were distinct from the early farmers in this region, the Funnel Beaker culture. Isotopic data shows a marked predominance of seal in the diet, which has given the Pitted Ware people the nickname “Inuit of the Baltic”.

As regards language, it is to be expected that people practicing a Pitted Ware lifestyle spoke a non-Indo-European language. In fact, there is some linguistic evidence that can support this claim. It is conceivable that both the Germanic and Finnish word for “seal” were ultimately borrowed from a language spoken in a Pitted Ware context. Once more, the linguistic evidence turns out to offer important information complementary to that of archaeology and archaeo-genetics.

prehistoric-seal-hunters
Stone Age Seal Hunters, by Måns Sjöberg.

Apparently, the idea of non-IE substrate languages in contact with Germanic in Scandinavia is fashionable for the Copenhagen group, probably due to their particular interpretation of the recent genetic papers, hence the multiple Germanic-Fennic connections to be reviewed through this new prism. While the ulterior motive of this proposal may be to try and connect yet again Germanic with CWC Denmark, I would argue that the effect is actually the opposite.

An early borrowing via Uralic

The word has always been considered a more likely loan from one language to the other, and – because of the quite popular idea of Uralic native to Fennoscandia – it was often seen as a likely borrowing of Germanic from Balto-Finnic. In any possible case, the borrowing in either direction must be quite early, for obvious reasons:

  • If the borrowing had been via late Palaeo-Germanic, the ending in *-xa– would have been reflected in Balto-Finnic, hence an early Palaeo-Germanic to Pre-Balto-Finnic stage would be necessary.
  • If the borrowing had been via late Balto-Finnic, the initial sibilant would be already aspirated, being adopted as *-x– in Palaeo-Germanic, while the ending in *-k– would have remained as such if it was adopted after Grimm’s law ceased to be active.
  • Similarly, a borrowing from a common, non-Indo-European & non-Uralic source would require that it happened during the early stages of both proto-languages to have undergone their respective phonetic changes, and both borrowings chronologically close to each other, to assume a similar vocalism and consonantism of the ultimate source.
wiik-indo-european-uralic-substrate
The idea of seal-hunting Uralic substrate of Pitted Ware is not new. Image modified from The Uralic and Finno-Ugric Phonetic Substratum, by Kalevi Wiik, Linguistica Uralica (1997).

Furthermore, regarding the most likely way of expansion of this loanword, due to the different vowels and sibilants present in Uralic but not in Indo-European:

  • A direct loan from Pre-Germanic **selkos – which shows a regular thematic declension – to Pre-Balto-Finnic *šülkeš doesn’t seem to be a reasonable assumption.
  • NOTE. A Germanic borrowing from alternative Gmc. genitive *silxis could only work in a Pre-Germanic to Pre-Balto-Finnic model, hence only if the Gmc. form can be reconstructed for an earlier stage. Even then, for the same reason stated above, the opposite could be more reasonably argued, i.e. that this form is the original one adopted in Germanic: Pre-PBF *šülkeš > Pre-Gmc. *silkis, reinterpreted as an -o- stem in its declension.

  • If we reconstruct an older Pre-Finno-Samic (i.e. with Finno-Permic-like vocalism) **šëlkëš, a borrowing into Pre-Germanic **selkos would work. Even though no Saami derivative exists to confirm such a possibility, this would be supported by the known common evolution of Finno-Samic dialects in close contact with Pre-Germanic.
  • Admittedly, even accepting the existence of a Finno-Samic stem, a potential substrate word could not be discarded. In fact, while **šëlkë- could perfectly be a Uralic root, the ending in *-š can’t be easily interpreted. Therefore, a third, non-Indo-European & non-Uralic source is a plausible explanation.

NOTE. Arguably, Proto-Finno-Samic could have adopted Gmc. *kh or *x exceptionally as PFS *k. However, early Palaeo-Germanic borrowings in Finno-Samic show a consistent regular consonant change as described above. For more on this, see Finno-Samic borrowings.

This likely Uralic first nature of the loanword is important for the discussion below.

Pitted Ware culture

pitted-ware-pyheensilta-ware-culture
Middle Neolithic A period. Distribution of Pyheensilta Ware, Funnel Beaker Culture in Sweden, and Pitted Ware Culture in northern Europe during the Middle Neolithic A period, c. 3300–2800 cal BC. Find locations with numbers demarcate sites where cereal grains have been found and later AMS radiocarbon dated. Figure was created by SV using QGIS 3.4. (https://www.qgis.org/) and Natural Earth data (https://www.naturalearthdata.com/). Image from Vanhanen et al. (2019).

About the Pitted Ware culture, this is what the recent paper by Vanhanen et al. (2019), from the University of Finland (including Volker Heyd) had to say:

The origins of the PWC are controversial. In one likely scenario, Comb Ceramic and Mesolithic hunter-gatherers first interacted with FBC during the last centuries of the EN and became specialized maritime hunter-gatherers. The PWC pushed south and westwards during the Middle Neolithic (MN), c. 3300–2300 BC, along the northern Baltic shoreline and adjacent islands, eventually reaching as far west as Denmark and southern Norway. Around 2800 BC, after the FBC ceased to exist, the Corded Ware Culture (CWC) migrated into the PWC area. The end date for the PWC and CWC is approximately 2300 BC, when the material culture was replaced by the Late Neolithic (LN) culture<. Spanning nearly a millennium virtually unchanged, the PWC maintained a coherent society and a successful economic model. PWC people lived in marine-oriented settlements, commonly dwelled in huts and produced relatively large amounts of ceramic vessels. This speaks to the partly sedentary nature of their habitation, at least for their base camps. These specialist hunter-gatherers obtained the great majority of their subsistence from maritime sources, such as seal, fish, and sea birds. Considering the amount of bones, sealing was of paramount importance, causing these peoples to be labelled ‘hard-core sealers’ or even the ‘Inuit of the Baltic’.

The Middle Neolithic Pitted Ware culture is dated ca. 3500–2300 BC, so we would be seeing here Pre-Germanic and Pre-Balto-Finnic peoples arriving near the Pitted Ware culture. That would leave us with one of both languages expanding with Corded Ware peoples, and the other with Bell Beakers. Since Battle Axe-derived cultures around the Gulf of Finland are associated with Balto-Finnic groups, and Bell Beakers arriving ca. 2400 started the Dagger Period, commonly associated with the Pre-Germanic community, I think the connection of each group with their language is self-evident.

pitted-ware-cored-ware-culture
Middle Neolithic B period. Distribution of Corded Ware Culture and Pitted Ware Culture in northern Europe during the Middle Neolithic B period, c. 2800–2300 cal BC. Find locations with numbers demarcate sites where cereal grains have been found and later AMS radiocarbon dated. Figure was created by SV using QGIS 3.4. (https://www.qgis.org/) and Natural Earth data (https://www.naturalearthdata.com/). Modified from Vanhanen et al. (2019).

NOTE. You can read some interesting information about prehistoric and recent seal hunting in the Baltic in the blog post “Själen” – Seal Hunting in the Northern Baltic Sea.

Germanic-Fennic phonetic evolution

The common Germanic – Balto-Finnic phonetic evolution, especially Verner’s law in Palaeo-Germanic and qualitative gradation in Proto-Balto-Finnic, has been variably interpreted as:

  • Uralic in Scandinavia influenced by Germanic (Verner’s law source of the gradation), by Koivulehto and Vennemann (1996).
  • Germanic over a Uralic substratum in Scandinavia, by Wiik (1997).
  • Both Germanic and Balto-Finnic influenced by a third language, an “extinct non-Uralic source” spoken in Fennoscandia before the arrival of Uralic and Indo-European, by Kallio (2001); maybe the same substrate proposed to have influenced the accent shift in Germanic similar to Uralic.
  • Balto-Finnic speakers adopting Pre-Germanic in Scandinavia, in contact with Balto-Finnic speakers retaining their language, by Schrijver in Language Contact and the Origins of the Germanic Languages (2014)– although first suggested by him in the 1990s.

NOTE. There are other (some much older) proposals of a Uralic substrate in Scandinavia, but I think those above summarize the most common positions tenable today.

If you add all linguistic, archaeological, and now genetic connections, it is really strange to keep arguing for so many surprisingly fitting common substrates and/or contact languages for both. Especially because the Pre-Germanic community – if originally from southern Scandinavia and not further south (see e.g. Kortlandt’s theory) – was marked by the Dagger Period, as accepted by most archaeologists (including Kristiansen), and we know that Bell Beakers – who triggered the Dagger period – might have arrived a little late to the Pitted Ware disintegration in most seal-hunting areas of southern Scandinavia.

bell-beaker-density
Density analysis based (Bell Beaker per km2) on the distribution of Bell Beaker per region (ca. 2700-2200 BC). Combination of different levels of b-spline interpolation. Exaltation of the values through square root usage. Modified from Michael Bilger (2018).

In other words, how many common substrate languages can we propose for Germanic (and Balto-Finnic)? Just from Kroonen we have already the Semitic-like TRB, and the seal-hunting Pitted Ware culture. Apparently, the culprit of the common phonetic evolution must be some (other?) culture that both Pre-Germanic and Pre-Balto-Finnic assimilated (or with which both were in contact) in Fennoscandia.

NOTE. I believe no data supports the attribution of those Germanic borrowings to the TRB culture, especially if one assumes they belong to an Afroasiatic branch, as did Kroonen. His initial assumption about an expansion of R1b-M269 associated with the Neolithic from Anatolia, and thus with Afroasiatic, must today be rejected. Much more likely is the incorporation of most of these loanwords during the expansion of North-West Indo-Europeans from Yamna Hungary.

How many “common” substrates from different regions and cultures is too much? Arguably, it’s not a question of quantity (because the overall probability remains the same), but a question of quality of arguments.

In my opinion, both a) the marked seal-hunting subsistence economy of the Pitted Ware culture and b) the difficult reconstruction of a fitting ‘natural’ PIE or PU stem warrant this proposal of a third source, just like the European agricultural substrate of North-West Indo-European and Palaeo-Balkan languages, as well as the Asian agricultural substrate of Indo-Iranian are the most logical interpretation of words not found in other IE dialects. The only problem in this case is the lack of other Scandinavian substrate words to compare its typology against.

scandinavia-neolithic-flint-daggers
Close contacts in Fennoscandia. The distribution of Scandinavian flint daggers (A) in the east and south Baltic region and possible trends of “down the line” trade (B). Good size and quality flint zone in the south-west Baltic region is hatched (C). According to: Wojciechowski 1976; Olausson 1983, fig. 1; Madsen 1993, 126; Libera 2001; Kriiska & Tvauri 2002, 86. Image modified from Piličiauskas (2010).

Common Scandinavian substratum

The theory of a Pitted Ware borrowing is therefore quite convincing from a cultural point of view, at the same time as it fits the linguistic data. However, one reason why I dislike the interpretation of a dual origin is that our knowledge of Uralic languages is fairly limited, whereas that of Indo-European branches and hence Proto-Indo-European is huge. To put it otherwise: if a common word appears in both, and it is most likely (culturally and linguistically) not Indo-European, it certainly means that it was borrowed in Germanic. What are the a priori chances of it coming directly from a third substrate language for both dialects, instead of coming directly from Pre-Balto-Finnic?

From Schrijver (2014):

What did happen, apparently, is that Finnic speakers had enough access to the way in which Germanic speakers pronounced Balto-Finnic in order to model their own pronunciation of Balto-Finnic on it. In other words, Balto-Finns conversed with bilingual speakers of Germanic and Balto-Finnic whose pronunciation of both was essentially Germanic. But access to the Germanic language itself was not sufficient to allow Balto-Finns to become bilingual themselves, either because social segregation prevented this or because contact with Germanic was severed before widespread bilingualism set in. This limited access to Germanic would allow us to understand why Balto-Finnic did not go the way of the vernacular languages that came in contact with Latin in the Roman Empire, where access to Latin was open to almost everybody and massive language shift in favour of Latin ensued.

NOTE. For a more detailed discussion, you can read the whole chapter dedicated to this question. I summarized it in Pre-Germanic born out of a Proto-Finnic substrate in Scandinavia.

On the other hand, about the ad hoc interpretation by Kallio (2001) of hypothetic third languages strongly influencing in the same way both the Palaeo-Germanic- and Balto-Finnic-speaking communities, Schrijver (2014) comments:

The idea that perhaps both languages moved towards a lost third language, whose speakers may have been assimilated to both Balto-Finnic and Germanic, provides a fuller explanation but suffers from the drawback that it shifts the full burden of the explanation to a mysterious ‘language X’ that is called upon only in order to explain the developments in Proto-Germanic and Balto-Finnic. That comes dangerously close to circular reasoning.

early-bronze-age-nordic-dagger-period
Early Bronze Age cultures of Northern Europe (roughly ca. 2200-1750). Dagger period representing the expansion of BBC-derived groups from southern Scandinavia.

NOTE. The proposal of some kind of “SHG/EHG-based Fennoscandian substrate” seems funny to me, for two reasons: firstly, there is usually no talk about which culture spread that common language, how it survived, how it was in contact with both groups and until when, etc. (see below for possibilities); secondly, apparently the evident survival of West European EEF communities driven by at least two cultural groups – El Argar and the poorly known groups from the Atlantic façade north of the Pyrenees – is, for the same people proposing this simplistic SHG/EHG idea, somehow not fitting for the prehistory of Proto-Iberian and Proto-Aquitanian, respectively…

The same argument that one could use against the direct borrowing of both dialects from Pitted Ware, but much more strongly, can be thus wielded against a common, centuries-long phonetic evolution of both Balto-Finnic and Germanic caused by close interactions with (and/or substrate influence of) some third language. Which unitary culture and when exactly could that have happened around the Baltic Sea?

  • Was it Pitted Ware the mysterious substrate language? Seems rather unlikely, due to the early demise of the Pitted Ware culture in contrast to the long-lasting common influence seen in both dialects.
  • Was it Pitted Ware in southern Scandinavia, but Comb Ware in the Gulf of Finland? Is there a direct genetic connection between both cultures? And how likely is a common phonology of an ancestral Comb Ware-like substrate language surviving separately in Finland and Sweden? Even accepting these assumptions, we would be stuck again in the Indo-European Beakers vs. Uralic Battle Axe model.
  • Was it a succession of cultures, from some Scandinavian culture that was replaced by some incoming ethnolinguistic group, then influencing the other? This non-IE, non-Uralic substrate would then need to be proposed, given the chronological and archaeological constraints, as an effect of Pitted Ware over Pre-Finno-Baltic spoken by Battle Axe peoples in Scandinavia, then replaced by Pre-Germanic peoples arriving later with Bell Beakers. A reverse direction and later chronology (say, Germanic replaced by Balto-Finnic from Netted Ware arriving from the Volga) wouldn’t work as well.
  • Was it Asbestos Ware as a late Comb Ware group influencing both? How likely is such a continued influence in Southern Scandinavia and the Gulf of Finland? Even if we accepted this influence that miraculously didn’t affect Samic (most likely located between the Balto-Finnic-speaking Gulf of Finland and northern Fennoscandian Asbestos Ware groups), it would necessarily mean that Germanic and Balto-Finnic were spoken neighbouring exactly the same Asbestos Ware groups in Scandinavia. That is, essentially, that the BBC-derived Dagger Period represented Pre-Germanic, while Battle Axe-derived groups around the Gulf of Finland were Balto-Finnic.

Mixing linguistics with archaeology (now complemented with genetics) also risks circular reasoning. But, how else can someone propose a third substrate language for a phonetic change, necessarily represented by Fennoscandian groups potentially separated by thousands of years? In this age of population genomics we can’t simply talk about theoretical models anymore: we must refer to Fennoscandian cultures and populations in a very specific time frame, as Kronen & Iversen do in their proposal. Not only is such a third unknown language usually a weak explanation for a common development of two unrelated languages; in this case it finds no support whatsoever.

Seals and the Arctic

Another interesting aspect about this Fennic-Germanic comparandum is its relevance to the Uralic homeland problem.

uralic-languages-modern
Current distribution of Uralic languages. Nenets and Saami are among the best positioned to retain the ‘original’ Uralic seal-hunting vocabulary.

Since the publication of Mittnik et al. (2018), Lamnidis et al. (2018), and Sikora et al. (2018), the new normal is apparently to consider Corded Ware Finland as Germanic-speaking, the Gulf of Finland as Balto-Slavic-speaking, while the Kola peninsula and whichever Palaeo-Arctic peoples preceded Nganasans and Nenets as ancient Uralians. Uh-huh, OK.

But, if prehistoric Arctic peoples practiced specialized seal-hunting economies, and Uralians were one among such populations – supposedly one widespread from the Barents Sea to the Lapteve Sea…how come no common Uralic word for ‘seal’ exists? In other words, why would these True™ Uralic peoples expanding from the Arctic need to borrow a word for ‘seal’ from neighbouring populations in every single seal-hunting region they are attested?

grey-seal-distribution
Historical distribution of grey seals, an important part of the diet around the Baltic Sea. Image modified from Wikimedia to include Skagerrak and Kattegat regions.

About Saami, which some have recklessly proposed to be derived from Bronze Age N1c-L392 samples from the Kola Peninsula (against the good judgment of the authors of the paper), this is what we know from their word for ‘seal’, from Grünthal (2004):

Ter Saami vīrre ‘seal; wolf’ displays two meanings that refer to clearly different animals. Neither of them is borrowed from the source language because the word descends from Russian zver’ ‘animal’ (T.I.Itkonen 1958: 756). Another word, Skolt Saami näúdd ‘seal, wolf’, has been similarly used in the two meanings. The evidence of North Saami návdi ‘wolf; creature, fur animal; beast’ (Sammallahti 1989: 305; Lagercrantz (1939: 518) presents the alternative meanings in the opposite order; E. Itkonen (1969: 148) lists the meanings ‘wildes Tier; Raubtier (bes. Wolf); Pelztier’) suggesting that ‘wolf’ is the primary sense and ‘seal’ is a metaphorical extension of it. More precisely, it is an example of a mythic metaphor (cf. Siikala 1992). According to the old folk belief, seal was a wolf and the Skolt Saamis preferred not to eat its meat (T.I.Itkonen 1958: 906). Before that the metonymic meaning ‘wolf’ rose from the less specified meanings, and originally návdi is a Scandinavian or Finnic loan word in Saamic, cf. Old Norse naut ‘vieh, rind’, Icelandic and Norwegian naut, Swedish nöt < Germanic *nauta ‘property’ (Hellquist 1980: 721, T.I.Itkonen 1958: 275, Lagercrantz 1939: 518, de Vries 1961: 406; E. Itkonen (1969: 148) considers Finnic, cf. Finnish nauta ‘bovine’ (< Germanic) as a possible alternative source for the Saamic word).

NOTE. Possibly comparable, for the mythic metaphor proper of Scandinavian folk belief, are Germanic derivatives built as ‘seal-hound’ and/or ‘sea-hound’.

sea-distribution-arctic
Seals formed a great part of the diet for Palaeo-Arctic populations. Boundaries of regions used to predict sea ice, superimposed over the distributions of the five ringed seal subspecies. Image modified from Kelly et al. (2010).

About Nenets (quite close to the Naganasans of pure “Siberian ancestry”), here is what Edward Vajda, an expert in Palaeo-Siberian languages, has to say:

Nenets techniques for hunting the animals of the Arctic Ocean seem to have been borrowed from the first Arctic aborigines. Thus, the Nenets word for seal is nyak, the Eskimo word is nesak. Also, the Nenets word for a one-piece Arctic clothing is lu; the Korak word on the Kamchatka peninsula for clothing is l’ku. All of these groups may have borrowed the words from some original circumpolar aborigines. More probably, the first settlers of Arctic Europe were cousins of the present-day Eskimo, Chukchi and other residents of the far northeast region of Asia. Nenets folklore also speaks of the aborigines living in ice dugouts (igloos).

On the other hand, Proto-Uralic shows a Chalcolithic steppe-like culture, with common words for metal and metalworking, for agriculture, and for domesticated animals, most likely including cattle. They were close to Indo-Europeans since at least before the Tocharian split, and probably earlier than that (even if one does not accept the Indo-Uralic phylum). And there were clearly strong contacts of Finno-Ugric with Indo-Iranian, and especially of Finno-Samic with Germanic.

uralic-cline
Uralic clines from Corded Ware groups to the east. A clear reason for the lack of common seal-hunting vocabulary. Modified from Tambets et al. (2018). Principal component analysis (PCA) and genetic distances of Uralic-speaking populations. a PCA (PC1 vs PC2) of the Uralic-speaking populations. You can see another PCA including ancient samples.

Some among my readers may now be thinking about these totally believable proposals of prehistoric cultures around Lake Baikal representing the True™ Uralic homeland; because haplogroup N1c, and because some 0.5% more “Devil’s Gate Cave ancestry” in Estonians than in Lithuanians; despite the fact that 1) the so-called “Siberian ancestry” formed an ancestral cline with EHG in North Eurasia, that 2) N1c-L392 lineages seem to appear among many Asian peoples of different languages, and that 3) recent prehistoric N1c-L392 lines expanded clearly with Micro-Altaic languages.

Like, who would have hunted seals in Lake Baikal, right? The problem is, seals represented one of their main game, essential for their subsistence economy. From Novokonova et al. (2015):

One of the key reasons for the density of human settlement in the Baikal region compared to adjacent areas of Siberia is that the lake and its nearby rivers offer an abundance of aquatic food resources, including several endemic species, with perhaps the most well known being the Baikal seal. This freshwater seal is only found in Lake Baikal and portions of its tributaries. It shares lifecycle and behavioral patterns with other small northern ice-adapted seals, and is genetically and morphologically most closely related to the ringed seal (Pusa hispida). The nerpa can grow up to 1.8 m long and weigh as much as 130 kg, with the males tending to be slightly larger than the females.

Zooarchaeological analyses of the 16,000 Baikal seal remains from this well-dated site clearly show that sealing began here at least 9000 calendar years ago. The use of these animals at Sagan-Zaba appears to have peaked in the Middle Holocene, when foragers used the site as a spring hunting and processing location for yearling and juvenile seals taken on the lake ice. After 4800 years ago, seal use declined at the site, while the relative importance of ungulate hunting and fishing increased. Pastoralists began occupying Sagan-Zaba at some point during the Late Holocene, and these groups too utilized the lake’s seals. Domesticated animals are increasingly common after about 2000 years ago, a pattern seen elsewhere in the region, but spring and some summer hunting of seals was still occurring. This use of seals by prehistoric herders mirrors patterns of seal use among the region’s historic and modern groups.

Bronze Age movements in Fennoscandia

Regarding the shrinkage and expansion of different farming economic strategies in Scandinavia since the Neolithic, with potential relevance for population movements and thus ethnolinguistic change – either from Balto-Finnic peoples migrating back from eastern Sweden, or Germanic peoples moving to eastern Finland – from Vanhanen et al. (2019):

Cultivated plants at CWC sites in Finland were not discovered in the current investigation (Supplementary Results) or earlier studies. In Finland, the keeping of domestic animals is indicated by the evidence of dairy lipids and mineralized goat hairs. Charred remains and impressions of cultivated plants have been discovered at CWC sites in Estonia and east-central Sweden (Fig. 3: 12). In the eastern Baltic region, the earliest bones of domestic animals and a shift in subsistence occurred with the CWC. Whether CWC produced the cereals and other agricultural products found at PWC sites is difficult to estimate because only small amounts of plant remains have ever been discovered at CWC sites. The CWC seemingly reached east-central Sweden from regions further to the east, where there is evidence of animal husbandry, but only very few signs of plant cultivation.

For the Late Neolithic (LN), cereal grains have been found north of Mälaren and along the Norrland coast. In mainland Finland, the first cereal grains occur during the LN or Bronze Age, c. 1900–1250 cal BC. The earliest bones of sheep/goat from mainland Finland are earlier, dating back to 2200–1950 cal BC. Finds of Scandinavian bronze artefacts indicate an influx from east-central Sweden, which might well be a source area for these agricultural innovations. A similar development is found in the eastern Baltic region, where the earliest directly radiocarbon-dated cereals originate from the Bronze Age, 1392–1123 cal BC (2 sigma). Thus, agriculture was evident during the Bronze Age in the eastern Baltic, but at least animal keeping and probably crop cultivation were present earlier during the CWC phase.

It has been known for a while already that the only options left for the expansion of Finno-Saami into Fennoscandia are either Battle Axe (continued in Textile Ceramics) or Netted Ware (as proposed e.g. by Parpola), based, among other data, on language contacts, language estimates, cultural evolution, and population genomics. Data like this one on seal-hunting vocabulary also support the most likely option, which entails the identification of Corded Ware as the vector of expansion of Uralic languages.

NOTE. Also interesting in this regard is the lack of Slavic words for ‘seal’ – borrowed, in Russian from Samic, and in other Slavic dialects from Russian, Latin, or other languages -, and the coinage of a new term in East Baltic. Rather odd for an “autochthonous” Proto-Baltic (supposedly in contact with Pitted Ware, Germanic, and Balto-Finnic, then), and for a Proto-Slavic stemming from the Baltic. Quite appropriate, though, for a Proto-East Baltic arriving in the Baltic with Trzciniec and for a Proto-Slavic community evolving further south.

So, what new episode in this renewed 2000s R1b/R1a/N1c soap opera is it going to be, when eastern Fennoscandia shows Corded Ware-derived peoples of “steppe ancestry” (and mainly R1a-Z645 lineages) continue during the Bronze Age? Will the resurge and/or infiltration of I2 – maybe even N1c – lineages among Corded Ware-derived cultures of north-eastern Europe support or challenge this model, and why? Make your bet below.

Related

N1c-L392 associated with expanding Turkic lineages in Siberia

haplogroup-n1c-tat

Second in popularity for the expansion of haplogroup N1a-L392 (ca. 4400 BC) is, apparently, the association with Turkic, and by extension with Micro-Altaic, after the Uralic link preferred in Europe; at least among certain eastern researchers.

New paper in a recently created journal, by the same main author of the group proposing that Scythians of hg. N1c were Turkic speakers: On the origins of the Sakhas’ paternal lineages: Reconciliation of population genetic / ancient DNA data, archaeological findings and historical narratives, by Tikhonov, Gurkan, Demirdov, and Beyoglu, Siberian Research (2019).

Interesting excerpts:

According to the views of a number of authoritative researchers, the Yakut ethnos was formed in the territory of Yakutia as a result of the mixing of people from the south and the autochthonous population [34].

These three major Sakha paternal lineages may have also arrived in Yakutia at different times and/ or from different places and/or with a difference in several generations instead, or perhaps Y-chromosomal STR mutations may have taken place in situ in Yakutia. Nevertheless, the immediate common ancestor(s) from the Asian Steppe of these three most prevalent Sakha Y-chromosomal STR haplotypes possibly lived during the prominence of the Turkic Khaganates, hence the near-perfect matches observed across a wide range of Eurasian geography, including as far as from Cyprus in the West to Liaoning, China in the East, then Middle Lena in the North and Afghanistan in the South (Table 3 and Figure 5). There may also be haplotypes closely-related to ‘the dominant Elley line’ among Karakalpaks, Uzbeks and Tajiks, however, limitations in the loci coverage for the available dataset (only eight Y-chromosomal STR loci) precludes further conclusions on this matter [25].

yakutia-haplogroup-n1c
17-loci median-joining network analysis of the original/dominant Elley, Unknown and Omogoy Y-chromosomal STR haplotypes with the YHRD matches from outside Yakutia populations.

According to the results presented here, very similar Y-STR haplotypes to that of the original Elley line were found in the west: Afghanistan and northern Cyprus, and in the east: Liaoning Province, China and Ulaanbaator, Northern Mongolia. In the case of the dominant Omogoy line, very closely matching haplotypes differing by a single mutational step were found in the city of Chifen of the Jirin Province, China. The widest range of similar haplotypes was found for the Yakut haplotype Unknown: In Mongolia, China and South Korea. For instance, haplotypes differing by a single step mutation were found in Northern Mongolia (Khalk, Darhad, Uryankhai populations), Ulaanbaator (Khalk) and in the province of Jirin, China (Han population).

n1c-uralic-altaic-siberia
14-loci median-joining network analysis for the original/dominant Elley (Ell), Unknown Clan
(Vil), Omogoy (Omo), Eurasian (Eur) and Xiongnu (Xuo) Y-chromosomal STR haplotypes and that for a representative ancient DNA sample (Ch0 or DSQ04) from the Upper Xiajiadian Culture
recovered from the Inner Mongolia Autonomous Region, China.

Notably, Tat-C-bearing Y-chromosomes were also observed in ancient DNA samples from the 2700-3000 years-old Upper Xiajiadian culture in Inner Mongolia, as well as those from the Serteya II site at the Upper Dvina region in Russia and the ‘Devichyi gory’ culture of long barrow burials at the Nevel’sky district of Pskovsky region in Russia. A 14-loci Y-chromosomal STR median-joining network of the most prevalent Sakha haplotypes and a Tat-C-bearing haplotype from one of the ancient DNA samples recovered from the Upper Xiajiadian culture in Inner Mongolia (DSQ04) revealed that the contemporary Sakha haplotype ‘Xuo’ (Table 2, Haplotype ID “Xuo”) classified as that of ‘the Xiongnu clan’ in our current study, was the closest to the ancient Xiongnu haplotype (Figure 6). TMRCA estimate for this 14-loci Y-chromosomal STR network was 4357 ± 1038 years or 2341 ± 1038 BCE, which correlated well with the Upper Xiajiadian culture that was dated to the Late Bronze Age (700-1000 BCE).

eurasian-n-subclades
Geographical location of ancient samples belonging to major clade N of the Y-chromosome.

NOTE. Also interesting from the paper seems to be the proportion of E1b1b among admixed Russian populations, in a proportion similar to R1a or I2a(xI2a1).

It is tempting to associate the prevalent presence of N1c-L392 in ancient Siberian populations with the expansion of Altaic, by simplistically linking the findings (in chronological order) near Lake Baikal (Damgaard et al. 2018), Upper Xiajiadian (Cui et al. 2013), among Khövsgöl (Jeong et al. 2018), in Huns (Damgaard et al. 2018), and in Mongolic-speaking Avars (Csáky et al. 2019).

However, its finding among Palaeo-Laplandic peoples in the Kola peninsula ca. 1500 BC (Lamnidis et al. 2018) and among Palaeo-Siberian populations near the Yana River (Sikora et al. 2018) ca. AD 1200 should be enough to accept the hypothesis of ancestral waves of expansion of the haplogroup over northern Eurasia, with acculturation and further expansions in the different regions since the Iron Age (see more on its potential expansion waves).

Also, a simple look at the TMRCA and modern distribution was enough to hypothesize long ago the lack of connection of N1c-L392 with Altaic or Uralic peoples. From Ilumäe et al. (2016):

Previous research has shown that Y chromosomes of the Turkic-speaking Yakuts (Sakha) belong overwhelmingly to hg N3 (formerly N1c1). We found that nearly all of the more than 150 genotyped Yakut N3 Y chromosomes belong to the N3a2-M2118 clade, just as in the Turkic-speaking Dolgans and the linguistically distant Tungusic-speaking Evenks and Evens living in Yakutia (Table S2). Hence, the N3a2 patrilineage is a prime example of a male population of broad central Siberian ancestry that is not intrinsic to any linguistically defined group of people. Moreover, the deepest branch of hg N3a2 is represented by a Lebanese and a Chinese sample. This finding agrees with the sequence data from Hallast et al., where one Turkish Y chromosome was also assigned to the same sub-clade. Interestingly, N3a2 was also found in one Bhutan individual who represents a separate sub-lineage in the clade. These findings show that although N3a2 reflects a recent strong founder effect primarily in central Siberia (Yakutia, Sakha), the sub-clade has a much wider distribution area with incidental occurrences in the Near East and South Asia.

haplogroup-n1a-M2118
Frequency-Distribution Maps of Individual Sub-clades of hg N3a2, by Ilumäe et al. (2016).

The most striking aspect of the phylogeography of hg N is the spread of the N3a3’6-CTS6967 lineages. Considering the three geographically most distant populations in our study—Chukchi, Buryats, and Lithuanians—it is remarkable to find that about half of the Y chromosome pool of each consists of hg N3 and that they share the same sub-clade N3a3’6. The fractionation of N3a3’6 into the four sub-clades that cover such an extraordinarily wide area occurred in the mid-Holocene, about 5.0 kya (95% CI = 4.4–5.7 kya). It is hard to pinpoint the precise region where the split of these lineages occurred. It could have happened somewhere in the middle of their geographic spread around the Urals or further east in West Siberia, where current regional diversity of hg N sub-lineages is the highest (Figure 1B). Yet, it is evident that the spread of the newly arisen sub-clades of N3a3’6 in opposing directions happened very quickly. Today, it unites the East Baltic, East Fennoscandia, Buryatia, Mongolia, and Chukotka-Kamchatka (Beringian) Eurasian regions, which are separated from each other by approximately 5,000–6,700 km by air. N3a3’6 has high frequencies in the patrilineal pools of populations belonging to the Altaic, Uralic, several Indo-European, and Chukotko-Kamchatkan language families. There is no generally agreed, time-resolved linguistic tree that unites these linguistic phyla. Yet, their split is almost certainly at least several millennia older than the rather recent expansion signal of the N3a3’6 sub-clade, suggesting that its spread had little to do with linguistic affinities of men carrying the N3a3’6 lineages.

haplogroup_n3a3
Frequency-Distribution Maps of Individual Subclade N3a3 / N1a1a1a1a1a-CTS2929/VL29.

It was thus clear long ago that N1c-L392 lineages must have expanded explosively in the 5th millennium through Northern Eurasia, probably from a region to the north of Lake Baikal, and that this expansion – and succeeding ones through Northern Eurasia – may not be associated to any known language group until well into the common era.

Related

The cradle of Russians, an obvious Finno-Volgaic genetic hotspot

pskov-novgorod-russia

First look of an accepted manuscript (behind paywall), Genome-wide sequence analyses of ethnic populations across Russia, by Zhernakova et al. Genomics (2019).

Interesting excerpts:

There remain ongoing discussions about the origins of the ethnic Russian population. The ancestors of ethnic Russians were among the Slavic tribes that separated from the early Indo-European Group, which included ancestors of modern Slavic, Germanic and Baltic speakers, who appeared in the northeastern part of Europe ca. 1,500 years ago. Slavs were found in the central part of Eastern Europe, where they came in direct contact with (and likely assimilation of) the populations speaking Uralic (Volga-Finnish and Baltic- Finnish), and also Baltic languages [11–13]. In the following centuries, Slavs interacted with the Iranian-Persian, Turkic and Scandinavian peoples, all of which in succession may have contributed to the current pattern of genome diversity across the different parts of Russia. At the end of the Middle Ages and in the early modern period, there occurred a division of the East Slavic unity into Russians, Ukrainians and Belarusians. It was the Russians who drove the colonization movement to the East, although other Slavic, Turkic and Finnish peoples took part in this movement, as the eastward migrations brought them to the Ural Mountains and further into Siberia, the Far East, and Alaska. During that interval, the Russians encountered the Finns, Ugrians, and Samoyeds speakers in the Urals, but also the Turkic, Mongolian and Tungus speakers of Siberia. Finally, in the great expanse between the Altai Mountains on the border with Mongolia, and the Bering Strait, they encountered paleo-Asiatic groups that may be genetically closest to the ancestors of the Native Americans. Today’s complex patchwork of human diversity in Russia has continued to be augmented by modern migrations from the Caucasus, and from Central Asia, as modern economic migrations take shape.

pskov-novgorod-pca-eurasia-yakut
Sample relatedness based on genotype data. Eurasia: Principal Component plot of 574 modern Russian genomes. Colors reflect geographical regions of collection; shapes reflect the sample source. Red circles show the location of Genome Russia samples.

In the current study, we annotated whole genome sequences of individuals currently living on the territory of Russia and identifying themselves as ethnic Russian or as members of a named ethnic minority (Fig. 1). We analyzed genetic variation in three modern populations of Russia (ethnic Russians from Pskov and Novgorod regions and ethnic Yakut from the Sakha Republic), and compared them to the recently released genome sequences collected from 52 indigenous Russian populations. The incidence of function-altering mutations was explored by identifying known variants and novel variants and their allele frequencies relative to variation in adjacent European, East Asian and South Asian populations. Genomic variation was further used to estimate genetic distance and relationships, historic gene flow and barriers to gene flow, the extent of population admixture, historic population contractions, and linkage disequilibrium patterns. Lastly, we present demographic models estimating historic founder events within Russia, and a preliminary HapMap of ethnic Russians from the European part of Russia and Yakuts from eastern Siberia.

pskov-novgorod-pca-finno-permic
Sample relatedness based on genotype data. Western Russia and neighboring countries: Principal Component plot of 574 modern Russian genomes. Colors reflect geographical regions of collection; shapes reflect the sample source. Red circles show the location of Genome Russia samples.

The collection of identified SNPs was used to inspect quantitative distinctions among 264 individuals from across Eurasia (Fig. 1) using Principal Component Analysis (PCA) (Fig. 2). The first and the second eigenvectors of the PCA plot are associated with longitude and latitude, respectively, of the sample locations and accurately separate Eurasian populations according to geographic origin. East European samples cluster near Pskov and Novgorod samples, which fall between northern Russians, Finno-Ugric peoples (Karelian, Finns, Veps etc.), and other Northeastern European peoples (Swedes, Central Russians, Estonian, Latvians, Lithuanians, and Ukrainians) (Fig. 2b). Yakut individuals map into the Siberian sample cluster as expected (Fig. 2a). To obtain an extended view of population relationships, we performed a maximum likelihood-based estimation of ancestry and population structure using ADMIXTURE [46](Fig. 2c). The Novgorod and Pskov populations show similar profiles with their Northeastern European ancestors while the Yakut ethnic group showed mixed ancestry similar to the Buryat and Mongolian groups.

pskov-novgorod-yakut-admixture
Population structure across samples in 178 populations from five major geographic regions (k=5). Samples are pooled across three different studies that covered the territory of Russian Federation (Mallick et al. 2016 [36], Pagani et al. 2016 [37], this study). The optimal k-value was selected by value of cross validation error. Russian samples from all studies (highlighted in bold dark blue) show a slight gradient from Eastern European (Ukrainian, Belorussian, Polish) to North European (Estonian Karelian, Finnish) structures, reflecting population history of northward expansion. Yakut samples from different studies (highlighted in bold red) also show a slight gradient from Mongolian to Siberian people (Evens), as expected from their original admixture and northward expansions. The samples originated from this study are highlighted, and plotted in separated boxes below.

Possible admixture sources of the Genome Russia populations were addressed more formally by calculating F3 statistics, which is an allele frequency-based measure, allowing to test if a target population can be modeled as a mixture of two source populations [48]. Results showed that Yakut individuals are best modeled as an admixture of Evens or Evenks with various European populations (Supplemental Table S4). Pskov and Novgorod showed admixture of European with Siberian or Finno-Ugric populations, with Lithuanian and Latvian populations being the dominant European sources for Pskov samples.

direction-expansion-russians
The heatmaps of gene flow barriers show for each point at the geographical map the interpolated differences in allele frequencies (AF) between the estimated AF at the point with AFs in the vicinity of this point. The direction of the maximal difference in allele frequencies is coded by colors and arrows.

So, Russians expanding in the Middle Ages as acculturaded Finno-Volgaic peoples.

Or maybe the true Germano-Slavonic™-speaking area was in north-eastern Europe, until the recent arrival of Finno-Permians with the totally believable Nganasan-Saami horde, whereas Yamna -> Bell Beaker represented Vasconic-Caucasian expanding all over Europe in the Bronze Age. Because steppe ancestry in Fennoscandia and Modern Basques in Iberia.

A really hard choice between equally plausible models.

Related

Magyar tribes brought R1a-Z645, I2a-L621, and N1a-L392(xB197) lineages to the Carpathian Basin

hungarian-conquerors-turks

The Nightmare Week of “N1c=Uralic” proponents (see here) continues, now with preprint Y-chromosome haplogroups from Hun, Avar and conquering Hungarian period nomadic people of the Carpathian Basin, by Neparaczki et al. bioRxiv (2019).

Abstract:

Hun, Avar and conquering Hungarian nomadic groups arrived into the Carpathian Basin from the Eurasian Steppes and significantly influenced its political and ethnical landscape. In order to shed light on the genetic affinity of above groups we have determined Y chromosomal haplogroups and autosomal loci, from 49 individuals, supposed to represent military leaders. Haplogroups from the Hun-age are consistent with Xiongnu ancestry of European Huns. Most of the Avar-age individuals carry east Eurasian Y haplogroups typical for modern north-eastern Siberian and Buryat populations and their autosomal loci indicate mostly unmixed Asian characteristics. In contrast the conquering Hungarians seem to be a recently assembled population incorporating pure European, Asian and admixed components. Their heterogeneous paternal and maternal lineages indicate similar phylogeographic origin of males and females, derived from Central-Inner Asian and European Pontic Steppe sources. Composition of conquering Hungarian paternal lineages is very similar to that of Baskhirs, supporting historical sources that report identity of the two groups.

Interesting excerpts (emphasis mine):

All N-Hg-s identified in the Avars and Conquerors belonged to N1a1a-M178. We have tested 7 subclades of M178; N1a1a2-B187, N1a1a1a2-B211, N1a1a1a1a3-B197, N1a1a1a1a4-M2118, N1a1a1a1a1a-VL29, N1a1a1a1a2-Z1936 and the N1a1a1a1a2a1c1-L1034 subbranch of Z1936. The European subclades VL29 and Z1936 could be excluded in most cases, while the rest of the subclades are prevalent in Siberia 23 from where this Hg dispersed in a counter-clockwise migratory route to Europe (…). All the 5 other Avar samples belonged to N1a1a1a1a3-B197, which is most prevalent in Chukchi, Buryats, Eskimos, Koryaks and appears among Tuvans and Mongols with lower frequency.

haplogroup-n-pca
First two components of PCA from Hg N1a subbranch distribution in 51 populations including Avars and Conquerors. Colors indicate geographic regions. Three letter codes are given in Supplementary Table S5.

By contrast two Conquerors belonged to N1a1a1a1a4-M2118, the Y lineage of nearly all Yakut males, being also frequent in Evenks, Evens and occurring with lower frequency among Khantys, Mansis and Kazakhs.

Three Conqueror samples belonged to Hg N1a1a1a1a2-Z1936 , the Finno-Permic N1a branch, being most frequent among northeastern European Saami, Finns, Karelians, as well as Komis, Volga Tatars and Bashkirs of the Volga-Ural region.Nevertheless this Hg is also present with lower frequency among Karanogays, Siberian Nenets, Khantys, Mansis, Dolgans, Nganasans, and Siberian Tatars.

The west Eurasian R1a1a1b1a2b-CTS1211 subclade of R1a is most frequent in Eastern Europe especially among Slavic people. This Hg was detected just in the Conqueror group (K2/18, K2/41 and K1/10). Though CTS1211 was not covered in K2/36 but it may also belong to this sub-branch of Z283.

Hg I2a1a2b-L621 was present in 5 Conqueror samples, and a 6th sample form Magyarhomorog (MH/9) most likely also belongs here, as MH/9 is a likely kin of MH/16 (see below). This Hg of European origin is most prominent in the Balkans and Eastern Europe, especially among Slavic speaking groups. It might have been a major lineage of the Cucuteni-Trypillian culture and it was present in the Baden culture of the Chalcolithic Carpathian Basin.

hungarian-conquerors-y-dna
Image modified from the paper, with drawn red square around lineages of likely Ugric origin, and squares around R1a-Z93, R1a-Z283, N1a-Z1936, and N1a-M2004 samples. Y-Hg-s determined from 46 males grouped according to sample age, cemetery and Hg. Hg designations are given according to ISOGG Tree 2019. Grey shading designate distinguished individuals with rich grave goods, color shadings denote geographic origin of Hg-s according to Fig. 1. For samples K3/1 and K3/3 the innermost Hg defining marker U106* was not covered, but had been determined previously.

We identified potential relatives within Conqueror cemeteries but not between them. The uniform paternal lineages of the small Karos3 (19 graves) and Magyarhomorog (17 graves) cemeteries approve patrilinear organization of these communities. The identical I2a1a2b Hg-s of Magyarhomorog individuals appears to be frequent among high-ranking Conquerors, as the most distinguished graves in the Karos2 and 3 cemeteries also belong to this lineage. The Karos2 and Karos3 leaders were brothers with identical mitogenomes 11 and Y-chromosomal STR profiles (Fóthi unpublished). The Sárrétudvari commoner cemetery seems distinct from the others, containing other sorts of European Hg-s. Available Y-chromosomal and mtDNA data from this cemetery suggest that common people of the 10th century rather represented resident population than newcomers. The great diversity of Y Hg-s, mtDNA Hg-s, phenotypes and predicted biogeographic classifications of the Conquerors indicate that they were relatively recently associated from very diverse populations.

Surprising about the Hungarian conquerors – although in line with the historical accounts – is the varied patrilineal origin of clans, including Q1a, G2a2b, I1, E1b1b, R1b, J1, or J2 – some of which (depending on specific lineages) may have appeared earlier in the Carpathian Basin or south-eastern Europe.

However, out of the 27 conqueror elite samples, 17 are of haplogroups most likely related to Ugric populations beyond the Urals: R1a-Z645, I2-L621, and two specific N1a-L392 lineages (see below). In fact, there are three high-ranking conqueror elites of hg. I2-L621 (one of them termed a “leader”, brother to an unpublished leader of Karos3, and all of them possibly family), one of hg. R1a-Z280, one of hg. R1a-Z93 (which should be added to the Árpáds), and one of hg. N1a-Z1936, which gives a good idea of the ruling class among the elite Ugric settlers.

NOTE. The Q1a sample is also likely to be found in the mixed population of the West Siberian forest-steppes, since it was found in Mesolithic-Neolithic samples from eastern Europe to Lake Baikal, and in Bronze Age Siberian groups, although admittedly it may have formed part of an Avar Transtisza group, or even earlier Hunnic or Scythian groups along the steppes. Without precise subclades it’s impossible to know.

arrival-of-hungarians-arpad
The seven chieftains of the Hungarians, detail of Arrival of the Hungarians, from Árpád Feszty’s and his assistants’ vast (1800 m2) cyclorama, painted to celebrate the 1000th anniversary of the Magyar conquest of Hungary, now displayed at the Ópusztaszer National Heritage Park in Hungary. Image from Wikipedia.

I2a-L621

I2a-L621 (xS17250) or I2a1b2 in the old nomenclature, is found in 6 early conquerors (including one leader), on a par with R1a and N samples. This haplogroup is found widely distributed in ancient samples, due to its early split (formed ca. 9200 BC, TMRCA ca. 4500 BC) and expansion, probably with Neolithic populations. I can’t seem to find samples of this early haplogroup from the Carpathian Basin, as mentioned in the text, although it wouldn’t be strange, because it appears also in Neolithic Iberia, and in modern populations from western Europe.

Nevertheless, I2a-L621 samples seem to be concentrated mainly in Mesolithic-Neolithic cultures of Fennoscandia, and appeared also in Sikora et al. (2017) in a sample of the High Middle Ages from Sunghir (ca. AD 1100-1200), probably from the Vladimir-Suzdalian Rus’, in a region where clearly tribes of Volga Finns were being assimilated at the time. The reported SNP call by Genetiker is A16681 (see Yfull), deep within I2a-CTS10228. It is possibly also behind a modern Saami from Chalmny Varre (ca. AD 1800) of hg. I2a in Lamnidis et al. (2018).

Lacking precise subclades from Hungarian conquerors this is pure speculation, but modern samples may also point to I2a-CTS10228 (formed ca. 3100 BC, TMRCA ca. 1800 BC) as a Finno-Ugric lineage in common with R1a, which must have expanded to the Urals and beyond with eastern Corded Ware groups or (more likely) succeeding cultures. This is in line with the association of certain I2a lineages with modern Uralic peoples or populations from their historical regions in eastern Europe, and linked thus to the most likely homeland of Uralians in the eastern European forests:

uralic-groups-haplogroup-r1a
Additional file 6: Table S5. Y chromosome haplogroup frequencies in Eurasia. Modified by me: in bold haplogroup N1c and R1a from Uralic-speaking populations, with those in red showing where R1a is the major haplogroup. Observe that all Uralic subgroups – Finno-Permic, Ugric, and Samoyedic – have some populations with a majority of R1a, and also of I lineages. Data from Tambets et al. (2018).

R1a-Z645

Regarding the important question of the ethnic makeup of Ugric populations stemming from the Urals, the most interesting (and expected) data is the presence of R1a-Z645 lineages among high-ranking conquerors, in particular four R1a-Z280 subclades proper of Finno-Ugrians.

This proves that, in line with the old split and expansion of R1a-CTS1211 (formed ca. 2600 BC, TMRCA ca. 2400 BC), and its finding in Bronze Age Fennoscandian samples, only some late R1a-Z280 (xZ92) lineages (see Z280 on YFull) may show a clear identification with early acculturated Uralic speakers, with the main early acculturated Balto-Slavic R1a haplogroup remaining R1a-M458.

I recently hypothesized this late connection of Slavs with very specific R1a-Z280 (xZ92) lineages based on analyses of modern populations (like Slovenians), because the connection of ancient Finno-Ugrians with modern Z92 samples was already evident:

(…) subclades of hg. R1a1a1b1a2-Z280 (xR1a1a1b1a2a-Z92) seem to have also been involved in early Slavic expansions, like R1a1a1b1a2b3a-CTS3402 (formed ca. 2200 BC, TMRCA ca. 2200 BC), found among modern West, South, and East Slavic populations and in Fennoscandia, prevalent e.g. among modern Slovenians which points to a northern origin of its expansion (Maisano Delser et al. 2018).

This finding also supports the expected shared R1a-Z280 lineages among ancient Finno-Ugric populations, as predicted from the study of modern Permic and Ugric peoples in Dudás et al. (2019).

r1a-z282-z280-z2125-distribution
Modified image, from Underhill et al. (2015). Spatial frequency distributions of Z282 (green) and Z93 (blue) affiliated haplogroups. Notice the distribution of R1a-Z280 (xZ92), i.e. R1a-M558, compared to the ancient Finno-Ugric distribution.

Furthermore, while we don’t have precise R1a-Z93 lineages to compare with the new Hunnic sample reported, we already know that some archaic R1a-Z2124 subclades stem from the forest-steppe areas of the Cis- and Trans-Urals, and the two newly reported R1a-Z93 Hungarian conqueror elites, like those of the Árpád dynasty, probably belong to them.

There is an obvious lack of continuity in specific paternal lineages among the Hunnic, the Avar, and the Conqueror periods, which makes any simplistic identification of all R1a-Z93 lineages as stemming from Avars, Huns, or the Iron Age Pontic-Caspian steppes clearly flawed. Comparing R1a-Z93 in Hungarian Conquerors with Huns is like comparing them with samples of the Srubna or earlier periods… Similarly, comparing the Hunnic R1b-U106 or the early Avar I1 to later Hungarian samples is not warranted without precise subclades, because they most likely correspond to different Germanic populations: Goths among Huns, then Longobards, then likely peoples descended from Franks and Irish Monks (the latter with R1b-P312).

N1a-L392

Second behind R1a subclades are, as expected, N1a-L392 (N1c in the old nomenclature).

Avars are dominated by a specific N1a-L392 subclade, N1a-B197, as we recently discovered in Csáky et al. (2019).

Hungarian conquerors show three N1a-Z1936 subclades, which is known to stem from the northern Ural region, including the Arctic (likely Palaeo-Laplandic peoples) and cross-stamped cultures of the northern Eurasian forests.

haplogroup_n3a4
Frequency-Distribution Maps of Individual Subclade N3a4 / N1a1a1a1a2-Z1936, probably with the Samic (first) and Fennic (later) expansions into Paleo-Lakelandic and Palaeo-Laplandic territories.

On the other hand, the two N1a-M2118 lineages are more clearly associated with Palaeo-Siberian populations east of the Urals, but became incorporated into the Ugric stock in the Trans-Urals region probably in the same way as N1a-Z1936, by infiltration from (and acculturation of) hunter-gatherers of forest and taiga cultures.

NOTE. You can read more about the infiltration of N1a lineages in the recent post Corded Ware—Uralic (IV): Hg R1a and N in Finno-Ugric and Samoyedic expansions, and in the specific sections for each Uralic group in A Clash of Chiefs.

haplogroup-n1a-M2118
Frequency-Distribution Maps of Individual Sub-clades of hg N3a2, by Ilumäe et al. (2016).

Conclusion

The picture offered by the paper on Hungarian Conquerors, while in line with historical accounts of multi-ethnic tribes incorporating regional lineages, shows nevertheless patrilineal clans clearly associated with Uralic peoples, in a distribution which could have been easily inferred from ancient Trans-Uralian forest-steppe cultures and modern samples (even regarding I2a-L621).

In spite of this, there is a great deal of discussion in the paper about specific N1a subclades in Hungarian conquerors, while the presence of R1a-Z280 (among early Magyar elites!) is interpreted, as always, as recently acculturated Slavs. This is sadly coupled with the simplistic identification of I2a-L621 as of local origin around the Carpathians.

The introduction of the paper to the history of Hungarians is also weird, for example giving credibility to the mythic accounts of the Árpád dynasty’s origin in Attila, which is in line, I guess, with what the authors intended to support all along, i.e. the association of Magyars with Turks from the Eurasian steppes, which they are apparently willing to achieve by relating them to haplogroup R1a-Z93

The conclusion is thus written to appease modern nation-building myths more than anything else, like many other papers before it:

It is generally accepted that the Hungarian language was brought to the Carpathian Basin by the Conquerors. Uralic speaking populations are characterized by a high frequency of Y-Hg N, which have often been interpreted as a genetic signal of shared ancestry. Indeed, recently a distinct shared ancestry component of likely Siberian origin was identified at the genomic level in these populations, modern Hungarians being a puzzling exception36. The Conqueror elite had a significant proportion of N Hgs, 7% of them carrying N1a1a1a1a4-M2118 and 10% N1a1a1a1a2-Z1936, both of which are present in Ugric speaking Khantys and Mansis. At the same time none of the examined Conquerors belonged to the L1034 subclade of Z1936, while all of the Khanty Z1936 lineages reported in 37 proved to be L1034 which has not been tested in the 23 study. Population genetic data rather position the Conqueror elite among Turkic groups, Bashkirs and Volga Tatars, in agreement with contemporary historical accounts which denominated the Conquerors as “Turks”. This does not exclude the possibility that the Hungarian language could also have been present in the obviously very heterogeneous, probably multiethnic Conqueror tribal alliance.

So, back to square one, and new circular reasoning: If ancient populations from north-eastern Europe believed to represent ancient Finno-Ugrians are of R1a-Z645 lineages, it’s because they were not Finno-Ugric speakers. If ancient and modern populations known to be of Finno-Ugric language show clear connections with R1a-Z645, it’s because they are “multi-ethnic”.

The only stable basis for discussion in genetic papers, apparently, is the own making of geneticists, with their traditional 2000s “R1a=Indo-European” and “N1c=Uralic”, coupled with national beliefs. It does not matter how many predictions based on that have been proven wrong, or how many predictions based on the Corded Ware = Uralic expansion have been proven right.

Related

Minimal gene flow from western pastoralists in the Bronze Age eastern steppes

jeong-steppes-mongolia

Open access paper Bronze Age population dynamics and the rise of dairy pastoralism on the eastern Eurasian steppe, by Jeong et al. PNAS (2018).

Interesting excerpts (emphasis mine):

To understand the population history and context of dairy pastoralism in the eastern Eurasian steppe, we applied genomic and proteomic analyses to individuals buried in Late Bronze Age (LBA) burial mounds associated with the Deer Stone-Khirigsuur Complex (DSKC) in northern Mongolia. To date, DSKC sites contain the clearest and most direct evidence for animal pastoralism in the Eastern steppe before ca. 1200 BCE.

Most LBA Khövsgöls are projected on top of modern Tuvinians or Altaians, who reside in neighboring regions. In comparison with other ancient individuals, they are also close to but slightly displaced from temporally earlier Neolithic and Early Bronze Age (EBA) populations from the Shamanka II cemetry (Shamanka_EN and Shamanka_EBA, respectively) from the Lake Baikal region. However, when Native Americans are added to PC calculation, we observe that LBA Khövsgöls are displaced from modern neighbors toward Native Americans along PC2, occupying a space not overlapping with any contemporary population. Such an upward shift on PC2 is also observed in the ancient Baikal populations from the Neolithic to EBA and in the Bronze Age individuals from the Altai associated with Okunevo and Karasuk cultures.

pca-eurasians-karasuk-khovsgol
Image modified from the article. Karasuk cluster in green, closely related to sample ARS026 in red. Principal Component Analysis (PCA) of selected 2,077 contemporary Eurasians belonging to 149 groups. Contemporary individuals are plotted using three-letter abbreviations for operational group IDs. Group IDs color coded by geographic region. Ancient Khövsgöl individuals and other selected ancient groups are represented on the plot by filled shapes. Ancient individuals are projected onto the PC space using the “lsqproject: YES” option in the smartpca program to minimize the impact of high genotype missing rate.

(…) two individuals fall on the PC space markedly separated from the others: ARS017 is placed close to ancient and modern northeast Asians, such as early Neolithic individuals from the Devil’s Gate archaeological site (22) and present-day Nivhs from the Russian far east, while ARS026 falls midway between the main cluster and western Eurasians.

Upper Paleolithic Siberians from nearby Afontova Gora and Mal’ta archaeological sites (AG3 and MA-1, respectively) (25, 26) have the highest extra affinity with the main cluster compared with other groups, including the eastern outlier ARS017, the early Neolithic Shamanka_EN, and present-day Nganasans and Tuvinians (Z > 6.7 SE for AG3). Main cluster Khövsgöl individuals mostly belong to Siberian mitochondrial (A, B, C, D, and G) and Y (all Q1a but one N1c1a) haplogroups.

mongolia-botai-ehg-ane-cline
The genetic affinity of the Khövsgöl clusters measured by outgroup-f3 and -f4 statistics. (A) The top 20 populations sharing the highest amount of >genetic drift with the Khövsgöl main cluster measured by f3(Mbuti; Khövsgöl, X). (B) The top 15 populations with the most extra affinity with each of the three Khövsgöl clusters in contrast to Tuvinian (for the main cluster) or to the main cluster (for the two outliers), measured by f4(Mbuti, X; Tuvinian/Khövsgöl, Khövsgöl/ARS017/ARS026). Ancient and contemporary groups are marked by squares and circles, respectively. Darker shades represent a larger f4 statistic.

Previous studies show a close genetic relationship between WSH populations and ANE ancestry, as Yamnaya and Afanasievo are modeled as a roughly equal mixture of early Holocene Iranian/ Caucasus ancestry (IRC) and Mesolithic Eastern European hunter-gatherers, the latter of which derive a large fraction of their ancestry from ANE. It is therefore important to pinpoint the source of ANE-related ancestry in the Khövsgöl gene pool: that is, whether it derives from a pre-Bronze Age ANE population (such as the one represented by AG3) or from a Bronze Age WSH population that has both ANE and IRC ancestry.

The amount of WSH contribution remains small (e.g., 6.4 ± 1.0% from Sintashta). Assuming that the early Neolithic populations of the Khövsgöl region resembled those of the nearby Baikal region, we conclude that the Khövsgöl main cluster obtained ∼11% of their ancestry from an ANE source during the Neolithic period and a much smaller contribution of WSH ancestry (4–7%) beginning in the early Bronze Age.

khovsgol-shamanka-sintashta
Admixture modeling of Altai populations and the Khövsgöl main cluster using qpAdm. For the archaeological populations, (A) Shamanka_EBA and (B and C) Khövsgöl, each colored block represents the proportion of ancestry derived from a corresponding ancestry source in the legend. Error bars show 1 SE. (A) Shamanka_EBA is modeled as a mixture of Shamanka_EN and AG3. The Khövsgöl main cluster is modeled as (B) a two-way admixture of Shamanka_EBA+Sintashta and (C) a three-way admixture Shamanka_EN+AG3+Sintashta.

Apparently, then, the first individual with substantial WSH ancestry in the Khövsgöl population (ARS026, of haplogroup R1a-Z2123), directly dated to 1130–900 BC, is consistent with the first appearance of admixed forest-steppe-related populations like Karasuk (ca. 1200-800 BC) in the Altai. Interestingly, haplogroup N1a1a-M178 pops up (with mtDNA U5a2d1) among the earlier Khövsgöl samples.

I will repeat what I wrote recently here: Samoyedic arrived in the Altai with Karasuk and hg R1a-Z645 + Steppe_MLBA-like ancestry, admixed with Altai populations, clustering thus within an Ancient Altai cline. Only later did N1a1a subclades infiltrate Samoyedic (and Ugric) populations, bringing them closer to their modern Palaeo-Siberian cline. The shared mtDNA may support an ancestral EHG-“Siberian” cline, or else a more recent Afanasevo-related origin.

east-uralic-clines
Modified image from Jeong et al. (2018), supplementary materials. The first two PCs summarizing the genetic structure within 2,077 Eurasian individuals. The two PCs generally mirror geography. PC1 separates western and eastern Eurasian populations, with many inner Eurasians in the middle. PC2 separates eastern Eurasians along the north-south cline and also separates Europeans from West Asians. Ancient individuals (color-filled shapes), including two Botai individuals, are projected onto PCs calculated from present-day individuals. Read more.

Also interesting, Q1a2 subclades and ANE ancestry making its appearance everywhere among ancestral Eurasian peoples, as Chetan recently pointed out.

Related

Corded Ware—Uralic (IV): Hg R1a and N in Finno-Ugric and Samoyedic expansions

haplogroup-uralians

This is the fourth of four posts on the Corded Ware—Uralic identification:

Let me begin this final post on the Corded Ware—Uralic connection with an assertion that should be obvious to everyone involved in ethnolinguistic identification of prehistoric populations but, for one reason or another, is usually forgotten. In the words of David Reich, in Who We Are and How We Got Here (2018):

Human history is full of dead ends, and we should not expect the people who lived in any one place in the past to be the direct ancestors of those who live there today.

Haplogroup N

Another recurrent argument – apart from “Siberian ancestry” – for the location of the Uralic homeland is “haplogroup N”. This is as serious as saying “haplogroup R1” to refer to Indo-European migrations, but let’s explore this possibility anyway:

Ancient haplogroups

We have now a better idea of how many ancient migrations (previously hypothesized to be associated with westward Uralic migrations) look like in genetic terms. From Damgaard et al. (Science 2018):

These serial changes in the Baikal populations are reflected in Y-chromosome lineages (Fig. SA; figs. S24 to S27, and tables S13 and SI4). MAI carries the R haplogroup, whereas the majority of Baikal_EN males belong to N lineages, which were widely distributed across Northern Eurasia (29), and the Baikal_LNBA males all carry Q haplogroups, as do most of the Okunevo_EMBA as well as some present-day Central Asians and Siberians.

The only N1c1 sample comes from Ust’Ida Late Neolithic, 180km to the north of Lake Baikal, which – together with the Bronze Age sample from the Kola peninsula, and the medieval sample from Ust’Ida – gives a good idea of the overall expansion of N subclades and Siberian ancestry among the Circum-Arctic peoples of Eurasia, speakers of Palaeo-Siberian languages.

eurasian-n-subclades
Geographical location of ancient samples belonging to major clade N of the Y-chromosome.

Modern haplogroups

What we should expect from Uralic peoples expanding with haplogroup N – seeing how Yamna expands with R1b-L23, and Corded Ware expands with R1a-Z645 – is to find a common subclade spreading with Uralic populations. Let’s see if it works like that for any N-X subclade, in data from Ilumäe et al. (2016):

haplogroup_n1
Geographic-Distribution Map of hg N3 / N1c / N1a.

Within the Eurasian circum-Arctic spread zone, N3 and N2a reveal a well-structured spread pattern where individual sub-clades show very different distributions:

N1a1-M46 (or N-TAT), formed ca. 13900 BC, TMRCA 9800 BC

   N1a1a2-B187, formed ca. 9800 BC, TMRCA 1050 AD:

The sub-clade N3b-B187 is specific to southern Siberia and Mongolia, whereas N3a-L708 is spread widely in other regions of northern Eurasia.

     N1a1a1a-L708, formed ca. 6800 BC, TMRCA 5400 BC.

       N1a1a1a2-B211/Y9022, formed ca. 5400 BC, TMRCA 1900 BC:

The deepest clade within N3a is N3a1-B211, mostly present in the Volga-Uralic region and western Siberian Khanty and Mansi populations.

         N1a1a1a1a-L392/L1026), formed ca. 4400 BC, TMRCA 2800 BC:

The neighbor clade, N3a3’6-CTS6967, spreads from eastern Siberia to the eastern part of Fennoscandia and the Baltic States

haplogroup_n3a3
Frequency-Distribution Maps of Individual Subclade N3a3 / N1a1a1a1a1a-CTS2929/VL29, probably initially with Akozino warrior-traders.

           N1a1a1a1a1a-CTS2929/VL29, formed ca. 2100 BC, TMRCA 1600 BC:

In Europe, the clade N3a3-VL29 encompasses over a third of the present-day male Estonians, Latvians, and Lithuanians but is also present among Saami, Karelians, and Finns (Table S2 and Figure 3). Among the Slavic-speaking Belarusians, Ukrainians, and Russians, about three-fourths of their hg N3 Y chromosomes belong to hg N3a3.

In the post on Finno-Permic expansions, I depicted what seems to me the most likely way of infiltration of N1c-L392 lineages with Akozino warrior-traders into the western Finno-Ugric populations, with an origin around the Barents sea.

This includes the potential spread of (a minority of) N1c-B211 subclades due to contacts with Anonino on both sides of the Urals, through a northern route of forest and forest-steppe regions (equivalent to the distribution of Cherkaskul compared to Andronovo), given the spread of certain subclades in Ugric populations.

NOTE. An alternative possibility is the association of certain B211 subclades with a southern route of expansion with Pre-Scythian and Scythian populations, under whose influence the Ananino culture emerged -which would imply a very quick infiltration of certain groups of haplogroup N everywhere among Finno-Ugrics on both sides of the Urals – , and also the expansion of some subclades with Turkic-speaking peoples, who apparently expanded with alliances of different peoples. Both (Scythian and Turkic) populations expanded from East Asia, where haplogroup N (including N1c) was present since the Neolithic. I find this a worse model of expansion for upper clades, but – given the YFull estimates and the presence of this haplogroup among Turkic peoples – it is a possibility for many subclades.

           N1a1a1a1a2-Z1936, formed ca. 2800 BC, TMRCA 2400 BC:

The only notable exception from the pattern are Russians from northern regions of European Russia, where, in turn, about two-thirds of the hg N3 Y chromosomes belong to the hg N3a4-Z1936—the second west Eurasian clade. Thus, according to the frequency distribution of this clade, these Northern Russians fit better among other non-Slavic populations from northeastern Europe. N3a4 tends to increase in frequency toward the northeastern European regions but is also somewhat unexpectedly a dominant hg N3 lineage among most Turcic-speaking Volga Tatars and South-Ural Bashkirs.

haplogroup_n3a4
Frequency-Distribution Maps of Individual Subclade N3a4 / N1a1a1a1a2-Z1936, probably with the Samic (first) and Fennic (later) expansions into Paleo-Lakelandic and Palaeo-Laplandic territories.

The expansion of N1a-Z1936 in Fennoscandia is most likely associated with the expansion of Saami into asbestos ware-related territory (like the Lovozero culture) during the Late Iron Age – and mixture with its population – , and with the later Fennic expansion to the east and north, replacing their language, as well as with Arctic and forest populations assimilated during Permic, Ugric, and Samoyedic expansions to the north.

           N1a1a1a1a4-M2019 (previously N3a2), formed ca. 4400 BC, TMRCA 1700 BC:

Sub-hg N3a2-M2118 is one of the two main bifurcating branches in the nested cladistic structure of N3a2’6-M2110. It is predominantly found in populations inhabiting present-day Yakutia (Republic of Sakha) in central Siberia and at lower frequencies in the Khanty and Mansi populations, which exhibit a distinct Y-STR pattern (Table S7) potentially intrinsic to an additional clade inside the sub-hg N3a2

The second widespread sub-clade of hg N is N2a. (…):

   N1a2b-P43 (B523/FGC10846/Y3184), formed ca. 6800 BC, TMRCA ca. 2700 BC:

The absolute majority of N2a individuals belong to the second sub-clade, N2a1-B523, which diversified about 4.7 kya (95% CI = 4.0–5.5 kya). Its distribution covers the western and southern parts of Siberia, the Taimyr Peninsula, and the Volga-Uralic region with frequencies ranging from from 10% to 30% and does not extend to eastern Siberia (…)

haplogroup_n2
Geographic-Distribution Map of hg N2a1 / N1a2b-P43

The “European” branch suggested earlier from Y-STR patterns turned out to consist of two clades

     N1a2b2a-Y3185/FGC10847, formed ca. 2200 BC, TMRCA 800 BC:

N2a1-L1419, spread mainly in the northern part of that region.

     N1a2b2b1-B528/Y24382, formed ca. 900 BC, TMRCA ca. 900 BC:

N2a1-B528, spread in the southern Volga-Uralic region.

Haplogroup R1a

We also have a good idea of the distribution of haplogroup R1a-Z645 in ancient samples. Its subclades were associated with the Corded Ware expansion, and some of them fit quite well the early expansion of Finno-Permic, Ugric, and Samoyedic peoples to the east.

r1a-z282-z280-z2125-distribution
Modified image, from Underhill et al. (2015). Spatial frequency distributions of Z282 (green) and Z93 (blue) affiliated haplogroups.. Notice the potential Finno-Ugric-associated distribution of Z282 (especially R1a-M558, a Z280 subclade), the expansion of R1a-Z2123 subclades with Central Asian forest-steppe groups.

This is how the modern distribution of R1a among Uralians looks like, from the latest report in Tambets et al. (2018):

  • Among Fennic populations, Estonians and Karelians (ca. 1.1 million) have not suffered the greatest bottleneck of Finns (ca. 6-7 million), and show thus a greater proportion of R1a-Z280 than N1c subclades, which points to the original situation of Fennic peoples before their expansion. To trust Finnish Y-DNA to derive conclusions about the Uralic populations is as useful as relying on the Basque Y-DNA for the language spread by R1b-P312
  • Among Volga-Finnic populations, Mordovians (the closest to the original Uralic cluster, see above) show a majority of R1a lineages (27%).
  • Hungarians (ca. 13-15 million) represent the majority of Ugric (and Finno-Ugric) peoples. They are mainly R1a-Z280, also R1a-Z2123, have little N1c, and lack Siberian ancestry, and represent thus the most likely original situation of Ugric peoples in 4th century AD (read more on Avars and Hungarians).
  • Among Samoyedic peoples, the Selkup, the southernmost ones and latest to expand – that is, those not heavily admixed with Siberian populations – , also have a majority of R1a-Z2123 lineages (see also here for the original Samoyedic haplogroups to the south).

To understand the relevance of Hungarians for Ugric peoples, as well as Estonians, Karelians, and Mordovians (and northern Russians, Finno-Ugric peoples recently Russified) for Finno-Permic peoples, as opposed to the Circum-Arctic and East Siberian populations, one has to put demographics in perspective. Even a modern map can show the relevance of certain territories in the past:

population-density
Population density (people per km2) map of the world in 1994. From Wikipedia.

Summary of ancestry + haplogroups

Fennic and Samic populations seem to be clearly influenced by Palaeo-Laplandic peoples, whereas Volga-Finnic and especially Permic populations may have received gene flow from both, but essentially Palaeo-Siberian influence from the north and east.

The fact that modern Mansis and Khantys offer the highest variation in N1a subclades, and some of the highest “Siberian ancestry” among non-Nganasans, should have raised a red flag long ago. The fact that Hungarians – supposedly stemming from a source population similar to Mansis – do not offer the same amount of N subclades or Siberian ancestry (not even close), and offer instead more R1a, in common with Estonians (among Finno-Samic peoples) and Mordvins (among Volga-Finnic peoples) should have raised a still bigger red flag. The fact that Nganasans – the model for Siberian ancestry – show completely different N1a2b-P43 lineages should have been a huge genetic red line (on top of the anthropological one) to regard them as the Uralian-type population.

We know now that ethnolinguistic groups have usually expanded with massive (usually male-biased) migrations, and that neighbouring locals often ‘resurge’ later without changing the language. That is seen in Europe after the spread of Bell Beakers, with the increase of previous ancestry and lineages in Scandinavia during the formation of the Nordic ethnolinguistic community; in Central-West Europe, with the resurgence of Neolithic ancestry (and lineages) during the Bronze Age over steppe ancestry; and in Central-East Europe (with Unetice or East European Bronze Age groups like Mierzanowice, Trzciniec, or Lusatian) showing an increase in steppe ancestry (and resurge of R1a subclades); none of them represented a radical ethnolinguistic change.

finno-ugric-haplogroup-n
Map of archaeological cultures in north-eastern Europe ca. 8th-3rd centuries BC. [The Mid-Volga Akozino group not depicted] Shaded area represents the Ananino cultural-historical society. Fading purple arrows represent likely stepped movements of subclades of haplogroup N for centuries (e.g. Siberian → Ananino → Akozino → Fennoscandia [N-VL29]; Circum-Arctic → forest-steppe [N1, N2]; etc.). Blue arrows represent eventual expansions of Uralic peoples to the north. Modified image from Vasilyev (2002).

It is not hard to model the stepped arrival, infiltration, and/or resurge of N subclades and “Siberian ancestries”, as well as their gradual expansion in certain regions, associated with certain migrations first – such as the expansions to the Circum-Arctic region, and later the Scythian- and Turkic-related movements – , as well as limited regional developments, like the known bottleneck in Finns, or the clear late expansion of Ugric and Samoyedic languages to the north among nomadic Palaeo-Siberians due to traditions of exogamy and multilingualism. This fits quite well with the different arrival of N (N1c and xN1c) lineages to the different Uralic-speaking groups, and to the stepped appearance of “Siberian ancestry” in the different regions.

The aternative

It is evident that a lot of people were too attached to the idea of Palaeolithic R1b lineages ‘native’ to western Europe speaking Basque languages; of R1a lineages speaking Indo-European and spreading with Yamna; and N lineages ‘native’ to north-eastern Europe and speaking Uralic, and this is causing widespread weeping and gnashing of teeth (instead of the joy of discovering where one’s true patrilineal ancestors come from, and what language they spoke in each given period, which is the supposed objective of genetic genealogy…)

Since an Indo-Germanic branch (as revived now by some in the Copenhaguen group to fit Kristiansen’s theory of the 1980s with recent genetic data) does not make any sense in linguistics, the finding of R1a in Yamna would not have led where some think it would have, because North-West Indo-European would still be the main Late PIE branch in Europe. Don’t take my word for it; take James P. Mallory’s (2013).

mallory-adams-tree
The levels of Indo-European reconstruction, from Mallory & Adams (2006).

If an (unlikely) Indo-Slavonic group were posited, though, such a group would still be bound (with Indo-Iranian) to the steppes with East Yamna/Poltavka (admixing with Abashevo migrants, but retaining its language), developing Sintashta/Potapovka → Srubna/Andronovo, and R1a lineages would have equally undergone the known bottlenecks of the steppes where they replaced R1b-Z2103 – which this eastern group shares with Balkan languages, a haplogroup that links therefore together the Graeco-Aryan group.

As far as I know – and there might be many other similar pet theories out there – there have been proposals of “modern Balto-Slavic-like” populations (in an obvious circular reasoning based on modern populations) in some Scythian clusters of the Iron Age.

NOTE. I will not enter into “Balto-Slavic-like R1a” of the Late Bronze Age or earlier because no one can seriously believe at this point of development of Population Genetics that autosomal similarity predating 1,500+ years the appearance of Slavs equates to their (ethnolinguistic) ancestral population, without a clear intermediate cultural and genetic trail – something we lack today in the Slavic case even for the late Roman period…

finno-saamic-palaeo-germanic-substratum
The Finnic and Saamic separation looks shallower than it actually is. Invisible convergence can be ‘triangulated’ with the help of Germanic layers of mutual loanwords (Häkkinen 2012).

We also know of R1a-Z280 lineages in Srubna, probably expanding to the west. With that in mind, and knowing that Palaeo-Germanic was in close contact with Finno-Samic while both were already separated but still in contact, and that Palaeo-Germanic was also in contact and closely related to a ‘Temematic’ distinct from Balto-Slavic (and also that early Proto-Baltic and Proto-Slavic from the Roman Iron Age and later were in contact with western Uralic) this will be the linguistic map of the Iron Age if R1a is considered to expand Indo-European from some kind of “patron-client” relationship with west Yamna:

palaeo-germanic-italo-celtic
Eastern European language map during the Late Bronze Age / Iron Age, if R1a spread Indo-European languages and Eastern Yamna spoke Indo-Slavonic. Palaeo-Germanic (i.e. Pre- to Proto-Germanic) needs to be in contact with both the Samic Lovozero population and the Fennic west Circum-Arctic one. Italic and Celtic in contact with Pre-Germanic. Germanic in contact with Temematic. Balto-Slavic in contact with Iranian, and near Fennic to allow for later loanwords. For Germanic and Temematic, see Kortlandt (2018).

You might think I have some personal or political reason against this kind of proposals. I haven’t. We have been proposing Indo-European to be the language of the European Union for more than 10 years, so to support R1b-Italo-Celtic in the whole Western Europe, R1a-Germanic in Central and Eastern Europe, and R1a-Indo-Slavonic in the steppes (as the Danish group seems to be doing) has nothing inherently bad (or good) for me. If anything, it gives more reason to support the revival of North-West Indo-European in Europe.

My problem with this proposal is that it is obviously beholden to the notion of the uninterrupted cultural, historic and ethnic continuity in certain territories. This bias is common in historiography (von Falkenhausen 1993), but it extends even more easily into the lesser known prehistory of any territory, and now more than ever some people feel the need to corrupt (pre)history based on their own haplogroups (or the majority haplogroups of their modern countries). However, more than on philosophical grounds, my rejection is based on facts: this picture is not what the combination of linguistic, archaeological, and genetic data shows. Period.

Nevertheless, if Yamna + Corded Ware represented the “big and early expansion” of Germanic and Italo-Celtic peoples proper of the dream Nazi’s Lebensraum and Fascist’s spazio vitale proposals; Uralians were Siberian hunter-gatherers that controlled the whole eastern and northern Russia, and miraculously managed to push (ethnolinguistically) Neolithic agropastoralists to the west during and after the Iron Age, with gradual (and often minimal) genetic impact; and Balto-Slavic peoples were represented by horse riders from Pokrovka/Srubna, hiding then somewhere around the forest-steppe until after the Scythian expansion, and then spreading their language (without much genetic impact) during the early Middle Ages…so be it.

See also

Related