Corded Ware ancestry in North Eurasia and the Uralic expansion

uralic-clines-nganasan

Now that it has become evident that Late Repin (i.e. Yamnaya/Afanasevo) ancestry was associated with the migration of R1b-L23-rich Late Proto-Indo-Europeans from the steppe in the second half of the the 4th millennium BC, there’s still the question of how R1a-rich Uralic speakers of Corded Ware ancestry expanded , and how they spread their languages throughout North Eurasia.

Modern North Eurasians

I have been collecting information from the supplementary data of the latest papers on modern and ancient North Eurasian peoples, including Jeong et al. (2019), Saag et al. (2019), Sikora et al. (2018), or Flegontov et al. (2019), and I have tried to add up their information on ancestral components and their modern and historical distributions.

Fortunately, the current obsession with simplifying ancestry components into three or four general, atemporal groups, and the common use of the same ones across labs, make it very simple to merge data and map them.

Corded Ware ancestry

There is no doubt about the prevalent ancestry among Uralic-speaking peoples. A map isn’t needed to realize that, because ancient and modern data – like those recently summarized in Jeong et al. (2019) – prove it. But maps sure help visualize their intricate relationship better:

natural-modern-srubnaya-ancestry
Natural neighbor interpolation of Srubnaya ancestry among modern populations. See full map.
kriging-modern-srubnaya-ancestry
Kriging interpolation of Srubnaya ancestry among modern populations. See full map

Interestingly, the regions with higher Corded Ware-related ancestry are in great part coincident with (pre)historical Finno-Ugric-speaking territories:

uralic-languages-modern
Modern distribution of Uralic languages, with ancient territory (in the Common Era) labelled and delimited by a red line. For more information on the ancient territory see here.

Edit (29/7/2019): Here is the full Steppe_MLBA ancestry map, including Steppe_MLBA (vs. Indus Periphery vs. Onge) in modern South Asian populations from Narasimhan et al. (2018), apart from the ‘Srubnaya component’ in North Eurasian populations. ‘Dummy’ variables (with 0% ancestry) have been included to the south and east of the map to avoid weird interpolations of Steppe_MLBA into Africa and East Asia.

modern-steppe-mlba-ancestry2
Natural neighbor interpolation of Steppe MLBA-like ancestry among modern populations. See full map.

Anatolia Neolithic ancestry

Also interesting are the patterns of non-CWC-related ancestry, in particular the apparent wedge created by expanding East Slavs, which seems to reflect the intrusion of central(-eastern) European ancestry into Finno-Permic territory.

NOTE. Read more on Balto-Slavic hydrotoponymy, on the cradle of Russians as a Finno-Permic hotspot, and about Pre-Slavic languages in North-West Russia.

natural-modern-lbk-en-ancestry
Natural neighbor interpolation of LBK EN ancestry among modern populations. See full map.
kriging-modern-lbk-en-ancestry
Kriging interpolation of LBK EN ancestry among modern populations. See full map

WHG ancestry

The cline(s) between WHG, EHG, ANE, Nganasan, and Baikal HG are also simplified when some of them excluded, in this case EHG, represented thus in part by WHG, and in part by more eastern ancestries (see below).

modern-whg-ancestry
Natural neighbor interpolation of WHG ancestry among modern populations. See full map.
kriging-modern-whg-ancestry
Kriging interpolation of WHG ancestry among modern populations. See full map.

Arctic, Tundra or Forest-steppe?

Data on Nganasan-related vs. ANE vs. Baikal HG/Ulchi-related ancestry is difficult to map properly, because both ancestry components are usually reported as mutually exclusive, when they are in fact clearly related in an ancestral cline formed by different ancient North Eurasian populations from Siberia.

When it comes to ascertaining the origin of the multiple CWC-related clines among Uralic-speaking peoples, the question is thus how to properly distinguish the proportions of WHG-, EHG-, Nganasan-, ANE or BaikalHG-related ancestral components in North Eurasia, i.e. how did each dialectal group admix with regional groups which formed part of these clines east and west of the Urals.

The truth is, one ought to test specific ancient samples for each “Siberian” ancestry found in the different Uralic dialectal groups, but the simplistic “Siberian” label somehow gets a pass in many papers (see a recent example).

Below qpAdm results with best fits for Ulchi ancestry, Afontova Gora 3 ancestry, and Nganasan ancestry, but some populations show good fits for both and with similar proportions, so selecting one necessarily simplifies the distribution of both.

Ulchi ancestry

modern-ulchi-ancestry
Natural neighbor interpolation of Ulchi ancestry among modern populations. See full map.
kriging-modern-ulchi-ancestry
Kriging interpolation of Ulchi ancestry among modern populations. See full map.

ANE ancestry

natural-modern-ane-ancestry
Natural neighbor interpolation of ANE ancestry among modern populations. See full map.
kriging-modern-ane-ancestry
Kriging interpolation of ANE ancestry among modern populations. See full map.

Nganasan ancestry

modern-nganasan-ancestry
Natural neighbor interpolation of Nganasan ancestry among modern populations. See full map.
kriging-modern-nganasan-ancestry
Kriging interpolation of Nganasan ancestry among modern populations. See full map.

Iran Chalcolithic

A simplistic Iran Chalcolithic-related ancestry is also seen in the Altaic cline(s) which (like Corded Ware ancestry) expanded from Central Asia into Europe – apart from its historical distribution south of the Caucasus:

modern-iran-chal-ancestry
Natural neighbor interpolation of Iran Neolithic ancestry among modern populations. See full map.
kriging-modern-iran-neolithic-ancestry
Kriging interpolation of Iran Chalcolithic ancestry among modern populations. See full map.

Other models

The first question I imagine some would like to know is: what about other models? Do they show the same results? Here is the simplistic combination of ancestry components published in Damgaard et al. (2018) for the same or similar populations:

NOTE. As you can see, their selection of EHG vs. WHG vs. Nganasan vs. Natufian vs. Clovis of is of little use, but corroborate the results from other papers, and show some interesting patterns in combination with those above.

EHG

damgaard-modern-ehg-ancestry
Natural neighbor interpolation of EHG ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-ehg-ancestry
Kriging interpolation of EHG ancestry among modern populations. See full map.

Natufian ancestry

damgaard-modern-natufian-ancestry
Natural neighbor interpolation of Natufian ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-natufian-ancestry
Kriging interpolation of Natufian ancestry among modern populations. See full map.

WHG ancestry

damgaard-modern-whg-ancestry
Natural neighbor interpolation of WHG ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-whg-ancestry
Kriging interpolation of WHG ancestry among modern populations. See full map.

Baikal HG ancestry

damgaard-modern-baikalhg-ancestry
Natural neighbor interpolation of Baikal hunter-gatherer ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-baikal-hg-ancestry
Kriging interpolation of Baikal HG ancestry among modern populations. See full map.

Ancient North Eurasians

Once the modern situation is clear, relevant questions are, for example, whether EHG-, WHG-, ANE, Nganasan-, and/or Baikal HG-related meta-populations expanded or became integrated into Uralic-speaking territories.

When did these admixture/migration events happen?

How did the ancient distribution or expansion of Palaeo-Arctic, Baikalic, and/or Altaic peoples affect the current distribution of the so-called “Siberian” ancestry, and of hg. N1a, in each specific population?

NOTE. A little excursus is necessary, because the calculated repetition of a hypothetic opposition “N1a vs. R1a” doesn’t make this dichotomy real:

  1. There was not a single ethnolinguistic community represented by hg. R1a after the initial expansion of Eastern Corded Ware groups, or by hg. N1a-L392 after its initial expansion in Siberia:
  2. Different subclades became incorporated in different ways into Bronze Age and Iron Age communities, most of which without an ethnolinguistic change. For example, N1a subclades became incorporated into North Eurasian populations of different languages, reaching Uralic- and Indo-European-speaking territories of north-eastern Europe during the late Iron Age, at a time when their ancestral origin or language in Siberia was impossible to ascertain. Just like the mix found among Proto-Germanic peoples (R1b, R1a, and I1)* or among Slavic peoples (I2a, E1b, R1a)*, the mix of many Uralic groups showing specific percentages of R1a, N1a, or Q subclades* reflect more or less recent admixture or acculturation events with little impact on their languages.

*other typically northern and eastern European haplogroups are also represented in early Germanic (N1a, I2, E1b, J, G2), Slavic (I1, G2, J) and Finno-Permic (I1, R1b, J) peoples.

ananino-culture-new
Map of archaeological cultures in north-eastern Europe ca. 8th-3rd centuries BC. [The Mid-Volga Akozino group not depicted] Shaded area represents the Ananino cultural-historical society. Fading purple arrows represent likely stepped movements of subclades of haplogroup N for centuries (e.g. Siberian → Ananino → Akozino → Fennoscandia [N-VL29]; Circum-Arctic → forest-steppe [N1, N2]; etc.). Blue arrows represent eventual expansions of Uralic peoples to the north. Modified image from Vasilyev (2002).

The problem with mapping the ancestry of the available sampling of ancient populations is that we lack proper temporal and regional transects. The maps that follow include cultures roughly divided into either “Bronze Age” or “Iron Age” groups, although the difference between samples may span up to 2,000 years.

NOTE. Rough estimates for more external groups (viz. Sweden Battle Axe/Gotland_A for the NW, Srubna from the North Pontic area for the SW, Arctic/Nganasan for the NE, and Baikal EBA/”Ulchi-like” for the SE) have been included to offer a wider interpolated area using data already known.

Bronze Age

Similar to modern populations, the selection of best fit “Siberian” ancestry between Baikal HG vs. Nganasan, both potentially ± ANE (AG3), is an oversimplification that needs to be addressed in future papers.

Corded Ware ancestry

bronze-age-corded-ware-ancestry
Natural neighbor interpolation of Srubnaya ancestry among Bronze Age populations. See full map.

Nganasan-like ancestry

bronze-age-nganasan-like-ancestry
Natural neighbor interpolation of Nganasan-like ancestry among Bronze Age populations. See full map.

Baikal HG ancestry

bronze-age-baikal-hg-ancestry
Natural neighbor interpolation of Baikal Hunter-Gatherer ancestry among Bronze Age populations. See full map.

Afontova Gora 3 ancestry

bronze-age-afontova-gora-ancestry
Natural neighbor interpolation of Afontova Gora 3 ancestry among Bronze Age populations. See full map.

Iron Age

Corded Ware ancestry

Interestingly, the moderate expansion of Corded Ware-related ancestry from the south during the Iron Age may be related to the expansion of hg. N1a-VL29 into the chiefdom-based system of north-eastern Europe, including Ananyino/Akozino and later expanding Akozino warrior-traders around the Baltic Sea.

NOTE. The samples from Levänluhta are centuries older than those from Estonia (and Ingria), and those from Chalmny Varre are modern ones, so this region has to be read as a south-west to north-east distribution from the Iron Age to modern times.

iron-age-corded-ware-ancestry
Natural neighbor interpolation of Srubnaya ancestry among Iron Age populations. See full map.

Baikal HG-like ancestry

The fact that this Baltic N1a-VL29 branch belongs in a group together with typically Avar N1a-B197 supports the Altaic origin of the parent group, which is possibly related to the expansion of Baikalic ancestry and Iron Age nomads:

iron-age-baikal-ancestry
Natural neighbor interpolation of Baikal HG ancestry among Iron Age populations. See full map.

Nganasan-like ancestry

The dilution of Nganasan-like ancestry in an Arctic region featuring “Siberian” ancestry and hg. N1a-L392 at least since the Bronze Age supports the integration of hg. N1a-Z1934, sister clade of Ugric N1a-Z1936, into populations west and east of the Urals with the expansion of Uralic languages to the north into the Tundra region (see here).

The integration of N1a-Z1934 lineages into Finnic-speaking peoples after their migration to the north and east, and the displacement or acculturation of Saami from their ancestral homeland, coinciding with known genetic bottlenecks among Finns, is yet another proof of this evolution:

iron-age-nganasan-ancestry
Natural neighbor interpolation of Nganasan ancestry among Iron Age populations. See full map.

WHG ancestry

Similarly, WHG ancestry doesn’t seem to be related to important population movements throughout the Bronze Age, which excludes the multiple North Eurasian populations that will be found along the clines formed by WHG, EHG, ANE, Nganasan, Baikal HG ancestry as forming part of the Uralic ethnogenesis, although they may be relevant to follow later regional movements of specific populations.

iron-age-whg-ancestry
Natural neighbor interpolation of WHG ancestry among Iron Age populations. See full map.

Conclusion

It seems natural that people used to look at maps of haplogroup distribution from the 2000s, coupled with modern language distributions, and would try to interpret them in a certain way, reaching thus the wrong conclusions whose consequences are especially visible today when ancient DNA keeps contradicting them.

In hindsight, though, assuming that Balto-Slavs expanded with Corded Ware and hg. R1a, or that Uralians expanded with “Siberian” ancestry and hg. N1a, was as absurd as looking at maps of ancestry and haplogroup distribution of ancient and modern Native Americans, trying to divide them into “Germanic” or “Iberian”…

The evolution of each specific region and cultural group of North Eurasia is far from being clear. However, the general trend speaks clearly in favour of an ancient, Bronze Age distribution of North Eurasian ancestry and haplogroups that have decreased, diluted, or become incorporated into expanding Uralians of Corded Ware ancestry, occasionally spreading with inter-regional expansions of local groups.

Given the relatively recent push of Altaic and Indo-European languages into ancestral Uralic-speaking territories, only the ancient Corded Ware expansion remains compatible with the spread of Uralic languages into their historical distribution.

Related

Volosovo hunter-gatherers started to disappear earlier than previously believed

volosovo-corded-ware

Recent paper (behind paywall) Marmot incisors and bear tooth pendants in Volosovo hunter-gatherer burials. New radiocarbon and stable isotope data from the Sakhtysh complex, Upper-Volga region, by Macānea, Nordqvist, and Kostyleva, J. Archaeol. Sci. (2019) 26:101908.

Interesting excerpts (emphasis mine):

The Sakhtysh micro-region is located in the Volga-Oka interfluve, along the headwaters of the Koyka River in the Ivanovo Region, central European Russia (Fig. 1). The area has evidence of human habitation from the Early Mesolithic to the Iron Age, and includes altogether 11 long-term and seasonal settlements (Sakhtysh I–II, IIa, III–IV, VII–XI, XIV) and four artefact scatters (sites V–VI, XII–XIII), in addition to which burials have been detected at five sites (I–II, IIa, VII, VIII) (Kostyleva and Utkin, 2010). The locations have been known since the 1930s and intensively studied since the 1960s under the leadership of D.A. Kraynov, M.G. Zhilin, E.L. Kostyleva, and A.V. Utkin.

Sakhtysh II and IIa are the most extensively studied sites of the complex, with ca. 1500m2 and around 800m2 excavated, respectively. The burial grounds at both sites are considered as fully investigated.

volosovo-sakhtysh-dates
AMS datings from the sites Sakhtysh II and IIa. Sampled contexts are given in parentheses (burial/hoard), “crust” indicates samples of charred organic
residues on pottery from cultural layer. For data, see Tables 1–2.

Sakhtysh chronology

The AMS dates do not support the previously proposed phasing of the Sakhtysh burials to early (4750–4375 BP/3600–3000 cal BCE), late (or developed; 4375–4000 BP/3000–2500 cal BCE), and final (4000–3750 BP/2500–2200 cal BCE): the early and late burials at Sakhtysh IIa do not stand out as two separate groups, and also the burials and hoards from Sakhtysh II, connected to the final phase, are temporally overlapping with these. Neither the use sequence, where the settlement and burial phases are non-overlapping and also complementary between the sites (Kostyleva and Utkin, 2010, 2014), finds support in the present material.

The AMS datings indicate that the Volosovo people started to bury their dead at Sakhtysh IIa after 3700 cal BCE; dates earlier than this may be affected by FRE or suffer from mixed contexts and poor quality of dates. The present data questions the interpretation that the Sakhtysh IIa cemetery was used without interruptions between 4800 and 4080 BP (Kostyleva and Utkin, 2010), i.e. for a millennium between 3550 and 2600 cal BCE. The AMS dates rather suggest a use period of some centuries only around the mid-4th millennium cal BCE, tentatively 3650–3400 cal BCE. This would also be more realistic considering the number of burials at the site.

volosovo-sakhtysh-sites
The core area of Volosovo culture (after Kraynov, 1987) and the sites of the Sakhtysh complex (after Kostyleva and Utkin, 2010). Eurasian map base made with Natural Earth. Illustration: K. Nordqvist.

Volosovo chronology

The absolute dating of Volosovo culture was for a long time hampered by the small number of radiocarbon dates (see Kraynov, 1987). Today,>100 datings connected with it can be found in literature (Korolev and Shalapinin, 2010; Chernykh et al., 2011; Nikitin, 2012; Mosin et al., 2014). Unfortunately, the available dates do not form solid grounds for dating the cultural phenomenon, as many of them have quality-related issues, large measurement errors, and ambiguous cultural or physical contexts. Consequently, particular datings may be connected to different cultural phases by different scholars. Finally, a large part of the newly-published datings are obtained through direct dating of potsherds (Kovaliukh and Skripkin, 2007; Zaitseva et al., 2009), and therefore, their cogency must be faced with reservation (see Van der Plicht et al., 2016; Dolbunova et al., 2017).

The datings connected with Volosovo cover a wide time range between ca. 5500 BP (4400 cal BCE) and ca. 3700 BP (2100 cal BCE). However, datings from secure contexts, with good quality (error ca. 50 years or below) and no probable FRE, place the beginning of Volosovo culture to the first half of the 4th millennium cal BCE, around 3700–3600 cal BCE. This is also supported by the roughly coeval terminal dates given for the preceding Lyalovo (Zaretskaya and Kostyleva, 2011) and Volga-Kama cultures (Lychagina, 2018), as well as the appearance of related neighbouring cultures, for example, in the Kama region (Nikitin, 2012; Lychagina, 2018), the southern forest steppe area (Korolev and Shalapinin, 2014), and north-western Russia and Finland (Nordqvist, 2018). Still, the dating of many of these cultural phases suffers from the same problems as of Volosovo.

A handful of contested datings place the end of Volosovo culture to the final centuries of the 3rd millennium cal BCE, or even later (Kostyleva and Utkin, 2010; Chernykh et al., 2011; Nikitin, 2012). On the other hand, the new AMS dates indicate that Volosovo activities at Sakhtysh II and IIa ceased before or towards the early 3rd millennium cal BCE; if this reflects the general decline of Volosovo culture must be still confirmed by more dates from Sakhtysh and elsewhere. In this context, the general cultural development must be accounted for. To what extent – if at all – the Volosovo people were present after the arrival of the Corded Ware culture-related Fatyanovo-Balanovo populations? Based on the current, albeit scant and inconclusive radiocarbon data this took place from ca. 2700 cal BCE onwards (Krenke et al., 2013).

volosovo-fatyanovo-balanovo
Corded Ware and Comb Ware hunter-gatherer-related populations in north-eastern Europe from ca. 2600 BC. See full map.

Comments

One of the interesting genetic papers in the near future will be the one that finally includes samples from Corded Ware groups in the forest zone (i.e. Fatyanovo-Balanovo and Abashevo), which will most likely confirm that they are the origin of the known genetic profile of Central and East Uralic-speaking peoples, seeing how West Uralic peoples show genetic continuity in the East Baltic area, coinciding with the Battle Axe culture.

Uralicists have come a long way from the 1990s, when the picture of Uralic before Balto-Slavic in the Baltic was already evident, and Uralians were identified with Comb Ware peoples. The linguistic data and relative chronology are still valid, despite the now outdated interpretations of absolute archaeological chronology, as happens with interpretations of Krahe or Villar about Old European.

As an example, here are some relevant excerpts from Languages in the Prehistoric Baltic Sea Region, by Kallio (2003):

NOTE. Kallio’s contribution appeared in the book Languages in Prehistoric Europe (2003), which I hold nostalgically close in my Indo-European library (now almost impossible to read fully). It is still one of my preferred books (from those made up of mostly unconnected chunks on European linguistic prehistory), because it contains Oettinger’s essential update of North-West Indo-European common vocabulary, which led us indirectly to our Modern Indo-European project from 2005 on.

In any case, the Uralic arrival in the region east of the Baltic Sea preceded the Indo-European one (…).

This theory that the ancestors of Finno-Saamic speakers arrived in the Baltic Sea region earlier than those of Balto-Slavic speakers is still rejected by some scholars (e.g. Napolskikh 1993: 41-44), who claim, for instance, that Finno-Saamic speakers would not have known salmons before they met Balts because the Finno-Saamic word for ‘salmon’ (i.e. *losi) is a borrowing from Baltic. Similarly, one could claim that English speakers would not have known salmons before they met Frenchmen because English salmon is a borrowing from French. In other words, Worter und Sachen are not necessarily borrowed hand in hand. Otherwise, it would not be so easy to explain how many Finnish names of body parts are borrowings from Baltic (e.g. hammas ‘tooth’, kaula ‘neck’, reisi ‘thigh’) and from Germanic (e.g. hartia ‘shoulder’, lantio ‘loin’, maha ‘stomach’).

A more probative argument is the fact that Balto-Slavic features in Finno-Saamic are mostly lexical ones (i.e. typical superstrate features), where Finno-Saamic features in Balto-Slavic are mostly non-lexical ones (i.e. typical substrate features). Note that there are more Balto-Slavic features in Finnic than in Saamic and more Finno-Saamic features in Baltic than in Slavic. This fact could be explained by presuming that Pre-Saamic was spoken north of the Corded Ware area and Pre-Slavic was spoken south of the Typical Pit-Comb Ware area, whereas Pre-Finnic and Pre-Baltic alone were spoken in the area, where both the Typical Pit-Comb Ware culture (ca. 4000-3600 BC) and the Corded Ware culture (ca. 3200-2300 BC) were situated. This area was most probably bilingual, until Finnic and Baltic won in the north and in the south, respectively.

As is well-known, the idea of Uralic substrate features in Balto-Slavic is not new (cf. e.g. Pokorny 1936/1968: 181-185). As recent studies (e.g. Bednarczuk 1997) have shown, their density is the most remarkable in the four Balto-Slavic languages spoken in the earlier Pit-Comb Ware area (i.e. Latvian, Lithuanian, Belorussian, Russian). On the other hand, occasional Uralisms in the other Balto-Slavic languages spoken west of the Vistula and south of the Pripyat may rather be considered adstrate features spread from the northeast.

comb-ware-uralic
Our beliefs from the 2000s. A hypothetic Uralic Comb Ware distribution before the arrival of a hypothetic North-West Indo-European-speaking Corded Ware. “Generalized distribution of the Pit-Comb Ware cultural complex (Mallory & Adams 1997: 430, Carpelan 1999: 257) and the most probable homelands of Saamic, Finnic, Mordvin, Mari, and Permic.”

The idea of Indo-European superstrate features in Finnic is not new either (cf. e.g. Posti 1953). As Jorma Koivulehto (1983) has recently shown, the earliest Indo-European loanword stratum in the westernmost Uralic branches alone can be considered Northwest Indo-European and connected with the Corded Ware culture (ca. 3200-2300 BC). Since this layer, there have been continuous contacts between Baltic and Finnic. According to Koivulehto (1990), the following stratum can be called Proto-Balt(o-Slav)ic and dated to the Late Neolithic period (ca. 2300-1500 BC). Note that this Proto-Balt(o-Slav)ic dating agrees with the established ones (cf. e.g. Shevelov 1964: 613-614, Kortlandt 1982: 181), when we remember the fact that archaeologists have also moved their datings back by centuries during the last decades.

Finally, there is also a Baltic loanword stratum which was not borrowed from the ancestral stage of Latvian, Lithuanian and/or Old Prussian but from some extinct Baltic language or dialect (Nieminen 1957). However, as these words still go back to the early Proto-Finnic stage, they can hardly be dated later than Bronze-Age ( ca. 1500-500 BC). Therefore, we may conclude that they were probably borrowed from a Baltic superstrate, which arrived in the Finnish Gulf area during the Corded Ware period and survived there until the Bronze Age, when it was no longer identical with other Baltic dialects. In any case, as later Baltic loanword strata concern southern Finnic languages alone, we may presume that this ‘North Baltic’superstrate had become extinct.

The traditional association of Uralic with Volosovo hunter-gatherers doesn’t make sense, since they neither miraculously survived for thousands of years nor mixed for hundreds of years with Corded Ware peoples, so we can now more confidently reject the recent assumption by Carpelan & Parpola that their language was adopted by incoming Fatyanovo, Balanovo and Abashevo groups, to develop into the known Uralic languages (more here). This includes one of the many models of the the Copenhagen group, who simplistically follow “Steppe ancestry” for Indo-Europeannes.

If one combines the known relative linguistic chronology with the North-West Indo-European hydrotoponymy layer, now more clearly identified as Old Europeans expanding with East Bell Beakers and derived Early Bronze Age groups, I think there is little space left for maneuvering out of the overwhelming evidence for a Uralic homeland in the forest-steppes, linked to the spread of late Sredni Stog/Corded Ware ancestry into north-eastern Europe and beyond the Urals.

Related

Uralic speakers formed clines of Corded Ware ancestry with WHG:ANE populations

steppe-forest-tundra-biomes-uralic

The preprint by Jeong et al. (2018) has been published: The genetic history of admixture across inner Eurasia Nature Ecol. Evol. (2019).

Interesting excerpts, referring mainly to Uralic peoples (emphasis mine):

A model-based clustering analysis using ADMIXTURE shows a similar pattern (Fig. 2b and Supplementary Fig. 3). Overall, the proportions of ancestry components associated with Eastern or Western Eurasians are well correlated with longitude in inner Eurasians (Fig. 3). Notable outliers include known historical migrants such as Kalmyks, Nogais and Dungans. The Uralic- and Yeniseian-speaking populations, as well as Russians from multiple locations, derive most of their Eastern Eurasian ancestry from a component most enriched in Nganasans, while Turkic/Mongolic speakers have this component together with another component most enriched in populations from the Russian Far East, such as Ulchi and Nivkh (Supplementary Fig. 3). Turkic/Mongolic speakers comprising the bottom-most cline have a distinct Western Eurasian ancestry profile: they have a high proportion of a component most enriched in Mesolithic Caucasus hunter-gatherers and Neolithic Iranians and frequently harbour another component enriched in present-day South Asians (Supplementary Fig. 4). Based on the PCA and ADMIXTURE results, we heuristically assigned inner Eurasians to three clines: the ‘forest-tundra’ cline includes Russians and all Uralic and Yeniseian speakers; the ‘steppe-forest’ cline includes Turkic- and Mongolic-speaking populations from the Volga and Altai–Sayan regions and Southern Siberia; and the ‘southern steppe’ cline includes the rest of the populations.

eurasian-clines-uralic-altaic
The first two PCs summarizing the genetic structure within 2,077 Eurasian individuals. The two PCs generally mirror geography. PC1 separates western and eastern Eurasian populations, with many inner Eurasians in the middle. PC2 separates eastern Eurasians along the northsouth cline and also separates Europeans from West Asians. Ancient individuals (color-filled shapes), including two Botai individuals, are projected onto PCs calculated from present-day individuals.

For the forest-tundra populations, the Nganasan + Srubnaya model is adequate only for the two Volga region populations, Udmurts and Besermyans (Fig. 5 and Supplementary Table 8).

For the other populations west of the Urals, six from the northeastern corner of Europe are modelled with additional Mesolithic Western European hunter-gatherer (WHG) contribution (8.2–11.4%; Supplementary Table 8), while the rest need both WHG and early Neolithic European farmers (LBK_EN; Supplementary Table 2). Nganasan-related ancestry substantially contributes to their gene pools and cannot be removed from the model without a significant decrease in the model fit (4.1–29.0% contribution; χ2 P ≤ 1.68 × 10−5; Supplementary Table 8).

west-urals-finno-ugrians-qpadm
Supplementary Table 8. QpAdm-based admixture modeling of the forest-tundra cline populations. For the 13 populations west of the Urals, we present a four-way admixture model, Nganasan+Srubnaya+WHG+LBK_EN, or its minimal adequate subset. Modified from the article, to include colors for cultures, and underlined best models for Corded Ware ancestry among Uralians.

NOTE. It doesn’t seem like Hungarians can be easily modelled with Nganasan ancestry, though…

For the 4 populations east of the Urals (Enets, Selkups, Kets and Mansi), for which the above models are not adequate, Nganasan + Srubnaya + AG3 provides a good fit (χ2 P ≥ 0.018; Fig. 5 and Supplementary Table 8). Using early Bronze Age populations from the Baikal Lake region (‘Baikal_EBA’; Supplementary Table 2) as a reference instead of Nganasan, the two-way model of Baikal_EBA + Srubnaya provides a reasonable fit (χ2 P ≥ 0.016; Supplementary Table 8) and the three-way model of Baikal_EBA + Srubnaya + AG3 is adequate but with negative AG3 contribution for Enets and Mansi (χ2 P ≥ 0.460; Supplementary Table 8).

east-urals-ugric-samoyedic-qpadm
Supplementary Table 8. QpAdm-based admixture modeling of the forest-tundra cline populations. For the four populations east of the Urals, we present three admixture models: Baikal_EBA+Srubnaya, Baikal_EBA+Srubnaya+AG3 and Nganasan+Srubnaya+AG3. For each model, we present qpAdm p-value, admixture coefficient estimates and associated 5 cM jackknife standard errors (estimate ± SE). Modified from the article, to include colors for cultures, and underlined best models for Corded Ware ancestry among Uralians.

Bronze/Iron Age populations from Southern Siberia also show a similar ancestry composition with high ANE affinity (Supplementary Table 9). The additional ANE contribution beyond the Nganasan + Srubnaya model suggests a legacy from ANE-ancestry-rich clines before the Late Bronze Age.

bronze-age-iron-age-karasuk-mezhovska-tagar-qpadm
Supplementary Table 9. QpAdm-based admixture modeling of Bronze and Iron Age populations of southern Siberia. For ancieint individuals associated with Karasuk and Tagar cultures, Nganasan+Srubnaya model is insufficient. For all five groups, adding AG3 as the third ancestry or substituting Nganasan with Baikal_EBA with higher ANE affinity provides an adequate model. For each model, we present qpAdm p-value, admixture coefficient estimates and associated 5 cM jackknife standard errors (estimate ± SE). Models with p-value ≥ 0.05 are highlighted in bold face. Modified from the article, to include colors for cultures, and underlined best models for Corded Ware ancestry among Uralians.

Lara M. Cassidy comments the results of the study in A steppe in the right direction (you can read it here):

Even among the earliest available inner Eurasian genomes, east–west connectivity is evident. These, too, form a longitudinal cline, characterized by the easterly increase of a distinct ancestry, labelled Ancient North Eurasian (ANE), lowest in western European hunter-gatherers (WHG) and highest in Palaeolithic Siberians from the Baikal region. Flow-through from this ANE cline is seen in steppe populations until at least the Bronze Age, including the world’s earliest known horse herders — the Botai. However, this is eroded over time by migration from west and east, following agricultural adoption on the continental peripheries (Fig. 1b,c).

Strikingly, Jeong et al. model the modern upper steppe cline as a simple two-way mixture between western Late Bronze Age herders and Northeast Asians (Fig. 1c), with no detectable residue from the older ANE cline. They propose modern steppe peoples were established mainly through migrations post-dating the Bronze Age, a sequence for which has been recently outlined using ancient genomes. In contrast, they confirm a substantial ANE legacy in modern Siberians of the northernmost cline, a pattern mirrored in excesses of WHG ancestry west of the Urals (Fig. 1b). This marks the inhospitable biome as a reservoir for older lineages, an indication that longstanding barriers to latitudinal movement may indeed be at work, reducing the penetrance of gene flows further south along the steppe.

eurasian-clines-uralic-turkic-mongol-altaic
The genomic formation of inner Eurasians. b–d, Depiction of the three main clines of ancestry identified among Inner Eurasians. Sources of admixture for each cline are represented using proxy ancient populations, both sampled and hypothesised, based on the study’s modelling results. The major eastern and western ancestries used to model each cline are shown in bold; the peripheral admixtures that gave rise to these are also shown. Additional contributions to subsections of each cline are marked with dashed lines. b, The northernmost cline, illustrating the legacy of WHG and ANE-related populations. c,d, The upper (c) and lower (d) steppe clines are shown, both of which have substantial eastern contributions related to modern Tungusic speakers. The authors propose these populations are themselves the result of an admixture between groups related to the Nganasan, whose ancestors potentially occupied a wider range, and hunter-gatherers (HGs) from the Amur River Basin. While the upper steppe cline in c can be described as a mixture between this eastern ancestry and western steppe herders, the current model for the southern steppe cline as shown in d is not adequate and is likely confounded by interactions with diverse bordering ancestries. Credit: Ecoregions 2017, Resolve https://ecoregions2017.appspot.com/

Given the findings as reported in the paper, I think it should be much easier to describe different subclines in the “northernmost cline” than in the much more recent “Turkic/Mongolic cline”, which is nevertheless subdivided in this paper in two clines. As an example, there are at least two obvious clines with “Nganasan-related meta-populations” among Uralians, which converge in a common Steppe MLBA (i.e. Corded Ware) ancestry – one with Palaeo-Laplandic peoples, and another one with different Palaeo-Siberian populations:

siberian-clines-uralic-altaic
PCA of ancient and modern Eurasian samples. Ancient Palaeo-Laplandic, Palaeosiberian, and Altai clines drawn, with modern populations labelled. See a version with higher resolution.

The inclusion of certain Eurasian groups (or lack thereof) in the PCA doesn’t help to distinguish these subclines visually, and I guess the tiny “Naganasan-related” ancestral components found in some western populations (e.g. the famous ~5% among Estonians) probably don’t lend themselves easily to further subdivisions. Notice, nevertheless, the different components of the Eastern Eurasian source populations among Finno-Ugrians:

uralic-admixture-qpadm
Characterization of the Western and Eastern Eurasian source ancestries in inner Eurasian populations. [Modified from the paper, includes only Uralic populations]. a, Admixture f3 values are compared for different Eastern Eurasian (Mixe, Nganasan and Ulchi; green) and Western Eurasian references (Srubnaya and Chalcolithic Iranians (Iran_ChL); red). For each target group, darker shades mark more negative f3 values. b, Weights of donor populations in two sources characterizing the main admixture signal (date 1 and PC1) in the GLOBETROTTER analysis. We merged 167 donor populations into 12 groups (top right). Target populations were split into five groups (from top to bottom): Aleuts; the forest-tundra cline populations; the steppe-forest cline populations; the southern steppe cline populations; and ‘others’.

Also remarkable is the lack of comparison of Uralic populations with other neighbouring ones, since the described Uralic-like ancestry of Russians was already known, and is most likely due to the recent acculturation of Uralic-speaking peoples in the cradle of Russians, right before their eastward expansions.

west-eurasian-east-eurasian-ancestry
Supplementary Fig. 4. ADMIXTURE results qualitatively support PCA-based grouping of inner Eurasians into three clines. (A) Most southern steppe cline populations derive a higher proportion of their total Western Eurasian ancestry from a source related to Caucasus, Iran and South Asian populations. (B) Turkic- and Mongolic-speaking populations tend to derive their Eastern Eurasian ancestry more from the Devil’s Gate related one than from Nganasan-related one, while the opposite is true for Uralic- and Yeiseian-speakers. To estimate overall western Eurasian ancestry proportion, we sum up four components in our ADMIXTURE results (K=14), which are the dominant components in Neolithic Anatolians (“Anatolia_N”), Mesolithic western European hunter-gatherers (“WHG”), early Holocene Caucasus hunter-gatherers (“CHG”) and Mala from southern India, respectively. The “West / South Asian ancestry” is a fraction of it, calculated by summing up the last two components. To estimate overall Eastern Eurasian ancestry proportion, we sum up six components, most prevalent in Surui, Chipewyan, Itelmen, Nganasan, Atayal and early Neolithic Russian Far East individuals (“Devil’s Gate”). Eurasians into three clines. (A) Most southern steppe cline populations derive a higher proportion of their total Western Eurasian ancestry from a source related to Caucasus, Iran and South Asian populations. (B) Turkic- and Mongolic-speaking populations tend to derive their Eastern Eurasian ancestry more from the Devil’s Gate related one than from Nganasan-related one, while the opposite is true for Uralic- and Yeiseian-speakers. To estimate overall western Eurasian ancestry proportion, we sum up four components in our ADMIXTURE results (K=14), which are the dominant components in Neolithic Anatolians (“Anatolia_N”), Mesolithic western European hunter-gatherers (“WHG”), early Holocene Caucasus hunter-gatherers (“CHG”) and Mala from southern India, respectively. The “West / South Asian ancestry” is a fraction of it, calculated by summing up the last two components. To estimate overall Eastern Eurasian ancestry proportion, we sum up six components, most prevalent in Surui, Chipewyan, Itelmen, Nganasan, Atayal and early Neolithic Russian Far East individuals (“Devil’s Gate”).

A comparison of Estonians and Finns with Balts, Scandinavians, and Eastern Europeans would have been more informative for the division of the different so-called “Nganasan-like meta-populations”, and to ascertain which one of these ancestral peoples along the ancient WHG:ANE cline could actually be connected (if at all) to the Cis-Urals.

Because, after all, based on linguistics and archaeology, geneticists are not supposed to be looking for populations from the North Asian Arctic region, for “Siberian ancestry”, or for haplogroup N1c – despite previous works by their peers – , but for the Bronze Age Volga-Kama region…

Related

N1c-L392 associated with expanding Turkic lineages in Siberia

haplogroup-n1c-tat

Second in popularity for the expansion of haplogroup N1a-L392 (ca. 4400 BC) is, apparently, the association with Turkic, and by extension with Micro-Altaic, after the Uralic link preferred in Europe; at least among certain eastern researchers.

New paper in a recently created journal, by the same main author of the group proposing that Scythians of hg. N1c were Turkic speakers: On the origins of the Sakhas’ paternal lineages: Reconciliation of population genetic / ancient DNA data, archaeological findings and historical narratives, by Tikhonov, Gurkan, Demirdov, and Beyoglu, Siberian Research (2019).

Interesting excerpts:

According to the views of a number of authoritative researchers, the Yakut ethnos was formed in the territory of Yakutia as a result of the mixing of people from the south and the autochthonous population [34].

These three major Sakha paternal lineages may have also arrived in Yakutia at different times and/ or from different places and/or with a difference in several generations instead, or perhaps Y-chromosomal STR mutations may have taken place in situ in Yakutia. Nevertheless, the immediate common ancestor(s) from the Asian Steppe of these three most prevalent Sakha Y-chromosomal STR haplotypes possibly lived during the prominence of the Turkic Khaganates, hence the near-perfect matches observed across a wide range of Eurasian geography, including as far as from Cyprus in the West to Liaoning, China in the East, then Middle Lena in the North and Afghanistan in the South (Table 3 and Figure 5). There may also be haplotypes closely-related to ‘the dominant Elley line’ among Karakalpaks, Uzbeks and Tajiks, however, limitations in the loci coverage for the available dataset (only eight Y-chromosomal STR loci) precludes further conclusions on this matter [25].

yakutia-haplogroup-n1c
17-loci median-joining network analysis of the original/dominant Elley, Unknown and Omogoy Y-chromosomal STR haplotypes with the YHRD matches from outside Yakutia populations.

According to the results presented here, very similar Y-STR haplotypes to that of the original Elley line were found in the west: Afghanistan and northern Cyprus, and in the east: Liaoning Province, China and Ulaanbaator, Northern Mongolia. In the case of the dominant Omogoy line, very closely matching haplotypes differing by a single mutational step were found in the city of Chifen of the Jirin Province, China. The widest range of similar haplotypes was found for the Yakut haplotype Unknown: In Mongolia, China and South Korea. For instance, haplotypes differing by a single step mutation were found in Northern Mongolia (Khalk, Darhad, Uryankhai populations), Ulaanbaator (Khalk) and in the province of Jirin, China (Han population).

n1c-uralic-altaic-siberia
14-loci median-joining network analysis for the original/dominant Elley (Ell), Unknown Clan
(Vil), Omogoy (Omo), Eurasian (Eur) and Xiongnu (Xuo) Y-chromosomal STR haplotypes and that for a representative ancient DNA sample (Ch0 or DSQ04) from the Upper Xiajiadian Culture
recovered from the Inner Mongolia Autonomous Region, China.

Notably, Tat-C-bearing Y-chromosomes were also observed in ancient DNA samples from the 2700-3000 years-old Upper Xiajiadian culture in Inner Mongolia, as well as those from the Serteya II site at the Upper Dvina region in Russia and the ‘Devichyi gory’ culture of long barrow burials at the Nevel’sky district of Pskovsky region in Russia. A 14-loci Y-chromosomal STR median-joining network of the most prevalent Sakha haplotypes and a Tat-C-bearing haplotype from one of the ancient DNA samples recovered from the Upper Xiajiadian culture in Inner Mongolia (DSQ04) revealed that the contemporary Sakha haplotype ‘Xuo’ (Table 2, Haplotype ID “Xuo”) classified as that of ‘the Xiongnu clan’ in our current study, was the closest to the ancient Xiongnu haplotype (Figure 6). TMRCA estimate for this 14-loci Y-chromosomal STR network was 4357 ± 1038 years or 2341 ± 1038 BCE, which correlated well with the Upper Xiajiadian culture that was dated to the Late Bronze Age (700-1000 BCE).

eurasian-n-subclades
Geographical location of ancient samples belonging to major clade N of the Y-chromosome.

NOTE. Also interesting from the paper seems to be the proportion of E1b1b among admixed Russian populations, in a proportion similar to R1a or I2a(xI2a1).

It is tempting to associate the prevalent presence of N1c-L392 in ancient Siberian populations with the expansion of Altaic, by simplistically linking the findings (in chronological order) near Lake Baikal (Damgaard et al. 2018), Upper Xiajiadian (Cui et al. 2013), among Khövsgöl (Jeong et al. 2018), in Huns (Damgaard et al. 2018), and in Mongolic-speaking Avars (Csáky et al. 2019).

However, its finding among Palaeo-Laplandic peoples in the Kola peninsula ca. 1500 BC (Lamnidis et al. 2018) and among Palaeo-Siberian populations near the Yana River (Sikora et al. 2018) ca. AD 1200 should be enough to accept the hypothesis of ancestral waves of expansion of the haplogroup over northern Eurasia, with acculturation and further expansions in the different regions since the Iron Age (see more on its potential expansion waves).

Also, a simple look at the TMRCA and modern distribution was enough to hypothesize long ago the lack of connection of N1c-L392 with Altaic or Uralic peoples. From Ilumäe et al. (2016):

Previous research has shown that Y chromosomes of the Turkic-speaking Yakuts (Sakha) belong overwhelmingly to hg N3 (formerly N1c1). We found that nearly all of the more than 150 genotyped Yakut N3 Y chromosomes belong to the N3a2-M2118 clade, just as in the Turkic-speaking Dolgans and the linguistically distant Tungusic-speaking Evenks and Evens living in Yakutia (Table S2). Hence, the N3a2 patrilineage is a prime example of a male population of broad central Siberian ancestry that is not intrinsic to any linguistically defined group of people. Moreover, the deepest branch of hg N3a2 is represented by a Lebanese and a Chinese sample. This finding agrees with the sequence data from Hallast et al., where one Turkish Y chromosome was also assigned to the same sub-clade. Interestingly, N3a2 was also found in one Bhutan individual who represents a separate sub-lineage in the clade. These findings show that although N3a2 reflects a recent strong founder effect primarily in central Siberia (Yakutia, Sakha), the sub-clade has a much wider distribution area with incidental occurrences in the Near East and South Asia.

haplogroup-n1a-M2118
Frequency-Distribution Maps of Individual Sub-clades of hg N3a2, by Ilumäe et al. (2016).

The most striking aspect of the phylogeography of hg N is the spread of the N3a3’6-CTS6967 lineages. Considering the three geographically most distant populations in our study—Chukchi, Buryats, and Lithuanians—it is remarkable to find that about half of the Y chromosome pool of each consists of hg N3 and that they share the same sub-clade N3a3’6. The fractionation of N3a3’6 into the four sub-clades that cover such an extraordinarily wide area occurred in the mid-Holocene, about 5.0 kya (95% CI = 4.4–5.7 kya). It is hard to pinpoint the precise region where the split of these lineages occurred. It could have happened somewhere in the middle of their geographic spread around the Urals or further east in West Siberia, where current regional diversity of hg N sub-lineages is the highest (Figure 1B). Yet, it is evident that the spread of the newly arisen sub-clades of N3a3’6 in opposing directions happened very quickly. Today, it unites the East Baltic, East Fennoscandia, Buryatia, Mongolia, and Chukotka-Kamchatka (Beringian) Eurasian regions, which are separated from each other by approximately 5,000–6,700 km by air. N3a3’6 has high frequencies in the patrilineal pools of populations belonging to the Altaic, Uralic, several Indo-European, and Chukotko-Kamchatkan language families. There is no generally agreed, time-resolved linguistic tree that unites these linguistic phyla. Yet, their split is almost certainly at least several millennia older than the rather recent expansion signal of the N3a3’6 sub-clade, suggesting that its spread had little to do with linguistic affinities of men carrying the N3a3’6 lineages.

haplogroup_n3a3
Frequency-Distribution Maps of Individual Subclade N3a3 / N1a1a1a1a1a-CTS2929/VL29.

It was thus clear long ago that N1c-L392 lineages must have expanded explosively in the 5th millennium through Northern Eurasia, probably from a region to the north of Lake Baikal, and that this expansion – and succeeding ones through Northern Eurasia – may not be associated to any known language group until well into the common era.

Related

The cradle of Russians, an obvious Finno-Volgaic genetic hotspot

pskov-novgorod-russia

First look of an accepted manuscript (behind paywall), Genome-wide sequence analyses of ethnic populations across Russia, by Zhernakova et al. Genomics (2019).

Interesting excerpts:

There remain ongoing discussions about the origins of the ethnic Russian population. The ancestors of ethnic Russians were among the Slavic tribes that separated from the early Indo-European Group, which included ancestors of modern Slavic, Germanic and Baltic speakers, who appeared in the northeastern part of Europe ca. 1,500 years ago. Slavs were found in the central part of Eastern Europe, where they came in direct contact with (and likely assimilation of) the populations speaking Uralic (Volga-Finnish and Baltic- Finnish), and also Baltic languages [11–13]. In the following centuries, Slavs interacted with the Iranian-Persian, Turkic and Scandinavian peoples, all of which in succession may have contributed to the current pattern of genome diversity across the different parts of Russia. At the end of the Middle Ages and in the early modern period, there occurred a division of the East Slavic unity into Russians, Ukrainians and Belarusians. It was the Russians who drove the colonization movement to the East, although other Slavic, Turkic and Finnish peoples took part in this movement, as the eastward migrations brought them to the Ural Mountains and further into Siberia, the Far East, and Alaska. During that interval, the Russians encountered the Finns, Ugrians, and Samoyeds speakers in the Urals, but also the Turkic, Mongolian and Tungus speakers of Siberia. Finally, in the great expanse between the Altai Mountains on the border with Mongolia, and the Bering Strait, they encountered paleo-Asiatic groups that may be genetically closest to the ancestors of the Native Americans. Today’s complex patchwork of human diversity in Russia has continued to be augmented by modern migrations from the Caucasus, and from Central Asia, as modern economic migrations take shape.

pskov-novgorod-pca-eurasia-yakut
Sample relatedness based on genotype data. Eurasia: Principal Component plot of 574 modern Russian genomes. Colors reflect geographical regions of collection; shapes reflect the sample source. Red circles show the location of Genome Russia samples.

In the current study, we annotated whole genome sequences of individuals currently living on the territory of Russia and identifying themselves as ethnic Russian or as members of a named ethnic minority (Fig. 1). We analyzed genetic variation in three modern populations of Russia (ethnic Russians from Pskov and Novgorod regions and ethnic Yakut from the Sakha Republic), and compared them to the recently released genome sequences collected from 52 indigenous Russian populations. The incidence of function-altering mutations was explored by identifying known variants and novel variants and their allele frequencies relative to variation in adjacent European, East Asian and South Asian populations. Genomic variation was further used to estimate genetic distance and relationships, historic gene flow and barriers to gene flow, the extent of population admixture, historic population contractions, and linkage disequilibrium patterns. Lastly, we present demographic models estimating historic founder events within Russia, and a preliminary HapMap of ethnic Russians from the European part of Russia and Yakuts from eastern Siberia.

pskov-novgorod-pca-finno-permic
Sample relatedness based on genotype data. Western Russia and neighboring countries: Principal Component plot of 574 modern Russian genomes. Colors reflect geographical regions of collection; shapes reflect the sample source. Red circles show the location of Genome Russia samples.

The collection of identified SNPs was used to inspect quantitative distinctions among 264 individuals from across Eurasia (Fig. 1) using Principal Component Analysis (PCA) (Fig. 2). The first and the second eigenvectors of the PCA plot are associated with longitude and latitude, respectively, of the sample locations and accurately separate Eurasian populations according to geographic origin. East European samples cluster near Pskov and Novgorod samples, which fall between northern Russians, Finno-Ugric peoples (Karelian, Finns, Veps etc.), and other Northeastern European peoples (Swedes, Central Russians, Estonian, Latvians, Lithuanians, and Ukrainians) (Fig. 2b). Yakut individuals map into the Siberian sample cluster as expected (Fig. 2a). To obtain an extended view of population relationships, we performed a maximum likelihood-based estimation of ancestry and population structure using ADMIXTURE [46](Fig. 2c). The Novgorod and Pskov populations show similar profiles with their Northeastern European ancestors while the Yakut ethnic group showed mixed ancestry similar to the Buryat and Mongolian groups.

pskov-novgorod-yakut-admixture
Population structure across samples in 178 populations from five major geographic regions (k=5). Samples are pooled across three different studies that covered the territory of Russian Federation (Mallick et al. 2016 [36], Pagani et al. 2016 [37], this study). The optimal k-value was selected by value of cross validation error. Russian samples from all studies (highlighted in bold dark blue) show a slight gradient from Eastern European (Ukrainian, Belorussian, Polish) to North European (Estonian Karelian, Finnish) structures, reflecting population history of northward expansion. Yakut samples from different studies (highlighted in bold red) also show a slight gradient from Mongolian to Siberian people (Evens), as expected from their original admixture and northward expansions. The samples originated from this study are highlighted, and plotted in separated boxes below.

Possible admixture sources of the Genome Russia populations were addressed more formally by calculating F3 statistics, which is an allele frequency-based measure, allowing to test if a target population can be modeled as a mixture of two source populations [48]. Results showed that Yakut individuals are best modeled as an admixture of Evens or Evenks with various European populations (Supplemental Table S4). Pskov and Novgorod showed admixture of European with Siberian or Finno-Ugric populations, with Lithuanian and Latvian populations being the dominant European sources for Pskov samples.

direction-expansion-russians
The heatmaps of gene flow barriers show for each point at the geographical map the interpolated differences in allele frequencies (AF) between the estimated AF at the point with AFs in the vicinity of this point. The direction of the maximal difference in allele frequencies is coded by colors and arrows.

So, Russians expanding in the Middle Ages as acculturaded Finno-Volgaic peoples.

Or maybe the true Germano-Slavonic™-speaking area was in north-eastern Europe, until the recent arrival of Finno-Permians with the totally believable Nganasan-Saami horde, whereas Yamna -> Bell Beaker represented Vasconic-Caucasian expanding all over Europe in the Bronze Age. Because steppe ancestry in Fennoscandia and Modern Basques in Iberia.

A really hard choice between equally plausible models.

Related

Magyar tribes brought R1a-Z645, I2a-L621, and N1a-L392(xB197) lineages to the Carpathian Basin

hungarian-conquerors-turks

The Nightmare Week of “N1c=Uralic” proponents (see here) continues, now with preprint Y-chromosome haplogroups from Hun, Avar and conquering Hungarian period nomadic people of the Carpathian Basin, by Neparaczki et al. bioRxiv (2019).

Abstract:

Hun, Avar and conquering Hungarian nomadic groups arrived into the Carpathian Basin from the Eurasian Steppes and significantly influenced its political and ethnical landscape. In order to shed light on the genetic affinity of above groups we have determined Y chromosomal haplogroups and autosomal loci, from 49 individuals, supposed to represent military leaders. Haplogroups from the Hun-age are consistent with Xiongnu ancestry of European Huns. Most of the Avar-age individuals carry east Eurasian Y haplogroups typical for modern north-eastern Siberian and Buryat populations and their autosomal loci indicate mostly unmixed Asian characteristics. In contrast the conquering Hungarians seem to be a recently assembled population incorporating pure European, Asian and admixed components. Their heterogeneous paternal and maternal lineages indicate similar phylogeographic origin of males and females, derived from Central-Inner Asian and European Pontic Steppe sources. Composition of conquering Hungarian paternal lineages is very similar to that of Baskhirs, supporting historical sources that report identity of the two groups.

Interesting excerpts (emphasis mine):

All N-Hg-s identified in the Avars and Conquerors belonged to N1a1a-M178. We have tested 7 subclades of M178; N1a1a2-B187, N1a1a1a2-B211, N1a1a1a1a3-B197, N1a1a1a1a4-M2118, N1a1a1a1a1a-VL29, N1a1a1a1a2-Z1936 and the N1a1a1a1a2a1c1-L1034 subbranch of Z1936. The European subclades VL29 and Z1936 could be excluded in most cases, while the rest of the subclades are prevalent in Siberia 23 from where this Hg dispersed in a counter-clockwise migratory route to Europe (…). All the 5 other Avar samples belonged to N1a1a1a1a3-B197, which is most prevalent in Chukchi, Buryats, Eskimos, Koryaks and appears among Tuvans and Mongols with lower frequency.

haplogroup-n-pca
First two components of PCA from Hg N1a subbranch distribution in 51 populations including Avars and Conquerors. Colors indicate geographic regions. Three letter codes are given in Supplementary Table S5.

By contrast two Conquerors belonged to N1a1a1a1a4-M2118, the Y lineage of nearly all Yakut males, being also frequent in Evenks, Evens and occurring with lower frequency among Khantys, Mansis and Kazakhs.

Three Conqueror samples belonged to Hg N1a1a1a1a2-Z1936 , the Finno-Permic N1a branch, being most frequent among northeastern European Saami, Finns, Karelians, as well as Komis, Volga Tatars and Bashkirs of the Volga-Ural region.Nevertheless this Hg is also present with lower frequency among Karanogays, Siberian Nenets, Khantys, Mansis, Dolgans, Nganasans, and Siberian Tatars.

The west Eurasian R1a1a1b1a2b-CTS1211 subclade of R1a is most frequent in Eastern Europe especially among Slavic people. This Hg was detected just in the Conqueror group (K2/18, K2/41 and K1/10). Though CTS1211 was not covered in K2/36 but it may also belong to this sub-branch of Z283.

Hg I2a1a2b-L621 was present in 5 Conqueror samples, and a 6th sample form Magyarhomorog (MH/9) most likely also belongs here, as MH/9 is a likely kin of MH/16 (see below). This Hg of European origin is most prominent in the Balkans and Eastern Europe, especially among Slavic speaking groups. It might have been a major lineage of the Cucuteni-Trypillian culture and it was present in the Baden culture of the Chalcolithic Carpathian Basin.

hungarian-conquerors-y-dna
Image modified from the paper, with drawn red square around lineages of likely Ugric origin, and squares around R1a-Z93, R1a-Z283, N1a-Z1936, and N1a-M2004 samples. Y-Hg-s determined from 46 males grouped according to sample age, cemetery and Hg. Hg designations are given according to ISOGG Tree 2019. Grey shading designate distinguished individuals with rich grave goods, color shadings denote geographic origin of Hg-s according to Fig. 1. For samples K3/1 and K3/3 the innermost Hg defining marker U106* was not covered, but had been determined previously.

We identified potential relatives within Conqueror cemeteries but not between them. The uniform paternal lineages of the small Karos3 (19 graves) and Magyarhomorog (17 graves) cemeteries approve patrilinear organization of these communities. The identical I2a1a2b Hg-s of Magyarhomorog individuals appears to be frequent among high-ranking Conquerors, as the most distinguished graves in the Karos2 and 3 cemeteries also belong to this lineage. The Karos2 and Karos3 leaders were brothers with identical mitogenomes 11 and Y-chromosomal STR profiles (Fóthi unpublished). The Sárrétudvari commoner cemetery seems distinct from the others, containing other sorts of European Hg-s. Available Y-chromosomal and mtDNA data from this cemetery suggest that common people of the 10th century rather represented resident population than newcomers. The great diversity of Y Hg-s, mtDNA Hg-s, phenotypes and predicted biogeographic classifications of the Conquerors indicate that they were relatively recently associated from very diverse populations.

Surprising about the Hungarian conquerors – although in line with the historical accounts – is the varied patrilineal origin of clans, including Q1a, G2a2b, I1, E1b1b, R1b, J1, or J2 – some of which (depending on specific lineages) may have appeared earlier in the Carpathian Basin or south-eastern Europe.

However, out of the 27 conqueror elite samples, 17 are of haplogroups most likely related to Ugric populations beyond the Urals: R1a-Z645, I2-L621, and two specific N1a-L392 lineages (see below). In fact, there are three high-ranking conqueror elites of hg. I2-L621 (one of them termed a “leader”, brother to an unpublished leader of Karos3, and all of them possibly family), one of hg. R1a-Z280, one of hg. R1a-Z93 (which should be added to the Árpáds), and one of hg. N1a-Z1936, which gives a good idea of the ruling class among the elite Ugric settlers.

NOTE. The Q1a sample is also likely to be found in the mixed population of the West Siberian forest-steppes, since it was found in Mesolithic-Neolithic samples from eastern Europe to Lake Baikal, and in Bronze Age Siberian groups, although admittedly it may have formed part of an Avar Transtisza group, or even earlier Hunnic or Scythian groups along the steppes. Without precise subclades it’s impossible to know.

arrival-of-hungarians-arpad
The seven chieftains of the Hungarians, detail of Arrival of the Hungarians, from Árpád Feszty’s and his assistants’ vast (1800 m2) cyclorama, painted to celebrate the 1000th anniversary of the Magyar conquest of Hungary, now displayed at the Ópusztaszer National Heritage Park in Hungary. Image from Wikipedia.

I2a-L621

I2a-L621 (xS17250) or I2a1b2 in the old nomenclature, is found in 6 early conquerors (including one leader), on a par with R1a and N samples. This haplogroup is found widely distributed in ancient samples, due to its early split (formed ca. 9200 BC, TMRCA ca. 4500 BC) and expansion, probably with Neolithic populations. I can’t seem to find samples of this early haplogroup from the Carpathian Basin, as mentioned in the text, although it wouldn’t be strange, because it appears also in Neolithic Iberia, and in modern populations from western Europe.

Nevertheless, I2a-L621 samples seem to be concentrated mainly in Mesolithic-Neolithic cultures of Fennoscandia, and appeared also in Sikora et al. (2017) in a sample of the High Middle Ages from Sunghir (ca. AD 1100-1200), probably from the Vladimir-Suzdalian Rus’, in a region where clearly tribes of Volga Finns were being assimilated at the time. The reported SNP call by Genetiker is A16681 (see Yfull), deep within I2a-CTS10228. It is possibly also behind a modern Saami from Chalmny Varre (ca. AD 1800) of hg. I2a in Lamnidis et al. (2018).

Lacking precise subclades from Hungarian conquerors this is pure speculation, but modern samples may also point to I2a-CTS10228 (formed ca. 3100 BC, TMRCA ca. 1800 BC) as a Finno-Ugric lineage in common with R1a, which must have expanded to the Urals and beyond with eastern Corded Ware groups or (more likely) succeeding cultures. This is in line with the association of certain I2a lineages with modern Uralic peoples or populations from their historical regions in eastern Europe, and linked thus to the most likely homeland of Uralians in the eastern European forests:

uralic-groups-haplogroup-r1a
Additional file 6: Table S5. Y chromosome haplogroup frequencies in Eurasia. Modified by me: in bold haplogroup N1c and R1a from Uralic-speaking populations, with those in red showing where R1a is the major haplogroup. Observe that all Uralic subgroups – Finno-Permic, Ugric, and Samoyedic – have some populations with a majority of R1a, and also of I lineages. Data from Tambets et al. (2018).

R1a-Z645

Regarding the important question of the ethnic makeup of Ugric populations stemming from the Urals, the most interesting (and expected) data is the presence of R1a-Z645 lineages among high-ranking conquerors, in particular four R1a-Z280 subclades proper of Finno-Ugrians.

This proves that, in line with the old split and expansion of R1a-CTS1211 (formed ca. 2600 BC, TMRCA ca. 2400 BC), and its finding in Bronze Age Fennoscandian samples, only some late R1a-Z280 (xZ92) lineages (see Z280 on YFull) may show a clear identification with early acculturated Uralic speakers, with the main early acculturated Balto-Slavic R1a haplogroup remaining R1a-M458.

I recently hypothesized this late connection of Slavs with very specific R1a-Z280 (xZ92) lineages based on analyses of modern populations (like Slovenians), because the connection of ancient Finno-Ugrians with modern Z92 samples was already evident:

(…) subclades of hg. R1a1a1b1a2-Z280 (xR1a1a1b1a2a-Z92) seem to have also been involved in early Slavic expansions, like R1a1a1b1a2b3a-CTS3402 (formed ca. 2200 BC, TMRCA ca. 2200 BC), found among modern West, South, and East Slavic populations and in Fennoscandia, prevalent e.g. among modern Slovenians which points to a northern origin of its expansion (Maisano Delser et al. 2018).

This finding also supports the expected shared R1a-Z280 lineages among ancient Finno-Ugric populations, as predicted from the study of modern Permic and Ugric peoples in Dudás et al. (2019).

r1a-z282-z280-z2125-distribution
Modified image, from Underhill et al. (2015). Spatial frequency distributions of Z282 (green) and Z93 (blue) affiliated haplogroups. Notice the distribution of R1a-Z280 (xZ92), i.e. R1a-M558, compared to the ancient Finno-Ugric distribution.

Furthermore, while we don’t have precise R1a-Z93 lineages to compare with the new Hunnic sample reported, we already know that some archaic R1a-Z2124 subclades stem from the forest-steppe areas of the Cis- and Trans-Urals, and the two newly reported R1a-Z93 Hungarian conqueror elites, like those of the Árpád dynasty, probably belong to them.

There is an obvious lack of continuity in specific paternal lineages among the Hunnic, the Avar, and the Conqueror periods, which makes any simplistic identification of all R1a-Z93 lineages as stemming from Avars, Huns, or the Iron Age Pontic-Caspian steppes clearly flawed. Comparing R1a-Z93 in Hungarian Conquerors with Huns is like comparing them with samples of the Srubna or earlier periods… Similarly, comparing the Hunnic R1b-U106 or the early Avar I1 to later Hungarian samples is not warranted without precise subclades, because they most likely correspond to different Germanic populations: Goths among Huns, then Longobards, then likely peoples descended from Franks and Irish Monks (the latter with R1b-P312).

N1a-L392

Second behind R1a subclades are, as expected, N1a-L392 (N1c in the old nomenclature).

Avars are dominated by a specific N1a-L392 subclade, N1a-B197, as we recently discovered in Csáky et al. (2019).

Hungarian conquerors show three N1a-Z1936 subclades, which is known to stem from the northern Ural region, including the Arctic (likely Palaeo-Laplandic peoples) and cross-stamped cultures of the northern Eurasian forests.

haplogroup_n3a4
Frequency-Distribution Maps of Individual Subclade N3a4 / N1a1a1a1a2-Z1936, probably with the Samic (first) and Fennic (later) expansions into Paleo-Lakelandic and Palaeo-Laplandic territories.

On the other hand, the two N1a-M2118 lineages are more clearly associated with Palaeo-Siberian populations east of the Urals, but became incorporated into the Ugric stock in the Trans-Urals region probably in the same way as N1a-Z1936, by infiltration from (and acculturation of) hunter-gatherers of forest and taiga cultures.

NOTE. You can read more about the infiltration of N1a lineages in the recent post Corded Ware—Uralic (IV): Hg R1a and N in Finno-Ugric and Samoyedic expansions, and in the specific sections for each Uralic group in A Clash of Chiefs.

haplogroup-n1a-M2118
Frequency-Distribution Maps of Individual Sub-clades of hg N3a2, by Ilumäe et al. (2016).

Conclusion

The picture offered by the paper on Hungarian Conquerors, while in line with historical accounts of multi-ethnic tribes incorporating regional lineages, shows nevertheless patrilineal clans clearly associated with Uralic peoples, in a distribution which could have been easily inferred from ancient Trans-Uralian forest-steppe cultures and modern samples (even regarding I2a-L621).

In spite of this, there is a great deal of discussion in the paper about specific N1a subclades in Hungarian conquerors, while the presence of R1a-Z280 (among early Magyar elites!) is interpreted, as always, as recently acculturated Slavs. This is sadly coupled with the simplistic identification of I2a-L621 as of local origin around the Carpathians.

The introduction of the paper to the history of Hungarians is also weird, for example giving credibility to the mythic accounts of the Árpád dynasty’s origin in Attila, which is in line, I guess, with what the authors intended to support all along, i.e. the association of Magyars with Turks from the Eurasian steppes, which they are apparently willing to achieve by relating them to haplogroup R1a-Z93

The conclusion is thus written to appease modern nation-building myths more than anything else, like many other papers before it:

It is generally accepted that the Hungarian language was brought to the Carpathian Basin by the Conquerors. Uralic speaking populations are characterized by a high frequency of Y-Hg N, which have often been interpreted as a genetic signal of shared ancestry. Indeed, recently a distinct shared ancestry component of likely Siberian origin was identified at the genomic level in these populations, modern Hungarians being a puzzling exception36. The Conqueror elite had a significant proportion of N Hgs, 7% of them carrying N1a1a1a1a4-M2118 and 10% N1a1a1a1a2-Z1936, both of which are present in Ugric speaking Khantys and Mansis. At the same time none of the examined Conquerors belonged to the L1034 subclade of Z1936, while all of the Khanty Z1936 lineages reported in 37 proved to be L1034 which has not been tested in the 23 study. Population genetic data rather position the Conqueror elite among Turkic groups, Bashkirs and Volga Tatars, in agreement with contemporary historical accounts which denominated the Conquerors as “Turks”. This does not exclude the possibility that the Hungarian language could also have been present in the obviously very heterogeneous, probably multiethnic Conqueror tribal alliance.

So, back to square one, and new circular reasoning: If ancient populations from north-eastern Europe believed to represent ancient Finno-Ugrians are of R1a-Z645 lineages, it’s because they were not Finno-Ugric speakers. If ancient and modern populations known to be of Finno-Ugric language show clear connections with R1a-Z645, it’s because they are “multi-ethnic”.

The only stable basis for discussion in genetic papers, apparently, is the own making of geneticists, with their traditional 2000s “R1a=Indo-European” and “N1c=Uralic”, coupled with national beliefs. It does not matter how many predictions based on that have been proven wrong, or how many predictions based on the Corded Ware = Uralic expansion have been proven right.

Related

R1a-Z280 and R1a-Z93 shared by ancient Finno-Ugric populations; N1c-Tat expanded with Micro-Altaic

Two important papers have appeared regarding the supposed link of Uralians with haplogroup N.

Avars of haplogroup N1c-Tat

Preprint Genetic insights into the social organisation of the Avar period elite in the 7th century AD Carpathian Basin, by Csáky et al. bioRxiv (2019).

Interesting excerpts (emphasis mine):

After 568 AD the Avars settled in the Carpathian Basin and founded the Avar Qaganate that was an important power in Central Europe until the 9th century. Part of the Avar society was probably of Asian origin, however the localisation of their homeland is hampered by the scarcity of historical and archaeological data.

Here, we study mitogenome and Y chromosomal STR variability of twenty-six individuals, a number of them representing a well-characterised elite group buried at the centre of the Carpathian Basin more than a century after the Avar conquest.

The Y-STR analyses of 17 males give evidence on a surprisingly homogeneous Y chromosomal composition. Y chromosomal STR profiles of 14 males could be assigned to haplogroup N-Tat (also N1a1-M46). N-Tat haplotype I was found in four males from Kunpeszér with identical alleles on at least nine loci. The full Y-STR haplotype I, reconstructed from AC17 with 17 detected STRs, is rare in our days. Only nine matches were found among haplotypes in YHRD database, such as samples from the Ural Region, Northern Europe (Estonia, Finland), and Western Alaska (Yupiks). We performed Median Joining (MJ) network analysis using N-Tat haplotypes with ten shared STR loci (Fig. 3, Table S9). All modern N-Tat samples included in the network had derived allele of L708 as well. Haplotype I (Cluster 1 in Fig. 3) is shared by eight populations on the MJ network among the 24 identical haplotypes. Cluster 1 represents the founding lineage, as it is described in Siberian populations, because this haplotype is shared by the most populations and it is more diverse than Cluster 2.

Nine males share N-Tat haplotype II (on a minimum of eight detected alleles), all of them buried in the Danube-Tisza Interfluve. We found 30 direct matches of this N-Tat haplotype II in the YHRD database, using the complete 17 STR Y-filer profile of AC1, AC12, AC14, AC15, AC19 samples. Most hits came from Mongolia (seven Buryats and one Khalkh) and from Russia (six Yakuts), but identical haplotypes also occur in China (five in Xinjiang and four in Inner Mongolia provinces). On the MJ network, this haplotype II is represented by Cluster 2 and is composed of 45 samples (including 32 Buryats) from six populations (Fig. 3).

y-str-haplogroup-n-mongolian-ugrians
Median Joining network of 162 N-Tat Y-STR haplotypes Allelic information of ten Y-STR loci were used for the network. Only those Avar samples were included, which had results for these ten Y-STR loci. The founder haplotype I (Cluster 1) is shared by eight populations including three Mongolian, three Székely, three northern Mansi, two southern Mansi, two Hungarian, eight Khanty, one Finn and two Avar (AC17, AC26) chromosomes. Haplotype II (Cluster 2) includes 45 haplotypes from six populations studied: 32 Buryats, two Mongolians, one Székely, one Uzbek, one Uzbek Madjar, two northern Mansi and six Avars (AC1, AC12, AC14, AC15, AC19 and KSZ 37). Haplotype III (indicated by a red arrow) is AC8. Information on the modern reference samples is seen in Table S9.

A third N-Tat lineage (type III) was represented only once in the Avar dataset (AC8), and has no direct modern parallels from the YHRD database. This haplotype on the MJ network (see red arrow in Fig. 3) seems to be a descendent from other haplotype cluster that is shared by three populations (two Buryat from Mongolia, three Khanty and one Northern Mansi samples). This haplotype cluster also differs one molecular step (locus DYS393) from haplotype II. We classified the Avar samples to downstream subgroup N-F4205 within the N-Tat haplogroup, based on the results of ours and Ilumäe et al.18 and constructed a second network (Fig. S4). The N-F4205 network results support the assumption that the N-Tat Avar samples belong to N-F4205 subgroup (see SI chapter 1d for more details).

Based on our calculation, the age of accumulated STR variance (TMRCA) within N-Tat lineage for all samples is 7.0 kya (95% CI: 4.9 – 9.2 kya), considering the core haplotype (Cluster 1) to be the founding lineage. Y haplogroup N-Tat was not detected by large scale Eurasian ancient DNA studies but it occurs in late Bronze Age Inner Mongolia and late medieval Yakuts, among them N-Tat has still the highest frequency.

Two males (AC4 and AC7) from the Transtisza group belong to two different haplotypes of Y-haplogroup Q1. Both Q1a-F1096 and Q1b-M346 haplotypes have neither direct nor one step neighbour matches in the worldwide YHRD database. A network of the Q1b-M346 haplotype shows that this male had a probable Altaian or South Siberian paternal genetic origin.

EDIT (5 APR 2019): The paper offers an interesting late sample before the arrival of Hungarian conquerors, although we don’t know which precise lineage the sample belongs to:

One sample in our dataset (HC9) comes from this population, and both his mtDNA (T1a1b) and Y chromosome (R1a) support Eastern European connections. (…) Furthermore, we excluded sample HC9 from population-genetic statistical analyses because it belongs to a later period (end of 7th – early 9th centuries)

Apparently, then, results are consistent with what was already known from studies of modern populations:

According to Ilumäe et al. study, the frequency peak of N-F4205 (N3a5-F4205) chromosomes is close to the Transbaikal region of Southern Siberia and Mongolia, and we conclude that most Avar N-Tat chromosomes probably originated from a common source population of people living in this area, completely in line with the results of Ilumäe et al.

haplogroup_n1
Geographic-Distribution Map of hg N3 from Ilumäe et al.

Finno-Ugrians share haplogroup R1a-Z280

Another paper, behind paywall, Genetic history of Bashkirian Mari and Southern Mansi ethnic groups in the Ural region, by Dudás et al. Molecular Genetics and Genomics (2019).

Interesting excerpts (emphasis mine):

Y‑chromosome diversity

The most frequent haplogroups of the Bashkirian Maris were N1b-P43 (42%), R1a-Z280 (16%), R1a-Z93 (16%), N1c-Tat (13%), and J2-M172 (7%). Furthermore, subgroup R1b-M343 accounted for 4% and I2a-P37 covered 2% of the lineages. None of the Mari N1c Y chromosomes belonged to the N1c subgroups investigated (L1034, VL29, Z1936).

In the case of the Southern Mansi males, the most frequent haplogroups were N1b-P43 (33%), N1c-L1034 (28%) and R1a-Z280 (19%). The frequencies of the remaining haplogroups were as follows: R1a-M458 (6%), I1-L22 (3%), I2a-P37 (3%), and R1b-P312 (3%). The haplotype and haplogroup diversities of the Bashkirian Mari group were 0.9929 and 0.7657, whereas these values for the Southern Mansi were 0.9984 and 0.7873, respectively. The results show that, in both populations, haplotypes are much more diverse than haplogroups.

bashkir-mari-southern-mansi
Haplogroup frequencies of the Bashkirian Mari and the Southern Mansi ethnic groups in Ural region

Genetic structure

(..) the studied Bashkirian Mari and Southern Mansi population groups formed a compact cluster along with two Khanty, Northern Mansi, Mari, and Estonian populations based on close Fst-genetic distances (< 0.05), with nonsignificant p values (p > 0.05) except for the Estonian population. All of these populations belong to the Finno-Ugric language family. Interestingly, the other Mansi population studied by Pimenoff et al. (2008) (pop # 38) was located a great distance from the Southern Mansi group (0.268). In addition, the Bashkir population (pop # 6) did not show a close genetic affinity to the Bashkirian Mari group (0.194), even though it is the host population. However, the Russian population from the Eastern European region of Russia (pop # 49) showed a genetic distance of 0.055 with the Southern Mansi group. All Hungarian speaking populations (pops 13, 22, 23, 24, 50, and 51) showed close genetic affinities to each other and to the neighbouring populations, but not to the two studied populations.

y-dna-hungarians-ugric-mansi
Multidimensional scaling (MDS) plot constructed on Fstgenetic distances of Y haplogroup frequencies of 63 populations compared. The haplogroup frequency data used for population comparison together with references are seen in Online Resource 2 (ESM_2). Pairwise Fst-genetic distances and p values between 63 populations were calculated as shown in Online Resource 3 (ESM_3) Fig. 4 Multidimensional scaling (MDS) plot constructed on Rstgenetic distances of 10 STR-based Y haplotype frequencies of 21 populations compared. Image modified to include labels of modern populations.

Phylogenetic analysis

Median-joining networks were constructed for:

N-P43 (earlier N1b):

(…) TMRCA estimates for this haplogroup were made for all P43 samples (n = 157) 8.7 kya (95% CI 6.7–10.8 kya), for the N-P43 Asian.

N1c-Tat:

(…) 75% of Buryats belonged to Haplotype 2, indicating that the Buryats studied by us is a young and isolated population (Bíró et al. 2015). Bashkirian Mari samples derive from Haplotype 2 via Haplotype 3 (see dark purple circles on the top of Fig. 6a). Haplotype 3 contained six males (2 Buryat, 1 Northern Mansi, and 3 Khanty samples from Pimenoff et al. 2008). The biggest Bashkirian Mari haplotype node (3 Mari samples) was positioned three mutational steps away from Haplotype 1 and the remaining Mari samples can be derived from this haplotype. Southern Mansi haplotypes were scattered within the network except for two, which formed a smaller haplotype node with two Northern Mansi and two Khanty samples from Pimenoff et al. (2008).

n1c-n-tat-uralic-ugric
Median-Joining Networks (MJ) of 153 N-Tat (a) and 26 N-L1034 (b) haplotypes constructed. The circle sizes are proportional to the haplotype frequencies. The smallest area is equivalent to one individual. For N-Tat network, we used data from Southern Mansi (n = 11), Bashkirian Mari (n = 6) samples with Hungarian (n = 12), Hungarian speaking Székely (n = 6), Northern Mansi (n = 14), Mongolian (n = 16), Buryat (n = 44), Finnish (n = 13), Uzbek Madjar (n = 2), Uzbek (n = 3), Khanty (n = 4) populations studied earlier by us (Fehér et al. 2015; Bíró et al. 2015) and Khanty (n = 18) and Mansi (n = 4) studied by Pimenoff et al. (2008)

R1a-Z280 haplotypes, shared by Maris, Mansis, and Hungarians, hence ancient Finno-Ugrians:

The founder R1a-Z280 haplotype was shared by four samples from four populations (1 Bashkirian Mari; 1 Southern Mansi; 1 Hungarian speaking Székely; and 1 Hungarian), as presented in Fig. 7 (Haplotype 1). Haplotype 2 included five males (3 Bashkirian Mari and 2 Hungarian), as it can be seen in Fig. 7. Haplotype 4 included two shared haplotypes (1 Bashkirian Mari and one Hungarian speaking Csángó). The remaining two Bashkirian Mari haplotypes differ from the founder haplotype (Haplotype 1) by two mutational steps via Hungarian or Hungarian and Bashkirian Mari shared haplotypes. Beside Haplotype 1, the remaining Southern Mansi haplotypes were shared with Hungarians (Haplotype 5 or turquoise blue and red-coloured circles above Haplotype 7) or with Hungarians and Hungarian speaking Székely group (Haplotypes 3, 5, and 6). Haplotype 7 included ten Hungarian speakers (Hungarian, Székely, and Csángó). One Hungarian and one Uzbek Khwarezm shared haplotype can be found in Fig. 7 as well (red and white-coloured circle). All the other haplotypes were scattered in the network. The age of accumulated STR variation within R1a-Z280 lineage for 93 samples is estimated to be 9.4 kya (95% CI 6.5–12.4 kya) considering Haplotype 1 (Fig. 7) to be the founder.

r1a-z280-ugrians
Median-Joining Networks (MJ) of 93 R1a-Z280 haplotypes constructed. The circle sizes are proportional to the haplotype frequencies. The smallest area is equivalent to one individual. We used haplotype data from Bashkirian Mari (n = 7), Southern Mansi (n = 7), Hungarian (n = 52), Hungarian speaking Székely (n = 11), Hungarian speaking Csángó (n = 10), Uzbek Ferghana (n = 2), Uzbek Tashkent (n = 1), Uzbek Khwarezm (n = 1) and Northern Mansi (n = 2) populations

R1a-Z93 as isolated lineages among Permic and Ugric populations:

Figure 8 depicts an MJ network of R1a-Z93* samples using 106 haplotypes from the 14 populations (Fig. 8). All of the Bashkirian Mari samples (7 haplotypes) formed a very isolated branch and differed from the one Hungarian haplotype (Fig. 8, see Haplotype 1) by seven mutational steps as well from two Uzbek Tashkent samples (see Haplotype 3). Another Hungarian sample shared two haplotypes of Uzbek Khwarezm samples in Haplotype 4. This haplotype can be derived from Haplotype 3 (Uzbek Tashkent). Haplotype 2 included one Hungarian and one Khakassian male. The remaining three Hungarian haplotypes are outliers in the network and are not shared by any sample. The other population samples included in the network either form independent clusters such as Altaians, Khakassians, Khanties, and Uzbek Madjars or were scattered in the network. The age of accumulated STR variation (TMRCA) within R1a-Z93* lineage for 106 samples is estimated as 11.6 kya (95% CI 9.3–14.0 kya) considering an Armenian haplotype (Fig. 8, “A”) to be the founder and the median haplotype.

r1a-z93-ugrians
Median-Joining Networks (MJ) of 106 R1a-Z93 haplotypes constructed. The circle sizes are proportional to the haplotype frequencies. The smallest area is equivalent to one individual. We used the next haplotype data: 7 Bashkirian Mari, 6 Khanty, 4 Uzbek Madjar, 5 Uzbek Ferghana, 9 Uzbek Tashkent, 7 Uzbek Khwarezm, 2 Mongolian, 2 Buryat, 6 Hungarian samples tested by us for this study or published earlier (Bíró et al. 2015) and populations (3 Armenian; 3 Afghan Tajik;
16 Altaian; 24 Khakassian; 12 Kyrgyz) from Underhill et al. (2015)

Comments

The results of modern populations for N (especially N1c) subclades show really wide clusters and ancient TMRCA, consistent with their known ancient and wide distribution in northern and eastern Eurasian groups, and thus with infiltration of different lineages with eastern nomads (and northern Arctic populations) coupled with later bottlenecks, as well as acculturation of groups.

EDIT (2 APR): Interesting is the specific subclade to which ancient Mongolic-speaking Avars belong (information from Yfull) N1c-F4205 (TMRCA ca. 500 BC), subclade of N1c-Y6058 (formed ca. 2800 BC, TMRCA ca. 2800 BC). This branch also gives the “European” branch N1c-CTS10760 (formed ca. 2800 BC, TMRCA ca. 2100 BC), and is subclade of a branch of N1c-L392 (formed ca. 4400 BC, TMRCA ca. 2800 BC). A northern expansion of N1c-L392 is probably represented by its branch N1c-Z1936 (formed ca. 2800, TMRCA ca. 2100 BC), the most likely candidate to appear in the Kola Peninsula in the Bronze Age as the Palaeo-Laplandic population (see here). Read more about potential routes of expansion of haplogroup N.

On the other hand, R1a-Z280 lineages form a tight cluster connecting Permic with Ugric groups, with R1a-Z93 showing early isolation (probably) between Cis-Urals and Trans-Urals regions. While both Corded Ware lineages in Finno-Ugrians are most likely related to the Abashevo expansion through Seima-Turbino and the Andronovo-like Horizon (and potentially later Eurasian expansions), a plausible hypothesis would be that Finno-Ugrians are related to an expansion of R1a-Z283 haplogroups (we already knew about the Finno-Permic connection), while the ancient connection between Permians and Hungarians with R1a-Z93 would correspond to this haplogroup’s potentially tighter link with an early Samoyedic split.

I don’t think that an explosive expansion of eastern Corded Ware groups of R1a-Z645 lineages will show a clear-cut division of haplogroups among Eastern Uralic groups, though, and culturally I doubt we will have such a clear image, either (similar to how the explosive expansion of Bell Beakers cannot be easily divided by regional/language group into R1b-L151 subclades before the known bottlenecks). Relevant in this regard are the known Z93 samples from the Árpád dynasty.

Nevertheless, this data may represent a slightly more recent wave of R1a-Z280 lineages linked to the expansion of Ugric into the Trans-Uralian region, after their split from Finno-Permic, still in close contact with Indo-Iranians in Poltavka and Sintashta-Potapovka, evident from the early and late Indo-Iranian borrowings, during a common period when Samoyedic had already separated.

Such a “Z283 over Z93” layer in the Trans-Urals (and Cis-Urals?) forest-steppes would be similar to the apparent replacement of Z284 by Z282 in the Eastern Baltic during the Bronze Age (possibly with the second or Estonian Battle Axe wave or, much more likely during later population movements). Such an early R1a-Z93 split could potentially be supported also by the separation into bottlenecks under “Northern” (R1a-Z283) Finno-Ugric-speaking Abashevo-related groups and “Southern” (R1a-Z93) acculturated Indo-Iranian-speaking Abashevo migrants developing Sintashta-Potapovka admixing with Poltavka R1b-Z2103 herders.

r1a-z282-z280-z2125-distribution
Modified image, from Underhill et al. (2015). Spatial frequency distributions of Z282 (green) and Z93 (blue) affiliated haplogroups.. Notice the potential Finno-Ugric-associated distribution of Z282 (especially R1a-M558, a Z280 subclade), the expansion of R1a-Z2123 subclades with Central Asian forest-steppe groups.

Conclusion

Let’s review some of the most common myths about Hungarians (and Finno-Ugrians in general) repeated ad nauseam, side by side with my assertions:

❌ N (especially N1c-Tat) in ancient and modern samples represent the True Uralic™ N1c peoples including Magyar tribes? Nope.

✅ Ancient N (especially N1c-Tat) lineages among Uralic populations expanded relatively recently, and differently in different regions (including eastern steppe nomads and northern arctic populations) not associated with a particular language or language group? Yep (read the series on Corded Ware = Uralic expansion).

❌ Modern Hungarian R1a-Z280 lineages represent the majority of the native population, poor Slavic ‘peasants’ from the Carpathian Basin, forcibly acculturated by a minority of bad bad Hungarian hordes? Nope.

✅ Modern Hungarian R1a-Z280 subclades represent Ugric lineages in common with ancient R1a-Z645 Finno-Ugric populations from north-eastern Europe and the Trans-Urals? Yep (see Avars and Ugrians).

❌ Modern Hungarian R1a-Z93 lineages represent acculturated Iranian/Turkic peoples from the steppes? Not likely.

✅ Modern Hungarian R1a-Z93 lineages represent a remnant of the expansion of Corded Ware to the east, potentially more clearly associated with Samoyedic? Much more likely.

finno-ugric-haplogroup-n
Map of archaeological cultures in north-eastern Europe ca. 8th-3rd centuries BC. [The Mid-Volga Akozino group not depicted] Shaded area represents the Ananino cultural-historical society. Fading purple arrows represent likely stepped movements of subclades of haplogroup N for centuries (e.g. Siberian → Ananino → Akozino → Fennoscandia [N-VL29]; Circum-Arctic → forest-steppe [N1, N2]; etc.). Blue arrows represent eventual expansions of Uralic peoples to the north. Modified image from Vasilyev (2002).

Sooo, the theory of a “diluted” Y-DNA in Modern Hungarians from originally fully N-dominated conquerors subjugating native R1a-Z280 Slavs from the Carpathian Basin is not backed up by genetic studies? The ethnic Iranian-Turkic R1a-Z93 federation in the steppes that ended up speaking Magyar is not real?? Who would’ve thunk.

Another true story whose rejection in genetics could not be predicted, like, not at all.

Totally unexpected, too, the drift of “R1a=IE” fans with the newest genetic findings towards a Molgen-like “Yamna/R1b = Vasconic-Caucasian”, “N1c = Uralic-Altaic”, and “R1a = the origin of the white world in Mother Russia”. So much for the supposed interest in “Steppe ancestry” and fancy statistics.

Related

Corded Ware—Uralic (IV): Hg R1a and N in Finno-Ugric and Samoyedic expansions

haplogroup-uralians

This is the fourth of four posts on the Corded Ware—Uralic identification:

Let me begin this final post on the Corded Ware—Uralic connection with an assertion that should be obvious to everyone involved in ethnolinguistic identification of prehistoric populations but, for one reason or another, is usually forgotten. In the words of David Reich, in Who We Are and How We Got Here (2018):

Human history is full of dead ends, and we should not expect the people who lived in any one place in the past to be the direct ancestors of those who live there today.

Haplogroup N

Another recurrent argument – apart from “Siberian ancestry” – for the location of the Uralic homeland is “haplogroup N”. This is as serious as saying “haplogroup R1” to refer to Indo-European migrations, but let’s explore this possibility anyway:

Ancient haplogroups

We have now a better idea of how many ancient migrations (previously hypothesized to be associated with westward Uralic migrations) look like in genetic terms. From Damgaard et al. (Science 2018):

These serial changes in the Baikal populations are reflected in Y-chromosome lineages (Fig. SA; figs. S24 to S27, and tables S13 and SI4). MAI carries the R haplogroup, whereas the majority of Baikal_EN males belong to N lineages, which were widely distributed across Northern Eurasia (29), and the Baikal_LNBA males all carry Q haplogroups, as do most of the Okunevo_EMBA as well as some present-day Central Asians and Siberians.

The only N1c1 sample comes from Ust’Ida Late Neolithic, 180km to the north of Lake Baikal, which – together with the Bronze Age sample from the Kola peninsula, and the medieval sample from Ust’Ida – gives a good idea of the overall expansion of N subclades and Siberian ancestry among the Circum-Arctic peoples of Eurasia, speakers of Palaeo-Siberian languages.

eurasian-n-subclades
Geographical location of ancient samples belonging to major clade N of the Y-chromosome.

Modern haplogroups

What we should expect from Uralic peoples expanding with haplogroup N – seeing how Yamna expands with R1b-L23, and Corded Ware expands with R1a-Z645 – is to find a common subclade spreading with Uralic populations. Let’s see if it works like that for any N-X subclade, in data from Ilumäe et al. (2016):

haplogroup_n1
Geographic-Distribution Map of hg N3 / N1c / N1a.

Within the Eurasian circum-Arctic spread zone, N3 and N2a reveal a well-structured spread pattern where individual sub-clades show very different distributions:

N1a1-M46 (or N-TAT), formed ca. 13900 BC, TMRCA 9800 BC

   N1a1a2-B187, formed ca. 9800 BC, TMRCA 1050 AD:

The sub-clade N3b-B187 is specific to southern Siberia and Mongolia, whereas N3a-L708 is spread widely in other regions of northern Eurasia.

     N1a1a1a-L708, formed ca. 6800 BC, TMRCA 5400 BC.

       N1a1a1a2-B211/Y9022, formed ca. 5400 BC, TMRCA 1900 BC:

The deepest clade within N3a is N3a1-B211, mostly present in the Volga-Uralic region and western Siberian Khanty and Mansi populations.

         N1a1a1a1a-L392/L1026), formed ca. 4400 BC, TMRCA 2800 BC:

The neighbor clade, N3a3’6-CTS6967, spreads from eastern Siberia to the eastern part of Fennoscandia and the Baltic States

haplogroup_n3a3
Frequency-Distribution Maps of Individual Subclade N3a3 / N1a1a1a1a1a-CTS2929/VL29, probably initially with Akozino warrior-traders.

           N1a1a1a1a1a-CTS2929/VL29, formed ca. 2100 BC, TMRCA 1600 BC:

In Europe, the clade N3a3-VL29 encompasses over a third of the present-day male Estonians, Latvians, and Lithuanians but is also present among Saami, Karelians, and Finns (Table S2 and Figure 3). Among the Slavic-speaking Belarusians, Ukrainians, and Russians, about three-fourths of their hg N3 Y chromosomes belong to hg N3a3.

In the post on Finno-Permic expansions, I depicted what seems to me the most likely way of infiltration of N1c-L392 lineages with Akozino warrior-traders into the western Finno-Ugric populations, with an origin around the Barents sea.

This includes the potential spread of (a minority of) N1c-B211 subclades due to contacts with Anonino on both sides of the Urals, through a northern route of forest and forest-steppe regions (equivalent to the distribution of Cherkaskul compared to Andronovo), given the spread of certain subclades in Ugric populations.

NOTE. An alternative possibility is the association of certain B211 subclades with a southern route of expansion with Pre-Scythian and Scythian populations, under whose influence the Ananino culture emerged -which would imply a very quick infiltration of certain groups of haplogroup N everywhere among Finno-Ugrics on both sides of the Urals – , and also the expansion of some subclades with Turkic-speaking peoples, who apparently expanded with alliances of different peoples. Both (Scythian and Turkic) populations expanded from East Asia, where haplogroup N (including N1c) was present since the Neolithic. I find this a worse model of expansion for upper clades, but – given the YFull estimates and the presence of this haplogroup among Turkic peoples – it is a possibility for many subclades.

           N1a1a1a1a2-Z1936, formed ca. 2800 BC, TMRCA 2400 BC:

The only notable exception from the pattern are Russians from northern regions of European Russia, where, in turn, about two-thirds of the hg N3 Y chromosomes belong to the hg N3a4-Z1936—the second west Eurasian clade. Thus, according to the frequency distribution of this clade, these Northern Russians fit better among other non-Slavic populations from northeastern Europe. N3a4 tends to increase in frequency toward the northeastern European regions but is also somewhat unexpectedly a dominant hg N3 lineage among most Turcic-speaking Volga Tatars and South-Ural Bashkirs.

haplogroup_n3a4
Frequency-Distribution Maps of Individual Subclade N3a4 / N1a1a1a1a2-Z1936, probably with the Samic (first) and Fennic (later) expansions into Paleo-Lakelandic and Palaeo-Laplandic territories.

The expansion of N1a-Z1936 in Fennoscandia is most likely associated with the expansion of Saami into asbestos ware-related territory (like the Lovozero culture) during the Late Iron Age – and mixture with its population – , and with the later Fennic expansion to the east and north, replacing their language, as well as with Arctic and forest populations assimilated during Permic, Ugric, and Samoyedic expansions to the north.

           N1a1a1a1a4-M2019 (previously N3a2), formed ca. 4400 BC, TMRCA 1700 BC:

Sub-hg N3a2-M2118 is one of the two main bifurcating branches in the nested cladistic structure of N3a2’6-M2110. It is predominantly found in populations inhabiting present-day Yakutia (Republic of Sakha) in central Siberia and at lower frequencies in the Khanty and Mansi populations, which exhibit a distinct Y-STR pattern (Table S7) potentially intrinsic to an additional clade inside the sub-hg N3a2

The second widespread sub-clade of hg N is N2a. (…):

   N1a2b-P43 (B523/FGC10846/Y3184), formed ca. 6800 BC, TMRCA ca. 2700 BC:

The absolute majority of N2a individuals belong to the second sub-clade, N2a1-B523, which diversified about 4.7 kya (95% CI = 4.0–5.5 kya). Its distribution covers the western and southern parts of Siberia, the Taimyr Peninsula, and the Volga-Uralic region with frequencies ranging from from 10% to 30% and does not extend to eastern Siberia (…)

haplogroup_n2
Geographic-Distribution Map of hg N2a1 / N1a2b-P43

The “European” branch suggested earlier from Y-STR patterns turned out to consist of two clades

     N1a2b2a-Y3185/FGC10847, formed ca. 2200 BC, TMRCA 800 BC:

N2a1-L1419, spread mainly in the northern part of that region.

     N1a2b2b1-B528/Y24382, formed ca. 900 BC, TMRCA ca. 900 BC:

N2a1-B528, spread in the southern Volga-Uralic region.

Haplogroup R1a

We also have a good idea of the distribution of haplogroup R1a-Z645 in ancient samples. Its subclades were associated with the Corded Ware expansion, and some of them fit quite well the early expansion of Finno-Permic, Ugric, and Samoyedic peoples to the east.

r1a-z282-z280-z2125-distribution
Modified image, from Underhill et al. (2015). Spatial frequency distributions of Z282 (green) and Z93 (blue) affiliated haplogroups.. Notice the potential Finno-Ugric-associated distribution of Z282 (especially R1a-M558, a Z280 subclade), the expansion of R1a-Z2123 subclades with Central Asian forest-steppe groups.

This is how the modern distribution of R1a among Uralians looks like, from the latest report in Tambets et al. (2018):

  • Among Fennic populations, Estonians and Karelians (ca. 1.1 million) have not suffered the greatest bottleneck of Finns (ca. 6-7 million), and show thus a greater proportion of R1a-Z280 than N1c subclades, which points to the original situation of Fennic peoples before their expansion. To trust Finnish Y-DNA to derive conclusions about the Uralic populations is as useful as relying on the Basque Y-DNA for the language spread by R1b-P312
  • Among Volga-Finnic populations, Mordovians (the closest to the original Uralic cluster, see above) show a majority of R1a lineages (27%).
  • Hungarians (ca. 13-15 million) represent the majority of Ugric (and Finno-Ugric) peoples. They are mainly R1a-Z280, also R1a-Z2123, have little N1c, and lack Siberian ancestry, and represent thus the most likely original situation of Ugric peoples in 4th century AD (read more on Avars and Hungarians).
  • Among Samoyedic peoples, the Selkup, the southernmost ones and latest to expand – that is, those not heavily admixed with Siberian populations – , also have a majority of R1a-Z2123 lineages (see also here for the original Samoyedic haplogroups to the south).

To understand the relevance of Hungarians for Ugric peoples, as well as Estonians, Karelians, and Mordovians (and northern Russians, Finno-Ugric peoples recently Russified) for Finno-Permic peoples, as opposed to the Circum-Arctic and East Siberian populations, one has to put demographics in perspective. Even a modern map can show the relevance of certain territories in the past:

population-density
Population density (people per km2) map of the world in 1994. From Wikipedia.

Summary of ancestry + haplogroups

Fennic and Samic populations seem to be clearly influenced by Palaeo-Laplandic peoples, whereas Volga-Finnic and especially Permic populations may have received gene flow from both, but essentially Palaeo-Siberian influence from the north and east.

The fact that modern Mansis and Khantys offer the highest variation in N1a subclades, and some of the highest “Siberian ancestry” among non-Nganasans, should have raised a red flag long ago. The fact that Hungarians – supposedly stemming from a source population similar to Mansis – do not offer the same amount of N subclades or Siberian ancestry (not even close), and offer instead more R1a, in common with Estonians (among Finno-Samic peoples) and Mordvins (among Volga-Finnic peoples) should have raised a still bigger red flag. The fact that Nganasans – the model for Siberian ancestry – show completely different N1a2b-P43 lineages should have been a huge genetic red line (on top of the anthropological one) to regard them as the Uralian-type population.

We know now that ethnolinguistic groups have usually expanded with massive (usually male-biased) migrations, and that neighbouring locals often ‘resurge’ later without changing the language. That is seen in Europe after the spread of Bell Beakers, with the increase of previous ancestry and lineages in Scandinavia during the formation of the Nordic ethnolinguistic community; in Central-West Europe, with the resurgence of Neolithic ancestry (and lineages) during the Bronze Age over steppe ancestry; and in Central-East Europe (with Unetice or East European Bronze Age groups like Mierzanowice, Trzciniec, or Lusatian) showing an increase in steppe ancestry (and resurge of R1a subclades); none of them represented a radical ethnolinguistic change.

finno-ugric-haplogroup-n
Map of archaeological cultures in north-eastern Europe ca. 8th-3rd centuries BC. [The Mid-Volga Akozino group not depicted] Shaded area represents the Ananino cultural-historical society. Fading purple arrows represent likely stepped movements of subclades of haplogroup N for centuries (e.g. Siberian → Ananino → Akozino → Fennoscandia [N-VL29]; Circum-Arctic → forest-steppe [N1, N2]; etc.). Blue arrows represent eventual expansions of Uralic peoples to the north. Modified image from Vasilyev (2002).

It is not hard to model the stepped arrival, infiltration, and/or resurge of N subclades and “Siberian ancestries”, as well as their gradual expansion in certain regions, associated with certain migrations first – such as the expansions to the Circum-Arctic region, and later the Scythian- and Turkic-related movements – , as well as limited regional developments, like the known bottleneck in Finns, or the clear late expansion of Ugric and Samoyedic languages to the north among nomadic Palaeo-Siberians due to traditions of exogamy and multilingualism. This fits quite well with the different arrival of N (N1c and xN1c) lineages to the different Uralic-speaking groups, and to the stepped appearance of “Siberian ancestry” in the different regions.

The aternative

It is evident that a lot of people were too attached to the idea of Palaeolithic R1b lineages ‘native’ to western Europe speaking Basque languages; of R1a lineages speaking Indo-European and spreading with Yamna; and N lineages ‘native’ to north-eastern Europe and speaking Uralic, and this is causing widespread weeping and gnashing of teeth (instead of the joy of discovering where one’s true patrilineal ancestors come from, and what language they spoke in each given period, which is the supposed objective of genetic genealogy…)

Since an Indo-Germanic branch (as revived now by some in the Copenhaguen group to fit Kristiansen’s theory of the 1980s with recent genetic data) does not make any sense in linguistics, the finding of R1a in Yamna would not have led where some think it would have, because North-West Indo-European would still be the main Late PIE branch in Europe. Don’t take my word for it; take James P. Mallory’s (2013).

mallory-adams-tree
The levels of Indo-European reconstruction, from Mallory & Adams (2006).

If an (unlikely) Indo-Slavonic group were posited, though, such a group would still be bound (with Indo-Iranian) to the steppes with East Yamna/Poltavka (admixing with Abashevo migrants, but retaining its language), developing Sintashta/Potapovka → Srubna/Andronovo, and R1a lineages would have equally undergone the known bottlenecks of the steppes where they replaced R1b-Z2103 – which this eastern group shares with Balkan languages, a haplogroup that links therefore together the Graeco-Aryan group.

As far as I know – and there might be many other similar pet theories out there – there have been proposals of “modern Balto-Slavic-like” populations (in an obvious circular reasoning based on modern populations) in some Scythian clusters of the Iron Age.

NOTE. I will not enter into “Balto-Slavic-like R1a” of the Late Bronze Age or earlier because no one can seriously believe at this point of development of Population Genetics that autosomal similarity predating 1,500+ years the appearance of Slavs equates to their (ethnolinguistic) ancestral population, without a clear intermediate cultural and genetic trail – something we lack today in the Slavic case even for the late Roman period…

finno-saamic-palaeo-germanic-substratum
The Finnic and Saamic separation looks shallower than it actually is. Invisible convergence can be ‘triangulated’ with the help of Germanic layers of mutual loanwords (Häkkinen 2012).

We also know of R1a-Z280 lineages in Srubna, probably expanding to the west. With that in mind, and knowing that Palaeo-Germanic was in close contact with Finno-Samic while both were already separated but still in contact, and that Palaeo-Germanic was also in contact and closely related to a ‘Temematic’ distinct from Balto-Slavic (and also that early Proto-Baltic and Proto-Slavic from the Roman Iron Age and later were in contact with western Uralic) this will be the linguistic map of the Iron Age if R1a is considered to expand Indo-European from some kind of “patron-client” relationship with west Yamna:

palaeo-germanic-italo-celtic
Eastern European language map during the Late Bronze Age / Iron Age, if R1a spread Indo-European languages and Eastern Yamna spoke Indo-Slavonic. Palaeo-Germanic (i.e. Pre- to Proto-Germanic) needs to be in contact with both the Samic Lovozero population and the Fennic west Circum-Arctic one. Italic and Celtic in contact with Pre-Germanic. Germanic in contact with Temematic. Balto-Slavic in contact with Iranian, and near Fennic to allow for later loanwords. For Germanic and Temematic, see Kortlandt (2018).

You might think I have some personal or political reason against this kind of proposals. I haven’t. We have been proposing Indo-European to be the language of the European Union for more than 10 years, so to support R1b-Italo-Celtic in the whole Western Europe, R1a-Germanic in Central and Eastern Europe, and R1a-Indo-Slavonic in the steppes (as the Danish group seems to be doing) has nothing inherently bad (or good) for me. If anything, it gives more reason to support the revival of North-West Indo-European in Europe.

My problem with this proposal is that it is obviously beholden to the notion of the uninterrupted cultural, historic and ethnic continuity in certain territories. This bias is common in historiography (von Falkenhausen 1993), but it extends even more easily into the lesser known prehistory of any territory, and now more than ever some people feel the need to corrupt (pre)history based on their own haplogroups (or the majority haplogroups of their modern countries). However, more than on philosophical grounds, my rejection is based on facts: this picture is not what the combination of linguistic, archaeological, and genetic data shows. Period.

Nevertheless, if Yamna + Corded Ware represented the “big and early expansion” of Germanic and Italo-Celtic peoples proper of the dream Nazi’s Lebensraum and Fascist’s spazio vitale proposals; Uralians were Siberian hunter-gatherers that controlled the whole eastern and northern Russia, and miraculously managed to push (ethnolinguistically) Neolithic agropastoralists to the west during and after the Iron Age, with gradual (and often minimal) genetic impact; and Balto-Slavic peoples were represented by horse riders from Pokrovka/Srubna, hiding then somewhere around the forest-steppe until after the Scythian expansion, and then spreading their language (without much genetic impact) during the early Middle Ages…so be it.

See also

Related