More Celts of hg. R1b, more Afanasievo ancestry, more maps


Interesting recent developments:

Celts and hg. R1b


Recent paper (behind paywall) Multi-scale archaeogenetic study of two French Iron Age communities: From internal social- to broad-scale population dynamics, by Fischer et al. J Archaeol Sci (2019).

In it, Fischer and colleagues update their previous data for the Y-DNA of Gauls from the Urville-Nacqueville necropolis, Normandy (ca. 300-100 BC), with 8 samples of hg. R, at least 5 of them R1b. They also report new data from the Gallic cemetery at Gurgy ‘Les Noisats’, Southern Paris Basin (ca. 120-80 BC), with 19 samples of hg. R, at least 13 of them R1b.

In both cases, it is likely that both communities belonged (each) to the same paternal lineages, hence the patrilocal residence rules and patrilineality described for Gallic groups, also supported by the different maternal gene pools.

The interesting data would be whether these individuals were of hg. R1b-L21, hence mainly local lineages later replaced or displaced to the west, or – a priori much more likely – of some R1b-U152 and/or R1b-DF27 subclades from Central Europe that became less and less prevalent as Celts expanded into more isolated regions south of the Pyrenees and into the British Isles. Such information is lacking in the paper, probably due to the poor coverage of the samples.

Y-DNA haplogroups in Europe during the Early Iron Age. See full map.

Other Celts

As for early Celts, we already have:

Celtiberians from the Basque Country (one of hg. I2a) and likely Celtic genetic influence in north-east Iberia (all R1b), where Iberian languages spread later, showing that Celts expanded from some place in Central Europe, probably already with the Urnfield culture (ca. 1300 BC on).

Two Hallstatt samples from Bylany, Bohemia (ca. 836-780 BC), by Damgaard et al. Nature (2018), one of them of hg. R1b-U152.

Photo and diagram of burial HÜ-I/8, Mitterkirchen, Oberösterreich, Leskovar 1998.

Another Hallstatt HaC/D1 sample from Mittelkirchen, Austria (ca. 850-650/600), by Kiesslich et al. (2012), with predicted hg. G2a (see Athey’s haplogroup prediction).

One sample of early La Tène culture A from Putzenfeld am Dürrnberg, Hallein, Austria (ca 450–380 BC), by Kiesslich et al. (2012), with predicted hg. R1b (see Athey’s haplogroup prediction).

NOTE. For potential unreliability of haplogroup prediction with Whit Atheys’ haplogroup predictor, see e.g. Zhang et al. (2017).

Photo and diagram of Burial 376, Putzenfeld, Dürrnberg bei Hallein, Moser 2007.

Three Britons from Hinxton, South Cambridgeshire (ca. 170 BC – AD 80) from Schiffels et al. (2016), two of them of local hg. R1b-S461.

Indirectly, data of Vikings by Margaryan et al. (2019) from the British Isles and beyond show hg. R1b associated with modern British-like ancestry, also linked to early “Picts”, hence likely associated with Britons even after the Anglo-Saxon settlement. Supporting both (1) my recent prediction of hg. R1b-M167 expanding with Celts and (2) the reason for its presence among modern Scandinavians, is the finding of the first ancient sample of this subclade (VK166) among the Vikings of St John’s College Oxford, associated with the ‘St Brice’s Day Massacre’ (see Margaryan et al. 2019 supplementary materials).

The R1b-M167 sample shows 23.5% British-like ancestry, hence autosomally closer to other local samples (and related to the likely Picts from Orkney) than to some of his deceased partners at the site. Other samples with sizeable British-like ancestry include VK177 (32.6%, hg. R1b-U152), VK173 (33.3%, hg. I2a1b1a), or VK150 (25.6%, hg. I2a1b1a), while typical Germanic subclades like I1 or R1b-U106 – which may be associated with Anglo-Saxons, too – tend to show less.

Y-DNA haplogroups in Europe during the Late Iron Age. See full map.

I remember some commenter asking recently what would happen to the theory of Proto-Indo-European-speaking R1b-rich Yamnaya culture if Celts expanded with hg. R1a, because there were only one hg. R1b and one (possibly) G2a from Hallstatt. As it turns out, they were mostly R1b. However, the increasingly frequent obsession of searching for specific haplogroups and ancestry during the Iron Age and the Middle Ages is weird, even as a desperate attempt, because:

  1. it is evident that the more recent the ancient DNA samples are, the more they are going to resemble modern populations of the same area, so ancient DNA would become essentially useless;
  2. cultures from the early Iron Age onward (and even earlier) were based on increasingly complex sociopolitical systems everywhere, which is reflected in haplogroup and ancestry variability, e.g. among Balts, East Germanic peoples, Slavs (of hg. E1b-V13, I2a-L621), or Tocharians.

In fact, even the finding of hg. R1b among Celts of central and western Europe during the Iron Age is rather unenlightening, because more specific subclades and information on ancestry changes are needed to reach any meaningful conclusion as to migration vs. acculturation waves of expanding Celtic languages, which spread into areas that were mostly Indo-European-speaking since the Bell Beaker expansion.

Afanasevo ancestry in Asia

Wang and colleagues continue to publish interesting analyses, now in the preprint Inland-coastal bifurcation of southern East Asians revealed by Hmong-Mien genomic history, by Xia et al. bioRxiv (2019).

Interesting excerpt (emphasis mine):

Although the Devil’s Cave ancestry is generally the predominant East Asian lineage in North Asia and adjacent areas, there is an intriguing discrepancy between the eastern [Korean, Japanese, Tungusic (except northernmost Oroqen), and Mongolic (except westernmost Kalmyk) speakers] and the western part [West Xiōngnú (~2,150 BP), Tiānshān Hun (~1,500 BP), Turkic-speaking Karakhanid (~1,000 BP) and Tuva, and Kalmyk]. Whereas the East Asian ancestry of populations in the western part has entirely belonged to the Devil’s Cave lineage till now, populations in the eastern part have received the genomic influence from an Amis-related lineage (17.4–52.1%) posterior to the presence of the Devil’s Cave population roughly in the same region (~7,600 BP)12. Analogically, archaeological record has documented the transmission of wet-rice cultivation from coastal China (Shāndōng and/or Liáoníng Peninsula) to Northeast Asia, notably the Korean Peninsula (Mumun pottery period, since ~3,500 BP) and the Japanese archipelago (Yayoi period, since ~2,900 BP)2. Especially for Japanese, the Austronesian-related linguistic influence in Japanese may indicate a potential contact between the Proto-Japonic speakers and population(s) affiliating to the coastal lineage. Thus, our results imply that a southern-East-Asian-related lineage could be arguably associated with the dispersal of wet-rice agriculture in Northeast Asia at least to some extent.

Spatial and temporal distribution of ancestries in East Asians. Reference populations and corresponding hypothesized ancestral populations: (1) Devil’s Cave (~7,600 BP), the northern East Asian lineage; (2) Amis, the southern East Asian lineage (= AHM + AAA + AAN); (3) Hòabìnhian (~7,900 BP), a lineage related to Andamanese and indigenous hunter-gatherer of MSEA; (4) Kolyma (~9,800 BP), “Ancient Palaeo-Siberians”; (5) Afanasievo (~4,800 BP), steppe ancestry; (6) Namazga (~5,200 BP), the lineage of Chalcolithic Central Asian. Here, we report the best-fitting results of qpAdm based on following criteria: (1) a feasible p-value (&mt; 0.05), (2) feasible proportions of all the ancestral components (mean &mt; 0 and standard error < mean), and (3) with the highest p-value if meeting previous conditions.

In this case, the study doesn’t compare Steppe_MLBA, though, so the findings of Afanasievo ancestry have to be taken with a pinch of salt. They are, however, compared to Namazga, so “Steppe ancestry” is there. Taking into account the limited amount of Yamnaya-like ancestry that could have reached the Tian Shan area with the Srubna-Andronovo horizon in the Iron Age (see here), and the amount of Yamnaya-like ancestry that appears in some of these populations, it seems unlikely that this amount of “Steppe ancestry” would emerge as based only on Steppe_MLBA, hence the most likely contacts of Turkic peoples with populations of both Afanasievo (first) and Corded Ware-derived ancestry (later) to the west of Lake Baikal.

(1) The simplification of ancestral components into A vs. B vs. C… (when many were already mixed), and (2) the simplistic selection of one OR the other in the preferred models (such as those published for Yamnaya or Corded Ware), both common strategies in population genomics pose evident problems when assessing the actual gene flow from some populations into others.

Also, it seems that when the “Steppe”-like contribution is small, both Yamnaya and Corded Ware ancestry will be good fits in admixed populations of Central Asia, due to the presence of peoples of EHG-like (viz. West Siberia HG) and/or CHG-like (viz. Namazga) ancestry in the area. Unless and until these problems are addressed, there is little that can be confidently said about the history of Yamnaya vs. Corded Ware admixture among Asian peoples.

Maps, maps, and more maps

As you have probably noticed if you follow this blog regularly, I have been experimenting with GIS software in the past month or so, trying to map haplogroups and ancestry components (see examples for Vikings, Corded Ware, and Yamnaya). My idea was to show the (pre)historical evolution of ancestry and haplogroups coupled with the atlas of prehistoric migrations, but I have to understand first what I can do with GIS statistical tools.

My latest exercise has been to map modern haplogroup distribution (now added to the main menu above) using data from the latest available reports. While there have been no great surprises – beyond the sometimes awful display of data by some papers – I think it is becoming clearer with each new publication how wrong it was for geneticists to target initially those populations considered “isolated” – hence subject to strong founder effects – to extrapolate language relationships. For example:

  • The mapping of R1b-M269, in particular basal subclades, corresponds nicely with the Indo-European expansions.
  • There is no clear relationship of R1b, not even R1b-DF27 (especially basal subclades), with Basques. There is no apparent relationship between the distribution of R1b-M269 and some mythical non-Indo-European “Old Europeans”, like Etruscans or Caucasian speakers, either.
  • Basal R1a-M417 shows an interesting distribution, as do maps of basal Z282 and Z93 subclades, despite the evident late bottlenecks and acculturation among Slavs.
  • The distribution of hg. N1a-VL29 (and other N1a-L392 subclades) is clearly dissociated from Uralic peoples, and their expansion in the whole Baltic Sea during the Iron Age doesn’t seem to be related to any specific linguistic expansion.
  • haplogroup-n1a-vl29
    Modern distribution of haplogroup N1a-VL29. See full map.
  • Even the most recent association in Post et al. (2019) with hg. N1a-Z1639 – due to the lack of relationship of Uralic with N1a-VL29 – seems like a stretch, seeing how it probably expanded from the Kola Peninsula and the East Urals, and neither the Lovozero Ware nor forest hunter-fishers of the Cis- and Trans-Urals regions were Uralic-speaking cultures.
  • The current prevalence of hg. R1b-M73 supports its likely expansion with Turkic-speaking peoples.
  • The distribution of haplogroup R1b-V88 in Africa doesn’t look like it was a mere founder effect in Chadic peoples – although they certainly underwent a bottleneck under it.
  • The distribution of R1a-M420 (xM198) and hg. R1b-M343 (possibly not fully depicted in the east) seem to be related to expansions close to the Caucasus, supporting once more their location in Eastern Europe / West Siberia during the Mesolithic.
  • The mapping of E1b-V13 and I-M170 (I haven’t yet divided it into subclades) are particularly relevant for the recent eastward expansion of early Slavic peoples.

All in all, modern haplogroup distribution might have been used to ascertain prehistoric language movements even in the 2000s. It was the obsession with (and the wrong assumptions about) the “purity” of certain populations – say, Basques or Finns – what caused many of the interpretation problems and circular reasoning we are still seeing today.

I have also updated maps of Y-chromosome haplogroups reported for ancient samples in Europe and/or West Eurasia for the Early Eneolithic, Early Chalcolithic, Late Chalcolithic, Early Bronze Age, Middle Bronze Age, Late Bronze Age, Early Iron Age, Late Iron Age, Antiquity, and Middle Ages.

Haplogroup inference

I have also tried Yleaf v.2 – which seems like an improvement over the infamous v.1 – to test some samples that hobbyists and/or geneticists have reported differently in the past. I have posted the results in this ancient DNA haplogroup page. It doesn’t mean that the inferences I obtain are the correct ones, but now you have yet another source to compare.

Not many surprises here, either:

  • M15-1 and M012, two Proto-Tocharians from Shirenzigou, are of hg. R1b-PH155, not R1b-M269.
  • I0124, the Samara HG, is of hg. R1b-P297, but uncertain for both R1b-M73 and R1b-M269.
  • I0122, the Khvalynsk chieftain, is of hg. R1b-V1636.
  • I2181, the Smyadovo outlier of poor coverage, is possibly of hg. R, and could be of hg. R1b-M269, but could also be even non-P.
  • I6561 from Alexandria is probably of hg. R1a-M417, likely R1a-Z645, maybe R1a-Z93, but can’t be known beyond that, which is more in line with the TMRCA of R1a subclades and the radiocarbon date of the sample.
  • I2181, the Yamnaya individual (supposedly Pre-R1b-L51) at Lopatino II is R1b-M269, negative for R1b-L51. Nothing beyond that.

You can ask me to try mapping more data or to test the haplogroup of more samples, provided you give me a proper link to the relevant data, they are interesting for the subject of this blog…and I have the time to do it.


Yamnaya ancestry: mapping the Proto-Indo-European expansions


The latest papers from Ning et al. Cell (2019) and Anthony JIES (2019) have offered some interesting new data, supporting once more what could be inferred since 2015, and what was evident in population genomics since 2017: that Proto-Indo-Europeans expanded under R1b bottlenecks, and that the so-called “Steppe ancestry” referred to two different components, one – Yamnaya or Steppe_EMBA ancestry – expanding with Pro-Indo-Europeans, and the other one – Corded Ware or Steppe_MLBA ancestry – expanding with Uralic speakers.

The following maps are based on formal stats published in the papers and supplementary materials from 2015 until today, mainly on Wang et al. (2018 & 2019), Mathieson et al. (2018) and Olalde et al. (2018), and others like Lazaridis et al. (2016), Lazaridis et al. (2017), Mittnik et al. (2018), Lamnidis et al. (2018), Fernandes et al. (2018), Jeong et al. (2019), Olalde et al. (2019), etc.

NOTE. As in the Corded Ware ancestry maps, the selected reports in this case are centered on the prototypical Yamnaya ancestry vs. other simplified components, so everything else refers to simplistic ancestral components widespread across populations that do not necessarily share any recent connection, much less a language. In fact, most of the time they clearly didn’t. They can be interpreted as “EHG that is not part of the Yamnaya component”, or “CHG that is not part of the Yamnaya component”. They can’t be read as “expanding EHG people/language” or “expanding CHG people/language”, at least no more than maps of “Steppe ancestry” can be read as “expanding Steppe people/language”. Also, remember that I have left the default behaviour for color classification, so that the highest value (i.e. 1, or white colour) could mean anything from 10% to 100% depending on the specific ancestry and period; that’s what the legend is for… But, fere libenter homines id quod volunt credunt.


  1. Neolithic or the formation of Early Indo-European
  2. Eneolithic or the expansion of Middle Proto-Indo-European
  3. Chalcolithic / Early Bronze Age or the expansion of Late Proto-Indo-European
  4. European Early Bronze Age and MLBA or the expansion of Late PIE dialects

1. Neolithic

Anthony (2019) agrees with the most likely explanation of the CHG component found in Yamnaya, as derived from steppe hunter-fishers close to the lower Volga basin. The ultimate origin of this specific CHG-like component that eventually formed part of the Pre-Yamnaya ancestry is not clear, though:

The hunter-fisher camps that first appeared on the lower Volga around 6200 BC could represent the migration northward of un-admixed CHG hunter-fishers from the steppe parts of the southeastern Caucasus, a speculation that awaits confirmation from aDNA.

Natural neighbor interpolation of CHG ancestry among Neolithic populations. See full map.

The typical EHG component that formed part eventually of Pre-Yamnaya ancestry came from the Middle Volga Basin, most likely close to the Samara region, as shown by the sampled Samara hunter-gatherer (ca. 5600-5500 BC):

After 5000 BC domesticated animals appeared in these same sites in the lower Volga, and in new ones, and in grave sacrifices at Khvalynsk and Ekaterinovka. CHG genes and domesticated animals flowed north up the Volga, and EHG genes flowed south into the North Caucasus steppes, and the two components became admixed.

Natural neighbor interpolation of EHG ancestry among Neolithic populations. See full map.

To the west, in the Dnieper-Dniester area, WHG became the dominant ancestry after the Mesolithic, at the expense of EHG, revealing a likely mating network reaching to the north into the Baltic:

Like the Mesolithic and Neolithic populations here, the Eneolithic populations of Dnieper-Donets II type seem to have limited their mating network to the rich, strategic region they occupied, centered on the Rapids. The absence of CHG shows that they did not mate frequently if at all with the people of the Volga steppes (…)

Natural neighbor interpolation of WHG ancestry among Neolithic populations. See full map.

North-West Anatolia Neolithic ancestry, proper of expanding Early European farmers, is found up to border of the Dniester, as Anthony (2007) had predicted.

Natural neighbor interpolation of Anatolia Neolithic ancestry among Neolithic populations. See full map.

2. Eneolithic

From Anthony (2019):

After approximately 4500 BC the Khvalynsk archaeological culture united the lower and middle Volga archaeological sites into one variable archaeological culture that kept domesticated sheep, goats, and cattle (and possibly horses). In my estimation, Khvalynsk might represent the oldest phase of PIE.

(…) this middle Volga mating network extended down to the North Caucasian steppes, where at cemeteries such as Progress-2 and Vonyuchka, dated 4300 BC, the same Khvalynsk-type ancestry appeared, an admixture of CHG and EHG with no Anatolian Farmer ancestry, with steppe-derived Y-chromosome haplogroup R1b. These three individuals in the North Caucasus steppes had higher proportions of CHG, overlapping Yamnaya. Without any doubt, a CHG population that was not admixed with Anatolian Farmers mated with EHG populations in the Volga steppes and in the North Caucasus steppes before 4500 BC. We can refer to this admixture as pre-Yamnaya, because it makes the best currently known genetic ancestor for EHG/CHG R1b Yamnaya genomes.

From Wang et al (2019):

Three individuals from the sites of Progress 2 and Vonyuchka 1 in the North Caucasus piedmont steppe (‘Eneolithic steppe’), which harbour EHG and CHG related ancestry, are genetically very similar to Eneolithic individuals from Khvalynsk II and the Samara region. This extends the cline of dilution of EHG ancestry via CHG-related ancestry to sites immediately north of the Caucasus foothills

Natural neighbor interpolation of Pre-Yamnaya ancestry among Neolithic populations. See full map. This map corresponds roughly to the map of Khvalynsk-Novodanilovka expansion, and in particular to the expansion of horse-head pommel-scepters (read more about Khvalynsk, and specifically about horse symbolism)

NOTE. Unpublished samples from Ekaterinovka have been previously reported as within the R1b-L23 tree. Interestingly, although the Varna outlier is a female, the Balkan outlier from Smyadovo shows two positive SNP calls for hg. R1b-M269. However, its poor coverage makes its most conservative haplogroup prediction R-M343.

The formation of this Pre-Yamnaya ancestry sets this Volga-Caucasus Khvalynsk community apart from the rest of the EHG-like population of eastern Europe.

Natural neighbor interpolation of non-Pre-Yamnaya EHG ancestry among Eneolithic populations. See full map.

Anthony (2019) seems to rely on ADMIXTURE graphics when he writes that the late Sredni Stog sample from Alexandria shows “80% Khvalynsk-type steppe ancestry (CHG&EHG)”. While this seems the most logical conclusion of what might have happened after the Suvorovo-Novodanilovka expansion through the North Pontic steppes (see my post on “Steppe ancestry” step by step), formal stats have not confirmed that.

In fact, analyses published in Wang et al. (2019) rejected that Corded Ware groups are derived from this Pre-Yamnaya ancestry, a reality that had been already hinted in Narasimhan et al. (2018), when Steppe_EMBA showed a poor fit for expanding Srubna-Andronovo populations. Hence the need to consider the whole CHG component of the North Pontic area separately:

Natural neighbor interpolation of non-Pre-Yamnaya CHG ancestry among Eneolithic populations. See full map. You can read more about population movements in the late Sredni Stog and closer to the Proto-Corded Ware period.

NOTE. Fits for WHG + CHG + EHG in Neolithic and Eneolithic populations are taken in part from Mathieson et al. (2019) supplementary materials (download Excel here). Unfortunately, while data on the Ukraine_Eneolithic outlier from Alexandria abounds, I don’t have specific data on the so-called ‘outlier’ from Dereivka compared to the other two analyzed together, so these maps of CHG and EHG expansion are possibly showing a lesser distribution to the west than the real one ca. 4000-3500 BC.

Natural neighbor interpolation of WHG ancestry among Eneolithic populations. See full map.

Anatolia Neolithic ancestry clearly spread to the east into the north Pontic area through a Middle Eneolithic mating network, most likely opened after the Khvalynsk expansion:

Natural neighbor interpolation of Anatolia Neolithic ancestry among Eneolithic populations. See full map.
Natural neighbor interpolation of Iran Chl. ancestry among Eneolithic populations. See full map.

Regarding Y-chromosome haplogroups, Anthony (2019) insists on the evident association of Khvalynsk, Yamnaya, and the spread of Pre-Yamnaya and Yamnaya ancestry with the expansion of elite R1b-L754 (and some I2a2) individuals:

Y-DNA haplogroups in West Eurasia during the Early Eneolithic in the Pontic-Caspian steppes. See full map, and see culture, ADMIXTURE, Y-DNA, and mtDNA maps of the Early Eneolithic and Late Eneolithic.

3. Early Bronze Age

Data from Wang et al. (2019) show that Corded Ware-derived populations do not have good fits for Eneolithic_Steppe-like ancestry, no matter the model. In other words: Corded Ware populations show not only a higher contribution of Anatolia Neolithic ancestry (ca. 20-30% compared to the ca. 2-10% of Yamnaya); they show a different EHG + CHG combination compared to the Pre-Yamnaya one.

Supplementary Table 13. P values of rank=2 and admixture proportions in modelling Steppe ancestry populations as a three-way admixture of Eneolithic steppe Anatolian_Neolithic and WHG using 14 outgroups.
Left populations: Test, Eneolithic_steppe, Anatolian_Neolithic, WHG.
Right populations: Mbuti.DG, Ust_Ishim.DG, Kostenki14, MA1, Han.DG, Papuan.DG, Onge.DG, Villabruna, Vestonice16, ElMiron, Ethiopia_4500BP.SG, Karitiana.DG, Natufian, Iran_Ganj_Dareh_Neolithic.

Yamnaya Kalmykia and Afanasievo show the closest fits to the Eneolithic population of the North Caucasian steppes, rejecting thus sizeable contributions from Anatolia Neolithic and/or WHG, as shown by the SD values. Both probably show then a Pre-Yamnaya ancestry closest to the late Repin population.

Modelling results for the Steppe and Caucasus cluster. Admixture proportions based on (temporally and geographically) distal and proximal models, showing additional AF ancestry in Steppe groups and additional gene flow from the south in some of the Steppe groups as well as the Caucasus groups. See tables above. Modified from Wang et al. (2019). Within a blue square, Yamnaya-related groups; within a cyan square, Corded Ware-related groups. Green background behind best p-values. In red circle, SD of AF/WHG ancestry contribution in Afanasevo and Yamnaya Kalmykia, with ranges that almost include 0%.

EBA maps include data from Wang et al. (2018) supplementary materials, specifically unpublished Yamnaya samples from Hungary that appeared in analysis of the preprint, but which were taken out of the definitive paper. Their location among Yamnaya settlers from Hungary is speculative, although most uncovered kurgans in Hungary are concentrated in the Tisza-Danube interfluve.

Natural neighbor interpolation of Pre-Yamnaya ancestry among Early Bronze Age populations. See full map. This map corresponds roughly with the known expansion of late Repin/Yamnaya settlers.

The Y-chromosome bottleneck of elite males from Proto-Indo-European clans under R1b-L754 and some I2a2 subclades, already visible in the Khvalynsk sampling, became even more noticeable in the subsequent expansion of late Repin/early Yamnaya elites under R1b-L23 and I2a-L699:

Y-DNA haplogroups in West Eurasia during the Yamnaya expansion. See full map and maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Early Chalcolithic and Yamnaya Hungary.

Maps of CHG, EHG, Anatolia Neolithic, and probably WHG show the expansion of these components among Corded Ware-related groups in North Eurasia, apart from other cultures close to the Caucasus:

NOTE. For maps with actual formal stats of Corded Ware ancestry from the Early Bronze Age to the modern times, you can read the post Corded Ware ancestry in North Eurasia and the Uralic expansion.

Natural neighbor interpolation of non-Pre-Yamnaya CHG ancestry among Early Bronze Age populations. See full map.
Natural neighbor interpolation of non-Pre-Yamnaya EHG ancestry among Early Bronze Age populations. See full map.
Natural neighbor interpolation of WHG ancestry among Early Bronze Age populations. See full map.
Natural neighbor interpolation of Anatolia Neolithic ancestry among Early Bronze Age populations. See full map.
Natural neighbor interpolation of Iran Chl. ancestry among Early Bronze Age populations. See full map.

4. Middle to Late Bronze Age

The following maps show the most likely distribution of Yamnaya ancestry during the Bell Beaker-, Balkan-, and Sintashta-Potapovka-related expansions.

4.1. Bell Beakers

The amount of Yamnaya ancestry is probably overestimated among populations where Bell Beakers replaced Corded Ware. A map of Yamnaya ancestry among Bell Beakers gets trickier for the following reasons:

  • Expanding Repin peoples of Pre-Yamnaya ancestry must have had admixture through exogamy with late Sredni Stog/Proto-Corded Ware peoples during their expansion into the North Pontic area, and Sredni Stog in turn had probably some Pre-Yamnaya admixture, too (although they don’t appear in the simplistic formal stats above). This is supported by the increase of Anatolia farmer ancestry in more western Yamna samples.
  • Later, Yamnaya admixed through exogamy with Corded Ware-like populations in Central Europe during their expansion. Even samples from the Middle to Upper Danube and around the Lower Rhine will probably show increasing contributions of Steppe_MLBA, at the same time as they show an increasing proportion of EEF-related ancestry.
  • To complicate things further, the late Corded Ware Espersted family (from ca. 2500 BC or later) shows, in turn, what seems like a recent admixture with Yamnaya vanguard groups, with the sample of highest Yamnaya ancestry being the paternal uncle of other individuals (all of hg. R1a-M417), suggesting that there might have been many similar Central European mating networks from the mid-3rd millennium BC on, of (mainly) Yamnaya-like R1b elites displaying a small proportion of CW-like ancestry admixing through exogamy with Corded Ware-like peoples who already had some Yamnaya ancestry.
Natural neighbor interpolation of Yamnaya ancestry among Middle to Late Bronze Age populations (Esperstedt CWC site close to BK_DE, label is hidden by BK_DE_SAN). See full map. You can see how this map correlated with the map of Late Copper Age migrations and Yamanaya into Bell Beaker expansion.

NOTE. Terms like “exogamy”, “male-driven migration”, and “sex bias”, are not only based on the Y-chromosome bottlenecks visible in the different cultural expansions since the Palaeolithic. Despite the scarce sampling available in 2017 for analysis of “Steppe ancestry”-related populations, it appeared to show already a male sex bias in Goldberg et al. (2017), and it has been confirmed for Neolithic and Copper Age population movements in Mathieson et al. (2018) – see Supplementary Table 5. The analysis of male-biased expansion of “Steppe ancestry” in CWC Esperstedt and Bell Beaker Germany is, for the reasons stated above, not very useful to distinguish their mutual influence, though.

Based on data from Olalde et al. (2019), Bell Beakers from Germany are the closest sampled ones to expanding East Bell Beakers, and those close to the Rhine – i.e. French, Dutch, and British Beakers in particular – show a clear excess “Steppe ancestry” due to their exogamy with local Corded Ware groups:

Only one 2-way model fits the ancestry in Iberia_CA_Stp with P-value>0.05: Germany_Beaker + Iberia_CA. Finding a Bell Beaker-related group as a plausible source for the introduction of steppe ancestry into Iberia is consistent with the fact that some of the individuals in the Iberia_CA_Stp group were excavated in Bell Beaker associated contexts. Models with Iberia_CA and other Bell Beaker groups such as France_Beaker (P-value=7.31E-06), Netherlands_Beaker (P-value=1.03E-03) and England_Beaker (P-value=4.86E-02) failed, probably because they have slightly higher proportions of steppe ancestry than the true source population.


The exogamy with Corded Ware-like groups in the Lower Rhine Basin seems at this point undeniable, as is the origin of Bell Beakers around the Middle-Upper Danube Basin from Yamnaya Hungary.

To avoid this excess “Steppe ancestry” showing up in the maps, since Bell Beakers from Germany pack the most Yamnaya ancestry among East Bell Beakers outside Hungary (ca. 51.1% “Steppe ancestry”), I equated this maximum with BK_Scotland_Ach (which shows ca. 61.1% “Steppe ancestry”, highest among western Beakers), and applied a simple rule of three for “Steppe ancestry” in Dutch and British Beakers.

NOTE. Formal stats for “Steppe ancestry” in Bell Beaker groups are available in Olalde et al. (2018) supplementary materials (PDF). I didn’t apply this adjustment to Bk_FR groups because of the R1b Bell Beaker sample from the Champagne/Alsace region reported by Samantha Brunel that will pack more Yamnaya ancestry than any other sampled Beaker to date, hence probably driving the Yamnaya ancestry up in French samples.

The most likely outcome in the following years, when Yamnaya and Corded Ware ancestry are investigated separately, is that Yamnaya ancestry will be much lower the farther away from the Middle and Lower Danube region, similar to the case in Iberia, so the map above probably overestimates this component in most Beakers to the north of the Danube. Even the late Hungarian Beaker samples, who pack the highest Yamnaya ancestry (up to 75%) among Beakers, represent likely a back-migration of Moravian Beakers, and will probably show a contribution of Corded Ware ancestry due to the exogamy with local Moravian groups.

Despite this decreasing admixture as Bell Beakers spread westward, the explosive expansion of Yamnaya R1b male lineages (in words of David Reich) and the radical replacement of local ones – whether derived from Corded Ware or Neolithic groups – shows the true extent of the North-West Indo-European expansion in Europe:

Y-DNA haplogroups in West Eurasia during the Bell Beaker expansion. See full map and see maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Late Copper Age and of the Yamnaya-Bell Beaker transition.

4.2. Palaeo-Balkan

There is scarce data on Palaeo-Balkan movements yet, although it is known that:

  1. Yamnaya ancestry appears among Mycenaeans, with the Yamnaya Bulgaria sample being its best current ancestral fit;
  2. the emergence of steppe ancestry and R1b-M269 in the eastern Mediterranean was associated with Ancient Greeks;
  3. Thracians, Albanians, and Armenians also show R1b-M269 subclades and “Steppe ancestry”.

4.3. Sintashta-Potapovka-Filatovka

Interestingly, Potapovka is the only Corded Ware derived culture that shows good fits for Yamnaya ancestry, despite having replaced Poltavka in the region under the same Corded Ware-like (Abashevo) influence as Sintashta.

This proves that there was a period of admixture in the Pre-Proto-Indo-Iranian community between CWC-like Abashevo and Yamnaya-like Catacomb-Poltavka herders in the Sintashta-Potapovka-Filatovka community, probably more easily detectable in this group because of the specific temporal and geographic sampling available.

Supplementary Table 14. P values of rank=3 and admixture proportions in modelling Steppe ancestry populations as a four-way admixture of distal sources EHG, CHG, Anatolian_Neolithic and WHG using 14 outgroups.
Left populations: Steppe cluster, EHG, CHG, WHG, Anatolian_Neolithic
Right populations: Mbuti.DG, Ust_Ishim.DG, Kostenki14, MA1, Han.DG, Papuan.DG, Onge.DG, Villabruna, Vestonice16, ElMiron, Ethiopia_4500BP.SG, Karitiana.DG, Natufian, Iran_Ganj_Dareh_Neolithic.

Srubnaya ancestry shows a best fit with non-Pre-Yamnaya ancestry, i.e. with different CHG + EHG components – possibly because the more western Potapovka (ancestral to Proto-Srubnaya Pokrovka) also showed good fits for it. Srubnaya shows poor fits for Pre-Yamnaya ancestry probably because Corded Ware-like (Abashevo) genetic influence increased during its formation.

On the other hand, more eastern Corded Ware-derived groups like Sintashta and its more direct offshoot Andronovo show poor fits with this model, too, but their fits are still better than those including Pre-Yamnaya ancestry.

Natural neighbor interpolation of non-Pre-Yamnaya EHG ancestry among Middle to Late Bronze Age populations. See full map.
Natural neighbor interpolation of non-Pre-Yamnaya CHG ancestry among Middle to Late Bronze Age populations. See full map.
Natural neighbor interpolation of Anatolia Neolithic ancestry among Middle to Late Bronze Age populations. See full map.
Natural neighbor interpolation of Iran Chl. ancestry among Middle to Late Bronze Age populations. See full map.

NOTE For maps with actual formal stats of Corded Ware ancestry from the Early Bronze Age to the modern times, you should read the post Corded Ware ancestry in North Eurasia and the Uralic expansion instead.

The bottleneck of Proto-Indo-Iranians under R1a-Z93 was not yet complete by the time when the Sintashta-Potapovka-Filatovka community expanded with the Srubna-Andronovo horizon:

Y-DNA haplogroups in West Eurasia during the European Early Bronze Age. See full map and see maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Early Bronze Age.

4.4. Afanasevo

At the end of the Afanasevo culture, at least three samples show hg. Q1a2-M25 (ca. 2900-2500 BC), which seemed to point to a resurgence of local lineages, despite continuity of the prototypical Pre-Yamnaya ancestry. On the other hand, Anthony (2019) makes this cryptic statement:

Yamnaya men were almost exclusively R1b, and pre-Yamnaya Eneolithic Volga-Caspian-Caucasus steppe men were principally R1b, with a significant Q1a minority.

Since the only available samples from the Khvalynsk community are R1b (x3), Q1a(x1), and R1a(x1), it seems strange that Anthony would talk about a “significant minority”, unless Q1a will pop up in some more individuals of those ca. 30 new to be published. Because he also mentions I2a2 as appearing in one elite burial, it seems Q1a (like R1a-M459) will not appear under elite kurgans, although it is still possible that hg. Q1a was involved in the expansion of Afanasevo to the east.

Y-DNA haplogroups in West Eurasia during the Middle Bronze Age. See full map and see maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Middle Bronze Age and the Late Bronze Age.

Okunevo, which replaced Afanasevo in the Altai region, shows a majority of hg. Q1a2-M25, and at least one Q1a1-B284, but also some R1b-M269 samples proper of Afanasevo, suggesting partial genetic continuity.

NOTE. Other sampled Siberian populations clearly show a variety of Q subclades that likely expanded during the Palaeolithic, such as Baikal EBA samples from Ust’Ida and Shamanka with a majority of Q1a2-M25 (in particular Q1a2-L712), and hg. Q reported from Elunino, Sagsai, Khövsgöl, and also among peoples of the Srubna-Andronovo horizon (the Krasnoyarsk MLBA outlier), and in Karasuk. Q1a-M25 was earlier found in a Baltic hunter-gatherer, which supports a widespread distribution of Q1a2 and Q1a1 in North Eurasia during the Neolithic and Bronze Age.

From Damgaard et al. Science (2018):

(…) in contrast to the lack of identifiable admixture from Yamnaya and Afanasievo in the CentralSteppe_EMBA, there is an admixture signal of 10 to 20% Yamnaya and Afanasievo in the Okunevo_EMBA samples, consistent with evidence of western steppe influence. This signal is not seen on the X chromosome (qpAdm P value for admixture on X 0.33 compared to 0.02 for autosomes), suggesting a male-derived admixture, also consistent with the fact that 1 of 10 Okunevo_EMBA males carries a R1b1a2a2 Y chromosome related to those found in western pastoralists. In contrast, there is no evidence of western steppe admixture among the more eastern Baikal region region Bronze Age (~2200 to 1800 BCE) samples.

This Yamnaya ancestry has been also recently found to be the best fit for the Iron Age population of Shirenzigou in Xinjiang – where Tocharian languages were attested centuries later – despite the haplogroup diversity acquired during their evolution, likely through an intermediate Chemurchek culture (see a recent discussion on the elusive Proto-Tocharians).

Haplogroup diversity seems to be common in Iron Age populations all over Eurasia, most likely due to the spread of different types of sociopolitical structures where alliances played a more relevant role in the expansion of peoples. A well-known example of this is the spread of Akozino warrior-traders in the whole Baltic region under a partial N1a-VL29-bottleneck associated with the emerging chiefdom-based systems under the influence of expanding steppe nomads.

Y-DNA haplogroups in West Eurasia during the Early Iron Age. See full map and see maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Early Iron Age and Late Iron Age.

Surprisingly, then, Proto-Tocharians from Shirenzigou pack up to 74% Yamnaya ancestry, in spite of the 2,000 years that separate them from the demise of the Afanasevo culture. They show more Yamnaya ancestry than any other population by that time, being thus a sort of Late PIE fossils not only in their archaic dialect, but also in their genetic profile:


The recent intrusion of Corded Ware-like ancestry, as well as the variable admixture with Siberian and East Asian populations, both point to the known intense Old Iranian and Old/Middle Chinese contacts. The scarce Proto-Samoyedic and Proto-Turkic loans in Tocharian suggest a rather loose, probably more distant connection with East Uralic and Altaic peoples from the forest-steppe and steppe areas to the north (read more about external influences on Tocharian).

Interestingly, both R1b samples, MO12 and M15-2 – likely of Asian R1b-PH155 branch – show a best fit for Andronovo/Srubna + Hezhen/Ulchi ancestry, suggesting a likely connection with Iranians to the east of Xinjiang, who later expanded as the Wusun and Kangju. How they might have been related to Huns and Xiongnu individuals, who also show this haplogroup, is yet unknown, although Huns also show hg. R1a-Z93 (probably most R1a-Z2124) and Steppe_MLBA ancestry, earlier associated with expanding Iranian peoples of the Srubna-Andronovo horizon.

All in all, it seems that prehistoric movements explained through the lens of genetic research fit perfectly well the linguistic reconstruction of Proto-Indo-European and Proto-Uralic.


Corded Ware ancestry in North Eurasia and the Uralic expansion


Now that it has become evident that Late Repin (i.e. Yamnaya/Afanasevo) ancestry was associated with the migration of R1b-L23-rich Late Proto-Indo-Europeans from the steppe in the second half of the the 4th millennium BC, there’s still the question of how R1a-rich Uralic speakers of Corded Ware ancestry expanded , and how they spread their languages throughout North Eurasia.

Modern North Eurasians

I have been collecting information from the supplementary data of the latest papers on modern and ancient North Eurasian peoples, including Jeong et al. (2019), Saag et al. (2019), Sikora et al. (2018), or Flegontov et al. (2019), and I have tried to add up their information on ancestral components and their modern and historical distributions.

Fortunately, the current obsession with simplifying ancestry components into three or four general, atemporal groups, and the common use of the same ones across labs, make it very simple to merge data and map them.

Corded Ware ancestry

There is no doubt about the prevalent ancestry among Uralic-speaking peoples. A map isn’t needed to realize that, because ancient and modern data – like those recently summarized in Jeong et al. (2019) – prove it. But maps sure help visualize their intricate relationship better:

Natural neighbor interpolation of Srubnaya ancestry among modern populations. See full map.
Kriging interpolation of Srubnaya ancestry among modern populations. See full map

Interestingly, the regions with higher Corded Ware-related ancestry are in great part coincident with (pre)historical Finno-Ugric-speaking territories:

Modern distribution of Uralic languages, with ancient territory (in the Common Era) labelled and delimited by a red line. For more information on the ancient territory see here.

Edit (29/7/2019): Here is the full Steppe_MLBA ancestry map, including Steppe_MLBA (vs. Indus Periphery vs. Onge) in modern South Asian populations from Narasimhan et al. (2018), apart from the ‘Srubnaya component’ in North Eurasian populations. ‘Dummy’ variables (with 0% ancestry) have been included to the south and east of the map to avoid weird interpolations of Steppe_MLBA into Africa and East Asia.

Natural neighbor interpolation of Steppe MLBA-like ancestry among modern populations. See full map.

Anatolia Neolithic ancestry

Also interesting are the patterns of non-CWC-related ancestry, in particular the apparent wedge created by expanding East Slavs, which seems to reflect the intrusion of central(-eastern) European ancestry into Finno-Permic territory.

NOTE. Read more on Balto-Slavic hydrotoponymy, on the cradle of Russians as a Finno-Permic hotspot, and about Pre-Slavic languages in North-West Russia.

Natural neighbor interpolation of LBK EN ancestry among modern populations. See full map.
Kriging interpolation of LBK EN ancestry among modern populations. See full map

WHG ancestry

The cline(s) between WHG, EHG, ANE, Nganasan, and Baikal HG are also simplified when some of them excluded, in this case EHG, represented thus in part by WHG, and in part by more eastern ancestries (see below).

Natural neighbor interpolation of WHG ancestry among modern populations. See full map.
Kriging interpolation of WHG ancestry among modern populations. See full map.

Arctic, Tundra or Forest-steppe?

Data on Nganasan-related vs. ANE vs. Baikal HG/Ulchi-related ancestry is difficult to map properly, because both ancestry components are usually reported as mutually exclusive, when they are in fact clearly related in an ancestral cline formed by different ancient North Eurasian populations from Siberia.

When it comes to ascertaining the origin of the multiple CWC-related clines among Uralic-speaking peoples, the question is thus how to properly distinguish the proportions of WHG-, EHG-, Nganasan-, ANE or BaikalHG-related ancestral components in North Eurasia, i.e. how did each dialectal group admix with regional groups which formed part of these clines east and west of the Urals.

The truth is, one ought to test specific ancient samples for each “Siberian” ancestry found in the different Uralic dialectal groups, but the simplistic “Siberian” label somehow gets a pass in many papers (see a recent example).

Below qpAdm results with best fits for Ulchi ancestry, Afontova Gora 3 ancestry, and Nganasan ancestry, but some populations show good fits for both and with similar proportions, so selecting one necessarily simplifies the distribution of both.

Ulchi ancestry

Natural neighbor interpolation of Ulchi ancestry among modern populations. See full map.
Kriging interpolation of Ulchi ancestry among modern populations. See full map.

ANE ancestry

Natural neighbor interpolation of ANE ancestry among modern populations. See full map.
Kriging interpolation of ANE ancestry among modern populations. See full map.

Nganasan ancestry

Natural neighbor interpolation of Nganasan ancestry among modern populations. See full map.
Kriging interpolation of Nganasan ancestry among modern populations. See full map.

Iran Chalcolithic

A simplistic Iran Chalcolithic-related ancestry is also seen in the Altaic cline(s) which (like Corded Ware ancestry) expanded from Central Asia into Europe – apart from its historical distribution south of the Caucasus:

Natural neighbor interpolation of Iran Neolithic ancestry among modern populations. See full map.
Kriging interpolation of Iran Chalcolithic ancestry among modern populations. See full map.

Other models

The first question I imagine some would like to know is: what about other models? Do they show the same results? Here is the simplistic combination of ancestry components published in Damgaard et al. (2018) for the same or similar populations:

NOTE. As you can see, their selection of EHG vs. WHG vs. Nganasan vs. Natufian vs. Clovis of is of little use, but corroborate the results from other papers, and show some interesting patterns in combination with those above.


Natural neighbor interpolation of EHG ancestry among modern populations, data from Damgaard et al. (2018). See full map.
Kriging interpolation of EHG ancestry among modern populations. See full map.

Natufian ancestry

Natural neighbor interpolation of Natufian ancestry among modern populations, data from Damgaard et al. (2018). See full map.
Kriging interpolation of Natufian ancestry among modern populations. See full map.

WHG ancestry

Natural neighbor interpolation of WHG ancestry among modern populations, data from Damgaard et al. (2018). See full map.
Kriging interpolation of WHG ancestry among modern populations. See full map.

Baikal HG ancestry

Natural neighbor interpolation of Baikal hunter-gatherer ancestry among modern populations, data from Damgaard et al. (2018). See full map.
Kriging interpolation of Baikal HG ancestry among modern populations. See full map.

Ancient North Eurasians

Once the modern situation is clear, relevant questions are, for example, whether EHG-, WHG-, ANE, Nganasan-, and/or Baikal HG-related meta-populations expanded or became integrated into Uralic-speaking territories.

When did these admixture/migration events happen?

How did the ancient distribution or expansion of Palaeo-Arctic, Baikalic, and/or Altaic peoples affect the current distribution of the so-called “Siberian” ancestry, and of hg. N1a, in each specific population?

NOTE. A little excursus is necessary, because the calculated repetition of a hypothetic opposition “N1a vs. R1a” doesn’t make this dichotomy real:

  1. There was not a single ethnolinguistic community represented by hg. R1a after the initial expansion of Eastern Corded Ware groups, or by hg. N1a-L392 after its initial expansion in Siberia:
  2. Different subclades became incorporated in different ways into Bronze Age and Iron Age communities, most of which without an ethnolinguistic change. For example, N1a subclades became incorporated into North Eurasian populations of different languages, reaching Uralic- and Indo-European-speaking territories of north-eastern Europe during the late Iron Age, at a time when their ancestral origin or language in Siberia was impossible to ascertain. Just like the mix found among Proto-Germanic peoples (R1b, R1a, and I1)* or among Slavic peoples (I2a, E1b, R1a)*, the mix of many Uralic groups showing specific percentages of R1a, N1a, or Q subclades* reflect more or less recent admixture or acculturation events with little impact on their languages.

*other typically northern and eastern European haplogroups are also represented in early Germanic (N1a, I2, E1b, J, G2), Slavic (I1, G2, J) and Finno-Permic (I1, R1b, J) peoples.

Map of archaeological cultures in north-eastern Europe ca. 8th-3rd centuries BC. [The Mid-Volga Akozino group not depicted] Shaded area represents the Ananino cultural-historical society. Fading purple arrows represent likely stepped movements of subclades of haplogroup N for centuries (e.g. Siberian → Ananino → Akozino → Fennoscandia [N-VL29]; Circum-Arctic → forest-steppe [N1, N2]; etc.). Blue arrows represent eventual expansions of Uralic peoples to the north. Modified image from Vasilyev (2002).

The problem with mapping the ancestry of the available sampling of ancient populations is that we lack proper temporal and regional transects. The maps that follow include cultures roughly divided into either “Bronze Age” or “Iron Age” groups, although the difference between samples may span up to 2,000 years.

NOTE. Rough estimates for more external groups (viz. Sweden Battle Axe/Gotland_A for the NW, Srubna from the North Pontic area for the SW, Arctic/Nganasan for the NE, and Baikal EBA/”Ulchi-like” for the SE) have been included to offer a wider interpolated area using data already known.

Bronze Age

Similar to modern populations, the selection of best fit “Siberian” ancestry between Baikal HG vs. Nganasan, both potentially ± ANE (AG3), is an oversimplification that needs to be addressed in future papers.

Corded Ware ancestry

Natural neighbor interpolation of Srubnaya ancestry among Bronze Age populations. See full map.

Nganasan-like ancestry

Natural neighbor interpolation of Nganasan-like ancestry among Bronze Age populations. See full map.

Baikal HG ancestry

Natural neighbor interpolation of Baikal Hunter-Gatherer ancestry among Bronze Age populations. See full map.

Afontova Gora 3 ancestry

Natural neighbor interpolation of Afontova Gora 3 ancestry among Bronze Age populations. See full map.

Iron Age

Corded Ware ancestry

Interestingly, the moderate expansion of Corded Ware-related ancestry from the south during the Iron Age may be related to the expansion of hg. N1a-VL29 into the chiefdom-based system of north-eastern Europe, including Ananyino/Akozino and later expanding Akozino warrior-traders around the Baltic Sea.

NOTE. The samples from Levänluhta are centuries older than those from Estonia (and Ingria), and those from Chalmny Varre are modern ones, so this region has to be read as a south-west to north-east distribution from the Iron Age to modern times.

Natural neighbor interpolation of Srubnaya ancestry among Iron Age populations. See full map.

Baikal HG-like ancestry

The fact that this Baltic N1a-VL29 branch belongs in a group together with typically Avar N1a-B197 supports the Altaic origin of the parent group, which is possibly related to the expansion of Baikalic ancestry and Iron Age nomads:

Natural neighbor interpolation of Baikal HG ancestry among Iron Age populations. See full map.

Nganasan-like ancestry

The dilution of Nganasan-like ancestry in an Arctic region featuring “Siberian” ancestry and hg. N1a-L392 at least since the Bronze Age supports the integration of hg. N1a-Z1934, sister clade of Ugric N1a-Z1936, into populations west and east of the Urals with the expansion of Uralic languages to the north into the Tundra region (see here).

The integration of N1a-Z1934 lineages into Finnic-speaking peoples after their migration to the north and east, and the displacement or acculturation of Saami from their ancestral homeland, coinciding with known genetic bottlenecks among Finns, is yet another proof of this evolution:

Natural neighbor interpolation of Nganasan ancestry among Iron Age populations. See full map.

WHG ancestry

Similarly, WHG ancestry doesn’t seem to be related to important population movements throughout the Bronze Age, which excludes the multiple North Eurasian populations that will be found along the clines formed by WHG, EHG, ANE, Nganasan, Baikal HG ancestry as forming part of the Uralic ethnogenesis, although they may be relevant to follow later regional movements of specific populations.

Natural neighbor interpolation of WHG ancestry among Iron Age populations. See full map.


It seems natural that people used to look at maps of haplogroup distribution from the 2000s, coupled with modern language distributions, and would try to interpret them in a certain way, reaching thus the wrong conclusions whose consequences are especially visible today when ancient DNA keeps contradicting them.

In hindsight, though, assuming that Balto-Slavs expanded with Corded Ware and hg. R1a, or that Uralians expanded with “Siberian” ancestry and hg. N1a, was as absurd as looking at maps of ancestry and haplogroup distribution of ancient and modern Native Americans, trying to divide them into “Germanic” or “Iberian”…

The evolution of each specific region and cultural group of North Eurasia is far from being clear. However, the general trend speaks clearly in favour of an ancient, Bronze Age distribution of North Eurasian ancestry and haplogroups that have decreased, diluted, or become incorporated into expanding Uralians of Corded Ware ancestry, occasionally spreading with inter-regional expansions of local groups.

Given the relatively recent push of Altaic and Indo-European languages into ancestral Uralic-speaking territories, only the ancient Corded Ware expansion remains compatible with the spread of Uralic languages into their historical distribution.


Iron Age Tocharians of Yamnaya ancestry from Afanasevo show hg. R1b-M269 and Q1a1

New open access Ancient Genomes Reveal Yamnaya-Related Ancestry and a Potential Source of Indo-European Speakers in Iron Age Tianshan, by Ning et al. Cell (2019).

Interesting excerpts (emphasis mine, changes for clarity):

Here, we report the first genome-wide data of 10 ancient individuals from northeastern Xinjiang. They are dated to around 2,200 years ago and were found at the Iron Age Shirenzigou site. We find them to be already genetically admixed between Eastern and Western Eurasians. We also find that the majority of the East Eurasian ancestry in the Shirenzigou individuals is related to northeastern Asian populations, while the West Eurasian ancestry is best presented by ∼20% to 80% Yamnaya-like ancestry. Our data thus suggest a Western Eurasian steppe origin for at least part of the ancient Xinjiang population. Our findings furthermore support a Yamnaya-related origin for the now extinct Tocharian languages in the Tarim Basin, in southern Xinjiang.


The dominant mtDNA lineages of the Shirenzigou people are commonly found in modern and ancient West Eurasian populations, such as U4, U5, and H, while they also have East Eurasian-specific haplogroups A, D4, and G3, preliminarily documenting admixed ancestry from eastern and western Eurasia.

The admixture profile is also shown on the paternal Y chromosome side that 4 out of 6 males in Shirenzigou (Figure S2) belong to the West Eurasian-specific haplogroup R1b (n = 2) and East Eurasian-specific haplogroup Q1a (n = 2), the former is predominant in ancient Yamnaya and nearly 100% in Afanasievo, different from the Middle and Late Bronze Age Steppe groups (Steppe_MLBA) such as Andronovo, [Potapovka], Srubnaya, and Sintashta whose Y chromosomal haplogroup is mainly R1a.



We first carried out principal component analysis (PCA) to assess the genetic affinities of the ancient individuals qualitatively by projecting them onto present-day Eurasian variation (Figure 2). We observed a distinct separation between East and West Eurasians. Our ancient Shirenzigou samples and present-day populations from Central Asia and northwestern China form a genetic cline from East to West in the first PC. The distribution of Shirenzigou samples on the cline is relatively scattered with two major clusters, one being closer to modern-day Uygurs and Kazakhs and the other being closer to recently published ancient Saka and Huns from the Tianshan in Kazakhstan (…).

We applied a formal admixture test using f3 statistics in the form of f3 (Shirenzigou; X, Y) where X and Y are worldwide populations that might be the genetic sources for the Shirenzigou individuals. We observed the most significant signals of admixture in the Shirenzigou samples when using Yamnaya_Samara or Srubnaya as the West Eurasian source and some Northern Asians or Koreans as the East Eurasian source (Table S1). We also plotted the outgroup f3 statistics in the form of f3 (Mbuti; X, Anatolia_Neolithic) and f3 (Mbuti; X, Kostenki14) to visualize the allele sharing between population X and Anatolian farmers. As shown in Figure S3, the Steppe_MLBA populations including Srubnaya, Andronovo, and Sintashta were shifted toward farming populations compared with Yamnaya groups and the Shirenzigou samples. This observation is consistent with ADMIXTURE analysis that Steppe_MLBA populations have an Anatolian and European farmer-related component that Yamnaya groups and the Shirenzigou individuals do not seem to have. The analysis consistently suggested Yamnaya-related Steppe populations were the better source in modeling the West Eurasian ancestry in Shirenzigou.

PCA and ADMIXTURE for Shirenzigou Samples. Modified from the original to include in black squares samples related to Yamnaya.

Genetic Composition of Iron Age Shirenzigou Individuals

We continued to use qpAdm to estimate the admixture proportions in the Shirenzigou samples by using different pairs of source populations, such as Yamnaya_Samara, Afanasievo, Srubnaya, Andronovo, BMAC culture (Bustan_BA and Sappali_Tepe_BA) and Tianshan_Hun as the West Eurasian source and Han, Ulchi, Hezhen, Shamanka_EN as the East Eurasian source. In all cases, Yamnaya, Afanasievo, or Tianshan_Hun always provide the best model fit for the Shirenzigou individuals, while Srubnaya, Andronovo, Bustan_BA and Sappali_Tepe_BA only work in some cases. The Yamnaya_Samara or Afanasievo-related ancestry ranges from ∼20% to 80% in different Shirenzigou individuals, consistent with the scattered distribution on the East-West cline in the PCA


(…) we then modeled Shirenzigou as a three-way admixture of Yamnaya_Samara, Ulchi (or Hezhen) and Han to infer the source from the East Eurasia side that contributed to Shirenzigou. We found the Ulchi or Hezhen and Han-related ancestry had a complicated and unevenly distribution in the Shirenzigou samples. The most Shirenzigou individuals derived the majority of their East Eurasian ancestry from Ulchi or Hezhen-related populations, while the following two individuals M820 and M15-2 have more Han related than Ulchi/Hezhen-related ancestry.

One important question remains, though: how and when did these Proto-Tocharian speakers migrate from the Afanasevo culture in the Altai into the Tarim Basin? The traditional answer, now more likely than ever, is through the Chemurchek culture. See e.g. A re-analysis of the Qiemu’erqieke (Shamirshak) cemeteries, Xinjiang, China, by Jia and Betts JIES (2010) 38(4).

Also, given the apparent lack of (extra farmer ancestry that characterizes) Corded Ware ancestry, if the results were already suspicious before, how likely are now the published R1a(xZ93) and/or radiocarbon dates of the Xiaohe mummies from Li et al. (2010, 2015)? Because, after all, one should have expected in such a late date a generalized admixture with neighbouring Srubna/Andronovo-like populations.


Złota a GAC-CWC transitional group…but not the origin of Corded Ware peoples


Open access Unraveling ancestry, kinship, and violence in a Late Neolithic mass grave, by Schroeder et al. PNAS (2019).

Interesting excerpts of the paper and supplementary materials, about the Złota group variant of Globular Amphora (emphasis mine):

A special case is the so-called Złota group, which emerged around 2,900 BCE in the northern part of the Małopolska Upland and existed until 2,600-2,500 BCE. Originally defined as a separate archaeological “culture” (15), this group is mainly defined by the rather local introduction of a distinct form of burial in the area mentioned. Distinct Złota settlements have not yet been identified. Nonetheless, because of the character of its burial practices and material culture, which both retain many elements of the GAC and yet point forward to the Corded Ware tradition, and because of its geographical location, the Złota group has attracted significant archaeological attention (15, 16).

The Złota group buried their dead in a new, distinct type of funerary structure; so-called niche graves (also called catacomb graves). These structures featured an entrance shaft or pit and, below that, a more or less extensive niche, sometimes connected to the entrance area by a narrow corridor. Local limestone was used to seal off the entrance shaft and to pave the floor of the niche, on which the dead were usually placed along with grave goods. This specific and relatively sophisticated form of burial probably reflects contacts between the northern Małopolska Upland and the steppe and forest-steppe communities further to the east, who also buried their dead in a form of catacomb graves. Individual cases of the use of ochre and of deformation of skulls in Złota burials provide further indications of such a connection (15). At the same time, the Złota niche grave practice also retains central elements of the GAC funerary tradition, such as the frequent practice of multiple burials in one grave, often entailing redeposition and violation of the anatomical order of corpses, and thus differs from the catacomb grave customs found on the steppes which are strongly dominated by single graves. Nonetheless, at Złota group cemeteries single burial graves appear, and even in multiple burial graves the identity of each individual is increasingly emphasized, e.g. by careful deposition of the body and through the personal nature of grave goods (16).

Correspondence analysis of amphorae from the Złota-graveyards reveals that there is no typological break between Globular Amphorae and Corded Ware Amphorae, including ‘Strichbündelamphorae’ (after Furholt 2008)

Just like its burial practices, the material culture and grave goods of the Złota group combine elements of the GAC, such as amber ornaments and central parts of the ceramic inventory, with elements also found in the Corded Ware tradition, such as copper ornaments, stone shaft-hole axes, bone and shell ornaments, and other stylistic features of the ceramic inventory. In particular, Złota group ceramic styles have been seen as a clear transitional phenomenon between classical GAC styles and the subsequent Corded Ware ceramics, probably playing a key role in the development of the typical cord decoration patterns that came to define the latter (17).

As briefly summarized above, the Złota group displays a distinct funerary tradition and combination of material culture traits, which give the clear impression of a cultural “transitional situation”. While the group also appears to have had long-distance contacts directed elsewhere (e.g. to Baden communities to the south), it is the combination of Globular Amphora traits, on the one hand, and traits found among late Yamnaya or Catacomb Grave groups to the east as well as the closely related Corded Ware groups that emerged around 2,800 BCE, on the other hand, that is such a striking feature of the Złota group and which makes it interesting when attempting to understand cultural and demographic dynamics in Central and Eastern Europe during the early 3rd millennium BCE.

Catacomb grave no. 2a/06 from Książnice, Złota culture (acc. to Wilk 2013). Image from Włodarczak (2017)

Książnice (site 2, grave 3ZC), Świętokrzyskie province. This burial, a so-called niche grave of the Złota type (with a vertical entrance shaft and perpendicularly situated niche), was excavated in 2006 and contained the remains of 8 individuals, osteologically identified as three adult females and five children, positioned on limestone pavement in the niche part of the grave. Radiocarbon dating of the human remains indicates that the grave dates to 2900-2630 BCE, 95.4% probability (Dataset S1). The grave had an oval entrance shaft with a diameter of 60 cm and depth of 130 cm; the depth of the niche reached to 170 cm (both measured from the modern surface), and it also contained a few animal bones, a few flint artefacts and four ceramic vessels typical of the Złota group. Książnice is located in the western part of the Małopolska Upland, which only has a few Złota group sites but a stronger presence of other, contemporary groups (including variants of the Baden culture).

Wilczyce (site 90, grave 10), Świętokrzyskie province. A rescue excavation in 2001 uncovered a niche grave of the Złota type, which had a round entrance shaft measuring 90 cm in diameter. The grave was some 60-65 cm deep below the modern surface and the bottom of the niche was paved with thin limestone plates, on which remains of three individuals had been placed; two adults, one female and one male, and one child. Four ceramic vessels of Złota group type were deposited in the niche along with the bodies. Wilczyce is located in the Sandomierz Upland, an area with substantial presence of both the Globular Amphora culture and Złota group, as well as the Corded Ware culture from 2800 BCE.

Genetic affinities of the Koszyce individuals and other GAC groups (here including Złota) analyzed in this study. (A) Principal component analysis of previously published and newly sequenced ancient individuals. Ancient genomes were projected onto modern reference populations, shown in gray. (B) Ancestry proportions based on supervised ADMIXTURE analysis (K = 3), specifying Western hunter-gatherers, Anatolian Neolithic farmers, and early Bronze Age steppe populations as ancestral source populations. LP, Late Paleolithic; M, Mesolithic; EN, Early Neolithic; MN, Middle Neolithic; LN, Late Neolithic; EBA, Early Bronze Age; PWC, Pitted Ware culture; TRB, Trichterbecherkultur/Funnelbeaker culture; LBK, Linearbandkeramik/Linear Pottery culture; GAC, Globular Amphora culture; Złota, Złota culture. Image modified to outline in red GAC and Złota groups.

To further investigate the ancestry of the Globular Amphora individuals, we performed a supervised ADMIXTURE (6) analysis, specifying typical western European hunter-gatherers (Loschbour), early Neolithic Anatolian farmers (Barcın), and early Bronze Age steppe populations (Yamnaya) as ancestral source populations (Fig. 2B). The results indicate that the Globular Amphora/Złota group individuals harbor ca. 30% western hunter-gatherer and 70% Neolithic farmer ancestry, but lack steppe ancestry. To formally test different admixture models and estimate mixture proportions, we then used qpAdm (7) and find that the Polish Globular Amphora/Złota group individuals can be modeled as a mix of western European hunter-gatherer (17%) and Anatolian Neolithic farmer (83%) ancestry (SI Appendix, Table S2), mirroring the results of previous studies.

Table S2. qpADM results. The ancestry of most Globular Amphora/Złota group individuals
can be modelled as a two-way mixture of Mesolithic western hunter-gatherers (WHG), and early Anatolian Neolithic farmers (Barcın). The five individuals from Książnice (Złota group) show evidence for additional gene flow, most likely from an eastern source.

The lack of a direct genetic connection of Corded Ware peoples with the Złota group despite their common “steppe-like traits” – shared with Yamna – reveals, once more, how the few “Yamna-like” traits of Corded Ware do not support a direct connection with Indo-Europeans, and are the result of the expansion of the so-called steppe package all over Europe, and particularly among cultures closely related to the Khvalynsk expansion, and later under the influence of expanding Yamna peoples.

The results from Książnice may support that early Corded Ware peoples were in close contact with GAC peoples in Lesser Poland during the complex period of GAC-Trypillia-CWC interactions, and especially close to the Złota group at the beginning of the 3rd millennium BC. Nevertheless, patrilineal clans of Złota apparently correspond to Globular Amphorae populations, with the only male sample available yet being within haplogroup I2a-L801, prevalent in GAC.

NOTE. The ADMIXTURE of Złota samples in common with GAC samples (and in contrast with the shared Sredni Stog – Corded Ware “steppe ancestry”) makes the possibility of R1a-M417 popping up in the Złota group from now on highly unlikely. If it happened, that would complicate further the available picture of unusually diverse patrilineal clans found among Uralic speakers expanding with early Corded Ware groups, in contrast with the strict patrilineal and patrilocal culture of Indo-Europeans as found in Repin, Yamna and Bell Beakers.

Once again the traditional links between groups hypothesized by archaeologists – like Gimbutas and Kristiansen in this case – are wrong, as is the still fashionable trend in descriptive archaeology, of supporting 1) wide cultural relationships in spite of clear-cut inter-cultural differences (and intra-cultural uniformity kept over long distances by genetically-related groups), 2) peaceful interactions among groups based on few common traits, and 3) regional population continuities despite cultural change. These generalized ideas made some propose a steppe language shared between Pontic-Caspian groups, most of which have been proven to be radically different in culture and genetics.

The background shading indicates the tree migratory waves proposed by Marija Gimbutas, and personally checked by her in 1995. Image from Tassi et al. (2017).

Furthermore, paternal lines show once again marked bottlenecks in expanding Neolithic cultures, supporting their relevance to follow the ethnolinguistic identity of different cultural groups. The steppe- or EHG-related ancestry (if it is in fact from early Corded Ware peoples) in Książnice was thus probably, as in the case of Trypillia, in the form of exogamy with females of neighbouring groups:

The presence of unrelated females and related males in the grave is interesting because it suggests that the community at Koszyce was organized along patrilineal lines of descent, adding to the mounting evidence that this was the dominant form of social organization among Late Neolithic communities in Central Europe. Usually, patrilineal forms of social organization go hand in hand with female exogamy (i.e., the practice of women marrying outside their social group). Indeed, several studies (11, 12) have shown that patrilocal residence patterns and female exogamy prevailed in several parts of Central Europe during the Late Neolithic. (…) the high diversity of mtDNA lineages, combined with the presence of only a single Y chromosome lineage, is certainly consistent with a patrilocal residence system.

Map of territorial ranges of Funnel Beaker Culture (and its settlement concentrations in Lesser Poland), local Tripolyan groups and Corded Ware Culture settlements (■) at the turn of the 4th/3rd millennia BC.

Since ancient and modern Uralians show predominantly Corded Ware ancestry, and Proto-Uralic must have been in close contact with Proto-Indo-European for a very long time – given the different layers of influence that can be distinguished between them -, it follows as logical consequence that the North Pontic forest-steppes (immediately to the west of the PIE homeland in the Don-Volga-Ural steppes) is the most likely candidate for the expansion of Proto-Uralic, accompanying the spread of Sredni Stog ancestry and a bottleneck under R1a-M417 lineages.

The early TMRCAs in the 4th millennium BC for R1a-M417 and R1a-Z645 support this interpretation, like the R1a-M417 sample found in Sredni Stog. On the other hand, the resurgence of typical GAC-like ancestry in late Corded Ware groups, with GAC lineages showing late TMRCAs in the 3rd millennium BC, proves the disintegration of Corded Ware all over Europe (except in Textile Ceramics- and Abashevo-related groups) as the culture lost its cohesion and different local patrilineal clans used the opportunity to seize power – similar to how eventually I2a-L621 infiltrated eastern (Finno-Ugrian) groups.


Uralic speakers formed clines of Corded Ware ancestry with WHG:ANE populations


The preprint by Jeong et al. (2018) has been published: The genetic history of admixture across inner Eurasia Nature Ecol. Evol. (2019).

Interesting excerpts, referring mainly to Uralic peoples (emphasis mine):

A model-based clustering analysis using ADMIXTURE shows a similar pattern (Fig. 2b and Supplementary Fig. 3). Overall, the proportions of ancestry components associated with Eastern or Western Eurasians are well correlated with longitude in inner Eurasians (Fig. 3). Notable outliers include known historical migrants such as Kalmyks, Nogais and Dungans. The Uralic- and Yeniseian-speaking populations, as well as Russians from multiple locations, derive most of their Eastern Eurasian ancestry from a component most enriched in Nganasans, while Turkic/Mongolic speakers have this component together with another component most enriched in populations from the Russian Far East, such as Ulchi and Nivkh (Supplementary Fig. 3). Turkic/Mongolic speakers comprising the bottom-most cline have a distinct Western Eurasian ancestry profile: they have a high proportion of a component most enriched in Mesolithic Caucasus hunter-gatherers and Neolithic Iranians and frequently harbour another component enriched in present-day South Asians (Supplementary Fig. 4). Based on the PCA and ADMIXTURE results, we heuristically assigned inner Eurasians to three clines: the ‘forest-tundra’ cline includes Russians and all Uralic and Yeniseian speakers; the ‘steppe-forest’ cline includes Turkic- and Mongolic-speaking populations from the Volga and Altai–Sayan regions and Southern Siberia; and the ‘southern steppe’ cline includes the rest of the populations.

The first two PCs summarizing the genetic structure within 2,077 Eurasian individuals. The two PCs generally mirror geography. PC1 separates western and eastern Eurasian populations, with many inner Eurasians in the middle. PC2 separates eastern Eurasians along the northsouth cline and also separates Europeans from West Asians. Ancient individuals (color-filled shapes), including two Botai individuals, are projected onto PCs calculated from present-day individuals.

For the forest-tundra populations, the Nganasan + Srubnaya model is adequate only for the two Volga region populations, Udmurts and Besermyans (Fig. 5 and Supplementary Table 8).

For the other populations west of the Urals, six from the northeastern corner of Europe are modelled with additional Mesolithic Western European hunter-gatherer (WHG) contribution (8.2–11.4%; Supplementary Table 8), while the rest need both WHG and early Neolithic European farmers (LBK_EN; Supplementary Table 2). Nganasan-related ancestry substantially contributes to their gene pools and cannot be removed from the model without a significant decrease in the model fit (4.1–29.0% contribution; χ2 P ≤ 1.68 × 10−5; Supplementary Table 8).

Supplementary Table 8. QpAdm-based admixture modeling of the forest-tundra cline populations. For the 13 populations west of the Urals, we present a four-way admixture model, Nganasan+Srubnaya+WHG+LBK_EN, or its minimal adequate subset. Modified from the article, to include colors for cultures, and underlined best models for Corded Ware ancestry among Uralians.

NOTE. It doesn’t seem like Hungarians can be easily modelled with Nganasan ancestry, though…

For the 4 populations east of the Urals (Enets, Selkups, Kets and Mansi), for which the above models are not adequate, Nganasan + Srubnaya + AG3 provides a good fit (χ2 P ≥ 0.018; Fig. 5 and Supplementary Table 8). Using early Bronze Age populations from the Baikal Lake region (‘Baikal_EBA’; Supplementary Table 2) as a reference instead of Nganasan, the two-way model of Baikal_EBA + Srubnaya provides a reasonable fit (χ2 P ≥ 0.016; Supplementary Table 8) and the three-way model of Baikal_EBA + Srubnaya + AG3 is adequate but with negative AG3 contribution for Enets and Mansi (χ2 P ≥ 0.460; Supplementary Table 8).

Supplementary Table 8. QpAdm-based admixture modeling of the forest-tundra cline populations. For the four populations east of the Urals, we present three admixture models: Baikal_EBA+Srubnaya, Baikal_EBA+Srubnaya+AG3 and Nganasan+Srubnaya+AG3. For each model, we present qpAdm p-value, admixture coefficient estimates and associated 5 cM jackknife standard errors (estimate ± SE). Modified from the article, to include colors for cultures, and underlined best models for Corded Ware ancestry among Uralians.

Bronze/Iron Age populations from Southern Siberia also show a similar ancestry composition with high ANE affinity (Supplementary Table 9). The additional ANE contribution beyond the Nganasan + Srubnaya model suggests a legacy from ANE-ancestry-rich clines before the Late Bronze Age.

Supplementary Table 9. QpAdm-based admixture modeling of Bronze and Iron Age populations of southern Siberia. For ancieint individuals associated with Karasuk and Tagar cultures, Nganasan+Srubnaya model is insufficient. For all five groups, adding AG3 as the third ancestry or substituting Nganasan with Baikal_EBA with higher ANE affinity provides an adequate model. For each model, we present qpAdm p-value, admixture coefficient estimates and associated 5 cM jackknife standard errors (estimate ± SE). Models with p-value ≥ 0.05 are highlighted in bold face. Modified from the article, to include colors for cultures, and underlined best models for Corded Ware ancestry among Uralians.

Lara M. Cassidy comments the results of the study in A steppe in the right direction (you can read it here):

Even among the earliest available inner Eurasian genomes, east–west connectivity is evident. These, too, form a longitudinal cline, characterized by the easterly increase of a distinct ancestry, labelled Ancient North Eurasian (ANE), lowest in western European hunter-gatherers (WHG) and highest in Palaeolithic Siberians from the Baikal region. Flow-through from this ANE cline is seen in steppe populations until at least the Bronze Age, including the world’s earliest known horse herders — the Botai. However, this is eroded over time by migration from west and east, following agricultural adoption on the continental peripheries (Fig. 1b,c).

Strikingly, Jeong et al. model the modern upper steppe cline as a simple two-way mixture between western Late Bronze Age herders and Northeast Asians (Fig. 1c), with no detectable residue from the older ANE cline. They propose modern steppe peoples were established mainly through migrations post-dating the Bronze Age, a sequence for which has been recently outlined using ancient genomes. In contrast, they confirm a substantial ANE legacy in modern Siberians of the northernmost cline, a pattern mirrored in excesses of WHG ancestry west of the Urals (Fig. 1b). This marks the inhospitable biome as a reservoir for older lineages, an indication that longstanding barriers to latitudinal movement may indeed be at work, reducing the penetrance of gene flows further south along the steppe.

The genomic formation of inner Eurasians. b–d, Depiction of the three main clines of ancestry identified among Inner Eurasians. Sources of admixture for each cline are represented using proxy ancient populations, both sampled and hypothesised, based on the study’s modelling results. The major eastern and western ancestries used to model each cline are shown in bold; the peripheral admixtures that gave rise to these are also shown. Additional contributions to subsections of each cline are marked with dashed lines. b, The northernmost cline, illustrating the legacy of WHG and ANE-related populations. c,d, The upper (c) and lower (d) steppe clines are shown, both of which have substantial eastern contributions related to modern Tungusic speakers. The authors propose these populations are themselves the result of an admixture between groups related to the Nganasan, whose ancestors potentially occupied a wider range, and hunter-gatherers (HGs) from the Amur River Basin. While the upper steppe cline in c can be described as a mixture between this eastern ancestry and western steppe herders, the current model for the southern steppe cline as shown in d is not adequate and is likely confounded by interactions with diverse bordering ancestries. Credit: Ecoregions 2017, Resolve

Given the findings as reported in the paper, I think it should be much easier to describe different subclines in the “northernmost cline” than in the much more recent “Turkic/Mongolic cline”, which is nevertheless subdivided in this paper in two clines. As an example, there are at least two obvious clines with “Nganasan-related meta-populations” among Uralians, which converge in a common Steppe MLBA (i.e. Corded Ware) ancestry – one with Palaeo-Laplandic peoples, and another one with different Palaeo-Siberian populations:

PCA of ancient and modern Eurasian samples. Ancient Palaeo-Laplandic, Palaeosiberian, and Altai clines drawn, with modern populations labelled. See a version with higher resolution.

The inclusion of certain Eurasian groups (or lack thereof) in the PCA doesn’t help to distinguish these subclines visually, and I guess the tiny “Naganasan-related” ancestral components found in some western populations (e.g. the famous ~5% among Estonians) probably don’t lend themselves easily to further subdivisions. Notice, nevertheless, the different components of the Eastern Eurasian source populations among Finno-Ugrians:

Characterization of the Western and Eastern Eurasian source ancestries in inner Eurasian populations. [Modified from the paper, includes only Uralic populations]. a, Admixture f3 values are compared for different Eastern Eurasian (Mixe, Nganasan and Ulchi; green) and Western Eurasian references (Srubnaya and Chalcolithic Iranians (Iran_ChL); red). For each target group, darker shades mark more negative f3 values. b, Weights of donor populations in two sources characterizing the main admixture signal (date 1 and PC1) in the GLOBETROTTER analysis. We merged 167 donor populations into 12 groups (top right). Target populations were split into five groups (from top to bottom): Aleuts; the forest-tundra cline populations; the steppe-forest cline populations; the southern steppe cline populations; and ‘others’.

Also remarkable is the lack of comparison of Uralic populations with other neighbouring ones, since the described Uralic-like ancestry of Russians was already known, and is most likely due to the recent acculturation of Uralic-speaking peoples in the cradle of Russians, right before their eastward expansions.

Supplementary Fig. 4. ADMIXTURE results qualitatively support PCA-based grouping of inner Eurasians into three clines. (A) Most southern steppe cline populations derive a higher proportion of their total Western Eurasian ancestry from a source related to Caucasus, Iran and South Asian populations. (B) Turkic- and Mongolic-speaking populations tend to derive their Eastern Eurasian ancestry more from the Devil’s Gate related one than from Nganasan-related one, while the opposite is true for Uralic- and Yeiseian-speakers. To estimate overall western Eurasian ancestry proportion, we sum up four components in our ADMIXTURE results (K=14), which are the dominant components in Neolithic Anatolians (“Anatolia_N”), Mesolithic western European hunter-gatherers (“WHG”), early Holocene Caucasus hunter-gatherers (“CHG”) and Mala from southern India, respectively. The “West / South Asian ancestry” is a fraction of it, calculated by summing up the last two components. To estimate overall Eastern Eurasian ancestry proportion, we sum up six components, most prevalent in Surui, Chipewyan, Itelmen, Nganasan, Atayal and early Neolithic Russian Far East individuals (“Devil’s Gate”). Eurasians into three clines. (A) Most southern steppe cline populations derive a higher proportion of their total Western Eurasian ancestry from a source related to Caucasus, Iran and South Asian populations. (B) Turkic- and Mongolic-speaking populations tend to derive their Eastern Eurasian ancestry more from the Devil’s Gate related one than from Nganasan-related one, while the opposite is true for Uralic- and Yeiseian-speakers. To estimate overall western Eurasian ancestry proportion, we sum up four components in our ADMIXTURE results (K=14), which are the dominant components in Neolithic Anatolians (“Anatolia_N”), Mesolithic western European hunter-gatherers (“WHG”), early Holocene Caucasus hunter-gatherers (“CHG”) and Mala from southern India, respectively. The “West / South Asian ancestry” is a fraction of it, calculated by summing up the last two components. To estimate overall Eastern Eurasian ancestry proportion, we sum up six components, most prevalent in Surui, Chipewyan, Itelmen, Nganasan, Atayal and early Neolithic Russian Far East individuals (“Devil’s Gate”).

A comparison of Estonians and Finns with Balts, Scandinavians, and Eastern Europeans would have been more informative for the division of the different so-called “Nganasan-like meta-populations”, and to ascertain which one of these ancestral peoples along the ancient WHG:ANE cline could actually be connected (if at all) to the Cis-Urals.

Because, after all, based on linguistics and archaeology, geneticists are not supposed to be looking for populations from the North Asian Arctic region, for “Siberian ancestry”, or for haplogroup N1c – despite previous works by their peers – , but for the Bronze Age Volga-Kama region…


How the genocidal Yamnaya men loved to switch cultures


After some really interesting fantasy full of arrows, it seems Kristiansen & friends are coming back to their most original idea from 2015, now in New Scientist’s recent clickbait Story of most murderous people of all time revealed in ancient DNA (2019):

Teams led by David Reich at Harvard Medical School and Eske Willerslev at the University of Copenhagen in Denmark announced, independently, that occupants of Corded Ware graves in Germany could trace about three-quarters of their genetic ancestry to the Yamnaya. It seemed that Corded Ware people weren’t simply copying the Yamnaya; to a large degree they actually were Yamnayan in origin.

If you think you have seen that movie, it’s because you have. They are at it again, Corded Ware from Yamna, and more “steppe ancestry” = “more Indo-European. It seems we haven’t learnt anything about “Steppe ancestry” since 2015. But there’s more:

Genocidal peoples who “switch cultures”

Burial practices shifted dramatically, a warrior class appeared, and there seems to have been a sharp upsurge in lethal violence. “I’ve become increasingly convinced there must have been a kind of genocide,” says Kristian Kristiansen at the University of Gothenburg, Sweden.

The collaboration revealed that the origin and initial spread of Bell Beaker culture had little to do – at least genetically – with the expansion of the Yamnaya or Corded Ware people into central Europe. “It started in It is in that region that the earliest Bell Beaker objects – including arrowheads, copper daggers and distinctive Bell-shaped pots – have been found, in archaeological sites carbon-dated to 4700 years ago. Then, Bell Beaker culture began to spread east, although the people more or less stayed put. By about 4600 years ago, it reached the most westerly Corded Ware people around where the Netherlands now lies. For reasons still unclear, the Corded Ware people fully embraced it. “They simply take on part of the Bell Beaker package and become Beaker people,” says Kristiansen.

The fact that the genetic analysis showed the Britons then all-but disappeared within a couple of generations might be significant. It suggests the capacity for violence that emerged when the Yamnaya lived on the Eurasia steppe remained even as these people moved into Europe, switched identity from Yamnaya to Corded Ware, and then switched again from Corded Ware to Bell Beaker.

Notice what Kristiansen did there? Yamnaya men “switched identities” into Corded Ware, then “switched identities” into Bell Beakers…So, the most aggresive peoples who have ever existed, exterminating all other Europeans, were actually not so violent when embracing wholly different cultures whose main connection is that they built kurgans (yes, Gimbutas lives on).

NOTE. By the way, just so we are clear, only Indo-Europeans are “genocidal”. Not like Neolithic farmers, or Palaeolithic or Mesolithic populations, or more recent Bronze Age or Iron Age peoples, who also replaced Y-DNA from many regions…


In fact, there is much stronger evidence that these Yamnaya Beakers were ruthless. By about 4500 years ago, they had pushed westwards into the Iberian Peninsula, where the Bell Beaker culture originated a few centuries earlier. Within a few generations, about 40 per cent of the DNA of people in the region could be traced back to the incoming Yamnaya Beakers, according to research by a large team including Reich that was published this month. More strikingly, the ancient DNA analysis reveals that essentially all the men have Y chromosomes characteristic of the Yamnaya, suggesting only Yamnaya men had children.

“The collision of these two populations was not a friendly one, not an equal one, but one where the males from outside were displacing local males and did so almost completely,” Reich told New Scientist Live in September. This supports Kristiansen’s view of the Yamnaya and their descendants as an almost unimaginably violent people. Indeed, he is about to publish a paper in which he argues that they were responsible for the genocide of Neolithic Europe’s men. “It’s the only way to explain that no male Neolithic lines survived,” he says.

So these unimaginably violent Yamnaya men had children exclusively with their Y chromosomes…but not Dutch Single Grave peoples. These great great steppe-like northerners switched culture, cephalic index…and Y-chromosome from R1a (and others) to R1b-L151 to expand Italo-Celtic From The West™.

It’s hilarious how (exactly like their latest funny episode of PIE from south of the Caucasus) this new visionary idea copied by Copenhagen from amateur friends (or was it the other way around?) had been already rejected before this article came out, in Olalde et al. (2019), and that “Corded Ware=Indo-European” fans have become a parody of themselves.

What’s not to love about 2019 with all this back-and-forth hopping between old and new pet theories?

NOTE. I would complain (again) that the obsessive idea of the Danes is that Denmark CWC is (surprise!) the Pre-Germanic community, so it has nothing to do with “steppe ancestry = Indo-European” (or even with “Corded Ware = Indo-European”, for that matter), but then again you have Koch still arguing for Celtic from the West, Kortlandt still arguing for Balto-Slavic from the east, and – no doubt worst of all – “R1a=IE / R1b=Vasconic / N1c=Uralic” ethnonationalists arguing for whatever is necessary right now, in spite of genetic research.

So prepare for the next episode in the nativist and haplogroup fetishist comedy, now with western and eastern Europeans hand in hand: Samara -> Khvalynsk -> Yamnaya -> Bell Beaker spoke Vasconic-Tyrsenian, because R1b. Wait for it…

Vanguard Yamnaya groups

On a serious note, interesting comment by Heyd in the article:

A striking example of this distinction is a discovery made near the town of Valencina de la Concepción in southern Spain. Archaeologists working there found a Yamnaya-like kurgan, below which was the body of a man buried with a dagger and Yamnaya-like sandals, and decorated with red pigment just as Yamnaya dead were. But the burial is 4875 years old and genetic information suggests Yamnaya-related people didn’t reach that far west until perhaps 4500 years ago. “Genetically, I’m pretty sure this burial has nothing to do with the Yamnaya or the Corded Ware,” says Heyd. “But culturally – identity-wise – there is an aspect that can be clearly linked with them.” It would appear that the ideology, lifestyle and death rituals of the Yamnaya could sometimes run far ahead of the migrants.

NOTE. I have been trying to find which kurgan is this, reviewing this text on the archaeological site, but didn’t find anything beyond occasional ochre and votive sandals, which are usual. Does some reader know which one is it?

Yamna expansion and succeeding East Bell Beaker expansion, without color on Bell Beaker territories. Notice vanguard Yamna groups in blue where East Bell Beakers later emerge. See original image with Bell Beaker territories.

Notice how, if you add all those vanguard Yamna findings of Central and Western Europe, including this one from southern Spain, you begin to get a good idea of the territories occupied by East Bell Beakers expanding later. More or less like vanguard Abashevo and Sintashta finds in the Zeravshan valley heralded the steppe-related Srubna-Andronovo expansions in Turan…

It doesn’t seem like Proto-Beaker and Yamna just “crossed paths” at some precise time around the Lower Danube, and Yamna men “switched cultures”. It seems that many Yamna vanguard groups, probably still in long-distance contact with Yamna settlers from the Carpathian Basin, were already settled in different European regions in the first half of the 3rd millennium BC, before the explosive expansion of East Bell Beakers ca. 2500 BC. As Heyd says, there are potentially many Yamna settlements along the Middle and Lower Danube and tributaries not yet found, connecting the Carpathian Basin to Western and Northern Europe.

These vanguard groups would have more easily transformed their weakened eastern Yamna connections with the fashionable Proto-Beaker package expanding from the west (and surrounding all of these loosely connected settlements), just like the Yamna materials from Seville probably represent a close cultural contact of Chalcolithic Iberia with a Yamna settlement (the closest known site with Yamna traits is near Alsace, where high Yamna ancestry is probably going to be found in a Bell Beaker R1b-L151 individual).

This does not mean that there wasn’t a secondary full-scale migration from the Carpathian Basin and nearby settlements, just like Corded Ware shows a secondary (A-horizon?) migration to the east with R1a-Z645. It just means that there was a complex picture of contacts between Yamna and European Chalcolithic groups before the expansion of Bell Beakers. Doesn’t seem genocidal enough for a popular movie, tho.


Arrival of steppe ancestry with R1b-P312 in the Mediterranean: Balearic Islands, Sicily, and Iron Age Sardinia


New preprint The Arrival of Steppe and Iranian Related Ancestry in the Islands of the Western Mediterranean by Fernandes, Mittnik, Olalde et al. bioRxiv (2019)

Interesting excerpts (emphasis in bold; modified for clarity):

Balearic Islands: The expansion of Iberian speakers

Mallorca_EBA dates to the earliest period of permanent occupation of the islands at around 2400 BCE. We parsimoniously modeled Mallorca_EBA as deriving 36.9 ± 4.2% of her ancestry from a source related to Yamnaya_Samara; (…). We next used qpAdm to identify “proximal” sources for Mallorca_EBA’s ancestry that are more closely related to this individual in space and time, and found that she can be modeled as a clade with the (small) subset of Iberian Bell Beaker culture associated individuals who carried Steppe-derived ancestry (p=0.442).

Suppl. Materials: The model used was with Bell_Beaker_Iberia_highsteppe, a group of outliers from Iberia buried in a Bell Beaker mortuary context who unlike most individuals from this context in that region had high proportions of Steppe ancestry (p=0.442).

Our estimates of Steppe ancestry in the two later Balearic Islands individuals are lower than the earlier one: 26.3 ± 5.1% for Formentera_MBA and 23.1 ± 3.6% for Menorca_LBA, but the Middle to Late Bronze Age Balearic individuals are not a clade relative to non-Balearic groups. Specifically, we find that f4(Mbuti.DG, X; Formentera_MBA, Menorca_LBA) is positive when X=Iberia_Chalcolithic (Z=2.6) or X=Sardinia_Nuragic_BA (Z=2.7). While it is tempting to interpret the latter statistic as suggesting a genetic link between peoples of the Talaiotic culture of the Balearic islands and the Nuragic culture of Sardinia, the attraction to Iberia_Chalcolithic is just as strong, and the mitochondrial haplogroup U5b1+16189+@16192 in Menorca_LBA is not observed in Sardinia_Nuragic_BA but is observed in multiple Iberia_Chalcolithic individuals. A possible explanation is that both the ancestors of Nuragic Sardinians and the ancestors of Talaiotic people from the Balearic Islands received gene flow from an unsampled Iberian Chalcolithic-related group (perhaps a mainland group affiliated to both) that did not contribute to Formentera_MBA.

This sample, like another one in El Argar, is of hg. R1b-P312. So there you are, the data that connects the Proto-Iberian expansion (replacing IE-speaking Bell Beakers) to the Iberian Chalcolithic population, signaled by the increase in Iberian Chalcolithic ancestry after the arrival of Bell Beakers, most likely connected originally to the Argaric and post-Argaric expansions during the MBA.

PCA with previously published ancient individuals (non-filled symbols), projected onto variation from present-day populations (gray squares).

Steppe in Sardinia IA: Phocaeans from Italy?

Most Sardinians buried in a Nuragic Bronze Age context possessed uniparental haplogroups found in European hunter-gatherers and early farmers, including Y-haplogroup R1b1a[xR1b1a1a] which is different from the characteristic R1b1a1a2a1a2 spread in association with the Bell Beaker complex. An exception is individual I10553 (1226-1056 calBCE) who carried Y-haplogroup J2b2a, previously observed in a Croatian Middle Bronze Age individual bearing Steppe ancestry, suggesting the possibility of genetic input from groups that arrived from the east after the spread of first farmers. This is consistent with the evidence of material culture exchange between Sardinians and mainland Mediterranean groups, although genome-wide analyses find no significant evidence of Steppe ancestry so the quantitative demographic impact was minimal.

Another interesting data, these (Mesolithic) remnant R1b-V88 lineages closely related to the Italian Peninsula, the most likely region of expansion of these lineages into Africa, in turn possibly connected to the expansion of Proto-Afroasiatic.

We detect definitive evidence of Iranian-related ancestry in an Iron Age Sardinian I10366 (391-209 calBCE) with an estimate of 11.9 ± 3.7.% Iran_Ganj_Dareh_Neolithic related ancestry, while rejecting the model with only Anatolian_Neolithic and WHG at p=0.0066 (Supplementary Table 9). The only model that we can fit for this individual using a pair of populations that are closer in time is as a mixture of Iberia_Chalcolithic (11.9 ± 3.2%) and Mycenaean (88.1 ± 3.2%) (p=0.067). This model fits even when including Nuragic Sardinians in the outgroups of the qpAdm analysis, which is consistent with the hypothesis that this individual had little if any ancestry from earlier Sardinians.

Proportions of ancestry using a distal qpAdm framework on an individual basis (a), and based on qpWave clusters

Sicily EBA: The Lusitanian/Ligurian connection?

(…) While a previously reported Bell Beaker culture-associated individual from Sicily had no evidence of Steppe ancestry, (…) we find evidence of Steppe ancestry in the Early Bronze Age by ~2200 BCE. In distal qpAdm, the outlier Sicily_EBA11443 is parsimoniously modeled as harboring 40.2 ± 3.5% Steppe ancestry, and the outlier Sicily_EBA8561 is parsimoniously modeled as harboring 23.3 ± 3.5% Steppe ancestry. (…) The presence of Steppe ancestry in Early Bronze Age Sicily is also evident in Y chromosome analysis, which reveals that 4 of the 5 Early Bronze Age males had Steppe-associated Y-haplogroup R1b1a1a2a1a2. (Online Table 1). Two of these were Y-haplogroup R1b1a1a2a1a2a1 (Z195) which today is largely restricted to Iberia and has been hypothesized to have originated there 2500-2000 BCE. This evidence of west-to-east gene flow from Iberia is also suggested by qpAdm modeling where the only parsimonious proximate source for the Steppe ancestry we found in the main Sicily_EBA cluster is Iberians.

What’s this? An ancestral connection between Sicel Elymian and Galaico-Lusitanian or Ligurian (based on an origin in NE Iberia)? Impossible to say, especially if the languages of these early settlers were replaced later by non-Indo-European speakers from the eastern Mediterranean, and by Indo-European speakers from the mainland closely related to Proto-Italic during the LBA, but see below.

Regarding the comment on R1b-Z195, it is associated with modern Iberians, as DF27 in general, due to founder effects beyond the Pyrenees. It is a very old subclade, split directly from DF27 roughly at the same time as it split from the parent P312, i.e. it can be found anywhere in Europe, and it almost certainly accompanied the expansion of Celts from Central Europe under the subclade R1b-M167/SRY2627.

The connection is thus strong only because of the qpAdm modeling, since R1b-DF27 and subclade R1b-Z195 are certainly lineages expanded quite early, most likely with Yamna settlers in Hungary and East Bell Beakers.

In this case, if stemming from Iberia, it is most likely of subclade R1b-Z220 – or another Z195 (xM167) lineage – originally associated with the Old European substrate found in topo-hydronymy in Iberia, whose most likely remnants attested during the Iron Age were Lusitanians.

Left: Modern distribution of R1b-Z195 (YFull estimate 2700 BC); Right: Modern distribution of DF27. Both include later founder effects within Iberia, so the increase in the Basque country and the Crown of Aragon and the decrease in Portugal can safely be ignored. Contour maps of the derived allele frequencies of the SNPs analyzed in Solé-Morata et al. (2017).

We detect Iranian-related ancestry in Sicily by the Middle Bronze Age 1800-1500 BCE, consistent with the directional shift of these individuals toward Mycenaeans in PCA. Specifically, two of the Middle Bronze Age individuals can only be fit with models that in addition to Anatolia_Neolithic and WHG, include Iran_Ganj_Dareh_Neolithic. The most parsimonious model for Sicily_MBA3125 has 18.0 ± 3.6% Iranian-related ancestry (p=0.032 for rejecting the alternative model of Steppe rather than Iranian-related ancestry), and the most parsimonious model for Sicily_MBA has 14.9 ± 3.9% Iranian-related ancestry (p=0.037 for rejecting the alternative model).

The modern southern Italian Caucasus-related signal identified in Raveane et al. (2018) is plausibly related to the same Iranian-related spread of ancestry into Sicily that we observe in the Middle Bronze Age (and possibly the Early Bronze Age).

The non-Indo-European Sicanians and Elymians were possibly then connected to eastern Mediterranean groups before the expansion of the Sea Peoples.

For the Late Bronze Age group of individuals, qpAdm documented Steppe-related ancestry, modeling this group as 80.2 ± 1.8% Anatolia_Neolithic, 5.3 ± 1.6% WHG, and 14.5 ± 2.2% Yamnaya_Samara. Our modeling using sources more closely related in space and time also supports Sicily_LBA having Minoan-related ancestry or being derived from local preceding populations or individuals with ancestries similar to those of Sicily_EBA3123 (p=0.527), Sicily_MBA3124 (p=0.352), and Sicily_MBA3125 (p=0.095).

This increase in Steppe-related ancestry in a western site during the LBA most likely represents either an expansion from the Aegean or – maybe more likely, given the archaeological finds – a regional population similar to Sicily EBA re-emerging or rather being displaced from the eastern part of the island because of a westward movement from nearby Calabria.

Whether this population sampled spoke Indo-European or not at this time is questionable, since the Iron Age accounts show non-IE Elymians in this region.

Actually, Elymians seem to have spoken Indo-European, which fits well with the increase in steppe ancestry.

EDIT (21 MAR): Interesting about a proposed incoming Minoan-like ancestry is the potential origin of the Iran Neolithic-related ancestry that is going to appear in Central Italy during the LBA. This could then be potentially associated with Tyrsenians passing through the area, although the traditional description may be more more compatible with an arrival of Sea Peoples from the Adriatic.

Sad to read this:

This manuscript is dedicated to the memory of Sebastiano Tusa of the Soprintendenza del Mare in Palermo, who would have been an author of this study had he not tragically died in the crash of Ethiopia Airlines flight 302 on March 10.