Predictions about the genetic change from Single Grave to the Late Neolithic in Denmark


New open access paper Mapping human mobility during the third and second millennia BC in present-day Denmark by Frei et al. PLOS One (2019), from the Copenhagen group (including Allentoft, Sikora, and Kristiansen) of samples whose genomic profile will probably be published soon.

Interesting excerpts (emphasis mine):

We present results of the largest multidisciplinary human mobility investigation to date of skeletal remains from present-day Denmark encompassing the 3rd and 2nd millennia BC. Through a multi-analytical approach based on 88 individuals from 37 different archaeological localities in which we combine strontium isotope and radiocarbon analyses together with anthropological investigations, we explore whether there are significant changes in human mobility patterns during this period. Overall, our data suggest that mobility of people seems to have been continuous throughout the 3rd and 2nd millennia BC. However, our data also indicate a clear shift in mobility patterns from around 1600 BC onwards, with a larger variation in the geographical origin of the migrants, and potentially including more distant regions. This shift occurred during a transition period at the beginning of the Nordic Bronze Age at a time when society flourished, expanded and experienced an unprecedented economic growth, suggesting that these aspects were closely related.

Map of present-day Denmark illustrating locations of the burial sites.
The dashed black line marks the maximum advance stage of the last glaciation (Weichselian). Drafted with public domain data from Natural Earth (

Strontium isotope analyses

The results of our strontium isotope analyses are presented in Table 2 and listed in chronological order according to the radiocarbon dates (in sites with multiple individuals we start with the oldest radiocarbon individual). The strontium isotope data set reveals a wide range of values from 87Sr/86Sr = 0.70871 (RISE 23, from the site of Debel) to 87Sr/86Sr = 0.71788 (RISE 20, from the site of Karlstrup). Despite the difficulties of establishing the baseline range some of the herein investigated individuals may be classified as non-locals. A few individuals have tooth enamel signatures that lie just above the upper baseline limit of 87Sr/86Sr = 0.711 and therefore, the classification of these humans as non-locals should be considered with caution. Nevertheless, the significant proportion of individuals with relatively radiogenic values suggest that about a quarter of the individuals studied herein seem to have originated from other places than from those they were buried, and hence implying a continuous degree of mobility during the 3rd and 2nd millennia BC.

Diagram plotting results of strontium isotope ratios versus calibrated radiocarbon dates of the individuals investigated. The grey band shows the “local” baseline.

Middle Neolithic

From the Single Grave Culture (SGC) which is closely related to the Corded Ware Complex in central and eastern Europe and dates from c. 2800 BC to 2200 BC, we analyzed seven of the at least ten individuals who were buried at the site of Gjerrild in eastern Jutland (Fig 1). Gjerrild is a key SGC site, as to date it has provided the most substantial skeletal material pertaining to this culture from present-day Denmark. However, it is not a typical SGC grave, but a megalithic chamber of the so-called “Bøstrup type”. The SGC pottery was decorated with cord or stamp impressions and the stone battle axes were a common feature of male equipment. Such shared traits in the Corded Ware Complex probably reflected shared occupational, social and religious characteristics. Apart from one individual who yielded a Bronze Age date, five individuals date within the period that spans from c. 2600 BC to 2200 BC, hence representing the middle and late SGC phases (Table 1 and S1 File). Of the seven individuals, three males, one female, two infants and one adult (only represented by a disarticulated mandible, and dated to the Bronze Age), all but one yielded strontium isotope signatures that fall within the local baseline range. Only the female (RISE 1283) has a more radiogenic strontium isotope signature of 87Sr/86Sr = 0.7127, which is similar to that of the male from Kyndeløse and might indicate non-local provenance. One of the individuals at Gjerrild, a mature-old adult male, who yielded a local signature (RISE 432) was accompanied by a D-type arrowhead and an amber bead which lay on his right side. He showed signs of inflammation on his lower legs, in particular on the left one. He had a healed trepanation (Fig 2). Another individual (RISE 73a, 1282), an adult male, was found with a type D arrowhead in the sternum (Fig 3).

Strontium isotope, 14C results and sex and age determinations from individuals from the 3rd and 2nd millennia BC from Denmark presented in chronological order. Modified from the paper, see full image.

Late Neolithic I

We sampled individuals from a total of twelve different sites that date to the Late Neolithic period (2300/2250-1700 BC).

One of these sites is Hellested on Zealand (Fig 1 and S1 File), with four flat graves containing five individuals, four young males and one mature adult female. We conducted strontium isotope analyses of enamel from all five individuals, and our results point to two individuals being characterized by local strontium isotope values. One of these individuals, the female, was buried with no grave goods (RISE 53, grave B) while the other, a young male, was buried with a fragmented bone pin (RISE 56, grave F). The other three male individuals (RISE 54, 55, 57) yielded similar strontium isotopic values that lie slightly above the local baseline range. All these individuals had been buried with early flint daggers (type I and II), and one of them (RISE 57, grave A) additionally had a ring-headed pin (Ringkopfnadel) [56]. On the basis of the presence of this ring-headed pin, Lomborg [56] suggested that these individuals had connections with the Únětice culture. Furthermore, three of them have radiocarbon dates that overlap (RISE 55, 56 and 57; Table 1).

Another Late Neolithic site is Juelsberg on the island of Funen (central Denmark, Fig 1 and S1 File) which is a gallery grave that contained at least 19 individuals. We conducted strontium isotope analyses of tooth enamel on 8 out of the 19 individuals and two of them, a male and female, yielded ratios that suggest a non-local origin (RISE 30 and 32). The grave goods comprise a (Lomborg) type I flint dagger but also some non-local type of artefacts. These consist of an early type of bone pin (type 7) mainly found in south-eastern Scandinavia, and a barbed and tanged flint arrowhead of the west-European Bell Beaker type suggesting western connections. The middle adult female (RISE 32) yielded an 87Sr/86Sr = 0.7121 and the mature to old adult male (RISE 30) yielded a 87Sr/86Sr = 0.7112. The different Sr isotope signatures of these individuals imply that they might have originated from different areas, albeit their radiocarbon dates are very similar.

The gallery grave of Marbjerg, Zealand (Fig 1), yielded 17 individuals (S1 File), and we conducted strontium isotope analyses of tooth enamel on 11 of them. The majority of the individuals were males, but females and children, too, were present. Anthropological investigations of the individuals from this site, males as well as females, indicate a relatively high life expectancy with respect to that typical for this period (S1 Table). Our radiocarbon dates revealed that this grave was in use for several hundred years from the Early Late Neolithic (2210–2030 cal BC, RISE 39) to the Late Neolithic /Early Nordic Bronze Age Period (1770–1620 cal BC, RISE 41). Despite the long-term use of this grave, 10 of the 11 individuals studied herein yielded a very narrow and overlapping range of strontium isotope values between 87Sr/86Sr = 0.7096–0.7101. Their values suggest not only that these individuals were local but that their food sources were derived from the same area over the course of several centuries. Only the tooth enamel sample of one individual, a middle to mature adult male (RISE 40), yielded a higher value of 87Sr/86Sr = 0.7117, which seems to suggest a non-local origin.

Predictions about these samples

Strontium isotope analyses only show potential movements during an individual’s lifetime, which is normally useless to assess relevant migrations if the sampling is not big enough (see more on investigating population movements). Still, if a sampling like this one shows many potentially non-local individuals from different parts of Denmark deviating from the baseline at a certain period, you can infer that something is happening within Denmark and in nearby regions.

Strontium isotope results of the 88 investigated individuals including Late Bronze Age individuals investigated previously. The grey band shows the “local” baseline.

Based on what we know now, I bet these are the most likely events in Denmark that marked the Nordic Late Neolithic with its Bell Beaker-related Dagger Period ca. 2400/2300 BC on:

  1. Sudden appearance of R1b-L23 lineages (probably R1b-U106 among them), originally from the Northern European Plain, ultimately from the Danube River Basin. R1a-M417 subclades, possibly prevalent in the previous period, disappear or appear rarely, to resurge later during the Bronze Age probably mostly as hg R1a-Z284, originally from the Battle Axe culture in Sweden, together with I1 – these resurgence events might be shifted to a later phase, though, and there might be some isolated R1a cases in the Danish LN, too.
  2. Shift of Middle Neolithic to Late Neolithic in the PCA away from the Corded Ware cluster and closer to the Bell Beaker cluster – whatever that means exactly for Danish SGC relative to Northern European Beakers, visible especially when enough samples are available.
  3. Evident sign of new incoming ancestry ultimately from Yamnaya-related populations, compared to earlier peoples of Corded Ware ancestry. Yes, even this far north, despite heavy admixture of Yamnaya-like Bell Beakers through exogamy with Corded Ware-like populations all the way to the north from the Danube Basin.

All this will support, once again, the expansion of Bell Beakers from Yamnaya settlers of Central-East Europe. That is probably what I will be reporting about the data as related to the Pre-Proto-Germanic homeland of the Northern European Plain, unless there is some big surprise, for example that R1b-U106 expanded later from Northern Germany, more clearly associated with later Barbed Wire Beakers or even Únětice movements, although I find this very unlikely at this point.

The above predictions are more or less evident to everyone, despite the current mistrust in the Yamnaya – Bell Beaker expansion route of North-West Indo-European, due to the prevalent nativist and/or reactionary trends in hobby population genomics and among academics. My main prediction is therefore about human behaviour:

(1) Seeing how the Copenhagen group started to describe recently South Scandinavian genetic and linguistic prehistory, their conclusions are predictable. From the introduction of this paper:

The 3rd millennium BC stands out as a period of migrations in western Eurasia, as pastoral steppe populations settled in temperate Europe after 2800 BC e.g. [1, 2]. This was also a period of cultural and genetic admixture e.g. [3]. From 1600 BC onwards, southern Scandinavia became more closely linked to the existing European metal trade networks (…)

See what they did there? No mention of the radical change that the Dagger Period brought to Scandinavia, in cultural or genetic terms (see e.g. here or here). Strange how the only thing that Kristiansen has changed since the 1980s – and only after the 2015 genetic papers – is his previous emphasis on the Dagger Period as the most relevant unifying cultural and population movement in Scandinavia, responsible for the formation of a common Nordic language, which is suddenly given as little weight as possible in all his publications, to support some imaginary continuity with the Corded Ware culture (see e.g. here or here).

(2) Only a few males from the Single Grave period are described in this sampling, and they are quite close to the arrival of Bell Beakers, so if someone is looking for closure about the “R1b from Corded Ware”, I bet there won’t be any. As with conspiracy theories of native Vasconic R1b-L51 hidden somewhere in Western Europe even after Olalde et al. (2018) and Olalde et al. (2019), the mythic native Nordic R1b-U106 of Corded Ware will remain hidden in some unsampled Corded Ware group in the minds of many, despite being already found in Bell Beaker-derived European EBA cultures of Bohemia and possibly of Hungary, too (ca. 2500-2200 BC, see SNP calls), apart from the Late Neolithic sample from Lilla Beddinge in Scania (ca. 2275-2032 BC, see SNP calls)

I hope that I am wrong, and that some scholar in the Danish group is capable of reporting the data as it is, even if it contradicts the theories of its leading archaeologist, Kristian Kristiansen. The apparent downplay of the increase in non-local origins of individuals during the Late Neolithic I period as they appear in this paper, as well as their summary of foreign migrations into Denmark which mysteriously stop with the arrival of “Steppe ancestry” ca. 2800 BC, make me think that a change in their narrative is not very likely. The cons of working with academic divos, I guess…


Yamnaya ancestry: mapping the Proto-Indo-European expansions


The latest papers from Ning et al. Cell (2019) and Anthony JIES (2019) have offered some interesting new data, supporting once more what could be inferred since 2015, and what was evident in population genomics since 2017: that Proto-Indo-Europeans expanded under R1b bottlenecks, and that the so-called “Steppe ancestry” referred to two different components, one – Yamnaya or Steppe_EMBA ancestry – expanding with Proto-Indo-Europeans, and the other one – Corded Ware or Steppe_MLBA ancestry – expanding with Uralic speakers.

The following maps are based on formal stats published in the papers and supplementary materials from 2015 until today, mainly on Wang et al. (2018 & 2019), Mathieson et al. (2018) and Olalde et al. (2018), and others like Lazaridis et al. (2016), Lazaridis et al. (2017), Mittnik et al. (2018), Lamnidis et al. (2018), Fernandes et al. (2018), Jeong et al. (2019), Olalde et al. (2019), etc.

NOTE. As in the Corded Ware ancestry maps, the selected reports in this case are centered on the prototypical Yamnaya ancestry vs. other simplified components, so everything else refers to simplistic ancestral components widespread across populations that do not necessarily share any recent connection, much less a language. In fact, most of the time they clearly didn’t. They can be interpreted as “EHG that is not part of the Yamnaya component”, or “CHG that is not part of the Yamnaya component”. They can’t be read as “expanding EHG people/language” or “expanding CHG people/language”, at least no more than maps of “Steppe ancestry” can be read as “expanding Steppe people/language”. Also, remember that I have left the default behaviour for color classification, so that the highest value (i.e. 1, or white colour) could mean anything from 10% to 100% depending on the specific ancestry and period; that’s what the legend is for… But, fere libenter homines id quod volunt credunt.


  1. Neolithic or the formation of Early Indo-European
  2. Eneolithic or the expansion of Middle Proto-Indo-European
  3. Chalcolithic / Early Bronze Age or the expansion of Late Proto-Indo-European
  4. European Early Bronze Age and MLBA or the expansion of Late PIE dialects

1. Neolithic

Anthony (2019) agrees with the most likely explanation of the CHG component found in Yamnaya, as derived from steppe hunter-fishers close to the lower Volga basin. The ultimate origin of this specific CHG-like component that eventually formed part of the Pre-Yamnaya ancestry is not clear, though:

The hunter-fisher camps that first appeared on the lower Volga around 6200 BC could represent the migration northward of un-admixed CHG hunter-fishers from the steppe parts of the southeastern Caucasus, a speculation that awaits confirmation from aDNA.

Natural neighbor interpolation of CHG ancestry among Neolithic populations. See full map.

The typical EHG component that formed part eventually of Pre-Yamnaya ancestry came from the Middle Volga Basin, most likely close to the Samara region, as shown by the sampled Samara hunter-gatherer (ca. 5600-5500 BC):

After 5000 BC domesticated animals appeared in these same sites in the lower Volga, and in new ones, and in grave sacrifices at Khvalynsk and Ekaterinovka. CHG genes and domesticated animals flowed north up the Volga, and EHG genes flowed south into the North Caucasus steppes, and the two components became admixed.

Natural neighbor interpolation of EHG ancestry among Neolithic populations. See full map.

To the west, in the Dnieper-Dniester area, WHG became the dominant ancestry after the Mesolithic, at the expense of EHG, revealing a likely mating network reaching to the north into the Baltic:

Like the Mesolithic and Neolithic populations here, the Eneolithic populations of Dnieper-Donets II type seem to have limited their mating network to the rich, strategic region they occupied, centered on the Rapids. The absence of CHG shows that they did not mate frequently if at all with the people of the Volga steppes (…)

Natural neighbor interpolation of WHG ancestry among Neolithic populations. See full map.

North-West Anatolia Neolithic ancestry, proper of expanding Early European farmers, is found up to border of the Dniester, as Anthony (2007) had predicted.

Natural neighbor interpolation of Anatolia Neolithic ancestry among Neolithic populations. See full map.

2. Eneolithic

From Anthony (2019):

After approximately 4500 BC the Khvalynsk archaeological culture united the lower and middle Volga archaeological sites into one variable archaeological culture that kept domesticated sheep, goats, and cattle (and possibly horses). In my estimation, Khvalynsk might represent the oldest phase of PIE.

(…) this middle Volga mating network extended down to the North Caucasian steppes, where at cemeteries such as Progress-2 and Vonyuchka, dated 4300 BC, the same Khvalynsk-type ancestry appeared, an admixture of CHG and EHG with no Anatolian Farmer ancestry, with steppe-derived Y-chromosome haplogroup R1b. These three individuals in the North Caucasus steppes had higher proportions of CHG, overlapping Yamnaya. Without any doubt, a CHG population that was not admixed with Anatolian Farmers mated with EHG populations in the Volga steppes and in the North Caucasus steppes before 4500 BC. We can refer to this admixture as pre-Yamnaya, because it makes the best currently known genetic ancestor for EHG/CHG R1b Yamnaya genomes.

From Wang et al (2019):

Three individuals from the sites of Progress 2 and Vonyuchka 1 in the North Caucasus piedmont steppe (‘Eneolithic steppe’), which harbour EHG and CHG related ancestry, are genetically very similar to Eneolithic individuals from Khvalynsk II and the Samara region. This extends the cline of dilution of EHG ancestry via CHG-related ancestry to sites immediately north of the Caucasus foothills

Natural neighbor interpolation of Pre-Yamnaya ancestry among Neolithic populations. See full map. This map corresponds roughly to the map of Khvalynsk-Novodanilovka expansion, and in particular to the expansion of horse-head pommel-scepters (read more about Khvalynsk, and specifically about horse symbolism)

NOTE. Unpublished samples from Ekaterinovka have been previously reported as within the R1b-L23 tree. Interestingly, although the Varna outlier is a female, the Balkan outlier from Smyadovo shows two positive SNP calls for hg. R1b-M269. However, its poor coverage makes its most conservative haplogroup prediction R-M343.

The formation of this Pre-Yamnaya ancestry sets this Volga-Caucasus Khvalynsk community apart from the rest of the EHG-like population of eastern Europe.

Natural neighbor interpolation of non-Pre-Yamnaya EHG ancestry among Eneolithic populations. See full map.

Anthony (2019) seems to rely on ADMIXTURE graphics when he writes that the late Sredni Stog sample from Alexandria shows “80% Khvalynsk-type steppe ancestry (CHG&EHG)”. While this seems the most logical conclusion of what might have happened after the Suvorovo-Novodanilovka expansion through the North Pontic steppes (see my post on “Steppe ancestry” step by step), formal stats have not confirmed that.

In fact, analyses published in Wang et al. (2019) rejected that Corded Ware groups are derived from this Pre-Yamnaya ancestry, a reality that had been already hinted in Narasimhan et al. (2018), when Steppe_EMBA showed a poor fit for expanding Srubna-Andronovo populations. Hence the need to consider the whole CHG component of the North Pontic area separately:

Natural neighbor interpolation of non-Pre-Yamnaya CHG ancestry among Eneolithic populations. See full map. You can read more about population movements in the late Sredni Stog and closer to the Proto-Corded Ware period.

NOTE. Fits for WHG + CHG + EHG in Neolithic and Eneolithic populations are taken in part from Mathieson et al. (2019) supplementary materials (download Excel here). Unfortunately, while data on the Ukraine_Eneolithic outlier from Alexandria abounds, I don’t have specific data on the so-called ‘outlier’ from Dereivka compared to the other two analyzed together, so these maps of CHG and EHG expansion are possibly showing a lesser distribution to the west than the real one ca. 4000-3500 BC.

Natural neighbor interpolation of WHG ancestry among Eneolithic populations. See full map.

Anatolia Neolithic ancestry clearly spread to the east into the north Pontic area through a Middle Eneolithic mating network, most likely opened after the Khvalynsk expansion:

Natural neighbor interpolation of Anatolia Neolithic ancestry among Eneolithic populations. See full map.
Natural neighbor interpolation of Iran Chl. ancestry among Eneolithic populations. See full map.

Regarding Y-chromosome haplogroups, Anthony (2019) insists on the evident association of Khvalynsk, Yamnaya, and the spread of Pre-Yamnaya and Yamnaya ancestry with the expansion of elite R1b-L754 (and some I2a2) individuals:

Y-DNA haplogroups in West Eurasia during the Early Eneolithic in the Pontic-Caspian steppes. See full map, and see culture, ADMIXTURE, Y-DNA, and mtDNA maps of the Early Eneolithic and Late Eneolithic.

3. Early Bronze Age

Data from Wang et al. (2019) show that Corded Ware-derived populations do not have good fits for Eneolithic_Steppe-like ancestry, no matter the model. In other words: Corded Ware populations show not only a higher contribution of Anatolia Neolithic ancestry (ca. 20-30% compared to the ca. 2-10% of Yamnaya); they show a different EHG + CHG combination compared to the Pre-Yamnaya one.

Supplementary Table 13. P values of rank=2 and admixture proportions in modelling Steppe ancestry populations as a three-way admixture of Eneolithic steppe Anatolian_Neolithic and WHG using 14 outgroups.
Left populations: Test, Eneolithic_steppe, Anatolian_Neolithic, WHG.
Right populations: Mbuti.DG, Ust_Ishim.DG, Kostenki14, MA1, Han.DG, Papuan.DG, Onge.DG, Villabruna, Vestonice16, ElMiron, Ethiopia_4500BP.SG, Karitiana.DG, Natufian, Iran_Ganj_Dareh_Neolithic.

Yamnaya Kalmykia and Afanasievo show the closest fits to the Eneolithic population of the North Caucasian steppes, rejecting thus sizeable contributions from Anatolia Neolithic and/or WHG, as shown by the SD values. Both probably show then a Pre-Yamnaya ancestry closest to the late Repin population.

Modelling results for the Steppe and Caucasus cluster. Admixture proportions based on (temporally and geographically) distal and proximal models, showing additional AF ancestry in Steppe groups and additional gene flow from the south in some of the Steppe groups as well as the Caucasus groups. See tables above. Modified from Wang et al. (2019). Within a blue square, Yamnaya-related groups; within a cyan square, Corded Ware-related groups. Green background behind best p-values. In red circle, SD of AF/WHG ancestry contribution in Afanasevo and Yamnaya Kalmykia, with ranges that almost include 0%.

EBA maps include data from Wang et al. (2018) supplementary materials, specifically unpublished Yamnaya samples from Hungary that appeared in analysis of the preprint, but which were taken out of the definitive paper. Their location among Yamnaya settlers from Hungary is speculative, although most uncovered kurgans in Hungary are concentrated in the Tisza-Danube interfluve.

Natural neighbor interpolation of Pre-Yamnaya ancestry among Early Bronze Age populations. See full map. This map corresponds roughly with the known expansion of late Repin/Yamnaya settlers.

The Y-chromosome bottleneck of elite males from Proto-Indo-European clans under R1b-L754 and some I2a2 subclades, already visible in the Khvalynsk sampling, became even more noticeable in the subsequent expansion of late Repin/early Yamnaya elites under R1b-L23 and I2a-L699:

Y-DNA haplogroups in West Eurasia during the Yamnaya expansion. See full map and maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Early Chalcolithic and Yamnaya Hungary.

Maps of CHG, EHG, Anatolia Neolithic, and probably WHG show the expansion of these components among Corded Ware-related groups in North Eurasia, apart from other cultures close to the Caucasus:

NOTE. For maps with actual formal stats of Corded Ware ancestry from the Early Bronze Age to the modern times, you can read the post Corded Ware ancestry in North Eurasia and the Uralic expansion.

Natural neighbor interpolation of non-Pre-Yamnaya CHG ancestry among Early Bronze Age populations. See full map.
Natural neighbor interpolation of non-Pre-Yamnaya EHG ancestry among Early Bronze Age populations. See full map.
Natural neighbor interpolation of WHG ancestry among Early Bronze Age populations. See full map.
Natural neighbor interpolation of Anatolia Neolithic ancestry among Early Bronze Age populations. See full map.
Natural neighbor interpolation of Iran Chl. ancestry among Early Bronze Age populations. See full map.

4. Middle to Late Bronze Age

The following maps show the most likely distribution of Yamnaya ancestry during the Bell Beaker-, Balkan-, and Sintashta-Potapovka-related expansions.

4.1. Bell Beakers

The amount of Yamnaya ancestry is probably overestimated among populations where Bell Beakers replaced Corded Ware. A map of Yamnaya ancestry among Bell Beakers gets trickier for the following reasons:

  • Expanding Repin peoples of Pre-Yamnaya ancestry must have had admixture through exogamy with late Sredni Stog/Proto-Corded Ware peoples during their expansion into the North Pontic area, and Sredni Stog in turn had probably some Pre-Yamnaya admixture, too (although they don’t appear in the simplistic formal stats above). This is supported by the increase of Anatolia farmer ancestry in more western Yamna samples.
  • Later, Yamnaya admixed through exogamy with Corded Ware-like populations in Central Europe during their expansion. Even samples from the Middle to Upper Danube and around the Lower Rhine will probably show increasing contributions of Steppe_MLBA, at the same time as they show an increasing proportion of EEF-related ancestry.
  • To complicate things further, the late Corded Ware Espersted family (from ca. 2500 BC or later) shows, in turn, what seems like a recent admixture with Yamnaya vanguard groups, with the sample of highest Yamnaya ancestry being the paternal uncle of other individuals (all of hg. R1a-M417), suggesting that there might have been many similar Central European mating networks from the mid-3rd millennium BC on, of (mainly) Yamnaya-like R1b elites displaying a small proportion of CW-like ancestry admixing through exogamy with Corded Ware-like peoples who already had some Yamnaya ancestry.
Natural neighbor interpolation of Yamnaya ancestry among Middle to Late Bronze Age populations (Esperstedt CWC site close to BK_DE, label is hidden by BK_DE_SAN). See full map. You can see how this map correlated with the map of Late Copper Age migrations and Yamanaya into Bell Beaker expansion.

NOTE. Terms like “exogamy”, “male-driven migration”, and “sex bias”, are not only based on the Y-chromosome bottlenecks visible in the different cultural expansions since the Palaeolithic. Despite the scarce sampling available in 2017 for analysis of “Steppe ancestry”-related populations, it appeared to show already a male sex bias in Goldberg et al. (2017), and it has been confirmed for Neolithic and Copper Age population movements in Mathieson et al. (2018) – see Supplementary Table 5. The analysis of male-biased expansion of “Steppe ancestry” in CWC Esperstedt and Bell Beaker Germany is, for the reasons stated above, not very useful to distinguish their mutual influence, though.

Based on data from Olalde et al. (2019), Bell Beakers from Germany are the closest sampled ones to expanding East Bell Beakers, and those close to the Rhine – i.e. French, Dutch, and British Beakers in particular – show a clear excess “Steppe ancestry” due to their exogamy with local Corded Ware groups:

Only one 2-way model fits the ancestry in Iberia_CA_Stp with P-value>0.05: Germany_Beaker + Iberia_CA. Finding a Bell Beaker-related group as a plausible source for the introduction of steppe ancestry into Iberia is consistent with the fact that some of the individuals in the Iberia_CA_Stp group were excavated in Bell Beaker associated contexts. Models with Iberia_CA and other Bell Beaker groups such as France_Beaker (P-value=7.31E-06), Netherlands_Beaker (P-value=1.03E-03) and England_Beaker (P-value=4.86E-02) failed, probably because they have slightly higher proportions of steppe ancestry than the true source population.


The exogamy with Corded Ware-like groups in the Lower Rhine Basin seems at this point undeniable, as is the origin of Bell Beakers around the Middle-Upper Danube Basin from Yamnaya Hungary.

To avoid this excess “Steppe ancestry” showing up in the maps, since Bell Beakers from Germany pack the most Yamnaya ancestry among East Bell Beakers outside Hungary (ca. 51.1% “Steppe ancestry”), I equated this maximum with BK_Scotland_Ach (which shows ca. 61.1% “Steppe ancestry”, highest among western Beakers), and applied a simple rule of three for “Steppe ancestry” in Dutch and British Beakers.

NOTE. Formal stats for “Steppe ancestry” in Bell Beaker groups are available in Olalde et al. (2018) supplementary materials (PDF). I didn’t apply this adjustment to Bk_FR groups because of the R1b Bell Beaker sample from the Champagne/Alsace region reported by Samantha Brunel that will pack more Yamnaya ancestry than any other sampled Beaker to date, hence probably driving the Yamnaya ancestry up in French samples.

The most likely outcome in the following years, when Yamnaya and Corded Ware ancestry are investigated separately, is that Yamnaya ancestry will be much lower the farther away from the Middle and Lower Danube region, similar to the case in Iberia, so the map above probably overestimates this component in most Beakers to the north of the Danube. Even the late Hungarian Beaker samples, who pack the highest Yamnaya ancestry (up to 75%) among Beakers, represent likely a back-migration of Moravian Beakers, and will probably show a contribution of Corded Ware ancestry due to the exogamy with local Moravian groups.

Despite this decreasing admixture as Bell Beakers spread westward, the explosive expansion of Yamnaya R1b male lineages (in words of David Reich) and the radical replacement of local ones – whether derived from Corded Ware or Neolithic groups – shows the true extent of the North-West Indo-European expansion in Europe:

Y-DNA haplogroups in West Eurasia during the Bell Beaker expansion. See full map and see maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Late Copper Age and of the Yamnaya-Bell Beaker transition.

4.2. Palaeo-Balkan

There is scarce data on Palaeo-Balkan movements yet, although it is known that:

  1. Yamnaya ancestry appears among Mycenaeans, with the Yamnaya Bulgaria sample being its best current ancestral fit;
  2. the emergence of steppe ancestry and R1b-M269 in the eastern Mediterranean was associated with Ancient Greeks;
  3. Thracians, Albanians, and Armenians also show R1b-M269 subclades and “Steppe ancestry”.

4.3. Sintashta-Potapovka-Filatovka

Interestingly, Potapovka is the only Corded Ware derived culture that shows good fits for Yamnaya ancestry, despite having replaced Poltavka in the region under the same Corded Ware-like (Abashevo) influence as Sintashta.

This proves that there was a period of admixture in the Pre-Proto-Indo-Iranian community between CWC-like Abashevo and Yamnaya-like Catacomb-Poltavka herders in the Sintashta-Potapovka-Filatovka community, probably more easily detectable in this group because of the specific temporal and geographic sampling available.

Supplementary Table 14. P values of rank=3 and admixture proportions in modelling Steppe ancestry populations as a four-way admixture of distal sources EHG, CHG, Anatolian_Neolithic and WHG using 14 outgroups.
Left populations: Steppe cluster, EHG, CHG, WHG, Anatolian_Neolithic
Right populations: Mbuti.DG, Ust_Ishim.DG, Kostenki14, MA1, Han.DG, Papuan.DG, Onge.DG, Villabruna, Vestonice16, ElMiron, Ethiopia_4500BP.SG, Karitiana.DG, Natufian, Iran_Ganj_Dareh_Neolithic.

Srubnaya ancestry shows a best fit with non-Pre-Yamnaya ancestry, i.e. with different CHG + EHG components – possibly because the more western Potapovka (ancestral to Proto-Srubnaya Pokrovka) also showed good fits for it. Srubnaya shows poor fits for Pre-Yamnaya ancestry probably because Corded Ware-like (Abashevo) genetic influence increased during its formation.

On the other hand, more eastern Corded Ware-derived groups like Sintashta and its more direct offshoot Andronovo show poor fits with this model, too, but their fits are still better than those including Pre-Yamnaya ancestry.

Natural neighbor interpolation of non-Pre-Yamnaya EHG ancestry among Middle to Late Bronze Age populations. See full map.
Natural neighbor interpolation of non-Pre-Yamnaya CHG ancestry among Middle to Late Bronze Age populations. See full map.
Natural neighbor interpolation of Anatolia Neolithic ancestry among Middle to Late Bronze Age populations. See full map.
Natural neighbor interpolation of Iran Chl. ancestry among Middle to Late Bronze Age populations. See full map.

NOTE For maps with actual formal stats of Corded Ware ancestry from the Early Bronze Age to the modern times, you should read the post Corded Ware ancestry in North Eurasia and the Uralic expansion instead.

The bottleneck of Proto-Indo-Iranians under R1a-Z93 was not yet complete by the time when the Sintashta-Potapovka-Filatovka community expanded with the Srubna-Andronovo horizon:

Y-DNA haplogroups in West Eurasia during the European Early Bronze Age. See full map and see maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Early Bronze Age.

4.4. Afanasevo

At the end of the Afanasevo culture, at least three samples show hg. Q1b (ca. 2900-2500 BC), which seemed to point to a resurgence of local lineages, despite continuity of the prototypical Pre-Yamnaya ancestry. On the other hand, Anthony (2019) makes this cryptic statement:

Yamnaya men were almost exclusively R1b, and pre-Yamnaya Eneolithic Volga-Caspian-Caucasus steppe men were principally R1b, with a significant Q1a minority.

Since the only available samples from the Khvalynsk community are R1b (x3), Q1a(x1), and R1a(x1), it seems strange that Anthony would talk about a “significant minority”, unless Q1a (potentially Q1b in the newer nomenclature) will pop up in some more individuals of those ca. 30 new to be published. Because he also mentions I2a2 as appearing in one elite burial, it seems Q1a (like R1a-M459) will not appear under elite kurgans, although it is still possible that hg. Q1a was involved in the expansion of Afanasevo to the east.

Y-DNA haplogroups in West Eurasia during the Middle Bronze Age. See full map and see maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Middle Bronze Age and the Late Bronze Age.

Okunevo, which replaced Afanasevo in the Altai region, shows a majority of hg. Q1b, but also some R1b-M269 samples proper of Afanasevo, suggesting partial genetic continuity.

NOTE. Other sampled Siberian populations clearly show a variety of Q subclades that likely expanded during the Palaeolithic, such as Baikal EBA samples from Ust’Ida and Shamanka with a majority of Q1b, and hg. Q reported from Elunino, Sagsai, Khövsgöl, and also among peoples of the Srubna-Andronovo horizon (the Krasnoyarsk MLBA outlier), and in Karasuk.

From Damgaard et al. Science (2018):

(…) in contrast to the lack of identifiable admixture from Yamnaya and Afanasievo in the CentralSteppe_EMBA, there is an admixture signal of 10 to 20% Yamnaya and Afanasievo in the Okunevo_EMBA samples, consistent with evidence of western steppe influence. This signal is not seen on the X chromosome (qpAdm P value for admixture on X 0.33 compared to 0.02 for autosomes), suggesting a male-derived admixture, also consistent with the fact that 1 of 10 Okunevo_EMBA males carries a R1b1a2a2 Y chromosome related to those found in western pastoralists. In contrast, there is no evidence of western steppe admixture among the more eastern Baikal region region Bronze Age (~2200 to 1800 BCE) samples.

This Yamnaya ancestry has been also recently found to be the best fit for the Iron Age population of Shirenzigou in Xinjiang – where Tocharian languages were attested centuries later – despite the haplogroup diversity acquired during their evolution, likely through an intermediate Chemurchek culture (see a recent discussion on the elusive Proto-Tocharians).

Haplogroup diversity seems to be common in Iron Age populations all over Eurasia, most likely due to the spread of different types of sociopolitical structures where alliances played a more relevant role in the expansion of peoples. A well-known example of this is the spread of Akozino warrior-traders in the whole Baltic region under a partial N1a-VL29-bottleneck associated with the emerging chiefdom-based systems under the influence of expanding steppe nomads.

Y-DNA haplogroups in West Eurasia during the Early Iron Age. See full map and see maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Early Iron Age and Late Iron Age.

Surprisingly, then, Proto-Tocharians from Shirenzigou pack up to 74% Yamnaya ancestry, in spite of the 2,000 years that separate them from the demise of the Afanasevo culture. They show more Yamnaya ancestry than any other population by that time, being thus a sort of Late PIE fossils not only in their archaic dialect, but also in their genetic profile:


The recent intrusion of Corded Ware-like ancestry, as well as the variable admixture with Siberian and East Asian populations, both point to the known intense Old Iranian and Old/Middle Chinese contacts. The scarce Proto-Samoyedic and Proto-Turkic loans in Tocharian suggest a rather loose, probably more distant connection with East Uralic and Altaic peoples from the forest-steppe and steppe areas to the north (read more about external influences on Tocharian).

Interestingly, both R1b samples, MO12 and M15-2 – likely of Asian R1b-PH155 branch – show a best fit for Andronovo/Srubna + Hezhen/Ulchi ancestry, suggesting a likely connection with Iranians to the east of Xinjiang, who later expanded as the Wusun and Kangju. How they might have been related to Huns and Xiongnu individuals, who also show this haplogroup, is yet unknown, although Huns also show hg. R1a-Z93 (probably most R1a-Z2124) and Steppe_MLBA ancestry, earlier associated with expanding Iranian peoples of the Srubna-Andronovo horizon.

All in all, it seems that prehistoric movements explained through the lens of genetic research fit perfectly well the linguistic reconstruction of Proto-Indo-European and Proto-Uralic.