Yamnaya ancestry: mapping the Proto-Indo-European expansions

steppe-ancestry-expansion-europe

The latest papers from Ning et al. Cell (2019) and Anthony JIES (2019) have offered some interesting new data, supporting once more what could be inferred since 2015, and what was evident in population genomics since 2017: that Proto-Indo-Europeans expanded under R1b bottlenecks, and that the so-called “Steppe ancestry” referred to two different components, one – Yamnaya or Steppe_EMBA ancestry – expanding with Proto-Indo-Europeans, and the other one – Corded Ware or Steppe_MLBA ancestry – expanding with Uralic speakers.

The following maps are based on formal stats published in the papers and supplementary materials from 2015 until today, mainly on Wang et al. (2018 & 2019), Mathieson et al. (2018) and Olalde et al. (2018), and others like Lazaridis et al. (2016), Lazaridis et al. (2017), Mittnik et al. (2018), Lamnidis et al. (2018), Fernandes et al. (2018), Jeong et al. (2019), Olalde et al. (2019), etc.

NOTE. As in the Corded Ware ancestry maps, the selected reports in this case are centered on the prototypical Yamnaya ancestry vs. other simplified components, so everything else refers to simplistic ancestral components widespread across populations that do not necessarily share any recent connection, much less a language. In fact, most of the time they clearly didn’t. They can be interpreted as “EHG that is not part of the Yamnaya component”, or “CHG that is not part of the Yamnaya component”. They can’t be read as “expanding EHG people/language” or “expanding CHG people/language”, at least no more than maps of “Steppe ancestry” can be read as “expanding Steppe people/language”. Also, remember that I have left the default behaviour for color classification, so that the highest value (i.e. 1, or white colour) could mean anything from 10% to 100% depending on the specific ancestry and period; that’s what the legend is for… But, fere libenter homines id quod volunt credunt.

Sections:

  1. Neolithic or the formation of Early Indo-European
  2. Eneolithic or the expansion of Middle Proto-Indo-European
  3. Chalcolithic / Early Bronze Age or the expansion of Late Proto-Indo-European
  4. European Early Bronze Age and MLBA or the expansion of Late PIE dialects

1. Neolithic

Anthony (2019) agrees with the most likely explanation of the CHG component found in Yamnaya, as derived from steppe hunter-fishers close to the lower Volga basin. The ultimate origin of this specific CHG-like component that eventually formed part of the Pre-Yamnaya ancestry is not clear, though:

The hunter-fisher camps that first appeared on the lower Volga around 6200 BC could represent the migration northward of un-admixed CHG hunter-fishers from the steppe parts of the southeastern Caucasus, a speculation that awaits confirmation from aDNA.

neolithic-chg-ancestry
Natural neighbor interpolation of CHG ancestry among Neolithic populations. See full map.

The typical EHG component that formed part eventually of Pre-Yamnaya ancestry came from the Middle Volga Basin, most likely close to the Samara region, as shown by the sampled Samara hunter-gatherer (ca. 5600-5500 BC):

After 5000 BC domesticated animals appeared in these same sites in the lower Volga, and in new ones, and in grave sacrifices at Khvalynsk and Ekaterinovka. CHG genes and domesticated animals flowed north up the Volga, and EHG genes flowed south into the North Caucasus steppes, and the two components became admixed.

neolithic-ehg-ancestry
Natural neighbor interpolation of EHG ancestry among Neolithic populations. See full map.

To the west, in the Dnieper-Dniester area, WHG became the dominant ancestry after the Mesolithic, at the expense of EHG, revealing a likely mating network reaching to the north into the Baltic:

Like the Mesolithic and Neolithic populations here, the Eneolithic populations of Dnieper-Donets II type seem to have limited their mating network to the rich, strategic region they occupied, centered on the Rapids. The absence of CHG shows that they did not mate frequently if at all with the people of the Volga steppes (…)

neolithic-whg-ancestry
Natural neighbor interpolation of WHG ancestry among Neolithic populations. See full map.

North-West Anatolia Neolithic ancestry, proper of expanding Early European farmers, is found up to border of the Dniester, as Anthony (2007) had predicted.

neolithic-anatolia-farmer-ancestry
Natural neighbor interpolation of Anatolia Neolithic ancestry among Neolithic populations. See full map.

2. Eneolithic

From Anthony (2019):

After approximately 4500 BC the Khvalynsk archaeological culture united the lower and middle Volga archaeological sites into one variable archaeological culture that kept domesticated sheep, goats, and cattle (and possibly horses). In my estimation, Khvalynsk might represent the oldest phase of PIE.

(…) this middle Volga mating network extended down to the North Caucasian steppes, where at cemeteries such as Progress-2 and Vonyuchka, dated 4300 BC, the same Khvalynsk-type ancestry appeared, an admixture of CHG and EHG with no Anatolian Farmer ancestry, with steppe-derived Y-chromosome haplogroup R1b. These three individuals in the North Caucasus steppes had higher proportions of CHG, overlapping Yamnaya. Without any doubt, a CHG population that was not admixed with Anatolian Farmers mated with EHG populations in the Volga steppes and in the North Caucasus steppes before 4500 BC. We can refer to this admixture as pre-Yamnaya, because it makes the best currently known genetic ancestor for EHG/CHG R1b Yamnaya genomes.

From Wang et al (2019):

Three individuals from the sites of Progress 2 and Vonyuchka 1 in the North Caucasus piedmont steppe (‘Eneolithic steppe’), which harbour EHG and CHG related ancestry, are genetically very similar to Eneolithic individuals from Khvalynsk II and the Samara region. This extends the cline of dilution of EHG ancestry via CHG-related ancestry to sites immediately north of the Caucasus foothills

eneolithic-pre-yamnaya-ancestry
Natural neighbor interpolation of Pre-Yamnaya ancestry among Neolithic populations. See full map. This map corresponds roughly to the map of Khvalynsk-Novodanilovka expansion, and in particular to the expansion of horse-head pommel-scepters (read more about Khvalynsk, and specifically about horse symbolism)

NOTE. Unpublished samples from Ekaterinovka have been previously reported as within the R1b-L23 tree. Interestingly, although the Varna outlier is a female, the Balkan outlier from Smyadovo shows two positive SNP calls for hg. R1b-M269. However, its poor coverage makes its most conservative haplogroup prediction R-M343.

The formation of this Pre-Yamnaya ancestry sets this Volga-Caucasus Khvalynsk community apart from the rest of the EHG-like population of eastern Europe.

eneolithic-ehg-ancestry
Natural neighbor interpolation of non-Pre-Yamnaya EHG ancestry among Eneolithic populations. See full map.

Anthony (2019) seems to rely on ADMIXTURE graphics when he writes that the late Sredni Stog sample from Alexandria shows “80% Khvalynsk-type steppe ancestry (CHG&EHG)”. While this seems the most logical conclusion of what might have happened after the Suvorovo-Novodanilovka expansion through the North Pontic steppes (see my post on “Steppe ancestry” step by step), formal stats have not confirmed that.

In fact, analyses published in Wang et al. (2019) rejected that Corded Ware groups are derived from this Pre-Yamnaya ancestry, a reality that had been already hinted in Narasimhan et al. (2018), when Steppe_EMBA showed a poor fit for expanding Srubna-Andronovo populations. Hence the need to consider the whole CHG component of the North Pontic area separately:

eneolithic-chg-ancestry
Natural neighbor interpolation of non-Pre-Yamnaya CHG ancestry among Eneolithic populations. See full map. You can read more about population movements in the late Sredni Stog and closer to the Proto-Corded Ware period.

NOTE. Fits for WHG + CHG + EHG in Neolithic and Eneolithic populations are taken in part from Mathieson et al. (2019) supplementary materials (download Excel here). Unfortunately, while data on the Ukraine_Eneolithic outlier from Alexandria abounds, I don’t have specific data on the so-called ‘outlier’ from Dereivka compared to the other two analyzed together, so these maps of CHG and EHG expansion are possibly showing a lesser distribution to the west than the real one ca. 4000-3500 BC.

eneolithic-whg-ancestry
Natural neighbor interpolation of WHG ancestry among Eneolithic populations. See full map.

Anatolia Neolithic ancestry clearly spread to the east into the north Pontic area through a Middle Eneolithic mating network, most likely opened after the Khvalynsk expansion:

eneolithic-anatolia-farmer-ancestry
Natural neighbor interpolation of Anatolia Neolithic ancestry among Eneolithic populations. See full map.
eneolithic-iran-chl-ancestry
Natural neighbor interpolation of Iran Chl. ancestry among Eneolithic populations. See full map.

Regarding Y-chromosome haplogroups, Anthony (2019) insists on the evident association of Khvalynsk, Yamnaya, and the spread of Pre-Yamnaya and Yamnaya ancestry with the expansion of elite R1b-L754 (and some I2a2) individuals:

eneolithic-early-y-dna
Y-DNA haplogroups in West Eurasia during the Early Eneolithic in the Pontic-Caspian steppes. See full map, and see culture, ADMIXTURE, Y-DNA, and mtDNA maps of the Early Eneolithic and Late Eneolithic.

3. Early Bronze Age

Data from Wang et al. (2019) show that Corded Ware-derived populations do not have good fits for Eneolithic_Steppe-like ancestry, no matter the model. In other words: Corded Ware populations show not only a higher contribution of Anatolia Neolithic ancestry (ca. 20-30% compared to the ca. 2-10% of Yamnaya); they show a different EHG + CHG combination compared to the Pre-Yamnaya one.

eneolithic-steppe-best-fits
Supplementary Table 13. P values of rank=2 and admixture proportions in modelling Steppe ancestry populations as a three-way admixture of Eneolithic steppe Anatolian_Neolithic and WHG using 14 outgroups.
Left populations: Test, Eneolithic_steppe, Anatolian_Neolithic, WHG.
Right populations: Mbuti.DG, Ust_Ishim.DG, Kostenki14, MA1, Han.DG, Papuan.DG, Onge.DG, Villabruna, Vestonice16, ElMiron, Ethiopia_4500BP.SG, Karitiana.DG, Natufian, Iran_Ganj_Dareh_Neolithic.

Yamnaya Kalmykia and Afanasievo show the closest fits to the Eneolithic population of the North Caucasian steppes, rejecting thus sizeable contributions from Anatolia Neolithic and/or WHG, as shown by the SD values. Both probably show then a Pre-Yamnaya ancestry closest to the late Repin population.

wang-eneolithic-steppe-caucasus-yamnaya
Modelling results for the Steppe and Caucasus cluster. Admixture proportions based on (temporally and geographically) distal and proximal models, showing additional AF ancestry in Steppe groups and additional gene flow from the south in some of the Steppe groups as well as the Caucasus groups. See tables above. Modified from Wang et al. (2019). Within a blue square, Yamnaya-related groups; within a cyan square, Corded Ware-related groups. Green background behind best p-values. In red circle, SD of AF/WHG ancestry contribution in Afanasevo and Yamnaya Kalmykia, with ranges that almost include 0%.

EBA maps include data from Wang et al. (2018) supplementary materials, specifically unpublished Yamnaya samples from Hungary that appeared in analysis of the preprint, but which were taken out of the definitive paper. Their location among Yamnaya settlers from Hungary is speculative, although most uncovered kurgans in Hungary are concentrated in the Tisza-Danube interfluve.

eba-yamnaya-ancestry
Natural neighbor interpolation of Pre-Yamnaya ancestry among Early Bronze Age populations. See full map. This map corresponds roughly with the known expansion of late Repin/Yamnaya settlers.

The Y-chromosome bottleneck of elite males from Proto-Indo-European clans under R1b-L754 and some I2a2 subclades, already visible in the Khvalynsk sampling, became even more noticeable in the subsequent expansion of late Repin/early Yamnaya elites under R1b-L23 and I2a-L699:

chalcolithic-early-y-dna
Y-DNA haplogroups in West Eurasia during the Yamnaya expansion. See full map and maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Early Chalcolithic and Yamnaya Hungary.

Maps of CHG, EHG, Anatolia Neolithic, and probably WHG show the expansion of these components among Corded Ware-related groups in North Eurasia, apart from other cultures close to the Caucasus:

NOTE. For maps with actual formal stats of Corded Ware ancestry from the Early Bronze Age to the modern times, you can read the post Corded Ware ancestry in North Eurasia and the Uralic expansion.

eba-chg-ancestry
Natural neighbor interpolation of non-Pre-Yamnaya CHG ancestry among Early Bronze Age populations. See full map.
eba-ehg-ancestry
Natural neighbor interpolation of non-Pre-Yamnaya EHG ancestry among Early Bronze Age populations. See full map.
eba-whg-ancestry
Natural neighbor interpolation of WHG ancestry among Early Bronze Age populations. See full map.
eba-anatolia-farmer-ancestry
Natural neighbor interpolation of Anatolia Neolithic ancestry among Early Bronze Age populations. See full map.
eba-iran-chl-ancestry
Natural neighbor interpolation of Iran Chl. ancestry among Early Bronze Age populations. See full map.

4. Middle to Late Bronze Age

The following maps show the most likely distribution of Yamnaya ancestry during the Bell Beaker-, Balkan-, and Sintashta-Potapovka-related expansions.

4.1. Bell Beakers

The amount of Yamnaya ancestry is probably overestimated among populations where Bell Beakers replaced Corded Ware. A map of Yamnaya ancestry among Bell Beakers gets trickier for the following reasons:

  • Expanding Repin peoples of Pre-Yamnaya ancestry must have had admixture through exogamy with late Sredni Stog/Proto-Corded Ware peoples during their expansion into the North Pontic area, and Sredni Stog in turn had probably some Pre-Yamnaya admixture, too (although they don’t appear in the simplistic formal stats above). This is supported by the increase of Anatolia farmer ancestry in more western Yamna samples.
  • Later, Yamnaya admixed through exogamy with Corded Ware-like populations in Central Europe during their expansion. Even samples from the Middle to Upper Danube and around the Lower Rhine will probably show increasing contributions of Steppe_MLBA, at the same time as they show an increasing proportion of EEF-related ancestry.
  • To complicate things further, the late Corded Ware Espersted family (from ca. 2500 BC or later) shows, in turn, what seems like a recent admixture with Yamnaya vanguard groups, with the sample of highest Yamnaya ancestry being the paternal uncle of other individuals (all of hg. R1a-M417), suggesting that there might have been many similar Central European mating networks from the mid-3rd millennium BC on, of (mainly) Yamnaya-like R1b elites displaying a small proportion of CW-like ancestry admixing through exogamy with Corded Ware-like peoples who already had some Yamnaya ancestry.
mlba-yamnaya-ancestry
Natural neighbor interpolation of Yamnaya ancestry among Middle to Late Bronze Age populations (Esperstedt CWC site close to BK_DE, label is hidden by BK_DE_SAN). See full map. You can see how this map correlated with the map of Late Copper Age migrations and Yamanaya into Bell Beaker expansion.

NOTE. Terms like “exogamy”, “male-driven migration”, and “sex bias”, are not only based on the Y-chromosome bottlenecks visible in the different cultural expansions since the Palaeolithic. Despite the scarce sampling available in 2017 for analysis of “Steppe ancestry”-related populations, it appeared to show already a male sex bias in Goldberg et al. (2017), and it has been confirmed for Neolithic and Copper Age population movements in Mathieson et al. (2018) – see Supplementary Table 5. The analysis of male-biased expansion of “Steppe ancestry” in CWC Esperstedt and Bell Beaker Germany is, for the reasons stated above, not very useful to distinguish their mutual influence, though.

Based on data from Olalde et al. (2019), Bell Beakers from Germany are the closest sampled ones to expanding East Bell Beakers, and those close to the Rhine – i.e. French, Dutch, and British Beakers in particular – show a clear excess “Steppe ancestry” due to their exogamy with local Corded Ware groups:

Only one 2-way model fits the ancestry in Iberia_CA_Stp with P-value>0.05: Germany_Beaker + Iberia_CA. Finding a Bell Beaker-related group as a plausible source for the introduction of steppe ancestry into Iberia is consistent with the fact that some of the individuals in the Iberia_CA_Stp group were excavated in Bell Beaker associated contexts. Models with Iberia_CA and other Bell Beaker groups such as France_Beaker (P-value=7.31E-06), Netherlands_Beaker (P-value=1.03E-03) and England_Beaker (P-value=4.86E-02) failed, probably because they have slightly higher proportions of steppe ancestry than the true source population.

olalde-iberia-chalcolithic

The exogamy with Corded Ware-like groups in the Lower Rhine Basin seems at this point undeniable, as is the origin of Bell Beakers around the Middle-Upper Danube Basin from Yamnaya Hungary.

To avoid this excess “Steppe ancestry” showing up in the maps, since Bell Beakers from Germany pack the most Yamnaya ancestry among East Bell Beakers outside Hungary (ca. 51.1% “Steppe ancestry”), I equated this maximum with BK_Scotland_Ach (which shows ca. 61.1% “Steppe ancestry”, highest among western Beakers), and applied a simple rule of three for “Steppe ancestry” in Dutch and British Beakers.

NOTE. Formal stats for “Steppe ancestry” in Bell Beaker groups are available in Olalde et al. (2018) supplementary materials (PDF). I didn’t apply this adjustment to Bk_FR groups because of the R1b Bell Beaker sample from the Champagne/Alsace region reported by Samantha Brunel that will pack more Yamnaya ancestry than any other sampled Beaker to date, hence probably driving the Yamnaya ancestry up in French samples.

The most likely outcome in the following years, when Yamnaya and Corded Ware ancestry are investigated separately, is that Yamnaya ancestry will be much lower the farther away from the Middle and Lower Danube region, similar to the case in Iberia, so the map above probably overestimates this component in most Beakers to the north of the Danube. Even the late Hungarian Beaker samples, who pack the highest Yamnaya ancestry (up to 75%) among Beakers, represent likely a back-migration of Moravian Beakers, and will probably show a contribution of Corded Ware ancestry due to the exogamy with local Moravian groups.

Despite this decreasing admixture as Bell Beakers spread westward, the explosive expansion of Yamnaya R1b male lineages (in words of David Reich) and the radical replacement of local ones – whether derived from Corded Ware or Neolithic groups – shows the true extent of the North-West Indo-European expansion in Europe:

chalcolithic-late-y-dna
Y-DNA haplogroups in West Eurasia during the Bell Beaker expansion. See full map and see maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Late Copper Age and of the Yamnaya-Bell Beaker transition.

4.2. Palaeo-Balkan

There is scarce data on Palaeo-Balkan movements yet, although it is known that:

  1. Yamnaya ancestry appears among Mycenaeans, with the Yamnaya Bulgaria sample being its best current ancestral fit;
  2. the emergence of steppe ancestry and R1b-M269 in the eastern Mediterranean was associated with Ancient Greeks;
  3. Thracians, Albanians, and Armenians also show R1b-M269 subclades and “Steppe ancestry”.

4.3. Sintashta-Potapovka-Filatovka

Interestingly, Potapovka is the only Corded Ware derived culture that shows good fits for Yamnaya ancestry, despite having replaced Poltavka in the region under the same Corded Ware-like (Abashevo) influence as Sintashta.

This proves that there was a period of admixture in the Pre-Proto-Indo-Iranian community between CWC-like Abashevo and Yamnaya-like Catacomb-Poltavka herders in the Sintashta-Potapovka-Filatovka community, probably more easily detectable in this group because of the specific temporal and geographic sampling available.

srubnaya-yamnaya-ehg-chg-ancestry
Supplementary Table 14. P values of rank=3 and admixture proportions in modelling Steppe ancestry populations as a four-way admixture of distal sources EHG, CHG, Anatolian_Neolithic and WHG using 14 outgroups.
Left populations: Steppe cluster, EHG, CHG, WHG, Anatolian_Neolithic
Right populations: Mbuti.DG, Ust_Ishim.DG, Kostenki14, MA1, Han.DG, Papuan.DG, Onge.DG, Villabruna, Vestonice16, ElMiron, Ethiopia_4500BP.SG, Karitiana.DG, Natufian, Iran_Ganj_Dareh_Neolithic.

Srubnaya ancestry shows a best fit with non-Pre-Yamnaya ancestry, i.e. with different CHG + EHG components – possibly because the more western Potapovka (ancestral to Proto-Srubnaya Pokrovka) also showed good fits for it. Srubnaya shows poor fits for Pre-Yamnaya ancestry probably because Corded Ware-like (Abashevo) genetic influence increased during its formation.

On the other hand, more eastern Corded Ware-derived groups like Sintashta and its more direct offshoot Andronovo show poor fits with this model, too, but their fits are still better than those including Pre-Yamnaya ancestry.

mlba-ehg-ancestry
Natural neighbor interpolation of non-Pre-Yamnaya EHG ancestry among Middle to Late Bronze Age populations. See full map.
mlba-chg-ancestry
Natural neighbor interpolation of non-Pre-Yamnaya CHG ancestry among Middle to Late Bronze Age populations. See full map.
mlba-anatolia-farmer-ancestry
Natural neighbor interpolation of Anatolia Neolithic ancestry among Middle to Late Bronze Age populations. See full map.
mlba-iran-chl-ancestry
Natural neighbor interpolation of Iran Chl. ancestry among Middle to Late Bronze Age populations. See full map.

NOTE For maps with actual formal stats of Corded Ware ancestry from the Early Bronze Age to the modern times, you should read the post Corded Ware ancestry in North Eurasia and the Uralic expansion instead.

The bottleneck of Proto-Indo-Iranians under R1a-Z93 was not yet complete by the time when the Sintashta-Potapovka-Filatovka community expanded with the Srubna-Andronovo horizon:

early-bronze-age-y-dna
Y-DNA haplogroups in West Eurasia during the European Early Bronze Age. See full map and see maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Early Bronze Age.

4.4. Afanasevo

At the end of the Afanasevo culture, at least three samples show hg. Q1b (ca. 2900-2500 BC), which seemed to point to a resurgence of local lineages, despite continuity of the prototypical Pre-Yamnaya ancestry. On the other hand, Anthony (2019) makes this cryptic statement:

Yamnaya men were almost exclusively R1b, and pre-Yamnaya Eneolithic Volga-Caspian-Caucasus steppe men were principally R1b, with a significant Q1a minority.

Since the only available samples from the Khvalynsk community are R1b (x3), Q1a(x1), and R1a(x1), it seems strange that Anthony would talk about a “significant minority”, unless Q1a (potentially Q1b in the newer nomenclature) will pop up in some more individuals of those ca. 30 new to be published. Because he also mentions I2a2 as appearing in one elite burial, it seems Q1a (like R1a-M459) will not appear under elite kurgans, although it is still possible that hg. Q1a was involved in the expansion of Afanasevo to the east.

middle-bronze-age-y-dna
Y-DNA haplogroups in West Eurasia during the Middle Bronze Age. See full map and see maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Middle Bronze Age and the Late Bronze Age.

Okunevo, which replaced Afanasevo in the Altai region, shows a majority of hg. Q1b, but also some R1b-M269 samples proper of Afanasevo, suggesting partial genetic continuity.

NOTE. Other sampled Siberian populations clearly show a variety of Q subclades that likely expanded during the Palaeolithic, such as Baikal EBA samples from Ust’Ida and Shamanka with a majority of Q1b, and hg. Q reported from Elunino, Sagsai, Khövsgöl, and also among peoples of the Srubna-Andronovo horizon (the Krasnoyarsk MLBA outlier), and in Karasuk.

From Damgaard et al. Science (2018):

(…) in contrast to the lack of identifiable admixture from Yamnaya and Afanasievo in the CentralSteppe_EMBA, there is an admixture signal of 10 to 20% Yamnaya and Afanasievo in the Okunevo_EMBA samples, consistent with evidence of western steppe influence. This signal is not seen on the X chromosome (qpAdm P value for admixture on X 0.33 compared to 0.02 for autosomes), suggesting a male-derived admixture, also consistent with the fact that 1 of 10 Okunevo_EMBA males carries a R1b1a2a2 Y chromosome related to those found in western pastoralists. In contrast, there is no evidence of western steppe admixture among the more eastern Baikal region region Bronze Age (~2200 to 1800 BCE) samples.

This Yamnaya ancestry has been also recently found to be the best fit for the Iron Age population of Shirenzigou in Xinjiang – where Tocharian languages were attested centuries later – despite the haplogroup diversity acquired during their evolution, likely through an intermediate Chemurchek culture (see a recent discussion on the elusive Proto-Tocharians).

Haplogroup diversity seems to be common in Iron Age populations all over Eurasia, most likely due to the spread of different types of sociopolitical structures where alliances played a more relevant role in the expansion of peoples. A well-known example of this is the spread of Akozino warrior-traders in the whole Baltic region under a partial N1a-VL29-bottleneck associated with the emerging chiefdom-based systems under the influence of expanding steppe nomads.

early-iron-age-y-dna
Y-DNA haplogroups in West Eurasia during the Early Iron Age. See full map and see maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Early Iron Age and Late Iron Age.

Surprisingly, then, Proto-Tocharians from Shirenzigou pack up to 74% Yamnaya ancestry, in spite of the 2,000 years that separate them from the demise of the Afanasevo culture. They show more Yamnaya ancestry than any other population by that time, being thus a sort of Late PIE fossils not only in their archaic dialect, but also in their genetic profile:

shirenzigou-afanasievo-yamnaya-andronovo-srubna-ulchi-han

The recent intrusion of Corded Ware-like ancestry, as well as the variable admixture with Siberian and East Asian populations, both point to the known intense Old Iranian and Old/Middle Chinese contacts. The scarce Proto-Samoyedic and Proto-Turkic loans in Tocharian suggest a rather loose, probably more distant connection with East Uralic and Altaic peoples from the forest-steppe and steppe areas to the north (read more about external influences on Tocharian).

Interestingly, both R1b samples, MO12 and M15-2 – likely of Asian R1b-PH155 branch – show a best fit for Andronovo/Srubna + Hezhen/Ulchi ancestry, suggesting a likely connection with Iranians to the east of Xinjiang, who later expanded as the Wusun and Kangju. How they might have been related to Huns and Xiongnu individuals, who also show this haplogroup, is yet unknown, although Huns also show hg. R1a-Z93 (probably most R1a-Z2124) and Steppe_MLBA ancestry, earlier associated with expanding Iranian peoples of the Srubna-Andronovo horizon.

All in all, it seems that prehistoric movements explained through the lens of genetic research fit perfectly well the linguistic reconstruction of Proto-Indo-European and Proto-Uralic.

Related

Volga Basin R1b-rich Proto-Indo-Europeans of (Pre-)Yamnaya ancestry

yamnaya-expansion

New paper (behind paywall) by David Anthony, Archaeology, Genetics, and Language in the Steppes: A Comment on Bomhard, complementing in a favourable way Bomhard’s Caucasian substrate hypothesis in the current issue of the JIES.

NOTE. I have tried to access this issue for some days, but it’s just not indexed in my university library online service (ProQuest) yet. This particular paper is on Academia.edu, though, as are Bomhard’s papers on this issue in his site.

Interesting excerpts (emphasis mine):

Along the banks of the lower Volga many excavated hunting-fishing camp sites are dated 6200-4500 BC. They could be the source of CHG ancestry in the steppes. At about 6200 BC, when these camps were first established at Kair Shak III and Varfolomievka (42 and 28 on Figure 2), they hunted primarily saiga antelope around Dzhangar, south of the lower Volga, and almost exclusively onagers in the drier desert-steppes at Kair-Shak, north of the lower Volga. Farther north at the lower/middle Volga ecotone, at sites such as Varfolomievka and Oroshaemoe hunter-fishers who made pottery similar to that at Kair-Shak hunted onagers and saiga antelope in the desert-steppe, horses in the steppe, and aurochs in the riverine forests. Finally, in the Volga steppes north of Saratov and near Samara, hunter-fishers who made a different kind of pottery (Samara type) and hunted wild horses and red deer definitely were EHG. A Samara hunter-gatherer of this era buried at Lebyazhinka IV, dated 5600-5500 BC, was one of the first named examples of the EHG genetic type (Haak et al. 2015). This individual, like others from the same region, had no or very little CHG ancestry. The CHG mating network had not yet reached Samara by 5500 BC.

morgunova-eneolithic-pontic-caspian
Eneolithic settlements (1–5, 7, 10–16, 20, 22–43, 48, 50), burial grounds (6, 8–9, 17–19, 21, 47, 49) and kurgans (44–46) of the steppe Ural-Volga region: 1 Ivanovka; 2 Turganik; 3 Kuzminki; 4 Mullino; 5 Davlekanovo; 6 Sjezheye (burial ground); 7 Vilovatoe; 8 Ivanovka; 9 Krivoluchye; 10–13 LebjazhinkaI-III-IV-V; 14 Gundorovka; 15–16 Bol. Rakovka I-II; 17–18 Khvalunsk I-II; 19 Lipoviy Ovrag; 20 Alekseevka; 21 Khlopkovskiy; 22 Kuznetsovo I; 23 Ozinki II; 24 Altata; 25 Monakhov I; 26 Oroshaemoe; 27 Rezvoe; 28 Varpholomeevka; 29 Vetelki; 30 Pshenichnoe; 31 Kumuska; 32 Inyasovo; 33 Shapkino VI; 34 Russkoe Truevo I; 35 Tsaritsa I-II; 36 Kamenka I; 37 Kurpezhe-Molla; 38 Istay; 39 Isekiy; 40 Koshalak; 41 Kara-Khuduk; 42 Kair-Shak VI; 43 Kombakte; 44 Berezhnovka I-II; 45 Rovnoe; 46 Politotdelskoe; 47 burial near s. Pushkino; 48 Elshanka; 49 Novoorsk; 50 Khutor Repin. Modified from Morgunova (2014).

But before 4500 BC, CHG ancestry appeared among the EHG hunter-fishers in the middle Volga steppes from Samara to Saratov, at the same time that domesticated cattle and sheep-goats appeared. The Reich lab now has whole-genome aDNA data from more than 30 individuals from three Eneolithic cemeteries in the Volga steppes between the cities of Saratov and Samara (Khlopkov Bugor, Khvalynsk, and Ekaterinovka), all dated around the middle of the fifth millennium BC. Many dates from human bone are older, even before 5000 BC, but they are affected by strong reservoir effects, derived from a diet rich in fish, making them appear too old (Shishlina et al 2009), so the dates I use here accord with published and unpublished dates from a few dated animal bones (not fish-eaters) in graves.

Only three individuals from Khvalynsk are published, and they were first published in a report that did not mention the site in the text (Mathieson et al. 2015), so they went largely unnoticed. Nevertheless, they are crucial for understanding the evolution of the Yamnaya mating network in the steppes. They were mentioned briefly in Damgaard et al (2018) but were not graphed. They were re-analyzed and their admixture components were illustrated in a bar graph in Wang et al (2018: figure 2c), but they are not the principal focus of any published study. All of the authors who examined them agreed that these three Khvalynsk individuals, dated about 4500 BC, showed EHG ancestry admixed substantially with CHG, and not a trace of Anatolian Farmer ancestry, so the CHG was a Hotu-Cave or Kotias-Cave type of un-admixed CHG. The proportion of CHG in the Wang et al. (2018) bar graphs is about 20-30% in two individuals, substantially less CHG than in Yamnaya; but the third Khvalynsk individual had more than 50% CHG, like Yamnaya. The ca. 30 additional unpublished individuals from three middle Volga Eneolithic cemeteries, including Khvalynsk, preliminarily show the same admixed EHG/CHG ancestry in varying proportions. Most of the males belonged to Y-chromosome haplogroup R1b1a, like almost all Yamnaya males, but Khvalynsk also had some minority Y-chromosome haplogroups (R1a, Q1a, J, I2a2) that do not appear or appear only rarely (I2a2) in Yamnaya graves.

eneolithic-steppes
Pontic-Caspian steppe and neighbouring groups in the Neolithic. See full map.

Wang et al. (2018) discovered that this middle Volga mating network extended down to the North Caucasian steppes, where at cemeteries such as Progress-2 and Vonyuchka, dated 4300 BC, the same Khvalynsk-type ancestry appeared, an admixture of CHG and EHG with no Anatolian Farmer ancestry, with steppe-derived Y-chromosome haplogroup R1b. These three individuals in the North Caucasus steppes had higher proportions of CHG, overlapping Yamnaya. Without any doubt, a CHG population that was not admixed with Anatolian Farmers mated with EHG populations in the Volga steppes and in the North Caucasus steppes before 4500 BC. We can refer to this admixture as pre-Yamnaya, because it makes the best currently known genetic ancestor for EHG/CHG R1b Yamnaya genomes. The Progress-2 individuals from North Caucasus steppe graves lived not far from the pre-Maikop farmers of the Belaya valley, but they did not exchange mates, according to their DNA.

The hunter-fisher camps that first appeared on the lower Volga around 6200 BC could represent the migration northward of un-admixed CHG hunter-fishers from the steppe parts of the southeastern Caucasus, a speculation that awaits confirmation from aDNA. After 5000 BC domesticated animals appeared in these same sites in the lower Volga, and in new ones, and in grave sacrifices at Khvalynsk and Ekaterinovka. CHG genes and domesticated animals flowed north up the Volga, and EHG genes flowed south into the North Caucasus steppes, and the two components became admixed. After approximately 4500 BC the Khvalynsk archaeological culture united the lower and middle Volga archaeological sites into one variable archaeological culture that kept domesticated sheep, goats, and cattle (and possibly horses). In my estimation, Khvalynsk might represent the oldest phase of PIE.

eneolithic-early-steppes
Pontic-Caspian steppe and neighbouring groups in the Early Eneolithic. See full map.

Anatolian Farmer ancestry and Yamnaya origins

The Eneolithic Volga-North Caucasus mating network (Khvalynsk/Progress-2 type) exhibited EHG/CHG admixtures and Y-chromosome haplogroups similar to Yamnaya, but without Yamnaya’s additional Anatolian Farmer ancestry. (…)

Like the Mesolithic and Neolithic populations here, the Eneolithic populations of Dnieper-Donets II type seem to have limited their mating network to the rich, strategic region they occupied, centered on the Rapids. The absence of CHG shows that they did not mate frequently if at all with the people of the Volga steppes, a surprising but undeniable discovery. Archaeologists have seen connections in ornament types and in some details of funeral ritual between Dnieper-Donets cemeteries of the Mariupol-Nikol’skoe type and cemeteries in the middle Volga steppes such as Khvalynsk and S’yez’zhe (Vasiliev 1981:122-123). Also their cranio-facial types were judged to be similar (Bogdanov and Khokhlov 2012:212). So it it surprising that their aDNA does not indicate any genetic admixture with Khvalynsk or Progress-2. Also, neither they nor the Volga steppe Eneolithic populations showed any Anatolian Farmer ancestry. (…)

All three of the steppe-admixed exceptions were from the Varna region (Mathieson et al. 2018). One of them was the famous “golden man’ at Varna (Krause et al. 2016), Grave 43, whose steppe ancestry was the most doubtful of the three. If he had steppe ancestry, it was sufficiently distant (five+ generations before him) that he was not a statistically significant outlier, but he was displaced in the steppe direction, away from the central values of the majority of typical Anatolian Farmers at Varna and elsewhere. The other two, at Varna (grave 158, a 5-7-year-old girl) and Smyadovo (grave 29, a male 20-25 years old), were statistically significant outliers who had recent steppe ancestry (consistent with grandparents or great-grandparents) of the EHG/CHG Khvalynsk/Progress-2 type, not of the Dnieper Rapids EHG/WHG type.

(…) I believe that the Suvorovo-Cernavoda I movement into the lower Danube valley and the Balkans about 4300 BC separated early PIE-speakers (pre-Anatolian) from the steppe population that stayed behind in the steppes and that later developed into late PIE and Yamnaya.

This archaeological transition marked the breakdown of the mating barrier between steppe and Anatolian Farmer mating networks. After this 4300-4200 BC event, Anatolian Farmer ancestry began to pop up in the steppes. The currently oldest sample with Anatolian Farmer ancestry in the steppes in an individual at Aleksandriya, a Sredni Stog cemetery on the Donets in eastern Ukraine. Sredni Stog has often been discussed as a possible Yamnaya ancestor in Ukraine (Anthony 2007: 239- 254). The single published grave is dated about 4000 BC (4045– 3974 calBC/ 5215±20 BP/ PSUAMS-2832) and shows 20% Anatolian Farmer ancestry and 80% Khvalynsk-type steppe ancestry (CHG&EHG). His Y-chromosome haplogroup was R1a-Z93, similar to the later Sintashta culture and to South Asian Indo-Aryans, and he is the earliest known sample to show the genetic adaptation to lactase persistence (I3910-T). Another pre-Yamnaya grave with Anatolian Farmer ancestry was analyzed from the Dnieper valley at Dereivka, dated 3600-3400 BC (grave 73, 3634–3377 calBC/ 4725±25 BP/ UCIAMS-186349). She also had 20% Anatolian Farmer ancestry, but she showed less CHG than Aleksandriya and more Dereivka-1 ancestry, not surprising for a Dnieper valley sample, but also showing that the old fifth-millennium-type EHG/WHG Dnieper ancestry survived into the fourth millennium BC in the Dnieper valley (Mathieson et al. 2018).

late-eneolithic-repin
Pontic-Caspian steppe and neighbouring groups in the Late Eneolithic. See full map.

Probably, late PIE (Yamnaya) evolved in the same part of the steppes—the Volga-Caucasus steppes between the lower Don, the lower and middle Volga, and the North Caucasus piedmont—where early PIE evolved, and where appropriate EHG/CHG admixtures and Y-chromosome haplogroups were seen already in the Eneolithic (without Anatolian Farmer). There have always been archaeologists who argued for an origin of Yamnaya in the Volga steppes, including Gimbutas (1963), Merpert (1974), and recently Morgunova (2014), who argued that this was where Repin-type ceramics, an important early Yamnaya pottery type, first appeared in dated contexts before Yamnaya, about 3600 BC. The genetic evidence is consistent with Yamnaya EHG/CHG origins in the Volga-Caucasus steppes. Also, if contact with the Maikop culture was a fundamental cause of the innovations in transport and metallurgy that defined the Yamnaya culture, then the lower Don-North Caucasus-lower Volga steppes, closest to the North Caucasus, would be where the earliest phase is expected.

I would still guess that the Darkveti-Meshoko culture and its descendant Maikop culture established the linguistic ancestor of the Northwest Caucasian languages in approximately the region where they remained. I also accept the general consensus that the appearance of the hierarchical Maikop culture about 3600 BC had profound effects on pre-Yamnaya and early Yamnaya steppe cultures. Yamnaya metallurgy borrowed from the Maikop culture two-sided molds, tanged daggers, cast shaft hole axes with a single blade, and arsenical copper. Wheeled vehicles might have entered the steppes through Maikop, revolutionizing steppe economies and making Yamnaya pastoral nomadism possible after 3300 BC.

For those who still hoped that Proto-Indo-Europeans of Yamnaya/Afanasievo ancestry from the Don-Volga region were associated with the expansion of hg. R1a-M417, in a sort of mythical “R1-rich” Indo-European society, it seems this is going to be yet another prediction based on ancestry magic that goes wrong.

Proto-Indo-Europeans were, however, associated with other subclades beyond R1b-M269, probably (as I wrote recently) R1b-V1636, I2a-L699, Q1a-M25, and R1a-YP1272, but also interestingly some J subclade, so let’s see what surprises the new study on Khvalynsk and Yamnaya settlers from the Carpathian Basin brings…

On the bright side, it is indirectly confirmed that late Sredni Stog formed part of the neighbouring Corded Ware-like populations of ca. 20-30%+ Anatolian farmer ancestry that gave Yamnaya its share (ca. 6-10%), relative to the comparatively unmixed Khvalynsk and late Repin population (as shown by Afanasevo).

In this steppe mating network that opened up after the Khvalynsk expansion, the increasing admixture of Anatolian farmer-related ancestry in Yamnaya from east (ca. 2-10%) to west (ca. 6-15%) points to an exogamy of late Repin males in their western/south-western regions with populations around the Don River basin and beyond (and endogamy within the Yamnaya community), in an evolution relevant for language expansions and language contacts during the Late Eneolithic.

NOTE. “Mating network” is my new preferred term for “ancestry”. Also great to see scholars finally talk about “Pre-Yamnaya” ancestry, which – combined with the distinction of Yamnaya from Corded Ware ancestry – will no doubt help differentiate fine-scale population movements of steppe- and forest-steppe-related populations.

north-pontic-kvityana-dereivka-repin
Modified from Rassamakin (1999), adding red color to Repin expansion. The system of the latest Eneolithic Pointic cultures and the sites of the Zhivotilovo-Volchanskoe type: 1) Volchanskoe; 2) Zhivotilovka; 3) Vishnevatoe; 4) Koisug.

The whole issue of the JIES is centered on Caucasian influences on Early PIE as an Indo-Uralic dialect, and this language contact/substrate is useful to locate the most likely candidates for the Northeast and Northwest Caucasian and the Proto-Indo-European homelands.

On the other hand, it would also be interesting to read a discussion of how this Volga homeland of Middle PIE and Don-Volga-Ural homeland of Late PIE would be reconciled with the known continuous contacts of Uralic with Middle and Late PIE (see here) to locate the most likely Proto-Uralic homeland.

Especially because Corded Ware fully replaced all sub-Neolithic groups to the north and east of Khvalynsk/Yamnaya, like Volosovo, so no other population neighbouring Middle and Late Proto-Indo-Europeans survived into the Bronze Age…

EDIT: For those new to this blog, this information on unpublished samples from the Volga River basin is yet another confirmation of Khokhlov’s report on the R1b-L23 samples from Yekaterinovka, and its confirmation by a co-author of The unique elite Khvalynsk male from a Yekaterinovskiy Cape burial, apart from more support to the newest data placing Yekaterinovka culturally and probably chronologically between Samara and Khvalynsk.

Related

The genetic and cultural barrier of the Pontic-Caspian steppe – forest-steppe ecotone

steppe-forest-steppe-biomes

We know that the Caucasus Mountains formed a persistent prehistoric barrier to cultural and population movements. Nevertheless, an even more persistent frontier to population movements in Europe, especially since the Neolithic, is the Pontic-Caspian steppe – forest-steppe ecotone.

Like the Caucasus, this barrier could certainly be crossed, and peoples and cultures could permeate in both directions, but there have been no massive migrations through it. The main connection between both regions (steppe vs. forest-steppe/forest zone) was probably through its eastern part, through the Samara region in the Middle Volga.

The chances of population expansions crossing this natural barrier anywhere else seem quite limited, with a much less porous crossing region in the west, through the Dnieper-Dniester corridor.

A Persistent ecological and cultural frontier

It is very difficult to think about any culture that transgressed this persistent ecological and cultural frontier: many prehistoric and historical steppe pastoralists did appear eventually in the neighbouring forest-steppe areas during their expansions (e.g. Yamna, Scythians, or Turks), as did forest groups who permeated to the south (e.g. Comb Ware, GAC, or Abashevo), but their respective hold in foreign biomes was mostly temporary, because their cultures had to adapt to the new ecological environment. Most if not all groups originally from a different ecological niche eventually disappeared, subjected to renewed demographic pressure from neighbouring steppe or forest populations…

The Samara region in the Middle Volga may be pointed out as the true prehistoric link between forests and steppes (see David Anthony’s remarks), something reflected in its nature as a prehistoric sink in genetics. This strong forest – forest-steppe – steppe connection was seen in the Eurasian technocomplex, during the expansion of hunter-gatherer pottery, in the expansion of Abashevo peoples to the steppes (in one of the most striking cases of population admixture in the area), with Scythians (visible in the intense contacts with Ananyino), and with Turks (Volga Turks).

steppe-forest-steppe-europe
Simplified map of the distribution of steppes and forest-steppes (Pontic and Pannonian) and xeric grasslands in Eastern Central Europe (with adjoining East European ranges) with their regionalisation as used in the review (Northern—Pannonic—Pontic). Modified from Kajtoch et al. (2016).

Before the emergence of pastoralism, the cultural contacts of the Pontic region (i.e. forest-steppes) with the Baltic were intense. In fact, the connection of the north Pontic area with the Baltic through the Dnieper-Dniester corridor and the Podolian-Volhynian region is essential to understand the spread of peoples of post-Maglemosian and post-Swiderian cultures (to the south), hunter-gatherer pottery (to the north), TRB (to the south), Late Trypillian groups (north), GAC (south), or Comb Ware (south) (see here for Eneolithic movements), and finally steppe ancestry and R1a-Z645 with Corded Ware (north). After the complex interaction of TRB, Trypillia, GAC, and CWC during the expansion of late Repin, this traditional long-range connection is lost and only emerges sporadically, such as with the expansion of East Germanic tribes.

A barrier to steppe migrations into northern Europe

One may think that this barrier was more permeable, then, in the past. However, the frontier is between steppe and forest-steppe ecological niches, and this barrier evolved during prehistory due to climate changes. The problem is, before the drought that began ca. 4000 BC and increased until the Yamna expansion, the steppe territory in the north Pontic region was much smaller, merely a strip of coastal land, compared to its greater size ca. 3300 BC and later.

This – apart from the cultural and technological changes associated with nomadic pastoralism – justifies the traditional connection of the north Pontic forest-steppes to the north, broken precisely after the expansion of Khvalynsk, as the north Pontic area became gradually a steppe region. The strips of north Pontic and Azov steppes and Crimea seem to have had stronger connections to the Northern Caucasus and Northern Caspian steppes than with the neighbouring forest-steppe areas during the Upper Palaeolithic, Mesolithic, and Neolithic.

NOTE. We still don’t know the genetic nature of Mikhailovka or Ezero, steppe-related groups possibly derived from Novodanilovka and Suvorovo close to the Black Sea (which possibly include groups from the Pannonian plains), and how they compare to neighbouring typically forest-steppe cultures of the so-called late Sredni Stog groups, like Dereivka or partly Kvityana.

steppe-forest-steppe-migration-routes
Typical migration routes through European steppes and forest-steppes. Red line represents the persistent cultural and genetic barrier, with the latest evolution in steppe region represented by the shift from dashed line to the north. Arrows show the most common population movements. Modified from Kajtoch et al. (2016).

Despite the Pontic-Caspian steppes and forest-steppes neighbouring each other for ca. 2,000 km, peoples from forested and steppe areas had an obvious advantage in their own regions, most likely due to the specialization of their subsistence economy. While this is visible already in Palaeolithic and Mesolithic hunter-gatherers, the arrival of the Neolithic package in the Pontic-Caspian region incremented the difference between groups, by spreading specialized animal domestication. The appearance of nomadic pastoralism adapted to the steppe, eventually including the use of horses and carts, made the cultural barrier based on the economic know-how even stronger.

Even though groups could still adapt and permeate a different territory (from steppe to forest-steppe/forest and vice-versa), this required an important cultural change, to the extent that it is eventually complicated to distinguish these groups from neighbouring ones (like north-west Pontic Mesolithic or Neolithic groups and their interaction with the steppes, Trypillia-Usatovo, Scythians-Thracians, etc.). In fact, this steppe – forest-steppe barrier is also seen to the east of the Urals, with the distinct expansion of Andronovo and Seima-Turbino/Andronovo-like horizons, which seem to represent completely different ethnolinguistic groups.

As a result of this cultural and genetic barrier, like that formed by the Northern Caucasus:

1) No steppe pastoralist culture (which after the emergence of Khvalynsk means almost invariably horse-riding, chariot-using nomadic herders who could easily pasture their cows in the huge grasslands without direct access to water) has ever been successful in spreading to the north or north-west into northern Europe, until the Mongols. No forest culture has ever been successful in expanding to the steppes, either (except for the infiltration of Abashevo into Sintashta-Potapovka).

2) Corded Ware was not an exception: like hunter-gatherer pottery before it (and like previous population movements of TRB, late Trypillia, GAC, Comb Ware or Lublin-Volhynia settlers) their movements between the north Pontic area and central Europe happened through forest-steppe ecological niches due to their adaptation to them. There is no reason to support a direct connection of CWC with true steppe cultures.

3) The so-called “Steppe ancestry” permeated the steppe – forest-steppe ecotone for hundreds of years during the 5th and early 4th millennium BC, due to the complex interaction of different groups, and probably to the aridization trend that expanded steppe (and probably forest-steppe) to the north. Language, culture, and paternal lineages did not cross that frontier, though.

EDIT (4 FEB 2019): Wang et al. is out in Nature Communications. They deleted the Yamna Hungary samples and related analyses, but it’s interesting to see where exactly they think the trajectory of admixture of Yamna with European MN cultures fits best. This path could also be inferred long ago from the steppe connections shown by the Yamna Hungary -> Bell Beaker evolution and by early Balkan samples:

wang-yamna-connection
Prehistoric individuals projected onto a PCA of 84 modern-day West Eurasian populations (open symbols). Dashed arrows indicate trajectories of admixture: EHG—CHG (petrol), Yamnaya—Central European MN (pink), Steppe—Caucasus (green), and Iran Neolithic—Anatolian Neolithic (brown). Modified from the original, a red circle has been added to the Yamna-Central European MN admixture.

Related

“Steppe ancestry” step by step: Khvalynsk, Sredni Stog, Repin, Yamna, Corded Ware

dzudzuana_pca-large

Wang et al. (2018) is obviously a game changer in many aspects. I have already written about the upcoming Yamna Hungary samples, about the new Steppe_Eneolithic and Caucasus Eneolithic keystones, and about the upcoming Greece Neolithic samples with steppe ancestry.

An interesting aspect of the paper, hidden among so many relevant details, is a clearer picture of how the so-called Yamnaya or steppe ancestry evolved from Samara hunter-gatherers to Yamna nomadic pastoralists, and how this ancestry appeared among Proto-Corded Ware populations.

anatolia-neolithic-steppe-eneolithic
Image modified from Wang et al. (2018). Marked are in orange: equivalent Steppe_Maykop ADMIXTURE; in red, approximate limit of Anatolia_Neolithic ancestry found in Yamna populations; in blue, Corded Ware-related groups. “Modelling results for the Steppe and Caucasus cluster. Admixture proportions based on (temporally and geographically) distal and proximal models, showing additional Anatolian farmer-related ancestry in Steppe groups as well as additional gene flow from the south in some of the Steppe groups as well as the Caucasus groups.”

Please note: arrows of “ancestry movement” in the following PCAs do not necessarily represent physical population movements, or even ethnolinguistic change. To avoid misinterpretations, I have depicted arrows with Y-DNA haplogroup migrations to represent the most likely true ethnolinguistic movements. Admixture graphics shown are from Wang et al. (2018), and also (the K12) from Mathieson et al. (2018).

1. Samara to Early Khvalynsk

The so-called steppe ancestry was born during the Khvalynsk expansion through the steppes, probably through exogamy of expanding elite clans (eventually all R1b-M269 lineages) originally of Samara_HG ancestry. The nearest group to the ANE-like ghost population with which Samara hunter-gatherers admixed is represented by the Steppe_Eneolithic / Steppe_Maykop cluster (from the Northern Caucasus Piedmont).

Steppe_Eneolithic samples, of R1b1 lineages, are probably expanded Khvalynsk peoples, showing thus a proximate ancestry of an Early Eneolithic ghost population of the Northern Caucasus. Steppe_Maykop samples represent a later replacement of this Steppe_Eneolithic population – and/or a similar population with further contribution of ANE-like ancestry – in the area some 1,000 years later.

PCA-caucasus-steppe-samara

This is what Steppe_Maykop looks like, different from Steppe_Eneolithic:

steppe-maykop-admixture

NOTE. This admixture shows how different Steppe_Maykop is from Steppe_Eneolithic, but in the different supervised ADMIXTURE graphics below Maykop_Eneolithic is roughly equivalent to Eneolithic_Steppe (see orange arrow in ADMIXTURE graphic above). This is useful for a simplified analysis, but actual differences between Khvalynsk, Sredni Stog, Afanasevo, Yamna and Corded Ware are probably underestimated in the analyses below, and will become clearer in the future when more ancestral hunter-gatherer populations are added to the analysis.

2. Early Khvalynsk expansion

We have direct data of Khvalynsk-Novodanilovka-like populations thanks to Khvalynsk and Steppe_Eneolithic samples (although I’ve used the latter above to represent the ghost Caucasus population with which Samara_HG admixed).

We also have indirect data. First, there is the PCA with outliers:

PCA-khvalynsk-steppe

Second, we have data from north Pontic Ukraine_Eneolithic samples (see next section).

Third, there is the continuity of late Repin / Afanasevo with Steppe_Eneolithic (see below).

3. Proto-Corded Ware expansion

It is unclear if R1a-M459 subclades were continuously in the steppe and resurged after the Khvalynsk expansion, or (the most likely option) they came from the forested region of the Upper Dnieper area, possibly from previous expansions there with hunter-gatherer pottery.

Supporting the latter is the millennia-long continuity of R1b-V88 and I2a2 subclades in the north Pontic Mesolithic, Neolithic, and Early Eneolithic Sredni Stog culture, until ca. 4500 BC (and even later, during the second half).

Only at the end of the Early Eneolithic with the disappearance of Novodanilovka (and beginning of the steppe ‘hiatus’ of Rassamakin) is R1a to be found in Ukraine again (after disappearing from the record some 2,000 years earlier), related to complex population movements in the north Pontic area.

NOTE. In the PCA, a tentative position of Novodanilovka closer to Anatolia_Neolithic / Dzudzuana ancestry is selected, based on the apparent cline formed by Ukraine_Eneolithic samples, and on the position and ancestry of Sredni Stog, Yamna, and Corded Ware later. A good alternative would be to place Novodanilovka still closer to the Balkan outliers (i.e. Suvorovo), and a source closer to EHG as the ancestry driven by the migration of R1a-M417.

PCA-sredni-stog-steppe

The first sample with steppe ancestry appears only after 4250 BC in the forest-steppe, centuries after the samples with steppe ancestry from the Northern Caucasus and the Balkans, which points to exogamy of expanding R1a-M417 lineages with the remnants of the Novodanilovka population.

steppe-ancestry-admixture-sredni-stog

4. Repin / Early Yamna expansion

We don’t have direct data on early Repin settlers. But we do have a very close representative: Afanasevo, a population we know comes directly from the Repin/late Khvalynsk expansion ca. 3500/3300 BC (just before the emergence of Early Yamna), and which shows fully Steppe_Eneolithic-like ancestry.

afanasevo-admixture

Compared to this eastern Repin expansion that gave Afanasevo, the late Repin expansion to the west ca. 3300 BC that gave rise to the Yamna culture was one of colonization, evidenced by the admixture with north Pontic (Sredni Stog-like) populations, no doubt through exogamy:

PCA-repin-yamna

This admixture is also found (in lesser proportion) in east Yamna groups, which supports the high mobility and exogamy practices among western and eastern Yamna clans, not only with locals:

yamnaya-admixture

5. Corded Ware

Corded Ware represents a quite homogeneous expansion of a late Sredni Stog population, compatible with the traditional location of Proto-Corded Ware peoples in the steppe-forest/forest zone of the Dnieper-Dniester region.

PCA-latvia-ln-steppe

We don’t have a comparison with Ukraine_Eneolithic or Corded Ware samples in Wang et al. (2018), but we do have proximate sources for Abashevo, when compared to the Poltavka population (with which it admixed in the Volga-Ural steppes): Sintashta, Potapovka, Srubna (with further Abashevo contribution), and Andronovo:

sintashta-poltavka-andronovo-admixture

The two CWC outliers from the Baltic show what I thought was an admixture with Yamna. However, given the previous mixture of Eneolithic_Steppe in north Pontic steppe-forest populations, this elevated “steppe ancestry” found in Baltic_LN (similar to west Yamna) seems rather an admixture of Baltic sub-Neolithic peoples with a north Pontic Eneolithic_Steppe-like population. Late Repin settlers also admixed with a similar population during its colonization of the north Pontic area, hence the Baltic_LN – west Yamna similarities.

NOTE. A direct admixture with west Yamna populations through exogamy by the ancestors of this Baltic population cannot be ruled out yet (without direct access to more samples), though, because of the contacts of Corded Ware with west Yamna settlers in the forest-steppe regions.

steppe-ancestry-admixture-latvia

A similar case is found in the Yamna outlier from Mednikarovo south of the Danube. It would be absurd to think that Yamna from the Balkans comes from Corded Ware (or vice versa), just because the former is closer in the PCA to the latter than other Yamna samples. The same error is also found e.g. in the Corded Ware → Bell Beaker theory, because of their proximity in the PCA and their shared “steppe ancestry”. All those theories have been proven already wrong.

NOTE. A similar fallacy is found in potential Sintashta→Mycenaean connections, where we should distinguish statistically that result from an East/West Yamna + Balkans_BA admixture. In fact, genetic links of Mycenaeans with west Yamna settlers prove this (there are some related analyses in Anthrogenica, but the site is down at this moment). To try to relate these two populations (separated more than 1,000 years before Sintashta) is like comparing ancient populations to modern ones, without the intermediate samples to trace the real anthropological trail of what is found…Pure numbers and wishful thinking.

Conclusion

Yamna and Corded Ware show a similar “steppe ancestry” due to convergence. I have said so many times (see e.g. here). This was clear long ago, just by looking at the Y-chromosome bottlenecks that differentiate them – and Tomenable noticed this difference in ADMIXTURE from the supplementary materials in Mathieson et al. (2017), well before Wang et al. (2018).

This different stock stems from (1) completely different ancestral populations + (2) different, long-lasting Y-chromosome bottlenecks. Their similarities come from the two neighbouring cultures admixing with similar populations.

If all this does not mean anything, and each lab was going to support some pre-selected archaeological theories from the 1960s or the 1980s, coupled with outdated linguistic models no matter what – Anthony’s model + Ringe’s glottochronological tree of the early 2000s in the Reich Lab; and worse, Kristiansen’s CWC-IE + Germano-Slavonic models of the 1940s in the Copenhagen group – , I have to repeat my question again:

What’s (so much published) ancient DNA useful for, exactly?

See also

Related

Dzudzuana, Sidelkino, and the Caucasus contribution to the Pontic-Caspian steppe

hunter-gatherer-pottery

It has been known for a long time that the Caucasus must have hosted many (at least partially) isolated populations, probably helped by geographical boundaries, setting it apart from open Eurasian areas.

David Reich writes in his book the following about India:

The genetic data told a clear story. Around a third of Indian groups experienced population bottlenecks as strong or stronger than the ones that occurred among Finns or Ashkenazi Jews. We later confirmed this finding in an even larger dataset that we collected working with Thangaraj: genetic data from more than 250 jati groups spread throughout India (…)

Rather than an invention of colonialism as Dirks suggested, long-term endogamy as embodied in India today in the institution of caste has been overwhelmingly important for millennia. (…)

The Han Chinese are truly a large population. They have been mixing freely for thousands of years. In contrast, there are few if any Indian groups that are demographically very large, and the degree of genetic differentiation among Indian jati groups living side by side in the same village is typically two to three times higher than the genetic differentiation between northern and southern Europeans. The truth is that India is composed of a large number of small populations.

There is little doubt now, based on findings spanning thousands of years, that the Mesolithic and Neolithic Caucasus hosted various very small populations, even if the ancestral components may be reduced to the few known to date (such as ANE, EHG, AME*, ENA, CHG, and other “deep” ancestral components).

NOTE. I will call the ancestral component of Dzudzuana/Anatolian hunter-gatherers Ancient Middle Easterner (AME), to give a clear idea of its likely extension during the Late Upper Palaeolithic, and to avoid using the more simplistic Dzudzuana, unless it is useful to mention these specific local samples.

dzudzuana-pca
Image modified from Lazaridis et al. (2018), including Caucasus, Don-Volga-Ural, and North Pontic Mesolithic-Neolithic populations. “Ancient West Eurasian population structure. (a) Geographical distribution of key ancient West Eurasian populations. (b) Temporal distribution of key ancient West Eurasian populations (approximate date in ky BP). (c) PCA of key ancient West Eurasians, including additional populations (shown with grey shells), in the space of outgroup f4-statistics (Methods).”

Genetic labs have a strong fixation with ancestry. I guess the use of complex statistical methods gives professionals and laymen alike the feeling of dealing with “Science”, as opposed to academic fields where you have to interpret data. I think language reveals a lot about the way people think, and the fact that ancestral components are called ‘lineages’ – while not wrong per se – is a clear symptom of the lack of interest in the true lineages: Y-DNA haplogroups.

Y-DNA bottlenecks

It has become quite clear that male-biased migrations are often the ones which can be confidently followed for actual population movements and ethnolinguistic identification, at least until the Iron Age. The frequently used Palaeolithic clusters offer a clear example of why ancestry does not represent what some people believe: They merely give a basic idea of sizeable population replacements by distant peoples.

Both concepts are important: sizeable and distant peoples. For example, during the Upper Palaeolithic in Europe there was a sizeable population replacement of the Aurignacian Goyet cluster by the Gravettian Vestonice cluster (probably from populations of far eastern Russia) coupled with the arrival of haplogroup I, although during the thousands of years that this material culture lasted, the previously expanded C1a2 lineages did not disappear, and there were probably different resurgence and admixture events.

Haplogroup I certainly expanded with the Gravettian culture to Iberia, where the Goyet ancestry did not change much – probably because of male-driven migrations -, to the extent that during the Magdalenian expansions haplogroup I expanded with an ancestry closer to Goyet, in what is called a ‘resurge’ of the Goyet cluster – even though there is a clear replacement of male lines.

The Villabruna (WHG) cluster is another good example. It probably spread with haplogroup R1b-L754, which – based on the extra ‘East Asian’ affinity of some samples and on modern samples from the Middle East – came probably from the east through a southern route, and not too long before the expansion of WHG likely from around the Black Sea, although this is still unclear. The finding of haplogroup I in samples of mostly WHG ancestry could confuse people that do not care about timing, sub-structured populations, and gene flow.

palaeolithic-expansions-reich
Image from David Reich’s Who We Are and How We Got Here. Having migrated out of Africa and the Near East, modern human pioneer populations spread throughout Eurasia (1). By at least thirty-nine thousand years ago, one group founded a lineage of European hunter-gatherers that persisted largely uninterrupted for more than twenty thousand years (2). Eventually, groups derived from an eastern branch of this founding population of European huntergatherers spread west (3), displaced previous groups, and were eventually themselves pushed out of northern Europe by the spread of glacial ice, shown at its maximum extent (top right). As the glaciers receded, western Europe was repeopled from the southwest (4) by a population that had managed to persist for tens of thousands of years and was related to an approximately thirty-five-thousand-year old individual from far western Europe. A later human migration, following the first strong warming period, had an even larger impact, with a spread from the southeast (5) that not only transformed the population of western Europe but also homogenized the populations of Europe and the Near East. At a single site—Goyet Caves in Belgium—ancient DNA from individuals spread over twenty thousand years reflects these transformations, with representatives from the Aurignacian, Gravettian, and Magdalenian periods.

NOTE. If you don’t understand why ‘clusters’ that span thousands of years don’t really matter for the many Palaeolithic population expansions that certainly happened among hunter-gatherers in Europe, just take a look at what happened with Bell Beakers expanding from Yamna into western Europe within 500 years.

If we don’t thread carefully when talking about population migrations, these terms are bound to confuse people. Just as the fixation on “steppe ancestry” – which marks the arrival in Chalcolithic Europe of peoples from the Pontic-Caspian region – has confused a lot of researchers to this day.

When I began to write about the Indo-European demic diffusion model, my concern was to find a single spot where a North-West Indo-European proto-language could have expanded from ca. 2000 BC (our most common guesstimate). Based on the 2015 papers, and in spite of their conclusions, I thought it had become clear that Corded Ware was not it, and it was rather Bell Beakers. I assumed that Uralic was spoken to the north (as was the traditional belief), and thus Corded Ware expanded from the forest zone, hence steppe ancestry would also be found there with other R1a lineages.

With the publication of Mathieson et al. (2017) and Olalde et al. (2017), I changed my mind, seeing how “steppe ancestry” did in fact appear quite late, hence it was likely to be the result of very specific population movements, probably directly from the Caucasus. Later, Mathieson published in a revision the sample from Alexandria of hg R1a-M417 (probably R1a-Z645, possibly Z93+), which further supported the idea that the migration of Corded Ware peoples started near the North Pontic forest-steppe (as I included in a the next revision).

The question remains the same I repeated recently, though: where do the extra Caucasus components (i.e. beyond EHG) of Eneolithic Ukraine/Corded Ware and Khvalynsk/Yamna come from?

Steppe ancestry: “EHG” + “CHG”?

About EHG ancestry

From Lazaridis et al. (2018):

Considering 2-way mixtures, we can model Karelia_HG as deriving 34 ± 2.8% of its ancestry from a Villabruna-related source, with the remainder mainly from ANE represented by the AfontovaGora3 (AG3) sample from Lake Baikal ~17kya.

AG3 was likely of haplogroup Q1a (as reported by YFull, see Genetiker), and probably the ANE ancestry found in Eastern Europe accompanied a Palaeolithic migration of Q1a2-M25 (formed ca. 22600 BC, TMRCA ca. 14300 BC).

NOTE. You can read more about the expansion of Q lineages during the Palaeolithic.

Combined with what we know about the Eneolithic Steppe and Caucasus populations – it is likely that ANE ancestry remained the most important component of some of the small ghost populations of the Caucasus until their emergence with the Lola culture.

pca-caucasus-dzudzuana
Image modified from Wang et al. (2018). Samples projected in PCA of 84 modern-day West Eurasian populations (open symbols). Previously known clusters have been marked and referenced. Marked and labelled are the Balkan samples referenced in this text An EHG and a Caucasus ‘clouds’ have been drawn, leaving Pontic-Caspian steppe and derived groups between them. See the original file here. To understand the drawn potential Caucasus Mesolithic cluster, see above the PCA from Lazaridis et al. (2018).

The first sample we have now attributed to the EHG cluster is Sidelkino, from the Samara region (ca. 9300 BC), mtDNA U5a2. In Damgaard et al. (Science 2018), Yamnaya could be modelled as a CHG population related to Kotias Klde (54%) and the remaining from ANE population related to Sidelkino (>46%), with the following split events:

  1. A split event, where the CHG component of Yamnaya splits from KK1. The model inferred this time at 27 kya (though we note the larger models in Sections S2.12.4 and S2.12.5 inferred a more recent split time).
  2. A split event, where the ANE component of Yamnaya splits from Sidelkino. This was inferred at about about 11 kya.
  3. A split event, where the ANE component of Yamnaya splits from Botai. We inferred this to occur 17 kya. Note that this is above the Sidelkino split time, so our model infers Yamnaya to be more closely related to the EHG Sidelkino, as expected.
  4. An ancestral split event between the CHG and ANE ancestral populations. This was inferred to occur around 40 kya.

Other samples classified as of the EHG cluster:

  • Popovo2 (ca. 6250 BC) of hg J1, mtDNA U4d – Po2 and Po4 from the same site (ca. 6550 BC) show continuity of mtDNA.
  • Karelia_HG, from Juzhnii Oleni Ostrov (ca. 6300 BC): I0211/UzOO40 (ca. 6300 BC) of hg J1(xJ1a), mtDNA U4a; and I0061/UzOO74 of hg R1a1(xR1a1a), mtDNA C1
  • UzOO77 and UzOO76 from Juzhnii Oleni Ostrov (ca. 5250 BC) of mtDNA R1b.
  • Samara_HG from Lebyanzhinka (ca. 5600 BC) of hg R1b1a, mtDNA U5a1d.

From the analysis of Lazaridis et al. (2018), we have some details about their admixture:

dzudzuana-admixture-sidelkino
Image modified from Lazaridis et al. (2018). Modeling present-day and ancient West-Eurasians. Mixture proportions computed with qpAdm (Supplementary Information section 4). The proportion of ‘Mbuti’ ancestry represents the total of ‘Deep’ ancestry from lineages that split prior to the split of Ust’Ishim, Tianyuan, and West Eurasians and can include both ‘Basal Eurasian’ and other (e.g., Sub-Saharan African) ancestry. (Left) ‘Conservative’ estimates. Each population 367 cannot be modeled with fewer admixture events than shown. (Right) ‘Speculative’ estimates. The highest number of sources (≤5) with admixture estimates within [0,1] are shown for each population. Some of the admixture proportions are not significantly different from 0 (Supplementary Information section 4).

About Anatolia_Neolithic ancestry

About the enigmatic Anatolia_Neolithic-related ancestry found in Pontic-Caspian steppe samples, this is what Wang et al. (2018) had to say:

We focused on model of mixture of proximal sources such as CHG and Anatolian Chalcolithic for all six groups of the Caucasus cluster (Eneolithic Caucasus, Maykop and Late Makyop, Maykop-Novosvobodnaya, Kura-Araxes, and Dolmen LBA), with admixture proportions on a genetic cline of 40-72% Anatolian Chalcolithic related and 28-60% CHG related (Supplementary Table 7). When we explored Romania_EN and Greece_Neolithic individuals as alternative southeast European sources (30-46% and 36-49%), the CHG proportions increased to 54-70% and 51-64%, respectively. We hypothesize that alternative models, replacing the Anatolian Chalcolithic individual with yet unsampled populations from eastern Anatolia, South Caucasus or northern Mesopotamia, would probably also provide a fit to the data from some of the tested Caucasus groups.

Also:

The first appearance of ‘Near Eastern farmer related ancestry’ in the steppe zone is evident in Steppe Maykop outliers. However, PCA results also suggest that Yamnaya and later groups of the West Eurasian steppe carry some farmer related ancestry as they are slightly shifted towards ‘European Neolithic groups’ in PC2 (Fig. 2D) compared to Eneolithic steppe. This is not the case for the preceding Eneolithic steppe individuals. The tilting cline is also confirmed by admixture f3-statistics, which provide statistically negative values for AG3 as one source and any Anatolian Neolithic related group as a second source

yamnaya-caucasus-dzudzuana
Modified image from Wang et al. (2018). In blue, Yamna-related populations. In red, Corded Ware-related populations, and two elevated Anatolia_Neolithic values in Yamna. Notice how only GAC-related admixture increases the Anatolian_N-related ancestry in the Yamna outlier from Ozero, and the late Yamna sample from Hungary, related to the homogeneous Yamna population. “Supplementary Table 14. P values of rank=3 and admixture proportions in modelling Steppe ancestry populations as a four-way admixture of distal sources EHG, CHG, Anatolian_Neolithic and WHG using 14 outgroups.Left populations: Steppe cluster, EHG, CHG, WHG, Anatolian_Neolithic. Right populations: Mbuti.DG, Ust_Ishim.DG, Kostenki14, MA1, Han.DG, Papuan.DG, Onge.DG, Villabruna, Vestonice16, ElMiron, Ethiopia_4500BP.SG, Karitiana.DG, Natufian, Iran_Ganj_Dareh_Neolithic.”

Detailed exploration via D-statistics in the form of D(EHG, steppe group; X, Mbuti) and D(Samara_Eneolithic, steppe group; X, Mbuti) show significantly negative D values for most of the steppe groups when X is a member of the Caucasus cluster or one of the Levant/Anatolia farmer-related groups (Supplementary Figs. 5 and 6). In addition, we used f- and D-statistics to explore the shared ancestry with Anatolian Neolithic as well as the reciprocal relationship between Anatolian- and Iranian farmer-related ancestry for all groups of our two main clusters and relevant adjacent regions (Supplementary Fig. 4). Here, we observe an increase in farmer-related ancestry (both Anatolian and Iranian) in our Steppe cluster, ranging from Eneolithic steppe to later groups. In Middle/Late Bronze Age groups especially to the north and east we observe a further increase of Anatolian farmer related ancestry consistent with previous studies of the Poltavka, Andronovo, Srubnaya and Sintashta groups and reflecting a different process not especially related to events in the Caucasus.

(…) Surprisingly, we found that a minimum of four streams of ancestry is needed to explain all eleven steppe ancestry groups tested, including previously published ones (Fig. 2; Supplementary Table 12). Importantly, our results show a subtle contribution of both Anatolian farmer-related ancestry and WHG-related ancestry (Fig.4; Supplementary Tables 13 and 14), which was likely contributed through Middle and Late Neolithic farming groups from adjacent regions in the West. The discovery of a quite old AME ancestry has rendered this probably unnecessary, because this admixture from an Anatolian-like ghost population could be driven even by small populations from the Caucasus.

yamna-caucasus-cwc-anatolia-neolithic
Image modified from Wang et al. (2018). Marked are: in red, approximate limit of Anatolia_Neolithic ancestry found in Yamna populations; in blue, Corded Ware-related groups. “Modelling results for the Steppe and Caucasus 1128 cluster. Admixture proportions based on (temporally and geographically) distal and proximal models, showing additional Anatolian farmer-related ancestry in Steppe groups as well as additional gene flow from the south in some of the Steppe groups as well as the Caucasus groups (see also Supplementary Tables 10, 14 and 20).”

NOTE. For a detailed account of the possibilities regarding this differential admixture in the North Pontic area in contrast to the Don-Volga-Ural region, you can read the posts Sredni Stog, Proto-Corded Ware, and their “steppe admixture”, and Corded Ware culture origins: The Final Frontier.

While it is not yet fully clear, the increased Anatolian_Neolithic-like ancestry in Ukraine_Eneolithic samples (see below) makes it unlikely that all such ancestry in Corded Ware groups comes from a GAC-related contribution. It is likely that at least part of it represents contributions from populations of the Caucasus, based on the mostly westward population movements in the steppe from ca. 4600 BC on, including the Suvorovo-Novodanilovka expansion, and especially the Kuban-Maykop expansion during the final Eneolithic into the North Pontic area.

NOTE. Since CHG-like groups from the Caucasus may have combinations of AME and ANE ancestry similar to Yamna (which may thus appear as ‘steppe ancestry’ in the North Pontic area), it is impossible to interpret with precision the following ADMIXTURE graphic:

ukraine-whg-ehg-steppe
Modified image from Mathieson et al. (2018). Supervised ADMIXTURE analysis, modelling each ancient individual (one per row) as a mixture of population clusters constrained to contain northwestern-Anatolian Neolithic (grey), Yamnaya from Samara (yellow), EHG (pink) and WHG (green) populations. Dates in parentheses indicate approximate range of individuals in each population.

North-Eastern Technocomplex

The East Asian contribution to samples from the WHG samples (like Loschbour or La Braña), as specified in Fu et al. (2016), does not seem to be related to Baikal_EN, and appears possibly (in the ADMIXTURE analysis) integrated into he Villabruna component. I guess this implies that the shared alleles with East Asians are quite early, and potentially due to the expansion of R1b-L754 from the East.

It would be interesting to know the specific material culture Sidelkino belonged to – i.e. if it was related to the expansion of the North-Eastern Technocomplex – , and its Y-DNA. The Post-Swiderian expansion into eastern Europe, probably associated with the expansion of R1b-P297 lineages (including R1b-M73, found later in Botai and in Baltic HG) is supposed to have begun during the 11th millennium BC, but migrations to the Urals and beyond are probably concentrated in the 9th millennium, so this sample is possibly slightly early for R1b.

NOTE. User Rozenfeld at Anthrogenica posted this, which I think is interesting (in case anyone wants to try a Y-SNP call):

there is something strange with Sidelkino EHG: first, its archaeological context is not described in the supplementary. Second, its sex is not listed in the supplementary tables. Third, after looking for info about this sample, I found that: “Сиделькино-3. Для снятия вопроса о половой принадлежности индивида была проведена генетическая экспертиза, выявившая принадлежность останков мужчине.”(translation: Sidelkino-3. To resolve the question about sex of the remains, the genetic analysis was conducted, which showed that remains belonged to male), source: http://static.iea.ras.ru/books/7487_Traditsii.pdf

So either they haven’t mentioned his Y-DNA in the paper for some reason, or there are more than one Sidelkino sample and the male one has not yet been published. The coverage of the Sidelkino sample from the paper is 2.9, more than enough to tell Y-DNA haplogroup.

zaliznyak-post-swiderian
The map of spreading of Post-Swiderian and Post-Krasnosillian sites in Mesolithic of Eastern Europe in the 8th millennia BC. From Zaliznyak (see here).

My speculative guess right now about specific population movements in far eastern Europe, based on the few data we have:

  • The expansion of the North-Eastern Technocomplex first around the 9th millennium BC, most likely expanded R1b-P279 ca. 11300 BC, judging by its TMRCA, with both R1b-M73 (TMRCA 5300) and R1b-M269 (TMRCA 4400 BC) info (with extra El Mirón ancestry) back, and thus Eurasiatic.
  • The expansion of haplogroup J1 to the north may have happened before or after the R1b-P279 expansion. Judging by the increase in AG3-related ancestry near Karelia compared to Baltic_HG, it is possible that it expanded just after R1b-P279 (hence possibly J1-Y6304? TMRCA 9700 BC). Its long-lasting presence in the Caucasus is supported by the Satsurblia (ca. 11300 BC) and the Dolmen BA (ca. 1300 BC) samples.
  • The expansion of R1a-M17 ca. 6600 BC is still likely to have happened from the east, based on the R1a-M17 samples found in Baikalic cultures slightly later (ca. 5300 BC). The presence of elevated Baikal_EN ancestry in Karelia HG and in Samara HG, and the finding of R1a-M417 samples in the Forest Zone after the Mesolithic suggests a connection with the expansion of Hunter-Gatherer pottery, from the Elshanka culture in the Samara region northward into the Forset Zone and westward into the North Pontic area.
  • The expansion of R1b-M73 ca. 5300 BC is likely to be associated with the emergence of a group east of the Urals (related to the later Botai culture, and potentially Pre-Yukaghir). Its presence in a Narva sample from Donkalnis (ca. 5200 BC) suggest either an early split and spread of both R1b-P297 lineages (M73 and M269) through Eastern Europe, or maybe a back-migration with hunter-gatherer pottery.
  • R1b-M269 spread successfully ca. 4400 BC (and R1b-L23 ca. 4100 BC, both based on TMRCA), and this successful expansion is probably to be associated with the Khvalynsk-Novodanilovka expansion. We already know that Samara_HG ca. 5600 was R1b1a, so it is likely that R1b-M269 appeared (or ‘resurged’) in the Volga-Ural region shortly after the expansion of R1a-M17, whose expansion through the region may be inferred by the additional AG3 and Baikal_EN ancestry. Interesting from Samara_HG compared to the previous Sidelkino sample is the introduction of more El Mirón-related ancestry, typical of WHG populations (and thus proper of Baltic groups).

NOTE. The TMRCA dates are obviously gross approximations, because a) the actual rate of mutation is unknown and b) TMRCA estimates are based on the convergence of lineages that survived. The potential finding of R1a-Z645 (possibly Z93+) in Ukraine Eneolithic (ca. 4000 BC), and the potential finding of R1b-L23 in Khvalynsk ca. 4250 BC complicates things further, in terms of dates and origins of any subclade.

The question thus remains as it was long ago: did R1b-M269 lineages expand (‘return’) from the east, near the Urals, or directly from the north? Were they already near Samara at the same time as the expansion of hunter-gatherer pottery, and were not much affected by it? Or did they ‘resurge’ from populations admixed with Caucasus-related ancestry after the expansion of R1a-M17 with this pottery (since there are different stepped expansions from the Samara region)? We could even ask, did R1a-M17 really expand from the east, i.e. are the dates on Baikalic subclades from Moussa et al. (2016) reliable? Or did R1a-M17 expand from some pockets in the Pontic-Caspian steppe, taking over the expansion of HG pottery at some point?

hunger-gatherer-pottery
Early Neolithic cultures in eastern and central Europe: 1–Yelshanian; 2–North Caspian; 3–Rakushechnyj Yar; 4–Surskian; 5–Dnieper-Donetsian; 6– Bug-Dniesterian; 7–Upper Volga; 8–Narvian; 9–Linear Pottery. White arrows: expansion of early farming; black arrows: spread of pottery-making traditions. From Dolukhanov et al. (2009).

Maglemose-related migrations

The most interesting aspect from the new paper (regarding Indo-Uralic migrations) is that Ancestral Middle Easterner ancestry will probably be a better proxy for the Anatolia_Neolithic component found in Ukraine Mesolithic to Eneolithic, and possibly also for some of the “more CHG-like” component found among Pontic-Caspian steppe populations, all likely derived from different admixture events with groups from the Caucasus.

NOTE. Even the supposed gene flow of Neolithic Iranian ancestry into the Caucasus can be put into question, since that means possibly a Dzudzuana-like population with greater “deep ancestry” proportion than the one found in CHG, which may still be found within the Caucasus.

If it was not clear already that following ‘steppe ancestry’ wherever it appears is a rather lame way of following Indo-European migrations, every single sample from the Caucasus and their admixture with Pontic-Caspian steppe populations will probably show that “steppe ancestry” is in fact formed by a variety of steppe-related ancestral components, impossible to follow coherently with a single population. Exactly what is happening already with the Siberian ancestry.

If the paper on the Dzudzuana samples has shown something, is that the expansion of an ANE-like population shook the entire Caucasus area up to the Zagros Mountains, creating this ANE – AME cline that are CHG and Iran_N, with further contributions of “deep ancestries” (probably from the south) complicating the picture further.

If this happens with few known samples, and we know of an ANE-like ghost population in the Caucasus (appearing later in the Lola culture), we can already guess that the often repeated “CHG component” found in Ukraine_Eneolithic and Khvalynsk will not be the same (except the part mediated by the Novodanilovka expansion).

This ANE-like expansion happened probably in the Late Upper Palaeolithic, and reached Northern Europe probably after the expansion of the Villabruna cluster (ca. 12000 BC), judging by the advance of AG3-like and ENA-like ancestry in later WHG samples.

The population movements during the Mesolithic and Early Neolithic in the North Pontic area are quite complicated: the extra AME ancestry is probably connected to the admixture with populations from the Caucasus, while the close similarity of Ukraine populations with Scandinavian ones (with an increase in Villabruna ancestry from Mesolithic to Neolithic samples), probably reveal population movements related to the expansion of Maglemose-related groups.

maglemose-mesolithic
Etno-cultural situation in Central and Eastern Europe in the Late Mesolithic — Early Neolithic (VI—V Mill. BC) (after Конча 2004: 201, карта 1; made after ideas by L. L. Zaliznyak). Legend: 1 — Maglemose circle in the VII Mill. BC (after Gr. Clark); 2—7 — Mesolithic cultures of the Post-Maglemose tradition, VI Mill. BC (after S. Kozłowsky, L. L. Zaliznyak): 2 — de Leyen-Wartena; 3 — Oldesloe — Godenaa; 4 — Chojnice — Peńki; 5 — Janisłavice; 6 — finds of Janisłavice artefacts outside of the main area; 7 — Donets culture; 8 — directions of the settling of Janisłavice people (after S. Kozłowsky and L. L. Zaliznyak); 9 — the south border of Mesolithic and Early Neolithic cultures of post-Swidrian and post-Arensburgian traditions; 10 — northern border of settlement of the Balkan-Danubian farmers; 11 — Bug- Dniester culture; 12 — Neolithic cultures emerged on the ethno-cultural basis of post-Maglemose: Э — Ertebölle-Ellerbeck, Н — Neman, Д — Dnieper-Donets, М — Mariupol (western variants). From Klein (2017).

These Maglemose-related groups were probably migrants from the north-west, originally from the Northern European Plains, who occupied the previous Swiderian territory, and then expanded into the North Pontic area. The overwhelming presence of I2a (likely all I2a2a1b1b) lineages in Ukraine Neolithic supports this migration.

The likely picture of Mesolithic-Neolithic migrations in the North Pontic area right now is then:

  1. Expansion of R1a-M459 from the east ca. 12000 BC – probably coupled with AG3 and also some Baikal_EN ancestry. First sample is I1819 from Vasilievka (ca. 8700 BC), another is from Dereivka ca. 6900 BC.
  2. Expansion of R1b-V88 from the Balkans in the west ca. 9700 BC, based on its TMRCA and also the Balkan hunter-gatherer population overwhemingly of this haplogroup from the 10th millennium until the Neolithic. First sample is I1734 from Vasilievka (ca. 7252 BC), which suggests that it replaced the male population there, based on their similar EHG-like adxmixture (and lack of sizeable WHG increase), and shared mtDNA U5b2, U5a2.
  3. Expansion of I2a-Y5606 probably ca. 6800 based on its TMRCA with Janislawice culture. Supporting this is the increase in WHG contribution to Neolithic samples, including the spread of U4 subclades compared to the previous period.
  4. Expansion of R1a-M17 starting probably ca. 6600 BC in the east (see above).

NOTE. The first sample of haplogroup I appears in the Mesolithic: I1763 (ca. 8100 BC) of haplogroup I2a1, probably related to an older Upper Palaeolithic expansion.

janislawice
Distribution of archeological cultures in the North Pontic Region during the Mesolithic (7th – 6th millennium BCE). Dotted, dashed and solid lines with corresponding arrows indicate alternative models of the spread of the Grebenyky culture groups. (After Bryuako IV., Samojlova TL., Eds, Drevnie kul’tury Severo-­‐Zapadnogo Prichernomor’ya, Odessa: SMIL, 2013.) Nikitin – Ivanova 2017.

Conclusion

It is becoming more and more clear with each new paper that – unless the number of very ancient samples increases – the use of Y-chromosome haplogroups remains one of the most important tools for academics; this is especially so in the steppes, in light of the diversity found in populations from the Caucasus. A clear example comes from the Yamna – Corded Ware similarities:

After the publication of the 2015 papers, it was likely that Yamna expanded with haplogroup R1b-L23, but it has only become crystal clear that Yamna expanded through the steppes into Bell Beakers, now that we have data about the strict genetic homogeneity of the whole Yamna population from west to east (including Afanasevo), in contrast with contemporary Corded Ware peoples which expanded from a different forest-steppe population.

The presence of haplogroups Q and R1a-M459 (xM17) in Khvalynsk along with a R1b1a sample, which some interpreted as being akin to modern ‘mixed’ populations in the past, is likely to point instead to a period of Khvalynsk-Novodanilovka expansion with R1b-M269, where different small populations from the steppe were being integrated into the common Khvalynsk stock, but where differences are seen in material culture surrounding their burials, as supported by the finding of R1b1 in the Kuban area already in the first half of the 5th millennium. The case would be similar to the early ‘mixed’ Icelandic population.

Only after the emergence of the Samara culture (in the second half of the 6th millennium BC), with a sample of haplogroup R1b1a, starts then the obvious connection with Early Proto-Indo-Europeans; and only after the appearance of late Sredni Stog and haplogroup R1a-M417 (ca. 4000 BC) is its connection with Uralic also clear. In previous population movements, I think more haplogroups were involved in migrations of small groups, and only some communities among them were eventually successful, expanding to be dominant, creating ever growing cultures during their expansions.

Indeed, if you think in terms of Uralic and Indo-European just as converging languages, and forget their potential genetic connection, then the genetic + linguistic picture becomes simplified, and the upper frontier of the 6th millennium BC with a division North Pontic (Mariupol) vs. Volga-Ural (Samara) is enough. However, tracing their movements backwards – with cultural expansions from west to east (with the expansion of farming), and earlier east to west (with hunter-gatherer pottery), and still earlier west to east (with the north-eastern technocomplex), offers an interesting way to prove their potential connection to macrofamilies, at least in terms of population movements.

corded-ware-uralic-qpgraph
Modified image from Tambets et al. (2018) Proportions of ancestral components in studied European and Siberian populations and the tested qpGraph model. a The qpGraph model fitting the data for the tested populations. Colour codes for the terminal nodes: pink—modern populations (‘Population X’ refers to test population) and yellow—ancient populations (aDNA samples and their pools). Nodes coloured other than pink or yellow are hypothetical intermediate populations. We putatively named nodes which we used as admixture sources using the main recipient among known populations. The colours of intermediate nodes on the qpGraph model match those on the admixture proportions panel. The NeolL (Neolithic Levant) ancestry selected in this qpGraph is likely to correspond (at least in part) to a specific Dzudzuana-like component present in the CHG-like population that admixed in the North Pontic area.

I am quite convinced right now that it would be possible to connect the expansion of R1b-L754 subclades with a speculative Nostratic (given the R1b-V88 connection with Afroasiatic, and the obvious connection of R1b-L297 with Eurasiatic). Paradoxically, the connection of an Indo-Uralic community in the steppes (after the separation of Yukaghir) with any lineage expansion (R1a-M17, R1b-M269, or even Q, I or J1) seems somehow blurrier than one year ago, possibly just because there are too many open possibilities.

David Reich says about the admixture with Neanderthals, which he helped discover:

At the conclusion of the Neanderthal genome project, I am still amazed by the surprises we encountered. Having found the first evidence of interbreeding between Neanderthals and modern humans, I continue to have nightmares that the finding is some kind of mistake. But the data are sternly consistent: the evidence for Neanderthal interbreeding turns out to be everywhere. As we continue to do genetic work, we keep encountering more and more patterns that reflect the extraordinary impact this interbreeding has had on the genomes of people living today.

I think this is a shared feeling among many of us who have made proposals about anything, to fear that we have made a gross, evident mistake, and constantly look for flaws. However, it seems to me that geneticists are more preoccupied with being wrong in their developed statistical methods, in the theoretical models they are creating, and not so much about errors in the true ancient ethnolinguistic picture human population genetics is (at least in theory) concerned about. Their publications are, after all, constantly associating genetic finds with cultures and (whenever possible) languages, so this aspect of their research should not be taken lightly.

Seeing how David Anthony or Razib Khan (among many others) have changed their previously preferred migration models as new data was published, and they continue to be respected in their own fields, I guess we can be confident that professionals with integrity are going to accept whatever new picture appears. While I don’t think that genetic finds can change what we can reconstruct with comparative grammar, I am also ready to revise guesstimates and routes of expansion of certain dialects if R1a-Z645 is shown to have accompanied Late Proto-Indo-Europeans during their expansion with Yamna, and later integrated somehow with Corded Ware.

However, taking into account the obsession of some with an ancestral, uninterrupted R1a—Indo-European association, and the lack of actual political repercussion of Neanderthal admixture, I think the most common nightmare that all genetic researchers should be worried about is to keep inflating this “Yamnaya ancestry”-based hornet’s nest, which has been constantly stirred up for the past two years, by rejecting it – or, rather, specifying it into its true complex nature.

This succession of corrections and redefinitions, coupled with the distinct Y-DNA bottleneck of each steppe population, will eventually lead to a completely different ethnolinguistic picture of the Pontic-Caspian region during the Eneolithic, which is likely to eventually piss off not only reasonable academics stubbornly attached to the CWC-IE idea, but also a part of those interested in daydreaming about their patrilineal ancestors.

Sometimes it’s better to just rip off the band-aid once and for all…

Featured image from The oldest pottery in hunter-gatherer communitiesand models of Neolithisation of Eastern Europe (2015), by Andrey Mazurkevich and Ekaterina Dolbunova.

Related

Interesting is today’s post in Ancient DNA Era: Is Male-driven Genetic Replacement always meaning Language-shift?

Corded Ware—Uralic (I): Differences and similarities with Yamna

indo-european-uralic-migrations-corded-ware

This is the first of four posts on the Corded Ware—Uralic identification:

I was reading The Bronze Age Landscape in the Russian Steppes: The Samara Valley Project (2016), and I was really surprised to find the following excerpt by David W. Anthony:

The Samara Valley links the central steppes with the western steppes and is a north-south ecotone between the pastoral steppes to the south and the forest-steppe zone to the north [see figure below]. The economic contrast between pastoral steppe subsistence, with its associated social organizations, and forest-zone hunting and fishing economies probably explains the shifting but persistent linguistic border between forest-zone Uralic languages to the north (today largely displaced by Russian) and a sequence of steppe languages to the south, recently Turkic, before that Iranian, and before that probably an eastern dialect of Proto-Indo-European (Anthony 2007). The Samara Valley represents several kinds of borders, linguistic, cultural, and ecological, and it is centrally located in the Eurasian steppes, making it a critical place to examine the development of Eurasian steppe pastoralism.

uralic-languages-forest-zone-volga
Language map of the middle Volga-Ural region. After “Geographical Distribution of the Uralic Languages” by Finno-Ugrian Society, Helsinki, 1993.

Khokhlov (translated by Anthony) further insists on the racial and ethnic divide between both populations, Abashevo to the north, and Poltavka to the south, during the formation of the Abashevo – Sintashta-Potapovka community that gave rise to Proto-Indo-Iranians:

Among all cranial series in the Volga-Ural region, the Potapovka population represents the clearest example of race mixing and probably ethnic mixing as well. The cultural advancements seen in this period might perhaps have been the result of the mixing of heterogeneous groups. Such a craniometric observation is to some extent consistent with the view of some archaeologists that the Sintashta monuments represent a combination of various cultures (principally Abashevo and Poltavka, but with other influences) and therefore do not correspond to the basic concept of an archaeological culture (Kuzmina 2003:76). Under this option, the Potapovka-Sintashta burial rite may be considered, first, a combination of traits to guarantee the afterlife of a selected part of a heterogeneous population. Second, it reflected a kind of social “caste” rather than a single population. In our view, the decisive element in shaping the ethnic structure of the Potapovka-Sintashta monuments was their extensive mobility over a fairly large geographic area. They obtained knowledge of various cultures from the populations with whom they interacted.

steppe-lmba-sintashta-potapovka-filatovka
Late Middle Bronze Age cultures with the Proto-Indo-Iranian Sintashta-Potapovka-Filatovka group (shaded). After Anthony (2007 Figure 15.5), from Anthony (2016).

Interesting is also this excerpt about the predominant population in the Abashevo – Sintashta-Potapovka admixture (which supports what Chetan said recently, although this does not seemed backed by Y-DNA haplogroups found in the richest burials), coupled with the sign of incoming “Uraloid” peoples from the east, found in both Sintashta and eastern Abashevo:

The socially dominant anthropological component was Europeoid, possibly the descendants of Yamnaya. The association of craniofacial types with archaeological cultures in this period is difficult, primarily because of the small amount of published anthropological material of the cultures of steppe and forest belt (Balanbash, Vol’sko-Lbishche) and the eastern and southern steppes (Botai-Tersek). The crania associated with late MBA western Abashevo groups in the Don-Volga forest zone were different from eastern Abashevo in the Urals, where the expression of the Old Uraloid craniological complex was increased. Old Uraloid is found also on a single skull of Vol’sko-Lbishche culture (Tamar Utkul VII, Kurgan 4). Potentially related variants, including Mongoloid features, could be found among the Seima-Turbino tribes of the forest-steppe zone, who mixed with Sintashta and Abashevo. In the Sintashta Bulanova cemetery from the western Urals, some individuals were buried with implements of Seima-Turbino type (Khalyapin 2001; Khokhlov 2009; Khokhlov and Kitov 2009). Previously, similarities were noted between some individual skulls from Potapovka I and burials of the much older Botai culture in northern Kazakhstan (Khokhlov 2000a). Botai-Tersek is, in fact, a growing contender for the source of some “eastern” cranial features.

khvalynsk-yamna-srubna-facial-reconstruction
Facial reconstructions based on skulls from (a) Khvalynsk II Grave 24, a young adult male; (b) Poludin Grave 6, Yamnaya culture, a mature male (both by A. I. Nechvaloda); and (c) Luzanovsky cemetery, Srubnaya culture (by L. T. Yablonsky). In Khokhlov (2016).

The wave of peoples associated with “eastern” features can be seen in genetics in the Sintashta outliers from Narasimhan et al. (2018), and it probably will be eventually seen in Abashevo, too. These may be related to the Seima-Turbino international network – but most likely it is directly connected to Sintashta through the starting Andronovo and Seima-Turbino horizons, by admixing of prospective groups and small-scale back-migrations.

Corded Ware – Yamna similarities?

So, if peoples of north-eastern Europe have been assumed for a long time to be Uralic speakers, what is happening with the Corded Ware = IE obsession? Is it Gimbutas’ ghost possessing old archaeologists? Probably not.

It is about certain cultural similarities evident at first sight, which have been traditionally interpreted as a sign of cultural diffusion or migration. Not dissimilar to the many Bell Beaker models available, where each archaeologist is pushing certain differences, mixing what seemed reasonable, what still might seem reasonable, and what certainly isn’t anymore after the latest ancient DNA data.

kurgan-expansion
“European dialect” expansion of Proto-Indo-European according to Gimbutas (1963)

The initial models of Gimbutas, Kristiansen, or Anthony – which are known to many today – were enunciated in the infancy of archaeological studies in the regions, during and just after the fall of the USSR, and before many radiocarbon dates that we have today were published (with radiocarbon dating being still today in need of refinement), so it is only logical that gross mistakes were made.

We have similar gross mistakes related to the origins of Bell Beakers, and studying them was certainly easier than studying eastern data.

  • Gimbutas believed – based mainly on Kurgan-like burials – that Bell Beaker formed from a combination of Yamna settlers with the Vučedol culture, so she was not that far from the truth.
  • The expansion of Corded Ware from peoples of the North Pontic forest-steppe area, proposed by Gimbutas and later supported also by Kristiansen (1989) as the main Indo-European expansion – , is probably also right about the approximate origins of the culture. Only its ‘Indo-European’ nature is in question, given the differences with Khvalynsk and Yamna evolution.
  • Anthony only claimed that Yamna migrants settled in the Balkans and along the Danube into the Hungarian steppes. He never said that Corded Ware was a Yamna offshoot until after the first genetic papers of 2015 (read about his newest proposal). He initially claimed that only certain neighbouring Corded Ware groups “adopted” Indo-European (through cultural diffusion) because of ‘patron-client’ relationships, and was never preoccupied with the fate of Corded Ware and related cultures in the east European forest zone and Finland.

So none of them was really that far from the true picture; we might say a lot people are more way off the real picture today than the picture these three researchers helped create in the 1990s and 2000s. Genetics is just putting the last nail in the coffin of Corded Ware as a Yamna offshoot, instead of – as we believed in the 2000s – to Vučedol and Bell Beaker.

So let’s revise some of these traditional links between Corded Ware and Yamna with today’s data:

Archaeology

Even more than genetics – at least until we have an adequate regional and temporary sampling – , archaeological findings lead what we have to know about both cultures.

It is essential to remember that Corded Ware, starting ca. 3000/2900 BC in east-central Europe, has been proposed to be derived from Early Yamna, which appeared suddenly in the Pontic-Caspian steppes ca. 3300 BC (probably from the late Repin expansion), and expanded to the west ca. 3000.

Early Yamna is in turn identified as the expanding Late Proto-Indo-European community, which has been confirmed with the recent data on Afanasevo, Bell Beaker, and Sintashta-Potapovka and derived cultures.

The question at hand, therefore, is if Corded Ware can be considered an offshoot of the Late PIE community, and thus whether the CWC ethnolinguistic community – proven in genetics to be quite homogeneous – spoke a Late PIE dialect, or if – alternatively – it is derived from other neighbouring cultures of the North Pontic region.

NOTE. The interpretation of an Indo-Slavonic group represented by a previous branching off of the group is untenable with today’s data, since Indo-Slavonic – for those who support it – would itself be a branch of Graeco-Aryan, and Palaeo-Balkan languages expanded most likely with West Yamna (i.e. R1b-L23, mainly R1b-Z2103) to the south.

The convoluted alternative explanation would be that Corded Ware represents an earlier, Middle PIE branch (somehow carrying R1a??) which influences expanding Late PIE dialects; this has been recently supported by Kortlandt, although this simplistic picture also fails to explain the Uralic problem.

Kurgans: The Yamna tradition was inherited from late Repin, in turn inherited from Khvalynsk-Novodanilovka proto-Kurgans. As for the CWC tradition, it is unclear if the tumuli were built as a tradition inherited from North and West Pontic cultures (in turn inherited or copied from Khvalynsk-Novodanilovka), such as late Trypillia, late Kvityana, late Dereivka, late Sredni Stog; or if they were built because of the spread of the ‘Transformation of Europe’, set in motion by the Early Yamna expansion ca. 3300-3000 BC (as found in east-central European cultures like Coţofeni, Lizevile, Șoimuș, or the Adriatic Vučedol). My guess is that it inherits an older tradition than Yamna, with an origin in east-central Europe, because of the mound-building distribution in the North Pontic area before the Yamna expansion, but we may never really know.

pit-graves-central-europe-cwc
Distribution of Pit-Grave burials west of the Black Sea likely dating to the 2nd half of the IVth millennium BC (triangles: side-crouched burials; filled circles: supine extended burials; open circles: suspected). Frînculeasa, Preda, and Heyd (2015)

Burial rite: Yamna features (with regional differences) single burials with body on its back, flexed upright knees, poor grave goods, common orientation east-west (heads to the west) inherited from Repin, in turn inherited from Khvalynsk-Novodanilovka. CWC tradition – partially connected to Złota and surrounding east-central European territories (in turn from the Khvalynsk-Novodanilovka expansion) – features single graves, body in fetal position, strict gender differentiation – men on the right, women on the left -, looking to the south, graves with standardized assemblages (objects representing affirmation of battle, hunting, and feasting). The burial rites clearly represent different ideologies.

pit-grave-burial-schemes
Left: Pit-Grave burial types expanded with Khvalynsk-Novodanilovka. Right: Pit-Grave burial types associated with the Yamna expansion and influence. Frînculeasa, Preda, and Heyd (2015)

Corded decoration: Corded ware decoration appears in the Balkans during the 5th millennium, and represents a simple technique whereby a cord is twisted, or wrapped around a stick, and then pressed directly onto the fresh surface of a vessel leaving a characteristic decoration. It appears in many groups of the 5th and 4th millennium BC, but it was Globular Amphorae the culture which popularized the drinking vessels and their corded ornamentation. It appears thus in some regional groups of Yamna, but it becomes the standard pottery only in Corded Ware (especially with the A-horizon), which shows continuity with GAC pottery.

corded-ware-first-horizon
Origins of the first Corded Ware horizon (5th millennium BC) after the Khvalynsk-Novodanilovka expansion. Corded Ware (circles) and horse-head scepters (rectangles) and other steppe elements (triangles). Image from Bulatović (2014).

Economy: Yamna expands from Repin (and Repin from Khvalynsk-Novodanilovka) as a nomadic or semi-nomadic purely pastoralist society (with occasional gathering of wild seeds), which naturally thrives in the grasslands of the Pontic-Caspian, lower Danube and Hungarian steppes. Corded Ware shows agropastoralism (as late Eneolithic forest-steppe and steppe groups of eastern Europe, such as late Trypillian, TRB, and GAC groups), inhabits territories north of the loess line, with heavy reliance of hunter-gathering depending on the specific region.

Cattle herding: Interestingly, both west Yamna and Corded Ware show more reliance on cattle herding than other pastoralist groups, which – contrasted with the previous Eneolithic herding traditions of the Pontic-Caspian steppe, where sheep-goats predominate – make them look alike. However, the cattle-herding economy of Yamna is essential for its development from late Repin and its expansion through the steppes (over western territories practising more hunter-gathering and sheep-goat herding economy), and it does not reach equally the Volga-Ural region, whose groups keep some of the old subsistence economy (read more about the late Repin expansion). Corded Ware, on the other hand, inherits its economic strategy from east European groups like TRB, GAC, and especially late Trypillian communities, showing a predominance of cattle herding within an agropastoral community in the forest-steppe and forest zones of Volhynia, Podolia, and surrounding forest-steppe and forest regions.

yamna-scheme
Scheme of interlinked socio-economic-ideological innovations forming the Yamnaya. Frînculeasa, Preda, and Heyd (2015)

Horse riding: Horse riding and horse transport is proven in Yamna (and succeeding Bell Beaker and Sintashta), assumed for late Repin (essential for cattle herding in the seas of grasslands that are the steppes, without nearby water sources), quite likely during the Khvalynsk expansion (read more here), and potentially also for Samara, where the predominant horse symbolism of early Khvalynsk starts. Corded Ware – like the north Pontic forest-steppe and forest areas during the Eneolithic – , on the other hand, does not show a strong reliance on horse riding. The high mobility and short-term settlements characteristic of Corded Ware, that are often associated with horse riding by association with Yamna, may or may not be correct, but there is no need for horses to explain their herding economy or their mobility, and the north-eastern European areas – the one which survived after Bell Beaker expansion – did certainly not rely on horses as an essential part of their economy.

NOTE: I cannot think of more supposed similarities right now. If you have more ideas, please share in the comments and I will add them here.

Genetic similarities

EHG: This is the clearest link between both communities. We thought it was related to the expansion of ANE-related ancestry to the west into WHG territory, but now it seems that it will be rather WHG expanding into ANE territory from the Pontic-Caspian region to the east (read more on recent Caucasus Neolithic, on , and on Caucasus HG).

NOTE. Given how much each paper changes what we know about the Palaeolithic, the origin and expansion of the (always developing) known ancestral components and specific subclades (see below) is not clear at all.

CHG: This is the key link between both cultures, which will delimit their interaction in terms of time and space. CHG is intermediate between EHG and Iran N (ca. 8000 BC). The ancestry is thus linked to the Caucasus south of the steppe before the emergence of North Pontic (western) and Don-Volga-Ural (eastern) communities during the Mesolithic. The real question is: when we have more samples from the steppe and the Caucasus during the Neolithic, how many CHG groups are we going to find? Will the new specific ancestral components (say CHG1, CHG2, CHG3, etc.) found in Yamna (from Khvalynsk, in the east) and Corded Ware (probably from the North Pontic forest-steppe) be the same? My guess is, most likely not, unless they are mediated by the Khvalynsk-Novodanilovka expansion (read more on CHG in the Caucasus).

yamnaya-chg-ancestry
Formation of Yamna and CHG contribution, in Damgaard et al. (Science 2018). A 10-leaf model based on combining the models in Fig. S16 and Fig. S19 and re-estimating the model parameters.

WHG/EEF: This is the obvious major difference – known today – in the formation of both communities in the steppe, and shows the different contacts that both groups had at least since the Eneolithic, i.e. since the expansion of Repin with its renewed Y-DNA bottleneck, and probably since before the early Khvalynsk expansion (read more on Yamna-Corded Ware differences contrasting with Yamna-Afanasevo, Yamna-Bell Beaker, and Yamna-Sintashta similarities).

NOTE 1. Some similarities between groups can be seen depending on the sampled region; e.g. Baltic groups show more similarities with southern Pontic-Caspian steppe populations, probably due to exogamy.

yamna-corded-ware-diff-qpgraph
Tested qpGraph model in Tambets et al. (2018). The qpGraph model fitting the data for the tested populations. “Colour codes for the terminal nodes: pink—modern populations (‘Population X’ refers to test population) and yellow—ancient populations (aDNA samples and their pools). Nodes coloured other than pink or yellow are hypothetical intermediate populations. We putatively named nodes which we used as admixture sources using the main recipient among known populations. The colours of intermediate nodes on the qpGraph model match those on the admixture proportions panel.”

NOTE 2. We have this information on the differences in “steppe ancestry” between Yamna and Corded Ware, compared to previous studies, because now we have more samples of neighbouring, roughly contemporaneous Eneolithic groups, to analyse the real admixture processes. This kind of fine scale studies is what is going to show more and more differences between Khvalynsk-Yamna and Sredni Stog-Corded Ware as more data pours in. The evolution of both communities in archaeology and in PCA (see below) is probably witness to those differences yet to be published.

R1: Even though some people try very hard to think in terms of “R1” vs. (Caucasus) J or G or any other upper clade, this is plainly wrong. It is possible, given what we know now, that Q1a2-M242 expanded ANE ancestry to the west ca. 13000 BC, while R1b-P279 expanded WHG ancestry to the east with the expansion of post-Swiderian cultures, creating EHG as a WHG:ANE cline. The role of R1a-M459 is unknown, but it might be related to any of these migrations, or others (plural) along northern Eurasia (read more on the expansion of R1b-P279, on Palaeolithic Q1a2, and on R1a-M417).

NOTE. I am inclined to believe in a speculative Mesolithic-Early Neolithic community involving Eurasiatic movements accross North Eurasia, and Indo-Uralic movements in its western part, with the last intense early Uralic-PIE contacts represented by the forming west (Mariupol culture) and east (Don-Volga-Ural cultures, including Samara) communities developing side by side. Before their known Eneolithic expansions, no large-scale Y-DNA bottleneck is going to be seen in the Pontic-Caspian steppe, with different (especially R1a and R1b subclades) mixed among them, as shown in North Pontic Neolithic, Samara HG, and Khvalynsk samples.

PCA-trypillia-greece-neolithic-outlier-anatolian
Image modified from Wang et al. (2018). Samples projected in PCA of 84 modern-day West Eurasian populations (open symbols). Previously known clusters have been marked and referenced. Marked and labelled are the Balkan samples referenced in this text An EHG and a Caucasus ‘clouds’ have been drawn, leaving Pontic-Caspian steppe and derived groups between them. See the original file here.

Corded Ware and ‘steppe ancestry’

If we take a look at the evolution of Corded Ware cultures, the expansion of Bell Beakers – dominated over most previous European cultures from west to east Europe – influenced the development of the whole European Bronze Age, up to Mierzanowice and Trzciniec in the east.

The only relevant unscathed CWC-derived groups, after the expansion of Sintashta-Potapovka as the Srubna-Andronovo horizon in the Eurasian steppes, were those of the north-eastern European forest zone: between Belarus to the west, Finland to the north, the Urals to the east, and the forest-steppe region to the south. That is, precisely the region supposed to represent Uralic speakers during the Bronze Age.

This inconsistency of steppe ancestry and its relation with Uralic (and Balto-Slavic) peoples was observed shortly after the publication of the first famous 2015 papers by Paul Heggarty, of the Max-Planck Institute for Evolutionary Anthropology (read more):

Haak et al. (2015) make much of the high Yamnaya ancestry scores for (only some!) Indo-European languages. What they do not mention is that those same results also include speakers of other languages among those with the highest of all scores for Yamnaya ancestry. Only these are languages of the Uralic family, not Indo-European at all; and their Yamnaya-ancestry signals are far higher than in many branches of Indo-European in (southern) Europe. Estonian ranks very high, while speakers of the very closely related Finnish are curiously not shown, and nor are the Saami. Hungarian is relevant less directly since this language arrived only c. 900 AD, but also high.

uralic-steppe-ancestry

These data imply that Uralic-speakers too would have been part of the Yamnaya > Corded Ware movement, which was thus not exclusively Indo-European in any case. And as well as the genetics, the geography, chronology and language contact evidence also all fit with a Yamnaya > Corded Ware movement including Uralic as well as Balto-Slavic.

Both papers fail to address properly the question of the Uralic languages. And this despite — or because? — the only Uralic speakers they report rank so high among modern populations with Yamnaya ancestry. Their linguistic ancestors also have a good claim to have been involved in the Corded Ware and Yamnaya cultures, and of course the other members of the Uralic family are scattered across European Russia up to the Urals.

NOTE. Although the author was trying to support the Anatolian hypothesis – proper of glottochronological studies often published from the Max Planck Institute – , the question remains equally valid: “if Proto-Indo-European expands with Corded Ware and steppe ancestry, what is happening with Uralic peoples?”

For my part, I claimed in my draft that ancestral components were not the only relevant data to take into account, and that Y-DNA haplogroups R1a and R1b (appearing separately in CWC and Yamna-Bell Beaker-Afanasevo), together with their calculated timeframes of formation – and therefore likely expansion – did not fit with the archaeological and linguistic description of the spread of Proto-Indo-European and its dialects.

In fact, it seemed that only one haplogroup (R1b-M269) was constantly and consistenly associated with the proposed routes of Late PIE dialectal expansions – like Anthony’s second (Afanasevo) and third (Lower Danube, Balkan) waves. What genetics shows fits seamlessly with Mallory’s association of the North-West Indo-European expansion with Bell Beakers (read here how archaeologists were right).

balanovksy-yamnaya-ancestry
Map of the much beloved steppe (or “Yamnaya”) ancestry in modern populations, by Balanovsky. Modified from Klejn (2017).

More precise inconsistencies were observed after the publication of Olalde et al. (2017) and Mathieson et al. (2017), by Volker Heyd in Kossinna’s smile (2017). Letting aside the many details enumerated (you can read a summary in my latest draft), this interesting excerpt is from the conclusion:

NOTE. An open access ealier draft version of the paper is offered for download by the author.

Simple solutions to complex problems are never the best choice, even when favoured by politicians and the media. Kossinna also offered a simple solution to a complex prehistoric problem, and failed therein. Prehistoric archaeology has been aware of this for a century, and has responded by becoming more differentiated and nuanced, working anthropologically, scientifically and across disciplines (cf. Müller 2013; Kristiansen 2014), and rejecting monocausal explanations. The two aDNA papers in Nature, powerful and promising as they are for our future understanding, also offer rather straightforward messages, heavily pulled by culture-history and the equation of people with culture. This admittedly is due partly to the restrictions of the medium that conveys them (and despite the often relevant additional detail given as supplementary information, which is unfortunately not always given full consideration).

While I have no doubt that both papers are essentially right, they do not reflect the complexity of the past. It is here that archaeology and archaeologists contributing to aDNA studies find their role; rather than simply handing over samples and advising on chronology, and instead of letting the geneticists determine the agenda and set the messages, we should teach them about complexity in past human actions and interactions. If accepted, this could be the beginning of a marriage made in heaven, with the blessing smile of Gustaf Kossinna, and no doubt Vere Gordon Childe, were they still alive, in a reconciliation of twentieth- and twenty-first-century approaches. For us as archaeologists, it could also be the starting point for the next level of a new archaeology.

heyd-yamnaya-expansion
Main distribution of Yamnaya kurgans in the Pontic-Caspian steppe of modern day Russia, Ukraine, and Kazakhstan, and its western branch in modern south-east European countries of Romania, Bulgaria, Serbia, and Hungary, with numbers of excavated kurgans and graves given. Picture: Volker Heyd (2018).

The question was made painfully clear with the publication of Olalde et al. (2018) & Mathieson et al. (2018), where the real route of Yamna expansion into Europe was now clearly set through the steppes into the Carpathian basin, later expanded as Bell Beakers.

This has been further confirmed in more recent papers, such as Narasimhan et al. (2018), Damgaard et al. (2018), or Wang et al. (2018), among others.

However, the discussion is still dominated by political agendas based on prevalent Y-DNA haplogroups in modern countries and ethnic groups.

Related

Palaeolithic Caucasus samples reveal the most important component of West Eurasians

dzudzuana-ancestry-europe

Preprint Paleolithic DNA from the Caucasus reveals core of West Eurasian ancestry, by Lazaridis et al. bioRxiv (2018).

Interesting excerpts:

We analyzed teeth from two individuals 63 recovered from Dzudzuana Cave, Southern Caucasus, from an archaeological layer previously dated to ~27-24kya (…). Both individuals had mitochondrial DNA sequences (U6 and N) that are consistent with deriving from lineages that are rare in the Caucasus or Europe today. The two individuals were genetically similar to each other, consistent with belonging to the same population and we thus analyze them jointly.

(…) our results prove that the European affinity of Neolithic Anatolians does not necessarily reflect any admixture into the Near East from Europe, as an Anatolian Neolithic-like population already existed in parts of the Near East by ~26kya. Furthermore, Dzudzuana shares more alleles with Villabruna-cluster groups than with other ESHG (Extended Data Fig. 5b), suggesting that this European affinity was specifically related to the Villabruna cluster, and indicating that the Villabruna affinity of PGNE populations from Anatolia and the Levant is not the result of a migration into the Near East from Europe. Rather, ancestry deeply related to the Villabruna cluster was present not only in Gravettian and Magdalenian-era Europeans but also in the populations of the Caucasus, by ~26kya. Neolithic Anatolians, while forming a clade with Dzudzuana with respect to ESHG, share more alleles with all other PGNE (Extended Data Fig. 5d), suggesting that PGNE share at least partially common descent to the exclusion of the much older samples from Dzudzuana.

dzudzuana-anatolia-pca
Ancient West Eurasian population structure. PCA of key ancient West Eurasians, including additional populations (shown with grey shells), in the space of outgroup f4-statistics (Methods).

Our co-modeling of Epipaleolithic Natufians and Ibero-Maurusians from Taforalt confirms that the Taforalt population was mixed, but instead of specifying gene flow from the ancestors of Natufians into the ancestors of Taforalt as originally reported, we infer gene flow in the reverse direction (into Natufians). The Neolithic population from Morocco, closely related to Taforalt is also consistent with being descended from the source of this gene flow, and appears to have no admixture from the Levantine Neolithic (Supplementary Information 166 section 3). If our model is correct, Epipaleolithic Natufians trace part of their ancestry to North Africa, consistent with morphological and archaeological studies that indicate a spread of morphological features and artifacts from North Africa into the Near East. Such a scenario would also explain the presence of Y-chromosome haplogroup E in the Natufians and Levantine farmers, a common link between the Levant and Africa.

(…) we cannot reject the hypothesis that Dzudzuana and the much later Neolithic Anatolians form a clade with respect to ESHG (P=0.286), consistent with the latter being a population largely descended from Dzudzuana-like pre-Neolithic populations whose geographical extent spanned both Anatolia and the Caucasus. Dzudzuana itself can be modeled as a 2-way mixture of Villabruna-related ancestry and a Basal Eurasian lineage.

In qpAdm modeling, a deeply divergent hunter-gatherer lineage that contributed in relatively unmixed form to the much later hunter-gatherers of the Villabruna cluster is specified as contributing to earlier hunter-gatherer groups (Gravettian Vestonice16: 35.7±11.3% and Magdalenian ElMiron: 60.6±11.3%) and to populations of the Caucasus (Dzudzuana: 199 72.5±3.7%, virtually identical to that inferred using ADMIXTUREGRAPH). In Europe, descendants of this lineage admixed with pre-existing hunter-gatherers related to Sunghir3 from Russia for the Gravettians and GoyetQ116-1 from Belgium for the Magdalenians, while in the Near East it did so with Basal Eurasians. Later Europeans prior to the arrival of agriculture were the product of re-settlement of this lineage after ~15kya in mainland Europe, while in eastern Europe they admixed with Siberian hunter-gatherers forming the WHG-ANE cline of ancestry [See PCA above]. In the Near East, the Dzudzuana-related population admixed with North African-related ancestry in the Levant and with Siberian hunter-gatherer and eastern non-African-related ancestry in Iran and the Caucasus. Thus, the highly differentiated populations at the dawn of the Neolithic were primarily descended from Villabruna Cluster and Dzudzuana-related ancestors, with varying degrees of additional input related to both North Africa and Ancient North/East Eurasia whose proximate sources may be clarified by future sampling of geographically and temporally intermediate populations.

qpgraph-dzudzuana
An admixture graph model of Paleolithic West Eurasians. An automatically generated admixture graph models fits populations (worst Z-score of the difference between estimated and fitted f-statistics is 2.7) or populations (also including South_Africa_HG, worst Z-score is 3.5). This is a simplified model assuming binary admixture events and is not a unique solution (Supplementary Information section 2). Sampled populations are shown with ovals and select labeled internal nodes with rectangles.

Interesting excerpts from the supplementary materials:

From our analysis of Supplementary Information section 3, we showed that these sources are indeed complex, and only one of these (WHG, represented by Villabruna) appears to be a contributor to all the remaining sources. This should not be understood as showing that hunter-gatherers from mainland Europe migrated to the rest of West Eurasia, but rather that the fairly homogeneous post-15kya population of mainland Europe labeled WHG appear to represent a deep strain of ancestry that seems to have contributed to West Eurasians from the Gravettian era down to the Neolithic period.

Villabruna is representative of the WHG group. We also include ElMiron, the best sample from the Magdalenian era as we noticed that within the WHG group there were individuals that could not be modeled as a simple clade with Villabruna but also had some ElMiron-related ancestry. Ddudzuana is representative of the Ice Age Caucasus population, differentiated from Villabruna by Basal Eurasian ancestry. AG3 represents ANE/Upper Paleolithic Siberian ancestry, sampled from the vicinity of Lake Baikal, while Russia_Baikal_EN related to eastern Eurasians and represents a later layer of ancestry from the same region of Siberia as AG3 Finally, Mbuti are a deeply diverged African population that is used here to represent deep strains of ancestry (including Basal Eurasian) prior to the differentiation between West Eurasians and eastern non-Africans that are otherwise not accounted for by the remaining five sources. Collectively, we refer to this as ‘Basal’ or ‘Deep’ ancestry, which should be understood as referring potentially to both Basal Eurasian and African ancestry.

It has been suggested that there is an Anatolia Neolithic-related affinity in hunter-gatherers from the Iron Gates. Our analysis confirms this by showing that this population has Dzudzuana-related ancestry as do many hunter-gatherer populations from southeastern Europe, eastern Europe and Scandinavia. These populations cannot be modeled as a simple mixture of Villabruna and AG3 but require extra Dzudzuana-related ancestry even in the conservative estimates, with a positive admixture proportion inferred for several more in the speculative ones. Thus, the distinction between European hunter-gatherers and Near Eastern populations may have been gradual in pre-Neolithic times; samples from the Aegean (intermediate between those from the Balkans and Anatolia) may reveal how gradual the transition between Dzudzuana-like Neolithic Anatolians and mostly Villabruna-like hunter-gatherers was in southeastern Europe.

ancient-modern-european-admixture
Modified image (cut, with important samples marked). Modeling present-day and ancient West-Eurasians. Mixture proportions computed with qpAdm (Supplementary Information section 4). The proportion of ‘Mbuti’ ancestry represents the total of ‘Deep’ ancestry from lineages that split prior to the 365 split of Ust’Ishim, Tianyuan, and West Eurasians and can include both ‘Basal Eurasian’ and other (e.g., Sub-Saharan African) ancestry. (a) ‘Conservative’ estimates. Each population 367 cannot be modeled with fewer admixture events than shown.

Villabruna: This type of ancestry differentiates between present-day Europeans and non-Europeans within West Eurasia, attaining a maximum of ~20% in the Baltic in accordance with previous observations and with the finding of a later persistence of significant hunter-gatherer ancestry in the region. Its proportion drops to ~0% throughout the Near East. Interestingly, a hint of such ancestry is also inferred in all North African populations west of Libya in the speculative proportions, consistent with an archaeogenetic inference of gene flow from Iberia to North Africa during the Late Neolithic.

ElMiron: This type of ancestry is absent in present-day West Eurasians. This may be because most of the Villabruna-related ancestry in Europeans traces to WHG populations that lacked it (since ElMiron-related ancestry is quite variable within European hunter-gatherers). However, ElMiron ancestry makes up only a minority component of all WHG populations sampled to date and WHG-related ancestry is a minority component of present-day Europeans. Thus, our failure to detect it in present day people may be simply be too little of it to detect with our methods.

Dzudzuana: Our analysis identifies Dzudzuana-related ancestry as the most important component of West Eurasians and the one that is found across West Eurasian-North African populations at ~46-88% levels. Thus, Dzudzuana-related ancestry can be viewed as the common core of the ancestry of West Eurasian-North African populations. Its distribution reaches its minima in northern Europe and appears to be complementary to that of Villabruna, being most strongly represented in North Africa, the Near East (including the Caucasus) and Mediterranean Europe. Our results here are expected from those of Supplementary Information section 3 in which we modeled ancient Near Eastern/North African populations (the principal ancestors of present-day people from the same regions) as deriving much of their ancestry from a Dzudzuana-related source. Migrations from the Near East/Caucasus associated with the spread of the Neolithic, but also the formation of steppe population introduced most of the Dzudzuana-related ancestry present in Europe, although (as we have seen above) some such ancestry was already present in some pre-agricultural hunter-gatherers in Europe.

AG3: Ancestry related to the AG3 sample from Siberia has a northern distribution, being strongly represented in both central-northern Europe and the north Caucasus.

Russia_Baikal_EN: Ancestry related to hunter-gatherers from Lake Baikal in Siberia (postdating AG3) appears to have affected primarily northeastern European populations which have been previously identified as having East Eurasian ancestry; some such ancestry is also identified for a Turkish population from Balıkesir, likely reflecting the Central Asian ancestry of Turkic speakers which has been recently confirmed directly in an Ottoman sample from Anatolia.

Some comments

So, to try and sum up:

  • Dzudzuana shares ancestry with ‘Common West Eurasian’ (CWE). the ancestor cluster of Villabruna.
  • Dzudzuana diverges from CWE because of a Basal Eurasian ancestry contribution [which supports that Basal Eurasian ancestry was a deep Middle Eastern lineage].
  • Dzudzuana is closest to Anatolia Neolithic, and close to Gravettian.
palaeolithic-gravettian-villabruna
Palaeolithic migrations and clusters in Europe. See more maps.

Chronologically:

  1. Aurignacian: First West Eurasians arrive ca. 36,000 BP, Goyet cluster expands probably with C1a2 lineages.
  2. After that, the early or ‘unmixed’ Villabruna cluster (‘hidden’ somewhere probably east of Europe, either North Eurasia or South Eurasia), lineages unknown (possibly IJ), contributes to:
    1. Gravettian (ca. 30,000 BP): Věstonice cluster expands, probably with IJ lineages.
    2. A (hidden) ‘Common West Eurasian’ population.
    3. In turn:

      • Dzudzuana ca. 26,000 BP derived from Common West Eurasian (curiously, haplogroup G seems to split in today’s subclades ca. 26,000 BP).
      • During the Gravettian (ca. 26,000 BP), an Anatolian Neolithic-like population exists already in the Near East. Both Věstonice and this Anatolian HG are close to Dzudzuana; in turn, Dzudzuana from CWE.

    4. Magdalenian (ca. 20,000 BP): El Mirón cluster expands, probably with more specific I lineages.
  3. Bølling-Allerød warming period (ca. 14,000 BP): ‘late’ Villabruna cluster or WHG (=CWE with greater affinity to Near Eastern populations) expands, probably spreading with R1b in mainland Europe and to the east (admixing with Siberian HG), creating the WHG — ANE ancestry cline, as reflected in Iron Gates HG, Baltic HG, etc.

[Here we have the possible “bidirectional gene flow between populations ancestral to Southeastern Europeans of the early Holocene and Anatolians of the late glacial or a dispersal of Southeastern Europeans into the Near East” inferred from Anatolian hunter-gatherers]

palaeolithic-gravettian-magdalenian-migrations
The Gravettian (30,000 to 20,000 years) is drawn in black and white; the subsequent Magdalenian (17,000 to 10,000 years) and Hamburgian (13,000-11,750 years) are in light blue and red. It is not known whether the spread of the Gravettian was a result of diffusion of people or cultures. This figure illustrates the possible monocentric origins of the Gravettian, in which the Gravettian is hypothesized to have its origin in the Middle Danube Basin, first spreading west (solid lines) and later spreading east and southeast (dashed lines). This scenario is largely based on the chronology of sites. Thus far, genome-wide data has been collected from only three of the ten< Gravettian regions indicated on the map. These regions are northern Austria (1 sample), the Czech Republic (6), southern Italy (3) and Belgium (3), indicating that they all share a genomic ancestry. However, it is unknown whether samples from the remaining regions also share a close genomic ancestry. Some skeletal remains associated with the Gravettian that could be investigated paleogenomically are from Sungir (Russia); Laghar Velho (central Portugal); Cussac Cave; Les Garennes, near Vilhonneur; and Level 2 at Abri Pataud116 (western France). Light blue and light red regions represent the approximate distributions of the Magdalenian Culture and the Hamburgian Culture (13,000-11,750 years). Figure adapted from Kozłowski. Image from Harris (2017)

The paper talks about possibilities for Common West Eurasian:

  1. Migration from mainland Europe to Near East or vice versa (not very likely);
  2. Migration from a geographically intermediate Ice Age refugium in southeast Europe, Anatolia, or the circum-Pontic region that explain post-glacial affinity of post-glacial Levantine and Anatolian populations.

It also re-states what was known:

  • EHG (ca. 8,000 BP) = between WHG — ANE (ca. 24,000 BP).
  • CHG (ca. 10,000 BP) = between EHG — Iran N.

I would say that the distinct CHG vs. Dzudzuana ancestry puts CHG probably to the south, within the Iranian Plateau, during the Gravettian, expanding probably later.

Also important, Ancestral North African probably accompanied by haplogroup E. Early expansion of North Africans into the Near East further confirms the impossibility of Afroasiatic (much younger) to be associated with these expansions, and confirms that the still unclear Green Sahara migrations are the key.

Related

Expansion of domesticated goat echoes expansion of early farmers

goat-neolithic

New paper (behind paywall) Ancient goat genomes reveal mosaic domestication in the Fertile Crescent, by Daly et al. Science (2018) 361(6397):85-88.

Interesting excerpts (emphasis mine):

Thus, our data favor a process of Near Eastern animal domestication that is dispersed in space and time, rather than radiating from a central core (3, 11). This resonates with archaeozoological evidence for disparate early management strategies from early Anatolian, Iranian, and Levantine Neolithic sites (12, 13). Interestingly, our finding of divergent goat genomes within the Neolithic echoes genetic investigation of early farmers. Northwestern Anatolian and Iranian human Neolithic genomes are also divergent (14–16), which suggests the sharing of techniques rather than large-scale migrations of populations across Southwest Asia in the period of early domestication. Several crop plants also show evidence of parallel domestication processes in the region (17).

PCA affinity (Fig. 2), supported by qpGraph and outgroup f3 analyses, suggests that modern European goats derive from a source close to the western Neolithic; Far Eastern goats derive from early eastern Neolithic domesticates; and African goats have a contribution from the Levant, but in this case with considerable admixture from the other sources (figs. S11, S16, and S17 and tables S26 and 27). The latter may be in part a result of admixture that is discernible in the same analyses extended to ancient genomes within the Fertile Crescent after the Neolithic (figs. S18 and S19 and tables S20, S27, and S31) when the spread of metallurgy and other developments likely resulted in an expansion of inter-regional trade networks and livestock movement.

goat-middle-east
Maximumlikelihood phylogeny and geographical distributions of ancient mtDNA haplogroups. (A) A phylogeny placing ancient whole mtDNA sequences in the context of known haplogroups. Symbols denoting individuals are colored by clade membership; shape indicates archaeological period (see key). Unlabeled nodes are modern bezoar and outgroup sequence (Nubian ibex) added for reference.We define haplogroup T as the sister branch to the West Caucasian tur (9). (B and C) Geographical distributions of haplogroups show early highly structured diversity in the Neolithic period (B) followed by collapse of structure in succeeding periods (C).We delineate the tiled maps at 7250 to 6950 BP, a period >bracketing both our earliest Chalcolithic sequence (24, Mianroud) and latest Neolithic (6, Aşağı Pınar). Numbered archaeological sites also include Direkli Cave (8), Abu Ghosh (9), ‘Ain Ghazal (10), and Hovk-1 Cave (11) (table S1) (9).

Our results imply a domestication process carried out by humans in dispersed, divergent, but communicating communities across the Fertile Crescent who selected animals in early millennia, including for pigmentation, the most visible of domestic traits.

Related