NOTE. The video is best viewed in HD 1080p (1920×1080) with a display that allows for this or greater video quality, and a screen big enough to see haplogroup symbols, i.e. tablet or greater. The YouTube link is here. The Facebook link is here.
Based on the results of the past 5 years or so, which have been confirming this combined picture every single time, I doubt there will be much need to change it in any radical way, as only minor details remain to be clarified.
I wanted to publish a GIS tool of my own for everyone to have an updated reference of all data I use for my books.
The most complex GIS tools consume too many resources when used online in a client-server model, so I have to keep that to myself, but there are some ways to publish low quality outputs.
The files below include the possibility to zoom some levels to be able to see more samples, and also to check each one for more information on their ID, attributed culture and label, archaeological site, source paper, subclade (and people responsible for SNP inferences if any), etc.
Some usage notes:
Files are large (ca. 20 Mb), so they still take some time to load.
For the meaning of symbols and colors (for Y-DNA haplogroups), if there is any doubt, check the video above.
Pop-ups with sample information will work on desktop browsers by clicking on them, apparently not on smartphone and related tactile OS. I have changed the settings to show pop-ups on hover, so that it now works (to some extent) on tactile OS.
The search tool can look for specific samples according to their official ID, and works by highlighting the symbol of the selected individual (turning it into a bright blue dot), and leading the layer view to the location, but it seems to work best only with some browser and OS settings – in other browsers, you need to zoom out to see where the dot is located. The specific sample with its information could paradoxically disappear in search mode, so you might need to reload and look again for the same site that was highlighted.
Latitude and longitude values have been randomly modified to avoid samples overcrowding specific sites, so they are not the original ones.
The latest papers from Ning et al. Cell (2019) and Anthony JIES (2019) have offered some interesting new data, supporting once more what could be inferred since 2015, and what was evident in population genomics since 2017: that Proto-Indo-Europeans expanded under R1b bottlenecks, and that the so-called “Steppe ancestry” referred to two different components, one – Yamnaya or Steppe_EMBA ancestry – expanding with Proto-Indo-Europeans, and the other one – Corded Ware or Steppe_MLBA ancestry – expanding with Uralic speakers.
NOTE. As in the Corded Ware ancestry maps, the selected reports in this case are centered on the prototypical Yamnaya ancestry vs. other simplified components, so everything else refers to simplistic ancestral components widespread across populations that do not necessarily share any recent connection, much less a language. In fact, most of the time they clearly didn’t. They can be interpreted as “EHG that is not part of the Yamnaya component”, or “CHG that is not part of the Yamnaya component”. They can’t be read as “expanding EHG people/language” or “expanding CHG people/language”, at least no more than maps of “Steppe ancestry” can be read as “expanding Steppe people/language”. Also, remember that I have left the default behaviour for color classification, so that the highest value (i.e. 1, or white colour) could mean anything from 10% to 100% depending on the specific ancestry and period; that’s what the legend is for… But, fere libenter homines id quod volunt credunt.
Anthony (2019) agrees with the most likely explanation of the CHG component found in Yamnaya, as derived from steppe hunter-fishers close to the lower Volga basin. The ultimate origin of this specific CHG-like component that eventually formed part of the Pre-Yamnaya ancestry is not clear, though:
The hunter-fisher camps that first appeared on the lower Volga around 6200 BC could represent the migration northward of un-admixed CHG hunter-fishers from the steppe parts of the southeastern Caucasus, a speculation that awaits confirmation from aDNA.
The typical EHG component that formed part eventually of Pre-Yamnaya ancestry came from the Middle Volga Basin, most likely close to the Samara region, as shown by the sampled Samara hunter-gatherer (ca. 5600-5500 BC):
After 5000 BC domesticated animals appeared in these same sites in the lower Volga, and in new ones, and in grave sacrifices at Khvalynsk and Ekaterinovka. CHG genes and domesticated animals flowed north up the Volga, and EHG genes flowed south into the North Caucasus steppes, and the two components became admixed.
To the west, in the Dnieper-Dniester area, WHG became the dominant ancestry after the Mesolithic, at the expense of EHG, revealing a likely mating network reaching to the north into the Baltic:
Like the Mesolithic and Neolithic populations here, the Eneolithic populations of Dnieper-Donets II type seem to have limited their mating network to the rich, strategic region they occupied, centered on the Rapids. The absence of CHG shows that they did not mate frequently if at all with the people of the Volga steppes (…)
After approximately 4500 BC the Khvalynsk archaeological culture united the lower and middle Volga archaeological sites into one variable archaeological culture that kept domesticated sheep, goats, and cattle (and possibly horses). In my estimation, Khvalynsk might represent the oldest phase of PIE.
(…) this middle Volga mating network extended down to the North Caucasian steppes, where at cemeteries such as Progress-2 and Vonyuchka, dated 4300 BC, the same Khvalynsk-type ancestry appeared, an admixture of CHG and EHG with no Anatolian Farmer ancestry, with steppe-derived Y-chromosome haplogroup R1b. These three individuals in the North Caucasus steppes had higher proportions of CHG, overlapping Yamnaya. Without any doubt, a CHG population that was not admixed with Anatolian Farmers mated with EHG populations in the Volga steppes and in the North Caucasus steppes before 4500 BC. We can refer to this admixture as pre-Yamnaya, because it makes the best currently known genetic ancestor for EHG/CHG R1b Yamnaya genomes.
Three individuals from the sites of Progress 2 and Vonyuchka 1 in the North Caucasus piedmont steppe (‘Eneolithic steppe’), which harbour EHG and CHG related ancestry, are genetically very similar to Eneolithic individuals from Khvalynsk II and the Samara region. This extends the cline of dilution of EHG ancestry via CHG-related ancestry to sites immediately north of the Caucasus foothills
NOTE. Unpublished samples from Ekaterinovka have been previously reported as within the R1b-L23 tree. Interestingly, although the Varna outlier is a female, the Balkan outlier from Smyadovo shows two positive SNP calls for hg. R1b-M269. However, its poor coverage makes its most conservative haplogroup prediction R-M343.
The formation of this Pre-Yamnaya ancestry sets this Volga-Caucasus Khvalynsk community apart from the rest of the EHG-like population of eastern Europe.
Anthony (2019) seems to rely on ADMIXTURE graphics when he writes that the late Sredni Stog sample from Alexandria shows “80% Khvalynsk-type steppe ancestry (CHG&EHG)”. While this seems the most logical conclusion of what might have happened after the Suvorovo-Novodanilovka expansion through the North Pontic steppes (see my post on “Steppe ancestry” step by step), formal stats have not confirmed that.
In fact, analyses published in Wang et al. (2019) rejected that Corded Ware groups are derived from this Pre-Yamnaya ancestry, a reality that had been already hinted in Narasimhan et al. (2018), when Steppe_EMBA showed a poor fit for expanding Srubna-Andronovo populations. Hence the need to consider the whole CHG component of the North Pontic area separately:
NOTE. Fits for WHG + CHG + EHG in Neolithic and Eneolithic populations are taken in part from Mathieson et al. (2019) supplementary materials (download Excel here). Unfortunately, while data on the Ukraine_Eneolithic outlier from Alexandria abounds, I don’t have specific data on the so-called ‘outlier’ from Dereivka compared to the other two analyzed together, so these maps of CHG and EHG expansion are possibly showing a lesser distribution to the west than the real one ca. 4000-3500 BC.
Anatolia Neolithic ancestry clearly spread to the east into the north Pontic area through a Middle Eneolithic mating network, most likely opened after the Khvalynsk expansion:
Regarding Y-chromosome haplogroups, Anthony (2019) insists on the evident association of Khvalynsk, Yamnaya, and the spread of Pre-Yamnaya and Yamnaya ancestry with the expansion of elite R1b-L754 (and some I2a2) individuals:
3. Early Bronze Age
Data from Wang et al. (2019) show that Corded Ware-derived populations do not have good fits for Eneolithic_Steppe-like ancestry, no matter the model. In other words: Corded Ware populations show not only a higher contribution of Anatolia Neolithic ancestry (ca. 20-30% compared to the ca. 2-10% of Yamnaya); they show a different EHG + CHG combination compared to the Pre-Yamnaya one.
Yamnaya Kalmykia and Afanasievo show the closest fits to the Eneolithic population of the North Caucasian steppes, rejecting thus sizeable contributions from Anatolia Neolithic and/or WHG, as shown by the SD values. Both probably show then a Pre-Yamnaya ancestry closest to the late Repin population.
EBA maps include data from Wang et al. (2018) supplementary materials, specifically unpublished Yamnaya samples from Hungary that appeared in analysis of the preprint, but which were taken out of the definitive paper. Their location among Yamnaya settlers from Hungary is speculative, although most uncovered kurgans in Hungary are concentrated in the Tisza-Danube interfluve.
The Y-chromosome bottleneck of elite males from Proto-Indo-European clans under R1b-L754 and some I2a2 subclades, already visible in the Khvalynsk sampling, became even more noticeable in the subsequent expansion of late Repin/early Yamnaya elites under R1b-L23 and I2a-L699:
Maps of CHG, EHG, Anatolia Neolithic, and probably WHG show the expansion of these components among Corded Ware-related groups in North Eurasia, apart from other cultures close to the Caucasus:
The following maps show the most likely distribution of Yamnaya ancestry during the Bell Beaker-, Balkan-, and Sintashta-Potapovka-related expansions.
4.1. Bell Beakers
The amount of Yamnaya ancestry is probably overestimated among populations where Bell Beakers replaced Corded Ware. A map of Yamnaya ancestry among Bell Beakers gets trickier for the following reasons:
Expanding Repin peoples of Pre-Yamnaya ancestry must have had admixture through exogamy with late Sredni Stog/Proto-Corded Ware peoples during their expansion into the North Pontic area, and Sredni Stog in turn had probably some Pre-Yamnaya admixture, too (although they don’t appear in the simplistic formal stats above). This is supported by the increase of Anatolia farmer ancestry in more western Yamna samples.
Later, Yamnaya admixed through exogamy with Corded Ware-like populations in Central Europe during their expansion. Even samples from the Middle to Upper Danube and around the Lower Rhine will probably show increasing contributions of Steppe_MLBA, at the same time as they show an increasing proportion of EEF-related ancestry.
To complicate things further, the late Corded Ware Espersted family (from ca. 2500 BC or later) shows, in turn, what seems like a recent admixture with Yamnaya vanguard groups, with the sample of highest Yamnaya ancestry being the paternal uncle of other individuals (all of hg. R1a-M417), suggesting that there might have been many similar Central European mating networks from the mid-3rd millennium BC on, of (mainly) Yamnaya-like R1b elites displaying a small proportion of CW-like ancestry admixing through exogamy with Corded Ware-like peoples who already had some Yamnaya ancestry.
NOTE. Terms like “exogamy”, “male-driven migration”, and “sex bias”, are not only based on the Y-chromosome bottlenecks visible in the different cultural expansions since the Palaeolithic. Despite the scarce sampling available in 2017 for analysis of “Steppe ancestry”-related populations, it appeared to show already a male sex bias in Goldberg et al. (2017), and it has been confirmed for Neolithic and Copper Age population movements in Mathieson et al. (2018) – see Supplementary Table 5. The analysis of male-biased expansion of “Steppe ancestry” in CWC Esperstedt and Bell Beaker Germany is, for the reasons stated above, not very useful to distinguish their mutual influence, though.
Based on data from Olalde et al. (2019), Bell Beakers from Germany are the closest sampled ones to expanding East Bell Beakers, and those close to the Rhine – i.e. French, Dutch, and British Beakers in particular – show a clear excess “Steppe ancestry” due to their exogamy with local Corded Ware groups:
Only one 2-way model fits the ancestry in Iberia_CA_Stp with P-value>0.05: Germany_Beaker + Iberia_CA. Finding a Bell Beaker-related group as a plausible source for the introduction of steppe ancestry into Iberia is consistent with the fact that some of the individuals in the Iberia_CA_Stp group were excavated in Bell Beaker associated contexts. Models with Iberia_CA and other Bell Beaker groups such as France_Beaker (P-value=7.31E-06), Netherlands_Beaker (P-value=1.03E-03) and England_Beaker (P-value=4.86E-02) failed, probably because they have slightly higher proportions of steppe ancestry than the true source population.
The exogamy with Corded Ware-like groups in the Lower Rhine Basin seems at this point undeniable, as is the origin of Bell Beakers around the Middle-Upper Danube Basin from Yamnaya Hungary.
To avoid this excess “Steppe ancestry” showing up in the maps, since Bell Beakers from Germany pack the most Yamnaya ancestry among East Bell Beakers outside Hungary (ca. 51.1% “Steppe ancestry”), I equated this maximum with BK_Scotland_Ach (which shows ca. 61.1% “Steppe ancestry”, highest among western Beakers), and applied a simple rule of three for “Steppe ancestry” in Dutch and British Beakers.
NOTE. Formal stats for “Steppe ancestry” in Bell Beaker groups are available in Olalde et al. (2018) supplementary materials (PDF). I didn’t apply this adjustment to Bk_FR groups because of the R1b Bell Beaker sample from the Champagne/Alsace region reported by Samantha Brunel that will pack more Yamnaya ancestry than any other sampled Beaker to date, hence probably driving the Yamnaya ancestry up in French samples.
The most likely outcome in the following years, when Yamnaya and Corded Ware ancestry are investigated separately, is that Yamnaya ancestry will be much lower the farther away from the Middle and Lower Danube region, similar to the case in Iberia, so the map above probably overestimates this component in most Beakers to the north of the Danube. Even the late Hungarian Beaker samples, who pack the highest Yamnaya ancestry (up to 75%) among Beakers, represent likely a back-migration of Moravian Beakers, and will probably show a contribution of Corded Ware ancestry due to the exogamy with local Moravian groups.
Despite this decreasing admixture as Bell Beakers spread westward, the explosive expansion of Yamnaya R1b male lineages (in words of David Reich) and the radical replacement of local ones – whether derived from Corded Ware or Neolithic groups – shows the true extent of the North-West Indo-European expansion in Europe:
There is scarce data on Palaeo-Balkan movements yet, although it is known that:
Yamnaya ancestry appears among Mycenaeans, with the Yamnaya Bulgaria sample being its best current ancestral fit;
Interestingly, Potapovka is the only Corded Ware derived culture that shows good fits for Yamnaya ancestry, despite having replaced Poltavka in the region under the same Corded Ware-like (Abashevo) influence as Sintashta.
Srubnaya ancestry shows a best fit with non-Pre-Yamnaya ancestry, i.e. with different CHG + EHG components – possibly because the more western Potapovka (ancestral to Proto-Srubnaya Pokrovka) also showed good fits for it. Srubnaya shows poor fits for Pre-Yamnaya ancestry probably because Corded Ware-like (Abashevo) genetic influence increased during its formation.
On the other hand, more eastern Corded Ware-derived groups like Sintashta and its more direct offshoot Andronovo show poor fits with this model, too, but their fits are still better than those including Pre-Yamnaya ancestry.
The bottleneck of Proto-Indo-Iranians under R1a-Z93 was not yet complete by the time when the Sintashta-Potapovka-Filatovka community expanded with the Srubna-Andronovo horizon:
At the end of the Afanasevo culture, at least three samples show hg. Q1b (ca. 2900-2500 BC), which seemed to point to a resurgence of local lineages, despite continuity of the prototypical Pre-Yamnaya ancestry. On the other hand, Anthony (2019) makes this cryptic statement:
Yamnaya men were almost exclusively R1b, and pre-Yamnaya Eneolithic Volga-Caspian-Caucasus steppe men were principally R1b, with a significant Q1a minority.
Since the only available samples from the Khvalynsk community are R1b (x3), Q1a(x1), and R1a(x1), it seems strange that Anthony would talk about a “significant minority”, unless Q1a (potentially Q1b in the newer nomenclature) will pop up in some more individuals of those ca. 30 new to be published. Because he also mentions I2a2 as appearing in one elite burial, it seems Q1a (like R1a-M459) will not appear under elite kurgans, although it is still possible that hg. Q1a was involved in the expansion of Afanasevo to the east.
Okunevo, which replaced Afanasevo in the Altai region, shows a majority of hg. Q1b, but also some R1b-M269 samples proper of Afanasevo, suggesting partial genetic continuity.
NOTE. Other sampled Siberian populations clearly show a variety of Q subclades that likely expanded during the Palaeolithic, such as Baikal EBA samples from Ust’Ida and Shamanka with a majority of Q1b, and hg. Q reported from Elunino, Sagsai, Khövsgöl, and also among peoples of the Srubna-Andronovo horizon (the Krasnoyarsk MLBA outlier), and in Karasuk.
(…) in contrast to the lack of identifiable admixture from Yamnaya and Afanasievo in the CentralSteppe_EMBA, there is an admixture signal of 10 to 20% Yamnaya and Afanasievo in the Okunevo_EMBA samples, consistent with evidence of western steppe influence. This signal is not seen on the X chromosome (qpAdm P value for admixture on X 0.33 compared to 0.02 for autosomes), suggesting a male-derived admixture, also consistent with the fact that 1 of 10 Okunevo_EMBA males carries a R1b1a2a2 Y chromosome related to those found in western pastoralists. In contrast, there is no evidence of western steppe admixture among the more eastern Baikal region region Bronze Age (~2200 to 1800 BCE) samples.
Haplogroup diversity seems to be common in Iron Age populations all over Eurasia, most likely due to the spread of different types of sociopolitical structures where alliances played a more relevant role in the expansion of peoples. A well-known example of this is the spread of Akozino warrior-traders in the whole Baltic region under a partial N1a-VL29-bottleneck associated with the emerging chiefdom-based systems under the influence of expanding steppe nomads.
Surprisingly, then, Proto-Tocharians from Shirenzigou pack up to 74% Yamnaya ancestry, in spite of the 2,000 years that separate them from the demise of the Afanasevo culture. They show more Yamnaya ancestry than any other population by that time, being thus a sort of Late PIE fossils not only in their archaic dialect, but also in their genetic profile:
The recent intrusion of Corded Ware-like ancestry, as well as the variable admixture with Siberian and East Asian populations, both point to the known intense Old Iranian and Old/Middle Chinese contacts. The scarce Proto-Samoyedic and Proto-Turkic loans in Tocharian suggest a rather loose, probably more distant connection with East Uralic and Altaic peoples from the forest-steppe and steppe areas to the north (read more about external influences on Tocharian).
Interestingly, both R1b samples, MO12 and M15-2 – likely of Asian R1b-PH155 branch – show a best fit for Andronovo/Srubna + Hezhen/Ulchi ancestry, suggesting a likely connection with Iranians to the east of Xinjiang, who later expanded as the Wusun and Kangju. How they might have been related to Huns and Xiongnu individuals, who also show this haplogroup, is yet unknown, although Huns also show hg. R1a-Z93 (probably most R1a-Z2124) and Steppe_MLBA ancestry, earlier associated with expanding Iranian peoples of the Srubna-Andronovo horizon.
All in all, it seems that prehistoric movements explained through the lens of genetic research fit perfectly well the linguistic reconstruction of Proto-Indo-European and Proto-Uralic.
We know that the Caucasus Mountains formed a persistent prehistoric barrier to cultural and population movements. Nevertheless, an even more persistent frontier to population movements in Europe, especially since the Neolithic, is the Pontic-Caspian steppe – forest-steppe ecotone.
Like the Caucasus, this barrier could certainly be crossed, and peoples and cultures could permeate in both directions, but there have been no massive migrations through it. The main connection between both regions (steppe vs. forest-steppe/forest zone) was probably through its eastern part, through the Samara region in the Middle Volga.
The chances of population expansions crossing this natural barrier anywhere else seem quite limited, with a much less porous crossing region in the west, through the Dnieper-Dniester corridor.
A Persistent ecological and cultural frontier
It is very difficult to think about any culture that transgressed this persistent ecological and cultural frontier: many prehistoric and historical steppe pastoralists did appear eventually in the neighbouring forest-steppe areas during their expansions (e.g. Yamna, Scythians, or Turks), as did forest groups who permeated to the south (e.g. Comb Ware, GAC, or Abashevo), but their respective hold in foreign biomes was mostly temporary, because their cultures had to adapt to the new ecological environment. Most if not all groups originally from a different ecological niche eventually disappeared, subjected to renewed demographic pressure from neighbouring steppe or forest populations…
Before the emergence of pastoralism, the cultural contacts of the Pontic region (i.e. forest-steppes) with the Baltic were intense. In fact, the connection of the north Pontic area with the Baltic through the Dnieper-Dniester corridor and the Podolian-Volhynian region is essential to understand the spread of peoples of post-Maglemosian and post-Swiderian cultures (to the south), hunter-gatherer pottery (to the north), TRB (to the south), Late Trypillian groups (north), GAC (south), or Comb Ware (south) (see here for Eneolithic movements), and finally steppe ancestry and R1a-Z645 with Corded Ware (north). After the complex interaction of TRB, Trypillia, GAC, and CWC during the expansion of late Repin, this traditional long-range connection is lost and only emerges sporadically, such as with the expansion of East Germanic tribes.
A barrier to steppe migrations into northern Europe
One may think that this barrier was more permeable, then, in the past. However, the frontier is between steppe and forest-steppe ecological niches, and this barrier evolved during prehistory due to climate changes. The problem is, before the drought that began ca. 4000 BC and increased until the Yamna expansion, the steppe territory in the north Pontic region was much smaller, merely a strip of coastal land, compared to its greater size ca. 3300 BC and later.
This – apart from the cultural and technological changes associated with nomadic pastoralism – justifies the traditional connection of the north Pontic forest-steppes to the north, broken precisely after the expansion of Khvalynsk, as the north Pontic area became gradually a steppe region. The strips of north Pontic and Azov steppes and Crimea seem to have had stronger connections to the Northern Caucasus and Northern Caspian steppes than with the neighbouring forest-steppe areas during the Upper Palaeolithic, Mesolithic, and Neolithic.
NOTE. We still don’t know the genetic nature of Mikhailovka or Ezero, steppe-related groups possibly derived from Novodanilovka and Suvorovo close to the Black Sea (which possibly include groups from the Pannonian plains), and how they compare to neighbouring typically forest-steppe cultures of the so-called late Sredni Stog groups, like Dereivka or partly Kvityana.
Despite the Pontic-Caspian steppes and forest-steppes neighbouring each other for ca. 2,000 km, peoples from forested and steppe areas had an obvious advantage in their own regions, most likely due to the specialization of their subsistence economy. While this is visible already in Palaeolithic and Mesolithic hunter-gatherers, the arrival of the Neolithic package in the Pontic-Caspian region incremented the difference between groups, by spreading specialized animal domestication. The appearance of nomadic pastoralism adapted to the steppe, eventually including the use of horses and carts, made the cultural barrier based on the economic know-how even stronger.
Even though groups could still adapt and permeate a different territory (from steppe to forest-steppe/forest and vice-versa), this required an important cultural change, to the extent that it is eventually complicated to distinguish these groups from neighbouring ones (like north-west Pontic Mesolithic or Neolithic groups and their interaction with the steppes, Trypillia-Usatovo, Scythians-Thracians, etc.). In fact, this steppe – forest-steppe barrier is also seen to the east of the Urals, with the distinct expansion of Andronovo and Seima-Turbino/Andronovo-like horizons, which seem to represent completely different ethnolinguistic groups.
As a result of this cultural and genetic barrier, like that formed by the Northern Caucasus:
1) No steppe pastoralist culture (which after the emergence of Khvalynsk means almost invariably horse-riding, chariot-using nomadic herders who could easily pasture their cows in the huge grasslands without direct access to water) has ever been successful in spreading to the north or north-west into northern Europe, until the Mongols. No forest culture has ever been successful in expanding to the steppes, either (except for the infiltration of Abashevo into Sintashta-Potapovka).
2) Corded Ware was not an exception: like hunter-gatherer pottery before it (and like previous population movements of TRB, late Trypillia, GAC, Comb Ware or Lublin-Volhynia settlers) their movements between the north Pontic area and central Europe happened through forest-steppe ecological niches due to their adaptation to them. There is no reason to support a direct connection of CWC with true steppe cultures.
3) The so-called “Steppe ancestry” permeated the steppe – forest-steppe ecotone for hundreds of years during the 5th and early 4th millennium BC, due to the complex interaction of different groups, and probably to the aridization trend that expanded steppe (and probably forest-steppe) to the north. Language, culture, and paternal lineages did not cross that frontier, though.
EDIT (4 FEB 2019): Wang et al. is out in Nature Communications. They deleted the Yamna Hungary samples and related analyses, but it’s interesting to see where exactly they think the trajectory of admixture of Yamna with European MN cultures fits best. This path could also be inferred long ago from the steppe connections shown by the Yamna Hungary -> Bell Beaker evolution and by early Balkan samples:
It has been known for a long time that the Caucasus must have hosted many (at least partially) isolated populations, probably helped by geographical boundaries, setting it apart from open Eurasian areas.
David Reich writes in his book the following about India:
The genetic data told a clear story. Around a third of Indian groups experienced population bottlenecks as strong or stronger than the ones that occurred among Finns or Ashkenazi Jews. We later confirmed this finding in an even larger dataset that we collected working with Thangaraj: genetic data from more than 250 jati groups spread throughout India (…)
Rather than an invention of colonialism as Dirks suggested, long-term endogamy as embodied in India today in the institution of caste has been overwhelmingly important for millennia. (…)
The Han Chinese are truly a large population. They have been mixing freely for thousands of years. In contrast, there are few if any Indian groups that are demographically very large, and the degree of genetic differentiation among Indian jati groups living side by side in the same village is typically two to three times higher than the genetic differentiation between northern and southern Europeans. The truth is that India is composed of a large number of small populations.
There is little doubt now, based on findings spanning thousands of years, that the Mesolithic and Neolithic Caucasus hosted various very small populations, even if the ancestral components may be reduced to the few known to date (such as ANE, EHG, AME*, ENA, CHG, and other “deep” ancestral components).
NOTE. I will call the ancestral component of Dzudzuana/Anatolian hunter-gatherers Ancient Middle Easterner (AME), to give a clear idea of its likely extension during the Late Upper Palaeolithic, and to avoid using the more simplistic Dzudzuana, unless it is useful to mention these specific local samples.
Genetic labs have a strong fixation with ancestry. I guess the use of complex statistical methods gives professionals and laymen alike the feeling of dealing with “Science”, as opposed to academic fields where you have to interpret data. I think language reveals a lot about the way people think, and the fact that ancestral components are called ‘lineages’ – while not wrong per se – is a clear symptom of the lack of interest in the true lineages: Y-DNA haplogroups.
It has become quite clear that male-biased migrations are often the ones which can be confidently followed for actual population movements and ethnolinguistic identification, at least until the Iron Age. The frequently used Palaeolithic clusters offer a clear example of why ancestry does not represent what some people believe: They merely give a basic idea of sizeable population replacements by distant peoples.
Both concepts are important: sizeable and distant peoples. For example, during the Upper Palaeolithic in Europe there was a sizeable population replacement of the Aurignacian Goyet cluster by the Gravettian Vestonice cluster (probably from populations of far eastern Russia) coupled with the arrival of haplogroup I, although during the thousands of years that this material culture lasted, the previously expanded C1a2 lineages did not disappear, and there were probably different resurgence and admixture events.
Haplogroup I certainly expanded with the Gravettian culture to Iberia, where the Goyet ancestry did not change much – probably because of male-driven migrations -, to the extent that during the Magdalenian expansions haplogroup I expanded with an ancestry closer to Goyet, in what is called a ‘resurge’ of the Goyet cluster – even though there is a clear replacement of male lines.
The Villabruna (WHG) cluster is another good example. It probably spread with haplogroup R1b-L754, which – based on the extra ‘East Asian’ affinity of some samples and on modern samples from the Middle East – came probably from the east through a southern route, and not too long before the expansion of WHG likely from around the Black Sea, although this is still unclear. The finding of haplogroup I in samples of mostly WHG ancestry could confuse people that do not care about timing, sub-structured populations, and gene flow.
NOTE. If you don’t understand why ‘clusters’ that span thousands of years don’t really matter for the many Palaeolithic population expansions that certainly happened among hunter-gatherers in Europe, just take a look at what happened with Bell Beakers expanding from Yamna into western Europe within 500 years.
If we don’t thread carefully when talking about population migrations, these terms are bound to confuse people. Just as the fixation on “steppe ancestry” – which marks the arrival in Chalcolithic Europe of peoples from the Pontic-Caspian region – has confused a lot of researchers to this day.
When I began to write about the Indo-European demic diffusion model, my concern was to find a single spot where a North-West Indo-European proto-language could have expanded from ca. 2000 BC (our most common guesstimate). Based on the 2015 papers, and in spite of their conclusions, I thought it had become clear that Corded Ware was not it, and it was rather Bell Beakers. I assumed that Uralic was spoken to the north (as was the traditional belief), and thus Corded Ware expanded from the forest zone, hence steppe ancestry would also be found there with other R1a lineages.
With the publication of Mathieson et al. (2017) and Olalde et al. (2017), I changed my mind, seeing how “steppe ancestry” did in fact appear quite late, hence it was likely to be the result of very specific population movements, probably directly from the Caucasus. Later, Mathieson published in a revision the sample from Alexandria of hg R1a-M417 (probably R1a-Z645, possibly Z93+), which further supported the idea that the migration of Corded Ware peoples started near the North Pontic forest-steppe (as I included in a the next revision).
The question remains the same I repeated recently, though: where do the extra Caucasus components (i.e. beyond EHG) of Eneolithic Ukraine/Corded Ware and Khvalynsk/Yamna come from?
Considering 2-way mixtures, we can model Karelia_HG as deriving 34 ± 2.8% of its ancestry from a Villabruna-related source, with the remainder mainly from ANE represented by the AfontovaGora3 (AG3) sample from Lake Baikal ~17kya.
AG3 was likely of haplogroup Q1a (as reported by YFull, see Genetiker), and probably the ANE ancestry found in Eastern Europe accompanied a Palaeolithic migration of Q1a2-M25 (formed ca. 22600 BC, TMRCA ca. 14300 BC).
Combined with what we know about the Eneolithic Steppe and Caucasus populations – it is likely that ANE ancestry remained the most important component of some of the small ghost populations of the Caucasus until their emergence with the Lola culture.
The first sample we have now attributed to the EHG cluster is Sidelkino, from the Samara region (ca. 9300 BC), mtDNA U5a2. In Damgaard et al. (Science 2018), Yamnaya could be modelled as a CHG population related to Kotias Klde (54%) and the remaining from ANE population related to Sidelkino (>46%), with the following split events:
A split event, where the CHG component of Yamnaya splits from KK1. The model inferred this time at 27 kya (though we note the larger models in Sections S2.12.4 and S2.12.5 inferred a more recent split time).
A split event, where the ANE component of Yamnaya splits from Sidelkino. This was inferred at about about 11 kya.
A split event, where the ANE component of Yamnaya splits from Botai. We inferred this to occur 17 kya. Note that this is above the Sidelkino split time, so our model infers Yamnaya to be more closely related to the EHG Sidelkino, as expected.
An ancestral split event between the CHG and ANE ancestral populations. This was inferred to occur around 40 kya.
Other samples classified as of the EHG cluster:
Popovo2 (ca. 6250 BC) of hg J1, mtDNA U4d – Po2 and Po4 from the same site (ca. 6550 BC) show continuity of mtDNA.
Karelia_HG, from Juzhnii Oleni Ostrov (ca. 6300 BC): I0211/UzOO40 (ca. 6300 BC) of hg J1(xJ1a), mtDNA U4a; and I0061/UzOO74 of hg R1a1(xR1a1a), mtDNA C1
UzOO77 and UzOO76 from Juzhnii Oleni Ostrov (ca. 5250 BC) of mtDNA R1b.
Samara_HG from Lebyanzhinka (ca. 5600 BC) of hg R1b1a, mtDNA U5a1d.
About the enigmatic Anatolia_Neolithic-related ancestry found in Pontic-Caspian steppe samples, this is what Wang et al. (2018) had to say:
We focused on model of mixture of proximal sources such as CHG and Anatolian Chalcolithic for all six groups of the Caucasus cluster (Eneolithic Caucasus, Maykop and Late Makyop, Maykop-Novosvobodnaya, Kura-Araxes, and Dolmen LBA), with admixture proportions on a genetic cline of 40-72% Anatolian Chalcolithic related and 28-60% CHG related (Supplementary Table 7). When we explored Romania_EN and Greece_Neolithic individuals as alternative southeast European sources (30-46% and 36-49%), the CHG proportions increased to 54-70% and 51-64%, respectively. We hypothesize that alternative models, replacing the Anatolian Chalcolithic individual with yet unsampled populations from eastern Anatolia, South Caucasus or northern Mesopotamia, would probably also provide a fit to the data from some of the tested Caucasus groups.
The first appearance of ‘Near Eastern farmer related ancestry’ in the steppe zone is evident in Steppe Maykop outliers. However, PCA results also suggest that Yamnaya and later groups of the West Eurasian steppe carry some farmer related ancestry as they are slightly shifted towards ‘European Neolithic groups’ in PC2 (Fig. 2D) compared to Eneolithic steppe. This is not the case for the preceding Eneolithic steppe individuals. The tilting cline is also confirmed by admixture f3-statistics, which provide statistically negative values for AG3 as one source and any Anatolian Neolithic related group as a second source
Detailed exploration via D-statistics in the form of D(EHG, steppe group; X, Mbuti) and D(Samara_Eneolithic, steppe group; X, Mbuti) show significantly negative D values for most of the steppe groups when X is a member of the Caucasus cluster or one of the Levant/Anatolia farmer-related groups (Supplementary Figs. 5 and 6). In addition, we used f- and D-statistics to explore the shared ancestry with Anatolian Neolithic as well as the reciprocal relationship between Anatolian- and Iranian farmer-related ancestry for all groups of our two main clusters and relevant adjacent regions (Supplementary Fig. 4). Here, we observe an increase in farmer-related ancestry (both Anatolian and Iranian) in our Steppe cluster, ranging from Eneolithic steppe to later groups. In Middle/Late Bronze Age groups especially to the north and east we observe a further increase of Anatolian farmer related ancestry consistent with previous studies of the Poltavka, Andronovo, Srubnaya and Sintashta groups and reflecting a different process not especially related to events in the Caucasus.
(…) Surprisingly, we found that a minimum of four streams of ancestry is needed to explain all eleven steppe ancestry groups tested, including previously published ones (Fig. 2; Supplementary Table 12). Importantly, our results show a subtle contribution of both Anatolian farmer-related ancestry and WHG-related ancestry (Fig.4; Supplementary Tables 13 and 14), which was likely contributed through Middle and Late Neolithic farming groups from adjacent regions in the West. The discovery of a quite old AME ancestry has rendered this probably unnecessary, because this admixture from an Anatolian-like ghost population could be driven even by small populations from the Caucasus.
While it is not yet fully clear, the increased Anatolian_Neolithic-like ancestry in Ukraine_Eneolithic samples (see below) makes it unlikely that all such ancestry in Corded Ware groups comes from a GAC-related contribution. It is likely that at least part of it represents contributions from populations of the Caucasus, based on the mostly westward population movements in the steppe from ca. 4600 BC on, including the Suvorovo-Novodanilovka expansion, and especially the Kuban-Maykop expansion during the final Eneolithic into the North Pontic area.
NOTE. Since CHG-like groups from the Caucasus may have combinations of AME and ANE ancestry similar to Yamna (which may thus appear as ‘steppe ancestry’ in the North Pontic area), it is impossible to interpret with precision the following ADMIXTURE graphic:
The East Asian contribution to samples from the WHG samples (like Loschbour or La Braña), as specified in Fu et al. (2016), does not seem to be related to Baikal_EN, and appears possibly (in the ADMIXTURE analysis) integrated into he Villabruna component. I guess this implies that the shared alleles with East Asians are quite early, and potentially due to the expansion of R1b-L754 from the East.
It would be interesting to know the specific material culture Sidelkino belonged to – i.e. if it was related to the expansion of the North-Eastern Technocomplex – , and its Y-DNA. The Post-Swiderian expansion into eastern Europe, probably associated with the expansion of R1b-P297 lineages (including R1b-M73, found later in Botai and in Baltic HG) is supposed to have begun during the 11th millennium BC, but migrations to the Urals and beyond are probably concentrated in the 9th millennium, so this sample is possibly slightly early for R1b.
NOTE. User Rozenfeld at Anthrogenica posted this, which I think is interesting (in case anyone wants to try a Y-SNP call):
there is something strange with Sidelkino EHG: first, its archaeological context is not described in the supplementary. Second, its sex is not listed in the supplementary tables. Third, after looking for info about this sample, I found that: “Сиделькино-3. Для снятия вопроса о половой принадлежности индивида была проведена генетическая экспертиза, выявившая принадлежность останков мужчине.”(translation: Sidelkino-3. To resolve the question about sex of the remains, the genetic analysis was conducted, which showed that remains belonged to male), source: http://static.iea.ras.ru/books/7487_Traditsii.pdf
So either they haven’t mentioned his Y-DNA in the paper for some reason, or there are more than one Sidelkino sample and the male one has not yet been published. The coverage of the Sidelkino sample from the paper is 2.9, more than enough to tell Y-DNA haplogroup.
My speculative guess right now about specific population movements in far eastern Europe, based on the few data we have:
The expansion of the North-Eastern Technocomplex first around the 9th millennium BC, most likely expanded R1b-P279 ca. 11300 BC, judging by its TMRCA, with both R1b-M73 (TMRCA 5300) and R1b-M269 (TMRCA 4400 BC) info (with extra El Mirón ancestry) back, and thus Eurasiatic.
The expansion of haplogroup J1 to the north may have happened before or after the R1b-P279 expansion. Judging by the increase in AG3-related ancestry near Karelia compared to Baltic_HG, it is possible that it expanded just after R1b-P279 (hence possibly J1-Y6304? TMRCA 9700 BC). Its long-lasting presence in the Caucasus is supported by the Satsurblia (ca. 11300 BC) and the Dolmen BA (ca. 1300 BC) samples.
The expansion of R1a-M17 ca. 6600 BC is still likely to have happened from the east, based on the R1a-M17 samples found in Baikalic cultures slightly later (ca. 5300 BC). The presence of elevated Baikal_EN ancestry in Karelia HG and in Samara HG, and the finding of R1a-M417 samples in the Forest Zone after the Mesolithic suggests a connection with the expansion of Hunter-Gatherer pottery, from the Elshanka culture in the Samara region northward into the Forset Zone and westward into the North Pontic area.
The expansion of R1b-M73 ca. 5300 BC is likely to be associated with the emergence of a group east of the Urals (related to the later Botai culture, and potentially Pre-Yukaghir). Its presence in a Narva sample from Donkalnis (ca. 5200 BC) suggest either an early split and spread of both R1b-P297 lineages (M73 and M269) through Eastern Europe, or maybe a back-migration with hunter-gatherer pottery.
R1b-M269 spread successfully ca. 4400 BC (and R1b-L23 ca. 4100 BC, both based on TMRCA), and this successful expansion is probably to be associated with the Khvalynsk-Novodanilovka expansion. We already know that Samara_HG ca. 5600 was R1b1a, so it is likely that R1b-M269 appeared (or ‘resurged’) in the Volga-Ural region shortly after the expansion of R1a-M17, whose expansion through the region may be inferred by the additional AG3 and Baikal_EN ancestry. Interesting from Samara_HG compared to the previous Sidelkino sample is the introduction of more El Mirón-related ancestry, typical of WHG populations (and thus proper of Baltic groups).
NOTE. The TMRCA dates are obviously gross approximations, because a) the actual rate of mutation is unknown and b) TMRCA estimates are based on the convergence of lineages that survived. The potential finding of R1a-Z645 (possibly Z93+) in Ukraine Eneolithic (ca. 4000 BC), and the potential finding of R1b-L23 in Khvalynsk ca. 4250 BC complicates things further, in terms of dates and origins of any subclade.
The question thus remains as it was long ago: did R1b-M269 lineages expand (‘return’) from the east, near the Urals, or directly from the north? Were they already near Samara at the same time as the expansion of hunter-gatherer pottery, and were not much affected by it? Or did they ‘resurge’ from populations admixed with Caucasus-related ancestry after the expansion of R1a-M17 with this pottery (since there are different stepped expansions from the Samara region)? We could even ask, did R1a-M17 really expand from the east, i.e. are the dates on Baikalic subclades from Moussa et al. (2016) reliable? Or did R1a-M17 expand from some pockets in the Pontic-Caspian steppe, taking over the expansion of HG pottery at some point?
The most interesting aspect from the new paper (regarding Indo-Uralic migrations) is that Ancestral Middle Easterner ancestry will probably be a better proxy for the Anatolia_Neolithic component found in Ukraine Mesolithic to Eneolithic, and possibly also for some of the “more CHG-like” component found among Pontic-Caspian steppe populations, all likely derived from different admixture events with groups from the Caucasus.
NOTE. Even the supposed gene flow of Neolithic Iranian ancestry into the Caucasus can be put into question, since that means possibly a Dzudzuana-like population with greater “deep ancestry” proportion than the one found in CHG, which may still be found within the Caucasus.
If it was not clear already that following ‘steppe ancestry’ wherever it appears is a rather lame way of following Indo-European migrations, every single sample from the Caucasus and their admixture with Pontic-Caspian steppe populations will probably show that “steppe ancestry” is in fact formed by a variety of steppe-related ancestral components, impossible to follow coherently with a single population. Exactly what is happening already with the Siberian ancestry.
If the paper on the Dzudzuana samples has shown something, is that the expansion of an ANE-like population shook the entire Caucasus area up to the Zagros Mountains, creating this ANE – AME cline that are CHG and Iran_N, with further contributions of “deep ancestries” (probably from the south) complicating the picture further.
If this happens with few known samples, and we know of an ANE-like ghost population in the Caucasus (appearing later in the Lola culture), we can already guess that the often repeated “CHG component” found in Ukraine_Eneolithic and Khvalynsk will not be the same (except the part mediated by the Novodanilovka expansion).
This ANE-like expansion happened probably in the Late Upper Palaeolithic, and reached Northern Europe probably after the expansion of the Villabruna cluster (ca. 12000 BC), judging by the advance of AG3-like and ENA-like ancestry in later WHG samples.
The population movements during the Mesolithic and Early Neolithic in the North Pontic area are quite complicated: the extra AME ancestry is probably connected to the admixture with populations from the Caucasus, while the close similarity of Ukraine populations with Scandinavian ones (with an increase in Villabruna ancestry from Mesolithic to Neolithic samples), probably reveal population movements related to the expansion of Maglemose-related groups.
These Maglemose-related groups were probably migrants from the north-west, originally from the Northern European Plains, who occupied the previous Swiderian territory, and then expanded into the North Pontic area. The overwhelming presence of I2a (likely all I2a2a1b1b) lineages in Ukraine Neolithic supports this migration.
The likely picture of Mesolithic-Neolithic migrations in the North Pontic area right now is then:
Expansion of R1a-M459 from the east ca. 12000 BC – probably coupled with AG3 and also some Baikal_EN ancestry. First sample is I1819 from Vasilievka (ca. 8700 BC), another is from Dereivka ca. 6900 BC.
Expansion of R1b-V88 from the Balkans in the west ca. 9700 BC, based on its TMRCA and also the Balkan hunter-gatherer population overwhemingly of this haplogroup from the 10th millennium until the Neolithic. First sample is I1734 from Vasilievka (ca. 7252 BC), which suggests that it replaced the male population there, based on their similar EHG-like adxmixture (and lack of sizeable WHG increase), and shared mtDNA U5b2, U5a2.
Expansion of I2a-Y5606 probably ca. 6800 based on its TMRCA with Janislawice culture. Supporting this is the increase in WHG contribution to Neolithic samples, including the spread of U4 subclades compared to the previous period.
Expansion of R1a-M17 starting probably ca. 6600 BC in the east (see above).
NOTE. The first sample of haplogroup I appears in the Mesolithic: I1763 (ca. 8100 BC) of haplogroup I2a1, probably related to an older Upper Palaeolithic expansion.
It is becoming more and more clear with each new paper that – unless the number of very ancient samples increases – the use of Y-chromosome haplogroups remains one of the most important tools for academics; this is especially so in the steppes, in light of the diversity found in populations from the Caucasus. A clear example comes from the Yamna – Corded Ware similarities:
The presence of haplogroups Q and R1a-M459 (xM17) in Khvalynsk along with a R1b1a sample, which some interpreted as being akin to modern ‘mixed’ populations in the past, is likely to point instead to a period of Khvalynsk-Novodanilovka expansion with R1b-M269, where different small populations from the steppe were being integrated into the common Khvalynsk stock, but where differences are seen in material culture surrounding their burials, as supported by the finding of R1b1 in the Kuban area already in the first half of the 5th millennium. The case would be similar to the early ‘mixed’ Icelandic population.
Only after the emergence of the Samara culture (in the second half of the 6th millennium BC), with a sample of haplogroup R1b1a, starts then the obvious connection with Early Proto-Indo-Europeans; and only after the appearance of late Sredni Stog and haplogroup R1a-M417 (ca. 4000 BC) is its connection with Uralic also clear. In previous population movements, I think more haplogroups were involved in migrations of small groups, and only some communities among them were eventually successful, expanding to be dominant, creating ever growing cultures during their expansions.
Indeed, if you think in terms of Uralic and Indo-European just as converging languages, and forget their potential genetic connection, then the genetic + linguistic picture becomes simplified, and the upper frontier of the 6th millennium BC with a division North Pontic (Mariupol) vs. Volga-Ural (Samara) is enough. However, tracing their movements backwards – with cultural expansions from west to east (with the expansion of farming), and earlier east to west (with hunter-gatherer pottery), and still earlier west to east (with the north-eastern technocomplex), offers an interesting way to prove their potential connection to macrofamilies, at least in terms of population movements.
I am quite convinced right now that it would be possible to connect the expansion of R1b-L754 subclades with a speculative Nostratic (given the R1b-V88 connection with Afroasiatic, and the obvious connection of R1b-L297 with Eurasiatic). Paradoxically, the connection of an Indo-Uralic community in the steppes (after the separation of Yukaghir) with any lineage expansion (R1a-M17, R1b-M269, or even Q, I or J1) seems somehow blurrier than one year ago, possibly just because there are too many open possibilities.
David Reich says about the admixture with Neanderthals, which he helped discover:
At the conclusion of the Neanderthal genome project, I am still amazed by the surprises we encountered. Having found the first evidence of interbreeding between Neanderthals and modern humans, I continue to have nightmares that the finding is some kind of mistake. But the data are sternly consistent: the evidence for Neanderthal interbreeding turns out to be everywhere. As we continue to do genetic work, we keep encountering more and more patterns that reflect the extraordinary impact this interbreeding has had on the genomes of people living today.
I think this is a shared feeling among many of us who have made proposals about anything, to fear that we have made a gross, evident mistake, and constantly look for flaws. However, it seems to me that geneticists are more preoccupied with being wrong in their developed statistical methods, in the theoretical models they are creating, and not so much about errors in the true ancient ethnolinguistic picture human population genetics is (at least in theory) concerned about. Their publications are, after all, constantly associating genetic finds with cultures and (whenever possible) languages, so this aspect of their research should not be taken lightly.
Seeing how David Anthony or Razib Khan (among many others) have changed their previously preferred migration models as new data was published, and they continue to be respected in their own fields, I guess we can be confident that professionals with integrity are going to accept whatever new picture appears. While I don’t think that genetic finds can change what we can reconstruct with comparative grammar, I am also ready to revise guesstimates and routes of expansion of certain dialects if R1a-Z645 is shown to have accompanied Late Proto-Indo-Europeans during their expansion with Yamna, and later integrated somehow with Corded Ware.
However, taking into account the obsession of some with an ancestral, uninterrupted R1a—Indo-European association, and the lack of actual political repercussion of Neanderthal admixture, I think the most common nightmare that all genetic researchers should be worried about is to keep inflating this “Yamnaya ancestry”-based hornet’s nest, which has been constantly stirred up for the past two years, by rejecting it – or, rather, specifying it into its true complex nature.
This succession of corrections and redefinitions, coupled with the distinct Y-DNA bottleneck of each steppe population, will eventually lead to a completely different ethnolinguistic picture of the Pontic-Caspian region during the Eneolithic, which is likely to eventually piss off not only reasonable academics stubbornly attached to the CWC-IE idea, but also a part of those interested in daydreaming about their patrilineal ancestors.
Sometimes it’s better to just rip off the band-aid once and for all…
I was reading The Bronze Age Landscape in the Russian Steppes: The Samara Valley Project (2016), and I was really surprised to find the following excerpt by David W. Anthony:
The Samara Valley links the central steppes with the western steppes and is a north-south ecotone between the pastoral steppes to the south and the forest-steppe zone to the north [see figure below]. The economic contrast between pastoral steppe subsistence, with its associated social organizations, and forest-zone hunting and fishing economies probably explains the shifting but persistent linguistic border between forest-zone Uralic languages to the north (today largely displaced by Russian) and a sequence of steppe languages to the south, recently Turkic, before that Iranian, and before that probably an eastern dialect of Proto-Indo-European (Anthony 2007). The Samara Valley represents several kinds of borders, linguistic, cultural, and ecological, and it is centrally located in the Eurasian steppes, making it a critical place to examine the development of Eurasian steppe pastoralism.
Khokhlov (translated by Anthony) further insists on the racial and ethnic divide between both populations, Abashevo to the north, and Poltavka to the south, during the formation of the Abashevo – Sintashta-Potapovka community that gave rise to Proto-Indo-Iranians:
Among all cranial series in the Volga-Ural region, the Potapovka population represents the clearest example of race mixing and probably ethnic mixing as well. The cultural advancements seen in this period might perhaps have been the result of the mixing of heterogeneous groups. Such a craniometric observation is to some extent consistent with the view of some archaeologists that the Sintashta monuments represent a combination of various cultures (principally Abashevo and Poltavka, but with other influences) and therefore do not correspond to the basic concept of an archaeological culture (Kuzmina 2003:76). Under this option, the Potapovka-Sintashta burial rite may be considered, first, a combination of traits to guarantee the afterlife of a selected part of a heterogeneous population. Second, it reflected a kind of social “caste” rather than a single population. In our view, the decisive element in shaping the ethnic structure of the Potapovka-Sintashta monuments was their extensive mobility over a fairly large geographic area. They obtained knowledge of various cultures from the populations with whom they interacted.
Interesting is also this excerpt about the predominant population in the Abashevo – Sintashta-Potapovka admixture (which supports what Chetan said recently, although this does not seemed backed by Y-DNA haplogroups found in the richest burials), coupled with the sign of incoming “Uraloid” peoples from the east, found in both Sintashta and eastern Abashevo:
The socially dominant anthropological component was Europeoid, possibly the descendants of Yamnaya. The association of craniofacial types with archaeological cultures in this period is difficult, primarily because of the small amount of published anthropological material of the cultures of steppe and forest belt (Balanbash, Vol’sko-Lbishche) and the eastern and southern steppes (Botai-Tersek). The crania associated with late MBA western Abashevo groups in the Don-Volga forest zone were different from eastern Abashevo in the Urals, where the expression of the Old Uraloid craniological complex was increased. Old Uraloid is found also on a single skull of Vol’sko-Lbishche culture (Tamar Utkul VII, Kurgan 4). Potentially related variants, including Mongoloid features, could be found among the Seima-Turbino tribes of the forest-steppe zone, who mixed with Sintashta and Abashevo. In the Sintashta Bulanova cemetery from the western Urals, some individuals were buried with implements of Seima-Turbino type (Khalyapin 2001; Khokhlov 2009; Khokhlov and Kitov 2009). Previously, similarities were noted between some individual skulls from Potapovka I and burials of the much older Botai culture in northern Kazakhstan (Khokhlov 2000a). Botai-Tersek is, in fact, a growing contender for the source of some “eastern” cranial features.
The wave of peoples associated with “eastern” features can be seen in genetics in the Sintashta outliers from Narasimhan et al. (2018), and it probably will be eventually seen in Abashevo, too. These may be related to the Seima-Turbino international network – but most likely it is directly connected to Sintashta through the starting Andronovo and Seima-Turbino horizons, by admixing of prospective groups and small-scale back-migrations.
Corded Ware – Yamna similarities?
So, if peoples of north-eastern Europe have been assumed for a long time to be Uralic speakers, what is happening with the Corded Ware = IE obsession? Is it Gimbutas’ ghost possessing old archaeologists? Probably not.
It is about certain cultural similarities evident at first sight, which have been traditionally interpreted as a sign of cultural diffusion or migration. Not dissimilar to the many Bell Beaker models available, where each archaeologist is pushing certain differences, mixing what seemed reasonable, what still might seem reasonable, and what certainly isn’t anymore after the latest ancient DNA data.
The initial models of Gimbutas, Kristiansen, or Anthony – which are known to many today – were enunciated in the infancy of archaeological studies in the regions, during and just after the fall of the USSR, and before many radiocarbon dates that we have today were published (with radiocarbon dating being still today in need of refinement), so it is only logical that gross mistakes were made.
We have similar gross mistakes related to the origins of Bell Beakers, and studying them was certainly easier than studying eastern data.
Gimbutas believed – based mainly on Kurgan-like burials – that Bell Beaker formed from a combination of Yamna settlers with the Vučedol culture, so she was not that far from the truth.
The expansion of Corded Ware from peoples of the North Pontic forest-steppe area, proposed by Gimbutas and later supported also by Kristiansen (1989) as the main Indo-European expansion – , is probably also right about the approximate origins of the culture. Only its ‘Indo-European’ nature is in question, given the differences with Khvalynsk and Yamna evolution.
Anthony only claimed that Yamna migrants settled in the Balkans and along the Danube into the Hungarian steppes. He never said that Corded Ware was a Yamna offshoot until after the first genetic papers of 2015 (read about his newest proposal). He initially claimed that only certain neighbouring Corded Ware groups “adopted” Indo-European (through cultural diffusion) because of ‘patron-client’ relationships, and was never preoccupied with the fate of Corded Ware and related cultures in the east European forest zone and Finland.
So none of them was really that far from the true picture; we might say a lot people are more way off the real picture today than the picture these three researchers helped create in the 1990s and 2000s. Genetics is just putting the last nail in the coffin of Corded Ware as a Yamna offshoot, instead of – as we believed in the 2000s – to Vučedol and Bell Beaker.
So let’s revise some of these traditional links between Corded Ware and Yamna with today’s data:
Even more than genetics – at least until we have an adequate regional and temporary sampling – , archaeological findings lead what we have to know about both cultures.
It is essential to remember that Corded Ware, starting ca. 3000/2900 BC in east-central Europe, has been proposed to be derived from Early Yamna, which appeared suddenly in the Pontic-Caspian steppes ca. 3300 BC (probably from the late Repin expansion), and expanded to the west ca. 3000.
The question at hand, therefore, is if Corded Ware can be considered an offshoot of the Late PIE community, and thus whether the CWC ethnolinguistic community – proven in genetics to be quite homogeneous – spoke a Late PIE dialect, or if – alternatively – it is derived from other neighbouring cultures of the North Pontic region.
NOTE. The interpretation of an Indo-Slavonic group represented by a previous branching off of the group is untenable with today’s data, since Indo-Slavonic – for those who support it – would itself be a branch of Graeco-Aryan, and Palaeo-Balkan languages expanded most likely with West Yamna (i.e. R1b-L23, mainly R1b-Z2103) to the south.
The convoluted alternative explanation would be that Corded Ware represents an earlier, Middle PIE branch (somehow carrying R1a??) which influences expanding Late PIE dialects; this has been recently supported by Kortlandt, although this simplistic picture also fails to explain the Uralic problem.
❔ Kurgans: The Yamna tradition was inherited from late Repin, in turn inherited from Khvalynsk-Novodanilovka proto-Kurgans. As for the CWC tradition, it is unclear if the tumuli were built as a tradition inherited from North and West Pontic cultures (in turn inherited or copied from Khvalynsk-Novodanilovka), such as late Trypillia, late Kvityana, late Dereivka, late Sredni Stog; or if they were built because of the spread of the ‘Transformation of Europe’, set in motion by the Early Yamna expansion ca. 3300-3000 BC (as found in east-central European cultures like Coţofeni, Lizevile, Șoimuș, or the Adriatic Vučedol). My guess is that it inherits an older tradition than Yamna, with an origin in east-central Europe, because of the mound-building distribution in the North Pontic area before the Yamna expansion, but we may never really know.
❌ Burial rite: Yamna features (with regional differences) single burials with body on its back, flexed upright knees, poor grave goods, common orientation east-west (heads to the west) inherited from Repin, in turn inherited from Khvalynsk-Novodanilovka. CWC tradition – partially connected to Złota and surrounding east-central European territories (in turn from the Khvalynsk-Novodanilovka expansion) – features single graves, body in fetal position, strict gender differentiation – men on the right, women on the left -, looking to the south, graves with standardized assemblages (objects representing affirmation of battle, hunting, and feasting). The burial rites clearly represent different ideologies.
❌ Corded decoration: Corded ware decoration appears in the Balkans during the 5th millennium, and represents a simple technique whereby a cord is twisted, or wrapped around a stick, and then pressed directly onto the fresh surface of a vessel leaving a characteristic decoration. It appears in many groups of the 5th and 4th millennium BC, but it was Globular Amphorae the culture which popularized the drinking vessels and their corded ornamentation. It appears thus in some regional groups of Yamna, but it becomes the standard pottery only in Corded Ware (especially with the A-horizon), which shows continuity with GAC pottery.
❌ Economy: Yamna expands from Repin (and Repin from Khvalynsk-Novodanilovka) as a nomadic or semi-nomadic purely pastoralist society (with occasional gathering of wild seeds), which naturally thrives in the grasslands of the Pontic-Caspian, lower Danube and Hungarian steppes. Corded Ware shows agropastoralism (as late Eneolithic forest-steppe and steppe groups of eastern Europe, such as late Trypillian, TRB, and GAC groups), inhabits territories north of the loess line, with heavy reliance of hunter-gathering depending on the specific region.
❌ Cattle herding: Interestingly, both west Yamna and Corded Ware show more reliance on cattle herding than other pastoralist groups, which – contrasted with the previous Eneolithic herding traditions of the Pontic-Caspian steppe, where sheep-goats predominate – make them look alike. However, the cattle-herding economy of Yamna is essential for its development from late Repin and its expansion through the steppes (over western territories practising more hunter-gathering and sheep-goat herding economy), and it does not reach equally the Volga-Ural region, whose groups keep some of the old subsistence economy (read more about the late Repin expansion). Corded Ware, on the other hand, inherits its economic strategy from east European groups like TRB, GAC, and especially late Trypillian communities, showing a predominance of cattle herding within an agropastoral community in the forest-steppe and forest zones of Volhynia, Podolia, and surrounding forest-steppe and forest regions.
❔ Horse riding: Horse riding and horse transport is proven in Yamna (and succeeding Bell Beaker and Sintashta), assumed for late Repin (essential for cattle herding in the seas of grasslands that are the steppes, without nearby water sources), quite likely during the Khvalynsk expansion (read more here), and potentially also for Samara, where the predominant horse symbolism of early Khvalynsk starts. Corded Ware – like the north Pontic forest-steppe and forest areas during the Eneolithic – , on the other hand, does not show a strong reliance on horse riding. The high mobility and short-term settlements characteristic of Corded Ware, that are often associated with horse riding by association with Yamna, may or may not be correct, but there is no need for horses to explain their herding economy or their mobility, and the north-eastern European areas – the one which survived after Bell Beaker expansion – did certainly not rely on horses as an essential part of their economy.
NOTE: I cannot think of more supposed similarities right now. If you have more ideas, please share in the comments and I will add them here.
✅ EHG: This is the clearest link between both communities. We thought it was related to the expansion of ANE-related ancestry to the west into WHG territory, but now it seems that it will be rather WHG expanding into ANE territory from the Pontic-Caspian region to the east (read more on recent Caucasus Neolithic, on , and on Caucasus HG).
NOTE. Given how much each paper changes what we know about the Palaeolithic, the origin and expansion of the (always developing) known ancestral components and specific subclades (see below) is not clear at all.
❔ CHG: This is the key link between both cultures, which will delimit their interaction in terms of time and space. CHG is intermediate between EHG and Iran N (ca. 8000 BC). The ancestry is thus linked to the Caucasus south of the steppe before the emergence of North Pontic (western) and Don-Volga-Ural (eastern) communities during the Mesolithic. The real question is: when we have more samples from the steppe and the Caucasus during the Neolithic, how many CHG groups are we going to find? Will the new specific ancestral components (say CHG1, CHG2, CHG3, etc.) found in Yamna (from Khvalynsk, in the east) and Corded Ware (probably from the North Pontic forest-steppe) be the same? My guess is, most likely not, unless they are mediated by the Khvalynsk-Novodanilovka expansion (read more on CHG in the Caucasus).
❌ WHG/EEF: This is the obvious major difference – known today – in the formation of both communities in the steppe, and shows the different contacts that both groups had at least since the Eneolithic, i.e. since the expansion of Repin with its renewed Y-DNA bottleneck, and probably since before the early Khvalynsk expansion (read more on Yamna-Corded Ware differences contrasting with Yamna-Afanasevo, Yamna-Bell Beaker, and Yamna-Sintashta similarities).
NOTE 1. Some similarities between groups can be seen depending on the sampled region; e.g. Baltic groups show more similarities with southern Pontic-Caspian steppe populations, probably due to exogamy.
NOTE 2. We have this information on the differences in “steppe ancestry” between Yamna and Corded Ware, compared to previous studies, because now we have more samples of neighbouring, roughly contemporaneous Eneolithic groups, to analyse the real admixture processes. This kind of fine scale studies is what is going to show more and more differences between Khvalynsk-Yamna and Sredni Stog-Corded Ware as more data pours in. The evolution of both communities in archaeology and in PCA (see below) is probably witness to those differences yet to be published.
❌ R1: Even though some people try very hard to think in terms of “R1” vs. (Caucasus) J or G or any other upper clade, this is plainly wrong. It is possible, given what we know now, that Q1a2-M242 expanded ANE ancestry to the west ca. 13000 BC, while R1b-P279 expanded WHG ancestry to the east with the expansion of post-Swiderian cultures, creating EHG as a WHG:ANE cline. The role of R1a-M459 is unknown, but it might be related to any of these migrations, or others (plural) along northern Eurasia (read more on the expansion of R1b-P279, on Palaeolithic Q1a2, and on R1a-M417).
NOTE. I am inclined to believe in a speculative Mesolithic-Early Neolithic community involving Eurasiatic movements accross North Eurasia, and Indo-Uralic movements in its western part, with the last intense early Uralic-PIE contacts represented by the forming west (Mariupol culture) and east (Don-Volga-Ural cultures, including Samara) communities developing side by side. Before their known Eneolithic expansions, no large-scale Y-DNA bottleneck is going to be seen in the Pontic-Caspian steppe, with different (especially R1a and R1b subclades) mixed among them, as shown in North Pontic Neolithic, Samara HG, and Khvalynsk samples.
Corded Ware and ‘steppe ancestry’
If we take a look at the evolution of Corded Ware cultures, the expansion of Bell Beakers – dominated over most previous European cultures from west to east Europe – influenced the development of the whole European Bronze Age, up to Mierzanowice and Trzciniec in the east.
The only relevant unscathed CWC-derived groups, after the expansion of Sintashta-Potapovka as the Srubna-Andronovo horizon in the Eurasian steppes, were those of the north-eastern European forest zone: between Belarus to the west, Finland to the north, the Urals to the east, and the forest-steppe region to the south. That is, precisely the region supposed to represent Uralic speakers during the Bronze Age.
This inconsistency of steppe ancestry and its relation with Uralic (and Balto-Slavic) peoples was observed shortly after the publication of the first famous 2015 papers by Paul Heggarty, of the Max-Planck Institute for Evolutionary Anthropology (read more):
Haak et al. (2015) make much of the high Yamnaya ancestry scores for (only some!) Indo-European languages. What they do not mention is that those same results also include speakers of other languages among those with the highest of all scores for Yamnaya ancestry. Only these are languages of the Uralic family, not Indo-European at all; and their Yamnaya-ancestry signals are far higher than in many branches of Indo-European in (southern) Europe. Estonian ranks very high, while speakers of the very closely related Finnish are curiously not shown, and nor are the Saami. Hungarian is relevant less directly since this language arrived only c. 900 AD, but also high.
These data imply that Uralic-speakers too would have been part of the Yamnaya > Corded Ware movement, which was thus not exclusively Indo-European in any case. And as well as the genetics, the geography, chronology and language contact evidence also all fit with a Yamnaya > Corded Ware movement including Uralic as well as Balto-Slavic.
Both papers fail to address properly the question of the Uralic languages. And this despite — or because? — the only Uralic speakers they report rank so high among modern populations with Yamnaya ancestry. Their linguistic ancestors also have a good claim to have been involved in the Corded Ware and Yamnaya cultures, and of course the other members of the Uralic family are scattered across European Russia up to the Urals.
NOTE. Although the author was trying to support the Anatolian hypothesis – proper of glottochronological studies often published from the Max Planck Institute – , the question remains equally valid: “if Proto-Indo-European expands with Corded Ware and steppe ancestry, what is happening with Uralic peoples?”
For my part, I claimed in my draft that ancestral components were not the only relevant data to take into account, and that Y-DNA haplogroups R1a and R1b (appearing separately in CWC and Yamna-Bell Beaker-Afanasevo), together with their calculated timeframes of formation – and therefore likely expansion – did not fit with the archaeological and linguistic description of the spread of Proto-Indo-European and its dialects.
In fact, it seemed that only one haplogroup (R1b-M269) was constantly and consistenly associated with the proposed routes of Late PIE dialectal expansions – like Anthony’s second (Afanasevo) and third (Lower Danube, Balkan) waves. What genetics shows fits seamlessly with Mallory’s association of the North-West Indo-European expansion with Bell Beakers (read here how archaeologists were right).
More precise inconsistencies were observed after the publication of Olalde et al. (2017) and Mathieson et al. (2017), by Volker Heyd in Kossinna’s smile (2017). Letting aside the many details enumerated (you can read a summary in my latest draft), this interesting excerpt is from the conclusion:
Simple solutions to complex problems are never the best choice, even when favoured by politicians and the media. Kossinna also offered a simple solution to a complex prehistoric problem, and failed therein. Prehistoric archaeology has been aware of this for a century, and has responded by becoming more differentiated and nuanced, working anthropologically, scientifically and across disciplines (cf. Müller 2013; Kristiansen 2014), and rejecting monocausal explanations. The two aDNA papers in Nature, powerful and promising as they are for our future understanding, also offer rather straightforward messages, heavily pulled by culture-history and the equation of people with culture. This admittedly is due partly to the restrictions of the medium that conveys them (and despite the often relevant additional detail given as supplementary information, which is unfortunately not always given full consideration).
While I have no doubt that both papers are essentially right, they do not reflect the complexity of the past. It is here that archaeology and archaeologists contributing to aDNA studies find their role; rather than simply handing over samples and advising on chronology, and instead of letting the geneticists determine the agenda and set the messages, we should teach them about complexity in past human actions and interactions. If accepted, this could be the beginning of a marriage made in heaven, with the blessing smile of Gustaf Kossinna, and no doubt Vere Gordon Childe, were they still alive, in a reconciliation of twentieth- and twenty-first-century approaches. For us as archaeologists, it could also be the starting point for the next level of a new archaeology.
The question was made painfully clear with the publication of Olalde et al. (2018) & Mathieson et al. (2018), where the real route of Yamna expansion into Europe was now clearly set through the steppes into the Carpathian basin, later expanded as Bell Beakers.
Previous research at KA-5 was carried out by A. V. Epimakhov in 1994–1995 and 2002–2003 and resulted in the excavation of three Sintashta culture barrows (kurgans) that produced 35 burial pits and a reported 100 skeletons (Epimakhov, 2002, 2005; Epimakhov et al., 2005; Razhev and Epimakhov, 2004). Seven AMS radiocarbon dates on human remains from the cemetery yielded a date range of 2040–1730 cal. BC (2 sigma), which placed the cemetery within the Sintashta phase of the regional Bronze Age (Hanks et al., 2007). Twelve recently obtained AMS radiocarbon dates, taken from short-lived wood and charcoal species recovered from the Kamennyi Ambar settlement, have provided a date range of 2050–1760 cal. BC (2 sigma). Importantly, these dates confirm the close chronological relationship between the settlement and cemetery for the Middle Bronze Age phase and discount the possibility of a freshwater reservoir effect influencing the earlier dating of the human remains from the Kamennyi Ambar 5 cemetery (Epimakhov and Krause, 2013).
Sintashta cemeteries frequently yield fewer than six barrow complexes and the number of skeletons recovered represents a fraction of the total population that would have inhabited the settlements (Judd et al., 2018; Johnson and Hanks, 2012). Scholars have suggested that only members of higher status were afforded interment in these cemeteries and that principles of social organization structured placement of individuals within central or peripheral grave pits (Fig. 2) (Koryakova and Epimakhov, 2007: 75–81). In comparison with other Sintashta cemeteries that have been excavated, KA-5 provides one of the largest skeletal inventories currently available for study.
The KA-5 (MBA), Bestamak (MBA) and Lisakovsk (LBA) datasets exhibited a wide range of δ13C and δ15N values for both humans and herbivores (Figs. 5 and 6 & Table 8). This diversity in isotopic signals may be evident for a variety of reasons. For example, the range of values may be associated with a broad spectrum of C3 and C4 plant diversity in the ancient site biome or herbivore grazing patterns that included more diverse environmental niche areas in the microregion around the sampled sites. Herders also may have chosen to graze animals in niche areas due to recognized territorial boundaries between settlements and concomitant patterns of mobility. Importantly, data from Bolshekaragansky represents humans with lower δ15N values that are more closely associated with δ15N values of the sampled domestic herbivores (Fig. 6). When the archaeological evidence from associated settlement sites is considered, Bolshekaragansky, Bestamak, Lisakovsk and KA-5 have been assumed to represent populations that shared similar forms of pastoral subsistence economies with significant dietary reliance upon domesticated herbivore meat and milk. Human diets have δ13C values closely related to those of local herbivores in terms of the slope of the trendline and range of values (Fig. 6). Comparatively, the cemetery of Bolshekaragansky (associated with the Arkaim settlement) reflects individuals with trend lines closer to those of cattle and caprines and may indicate a stronger reliance on subsistence products from these species with less use of wild riverine and terrestrial resources. The site of Čiča is significantly different with elevated human δ15N isotopic values and depleted δ13C values indicative of a subsistence regime more closely associated with the consumption of freshwater resources, such as fish. The stable isotopic data in this instance is strongly supported by zooarchaeological evidence recovered from the Čiča settlement and also is indicative of significant diachronic changes from the LBA phases through the Iron Age (Fig. 6).
(…) The isotopic results from KA-5, and recent botanical and archaeological studies from the Kamennyi Ambar settlement, have not produced any evidence for the production or use of domesticated cereals. While this does not definitively answer the question as to whether Sintashta populations engaged in agriculture and/or utilized agricultural products, it does call into serious question the ubiquity of such practices across the region and correlates well with recent archaeological, bioarchaeological, and isotopic studies of human and animal remains from the Southwestern Urals region and Samara Basin (Anthony et al., 2016; Schulting and Richards, 2016). The results substantiate a broader spectrum subsistence diet that in addition to the use of domesticated animal products also incorporated wild flora, wild fauna and fish species. These findings further demonstrate the need to draw on multiple methods and datasets for the reconstruction of late prehistoric subsistence economies in the Eurasian steppes. When possible, this should include datasets from both settlements and associated cemeteries.
Variability in subsistence practices in the central steppes region has been highlighted by other scholars and appears to be strongly correlated with local environmental conditions and adaptations. More comprehensive isotopic studies of human, animal and fish remains are of fundamental importance to achieve more robust and empirically substantiated reconstructions of local biomes and to aid the refinement of regional and micro-regional economic subsistence models. This will allow for a fuller understanding of key diachronic shifts within dietary trends and highlight regional variation of such practices. Ultimately, this will more effectively index the diverse social and environmental variables that contributed to late prehistoric lifeways and the economic strategies employed by these early steppe communities.
Social organization of Sintashta-Petrovka
Interesting to remember now the recent article by Chechushkov et al. (2018) about the social stratificaton in Sintashta-Petrovka, and how it must have caused the long-lasting, peaceful admixture process that led to the known almost full replacement of R1b-L23 (mostly R1b-Z2103) by R1a-Z645 (mostly R1a-Z93) subclades in the North Caspian steppe, coinciding with the formation of the Proto-Indo-Iranian community and language (read my thoughts on this after Damgaard et al. 2018).
Here is another relevant excerpt from Chechushkov et al. (2018), translated from Russian:
The analysis suggests that the Sintashta-Petrovka societies had a certain degree of social stratification, expressed both in selective funeral rituals and in the significant difference in lifestyle between the elite and the immediate producers of the product. The data obtained during the field study suggest that the elite lived within the fortifications, while a part of the population was outside their borders, on seasonal sites, and also in stationary non-fortified settlements. Probably, traces of winter settlements can be found near the walls, while the search for summer ones is a task of a separate study. From our point of view, the elite of the early complex societies of the Bronze Age of the Eurasian steppe originated as a response to environmental challenges that created risks for cattle farming. The need to adapt the team to the harsh and changing climatic conditions created a precedent in which the settled collectives of pastoralists – hunter-gatherers could afford the content and magnificent posthumous celebration of people and their families who were not engaged in the production or extraction of an immediate product. In turn, representatives of this social group directed their efforts to the adoption of socially significant decisions, the organization of collective labor in the construction of settlement-shelters and risked their lives, acting as military leaders and fighters.
Thus, in Bronze Age steppe societies, the formation, development and decline of social complexity are directly related to the intensity of pastoralism and the development of new territories, where collectives had to survive in part a new ecological niche. At the same time, some members of the collective took upon themselves the organization of the collective’s life, receiving in return a privileged status. As soon as the conditions of the environment and management changed, the need for such functions was virtually eliminated, as a result of which the privileged members of society dissolved into the general mass, having lost their lifetime status and the right to be allocated posthumously.
Regarding the special position of the Chicha-1 samples in the change of diet and economy during the Iron Age, it is by now well known that haplogroup N must have arrived quite late to North-East Europe, and possibly not linked with the expansion of Siberian ancestry – or linked only with some waves of Siberian ancestry in the region, but not all of them. See Lamnidis et al. (2018) for more on this.
Also, the high prevalence of haplogroup N among Fennic and Siberian (Samoyedic) peoples is not related: while the latter reflects probably the native (Palaeo-Siberian) population that acquired their Uralic branch during the MLBA expansions associated with Corded Ware groups, the former points to the expansion of Fennic peoples into Saamic territory (i.e. after the Fenno-Saamic split) as the most likely period of expansion of N1c1-L392 subclades (see known recent bottlenecks among Finns, and on Proto-Finnic dialectalization).
Probably related to these late incomers are the ancient DNA samples from the Sargat culture during the Iron Age, which show the arrival of N subclades in the region, replacing most – but not all – R1a lineages (see Pilipenko et al. (2017)). Regarding the site of Chicha-1, the following are relevant excerpts about the cultural situation that could have allowed for such stepped, diachronic admixture events in Northern Eurasia, from the paper Stages in the settlement history of Chicha-1: The Results of ceramic analysis, by Molodin et al. (2008):
The stratigraphic data allows us to make the following inference: originally, the settlement was inhabited by people bearing the Late Irmen culture. Later, the people of the Baraba trend of the Suzgun culture arrived at the site (Molodin, Chemyakina, 1984: 40–62). The Baraba-Suzgun pottery demonstrates features similar to what has been reported from the sites of the transitional Bronze to Iron Age culture in the pre-taiga and taiga zones in the Irtysh basin (Potemkina, Korochkova, Stefanov, 1995; Polevodov, 2003). The major morphological types are slightly and well-profiled pots with a short throat. (…)
During the following stage of development of the site, the Chicha population increased with people who practiced cultures others than those noted in earlier collections. The ceramic materials from layer 5 provide data on possible relationships. In addition to migrants from northwestern regions practicing the Suzgun culture, there were people bearing the Krasnoozerka culture. Available data also suggests that people from the northern taiga region with the Atlym culture visited the site.
However, people from the west and southwest represent the greatest migration to the region under study. In all likelihood they moved from the northern forest-steppe zone of modern Kazakhstan and practiced the Berlik culture. The spatial distribution analysis of the Chicha-1 site suggests that the Berlik population was rather large. The Berlik people formed a single settlement with the indigenous Late Irmen people and apparently waged certain common economic activities, but preserved their own ethnic and cultural specificity (Molodin, Parzinger, 2006: 49–55). Judging by the data on the chronological sequence of deposited artifacts, migration took place roughly synchronously, hence Chicha-1 became a real cultural and economic center.
(…) In sum, the noted distribution of ceramics over the culture-bearing horizons suggests that beginning with layer 5, traditions of ceramic manufacture described above were practiced, hence the relevant population inhabited the site. Apparently, there were two predominant traditions: the local Late Irmen cultural tradition and the Berlik tradition, which was brought by the immigrants. The Late Irmen people mostly populated the citadel, while the Berlik immigrants inhabited the areas to the east and the north of the citadel.
The stratigraphic data also suggest that the Early Sargat ceramics emerged at the site likely as a part of the Late Irmen tradition (…) Early Sargat ceramics is apparently linked with the Late Irmen tradition. Artifacts associated with the Sargat culture proper have been found in several areas of Chicha-1 (e.g., in excavation area 16). However, the Sargat people appeared at the site after it had been abandoned by its previous inhabitants, and had eventually become completely desolated. This happened no earlier than the 6th cent. BC, possibly in the 5th cent. BC (in fact, the radiocarbon dates for that horizon are close to the turn of the Christian era).
Hepatitis B virus (HBV) is a major cause of human hepatitis. There is considerable uncertainty about the timescale of its evolution and its association with humans. Here we present 12 full or partial ancient HBV genomes that are between approximately 0.8 and 4.5 thousand years old. The ancient sequences group either within or in a sister relationship with extant human or other ape HBV clades. Generally, the genome properties follow those of modern HBV. The root of the HBV tree is projected to between 8.6 and 20.9 thousand years ago, and we estimate a substitution rate of 8.04 × 10−6–1.51 × 10−5 nucleotide substitutions per site per year. In several cases, the geographical locations of the ancient genotypes do not match present-day distributions. Genotypes that today are typical of Africa and Asia, and a subgenotype from India, are shown to have an early Eurasian presence. The geographical and temporal patterns that we observe in ancient and modern HBV genotypes are compatible with well-documented human migrations during the Bronze and Iron Ages1,2. We provide evidence for the creation of HBV genotype A via recombination, and for a long-term association of modern HBV genotypes with humans, including the discovery of a human genotype that is now extinct. These data expose a complexity of HBV evolution that is not evident when considering modern sequences alone.
We find genotype A in south-western Russia by 4.3 ka (in samples RISE386 and RISE387) in individuals belonging to the Sintashta culture, and in a Hungarian sample (DA195) from the Scythian culture. The western Scythians are related to the Bronze Age cultures of western steppe populations2 and their shared ancestry suggests that the modern genotype A may descend from this ancient Eurasian diversity and not, as previously hypothesized, from African ancestors29,30. This is also consistent with the phylogeny (Fig. 2), as well as the fact that the three oldest ancient genotype A sequences (HBV-DA195, HBV-RISE386 and HBV-RISE387) lack the six-nucleotide insertion found in the youngest (HBV-DA119) and in all modern genotype A sequences. The ancestors of subgenotypes A1 and A3 could have been carried into Africa subsequently, via migration from western Eurasia31.
The ancient HBV genotype D sequences were all found in Central Asia. HBV-DA27, found in Kazakhstan and dated to 1.6 ka, falls basal to the modern subgenotype D5 sequences that today are found in the Paharia tribe from eastern India32. DA27 and the Paharia people in India are linked by their East Asian ancestry2,33.
(…)Despite the age of the samples and the imperfect diagnostic test, our dataset contained a high proportion of HBV-positive individuals. The actual ancient prevalence during the Bronze Age and thereafter might have been higher, reaching or exceeding the prevalence typically found in contemporary indigenous populations5. This clearly establishes the potential of HBV as powerful proxy tool for research into human spread and interactions. The data from ancient genomes reveal aspects of complexity in HBV evolution that are not apparent when only modern sequences are considered. They show the existence of ancient HBV genotypes in locations incongruent with their present-day distribution, contradicting previously suggested geographical or temporal origins of genotypes or sub-genotypes; evidence for the creation of genotype A via recombination and the emergence of the genotype outside Africa; at least one now-extinct human genotype; ancient genotype-level localized diversity; and demonstrate that the viral substitution rate obtained from modern heterochronously sampled sequences is probably misleading. Together, these findings suggest that the difficulty in formulating a coherent theory for the origin and spread of HBV may be due to genetic evidence of an earlier evolutionary scenario being overwritten by relatively recent alterations, as has previously been suggested in the context of recombination24
Our findings fit well with current insights from the historical linguistics of this region (Supplementary Information section 2). The steppes were probably largely Iranian-speaking in the first and second millennia bc. This is supported by the split of the Indo-Iranian linguistic branch into Iranian and Indian33, the distribution of the Iranian languages, and the preservation of Old Iranian loanwords in Tocharian34. The wide distribution of the Turkic languages from Northwest China, Mongolia and Siberia in the east to Turkey and Bulgaria in the west implies large-scale migrations out of the homeland in Mongolia since about 2,000 years ago35. The diversification within the Turkic languages suggests that several waves of migration occurred36 and, on the basis of the effect of local languages, gradual assimilation to local populations had previously been assumed37. The East Asian migration starting with the Xiongnu accords well with the hypothesis that early Turkic was the major language of Xiongnu groups38. Further migrations of East Asians westwards find a good linguistic correlate in the influence of Mongolian on Turkic and Iranian in the last millennium39. As such, the genomic history of the Eurasian steppes is the story of a gradual transition from Bronze Age pastoralists of West Eurasian ancestry towards mounted warriors of increased East Asian ancestry—a process that continued well into historical times.
This paper will need a careful reading – better in combination with Narasimhan et al. (2018), when their tables are corrected – , to assess the actual ‘Iranian’ nature of the peoples studied. Their wide and long-term dominion over the steppe could also potentially explain some early samples from Hajji Firuz with steppe ancestry.
For the moment, at first sight, it seems that, in terms of Y-DNA lineages:
R1b-Z93 (especially Z2124 subclades) dominate the steppes in the studied periods.
R1b-P312 is found in Hallstatt ca. 810 BC, which is compatible with its role in the Celtic expansion.
R1b-U106 is found in a West Germanic chieftain in Poprad (Slovakia) ca. 400 AD, during the Migration Period, hence supporting once again the expansion of Germanic tribes especially with R1b-U106 lineages.
A sample of haplogroup R1a-Z282 (Z92) dated ca. 1300 AD in the Golden Horde is probably not quite revealing, not even for the East Slavic expansion.
Also, interestingly, some R1b(xM269) lineages seem to be associated with Turkic expansions from the eastern steppe dated around 500 AD, which probably points to a wide Eurasian distribution of early R1b subclades in the Mesolithic.
NOTE. I have referenced not just the reported subclades from the paper, but also (and mainly) further Y-SNP calls studied by Open Genomes. See the spreadsheet here.
Interesting also to read in the supplementary materials the following, by Michaël Peyrot (emphasis mine):
1. Early Indo-Europeans on the steppe: Tocharians and Indo-Iranians
The Indo-European language family is spread over Eurasia and comprises such branches and languages as Greek, Latin, Germanic, Celtic, Sanskrit etc. The branches relevant for the Eurasian steppe are Indo-Aryan (= Indian) and Iranian, which together form the Indo-Iranian branch, and the extinct Tocharian branch. All Indo-European languages derive from a postulated protolanguage termed Proto-Indo-European. This language must have been spoken ca 4500–3500 BCE in the steppe of Eastern Europe21. The Tocharian languages were spoken in the Tarim Basin in present-day Northwest China, as shown by manuscripts from ca 500–1000 CE. The Indo-Aryan branch consists of Sanskrit and several languages of the Indian subcontinent, including Hindi. The Iranian branch is spread today from Kurdish in the west, through a.o. Persian and Pashto, to minority languages in western China, but was in the 2nd and 1st millennia BCE widespread also on the Eurasian steppe. Since despite their location Tocharian and Indo-Iranian show no closer relationship within Indo-European, the early Tocharians may have moved east before the Indo-Iranians. They are probably to be identified with the Afanasievo Culture of South Siberia (ca 2900 – 2500 BCE) and have possibly entered the Tarim Basin ca 2000 BCE103.
The Indo-Iranian branch is an extension of the Indo-European Yamnaya Culture (ca 3000–2400 BCE) towards the east. The rise of the Indo-Iranian language, of which no direct records exist, must be connected with the Abashevo / Sintashta Culture (ca 2100 – 1800 BCE) in the southern Urals and the subsequent rise and spread of Andronovo-related Culture (1700 – 1500 BCE). The most important linguistic evidence of the Indo-Iranian phase is formed by borrowings into Finno-Ugric languages104–106. Kuz’mina (2001) identifies the Finno-Ugrians with the Andronoid cultures in the pre-taiga zone east of the Urals107. Since some of the oldest words borrowed into Finno-Ugric are only found in Indo-Aryan, Indo-Aryan and Iranian apparently had already begun to diverge by the time of these contacts, and when both groups moved east, the Iranians followed the Indo-Aryans108. Being pushed by the expanding Iranians, the Indo-Aryans then moved south, one group surfacing in equestrian terminology of the Anatolian Mitanni kingdom, and the main group entering the Indian subcontinent from the northwest.
2. Andronovo Culture: Early Steppe Iranian
Initially, the Andronovo Culture may have encompassed speakers of Iranian as well as Indo-Aryan, but its large expansion over the Eurasian steppe is most probably to be interpreted as the spread of Iranians. Unfortunately, there is no direct linguistic evidence to prove to what extent the steppe was indeed Iranian speaking in the 2nd millennium BCE. An important piece of indirect evidence is formed by an archaic stratum of Iranian loanwords in Tocharian34,109. Since Tocharian was spoken beyond the eastern end of the steppe, this suggests that speakers of Iranian spread at least that far. In the west of the Tarim Basin the Iranian languages Khotanese and Tumshuqese were spoken. However, the Tocharian B word etswe ‘mule’, borrowed from Iranian *atswa- ‘horse’, cannot derive from these languages, since Khotanese has aśśa- ‘horse’ with śś instead of tsw. The archaic Iranian stratum in Tocharian is therefore rather to be connected with the presence of Andronovo people to the north and possibly to the east of the Tarim Basin from the middle of the 2nd millennium BCE onwards110.
Since Kristiansen and Allentoft sign the paper (and Peyrot is a colleague of Kroonen), it seems that they needed to expressly respond to the growing criticism about their recent Indo-European – Corded Ware Theory. That’s nice.
IECWT-proponents are apparently not prepared to let it go quietly, and instead of challenging the traditional Neolithic Uralic homeland in Eastern Europe with a recent paper on the subject, they selected an older one which partially fit, from Kuz’mina (2001), now shifting the Uralic homeland to the east of the Urals (when Kuz’mina asserts it was south of the Urals).
Different authors comment later in this same paper about East Uralic languages spreading quite late, so even their text is not consistent among collaborating authors.
Also interesting is the need to resort to the questionable argument of early Indo-Aryan loans – which may have evidently been Indo-Iranian instead, since there is no way to prove a difference between both stages in early Uralic borrowings from ca. 4,500-3,500 years ago…
NOTE. I don’t mind repeating it again: Uralic is one possibility (the most likely one) for the substrate language that Corded Ware migrants spread, but it could have been e.g. another Middle PIE dialect, similar to Proto-Anatolian (after the expansion of Suvorovo-Novodanilovka chiefs). I expressly stated this in the Corded Ware substrate hypothesis, since the first edition. What was clear since 2015, and should be clear to anyone now, is that Corded Ware did not spread Late PIE languages to Europe, and that some east CWC groups only spread languages to Asia after admixing with East Yamna. If they did not spread Uralic, then it was a language or group of languages phonetically similar, which has not survived to this day.
At least we won’t have the Yamna -> Corded Ware -> BBC nonsense anymore, and they expressly stated that LPIE is to be associated with Yamna, and in particular the “Indo-Iranian branch is an extension of the Indo-European Yamnaya Culture (ca 3000–2400 BCE) to the East” (which will evidently show an East Yamna / Poltavka society of R1b-L23 subclades), so that earlier Eneolithic cultures have to be excluded, and Balto-Slavic identification with East Europe is also out of the way.