“Steppe ancestry” step by step: Khvalynsk, Sredni Stog, Repin, Yamna, Corded Ware

dzudzuana_pca-large

Wang et al. (2018) is obviously a game changer in many aspects. I have already written about the upcoming Yamna Hungary samples, about the new Steppe_Eneolithic and Caucasus Eneolithic keystones, and about the upcoming Greece Neolithic samples with steppe ancestry.

An interesting aspect of the paper, hidden among so many relevant details, is a clearer picture of how the so-called Yamnaya or steppe ancestry evolved from Samara hunter-gatherers to Yamna nomadic pastoralists, and how this ancestry appeared among Proto-Corded Ware populations.

anatolia-neolithic-steppe-eneolithic
Image modified from Wang et al. (2018). Marked are in orange: equivalent Steppe_Maykop ADMIXTURE; in red, approximate limit of Anatolia_Neolithic ancestry found in Yamna populations; in blue, Corded Ware-related groups. “Modelling results for the Steppe and Caucasus cluster. Admixture proportions based on (temporally and geographically) distal and proximal models, showing additional Anatolian farmer-related ancestry in Steppe groups as well as additional gene flow from the south in some of the Steppe groups as well as the Caucasus groups.”

Please note: arrows of “ancestry movement” in the following PCAs do not necessarily represent physical population movements, or even ethnolinguistic change. To avoid misinterpretations, I have depicted arrows with Y-DNA haplogroup migrations to represent the most likely true ethnolinguistic movements. Admixture graphics shown are from Wang et al. (2018), and also (the K12) from Mathieson et al. (2018).

1. Samara to Early Khvalynsk

The so-called steppe ancestry was born during the Khvalynsk expansion through the steppes, probably through exogamy of expanding elite clans (eventually all R1b-M269 lineages) originally of Samara_HG ancestry. The nearest group to the ANE-like ghost population with which Samara hunter-gatherers admixed is represented by the Steppe_Eneolithic / Steppe_Maykop cluster (from the Northern Caucasus Piedmont).

Steppe_Eneolithic samples, of R1b1 lineages, are probably expanded Khvalynsk peoples, showing thus a proximate ancestry of an Early Eneolithic ghost population of the Northern Caucasus. Steppe_Maykop samples represent a later replacement of this Steppe_Eneolithic population – and/or a similar population with further contribution of ANE-like ancestry – in the area some 1,000 years later.

PCA-caucasus-steppe-samara

This is what Steppe_Maykop looks like, different from Steppe_Eneolithic:

steppe-maykop-admixture

NOTE. This admixture shows how different Steppe_Maykop is from Steppe_Eneolithic, but in the different supervised ADMIXTURE graphics below Maykop_Eneolithic is roughly equivalent to Eneolithic_Steppe (see orange arrow in ADMIXTURE graphic above). This is useful for a simplified analysis, but actual differences between Khvalynsk, Sredni Stog, Afanasevo, Yamna and Corded Ware are probably underestimated in the analyses below, and will become clearer in the future when more ancestral hunter-gatherer populations are added to the analysis.

2. Early Khvalynsk expansion

We have direct data of Khvalynsk-Novodanilovka-like populations thanks to Khvalynsk and Steppe_Eneolithic samples (although I’ve used the latter above to represent the ghost Caucasus population with which Samara_HG admixed).

We also have indirect data. First, there is the PCA with outliers:

PCA-khvalynsk-steppe

Second, we have data from north Pontic Ukraine_Eneolithic samples (see next section).

Third, there is the continuity of late Repin / Afanasevo with Steppe_Eneolithic (see below).

3. Proto-Corded Ware expansion

It is unclear if R1a-M459 subclades were continuously in the steppe and resurged after the Khvalynsk expansion, or (the most likely option) they came from the forested region of the Upper Dnieper area, possibly from previous expansions there with hunter-gatherer pottery.

Supporting the latter is the millennia-long continuity of R1b-V88 and I2a2 subclades in the north Pontic Mesolithic, Neolithic, and Early Eneolithic Sredni Stog culture, until ca. 4500 BC (and even later, during the second half).

Only at the end of the Early Eneolithic with the disappearance of Novodanilovka (and beginning of the steppe ‘hiatus’ of Rassamakin) is R1a to be found in Ukraine again (after disappearing from the record some 2,000 years earlier), related to complex population movements in the north Pontic area.

NOTE. In the PCA, a tentative position of Novodanilovka closer to Anatolia_Neolithic / Dzudzuana ancestry is selected, based on the apparent cline formed by Ukraine_Eneolithic samples, and on the position and ancestry of Sredni Stog, Yamna, and Corded Ware later. A good alternative would be to place Novodanilovka still closer to the Balkan outliers (i.e. Suvorovo), and a source closer to EHG as the ancestry driven by the migration of R1a-M417.

PCA-sredni-stog-steppe

The first sample with steppe ancestry appears only after 4250 BC in the forest-steppe, centuries after the samples with steppe ancestry from the Northern Caucasus and the Balkans, which points to exogamy of expanding R1a-M417 lineages with the remnants of the Novodanilovka population.

steppe-ancestry-admixture-sredni-stog

4. Repin / Early Yamna expansion

We don’t have direct data on early Repin settlers. But we do have a very close representative: Afanasevo, a population we know comes directly from the Repin/late Khvalynsk expansion ca. 3500/3300 BC (just before the emergence of Early Yamna), and which shows fully Steppe_Eneolithic-like ancestry.

afanasevo-admixture

Compared to this eastern Repin expansion that gave Afanasevo, the late Repin expansion to the west ca. 3300 BC that gave rise to the Yamna culture was one of colonization, evidenced by the admixture with north Pontic (Sredni Stog-like) populations, no doubt through exogamy:

PCA-repin-yamna

This admixture is also found (in lesser proportion) in east Yamna groups, which supports the high mobility and exogamy practices among western and eastern Yamna clans, not only with locals:

yamnaya-admixture

5. Corded Ware

Corded Ware represents a quite homogeneous expansion of a late Sredni Stog population, compatible with the traditional location of Proto-Corded Ware peoples in the steppe-forest/forest zone of the Dnieper-Dniester region.

PCA-latvia-ln-steppe

We don’t have a comparison with Ukraine_Eneolithic or Corded Ware samples in Wang et al. (2018), but we do have proximate sources for Abashevo, when compared to the Poltavka population (with which it admixed in the Volga-Ural steppes): Sintashta, Potapovka, Srubna (with further Abashevo contribution), and Andronovo:

sintashta-poltavka-andronovo-admixture

The two CWC outliers from the Baltic show what I thought was an admixture with Yamna. However, given the previous mixture of Eneolithic_Steppe in north Pontic steppe-forest populations, this elevated “steppe ancestry” found in Baltic_LN (similar to west Yamna) seems rather an admixture of Baltic sub-Neolithic peoples with a north Pontic Eneolithic_Steppe-like population. Late Repin settlers also admixed with a similar population during its colonization of the north Pontic area, hence the Baltic_LN – west Yamna similarities.

NOTE. A direct admixture with west Yamna populations through exogamy by the ancestors of this Baltic population cannot be ruled out yet (without direct access to more samples), though, because of the contacts of Corded Ware with west Yamna settlers in the forest-steppe regions.

steppe-ancestry-admixture-latvia

A similar case is found in the Yamna outlier from Mednikarovo south of the Danube. It would be absurd to think that Yamna from the Balkans comes from Corded Ware (or vice versa), just because the former is closer in the PCA to the latter than other Yamna samples. The same error is also found e.g. in the Corded Ware → Bell Beaker theory, because of their proximity in the PCA and their shared “steppe ancestry”. All those theories have been proven already wrong.

NOTE. A similar fallacy is found in potential Sintashta→Mycenaean connections, where we should distinguish statistically that result from an East/West Yamna + Balkans_BA admixture. In fact, genetic links of Mycenaeans with west Yamna settlers prove this (there are some related analyses in Anthrogenica, but the site is down at this moment). To try to relate these two populations (separated more than 1,000 years before Sintashta) is like comparing ancient populations to modern ones, without the intermediate samples to trace the real anthropological trail of what is found…Pure numbers and wishful thinking.

Conclusion

Yamna and Corded Ware show a similar “steppe ancestry” due to convergence. I have said so many times (see e.g. here). This was clear long ago, just by looking at the Y-chromosome bottlenecks that differentiate them – and Tomenable noticed this difference in ADMIXTURE from the supplementary materials in Mathieson et al. (2017), well before Wang et al. (2018).

This different stock stems from (1) completely different ancestral populations + (2) different, long-lasting Y-chromosome bottlenecks. Their similarities come from the two neighbouring cultures admixing with similar populations.

If all this does not mean anything, and each lab was going to support some pre-selected archaeological theories from the 1960s or the 1980s, coupled with outdated linguistic models no matter what – Anthony’s model + Ringe’s glottochronological tree of the early 2000s in the Reich Lab; and worse, Kristiansen’s CWC-IE + Germano-Slavonic models of the 1940s in the Copenhagen group – , I have to repeat my question again:

What’s (so much published) ancient DNA useful for, exactly?

Related

Waves of Palaeolithic ANE ancestry driven by P subclades; new CWC-like Finnish Iron Age

New preprint The population history of northeastern Siberia since the Pleistocene, by Sikora et al. bioRxiv (2018).

Interesting excerpts (emphasis mine; most internal references removed):

ANE ancestry

The earliest, most secure archaeological evidence of human occupation of the region comes from the artefact-rich, high-latitude (~70° N) Yana RHS site dated to ~31.6 kya (…)

The Yana RHS human remains represent the earliest direct evidence of human presence in northeastern Siberia, a population we refer to as “Ancient North Siberians” (ANS). Both Yana RHS individuals were unrelated males, and belong to mitochondrial haplogroup U, predominant among ancient West Eurasian hunter-gatherers, and to Y chromosome haplogroup P1, ancestral to haplogroups Q and R, which are widespread among present-day Eurasians and Native Americans.

Symmetry tests using f4 statistics reject tree-like clade relationships with both Early West Eurasians (EWE; Sunghir) and Early East Asians (EEA; Tianyuan); however, Yana is genetically closer to EWE, despite its geographic location in northeastern Siberia

Using admixture graphs (qpGraph) and outgroup-based estimation of mixture proportions (qpAdm), we find that Yana can be modelled as EWE with ~25% contribution from EEA

Among all ancient individuals, Yana shares the most genetic drift with Mal’ta, and f4 statistics show that Mal’ta shares more alleles with Yana than with EWE (e.g. f4(Mbuti,Mal’ta;Sunghir,Yana) = 0.0019, Z = 3.99). Mal’ta and Yana also exhibit a similar pattern of genetic affinities to both EWE and EEA, consistent with previous studies.The ANE lineage can thus be considered a descendant of the ANS lineage, demonstrating that by 31.6 kya early representatives of this lineage were widespread across northern Eurasia, including far northeastern Siberia.

siberian-samples-haplogroup

Ancient Palaeosiberian

(…) the 9.8 kya Kolyma1 individual, representing a group we term “Ancient Paleosiberians” (AP). Our results indicate that AP are derived from a first major genetic shift observed in the region. Principal component analysis (PCA), outgroup f3-statistics and mtDNA and Y chromosome haplogroups (G1b and Q1a1a, respectively) demonstrate a close affinity between AP and present-day Koryaks, Itelmen and Chukchis, as well as with Native Americans.

For both AP and Native Americans, ANS ancestry appears more closely related to Mal’ta than Yana, therefore rejecting a direct contribution of Yana to later AP or Native American groups.

Lake Baikal Neolithic – Bronze Age

(…) the newly reported genomes from Ust’Belaya and recently published neighbouring Neolithic and Bronze Age sites show a succession of three distinct genetic ancestries over a ~6 ky time span. The earliest individuals show predominantly East Asian ancestry, closely related to the ancient individuals from DGC. In the early Bronze Age (BA), we observe a resurgence of AP ancestry (up to ~50% ancestry fraction), as well as influence of West Eurasian Steppe ANE ancestry represented by the early BA individuals from Afanasievo in the Altai region (~10%) This is consistent with previous reports of gene flow from an unknown ANE-related source into Lake Baikal hunter-gatherers.

Our results suggest a southward expansion of AP as a possible source, which is also consistent with the replacement of Y chromosome lineages observed at Lake Baikal, from predominantly haplogroup N in the Neolithic to haplogroup Q in the BA. Finally, the most recent individual from Ust’Belaya, dated to ~600 years ago, falls along the Neosiberian cline, similar to the ~760 year-old ‘Young Yana’ individual from northeastern Siberia, demonstrating the widespread distribution of Neosiberian ancestry in the most recent epoch.

finnish_ia_palaeosiberian
Genetic structure of ancient northeast Siberians. PCA of ancient individuals projected onto a set of modern Eurasian and American individuals. Abbreviations in group labels: UP – Upper Palaeolithic; LP – Late Palaeolithic; M – Mesolithic; EN – Early Neolithic; MN – Middle Neolithic; LN – Late Neolithic; EBA – Early Bronze Age; LBA – Late Bronze Age; IA – Iron Age; PE – Paleoeskimo; MED – Medieval

Finland Saami

At the western edge of northern Eurasia, genetic and strontium isotope data from ancient individuals at the Levänluhta site documents the presence of Saami ancestry in Southern Finland in the Late Holocene 1.5 kya. This ancestry component is currently limited to the northern fringes of the region, mirroring the pattern observed for AP ancestry in northeastern Siberia. However, while the ancient Saami individuals harbour East Asian ancestry, we find that this is better modelled by DGC rather than AP, suggesting that AP influence was likely restricted to the eastern side of the Urals. Comparison of ancient Finns and Saami with their present-day counterparts reveals additional gene flow over the past 1.6 kya, with evidence for West Eurasian admixture into modern Saami. The ancient Finn from Levänluhta shows lower Siberian ancestry than modern Finns .

EDIT (27 OCT 2018): By comparing the three, I see these are samples published already (at least two) in Lamnidis et al. (2018), but here with added (1) specific radiocarbon dates, (2) comparison with Neosiberian populations and (3) strontium isotope analyses.

Finnish_IA (ca. 350 AD) is probably a Saami-speaking individual, just like the Saami_IA with newly reported radiocarbon dates from Levänluhta ca. 400-600 AD (since Fennic peoples were then likely around the Gulf of Finland).

The conflicting strontium isotope data on marine dietary resources on certain samples from the supplementary material hint at possible external origin of the diet of some of the previously reported (and possibly one newly reported) Saami Iron Age individuals, from some 25-30 km. to the northwest through the river up to hundreds of km. to the southwest of Levänluhta (i.e. the whole coast of the Bothnian Sea). It is unclear why they would prefer an origin of the dietary source in southern Baltic regions instead of some km. to the west, though, unless that’s what they want to propose based on the sample’s admixture…

The coast of the Bothnian Sea (=the northern part of the Baltic Sea, between Sweden and Finland) lay only 25-30 km to the northwest, and accessible to the Iron Age people of the Levänluhta region via the Kyrönjoki river. (…) For individual JA2065/DA236, the low 87Sr/86Sr value (0.71078) would imply an exceptionally heavy reliance on Baltic Sea resources. The δ13C and δ15N values of the individual are near comparable (especially considering within-Baltic latitudinal gradients in δ13C; Torniainen et al. 2017) to the δ13C and δ15N values of a Middle Neolithic population on the Baltic island of Gotland (Eriksson, 2004) interpreted to have subsisted primarily on seals.

These new data on the samples give us some more information than what we already had, because the early date of Finnish_IA implies that there was few East Asian admixture (if any at all) in west Finland during the Roman Iron Age, which pushes still farther forward in time the expected appearance of Siberian ancestry among Saamic (first) and Fennic populations (later). It is unclear whether this East Asian ancestry found in Finnish_IA is actually related to DGC, or it is rather related to the ENA-like ancestry found already in Baltic hunter-gatherers (i.e. in some EHG samples from Karelia), for which Baikal_EN is a good proxy in Lazaridis et al. (2018).

Since Bronze Age and Iron Age samples from Estonia show more Baltic_HG drift compared to Corded Ware samples, it is likely that this supposedly DGC-related ancestry (here considered part of the ‘Siberian ancestry’) is actually an EHG-related ENA component of north-east European hunter-gatherers, with whom Finno-Saamic peoples admixed during the expansion of the Corded Ware culture into Finland.

The paper finds thus increased (probably the actual) Siberian ancestry in modern Finns compared to this Iron Age Saami individual. Coupled with the later Saami Iron Age samples, from between one to three centuries later – showing the start of Siberian ancestry influx – , we can begin to establish when the expansion of Siberian ancestry happened in central Finland, and thus quite likely when the Saami began to expand to the north and east and admix with Palaeo-Laplandic peoples.

siberian-population-expansions
Admixture modelling using qpAdm. Maps showing locations and ancestry proportions of ancient (left) and modern (right) groups.

One sample of haplogroup N1a1a1a1a4a1-M1982, Yana_MED, is found in the Arctic region (north-eastern Yakutia) ca. 1100 AD. Since it is derived from N1a1a1a1a-L392, it might be a surprise for some to find it in a clearly non-Uralic speaking environment at the same time other subclades of this haplogroup were admixing in the west with well-established Finno-Saamic, Volga-Finnic, Ugric, and Samoyedic populations…

On the growing doubts that these data – contradicting the CWC=IE theory – are creating among geneticists (from the supplementary materials):

NOTE. This paper comes from the Copenhagen group, also signed by Kristiansen, one of today’s strongest supporters of this connection

The Proto-Saami language evolved in southern Finland and Karelia in the Early Iron Age, an area now host to Finnish and the closely related Karelian, but with Saami toponyms showing that the latter two languages are intrusive here (Saarikivi 2004). Saami-speaking populations are thought to have retreated to Lapland during the Middle Iron Age (300–800 AD), where it diverged into the modern Saami dialects. Genetically, the northward retreat of the Saami language correlates with the documented decrease of Saami ancestry in Southern Finland between the Iron Age and the modern period (cf. Lamnidis et al. 2018).

On the way to Lapland, the Saami replaced at least two linguistically obscure groups. This can be inferred from 1) an influx of non-Uralic loanwords into Proto-Saami in the Finnish Lakeland area, and 2) an influx of non-Uralic, non-Germanic words into Saami dialects in Lapland (Aikio 2012). Both of these borrowing events imply contact with non-Saami-speaking groups, e.g. non-Uralic-speaking hunter-gatherers that may have left a genetic and linguistic footprint on modern Saami populations.

The linguistic prehistory of Finland thus does not allow for a straightforward interpretation of the genetic data. The detection of East Asian ancestry in the genetically Saami individual is indicative of a population movement from the east (cf. Lamnidis et al. 2018, Rootsi et al. 2007), one that given the affinities with the ~7.6 ky old individuals from the Devil’s Gate Cave may have been a western extension of the Neosiberian turnover. However, it remains unclear whether this gene flow should be associated with the arrival of Uralic speakers, thus providing further support for a Uralic homeland in Eastern Eurasia, or with an earlier immigration of pre-Uralic, so-called “Paleo-Lakelandic” groups.

I think the genetic interpretation is already straightforward, though. We had a sneak peek at how this late admixture with non-Uralians (mainly Palaeo-Lakelandic and Palaeo-Laplandic peoples from Lovozero and related asbestos ware cultures) is going to unfold among expanding Saami-speaking populations thanks to Lamnidis et al. (2018):

saamic-lovozero-pca
PCA plot of 113 Modern Eurasian populations, with individuals from this study projected on the principal components. Uralic speakers are highlighted in light purple. Image modified from Lamnidis et al. (2018)

Also, still no trace of R1a in far East Asia (reported as M17 ca. 5300 BC near Lake Baikal by Moussa et al. 2016), so I still have doubts about my previous assessment that R1a split into M17 (and thus also M417) in Siberia, with those expanding hunter-gatherer pottery.

Related

The Tungusic Ulchi population probably linked to haplogroup C2b1a

ulchi-marital

New paper (behind paywall) Demographic and Genetic Portraits of the Ulchi Population, by Balanovska et al. Russian Journal of Genetics (2018) 54(10):1245–1253.

Interesting excerpts (emphasis mine):

Marital structure. The intensity of interethnic marriages puts the existence of the Ulchi population at risk. The colorful ethnic composition of the Ulchi settlements is reflected in the marriage structure [see featured image]. We found that the proportion of single-ethnic marriages of the Ulchi is on average 51%. The greatest number of such marriages takes place in the village of Bulava. Marriages of Ulchi with Russians are in second place. Marriages with indigenous peoples of the Far East, Nanais, Nivkhs, Evenks, and others, are in third place. Thus, almost half of the Ulchi marriages are with representatives of other nationalities. Such a significant level of interethnic mixing makes it possible to talk about intense processes of assimilation of this indigenous people and puts to the forefront the problem of loss of the unique gene pool of the Ulchi.

Haplogroup C (its branch M48) was genotyped for its five subbranches with markers M86, B470, F13686, B93, and the marker at position 16645386 (GRCh37), which was found by our team for the first time. Variant B93 is rare in the Ulchi, and 14 samples (that is, more than a quarter of the entire gene pool of the Ulchi, Fig. 2) belong to M86 and its subvariants. Therefore, we genotyped STR markers of C-M86 carriers for the Ulchi and neighboring Amur populations and analyzed the relationships of detected haplotypes on the phylogenetic network (Fig. 3, STR haplotypes are available from authors upon request).

(…) On the network, different clusters are associated with different populations: most Mongols belong to F13686, all Evenks of the Amur River region with this haplogroup form a subcluster within F13686, and part of Upper Nanais is the basis of cluster B470.

ulchi-y-chromosome
Frequencies of haplogroups of Y chromosome in the Ulchi population. The nomenclature of haplogroups is given according to [9]. Markers that are not in bold type were not typed, but are ancestral for these nodes.

An estimate of the age of the entire haplogroup C-F12355 obtained from the data of genome-wide sequencing of seven specimens is 2400 ± 500 years (O.P. Balanovsky, unpublished data). That is, the common ancestor of all the studied representatives of various peoples with this haplogroup lived not so long ago, the first millennium BC. The formation time of cluster F13686 is somewhat later: 1990 ± 600 years.

(…) obvious traces of the interaction of the gene pool of the Ulchi with neighboring and remote peoples of the Far East and Central Asia in the time range of the last one to three thousand years were revealed. This shows that the results of work [4] on the similarity of the gene pool of the ancient (age of 7500 years) Neolithic genomes of the Amur River region to the Ulchi probably indicate not the uniqueness of the Ulchi, but the fact that this ancient gene pool was preserved in a vast circle of populations of the Far East interwoven with gene flows both with each other and, to a lesser extent, with populations of Central Asia.

The expansion of C2b1a2a-M86 (among many basal C2-M217 samples) is thus possibly associated with the spread of Tungusic, which puts C2b1a at the root of the Micro-Altaic expansion, with a formation date ca. 12700 BC, TMRCA 12500 BC (and not only Mongolian). This shows that Micro-Altaic is connected with a local population which shows a clear continuity since at least 3500 BC. This, however, tells us little about the origin of the language.

See also the recent ISBA presentation on the Houtaomuga site, Neolithic transition in Northeast Asia; and also Bronze Age population dynamics and rise of dairy pastoralism in Mongolia, Impact of colonization in north-eastern Siberia

That leaves the ancestral N lineages found among Far East Asians as Palaeo-Siberian in origin, and their late expansions to the west not particularly linked with any of the known Palaeo-Siberian ethnolinguistic groups, let alone a supposed “Uralo-Altaic” language…

Related

Dzudzuana, Sidelkino, and the Caucasus contribution to the Pontic-Caspian steppe

hunter-gatherer-pottery

It has been known for a long time that the Caucasus must have hosted many (at least partially) isolated populations, probably helped by geographical boundaries, setting it apart from open Eurasian areas.

David Reich writes in his book the following about India:

The genetic data told a clear story. Around a third of Indian groups experienced population bottlenecks as strong or stronger than the ones that occurred among Finns or Ashkenazi Jews. We later confirmed this finding in an even larger dataset that we collected working with Thangaraj: genetic data from more than 250 jati groups spread throughout India (…)

Rather than an invention of colonialism as Dirks suggested, long-term endogamy as embodied in India today in the institution of caste has been overwhelmingly important for millennia. (…)

The Han Chinese are truly a large population. They have been mixing freely for thousands of years. In contrast, there are few if any Indian groups that are demographically very large, and the degree of genetic differentiation among Indian jati groups living side by side in the same village is typically two to three times higher than the genetic differentiation between northern and southern Europeans. The truth is that India is composed of a large number of small populations.

There is little doubt now, based on findings spanning thousands of years, that the Mesolithic and Neolithic Caucasus hosted various very small populations, even if the ancestral components may be reduced to the few known to date (such as ANE, EHG, AME*, ENA, CHG, and other “deep” ancestral components).

NOTE. I will call the ancestral component of Dzudzuana/Anatolian hunter-gatherers Ancient Middle Easterner (AME), to give a clear idea of its likely extension during the Late Upper Palaeolithic, and to avoid using the more simplistic Dzudzuana, unless it is useful to mention these specific local samples.

dzudzuana-pca
Image modified from Lazaridis et al. (2018), including Caucasus, Don-Volga-Ural, and North Pontic Mesolithic-Neolithic populations. “Ancient West Eurasian population structure. (a) Geographical distribution of key ancient West Eurasian populations. (b) Temporal distribution of key ancient West Eurasian populations (approximate date in ky BP). (c) PCA of key ancient West Eurasians, including additional populations (shown with grey shells), in the space of outgroup f4-statistics (Methods).”

Genetic labs have a strong fixation with ancestry. I guess the use of complex statistical methods gives professionals and laymen alike the feeling of dealing with “Science”, as opposed to academic fields where you have to interpret data. I think language reveals a lot about the way people think, and the fact that ancestral components are called ‘lineages’ – while not wrong per se – is a clear symptom of the lack of interest in the true lineages: Y-DNA haplogroups.

Y-DNA bottlenecks

It has become quite clear that male-biased migrations are often the ones which can be confidently followed for actual population movements and ethnolinguistic identification, at least until the Iron Age. The frequently used Palaeolithic clusters offer a clear example of why ancestry does not represent what some people believe: They merely give a basic idea of sizeable population replacements by distant peoples.

Both concepts are important: sizeable and distant peoples. For example, during the Upper Palaeolithic in Europe there was a sizeable population replacement of the Aurignacian Goyet cluster by the Gravettian Vestonice cluster (probably from populations of far eastern Russia) coupled with the arrival of haplogroup I, although during the thousands of years that this material culture lasted, the previously expanded C1a2 lineages did not disappear, and there were probably different resurgence and admixture events.

Haplogroup I certainly expanded with the Gravettian culture to Iberia, where the Goyet ancestry did not change much – probably because of male-driven migrations -, to the extent that during the Magdalenian expansions haplogroup I expanded with an ancestry closer to Goyet, in what is called a ‘resurge’ of the Goyet cluster – even though there is a clear replacement of male lines.

The Villabruna (WHG) cluster is another good example. It probably spread with haplogroup R1b-L754, which – based on the extra ‘East Asian’ affinity of some samples and on modern samples from the Middle East – came probably from the east through a southern route, and not too long before the expansion of WHG likely from around the Black Sea, although this is still unclear. The finding of haplogroup I in samples of mostly WHG ancestry could confuse people that do not care about timing, sub-structured populations, and gene flow.

palaeolithic-expansions-reich
Image from David Reich’s Who We Are and How We Got Here. Having migrated out of Africa and the Near East, modern human pioneer populations spread throughout Eurasia (1). By at least thirty-nine thousand years ago, one group founded a lineage of European hunter-gatherers that persisted largely uninterrupted for more than twenty thousand years (2). Eventually, groups derived from an eastern branch of this founding population of European huntergatherers spread west (3), displaced previous groups, and were eventually themselves pushed out of northern Europe by the spread of glacial ice, shown at its maximum extent (top right). As the glaciers receded, western Europe was repeopled from the southwest (4) by a population that had managed to persist for tens of thousands of years and was related to an approximately thirty-five-thousand-year old individual from far western Europe. A later human migration, following the first strong warming period, had an even larger impact, with a spread from the southeast (5) that not only transformed the population of western Europe but also homogenized the populations of Europe and the Near East. At a single site—Goyet Caves in Belgium—ancient DNA from individuals spread over twenty thousand years reflects these transformations, with representatives from the Aurignacian, Gravettian, and Magdalenian periods.

NOTE. If you don’t understand why ‘clusters’ that span thousands of years don’t really matter for the many Palaeolithic population expansions that certainly happened among hunter-gatherers in Europe, just take a look at what happened with Bell Beakers expanding from Yamna into western Europe within 500 years.

If we don’t thread carefully when talking about population migrations, these terms are bound to confuse people. Just as the fixation on “steppe ancestry” – which marks the arrival in Chalcolithic Europe of peoples from the Pontic-Caspian region – has confused a lot of researchers to this day.

When I began to write about the Indo-European demic diffusion model, my concern was to find a single spot where a North-West Indo-European proto-language could have expanded from ca. 2000 BC (our most common guesstimate). Based on the 2015 papers, and in spite of their conclusions, I thought it had become clear that Corded Ware was not it, and it was rather Bell Beakers. I assumed that Uralic was spoken to the north (as was the traditional belief), and thus Corded Ware expanded from the forest zone, hence steppe ancestry would also be found there with other R1a lineages.

With the publication of Mathieson et al. (2017) and Olalde et al. (2017), I changed my mind, seeing how “steppe ancestry” did in fact appear quite late, hence it was likely to be the result of very specific population movements, probably directly from the Caucasus. Later, Mathieson published in a revision the sample from Alexandria of hg R1a-M417 (probably R1a-Z645, possibly Z93+), which further supported the idea that the migration of Corded Ware peoples started near the North Pontic forest-steppe (as I included in a the next revision).

The question remains the same I repeated recently, though: where do the extra Caucasus components (i.e. beyond EHG) of Eneolithic Ukraine/Corded Ware and Khvalynsk/Yamna come from?

Steppe ancestry: “EHG” + “CHG”?

About EHG ancestry

From Lazaridis et al. (2018):

Considering 2-way mixtures, we can model Karelia_HG as deriving 34 ± 2.8% of its ancestry from a Villabruna-related source, with the remainder mainly from ANE represented by the AfontovaGora3 (AG3) sample from Lake Baikal ~17kya.

AG3 was likely of haplogroup Q1a (as reported by YFull, see Genetiker), and probably the ANE ancestry found in Eastern Europe accompanied a Palaeolithic migration of Q1a2-M25 (formed ca. 22600 BC, TMRCA ca. 14300 BC).

NOTE. You can read more about the expansion of Q lineages during the Palaeolithic.

Combined with what we know about the Eneolithic Steppe and Caucasus populations – it is likely that ANE ancestry remained the most important component of some of the small ghost populations of the Caucasus until their emergence with the Lola culture.

pca-caucasus-dzudzuana
Image modified from Wang et al. (2018). Samples projected in PCA of 84 modern-day West Eurasian populations (open symbols). Previously known clusters have been marked and referenced. Marked and labelled are the Balkan samples referenced in this text An EHG and a Caucasus ‘clouds’ have been drawn, leaving Pontic-Caspian steppe and derived groups between them. See the original file here. To understand the drawn potential Caucasus Mesolithic cluster, see above the PCA from Lazaridis et al. (2018).

The first sample we have now attributed to the EHG cluster is Sidelkino, from the Samara region (ca. 9300 BC), mtDNA U5a2. In Damgaard et al. (Science 2018), Yamnaya could be modelled as a CHG population related to Kotias Klde (54%) and the remaining from ANE population related to Sidelkino (>46%), with the following split events:

  1. A split event, where the CHG component of Yamnaya splits from KK1. The model inferred this time at 27 kya (though we note the larger models in Sections S2.12.4 and S2.12.5 inferred a more recent split time).
  2. A split event, where the ANE component of Yamnaya splits from Sidelkino. This was inferred at about about 11 kya.
  3. A split event, where the ANE component of Yamnaya splits from Botai. We inferred this to occur 17 kya. Note that this is above the Sidelkino split time, so our model infers Yamnaya to be more closely related to the EHG Sidelkino, as expected.
  4. An ancestral split event between the CHG and ANE ancestral populations. This was inferred to occur around 40 kya.

Other samples classified as of the EHG cluster:

  • Popovo2 (ca. 6250 BC) of hg J1, mtDNA U4d – Po2 and Po4 from the same site (ca. 6550 BC) show continuity of mtDNA.
  • Karelia_HG, from Juzhnii Oleni Ostrov (ca. 6300 BC): I0211/UzOO40 (ca. 6300 BC) of hg J1(xJ1a), mtDNA U4a; and I0061/UzOO74 of hg R1a1(xR1a1a), mtDNA C1
  • UzOO77 and UzOO76 from Juzhnii Oleni Ostrov (ca. 5250 BC) of mtDNA R1b.
  • Samara_HG from Lebyanzhinka (ca. 5600 BC) of hg R1b1a, mtDNA U5a1d.

From the analysis of Lazaridis et al. (2018), we have some details about their admixture:

dzudzuana-admixture-sidelkino
Image modified from Lazaridis et al. (2018). Modeling present-day and ancient West-Eurasians. Mixture proportions computed with qpAdm (Supplementary Information section 4). The proportion of ‘Mbuti’ ancestry represents the total of ‘Deep’ ancestry from lineages that split prior to the split of Ust’Ishim, Tianyuan, and West Eurasians and can include both ‘Basal Eurasian’ and other (e.g., Sub-Saharan African) ancestry. (Left) ‘Conservative’ estimates. Each population 367 cannot be modeled with fewer admixture events than shown. (Right) ‘Speculative’ estimates. The highest number of sources (≤5) with admixture estimates within [0,1] are shown for each population. Some of the admixture proportions are not significantly different from 0 (Supplementary Information section 4).

About Anatolia_Neolithic ancestry

About the enigmatic Anatolia_Neolithic-related ancestry found in Pontic-Caspian steppe samples, this is what Wang et al. (2018) had to say:

We focused on model of mixture of proximal sources such as CHG and Anatolian Chalcolithic for all six groups of the Caucasus cluster (Eneolithic Caucasus, Maykop and Late Makyop, Maykop-Novosvobodnaya, Kura-Araxes, and Dolmen LBA), with admixture proportions on a genetic cline of 40-72% Anatolian Chalcolithic related and 28-60% CHG related (Supplementary Table 7). When we explored Romania_EN and Greece_Neolithic individuals as alternative southeast European sources (30-46% and 36-49%), the CHG proportions increased to 54-70% and 51-64%, respectively. We hypothesize that alternative models, replacing the Anatolian Chalcolithic individual with yet unsampled populations from eastern Anatolia, South Caucasus or northern Mesopotamia, would probably also provide a fit to the data from some of the tested Caucasus groups.

Also:

The first appearance of ‘Near Eastern farmer related ancestry’ in the steppe zone is evident in Steppe Maykop outliers. However, PCA results also suggest that Yamnaya and later groups of the West Eurasian steppe carry some farmer related ancestry as they are slightly shifted towards ‘European Neolithic groups’ in PC2 (Fig. 2D) compared to Eneolithic steppe. This is not the case for the preceding Eneolithic steppe individuals. The tilting cline is also confirmed by admixture f3-statistics, which provide statistically negative values for AG3 as one source and any Anatolian Neolithic related group as a second source

yamnaya-caucasus-dzudzuana
Modified image from Wang et al. (2018). In blue, Yamna-related populations. In red, Corded Ware-related populations, and two elevated Anatolia_Neolithic values in Yamna. Notice how only GAC-related admixture increases the Anatolian_N-related ancestry in the Yamna outlier from Ozero, and the late Yamna sample from Hungary, related to the homogeneous Yamna population. “Supplementary Table 14. P values of rank=3 and admixture proportions in modelling Steppe ancestry populations as a four-way admixture of distal sources EHG, CHG, Anatolian_Neolithic and WHG using 14 outgroups.Left populations: Steppe cluster, EHG, CHG, WHG, Anatolian_Neolithic. Right populations: Mbuti.DG, Ust_Ishim.DG, Kostenki14, MA1, Han.DG, Papuan.DG, Onge.DG, Villabruna, Vestonice16, ElMiron, Ethiopia_4500BP.SG, Karitiana.DG, Natufian, Iran_Ganj_Dareh_Neolithic.”

Detailed exploration via D-statistics in the form of D(EHG, steppe group; X, Mbuti) and D(Samara_Eneolithic, steppe group; X, Mbuti) show significantly negative D values for most of the steppe groups when X is a member of the Caucasus cluster or one of the Levant/Anatolia farmer-related groups (Supplementary Figs. 5 and 6). In addition, we used f- and D-statistics to explore the shared ancestry with Anatolian Neolithic as well as the reciprocal relationship between Anatolian- and Iranian farmer-related ancestry for all groups of our two main clusters and relevant adjacent regions (Supplementary Fig. 4). Here, we observe an increase in farmer-related ancestry (both Anatolian and Iranian) in our Steppe cluster, ranging from Eneolithic steppe to later groups. In Middle/Late Bronze Age groups especially to the north and east we observe a further increase of Anatolian farmer related ancestry consistent with previous studies of the Poltavka, Andronovo, Srubnaya and Sintashta groups and reflecting a different process not especially related to events in the Caucasus.

(…) Surprisingly, we found that a minimum of four streams of ancestry is needed to explain all eleven steppe ancestry groups tested, including previously published ones (Fig. 2; Supplementary Table 12). Importantly, our results show a subtle contribution of both Anatolian farmer-related ancestry and WHG-related ancestry (Fig.4; Supplementary Tables 13 and 14), which was likely contributed through Middle and Late Neolithic farming groups from adjacent regions in the West. The discovery of a quite old AME ancestry has rendered this probably unnecessary, because this admixture from an Anatolian-like ghost population could be driven even by small populations from the Caucasus.

yamna-caucasus-cwc-anatolia-neolithic
Image modified from Wang et al. (2018). Marked are: in red, approximate limit of Anatolia_Neolithic ancestry found in Yamna populations; in blue, Corded Ware-related groups. “Modelling results for the Steppe and Caucasus 1128 cluster. Admixture proportions based on (temporally and geographically) distal and proximal models, showing additional Anatolian farmer-related ancestry in Steppe groups as well as additional gene flow from the south in some of the Steppe groups as well as the Caucasus groups (see also Supplementary Tables 10, 14 and 20).”

NOTE. For a detailed account of the possibilities regarding this differential admixture in the North Pontic area in contrast to the Don-Volga-Ural region, you can read the posts Sredni Stog, Proto-Corded Ware, and their “steppe admixture”, and Corded Ware culture origins: The Final Frontier.

While it is not yet fully clear, the increased Anatolian_Neolithic-like ancestry in Ukraine_Eneolithic samples (see below) makes it unlikely that all such ancestry in Corded Ware groups comes from a GAC-related contribution. It is likely that at least part of it represents contributions from populations of the Caucasus, based on the mostly westward population movements in the steppe from ca. 4600 BC on, including the Suvorovo-Novodanilovka expansion, and especially the Kuban-Maykop expansion during the final Eneolithic into the North Pontic area.

NOTE. Since CHG-like groups from the Caucasus may have combinations of AME and ANE ancestry similar to Yamna (which may thus appear as ‘steppe ancestry’ in the North Pontic area), it is impossible to interpret with precision the following ADMIXTURE graphic:

ukraine-whg-ehg-steppe
Modified image from Mathieson et al. (2018). Supervised ADMIXTURE analysis, modelling each ancient individual (one per row) as a mixture of population clusters constrained to contain northwestern-Anatolian Neolithic (grey), Yamnaya from Samara (yellow), EHG (pink) and WHG (green) populations. Dates in parentheses indicate approximate range of individuals in each population.

North-Eastern Technocomplex

The East Asian contribution to samples from the WHG samples (like Loschbour or La Braña), as specified in Fu et al. (2016), does not seem to be related to Baikal_EN, and appears possibly (in the ADMIXTURE analysis) integrated into he Villabruna component. I guess this implies that the shared alleles with East Asians are quite early, and potentially due to the expansion of R1b-L754 from the East.

It would be interesting to know the specific material culture Sidelkino belonged to – i.e. if it was related to the expansion of the North-Eastern Technocomplex – , and its Y-DNA. The Post-Swiderian expansion into eastern Europe, probably associated with the expansion of R1b-P297 lineages (including R1b-M73, found later in Botai and in Baltic HG) is supposed to have begun during the 11th millennium BC, but migrations to the Urals and beyond are probably concentrated in the 9th millennium, so this sample is possibly slightly early for R1b.

NOTE. User Rozenfeld at Anthrogenica posted this, which I think is interesting (in case anyone wants to try a Y-SNP call):

there is something strange with Sidelkino EHG: first, its archaeological context is not described in the supplementary. Second, its sex is not listed in the supplementary tables. Third, after looking for info about this sample, I found that: “Сиделькино-3. Для снятия вопроса о половой принадлежности индивида была проведена генетическая экспертиза, выявившая принадлежность останков мужчине.”(translation: Sidelkino-3. To resolve the question about sex of the remains, the genetic analysis was conducted, which showed that remains belonged to male), source: http://static.iea.ras.ru/books/7487_Traditsii.pdf

So either they haven’t mentioned his Y-DNA in the paper for some reason, or there are more than one Sidelkino sample and the male one has not yet been published. The coverage of the Sidelkino sample from the paper is 2.9, more than enough to tell Y-DNA haplogroup.

zaliznyak-post-swiderian
The map of spreading of Post-Swiderian and Post-Krasnosillian sites in Mesolithic of Eastern Europe in the 8th millennia BC. From Zaliznyak (see here).

My speculative guess right now about specific population movements in far eastern Europe, based on the few data we have:

  • The expansion of the North-Eastern Technocomplex first around the 9th millennium BC, most likely expanded R1b-P279 ca. 11300 BC, judging by its TMRCA, with both R1b-M73 (TMRCA 5300) and R1b-M269 (TMRCA 4400 BC) info (with extra El Mirón ancestry) back, and thus Eurasiatic.
  • The expansion of haplogroup J1 to the north may have happened before or after the R1b-P279 expansion. Judging by the increase in AG3-related ancestry near Karelia compared to Baltic_HG, it is possible that it expanded just after R1b-P279 (hence possibly J1-Y6304? TMRCA 9700 BC). Its long-lasting presence in the Caucasus is supported by the Satsurblia (ca. 11300 BC) and the Dolmen BA (ca. 1300 BC) samples.
  • The expansion of R1a-M17 ca. 6600 BC is still likely to have happened from the east, based on the R1a-M17 samples found in Baikalic cultures slightly later (ca. 5300 BC). The presence of elevated Baikal_EN ancestry in Karelia HG and in Samara HG, and the finding of R1a-M417 samples in the Forest Zone after the Mesolithic suggests a connection with the expansion of Hunter-Gatherer pottery, from the Elshanka culture in the Samara region northward into the Forset Zone and westward into the North Pontic area.
  • The expansion of R1b-M73 ca. 5300 BC is likely to be associated with the emergence of a group east of the Urals (related to the later Botai culture, and potentially Pre-Yukaghir). Its presence in a Narva sample from Donkalnis (ca. 5200 BC) suggest either an early split and spread of both R1b-P297 lineages (M73 and M269) through Eastern Europe, or maybe a back-migration with hunter-gatherer pottery.
  • R1b-M269 spread successfully ca. 4400 BC (and R1b-L23 ca. 4100 BC, both based on TMRCA), and this successful expansion is probably to be associated with the Khvalynsk-Novodanilovka expansion. We already know that Samara_HG ca. 5600 was R1b1a, so it is likely that R1b-M269 appeared (or ‘resurged’) in the Volga-Ural region shortly after the expansion of R1a-M17, whose expansion through the region may be inferred by the additional AG3 and Baikal_EN ancestry. Interesting from Samara_HG compared to the previous Sidelkino sample is the introduction of more El Mirón-related ancestry, typical of WHG populations (and thus proper of Baltic groups).

NOTE. The TMRCA dates are obviously gross approximations, because a) the actual rate of mutation is unknown and b) TMRCA estimates are based on the convergence of lineages that survived. The potential finding of R1a-Z645 (possibly Z93+) in Ukraine Eneolithic (ca. 4000 BC), and the potential finding of R1b-L23 in Khvalynsk ca. 4250 BC complicates things further, in terms of dates and origins of any subclade.

The question thus remains as it was long ago: did R1b-M269 lineages expand (‘return’) from the east, near the Urals, or directly from the north? Were they already near Samara at the same time as the expansion of hunter-gatherer pottery, and were not much affected by it? Or did they ‘resurge’ from populations admixed with Caucasus-related ancestry after the expansion of R1a-M17 with this pottery (since there are different stepped expansions from the Samara region)? We could even ask, did R1a-M17 really expand from the east, i.e. are the dates on Baikalic subclades from Moussa et al. (2016) reliable? Or did R1a-M17 expand from some pockets in the Pontic-Caspian steppe, taking over the expansion of HG pottery at some point?

hunger-gatherer-pottery
Early Neolithic cultures in eastern and central Europe: 1–Yelshanian; 2–North Caspian; 3–Rakushechnyj Yar; 4–Surskian; 5–Dnieper-Donetsian; 6– Bug-Dniesterian; 7–Upper Volga; 8–Narvian; 9–Linear Pottery. White arrows: expansion of early farming; black arrows: spread of pottery-making traditions. From Dolukhanov et al. (2009).

Maglemose-related migrations

The most interesting aspect from the new paper (regarding Indo-Uralic migrations) is that Ancestral Middle Easterner ancestry will probably be a better proxy for the Anatolia_Neolithic component found in Ukraine Mesolithic to Eneolithic, and possibly also for some of the “more CHG-like” component found among Pontic-Caspian steppe populations, all likely derived from different admixture events with groups from the Caucasus.

NOTE. Even the supposed gene flow of Neolithic Iranian ancestry into the Caucasus can be put into question, since that means possibly a Dzudzuana-like population with greater “deep ancestry” proportion than the one found in CHG, which may still be found within the Caucasus.

If it was not clear already that following ‘steppe ancestry’ wherever it appears is a rather lame way of following Indo-European migrations, every single sample from the Caucasus and their admixture with Pontic-Caspian steppe populations will probably show that “steppe ancestry” is in fact formed by a variety of steppe-related ancestral components, impossible to follow coherently with a single population. Exactly what is happening already with the Siberian ancestry.

If the paper on the Dzudzuana samples has shown something, is that the expansion of an ANE-like population shook the entire Caucasus area up to the Zagros Mountains, creating this ANE – AME cline that are CHG and Iran_N, with further contributions of “deep ancestries” (probably from the south) complicating the picture further.

If this happens with few known samples, and we know of an ANE-like ghost population in the Caucasus (appearing later in the Lola culture), we can already guess that the often repeated “CHG component” found in Ukraine_Eneolithic and Khvalynsk will not be the same (except the part mediated by the Novodanilovka expansion).

This ANE-like expansion happened probably in the Late Upper Palaeolithic, and reached Northern Europe probably after the expansion of the Villabruna cluster (ca. 12000 BC), judging by the advance of AG3-like and ENA-like ancestry in later WHG samples.

The population movements during the Mesolithic and Early Neolithic in the North Pontic area are quite complicated: the extra AME ancestry is probably connected to the admixture with populations from the Caucasus, while the close similarity of Ukraine populations with Scandinavian ones (with an increase in Villabruna ancestry from Mesolithic to Neolithic samples), probably reveal population movements related to the expansion of Maglemose-related groups.

maglemose-mesolithic
Etno-cultural situation in Central and Eastern Europe in the Late Mesolithic — Early Neolithic (VI—V Mill. BC) (after Конча 2004: 201, карта 1; made after ideas by L. L. Zaliznyak). Legend: 1 — Maglemose circle in the VII Mill. BC (after Gr. Clark); 2—7 — Mesolithic cultures of the Post-Maglemose tradition, VI Mill. BC (after S. Kozłowsky, L. L. Zaliznyak): 2 — de Leyen-Wartena; 3 — Oldesloe — Godenaa; 4 — Chojnice — Peńki; 5 — Janisłavice; 6 — finds of Janisłavice artefacts outside of the main area; 7 — Donets culture; 8 — directions of the settling of Janisłavice people (after S. Kozłowsky and L. L. Zaliznyak); 9 — the south border of Mesolithic and Early Neolithic cultures of post-Swidrian and post-Arensburgian traditions; 10 — northern border of settlement of the Balkan-Danubian farmers; 11 — Bug- Dniester culture; 12 — Neolithic cultures emerged on the ethno-cultural basis of post-Maglemose: Э — Ertebölle-Ellerbeck, Н — Neman, Д — Dnieper-Donets, М — Mariupol (western variants). From Klein (2017).

These Maglemose-related groups were probably migrants from the north-west, originally from the Northern European Plains, who occupied the previous Swiderian territory, and then expanded into the North Pontic area. The overwhelming presence of I2a (likely all I2a2a1b1b) lineages in Ukraine Neolithic supports this migration.

The likely picture of Mesolithic-Neolithic migrations in the North Pontic area right now is then:

  1. Expansion of R1a-M459 from the east ca. 12000 BC – probably coupled with AG3 and also some Baikal_EN ancestry. First sample is I1819 from Vasilievka (ca. 8700 BC), another is from Dereivka ca. 6900 BC.
  2. Expansion of R1b-V88 from the Balkans in the west ca. 9700 BC, based on its TMRCA and also the Balkan hunter-gatherer population overwhemingly of this haplogroup from the 10th millennium until the Neolithic. First sample is I1734 from Vasilievka (ca. 7252 BC), which suggests that it replaced the male population there, based on their similar EHG-like adxmixture (and lack of sizeable WHG increase), and shared mtDNA U5b2, U5a2.
  3. Expansion of I2a-Y5606 probably ca. 6800 based on its TMRCA with Janislawice culture. Supporting this is the increase in WHG contribution to Neolithic samples, including the spread of U4 subclades compared to the previous period.
  4. Expansion of R1a-M17 starting probably ca. 6600 BC in the east (see above).

NOTE. The first sample of haplogroup I appears in the Mesolithic: I1763 (ca. 8100 BC) of haplogroup I2a1, probably related to an older Upper Palaeolithic expansion.

janislawice
Distribution of archeological cultures in the North Pontic Region during the Mesolithic (7th – 6th millennium BCE). Dotted, dashed and solid lines with corresponding arrows indicate alternative models of the spread of the Grebenyky culture groups. (After Bryuako IV., Samojlova TL., Eds, Drevnie kul’tury Severo-­‐Zapadnogo Prichernomor’ya, Odessa: SMIL, 2013.) Nikitin – Ivanova 2017.

Conclusion

It is becoming more and more clear with each new paper that – unless the number of very ancient samples increases – the use of Y-chromosome haplogroups remains one of the most important tools for academics; this is especially so in the steppes, in light of the diversity found in populations from the Caucasus. A clear example comes from the Yamna – Corded Ware similarities:

After the publication of the 2015 papers, it was likely that Yamna expanded with haplogroup R1b-L23, but it has only become crystal clear that Yamna expanded through the steppes into Bell Beakers, now that we have data about the strict genetic homogeneity of the whole Yamna population from west to east (including Afanasevo), in contrast with contemporary Corded Ware peoples which expanded from a different forest-steppe population.

The presence of haplogroups Q and R1a-M459 (xM17) in Khvalynsk along with a R1b1a sample, which some interpreted as being akin to modern ‘mixed’ populations in the past, is likely to point instead to a period of Khvalynsk-Novodanilovka expansion with R1b-M269, where different small populations from the steppe were being integrated into the common Khvalynsk stock, but where differences are seen in material culture surrounding their burials, as supported by the finding of R1b1 in the Kuban area already in the first half of the 5th millennium. The case would be similar to the early ‘mixed’ Icelandic population.

Only after the emergence of the Samara culture (in the second half of the 6th millennium BC), with a sample of haplogroup R1b1a, starts then the obvious connection with Early Proto-Indo-Europeans; and only after the appearance of late Sredni Stog and haplogroup R1a-M417 (ca. 4000 BC) is its connection with Uralic also clear. In previous population movements, I think more haplogroups were involved in migrations of small groups, and only some communities among them were eventually successful, expanding to be dominant, creating ever growing cultures during their expansions.

Indeed, if you think in terms of Uralic and Indo-European just as converging languages, and forget their potential genetic connection, then the genetic + linguistic picture becomes simplified, and the upper frontier of the 6th millennium BC with a division North Pontic (Mariupol) vs. Volga-Ural (Samara) is enough. However, tracing their movements backwards – with cultural expansions from west to east (with the expansion of farming), and earlier east to west (with hunter-gatherer pottery), and still earlier west to east (with the north-eastern technocomplex), offers an interesting way to prove their potential connection to macrofamilies, at least in terms of population movements.

corded-ware-uralic-qpgraph
Modified image from Tambets et al. (2018) Proportions of ancestral components in studied European and Siberian populations and the tested qpGraph model. a The qpGraph model fitting the data for the tested populations. Colour codes for the terminal nodes: pink—modern populations (‘Population X’ refers to test population) and yellow—ancient populations (aDNA samples and their pools). Nodes coloured other than pink or yellow are hypothetical intermediate populations. We putatively named nodes which we used as admixture sources using the main recipient among known populations. The colours of intermediate nodes on the qpGraph model match those on the admixture proportions panel. The NeolL (Neolithic Levant) ancestry selected in this qpGraph is likely to correspond (at least in part) to a specific Dzudzuana-like component present in the CHG-like population that admixed in the North Pontic area.

I am quite convinced right now that it would be possible to connect the expansion of R1b-L754 subclades with a speculative Nostratic (given the R1b-V88 connection with Afroasiatic, and the obvious connection of R1b-L297 with Eurasiatic). Paradoxically, the connection of an Indo-Uralic community in the steppes (after the separation of Yukaghir) with any lineage expansion (R1a-M17, R1b-M269, or even Q, I or J1) seems somehow blurrier than one year ago, possibly just because there are too many open possibilities.

David Reich says about the admixture with Neanderthals, which he helped discover:

At the conclusion of the Neanderthal genome project, I am still amazed by the surprises we encountered. Having found the first evidence of interbreeding between Neanderthals and modern humans, I continue to have nightmares that the finding is some kind of mistake. But the data are sternly consistent: the evidence for Neanderthal interbreeding turns out to be everywhere. As we continue to do genetic work, we keep encountering more and more patterns that reflect the extraordinary impact this interbreeding has had on the genomes of people living today.

I think this is a shared feeling among many of us who have made proposals about anything, to fear that we have made a gross, evident mistake, and constantly look for flaws. However, it seems to me that geneticists are more preoccupied with being wrong in their developed statistical methods, in the theoretical models they are creating, and not so much about errors in the true ancient ethnolinguistic picture human population genetics is (at least in theory) concerned about. Their publications are, after all, constantly associating genetic finds with cultures and (whenever possible) languages, so this aspect of their research should not be taken lightly.

Seeing how David Anthony or Razib Khan (among many others) have changed their previously preferred migration models as new data was published, and they continue to be respected in their own fields, I guess we can be confident that professionals with integrity are going to accept whatever new picture appears. While I don’t think that genetic finds can change what we can reconstruct with comparative grammar, I am also ready to revise guesstimates and routes of expansion of certain dialects if R1a-Z645 is shown to have accompanied Late Proto-Indo-Europeans during their expansion with Yamna, and later integrated somehow with Corded Ware.

However, taking into account the obsession of some with an ancestral, uninterrupted R1a—Indo-European association, and the lack of actual political repercussion of Neanderthal admixture, I think the most common nightmare that all genetic researchers should be worried about is to keep inflating this “Yamnaya ancestry”-based hornet’s nest, which has been constantly stirred up for the past two years, by rejecting it – or, rather, specifying it into its true complex nature.

This succession of corrections and redefinitions, coupled with the distinct Y-DNA bottleneck of each steppe population, will eventually lead to a completely different ethnolinguistic picture of the Pontic-Caspian region during the Eneolithic, which is likely to eventually piss off not only reasonable academics stubbornly attached to the CWC-IE idea, but also a part of those interested in daydreaming about their patrilineal ancestors.

Sometimes it’s better to just rip off the band-aid once and for all…

Featured image from The oldest pottery in hunter-gatherer communitiesand models of Neolithisation of Eastern Europe (2015), by Andrey Mazurkevich and Ekaterina Dolbunova.

Related

Interesting is today’s post in Ancient DNA Era: Is Male-driven Genetic Replacement always meaning Language-shift?

R1a-Z280 lineages in Srubna; and first Palaeo-Balkan R1b-Z2103?

herodotus-world-map

Scythian samples from the North Pontic area are far more complex than what could be seen at first glance. From the new Y-SNP calls we have now thanks to the publications at Molgen (see the spreadsheet) and in Anthrogenica threads, I think this is the basis to work with:

NOTE. I understand that writing a paper requires a lot of work, and probably statistical methods are the main interest of authors, editors, and reviewers. But it is difficult to comprehend how any user of open source tools can instantly offer a more complex assessment of the samples’ Y-SNP calls than professionals working on these samples for months. I think that, by now, it should be clear to everyone that Y-DNA is often as important (sometimes even more) than statistical tools to infer certain population movements, since admixture can change within few generations of male-biased migrations, whereas haplogroups can’t…

Srubna

Srubna-Andronovo samples are as homogeneous as they always were, dominated by R1a-Z645 subclades and CWC-related (steppe_MLBA) ancestry.

The appearance of one (possibly two) R-Z280 lineages in this mixed Srubna-Alakul region of the southern Urals and this early (1880-1690 BC, hence rather Pokrovka-Alakul) points to the admixture of R1a-Z93 and R1a-Z280 already in Abashevo, which also explains the wide distribution of both subclades in the forest zones of Central Asia.

If Abashevo is the cornerstone of the Indo-Iranian / Uralic community, as it seems, the genetic admixture would initially be quite similar, undergoing in the steppes a reduction to haplogroup R1a-Z93 (obviously not complete), at the same time as it expanded to the west with Pokrovka and Srubna, and to the east with Petrovka and Andronovo. To the north, similar reductions will probably be seen following the Seima-Turbino phenomenon.

NOTE. Another R1a-Z280 has been found in the recent sample from Bronze Age Poland (see spreadsheet). As it appears right now in ancient and modern DNA, there seems to be a different distribution between subclades:

  • R1a-Z280 (formed ca. 2900 BC, TMRCA ca. 2600 BC) appears mainly distributed today to the east, in the forest and steppe regions, with the most ‘successful’ expansions possibly related to the spread of Abashevo- and Battle Axe-related cultures (Indo-Iranian and Uralic alike).
  • R1a-M458 (formed ca. 2700, TMRCA ca. 2700 BC) appears mainly distributed to the north, from central Europe to the east – but not in the steppe in aDNA, with the most ‘successful’ expansions to the west.

M458 lineages seem thus to have expanded in the steppe in sizeable numbers only after the Iranian expansions (see a map of modern R1a distributions) i.e. possibly with the expansion of Slavs, which supports the model whereby cultures from central-east Europe (like Trzciniec and Lusatian), accompanied mainly by M458 lineages, were responsible for the expansion of Proto-Balto-Slavic (and later Proto-Slavic).

The finding of haplogroup R1a-Z93, among them one Z2123, is no surprise at this point after other similar Srubna samples. As I said, the early Srubna expansion is most likely responsible for the Szólád Bronze Age sample (ca. 2100-1700 BC), and for the Balkans BA sample (ca. 1750-1625 BC) from Merichleri, due to incursions along the central-east European steppe.

cheek-pieces
Map of decorated bone/antler bridle cheek-pieces and whip handle equivalents. They are often local translations that remained faithful to the originals (from data in Piggott, 1965; Kristiansen & Larsson, 2005; David, 2007). Image from Vandkilde (2014).

Cimmerians

Cimmerian samples from the west show signs of continuity with R1a-Z93 lineages. Nevertheless, the sample of haplogroup Q1a-Y558, together with the ‘Pre-Scythian’ sample of haplogroup N (of the Mezőcsát Culture) in Hungary ca. 980-830 BC, as well as their PCA, seem to depict an origin of these Pre-Scythian peoples in populations related to the eastern Central Asian steppes, too.

NOTE. I will write more on different movements (unrelated to Uralic expansions) from Central and East Asia to the west accompanied by Siberian ancestry and haplogroup N with the post of Ugric-Samoyedic expansions.

Scythians

The Scythian of Z2123 lineage ca. 375-203 BC from the Volga (in Mathieson et al. 2015), together with the sample scy193 from Glinoe (probably also R1a-Z2123), without a date, as well as their common Steppe_MLBA cluster, suggest that Scythians, too, were at first probably quite homogeneous as is common among pastoralist nomads, and came thus from the Central Asian steppes.

The reduction in haplogroup variability among East Iranian peoples seems supported by the three new Late Sarmatian samples of haplogroup R1a-Z2124.

Approximate location of Glinoe and Glinoe Sad (with Starosilya to the south, in Ukrainian territory):

This initial expansion of Scythians does not mean that one can dismiss the western samples as non-Scythians, though, because ‘Scythian’ is a cultural attribution, based on materials. Confirming the diversity among western Scythians, a session at the recent ISBA 8:

Genetic continuity in the western Eurasian Steppe broken not due to Scythian dominance, but rather at the transition to the Chernyakhov culture (Ostrogoths), by Järve et al.

The long-held archaeological view sees the Early Iron Age nomadic Scythians expanding west from their Altai region homeland across the Eurasian Steppe until they reached the Ponto-Caspian region north of the Black and Caspian Seas by around 2,900 BP. However, the migration theory has not found support from ancient DNA evidence, and it is still unclear how much of the Scythian dominance in the Eurasian Steppe was due to movements of people and how much reflected cultural diffusion and elite dominance. We present new whole-genome results of 31 ancient Western and Eastern Scythians as well as samples pre- and postdating them that allow us to set the Scythians in a temporal context by comparing the Western Scythians to samples before and after within the Ponto-Caspian region. We detect no significant contribution of the Scythians to the Early Iron Age Ponto-Caspian gene pool, inferring instead a genetic continuity in the western Eurasian Steppe that persisted from at least 4,800–4,400 cal BP to 2,700–2,100 cal BP (based on our radiocarbon dated samples), i.e. from the Yamnaya through the Scythian period.

(…) Our results (…) support the hypothesis that the Scythian dominance was cultural rather than achieved through population replacement.

Detail of the slide with admixture of Scythian groups in Ukraine:

scythians-admixture

The findings of those 31 samples seem to support what Krzewińska et al. (2018) found in a tiny region of Moldavia-south-western Ukraine (Glinoi, Glinoi Sad, and Starosilya).

The question, then, is as follows: if Scythian dominance was “cultural rather than achieved through population replacement”…Where are the R1b-Z2103 from? One possibility, as I said in the previous post, is that they represent pockets of Iranian R1b lineages in the steppes descended from eastern Yamna, given that this haplogroup appears in modern populations from a wide region surrounding the steppes.

The other possibility, which is what some have proposed since the publication of the paper, is that they are related to Thracians, and thus to Palaeo-Balkan populations. About the previously published Thracian individuals in Sikora et al. (2014):

thracian-samples
Geographic origin of ancient samples and ADMIXTURE results. (A) Map of Europe indicating the discovery sites for each of the ancient samples used in this study. (B) Ancestral population clusters inferred using ADMIXTURE on the HGDP dataset, for k = 6 ancestral clusters. The width of the bars of the ancient samples was increased to aid visualization. https://doi.org/10.1371/journal.pgen.1004353.g001

For the Thracian individuals from Bulgaria, no clear pattern emerges. While P192-1 still shows the highest proportion of Sardinian ancestry, K8 more resembles the HG individuals, with a high fraction of Russian ancestry.

Despite their different geographic origins, both the Swedish farmer gok4 and the Thracian P192-1 closely resemble the Iceman in their relationship with Sardinians, making it unlikely that all three individuals were recent migrants from Sardinia. Furthermore, P192-1 is an Iron Age individual from well after the arrival of the first farmers in Southeastern Europe (more than 2,000 years after the Iceman and gok4), perhaps indicating genetic continuity with the early farmers in this region. The only non-HG individual not following this pattern is K8 from Bulgaria. Interestingly, this individual was excavated from an aristocratic inhumation burial containing rich grave goods, indicating a high social standing, as opposed to the other individual, who was found in a pit.

pca-thracians

The following are excerpts from A Companion to Ancient Thrace (2015), by Valeva, Nankov, and Graninger (emphasis mine):

Thracian settlements from the 6th c. BC on:

(…) urban centers were established in northeastern Thrace, whose development was linked to the growth of road and communication networks along with related economic and distributive functions. The early establishment of markets/emporia along the Danube took place toward the middle of the first millennium BCE (Irimia 2006, 250–253; Stoyanov in press). The abundant data for intensive trade discovered at the Getic village in Satu Nou on the right bank of the Danube provides another example of an emporion that developed along the main artery of communication toward the interior of Thrace (Conovici 2000, 75–76).

Undoubtedly the most prominent manifestation of centralization processes and stratification in the settlement system of Thrace arrives with the emergence of political capitals – the leading urban centers of various Thracian political formations.

getic-thracian
Image from Volf at Vol_Vlad LiveJournal.

Their relationships with Scythians and Greeks

The Scythian presence south of the Danube must be balanced with a Thracian presence north of the river. We have observed Getae there in Alexander’s day, settled and raising grain. For Strabo the coastlands from the Danube delta north as far as the river and Greek city of Tyras were the Desert of the Getae (7.3.14), notable for its poverty and tracklessness beyond the great river. He seems to suggest also that it was here that Lysimachus was taken alive by Dromichaetes, king of the Getae, whose famous homily on poverty and imperialism only makes sense on the steppe beyond the river (7.3.8; cf. Diod. 21.12; further on Getic possessions above the Danube, Paus. 1.9 with Delev 2000, 393, who seems rather too skeptical; on poverty, cf. Ballesteros Pastor 2003). This was the kind of discourse more familiarly found among Scythians, proud and blunt in the strength of their poverty. However, as Herodotus makes clear, simple pastoralism was not the whole story as one advanced round into Scythia. For he observes the agriculture practiced north and west of Olbia. These were the lands of the Alizones and the people he calls the Scythian Ploughmen, not least to distinguish them from the Royal Scythians east of Olbia, in whose outlook, he says, these agriculturalist Scythians were their inferiors, their slaves (Hdt. 4.20). The key point here is that, as we began to see with the Getan grain-fields of Alexander’s day, there was scope for Thracian agriculturalists to maintain their lifestyles if they moved north of the Danube, the steppe notwithstanding. It is true that it is movement in the other direction that tends to catch the eye, but there are indications in the literary tradition and, especially, in the archaeological record that there was also significant movement northward from Thrace across the Danube and the Desert of the Getae beyond it.

Greek literary sources were not much concerned with Thracian migration into Scythia, but we should observe the occasional indications of that process in very different texts and contexts. At the level of myth, it is to be remembered that Amazons were regularly considered to be of Thracian ethnicity from Archaic times onward and so are often depicted in Thracian dress in Greek art (Bothmer 1957; cf. Sparkes 1997): while they are most familiar on the south coast of the Black Sea, east of Sinope, they were also located on the north coast, especially east of the Don (the ancient Tanais). Herodotus reports an origin-story of the Sauromatians there, according to which this people had been created by the union of some Scythian warriors with Amazons captured on the south coast and then washed up on the coast of Scythia (4.110). While the story is unhistorical, it is not without importance. First, it reminds us that passage north from the Danube was not the only way that Thracians, Thracian influence, and Thracian culture might find their way into Scythia. There were many more and less circuitous routes, especially by sea, that could bring Thrace into Scythia. Secondly, the myth offered some ideological basis for the Sauromatian settlement in Thrace that Strabo records, for Sauromatians might claim a Thracian origin through their Amazon forebears. Finally, rather as we saw that Heracles could bring together some of the peoples of the region, we should also observe that Ares, whose earthly home was located in Thrace by a strong Greek and Roman tradition, seems also to have been a deity of special significance and special cult among the Scythians. So much was appropriate, especially from a Classical perspective, in associations between these two peoples, whose fame resided especially in their capacity for war.

skythen
Scythians: cultures and findings (ca. 7th-4th/3rd c. BC). Greek colonies marked with concentric circles.

This broad picture of cultural contact, interaction, and osmosis, beyond simple conflict, provides the context for a range of archaeological discoveries, which – if examined separately – may seem to offer no more than a scatter of peculiarities. Here we must acknowledge especially the pioneering work of Melyukova, who has done most to develop thinking on Thracian–Scythian interaction. As she pointed out, we have a good example of Thracian–Scythian osmosis as early as the mid-seventh century bce at Tsarev Brod in northeastern Bulgaria, where a warrior’s burial combines elements of Scythian and Thracian culture (Melyukova 1965). For, while the manner of his burial and many of the grave goods find parallels in Scythia and not Thrace, there are also goods which would be odd in a Scythian burial and more at home in a Thracian one of this period (notably a Hallstatt vessel, an iron knife, and a gold diadem). Also interesting in this regard are several stone figures found in the Dobrudja which resemble very closely figures of this kind (baby) known from Scythia (Melyukova 1965, 37–38). They range in date from perhaps the sixth to the third centuries bce, and presumably were used there – as in Scythia – to mark the burials of leading Scythians deposited in the area. Is this cultural osmosis? We should probably expect osmosis to occur in tandem with the movement of artefacts, so that only good contexts can really answer such questions from case to case. However, the broad pattern is indicated by a range of factors. Particularly notable in this regard is the observable development of a Thraco-Scythian form of what is more familiar as “Scythian animal style,” a term which – it must be understood – already embraces a range of types as we examine the different examples of the style across the great expanse from Siberia to the western Ukraine. As Melyukova observes, Thrace shows both items made in this style among Scythians and, more numerous and more interesting, a Thracian tendency to adapt that style to local tastes, with observable regional distinctions within Thrace itself. Among the Getae and Odrysians the adaptation seems to have been at its height from the later fifth century to the mid-third century (Melyukova 1965, 38; 1979).

The absence of local animal style in Bulgaria before the fifth century bce confirms that we have cultural influences and osmosis at work here, though that is not to say that Scythian tradition somehow dominated its Thracian counterpart, as has been claimed (pace Melyukova 1965, 39; contrast Kitov 1980 and 1984). Of particular interest here is the horse-gear (forehead-covers, cheek-pieces, bridle fittings, and so on) which is found extensively in Romania and Bulgaria as well as in Scythia, both in hoarded deposits and in burials. This exemplifies the development of a regional animal style, not least in silver and bronze, which problematizes the whole issue of the place(s) of its production. Accordingly, the regular designation as “Thracian” of horse-gear from the rich fourth century Scythian burial of Oguz in the Ukraine becomes at least awkward and questionable (further, Fialko 1995). And let us be clear that this is no minor matter, nor even part of a broader debate about the shared development of toreutics among Thracians and Scythians (e.g., Kitov 1980 and 1984). A finely equipped horse of fine quality was a strong statement and striking display of wealth and the power it implied

(…) while Thracian pottery appears at Olbia, Scythian pottery among Thracians is largely confined to the eastern limits of what should probably be regarded as Getic territory, namely the area close to the west of the Dniester, from the sixth century bce. Rather exceptional then is the Scythian pottery noted at Istros, which has been explained as a consequence of the Scythian pursuit of the withdrawing army of Darius and, possibly, a continued Scythian grip on the southern Danube in its aftermath (Melyukova 1965, 34). The archaeology seems to show us, therefore, that the elite Thracians and Scythians were more open to adaptation and acculturation than were their lesser brethren.

palaeo-balkan-languages
Paleo-Balkan languages in Eastern Europe between 5th and 1st century BC. From Wikipedia.

Conclusion

(…) we see distinct peoples and organizations, for example as Sitalces’ forces line up against the Scythians. Much more striking, however, against that general background, are the various ways in which the two peoples and their elites are seen to interact, connect, and share a cultural interface. We see also in Scyles’ story how the Greek cities on the coast of Thrace and Scythia played a significant role in the workings of relationships between the two peoples. It is not simply that these cities straddled the Danube, but also that they could collaborate – witness the honors for Autocles, ca. 300 bce (SEG 49.1051; Ochotnikov 2006) – and were implicated with the interactions of the much greater non-Greek powers around them. At the same time, we have seen the limited reality of familiar distinctions between settled Thracians and nomadic Scythians and the limited role of the Danube too in dividing Thrace and Scythia. The interactions of the two were not simply matters of dynastic politics and the occasional shared taste for artefacts like horse-gear, but were more profoundly rooted in the economic matrix across the region, so that “Scythian” nomadism might flourish in the Dobrudja and “Thracian-style” agriculture and settlement can be traced from Thrace across the Danube as far as Olbia. All of that offers scant justification for the Greek tendency to run together Thracians and Scythians as much the same phenomenon, not least as irrational, ferocious, and rather vulgar barbarians (e.g., Plato, Rep. 435b), because such notions were the result of ignorance and chauvinism. However, Herodotus did not share those faults to any degree, so that we may take his ready movement from Scythians to Thracians to be an indication of the importance of interaction between the two peoples whom he had encountered not only as slaves in the Aegean world, but as powerful forces in their own lands (e.g., Hdt. 4.74, where Thracian usage is suddenly brought into his account of Scythian hemp). Similarly, Thucydides, who quite without need breaks off his disquisition on the Odrysians to remark upon political disunity among the Scythians (Thuc. 2.97, a favorite theme: cf. Hdt. 4.81; Xen., Cyr. 1.1.4). As we have seen throughout this discussion, there were many reasons why Thracians might turn the thoughts of serious writers to Scythians and vice versa.

It seems, following Sikora et al. (2014), that Thracian ‘common’ populations would have more Anatolian Neolithic ancestry compared to more ‘steppe-like’ samples. But there were important differences even between the two nearby samples published from Bulgaria, which may account for the close interaction between Scythians and Thracians we see in Krzewińska et al. (2018), potentially reflected in the differences between the Central, Southern and the South-Central clusters (possibly related to different periods rather than peoples??).

If these R1b-Z2103 were descended from Thracian elites, this would be the first proof of Palaeo-Balkan populations showing mainly R1b-Z2103, as I expect. Their appearance together with haplogroup I2a2a1b1 (also found in Ukraine Neolithic and in the Yamna outlier from Bulgaria) seem to support this regional continuity, and thus a long-lasting cultural and ethnic border roughly around the Danube, similar to the one found in the northern Caucasus.

However, since these samples are some 2,500 years younger than the Yamna expansion to the south, and they are archaeologically Scythians, it is impossible to say. In any case, it would seem that the main expansion of R1a-Z645 lineages to the south of the Danube – and therefore those found among modern Greeks – was mediated by the Slavic expansions centuries later.

krzewinska-scythians-pca
Modified image from Krzewińska et al. (2018), with added Y-DNA haplogroups to each defined Scythian cluster and Sarmatians. Principal component analysis (PCA) plot visualizing 35 Bronze Age and Iron Age individuals presented in this study and in published ancient individuals in relation to modern reference panel from the Human Origins data set. See image with population references.

On the Northern cluster there is a sample of haplogroup R1b-P312 which, given its position on the PCA (apparently even more ‘modern Celtic’-like than the Hallstatt_Bylany sample from Damgaard et al. 2018), it seems that it could be the product of the previous eastward Hallstatt expansion…although potentially also from a recent one?:

Especially important in the archaeology of this interior is the large settlement at Nemirov in the wooded steppe of the western Ukraine, where there has been considerable excavation. This settlement’s origins evidently owe nothing significant to Greek influence, though the early east Greek pottery there (from ca. 650 bce onward: Vakhtina 2007) and what seems to be a Greek graffito hint at its connections with the Greeks of the coast, especially at Olbia, which lay at the estuary of the River Bug on whose middle course the site was located (Braund 2008). The main interest of the site for the present discussion, however, is its demonstrable participation in the broader Hallstatt culture to its west and south (especially Smirnova 2001). Once we consider Nemirov and the forest steppe in connection with Olbia and the other locations across the forest steppe and coastal zone, together with the less obvious movements across the steppe itself, we have a large picture of multiple connectivities in which Thrace bulks large.

scythian-peoples-balkans
Early Iron Age cultures of the Carpathian basin ca. 7-6th century BC, including steppe-related groups. Ďurkovič et al. (2018).

While the above description of clear-cut R1a-Steppe and R1b-Balkans is attractive (and probably more reliable than admixture found in scattered samples of unclear dates), the true ancient genetic picture is more complicated than that:

  • There is nothing in the material culture of the published western Scythians to distinguish the supposed Thracian elites.
  • We have the sample I0575, an Early Sarmatian from the southern Urals (one of the few available) of haplogroup R1b-Z2106, which supports the presence of R1b-Z2103 lineages among Eastern Iranian-speaking peoples.
  • We also have DA30, a Sarmatian of I2b lineage from the central steppes in Kazakhstan (ca. 47 BC – 24 AD).
  • Other Sarmatian samples of haplogroup R remain undefined.
  • There is R1a-Z93 in a late Sarmatian-Hun sample, which complicates the picture of late pastoralist nomads further.

Therefore, the possibility of hidden pockets of Iranian peoples of R1b-Z2103 (maybe also R1b-P312) lineages remains the best explanation, and should not be discarded simply because of the prevalent haplogroups among modern populations, or because of the different clusters found, or else we risk an obvious circular reasoning: “this sample is not (autosomically or in prevalent haplogroups) like those we already had from the steppe, ergo it is not from this or that steppe culture.” Hopefully, the upcoming paper by Järve et al. will help develop a clearer genetic transect of Iranian populations from the steppes.

All in all, the diversity among western Scythians represents probably one of the earliest difficult cases of acculturation to be studied with ancient DNA (obviously not the only one), since Scythians combine unclear archaeological data with limited and conflicting proto-historical accounts (also difficult to contrast with the wide confidence intervals of radiocarbon dates) with different evolving clusters and haplogroups – especially in border regions with strong and continued interactions of cultures and peoples.

With emerging complex cases like these during the Iron Age, I am happy to see that at least earlier expansions show clearer Y-DNA bottlenecks, or else genetics would only add more data to argue about potential cultural diffusion events, instead of solving questions about proto-language expansions once and for all…

Related

The genetic makings of South Asia – IVC as Proto-Dravidian

south-asian-language-families

Review (behind paywall) The genetic makings of South Asia, by Metspalu, Monda, and Chaubey, Current Opinion in Genetics & Development (2018) 53:128-133.

Interesting excerpts (emphasis mine):

(…) the spread of agriculture in Europe was a result of the demic diffusion of early Anatolian farmers, it was discovered that the spread of agriculture to South Asia was mediated by a genetically completely different farmer population in the Zagros mountains in contemporary Iran (IF). The ANI-ASI cline itself was interpreted as a mixture of three components genetically related to Iranian agriculturalists, Onge and Early and Middle Bronze Age Steppe populations (Steppe_EMBA).

The first ever autosomal aDNA from South Asia comes from Northern Pakistan (Swat Valley, early Iron Age). This study presented altogether 362 aDNA samples from the broad South and Central Asia and contributes substantially to our understanding of the evolutionary past of South and Central Asia. The study redefines the three genetic strata that form the basis of the Indian Cline. The Indus Periphery (IP) component is composed of (varying proportions of): first, IF, second, Ancient Ancestral South Asians (AASI), which represents an ancient branch of human genetic variation in Asia arising from a population split contemporaneous with the splits of East Asian, Onge and Australian Aboriginal ancestors and third, West_Siberian Hunter gatherers (WS_HG).

The authors argue that IP could have formed the genetic base of the Indus Valley Civilization (IVC). Upon the collapse of the IVC IP contributes to the formation of both ASI and ANI. ASI is formed as IP admixes further with AASI. ANI in turn forms when IP admixes with the incoming Middle and Late Bronze Age Steppe (Steppe_MLBA) component, (rather than the Steppe_EMBA groups suggested earlier)

ane-whg-ehg-chg-wshg-steppe
A sketch of the peopling history of South Asia. Depicting the full complexity of available reconstructions is not attempted. Placing of population labels does not indicate precise geographic location or range of the population in question. Rather we aim to highlight the essentials of the recent advancements in the field. We divide the scenario into three time horizons: Panels (a) before 10 000 BCE (pre agriculture era.); (b) 10 000 BCE to 3000 BCE (agriculture era) and (c) 3000 BCE to prehistoric era/modern era. (iron age).

Dating of the arrival of the Austro-Asiatic speakers in South Asia-based on Y chromosome haplogroup O2a1-M95 expansion estimates yielded dates between 3000 and 2000 BCE [30]. However, admixture LD decay-based approach on genome-wide data suggests the admixture between South Asian and incoming Austro-Asiatic speakers occurred slightly later between 1800 and 0 BCE (Tätte et al. submitted). It is interesting that while the mtDNA variants of the Mundas are completely South Asian, the Y chromosome variation is dominated at >60% by haplogroup O2a which is phylogeographically nested in East Asian-specific paternal lineages.

In India, the speakers of Tibeto-Burman (TB) languages live in the Seven Sisters States in Northeast India and in the very north of the country. Genetically they show a clear East Asian origin and around 20% of subsequent admixture with South Asians within the last 1000 years.The genetic flavour of East Asia in TB is different from that in Munda speakers as the best surrogates for the East Asian admixing component are contemporary Han Chinese.

I found the simplistic migration maps especially interesting to illustrate ancient population movements. The emergence of EHG is supposed to involve a WHG:ANE cline, though, and this isn’t clear from the map. Also, there is new information on what may be at the origin of WHG and Anatolian hunter-gatherers.

From the recent Reich’s session on South Asia at ISBA 8:

ani-asi-steppe-cline
– Tale of three clines, with clear indication that “Indus Periphery” samples drawn from an already-cosmopolitan and heterogeneous world of variable ASI & Iranian ancestry. (I know how some people like to pore over these pictures – so note red dots = just dummy data for illustration.)
– Some more certainty about primary window of steppe ancestry injection into S. Asia: 2000-1500 BC
Alexander M. Kim

Featured image: map of South Asian languages from http://llmap.org.

Related

Resurge of local populations in the final Corded Ware culture period from Poland

poland-kujawy

Open access A genomic Neolithic time transect of hunter-farmer admixture in central Poland, by Fernandes et al. Scientific Reports (2018).

Interesting excerpts (emphasis mine, stylistic changes):

Most mtDNA lineages found are characteristic of the early Neolithic farmers in south-eastern and central Europe of the Starčevo-Kőrös-Criş and LBK cultures. Haplogroups N1a, T2, J, K, and V, which are found in the Neolithic BKG, TRB, GAC and Early Bronze Age samples, are part of the mitochondrial ‘Neolithic package’ (which also includes haplogroups HV, V, and W) that was introduced to Europe with farmers migrating from Anatolia at the onset of the Neolithic17,31.

A noteworthy proportion of Mesolithic haplogroup U5 is also found among the individuals of the current study. The proportion of haplogroup U5 already present in the earliest of the analysed Neolithic groups from the examined area differs from the expected pattern of diversity of mtDNA lineages based on a previous archaeological view and on the aDNA findings from the neighbouring regions which were settled by post-Linear farmers similar to BKG at that time. A large proportion of Mesolithic haplogroups in late-Danubian farmers in Kuyavia was also shown in previous studies concerning BKG samples based on mtDNA only, although these frequencies were derived on the basis of very small sample sizes.

y-dna-poland

A significant genetic influence of HG populations persisted in this region at least until the Eneolithic/Early Bronze Age period, when steppe migrants arrived to central Europe. The presence of two outliers from the middle and late phases of the BKG in Kuyavia associated with typical Neolithic burial contexts provides evidence that hunter-farmer contacts were not restricted to the final period of this culture and were marked by various episodes of interaction between two societies with distinct cultural and subsistence differences.

The identification of both mitochondrial and Y-chromosome haplogroup lineages of Mesolithic provenance (U5 and I, respectively) in the BKG support the theory that both male and female hunter-gatherers became part of these Neolithic agricultural societies, as has been reported for similar cases from the Carpathian Basin, and the Balkans. The identification of an individual with WHG affinity, dated to ca. 4300 BCE, in a Middle Neolithic context within a BKG settlement, provides direct evidence for the regional existence of HG enclaves that persisted and coexisted at least for over 1000 years, from the arrival of the LBK farmers ca. 5400 BCE until ca. 4300 BCE, in proximity with Neolithic settlements, but without admixing with their inhabitants.

poland-pca
Principal component analysis with modern populations greyed out on the background (top), ADMIXTURE results with K = 10 with samples from this study amplified (bottom).

The analysis of two Late Neolithic cultures, the GAC and CWC, shows that steppe ancestry was present only among the CWC individuals analysed, and that the single GAC individual had more WHG ancestry than previous local Neolithic individuals. (…) The CWC’s affinity to WHG, however, contrasts with results from published CWC individuals that identified steppe ancestry related to Yamnaya as the major contributor to the CWC genomes, while here we report also substantial contributions from WHG that could relate to the late persistence of pockets of WHG populations, as supported by the admixture results of N42 and the finding of the 4300-year-old N22 HG individual. These results agree with archaeological theories that suggest that the CWC interaction with incoming steppe cultures was complex and that it varied by region.

Some comments

About the analyzed CWC samples, it is remarkable that, even though they are somehow related to each other, they do not form a tight cluster. Also, their Y-DNA (I2a), and this:

When compared to previously published CWC data, our CWC group (not individuals) is genetically significantly closer to WHG than to steppe individuals (Z = −4.898), a result which is in contrast with those for CWC from Germany (Z = 2.336), Estonia (Z = 0.555), and Latvia (Z = 1.553).

ancestry-proportions-poland
Ancestry proportions based on qpAdm. Visual representation of the main results presented in Supplementary Table S5. Populations from this study marked with an asterisk. Values and populations in brackets show the nested model results marked in green in Supplementary Table S5.

Włodarczak (2017) talks about the CWC period in Poland after ca. 2600 BC as a time of emergence of an allochthnous population, marked by the rare graves of this area, showing infiltrations initially mainly from Lesser Poland, and later (after 2500 BC) from the western Baltic zone.

Since forest sub-Neolithic populations would have probably given more EHG to the typical CWC population, these samples support the resurge of ‘local’ pockets of GAC- or TRB-like groups with more WHG (and also Levant_Neolithic) ancestry.

The known presence of I2a2a1b lineages in GAC groups in Poland also supports this interpretation, and the subsistence of such pockets of pre-steppe-like populations is also seen with the same or similar lineages appearing in comparable ‘resurge’ events in Central Europe, e.g. in samples from the Únětice and Tumulus culture.

About the Bronze Age sample, we have at last official confirmation of haplogroup R1a1a (sadly no subclade*) at the very beginning of the Trzciniec period – in a region between western (Iwno) and eastern (Strzyżów) groups related to Mierzanowice – , which has to be put in relation with the samples from the final Trzciniec period in the Baltic published in Mittnik et al. (2018).

EDIT (8 OCT 2018): More specific subclades have been published, including a R1a-Z280 lineage for the Bronze Age sample (see spreadsheet).

This confirms the early resurge of R1a-Z645 (probably R1a-Z282) lineages at the core of the developing East European Bronze Age, a province of the European Bronze Age that emerged from evolving Bell Beaker groups in Poland.

bell-beakers-poland-kujawy
Arrival of Bell Beakers in Poland after ca. 2400 BC, and their origin in other BBC centres (Czebreszuk and Szmyt 2011).

I don’t have any hope that the Balto-Slavic evolution through BBC Poland → Mierzanowice/Iwno → Trzciniec → Lusatian cultures is going to be confirmed any time soon, until we have a complete trail of samples to follow all the way to historic Slavs of the Prague culture. However, I do think that the current data on central-east Europe – and the recent data we are receiving from north-east Europe and the Iranian steppes, at odds with the Indo-Slavonic alternative – supports this model.

I guess that, in the end, similar to how the Yamna vs. Corded Ware question is being solved, the real route of expansion of Proto-Balto-Slavic (supposedly spoken ca. 1500-1000 BC) is probably going to be decided by the expansion of either R1a-M458 (from the west) or R1a-Z280 lineages (from the east), because the limited precision of genetic data and analyses available today are going to show ‘modern Slavic’-like populations from the whole eastern half of Europe for the past 4,000 years…

Related

Early Iranian steppe nomadic pastoralists also show Y-DNA bottlenecks and R1b-L23

New paper (behind paywall) Ancient genomes suggest the eastern Pontic-Caspian steppe as the source of western Iron Age nomads, by Krzewińska et al. Science (2018) 4(10):eaat4457.

Interesting excerpts (emphasis mine, some links to images and tables deleted for clarity):

Late Bronze Age (LBA) Srubnaya-Alakulskaya individuals carried mtDNA haplogroups associated with Europeans or West Eurasians (17) including H, J1, K1, T2, U2, U4, and U5 (table S3). In contrast, the Iron Age nomads (Cimmerians, Scythians, and Sarmatians) additionally carried mtDNA haplogroups associated with Central Asia and the Far East (A, C, D, and M). The absence of East Asian mitochondrial lineages in the more eastern and older Srubnaya-Alakulskaya population suggests that the appearance of East Asian haplogroups in the steppe populations might be associated with the Iron Age nomads, starting with the Cimmerians.

scythian-cimmerian-sarmatian-y-dna-mtdna

#UPDATE (5 OCT 2018): Some Y-SNP calls have been published in a Molgen thread, with:

  • Srubna samples have possibly two R1a-Z280, three R1a-Z93.
  • Cimmerians may not have R1b: cim357 is reported as R1a.
  • Some Scythians have low coverage to the point where it is difficult to assign even a reliable haplogroup (they report hg I2 for scy301, or E for scy197, probably based on some shared SNPs?), but those which can be reliably assigned seem R1b-Z2103 [hence probably the use of question marks and asterisks in the table, and the assumption of the paper that all Scythians are R1b-L23]:
    • The most recent subclade is found in scy305: R1b-Z2103>Z2106 (Z2106+, Y12538/Z8131+)
    • scy304: R1b-Z2103 (M12149/Y4371/Z8128+).
    • scy009: R1b-P312>U152>L2 (P312+, U152?, L2+)?
  • Sarmatians are apparently all R1a-Z93 (including tem002 and tem003);
  • You can read here the Excel file with (some probably as speculative as the paper’s own) results.

    About the PCA

    1. Srubnaya-Alakulskaya individuals exhibited genetic affinity to northern and northeastern present-day Europeans, and these results were also consistent with outgroup f3 statistics.
    2. The Cimmerian individuals, representing the time period of transition from Bronze to Iron Age, were not homogeneous regarding their genetic similarities to present-day populations according to the PCA. F3 statistics confirmed the heterogeneity of these individuals in comparison with present-day populations
    3. The Scythians reported in this study, from the core Scythian territory in the North Pontic steppe, showed high intragroup diversity. In the PCA, they are positioned as four visually distinct groups compared to the gradient of present-day populations:
      1. A group of three individuals (scy009, scy010, and scy303) showed genetic affinity to north European populations (…).
      2. A group of four individuals (scy192, scy197, scy300, and scy305) showed genetic similarities to southern European populations (…).
      3. A group of three individuals (scy006, scy011, and scy193) located between the genetic variation of Mordovians and populations of the North Caucasus (…). In addition, one Srubnaya-Alakulskaya individual (kzb004), the most recent Cimmerian (cim357), and all Sarmatians fell within this cluster. In contrast to the Scythians, and despite being from opposite ends of the Pontic-Caspian steppe, the five Sarmatians grouped close together in this cluster.
      4. A group of three Scythians (scy301, scy304, and scy311) formed a discrete group between the SC and SE and had genetic affinities to present-day Bulgarian, Greek, Croatian, and Turkish populations (…).
      5. Finally, one individual from a Scythian cultural context (scy332) is positioned outside of the modern West Eurasian genetic variation (Fig. 1C) but shared genetic drift with East Asian populations.
    scythian-cimmerian-pca
    Radiocarbon ages and geographical locations of the ancient samples used in this study. Figure panels presented (Left) Bar plot visualizing approximate timeline of presented and previously published individuals. (Right) Principal component analysis (PCA) plot visualizing 35 Bronze Age and Iron Age individuals presented in this study and in published ancient individuals (table S5) in relation to modern reference panel from the Human Origins data set (41).

    Cimmerians

    The presence of an SA component (as well as finding of metals imported from Tien Shan Mountains in Muradym 8) could therefore reflect a connection to the complex networks of the nomadic transmigration patterns characteristic of seasonal steppe population movements. These movements, although dictated by the needs of the nomads and their animals, shaped the economic and social networks linking the outskirts of the steppe and facilitated the flow of goods between settled, semi-nomadic, and nomadic peoples. In contrast, all Cimmerians carried the Siberian genetic component. Both the PCA and f4 statistics supported their closer affinities to the Bronze Age western Siberian populations (including Karasuk) than to Srubnaya. It is noteworthy that the oldest of the Cimmerians studied here (cim357) carried almost equal proportions of Asian and West Eurasian components, resembling the Pazyryks, Aldy-Bel, and Iron Age individuals from Russia and Kazakhstan (12). The second oldest Cimmerian (cim358) was also the only one with both uniparental markers pointing toward East Asia. The Q1* Y chromosome sublineage of Q-M242 is widespread among Asians and Native Americans and is thought to have originated in the Altai Mountains (24)

    Scythians

    In contrast to the eastern steppe Scythians (Pazyryks and Aldy-Bel) that were closely related to Yamnaya, the western North Pontic Scythians were instead more closely related to individuals from Afanasievo and Andronovo groups. Some of the Scythians of the western Pontic-Caspian steppe lacked the SA and the East Eurasian components altogether and instead were more similar to a Montenegro Iron Age individual (3), possibly indicating assimilation of the earlier local groups by the Scythians.

    Toward the end of the Scythian period (fourth century CE), a possible direct influx from the southern Ural steppe zone took place, as indicated by scy332. However, it is possible that this individual might have originated in a different nomadic group despite being found in a Scythian cultural context.

    scythian-alakul-variation
    Genetic diversity and ancestral components of Srubnaya-Alakulskaya population.(here called “Srubnaya”): (Left) Mean f3 statistics for Srubnaya and other Bronze Age populations. Srubnaya group was color-coded the same as with PCA. (Right) Pairwise mismatch estimates for Bronze Age populations.

    Comments

    I am surprised to find this new R1b-L23-based bottleneck in Eastern Iranian expansions so late, but admittedly – based on data from later times in the Pontic-Caspian steppe near the Caucasus – it was always a possibility. The fact that pockets of R1b-L23 lineages remained somehow ‘hidden’ in early Indo-Iranian communities was clear already since Narasimhan et al. (2018), as I predicted could happen, and is compatible with the limited archaeological data on Sintashta-Potapovka populations outside fortified settlements. I already said that Corded Ware was out of Indo-European migrations then, this further supports it.

    Even with all these data coming just from a north-west Pontic steppe region (west of the Dnieper), these ‘Cimmerians’ – or rather the ‘Proto-Scythian’ nomadic cultures appearing before ca. 800 BC in the Pontic-Caspian steppes – are shown to be probably formed by diverse peoples from Central Asia who brought about the first waves of Siberian ancestry (and Asian lineages) seen in the western steppes. You can read about a Cimmerian-related culture, Anonino, key for the evolution of Finno-Permic peoples.

    Also interesting about the Y-DNA bottleneck seen here is the rejection of the supposed continuous western expansions of R1a-Z645 subclades with steppe tribes since the Bronze Age, and thus a clearest link of the Hungarian Árpád dynasty (of R1a-Z2123 lineage) to either the early Srubna-related expansions or – much more likely – to the actual expansions of Hungarian tribes near the Urals in historic times.

    NOTE. I will add the information of this paper to the upcoming post on Ugric and Samoyedic expansions, and the late introduction of Siberian ancestry to these peoples.

    A few interesting lessons to be learned:

    • Remember the fantasy story about that supposed steppe nomadic pastoralist society sharing different Y-DNA lineages? You know, that Yamna culture expanding with R1b from Khvalynsk-Repin into the whole Pontic-Caspian steppes and beyond, developing R1b-dominated Afanasevo, Bell Beaker, and Poltavka, but suddenly appearing (in the middle of those expansions through the steppes) as a different culture, Corded Ware, to the north (in the east-central European forest zone) and dominated by R1a? Well, it hasn’t happened with any other steppe migration, so…maybe Proto-Indo-Europeans were that kind of especially friendly language-teaching neighbours?
    • Remember that ‘pure-R1a’ Indo-Slavonic society emerged from Sintashta ca. 2100 BC? (or even Graeco-Aryan??) Hmmmm… Another good fantasy story that didn’t happen; just like a central-east European Bronze Age Balto-Slavic R1a continuity didn’t happen, either. So, given that cultures from around Estonia are those showing the closest thing to R1a continuity in Europe until the Iron Age, I assume we have to get ready for the Gulf of Finland Balto-Slavic soon.
    • Remember that ‘pure-R1a’ expansion of Indo-Europeans based on the Tarim Basin samples? This paper means ipso facto an end to the Tarim Basin – Tocharian artificial controversy. The Pre-Tocharian expansion is represented by Afanasevo, and whether or not (Andronovo-related) groups of R1a-Z645 lineages replaced part or eventually all of its population before, during, or after the Tocharian expansion into the Tarim Basin, this does not change the origin of the language split and expansion from Yamna to Central Asia; just like this paper does not change the fact that these steppe groups were Proto-Iranian (Srubna) and Eastern Iranian (Scythian) speakers, regardless of their dominant haplogroup.
    • And, best of all, remember the Copenhagen group’s recent R1a-based “Indo-Germanic” dialect revival vs. the R1b-Tocharo-Italo-Celtic? Yep, they made that proposal, in 2018, based on the obvious Yamna—R1b-L23 association, and the desire to support Kristiansen’s model of Corded Ware – Indo-European expansion. Pepperidge Farm remembers. This new data on Early Iranians means another big NO to that imaginary R1a-based PIE society. But good try to go back to Gimbutas’ times, though.
    olander-classificatoin
    Olander’s (2018) tree of Indo-European languages. Presented at Languages and migrations in pre-historic Europe (7-12 Aug 2018)

    Do you smell that fresher air? It’s the Central and East European post-Communist populist and ethnonationalist bullshit (viz. pure blond R1a-based Pan-Nordicism / pro-Russian Pan-Slavism / Pan-Eurasianism, as well as Pan-Turanism and similar crap from the 19th century) going down the toilet with each new paper.

    #EDIT (5 OCT 2018): It seems I was too quick to rant about the consequences of the paper without taking into account the complexity of the data presented. Not the first time this impulsivity happens, I guess it depends on my mood and on the time I have to write a post on the specific work day…

    While the data on Srubna, Cimmerians, and Sarmatians shows clearer Y-DNA bottlenecks (of R1a-Z645 subclades) with the new data, the Scythian samples remain controversial, because of the many doubts about the haplogroups (although the most certain cases are R1b-Z2103), their actual date, and cultural attribution. However, I doubt they belong to other peoples, given the expansionist trends of steppe nomads before, during, and after Scythians (as shown in statistical analyses), so most likely they are Scythian or ‘Para-Scythian’ nomadic groups that probably came from the east, whether or not they incorporated Balkan populations. This is further supported by the remaining R1b-P312 and R1b-Z2103 populations in and around the modern Eurasian steppe region.

    scythian-peoples-balkans
    Early Iron Age cultures of the Carpathian basin ca. 7-6th century BC, including steppe groups Basarabi and Scythians. Ďurkovič et al. (2018).

    You can find an interesting and detailed take on the data published (in Russian) at Vol-Vlad’s LiveJournal (you can read an automatic translation from Google). I think that post is maybe too detailed in debunking all information associated to the supposed Scythians – to the point where just a single sample seems to be an actual Scythian (?!) -, but is nevertheless interesting to read the potential pitfalls of the study.

    Related