Samoyedic shows Yeniseic substrate; both influenced Tocharian

chalcolithic-late-tocharian

Open access paper The deviant typological profile of the Tocharian branch of Indo-European may be due to Uralic substrate influence by Peyrot, Indo-European Linguistics (2019).

NOTE. This seems to be part of the master’s thesis by Abel Warries, but the paper is authored only by Peyrot.

Interesting excerpts (emphasis mine):

1. The stop system

The loss in Tocharian of the Proto-Indo-European obstruent distinctions conventionally noted as voice and aspiration is a very strong indication of foreign influence. Since Proto-Indo-European roots mostly have at least one stop, and often two, the merger of all three stop series into one must have led to massive homonymy and subsequently to heavy restructuring of the lexicon. It is difficult to see how these changes could be motivated language-internally.

tocharian-consonants

It is this innovative typological feature of Tocharian that is the strongest indication of Uralic influence (cf. e.g. Bednarczuk 2015:56). A single stop series as found in Tocharian is reconstructed for Proto-Uralic as well as for Proto-Samoyedic, while other possibly relevant languages all show a system with a contrast between voiced and unvoiced stops, i.e. Proto-Yeniseian, Old Iranian and Yukaghir, or, in Proto-Turkic, a contrast between strong and weak obstruents (see also below).

For Proto-Uralic, Janhunen (1982:23) reconstructs the following obstruents: *k, *c, *t, *p; *δ, *δ´; and *ś, *s. With the development of *s to *t, *ś to *s, *δ to *r and *δ´ to *j, the Proto-Samoyedic obstruent system had become: *k, *c, *t, *p, *s (a secondary *ś arose later). The Tocharian obstruent system is much closer to both these reconstructed obstruent systems than to the Proto-Indo-European system that is commonly assumed.

tocharian-uralic-consonants

Interestingly, from the perspective of a two-velar series reconstructed for the parent Late Proto-Indo-European, Tocharian shows thus a satemization trend and Uralic influence similar to (but qualitatively different than) the one seen in Balto-Slavic and Indo-Iranian, probably due to the less marked population replacement evidenced by the continuity of Afanasievo-related ancestry among Iron Age Common Tocharians.

2. The vowel system

(…) the development of the Tocharian vowel system can be understood very well in light of a South Siberian vowel system today represented by the Yeniseian language Ket. This South Siberian vowel system is different from both the Proto-Tocharian and the Proto-Uralic and Proto-Samoyedic vowel systems. However, a successful comparison is possible when intermediate phases are taken into account: a Pre-Proto-Tocharian phase between Proto-Indo-European and Proto-Tocharian; and a Pre-Proto-Samoyedic phase between Proto-Uralic and Proto-Samoyedic. For a Pre-Proto-Tocharian phase, a vowel system identical to that of Ket can be reconstructed. For Proto-Samoyedic, several different reconstructions of the vowel system have been proposed. Depending on which reconstruction turns out to be correct, a Pre-Proto-Samoyedic vowel system can be reconstructed that is close to the Ket system or perhaps even identical to it.

The basic vowel changes from Proto-Indo-European to Proto-Tocharian are the following (Ringe 1996; Hackstein 2017):

tocharian-vowel-system

It is the seven-vowel system of Pre-Proto-Tocharian stage 5 above that is structurally identical to the South Siberian system represented by Ket. According to Vajda (2004:5), Ket ɨ and ə are further back than IPA central [ɨ] and [ə], but not as far back as the unrounded back vowels [ɯ] and [ɤ] of IPA. The allophonic variation in the mid vowels e, ə, o is correlated with tone: they are pronounced as high-mid [e, ə, o] with high-even tone, and as low-mid [ɛ, ʌ, ɔ] elsewhere (Vadja l.c.).

Obviously, this parallel with Ket can only be meaningful for Tocharian linguistic prehistory if the same vowel system can be reconstructed for earlier stages. Indeed, Vajda assumes an original Pre-Proto-Yeniseian five-vowel system with i, a, ʌ, o, u that was in Common Yeniseian enlarged with *e and *ɨ (2010:78–79).

tocharian-ket-siberian-vowel

Pre-Proto-Samoyedic

Of the eleven vowels reconstructed for Proto-Samoyedic by Janhunen and Sammallahti, the following arose in the course of Pre-Proto-Samoyedic:

  • *ö is rare and was clearly added at a late stage;
  • *ü arose secondarily, amongst others from PU *i, while PU *ü changed to PSam. *i;
  • *ä arose secondarily, while PU *ä changed to PSam. *e;
  • *ə in first syllables, or back *ə̑ and front *ə̈, arose secondarily from *u and *i.

Since these four vowels arose secondarily, the following seven-vowel system can be assumed for a very early stage of Pre-Proto-Samoyedic. This system is structurally identical to the system of Ket and to that reconstructed for Pre-Proto-Tocharian:

pre-proto-samoyedic-tocharian-vowels

The vowel system of Ket, which has also been reconstructed for Pre-Proto-Tocharian, and which may possibly be reconstructed for Pre-Proto-Samoyedic as well, has a further parallel in Siberia: it is very close to that reconstructed for Proto-Yukaghir by Nikolaeva (2006:57).

It is attractive to think that the imbalances of the Yukaghir vowel system and vowel harmony reflect the adaptation of an original system with front rounded *ü and *ö to a system very similar to that seen in Yeniseian, Pre-Proto-Samoyedic and Pre-Proto-Tocharian.

ket-substrate-river-names
Location of contemporary speakers of Ket (shown in black) and of Yeniseian groups in
1600 as well as Yeniseian substrate river names (marked by labels such as -ses)

3. Agglutinative case marking and case functions

Although other Indo-European languages also occasionally show agglutinative case markers, one of the most striking typological characteristics of Tocharian are the agglutinative so-called “secondary” cases. It is obvious that for such a major shift in language type substrate influence must be considered as a serious option.

The key to identifying the model of the Tocharian case system is to be found in the functions of the cases. On the functional level, the Tocharian case system shows the following non-Indo-European peculiarities: it lacks a dative, whose functions are fulfilled by the genitive; and it has a local case termed “perlative” which denotes movement along, through or over something, as well as a comitative case denoting accompaniment.

Another interesting functional phenomenon is the lack of a dative in Tocharian. Here the best match is offered by Uralic, where nominative, accusative and genitive are generally analysed as being the “grammatical cases,” while the remaining cases are the “local cases.”

Tocharian, in spite of its comitative, agrees better with the Samoyedic case system than with the more elaborate sets of e.g. Finnish and Hungarian: there is no inessive : adessive or ablative : elative contrast. The Ket system, too, is more elaborate than the Tocharian set.

Evaluation and interpretation of the parallels

I consider the evidence from the stop system (§ 2.1), the vowel system (§ 2.2) and the agglutinative case system (§ 2.3) as the strongest indications of language contact. The Tocharian stop system with only voiceless stops is the best evidence for Uralic influence. The vowel system shows neat parallels with Yeniseian and Pre-Proto-Samoyedic. Taken together, this suggests that the Uralic variety with which Tocharian was in contact was a form of Pre-Proto-Samoyedic. Agglutinative case systems are widely found in Siberia and Eastern Central Asia, but the case functions, in particular the Tocharian perlative, best match Uralic and comparable systems in South Siberia.

The perlative is the strongest indication of Siberian, and most probably Uralic or Pre-Proto-Samoyedic influence. A similar local case is widely found across Uralic and in Samoyedic, and also in Yukaghir and Ket, but not in Turkic.

bronze-age-early-chemurchek

bronze-age-early-tocharian

Conclusion

The author ends by trying to fit the relative chronology of a Samoyedic and Tocharian spread from the Cis-Urals with the ideas set forth (mainly) by the Copenhagen group, with which he has participated in the past interpreting their results from a linguistic perspective. Hence the difficulties in finding potentially fitting settings to the proposed contacts.

I think it is self-evident that the push of the Abashevo-related, Seima-Turbino-mediated Andronovo-like cultural horizon perfectly explains the expansion of Pre-Proto-Samoyedic into the previous Afanasievo territory, strongly influencing the Chemurchek and related populations that most likely represent the evolving Tocharian-speaking community.

My recent video of expansion of Indo-Europeans illustrates this linguistic evolution quite accurately (although, admittedly, I didn’t put much effort in the maps to follow Palaeosiberian languages).

bronze-age-middle-chemurchek

bronze-age-middle-tocharian

The finding of Afanasievo-like population in Iron Age Tian Shan, including haplogroup replacement (among them hg. R1a-M417, proper of Disintegrating Uralians), as well as the finding of R1a-Z645 subclades up to the Deer Stone-Khirigsuur Complex in northern Mongolia, confirms this setting of a sudden expansion of (originally) Uralic-speaking populations through northern Eurasia up to Lake Baikal, disrupting the Afanasievo-like Tocharian-speaking community.

Similarly, the highly divergent genetic make-up of the Samoyedic population relative to other Uralic groups is consistent with the dilution of their typically Uralic Corded Ware ancestry among Siberian populations, on top of the multiple acculturation events of traditionally multilingual North Siberian populations (especially among Northern Samoyeds, similar to other Circum-Arctic groups).

This paper is not the first, and certainly not the last to confirm strong language contacts between Uralic and Indo-European dialects with the previous native speakers of Siberia, such as Palaeosiberians and Altaic peoples, causing the aberrant (but seemingly closely related) traits of Samoyedic and Tocharian, proper of European languages introduced into an area foreign to Indo-Uralic languages.

Related

“Steppe ancestry” step by step (2019): Mesolithic to Early Bronze Age Eurasia

yamnaya-gac-maykop-corded-ware-bell-beaker

The recent update on the Indo-Anatolian homeland in the Middle Volga region and its evolution as the Indo-Tocharian homeland in the Don–Volga area as described in Anthony (2019) has, at last, a strong scientific foundation, as it relies on previous linguistic and archaeological theories, now coupled with ancient phylogeography and genomic ancestry.

There are still some inconsistencies in the interpretation of the so-called “Steppe ancestry”, though, despite the one and a half years that have passed since we first had access to the closest Pontic–Caspian steppe source populations. Even my post “Steppe ancestry” step by step from a year ago is already outdated.

Admixture

The population selection process for models shown below included (1) plausibility of potential influences in the particular geographic and archaeological context; (2) looking for their clusters or particular samples in the PCA; and (3) testing with qpAdm for potential source populations that might have been involved in their development.

The results and graphics posted are therefore intended to simplistically show potential admixture events between populations potentially close to the actual sources of the target samples, whenever such mating networks could be supported by archaeology.

NOTE. This is an informal post and I am not a geneticist, so I am turning this flexibility to my advantage. If any reader is – for some strange reason – looking for a strict hypothesis testing, for the use of a full set of formal stats (as used e.g. in Ning et al. 2019 for Proto-Tocharians), and correctly redacted and peer-reviewed text, this is not the right place to find them.

spatial-pedigree-geographic-admixture
An example pedigree (a) of a focal individual sampled in the modern day, placed in its geographic context to make the spatial pedigree (b). Dashed lines denote matings, and solid lines denote parentage, with red hues for the maternal ancestors and blue hues for the paternal ancestors. In the spatial pedigree, each plane represents a sampled region in a discrete (nonoverlapping) generation, and each dot shows the birth location of an individual. The pedigree of the focal individual is highlighted back through time and across space. Image modified from Bradburd and Ralph (2019).

Despite the natural impulse to draw straight mixture trajectories (see e.g. Wang et al. 2019), simply adding or subtracting samples used for a PCA shows how the plot is affected by different variables (see e.g. what happens by including more South Asian samples to the PCA below), hence the need to draw curved arrows – not necessarily representing a sizable drift; at least not in recent prehistoric admixture events for which we have a reasonable chronological transect.

reich-arrows-admixture-neolithic-bronze-age
Representation of mixture events between European prehistoric peoples in the PCA. Image modified from David Reich‘s Who We Are and How We Got Here (2018).

Ethnolinguistic identification is a risky business that brings back memories of an evil use of cultural history and its consequences (at least in Western Europe, where this tradition was discontinued after WWII), but it seems necessary for those of us who want to find some confirmation of proposed dialectal schemes and language contacts.

Eneolithic Steppe vs. Steppe Maykop

First things first: I tested Bronze Age Eurasian peoples for the only two true steppe populations sampled to date, as potential sources of their “Steppe ancestry” – conventionally described as an EHG:CHG admixture, similar to that found in the first sampled Yamnaya individuals. I used the rightpops of Wang et al. (2018), but with a catch: since authors used WHG as a leftpop and Villabruna as a rightpop, and I find that a little inconsequential*, I preferred the strategy in Ning et al. (2019), contrasting as outgroup Eneolithic_Steppe (ca. 4300 BC) vs. Steppe_Maykop (ca. 3500 BC) when testing for WHG as a source population.

*WHG usually includes samples from a ‘western’ cluster (Loschbour and La Braña) and an ‘eastern’ cluster (Villabruna and Koros), see Lipson et al. (2017). Therefore, it doesn’t make much sense to include the same (or a very similar) population as a source AND an outgroup.

NOTE. For all other qpAdm analyses below, where WHG was not used as leftpop, I have used Villabruna as rightpop following Wang et al. (2019).

greater-caucasus-steppe-ancestry
Map of samples and sites mentioned in Wang et al. (2019), modified from the original to include labels of Eneolithic_Steppe and Steppe_Maykop samples. See PCA and ADMIXTURE grahpic for the identification of specific samples.

Results are not much different from what has been reported. In general, Yamnaya and related groups such as Bell Beakers and Steppe-related Chalcolithic/Bronze Age populations show good fits for Eneolithic_Steppe as their closest source for Steppe ancestry, and bad fits for Steppe_Maykop, whereas Corded Ware groups show the opposite, supporting their known differences.

This trend seems to be tempered in some groups, though, most likely due the influence of Samara_LN-like admixture in Circum-Baltic Late Neolithic and Eastern Corded Ware groups, and the influence of Anatolia_N/EEF-like admixture in Balkan and late European CWC or BBC groups. In fact, the more EEF-related ancestry in a populatoin, the less reliable these generic models (and even specific ones) seem to become when distinguishing the Steppe-related source.

NOTE. For more on this, see the discussion on Circum-Baltic Corded Ware peoples, and the discussion on Mycenaeans and their potential source populations.

These are just broad strokes of what might have happened around the Pontic–Caspian steppes before and during the Early Bronze Age expansions. The most relevant quest right now for Indo-European studies is to ascertain the chain of admixture events that led to the development and expansion of Indo-Uralic and its offshoots, Indo-European and Uralic.

mesolithic-eastern-europe-post-swiderian
Eastern European Mesolithic with the expansion of Post-Swiderian cultures. See full map.

A history of Steppe ancestry

This post is divided in (more or less accurate) chronological developments as follows:

  1. Hunter-gatherer pottery and the steppes
  2. Khvalynsk and Sredni Stog
  3. Post-Stog and Proto-Corded Ware
  4. Yamnaya and Afanasievo

1. Hunter-gatherer pottery and the steppes

I laid out in the ASOSAH book series the general idea – based on attempts to reconstruct the linguistic ancestor of Indo-Uralic – that Eurasiatic speakers might have expanded with the North-Eastern Techno-Complex that spread through north-eastern Europe during the warm period represented by the transition of the Palaeolithic to the Mesolithic.

If one were to trust the traditional migrationist view, a post-Swiderian population expanded from central-eastern Europe (potentially related originally to Epi-Gravettian peoples, represented by WHG ancestry) into north-eastern Europe, and then further east into the Trans-Urals, to then reappear in eastern Europe as a back-migration represented by the spread of hunter-gatherer pottery.

The marked shift from WHG-like towards EHG-related ancestry from Baltic Mesolithic (ca. 30%) to Combed Ware cultures (ca. 65%-100%) supports this continuous westward expansion, that is possibly best represented in the currently available sampling by the ‘south-eastern’ shift (CHG:ANE-related) of the hunter-gatherer from Lebyazhinka IV (5600 BC) relative to the older one from Sidelkino (9300 BC), both from the Samara region in the Middle Volga:

Mesolithic-Neolithic transition ca. 7000-6000 BC, with hunter-gatherer pottery groups spreading westwards. See full map.

From Anthony (2019):

Along the banks of the lower Volga many excavated hunting-fishing camp sites are dated 6200-4500 BC. They could be the source of CHG ancestry in the steppes. At about 6200 BC, when these camps were first established at Kair-Shak III and Varfolomievka, they hunted primarily saiga antelope around Dzhangar, south of the lower Volga, and almost exclusively onagers in the drier desert-steppes at Kair-Shak, north of the lower Volga. Farther north at the lower/middle Volga ecotone, at sites such as Varfolomievka and Oroshaemoe hunter-fishers who made pottery similar to that at Kair-Shak hunted onagers and saiga antelope in the desert-steppe, horses in the steppe, and aurochs in the riverine forests. Finally, in the Volga steppes north of Saratov and near Samara, hunter-fishers who made a different kind of pottery (Samara type) and hunted wild horses and red deer definitely were EHG. A Samara hunter-gatherer of this era buried at Lebyazhinka IV, dated 5600-5500 BC, was one of the first named examples of the EHG genetic type (Haak et al. 2015). This individual, like others from the same region, had no or very little CHG ancestry. The CHG mating network had not yet reached Samara by 5500 BC.

Given the lack of a proper geographical and chronological transect of ancient DNA from eastern European groups, and the discontinuous appearance of both R1b-M73 and R1b-M269 lineages on both sides of the Urals within the WHG:ANE cline, where EHG appears to have formed, it is impossible at this point to assert anything with enough degree of certainty. For simplicity purposes, though, I risked to equate the expansion of R1b-M73 in West Siberia as potentially associated with Micro-Altaic, and the expansion of hg. R1b-M269 with the spread of Indo-Uralic on both sides of the Urals.

NOTE. For incrementally speculative associations of languages with prehistoric cultures and their potential link to ancestry ± haplogroup expansions, you can check sections on Early Indo-Europeans and Uralians, Indo-Uralians, Altaic peoples, Eurasians, or Nostratians. I explained why I made these simplistic choices here.

While this identification of the Indo-Uralic expansion with hg. R1b is more or less straightforward for the Cis-Urals, given the available ancient DNA samples, it will be very difficult (if at all possible) to trace the migration of these originally R1b-M269-rich populations into Trans-Uralian groups that could eventually be linked to Yukaghir speakers. The sheer number of potential admixture events and bottlenecks in Siberian forest, taiga, and tundra regions since the Mesolithic until Yukaghirs were first attested is guaranteed to give more than one headache in upcoming years…

neolithic-steppes-samara-mariupol
Spread of hunter-gatherer pottery in eastern Europe ca. 6000-5000 BC. See full map.

The slight increase in WHG-related ancestry in Ukraine Neolithic groups relative to Mesolithic ones questions the arrival of this eastern influence in the north Pontic area, or at least its relevance in genomic terms, although the cluster formed is similar to the previous one and to Combed Ware groups – despite the Central European and Baltic influences in the north Pontic region – with some samples showing 0% change relative to Mesolithic groups.

ukraine-samara-mesolithic-neolithic-evolution
Structure and change in hunter-gatherer-related populations, from Mathieson et al. (2018). Inferred ancestry proportions for populations modelled as a mixture of WHG, EHG and CHG. Dashed lines show populations from the same geographic region. Percentages indicate proportion of WHG + EHG ancestry. Standard errors range from 1.5 to 8.3%.

NOTE. For more on Indo-Uralic and its reconstruction from a linguistic point of view, check out its dedicated section on ASOSAH, or the recently published (behind paywall) The Precursors of Proto-Indo-European, edited by Kloekhorst and Pronk, Brill (2019). Authors of specific chapters have posted their contributions to Academia.edu, where they can be downloaded for free.

2. Khvalynsk and Sredni Stog

The cluster formed by the three available samples of the Khvalynsk culture (early 5th millennium BC) might be described, as expected from its position in the PCA, as a mixture of EHG-like populations of the Middle Volga with CHG-like ancestry close to that represented by samples from Progress-2 and Vonyuchka, in the North Caucasus Piedmont (ca. 4300 BC):

This variable CHG-like admixture shown in the wide cluster formed by the available Khvalynsk-related samples support the interpretation of a recently created CHG mating network in Anthony (2019):

After 5000 BC domesticated animals appeared in these same sites in the lower Volga, and in new ones, and in grave sacrifices at Khvalynsk and Ekaterinovka. CHG genes and domesticated animals flowed north up the Volga, and EHG genes flowed south into the North Caucasus steppes, and the two components became admixed. After approximately 4500 BC the Khvalynsk archaeological culture united the lower and middle Volga archaeological sites into one variable archaeological culture that kept domesticated sheep, goats, and cattle (and possibly horses). In my estimation, Khvalynsk might represent the oldest phase of PIE.

steppe-ancestry-pca-neolithic-khvalynsk
Detail of the PCA of Eurasian samples, including Neolithic clusters with the hypothesized gene flows related to (1) the formation and (2) expansion of Khvalynsk and the (3) emergence of late Sredni Stog. See full image.

The richest copper assemblage found in all Khvalynsk burials belongs to an individual of hg. R1b-V1636 and intermediate Samara_HG:Eneolithic_Steppe ancestry, while full Eneolithic_Steppe-like admixture in the Middle Volga is represented by the commoner of Khvalynsk II, of hg. Q1. The finding of hg. R1b-V1636 in the North Caucasus Piedmont – and R1b-P297 in the Samara region (probably including Yekaterinovka) begs the question of the origin of hg. R1b-V1636 in the Khvalynsk community. Based on its absence in ancient samples from the forest zone, it is tempting to assign it to steppe hunter-gatherers down the Lower Volga and possibly to the east of it, who infiltrated the Samara region precisely during these population movements described by Anthony (2019).

Suvorovo-related samples from the Balkans, including the Varna and Smyadovo outliers of Steppe ancestry, are closely related to the Khvalynsk expansion:

Similarly, the ancestry of late Sredni Stog samples from Dereivka seem to be directly related to the expansion of Mariupol-like individuals over populations of Suvorovo-Novodanilovka-like admixture, as suggested by the resurgence of typical Ukraine Neolithic haplogroups, the shift in the PCA, and the models of Eneolithic_Steppe vs. Steppe_Maykop above:

#EDIT (11 Nov 2019): In fact, the position of the unpublished Greece_Neolithic outlier that appeared in the Wang et al. (2018) preprint (see full PCA and ADMIXTURE) show that the expanding Suvorovo chiefs from the Balkans formed a tight cluster close to the two published outliers with Steppe ancestry from Bulgaria.

The Ukraine_Neolithic outlier, possibly a Novodanilovka-related sample suggests, based on its position in the PCA close to the late Trypillian outlier of Steppe-related ancestry, that Ukraine_Eneolithic samples from Dereivka are a mixture of Ukraine_Neolithic and a Novodanilovka-like community similar to Suvorovo.

The Trypillian_Eneolithic-like admixture found among Proto-Corded Ware peoples (see below) would then feature potentially a small Steppe_Eneolithic-like component already present in the north Pontic area, too.

pca-suvorovo-novodanilovka-khvalynsk-trypillia-greece-ukraine-neolithic-outlier
Image modified from Wang et al. (2018). Samples projected in PCA of 84 modern-day West Eurasian populations (open symbols). Previously known clusters have been marked and referenced. Marked and labelled are the Balkan samples referenced in this text An EHG and a Caucasus ‘clouds’ have been drawn, leaving Pontic-Caspian steppe and derived groups between them. See the original file here.

Furthermore, whereas Anthony (2019) mentions a long-lasting predominance of hg. R1b in elite graves of the Eneolithic Volga basin, not a single sample of hg. R1a is mentioned supporting the community formed by the Alexandria individual, supposedly belonging to late Sredni Stog groups, but with a Corded Ware-like genetic profile (suggesting yet again that it is possibly a wrongly dated sample).

NOTE. A lack of first-hand information rather than an absence of R1a-M417 samples in the north Pontic forest-steppes would not be surprising, since Anthony is involved in the archaeology of the Middle Volga, but not in that of the north Pontic area.

eneolithic-pontic-caspian-steppe-khvalynsk-novodanilovka-suvorovo
Khvalynsk expansion through the Pontic–Caspian steppes in the early 5th millennium BC. See full map.

3. Post-Stog and Proto-Corded Ware

The origin of the Pre-Corded Ware ancestry is still a mystery, because of the heterogeneity of the sampled groups to date, and because the only ancestral sample that had a compatible genetic profile – I6561 from Alexandria – shows some details that make its radiocarbon date rather unlikely.

The most likely explanation for the closest source population of Corded Ware groups, found in the three core samples of Steppe_Maykop and in Trypillian Eneolithic samples from the first half of the 4th millennium BC, is still that a population of north Pontic forest-steppe hunter-gatherers hijacked this kind of ancestry, that was foreign to the north Pontic region before the Late Eneolithic period, later expanding east and west through the Podolian–Volhynian upland, due to the complex population movements of the Late Eneolithic.

NOTE. The idea of Trypillia influencing the formation of the Steppe_MLBA ancestry proper of Uralic peoples has been around for quite some time already, since the publication of Narasimhan et al. (2018) (see here or here).

steppe-ancestry-pca-corded-ware-bronze-age
Detail of the PCA of Eurasian samples, including Corded Ware groups and related clusters, as well as outliers, with hypothesized gene flows related to the (1) formation and (2) initial expansion of Pre-Corded Ware ancestry, as well as (3) later regional admixture events. See full image.

The specifics of how the Proto-Corded Ware community emerged remain unclear at this point, despite the simplistic description by Rassamakin (1999) of the Late Eneolithic north Pontic population movements as a two-stage migration of 1) late Trypillian groups (Usatovo) west → east, and (2) Late Maykop–Novosvobodnaya east → west. So, for example, Manzura (2016) on the Zhivotilovka “cultural-historical horizon” (emphasis mine):

Indeed, the very complex combination of different cultural traits in the burial sites of the Zhivotilovka type is able to generate certain problems in the search for the origins of this phenomenon. The only really consistent attribute is the burial rite in contracted position on the left or right side. Yu. Rassamakin is correct in asserting that this position of the deceased can be considered as new in the North Pontic region (Rassamakin 1999, 97). However, this opinion can be accepted only partially for the territory between Dniester and Lower Don. This position is well known in the Usatovo culture in the Northwest Pontic region, although skeletons on the right side are evidenced there only in double burials, whereas single burials contain the deceased only in a contracted position on the left side. On the other hand, the southern and western orientation of the deceased, which is one of the main burial traits of the Zhivotilovka type, is not characteristic of the Usatovo culture. Nevertheless, it is possible to suppose that at least part of the Usatovo population could have played a part in the formation of the cultural type under consideration here. One aspect of this cultural tradition, for instance, could be represented by skeletons on the left side and oriented in north-eastern and eastern directions.

Especially close ties can be traced between the Zhivotilovka and Maykop-Novosvobodnaya traditions, as exemplified by similar burial customs and various grave goods. It is beyond any doubt that the Maykop-Novosvobodnaya population was actively involved in the spread of the main Zhivotilovka cultural traits. The influence of North Caucasian traditions can be well observed, at least as far as the Dnieper Basin, but farther west influence is not manifested pronouncedly. The role of cultural units situated between the Dniester and Don rivers in the process of emergence of the Zhivotilovka type looks somewhat vague. Now, it can be quite confidently asserted that at the end of the 4th millennium BC this territory was settled by migrants from the North Caucasus and Carpathian-Dniester region. This event in theory had to stimulate cultural transformations in the Azov-Black Sea steppes and, thus, bearers of local cultural traditions perhaps could have participated in forming the culture under consideration. In any event, the Zhivotilovka type can be regarded as a complex phenomenon that emerged within the regime of intensive cultural dialogue and that it absorbed totally diff erent cultural traditions. The spread of the Zhivotilovka graves across the Pontic steppes from the Carpathians to the Lower Don or even to the Kuban Basin clearly signalizes a rapid dissolution of former cultural borders and the beginning of active movements of people, things and ideas over vast territories.

zhivotilovka-horizon-north-pontic-area

What were the factors or reasons that could have provoked this event? In the beginning of the second half of the 4th millennium BC two advanced cultural centers emerged in the south of Eastern Europe. These were the Maykop-Novosvobodnaya and Usatovo cultures, which in spite of their separation by great distances were structurally very alike. This is expressed in similar monumental burial architecture, complex burial rites, even the composition of grave goods, developed bronze metallurgy, high standards of material culture, etc. Both cultures in a completely formed state exemplify prosperous societies with a high level of economic and social organization, which can correspond to the type of ranked or early complex societies. Normally, the social elite in such polities tends to rigidly control basic domains social, economic and spiritual life using different mechanisms, even open compulsion (Earle 1987, 294-297). To some extent similar social entities can be found at this moment in the forest-steppe zone of the Carpathian-Dniester region, as reflected by the well organized settlement of Brânzeni III and the Vykhatitsy cemetery (Маркевич 1981; Дергачев 1978). In spite of their complex character, such societies represent rather friable structures, which could rapidly disintegrate due to unfavourable inner or external factors.

The societies in question emerged and existed during a time of favourable natural climatic conditions, which is considered to be a transitional period from the Atlantic to the Subboreal period, lasting approximately from 3600 to 3300 cal BC, or a climatic optimum for the steppe zone (Иванова и др. 2011, 108; Спиридонова, Алешинская 1999, 30-31). These conditions to a large degree could guarantee a stable exploitation of basic resources and support existing social hierarchies. However, after 3300 cal BC significant climatic changes occurred, accompanied by an increasing aridization and fall in temperature. This event is usually termed the “Piora oscillation” or “Rapid Climatic Event”, and is regarded as having been of global character (Magny, Haas 2004). These rapid changes could have seriously disturbed existing economic and social relations and finally provoked a similar rapid disintegration of complex social structures. In this case the sites of the Zhivotilovka type could represent mere fragments of former prosperous societies, which under conditions of the absence of centralized social control and stable cultural borders tried to recombine social and economic ties. However, the population possessed the necessary social experience and important technological resources, such as developed stock-breeding based on the breeding of small cattle and wheeled transport, so they were ready for opening new territories in their search for a better life.

maykop-trypillia-intrusion-steppes
Disintegration, migration, and imports of the Azov–Black Sea region. First migration event (solid arrows): Gordineşti–Maikop expansion (groups: I – Bursuchensk; II – Zhyvotylivka; III – Vovchans’k; IV – Crimean; V – Lower Don; VI – pre-Kuban). Second migration event (hollow arrows): Repin expansion. After Rassamakin (1999), Demchenko (2016).

For more on chronology and the potentially larger, longer-lasting Zhivotilovka–Volchansk–Gordineşti cultural horizon and its expansion through the Podolian–Volhynian upland, read e.g. on the Yampil Complex in the latest volume 22 of Baltic-Pontic Studies (2017):

In the forest-steppe zone of the North-West Pontic area, important data concerning the chronological position of the Zhivotilovka-Volchansk group have been produced by the exploration of the Bursuceni kurgan, which is still awaiting full publication [Yarovoy 1978; cf. also Demcenko 2016; Manzura 2016]. Burials linked with the mentioned group were stratigraphically the eldest in the kurgan, and pre-dated a burial in the extended position and [Yamnaya culture] graves. Two of these burials (features 20 and 21) produced radiocarbon dates falling around 3350-3100 BC [Petrenko, Kovaliukh 2003: 108, Tab. 7]. Similar absolute age determinations were obtained for Podolia kurgans at Prydnistryanske [Goslar et al. 2015]. These dates, falling within the Late Eneolithic, mark the currently oldest horizon of kurgan burials in the forest-steppe zone of the North-West Pontic area. The Podolia graves linked with other, older traditions of the steppe Eneolithic seem to represent a slightly later horizon dated to the transition between the Late Eneolithic and Early Bronze Age.

The presence on the left bank of the Dniester River of kurgans associated with the Eneolithic tradition, which at the same time reveals connections with the Gordineşti-Kasperovce-Horodiştea complex, raises questions about the western range of the new trend in funerary rituals, and its potential connection with the expansion of the late Trypilia culture to the West Podolia and West Volhynia Regions. The data potentially suggesting the attribution of kurgans from the upper Dniester basin to this period is patchy and difficult to verify [e.g. Liczkowce – see Sulimirski 1968: 173]. In this context, the discovery of vessels in the Gordineşti style in a kurgan at Zawisznia near Sokal is inspiring [Antoniewicz 1925].

zhivotilovka-volchansk-burial-podolia
Burials representing funerary traditions of Zhivotilovka-Volchansk group in Podolie kurgans: 1 – Porohy, grave 3A/7, 2 – Kuzmin, grave 2/2 [after Klochko et al. 2015b, Bubulich, Khakhey 2001]

Another interesting aspect of potential source populations, in combination with those above for Eneolithic_Steppe vs. Steppe_Maykop, are groups with worse fits for Steppe_Maykop_core, which include Potapovka and Srubnaya, as reported by Wang et al. (2018), but also Sintastha_MLBA (although not Andronovo). This is compatible with the long-term admixture of Abashevo chiefs dominating over a majority of Poltavka-like herders in the Don-Volga-Ural steppes during the formation of the Sintashta-Potapovka-Filatovka community, also visible in the typical Yamnaya lineages and Yamnaya-like ancestry still appearing in the region centuries after the change in power structures had occurred.

NOTE. If you feel tempted to test for mixtures of Khvalynsk_EN, Eneolithic_Steppe, Yamnaya, etc. as a source population for Corded Ware, go for it, but it’s almost certain to give similar ‘good’ fits – whatever the model – in some Corded Ware groups and not in others. It is still unclear, as far as I know, how to formally distinguish a mixture of Corded Ware-related from a Yamnaya-related source in the same model, and the results obtained with a combination of Steppe_Maykop-related + Eneolithic_Steppe-related sources will probably artificially select either one or the other source, as it probably happened in Ning et al. (2019) with Proto-Tocharian samples (see qpAdm values) that most likely had a contribution of both, based on their known intense interactions in the Tarim Basin.

eneolithic-pontic-caspian-steppes-east-europe
Expansion of north Pontic cultures and related groups during the Late Eneolithic. See full map.

#EDIT (22 NOV 2019): New preprint Gene-flow from steppe individuals into Cucuteni-Trypillia associated populations indicates long-standing contacts and gradual admixture, by Immel et al. bioRxiv (2019), on Gordinești samples from Moldova ca. 3500-3100 BC. Relevant excerpts (emphasis mine):

A principal component analysis of the four Moldova females together with previously published data sets of ancient Eurasians showed that Gordinești, Pocrovca 1 and Pocrovca 3 grouped with later dating Bell Beakers from Germany and Hungary close to the four CTC males from Verteba, while Pocrovca 2 fell into the LBK cluster next to Neolithic farmers from Anatolia and Starčevo individual.

When looking at various proxies for steppe-related ancestry (Yamnaya Samara, Ukraine Mesolithic, Caucasian hunter-gatherer (CHG), Eastern hunter gatherer (EHG)), we did not observe any significant difference in genetic influx from either Yamnaya Samara, EHG or Ukraine Mesolithic. However, relative to CHG, we detected a substantial shift towards Yamnaya Samara steppe-related ancestry. Consequently, Yamnaya Samara, Ukraine Mesolithic and EHG appear to be equally suitable proxies for steppe-related ancestry in the Moldovan CTC individuals.

We did not obtain feasible models when running qpAdm on the X-chromosome in order to test for male-biased admixture from hunter-gatherers or individuals with steppe-related ancestry.

It is not surprising that Gordinești, Pocrovca 1 and Pocrovca 3 showed genetic affinities with later dating Bronze Age or Bell Beaker individuals. The common link among them is the considerable steppe-related ancestry, which each group likely received independently from different parental populations.

pca-trypillia-verteba-pocrovka-gordinesti
Principal component analysis of the CTC individuals from Moldova (Gordinești, Pocrovca 1, Pocrovca 2, Pocrovca 3) in red and the CTC individuals from Verteba Cave (I1926, I2110, I2111, I3151) in blue together with 23 selected ancient populations/individuals projected onto a basemap of 58 modern-day West Eurasian populations (not shown). HG=hunter-gatherer, LBK=Linearbandkeramik, PU=Proto-Unetice, TRB=Trichterbecher (Funnel Beaker Culture, FBC). PC1 is shown on the x-axis and PC2 on the y-axis.

4. Yamnaya and Afanasievo

I don’t think it makes much sense to test for GAC (or Iberia_CA, for that matter) as Wang et al. (2019) did, given the implausibility of them taking part in the formation of late Repin during the mid-4th millennium BC around the Don-Volga interfluve (represented by its offshoots Yamnaya and Afanasievo), whether these or other EEF-related populations show ‘better’ fits or not. Therefore, I only tested for more or less straightforward potential source populations:

steppe-ancestry-pca-yamnaya-hungary-bulgaria-vucedol
Detail of the PCA of Eurasian samples, including Yamnaya groups and related clusters, as well as outliers, with hypothesized gene flows related to its (1) formation and (2) expansion. Also included is the inferred position of the admixed sample Yamnaya_Hungary_EBA1. See full image.

Quite unexpectedly – for me, at least – it appears that Afanasievo and Yamnaya invariably prefer Khvalynsk_EN as the closest source rather than a combination including Eneolithic_Steppe directly. In other words, late Repin shows largely genetic continuity with the Steppe ancestry already shown by the three sampled individuals from the Khvalynsk II cemetery, in line with the known strong bottlenecks of Khvalynsk-related groups under R1b lineages, visible also later in Afanasievo and Yamnaya and derived Indo-European-speaking groups under R1b-L23 subclades.

NOTE. This explains better the reported bad fits of models using directly Eneolithic_Steppe instead of Khvalynsk_EN for Afanasievo and Yamnaya Kalmykia, as is readily evident from the results above, instead of a rejection of an additional contribution to an Eneolithic_Steppe-like population, as I interpreted it, based on Anthony (2019).

repin-zhivotilovka-north-pontic-steppe
Map of major sites of the Zhivotilovka-Volchansk group (A) and Repin culture (B), by Rassamakin (see 1994 and 2013). (A) 1 – Primorskoye; 2 – Vasilevka; 3 – Aleksandrovka; 4 – Boguslav; 5 – Pavlograd; 6 – Zhivotilovka; 7 – Podgorodnoye; 8 – Novomoskovsk; 9- Sokolovo; 10 – Dneprelstan; 11- Razumovka; 12 – Pologi; 13 – Vinogradnoye; 14 – Novo-Filipovka; 15 – Volchansk; 16 – Yuryevka; 17 – Davydovka; 18 – Novovorontsovka; 19 – Ust-Kamenka; 20 – Staroselye; 21- Velikaya Aleksandrovka; 22- Kovalevka; 23 – Tiraspol; 24 – Cura-Bykuluy; 25 – Roshkany; 26 – Tarakliya; 27 – Kazakliya; 28 – Bolgrad; 29 – Sarateny; 30 – Bursucheny; 31 – Novye Duruitory; 232 – Kosteshty. (B) 1 – Podgorovka; 2 – Aleksandria; 3 – Volonterovka; 4 – Zamozhnoye; 5 – Kremenevka; 6 – Ogorodnoye; 7 – Boguslav; 8 – Aleksandrovka; 9 – Verkhnaya Mayevka; 10 – Duma Skela; 11 – Zamozhnoye; 12 – Mikhailovka II.

This might suggest that the Steppe ancestry visible in samples from Progress-2 and Vonyuchka, sharing the same cluster with the Khvalynsk II cemetery commoner of hg. Q1, most likely represents North Caspian or Black Sea–Caspian steppe hunter-gatherer ancestry that increased as Khvalynsk settlers expanded to the south-west towards the Greater Caucasus, probably through female exogamy. That would mean that Steppe_Maykop potentially represents the ‘original’ ancestry of steppe hunter-gatherers of the North Caucasus steppes, which is also weakly supported by the available similar admixture of the Lola culture. The chronology, geographical location and admixture of both clusters seemed to indicate the opposite.

eneolithic-steppe-maykop-ehg-chg-ag2
Modelling results for the Steppe and Caucasus cluster. Additional ‘eastern’ AG-Siberian gene flow in Steppe Maykop relative to Eneolithic Steppe. From Wang et al. (2019).

Due to the limitations of the currently available sampling and statistical tools, and barring the dubious Alexandria outlier, it is unclear how much of the late Trypillian-related admixture of late Repin (as reflected in Yamnaya and Afanasievo) corresponds to late Trypillian, Post-Stog, or Proto-Corded Ware groups from the north Pontic area. A mutual exchange suggestive of a common mating network (also supported by the mixed results obtained when including Khvalynsk_EN as source for early Corded Ware groups) seem to be the strongest proof to date of the Late Proto-Indo-European – Uralic contacts reflected in the period when post-laryngeal vocabulary was borrowed (with some samples predating the merged laryngeal loss), before the period of intense borrowing from Pre- and Proto-Indo-Iranian.

Between-group differences of Yamnaya samples are caused – like those between Corded Ware groups – by the admixture of a rapidly expanding society through exogamy with regional populations, evidenced by the inconstant affinities of western or southern outliers for previous local populations of the west Pontic or Caucasus area. This explanation for the gradual increase in local admixture is also supported by the strong, long-term patrilineal system and female exogamy practiced among expanding Proto-Indo-Europeans.

chalcolithic-early-bronze-yamnaya-corded-ware-vucedol
Groups of the Yamnaya culture and its western expansion after ca. 3100 BC, and Corded Ware after ca. 2900 BC See full map.

Bell Beakers and Mycenaeans

This Eneolithic_Steppe ancestry is also found among Bell Beaker groups (see above). More specifically, all Bell Beaker groups prefer a source closest to a combination of Yamnaya from the Don and Baden LCA individuals from Hungary, rather than with Corded Ware and GAC, despite the quite likely admixture of western Yamnaya settlers with (1) south-eastern European (west Pontic, Balkan) Chalcolithic populations during their expansion through the Lower Danube and with (2) late Corded Ware groups (already admixed with GAC-like populations) during their expansion as East Bell Beakers:

Similarly, Mycenaeans show good fits for a source close to the Yamnaya outlier from Bulgaria:

steppe-ancestry-pca-bell-beakers-mycenaeans
Detail of the PCA of Eurasian samples, including Bell Beaker and Balkan EBA groups and related clusters, as well as outliers, including ancestral Yamnaya samples from Hungary (position inferred) and Bulgaria. Also marked are Minoans, Mycenaeans and Armenian BA samples. See full image.

You can read more on Yamnaya-related admixture of Bell Beakers and Mycenaeans, and on Afanasievo-related admixture of Iron Age Proto-Tocharians.

Conclusion

The use of the concept of “Yamnaya ancestry”, then “Steppe ancestry” (and now even “Yamnaya Steppe ancestry“?) has already permeated the ongoing research of all labs working with human population genomics. Somehow, the conventional use of Yamnaya_Samara samples opposed to a combination of other ancient samples – alternatively selected among WHG, EHG, CHG/Iran_N, Anatolia_N, or ANE – has spread and is now unquestionably accepted as one of the “three quite distinct” ancestral groups that admixed to form the ancestry of modern Europeans, which is a rather odd, simplistic and anachronistic description of prehistory…

It has now become evident that authors involved with the Proto-Indo-European homeland question – and the tightly intertwined one of the Proto-Uralic homeland – are going to dedicate a great part of the discussion of many future papers to correct or outright reject the conclusions of previous publications, instead of simply going forward with new data.

The most striking argument to mistrust the current use of “Steppe ancestry” (as an alternative name for Yamnaya_Samara, and not as ancestry proper of steppe hunter-gatherers) is not the apparent difference in direct Eneolithic sources of Steppe ancestry for Corded Ware and Yamnaya-related peoples – closer to the available samples classified as Steppe_Maykop and Eneolithic_Steppe, respectively – or their different evolution under marked Y-DNA bottlenecks.

It is not even the lack of information about the distant origin of these Pontic–Caspian steppe hunter-gatherers of the 5th and 4th millennium BC, with their shared ancestral component potentially separated during the warmer Palaeolithic-Mesolithic transition, when the steppes were settled, without necessarily sharing any meaningful recent history before the formation of the Proto-Indo-Uralic community.

NOTE. I have raised this question multiple times since 2017 (see e.g. here or here).

The most striking paradox about simplistically misinterpreting “Steppe ancestry” as representative of Indo-European expansions is that those sub-Neolithic Pontic–Caspian steppe hunter-gatherers that had this ancestry in the 6th millennium BC were probably non-Indo-European-speaking communities, most likely related to the North(West) Caucasian language family, based on the substrate of Indo-Anatolian that sets it apart from Uralic within the Indo-Uralic trunk, and on later contacts of Indo-Tocharian with North-West Caucasian and Kartvelian, the former probably represented by Maykop and its contact with the Repin and early Yamnaya cultures.

NOTE. For more on this, see Allan Bomhard’s recent paper on the Caucasian substrate hypothesis and its ongoing supplement Additional Proto-Indo-European/Northwest Caucasian Lexical Parallels.

steppe-ancestry-racimo
“Spatiotemporal kriging of YAM steppe ancestry during the Holocene, using 5000 spatial grid points. The colors represent the predicted ancestry proportion at each point in the grid.” Image with evolution from ca. 2800 BC until the present day, modified from Racimo et al. (2019). The Copenhagen group considers the expansion of this component as representative of expanding Indo-Europeans…

This kind of error happens because we all – hence also authors, peer reviewers, and especially journal editors – love far-fetched conclusions and sensational titles, forgetting what a paper actually shows and – always more importantly in scientific reports – what it doesn’t show. This is particularly true when more than one field is involved and when extraordinary claims involve aspects foreign to the journal’s (and usually the own authors’) main interests. One would have thought that the glottochronological fiasco published in Science in 2012 (open access in PMC) should have taught an important lesson to everyone involved. It didn’t, because apparently no one has felt the responsibility or the shame to retract that paper yet, even in the age of population genomics.

If anything, the excesses of mathematical linguistics – using computational methods to try and reconstruct phylogenetic trees – have perpetuated a form of misunderstood Scientism which blindly relies on a simple promise made by authors in the Materials and Method section (rarely if ever kept beyond it) to use statistics rather than resorting to the harder, well-informed, comprehensive reasoning that is needed in the comparative method. After all, why should anyone invest hundreds of hours (or simply show an interest in) learning about historical linguistics, about ancient Indo-European or Uralic languages, carefully argumenting and discussing each and every detail of the reconstruction, when one can simply rely on the own guts to decide what is Science and what isn’t? When one can trust a promise that formulas have been used?

The conservative, null hypothesis when studying prehistoric Eurasian samples related to evolving cultures was universally understood as no migration, or “pots not people” (as most western archaeologists chose to believe until recently), whereas the alternative one should have been that there were in fact migration events, some of them potentially related to the expansion of Eurasian languages ancestral to the historically attested ones. Beyond this migrationist view there were obviously dozens of thorough theories concerning potential linguistic expansions associated with specific prehistoric cultures, and a myriad of less developed alternatives, all of which deserved to be evaluated after the null hypothesis had been rejected.

Despite the shortcomings of the 2015 papers and their lack of testing or discussion of different language expansion models, the spread of the so-called “Yamnaya ancestry” – an admixture especially prevalent (after the demise of the Yamnaya) among the most likely ancient Uralic-speaking groups as well as among modern Uralic speakers and recently acculturated groups from Eastern Europe – has been nevertheless invariably concluded by each lab to support the theories of their leading archaeologist, often combined with pre-aDNA theories of geneticists based on modern haplogroup distributions. This is as evident a case of confirmation bias, circular reasoning, and jumping to conclusions as it gets.

Why many researchers of other labs have chosen to follow such conclusions instead of challenging or simply ignoring them is difficult to understand.

Related

Proto-Tocharians: From Afanasievo to the Tarim Basin through the Tian Shan

tocharians-early-eneolithic

A reader commented recently that there is little information about Indo-Europeans from Central and East Asia in this blog. Regardless of the scarce archaeological data compared to European prehistory, I think it is premature to write anything detailed about population movements of Indo-Iranians in Asia, especially now that we are awaiting the updates of Narasimhan et al (2018).

Furthermore, there was little hope that Tocharians would be different than neighbouring Andronovo-like populations (see a recent post on my predicted varied admixture of Common Tocharians), so the history of both unrelated Late PIE languages would have had to be explained by the admixture of Afanasievo-related groups with peoples of Andronovo descent and their acculturation.

However, data reported recently by Ning, Wang et al. Current Biology (2019) confirmed that peoples of mainly Afanasievo ancestry – as opposed to those of Corded Ware-related ancestry expanding with the Srubna-Andronovo horizon – spread the Tocharian branch of Proto-Indo-European from the Altai into the Tian Shan area, surviving essentially unadmixed into the Early Iron Age.

This genetic continuity of Tocharians will no doubt help us disentangle a great part the ethnolinguistic history of speakers of the Tocharian branch of Proto-Indo-European, from Pre-Proto-Tocharians of Afanasievo to Common Tocharians of the Late Bronze Age/Iron Age eastern Tian Shan.

NOTE. Tocharian’s isolation from the rest of Late PIE dialects and its early and intense language contacts have always been the key to support an early migration and physical separation of the group, hence the traditional association with Afanasievo, a late Repin/early Yamna offshoot. Even with the current incomplete archaeological and genetic picture, there is no other option left for the expansion of Tocharian.

It is not possible to use the currently available ancestry data to map the evolution of Afanasievo ancestry, lacking a proper geographical and temporal transect of Central and East Asian groups. In spite of this, Ning, Wang, et al. (2019) is a huge leap forward, discarding some archaeological models, and leaving only a few potential routes by which Tocharians may have spread southward from the Altai.

NOTE. I have updated the maps of prehistoric cultures accordingly, with colours – as always – reflecting the language/ancestry evolution of the different groups, even though the archaeological data of some groups of Xinjiang remains scarce, so their ethnolinguistic attribution – and the colours picked for them – remain tentative.

xinjiang-andronovo-xiaohe-horizon-bronze-iron-age
A rough timeline of related archaeological sites from North Eurasia. Image modified from Yang (2019).

Tocharians

The recent book Ancient China and its Eurasian Neighbors. Artifacts, Identity and Death in the Frontier, 3000–700 BCE, by Linduff, Sun, Cao, and Liu, Cambridge University Press (2017) offers an interesting summary of the introduction of metalworking into western China.

Here are some relevant excerpts (emphasis mine):

Although [the Xinjiang] route is not uniformly agreed upon (Shelach-Lavi 2009: 134–46), this western transmission has been thought to have passed through eastern Kazakhstan, especially as it is manifest in Semireiche, with Yamnaya, Afanasievo (copper) and Andronovo (tin bronze) peoples (Mei 2000: Fig. 3). From Xinjiang this knowledge has been thought to have traveled through the Gansu Corridor via the Qijia peoples (Bagley 1999) and then into territories controlled by dynastic China. The dating of this process is still a problem, as the sites and their contents in Xinjiang are consistently later than those in Gansu, suggesting that the point of contact was in Gansu and that the knowledge then spread from there westward.

1. Eneolithic Altai

tocharians-chalcolithic-eneolithic
Afanasievo expansion ca. 3300-2600 BC. See full culture and ancient DNA maps.

The Afanasievo sites, as they are identified in Mongolia, for instance, make up an Eneolithic culture analogous to that of southern Siberia (3100/2500–2000 BCE) in the Upper Yenissei Valley that is characterized by copper tools and an economy reliant on horse, sheep and cattle breeding as well as hunting. (…) The Afanasievo is best known through study of its burials, which typically include groups of round barrows (kurgans), each up to 12 m in diameter with a stone kerb and covering a central pit grave containing multiple inhumations. In their Siberian context, burial pottery types and styles have suggested contacts with the slightly earlier Kelteminar culture of the Aral and Caspian Sea area.

The Afanasievo culture monuments, located in the northern Altai and in the Minusinsk Basin (the western Sayan), have been seen as analogous evidence for cross-Eurasian exchange. These complexes contain small collections of metal, and many of the items are made of brass, although golden, silver and iron ornaments were also identified. A mere one-fourth of these objects are tools and ornaments, while the rest consist of unshaped remains and semi-manufactured objects. Its metallurgical tradition has recently been dated by Chernykh to as early as 3100 to 2700 BCE (1992),making it more compatible chronologically with the early brass-using sites in Shaanxi mentioned above. Kovalev and Erdenebaatar have excavated barrows in Bayan-Ulgii, Mongolia, that have been carbon-dated to the first half of the third millennium BCE and associated by ceramic types and styles and burial patterns with the Afanasievo (Kovalev and Erdenebaatar 2009: 357–58). These mounded kurgans were covered with stone and housed rectangular, wooden-faced tombs that included Afanasievo-type bronze awls, plates and small “leaf-shaped” knife blades (Kovalev and Erdenebaatar 2009: Figs. 6 and 7).

They also excavated sites belonging to the more recently identified Chemurchek archaeological culture, located in the foothills of the Mongolian Altai (Kovalev 2014, 2015) (Fig. 2.6). These sites are carbon-dated to the same period as the Afanasievo burials or to c. 3100/2500–1800 BCE (six barrows in Khovd aimag and four in Bayan-Ulgo aimag). In the rectangular stone kerbed Chemurchek slab burials (Ulaaanhus sum, Bayan-ul’gi aimag and so forth), bronze items included awls; and at Khovd aimag, Bulgan sum, in addition to stone sculptures, three lead and one bronze ring were excavated (Kovalev and Erdenebaatar 2009: Figs. 2 and 3; Fig. 2.6). Although we will not know if they were produced locally until much further investigation is undertaken, these discoveries do document knowledge of various uses and types of metal objects in western and south central Mongolia. The types of metal items thus far recovered are simple tools (awls) and rings (ornamental?) not unlike those associated with Andronovo archaeological cultures as well.

This is a complex circumstance where archaeological evidence is not complete, but raises very important questions about transmission of metallurgical knowledge to and from areas in present-day China. In the 1970s some Afanasievo mounds were excavated in Central Mongolia by a Soviet–Mongolian expedition led by V. V. Volkov and E. A. Novgorodova (Novgorodova 1989: 81–85). Unfortunately, these mounds did not yield metal objects, only ceramics, but they show that the Afanasievo culture with the Eneolithic metallurgical tradition of manufacturing pure copper items had already moved east at least far as central Mongolia. In 2004, Kovalev and Erdenebaatar investigated a large Afanasievo mound, Kulala ula, in the extreme northwest of Mongolia, near the Russian border (Kovalev and Erdenebaatar 2009). There they found a copper knife and awl (Fig. 2.5). There are five C14 dates on wood, coal and human bones from this mound, which belong to the period 2890–2570 BCE. This shows that the Afanasievo culture were carriers of technology and produced artifacts in the first half of the third millennium BCE and that they also moved south along the foothills of the Mongolian Altai. Afanasievo culture in Altai and the Minusinsk basin is dated by C14 to 3600–2500 BCE (Svyatko et al. 2009; Polyakov 2010). In the north of Xinjiang in the Altai district, several typical egg-shaped vessels and two censers of Afanasievo types were found. Some of these have been obtained from the stone boxes (chambers of megalithic graves of the Chemurchek culture) (Kovalev 2011). Thus, the Afanasievo tradition of pure copper metallurgy must have spread to the northern foothills of the Tienshan Mountains no later than the mid-third millennium BCE. The links with Afanasievo and local cultures adjacent to and south of the mountains into present-day China can now be assumed.

tocharians-chalcolithic-late
Afanasievo – Chemurchek evolution ca. 2600-2200 BC. See full culture and ancient DNA maps.

2. Bronze Age Altai

Kovalev and Erdenebaatar (2014a) and later Tishkin, Grushin, Kovalev and Munkhbayar (2015) in Western Mongolia conducted large-scale excavations of megalithic barrows of the Chemurchek culture (dated about 2600–1800 BCE). This peculiar culture appeared in Dzungaria and the Mongolian Altai in the second quarter of the third millennium BCE and for some time existed together with the late Afanasievo culture, as evidenced by the findings of Afanasievo ceramics in Chemurchek graves, in the stone boxes. Unfortunately, in China we do not yet know of any metal object related,without doubt, to the Chemurchek culture. Kovalev, Erdenebaatar, Tishkin and Grushin found several leaden ear rings and one ring of tin bronze in three excavated Chemurchek stone boxes (Kovalev and Erdenebaatar 2014a; Tishkin et al. 2015). Such lead rings are typical for Elunino culture,which occupied the entire West Altai after 2400–2300 BCE (Tishkin et al. 2015). This culture had developed a tradition of bronze metallurgy with various dopants, primarily tin. Thus, the tradition of bronze metallurgy as early as this time could have penetrated the Mongolian Altai far to the south. In addition, in the Hadat ovoo Chemurchek stone box, Kovalev and Erdenebaatar discovered stone vessels refurbished with the help of copper “patches,” indicating the presence there of metallurgical production (Fig. 2.7) (Kovalev and Erdenebaatar 2014a). In one of the secondary

Chemurchek graves unearthed by Kovalev and Erdenebaatar in Bayan-Ulgi (2400–2220 BCE), a bronze awl was found (Kovalev and Erdenebaatar 2009). Kovalev and Erdenebaatar also discovered a new culture in the territory of Mongolia (Map 2.3), one that begins immediately after Chemurchek – Munkh-Khairkhan culture (Kovalev and Erdenebaatar 2009, 2014b). To date, about 17 mounds of this culture have been excavated in Khovd, Zavkhan, Khovsgol, Bulgan aimag of Mongolia. This culture dates from about 1800 to 1500 BCE, that is, contemporary with the Andronovo culture. Therefore, the Andronovo culture does not extend far into the territory of Mongolia. Three knives without dedicated handles or stems and five awls have been found in the Munkh-Khairkhan culture mounds (Fig. 2.8). All these products are made of tin bronze. (…) Additionally, eight Late Bronze Age burials (c. 1400–1100 BCE) were unearthed in the Bulgan sum of Khovd aimag and belong to another previously unknown culture called Baitag. And in the Gobi Altai, a new group of “Tevsh” sites dating to the Late Bronze Age were defined in Bayankhongor and South Gobi aimags (Miyamoto and Obata 2016: 42–50). From these Tevsh and Baitag sites, we see the expansion of burial goods to include beads of semiprecious stones (carnelian), bronze beads, buttons and rings and even the famous elaborate golden hair ornaments (Tevsh uul;Bogd sum;Uverkhanagia aimag) from the Baitag barrows (Kovalev and Erdenebaatar 2009: Fig. 5; Miyamoto and Obata 2016).

2.1. Chemurchek

About the Chemurchek culture, from A re-analysis of the Qiemu’erqieke (Shamirshak) cemeteries, Xinjiang, China, by Jia and Betts JIES (2010) 38(4):

The major characteristics of Qiemu’erqieke Phase I include:

  1. Burials with two orientations of approximately 20° or 345°.
  2. Rectangular enclosures built using large stone slabs. The size of the enclosure varies from a maximum of 28 x 30 m.*to a minimum of 10.5 x 4.4 m. (Figure 8, Table 2).
  3. *The stone enclosure located near Hayinar is the largest one at approximately 30 x 40 m. based on pacing of the site during a visit by the authors in 2008.

  4. Almost life-sized anthropomorphic stone stelae erected along one side of the stone enclosures (Lin Yun 2008).
  5. Single enclosures tend to contain one or more than one burial, all or some with stone cist coffins.
  6. The cist coffin is usually constructed using five large stone slabs, four for the sides and one on top, leaving bare earth at the base (Zhang Yuzhong 2007). Sometimes the insides of the slabs have simple painted designs (Zhang Yuzhong 2005).
  7. Primary and secondary burials occur in the same grave.
  8. Some decapitated bodies (up to 20) may be associated with the main burial in one cist.
  9. Bodies are commonly placed on the back or side with the legs drawn up.
  10. Grave goods include stone and bronze arrowheads, handmade gray or brown round-bottomed ovoid jars, and small numbers of flat-bottomed jars (Fig. 7).
  11. Clay lamps appear to occur together with roundbottomed jars.
  12. Complex incised decoration on ceramics is common but some vessels are undecorated.
  13. The stone vessels are distinctive for the high quality of manufacture.
  14. Stone moulds indicate relatively sophisticated metallurgical expertise.
  15. Artefacts made from pure copper occur.
  16. Sheep knucklebones (astragali) imply a tradition (as in historical and modern times) of keeping knucklebones for ritual or other purposes. They also indicate the herding of domestic sheep as part of the subsistence economy.
tocharians-bronze-age-early
Chemurchek culture ca. 2200-1750 BC. See full culture and ancient DNA maps.

Chemurchek dating

Available evidence suggests that the date range for Qiemu’erqieke Phase I should fall from the later third into the early second millennium BC. There are several reasons to suggest that the time span is around the early second millennium BC. Lin Yun (2008) (…) maintains that the bronze artefacts found in Phase I show a greater sophistication in the level of copper alloy technology than that of the pure copper artefacts common to the Afanasievo tradition. On this basis it might be suggested that the Afanasievo could be considered to be Chalcolithic with a time span across much of the third millennium BC ( Gorsdorf et al. 2004: 86, Fig. 1). Qiemu’erqieke Phase I, however, should more properly be considered as Bronze Age.

Lin Yun also used the bronze arrowhead from burial Ml 7 to narrow down the date of Qiemu’erqieke Phase I. Two arrowheads were found in this burial, one of them leaf shaped with a single barb on the back (Fig. 7:4). A similar arrowhead, together with its casting mould, has been found at the Huoshaogou site of Siba tradition (Li Shuicheng 2005, Sun Shuyun and Han Rufen 1997), in Gansu province, northwest China, dated around 2000-1800 BC (Li Shuicheng and Shui Tao 2000) . This supports a date in the early second millennium BC for the Qiemu’erqieke arrowhead. The painted, round-bottomed jar from the Tianshanbeilu cemetery Qia Weiming, Betts and Wu Xinhua 2008: Fig. 7, bottom left) has been considered as a hybrid between the Upper Yellow River Bronze Age cultures of Siba in northwest China and the steppe tradition of Qiemu’erqieke in west Siberia (Li Shuicheng 1999). If this assumption is correct, the date of Tianshanbeilu, around 2000 BC, can be used as a reference for Qiemu’erqieke Phase I (Jia Weiming, Betts and Wu Xinhua 2008, Lin Yun 2008, Li Shuicheng 1999). Stone arrowheads found in Qiemu’erqieke Phase I also imply that the date is likely to fall within the earlier part of the Bronze Age as no such stone arrowheads have yet been found elsewhere in sites of the Bronze Age in Xinlang dated after the beginning of the second millennium BC.*
*For example Chawuhu and Xiaohe cemeteries (Xinjiang Institute of Archaeology 1999, 2003).

pottery-afanasevo-chemurchek
Pottery of Afanasevo and East European traits from the Chemurchek complex. Image modified from Kovalev (2017).

(…) Pottery “oil burners” (goblet-like ceramic vessels, possibly lamps) have been found in three traditions: Afanasievo (Gryaznov and Krizhevskaya 1986:21), Okunevo and Qiemu’erqieke. It is believed that this oil-burner found in Siberia and the Altai is a heritage from the Yamnaya and Catacomb
cultures (Sulimirski 1970: 225, 425; Shishlina 2008:46) in the Caspian steppe further to the west, but does not seem to exist in known Andronovo cultures.
The oil-burner tends to disappear after around 2300 BC during the mid-Okunevo period. It is, however, possible that the tradition continues longer in the Qiemu’erqieke sites.

The construction of the stone enclosures also reveals a close connection between Qiemu’erqieke Phase I and the mid and late Okunevo tradition (Sokolova 2007). Slab built stone enclosures emerged in both the Okunevo and Afanasievo traditions (Gryaznov and Krizhevskaya 1986:15-23, Kovalev 2008, Sokolova 2007, Anthony 2007:310, Koryakova and Epimakhov 2007). In the early Afanasievo the enclosure is circular with no cist coffin (Anthony 2007:310, Gryaznov and Krizhevskaya 1986:20), but in the early stage of the Okunevo square stone enclosures with a single cist burial are dominant. Square or rectangular stone enclosures are a marked feature of Qiemu’erqieke Phase I, suggesting temporal relationships between Qiemu’erqieke Phase I and the Okunevo. In Okunevo chronological group II, possibly with influence from the Anfanasievo, circular stone enclosures appeared in combination with rectangular enclosures within individual cemeteries, referred to by Sokolova (2007: table 2) as hybrid examples. By Okunevo chronological group III, rectangular stone slab enclosures with multi-burials emerged again. This is the dominant form in Qiemu’erqieke Phase I. Okunevo burial traditions changed again to single cist burials in the late stage around chronological group V ( Sokol ova 2007). A specific mortuary rite of decapitated burials exists in both the Qiemu’erqieke and Okunevo traditions (Sokolova 2007, Chen Kwang-tzuu and Hiebert 1995), as does the occasional occurrence of painted designs on the interior of the slabs forming the cists ( e.g., Khavrin 1997: 70, fig. 4; 77: tab. IV.5). Based on these comparisons, the date of Qiemu’erqieke Phase I may well parallel that of the Okunevo from at least chronological group II around 2400 BC (Gorsdorf et al. 2004: fig. 1).

khuh-udzuur-barrow
Khuh Udzuuriin I-1 elite barrow (ca. 2470-2190 BC). Modified from Image modified from Kovalev (2014).

In addition to the pottery making tradition, the anthropomorphic stone stelae may also have earlier antecedents. In the Okunevo assemblage there are anthropomorphic stelae that are longer, thinner and more abstract than those of Qiemu’erqieke. There is no indication of such stelae in the Afanasievo tradition (Gryaznov and Krizhevskaya 1986:15-23). However, further to the west, anthropomorphic stone stelae are associated with the Kemi-Oba and Yamnya cultures around the third millennium BC (Telegin and Mallory 1994; Figure 13). Some major characteristics of these stelae such as the icons on the front face of the stelae (Telegin and Mallory 1994:8-9) also appear on stelae found in Qiemu’erqieke Phase I. Recalling the oil burners that may have been inherited from the Yamnya culture and which are found in the Afansievo, Okunevo and Qiemu’erqieke Phase I, it migh t be possible to speculate that Qiemu’erqieke Phase I has its origins even earlier than the first half of the third millennium BC. This idea has also been suggested by Kovalev ( 1999).

Despite the affinities with the Okunevo cultural tradition, Qiemu’erqieke Phase I appears to be a discrete regional variant. The ceramic assemblage shows traits unique to this cluster of sites, while the anthropomorphic stelae are also distinctive markers of this tradition.

khuh-udzuur-stela
Khuh Udzuur anthropomorphic stone stela, oriented toward the south – south-east. Image modified from Kovalev (2014).

3. Bronze Age Xinjiang

I recently reported on this blog the description of Xiaohe and Gumugou cemeteries from interesting Master’s thesis Shifting Memories: Burial Practices and Cultural Interaction in Bronze Age China: A study of the Xiaohe-Gumugou cemeteries in the Tarim Basin, by Yunyun Yang, Uppsala University, Department of Archaeology and Ancient History (2019).

It also offered a full summary of findings from prehistoric sites of Xinjiang related to the arrival of a cultural package from the Altai region, ultimately connected to Afanasievo. Relevant excerpts include the following (emphasis mine):

In Bronze Age Xinjiang, burials were diverse but also show some common features between different geographic sections. The main three mountains, including Kunlun Mountains, Tian Shan (mountains) and Altai Mountains, enclose the Tarim Basin, and the Dzungaria Basin, but leave the eastern part of the Tarim Basin and the south-eastern part of the Dzungaria Basin open (with easy access to the surroundings). The Hami Basin is located at the transitional area, connecting the two basins. Burials are mainly spread along the edge of the mountain ranges.

xinjiang-afanasievo-andronovo-bmac-tian-shan
An assumption of the spreading/expansion routes stone burial construct.

3.1. The Lop Nur region

In the Lop Nur region, the Xiaohe cemetery (2000-1450 BCE) and the Gumugou cemetery (1900-1800 BCE) had many common features shared, and so is the Keliyahe northern cemetery:

  • Cemeteries were located in sandy areas;
  • Rectangular/boat-shaped wooden coffins with monuments of wooden planks or poles;
  • Coffins had no bottoms;
  • The dead were placed lying straight on the back;
  • The dead were commonly buried in single graves.

The Gumugou cemetery contained six special sun-radiating-spokes burial pattern in addition to the normal burials, which were similar to the wooden coffin graves of the Xiaohe cemetery.

NOTE. For more on Xiaohe and Gumugou, see the recent post on Proto-Tocharians. See other papers on the Andronovo horizon for other Early to Middle Bronze Age cultural groups less clearly associated with the Xiaohe horizon, like Hazandu, Xintala, or the Chust culture.

From Shuicheng (2006):

An assemblage of early bronzes had been recovered from northwestern Xinjiang and the periphery of Dzungaria 准噶尔 Basin. It comprises a variety of utilitarian tools and weapons, and a small number of apparels. These artifacts bear the stamps of Andronovo Culture in form, artifact type and decorative pattern. The metallographic analysis on selected artifacts indicates that they comprise mainly of tin-bronzes that contain 2–10% of tin. Moreover, the chemical compositions of these artifacts are similar to that of the Andronovo Culture. Latter date (first half of the 1st millennium BC) artifacts of the assemblage include a small number of arsenic bronzes. In all, during the period between the mid-2nd and mid-1st millennium BC, copper and bronze artifacts coexisted in this region, albeit tin-bronze comprised the majority. The composition of alloy did not show significant change over time. Some colleagues pointed out that the Nulasai 奴拉赛 site at Nileke 尼勒克 County in the Yili 伊犁 River basin of Xinjiang was the pioneer in the use of “sulphuric ore–ice copper–copper”technology. It is also the only early smelting site in Euro-Asia that arsenic ore was added to deliberately produce an alloy

tocharians-bronze-age-middle
Prehistoric cultures of Xinjiang during the Middle Bronze Age. See full culture and ancient DNA maps.

3.2. The Hami Basin-the Balikun Grassland

From Yang (2019):

The Hami Basin-the Balikun Grassland area is located at the eastern part of Tian Shan. The area is divided in a northern basin and a southern basin by the east-west stretch of the Tian Shan. In the Hami Basin-the Balikun Grassland area, the main type of burials were earth-pit graves in the early Bronze Age, and burials of stone-pit with barrows became more common in the late Bronze Age. The Hami-Tianshan-Beilu cemetery is a representative of the earth-pit graves. The features of the Hami-Tianshan-Beilu cemetery (2000-1500 bce) here were:

  • Rectangular earth pit graves;
  • The dead were often in a hocker position lying on one side;
  • Commonly a single dead in one grave.
balikun-grassland
The Balikun grassland today (source).

The Hami-Wubu cemetery (earlier than 1000 bce) and the Yanbulake cemetery (1200-600 bce) are representatives of another common earth-pit graves. Common features here were:

  • Rectangular earth pits, with two storeys and/or roofed with wooden boards;
  • The dead was placed in a hocker position lying on one side;
  • Mostly a single dead in one grave.

Later there appeared more stone-pit graves in this area, and the features can be summarized as:

  • Round burial mounds, commonly constructed by stones or a mix of stones and earth;
  • Burial mounds with a sunken top or a normal (dome) top;
  • The diameter of the burial mounds varied between 3 and 25.4 m (but not necessarily limited in this scope);
  • Circular or rectangular stone kerbs;
  • Rectangular stone pits, constructed by earth, or stones, or a mix of earth and stones;
  • Rectangular stone pits contained wooden coffins (represented by the Yiwu Baiqi’er cemetery).
hami-basin-balikun-grassland-iron-age-burials
Some representatives of stone burials in the Hami Basin – the Balikun Grassland in the Iron Age (Adapted from: Xinjiang 2011, 29-41). Image modified from Yang (2019).

In the Hami Basin, the Bronze Age cemeteries show common burial features like earth pits and hocker position of the dead. With similar pottery styles in the Hami-Tianshan-Beilu cemetery to those in the Machang and Siba cultures (Xinjiang 2011: 17), it suggests possible cultural influence or people’s migrating from the Hexi Corridor in the east.

In the Balikun Grassland, burials in an earlier time contained mostly earth-pit graves but also a small number of stone-pit graves. The pebbles were imbedded in the floors and the walls of the graves in a rectangular shape, e.g. the Balikun-Nanwan cemetery (1600-1000 bce). In a later time, there appeared huge burial mounds with a sunken top, and with the diameters of the burial mounds varying from 3 to 25.4 m, e.g. the Balikun-Dongheigou cemetery and the Balikun-Heigouliang cemetery. The Yiwu-Bai’erqi and the Yiwu-Kuola cemeteries contained either round stone burial mounds or circular stone kerbs on the ground surface. Considering the three burial elements including burial mounds, stone pits and circular kerbs, the later period cemeteries in the Balikun Grassland were actually similar to cemeteries from the southern edge of the Altai Mountain area.

From Shuicheng (2006):

The Nanwan 南湾 cemetery site at Kuisu 奎苏 Town, Balikun 巴里坤 (1600–1100 BC) also yielded an assemblage of early bronzes. The style of its early phase artifacts is similar to that of the burials distributed in the North Tianshan Route. Some sorts of cultural connection should have existed between the two.

The dates of Yanbulake 焉不拉克 Culture (1300–700 BC) are comparatively late. Its metallurgy was a continuation of the western China tradition. Artifact types include a variety of utilitarian tools, weapons and apparels.

tocharians-bronze-age-late
Prehistoric cultures of Xinjiang during the Late Bronze Age. See full culture and ancient DNA maps.

3.3. The Turpan Basin-the middle part of Tian Shan

From Yang (2019):

Turpan Basin is located at the western part of the Hami Basin, and lies at the southern edge of the eastern Tian Shan. In the Turpan Basin-the middle part of Tian Shan area, the main representative of the Bronze Age cemeteries is the Yanghai Nr.1 cemetery. The features here were:

  • Elliptic earth pit graves, commonly covered by round logs on the top;
  • Some graves contained burial beds made of round logs or reeds;
  • The dead were mainly placed lying straight on the back;
  • Mostly a single dead in one grave.

In Iron Age, the stone burials became dominant, but the stone burials varied in different regions of the Turpan Basin-the middle part of Tian Shan area. Graves containing burial mounds, stone pit, and circular stone kerbs are represented by the Shanshan-Ertanggou cemetery, the Tuokexun-Alagou cemetery, the Urumqi-Chaiwobu cemetery and the Urumqi-Yizihu-Sayi cemetery, etc. The stone funeral construction features here are similar to those contemporary cemeteries in the Hami Basin-the Balikun Grassland area.

3.4. The southern edge of the western and middle part of Tian Shan

In the southern edge of the western and middle part of Tian Shan area, the main representatives of the late Bronze Age cemeteries are the Hejing-Chawuhu Nr.4 cemetery (around 1000-500 bce), the Hejing-Xiaoshankou cemetery, the Baicheng-cemetery, etc. The main burial features of the late Bronze Age and the early Iron Age cemeteries (see Fig.12) here were:

  • Burial mounds, constructed by stones or a mix of stones and earth;
  • Irregular circular or rectangular stone kerbs;
  • Stone pit graves in a bell-shape or a rectangular shape;
  • Stone pit graves constructed by imbedding pebbles or stone slabs in walls and floors;
  • The dead were often placed lying on their back with bent legs;
  • The dead were commonly reburied a second time with multiple burials.

From the late Bronze Age to the early Iron Age in this area, the burial traditions tended to be in a more varied way. In the stone burials with stone kerbs, there is a mixture of stone pit and earth pit graves. The burial features of the Iron Age cemeteries in this section were similar to those contemporary both in the Hami Basin-the Balikun Grassland area and in the Turpan Basin-the middle part of Tian Shan area.

From Shuicheng (2006):

The Chawuhu 察吾呼 Culture (1100–500 BC) distributes on the foothills between the middle section of the Tianshan Mountain Ranges and Tarim River. Its bronze assemblage comprises a variety of weapons, utilitarian tools and small apparels. They show no apparent temporal change in form and type through the four cultural phases. In addition, bronzes bear the Chawuhu characteristics were found in Hejing 和静, Baicheng 拜城 and Luntai 轮台 (Bügür). Yet, sites distributed along the Tarim River, such as Heshuo 和硕, Kuga 库车and Aksu 阿克苏, yielded remains of a bronze culture different from that of Chawuhu. Bronzes recovered include double-eared socketed axe, arrowheads, awls, knives, needles and bracelets. Their absolute dates have been estimated to be earlier than that of Chawuhu.

tocharians-iron-age-early
Prehistoric cultures of Xinjiang during the Early Iron Age. See full culture and ancient DNA maps

3.5. The Pamir Plateau

From Yang (2019):

A typical Bronze Age cemetery from the Pamir Plateau area is the Tashenku’ergan-Xiabandi cemetery (around 1000-500 bce). The burial features here were:

  • Mainly inhumations, but also a few cremations;
  • Burial mounds, constructed of stones;
  • Irregular circular or rectangular stone kerbs;
  • Mostly a single dead in one grave;
  • The dead was placed in a hocker position lying on one side.

The adoption of burial customs from the east supports the migration of Afanasievo-related peoples from the Tian Shan up to the Pamir Plateau, strongly influencing the findings of the Xiabandi cemetery, which has been dated from an early Bronze Age phase (ca. 1500-300 BC) to a late date up to ca. 600 BC.

While it is today unclear how far the Afanasievo admixture reached into the western Xinjiang, it seems that the Pamir Plateau remained culturally connected to neighbouring Andronovo-related cultures in pottery and metallurgical innovations, hence their language probably belonged – during most part of the Bronze and Iron Ages – to the Indo-Iranian branch, even though specific dialects might have changed with each new attested group.

In particular, it is possible that the early Andronovo groups related to the Xiaohe Horizon spoke Indo-Aryan or West Iranian dialects, while Saka-related groups replaced them – or an intermediate Tocharian-speaking group – with East Iranian dialects. A close interaction with West Iranian would justify the known ancient borrowings of Tocharian, although they could also be explained by contacts with Chust-related groups farther west. For more on this, see Ged Carling’s work on the different layers of Iranian loans.

Xinjiang BA/IA Summary

From Yang (2019):

In the early Bronze Age, there are distinct regional differences in the burial customs in and surrounding the Tarim Basin. At the southern edge of the Altai Mountains area, the burial customs included stone burial mounds, stone pit graves, circular or rectangular stone kerbs and stone human sculptures; the dead were placed lying straight on the back. In the Hami Basin-the Balikun Grassland area, the burial customs included earth pit graves; the dead were placed in a hocker position lying on one side. In the Turpan Basin-the middle part of Tian Shan area, the burial customs included earth pit graves; the dead were placed lying straight on the back. In the Lop Nur region, the burial customs included wooden coffins buried in sand; the dead were placed lying straight on the back.

But from the late Bronze Age to the early Iron Age, there was a common shift in burial customs from earth pit graves to stone burials in the Hami Basin-the Balikun Grassland area and in the Turpan Basin-the middle part of Tian Shan area. The main features of the stone burials include stone burial mounds, circular or rectangular stone kerbs, and the stone pit graves in the cemeteries. Similar stone burial customs commonly appeared at the southern edge of the western and middle part of Tian Shan area and the Pamir Plateau area in Iron Age. The burial features in most areas are in a mixture of both the earth pit graves and stone pit graves, especially in the Hami Basin-the Balikun Grassland area and the Turpan Basin-the middle part of Tian Shan area.

xinjiang-bronze-age-iron-age

From Shuicheng (2006):

Historians of metallurgy conducted metallographic analyses on a sample of 234 metal specimens recovered from 16 localities in eastern Xinjiang. They concluded that the metallurgic industry in eastern Xinjiang could be roughly partitioned into three developmental phases. The early phase is represented by the burials distributed in the North Tianshan Route. The majority of the metal assemblage was tin-bronzes; however, copper and arsenic-bronzes maintained considerable proportions. The middle phase is represented by the burials at Yanbulake. During this phase, tin-bronze still maintained the majority; the proportion of arsenic-bronze increased, and some of them were high arsenic-bronzes. The late phase is represented by the burials at Heigouliang 黑沟梁. The composition of lead increased in the bronze alloy in the expense of arsenic. In addition, this phase witnessed the appearance of high tin-bronze that composed up to 16% of tin and the appearance of brass, that is, an alloy of copper and zinc. The bronze alloy consistently contained significant amount of impurities regardless of temporal difference. Casting and forging technologies coexisted throughout the three phases. The early bronzes (2000–500 BC) of eastern Xinjiang, in general, contained arsenic; however, the composition of arsenic was usually under 8%, but a few artifacts contained more than 20% arsenic. In all, arsenic had long been used in the alloy-forming of the early bronzes in eastern Xinjiang. Consequently, arsenic-bronzes were widely found in the prehistoric archaeology of the region. The artifact types, chemical compositions and manufacture techniques of the bronze assemblage of the burials of the North Tianshan Route are similar to those of Siba Culture, indicating that eastern Xinjiang had played a significant role in the East-West interactions.

An assemblage of early bronzes had been recovered from northwestern Xinjiang and the periphery of Dzungaria 准噶尔 Basin. It comprises a variety of utilitarian tools and weapons, and a small number of apparels. These artifacts bear the stamps of Andronovo Culture in form, artifact type and decorative pattern. The metallographic analysis on selected artifacts indicates that they comprise mainly of tin-bronzes that contain 2–10% of tin. Moreover, the chemical compositions of these artifacts are similar to that of the Andronovo Culture. Latter date (first half of the 1st millennium BC) artifacts of the assemblage include a small number of arsenic-bronzes. In all, during the period between the mid-2nd and mid-1st millennium BC, copper and bronze artifacts coexisted in this region, albeit tin-bronze comprised the majority.

tocharians-iron-age-late
Prehistoric cultures of Xinjiang during the Late Iron Age. See full culture and ancient DNA maps.

Tocharians in population genomics

Prehistoric population movements between the Altai and the Tian Shan are difficult to pinpoint, not the least because of the division of these territories among three different countries and their archaeological teams, only recently (more) open to the international scholarship.

The available schematic archaeological picture, where migrations could only be roughly inferred, has been recently updated to a great extent by Ning, Wang et al. (2019), whose genetic analysis of the samples is as thorough as anyone could have asked for, with a level of detail which matches the complex genetic picture of the region by the Iron Age.

As a summary, here is what they described about the samples from Shirenzigou (ca 400-200 BC), corresponding to the Iron Age populations of the Hami Basin-the Balikun Grassland area, and closely related to the preceding Yanbulake Culture:

As shown in Figure S3, the Steppe_MLBA populations including Srubnaya, Andronovo, and Sintashta were shifted toward farming populations compared with Yamnaya groups and the Shirenzigou samples. This observation is consistent with ADMIXTURE analysis that Steppe_MLBA populations have an Anatolian and European farmer-related component that Yamnaya groups and the Shirenzigou individuals do not seem to have. The analysis consistently suggested Yamnaya-related Steppe populations were the better source in modeling the West Eurasian ancestry in Shirenzigou.

biplot-yamnaya-tocharians-shirenzigou
Biplot of f3-outgroup tests illustrating the Kostenki14 and Anatolia_N like ancestries in Shirenzigou individuals. Most Shirenzigou individuals were on a cline with Yamnaya and European hunter-gatherer groups, lacking the European farmer ancestry as compared to the Steppe_MLBA populations such as Andronovo, Srubnaya and Sintashta [S1-S5]. Horizontal and vertical bars represent ± 3 standard errors, corresponding to form of outgroup f3 tests on the x axis and y axis respectively.

We continued to use qpAdm to estimate the admixture proportions in the Shirenzigou samples by using different pairs of source populations, such as Yamnaya_Samara, Afanasievo, Srubnaya, Andronovo, BMAC culture (Bustan_BA and Sappali_ Tepe_BA) and Tianshan_Hun as the West Eurasian source and Han, Ulchi, Hezhen, Shamanka_EN as the East Eurasian source. In all cases, Yamnaya, Afanasievo, or Tianshan_Hun always provide the best model fit for the Shirenzigou individuals, while Srubnaya, Andronovo, Bustan_BA and Sappali_Tepe_BA only work in some cases.

p-values-shirenzigou-samples-han-chinese
Table S2. P values in modelling a two-way (P=rank 1) admixture in Shirenzigou samples using each of the four populations (Bustan_BA, Sappali_Tepe_BA, Andronovo.SG, Srubnaya) together with Han Chinese as two sources [S6], Related to Figure 2. We used the following set of outgroups populations: Dinka, Ust_Ishim, Kostenki14, Onge, Papuan, Australian, Iran_N, EHG, LBK_EN.

shirenzigou-afanasievo-yamnaya-andronovo-srubna-ulchi-han

In the PCA, ADMIXTURE, outgroup f3 statistics [see Figure S4], as well as f4 statistics (Table S3), we observed the Shirenzigou individuals were closer to the present day Tungusic and Mongolic-speaking populations in northern Asia than to the populations in central and southern China, suggesting the northern populations might contribute more to the Shirenzigou individuals. Based on this, we then modeled Shirenzigou as a three-way admixture of Yamnaya_Samara, Ulchi (or Hezhen) and Han to infer the source from the East Eurasia side that contributed to Shirenzigou. We found the Ulchi or Hezhen and Han-related ancestry had a complicated and unevenly distribution in the Shirenzigou samples. The most Shirenzigou individuals derived the majority of their East Eurasian ancestry from Ulchi or Hezhen-related populations, while the following two individuals M820 and M15-2 have more Han related than Ulchi/ Hezhen-related ancestry

It is unclear whether the Chemurchek population will show a sizeable local contribution from neighbouring groups. The fact that Okunevo shows 20% Yamnaya-related ancestry strongly supports the nature of neighbouring stone-grave-building peoples of the Altai and the northern Tian Shan as mostly Afanasievo-like, and the apparent lack of contributions of Srubna/Andronovo-like ancestry in the early Hami-Balikun stone burial builders also speaks for radical population replacement events reaching the areas south of Tian Shan, at least initially.

While ancestry cannot settle linguistic questions, it seems that nomads of the Gansu and Qinghai grasslands retained an ancestry close to Andronovo, whereas nomads of the Hami Basin-Balikun grasslands and related populations of Xinjiang remained closely related to Afanasievo. This doesn’t preclude that the ancestors of the Yuezhi became acculturated under the influence of peoples from eastern Xinjiang, but all data combined suggest an isolation of both populations – relative to other groups and to each other – and it is therefore more likely that they spoke Indo-Iranian-related languages rather than a language of the Tocharian branch.

Haplogroups

In an interesting twist of events, despite the initially reported hg. R1b and Q, Tocharians from Shirenzigou actually show a haplogroup diversity comparable to that attested in other late Iron Age populations: a similar diversity is seen, for example, among Germanic, Baltic, and Balto-Finnic peoples of the Baltic region; among East Germanic or Scythians of the north Pontic region; or among Mediterranean peoples sampled to date. Iron Age peoples show thus a complex sociopolitical setting that overcame the previous patrilineal homogeneity of Bronze Age expansions.

tocharians-pca
PCA and ADMIXTURE for Shirenzigou Samples. Modified from the original to include in black squares samples related to Yamnaya. Modified from the paper to include labels of modern populations and a dotted lines with the cline formed by Shirenzigou, from (Yamnaya-like) Afanasievo to Central and East Asian-like populations. In red circles, samples with best fit for Andronovo-like ancestry. In green circles, samples with Han-related admixture.

M15-2 (with Han-related ancestry) is of the rare haplogroup Q1a-M120, while the samples with highest Steppe_MLBA-related ancestry are of hg. R1b-PH155, which points to their recent origin among Yuezhi, or to Hun-related populations showing an admixture related to the proto-historic nomads of the Gansu and Qinghai grasslands.

The expansion of Chemurchek-related peoples was probably associated more with hg. Q1a (dubious if it’s a Pre-ISOGG 2017 nomenclature, hence possibly Q1b), a haplogroup that might be found in Khvalynsk as a “significant minority” according to Anthony (2019), and it might also be attested in sampled individuals from Afanasievo in its late phase. This might be, therefore, a case similar to the early expansion of Indo-Europeans with R1b-V1636 lineages through the Volga – North Caucasus region, and of the later expansion with I2a-L699 lineages into the Balkans.

Haplogroup Q1a2-M25 is found in individual X3, whose Steppe ancestry is likely a combination of Afanasievo plus Andronovo-like ancestry heavily admixed with Hezhen/Ulchi-like populations, in line with the expected recent contacts with the neighbouring Xiongnu, Yuezhi, and other population movements affecting eastern Xinjiang.

Sample M4, which packs the most Afanasievo-like ancestry, is of hg. R1a-Z645, which – like sample M8R1 of hg. O – is most likely related to haplogroup resurgence events of local populations, which left the predominant Afanasievo-like admixture brought by builders of stone burials essentially intact, evidenced by the almost 100% of R1a found in the Xiaohe cemetery – and in most of the early Andronovo horizon – and among expanding Kangju and Wusun, as well as by the prevalence of hg. O among sampled East Asian populations.

A question that will only be answered with more samples is how and when the prevalent R1b-L23 and Q1b lineages among Afanasievo-related peoples began to be replaced to reach the high variability seen in Shirenzigou. Given the pastoralist nature of peoples around Tian Shan, the succeeding expansions of Proto-Tocharians, and the late isolation of different Common Tocharian groups, it is more than likely that this variability represents a late and local phenomenon within Xinjiang itself.

tocharians-antiquity
Peoples of Xinjiang during Antiquity. See full culture and ancient DNA maps.

Conclusion

Tocharians are one of the main pillars that confirm the Late Proto-Indo-European homeland of the R1b-rich populations of the Don-Volga region. There is already:

Just like the East Bell Beaker expansion from Yamnaya Hungary has confirmed that Corded Ware peoples did not partake in spreading Indo-European languages (spreading Uralic languages instead), data on the expansion of Tocharian speakers from Afanasievo to the Tian Shan was always there; population genomics is merely helping to connect the dots.

In summary, genetic research is supporting the expected linguistic expansions of the Neolithic and Bronze Age step by step, slowly but surely.

Related

Yamnaya ancestry: mapping the Proto-Indo-European expansions

steppe-ancestry-expansion-europe

The latest papers from Ning et al. Cell (2019) and Anthony JIES (2019) have offered some interesting new data, supporting once more what could be inferred since 2015, and what was evident in population genomics since 2017: that Proto-Indo-Europeans expanded under R1b bottlenecks, and that the so-called “Steppe ancestry” referred to two different components, one – Yamnaya or Steppe_EMBA ancestry – expanding with Proto-Indo-Europeans, and the other one – Corded Ware or Steppe_MLBA ancestry – expanding with Uralic speakers.

The following maps are based on formal stats published in the papers and supplementary materials from 2015 until today, mainly on Wang et al. (2018 & 2019), Mathieson et al. (2018) and Olalde et al. (2018), and others like Lazaridis et al. (2016), Lazaridis et al. (2017), Mittnik et al. (2018), Lamnidis et al. (2018), Fernandes et al. (2018), Jeong et al. (2019), Olalde et al. (2019), etc.

NOTE. As in the Corded Ware ancestry maps, the selected reports in this case are centered on the prototypical Yamnaya ancestry vs. other simplified components, so everything else refers to simplistic ancestral components widespread across populations that do not necessarily share any recent connection, much less a language. In fact, most of the time they clearly didn’t. They can be interpreted as “EHG that is not part of the Yamnaya component”, or “CHG that is not part of the Yamnaya component”. They can’t be read as “expanding EHG people/language” or “expanding CHG people/language”, at least no more than maps of “Steppe ancestry” can be read as “expanding Steppe people/language”. Also, remember that I have left the default behaviour for color classification, so that the highest value (i.e. 1, or white colour) could mean anything from 10% to 100% depending on the specific ancestry and period; that’s what the legend is for… But, fere libenter homines id quod volunt credunt.

Sections:

  1. Neolithic or the formation of Early Indo-European
  2. Eneolithic or the expansion of Middle Proto-Indo-European
  3. Chalcolithic / Early Bronze Age or the expansion of Late Proto-Indo-European
  4. European Early Bronze Age and MLBA or the expansion of Late PIE dialects

1. Neolithic

Anthony (2019) agrees with the most likely explanation of the CHG component found in Yamnaya, as derived from steppe hunter-fishers close to the lower Volga basin. The ultimate origin of this specific CHG-like component that eventually formed part of the Pre-Yamnaya ancestry is not clear, though:

The hunter-fisher camps that first appeared on the lower Volga around 6200 BC could represent the migration northward of un-admixed CHG hunter-fishers from the steppe parts of the southeastern Caucasus, a speculation that awaits confirmation from aDNA.

neolithic-chg-ancestry
Natural neighbor interpolation of CHG ancestry among Neolithic populations. See full map.

The typical EHG component that formed part eventually of Pre-Yamnaya ancestry came from the Middle Volga Basin, most likely close to the Samara region, as shown by the sampled Samara hunter-gatherer (ca. 5600-5500 BC):

After 5000 BC domesticated animals appeared in these same sites in the lower Volga, and in new ones, and in grave sacrifices at Khvalynsk and Ekaterinovka. CHG genes and domesticated animals flowed north up the Volga, and EHG genes flowed south into the North Caucasus steppes, and the two components became admixed.

neolithic-ehg-ancestry
Natural neighbor interpolation of EHG ancestry among Neolithic populations. See full map.

To the west, in the Dnieper-Dniester area, WHG became the dominant ancestry after the Mesolithic, at the expense of EHG, revealing a likely mating network reaching to the north into the Baltic:

Like the Mesolithic and Neolithic populations here, the Eneolithic populations of Dnieper-Donets II type seem to have limited their mating network to the rich, strategic region they occupied, centered on the Rapids. The absence of CHG shows that they did not mate frequently if at all with the people of the Volga steppes (…)

neolithic-whg-ancestry
Natural neighbor interpolation of WHG ancestry among Neolithic populations. See full map.

North-West Anatolia Neolithic ancestry, proper of expanding Early European farmers, is found up to border of the Dniester, as Anthony (2007) had predicted.

neolithic-anatolia-farmer-ancestry
Natural neighbor interpolation of Anatolia Neolithic ancestry among Neolithic populations. See full map.

2. Eneolithic

From Anthony (2019):

After approximately 4500 BC the Khvalynsk archaeological culture united the lower and middle Volga archaeological sites into one variable archaeological culture that kept domesticated sheep, goats, and cattle (and possibly horses). In my estimation, Khvalynsk might represent the oldest phase of PIE.

(…) this middle Volga mating network extended down to the North Caucasian steppes, where at cemeteries such as Progress-2 and Vonyuchka, dated 4300 BC, the same Khvalynsk-type ancestry appeared, an admixture of CHG and EHG with no Anatolian Farmer ancestry, with steppe-derived Y-chromosome haplogroup R1b. These three individuals in the North Caucasus steppes had higher proportions of CHG, overlapping Yamnaya. Without any doubt, a CHG population that was not admixed with Anatolian Farmers mated with EHG populations in the Volga steppes and in the North Caucasus steppes before 4500 BC. We can refer to this admixture as pre-Yamnaya, because it makes the best currently known genetic ancestor for EHG/CHG R1b Yamnaya genomes.

From Wang et al (2019):

Three individuals from the sites of Progress 2 and Vonyuchka 1 in the North Caucasus piedmont steppe (‘Eneolithic steppe’), which harbour EHG and CHG related ancestry, are genetically very similar to Eneolithic individuals from Khvalynsk II and the Samara region. This extends the cline of dilution of EHG ancestry via CHG-related ancestry to sites immediately north of the Caucasus foothills

eneolithic-pre-yamnaya-ancestry
Natural neighbor interpolation of Pre-Yamnaya ancestry among Neolithic populations. See full map. This map corresponds roughly to the map of Khvalynsk-Novodanilovka expansion, and in particular to the expansion of horse-head pommel-scepters (read more about Khvalynsk, and specifically about horse symbolism)

NOTE. Unpublished samples from Ekaterinovka have been previously reported as within the R1b-L23 tree. Interestingly, although the Varna outlier is a female, the Balkan outlier from Smyadovo shows two positive SNP calls for hg. R1b-M269. However, its poor coverage makes its most conservative haplogroup prediction R-M343.

The formation of this Pre-Yamnaya ancestry sets this Volga-Caucasus Khvalynsk community apart from the rest of the EHG-like population of eastern Europe.

eneolithic-ehg-ancestry
Natural neighbor interpolation of non-Pre-Yamnaya EHG ancestry among Eneolithic populations. See full map.

Anthony (2019) seems to rely on ADMIXTURE graphics when he writes that the late Sredni Stog sample from Alexandria shows “80% Khvalynsk-type steppe ancestry (CHG&EHG)”. While this seems the most logical conclusion of what might have happened after the Suvorovo-Novodanilovka expansion through the North Pontic steppes (see my post on “Steppe ancestry” step by step), formal stats have not confirmed that.

In fact, analyses published in Wang et al. (2019) rejected that Corded Ware groups are derived from this Pre-Yamnaya ancestry, a reality that had been already hinted in Narasimhan et al. (2018), when Steppe_EMBA showed a poor fit for expanding Srubna-Andronovo populations. Hence the need to consider the whole CHG component of the North Pontic area separately:

eneolithic-chg-ancestry
Natural neighbor interpolation of non-Pre-Yamnaya CHG ancestry among Eneolithic populations. See full map. You can read more about population movements in the late Sredni Stog and closer to the Proto-Corded Ware period.

NOTE. Fits for WHG + CHG + EHG in Neolithic and Eneolithic populations are taken in part from Mathieson et al. (2019) supplementary materials (download Excel here). Unfortunately, while data on the Ukraine_Eneolithic outlier from Alexandria abounds, I don’t have specific data on the so-called ‘outlier’ from Dereivka compared to the other two analyzed together, so these maps of CHG and EHG expansion are possibly showing a lesser distribution to the west than the real one ca. 4000-3500 BC.

eneolithic-whg-ancestry
Natural neighbor interpolation of WHG ancestry among Eneolithic populations. See full map.

Anatolia Neolithic ancestry clearly spread to the east into the north Pontic area through a Middle Eneolithic mating network, most likely opened after the Khvalynsk expansion:

eneolithic-anatolia-farmer-ancestry
Natural neighbor interpolation of Anatolia Neolithic ancestry among Eneolithic populations. See full map.
eneolithic-iran-chl-ancestry
Natural neighbor interpolation of Iran Chl. ancestry among Eneolithic populations. See full map.

Regarding Y-chromosome haplogroups, Anthony (2019) insists on the evident association of Khvalynsk, Yamnaya, and the spread of Pre-Yamnaya and Yamnaya ancestry with the expansion of elite R1b-L754 (and some I2a2) individuals:

eneolithic-early-y-dna
Y-DNA haplogroups in West Eurasia during the Early Eneolithic in the Pontic-Caspian steppes. See full map, and see culture, ADMIXTURE, Y-DNA, and mtDNA maps of the Early Eneolithic and Late Eneolithic.

3. Early Bronze Age

Data from Wang et al. (2019) show that Corded Ware-derived populations do not have good fits for Eneolithic_Steppe-like ancestry, no matter the model. In other words: Corded Ware populations show not only a higher contribution of Anatolia Neolithic ancestry (ca. 20-30% compared to the ca. 2-10% of Yamnaya); they show a different EHG + CHG combination compared to the Pre-Yamnaya one.

eneolithic-steppe-best-fits
Supplementary Table 13. P values of rank=2 and admixture proportions in modelling Steppe ancestry populations as a three-way admixture of Eneolithic steppe Anatolian_Neolithic and WHG using 14 outgroups.
Left populations: Test, Eneolithic_steppe, Anatolian_Neolithic, WHG.
Right populations: Mbuti.DG, Ust_Ishim.DG, Kostenki14, MA1, Han.DG, Papuan.DG, Onge.DG, Villabruna, Vestonice16, ElMiron, Ethiopia_4500BP.SG, Karitiana.DG, Natufian, Iran_Ganj_Dareh_Neolithic.

Yamnaya Kalmykia and Afanasievo show the closest fits to the Eneolithic population of the North Caucasian steppes, rejecting thus sizeable contributions from Anatolia Neolithic and/or WHG, as shown by the SD values. Both probably show then a Pre-Yamnaya ancestry closest to the late Repin population.

wang-eneolithic-steppe-caucasus-yamnaya
Modelling results for the Steppe and Caucasus cluster. Admixture proportions based on (temporally and geographically) distal and proximal models, showing additional AF ancestry in Steppe groups and additional gene flow from the south in some of the Steppe groups as well as the Caucasus groups. See tables above. Modified from Wang et al. (2019). Within a blue square, Yamnaya-related groups; within a cyan square, Corded Ware-related groups. Green background behind best p-values. In red circle, SD of AF/WHG ancestry contribution in Afanasevo and Yamnaya Kalmykia, with ranges that almost include 0%.

EBA maps include data from Wang et al. (2018) supplementary materials, specifically unpublished Yamnaya samples from Hungary that appeared in analysis of the preprint, but which were taken out of the definitive paper. Their location among Yamnaya settlers from Hungary is speculative, although most uncovered kurgans in Hungary are concentrated in the Tisza-Danube interfluve.

eba-yamnaya-ancestry
Natural neighbor interpolation of Pre-Yamnaya ancestry among Early Bronze Age populations. See full map. This map corresponds roughly with the known expansion of late Repin/Yamnaya settlers.

The Y-chromosome bottleneck of elite males from Proto-Indo-European clans under R1b-L754 and some I2a2 subclades, already visible in the Khvalynsk sampling, became even more noticeable in the subsequent expansion of late Repin/early Yamnaya elites under R1b-L23 and I2a-L699:

chalcolithic-early-y-dna
Y-DNA haplogroups in West Eurasia during the Yamnaya expansion. See full map and maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Early Chalcolithic and Yamnaya Hungary.

Maps of CHG, EHG, Anatolia Neolithic, and probably WHG show the expansion of these components among Corded Ware-related groups in North Eurasia, apart from other cultures close to the Caucasus:

NOTE. For maps with actual formal stats of Corded Ware ancestry from the Early Bronze Age to the modern times, you can read the post Corded Ware ancestry in North Eurasia and the Uralic expansion.

eba-chg-ancestry
Natural neighbor interpolation of non-Pre-Yamnaya CHG ancestry among Early Bronze Age populations. See full map.
eba-ehg-ancestry
Natural neighbor interpolation of non-Pre-Yamnaya EHG ancestry among Early Bronze Age populations. See full map.
eba-whg-ancestry
Natural neighbor interpolation of WHG ancestry among Early Bronze Age populations. See full map.
eba-anatolia-farmer-ancestry
Natural neighbor interpolation of Anatolia Neolithic ancestry among Early Bronze Age populations. See full map.
eba-iran-chl-ancestry
Natural neighbor interpolation of Iran Chl. ancestry among Early Bronze Age populations. See full map.

4. Middle to Late Bronze Age

The following maps show the most likely distribution of Yamnaya ancestry during the Bell Beaker-, Balkan-, and Sintashta-Potapovka-related expansions.

4.1. Bell Beakers

The amount of Yamnaya ancestry is probably overestimated among populations where Bell Beakers replaced Corded Ware. A map of Yamnaya ancestry among Bell Beakers gets trickier for the following reasons:

  • Expanding Repin peoples of Pre-Yamnaya ancestry must have had admixture through exogamy with late Sredni Stog/Proto-Corded Ware peoples during their expansion into the North Pontic area, and Sredni Stog in turn had probably some Pre-Yamnaya admixture, too (although they don’t appear in the simplistic formal stats above). This is supported by the increase of Anatolia farmer ancestry in more western Yamna samples.
  • Later, Yamnaya admixed through exogamy with Corded Ware-like populations in Central Europe during their expansion. Even samples from the Middle to Upper Danube and around the Lower Rhine will probably show increasing contributions of Steppe_MLBA, at the same time as they show an increasing proportion of EEF-related ancestry.
  • To complicate things further, the late Corded Ware Espersted family (from ca. 2500 BC or later) shows, in turn, what seems like a recent admixture with Yamnaya vanguard groups, with the sample of highest Yamnaya ancestry being the paternal uncle of other individuals (all of hg. R1a-M417), suggesting that there might have been many similar Central European mating networks from the mid-3rd millennium BC on, of (mainly) Yamnaya-like R1b elites displaying a small proportion of CW-like ancestry admixing through exogamy with Corded Ware-like peoples who already had some Yamnaya ancestry.
mlba-yamnaya-ancestry
Natural neighbor interpolation of Yamnaya ancestry among Middle to Late Bronze Age populations (Esperstedt CWC site close to BK_DE, label is hidden by BK_DE_SAN). See full map. You can see how this map correlated with the map of Late Copper Age migrations and Yamanaya into Bell Beaker expansion.

NOTE. Terms like “exogamy”, “male-driven migration”, and “sex bias”, are not only based on the Y-chromosome bottlenecks visible in the different cultural expansions since the Palaeolithic. Despite the scarce sampling available in 2017 for analysis of “Steppe ancestry”-related populations, it appeared to show already a male sex bias in Goldberg et al. (2017), and it has been confirmed for Neolithic and Copper Age population movements in Mathieson et al. (2018) – see Supplementary Table 5. The analysis of male-biased expansion of “Steppe ancestry” in CWC Esperstedt and Bell Beaker Germany is, for the reasons stated above, not very useful to distinguish their mutual influence, though.

Based on data from Olalde et al. (2019), Bell Beakers from Germany are the closest sampled ones to expanding East Bell Beakers, and those close to the Rhine – i.e. French, Dutch, and British Beakers in particular – show a clear excess “Steppe ancestry” due to their exogamy with local Corded Ware groups:

Only one 2-way model fits the ancestry in Iberia_CA_Stp with P-value>0.05: Germany_Beaker + Iberia_CA. Finding a Bell Beaker-related group as a plausible source for the introduction of steppe ancestry into Iberia is consistent with the fact that some of the individuals in the Iberia_CA_Stp group were excavated in Bell Beaker associated contexts. Models with Iberia_CA and other Bell Beaker groups such as France_Beaker (P-value=7.31E-06), Netherlands_Beaker (P-value=1.03E-03) and England_Beaker (P-value=4.86E-02) failed, probably because they have slightly higher proportions of steppe ancestry than the true source population.

olalde-iberia-chalcolithic

The exogamy with Corded Ware-like groups in the Lower Rhine Basin seems at this point undeniable, as is the origin of Bell Beakers around the Middle-Upper Danube Basin from Yamnaya Hungary.

To avoid this excess “Steppe ancestry” showing up in the maps, since Bell Beakers from Germany pack the most Yamnaya ancestry among East Bell Beakers outside Hungary (ca. 51.1% “Steppe ancestry”), I equated this maximum with BK_Scotland_Ach (which shows ca. 61.1% “Steppe ancestry”, highest among western Beakers), and applied a simple rule of three for “Steppe ancestry” in Dutch and British Beakers.

NOTE. Formal stats for “Steppe ancestry” in Bell Beaker groups are available in Olalde et al. (2018) supplementary materials (PDF). I didn’t apply this adjustment to Bk_FR groups because of the R1b Bell Beaker sample from the Champagne/Alsace region reported by Samantha Brunel that will pack more Yamnaya ancestry than any other sampled Beaker to date, hence probably driving the Yamnaya ancestry up in French samples.

The most likely outcome in the following years, when Yamnaya and Corded Ware ancestry are investigated separately, is that Yamnaya ancestry will be much lower the farther away from the Middle and Lower Danube region, similar to the case in Iberia, so the map above probably overestimates this component in most Beakers to the north of the Danube. Even the late Hungarian Beaker samples, who pack the highest Yamnaya ancestry (up to 75%) among Beakers, represent likely a back-migration of Moravian Beakers, and will probably show a contribution of Corded Ware ancestry due to the exogamy with local Moravian groups.

Despite this decreasing admixture as Bell Beakers spread westward, the explosive expansion of Yamnaya R1b male lineages (in words of David Reich) and the radical replacement of local ones – whether derived from Corded Ware or Neolithic groups – shows the true extent of the North-West Indo-European expansion in Europe:

chalcolithic-late-y-dna
Y-DNA haplogroups in West Eurasia during the Bell Beaker expansion. See full map and see maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Late Copper Age and of the Yamnaya-Bell Beaker transition.

4.2. Palaeo-Balkan

There is scarce data on Palaeo-Balkan movements yet, although it is known that:

  1. Yamnaya ancestry appears among Mycenaeans, with the Yamnaya Bulgaria sample being its best current ancestral fit;
  2. the emergence of steppe ancestry and R1b-M269 in the eastern Mediterranean was associated with Ancient Greeks;
  3. Thracians, Albanians, and Armenians also show R1b-M269 subclades and “Steppe ancestry”.

4.3. Sintashta-Potapovka-Filatovka

Interestingly, Potapovka is the only Corded Ware derived culture that shows good fits for Yamnaya ancestry, despite having replaced Poltavka in the region under the same Corded Ware-like (Abashevo) influence as Sintashta.

This proves that there was a period of admixture in the Pre-Proto-Indo-Iranian community between CWC-like Abashevo and Yamnaya-like Catacomb-Poltavka herders in the Sintashta-Potapovka-Filatovka community, probably more easily detectable in this group because of the specific temporal and geographic sampling available.

srubnaya-yamnaya-ehg-chg-ancestry
Supplementary Table 14. P values of rank=3 and admixture proportions in modelling Steppe ancestry populations as a four-way admixture of distal sources EHG, CHG, Anatolian_Neolithic and WHG using 14 outgroups.
Left populations: Steppe cluster, EHG, CHG, WHG, Anatolian_Neolithic
Right populations: Mbuti.DG, Ust_Ishim.DG, Kostenki14, MA1, Han.DG, Papuan.DG, Onge.DG, Villabruna, Vestonice16, ElMiron, Ethiopia_4500BP.SG, Karitiana.DG, Natufian, Iran_Ganj_Dareh_Neolithic.

Srubnaya ancestry shows a best fit with non-Pre-Yamnaya ancestry, i.e. with different CHG + EHG components – possibly because the more western Potapovka (ancestral to Proto-Srubnaya Pokrovka) also showed good fits for it. Srubnaya shows poor fits for Pre-Yamnaya ancestry probably because Corded Ware-like (Abashevo) genetic influence increased during its formation.

On the other hand, more eastern Corded Ware-derived groups like Sintashta and its more direct offshoot Andronovo show poor fits with this model, too, but their fits are still better than those including Pre-Yamnaya ancestry.

mlba-ehg-ancestry
Natural neighbor interpolation of non-Pre-Yamnaya EHG ancestry among Middle to Late Bronze Age populations. See full map.
mlba-chg-ancestry
Natural neighbor interpolation of non-Pre-Yamnaya CHG ancestry among Middle to Late Bronze Age populations. See full map.
mlba-anatolia-farmer-ancestry
Natural neighbor interpolation of Anatolia Neolithic ancestry among Middle to Late Bronze Age populations. See full map.
mlba-iran-chl-ancestry
Natural neighbor interpolation of Iran Chl. ancestry among Middle to Late Bronze Age populations. See full map.

NOTE For maps with actual formal stats of Corded Ware ancestry from the Early Bronze Age to the modern times, you should read the post Corded Ware ancestry in North Eurasia and the Uralic expansion instead.

The bottleneck of Proto-Indo-Iranians under R1a-Z93 was not yet complete by the time when the Sintashta-Potapovka-Filatovka community expanded with the Srubna-Andronovo horizon:

early-bronze-age-y-dna
Y-DNA haplogroups in West Eurasia during the European Early Bronze Age. See full map and see maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Early Bronze Age.

4.4. Afanasevo

At the end of the Afanasevo culture, at least three samples show hg. Q1b (ca. 2900-2500 BC), which seemed to point to a resurgence of local lineages, despite continuity of the prototypical Pre-Yamnaya ancestry. On the other hand, Anthony (2019) makes this cryptic statement:

Yamnaya men were almost exclusively R1b, and pre-Yamnaya Eneolithic Volga-Caspian-Caucasus steppe men were principally R1b, with a significant Q1a minority.

Since the only available samples from the Khvalynsk community are R1b (x3), Q1a(x1), and R1a(x1), it seems strange that Anthony would talk about a “significant minority”, unless Q1a (potentially Q1b in the newer nomenclature) will pop up in some more individuals of those ca. 30 new to be published. Because he also mentions I2a2 as appearing in one elite burial, it seems Q1a (like R1a-M459) will not appear under elite kurgans, although it is still possible that hg. Q1a was involved in the expansion of Afanasevo to the east.

middle-bronze-age-y-dna
Y-DNA haplogroups in West Eurasia during the Middle Bronze Age. See full map and see maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Middle Bronze Age and the Late Bronze Age.

Okunevo, which replaced Afanasevo in the Altai region, shows a majority of hg. Q1b, but also some R1b-M269 samples proper of Afanasevo, suggesting partial genetic continuity.

NOTE. Other sampled Siberian populations clearly show a variety of Q subclades that likely expanded during the Palaeolithic, such as Baikal EBA samples from Ust’Ida and Shamanka with a majority of Q1b, and hg. Q reported from Elunino, Sagsai, Khövsgöl, and also among peoples of the Srubna-Andronovo horizon (the Krasnoyarsk MLBA outlier), and in Karasuk.

From Damgaard et al. Science (2018):

(…) in contrast to the lack of identifiable admixture from Yamnaya and Afanasievo in the CentralSteppe_EMBA, there is an admixture signal of 10 to 20% Yamnaya and Afanasievo in the Okunevo_EMBA samples, consistent with evidence of western steppe influence. This signal is not seen on the X chromosome (qpAdm P value for admixture on X 0.33 compared to 0.02 for autosomes), suggesting a male-derived admixture, also consistent with the fact that 1 of 10 Okunevo_EMBA males carries a R1b1a2a2 Y chromosome related to those found in western pastoralists. In contrast, there is no evidence of western steppe admixture among the more eastern Baikal region region Bronze Age (~2200 to 1800 BCE) samples.

This Yamnaya ancestry has been also recently found to be the best fit for the Iron Age population of Shirenzigou in Xinjiang – where Tocharian languages were attested centuries later – despite the haplogroup diversity acquired during their evolution, likely through an intermediate Chemurchek culture (see a recent discussion on the elusive Proto-Tocharians).

Haplogroup diversity seems to be common in Iron Age populations all over Eurasia, most likely due to the spread of different types of sociopolitical structures where alliances played a more relevant role in the expansion of peoples. A well-known example of this is the spread of Akozino warrior-traders in the whole Baltic region under a partial N1a-VL29-bottleneck associated with the emerging chiefdom-based systems under the influence of expanding steppe nomads.

early-iron-age-y-dna
Y-DNA haplogroups in West Eurasia during the Early Iron Age. See full map and see maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Early Iron Age and Late Iron Age.

Surprisingly, then, Proto-Tocharians from Shirenzigou pack up to 74% Yamnaya ancestry, in spite of the 2,000 years that separate them from the demise of the Afanasevo culture. They show more Yamnaya ancestry than any other population by that time, being thus a sort of Late PIE fossils not only in their archaic dialect, but also in their genetic profile:

shirenzigou-afanasievo-yamnaya-andronovo-srubna-ulchi-han

The recent intrusion of Corded Ware-like ancestry, as well as the variable admixture with Siberian and East Asian populations, both point to the known intense Old Iranian and Old/Middle Chinese contacts. The scarce Proto-Samoyedic and Proto-Turkic loans in Tocharian suggest a rather loose, probably more distant connection with East Uralic and Altaic peoples from the forest-steppe and steppe areas to the north (read more about external influences on Tocharian).

Interestingly, both R1b samples, MO12 and M15-2 – likely of Asian R1b-PH155 branch – show a best fit for Andronovo/Srubna + Hezhen/Ulchi ancestry, suggesting a likely connection with Iranians to the east of Xinjiang, who later expanded as the Wusun and Kangju. How they might have been related to Huns and Xiongnu individuals, who also show this haplogroup, is yet unknown, although Huns also show hg. R1a-Z93 (probably most R1a-Z2124) and Steppe_MLBA ancestry, earlier associated with expanding Iranian peoples of the Srubna-Andronovo horizon.

All in all, it seems that prehistoric movements explained through the lens of genetic research fit perfectly well the linguistic reconstruction of Proto-Indo-European and Proto-Uralic.

Related

Corded Ware ancestry in North Eurasia and the Uralic expansion

uralic-clines-nganasan

Now that it has become evident that Late Repin (i.e. Yamnaya/Afanasevo) ancestry was associated with the migration of R1b-L23-rich Late Proto-Indo-Europeans from the steppe in the second half of the the 4th millennium BC, there’s still the question of how R1a-rich Uralic speakers of Corded Ware ancestry expanded , and how they spread their languages throughout North Eurasia.

Modern North Eurasians

I have been collecting information from the supplementary data of the latest papers on modern and ancient North Eurasian peoples, including Jeong et al. (2019), Saag et al. (2019), Sikora et al. (2018), or Flegontov et al. (2019), and I have tried to add up their information on ancestral components and their modern and historical distributions.

Fortunately, the current obsession with simplifying ancestry components into three or four general, atemporal groups, and the common use of the same ones across labs, make it very simple to merge data and map them.

Corded Ware ancestry

There is no doubt about the prevalent ancestry among Uralic-speaking peoples. A map isn’t needed to realize that, because ancient and modern data – like those recently summarized in Jeong et al. (2019) – prove it. But maps sure help visualize their intricate relationship better:

natural-modern-srubnaya-ancestry
Natural neighbor interpolation of Srubnaya ancestry among modern populations. See full map.
kriging-modern-srubnaya-ancestry
Kriging interpolation of Srubnaya ancestry among modern populations. See full map

Interestingly, the regions with higher Corded Ware-related ancestry are in great part coincident with (pre)historical Finno-Ugric-speaking territories:

uralic-languages-modern
Modern distribution of Uralic languages, with ancient territory (in the Common Era) labelled and delimited by a red line. For more information on the ancient territory see here.

Edit (29/7/2019): Here is the full Steppe_MLBA ancestry map, including Steppe_MLBA (vs. Indus Periphery vs. Onge) in modern South Asian populations from Narasimhan et al. (2018), apart from the ‘Srubnaya component’ in North Eurasian populations. ‘Dummy’ variables (with 0% ancestry) have been included to the south and east of the map to avoid weird interpolations of Steppe_MLBA into Africa and East Asia.

modern-steppe-mlba-ancestry2
Natural neighbor interpolation of Steppe MLBA-like ancestry among modern populations. See full map.

Anatolia Neolithic ancestry

Also interesting are the patterns of non-CWC-related ancestry, in particular the apparent wedge created by expanding East Slavs, which seems to reflect the intrusion of central(-eastern) European ancestry into Finno-Permic territory.

NOTE. Read more on Balto-Slavic hydrotoponymy, on the cradle of Russians as a Finno-Permic hotspot, and about Pre-Slavic languages in North-West Russia.

natural-modern-lbk-en-ancestry
Natural neighbor interpolation of LBK EN ancestry among modern populations. See full map.
kriging-modern-lbk-en-ancestry
Kriging interpolation of LBK EN ancestry among modern populations. See full map

WHG ancestry

The cline(s) between WHG, EHG, ANE, Nganasan, and Baikal HG are also simplified when some of them excluded, in this case EHG, represented thus in part by WHG, and in part by more eastern ancestries (see below).

modern-whg-ancestry
Natural neighbor interpolation of WHG ancestry among modern populations. See full map.
kriging-modern-whg-ancestry
Kriging interpolation of WHG ancestry among modern populations. See full map.

Arctic, Tundra or Forest-steppe?

Data on Nganasan-related vs. ANE vs. Baikal HG/Ulchi-related ancestry is difficult to map properly, because both ancestry components are usually reported as mutually exclusive, when they are in fact clearly related in an ancestral cline formed by different ancient North Eurasian populations from Siberia.

When it comes to ascertaining the origin of the multiple CWC-related clines among Uralic-speaking peoples, the question is thus how to properly distinguish the proportions of WHG-, EHG-, Nganasan-, ANE or BaikalHG-related ancestral components in North Eurasia, i.e. how did each dialectal group admix with regional groups which formed part of these clines east and west of the Urals.

The truth is, one ought to test specific ancient samples for each “Siberian” ancestry found in the different Uralic dialectal groups, but the simplistic “Siberian” label somehow gets a pass in many papers (see a recent example).

Below qpAdm results with best fits for Ulchi ancestry, Afontova Gora 3 ancestry, and Nganasan ancestry, but some populations show good fits for both and with similar proportions, so selecting one necessarily simplifies the distribution of both.

Ulchi ancestry

modern-ulchi-ancestry
Natural neighbor interpolation of Ulchi ancestry among modern populations. See full map.
kriging-modern-ulchi-ancestry
Kriging interpolation of Ulchi ancestry among modern populations. See full map.

ANE ancestry

natural-modern-ane-ancestry
Natural neighbor interpolation of ANE ancestry among modern populations. See full map.
kriging-modern-ane-ancestry
Kriging interpolation of ANE ancestry among modern populations. See full map.

Nganasan ancestry

modern-nganasan-ancestry
Natural neighbor interpolation of Nganasan ancestry among modern populations. See full map.
kriging-modern-nganasan-ancestry
Kriging interpolation of Nganasan ancestry among modern populations. See full map.

Iran Chalcolithic

A simplistic Iran Chalcolithic-related ancestry is also seen in the Altaic cline(s) which (like Corded Ware ancestry) expanded from Central Asia into Europe – apart from its historical distribution south of the Caucasus:

modern-iran-chal-ancestry
Natural neighbor interpolation of Iran Neolithic ancestry among modern populations. See full map.
kriging-modern-iran-neolithic-ancestry
Kriging interpolation of Iran Chalcolithic ancestry among modern populations. See full map.

Other models

The first question I imagine some would like to know is: what about other models? Do they show the same results? Here is the simplistic combination of ancestry components published in Damgaard et al. (2018) for the same or similar populations:

NOTE. As you can see, their selection of EHG vs. WHG vs. Nganasan vs. Natufian vs. Clovis of is of little use, but corroborate the results from other papers, and show some interesting patterns in combination with those above.

EHG

damgaard-modern-ehg-ancestry
Natural neighbor interpolation of EHG ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-ehg-ancestry
Kriging interpolation of EHG ancestry among modern populations. See full map.

Natufian ancestry

damgaard-modern-natufian-ancestry
Natural neighbor interpolation of Natufian ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-natufian-ancestry
Kriging interpolation of Natufian ancestry among modern populations. See full map.

WHG ancestry

damgaard-modern-whg-ancestry
Natural neighbor interpolation of WHG ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-whg-ancestry
Kriging interpolation of WHG ancestry among modern populations. See full map.

Baikal HG ancestry

damgaard-modern-baikalhg-ancestry
Natural neighbor interpolation of Baikal hunter-gatherer ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-baikal-hg-ancestry
Kriging interpolation of Baikal HG ancestry among modern populations. See full map.

Ancient North Eurasians

Once the modern situation is clear, relevant questions are, for example, whether EHG-, WHG-, ANE, Nganasan-, and/or Baikal HG-related meta-populations expanded or became integrated into Uralic-speaking territories.

When did these admixture/migration events happen?

How did the ancient distribution or expansion of Palaeo-Arctic, Baikalic, and/or Altaic peoples affect the current distribution of the so-called “Siberian” ancestry, and of hg. N1a, in each specific population?

NOTE. A little excursus is necessary, because the calculated repetition of a hypothetic opposition “N1a vs. R1a” doesn’t make this dichotomy real:

  1. There was not a single ethnolinguistic community represented by hg. R1a after the initial expansion of Eastern Corded Ware groups, or by hg. N1a-L392 after its initial expansion in Siberia:
  2. Different subclades became incorporated in different ways into Bronze Age and Iron Age communities, most of which without an ethnolinguistic change. For example, N1a subclades became incorporated into North Eurasian populations of different languages, reaching Uralic- and Indo-European-speaking territories of north-eastern Europe during the late Iron Age, at a time when their ancestral origin or language in Siberia was impossible to ascertain. Just like the mix found among Proto-Germanic peoples (R1b, R1a, and I1)* or among Slavic peoples (I2a, E1b, R1a)*, the mix of many Uralic groups showing specific percentages of R1a, N1a, or Q subclades* reflect more or less recent admixture or acculturation events with little impact on their languages.

*other typically northern and eastern European haplogroups are also represented in early Germanic (N1a, I2, E1b, J, G2), Slavic (I1, G2, J) and Finno-Permic (I1, R1b, J) peoples.

ananino-culture-new
Map of archaeological cultures in north-eastern Europe ca. 8th-3rd centuries BC. [The Mid-Volga Akozino group not depicted] Shaded area represents the Ananino cultural-historical society. Fading purple arrows represent likely stepped movements of subclades of haplogroup N for centuries (e.g. Siberian → Ananino → Akozino → Fennoscandia [N-VL29]; Circum-Arctic → forest-steppe [N1, N2]; etc.). Blue arrows represent eventual expansions of Uralic peoples to the north. Modified image from Vasilyev (2002).

The problem with mapping the ancestry of the available sampling of ancient populations is that we lack proper temporal and regional transects. The maps that follow include cultures roughly divided into either “Bronze Age” or “Iron Age” groups, although the difference between samples may span up to 2,000 years.

NOTE. Rough estimates for more external groups (viz. Sweden Battle Axe/Gotland_A for the NW, Srubna from the North Pontic area for the SW, Arctic/Nganasan for the NE, and Baikal EBA/”Ulchi-like” for the SE) have been included to offer a wider interpolated area using data already known.

Bronze Age

Similar to modern populations, the selection of best fit “Siberian” ancestry between Baikal HG vs. Nganasan, both potentially ± ANE (AG3), is an oversimplification that needs to be addressed in future papers.

Corded Ware ancestry

bronze-age-corded-ware-ancestry
Natural neighbor interpolation of Srubnaya ancestry among Bronze Age populations. See full map.

Nganasan-like ancestry

bronze-age-nganasan-like-ancestry
Natural neighbor interpolation of Nganasan-like ancestry among Bronze Age populations. See full map.

Baikal HG ancestry

bronze-age-baikal-hg-ancestry
Natural neighbor interpolation of Baikal Hunter-Gatherer ancestry among Bronze Age populations. See full map.

Afontova Gora 3 ancestry

bronze-age-afontova-gora-ancestry
Natural neighbor interpolation of Afontova Gora 3 ancestry among Bronze Age populations. See full map.

Iron Age

Corded Ware ancestry

Interestingly, the moderate expansion of Corded Ware-related ancestry from the south during the Iron Age may be related to the expansion of hg. N1a-VL29 into the chiefdom-based system of north-eastern Europe, including Ananyino/Akozino and later expanding Akozino warrior-traders around the Baltic Sea.

NOTE. The samples from Levänluhta are centuries older than those from Estonia (and Ingria), and those from Chalmny Varre are modern ones, so this region has to be read as a south-west to north-east distribution from the Iron Age to modern times.

iron-age-corded-ware-ancestry
Natural neighbor interpolation of Srubnaya ancestry among Iron Age populations. See full map.

Baikal HG-like ancestry

The fact that this Baltic N1a-VL29 branch belongs in a group together with typically Avar N1a-B197 supports the Altaic origin of the parent group, which is possibly related to the expansion of Baikalic ancestry and Iron Age nomads:

iron-age-baikal-ancestry
Natural neighbor interpolation of Baikal HG ancestry among Iron Age populations. See full map.

Nganasan-like ancestry

The dilution of Nganasan-like ancestry in an Arctic region featuring “Siberian” ancestry and hg. N1a-L392 at least since the Bronze Age supports the integration of hg. N1a-Z1934, sister clade of Ugric N1a-Z1936, into populations west and east of the Urals with the expansion of Uralic languages to the north into the Tundra region (see here).

The integration of N1a-Z1934 lineages into Finnic-speaking peoples after their migration to the north and east, and the displacement or acculturation of Saami from their ancestral homeland, coinciding with known genetic bottlenecks among Finns, is yet another proof of this evolution:

iron-age-nganasan-ancestry
Natural neighbor interpolation of Nganasan ancestry among Iron Age populations. See full map.

WHG ancestry

Similarly, WHG ancestry doesn’t seem to be related to important population movements throughout the Bronze Age, which excludes the multiple North Eurasian populations that will be found along the clines formed by WHG, EHG, ANE, Nganasan, Baikal HG ancestry as forming part of the Uralic ethnogenesis, although they may be relevant to follow later regional movements of specific populations.

iron-age-whg-ancestry
Natural neighbor interpolation of WHG ancestry among Iron Age populations. See full map.

Conclusion

It seems natural that people used to look at maps of haplogroup distribution from the 2000s, coupled with modern language distributions, and would try to interpret them in a certain way, reaching thus the wrong conclusions whose consequences are especially visible today when ancient DNA keeps contradicting them.

In hindsight, though, assuming that Balto-Slavs expanded with Corded Ware and hg. R1a, or that Uralians expanded with “Siberian” ancestry and hg. N1a, was as absurd as looking at maps of ancestry and haplogroup distribution of ancient and modern Native Americans, trying to divide them into “Germanic” or “Iberian”…

The evolution of each specific region and cultural group of North Eurasia is far from being clear. However, the general trend speaks clearly in favour of an ancient, Bronze Age distribution of North Eurasian ancestry and haplogroups that have decreased, diluted, or become incorporated into expanding Uralians of Corded Ware ancestry, occasionally spreading with inter-regional expansions of local groups.

Given the relatively recent push of Altaic and Indo-European languages into ancestral Uralic-speaking territories, only the ancient Corded Ware expansion remains compatible with the spread of Uralic languages into their historical distribution.

Related

A Song of Sheep and Horses, revised edition, now available as printed books

cover-song-sheep-and-horses

As I said 6 months ago, 2019 is a tough year to write a blog, because this was going to be a complex regional election year and therefore a time of political promises, hence tenure offers too. Now the preliminary offers have been made, elections have passed, but the timing has slightly shifted toward 2020. So I may have the time, but not really any benefit of dedicating too much effort to the blog, and a lot of potential benefit of dedicating any time to evaluable scientific work.

On the other hand, I saw some potential benefit for publishing texts with ISBNs, hence the updates to the text and the preparation of these printed copies of the books, just in case. While Spain’s accreditation agency has some hard rules for becoming a tenured professor, especially for medical associates (whose years of professional experience are almost worthless compared to published peer-reviewed papers), it is quite flexible in assessing one’s merits.

However, regional and/or autonomous entities are not, and need an official identifier and preferably printed versions to evaluate publications, such as an ISBN for books. I took thus some time about a month ago to update the texts and supplementary materials, to publish a printed copy of the books with Amazon. The first copies have arrived, and they look good.

series-song-sheep-horses-cover

Corrections and Additions

Titles
I have changed the names and order of the books, as I intended for the first publication – as some of you may have noticed when the linguistic book was referred to as the third volume in some parts. In the first concept I just wanted to emphasize that the linguistic work had priority over the rest. Now the whole series and the linguistic volume don’t share the same name, and I hope this added clarity is for the better, despite the linguistic volume being the third one.

Uralic dialects
I have changed the nomenclature for Uralic dialects, as I said recently. I haven’t really modified anything deeper than that, because – unlike adding new information from population genomics – this would require for me to do a thorough research of the most recent publications of Uralic comparative grammar, and I just can’t begin with that right now.

Anyway, the use of terms like Finno-Ugric or Finno-Samic is as correct now for the reconstructed forms as it was before the change in nomenclature.

west-east-uralic-schema

Mediterranean
The most interesting recent genetic data has come from Iberia and the Mediterranean. Lacking direct data from the Italian Peninsula (and thus from the emergence of the Etruscan and Rhaetian ethnolinguistic community), it is becoming clearer how some quite early waves of Indo-Europeans and non-Indo-Europeans expanded and shrank – at least in West Iberia, West Mediterranean, and France.

Finno-Ugric
Some of the main updates to the text have been made to the sections on Finno-Ugric populations, because some interesting new genetic data (especially Y-DNA) have been published in the past months. This is especially true for Baltic Finns and for Ugric populations.

ananino-culture-new

Balto-Slavic
Consequently, and somehow unsurprisingly, the Balto-Slavic section has been affected by this; e.g. by the identification of Early Slavs likely with central-eastern populations dominated by (at least some subclades of) hg. I2a-L621 and E1b-V13.

Maps
I have updated some cultural borders in the prehistoric maps, and the maps with Y-DNA and mtDNA. I have also added one new version of the Early Bronze age map, to better reflect the most likely location of Indo-European languages in the Early European Bronze Age.

As those in software programming will understand, major changes in the files that are used for maps and graphics come with an increasing risk of additional errors, so I would not be surprised if some major ones would be found (I already spotted three of them). Feel free to communicate these errors in any way you see fit.

bronze-age-early-indo-european
European Early Bronze Age: tentative langage map based on linguistics, archaeology, and genetics.

SNPs
I have selected more conservative SNPs in certain controversial cases.

I have also deleted most SNP-related footnotes and replaced them with the marking of each individual tentative SNP, leaving only those footnotes that give important specific information, because:

  • My way of referencing tentative SNP authors did not make it clear which samples were tentative, if there were more than one.
  • It was probably not necessary to see four names repeated 100 times over.
  • Often I don’t really know if the person I have listed as author of the SNP call is the true author – unless I saw the full SNP data posted directly – or just someone who reposted the results.
  • Sometimes there are more than one author of SNPs for a certain sample, but I might have added just one for all.
ancient-dna-all
More than 6000 ancient DNA samples compiled to date.

For a centralized file to host the names of those responsible for the unofficial/tentative SNPs used in the text – and to correct them if necessary -, readers will be eventually able to use Phylogeographer‘s tool for ancient Y-DNA, for which they use (partly) the same data I compiled, adding Y-Full‘s nomenclature and references. You can see another map tool in ArcGIS.

NOTE. As I say in the text, if the final working map tool does not deliver the names, I will publish another supplementary table to the text, listing all tentative SNPs with their respective author(s).

If you are interested in ancient Y-DNA and you want to help develop comprehensive and precise maps of ancient Y-DNA and mtDNA haplogroups, you can contact Hunter Provyn at Phylogeographer.com. You can also find more about phylogeography projects at Iain McDonald’s website.

Graphics
I have also added more samples to both the “Asian” and the “European” PCAs, and to the ADMIXTURE analyses, too.

I previously used certain samples prepared by amateurs from BAM files (like Botai, Okunevo, or Hittites), and the results were obviously less than satisfactory – hence my criticism of the lack of publication of prepared files by the most famous labs, especially the Copenhagen group.

Fortunately for all of us, most published datasets are free, so we don’t have to reinvent the wheel. I criticized genetic labs for not releasing all data, so now it is time for praise, at least for one of them: thank you to all responsible at the Reich Lab for this great merged dataset, which includes samples from other labs.

NOTE. I would like to make my tiny contribution here, for beginners interested in working with these files, so I will update – whenever I have time – the “How To” sections of this blog for PCAs, PCA3d, and ADMIXTURE.

-iron-age-europe-romans
Detail of the PCA of European Iron Age populations. See full versions.

ADMIXTURE
For unsupervised ADMIXTURE in the maps, a K=5 is selected based on the CV, giving a kind of visual WHG : NWAN : CHG/IN : EHG : ENA, but with Steppe ancestry “in between”. Higher K gave worse CV, which I guess depends on the many ancient and modern samples selected (and on the fact that many samples are repeated from different sources in my files, because I did not have time to filter them all individually).

I found some interesting component shared by Central European populations in K=7 to K=9 (from CEU Bell Beakers to Denmark LN to Hungarian EBA to Iberia BA, in a sort of “CEU BBC ancestry” potentially related to North-West Indo-Europeans), but still, I prefer to go for a theoretically more correct visualization instead of cherry-picking the ‘best-looking’ results.

Since I made fun of the search for “Siberian ancestry” in coloured components in Tambets et al. 2018, I have to be consistent and preferred to avoid doing the same here…

qpAdm
In the first publication (in January) and subsequent minor revisions until March, I trusted analyses and ancestry estimates reported by amateurs in 2018, which I used for the text adding my own interpretations. Most of them have been refuted in papers from 2019, as you probably know if you have followed this blog (see very recent examples here, here, or here), compelling me to delete or change them again, and again, and again. I don’t have experience from previous years, although the current pattern must have been evidently repeated many times over, or else we would be still talking about such previous analyses as being confirmed today…

I wanted to be one step ahead of peer-reviewed publications in the books, but I prefer now to go for something safe in the book series, rather than having one potentially interesting prediction – which may or may not be right – and ten huge mistakes that I would have helped to endlessly redistribute among my readers (online and now in print) based on some cherry-picked pairwise comparisons. This is especially true when predictions of “Steppe“- and/or “Siberian“-related ancestry have been published, which, for some reason, seem to go horribly wrong most of the time.

I am sure whole books can be written about why and how this happened (and how this is going to keep happening), based on psychology and sociology, but the reasons are irrelevant, and that would be a futile effort; like writing books about glottochronology and its intermittent popularity due to misunderstood scientist trends. The most efficient way to deal with this problem is to avoid such information altogether, because – as you can see in the current revised text – they wouldn’t really add anything essential to the content of these books, anyway.

Continue reading

Official site of the book series:
A Song of Sheep and Horses: eurafrasia nostratica, eurasia indouralica

Uralic speakers formed clines of Corded Ware ancestry with WHG:ANE populations

steppe-forest-tundra-biomes-uralic

The preprint by Jeong et al. (2018) has been published: The genetic history of admixture across inner Eurasia Nature Ecol. Evol. (2019).

Interesting excerpts, referring mainly to Uralic peoples (emphasis mine):

A model-based clustering analysis using ADMIXTURE shows a similar pattern (Fig. 2b and Supplementary Fig. 3). Overall, the proportions of ancestry components associated with Eastern or Western Eurasians are well correlated with longitude in inner Eurasians (Fig. 3). Notable outliers include known historical migrants such as Kalmyks, Nogais and Dungans. The Uralic- and Yeniseian-speaking populations, as well as Russians from multiple locations, derive most of their Eastern Eurasian ancestry from a component most enriched in Nganasans, while Turkic/Mongolic speakers have this component together with another component most enriched in populations from the Russian Far East, such as Ulchi and Nivkh (Supplementary Fig. 3). Turkic/Mongolic speakers comprising the bottom-most cline have a distinct Western Eurasian ancestry profile: they have a high proportion of a component most enriched in Mesolithic Caucasus hunter-gatherers and Neolithic Iranians and frequently harbour another component enriched in present-day South Asians (Supplementary Fig. 4). Based on the PCA and ADMIXTURE results, we heuristically assigned inner Eurasians to three clines: the ‘forest-tundra’ cline includes Russians and all Uralic and Yeniseian speakers; the ‘steppe-forest’ cline includes Turkic- and Mongolic-speaking populations from the Volga and Altai–Sayan regions and Southern Siberia; and the ‘southern steppe’ cline includes the rest of the populations.

eurasian-clines-uralic-altaic
The first two PCs summarizing the genetic structure within 2,077 Eurasian individuals. The two PCs generally mirror geography. PC1 separates western and eastern Eurasian populations, with many inner Eurasians in the middle. PC2 separates eastern Eurasians along the northsouth cline and also separates Europeans from West Asians. Ancient individuals (color-filled shapes), including two Botai individuals, are projected onto PCs calculated from present-day individuals.

For the forest-tundra populations, the Nganasan + Srubnaya model is adequate only for the two Volga region populations, Udmurts and Besermyans (Fig. 5 and Supplementary Table 8).

For the other populations west of the Urals, six from the northeastern corner of Europe are modelled with additional Mesolithic Western European hunter-gatherer (WHG) contribution (8.2–11.4%; Supplementary Table 8), while the rest need both WHG and early Neolithic European farmers (LBK_EN; Supplementary Table 2). Nganasan-related ancestry substantially contributes to their gene pools and cannot be removed from the model without a significant decrease in the model fit (4.1–29.0% contribution; χ2 P ≤ 1.68 × 10−5; Supplementary Table 8).

west-urals-finno-ugrians-qpadm
Supplementary Table 8. QpAdm-based admixture modeling of the forest-tundra cline populations. For the 13 populations west of the Urals, we present a four-way admixture model, Nganasan+Srubnaya+WHG+LBK_EN, or its minimal adequate subset. Modified from the article, to include colors for cultures, and underlined best models for Corded Ware ancestry among Uralians.

NOTE. It doesn’t seem like Hungarians can be easily modelled with Nganasan ancestry, though…

For the 4 populations east of the Urals (Enets, Selkups, Kets and Mansi), for which the above models are not adequate, Nganasan + Srubnaya + AG3 provides a good fit (χ2 P ≥ 0.018; Fig. 5 and Supplementary Table 8). Using early Bronze Age populations from the Baikal Lake region (‘Baikal_EBA’; Supplementary Table 2) as a reference instead of Nganasan, the two-way model of Baikal_EBA + Srubnaya provides a reasonable fit (χ2 P ≥ 0.016; Supplementary Table 8) and the three-way model of Baikal_EBA + Srubnaya + AG3 is adequate but with negative AG3 contribution for Enets and Mansi (χ2 P ≥ 0.460; Supplementary Table 8).

east-urals-ugric-samoyedic-qpadm
Supplementary Table 8. QpAdm-based admixture modeling of the forest-tundra cline populations. For the four populations east of the Urals, we present three admixture models: Baikal_EBA+Srubnaya, Baikal_EBA+Srubnaya+AG3 and Nganasan+Srubnaya+AG3. For each model, we present qpAdm p-value, admixture coefficient estimates and associated 5 cM jackknife standard errors (estimate ± SE). Modified from the article, to include colors for cultures, and underlined best models for Corded Ware ancestry among Uralians.

Bronze/Iron Age populations from Southern Siberia also show a similar ancestry composition with high ANE affinity (Supplementary Table 9). The additional ANE contribution beyond the Nganasan + Srubnaya model suggests a legacy from ANE-ancestry-rich clines before the Late Bronze Age.

bronze-age-iron-age-karasuk-mezhovska-tagar-qpadm
Supplementary Table 9. QpAdm-based admixture modeling of Bronze and Iron Age populations of southern Siberia. For ancieint individuals associated with Karasuk and Tagar cultures, Nganasan+Srubnaya model is insufficient. For all five groups, adding AG3 as the third ancestry or substituting Nganasan with Baikal_EBA with higher ANE affinity provides an adequate model. For each model, we present qpAdm p-value, admixture coefficient estimates and associated 5 cM jackknife standard errors (estimate ± SE). Models with p-value ≥ 0.05 are highlighted in bold face. Modified from the article, to include colors for cultures, and underlined best models for Corded Ware ancestry among Uralians.

Lara M. Cassidy comments the results of the study in A steppe in the right direction (you can read it here):

Even among the earliest available inner Eurasian genomes, east–west connectivity is evident. These, too, form a longitudinal cline, characterized by the easterly increase of a distinct ancestry, labelled Ancient North Eurasian (ANE), lowest in western European hunter-gatherers (WHG) and highest in Palaeolithic Siberians from the Baikal region. Flow-through from this ANE cline is seen in steppe populations until at least the Bronze Age, including the world’s earliest known horse herders — the Botai. However, this is eroded over time by migration from west and east, following agricultural adoption on the continental peripheries (Fig. 1b,c).

Strikingly, Jeong et al. model the modern upper steppe cline as a simple two-way mixture between western Late Bronze Age herders and Northeast Asians (Fig. 1c), with no detectable residue from the older ANE cline. They propose modern steppe peoples were established mainly through migrations post-dating the Bronze Age, a sequence for which has been recently outlined using ancient genomes. In contrast, they confirm a substantial ANE legacy in modern Siberians of the northernmost cline, a pattern mirrored in excesses of WHG ancestry west of the Urals (Fig. 1b). This marks the inhospitable biome as a reservoir for older lineages, an indication that longstanding barriers to latitudinal movement may indeed be at work, reducing the penetrance of gene flows further south along the steppe.

eurasian-clines-uralic-turkic-mongol-altaic
The genomic formation of inner Eurasians. b–d, Depiction of the three main clines of ancestry identified among Inner Eurasians. Sources of admixture for each cline are represented using proxy ancient populations, both sampled and hypothesised, based on the study’s modelling results. The major eastern and western ancestries used to model each cline are shown in bold; the peripheral admixtures that gave rise to these are also shown. Additional contributions to subsections of each cline are marked with dashed lines. b, The northernmost cline, illustrating the legacy of WHG and ANE-related populations. c,d, The upper (c) and lower (d) steppe clines are shown, both of which have substantial eastern contributions related to modern Tungusic speakers. The authors propose these populations are themselves the result of an admixture between groups related to the Nganasan, whose ancestors potentially occupied a wider range, and hunter-gatherers (HGs) from the Amur River Basin. While the upper steppe cline in c can be described as a mixture between this eastern ancestry and western steppe herders, the current model for the southern steppe cline as shown in d is not adequate and is likely confounded by interactions with diverse bordering ancestries. Credit: Ecoregions 2017, Resolve https://ecoregions2017.appspot.com/

Given the findings as reported in the paper, I think it should be much easier to describe different subclines in the “northernmost cline” than in the much more recent “Turkic/Mongolic cline”, which is nevertheless subdivided in this paper in two clines. As an example, there are at least two obvious clines with “Nganasan-related meta-populations” among Uralians, which converge in a common Steppe MLBA (i.e. Corded Ware) ancestry – one with Palaeo-Laplandic peoples, and another one with different Palaeo-Siberian populations:

siberian-clines-uralic-altaic
PCA of ancient and modern Eurasian samples. Ancient Palaeo-Laplandic, Palaeosiberian, and Altai clines drawn, with modern populations labelled. See a version with higher resolution.

The inclusion of certain Eurasian groups (or lack thereof) in the PCA doesn’t help to distinguish these subclines visually, and I guess the tiny “Naganasan-related” ancestral components found in some western populations (e.g. the famous ~5% among Estonians) probably don’t lend themselves easily to further subdivisions. Notice, nevertheless, the different components of the Eastern Eurasian source populations among Finno-Ugrians:

uralic-admixture-qpadm
Characterization of the Western and Eastern Eurasian source ancestries in inner Eurasian populations. [Modified from the paper, includes only Uralic populations]. a, Admixture f3 values are compared for different Eastern Eurasian (Mixe, Nganasan and Ulchi; green) and Western Eurasian references (Srubnaya and Chalcolithic Iranians (Iran_ChL); red). For each target group, darker shades mark more negative f3 values. b, Weights of donor populations in two sources characterizing the main admixture signal (date 1 and PC1) in the GLOBETROTTER analysis. We merged 167 donor populations into 12 groups (top right). Target populations were split into five groups (from top to bottom): Aleuts; the forest-tundra cline populations; the steppe-forest cline populations; the southern steppe cline populations; and ‘others’.

Also remarkable is the lack of comparison of Uralic populations with other neighbouring ones, since the described Uralic-like ancestry of Russians was already known, and is most likely due to the recent acculturation of Uralic-speaking peoples in the cradle of Russians, right before their eastward expansions.

west-eurasian-east-eurasian-ancestry
Supplementary Fig. 4. ADMIXTURE results qualitatively support PCA-based grouping of inner Eurasians into three clines. (A) Most southern steppe cline populations derive a higher proportion of their total Western Eurasian ancestry from a source related to Caucasus, Iran and South Asian populations. (B) Turkic- and Mongolic-speaking populations tend to derive their Eastern Eurasian ancestry more from the Devil’s Gate related one than from Nganasan-related one, while the opposite is true for Uralic- and Yeiseian-speakers. To estimate overall western Eurasian ancestry proportion, we sum up four components in our ADMIXTURE results (K=14), which are the dominant components in Neolithic Anatolians (“Anatolia_N”), Mesolithic western European hunter-gatherers (“WHG”), early Holocene Caucasus hunter-gatherers (“CHG”) and Mala from southern India, respectively. The “West / South Asian ancestry” is a fraction of it, calculated by summing up the last two components. To estimate overall Eastern Eurasian ancestry proportion, we sum up six components, most prevalent in Surui, Chipewyan, Itelmen, Nganasan, Atayal and early Neolithic Russian Far East individuals (“Devil’s Gate”). Eurasians into three clines. (A) Most southern steppe cline populations derive a higher proportion of their total Western Eurasian ancestry from a source related to Caucasus, Iran and South Asian populations. (B) Turkic- and Mongolic-speaking populations tend to derive their Eastern Eurasian ancestry more from the Devil’s Gate related one than from Nganasan-related one, while the opposite is true for Uralic- and Yeiseian-speakers. To estimate overall western Eurasian ancestry proportion, we sum up four components in our ADMIXTURE results (K=14), which are the dominant components in Neolithic Anatolians (“Anatolia_N”), Mesolithic western European hunter-gatherers (“WHG”), early Holocene Caucasus hunter-gatherers (“CHG”) and Mala from southern India, respectively. The “West / South Asian ancestry” is a fraction of it, calculated by summing up the last two components. To estimate overall Eastern Eurasian ancestry proportion, we sum up six components, most prevalent in Surui, Chipewyan, Itelmen, Nganasan, Atayal and early Neolithic Russian Far East individuals (“Devil’s Gate”).

A comparison of Estonians and Finns with Balts, Scandinavians, and Eastern Europeans would have been more informative for the division of the different so-called “Nganasan-like meta-populations”, and to ascertain which one of these ancestral peoples along the ancient WHG:ANE cline could actually be connected (if at all) to the Cis-Urals.

Because, after all, based on linguistics and archaeology, geneticists are not supposed to be looking for populations from the North Asian Arctic region, for “Siberian ancestry”, or for haplogroup N1c – despite previous works by their peers – , but for the Bronze Age Volga-Kama region…

Related

N1c-L392 associated with expanding Turkic lineages in Siberia

haplogroup-n1c-tat

Second in popularity for the expansion of haplogroup N1a-L392 (ca. 4400 BC) is, apparently, the association with Turkic, and by extension with Micro-Altaic, after the Uralic link preferred in Europe; at least among certain eastern researchers.

New paper in a recently created journal, by the same main author of the group proposing that Scythians of hg. N1c were Turkic speakers: On the origins of the Sakhas’ paternal lineages: Reconciliation of population genetic / ancient DNA data, archaeological findings and historical narratives, by Tikhonov, Gurkan, Demirdov, and Beyoglu, Siberian Research (2019).

Interesting excerpts:

According to the views of a number of authoritative researchers, the Yakut ethnos was formed in the territory of Yakutia as a result of the mixing of people from the south and the autochthonous population [34].

These three major Sakha paternal lineages may have also arrived in Yakutia at different times and/ or from different places and/or with a difference in several generations instead, or perhaps Y-chromosomal STR mutations may have taken place in situ in Yakutia. Nevertheless, the immediate common ancestor(s) from the Asian Steppe of these three most prevalent Sakha Y-chromosomal STR haplotypes possibly lived during the prominence of the Turkic Khaganates, hence the near-perfect matches observed across a wide range of Eurasian geography, including as far as from Cyprus in the West to Liaoning, China in the East, then Middle Lena in the North and Afghanistan in the South (Table 3 and Figure 5). There may also be haplotypes closely-related to ‘the dominant Elley line’ among Karakalpaks, Uzbeks and Tajiks, however, limitations in the loci coverage for the available dataset (only eight Y-chromosomal STR loci) precludes further conclusions on this matter [25].

yakutia-haplogroup-n1c
17-loci median-joining network analysis of the original/dominant Elley, Unknown and Omogoy Y-chromosomal STR haplotypes with the YHRD matches from outside Yakutia populations.

According to the results presented here, very similar Y-STR haplotypes to that of the original Elley line were found in the west: Afghanistan and northern Cyprus, and in the east: Liaoning Province, China and Ulaanbaator, Northern Mongolia. In the case of the dominant Omogoy line, very closely matching haplotypes differing by a single mutational step were found in the city of Chifen of the Jirin Province, China. The widest range of similar haplotypes was found for the Yakut haplotype Unknown: In Mongolia, China and South Korea. For instance, haplotypes differing by a single step mutation were found in Northern Mongolia (Khalk, Darhad, Uryankhai populations), Ulaanbaator (Khalk) and in the province of Jirin, China (Han population).

n1c-uralic-altaic-siberia
14-loci median-joining network analysis for the original/dominant Elley (Ell), Unknown Clan
(Vil), Omogoy (Omo), Eurasian (Eur) and Xiongnu (Xuo) Y-chromosomal STR haplotypes and that for a representative ancient DNA sample (Ch0 or DSQ04) from the Upper Xiajiadian Culture
recovered from the Inner Mongolia Autonomous Region, China.

Notably, Tat-C-bearing Y-chromosomes were also observed in ancient DNA samples from the 2700-3000 years-old Upper Xiajiadian culture in Inner Mongolia, as well as those from the Serteya II site at the Upper Dvina region in Russia and the ‘Devichyi gory’ culture of long barrow burials at the Nevel’sky district of Pskovsky region in Russia. A 14-loci Y-chromosomal STR median-joining network of the most prevalent Sakha haplotypes and a Tat-C-bearing haplotype from one of the ancient DNA samples recovered from the Upper Xiajiadian culture in Inner Mongolia (DSQ04) revealed that the contemporary Sakha haplotype ‘Xuo’ (Table 2, Haplotype ID “Xuo”) classified as that of ‘the Xiongnu clan’ in our current study, was the closest to the ancient Xiongnu haplotype (Figure 6). TMRCA estimate for this 14-loci Y-chromosomal STR network was 4357 ± 1038 years or 2341 ± 1038 BCE, which correlated well with the Upper Xiajiadian culture that was dated to the Late Bronze Age (700-1000 BCE).

eurasian-n-subclades
Geographical location of ancient samples belonging to major clade N of the Y-chromosome.

NOTE. Also interesting from the paper seems to be the proportion of E1b1b among admixed Russian populations, in a proportion similar to R1a or I2a(xI2a1).

It is tempting to associate the prevalent presence of N1c-L392 in ancient Siberian populations with the expansion of Altaic, by simplistically linking the findings (in chronological order) near Lake Baikal (Damgaard et al. 2018), Upper Xiajiadian (Cui et al. 2013), among Khövsgöl (Jeong et al. 2018), in Huns (Damgaard et al. 2018), and in Mongolic-speaking Avars (Csáky et al. 2019).

However, its finding among Palaeo-Laplandic peoples in the Kola peninsula ca. 1500 BC (Lamnidis et al. 2018) and among Palaeo-Siberian populations near the Yana River (Sikora et al. 2018) ca. AD 1200 should be enough to accept the hypothesis of ancestral waves of expansion of the haplogroup over northern Eurasia, with acculturation and further expansions in the different regions since the Iron Age (see more on its potential expansion waves).

Also, a simple look at the TMRCA and modern distribution was enough to hypothesize long ago the lack of connection of N1c-L392 with Altaic or Uralic peoples. From Ilumäe et al. (2016):

Previous research has shown that Y chromosomes of the Turkic-speaking Yakuts (Sakha) belong overwhelmingly to hg N3 (formerly N1c1). We found that nearly all of the more than 150 genotyped Yakut N3 Y chromosomes belong to the N3a2-M2118 clade, just as in the Turkic-speaking Dolgans and the linguistically distant Tungusic-speaking Evenks and Evens living in Yakutia (Table S2). Hence, the N3a2 patrilineage is a prime example of a male population of broad central Siberian ancestry that is not intrinsic to any linguistically defined group of people. Moreover, the deepest branch of hg N3a2 is represented by a Lebanese and a Chinese sample. This finding agrees with the sequence data from Hallast et al., where one Turkish Y chromosome was also assigned to the same sub-clade. Interestingly, N3a2 was also found in one Bhutan individual who represents a separate sub-lineage in the clade. These findings show that although N3a2 reflects a recent strong founder effect primarily in central Siberia (Yakutia, Sakha), the sub-clade has a much wider distribution area with incidental occurrences in the Near East and South Asia.

haplogroup-n1a-M2118
Frequency-Distribution Maps of Individual Sub-clades of hg N3a2, by Ilumäe et al. (2016).

The most striking aspect of the phylogeography of hg N is the spread of the N3a3’6-CTS6967 lineages. Considering the three geographically most distant populations in our study—Chukchi, Buryats, and Lithuanians—it is remarkable to find that about half of the Y chromosome pool of each consists of hg N3 and that they share the same sub-clade N3a3’6. The fractionation of N3a3’6 into the four sub-clades that cover such an extraordinarily wide area occurred in the mid-Holocene, about 5.0 kya (95% CI = 4.4–5.7 kya). It is hard to pinpoint the precise region where the split of these lineages occurred. It could have happened somewhere in the middle of their geographic spread around the Urals or further east in West Siberia, where current regional diversity of hg N sub-lineages is the highest (Figure 1B). Yet, it is evident that the spread of the newly arisen sub-clades of N3a3’6 in opposing directions happened very quickly. Today, it unites the East Baltic, East Fennoscandia, Buryatia, Mongolia, and Chukotka-Kamchatka (Beringian) Eurasian regions, which are separated from each other by approximately 5,000–6,700 km by air. N3a3’6 has high frequencies in the patrilineal pools of populations belonging to the Altaic, Uralic, several Indo-European, and Chukotko-Kamchatkan language families. There is no generally agreed, time-resolved linguistic tree that unites these linguistic phyla. Yet, their split is almost certainly at least several millennia older than the rather recent expansion signal of the N3a3’6 sub-clade, suggesting that its spread had little to do with linguistic affinities of men carrying the N3a3’6 lineages.

haplogroup_n3a3
Frequency-Distribution Maps of Individual Subclade N3a3 / N1a1a1a1a1a-CTS2929/VL29.

It was thus clear long ago that N1c-L392 lineages must have expanded explosively in the 5th millennium through Northern Eurasia, probably from a region to the north of Lake Baikal, and that this expansion – and succeeding ones through Northern Eurasia – may not be associated to any known language group until well into the common era.

Related