Bell Beakers and Mycenaeans from Yamnaya; Corded Ware from the forest steppe

eba-yamnaya-ancestry-hungary

I have recently written about the spread of Pre-Yamnaya or Yamnaya ancestry and Corded Ware-related ancestry throughout Eurasia, using exclusively analyses published by professional geneticists, and filling in the gaps and contradictory data with the most reasonable interpretations. I did so consciously, to avoid any suspicion that I was interspersing my own data or cherry picking results.

Now I’m finished recapitulating the known public data, and the only way forward is the assessment of these populations using the available datasets and free tools.

Understanding the complexities of qpAdm is fairly difficult without a proper genetic and statistical background, which I won’t pretend to have, so its tweaking to get strictly correct results would require an unending game of trial and error. I have sadly little time for this, even taking my tendency to procrastination into account… so I have used a simple model akin to those published before – in particular, the outgroup selection by Ning, Wang et al. (2019), who seem to be part of the only group interested in distinguishing Yamnaya-related from Corded Ware-related ancestry, probably the most relevant question discussed today in population genomics regarding the Proto-Indo-European and Proto-Uralic homelands.

eneolithic-steppe-best-fits
Supplementary Table 13. P values of rank=2 and admixture proportions in modelling Steppe ancestry populations as a three-way admixture of Eneolithic steppe Anatolian_Neolithic and WHG using 14 outgroups.
Left populations: Test, Eneolithic_steppe, Anatolian_Neolithic, WHG.
Right populations: Mbuti.DG, Ust_Ishim.DG, Kostenki14, MA1, Han.DG, Papuan.DG, Onge.DG, Villabruna, Vestonice16, ElMiron, Ethiopia_4500BP.SG, Karitiana.DG, Natufian, Iran_Ganj_Dareh_Neolithic.

I have used for all analyses below a merged dataset including the curated one of the Reich Lab, the latest on Central and South Asia by Narasimhan, Patterson et al. (2019), on Iberia by Olalde et al. (2019), and on the East Baltic by Saag et al. (2019), as well as datasets including samples from Wang et al. (2019) and Lamnidis et al. (2018). I used (and intend to use) the same merged dataset in all cases, despite its huge size, to avoid adding one more uncontrolled variable to the analyses, so that all results obtained can be compared.

I try to prepare in advance a bunch of relevant files with left pops and right pops for each model:

  1. It seems a priori more reasonable to use geographically and chronologically closer proxy populations (say, Trypillia or GAC for Steppe-related peoples) than hypothetic combinations of ancestral ones (viz. Anatolian farmer, WHG, and EHG).
  2. This also means using subgroups closer to the most likely source population, such as (Don-Volga interfluve) Yamnaya_Kalmykia rather than (Middle Volga) Yamnaya_Samara for the western expansion of late Repin/early Yamnaya, or the early Germany_Corded_Ware.SG or Czech_Corded Ware for the group closest to the Proto-Corded Ware population (see below), likely neighbouring the Upper Vistula region.
  3. I usually test two source populations for different targets, which seems like a much more efficient way of using computer resources, whenever I know what I want to test, since I need my PC back for its normal use; whenever I don’t know exactly what to test, I use three-way admixture models and look for subsets to try and improve the results.

I have probably left out some more complex models by individualizing the most relevant groups, but for the time being this would have to do. Also, no other formal stats have been used in any case, which is an evident shortcoming, ruling out an interpretation drawn directly and only from the results below.

Full qpAdm results for each batch of samples are presented in a Google Spreadsheet, with each tab (bottom of the page) showing a different combination of sources, usually in order of formally ‘best’ (first to the left) to ‘worst’ (last to the right) fits, although the order is difficult to select in highly heterogeneous target groups, as will be readily visible.

maykop-trypillia-intrusion-steppes
Disintegration, migration, and imports of the Azov–Black Sea region. First migration event (solid arrows): Gordineşti–Maikop expansion (groups: I – Bursuchensk; II – Zhyvotylivka; III – Vovchans’k; IV – Crimean; V – Lower Don; VI – pre-Kuban). Second migration event (hollow arrows): Repin expansion. After Rassamakin (1999), Demchenko (2016).

Corded Ware origins

The latest publications on the Yampil barrow complex have not improved much our understanding of the complexity of Corded Ware origins from an archaeological point of view, involving multiple cultural (hence likely population) influences. This bit is from Ivanova et al., Baltic-Pontic Studies (2015) 20:1, and most hypotheses of the paper remain unanswered (except maybe for the relevance of the Złota group):

In the light of the above outline therefore one should argue that the ‘architecture of barrows’ associated in the ‘Yampil landscape’ of the Middle Dniester Area with the Eneolithic (specifically, mainly with the TC), precedes the development of a similar phenomenon that can be observed from 2900/2800 BC in the Upper Dniester Area and drainage basin of the Upper Vistula, associated with the CWC [Goslar et al. 2015; Włodarczak 2006; 2007; 2008; Jarosz, Włodarczak 2007]. The most consuming research question therefore is whether ritual customs making use of Eneolithic (Tripolye) ‘barrow architecture’ could have penetrated northwards along the Dniester route, where GAC communities functioned. One could also ask what role the rituals played among the autochthons [Kośko 2000; Włodarczak 2008; 2014: 335; Ivanova, Toshchev 2015b].

This issue has already been discussed with a resulting tentative systemic taxonomy in the studies of Włodarczak, arguing for the Złota culture (ZC) in the Vistula region as an illustration of one of the (Małopolska) reception centres of civilization inspirations from the oldest Pontic ‘barrow culture’ circle associated with the Eneolithic and Early Bronze Age [Włodarczak 2008]. Notably, it is in the ZC that one can notice a set of cultural traits (catacomb grave construction, burial details, forms and decoration of vessels) analogous to those shared by the north-western Black Sea Coast groups of the forest-steppe Eneolithic (chiefly Zhyvotilovka-Volchansk) and the Late Tripolye circle (chiefly Usatovo-Gordinești-Horodiștea-Kasperovtsy).

gac-trypillia-usatovo-corded-ware
Globular Amphorae culture „exodus” to the Danube Delta: a – Globular Amphorae culture; b – GAC (1), Gorodsk (2), Vykhvatintsy (3) and Usatovo (4) groups of Trypillia culture; c – Coţofeni culture; d – northern border of the late phase of Baden culture;red arrows – direction of Globular Amphora culture expansion; blue arrow – direction of „reflux” of Globular Amphora culture (apud Włodarczak, 2008, with changes).

Taking into account that I6561 might be wrongly dated, we cannot include the Corded Ware-like sample of the end-5th millennium BC in the analysis of Corded Ware origins. That uncertainty in the chronology of the appearance of “Steppe ancestry” in Proto-Corded Ware peoples complicates the selection of any potential source population from the CHG cline.

Nevertheless, the lack of hg. R1a-M417 and sizeable Pre-Yamnaya-related ancestry in the sampled Pontic forest-steppe Eneolithic populations (represented exclusively by two samples from Dereivka ca. 3600-3400 BC) would leave open the interesting possibility that a similar ancestry got to the forest-steppe region between modern Poland and Ukraine during the known complex population movements of the Late Eneolithic.

It is known that Corded Ware-derived groups and Steppe Maykop show bad fits for Pre-Yamnaya/Yamnaya ancestry, and also that Steppe Maykop is a potential source of “Steppe-related ancestry” within the Eneolithic CHG mating network of the Pontic-Caspian steppes and forest-steppes. Testing Corded Ware for recent Trypillia and Maykop influences, proper of Late Trypillia and Late Maykop groups in the North Pontic area (such as Zhyvotylivka–Vovchans’k and Gordineşti) side by side with potential Pre-Yamnaya and Yamnaya sources makes thus sense:

Now, the main obvious difference between Khvalynsk-Yamnaya and Corded Ware is the long-lasting, pervasive Y-chromosome bottlenecks under R1b lineages in the former, compared to the haplogroup variability and late bottleneck under R1a-M417 in the latter, which speaks in favour – on top of everything else – of a different community of sub-Neolithic hunter-gatherers including hg. R1a-M417 hijacking the expansion of Steppe_Maykop-related ancestry around the Volhynian-Podolian Upland.

Akin to how Yamnaya patrilineal descendants hijacked regional EEF (±CWC) ancestry components mainly through exogamy, dragging them into the different expanding Bell Beaker groups (see below), but kept their Indo-European languages, these hunter-gatherers that admixed with peoples of “Steppe ancestry” were the most likely vector of expansion of Uralic languages in Eastern Europe.

corded-ware-from-trypillia-maykop
PCA of ancient Eurasian samples. Marked likely Proto-Corded Ware samples and potential origin of its PCA cluster based on qpAdm results. See full PCA and more related files.

Baltic Corded Ware

One of the most interesting aspects of the results above is the surprising heterogeneity of the different regional groups, which is also reflected in the Y-DNA variability of early Corded Ware samples.

Seeing how Baltic CWC groups, especially the early Latvia_LN sample, show particularly bad fits with the models above, it seems necessary to test how this population might have come to be. My first impression in 2017 was that they could represent early Corded Ware groups admixed with Yamnaya settlers through their interactions along the Dnieper-Dniester corridor.

However, I recently predicted that the most likely admixture leading to their ancestry and PCA cluster would involve a Corded Ware-like group and a group related to sub-Neolithic cultures of eastern Europe, whose best proxy to date are EHG-like Khvalynsk samples (i.e. excluding the outlier with Pre-Yamnaya ancestry, I0434):

corded-ware-pca-sub-neolithic-europe
Detail of the PCA of the Corded Ware expansion. See full PCA and more related files.

Late Corded Ware + Yamnaya vanguard

Relevant are also the mixtures of Corded Ware from Esperstedt, and particularly those of the sample I0104, which I have repeated many times in this blog I suspected to be influenced by vanguard Yamnaya settlers:

The infeasible models of CWC + Yamnaya_Kalmykia ± Hungary_Baden (see below for Bell Beakers) and the potential cluster formed with other samples from the Baltic suggest that it could represent a more complex set of mixtures with sub-Neolithic populations. On the other hand, its location in Germany, late date (ca. 2500 BC or later), and position in the PCA, together with the good fits obtained for Germany_Beaker as a source, suggest that the increase in Steppe-related ancestry + EEF makes it impossible for the model (as I set it) to directly include Yamnaya_Kalmykia, despite this excess Steppe-related ancestry actually coming from Yamnaya vanguard groups.

I think it is very likely that the future publication of EEF-admixed Yamnaya_Hungary samples (or maybe even Yamnaya vanguard samples) will improve the fits of this model.

These results confirm at least the need to distrust the common interpretation of mixtures including late Corded Ware samples from Esperstedt (giving rise to the “up to 75% Yamnaya ancestry of CWC” in the 2015 papers) as representative of the Corded Ware culture as a whole, and to keep always in mind that an admixture of European BA groups including Corded Ware Esperstedt as a source also includes East BBC-like ancestry, unless proven otherwise.

yamnaya-vanguard-corded-ware-chalcolithic-early
Yamnaya vanguard groups in Corded Ware territory before the expansion of Bell Beakers (ca. 2500 BC). See full map.

Bell Beaker expansion

A hotly (re)debated topic in the past 6 months or so, and for all the wrong reasons, is the origin of the Bell Beaker folk. Archaeology, linguistics, and different Y-chromosome bottlenecks clearly indicate that Bell Beakers were at the origin of the North-West Indo-European expansion in Europe, while the survival of Corded Ware-related groups in north-eastern Europe is clearly related to the expansion of Uralic languages.

NOTE. For the interesting case of Proto-Indo-Iranians expanding with Corded Ware-like ancestry, see more on the formation of Sintashta-Potapovka-Filatovka from East Uralic-speaking Abashevo and Pre-Proto-Indo-Iranian-speaking Poltavka herders. See also more on R1a in Indo-Iranians and on the social complexity of Sintashta.

Nevertheless, every single discarded theory out there seems to keep coming back to life from time to time, and a new wave of interest in “Bell Beaker from the Single Grave culture” somehow got revived in the process, too, because this obsession – unlike the “Bell Beakers from Iberia Chalcolithic” – is apparently acceptable in certain circles, for some reason.

We know that Iberian Beakers, British Beakers, or Sicilian EBA – representing the most likely closest source population of speakers of Proto-Galaico-Lusitanian, Pre-Celtic Indo-European, and Proto-Elymian, respectively – have already been successfully tested for a direct origin among Western European Beakers in Olalde et al. (2018), Olalde et al. (2019), and Fernandes et al. (2019).

This success in ascertaining a closer Beaker source is probably due to the physical isolation of the specific groups (related to Germany_Beaker, Netherlands_Beaker, and NE_Mediterranean_Beaker samples, respectively) after their migration into regions dominated by peoples without Steppe-related ancestry. Furthermore, Celtic-speaking populations expanding with Urnfield south of the Pyrenees also show a good fit with a source close to France_Beaker.

So I decided to test sampled Bell Beaker populations, to see if it could shed light to the most likely source population of individual Beaker groups and the direction of migration within Central Europe, i.e. roughly eastwards or westwards. As it was to be expected for closely related populations (see the relevant discussion here), an attempt to offer a simplistic analysis of direction based on formal stats does not make any sense, because most of the alternative hypotheses cannot be rejected:

Not only because of the similar values obtained, but because it is absurd to take p-values as a measure of anything, especially when most of these conflicting groups with slightly ‘better’ or ‘worse’ p-values represent multiple different mixtures of the type (Yamnaya + EEF) + (Corded Ware + EEF ± Yamnaya), impossible to distinguish without selecting proper, direct ancestral populations…

A further example of how explosive the Bell Beaker expansion was into different territories, and of their extensive local admixture, is shown by the unsuccessful attempt by Olalde et al. (2018) to obtain an origin of the EEF source for all Beaker groups (excluding Iberian Beakers):

bell-beaker-local-population-iberia
Investigating the genetic makeup of Beaker-complex-associated individuals. Testing different populations as a source for the Neolithic ancestry component in Beaker-complex-associated individuals. The table shows P values (* indicates values > 0.05) for the fit of the model: ‘Steppe_EBA + Neolithic/Copper Age’ source population.
burials-yamnaya-hungary
Map of attested Yamnaya pit-grave burials in the Hungarian plains; superimposed in shades of blue are common areas covered by floods before the extensive controls imposed in the 19th century; in orange, cumulative thickness of sand, unfavourable loamy sand layer. Marked are settlements/findings of Boleráz (ca. 3500 BC on), Baden (until ca. 2800 BC), Kostolac (precise dates unknown), and Yamna kurgans (from ca. 3100/3000 BC on).

Now, there is a simpler way to understand what kind of Steppe-related ancestry is proper of Bell Beakers. I tested two simple models for some Beaker groups: Yamnaya + Hungary Baden vs. Corded Ware + GAC Poland. After all, the Bell Beaker folk should prefer a source more closely related to either Yamnaya Hungary or Central European Corded Ware:

Interestingly, models including Yamnaya + Baden show good fits for the most important groups related to North-West Indo-Europeans, including Bell Beakers from Germany, the Netherlands, Italy, and Poland, representing the most likely closest source populations of speakers of Pre-Proto-Celtic, Pre-Proto-Germanic, Proto-Italo-Venetic, and Pre-Proto-Balto-Slavic, respectively.

The admixed Yamnaya samples from Hungary that will hopefully be published soon by the Jena Lab will most likely further improve these fits, especially in combination with intermediate Chalcolithic populations of the Middle and Upper Danube and its tributaries, to a point where there will be an absolute chronological and geographical genomic trail from the fully Yamnaya-like Yamnaya settlers from Hungary to all North-West Indo-European-speaking groups of the Early Bronze Age.

The only difference between groups will be the gradual admixture events of their source Beaker group with local populations on their expansion paths, including peoples of mainly EEF, CWC+EEF, or CWC+EEF+Yamnaya related ancestry. There is ample evidence beyond ancestry models to support this, in particular continued Y-DNA bottlenecks under typical Yamnaya paternal lineages, mainly represented by R1b-L51 subclades.

east-bell-beaker-group-expansion
Distribution of the Bell Beaker East Group, with its regional provinces, as of c. 2400 cal BC (after Heyd et al. 2004, modified). See full maps.

European Early Bronze Age

European EBA groups that might show conflicting results due to multiple admixture events with Corded Ware-related populations are the Únětice culture and the Nordic Late Neolithic.

The results for Únětice groups seem to be in line with what is expected of a Central European EBA population derived from Bell Beakers admixed with surrounding poulations of East Bell Beaker and/or late (Epi-)Corded Ware descent.

Potential models of mixture for Nordic Late Neolithic samples – despite the bad fits due to the lack of direct ancestral CWC and BBC groups from Denmark – seem to be impossible to justify as derived exclusively from Single Grave or (even less) from Battle Axe peoples, supporting immigration waves of Bell Beakers from the south and further admixture events with local groups through maritime domination.

PCA of ancient European samples. Marked are Bronze Age clusters. See full PCAs.

Balkans Bronze Age

The potential origin of the typical Corded Ware Steppe-related ancestry in the social upheaval and population movements of the Dnieper-Dniester forest-steppe corridor during the 4th millennium BC raises the question: how much do Balkan Bronze Age groups owe their ancestry to a population different than the spread of Pre-Yamnaya-like Suvorovo-Novodanilovka chieftains? Furthermore, which Bronze Age groups seem to be more likely derived exclusively from Pre-Yamnaya groups, and which are more likely to be derived from a mixture of Yamnaya and Pre-Yamnaya? Do the formal stats obtained correspond to the expected results for each group?

Since the expansion of hg. I2a-L699 (TMRCA ca. 5500 BC) need not be associated with Yamnaya, some of these values – together with the assessment of each individual archaeological culture – may question their origin in a Yamnaya-related expansion rather than in a Khvalynsk-related one.

NOTE. These are the last ones I was able to test yesterday, and I have not thought these models through, so feel free to propose other source and target groups. In particular, complex movements through the North Pontic area during the Late Eneolithic would suggest that there might have been different Steppe-ancestry-related vs. EEF-related interactions in the north-west and west Pontic area before and during the expansion of Yamnaya.

Mycenaeans

One of the key Indo-European populations that should be derived from Yamnaya to confirm the Steppe hypothesis, together with North-West Indo-Europeans, are Proto-Greeks, who will in turn improve our understanding of the preceding Palaeo-Balkan community. Unfortunately, we only have Mycenaean samples from the Aegean, with slight contributions of Steppe-related ancestry.

Still, analyses with potential source populations for this Steppe ancestry show that the Yamnaya outlier from Bulgaria is a good fit:

The comparison of all results makes it quite evident the why of the good fits from (Srubnaya-related) Bulgaria_MLBA I2163 or of Sintashta_MLBA relative to the only a priori reasonable Yamnaya and Catacomb sources: it is not about some hypothetical shared ancestor in Graeco-Aryan-speaking East Yamnaya– or even Catacomb-Poltavka-related groups, because all available Yamnaya-related peoples are almost indistinguishable from each other (at least with the sampling available today). These results reflect a sizeable contribution of similar EEF-related populations from around the Carpathians in both Steppe-related groups: Corded Ware and Yamnaya settlers from the Balkans.

mycenaeans-minyan-ware-greece-minoan
Cultural groups in and around the Balkans during the Early Bronze Age. See full maps.

qpAdm magic

In hobby ancestry magic, as in magic in general, it is not about getting dubious results out of thin air: misdirection is the key. A magician needs to draw the audience attention to ‘remarkable’ ancestry percentages coupled with ‘great’ (?) p-values that purportedly “prove” what the audience expects to see, distracting everyone from the true interesting aspects, like statistical design, the data used (and its shortcomings), other opposing models, a comparison of values, a proper interpretation…you name it.

I reckon – based on the examples above – that the following problems lie at the core of bad uses of qpAdm:

  1. In the formal aspect, the poor understanding of what p-values and other formal stats obtained actually mean, and – more importantly – what they don’t mean. The simplistic trend to accept results of a few analyses at face value is necessarily wrong, in so far as there is often no proper reasoning of what is being assessed and how, and there is never a previous opinion about what could be expected if the alternative hypotheses were true.
  2. In the interpretation aspect, the poor judgement of accompanying any results with simplistic, superficial, irrelevant, and often plainly wrong archaeological or linguistic data selected a posteriori; the inclusion of some racial or sociopolitical overtones in the mixture to set a propitious mood in the target audience; and a sort of ritualistic theatrics with the main theme of ‘winning’, that is best completed with ad hominems.

If you get rid of all this, the most reasonable interpretation of the output of a model proposed and tested should be similar to Nick Patterson’s words in his explanation of qpWave and qpAdm use:

Here we see that, at least in this analysis there are reasonable models with CordedWareNeolithic is a mix of either WHG or LBKNeolithic and YamnayaEBA. (…) The point of this note is not to give a serious phylogenetic analysis but the results here certainly support a major Steppe contribution to the Corded Ware population, which is entirely concordant with the archaeology [?].

Very far, as you can see, from the childish “Eureka! I proved the source!”-kind of thinking common among hobbyists.

The Mycenaean case is an illustrative example: if the Yamnaya outlier from Bulgaria were not available, and if one were not careful when designing and assessing those mixture models, the interpretation would range from erroneous (viz. a Graeco-Aryan substrate, as I initially thought) to impossible (say, inventing migration waves of Sintashta or Srubnaya peoples into Crete). The models presented above show that a contribution of Yamnaya to Mycenaeans couldn’t be rejected, and this alone should have been enough to accept Yamnaya as the most likely source population of “Steppe ancestry” in Proto-Greeks, pending intermediate samples from the Balkans. In other words, one could actually find that ‘the best’ p-values for source populations of Mycenaeans is a combination of modern Poles + Turks, despite the impracticality of such a model…

I haven’t been able to reproduce results which supposedly showed that Corded Ware is more likely to be derived from (Pre-)Yamnaya than other source population, or that Corded Ware is better suited as the ancestral population of Bell Beakers. The analyses above show values in line with what has been published in recent scientific papers, and what should be expected based on linguistics and archaeology. So I’ll go out on a limb here and say that it’s only through a careful selection of outgroups and samples tested, and of as few compared models as possible, that you could eventually get this kind of results and interpretation, if at all.

Whether that kind of special care for outgroups and samples is about (a) an acceptable fine-tuning of the analyses, (b) a simplistic selection dragged from the first papers published and applied indiscriminately to all models, or (c) cherry picking analyses until results fit the expected outcome, is a question that will become mostly irrelevant when future publications continue to support an origin of the expansion of ancient Indo-European languages in Khvalynsk- and Yamnaya-related migrations.

Feel free to suggest (reasonable) modifications to correct some of these models in the comments. Also, be sure to check out other values such as proportions, SD or SNPs of the different results that I might have not taken into account when assessing ‘good’ or ‘bad’ fits.

Related

The Lusatian culture, the most likely vector of Balto-Slavic expansions

early-bronze-age-languages-europe

New archaeological paper (behind paywall) New evidence on the southeast Baltic Late Bronze Age agrarian intensification and the earliest AMS dates of Lens culinaris and Vicia faba, by Minkevičius et al. Vegetation History and Archaeobotany (2019).

Interesting excerpts (emphasis mine):

Arrival of farming in the south-east Baltic

The current state of research reveals no firm evidence of crop cultivation in the region before the LBA (Piličiauskas et al. 2017b; Grikpėdis and Motuzaitė-Matuzevičiūtė 2018). Current archaeobotanical data firmly suggest the adoption of farming during the EBA to LBA transition. (…) By comparison, in other parts of N Europe subsistence economy of CWC groups was characterized by strong emphasis on animal husbandry, however crop cultivation was also used (Kirleis 2019; Vanhanen et al. 2019). CWC sites from the Netherlands, Denmark, Sweden and Germany reveal evidence of the cultivation of H. vulgare var. nudum, T. dicoccum, Linum usitatissimum (flax) (Oudemans and Kubiak-Martens 2014; Beckerman 2015; Kubiak- Martens et al. 2015).

It is (…) striking that earliest evidence of farming in the SE Baltic only appears in the deposits dating over 4,000 years later.

The environmental conditions of the SE Baltic presented a significant barrier and numerous genetic adaptations were required before farming could successfully spread into the region (Motuzaitė-Matuzevičiūtė 2018). Adaptations through seasonality changes usually play a major role in adapting to new environments (Sherratt 1980). These include establishing genetic controls on seasonality, especially flowering times and length of growing season (Fuller and Lucas 2017). Therefore, it could be argued that farming was only firmly established in the region around the LBA after several crop species, primarily barley, became adapted to the local environment and the risk of crop failure was reduced (Motuzaitė-Matuzevičiūtė 2018). The transition to farming was further aided by the climate warming which started around 750 cal bc (Gaigalas 2004; Sillasoo et al. 2009). In such a case the fragmented evidence from earlier periods is a likely illustration of the early attempts that have failed.

south-east-baltic-agrarian-communities
Map of sites mentioned in the text: 1 Duba and Palesa Lakes, 2 Šventoji, 3 Šarnelė, 4 Iru, 5 Kvietiniai, 6 Kreiči, 7 Turlojiškė, 8 Narkūnai, 9 Luokesa 1, 10 Mūkakalns, 11 Kivutkalns, 12 Asva, 13 Kukuliškiai

Social change

The LBA agrarian intensification of the SE Baltic was most likely not an isolated case but rather a part of broader social, economic and technological developments sweeping across northern Europe.

Evidence from sites across the Baltic Sea shows that the end of the EBA (ca. 1200 bc onward, after Gustafsson 1998) was marked by intensification of agriculture and changes in landscape management. This coincides with the agricultural developments observed on the SE fringes of the Baltic Sea and provides a context for the eventual arrival of farming, followed shortly by the rapid agrarian intensification of the region. Looking just south from the study region, we see that data from northern Poland reveal a sharp increase in both scale and intensity of agricultural activities during the EBA to LBA transition. Pollen records show significant environmental changes starting around 1400/1300 bc (Wacnik 2005, 2009; Wacnik et al. 2012). These were mostly a result of development of a production economy based on plant cultivation and animal raising. Even more significant changes during this period are visible in southern Scandinavia. Pollen records from S Sweden present evidence for an opening up of the forested landscape and the creation of extensive grasslands (Berglund 1991; Gustafsson 1998). Major changes are also apparent in archaeobotanical assemblages.

In general, during the end of the EBA northern Europe underwent a massive transformation of the farming system moving towards a more intensified agriculture aimed at surplus production. However, this should not be regarded as an isolated occurrence, but rather as a radical change of the whole society which took place throughout Europe (Gustafsson 1998). Intensification of contacts across northern Europe have integrated previously isolated regions into a wider network (Kristiansen and Larsson 2005; Wehlin 2013; Earle et al. 2015). It is therefore likely that farming spread into the SE fringes of the Baltic Sea alongside other innovations including malleable technologies and developments of social structure.

bronze-age-late-baltic
Late Bronze Age cultures in the Baltic. See full map.

The presence and scale of intensifying connections is well illustrated by SE Baltic archaeological material.

Firstly, the appearance of stone ship graves has served as a basis for locating the Nordic communication zones. Construction of such graves was limited to the coastal regions of Kurzeme, Saaremaa Island and the Northern Estonian coast near Tallinn and Kaliningrad (Graudonis 1967; Okulicz 1976; Lang 2007) and is generally regarded as a foreign burial custom which was common in Gotland and along the Scandinavian coast. This is also supported by the Staldzene and Tehumardi hoards (Vasks and Vijups 2004; Sperling 2013), which contained artefacts typical of Nordic culture.

Secondly, studies of early metallurgy and its products, both imported and created in the SE Baltic, have concluded that metal consumption in the LBA had more than doubled compared to the EBA (Sidrys and Luchtanas 1999). The SE Baltic region lacks any metal artefact types exclusive to the region and metal objects are dominated by artefact types originating from Nordic and Lusatian cultures (Sidrys and Luchtanas 1999; Lang 2007; Čivilytė 2014). This indicates that even after metal crafting reached the region, the technology remained exclusively of foreign origin. Rarely identifiable negatives of clay casting moulds were also made for artefacts of Nordic influence, such as Mälar type axes or Härnevi type pins (Čivilytė 2014; Sperling 2014).

Lastly, emerging social diversification was accompanied by the establishment of the first identifiable settlement pattern. Settlement locations were strategically chosen alongside economically significant routes, primarily on the coast and near the Daugava River. Hilltop areas were prioritized over the lowlands, and excavations on these sites have often revealed several stages of enclosure construction (Graudonis 1989). This has also been explained as a reflection of intensifying communication networks between Nordic and Lusatian cultures, and the indigenous communities of the SE Baltic.

Proto-Balto-Slavic

One of the aspects of my description of Balto-Slavic I am least convinced about is my acceptance of Kortlandt’s dialectal classification into Proto-East Baltic, Proto-West Baltic, and Proto-Slavic, due to its strong reliance on his own controversial theory of late laryngeal loss.

Kortlandt’s position regarding Balto-Slavic is that it is in fact simply ‘Proto-Baltic’, a language that would stem thus from an Indo-Baltic branch, which would be originally represented by Corded Ware, and which would have split suddenly in its three dialects without any common development between branches, including some intermediate hypothetic “Centum” Temematic substrate that would explain everything his model can’t…

As more genetic and archaeological data on northern Europe appears, his ideas about Balto-Slavic are becoming even less credible, fully at odds with his predicted population and cultural movements, in particular because of the evident shaping of Indo-European-speaking Europe through the expansion of the Bell Beaker culture from the Yamnaya of the Carpathian Basin, and of the shaping of Uralic-speaking Europe through the expansion of the Corded Ware culture.

bronze-age-middle-northern-europe
Middle Bronze Age cultures close to the Baltic ca. 1750-1250 BC. See full map.

The site of Turlojiškė in southern Lithuania (ca. 908-485 BC) – which Mittnik et al. (2018) classified as “Bronze Age, Trzciniec culture?” – can be more reasonably considered a settlement of incoming intensive agrarian communities under the influence of the Lusatian culture, like the Narkūnai hilltop settlement in eastern Lithuania (ca. 800–550 BC), or the enclosed hilltop settlement of Kukuliškiai in western Lithuania (ca. 887-506 BC), just 300 m east of the Baltic Sea, also referred to in the paper.

While the dates of sampled individuals include a huge span (ca. 2100-600 BC), those with confirmed radiocarbon dates are more precisely dated to the LBA-EIA transition. More specifically, the first clearly western influence is seen in the early outlier Turlojiškė1932 (ca. 1230-920 BC), while later samples and samples from Kivutkalns, in Latvia, show major genetic continuity with indigenous populations, compatible with the new chiefdom-based systems of the Baltic and the known lack of massive migrations to the region.

Contacts with western groups of the Nordic Bronze Age and Lusatian cultures intensified – based on existing archaeological and archaeobotanical evidence – in the LBA, especially from ca. 1100/1000 BC on, and Baltic languages seem to have thus little to do with the disappearing Trzciniec culture, and more with the incoming Lusatian influence.

Both facts – more simple dialectalization scheme, and more recent Indo-European expansion to the east – support the spread of Proto-Baltic into the south-east Baltic area precisely around this time, and is also compatible with an internal separation from Proto-Slavic during the expansion of the Lusatian culture.

pca-late-bronze-age-balto-slavic-finnic
Top Left:Likely Baltic, Slavic, and Balto-Finnic-speaking territories (asynchronous), overlaid over Late Bronze Age cultures. Balto-Slavic in green: West(-East?) Baltic (B1), unattested early Baltic (B2), and Slavic (S). Late Balto-Finnic (F) in cyan. In red, Tollense and Turlojiškė sampling. Dashed black line: Balto-Slavic/West Uralic hydrotoponymy border until ca. 1000 AD. Top right: PCA of groups from the Early Bronze Age to the Late Bronze Age. Marked are Iwno/Pre-Trzciniec of Gustorzyn (see below), Late Trzciniec/Iron Age samples from Turlojiškė, and in dashed line approximate extent of Tollense cluster; Y-DNA haplogroups during the Late Bronze Age (Bottom left) and during the Early Iron Age (Bottom right). Notice a majority non-R1a lineages among sampled Early Slavs. See full maps and PCAs.

Even though comparative grammar is traditionally known to be wary of resorting to language contamination or language contact, the truth is that – very much like population genomics – trying to draw a ‘pure’ phylogenetic tree for Balto-Slavic has never worked very well, and the most likely culprit is the Slavic expansion to the south-east into territories which underwent different and complex genetic and linguistic influences for centuries (see here and here).

The close interaction of Nordic BA and Lusatian cultures (and their cultural predominance over) indigenous eastern Baltic peoples from ca. 1100 BC fits (part of) the known intense lexical borrowings of Balto-Finnic from Palaeo-Germanic and from early Proto-Baltic, as well as (part of) the known Germanic–Balto-Slavic contacts, whereas the evident Balto-Finnic-like substrate of Balto-Slavic, and especially of Baltic, must stem from the acculturation of those indigenous East Baltic peoples.

The relative chronology of hydrotoponymy in the East Baltic shows that essentially all ancestral layers to the north of the Daugava must have been Uralic, while roughly south of the Daugava they seem to be mostly Indo-European. The question remains, though, when did this Indo-European layer start?

Despite the many centuries that could separate the attestation of southern place- and river-names from northern ones, Old European is also defined by linguistic traits, which would imply that the same language inferred from Western and Southern European hydrotoponymy is that found in the Baltic, hence all from North-West Indo-European-speaking Bell Beakers and derived Early European Bronze Age groups.

Interestingly, though, it is well known that some modern Baltic toponyms can’t be easily distinguished from the Old European layers – unlike those of Iberia or the British Isles, which show some attested language change in the proto-historical and historical period – which may imply both (a) continuity of Baltic languages since the EBA, but also that (b) the Baltic naming system is a confounding factor in assessing the ancestral expansion of Old European. The latter is becoming more and more likely with each new linguistic, archaeological, and genetic paper.

up-river
Hydronyms in up-. One among many examples of scarcely attested appellatives that appear inflated in the Baltic due to modern use.

In summary, a survival of a hypothetical late Trzciniec language in Lithuania or as part of the expanding Lusatian community is not the most economic explanation for what is seen in genetics and archaeology. On the other hand, the cluster formed by the Tollense samples (a site corresponding to the Nordic Bronze Age), the Turlojiškė outlier, and the early Slavs from Bohemia all depict an eastward expansion of Balto-Slavic languages from Central Europe, at the same time as Celtic expanded to the west with the Urnfield culture.

NOTE. Another, more complicated question, though, is if this expanding Proto-Baltic language accompanying agriculture represents the extinct
early Proto-Baltic dialect from which Balto-Finnic borrowed words, hence Proto-Baltic proper expanded later, or if this early Baltic branch could have been part of the Trzciniec expansion. Again, the answer in archaeological and genetic terms seems to be the former. For a more detailed discussion of this and more, see European hydrotoponymy (IV): tug of war between Balto-Slavic and West Uralic.

As I said recently, the slight increase in Corded Ware-like ancestry among Iron Age Estonians, if it were statistically relevant and representative of an incoming population – and not just the product of “usual” admixture with immediate neighbours – need not be from south-eastern Corded Ware groups, because the Akozino-Malär cultural exchange seems to have happened as an interaction in both directions, and not just as an eastward migration imagined by Carpelan and Parpola.

Archaeology and genetics could actually suggest then (at least in part) an admixture with displaced indigenous West Uralic-speaking peoples from the south-west, to the south of the Daugava River, at the same time as the Indo-European – Uralic language frontier must have shifted to its traditional location, precisely during the LBA / EIA transition around 1000 BC.

NOTE. For more on this, see the supplementary materials of Saag et al. (2019).

fortified-settlements-lba-ia
Distribution of fortified settlements (filled circles) and other hilltop sites (empty circles) of the Late Bronze Age and Pre-Roman Iron Ages in the East Baltic region. Tentative area of most intensive contacts between Baltic and Balto-Finnic communities marked with a dashed line. Image modified from (Lang 2016).

The tight relationship of the three communities also accounts for the homogeneous distribution of expanding haplogroup N1c-VL29 (possibly associated with Akozino warrior-traders) in the whole Baltic Sea area, such as those appearing in the Estonian Iron Age samples, which have no clearly defined route(s) of expansion.

It is even possible that they emerged first in the south, linked to marriage alliances of Akozino chieftains with Baltic- and Germanic-speaking chiefdoms around the Baltic Sea (see N1c in Germanic Iron Age), because the expansion of (some) N1c lineages with Gulf of Finland Finnic to the north was more clearly associated with their known bottleneck ca. 2,000 years ago.

Related

Yamnaya ancestry: mapping the Proto-Indo-European expansions

steppe-ancestry-expansion-europe

The latest papers from Ning et al. Cell (2019) and Anthony JIES (2019) have offered some interesting new data, supporting once more what could be inferred since 2015, and what was evident in population genomics since 2017: that Proto-Indo-Europeans expanded under R1b bottlenecks, and that the so-called “Steppe ancestry” referred to two different components, one – Yamnaya or Steppe_EMBA ancestry – expanding with Proto-Indo-Europeans, and the other one – Corded Ware or Steppe_MLBA ancestry – expanding with Uralic speakers.

The following maps are based on formal stats published in the papers and supplementary materials from 2015 until today, mainly on Wang et al. (2018 & 2019), Mathieson et al. (2018) and Olalde et al. (2018), and others like Lazaridis et al. (2016), Lazaridis et al. (2017), Mittnik et al. (2018), Lamnidis et al. (2018), Fernandes et al. (2018), Jeong et al. (2019), Olalde et al. (2019), etc.

NOTE. As in the Corded Ware ancestry maps, the selected reports in this case are centered on the prototypical Yamnaya ancestry vs. other simplified components, so everything else refers to simplistic ancestral components widespread across populations that do not necessarily share any recent connection, much less a language. In fact, most of the time they clearly didn’t. They can be interpreted as “EHG that is not part of the Yamnaya component”, or “CHG that is not part of the Yamnaya component”. They can’t be read as “expanding EHG people/language” or “expanding CHG people/language”, at least no more than maps of “Steppe ancestry” can be read as “expanding Steppe people/language”. Also, remember that I have left the default behaviour for color classification, so that the highest value (i.e. 1, or white colour) could mean anything from 10% to 100% depending on the specific ancestry and period; that’s what the legend is for… But, fere libenter homines id quod volunt credunt.

Sections:

  1. Neolithic or the formation of Early Indo-European
  2. Eneolithic or the expansion of Middle Proto-Indo-European
  3. Chalcolithic / Early Bronze Age or the expansion of Late Proto-Indo-European
  4. European Early Bronze Age and MLBA or the expansion of Late PIE dialects

1. Neolithic

Anthony (2019) agrees with the most likely explanation of the CHG component found in Yamnaya, as derived from steppe hunter-fishers close to the lower Volga basin. The ultimate origin of this specific CHG-like component that eventually formed part of the Pre-Yamnaya ancestry is not clear, though:

The hunter-fisher camps that first appeared on the lower Volga around 6200 BC could represent the migration northward of un-admixed CHG hunter-fishers from the steppe parts of the southeastern Caucasus, a speculation that awaits confirmation from aDNA.

neolithic-chg-ancestry
Natural neighbor interpolation of CHG ancestry among Neolithic populations. See full map.

The typical EHG component that formed part eventually of Pre-Yamnaya ancestry came from the Middle Volga Basin, most likely close to the Samara region, as shown by the sampled Samara hunter-gatherer (ca. 5600-5500 BC):

After 5000 BC domesticated animals appeared in these same sites in the lower Volga, and in new ones, and in grave sacrifices at Khvalynsk and Ekaterinovka. CHG genes and domesticated animals flowed north up the Volga, and EHG genes flowed south into the North Caucasus steppes, and the two components became admixed.

neolithic-ehg-ancestry
Natural neighbor interpolation of EHG ancestry among Neolithic populations. See full map.

To the west, in the Dnieper-Dniester area, WHG became the dominant ancestry after the Mesolithic, at the expense of EHG, revealing a likely mating network reaching to the north into the Baltic:

Like the Mesolithic and Neolithic populations here, the Eneolithic populations of Dnieper-Donets II type seem to have limited their mating network to the rich, strategic region they occupied, centered on the Rapids. The absence of CHG shows that they did not mate frequently if at all with the people of the Volga steppes (…)

neolithic-whg-ancestry
Natural neighbor interpolation of WHG ancestry among Neolithic populations. See full map.

North-West Anatolia Neolithic ancestry, proper of expanding Early European farmers, is found up to border of the Dniester, as Anthony (2007) had predicted.

neolithic-anatolia-farmer-ancestry
Natural neighbor interpolation of Anatolia Neolithic ancestry among Neolithic populations. See full map.

2. Eneolithic

From Anthony (2019):

After approximately 4500 BC the Khvalynsk archaeological culture united the lower and middle Volga archaeological sites into one variable archaeological culture that kept domesticated sheep, goats, and cattle (and possibly horses). In my estimation, Khvalynsk might represent the oldest phase of PIE.

(…) this middle Volga mating network extended down to the North Caucasian steppes, where at cemeteries such as Progress-2 and Vonyuchka, dated 4300 BC, the same Khvalynsk-type ancestry appeared, an admixture of CHG and EHG with no Anatolian Farmer ancestry, with steppe-derived Y-chromosome haplogroup R1b. These three individuals in the North Caucasus steppes had higher proportions of CHG, overlapping Yamnaya. Without any doubt, a CHG population that was not admixed with Anatolian Farmers mated with EHG populations in the Volga steppes and in the North Caucasus steppes before 4500 BC. We can refer to this admixture as pre-Yamnaya, because it makes the best currently known genetic ancestor for EHG/CHG R1b Yamnaya genomes.

From Wang et al (2019):

Three individuals from the sites of Progress 2 and Vonyuchka 1 in the North Caucasus piedmont steppe (‘Eneolithic steppe’), which harbour EHG and CHG related ancestry, are genetically very similar to Eneolithic individuals from Khvalynsk II and the Samara region. This extends the cline of dilution of EHG ancestry via CHG-related ancestry to sites immediately north of the Caucasus foothills

eneolithic-pre-yamnaya-ancestry
Natural neighbor interpolation of Pre-Yamnaya ancestry among Neolithic populations. See full map. This map corresponds roughly to the map of Khvalynsk-Novodanilovka expansion, and in particular to the expansion of horse-head pommel-scepters (read more about Khvalynsk, and specifically about horse symbolism)

NOTE. Unpublished samples from Ekaterinovka have been previously reported as within the R1b-L23 tree. Interestingly, although the Varna outlier is a female, the Balkan outlier from Smyadovo shows two positive SNP calls for hg. R1b-M269. However, its poor coverage makes its most conservative haplogroup prediction R-M343.

The formation of this Pre-Yamnaya ancestry sets this Volga-Caucasus Khvalynsk community apart from the rest of the EHG-like population of eastern Europe.

eneolithic-ehg-ancestry
Natural neighbor interpolation of non-Pre-Yamnaya EHG ancestry among Eneolithic populations. See full map.

Anthony (2019) seems to rely on ADMIXTURE graphics when he writes that the late Sredni Stog sample from Alexandria shows “80% Khvalynsk-type steppe ancestry (CHG&EHG)”. While this seems the most logical conclusion of what might have happened after the Suvorovo-Novodanilovka expansion through the North Pontic steppes (see my post on “Steppe ancestry” step by step), formal stats have not confirmed that.

In fact, analyses published in Wang et al. (2019) rejected that Corded Ware groups are derived from this Pre-Yamnaya ancestry, a reality that had been already hinted in Narasimhan et al. (2018), when Steppe_EMBA showed a poor fit for expanding Srubna-Andronovo populations. Hence the need to consider the whole CHG component of the North Pontic area separately:

eneolithic-chg-ancestry
Natural neighbor interpolation of non-Pre-Yamnaya CHG ancestry among Eneolithic populations. See full map. You can read more about population movements in the late Sredni Stog and closer to the Proto-Corded Ware period.

NOTE. Fits for WHG + CHG + EHG in Neolithic and Eneolithic populations are taken in part from Mathieson et al. (2019) supplementary materials (download Excel here). Unfortunately, while data on the Ukraine_Eneolithic outlier from Alexandria abounds, I don’t have specific data on the so-called ‘outlier’ from Dereivka compared to the other two analyzed together, so these maps of CHG and EHG expansion are possibly showing a lesser distribution to the west than the real one ca. 4000-3500 BC.

eneolithic-whg-ancestry
Natural neighbor interpolation of WHG ancestry among Eneolithic populations. See full map.

Anatolia Neolithic ancestry clearly spread to the east into the north Pontic area through a Middle Eneolithic mating network, most likely opened after the Khvalynsk expansion:

eneolithic-anatolia-farmer-ancestry
Natural neighbor interpolation of Anatolia Neolithic ancestry among Eneolithic populations. See full map.
eneolithic-iran-chl-ancestry
Natural neighbor interpolation of Iran Chl. ancestry among Eneolithic populations. See full map.

Regarding Y-chromosome haplogroups, Anthony (2019) insists on the evident association of Khvalynsk, Yamnaya, and the spread of Pre-Yamnaya and Yamnaya ancestry with the expansion of elite R1b-L754 (and some I2a2) individuals:

eneolithic-early-y-dna
Y-DNA haplogroups in West Eurasia during the Early Eneolithic in the Pontic-Caspian steppes. See full map, and see culture, ADMIXTURE, Y-DNA, and mtDNA maps of the Early Eneolithic and Late Eneolithic.

3. Early Bronze Age

Data from Wang et al. (2019) show that Corded Ware-derived populations do not have good fits for Eneolithic_Steppe-like ancestry, no matter the model. In other words: Corded Ware populations show not only a higher contribution of Anatolia Neolithic ancestry (ca. 20-30% compared to the ca. 2-10% of Yamnaya); they show a different EHG + CHG combination compared to the Pre-Yamnaya one.

eneolithic-steppe-best-fits
Supplementary Table 13. P values of rank=2 and admixture proportions in modelling Steppe ancestry populations as a three-way admixture of Eneolithic steppe Anatolian_Neolithic and WHG using 14 outgroups.
Left populations: Test, Eneolithic_steppe, Anatolian_Neolithic, WHG.
Right populations: Mbuti.DG, Ust_Ishim.DG, Kostenki14, MA1, Han.DG, Papuan.DG, Onge.DG, Villabruna, Vestonice16, ElMiron, Ethiopia_4500BP.SG, Karitiana.DG, Natufian, Iran_Ganj_Dareh_Neolithic.

Yamnaya Kalmykia and Afanasievo show the closest fits to the Eneolithic population of the North Caucasian steppes, rejecting thus sizeable contributions from Anatolia Neolithic and/or WHG, as shown by the SD values. Both probably show then a Pre-Yamnaya ancestry closest to the late Repin population.

wang-eneolithic-steppe-caucasus-yamnaya
Modelling results for the Steppe and Caucasus cluster. Admixture proportions based on (temporally and geographically) distal and proximal models, showing additional AF ancestry in Steppe groups and additional gene flow from the south in some of the Steppe groups as well as the Caucasus groups. See tables above. Modified from Wang et al. (2019). Within a blue square, Yamnaya-related groups; within a cyan square, Corded Ware-related groups. Green background behind best p-values. In red circle, SD of AF/WHG ancestry contribution in Afanasevo and Yamnaya Kalmykia, with ranges that almost include 0%.

EBA maps include data from Wang et al. (2018) supplementary materials, specifically unpublished Yamnaya samples from Hungary that appeared in analysis of the preprint, but which were taken out of the definitive paper. Their location among Yamnaya settlers from Hungary is speculative, although most uncovered kurgans in Hungary are concentrated in the Tisza-Danube interfluve.

eba-yamnaya-ancestry
Natural neighbor interpolation of Pre-Yamnaya ancestry among Early Bronze Age populations. See full map. This map corresponds roughly with the known expansion of late Repin/Yamnaya settlers.

The Y-chromosome bottleneck of elite males from Proto-Indo-European clans under R1b-L754 and some I2a2 subclades, already visible in the Khvalynsk sampling, became even more noticeable in the subsequent expansion of late Repin/early Yamnaya elites under R1b-L23 and I2a-L699:

chalcolithic-early-y-dna
Y-DNA haplogroups in West Eurasia during the Yamnaya expansion. See full map and maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Early Chalcolithic and Yamnaya Hungary.

Maps of CHG, EHG, Anatolia Neolithic, and probably WHG show the expansion of these components among Corded Ware-related groups in North Eurasia, apart from other cultures close to the Caucasus:

NOTE. For maps with actual formal stats of Corded Ware ancestry from the Early Bronze Age to the modern times, you can read the post Corded Ware ancestry in North Eurasia and the Uralic expansion.

eba-chg-ancestry
Natural neighbor interpolation of non-Pre-Yamnaya CHG ancestry among Early Bronze Age populations. See full map.
eba-ehg-ancestry
Natural neighbor interpolation of non-Pre-Yamnaya EHG ancestry among Early Bronze Age populations. See full map.
eba-whg-ancestry
Natural neighbor interpolation of WHG ancestry among Early Bronze Age populations. See full map.
eba-anatolia-farmer-ancestry
Natural neighbor interpolation of Anatolia Neolithic ancestry among Early Bronze Age populations. See full map.
eba-iran-chl-ancestry
Natural neighbor interpolation of Iran Chl. ancestry among Early Bronze Age populations. See full map.

4. Middle to Late Bronze Age

The following maps show the most likely distribution of Yamnaya ancestry during the Bell Beaker-, Balkan-, and Sintashta-Potapovka-related expansions.

4.1. Bell Beakers

The amount of Yamnaya ancestry is probably overestimated among populations where Bell Beakers replaced Corded Ware. A map of Yamnaya ancestry among Bell Beakers gets trickier for the following reasons:

  • Expanding Repin peoples of Pre-Yamnaya ancestry must have had admixture through exogamy with late Sredni Stog/Proto-Corded Ware peoples during their expansion into the North Pontic area, and Sredni Stog in turn had probably some Pre-Yamnaya admixture, too (although they don’t appear in the simplistic formal stats above). This is supported by the increase of Anatolia farmer ancestry in more western Yamna samples.
  • Later, Yamnaya admixed through exogamy with Corded Ware-like populations in Central Europe during their expansion. Even samples from the Middle to Upper Danube and around the Lower Rhine will probably show increasing contributions of Steppe_MLBA, at the same time as they show an increasing proportion of EEF-related ancestry.
  • To complicate things further, the late Corded Ware Espersted family (from ca. 2500 BC or later) shows, in turn, what seems like a recent admixture with Yamnaya vanguard groups, with the sample of highest Yamnaya ancestry being the paternal uncle of other individuals (all of hg. R1a-M417), suggesting that there might have been many similar Central European mating networks from the mid-3rd millennium BC on, of (mainly) Yamnaya-like R1b elites displaying a small proportion of CW-like ancestry admixing through exogamy with Corded Ware-like peoples who already had some Yamnaya ancestry.
mlba-yamnaya-ancestry
Natural neighbor interpolation of Yamnaya ancestry among Middle to Late Bronze Age populations (Esperstedt CWC site close to BK_DE, label is hidden by BK_DE_SAN). See full map. You can see how this map correlated with the map of Late Copper Age migrations and Yamanaya into Bell Beaker expansion.

NOTE. Terms like “exogamy”, “male-driven migration”, and “sex bias”, are not only based on the Y-chromosome bottlenecks visible in the different cultural expansions since the Palaeolithic. Despite the scarce sampling available in 2017 for analysis of “Steppe ancestry”-related populations, it appeared to show already a male sex bias in Goldberg et al. (2017), and it has been confirmed for Neolithic and Copper Age population movements in Mathieson et al. (2018) – see Supplementary Table 5. The analysis of male-biased expansion of “Steppe ancestry” in CWC Esperstedt and Bell Beaker Germany is, for the reasons stated above, not very useful to distinguish their mutual influence, though.

Based on data from Olalde et al. (2019), Bell Beakers from Germany are the closest sampled ones to expanding East Bell Beakers, and those close to the Rhine – i.e. French, Dutch, and British Beakers in particular – show a clear excess “Steppe ancestry” due to their exogamy with local Corded Ware groups:

Only one 2-way model fits the ancestry in Iberia_CA_Stp with P-value>0.05: Germany_Beaker + Iberia_CA. Finding a Bell Beaker-related group as a plausible source for the introduction of steppe ancestry into Iberia is consistent with the fact that some of the individuals in the Iberia_CA_Stp group were excavated in Bell Beaker associated contexts. Models with Iberia_CA and other Bell Beaker groups such as France_Beaker (P-value=7.31E-06), Netherlands_Beaker (P-value=1.03E-03) and England_Beaker (P-value=4.86E-02) failed, probably because they have slightly higher proportions of steppe ancestry than the true source population.

olalde-iberia-chalcolithic

The exogamy with Corded Ware-like groups in the Lower Rhine Basin seems at this point undeniable, as is the origin of Bell Beakers around the Middle-Upper Danube Basin from Yamnaya Hungary.

To avoid this excess “Steppe ancestry” showing up in the maps, since Bell Beakers from Germany pack the most Yamnaya ancestry among East Bell Beakers outside Hungary (ca. 51.1% “Steppe ancestry”), I equated this maximum with BK_Scotland_Ach (which shows ca. 61.1% “Steppe ancestry”, highest among western Beakers), and applied a simple rule of three for “Steppe ancestry” in Dutch and British Beakers.

NOTE. Formal stats for “Steppe ancestry” in Bell Beaker groups are available in Olalde et al. (2018) supplementary materials (PDF). I didn’t apply this adjustment to Bk_FR groups because of the R1b Bell Beaker sample from the Champagne/Alsace region reported by Samantha Brunel that will pack more Yamnaya ancestry than any other sampled Beaker to date, hence probably driving the Yamnaya ancestry up in French samples.

The most likely outcome in the following years, when Yamnaya and Corded Ware ancestry are investigated separately, is that Yamnaya ancestry will be much lower the farther away from the Middle and Lower Danube region, similar to the case in Iberia, so the map above probably overestimates this component in most Beakers to the north of the Danube. Even the late Hungarian Beaker samples, who pack the highest Yamnaya ancestry (up to 75%) among Beakers, represent likely a back-migration of Moravian Beakers, and will probably show a contribution of Corded Ware ancestry due to the exogamy with local Moravian groups.

Despite this decreasing admixture as Bell Beakers spread westward, the explosive expansion of Yamnaya R1b male lineages (in words of David Reich) and the radical replacement of local ones – whether derived from Corded Ware or Neolithic groups – shows the true extent of the North-West Indo-European expansion in Europe:

chalcolithic-late-y-dna
Y-DNA haplogroups in West Eurasia during the Bell Beaker expansion. See full map and see maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Late Copper Age and of the Yamnaya-Bell Beaker transition.

4.2. Palaeo-Balkan

There is scarce data on Palaeo-Balkan movements yet, although it is known that:

  1. Yamnaya ancestry appears among Mycenaeans, with the Yamnaya Bulgaria sample being its best current ancestral fit;
  2. the emergence of steppe ancestry and R1b-M269 in the eastern Mediterranean was associated with Ancient Greeks;
  3. Thracians, Albanians, and Armenians also show R1b-M269 subclades and “Steppe ancestry”.

4.3. Sintashta-Potapovka-Filatovka

Interestingly, Potapovka is the only Corded Ware derived culture that shows good fits for Yamnaya ancestry, despite having replaced Poltavka in the region under the same Corded Ware-like (Abashevo) influence as Sintashta.

This proves that there was a period of admixture in the Pre-Proto-Indo-Iranian community between CWC-like Abashevo and Yamnaya-like Catacomb-Poltavka herders in the Sintashta-Potapovka-Filatovka community, probably more easily detectable in this group because of the specific temporal and geographic sampling available.

srubnaya-yamnaya-ehg-chg-ancestry
Supplementary Table 14. P values of rank=3 and admixture proportions in modelling Steppe ancestry populations as a four-way admixture of distal sources EHG, CHG, Anatolian_Neolithic and WHG using 14 outgroups.
Left populations: Steppe cluster, EHG, CHG, WHG, Anatolian_Neolithic
Right populations: Mbuti.DG, Ust_Ishim.DG, Kostenki14, MA1, Han.DG, Papuan.DG, Onge.DG, Villabruna, Vestonice16, ElMiron, Ethiopia_4500BP.SG, Karitiana.DG, Natufian, Iran_Ganj_Dareh_Neolithic.

Srubnaya ancestry shows a best fit with non-Pre-Yamnaya ancestry, i.e. with different CHG + EHG components – possibly because the more western Potapovka (ancestral to Proto-Srubnaya Pokrovka) also showed good fits for it. Srubnaya shows poor fits for Pre-Yamnaya ancestry probably because Corded Ware-like (Abashevo) genetic influence increased during its formation.

On the other hand, more eastern Corded Ware-derived groups like Sintashta and its more direct offshoot Andronovo show poor fits with this model, too, but their fits are still better than those including Pre-Yamnaya ancestry.

mlba-ehg-ancestry
Natural neighbor interpolation of non-Pre-Yamnaya EHG ancestry among Middle to Late Bronze Age populations. See full map.
mlba-chg-ancestry
Natural neighbor interpolation of non-Pre-Yamnaya CHG ancestry among Middle to Late Bronze Age populations. See full map.
mlba-anatolia-farmer-ancestry
Natural neighbor interpolation of Anatolia Neolithic ancestry among Middle to Late Bronze Age populations. See full map.
mlba-iran-chl-ancestry
Natural neighbor interpolation of Iran Chl. ancestry among Middle to Late Bronze Age populations. See full map.

NOTE For maps with actual formal stats of Corded Ware ancestry from the Early Bronze Age to the modern times, you should read the post Corded Ware ancestry in North Eurasia and the Uralic expansion instead.

The bottleneck of Proto-Indo-Iranians under R1a-Z93 was not yet complete by the time when the Sintashta-Potapovka-Filatovka community expanded with the Srubna-Andronovo horizon:

early-bronze-age-y-dna
Y-DNA haplogroups in West Eurasia during the European Early Bronze Age. See full map and see maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Early Bronze Age.

4.4. Afanasevo

At the end of the Afanasevo culture, at least three samples show hg. Q1b (ca. 2900-2500 BC), which seemed to point to a resurgence of local lineages, despite continuity of the prototypical Pre-Yamnaya ancestry. On the other hand, Anthony (2019) makes this cryptic statement:

Yamnaya men were almost exclusively R1b, and pre-Yamnaya Eneolithic Volga-Caspian-Caucasus steppe men were principally R1b, with a significant Q1a minority.

Since the only available samples from the Khvalynsk community are R1b (x3), Q1a(x1), and R1a(x1), it seems strange that Anthony would talk about a “significant minority”, unless Q1a (potentially Q1b in the newer nomenclature) will pop up in some more individuals of those ca. 30 new to be published. Because he also mentions I2a2 as appearing in one elite burial, it seems Q1a (like R1a-M459) will not appear under elite kurgans, although it is still possible that hg. Q1a was involved in the expansion of Afanasevo to the east.

middle-bronze-age-y-dna
Y-DNA haplogroups in West Eurasia during the Middle Bronze Age. See full map and see maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Middle Bronze Age and the Late Bronze Age.

Okunevo, which replaced Afanasevo in the Altai region, shows a majority of hg. Q1b, but also some R1b-M269 samples proper of Afanasevo, suggesting partial genetic continuity.

NOTE. Other sampled Siberian populations clearly show a variety of Q subclades that likely expanded during the Palaeolithic, such as Baikal EBA samples from Ust’Ida and Shamanka with a majority of Q1b, and hg. Q reported from Elunino, Sagsai, Khövsgöl, and also among peoples of the Srubna-Andronovo horizon (the Krasnoyarsk MLBA outlier), and in Karasuk.

From Damgaard et al. Science (2018):

(…) in contrast to the lack of identifiable admixture from Yamnaya and Afanasievo in the CentralSteppe_EMBA, there is an admixture signal of 10 to 20% Yamnaya and Afanasievo in the Okunevo_EMBA samples, consistent with evidence of western steppe influence. This signal is not seen on the X chromosome (qpAdm P value for admixture on X 0.33 compared to 0.02 for autosomes), suggesting a male-derived admixture, also consistent with the fact that 1 of 10 Okunevo_EMBA males carries a R1b1a2a2 Y chromosome related to those found in western pastoralists. In contrast, there is no evidence of western steppe admixture among the more eastern Baikal region region Bronze Age (~2200 to 1800 BCE) samples.

This Yamnaya ancestry has been also recently found to be the best fit for the Iron Age population of Shirenzigou in Xinjiang – where Tocharian languages were attested centuries later – despite the haplogroup diversity acquired during their evolution, likely through an intermediate Chemurchek culture (see a recent discussion on the elusive Proto-Tocharians).

Haplogroup diversity seems to be common in Iron Age populations all over Eurasia, most likely due to the spread of different types of sociopolitical structures where alliances played a more relevant role in the expansion of peoples. A well-known example of this is the spread of Akozino warrior-traders in the whole Baltic region under a partial N1a-VL29-bottleneck associated with the emerging chiefdom-based systems under the influence of expanding steppe nomads.

early-iron-age-y-dna
Y-DNA haplogroups in West Eurasia during the Early Iron Age. See full map and see maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Early Iron Age and Late Iron Age.

Surprisingly, then, Proto-Tocharians from Shirenzigou pack up to 74% Yamnaya ancestry, in spite of the 2,000 years that separate them from the demise of the Afanasevo culture. They show more Yamnaya ancestry than any other population by that time, being thus a sort of Late PIE fossils not only in their archaic dialect, but also in their genetic profile:

shirenzigou-afanasievo-yamnaya-andronovo-srubna-ulchi-han

The recent intrusion of Corded Ware-like ancestry, as well as the variable admixture with Siberian and East Asian populations, both point to the known intense Old Iranian and Old/Middle Chinese contacts. The scarce Proto-Samoyedic and Proto-Turkic loans in Tocharian suggest a rather loose, probably more distant connection with East Uralic and Altaic peoples from the forest-steppe and steppe areas to the north (read more about external influences on Tocharian).

Interestingly, both R1b samples, MO12 and M15-2 – likely of Asian R1b-PH155 branch – show a best fit for Andronovo/Srubna + Hezhen/Ulchi ancestry, suggesting a likely connection with Iranians to the east of Xinjiang, who later expanded as the Wusun and Kangju. How they might have been related to Huns and Xiongnu individuals, who also show this haplogroup, is yet unknown, although Huns also show hg. R1a-Z93 (probably most R1a-Z2124) and Steppe_MLBA ancestry, earlier associated with expanding Iranian peoples of the Srubna-Andronovo horizon.

All in all, it seems that prehistoric movements explained through the lens of genetic research fit perfectly well the linguistic reconstruction of Proto-Indo-European and Proto-Uralic.

Related

Volga Basin R1b-rich Proto-Indo-Europeans of (Pre-)Yamnaya ancestry

yamnaya-expansion

New paper (behind paywall) by David Anthony, Archaeology, Genetics, and Language in the Steppes: A Comment on Bomhard, complementing in a favourable way Bomhard’s Caucasian substrate hypothesis in the current issue of the JIES.

NOTE. I have tried to access this issue for some days, but it’s just not indexed in my university library online service (ProQuest) yet. This particular paper is on Academia.edu, though, as are Bomhard’s papers on this issue in his site.

Interesting excerpts (emphasis mine):

Along the banks of the lower Volga many excavated hunting-fishing camp sites are dated 6200-4500 BC. They could be the source of CHG ancestry in the steppes. At about 6200 BC, when these camps were first established at Kair Shak III and Varfolomievka (42 and 28 on Figure 2), they hunted primarily saiga antelope around Dzhangar, south of the lower Volga, and almost exclusively onagers in the drier desert-steppes at Kair-Shak, north of the lower Volga. Farther north at the lower/middle Volga ecotone, at sites such as Varfolomievka and Oroshaemoe hunter-fishers who made pottery similar to that at Kair-Shak hunted onagers and saiga antelope in the desert-steppe, horses in the steppe, and aurochs in the riverine forests. Finally, in the Volga steppes north of Saratov and near Samara, hunter-fishers who made a different kind of pottery (Samara type) and hunted wild horses and red deer definitely were EHG. A Samara hunter-gatherer of this era buried at Lebyazhinka IV, dated 5600-5500 BC, was one of the first named examples of the EHG genetic type (Haak et al. 2015). This individual, like others from the same region, had no or very little CHG ancestry. The CHG mating network had not yet reached Samara by 5500 BC.

morgunova-eneolithic-pontic-caspian
Eneolithic settlements (1–5, 7, 10–16, 20, 22–43, 48, 50), burial grounds (6, 8–9, 17–19, 21, 47, 49) and kurgans (44–46) of the steppe Ural-Volga region: 1 Ivanovka; 2 Turganik; 3 Kuzminki; 4 Mullino; 5 Davlekanovo; 6 Sjezheye (burial ground); 7 Vilovatoe; 8 Ivanovka; 9 Krivoluchye; 10–13 LebjazhinkaI-III-IV-V; 14 Gundorovka; 15–16 Bol. Rakovka I-II; 17–18 Khvalunsk I-II; 19 Lipoviy Ovrag; 20 Alekseevka; 21 Khlopkovskiy; 22 Kuznetsovo I; 23 Ozinki II; 24 Altata; 25 Monakhov I; 26 Oroshaemoe; 27 Rezvoe; 28 Varpholomeevka; 29 Vetelki; 30 Pshenichnoe; 31 Kumuska; 32 Inyasovo; 33 Shapkino VI; 34 Russkoe Truevo I; 35 Tsaritsa I-II; 36 Kamenka I; 37 Kurpezhe-Molla; 38 Istay; 39 Isekiy; 40 Koshalak; 41 Kara-Khuduk; 42 Kair-Shak VI; 43 Kombakte; 44 Berezhnovka I-II; 45 Rovnoe; 46 Politotdelskoe; 47 burial near s. Pushkino; 48 Elshanka; 49 Novoorsk; 50 Khutor Repin. Modified from Morgunova (2014).

But before 4500 BC, CHG ancestry appeared among the EHG hunter-fishers in the middle Volga steppes from Samara to Saratov, at the same time that domesticated cattle and sheep-goats appeared. The Reich lab now has whole-genome aDNA data from more than 30 individuals from three Eneolithic cemeteries in the Volga steppes between the cities of Saratov and Samara (Khlopkov Bugor, Khvalynsk, and Ekaterinovka), all dated around the middle of the fifth millennium BC. Many dates from human bone are older, even before 5000 BC, but they are affected by strong reservoir effects, derived from a diet rich in fish, making them appear too old (Shishlina et al 2009), so the dates I use here accord with published and unpublished dates from a few dated animal bones (not fish-eaters) in graves.

Only three individuals from Khvalynsk are published, and they were first published in a report that did not mention the site in the text (Mathieson et al. 2015), so they went largely unnoticed. Nevertheless, they are crucial for understanding the evolution of the Yamnaya mating network in the steppes. They were mentioned briefly in Damgaard et al (2018) but were not graphed. They were re-analyzed and their admixture components were illustrated in a bar graph in Wang et al (2018: figure 2c), but they are not the principal focus of any published study. All of the authors who examined them agreed that these three Khvalynsk individuals, dated about 4500 BC, showed EHG ancestry admixed substantially with CHG, and not a trace of Anatolian Farmer ancestry, so the CHG was a Hotu-Cave or Kotias-Cave type of un-admixed CHG. The proportion of CHG in the Wang et al. (2018) bar graphs is about 20-30% in two individuals, substantially less CHG than in Yamnaya; but the third Khvalynsk individual had more than 50% CHG, like Yamnaya. The ca. 30 additional unpublished individuals from three middle Volga Eneolithic cemeteries, including Khvalynsk, preliminarily show the same admixed EHG/CHG ancestry in varying proportions. Most of the males belonged to Y-chromosome haplogroup R1b1a, like almost all Yamnaya males, but Khvalynsk also had some minority Y-chromosome haplogroups (R1a, Q1a, J, I2a2) that do not appear or appear only rarely (I2a2) in Yamnaya graves.

eneolithic-steppes
Pontic-Caspian steppe and neighbouring groups in the Neolithic. See full map.

Wang et al. (2018) discovered that this middle Volga mating network extended down to the North Caucasian steppes, where at cemeteries such as Progress-2 and Vonyuchka, dated 4300 BC, the same Khvalynsk-type ancestry appeared, an admixture of CHG and EHG with no Anatolian Farmer ancestry, with steppe-derived Y-chromosome haplogroup R1b. These three individuals in the North Caucasus steppes had higher proportions of CHG, overlapping Yamnaya. Without any doubt, a CHG population that was not admixed with Anatolian Farmers mated with EHG populations in the Volga steppes and in the North Caucasus steppes before 4500 BC. We can refer to this admixture as pre-Yamnaya, because it makes the best currently known genetic ancestor for EHG/CHG R1b Yamnaya genomes. The Progress-2 individuals from North Caucasus steppe graves lived not far from the pre-Maikop farmers of the Belaya valley, but they did not exchange mates, according to their DNA.

The hunter-fisher camps that first appeared on the lower Volga around 6200 BC could represent the migration northward of un-admixed CHG hunter-fishers from the steppe parts of the southeastern Caucasus, a speculation that awaits confirmation from aDNA. After 5000 BC domesticated animals appeared in these same sites in the lower Volga, and in new ones, and in grave sacrifices at Khvalynsk and Ekaterinovka. CHG genes and domesticated animals flowed north up the Volga, and EHG genes flowed south into the North Caucasus steppes, and the two components became admixed. After approximately 4500 BC the Khvalynsk archaeological culture united the lower and middle Volga archaeological sites into one variable archaeological culture that kept domesticated sheep, goats, and cattle (and possibly horses). In my estimation, Khvalynsk might represent the oldest phase of PIE.

eneolithic-early-steppes
Pontic-Caspian steppe and neighbouring groups in the Early Eneolithic. See full map.

Anatolian Farmer ancestry and Yamnaya origins

The Eneolithic Volga-North Caucasus mating network (Khvalynsk/Progress-2 type) exhibited EHG/CHG admixtures and Y-chromosome haplogroups similar to Yamnaya, but without Yamnaya’s additional Anatolian Farmer ancestry. (…)

Like the Mesolithic and Neolithic populations here, the Eneolithic populations of Dnieper-Donets II type seem to have limited their mating network to the rich, strategic region they occupied, centered on the Rapids. The absence of CHG shows that they did not mate frequently if at all with the people of the Volga steppes, a surprising but undeniable discovery. Archaeologists have seen connections in ornament types and in some details of funeral ritual between Dnieper-Donets cemeteries of the Mariupol-Nikol’skoe type and cemeteries in the middle Volga steppes such as Khvalynsk and S’yez’zhe (Vasiliev 1981:122-123). Also their cranio-facial types were judged to be similar (Bogdanov and Khokhlov 2012:212). So it it surprising that their aDNA does not indicate any genetic admixture with Khvalynsk or Progress-2. Also, neither they nor the Volga steppe Eneolithic populations showed any Anatolian Farmer ancestry. (…)

All three of the steppe-admixed exceptions were from the Varna region (Mathieson et al. 2018). One of them was the famous “golden man’ at Varna (Krause et al. 2016), Grave 43, whose steppe ancestry was the most doubtful of the three. If he had steppe ancestry, it was sufficiently distant (five+ generations before him) that he was not a statistically significant outlier, but he was displaced in the steppe direction, away from the central values of the majority of typical Anatolian Farmers at Varna and elsewhere. The other two, at Varna (grave 158, a 5-7-year-old girl) and Smyadovo (grave 29, a male 20-25 years old), were statistically significant outliers who had recent steppe ancestry (consistent with grandparents or great-grandparents) of the EHG/CHG Khvalynsk/Progress-2 type, not of the Dnieper Rapids EHG/WHG type.

(…) I believe that the Suvorovo-Cernavoda I movement into the lower Danube valley and the Balkans about 4300 BC separated early PIE-speakers (pre-Anatolian) from the steppe population that stayed behind in the steppes and that later developed into late PIE and Yamnaya.

This archaeological transition marked the breakdown of the mating barrier between steppe and Anatolian Farmer mating networks. After this 4300-4200 BC event, Anatolian Farmer ancestry began to pop up in the steppes. The currently oldest sample with Anatolian Farmer ancestry in the steppes in an individual at Aleksandriya, a Sredni Stog cemetery on the Donets in eastern Ukraine. Sredni Stog has often been discussed as a possible Yamnaya ancestor in Ukraine (Anthony 2007: 239- 254). The single published grave is dated about 4000 BC (4045– 3974 calBC/ 5215±20 BP/ PSUAMS-2832) and shows 20% Anatolian Farmer ancestry and 80% Khvalynsk-type steppe ancestry (CHG&EHG). His Y-chromosome haplogroup was R1a-Z93, similar to the later Sintashta culture and to South Asian Indo-Aryans, and he is the earliest known sample to show the genetic adaptation to lactase persistence (I3910-T). Another pre-Yamnaya grave with Anatolian Farmer ancestry was analyzed from the Dnieper valley at Dereivka, dated 3600-3400 BC (grave 73, 3634–3377 calBC/ 4725±25 BP/ UCIAMS-186349). She also had 20% Anatolian Farmer ancestry, but she showed less CHG than Aleksandriya and more Dereivka-1 ancestry, not surprising for a Dnieper valley sample, but also showing that the old fifth-millennium-type EHG/WHG Dnieper ancestry survived into the fourth millennium BC in the Dnieper valley (Mathieson et al. 2018).

late-eneolithic-repin
Pontic-Caspian steppe and neighbouring groups in the Late Eneolithic. See full map.

Probably, late PIE (Yamnaya) evolved in the same part of the steppes—the Volga-Caucasus steppes between the lower Don, the lower and middle Volga, and the North Caucasus piedmont—where early PIE evolved, and where appropriate EHG/CHG admixtures and Y-chromosome haplogroups were seen already in the Eneolithic (without Anatolian Farmer). There have always been archaeologists who argued for an origin of Yamnaya in the Volga steppes, including Gimbutas (1963), Merpert (1974), and recently Morgunova (2014), who argued that this was where Repin-type ceramics, an important early Yamnaya pottery type, first appeared in dated contexts before Yamnaya, about 3600 BC. The genetic evidence is consistent with Yamnaya EHG/CHG origins in the Volga-Caucasus steppes. Also, if contact with the Maikop culture was a fundamental cause of the innovations in transport and metallurgy that defined the Yamnaya culture, then the lower Don-North Caucasus-lower Volga steppes, closest to the North Caucasus, would be where the earliest phase is expected.

I would still guess that the Darkveti-Meshoko culture and its descendant Maikop culture established the linguistic ancestor of the Northwest Caucasian languages in approximately the region where they remained. I also accept the general consensus that the appearance of the hierarchical Maikop culture about 3600 BC had profound effects on pre-Yamnaya and early Yamnaya steppe cultures. Yamnaya metallurgy borrowed from the Maikop culture two-sided molds, tanged daggers, cast shaft hole axes with a single blade, and arsenical copper. Wheeled vehicles might have entered the steppes through Maikop, revolutionizing steppe economies and making Yamnaya pastoral nomadism possible after 3300 BC.

For those who still hoped that Proto-Indo-Europeans of Yamnaya/Afanasievo ancestry from the Don-Volga region were associated with the expansion of hg. R1a-M417, in a sort of mythical “R1-rich” Indo-European society, it seems this is going to be yet another prediction based on ancestry magic that goes wrong.

Proto-Indo-Europeans were, however, associated with other subclades beyond R1b-M269, probably (as I wrote recently) R1b-V1636, I2a-L699, Q1a-M25, and R1a-YP1272, but also interestingly some J subclade, so let’s see what surprises the new study on Khvalynsk and Yamnaya settlers from the Carpathian Basin brings…

On the bright side, it is indirectly confirmed that late Sredni Stog formed part of the neighbouring Corded Ware-like populations of ca. 20-30%+ Anatolian farmer ancestry that gave Yamnaya its share (ca. 6-10%), relative to the comparatively unmixed Khvalynsk and late Repin population (as shown by Afanasevo).

In this steppe mating network that opened up after the Khvalynsk expansion, the increasing admixture of Anatolian farmer-related ancestry in Yamnaya from east (ca. 2-10%) to west (ca. 6-15%) points to an exogamy of late Repin males in their western/south-western regions with populations around the Don River basin and beyond (and endogamy within the Yamnaya community), in an evolution relevant for language expansions and language contacts during the Late Eneolithic.

NOTE. “Mating network” is my new preferred term for “ancestry”. Also great to see scholars finally talk about “Pre-Yamnaya” ancestry, which – combined with the distinction of Yamnaya from Corded Ware ancestry – will no doubt help differentiate fine-scale population movements of steppe- and forest-steppe-related populations.

north-pontic-kvityana-dereivka-repin
Modified from Rassamakin (1999), adding red color to Repin expansion. The system of the latest Eneolithic Pointic cultures and the sites of the Zhivotilovo-Volchanskoe type: 1) Volchanskoe; 2) Zhivotilovka; 3) Vishnevatoe; 4) Koisug.

The whole issue of the JIES is centered on Caucasian influences on Early PIE as an Indo-Uralic dialect, and this language contact/substrate is useful to locate the most likely candidates for the Northeast and Northwest Caucasian and the Proto-Indo-European homelands.

On the other hand, it would also be interesting to read a discussion of how this Volga homeland of Middle PIE and Don-Volga-Ural homeland of Late PIE would be reconciled with the known continuous contacts of Uralic with Middle and Late PIE (see here) to locate the most likely Proto-Uralic homeland.

Especially because Corded Ware fully replaced all sub-Neolithic groups to the north and east of Khvalynsk/Yamnaya, like Volosovo, so no other population neighbouring Middle and Late Proto-Indo-Europeans survived into the Bronze Age…

EDIT: For those new to this blog, this information on unpublished samples from the Volga River basin is yet another confirmation of Khokhlov’s report on the R1b-L23 samples from Yekaterinovka, and its confirmation by a co-author of The unique elite Khvalynsk male from a Yekaterinovskiy Cape burial, apart from more support to the newest data placing Yekaterinovka culturally and probably chronologically between Samara and Khvalynsk.

Related

Corded Ware ancestry in North Eurasia and the Uralic expansion

uralic-clines-nganasan

Now that it has become evident that Late Repin (i.e. Yamnaya/Afanasevo) ancestry was associated with the migration of R1b-L23-rich Late Proto-Indo-Europeans from the steppe in the second half of the the 4th millennium BC, there’s still the question of how R1a-rich Uralic speakers of Corded Ware ancestry expanded , and how they spread their languages throughout North Eurasia.

Modern North Eurasians

I have been collecting information from the supplementary data of the latest papers on modern and ancient North Eurasian peoples, including Jeong et al. (2019), Saag et al. (2019), Sikora et al. (2018), or Flegontov et al. (2019), and I have tried to add up their information on ancestral components and their modern and historical distributions.

Fortunately, the current obsession with simplifying ancestry components into three or four general, atemporal groups, and the common use of the same ones across labs, make it very simple to merge data and map them.

Corded Ware ancestry

There is no doubt about the prevalent ancestry among Uralic-speaking peoples. A map isn’t needed to realize that, because ancient and modern data – like those recently summarized in Jeong et al. (2019) – prove it. But maps sure help visualize their intricate relationship better:

natural-modern-srubnaya-ancestry
Natural neighbor interpolation of Srubnaya ancestry among modern populations. See full map.
kriging-modern-srubnaya-ancestry
Kriging interpolation of Srubnaya ancestry among modern populations. See full map

Interestingly, the regions with higher Corded Ware-related ancestry are in great part coincident with (pre)historical Finno-Ugric-speaking territories:

uralic-languages-modern
Modern distribution of Uralic languages, with ancient territory (in the Common Era) labelled and delimited by a red line. For more information on the ancient territory see here.

Edit (29/7/2019): Here is the full Steppe_MLBA ancestry map, including Steppe_MLBA (vs. Indus Periphery vs. Onge) in modern South Asian populations from Narasimhan et al. (2018), apart from the ‘Srubnaya component’ in North Eurasian populations. ‘Dummy’ variables (with 0% ancestry) have been included to the south and east of the map to avoid weird interpolations of Steppe_MLBA into Africa and East Asia.

modern-steppe-mlba-ancestry2
Natural neighbor interpolation of Steppe MLBA-like ancestry among modern populations. See full map.

Anatolia Neolithic ancestry

Also interesting are the patterns of non-CWC-related ancestry, in particular the apparent wedge created by expanding East Slavs, which seems to reflect the intrusion of central(-eastern) European ancestry into Finno-Permic territory.

NOTE. Read more on Balto-Slavic hydrotoponymy, on the cradle of Russians as a Finno-Permic hotspot, and about Pre-Slavic languages in North-West Russia.

natural-modern-lbk-en-ancestry
Natural neighbor interpolation of LBK EN ancestry among modern populations. See full map.
kriging-modern-lbk-en-ancestry
Kriging interpolation of LBK EN ancestry among modern populations. See full map

WHG ancestry

The cline(s) between WHG, EHG, ANE, Nganasan, and Baikal HG are also simplified when some of them excluded, in this case EHG, represented thus in part by WHG, and in part by more eastern ancestries (see below).

modern-whg-ancestry
Natural neighbor interpolation of WHG ancestry among modern populations. See full map.
kriging-modern-whg-ancestry
Kriging interpolation of WHG ancestry among modern populations. See full map.

Arctic, Tundra or Forest-steppe?

Data on Nganasan-related vs. ANE vs. Baikal HG/Ulchi-related ancestry is difficult to map properly, because both ancestry components are usually reported as mutually exclusive, when they are in fact clearly related in an ancestral cline formed by different ancient North Eurasian populations from Siberia.

When it comes to ascertaining the origin of the multiple CWC-related clines among Uralic-speaking peoples, the question is thus how to properly distinguish the proportions of WHG-, EHG-, Nganasan-, ANE or BaikalHG-related ancestral components in North Eurasia, i.e. how did each dialectal group admix with regional groups which formed part of these clines east and west of the Urals.

The truth is, one ought to test specific ancient samples for each “Siberian” ancestry found in the different Uralic dialectal groups, but the simplistic “Siberian” label somehow gets a pass in many papers (see a recent example).

Below qpAdm results with best fits for Ulchi ancestry, Afontova Gora 3 ancestry, and Nganasan ancestry, but some populations show good fits for both and with similar proportions, so selecting one necessarily simplifies the distribution of both.

Ulchi ancestry

modern-ulchi-ancestry
Natural neighbor interpolation of Ulchi ancestry among modern populations. See full map.
kriging-modern-ulchi-ancestry
Kriging interpolation of Ulchi ancestry among modern populations. See full map.

ANE ancestry

natural-modern-ane-ancestry
Natural neighbor interpolation of ANE ancestry among modern populations. See full map.
kriging-modern-ane-ancestry
Kriging interpolation of ANE ancestry among modern populations. See full map.

Nganasan ancestry

modern-nganasan-ancestry
Natural neighbor interpolation of Nganasan ancestry among modern populations. See full map.
kriging-modern-nganasan-ancestry
Kriging interpolation of Nganasan ancestry among modern populations. See full map.

Iran Chalcolithic

A simplistic Iran Chalcolithic-related ancestry is also seen in the Altaic cline(s) which (like Corded Ware ancestry) expanded from Central Asia into Europe – apart from its historical distribution south of the Caucasus:

modern-iran-chal-ancestry
Natural neighbor interpolation of Iran Neolithic ancestry among modern populations. See full map.
kriging-modern-iran-neolithic-ancestry
Kriging interpolation of Iran Chalcolithic ancestry among modern populations. See full map.

Other models

The first question I imagine some would like to know is: what about other models? Do they show the same results? Here is the simplistic combination of ancestry components published in Damgaard et al. (2018) for the same or similar populations:

NOTE. As you can see, their selection of EHG vs. WHG vs. Nganasan vs. Natufian vs. Clovis of is of little use, but corroborate the results from other papers, and show some interesting patterns in combination with those above.

EHG

damgaard-modern-ehg-ancestry
Natural neighbor interpolation of EHG ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-ehg-ancestry
Kriging interpolation of EHG ancestry among modern populations. See full map.

Natufian ancestry

damgaard-modern-natufian-ancestry
Natural neighbor interpolation of Natufian ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-natufian-ancestry
Kriging interpolation of Natufian ancestry among modern populations. See full map.

WHG ancestry

damgaard-modern-whg-ancestry
Natural neighbor interpolation of WHG ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-whg-ancestry
Kriging interpolation of WHG ancestry among modern populations. See full map.

Baikal HG ancestry

damgaard-modern-baikalhg-ancestry
Natural neighbor interpolation of Baikal hunter-gatherer ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-baikal-hg-ancestry
Kriging interpolation of Baikal HG ancestry among modern populations. See full map.

Ancient North Eurasians

Once the modern situation is clear, relevant questions are, for example, whether EHG-, WHG-, ANE, Nganasan-, and/or Baikal HG-related meta-populations expanded or became integrated into Uralic-speaking territories.

When did these admixture/migration events happen?

How did the ancient distribution or expansion of Palaeo-Arctic, Baikalic, and/or Altaic peoples affect the current distribution of the so-called “Siberian” ancestry, and of hg. N1a, in each specific population?

NOTE. A little excursus is necessary, because the calculated repetition of a hypothetic opposition “N1a vs. R1a” doesn’t make this dichotomy real:

  1. There was not a single ethnolinguistic community represented by hg. R1a after the initial expansion of Eastern Corded Ware groups, or by hg. N1a-L392 after its initial expansion in Siberia:
  2. Different subclades became incorporated in different ways into Bronze Age and Iron Age communities, most of which without an ethnolinguistic change. For example, N1a subclades became incorporated into North Eurasian populations of different languages, reaching Uralic- and Indo-European-speaking territories of north-eastern Europe during the late Iron Age, at a time when their ancestral origin or language in Siberia was impossible to ascertain. Just like the mix found among Proto-Germanic peoples (R1b, R1a, and I1)* or among Slavic peoples (I2a, E1b, R1a)*, the mix of many Uralic groups showing specific percentages of R1a, N1a, or Q subclades* reflect more or less recent admixture or acculturation events with little impact on their languages.

*other typically northern and eastern European haplogroups are also represented in early Germanic (N1a, I2, E1b, J, G2), Slavic (I1, G2, J) and Finno-Permic (I1, R1b, J) peoples.

ananino-culture-new
Map of archaeological cultures in north-eastern Europe ca. 8th-3rd centuries BC. [The Mid-Volga Akozino group not depicted] Shaded area represents the Ananino cultural-historical society. Fading purple arrows represent likely stepped movements of subclades of haplogroup N for centuries (e.g. Siberian → Ananino → Akozino → Fennoscandia [N-VL29]; Circum-Arctic → forest-steppe [N1, N2]; etc.). Blue arrows represent eventual expansions of Uralic peoples to the north. Modified image from Vasilyev (2002).

The problem with mapping the ancestry of the available sampling of ancient populations is that we lack proper temporal and regional transects. The maps that follow include cultures roughly divided into either “Bronze Age” or “Iron Age” groups, although the difference between samples may span up to 2,000 years.

NOTE. Rough estimates for more external groups (viz. Sweden Battle Axe/Gotland_A for the NW, Srubna from the North Pontic area for the SW, Arctic/Nganasan for the NE, and Baikal EBA/”Ulchi-like” for the SE) have been included to offer a wider interpolated area using data already known.

Bronze Age

Similar to modern populations, the selection of best fit “Siberian” ancestry between Baikal HG vs. Nganasan, both potentially ± ANE (AG3), is an oversimplification that needs to be addressed in future papers.

Corded Ware ancestry

bronze-age-corded-ware-ancestry
Natural neighbor interpolation of Srubnaya ancestry among Bronze Age populations. See full map.

Nganasan-like ancestry

bronze-age-nganasan-like-ancestry
Natural neighbor interpolation of Nganasan-like ancestry among Bronze Age populations. See full map.

Baikal HG ancestry

bronze-age-baikal-hg-ancestry
Natural neighbor interpolation of Baikal Hunter-Gatherer ancestry among Bronze Age populations. See full map.

Afontova Gora 3 ancestry

bronze-age-afontova-gora-ancestry
Natural neighbor interpolation of Afontova Gora 3 ancestry among Bronze Age populations. See full map.

Iron Age

Corded Ware ancestry

Interestingly, the moderate expansion of Corded Ware-related ancestry from the south during the Iron Age may be related to the expansion of hg. N1a-VL29 into the chiefdom-based system of north-eastern Europe, including Ananyino/Akozino and later expanding Akozino warrior-traders around the Baltic Sea.

NOTE. The samples from Levänluhta are centuries older than those from Estonia (and Ingria), and those from Chalmny Varre are modern ones, so this region has to be read as a south-west to north-east distribution from the Iron Age to modern times.

iron-age-corded-ware-ancestry
Natural neighbor interpolation of Srubnaya ancestry among Iron Age populations. See full map.

Baikal HG-like ancestry

The fact that this Baltic N1a-VL29 branch belongs in a group together with typically Avar N1a-B197 supports the Altaic origin of the parent group, which is possibly related to the expansion of Baikalic ancestry and Iron Age nomads:

iron-age-baikal-ancestry
Natural neighbor interpolation of Baikal HG ancestry among Iron Age populations. See full map.

Nganasan-like ancestry

The dilution of Nganasan-like ancestry in an Arctic region featuring “Siberian” ancestry and hg. N1a-L392 at least since the Bronze Age supports the integration of hg. N1a-Z1934, sister clade of Ugric N1a-Z1936, into populations west and east of the Urals with the expansion of Uralic languages to the north into the Tundra region (see here).

The integration of N1a-Z1934 lineages into Finnic-speaking peoples after their migration to the north and east, and the displacement or acculturation of Saami from their ancestral homeland, coinciding with known genetic bottlenecks among Finns, is yet another proof of this evolution:

iron-age-nganasan-ancestry
Natural neighbor interpolation of Nganasan ancestry among Iron Age populations. See full map.

WHG ancestry

Similarly, WHG ancestry doesn’t seem to be related to important population movements throughout the Bronze Age, which excludes the multiple North Eurasian populations that will be found along the clines formed by WHG, EHG, ANE, Nganasan, Baikal HG ancestry as forming part of the Uralic ethnogenesis, although they may be relevant to follow later regional movements of specific populations.

iron-age-whg-ancestry
Natural neighbor interpolation of WHG ancestry among Iron Age populations. See full map.

Conclusion

It seems natural that people used to look at maps of haplogroup distribution from the 2000s, coupled with modern language distributions, and would try to interpret them in a certain way, reaching thus the wrong conclusions whose consequences are especially visible today when ancient DNA keeps contradicting them.

In hindsight, though, assuming that Balto-Slavs expanded with Corded Ware and hg. R1a, or that Uralians expanded with “Siberian” ancestry and hg. N1a, was as absurd as looking at maps of ancestry and haplogroup distribution of ancient and modern Native Americans, trying to divide them into “Germanic” or “Iberian”…

The evolution of each specific region and cultural group of North Eurasia is far from being clear. However, the general trend speaks clearly in favour of an ancient, Bronze Age distribution of North Eurasian ancestry and haplogroups that have decreased, diluted, or become incorporated into expanding Uralians of Corded Ware ancestry, occasionally spreading with inter-regional expansions of local groups.

Given the relatively recent push of Altaic and Indo-European languages into ancestral Uralic-speaking territories, only the ancient Corded Ware expansion remains compatible with the spread of Uralic languages into their historical distribution.

Related

Iron Age Tocharians of Yamnaya ancestry from Afanasevo show hg. R1b-M269 and Q1a1

New open access Ancient Genomes Reveal Yamnaya-Related Ancestry and a Potential Source of Indo-European Speakers in Iron Age Tianshan, by Ning et al. Current Biology (2019).

Interesting excerpts (emphasis mine, changes for clarity):

Here, we report the first genome-wide data of 10 ancient individuals from northeastern Xinjiang. They are dated to around 2,200 years ago and were found at the Iron Age Shirenzigou site. We find them to be already genetically admixed between Eastern and Western Eurasians. We also find that the majority of the East Eurasian ancestry in the Shirenzigou individuals is related to northeastern Asian populations, while the West Eurasian ancestry is best presented by ∼20% to 80% Yamnaya-like ancestry. Our data thus suggest a Western Eurasian steppe origin for at least part of the ancient Xinjiang population. Our findings furthermore support a Yamnaya-related origin for the now extinct Tocharian languages in the Tarim Basin, in southern Xinjiang.

Haplogroups

The dominant mtDNA lineages of the Shirenzigou people are commonly found in modern and ancient West Eurasian populations, such as U4, U5, and H, while they also have East Eurasian-specific haplogroups A, D4, and G3, preliminarily documenting admixed ancestry from eastern and western Eurasia.

The admixture profile is also shown on the paternal Y chromosome side that 4 out of 6 males in Shirenzigou (Figure S2) belong to the West Eurasian-specific haplogroup R1b (n = 2) and East Eurasian-specific haplogroup Q1a (n = 2), the former is predominant in ancient Yamnaya and nearly 100% in Afanasievo, different from the Middle and Late Bronze Age Steppe groups (Steppe_MLBA) such as Andronovo, [Potapovka], Srubnaya, and Sintashta whose Y chromosomal haplogroup is mainly R1a.

tocharians-y-dna-mtdna

Autosomal

We first carried out principal component analysis (PCA) to assess the genetic affinities of the ancient individuals qualitatively by projecting them onto present-day Eurasian variation (Figure 2). We observed a distinct separation between East and West Eurasians. Our ancient Shirenzigou samples and present-day populations from Central Asia and northwestern China form a genetic cline from East to West in the first PC. The distribution of Shirenzigou samples on the cline is relatively scattered with two major clusters, one being closer to modern-day Uygurs and Kazakhs and the other being closer to recently published ancient Saka and Huns from the Tianshan in Kazakhstan (…).

We applied a formal admixture test using f3 statistics in the form of f3 (Shirenzigou; X, Y) where X and Y are worldwide populations that might be the genetic sources for the Shirenzigou individuals. We observed the most significant signals of admixture in the Shirenzigou samples when using Yamnaya_Samara or Srubnaya as the West Eurasian source and some Northern Asians or Koreans as the East Eurasian source (Table S1). We also plotted the outgroup f3 statistics in the form of f3 (Mbuti; X, Anatolia_Neolithic) and f3 (Mbuti; X, Kostenki14) to visualize the allele sharing between population X and Anatolian farmers. As shown in Figure S3, the Steppe_MLBA populations including Srubnaya, Andronovo, and Sintashta were shifted toward farming populations compared with Yamnaya groups and the Shirenzigou samples. This observation is consistent with ADMIXTURE analysis that Steppe_MLBA populations have an Anatolian and European farmer-related component that Yamnaya groups and the Shirenzigou individuals do not seem to have. The analysis consistently suggested Yamnaya-related Steppe populations were the better source in modeling the West Eurasian ancestry in Shirenzigou.

tocharians-pca-admixture
PCA and ADMIXTURE for Shirenzigou Samples. Modified from the original to include in black squares samples related to Yamnaya.

Genetic Composition of Iron Age Shirenzigou Individuals

We continued to use qpAdm to estimate the admixture proportions in the Shirenzigou samples by using different pairs of source populations, such as Yamnaya_Samara, Afanasievo, Srubnaya, Andronovo, BMAC culture (Bustan_BA and Sappali_Tepe_BA) and Tianshan_Hun as the West Eurasian source and Han, Ulchi, Hezhen, Shamanka_EN as the East Eurasian source. In all cases, Yamnaya, Afanasievo, or Tianshan_Hun always provide the best model fit for the Shirenzigou individuals, while Srubnaya, Andronovo, Bustan_BA and Sappali_Tepe_BA only work in some cases. The Yamnaya_Samara or Afanasievo-related ancestry ranges from ∼20% to 80% in different Shirenzigou individuals, consistent with the scattered distribution on the East-West cline in the PCA

ancestry-tocharians

(…) we then modeled Shirenzigou as a three-way admixture of Yamnaya_Samara, Ulchi (or Hezhen) and Han to infer the source from the East Eurasia side that contributed to Shirenzigou. We found the Ulchi or Hezhen and Han-related ancestry had a complicated and unevenly distribution in the Shirenzigou samples. The most Shirenzigou individuals derived the majority of their East Eurasian ancestry from Ulchi or Hezhen-related populations, while the following two individuals M820 and M15-2 have more Han related than Ulchi/Hezhen-related ancestry.

One important question remains, though: how and when did these Proto-Tocharian speakers migrate from the Afanasevo culture in the Altai into the Tarim Basin? The traditional answer, now more likely than ever, is through the Chemurchek culture. See e.g. A re-analysis of the Qiemu’erqieke (Shamirshak) cemeteries, Xinjiang, China, by Jia and Betts JIES (2010) 38(4).

Also, given the apparent lack of (extra farmer ancestry that characterizes) Corded Ware ancestry, if the results were already suspicious before, how likely are now the published R1a(xZ93) and/or radiocarbon dates of the Xiaohe mummies from Li et al. (2010, 2015)? Because, after all, one should have expected in such a late date a generalized admixture with neighbouring Srubna/Andronovo-like populations.

Related

“Dinaric I2a” and the expansion of Common Slavs from East-Central Europe

late-iron-age-eastern-europe

A recently published abstract for an upcoming chapter about Early Slavs shows the generalized view among modern researchers that Common Slavs did not spread explosively from the east, an idea proper of 19th-century Romantic views about ancestral tribes of pure peoples showing continuity since time immemorial.

Migrations and language shifts as components of the Slavic spread, by Lindstedt and Salmela, In: Language contact and the early Slavs, Eds. Tomáš Klír, Vít Boček, Universitätsverlag Winter (2019):

The rapid spread of the Proto-Slavic language in the second half of the first millennium CE was long explained by the migration of its speakers out of their small primary habitat in all directions. Starting from the 1980s, alternative theories have been proposed that present language shift as the main scenario of the Slavic spread, emphasizing the presumed role of Slavic as the lingua franca of the Avar Khaganate. Both the migration and the language shift scenarios in their extreme forms suffer from factual and chronological inaccuracy. On the basis of some key facts about human population genetics (the relatively recent common ancestry of the East European populations), palaeoclimatology (the Late Antique Little Ice Age from 536 to around 660 CE), and historical epidemiology (the Justinianic Plague), we propose a scenario that includes a primary rapid demographic spread of the Slavs followed by population mixing and language shifts to and from Slavic in different regions of Europe. There was no single reason for the Slavic spread that would apply to all of the area that became Slavic-speaking. The northern West Slavic area, the East Slavic area, and the Avar sphere and South-Eastern Europe exhibit different kinds of spread: mainly migration to a sparsely populated area in the northwest, migration and language shift in the east, and a more complicated scenario in the southeast. The remarkable homogeneity of Slavic up to the jer shift was not attributable to a lingua-franca function in a great area, as is often surmised. It was a founder effect: Proto-Slavic was originally a small Baltic dialect with little internal variation, and it took time for the individual Slavic languages to develop in different directions.

While I would need to read the whole chapter, in principle it seems easier to agree with this summary than with Curta’s (sort of diffuse) Danubian origin of Common Slavic, based on the likely origin of the Balto-Slavic expansion with the Trzciniec and/or Lusatian culture, close to the Baltic.

A multi-ethnic Chernyakhov culture

In a sneak peek to the expected Järve et al. (2019) paper in review, there are three Chernyakhov samples (ca. calAD 350-550) with different ancestry probably corresponding to the different regions where they stem from (see image below), which supports the idea that Iron Age eastern Europe was a true melting pot where the eventual language of the different cultures depended on many different factors:

chernyakhov-samples-region
Map of the samples from Järve et al. (2019).

From the paper:

The Chernyakhiv culture was likely an ethnically heterogeneous mix based on Goths (Germanic tribes) but also including Sarmatians, Alans, Slavs, late Scythians and Dacians – the entire ancient population of the northern coast of the Black Sea.

Contacts with neighbouring regions were active, and the Chernyakhiv culture is associated with a number of historical events that took place in Europe at that time. In particular, during the Scythian or Gothic wars of the 230s and 270s, barbarians living in the territory of the Chernyakhiv culture (Goths, Ferules, Carps, Bastarns, etc.) carried out regular raids across the Danube Limes of the Roman Empire. However, from the end of the 3rd century the relations of the barbarians with the Roman Empire gained a certain stability. From the reign of Constantine I the Goths, who were part of the Chernyakhiv culture, became federates (military allies) of the Empire.

The Goths also interacted with the inhabitants of the East European forest zone. The Roman historian Jordanes described the military campaigns of the Gothic king Ermanaric against northern peoples (the ancestors of Vends, Slavs, etc., and the inhabitants of the northern Volga region).

NOTE. As it has become traditional in writings about eastern Europe, ‘Slavs’ are assumed – for no particular reason – to be part of the ‘northern peoples of the forest’ since who knows when exactly, and thus appear mentioned in this very text simultaneously as part of Chernyakhov, but also part of peoples to the north of Chernyakhov warring against them…

admixture-chernyakhov
Proportions of Eastern Hunter-Gatherer (EHG, blue), Natufian (red) and Altaian (green) ancestries in Scythian/Sarmatian groups and groups pre- and postdating them inferred using the a) qpAdm and b) ChromoPainter/NNLS method. c–e Correlation of qpAdm and CP/NNLS proportions for the three putative sources evaluated. Steppe populations predating the Scythians: Yamnaya_Ukraine [26], Yamnaya_Kalmykia [15], Ukr_BA (this study). Scythians and Sarmatians: Nomad_IA [15], Scythian_East and Sarmatian_SU [3], Hungarian Scythian, Sarmatian, Central Saka, Tian Shan Saka and Tagar [1], Scy_Ukr, ScySar_SU and Scy_Kaz (this study). Population postdating the Scythians: Chern (this study). See also Table S3.

Genetic variation

(…) the Chernyakhiv samples overlapped with modern Europeans, representing the most ‘western’ range of variation among the groups of this study.

After the end of the Scythian period in the western Eurasian Steppe, the Chernyakhiv culture samples have higher Near Eastern affinity compared to the Scythians preceding them, agreeing with the Gothic component in the multi-ethnic mix of the Chernyakhiv culture.

The higher proportion Near Eastern and (according to CP/NNLS) lower proportion of eastern ancestry in the Chernyakhiv culture samples were mirrored by f4 analyses where Chern showed lower affinity to Han (Z score –3.097) and EHG (Z score –3.643) than Ukrainian Scythian and Bronze Age samples, respectively, as well as higher Near Eastern (Levant_N and Anatolia_N) affinity than Ukrainian Scythians (Z scores 4.696 and 3.933, respectively). It is plausible to assume that this excess Near Eastern ancestry in Chern is related to European populations whose Near Eastern proportion has exceeded that in the steppe populations since the Neolithic expansion of early farmers. While the Chernyakhiv culture was likely ethnically heterogeneous, the three samples in our Chern group appear to represent its Gothic component.

chernyakhov-goths-uralic-clines
PCA obtained by projecting the ancient samples of this study together with published Scythian/Sarmatian and related samples onto a plot based on 537,802 autosomal SNPs in 1,422 modern Eurasians. To improve readability, the modern populations have been plotted as population medians (after outlier removal). Image modified from the paper, including Sredni Stog, Corded Ware/Uralic (with Srubna outliers) and Chernyakhov clusters.Notice the two new Late Yamna and Catacomb samples from Ukraine clustering with other published samples, despite being from the same region as Sredni Stog individuals.

Early Slavs of hg. I2-L621

A post in Anthrogenica shows some subclades of the varied haplogroups that are expected from medieval Poland:

KO_55, Kowalewko (100-300 AD), I1a3a1a1-Y6626
KO_45, Kowalewko (100-300 AD), I2a2a1b2a-L801
KO_22, Kowalewko (100-300 AD), G2a2b-L30
KO_57, Kowalewko (100-300 AD), G2a2b-L30

ME_7, Markowice (1000-1200 AD), I1a2a2a5-Y5384
NA_13, Niemcza, (900-1000 AD), I2a1b2-L621
NA_18, Niemcza, (900-1000 AD), J2a1a-L26

Just because of these samples among Early Slavs, and looking again more carefully at the modern distribution of I2a-L621 subclades, I think now I was wrong in assuming that I2a-L621 in early Hungarian Conquerors would mean they would appear around the Urals as a lineage integrated in Eastern Corded Ware groups. It seems rather a haplogroup with an origin in Central Europe. Whether it was part of a Baltic community that expanded south, or was incorporated during the expansions to the south is unclear. Like hg. E-V13, it doesn’t seem to have been incorporated precisely along the Danube, but closer to the north-east Carpathians.

Especially interesting is the finding of I2a-L621 among Early Slavs from Silesia, a zone of close interaction among early West Slavs. From Curta (2019):

On Common Slavs

In Poland, settlement discontinuity was postulated, to make room for the new, Prague culture introduced gradually from the southeast (from neighboring Ukraine). However, there is increasing evidence of 6th-century settlements in Lower Silesia (western Poland and the lands along the Middle Oder) that have nothing to do with the Prague culture. Nor is it clear how and when did the Prague culture spread over the entire territory of Poland.

On Great Moravia

Svatopluk’s remarkably strong position was immediately recognized by Pope John VIII, who ordered the immediate release of Methodius from his monastic prison in order to place him in 873 under Svatopluk’s protection. One year later (874), Louis the German himself was forced to recognize Svatopluk’s independence through the peace of Forchheim. By that time, the power of Svatopluk had extended into the upper Vistula Basin, over Bohemia, the lands between the Saale and the Elbe rivers, as well as the northern and northeastern parts of the Carpathian Basin.* The Czech prince Bořivoj, a member of the Přemyslid family which would unify and rule Bohemia in the following century, is believed to have been baptized in 874 by Methodius in Moravia together with his wife Ludmila (St. Wenceslas’s grandmother).

*Brather, Archäologie, p. 71. The expansion into the region of the Upper Vistula (Little Poland) results from one of St. Methodius’ prophecies, for which see the Life of Methodius 11, p. 72; Poleski, “Contacts between the Great Moravian empire and the tribes”; Poleski, “Contacts between the tribes in the basins.” Despite an early recognition of the Moravian influences on the material culture in 9th-century southern Poland and Silesia (e.g., Dostál, “Das Vordringen”), the question of Svatopluk’s expansion has triggered in the 1990s a fierce debate among Polish archaeologists. See Wachowski, “Problem”; Abłamowicz, “Górny Śląsk”; Wachowski, “Północny zasięg ekspansji”; Szydłowski, “Czy ślad”; Jaworski, “Elemente.”

On Piast Poland

Mieszko agreed to marry Oda, the daughter of the margrave of the North March, for his first wife had died in 977. The marriage signaled a change in the relations with the Empire, for Mieszko sent troops to help Otto II against the Slavic rebels of 983. He also attacked Bohemia and incorporated Silesia and Lesser Poland into the Piast realm, which prompted Bohemians to ally themselves with the Slavic rebels against whom Emperor Otto was now fighting. By 980, therefore, Mieszko was part of a broader configuration of power, and his political stature was recognized in Scandinavia as well. His daughter, Swietoslawa married first Erik Segersäll of Sweden (ca. 970–ca. 995) and then Sweyn Forkbeard of Denmark (986–1014).26 In the early 990s, together with his wife and children, Mieszko offered his state (called “civitas Schinesghe,” the state of Gniezno) to the pope as a fief, as attested by a unique document known as Dagome iudex and preserved in a late 11th-century summary. The document describes the inner boundaries of the state and peripheral provinces, as if Gniezno were a civitas (city) in Italy, with its surrounding territory. Regional centers, however, did indeed come into being shortly before AD 1000 in Lesser Poland (Cracow, Sandomierz), Pomerania (Gdańsk), and Silesia (Wrocław). Such regional centers came to be distinguished from other strongholds by virtue of the presence within their walls of some of the earliest churches built in stone. Mieszko got his own, probably missionary bishop.

In light of this recent find, which complements the Early Slav of the High Middle Ages from Sunghir (ca. AD 1100-1200), probably from the Vladimir-Suzdalian Rus’, we can assume now less speculatively that I2a-CTS10228 most likely expanded with Common Slavs, because alternative explanations for its emergence in the Carpathian Basin, among Early West Slavs, and among Early East Slavs within this short period of time requires too many unacceptable assumptions.

dinaric-i2a-distribution
Modern distribution of “Dinaric” I2a. Modified from Balanovsky et al. (2008)

Hungarian Conquerors

Knowing that R1a-Z280 was an Eastern Corded Ware lineage, found from Baltic Finns to Finno-Ugric populations of the Trans-Urals, we can probably assign expanding Magyars to at least R1a-Z280, R1a-Z93, and N1c-L392 (xB197) lineages.

From Curta (2019):

Earlier Latin sources, especially those of the first half of the 10th century, refer to Magyars as Huns or Avars. They most likely called themselves Magyars, a word indicating that the language they spoke was not Turkic, but Finno-Ugrian, related to a number of languages spoken in Western Siberia and the southern Ural region. The modern word—Hungarian—derives from the Slavic word for those people, U(n)gri, which is another indication of Ugric roots. This has encouraged the search for the origin of the Hungarian people in the lands to the east from the Ural Mountains, in western Siberia, where the Hungarian language is believed to have emerged between 1000 and 500 BC.

In looking for the Magyar primordial homeland, they draw comparisons with the assemblages found in Hungary that have been dated to the 10th century and attributed to the Magyars. Some of those comparisons had extraordinary results. For example, the excavation of the burial mound cemetery recently discovered near Lake Uelgi, in the Cheliabinsk region of Russia, has produced rosette-shaped harness mounts and silver objects ornamented with palmette and floral designs arranged in reticulated patterns, which are very similar to those of Hungary. But Uelgi is not dated to prehistory, and many finds from that site coincided in time with those found in burial assemblages in Hungary. In other words, although there can be no doubt about the relations between Uelgi and the sites in Hungary attributed to the first generations of Magyars, those relations indicate a migration directly from the Trans-Ural lands, and not gradually, with several other stops in the forest-steppe and steppe zones of Eastern Europe. In the lands west of the Ural Mountains, the Magyars are now associated with the Kushnarenkovo (6th to 8th century) and Karaiakupovo (8th to 10th century) cultures, and with such burial sites as Sterlitamak (near Ufa, Bashkortostan) and Bol’shie Tigany (near Chistopol, Tatarstan).14 However, the same problem with chronology makes it difficult to draw the model of a migration from the lands along the Middle Volga. Many parallels for the so typically Magyar sabretache plates found in Hungary are from that region. They have traditionally been dated to the 9th century, but more recent studies point to the coincidence in time between specimens found in Eastern Europe and those from Hungary.

Adding J2a and I1a samples to the Early Slavic stock, based on medieval samples from Poland – with G2a and E-V13 lineages probably shared with Goths from Wielbark/Chernyakhov, or becoming acculturated in the Carpathian Basin – one is left to wonder which of these lineages actually took part in Common Slavic migrations/acculturation events, whenever and wherever those actually happened.

I have tentatively re-assigned lineages of Hungarian conquerors according to their likely origins in a simplistic way – similar to how the paper classifies them – , now (I think) less speculatively, assuming that Early Slavs likely formed eventually part of them:

hungarian-conquerors-y-dna-slavs
Image modified from the paper, with drawn red square around lineages of likely East Slavic origin, and blue squares around R1a-Z93, R1a-Z283, N1a-Z1936, and N1a-M2004 samples, of likely Ugric origin Y-Hg-s determined from 46 males grouped according to sample age, cemetery and Hg. Hg designations are given according to ISOGG Tree 2019. Grey shading designate distinguished individuals with rich grave goods, color shadings denote geographic origin of Hg-s according to Fig. 1. For samples K3/1 and K3/3 the innermost Hg defining marker U106* was not covered, but had been determined previously.

NOTE. The ancestral origin of lineages is meaningless for an ethnolinguistic identification. The only reasonable assumption is that all the individuals sampled formed part of the Magyar polity, shared Magyar culture, and likely spoke Hungarian, unless there is a clear reason to deny this: which I guess should include at least a clearly ‘foreign’ ancestry (showing a distant cluster compared to the group formed by all other samples), ‘foreign’ isotopic data (showing that he was born and/or raised outside of the Carpathian Basin), and particularly ‘foreign’ cultural assemblage of the burial, if one really wants to risk assuming that the individual didn’t speak Hungarian as his mother tongue.

“Dinaric” or Slavic I2a?

I don’t like the use of “Dinaric I2a”, because it is reminiscent of the use of “Iberian R1b-DF27”, or “Germanic R1b-U106”, when ancient DNA has shown that this terminology is most often wrong, and turns out to be misleading. As misleading as “Slavic R1a”. Recently, a Spanish reader wrote me emails wondering how could I possibly say that R1b-DF27 came from Central Europe, because modern distribution maps (see below) made it evident that the haplogroup expanded from Iberia…

DF27-iberia-france-m167
Contour maps of the derived allele frequencies of the SNPs analyzed in Solé-Morata et al. (2017).

The obvious answer is, these maps show modern distributions, not ancient ones. In the case of R1b-DF27, different Iberian lineages are not even related to the same expansion. At least R1b-M167/SRY2627 lineages seem to have expanded from Central Europe into Iberia much more recently than other DF27 subclades associated with Bell Beakers. What’s more, if R1b-M167/SRY2627 appear densest in north-east Spain it is not because of the impact of Celts or Iberians before the arrival of Romans, but because of the impact of medieval expansions during the Reconquista from northern kingdoms expanding south in the Middle Ages:

iberian-medieval-kingdoms-expansion-population-genomics
Genetic differentiation and the footprints of historical migrations in the Iberian Peninsula. Image modified from Bycroft et al. (2018).

Similarly, the term “Dinaric I2a”, based on the higher density in the Western Balkans, is misleading because it is probably the result of later bottlenecks. Just like the density of different R1a subclades among Modern Slavs is most likely the result of acculturation of different groups, especially to the east and north-east, where language shift is known to have happened in historical times, with the cradle of Russians in particular being a Finno-Volgaic hotspot, later expanding with hg. R1a-Z280 and N1c-L392 lineages.

Now, one may think that maybe Slavs expanded with ALL of these different lineages. Since we are talking about late Iron Age / medieval expansions, there might be confederations of different peoples expanding with a single lingua franca… But no, not really. Not likely in linguistics, not likely in archaeology, and apparently not in population genomics, either.

How many ancient peoples from the Iron Age and Early Middle Ages expanded with so many different lineages? We see bottlenecks in expansions even in recent times: say, in Visigoths under E-V13 (probably recently incorporated during their migrations); in Moors (mostly Berbers) with E-M81 and J; in medieval Iberians under different DF27 bottlenecks during the Reconquista (including huge bottlenecks among Basques); similarly, huge bottlenecks are found in Finnic expansions under N1c…How likely is it that Proto-Slavs (and Common Slavs) expanded with all those attested lineages to date among Early Slavs (E-V13, I2a-L621, R1a-M458, I1, J2a) AND also with other R1a subclades prevalent today, but almost absent in sampled Early Slavs?

To sum up, I am not so sure anymore about the possibility of simplistically assigning R1a-M458 to expanding Common Slavs. R1a-M458 may well have been the prevalent R1a subclade in Central Europe among early Balto-Slavic – and possibly also neighbouring Northern Indo-European-speaking – peoples (let’s see what subclades Tollense and Unetice samples bring), but it is more and more likely that most of the density we see in modern R1a-M458 distribution maps is actually the effect of medieval bottlenecks of West Slavs, similar to the case of Iberia.

r1a-m458-underhill-2015
Modern distribution of R1a-M458, after Underhill et al. (2015).

Related

Baltic Finns in the Bronze Age, of hg. R1a-Z283 and Corded Ware ancestry

estonian-bronze-age-dna

Open access The Arrival of Siberian Ancestry Connecting the Eastern Baltic to Uralic Speakers further East, by Saag et al. Current Biology (2019).

Interesting excerpts:

In this study, we present new genomic data from Estonian Late Bronze Age stone-cist graves (1200–400 BC) (EstBA) and Pre-Roman Iron Age tarand cemeteries (800/500 BC–50 AD) (EstIA). The cultural background of stone-cist graves indicates strong connections both to the west and the east [20, 21]. The Iron Age (IA) tarands have been proposed to mirror “houses of the dead” found among Uralic peoples of the Volga-Kama region [22].

(…) The 33 individuals included 15 from EstBA, 6 from EstIA, 5 from Pre-Roman to Roman Iron Age Ingria (500 BC–450 AD) (IngIA), and 7 from Middle Age Estonia (1200–1600 AD) (EstMA) and yielded endogenous DNA ∼4%–88%, average genomic coverages ∼0.017–0.734×, and contamination estimates <4% (Table S1). We analyzed the data in the context of modern and other ancient individuals, including from Neolithic Estonia [13].

estonian-y-dna-bronze-iron-age
Archaeological Information, Genetic Sex, mtDNA and Y Chromosome Haplogroups, and Average Coverage of the Individuals of This Study. Modified from the paper to mark distinct Y-DNA haplogroups in the LBA and IA.

We identified chrY hgs for 30 male individuals (Tables 1 and S2; STAR Methods). All 16 successfully haplogrouped EstBA males belonged to hg R1a, showing no change from the CWC period, when this was also the only chrY lineage detected in the Eastern Baltic [11, 13, 30, 31]. Three EstIA and two IngIA individuals also belonged to hg R1a, but three EstIA males belonged to hg N3a, the earliest so far observed in the Eastern Baltic. Three EstMA individuals belonged to hg N3a, two to hg R1a, and one to hg J2b. ChrY lineages found in the Baltic Sea region before the CWC belong to hgs I, R1b, R1a5, and Q [10, 11, 12, 13, 17, 32]. Thus, it appears that these lineages were substantially replaced in the Eastern Baltic by hg R1a [10, 11, 12, 13], most likely through steppe migrations from the east [30, 31]. (…) Our results enable us to conclude that, although the expansion time for R1a1 and N3a3′5 in Eastern Europe is similar [25], hg N3a likely reached Estonia or at least became comparably frequent to modern Estonia [1] only during the BA-IA transition.

A clear shift toward West Eurasian hunter-gatherers is visible between European LN and BA (including Baltic CWC) and EstBA individuals, the latter clustering together with Latvian and Lithuanian BA individuals [11]. EstIA, IngIA, and EstMA individuals project between BA individuals and modern Estonians, partially overlapping with both.

(…) EstBA individuals are clearly distinguishable from Estonian CWC individuals as the former have more of the blue component most frequent in WHGs and less of the brown and yellow components maximized in Caucasus hunter-gatherers and modern Khanty, respectively. The individuals of EstBA, EstIA, IngIA, EstMA, and modern Estonia are quite similar to each other on average, indicating that the relatively high proportion of WHG ancestry in modern Eastern Baltic populations compared to other present-day Europeans [15] traces back to the BA.

estonian-pca-published
Detail of the PCA, modified from the paper to label populations. Estonian Bronze Age and Iron Age samples cluster close to Early Corded Ware from the Baltic.. Principal-component analysis results of modern West Eurasians with ancient individuals projected onto the first two components (PC1 and PC2). BA, Bronze Age; EF, early farmers; HG, hunter-gatherers; IA, Iron Age; IMA, Iron/Middle Ages; LN, Late Neolithic; LNBA, Late Neolithic/Bronze Age; MA, Middle Ages

When comparing Estonian CWC and EstBA using autosomal outgroup f3 and Patterson’s D statistics (Table S3), the latter is more similar to other Baltic BA populations, to Baltic IA and Middle Age (MA) populations, and also to populations similar to WHGs and Scandinavian hunter-gatherers (SHGs), but not to Estonian CCC (Figures 2A and S2A; Data S1). The increase in WHG or SHG ancestry could be connected to western influences seen in material culture [20, 21] and facilitated by a decline in local population after the CCC-CWC period [20]. A slight trend of bigger similarity of Estonian CWC to forest or steppe zone populations and of EstBA to European early farmer populations can also be seen.

(…) When comparing to modern populations, Estonian CWC is slightly more similar to Caucasus individuals but EstBA to Baltic populations and Finnic speakers (Figure 2B; Data S1). Outgroup f3 and D statistics do not reveal apparent differences when comparing EstBA to EstIA, EstIA to IngIA, and EstIA to EstMA (Data S1).

estonian-ba-ia-ancestry
qpAdm results. Error bars indicate one SE. Central MN, Central European Middle Neolithic; EstBA, Estonian Bronze Age; EstIA, Estonian Iron Age; IngIA, Ingrian Iron Age; EstMA, Estonian Middle Ages; WHG, western hunter-gatherers.

These results highlight how uniparental and autosomal data can lead to different demographic inferences—the genetic change between CWC and BA not seen in uniparental lineages is clear in autosomal data and the appearance of chrY hg N in the IA is not matched by a clear shift in autosomal profiles.

EstBA individuals have no Nganasan-related ancestry and EstIA, IngIA, and EstMA individuals on average have 2% or 4% (Figure 3; Data S1). The differentiation remains when using BA or IA Fennoscandian populations [26] instead of Nganasans (Data S1). Notably, the proportion of Nganasan-related ancestry varies between 0% and 12% among sampled EstIA, IngIA, and EstMA individuals (Data S1), which may suggest its relatively recent admixture into the target population. Moreover, two individuals from Kunda (0LS10 and V10) have the highest proportions of Nganasan ancestry among EstIA (6% and 8%), one of them has chrY hg N3a, and isotopic analysis suggests neither individual being born in Kunda [34].

About these two males from Tarand-graves, ‘foreign’ to Kunda:

0LS10: Male from tarand III (burial 9; TÜ 1325: L777), age 17–25 years [34]. He had a fragment of a sheep/goat bone and ceramics as grave goods. This burial has two radiocarbon dates: 2430 ± 35 BP (Poz-10801; 760–400 cal BC) and 2530 ± 41 BP (UBA-26114; 800–530 cal BC) [34]. According to the isotopic analysis, the person was not born in the vicinity of Kunda; his place of birth is still unknown (but south-western Finland and Sweden are excluded) [34]. Sampled tooth r P1.

V10: Male from tarand XI (burial 24; TÜ 1325: L1925), age 25–35 years [34], date 2484 ± 40 BP (UBA-26115; 790–430 cal BC) [34]. He had a few potsherds near the skull. Likewise, this person was not locally born [34]. Sampled tooth l P1.

estonia-bronze-iron-age-steppe-siberian
Autosomal Analyses’ Results for Gyvakarai1 as the closest available Corded Ware source for Balto-Finnic populations.

The paper shows thus:

  • Major continuity of ancestry from Corded Ware to modern Estonians, with only slight changes in different periods. In fact, one of the best fits for the Late Bronze Age ancestry is Gyvakarai1, one of the Corded Ware “outliers” described as “closer to Yamna”, which I already said may be closer to Sredni Stog/EHG populations instead. Another interesting take is that the change from Bronze Age to Iron Age corresponds to an increase in Baltic Corded Ware-related ancestry, rather than being driven by Siberian ancestry.
  • pca-mittnik-gyvakarai
    File modified by me from Mittnik et al. (2018) to include the approximate position of the most common ancestral components, and an identification of potential outliers. Zoomed-in version of the European Late Neolithic and Bronze Age samples. “Principal components analysis of 1012 present-day West Eurasians (grey points, modern Baltic populations in dark grey) with 294 projected published ancient and 38 ancient North European samples introduced in this study (marked with a red outline). From Mittnik et al. (2018).
  • A Volosovo-related migration of hg. N1c with Netted Ware into the area seems to be discarded, based on the full replacement of paternal lines and continuity of R1a-Z283. It is only during the Tarand-grave period when a system of chiefdoms (spread from Ananyino/Akozino) brings haplogroup N1c to the Gulf of Finland. During the Iron Age, the proportion of paternal lineages is still clearly in favour of R1a (50% in the coast, 100% in Ostrobothnia), which indicates a gradual replacement led by elites, likely because of the incorporation of Akozino warrior-traders spreading all over the Baltic, bringing the described shared Mordvinic traits in Fennic.
  • finno-ugric-haplogroup-n
    Map of archaeological cultures in north-eastern Europe ca. 8th-3rd centuries BC. [The Mid-Volga Akozino group not depicted] Shaded area represents the Ananino cultural-historical society. Fading purple arrows represent likely stepped movements of subclades of haplogroup N for centuries (e.g. Siberian → Ananino → Akozino → Fennoscandia [N-VL29]; Circum-Arctic → forest-steppe [N1, N2]; etc.). Blue arrows represent eventual expansions of Uralic peoples to the north. Modified image from Vasilyev (2002).
  • The arrival of Akozino warrior-traders (bringing N1c and R1a lineages) was probably linked to this minimal “Nganasan-like” ancestry of some samples in the transition to the Iron Age. This arrival is supported by samples 0LS10 (the earliest hg. N1c) and V10 (of hg. R1a), both dated to ca. 800-400 BC, with V10 showing the highest “Nganasan-like” ancestry with 4.8%, both of them neighbouring samples showing 0%. This variable admixture among local and foreign paternal lineages might support the described social system of family alliances with intermarriages. In fact, a medieval sample, 0LS03_1 (hg. R1a) also shows a recent “Nganasan-like” ancestry, which probably points to the integration of different Arctic-related ancestry components among Modern Estonians, in this case related to Finnish expansions and thus integration of Levänluhta-related ancestry, as per the supplementary data.
  • NOTE. Such minimal proportions of “Nganasan-like” ancestry evidence the process of admixture of Volga Finns in Akozino territory through their close interactions with Permians of Ananyino, who in turn acquired this Palaeo-Arctic admixture most likely during the expansion of the linguistic community to hunter-gatherer territories, to the north of the Cis-Urals. This process of stepped infiltration and expansion without language change is not dissimilar to the one seen among Indo-Iranians and Balto-Slavs of hg. R1b, or Vasconic speakers of hg. I2a, although in the case of Baltic Finns of hg. R1a the process of infiltration and expansion of hg. N1c is much less dramatic, with no radical replacement anywhere before the huge bottlenecks observable in Finns.

  • The expansion of haplogroup N1c among Finnic populations, as we are going to see in samples from the Middle Ages such as Luistari, is the consequence of late founder effects after huge bottlenecks expected based on the analysis of modern populations. The expansion of N1c-VL29 is different in origin from that of N1c-Z1936 among Samic (later integrated into Finnish populations), most likely from the east and originally associated with Lovozero Ware.
haplogroup_n3a3
Frequency-Distribution Maps of Individual Subclade N3a3 / N1a1a1a1a1a-CTS2929/VL29, probably initially with Akozino warrior-traders. Map from Ilumäe et al. (2016).

In spite of all this, the conclusion of the paper is (surprise!) that Siberian ancestry and hg. N heralded the arrival of Finnic to the Gulf of Finland in the Iron Age… However, this conclusion is supposedly* supported, not by their previous papers, but by a recent phylogenetic study by Honkola et al. (2013), which doesn’t actually argue for such a late ‘arrival’: it argues for the split of Balto-Finnic around 1500 BC.

NOTE. I say ‘supposedly’ because Kristiina Tambets, for example, has been following the link of Uralic with haplogroup N since the 2000s, so this is not some conclusion they just happened to misread from some random paper they Googled. In those initial assessments, she argued that the “ancient homeland” of the Tat C mutation suggested that Finno-Ugrians were in Fennoscandia before Indo-Europeans. Apparently, since haplogroup N appears later and from the east, it is now more important to follow this haplogroup than what is established in archaeology and linguistics.

Even in the referred paper, this split is considered an in situ development, since the phylogenetic study takes the information – among others – 1) from Parpola and Carpelan, who consider Netted Ware, a culture derived from Fatyanovo/Abashevo and Volosovo, as the culprit of the Finno-Ugric expansion; and 2) from Kallio (2006), who clearly states that Proto-Balto-Finnic (like Proto-Finno-Samic) was spoken around the Gulf of Finland during the Bronze Age. Both of them set the terminus ante quem of the language presence in the Baltic ca. 1900 BC.

Anyways, as a consequence of geneticists keeping these untenable pre-ancient DNA haplogroup-based arguments today, I expect to see this “Finnic” language expansion also described for the Western Baltic, Scandinavia or northern Europe, when this same proportion of hg. N1c and “Nganasan” ancestry is observed in Iron Age samples around the Baltic Sea. The nativist trends that this domination of “Finns” all over Northern Europe 2,500 years ago will create will be even more fun to read than the current ones…

EDIT (10 May 2019) How I see the reaction of many to ancient DNA, in keeping their old theories:

Related