Yamnaya replaced Europeans, but admixed heavily as they spread to Asia

narasimhan-spread-yamnaya-ancestry

Recent papers The formation of human populations in South and Central Asia, by Narasimhan, Patterson et al. Science (2019) and An Ancient Harappan Genome Lacks Ancestry from Steppe Pastoralists or Iranian Farmers, by Shinde et al. Cell (2019).

NOTE. For direct access to Narasimhan, Patterson et al. (2019), visit this link courtesy of the first author and the Reich Lab.

I am currently not on holidays anymore, and the information in the paper is huge, with many complex issues raised by the new samples and analyses rather than solved, so I will stick to the Indo-European question, especially to some details that have changed since the publication of the preprint. For a summary of its previous findings, see the book series A Song of Sheep and Horses, in particular the sections from A Clash of Chiefs where I discuss languages and regions related to Central and South Asia.

I have updated the maps of the Preshistory Atlas, and included the most recently reported mtDNA and Y-DNA subclades. I will try to update the Eurasian PCA and related graphics, too.

NOTE. Many subclades from this paper have been reported by Kolgeh (download), Pribislav and Principe at Anthrogenica on this thread. I have checked some out for comparison, but even if it contradicted their analyses mine would be the wrong ones. I will upload my spreadsheets and link to them from this page whenever I find the time.

caucasus-cline-narasimhan
Ancestry clines (1) before and (2) after the advent of farming. Colour modified from the original to emphasize the CHG cline: notice the apparent relevance of forest-steppe groups in the formation of this CHG mating network from which Pre-Yamnaya peoples emerged.

Indo-Europeans

I think the Narasimhan, Patterson et al. (2019) paper is well-balanced, and unexpectedly centered – as it should – on the spread of Yamnaya-related ancestry (now Western_Steppe_EMBA) as the marker of Proto-Indo-European migrations, which stretched ca. 3000 BC “from Hungary in the west to the Altai mountains in the east”, spreading later Indo-European dialects after admixing with local groups, from the Atlantic to South Asia.

I. Afanasievo

I.1. East or West PIE?

I expected Afanasievo to show (1) R1b-L23(xZ2103, xL51) and (2) R1b-L51 lineages, apart from (3) the known R1b-Z2103 ones, pointing thus to an ancestral PIE community before the typical Yamnaya bottlenecks, and with R1b-L51 supporting a connection with North-West Indo-European. The presence of some samples of hg. Q pointed in this direction, too.

However, Afanasievo samples show overwhelmingly R1b-Z2103 subclades (all except for those with low coverage), all apparently under R1b-Z2108 (formed ca. 3500 BC, TMRCA ca. 3500 BC), like most samples from East Yamnaya.

This necessarily shifts the split and spread of R1b-L23 lineages to Khvalynsk/early Repin-related expansions, in line with what TMRCA suggested, and what advances by Anthony (2019) and Khokhlov (2018) on future samples from the Reich Lab suggest.

Given the almost indistinguishable ancestry between Afanasievo and Early Yamnaya, there seems to be as of yet little potential information to support in population genomics that Pre-Tocharians were more closely related to North-West Indo-Europeans than to Graeco-Aryans, as it is proposed in linguistics based on the few shared traits between them, and the lack of innovations proper of the Graeco-Aryan community.

NOTE. A new issue of Wekʷos contains an abstract from a relevant paper by Blažek on vocabulary for ‘word’, including the common NWIE *wrdʰo-/wordʰo-, but also a new (for me, at least) Northern Indo-European one: *rēki-/*rēkoi̯-, shared by Slavic and Tocharian.

The fact that bottlenecks happened around the time of the late Repin expansion suggests that we might be able to see different clans based on the predominant lineages developing around the Don-Volga area in the 4th millennium BC. The finding of Pre-R1b-L51 in Lopatino (see below), and of a Catacomb sample of hg. R1b-Z2103(Z2105-) in the North Caucasus steppe near Novoaleksandrovskij also support a star-like phylogeny of R1b-L23 stemming from the Don-Volga area.

NOTE. Interestingly, a dismissal of a common trunk between Tocharian and North-West Indo-European would mean that shared similarities between such disparate groups could be traced back to a Common Late PIE trunk, and not to a shared (western) Repin community. For an example of such a ‘pure’ East-West dialectal division, see the diagram of Adams & Mallory (2007) at the end of the post. It would thus mean a fatal blow to Kortlandt’s Indo-Slavonic group among other hypothetical groupings (remade versions of the ancient Centum-Satem division), as well as to certain assumptions about laryngeal survival or tritectalism that usually accompany them. Still, I don’t think this is the case, so the question will remain a linguistic one, and maybe some similarities will be found with enough number of samples that differentiate Northern Indo-Europeans from the East Yamna/Catacomb-Poltavka-Balkan_EBA group.

afanasievo-y-dna
Y-chromosome haplogroups of Afanasievo samples and neighbouring groups. See full maps.

I.2. Expansion or resurgence of hg. Q1b?

Haplogroup Q1b-Y6802(xY6798) seems to be the main lineage that expanded with Afanasievo, or resurged in their territory. It’s difficult to tell, because the three available samples are family, and belong to a later period.

NOTE. I have finally put some order to the chaos of Q1a vs. Q1b subclades in my spreadsheet and in the maps. The change of ISOGG 2016 to 2017 has caused that many samples reported as of Q1 subclades from papers prepared during the 2017-2018 period, and which did not provide specific SNP calls, were impossible to define with certainty. By checking some of them I could determine the specific standard used.

In favour of the presence of this haplogroup in the Pre-Yamnaya community are:

  • The statement by Anthony (2019) that Q1a [hence maybe Q1b in the new ISOGG nomenclature] represented a significant minority among an R1b-rich community.
  • The sample found in a Sintastha WSHG outlier (see below), of hg. Q1b-Y6798, and the sample from Lola, of hg. Q1b-L717, are thus from other lineage(s) separated thousands of years from the Afanasievo subclade, but might be related to the Khvalynsk expansion, like R1b-V1636 and R1b-M269 are.

These are the data that suggest multiple resurgence events in Afanasievo, rather than expanding Q1b lineages with late Repin:

  • Overwhelming presence of R1b in early Yamnaya and Afanasievo samples; one Q1(xQ1b) sample reported in Khvalynsk.
  • The three Q1b samples appear only later, although wide CI for radiocarbon dates, different sites, and indistinguishable ancestry may preclude a proper interpretation of the only available family.
    • Nevertheless, ancestry seems unimportant in the case of Afanasievo, since the same ancestry is found up to the Iron Age in a community of varied haplogroups.
  • Another sample of hg. Q1b-Y6802(xY6798) is found in Aigyrzhal_BA (ca. 2120 BC), with Central_Steppe_EMBA (WSHG-related) ancestry; however, this clade formed and expanded ca. 14000 BC.
  • The whole Altai – Baikal area seems to be a Q1b-L54 hotspot, although admittedly many subclades separated very early from each other, so they might be found throughout North Eurasia during the Neolithic.
  • One Afanasievo sample is reported as of hg. C in Shin (2017), and the same haplogroup is reported by Hollard (2014) for the only available sample of early Chemurchek to date, from Kulala ula, North Altai (ca. 2400 BC).
afanasievo-chemurchek-y-dna
Y-chromosome haplogroups of late Afanasievo – early Chemurchek samples and neighbouring groups. See full maps.

I.3. Agricultural substrate

Evidence of continuous contacts of Central_Steppe_MLBA populations with BMAC from ca. 2100 BC on – visible in the appearance of Steppe ancestry among BMAC samples and BMAC ancestry among Steppe pastoralists – supports the close interaction between Indo-Iranian pastoralists and BMAC agriculturalists as the origin of the Asian agricultural substrate found in Proto-Indo-Iranian, hence likely related to the language of the Oxus Civilization.

Similar to the European agricultural substrate adopted by West Yamnaya settlers (both NWIE and Palaeo-Balkan speakers), Tocharian shows a few substrate terms in common with Indo-Iranian, which can be explained by contacts in different dialectal stages through phonetic reconstruction alone.

The recent Hermes et al. (2019) supports the early integration of pastoralism and millet cultivation in Central Asia (ca. 2700 BC or earlier), with the spread of agriculture to the north – through the Inner Asian Mountain Corridor – being thus unrelated to the Indo-Iranian expansions, which might support independent loans.

However, compared to the huge number of parallel shared loans between NWIE and Palaeo-Balkan languages in the European substratum, Indo-Iranians seem to have been the first borrowers of vocabulary from Asian agriculturalists, while Proto-Tocharian shows just one certain related word, with phonetic similarities that warrant an adoption from late Indo-Iranian dialects.

chemurchek-sintashta-bmac
Y-chromosome haplogroups of Sintashta, Central Asia, and neighbouring groups in the Early Bronze Age. See full maps.

The finding of hg. (pre-)R1b-PH155 in a BMAC sample from Dzharkutan (to the west of Xinjiang) together with hg. R1b in a sample from Central Mongolia previously reported by Shin (2017) support the widespread presence of this lineage to the east and west of Xinjiang, which means it might have become incorporated to Indo-Iranian migrants into the Xiaohe horizon, to the Afanasievo-Chemurchek-derived groups, or the later from the former. In other words, the Island Biogeography Theory with its explanation of founder effects might be, after all, applicable to the whole Xinjiang area, not only during the Chemurchek – Tianshan-Beilu – Xiaohe interaction.

Of course, there is no need for too complicated models of haplogroup resurgence events in Central and South Asia, seeing how the total amount of hg. R1a-L657 (today prevalent among Indo-Aryan speakers from South Asia) among ancient Western/Central_Steppe_MLBA-related samples amounts to a total of 0, and that many different lineages survived in the region. Similar cases of haplogroup resurgence and Y-DNA bottleneck events are also found in the Central and Eastern Mediterranean, and in North-Eastern Europe. From the paper:

[It] could reflect stronger ecological or cultural barriers to the spread of people in South Asia than in Europe, allowing the previously established groups more time to adapt and mix with incoming groups. A second difference is the smaller proportion of Steppe pastoralist– related ancestry in South Asia compared with Europe, its later arrival by ~500 to 1000 years, and a lower (albeit still significant) male sex bias in the admixture (…).

Y-chromosome haplogroups of samples from the Srubna-Andronovo and Andronovo-related horizon, Xiaohe, late BMAC, and neighbouring groups. See full maps.

II. R1b-Beakers replaced R1a-CWC peoples

II.1. R1a-M417-rich Corded Ware

Newly reported Corded Ware samples from Radovesice show hg. R1a-M417, at least some of them xZ645, ‘archaic’ lineages shared with the early Bergrheinfeld sample (ca. 2650 BC) and with the coeval Esperstedt family, hence supporting that it eventually became the typical Western Corded Ware lineage(s), probably dominating over the so-called A-horizon and the Single Grave culture in particular. On the other hand, R1a-Z645 was typical of bottlenecks among expanding Eastern Corded Ware groups.

Interestingly, it is supported once again that known bottlenecks under hg. R1a-M417 happened during the Corded Ware expansion, evidenced also by the remarkable high variability of male lineages among early Corded Ware samples. Similarly, these Corded Ware samples from Bohemia form part of the typical ‘Central European’ cluster in the PCA, which excludes once again not only the ‘official’ Espersted outlier I1540, but also the known outlier with Yamnaya ancestry.

NOTE. The fact that Esperstedt is closely related geographically and in terms of ancestry to later Únětice samples further complicates the assumption that Únětice is a mixture of Bell Beakers and Corded Ware, being rather an admixture of incoming Bell Beakers with post-Yamnaya vanguard settlers who admixed with Corded Ware (see more on the expansion of Yamnaya ancestry). In other words, Únětice is rather an admixture of Yamnaya+EEF with Yamnaya+(CWC+EEF).

Y-chromosome haplogroups of samples from Catacomb, Poltavka, Balkan EBA, and Bell Beaker, as well as neighbouring groups. See full maps.

On Ukraine_Eneolithic I6561

If the bottlenecks are as straightforward as they appear, with a star-like phylogeny of R1a-M417 starting with the Pre-Corded Ware expansion, then what is happening with the Alexandria sample, so precisely radiocarbon dated to ca. 4045-3974 BC? The reported hg. R1a-M417 was fully compatible, while R1a-Z645 could be compatible with its date, but the few positive SNPs I got in my analysis point indeed to a potential subclade of R1a-Z94, and I trust more experienced hobbyists in this ‘art’ of ascertaining the SNPs of ancient samples, and they report hg. R1a-Z93 (Z95+, Y26+, Y2-).

Seeing how Y-DNA bottlenecks worked in Yamnaya-Afanasievo and in Corded Ware and related groups, and if this sample really is so deep within R1a-Z93 in a region that should be more strongly affected by the known Neolithic Y-chromosome bottlenecks and forest-steppe ecotone, someone from the lab responsible for this sample should check its date once again, before more people keep chasing their tails with an individual that (based on its derived SNPs’ TMRCA) might actually be dated to the Bronze Age, where it could make much more sense in terms of ancestry and position in the PCA.

EDIT (14 SEP 2019): … and with the fact that he is the first individual to show the genetic adaptation for lactase persistence (I3910-T), which is only found later among Bell Beakers, and much later in Sintashta and related Steppe_MLBA peoples (see comments below).

This is also evidenced by the other Ukraine_Eneolithic (likely a late Yamnaya) sample of hg. R1b-Z2103 from Dereivka (ca. 2800 BC) and who – despite being in a similar territory 1,000 years later – shows a wholly diluted Yamnaya ancestry under typically European HG ancestry, even more so than other late Sredni Stog samples from Dereivka of ca. 3600-3400 BC, suggesting a decrease in Steppe ancestry rather than an increase – which is supposedly what should be expected based on the ancestry from Alexandria…

Like the reported Chalcolithic individual of Hajji Firuz who showed an apparently incompatible subclade and Yamnaya ancestry at least some 1,000 years before it should, and turned out to be from the Iron Age (see below), this may be another case of wrong radiocarbon dating.

NOTE. It would be interesting, if this turns out to be another Hajji Firuz-like error, to check how well different ancestry models worked in whose hands exactly, and if anyone actually pointed out that this sample was derived, and not ancestral, to many different samples that were used in combination with it. It would also be a great control to check if those still supporting a Sredni Stog origin for PIE would shift their preference even more to the north or west, depending on where the first “true” R1a-M417 samples popped up. Such a finding now could be thus a great tool to discover whether haplogroup-based bias plays a role in ancestry magic as related to the Indo-European question, i.e. if it really is about “pure statistics”, or there is something else to it…

II.1. R1b-L51-rich Bell Beakers

The overwhelming majority of R1b-L51 lineages in Radovesice during the Bell Beaker period, just after the sampled Corded Ware individuals from the same site, further strengthen the hypothesis of an almost full replacement of R1a-M417 lineages from Central Europe up to southern Scandinavia after the arrival of Bell Beakers.

Yet another R1b-L151* sample has popped up in Central Europe, in the individual classified as Bilina_BA (ca. 2200-800 BC), which clusters with Bell Beakers from Bohemia, with the outlier from Turlojiškė, and with Early Slavs, suggesting once again that a group of central-east European Beakers represented the Pre-Proto-Balto-Slavic community before their spread and admixture events to the east.

The available ancient distribution of R1b-L51*, R1b-L52* or R1b-L151* is getting thus closer to the most likely origin of R1b-L51 in the expansion of East Bell Beakers, who trace their paternal ancestors to Yamnaya settlers from the Carpathian Basin:

NOTE. Some of these are from other sources, and some are samples I have checked in a hurry, so I may have missed some derived SNPs. If you send me a corrected SNP call to dismiss one of these, or more ‘archaic’ samples, I’ll correct the map accordingly. See also maps of modern distributionof R1b-M269 subclades.

r1b-l51-ancient-europe
Distribution of ‘archaic’ R1b-L51 subclades in ancient samples, overlaid over a map of Yamnaya and Bell Beaker migrations. In blue, Yamnaya Pre-L51 from Lopatino (not shown) and R1b-L52* from BBC Augsburg. In violet, R1b-L51 (xP312,xU106) from BBC Prague and Poland. In maroon, hg. R1b-L151* from BBC Hungary, BA Bohemia, and (not shown) a potential sample from BBC at Mondelange, which is certainly xU106, maybe xP312. Interestingly, the earliest sample of hg. R1b-U106 (a lineage more proper of northern Europe) has been found in a Bell Beaker from Radovesice (ca. 2350 BC), between two of these ‘archaic’ R1b-L51 samples; and a sample possibly of hg. R1b-ZZ11+ (ancestral to DF27 and U152) was found in a Bell Beaker from Quedlinburg, Germany (ca. 2290 BC), to the north-west of Bohemia. The oldest R1b-U152 are logically from Central Europe, too.

III. Proto-Indo-Iranian

Before the emergence of Proto-Indo-Iranian, it seems that Pre-Proto-Indo-Iranian-speaking Poltavka groups were subjected to pressure from Central_Steppe_EMBA-related peoples coming from the (south-?)east, such as those found sampled from Mereke_BA. Their ‘kurgan’ culture was dated correctly to approximately the same date as Poltavka materials, but their ancestry and hg. N2(pre-N2a) – also found in a previous sample from Botai – point to their intrusive nature, and thus to difficulties in the Pre-Proto-Indo-Iranian community to keep control over the previous East Yamnaya territory in the Don-Volga-Ural steppes.

We know that the region does not show genetic continuity with a previous period (or was not under this ‘eastern’ pressure) because of an Eastern Yamnaya sample from the same site (ca. 3100 BC) showing typical Yamnaya ancestry. Before Yamnaya, it is likely that Pre-Yamnaya ancestry formed through admixture of EHG-like Khvalynsk with a North Caspian steppe population similar to the Steppe_Eneolithic samples from the North Caucasus Piedmont (see Anthony 2019), so we can also rule out some intermittent presence of a Botai/Kelteminar-like population in the region during the Khvalynsk period.

It is very likely, then, that this competition for the same territory – coupled with the known harsher climate of the late 3rd millennium BC – led Poltavka herders to their known joint venture with Abashevo chiefs in the formation of the Sintashta-Potapovka-Filatovka community of fortified settlements. Supporting these intense contacts of Poltavka herders with Central Asian populations, late ‘outliers’ from the Volga-Ural region show admixture with typical Central_Steppe_MLBA populations: one in Potapovka (ca. 2220 BC), of hg. R1b-Z2103; and four in the Sintashta_MLBA_o1 cluster (ca. 2050-1650 BC), with two samples of hg. R1b-L23 (one R1b-Z2109), one Q1b-L56(xL53), one Q1b-Y6798.

central-steppe-pastoralists
Outlier analysis reveals ancient contacts between sites. We plot the average of principal component 1 (x axis) and principal component 2 (y axis) for the West Eurasian and All Eurasian PCA plots (…). In the Middle to Late Bronze Age Steppe, we observe, in addition to the Western_Steppe_MLBA and Central_Steppe_MLBA clusters (indistinguishable in this projection), outliers admixed with other ancestries. The BMAC-related admixture in Kazakhstan documents northward gene flow onto the Steppe and confirms the Inner Asian Mountain Corridor as a conduit for movement of people.

Similar to how the Sintashta_MLBA_o2 cluster shows an admixture with central steppe populations and hg. R1a-Z645, the WSHG ancestry in those outliers from the o1 cluster of typically (or potentially) Yamnaya lineages show that Poltavka-like herders survived well after centuries of Abashevo-Poltavka coexistence and admixture events, supporting the formation of a Proto-Indo-Iranian community from the local language as pronounced by the incomers, who dominated as elites over the fortified settlements.

The Proto-Indo-Iranian community likely formed thus in situ in the Don-Volga-Ural region, from the admixture of locals of Yamnaya ancestry with incomers of Corded Ware ancestry – represented by the ca. 67% Yamnaya-like ancestry and ca. 33% ancestry from the European cline. Their community formed thus ca. 1,000 years later than the expansion of Late PIE ca. 3500 BC, and expanded (some 500 years after that) a full-fledged Proto-Indo-Iranian language with the Srubna-Andronovo horizon, further admixing with ca. 9% of Central_Steppe_EMBA (WSHG-related) ancestry in their migration through Central Asia, as reported in the paper.

IV. Armenian

The sample from Hajji Firuz, of hg. R1b-Z2103 (xPF331), has been – as expected – re-dated to the Iron Age (ca. 1193-1019 BC), hence it may offer – together with the samples from the Levant and their Aegean-like ancestry rapidly diluted among local populations – yet another proof of how the Late Bronze Age upheaval in Europe was the cause of the Armenian migration to the Armenoid homeland, where they thrived under the strong influence from Hurro-Urartian.

middle-east-armenia-y-dna
Y-chromosome haplogroups of the Middle East and neighbouring groups during the Late Bronze Age / Iron Age. See full maps.

Indus Valley Civilization and Dravidian

A surprise came from the analysis reported by Shinde et al. (2019) of an Iran_N-related IVC ancestry which may have split earlier than 10000 BC from a source common to Iran hunter-gatherers of the Belt Cave.

For the controversial Elamo-Dravidian hypothesis of the Muscovite school, this difference in ancestry between both groups (IVC and Iran Neolithic) seems to be a death blow, if population genomics was even needed for that. Nevertheless, I guess that a full rejection of a recent connection will come down to more recent and subtle population movements in the area.

EDIT (12 SEP): Apparently, Iosif Lazaridis is not so sure about this deep splitting of ‘lineages’ as shown in the paper, so we may be talking about different contributions of AME+ANE/ENA, which means the Elamo-Dravidian game is afoot; at least in genomics:

I shared the idea that the Indus Valley Civilization was linked to the Proto-Dravidian community, so I’m inclined to support this statement by Narasimhan, Patterson, et al. (2019), even if based only on modern samples and a few ancient ones:

The strong correlation between ASI ancestry and present-day Dravidian languages suggests that the ASI, which we have shown formed as groups with ancestry typical of the Indus Periphery Cline moved south and east after the decline of the IVC to mix with groups with more AASI ancestry, most likely spoke an early Dravidian language.

india-steppe-indus-valley-andamanese-ancestry
Natural neighbour interpolation of qpAdm results – Maximum A Posteriori Estimate from the Hierarchical Model (estimates used in the Narasimhan, Patterson et al. 2019 figures) for Central_Steppe_MLBA-related (left), Indus_Periphery_West-related (center) and Andamanese_Hunter-Gatherer-related ancestry (right) among sampled modern Indian populations. In blue, peoples of IE language; in red, Dravidian; in pink, Tibeto-Burman; in black, unclassified. See full image.

I am wary of this sort of simplistic correlation with modern speakers, because we have seen what happened with the wrong assumptions about modern Balto-Slavic and Finno-Ugric speakers and their genetic profile (see e.g. here or here). In fact, I just can’t differentiate as well as those with deep knowledge in South Asian history the social stratification of the different tribal groups – with their endogamous rules under the varna and jati systems – in the ancestry maps of modern India. The pattern of ancestry and language distribution combined with the findings of ancient populations seem in principle straightforward, though.

Conclusion

The message to take home from Shinde et al. (2019) is that genomic data is fully at odds with the Anatolian homeland hypothesis – including the latest model by Heggarty (2014)* – whose relevance is still overvalued today, probably due in part to the shift of OIT proponents to more reasonable Out-of-Iran models, apparently more fashionable as a vector of Indo-Aryan languages than Eurasian steppe pastoralists?
*The authors listed this model erroneously as Heggarty (2019).

The paper seems to play with the occasional reference to Corded Ware as a vector of expansion of Indo-European languages, even after accepting the role of Yamnaya as the most evident population expanding Late PIE to western Europe – and the different ancestry that spread with Indo-Iranian to South Asia 1,000 years later. However, the most cringe-worthy aspect is the sole citation of the debunked, pseudoscientific glottochronological method used by Ringe, Warnow, and Taylor (2002) to support the so-called “steppe homeland”, a paper and dialectal scheme which keeps being referenced in papers of the Reich Lab, probably as a consequence of its use in Anthony (2007).

On the other hand, these are the equivalent simplistic comments in Narasimhan, Patterson et al. (2019):

The Steppe ancestry in South Asia has the same profile as that in Bronze Age Eastern Europe, tracking a movement of people that affected both regions and that likely spread the unique features shared between Indo-Iranian and Balto-Slavic languages. (…), which despite their vast geographic separation share the “satem” innovation and “ruki” sound laws.

mallory-adams-tree
Indo-European dialectal relationships, from Mallory and Adams (2006).

The only academic closely related to linguistics from the list of authors, as far as I know, is James P. Mallory, who has supported a North-West Indo-European dialect (including Balto-Slavic) for a long time – recently associating its expansion with Bell Beakers – opposed thus to a Graeco-Aryan group which shared certain innovations, “Satemization” not being one of them. Not that anyone needs to be a linguist to dismiss any similarities between Balto-Slavic and Indo-Iranian beyond this phonetic trend, mind you.

Even Anthony (2019) supports now R1b-rich Pre-Yamnaya and Yamnaya communities from the Don-Volga region expanding Middle and Late Proto-Indo-European dialects.

So how does the underlying Corded Ware ancestry of eastern Europe (where Pre-Balto-Slavs eventually spread to from Bell Beaker-derived groups) and of the highly admixed (“cosmopolitan”, according to the authors) Sintashta-Potapovka-Filatovka in the east relate to the similar-but-different phonetic trends of two unrelated IE dialects?

If only there was a language substrate that could (as Shinde et al. put it) “elegantly” explain this similar phonetic evolution, solving at the same time the question of the expansion of Uralic languages and their strong linguistic contacts with steppe peoples. Say, Eneolithic populations of mainly hunter-fisher-gatherers from the North Pontic forest-steppes with a stronger connection to metalworking

Related

A Song of Sheep and Horses, revised edition, now available as printed books

cover-song-sheep-and-horses

As I said 6 months ago, 2019 is a tough year to write a blog, because this was going to be a complex regional election year and therefore a time of political promises, hence tenure offers too. Now the preliminary offers have been made, elections have passed, but the timing has slightly shifted toward 2020. So I may have the time, but not really any benefit of dedicating too much effort to the blog, and a lot of potential benefit of dedicating any time to evaluable scientific work.

On the other hand, I saw some potential benefit for publishing texts with ISBNs, hence the updates to the text and the preparation of these printed copies of the books, just in case. While Spain’s accreditation agency has some hard rules for becoming a tenured professor, especially for medical associates (whose years of professional experience are almost worthless compared to published peer-reviewed papers), it is quite flexible in assessing one’s merits.

However, regional and/or autonomous entities are not, and need an official identifier and preferably printed versions to evaluate publications, such as an ISBN for books. I took thus some time about a month ago to update the texts and supplementary materials, to publish a printed copy of the books with Amazon. The first copies have arrived, and they look good.

series-song-sheep-horses-cover

Corrections and Additions

Titles
I have changed the names and order of the books, as I intended for the first publication – as some of you may have noticed when the linguistic book was referred to as the third volume in some parts. In the first concept I just wanted to emphasize that the linguistic work had priority over the rest. Now the whole series and the linguistic volume don’t share the same name, and I hope this added clarity is for the better, despite the linguistic volume being the third one.

Uralic dialects
I have changed the nomenclature for Uralic dialects, as I said recently. I haven’t really modified anything deeper than that, because – unlike adding new information from population genomics – this would require for me to do a thorough research of the most recent publications of Uralic comparative grammar, and I just can’t begin with that right now.

Anyway, the use of terms like Finno-Ugric or Finno-Samic is as correct now for the reconstructed forms as it was before the change in nomenclature.

west-east-uralic-schema

Mediterranean
The most interesting recent genetic data has come from Iberia and the Mediterranean. Lacking direct data from the Italian Peninsula (and thus from the emergence of the Etruscan and Rhaetian ethnolinguistic community), it is becoming clearer how some quite early waves of Indo-Europeans and non-Indo-Europeans expanded and shrank – at least in West Iberia, West Mediterranean, and France.

Finno-Ugric
Some of the main updates to the text have been made to the sections on Finno-Ugric populations, because some interesting new genetic data (especially Y-DNA) have been published in the past months. This is especially true for Baltic Finns and for Ugric populations.

ananino-culture-new

Balto-Slavic
Consequently, and somehow unsurprisingly, the Balto-Slavic section has been affected by this; e.g. by the identification of Early Slavs likely with central-eastern populations dominated by (at least some subclades of) hg. I2a-L621 and E1b-V13.

Maps
I have updated some cultural borders in the prehistoric maps, and the maps with Y-DNA and mtDNA. I have also added one new version of the Early Bronze age map, to better reflect the most likely location of Indo-European languages in the Early European Bronze Age.

As those in software programming will understand, major changes in the files that are used for maps and graphics come with an increasing risk of additional errors, so I would not be surprised if some major ones would be found (I already spotted three of them). Feel free to communicate these errors in any way you see fit.

bronze-age-early-indo-european
European Early Bronze Age: tentative langage map based on linguistics, archaeology, and genetics.

SNPs
I have selected more conservative SNPs in certain controversial cases.

I have also deleted most SNP-related footnotes and replaced them with the marking of each individual tentative SNP, leaving only those footnotes that give important specific information, because:

  • My way of referencing tentative SNP authors did not make it clear which samples were tentative, if there were more than one.
  • It was probably not necessary to see four names repeated 100 times over.
  • Often I don’t really know if the person I have listed as author of the SNP call is the true author – unless I saw the full SNP data posted directly – or just someone who reposted the results.
  • Sometimes there are more than one author of SNPs for a certain sample, but I might have added just one for all.
ancient-dna-all
More than 6000 ancient DNA samples compiled to date.

For a centralized file to host the names of those responsible for the unofficial/tentative SNPs used in the text – and to correct them if necessary -, readers will be eventually able to use Phylogeographer‘s tool for ancient Y-DNA, for which they use (partly) the same data I compiled, adding Y-Full‘s nomenclature and references. You can see another map tool in ArcGIS.

NOTE. As I say in the text, if the final working map tool does not deliver the names, I will publish another supplementary table to the text, listing all tentative SNPs with their respective author(s).

If you are interested in ancient Y-DNA and you want to help develop comprehensive and precise maps of ancient Y-DNA and mtDNA haplogroups, you can contact Hunter Provyn at Phylogeographer.com. You can also find more about phylogeography projects at Iain McDonald’s website.

Graphics
I have also added more samples to both the “Asian” and the “European” PCAs, and to the ADMIXTURE analyses, too.

I previously used certain samples prepared by amateurs from BAM files (like Botai, Okunevo, or Hittites), and the results were obviously less than satisfactory – hence my criticism of the lack of publication of prepared files by the most famous labs, especially the Copenhagen group.

Fortunately for all of us, most published datasets are free, so we don’t have to reinvent the wheel. I criticized genetic labs for not releasing all data, so now it is time for praise, at least for one of them: thank you to all responsible at the Reich Lab for this great merged dataset, which includes samples from other labs.

NOTE. I would like to make my tiny contribution here, for beginners interested in working with these files, so I will update – whenever I have time – the “How To” sections of this blog for PCAs, PCA3d, and ADMIXTURE.

-iron-age-europe-romans
Detail of the PCA of European Iron Age populations. See full versions.

ADMIXTURE
For unsupervised ADMIXTURE in the maps, a K=5 is selected based on the CV, giving a kind of visual WHG : NWAN : CHG/IN : EHG : ENA, but with Steppe ancestry “in between”. Higher K gave worse CV, which I guess depends on the many ancient and modern samples selected (and on the fact that many samples are repeated from different sources in my files, because I did not have time to filter them all individually).

I found some interesting component shared by Central European populations in K=7 to K=9 (from CEU Bell Beakers to Denmark LN to Hungarian EBA to Iberia BA, in a sort of “CEU BBC ancestry” potentially related to North-West Indo-Europeans), but still, I prefer to go for a theoretically more correct visualization instead of cherry-picking the ‘best-looking’ results.

Since I made fun of the search for “Siberian ancestry” in coloured components in Tambets et al. 2018, I have to be consistent and preferred to avoid doing the same here…

qpAdm
In the first publication (in January) and subsequent minor revisions until March, I trusted analyses and ancestry estimates reported by amateurs in 2018, which I used for the text adding my own interpretations. Most of them have been refuted in papers from 2019, as you probably know if you have followed this blog (see very recent examples here, here, or here), compelling me to delete or change them again, and again, and again. I don’t have experience from previous years, although the current pattern must have been evidently repeated many times over, or else we would be still talking about such previous analyses as being confirmed today…

I wanted to be one step ahead of peer-reviewed publications in the books, but I prefer now to go for something safe in the book series, rather than having one potentially interesting prediction – which may or may not be right – and ten huge mistakes that I would have helped to endlessly redistribute among my readers (online and now in print) based on some cherry-picked pairwise comparisons. This is especially true when predictions of “Steppe“- and/or “Siberian“-related ancestry have been published, which, for some reason, seem to go horribly wrong most of the time.

I am sure whole books can be written about why and how this happened (and how this is going to keep happening), based on psychology and sociology, but the reasons are irrelevant, and that would be a futile effort; like writing books about glottochronology and its intermittent popularity due to misunderstood scientist trends. The most efficient way to deal with this problem is to avoid such information altogether, because – as you can see in the current revised text – they wouldn’t really add anything essential to the content of these books, anyway.

Continue reading

Official site of the book series:
A Song of Sheep and Horses: eurafrasia nostratica, eurasia indouralica

Consequences of O&M 2018 (I): The latest West Yamna “outlier”

This is the first of a series of posts analyzing the findings of the recent Nature papers Olalde et al.(2018) and Mathieson et al.(2018) (abbreviated O&M 2018).

As expected, the first Y-DNA haplogroup of a sample from the North Pontic region (apart from an indigenous European I2 subclade) during its domination by the Yamna culture is of haplogroup R1b-L23, and it is dated ca. 2890-2696 BC. More specifically, it is of Z2103 subclade, the main lineage found to date in Yamna samples. The site in question is Dereivka, “in the southern part of the middle Dnieper, at the boundary between the forest-steppe and the steppe zones”.

NOTE: A bit of history for those lost here, which appear to be many: the classical Yamna culture – from previous late Khvalynsk, and (probably) Repin groupsspread west of the Don ca. 3300 BC creating a cultural-historical community – and also an early offshoot into Asia – , with mass migrations following some centuries later along the Danube to the Carpathian Basin, but also south into the Balkans, and north along the Prut. There is thus a very short time frame to find Yamna peoples shaping these massive migrations – the most likely speakers of Late Proto-Indo-European dialects – in Ukraine, compared to their most stable historical settlements east of the Don River.

There is no data on this individual in the supplementary material – since Eneolithic Dereivka samples come from stored dental remains – , but the radiocarbon date (if correct) is unequivocal: the Yamna cultural-historical community dominated over that region at that precise time. Why would the authors name it just “Ukraine_Eneolithic”? They surely took the assessment of archaeologists, and there is no data on it, so I agree this is the safest name to use for a serious paper. This would not be the first sample apparently too early for a certain culture (e.g. Catacomb in this case) which ends up being nevertheless classified as such. And it is also not impossible that it represents another close Ukraine Eneolithic culture, since ancestral cultural groups did not have borders…

NOTE. Why, on the other hand, was the sample from Zvejnieki – classified as of Latvia_LN – assumed to correspond to “Corded Ware” (like the recent samples from Plinkaigalis242 or Gyvakarai1), when we don’t have data on their cultures either? No conspiracy here, just taking assessments from different archaeologists in charge of these samples: those attributed to “Corded Ware” have been equally judged solely by radiocarbon date, but, combining the known archaeological signs of herding in the region arriving around this time with the old belief (similar to the “Iberia is the origin of Bell Beaker peoples” meme) that “only the Corded Ware culture signals the arrival of herding in the Baltic”. This assumption has been contested recently by Furholt, in an anthropological model that is now mainstream, upheld also by Anthony.

We already know that, out of three previous West Yamna samples, one shows Anatolian Neolithic ancestry, the so-called “Yamna outlier”. We also know that one sample from Yamna in Bulgaria also shows Anatolian Neolithic ancestry, with a distinct ‘southern’ drift, clustering closely to East Bell Beaker samples, as we can still see in Mathieson et al. (2018), see below. So, two “outliers” (relative to East Yamna samples) out of four samples… Now a new, fifth sample from Ukraine is another “outlier”, coinciding with (and possibly somehow late to be a part of) the massive migration waves into Central Europe and the Balkans predicted long ago by academics and now confirmed with Genomics.

I think there are two good explanations right now for its ancestral components and position in PCA:

pca-mathieson2018-yamna
Modified image from Mathieson et al. (2018), including also approximate location of groups from Mittnik et al. (2018), and group (transparent shape outlined by dots) formed by new Bell Beaker samples from Olalde et al. (2018). “Principal components analysis of ancient individuals. Points for 486 ancient individuals are projected onto principal components defined by 777 present-day west Eurasian individuals (grey points). Present-day individuals are shown.”

a) The most obvious one, that the Dnieper-Dniester territory must have been a melting pot, as I suggested, a region which historically connected steppe, forest steppe, and forest zone with the Baltic, as we have seen with early Baltic Neolithic samples (showing likely earlier admixture in the opposite direction). The Yamna population, a rapidly expanding “elite group of patrilineally-related families” (words from the famous 2015 genetic papers, not mine), whose only common genetic trait is therefore Y-DNA haplogroup R1b-L23, must have necessarily acquired other ancestral components of Eneolithic Ukraine during the migrations and settlements west of the Don River.

How many generations are needed for ancestral components and PCA clusters to change to that extent, in regions where only some patrilocal chiefs but indigenous populations remain, and the population probably admixed due to exogamy, back-migrations, and “resurge” events? Not many, obviously, as we see from the differences among the many Bell Beaker samples of R1b-L23 subclades from Olalde et al. (2018)

b) That this sample shows the first genetic sign of the precise population that contributed to the formation of the Catacomb culture. Since it is a hotly debated topic where and how this culture actually formed to gradually replace the Yamna culture in the central region of the Pontic-Caspian steppe, this sample would be a good hint of how its population came to be.

See e.g. for free articles on the Catacomb culture its article on the Encyclopedia of Indo-European Culture, Catacomb culture wagons of the Eurasian steppes, or The Warfare of the Northern Pontic Steppe – Forest-Steppe Pastoral Societies: 2750 – 2000 B.C. There are also many freely available Russian and Ukrainian papers on anthropometry (a discipline I don’t especially like) which clearly show early radiocarbon dates for different remains.

This could then be not ‘just another West Yamna outlier’, but would actually show meaningful ‘resurge’ of Neolithic Ukraine ancestry in the Catacomb culture.

It could be meaningul to derive hypotheses, in the same way that the late Central European CWC sample from Esperstedt (of R1a-M417 subclade) shows recent exogamy directly from the (now more probably eastern part of the) steppe or steppe-forest, and thus implies great mobility among distant CWC groups. Although, given the BB samples with elevated steppe ancestry and close PCA cluster from Olalde et al. (2018), it could also just mean exogamy from a near-by region, around the Carpathian Basin where Yamna migrants settled…

If this was the case, it would then potentially mean a “continuity” break in the steppe, in the region that some looked for as a Balto-Slavic homeland, and which would have been only later replaced by Srubna peoples with steppe ancestry (and probably R1a-Z93 subclades). We would then be more obviously left with only two options: a hypothetic ‘Indo-Slavonic’ North Caspian group to the east (supported by Kortlandt), or a Central-East European homeland near Únětice, as one of the offshoots from the North-West Indo-European group (supported by mainstream Indo-Europeanists).

How to know which is the case? We have to wait for more samples in the region. For the moment, the date seems too early for the known radiocarbon dating of most archaeological remains of the Catacomb Culture.

chalcolithic_late-yamna-catacomb
Diachronic map of Late Copper Age migrations including steppe groups ca. 2600-2250 BC

An important consequence of the addition of these “Yamna outliers” for the future of research on Indo-European migrations is that, especially if confirmed as just another West Yamna sample – with more, similar samples – , early Palaeo-Balkan peoples migrating south of the Danube and later through Anatolia may need to be judged not only in terms of ancestral components or PCA (as in the paper on Minoans and Mycenaeans), but also and more decisively using phylogeography, especially with the earliest samples potentially connected with such migrations.

NOTE. Regarding the controversy (that some R1b European autochthonous continuists want to create) over the origin of the R1b-L151 lineages, we cannot state its presence for sure in Yamna territory right now, but we already have R1b-M269 in the eastern Pontic-Caspian steppe during the Neolithic-Chalcolithic transition, then R1b-L23 and subclades (mainly R1b-Z2013, but also one xZ2103, xL51 which suggests its expansion) in the region before and during the Yamna expansion, and now we have L51 subclades with elevated steppe ancestry in early East Bell Beakers, which most likely descended from Yamna settlers in the Carpathian Basin (yet to be sampled).

Even without express confirmation of its presence in the steppe, the alternative model of a Balkan origin seems unlikely, given the almost certain continuity of expanding Yamna clans as East Bell Beaker ones, in this clearly massive and relatively quick expansion that did not leave much time for founder effects. But, of course, it is not impossible to think about a previously hidden R1b-L151 community in the Carpathian Basin yet to be discovered, adopting North-West Indo-European (by some sort of founder effect) brought there by Yamna peoples of exclusively R1b-Z2103 lineages. As it is not impossible to think about a hidden and ‘magically’ isolated community of haplogroup R1a-M417 in Yamna waiting to be discovered…Just not very likely, either option.

As to why this sample or the other Bell Beaker samples “solve” the question of R1a-Z645 subclades (typical of Corded Ware migrants) not expanding with Yamna, it’s very simple: it doesn’t. What should have settled that question – in previous papers, at least since 2015 – is the absence of this subclade in elite chiefs of clans expanded from Khvalynsk, Yamna, or their only known offshoots Afanasevo and Bell Beaker. Now we only have still more proof, and no single ‘outlier’ in that respect.

No haplogroup R1a among hundreds of samples from a regionally extensive sampling of the only cultures mainstream archaeologists had thoroughly described as potentially representing Indo-European-speakers should mean, for any reasonable person (i.e. without a personal or professional involvement in an alternative hypothesis), that Corded Ware migrants (as expected) did not stem from Yamna, and thus did not spread Late Indo-European dialects.

This haplogroup’s hegemonic presence in North-Eastern Europe – and the lack of N1c lineages until after the Bronze Age – coinciding with dates when Uralicists have guesstimated Uralic dialectal expansion accross this wide region makes the question of the language spread with CWC still clearer. The only surprise would have been to find a hidden and isolated community of R1a-Z645 lineages clearly associated with the Yamna culture.

NOTE. A funny (however predictable) consequence for R1a autochthonous continuists of Northern or Eastern European ancestry: forum commentators are judging if this sample was of the Yamna culture or spoke Indo-European based on steppe component and PCA cluster of the few eastern Yamna samples which define now (you know, with the infallible ‘Yamnaya ancestral component’) the “steppe people” who spoke the “steppe language”™ – including, of course, North-Eastern European Late Neolithic

Not that radiocarbon dates or the actual origin of this sample cannot be wrong, mind you, it just strikes me how twisted such biased reasonings may be, depending on the specific sample at hand… Denial, anger, and bargaining, including shameless circular reasoning – we know the drill: we have seen it a hundred times already, with all kinds of supremacists autochthonous continuists who still today manage to place an oudated mythical symbolism on expanding Proto-Indo-Europeans, or on regional ethnolinguistic continuity…

More detailed posts on the new samples from O&M 2018 and their consequences for the Indo-European demic diffusion to come, indeed…

See also:

Something is very wrong with models based on the so-called ‘Yamnaya admixture’ – and archaeologists are catching up (II)

A new article by Leo S. Klejn tries to improve the Northern Mesolithic Proto-Indo-European homeland model of the Russian school of thought: The Steppe hypothesis of Indo-European origins remains to be proven, Acta Archaeologica, 88:1, 193–204.

Abstract:

Recent genetic studies have claimed to reveal a massive migration of the bearers of the Yamnaya culture (Pit-grave culture) to the Central and Northern Europe. This migration has supposedly lead to the formation of the Corded Ware cultures and thereby to the dispersal of Indo-European languages in Europe. The article is a summary presentation of available archaeological, linguistic, genetic and cultural data that demonstrates many discrepancies in the suggested scenario for the transformations caused by the Yamnaya “invasion” some 5000 years ago.

Excerpts:

Both teams [Reich/Anthony, and Willerslev/Kristiansen] interpreted this resemblance in the same way: as evidence of mass migration of the Yamnaya culture from the steppes into the Central and Northern Europe, resulting in the formation of the Corded Ware cultures, and these are universally recognised as Indo-European. Since earlier in this part of Europe existed a different pool of genomes, geneticists presumed that the Yamnaya migration alone had brought the Indo-European languages into Europe. It is difficult to say to what extent the pre-convictions of the involved archaeologists influenced these conclusions, or whether the results of the genetic studies attracted archaeologists with such beliefs.

Mismatch of cultural manifestations

First, we might question the idea of the Yamnaya culture as a unity rather than a loose conglomerate of cultures. Merpert (1974) divided it into nine local groups but did not recognise them as separate cultures. However, in 1975 I suggested that Nerushay (Budzhak) monuments should be recognised as a distinct culture (Klejn 1975), although still as a part of the same broader steppe community.

This was accepted by other specialists (Ivanova 2012; 2013; 2014). Generally, in the western branch of this community, a mixture of the eastern rites of interment with local, Balkan ceramics can be observed. It should be noted that hitherto all genetic samples were taken from eastern material (in the vicinity of Samara in the Volga basin and Kalmykia), while the central thesis concerns the intrusion of the western branch of this community (Budzhak culture) into Europe.

yamnaya-corded-ware.connection
The spread of cultural-historical communities of the Yamnaya culture and the location of the Budzhak culture. GAC – Globular Amphora culture; CWC – Corded Ware culture. After Ivanova 2013.

Simultaneity of cultures

The Yamnaya culture (Chernykh & Orlovskaya 2004a; Heyd 2011; Frȋnculeasa et al. 2015) appears not to be the predecessor of the Corded Ware cultures but is contemporary with them. The Corded Ware cultures appeared also around the turn between the fourth and third millennium BC (Stöckli 2001; Furholt 2003). Their derivation from the Yamnaya seems, therefore, to be less probable. This is evidenced by the fact that the corded beakers or amphorae found in the Budzhak culture are not the prototypes of the corded beakers or amphorae found in more northern territories, but seem instead to be an outcome of contemporaneous contacts (Ivanova 2014; Klejn 2017c).

Discrepancies across the haplogroups

Even more remarkable is the variation in the distribution of types of Y chromosome. In the Yamnaya population, R1b is not just a single occurrence (there are about seven known occurrences) while in the Corded Ware population a different clade of R1b is found and R1a is predominant (several instances). Thus the postulate of unbroken succession finds no support!

yamna-into-corded-ware
Distribution of artefacts and customs of the Yamnaya culture in the area of the Corded Ware cultures. After Bátora 2006.

Paradoxical gradient

In the tables presented in the article by Reichs’ team (Haak et al. 2015) the genetic pool connecting the Yamnaya culture with the Corded Ware people is shown to be more intense in Northern Europe (Norway and Sweden) and decreases gradually from the North to the South (Fig. 6). It is weakest around the Danube, in Hungary, i. e. areas neighbouring the western branch of the Yamnaya culture! This is the reverse image to what the proposed hypothesis by the geneticists would lead us to expect. It is true that this gradient is traced back from the contemporary materials, but it was already present during the Bronze Age (Klejn 2015a).

The author also uses questionable interpretations from selected articles to advance his (as of today) untenable positions regarding a Mesolithic origin of the reconstructible Proto-Indo-European language.

1. Glottochronology, for a PIE origin:

If based on the data of glottochronology (taking into account all disputes) the period of initial dispersal is to be dated to the 7th-5th millennium BC.

2. Doubts on the origin of R1b-L51 subclades expressed in Genetic differentiation between upland and lowland populations shapes the Y-chromosomal landscape of West Asia, by Balanovsky et al. (2017), Human Genetics 136, 4. 437-450:

The currently available dataset does not contradict the hypothesis that R-GG400 marks a link between the East European steppe dwellers and West Asians, though the route and even direction of this migration is disputable. It does, however, demonstrate that present-day West European R1b chromosomes do not originate from the Yamnaya populations analyzed in (Haak et al. 2015; Mathieson et al. 2015) and raises the question of their origin. A Bronze Age origin is more likely than a Neolithic one (Balaresque et al. 2010), but further ancient DNA studies may be necessary to identify this source.

Just yesterday I read the post The retraction paradox: Once you retract, you implicitly have to defend all the many things you haven’t yet retracted, by Andrew Gelman. While – in my opinion – the post does not live up to its title, it poses an interesting question, as to how ad logicam (fallacy fallacy) is often used today in research: One author proposes something that is later demonstrated to be wrong, so everything they wrote or write can be said ipso facto to be wrong…especially if they accept that it was wrong.

This is usual with amateur geneticists (those who don’t publish, and are therefore not subjected to criticism): if anyone is wrong (whether in Archaeology or Genetics), then they are wrong in everything else. It seems to me that Klejn’s theses against recent genetic results rest on the same assumption: The Yamna -> Corded Ware migration model is wrong, ergo the Yamna homeland model is wrong.

I guess this same fallacy is what a lot of angered geneticists (whether professional or amateurs) are going to use to dismiss Klejn’s criticism, trying to focus on what he clearly does not grasp – about genomic data of Yamna peoples and their expansion – to disregard his doubts on genetic interpretations entirely.

I have warned many times about how simplistic interpretations of genetic data would cause a general mistrust in the field, and that archaeologists won’t take the discipline seriously, no matter how many articles get published in famous research tabloids like Nature or Science…

Those who dismiss this warning lightly seem to forget the fate of other recent “scientific breakthroughs” which were initially so promising that Humanities appeared to matter no more, like glottochronology for Linguistics and, to some extent, that of radiocarbon analysis for Archaeology.
EDIT: see here a recent example of discusion on discrepancies between archaeological and 14C-based chronologies, whereby ‘scientific data’ obviously needs archaeological context for a meaningful interpretation

Featured image: The direction of the supposed migration of the bearers of the Yamnaya culture into the area of the Corded Ware cultures. After Haak et al. 2015.

NOTE: I obviously don’t agree with Klejn’s main model: he criticises the Proto-Indo-European steppe homeland, and more specifically the expansion of Yamna peoples with R1b-L23 subclades, which I support. But, probably because of his “pre-convictions” (as he puts it when describing proponents of the steppe hypotheses) about the Proto-Indo-European homeland in Northern Europe during the Mesolithic, he was one of the first renown archaeologists to criticise the obvious inconsistencies in the genetic model of migrations based exclusively on the “Yamnaya ancestral component” concept, and to provoke the necessary reaction from (until then) overconfident geneticists, and he deserves credit for that.

In my opinion, the Russian school’s “Northern European Mesolithic” homeland model – as I have said before – could be based on the appearance of EHG ancestry, or maybe on the expansion of haplogroup R1b with post-Swiderian cultures, but the timeframe proposed is too early for any reconstructible parent proto-language, even for Indo-Uralic.

Related:

The new “Indo-European Corded Ware Theory” of David Anthony

allentoft-yamna-corded-ware

I recently wrote about the Indo-European Corded Ware Theory of Kristian Kristiansen and his workgroup, a sort of “Danish school”, whose aim is to prove a direct, long-lasting interaction between the North Pontic steppe and east European cultures during the Late Neolithic, which supposedly gave rise to a Late Indo-European-speaking Corded Ware culture. That is, a sort of renewed Kurgan model; or, more exactly, Kurgan models, since there is no single one preferred right now.

David Anthony had remained more or less in the background after the controversial assessment of the so-called Yamnaya ancestral component by recent genetic papers, which posited that there was a genetic flow in the Late Neolithic suggesting a migration model that could be hypothetically simplified to Yamna -> Corded Ware -> Bell Beaker.

With his previous publications, especially The Horse, the Wheel, and Language: How Bronze-Age Riders from the Eurasian Steppes Shaped the Modern World (and its revisions), Anthony had set up an impressive revised steppe theory that overcame some of the errors of Gimbutas’ Kurgan model.

Whereas Indo-European-speaking Corded Ware cultures (CWC) were still featured prominently in his model, the different languages supposedly spoken by these groups were explained through multiple cultural diffusion events, and actual migrations from Yamna peoples were only described into Early Bronze Age cultures of the Balkans, and into the Afanasevo culture.

More recently, he has offered (in collaboration with his wife, Dorcas R. Brown) a tentative original connection Yamna -> Corded Ware in the Lesser Poland region, in their paper Molecular archaeology and Indo-European linguistics: Impressions from new data. It seemed to be based merely on recent genetic finds, and on the fact that Corded Ware remains appear to be oldest in that region, according to radiocarbon analysis.

Now he seems to be more and more supportive of this hypothesis in his new essay, Archaeology and Language: Why Archaeologists Care About the Indo-European Problem, in European Archaeology as Anthropology: Essays in Memory of Bernard Wailes ed by P.J. Crabtree and P. Bogucki (2017).

The chapter is interesting to read, as always. Nevertheless, it commits to previous errors – driven by the wrong interpretation of recent genetic papers -, and deepens thus this untenable archaeological-linguistic model of migrations from the steppe, in a weird vicious circle of wrong feedback between archaeologists and geneticists that diregards what archaeologists have been saying in the last decade.

Instead of waiting for the current storm of genetic papers (and their misinformation) to pass, and see what remains, Anthony is now supporting a different model than the one that made him popular, risking the good name he has earned in Archaeology and in popular science texts – in spite of initial setbacks due to the prevailing criticisms of Indo-European migration models.

Some excerpts (emphasis mine):

indo-european-corded-ware-bell-beaker
Central and Eastern Europe ca. 3000–2500 BCE showing the early Yamnaya culture area 3300–2700 BCE and the Yamnaya migration up the Danube Valley with related/offshoot Makó and Vučedol sites; also the distribution of Corded Ware sites in northern Europe; and site areas sampled for aDNA in Haak et al. (2015). The oldest Corded Ware radiocarbon dates are from southern and central Poland. The Yamnaya cemeteries in the Danube Valley are after Heyd (2011), the shaded Globular Amphorae site area is after Harrison and Heyd (2007); the Corded Ware and Globular Amphorae sites in southern Poland are after Machnik (1999); and the blue dots were all Corded Ware sites with radiocarbon dates as of Furholt (2003).

A Yamnaya migration from the steppes up the Danube valley as far as Hungary was already accepted by many archaeologists (Fig. 2.2). Hundreds of Yamnaya-type kurgans and dozens of cemeteries have been recognized by archaeologists in the lower Danube valley, in Bulgaria and Romania; and in the middle Danube valley, in eastern Hungary, with radiocarbon dates that began about 3000–2800 BCE and extended to about 2700–2600 BCE (Ecsedy 1979; Sherratt 1986; Boyadziev 1995; Harrison and Heyd 2007; Heyd 2012; Frînculeasa et al. 2015). The migration stream that created these intrusive cemeteries now can be seen to have continued from eastern Hungary across the Carpathians into southern Poland, where the earliest material traits of the Corded Ware horizon appeared (Furholt 2003). Corded Ware sites appeared in Denmark by 2800–2700 BCE, probably within 100–200 years after the first Yamnaya migrants entered the lower Danube valley. This surprisingly rapid migration introduced genetic traits such as the R1a and R1b Y-chromosome haplogroups and a substantial element of ANE (Ancient North Eurasian) ancestry that remain characteristic of most northern and western Europeans today.

(…)

The oldest radiocarbon dates from Corded Ware sites occur in southern Poland (upper Vistula) and north-central Poland (Kujavia), and this was seen as the region where the early networking of amphorae styles from Globular Amphorae and axe types from Scandinavia began. The genetic evidence shows a somewhat different picture: the Corded Ware people were largely immigrants whose ancestors came from the steppes (probably immediately from eastern Hungary), but they quickly adopted local material traits in amphorae and axe types that obscured their foreign origins. Middle Neolithic northern European populations composed of admixed WHG/EEF survived but were largely excluded from Corded Ware cemeteries, and from marriage into the Corded Ware population. Even centuries after the initial migration the Corded Ware population at Esperstedt, dated 2500–2400 BCE, still exhibited 70–80% Yamnaya genes, although individual variations in the extent of local admixture were apparent. Intermarriage with the surviving local population was more frequent during the ensuing Bell Beaker period. However, the resurgence is more visible in mtDNA than in Y-DNA (Szécsényi-Nagy et al. 2015), suggesting that men of the older EEF heritage were disadvantaged more than women.

(…)

Settlements were more permanent before the Corded Ware migration, and remained so among the Globular Amphorae people, who continued to create more localized site-and-cemetery groups in the same landscape with the more mobile immigrants. Afterward, during the Bell Beaker period, when local genetic ancestry rebounded and the population became more admixed, settlements again were more permanent. The Corded Ware culture introduced both a large, steppe-derived population and an unusually mobile form of pastoral economy that was a regional economic anomaly, but nevertheless survived in varying forms for centuries before the regional economic pattern was re-established. A steppe language certainly accompanied this demographic and economic shift. As we have seen above, there are good independent reasons (loans with Uralic and South Caucasian) to think that PIE was spoken in the steppes. It is likely that the steppe language introduced between 3000–2500 BCE was a late (post-Anatolian) form of PIE and survived and evolved into the later northern IE languages.

So, to sum up the new developments of Anthony’s preferred model:

  1. Abandonment of the multiple cultural diffusion models from Yamna into Corded Ware, i.e. Pre-Germanic (in the Usatovo culture) and Pre-Balto-Slavic (in the Middle Dnieper culture).
  2. The only potential Yamna connection with Corded Ware in Archaeology must come from Yamna migrants in the Carpathian basin. Therefore, R1a must come from Hungarian settlements.
  3. Corded Ware cultures from Northern Europe, from roughly 2800 BC, must come from Yamna settlers of the Carpathian basin.
  4. Esperstedt is a great example of Yamnaya genes, and of the mobility (and lack of intermarriage) of Corded Ware peoples centuries, after their migration from Yamna settlers in Hungary.

My answers (obvious for anyone reading this blog, or my demic diffusion model):

  1. It is a pitty that cultural diffusion models are abandoned. They were the last hope to keep these IE-CWC/Kurgan hypotheses alive.
  2. The Carpathian Basin is obviously the only potential early connection between Corded Ware and Yamna. But no single R1a has been found in western migrants, and admixture (including ancestral components and PCA) from Early Yamna, West Yamna, Balkan EBA, and early Bell Beaker samples from Hungary make it very unlikely that such a connection existed.
  3. Corded Ware peoples formed and began their migration much earlier than Yamna settlers arrived in the Carpathian Basin. Compare e.g. the Late Neolithic sample from Latvia (dated ca. 2885 BC) with steppe ancestry attributed to Corded Ware, or the early appearance of east European cultures like Fatyanovo-Balanovo or Abashevo. Also, known Yamna migration routes don’t include these proposed population expansions.
  4. I have already written about the Esperstedt outlier, and why its definition as an outlier should have been clearly made to avoid this kind of misinterpretations…
yamna-bell-beaker
Yamna – East Bell Beaker migration ca. 3000-2300 BC, according to Heyd (2007)

With each new genetic paper it is less and less likely that many individuals of Y-DNA haplogroup R1a, and especially R1a-Z645 (if any at all), will appear associated with Yamna, either in the Pontic-Caspian steppe or in western settlements (at least clearly belonging to Yamna, Balkan EBA, or Bell Beaker cultures), which will make the life of this new Indo-European Corded Ware Theory model still shorter than could be a priori expected for any archaeological model.

Also, it seems that the Bell Beaker preprint paper by Olalde et al. (2017) will be published in Nature soon with more samples, so a swift rejection of this theory may be near. On the other hand, the first paper on this model by Anthony and Brown (like the first paper of Kristiansen and his workgroup) appeared just before Olalde et al. (2017) and Mathieson et al. (2017), and yet all samples against their pet theories have not deterred any of them to continue supporting them…

I would say it is a shame that some geneticists are misleading good archaeologists into so many different wrong models, but I guess it is only fair to blame authors for what they write, not whom or what they trusted to write…

I think there is much more to be said about the interaction among Neolithic cultures from the steppe (viz. Sredni Stog and Khvalynsk), than about the Yamna migration, and Anthony was in a better position to judge this. Right now, it seems that other researchers like Rassamakin or Ivanova are taking the lead in the research of Neolithic cultures from the steppe, while Heyd or Prescott are taking the lead in the explanation of Yamna -> Bell Beaker migrations and their connection with the expansion of Late Indo-European languages.

#EDIT (December 18 2017): Just to be clear, Anthony’s new Indo-European Corded Ware Theory model in Archaeology would be compatible with the development and expansion of a North-West Indo-European dialect of Late Indo-European in Linguistics (which is my main source of disagreement with other recent models). In fact, Anthony’s new model could explain the different nature of Balto-Slavic, being adopted by peoples of mainly R1a-Z645 subclades of Lesser Poland – from Yamna migrants of R1b-L23 subclades – , and later influencing Pre-Germanic brought by Bell Beakers to Scandinavia, so in that sense it could offer some light to certain controversial linguistic aspects. See Corded Ware Substrate Theory for more on Germanic and Balto-Slavic similarities based on a common, intermediate substrate.

What I am criticising with this post is that the model seems to rely heavily (in fact, almost solely) on what some geneticists (and especially amateurs, fanboys of specific haplogroups and/or admixture components) are selling about the ‘Yamnaya component’ (and thus the assumption of a common migration of peoples of R1a-Z645 and R1b-L23 subclades), something which is – to say the least – highly controversial today. Instead of departing from Archaeology (his field) to try and make sense of what others are saying, he seems to be abandoning his own migration models and adopting one compatible with genetic studies of 2015-2016 made by laymen in Indo-European studies, who based their conclusions on their own new methods, applied to a few scattered samples. These new IECWT proponents are thus in turn giving still more reasons for these geneticists to support wrong assumptions in future studies, by relying on any of these new potential archaeological scenarios. And so on and on it goes…

Related:

Indo-European demic diffusion model, 3rd edition

pca-yamna-corded-ware

I have just uploaded the working draft of the third version of the Indo-European demic diffusion model. Unlike the previous two versions, which were published as essays (fully developed papers), this new version adds more information on human admixture, and probably needs important corrections before a definitive edition can be published.

The third version is available right now on ResearchGate and Academia.edu. I will post the PDF at Academia Prisca, as soon as possible:

pca-map-yamna-corded-ware-bell-beaker
Map overlaid by PCA including Yamna, Corded Ware, Bell Beaker, and other samples

Feel free to comment on the paper here, or (preferably) in our forum.

A working version (needing some corrections) divided by sections, illustrated with up-to-date, high resolution maps, can be found (as always) at the official collaborative Wiki website indo-european.info.

Marija Gimbutas and the expansion of the “Kurgan people” based on tumulus-building cultures

kurgan-expansion

An interesting article that I keep stumbling upon, The tumulus in European prehistory: covering the body, housing the soul, by Anthony Harding (2011):

Finally, in Kurgan IV she saw “continuous waves of expansion or raids[that] touched all of northern Europe, the Aegean area, and the east Mediterranean areas possibly as far south as Egypt”. This was the period of the Catacomb Graves, but also the Early Bronze Age rock-cut tombs of the Mediterranean, Vučedol, Bell Beakers in Hungary, the Single Grave culture of the Nordic region. The Kurgan Culture reached Ireland, she remarked in a paper of 1978 “as early as 3500 B.C.” – by which she presumably referred to megalithic mounds covering passage tombs.

(…)

According to Gimbutas, the “Kurgan people” are evidenced by single graves in deep shafts, often in wooden chests (coffins) or stone cists marked by low earth or stone barrows; the dead lay on their backs with legs contracted; they were buried with flint points or arrowheads, figurines depicting horses’ heads, boars tusk ornaments and animal tooth pendants. Human sacrifice was allegedly performed during the funeral ceremonies,and sometimes ritual graves of cattle and other animals were added. This is said to contrast with what Gimbutas called the culture of Old Europe (i.e. the earlier Neolithic of the Balkans), who “betray a concern for the deification of the dead and the construction of monumental works of architecture visible in mortuary houses,grave markings, tumuli, stone rings or stone stelae, and in the large quantity of weapons found in the graves”.

(…)

single-graves-kurgan
The varying burial traditions of the Early Bronze Age in Central and Eastern Europe (Häusler 1977, fig. 1). Circles: tumuli with the “mound edge principle”. Semicircles: tumuli. Stippling: cremation; other symbols represent inhumation graves, divided according to orientation and sex

Can we really associate the practice of mound-building with a specific people, and assume that the spread of the practice indicates the spread of the people? That is one of the “big questions” of European archaeology, and one which a number of papers in the volume address. My own position is that the practice of tumulus building seems so widespread in time and space that it seems hard to associate it with one particular ethnic group – though I can understand how, in the melting pot that was Early Europe, people could believe this to be the case. There are, however, major arguments against the idea, on archaeological grounds alone – which Häusler’s map indicates very clearly. Burial mode and grave form in Copper and Bronze Age Europe was far too variable for any such simplistic correlation. In any case, what are we to make of the appearance of tumuli in such far-flung places as Japan or North America, where tumuli are very common? It was always unlikely that the megalithic tombs of western Europe were to be associated with movements from the steppe 1000 or 2000 years earlier, and nothing that has happened since Gimbutas was writing has changed that situation

Research has corrected Gimbutas’ opinion on the time of spread of Indo-Europeans, on the role of the horse (see e.g. Anthony 2007) in their expansion, and the unrelatedness of the two main central European Chalcolithic archaeological packages: the Corded Ware package that expanded from the Balkans into north-eastern Europe, and the Yamna package (together with the proto-Beaker package) that evolved into the East Bell Beaker culture.

yamna-migration
Extent of migration of the “Yamna package“, from Heyd 2007

However, the shadow of the “Kurgan people” remains in the outdated body of innumerable writings. It was revived with the first attempts at disentangling Europe’s genetic past (based on the role of R1a in expanding Proto-Indo-European).

Particularly strong in that sense is the model set forth by Kristiansen, who was nevertheless aware since his first proposal of the differences between the ‘Kurgan people’ of the steppe and those of the Corded Ware culture, selecting thus an alternative framework of long-lasting human and economic interactions between the “Kurgan people”, the Globular Amphora and Baden cultures with an origin of the culture in the natural region formed between the Upper Dnieper and Vistula rivers.

This idea is continued today, and has been recently linked with the Agricultural Substrate Hypothesis. Originally proposed by Kroonen and linked to the spread of Middle Eastern “R1b1b2” with agriculture, it is now (in Kristiansen et al. 2017 and more recently in Iversen and Kroonen 2017) linked with the expansion of the Corded Ware culture, thus proposing that Pre-Germanic is a branch separated some 6,000 years ago from other branches…

kurgans-corded-ware
Kristiansen’s (1989) schematic presentation of basic principles of burial positions in the Late Neolithic / EBA cultures in northern Eurasia, following to some extent Häusler (1983)

The linguistic proposal is obviously compatible with mainstream archaeological models – which suggest the introduction of Pre-Germanic in Scandinavia with Bell Beaker peoples -, but since the linguistic proposal alone would probably not make such a fuss without the accompanying genetics, I guess this is the right way to publicise it. I doubt linguists really care about genetics, and I really doubt amateur geneticists will read the linguistic proposal, but who cares.

Kristiansen’s traditional model is obviously in contrast with contemporaneous anthropological writings by Anthony, Heyd, or (Gimbutas’ pupil) Mallory, but is nevertheless becoming a resilient tradition in the interpretation of results in studies of human ancestry in Europe.

I doubt that Gimbutas, who was not very fond of tradition, would be proud of this kind of legacy, though…

Featured image: “European dialect” expansion of Proto-Indo-European according to The Indo-Europeans: Archeological Problems, Gimbutas (1963). Observe the similarities of the western European expansion to the recently proposed expansion of R1b lineages with western Yamna and Bell Beaker.

Related: