Volga Basin R1b-rich Proto-Indo-Europeans of (Pre-)Yamnaya ancestry


New paper (behind paywall) by David Anthony, Archaeology, Genetics, and Language in the Steppes: A Comment on Bomhard, complementing in a favourable way Bomhard’s Caucasian substrate hypothesis in the current issue of the JIES.

NOTE. I have tried to access this issue for some days, but it’s just not indexed in my university library online service (ProQuest) yet. This particular paper is on Academia.edu, though, as are Bomhard’s papers on this issue in his site.

Interesting excerpts (emphasis mine):

Along the banks of the lower Volga many excavated hunting-fishing camp sites are dated 6200-4500 BC. They could be the source of CHG ancestry in the steppes. At about 6200 BC, when these camps were first established at Kair Shak III and Varfolomievka (42 and 28 on Figure 2), they hunted primarily saiga antelope around Dzhangar, south of the lower Volga, and almost exclusively onagers in the drier desert-steppes at Kair-Shak, north of the lower Volga. Farther north at the lower/middle Volga ecotone, at sites such as Varfolomievka and Oroshaemoe hunter-fishers who made pottery similar to that at Kair-Shak hunted onagers and saiga antelope in the desert-steppe, horses in the steppe, and aurochs in the riverine forests. Finally, in the Volga steppes north of Saratov and near Samara, hunter-fishers who made a different kind of pottery (Samara type) and hunted wild horses and red deer definitely were EHG. A Samara hunter-gatherer of this era buried at Lebyazhinka IV, dated 5600-5500 BC, was one of the first named examples of the EHG genetic type (Haak et al. 2015). This individual, like others from the same region, had no or very little CHG ancestry. The CHG mating network had not yet reached Samara by 5500 BC.

Eneolithic settlements (1–5, 7, 10–16, 20, 22–43, 48, 50), burial grounds (6, 8–9, 17–19, 21, 47, 49) and kurgans (44–46) of the steppe Ural-Volga region: 1 Ivanovka; 2 Turganik; 3 Kuzminki; 4 Mullino; 5 Davlekanovo; 6 Sjezheye (burial ground); 7 Vilovatoe; 8 Ivanovka; 9 Krivoluchye; 10–13 LebjazhinkaI-III-IV-V; 14 Gundorovka; 15–16 Bol. Rakovka I-II; 17–18 Khvalunsk I-II; 19 Lipoviy Ovrag; 20 Alekseevka; 21 Khlopkovskiy; 22 Kuznetsovo I; 23 Ozinki II; 24 Altata; 25 Monakhov I; 26 Oroshaemoe; 27 Rezvoe; 28 Varpholomeevka; 29 Vetelki; 30 Pshenichnoe; 31 Kumuska; 32 Inyasovo; 33 Shapkino VI; 34 Russkoe Truevo I; 35 Tsaritsa I-II; 36 Kamenka I; 37 Kurpezhe-Molla; 38 Istay; 39 Isekiy; 40 Koshalak; 41 Kara-Khuduk; 42 Kair-Shak VI; 43 Kombakte; 44 Berezhnovka I-II; 45 Rovnoe; 46 Politotdelskoe; 47 burial near s. Pushkino; 48 Elshanka; 49 Novoorsk; 50 Khutor Repin. Modified from Morgunova (2014).

But before 4500 BC, CHG ancestry appeared among the EHG hunter-fishers in the middle Volga steppes from Samara to Saratov, at the same time that domesticated cattle and sheep-goats appeared. The Reich lab now has whole-genome aDNA data from more than 30 individuals from three Eneolithic cemeteries in the Volga steppes between the cities of Saratov and Samara (Khlopkov Bugor, Khvalynsk, and Ekaterinovka), all dated around the middle of the fifth millennium BC. Many dates from human bone are older, even before 5000 BC, but they are affected by strong reservoir effects, derived from a diet rich in fish, making them appear too old (Shishlina et al 2009), so the dates I use here accord with published and unpublished dates from a few dated animal bones (not fish-eaters) in graves.

Only three individuals from Khvalynsk are published, and they were first published in a report that did not mention the site in the text (Mathieson et al. 2015), so they went largely unnoticed. Nevertheless, they are crucial for understanding the evolution of the Yamnaya mating network in the steppes. They were mentioned briefly in Damgaard et al (2018) but were not graphed. They were re-analyzed and their admixture components were illustrated in a bar graph in Wang et al (2018: figure 2c), but they are not the principal focus of any published study. All of the authors who examined them agreed that these three Khvalynsk individuals, dated about 4500 BC, showed EHG ancestry admixed substantially with CHG, and not a trace of Anatolian Farmer ancestry, so the CHG was a Hotu-Cave or Kotias-Cave type of un-admixed CHG. The proportion of CHG in the Wang et al. (2018) bar graphs is about 20-30% in two individuals, substantially less CHG than in Yamnaya; but the third Khvalynsk individual had more than 50% CHG, like Yamnaya. The ca. 30 additional unpublished individuals from three middle Volga Eneolithic cemeteries, including Khvalynsk, preliminarily show the same admixed EHG/CHG ancestry in varying proportions. Most of the males belonged to Y-chromosome haplogroup R1b1a, like almost all Yamnaya males, but Khvalynsk also had some minority Y-chromosome haplogroups (R1a, Q1a, J, I2a2) that do not appear or appear only rarely (I2a2) in Yamnaya graves.

Pontic-Caspian steppe and neighbouring groups in the Neolithic. See full map.

Wang et al. (2018) discovered that this middle Volga mating network extended down to the North Caucasian steppes, where at cemeteries such as Progress-2 and Vonyuchka, dated 4300 BC, the same Khvalynsk-type ancestry appeared, an admixture of CHG and EHG with no Anatolian Farmer ancestry, with steppe-derived Y-chromosome haplogroup R1b. These three individuals in the North Caucasus steppes had higher proportions of CHG, overlapping Yamnaya. Without any doubt, a CHG population that was not admixed with Anatolian Farmers mated with EHG populations in the Volga steppes and in the North Caucasus steppes before 4500 BC. We can refer to this admixture as pre-Yamnaya, because it makes the best currently known genetic ancestor for EHG/CHG R1b Yamnaya genomes. The Progress-2 individuals from North Caucasus steppe graves lived not far from the pre-Maikop farmers of the Belaya valley, but they did not exchange mates, according to their DNA.

The hunter-fisher camps that first appeared on the lower Volga around 6200 BC could represent the migration northward of un-admixed CHG hunter-fishers from the steppe parts of the southeastern Caucasus, a speculation that awaits confirmation from aDNA. After 5000 BC domesticated animals appeared in these same sites in the lower Volga, and in new ones, and in grave sacrifices at Khvalynsk and Ekaterinovka. CHG genes and domesticated animals flowed north up the Volga, and EHG genes flowed south into the North Caucasus steppes, and the two components became admixed. After approximately 4500 BC the Khvalynsk archaeological culture united the lower and middle Volga archaeological sites into one variable archaeological culture that kept domesticated sheep, goats, and cattle (and possibly horses). In my estimation, Khvalynsk might represent the oldest phase of PIE.

Pontic-Caspian steppe and neighbouring groups in the Early Eneolithic. See full map.

Anatolian Farmer ancestry and Yamnaya origins

The Eneolithic Volga-North Caucasus mating network (Khvalynsk/Progress-2 type) exhibited EHG/CHG admixtures and Y-chromosome haplogroups similar to Yamnaya, but without Yamnaya’s additional Anatolian Farmer ancestry. (…)

Like the Mesolithic and Neolithic populations here, the Eneolithic populations of Dnieper-Donets II type seem to have limited their mating network to the rich, strategic region they occupied, centered on the Rapids. The absence of CHG shows that they did not mate frequently if at all with the people of the Volga steppes, a surprising but undeniable discovery. Archaeologists have seen connections in ornament types and in some details of funeral ritual between Dnieper-Donets cemeteries of the Mariupol-Nikol’skoe type and cemeteries in the middle Volga steppes such as Khvalynsk and S’yez’zhe (Vasiliev 1981:122-123). Also their cranio-facial types were judged to be similar (Bogdanov and Khokhlov 2012:212). So it it surprising that their aDNA does not indicate any genetic admixture with Khvalynsk or Progress-2. Also, neither they nor the Volga steppe Eneolithic populations showed any Anatolian Farmer ancestry. (…)

All three of the steppe-admixed exceptions were from the Varna region (Mathieson et al. 2018). One of them was the famous “golden man’ at Varna (Krause et al. 2016), Grave 43, whose steppe ancestry was the most doubtful of the three. If he had steppe ancestry, it was sufficiently distant (five+ generations before him) that he was not a statistically significant outlier, but he was displaced in the steppe direction, away from the central values of the majority of typical Anatolian Farmers at Varna and elsewhere. The other two, at Varna (grave 158, a 5-7-year-old girl) and Smyadovo (grave 29, a male 20-25 years old), were statistically significant outliers who had recent steppe ancestry (consistent with grandparents or great-grandparents) of the EHG/CHG Khvalynsk/Progress-2 type, not of the Dnieper Rapids EHG/WHG type.

(…) I believe that the Suvorovo-Cernavoda I movement into the lower Danube valley and the Balkans about 4300 BC separated early PIE-speakers (pre-Anatolian) from the steppe population that stayed behind in the steppes and that later developed into late PIE and Yamnaya.

This archaeological transition marked the breakdown of the mating barrier between steppe and Anatolian Farmer mating networks. After this 4300-4200 BC event, Anatolian Farmer ancestry began to pop up in the steppes. The currently oldest sample with Anatolian Farmer ancestry in the steppes in an individual at Aleksandriya, a Sredni Stog cemetery on the Donets in eastern Ukraine. Sredni Stog has often been discussed as a possible Yamnaya ancestor in Ukraine (Anthony 2007: 239- 254). The single published grave is dated about 4000 BC (4045– 3974 calBC/ 5215±20 BP/ PSUAMS-2832) and shows 20% Anatolian Farmer ancestry and 80% Khvalynsk-type steppe ancestry (CHG&EHG). His Y-chromosome haplogroup was R1a-Z93, similar to the later Sintashta culture and to South Asian Indo-Aryans, and he is the earliest known sample to show the genetic adaptation to lactase persistence (I3910-T). Another pre-Yamnaya grave with Anatolian Farmer ancestry was analyzed from the Dnieper valley at Dereivka, dated 3600-3400 BC (grave 73, 3634–3377 calBC/ 4725±25 BP/ UCIAMS-186349). She also had 20% Anatolian Farmer ancestry, but she showed less CHG than Aleksandriya and more Dereivka-1 ancestry, not surprising for a Dnieper valley sample, but also showing that the old fifth-millennium-type EHG/WHG Dnieper ancestry survived into the fourth millennium BC in the Dnieper valley (Mathieson et al. 2018).

Pontic-Caspian steppe and neighbouring groups in the Late Eneolithic. See full map.

Probably, late PIE (Yamnaya) evolved in the same part of the steppes—the Volga-Caucasus steppes between the lower Don, the lower and middle Volga, and the North Caucasus piedmont—where early PIE evolved, and where appropriate EHG/CHG admixtures and Y-chromosome haplogroups were seen already in the Eneolithic (without Anatolian Farmer). There have always been archaeologists who argued for an origin of Yamnaya in the Volga steppes, including Gimbutas (1963), Merpert (1974), and recently Morgunova (2014), who argued that this was where Repin-type ceramics, an important early Yamnaya pottery type, first appeared in dated contexts before Yamnaya, about 3600 BC. The genetic evidence is consistent with Yamnaya EHG/CHG origins in the Volga-Caucasus steppes. Also, if contact with the Maikop culture was a fundamental cause of the innovations in transport and metallurgy that defined the Yamnaya culture, then the lower Don-North Caucasus-lower Volga steppes, closest to the North Caucasus, would be where the earliest phase is expected.

I would still guess that the Darkveti-Meshoko culture and its descendant Maikop culture established the linguistic ancestor of the Northwest Caucasian languages in approximately the region where they remained. I also accept the general consensus that the appearance of the hierarchical Maikop culture about 3600 BC had profound effects on pre-Yamnaya and early Yamnaya steppe cultures. Yamnaya metallurgy borrowed from the Maikop culture two-sided molds, tanged daggers, cast shaft hole axes with a single blade, and arsenical copper. Wheeled vehicles might have entered the steppes through Maikop, revolutionizing steppe economies and making Yamnaya pastoral nomadism possible after 3300 BC.

For those who still hoped that Proto-Indo-Europeans of Yamnaya/Afanasievo ancestry from the Don-Volga region were associated with the expansion of hg. R1a-M417, in a sort of mythical “R1-rich” Indo-European society, it seems this is going to be yet another prediction based on ancestry magic that goes wrong.

Proto-Indo-Europeans were, however, associated with other subclades beyond R1b-M269, probably (as I wrote recently) R1b-V1636, I2a-L699, Q1a-M25, and R1a-YP1272, but also interestingly some J subclade, so let’s see what surprises the new study on Khvalynsk and Yamnaya settlers from the Carpathian Basin brings…

On the bright side, it is indirectly confirmed that late Sredni Stog formed part of the neighbouring Corded Ware-like populations of ca. 20-30%+ Anatolian farmer ancestry that gave Yamnaya its share (ca. 6-10%), relative to the comparatively unmixed Khvalynsk and late Repin population (as shown by Afanasevo).

In this steppe mating network that opened up after the Khvalynsk expansion, the increasing admixture of Anatolian farmer-related ancestry in Yamnaya from east (ca. 2-10%) to west (ca. 6-15%) points to an exogamy of late Repin males in their western/south-western regions with populations around the Don River basin and beyond (and endogamy within the Yamnaya community), in an evolution relevant for language expansions and language contacts during the Late Eneolithic.

NOTE. “Mating network” is my new preferred term for “ancestry”. Also great to see scholars finally talk about “Pre-Yamnaya” ancestry, which – combined with the distinction of Yamnaya from Corded Ware ancestry – will no doubt help differentiate fine-scale population movements of steppe- and forest-steppe-related populations.

Modified from Rassamakin (1999), adding red color to Repin expansion. The system of the latest Eneolithic Pointic cultures and the sites of the Zhivotilovo-Volchanskoe type: 1) Volchanskoe; 2) Zhivotilovka; 3) Vishnevatoe; 4) Koisug.

The whole issue of the JIES is centered on Caucasian influences on Early PIE as an Indo-Uralic dialect, and this language contact/substrate is useful to locate the most likely candidates for the Northeast and Northwest Caucasian and the Proto-Indo-European homelands.

On the other hand, it would also be interesting to read a discussion of how this Volga homeland of Middle PIE and Don-Volga-Ural homeland of Late PIE would be reconciled with the known continuous contacts of Uralic with Middle and Late PIE (see here) to locate the most likely Proto-Uralic homeland.

Especially because Corded Ware fully replaced all sub-Neolithic groups to the north and east of Khvalynsk/Yamnaya, like Volosovo, so no other population neighbouring Middle and Late Proto-Indo-Europeans survived into the Bronze Age…

EDIT: For those new to this blog, this information on unpublished samples from the Volga River basin is yet another confirmation of Khokhlov’s report on the R1b-L23 samples from Yekaterinovka, and its confirmation by a co-author of The unique elite Khvalynsk male from a Yekaterinovskiy Cape burial, apart from more support to the newest data placing Yekaterinovka culturally and probably chronologically between Samara and Khvalynsk.


ASoSaH Reread (I): Y-DNA haplogroups among Indo-Europeans (apart from R1b-L23)


Given my reduced free time in these months, I have decided to keep updating the text on Indo-European and Uralic migrations and/or this blog, simultaneously or alternatively, to make the most out of the time I can dedicate to this. I will add the different ‘A Song of Sheep and Horses (ASoSaH) reread’ posts to the original post announcing the books. I would be especially interested in comments and corrections to the book chapters rather than the posts, but any comments are welcome (including in the forum, where comments are more likely to stick).

This is mainly a reread of iv.2. Indo-Anatolians and vi.1. Disintegrating Indo-Europeans.

Indo-Anatolians and Late Indo-Europeans

I have often written about R1b-L23 as the majority haplogroup among Late Proto-Indo-Europeans (see my predictions for 2018 and my summary of 2018), but always expected other haplogroups to pop up somewhere along the way, in Khvalynsk, in Repin, in Yamna, and in Bell Beakers (see e.g. the post on common fallacies of R1a/IE-fans).

Luckily enough – for those of us who want precise answers to our previous infinite models of Indo-European language expansions (viz. GAC-associated expansion, IE-speaking Old Europe, Anatolian homeland, Iran homeland, Maykop as Proto-Anatolian, Palaeolithic Continuity Theory, Celtic in the Atlantic façade, etc.) – the situation has been more clear-cut than expected: it turns out that, especially during population expansions, acute Y-chromosome bottlenecks were very common in the past, at least until the Iron Age.

Khvalynsk and Repin-Yamna expansions were no different, and that seems quite natural in hindsight, given the strong familial ties and aversion to foreigners proper of the Late Proto-Indo-European society and culture – probably not really that different from other contemporary societies, like the neighbouring Late Proto-Uralic or Trypillian ones.

Y-DNA samples from Khvalynsk and neighbouring cultures. See full version here.

Y-DNA haplogroups

During the expansion of early Khvalynsk, the most likely Indo-Anatolian culture, the society of the Don-Volga area was probably made up of different lineages including R1b-V1636, R1b-M269, R1a-YP1272, Q1a-M25, and I2a-L699 (and possibly some R1b-V88?), a variability possibly greater than that of the contemporary north Pontic area, probably a sign of this region being a sink of different east and west migrations from steppe and forest areas.

During its expansion, the Khvalynsk society saw its haplogroup variability reduced, as evidenced by the succeeding expansive Repin culture:

Afanasevo, representing Pre-Tocharian (the earliest Late PIE dialect to branch off), expanded with R1b-L23 – especially R1b-Z2103 – lineages, while early Yamna expanded with R1b-L23 and I2a-L699 lineages, which suggests that these are the main haplogroups that survived the Y-DNA bottleneck undergone during the Khvalynsk expansion, and especially later during the late Repin expansion. Nevertheless, other old haplogroups might still pop up during the Repin and early Yamna period, such as the R1b-V1636 sample from Yamna in the Northern Caucasus.

It is still unclear if R1b-L23 sister clade R1b-PF7562 (formed ca. 4400 BC, TMRCA ca. 3400 BC), prevalent among modern Albanians, expanded with Yamna migrants, or if it was part of an earlier expansion of R1b-M269 into the Balkans, and represent thus Indo-Anatolian speakers who later hitchhiked the expansion of the Late PIE language from the north or west Pontic area. The early TMRCA seems to suggest an association with Repin (and therefore Yamna), rather than later movements in the Balkans.

Y-DNA samples from Yamnaya and neighbouring cultures. See full version here.

‘Yamnaya’ or ‘steppe’ ancestry?

After the early years when population genetics relied mainly on modern Y-DNA haplogroups, geneticists and amateurs have been recently playing around with testing “ancestry percentages”, based on newly developed free statistical tools, which offer obviously just one among many types of data to achieve a proper interpretation of the past.

Today we have quite a lot Y-DNA haplogroups reported for ancient samples of more recent prehistoric periods, and they seem to offer (at least since the 2015 papers, but more evidently since the 2018 papers on Bell Beakers and Europeans, Corded Ware, or Fennoscandia among others) the most straightforward interpretation of all results published in population genomics research.

NOTE. The finding of a specific type of ancestry in one isolated 40,000-year-old sample from Tianyuan can offer very interesting information on potential population movements to the region. However, the identification of ethnolinguistic communities and their migrations among neighbouring groups in Neolithic or Bronze Age groups is evidently not that simple.

Yamnaya (Indo-European peoples) and their evolution in the steppes, together with North Pontic (eventually Uralic) peoples.Notice how little Indo-European ancestry changes from Khvalynsk (Indo-Anatolian) to Yamna Hungary (North-West Indo-Europeans) Image modified from Wang et al. (2018). See more on the evolution of “steppe ancestry”.

It is becoming more and more clear with each paper that the true “Yamnaya ancestry” – not the originally described one – was in fact associated with Indo-Europeans (see more on the very Yamnaya-like Yamna Hungary and early East Bell Beaker R1b samples, all of quite similar ancestry and PCA cluster before their further admixture with EEF- and CWC-like groups).

The so-called “steppe ancestry”, on the other hand, reflects the contribution of a Northern Caucasus-related ancestry to expanding Khvalynsk settlers, who spread through the steppes more than a thousand years before the expansion of Late Proto-Indo-Europeans with late Repin, and can thus be found among different groups related to the Pontic-Caspian steppes (see more on the emergence and evolution of “steppe ancestry”).

In fact, after the Yamna/Indo-European and Corded Ware/Uralic expansions, it is more likely to find “steppe ancestry” to the north and east in territories traditionally associated with Uralic languages, whereas to the south and west – i.e. in territories traditionally associated with Indo-European languages – it is more likely to find “EEF ancestry” with diminished “steppe ancestry”, among peoples patrilineally descended from Yamna settlers.

Y-DNA haplogroups, the only uniparental markers (see exceptions in mtDNA inheritance) – unlike ancestry percentages based on the comparison of a few samples and flawed study designs – do not admix, do not change, and therefore they do not lend themselves to infinite pet theories (see e.g. what David Reich has to say about R1b-P312 in Iberia directly derived from Yamna migrants in spite of their predominant EEF ancestry): their cultural continuity can only be challenged with carefully threaded linguistic, archaeological, and genetic data.


Trypillia and Greece Neolithic outliers: the smoking gun of Proto-Anatolian migrations?


(Continued from the post Corded Ware culture origins: The Final Frontier).

Looking at the PCA of Wang et al. (2018), I realized that Sredni Stog / Corded Ware peoples seem to lie somewhere between:

  • the eastern steppe (i.e. Khvalynsk-Yamna); and
  • Lower Danube and Balkan cultures affected by Anatolian- and steppe-related (i.e. Khvalynsk-Novodanilovka) migrations.

This multiethnic interaction of the western steppe fits therefore the complex archaeological description of events in the North Pontic, Lower Danube, and Dnieper-Dniester regions. Here are some interesting samples related to those long-lasting contacts:

1. I3719 (mtDNA H1, Y-DNA I2a2a) Ukraine Neolithic sample from Dereivka ca. 4949–4799 BC, described in Mathieson et al. (2018) as of “entirely northwestern-Anatolian-Neolithic-related ancestry”.

2. ANI163 from Varna I ca. 4711–4550 BC (mtDNA H7a1), and I2181 from Balkans Chalcolithic (Smyadovo, in Bulgaria) ca. 4500 BC (mtDNA HV15, Y-DNA R) show the first steppe ancestry in regions known to be affected by the expansion of Suvorovo chiefs.

3. The Yamna Bulgaria outlier (Y-DNA I2a2a1b1b), 3012-2900 calBCE, shows apparently an admixture with cultures of that region (but 1,500 years later).

Image modified from Wang et al. (2018). Samples projected in PCA of 84 modern-day West Eurasian populations (open symbols). Previously known clusters have been marked and referenced. Marked and labelled are the Balkan samples referenced in this text An EHG and a Caucasus ‘clouds’ have been drawn, leaving Pontic-Caspian steppe and derived groups between them. See the original file here.

Trypillia and Corded Ware

4. There is one ‘Trypillia outlier’ among five samples from the Verteba cave in Wang et al. (2018): I1927 (Y-DNA G2a2b2a1a1b1a1a1, mtDNA H1b), ca. 3619-2936 BC, a sample published previously in Nikitin et al. (2017) and Mathieson et al. (2017). We were very quick to dismiss Trypillia (three samples of haplogroup G2a, one sample E) and GAC as a source of Corded Ware admixture, but archaeology clearly shows important population movements at the end of the fourth millennium between late Trypillia groups, GAC, and post-Sredni Stog populations, and genetics is showing that in both cultures, too.

I am not a fan of the ‘lack of samples’ argument, but (similar to Old Hittite samples related to all Anatolian speakers) one site is not enough to describe a culture that spanned millennia and many different early and late groups. One among five Trypillian samples (from a single site), showing a late date (ca. 3228 BC) compared to the other samples (ca. 3700 BC), and quite close to the only three Ukraine Eneolithic samples we have may mean much more than what we may a priori think, i.e. some simplistic unidirectional punctual ‘intrusion’ of steppe ancestry, and instead hint at the known close contacts of late Trypillian groups and North Pontic cultures, including also the Caucasus.

NOTE. The big difference in PCA among GAC-like Hungary LCA – EBA samples (see above two star symbols close to Ukraine Neolithic outlier in the PCA, in contrast with the other three at the bottom) may also be significant, although we don’t have any data about their culture, sites, or the relationship between them.

Location of Verteba Cave in relation to different stages and neighbouring groups of the Cucuteni-Trypillia culture. Image from the paper A Subterranean Sanctuary of the Cucuteni-Trypillia Culture in Western Ukraine, by Kadrow and Pokutta (2016).

Greece Neolithic outlier: Proto-Anatolians?

5. Especially interesting is I6423, one of the Greece Neolithic samples referred to in Wang et al. (2018), which is obviously an outlier among the three used in the paper. It does not seem to correspond to any of the ancient DNA samples published to date; it is not in Hofmanova et al. (2016), in Lazaridis et al. (2017), or in Mathieson et al. (2018).

Since the Neolithic in Greece could mean any period from ca. 6500 BC to ca. 3200 BC, I guess we are talking here about some migration related to the expansion of Khvalynsk-Novodanilovka chieftains after ca. 4500 BC, because it appears on the PCA precisely on the same spot as Varna and Smyadovo outliers, and its ADMIXTURE shows similar components

Image modified from Wang et al. (2018). “ADMIXTURE results of relevant prehistoric individuals mentioned in the text (filled symbols)”. ‘Outlier’ samples referred to in this post have been marked in red. See the original file here.

So, this may be the smoking gun of Proto-Anatolian (or maybe early Common Anatolian) expansion with steppe migrants up to the border of Western Anatolia, and we may be able to get rid of those unfounded doubts about Anatolian origins once and for all…

NOTE. Also interesting seems another Greece Neolithic sample, I6420, in ADMIXTURE, although its position in the PCA (near Minoans and Mycenaeans) does not necessarily point to potential steppe influence, but rather to the extra ‘eastern (Caucasus/Iran-related) ancestry’ contribution found in Minoans and in Mycenaeans (and Anatolia Chalcolithic) compared to previous samples of the region. The third Greece Nelithic sample, I5427 (mtDNA K1a24), from Diros, Alepotrypa Cave, is dated 6005-5879 BC (mean 5892 BC), and appeared first in Mathieson et al. (2018).

Modified from Wang et al. (2018) (Greece_Neolithic in red). Supplementary Table 6. P values of rank=1 in modelling the two-way admixture in the Caucasus cluster. Right populations: Mbuti.DG, Ust_Ishim.DG, Kostenki14, MA1, Han.DG, Papuan.DG, Onge.DG, Villabruna, Vestonice16, ElMiron, Ethiopia_4500BP.SG, Karitiana.DG, Natufian, Iran_Ganj_Dareh_Neolithic. Source 2 populations in bold print are used as examples in modelling the Caucasus cluster groups (See Supplementary Table 7).
Modified from Wang et al. (2018) (Greece_Neolithic in red). Supplementary Table 10. P values of rank=2 and ancestry proportions in modelling a three-way admixture in the Caucasus cluster testing additional contribution from Iran_ChL. Here, we used an extended set of outgroup populations populations to constrain standard errors: Right populations: Mbuti.DG, Ust_Ishim.DG, Kostenki14, MA1, Han.DG, Papuan.DG, Onge.DG, Villabruna, Vestonice16, ElMiron, Ethiopia_4500BP.SG, Karitiana.DG, Natufian, Iran_Ganj_Dareh_Neolithic, EHG, WHG, Levant_N.

If this Greece Neolithic sample is not related to Yamna migrations – and its use for statistical analysis of Caucasus samples from Wang et al. (2018) suggests that it is not – , it may have important consequences:

If it is located near the Western Anatolian coast – especially near Troy – there won’t be much to add about the potential site of entry of Common Anatolian languages into Anatolia… I have read some comments about how ‘impossible’ it was for steppe migrants and their language to ‘invade’ the more advanced cultures of Anatolia from the west, but it seems as ‘impossible’ as it was for Barbarians to invade the Roman Empire and impose their language as elites in certain regions. (And yes, we have at least one important weak political period among Middle Eastern cultures in the early 3rd millennium BC, similar to the period of the Fall of the Western Roman Empire).

Most likely Proto-Anatolian expansion in the North Pontic and Balkan area with Khvalynsk-Novodanilovka chieftains, including ADMIXTURE data from Wang et al. (2018).

If it is located somewhere more ‘central’ in the Greek Peninsula, then it could also be used to support the Anatolian nature of the controversial Pre-Greek (‘Pelasgian’) substrate. While we know that Greek (at least since Mycenaean) shows a strong Pre-Greek cultural and linguistic heritage (also reflected in its genetic continuity), the nature of that language is usually believed to be non-Indo-European, and Anatolian contacts are rather few and coincident with the Mycenaean period. I don’t think this sample can tell much about the Pre-Greek language, though, because – if it is really Neolithic, and comparing it with later Minoan and Mycenaean samples – it seems a clear outlier.

Heyd (2016): The Southeast European distribution of graves of the Suvorovo-Novodanilovka group and such unequipped ones mentioned in the text which can be attributed by burial custom and stratigraphic position in the barrow, plus zoomorphic and abstract animal head sceptres as well as specific maceheads with knobs as from Decea Maresului (mid-5th millennium until around 4000 BC). The site in the south-west Balkans is Suvodol-Šuplevec, Northern Macedonia (FYROM).

If it is, however, related to later Yamna migrations after ca. 3000 BC (and, like the ‘Ukraine Eneolithic’ sample that is likely from Catacomb, it is classified as Neolithic just because it cannot be attributed to precise Helladic periods), then we may be in front of the first obvious Yamna migrants in Greece. If that is the case (which I doubt), the sample wouldn’t be so informative for PIE dialectal expansions, because by now it is evident that we will find steppe ancestry and R1b-Z2103 subclades accompanying Yamna migrants in the southern Balkans, and probably well into Mycenaean Greece.

NOTE. Whatever the case, I am sure that for those fond of absurd autochthonous continuity theories, as well as for anti-steppe conspirationists, this sample will be just another good way of arguing for anything, ranging from a rejection of the Middle PIE – Late PIE division, to a support for some mythic ancient autochtonous Proto-Graeco-Anatolian group, or maybe some ancient Graeco–Indo-Slavonic split, or whatever new dialectal stage one can invent to support the own genealogical fantasies…

So, if it actually is a Neolithic sample, let’s hope that it shows a clear R1b-M269 (xL23 or early L23) subclade distinct from those (likely Z2103) expanded later with Late PIE-speaking Yamna (and probably to be found among Mycenaeans), so that there can be no more place for ethnic fantasies.

EDIT (28 JUL 2018): Added information on Greece Neolithic and Trypillia samples


Kortlandt: West Indo-Europeans along the Danube, Germanic and Balto-Slavic share a Corded Ware substrate


New paper (behind paywall) The Expansion of the Indo-European Languages, by Frederik Kortlandt, JIES (2018) 46(1 & 2):219-231.


When considering the way the Indo-Europeans took to the west, it is important to realize that mountains, forests and marshlands were prohibitive impediments. Moreover, people need fresh water, all the more so when traveling with horses. The natural way from the Russian steppe to the west is therefore along the northern bank of the river Danube. This leads to the hypothesis that the western Indo-Europeans represent successive waves of migration along the Danube and its tributaries. The Celts evidently followed the Danube all the way to southern Germany. The ancestors of the Italic tribes, including the Veneti, may have followed the river Sava towards northern Italy. The ancestors of Germanic speakers apparently moved into Moravia and Bohemia and followed the Elbe into Saxony. A part of the Veneti may have followed them into Moravia and moved along the Oder through the Moravian Gate into Silesia. The hypothetical speakers of Temematic probably moved through Slovakia along the river Orava into western Galicia. The ancestors of speakers of Balkan languages crossed the lower Danube and moved to the south. This scenario is in agreement with the generally accepted view of the earliest relations between these branches of Indo-European.

The western Indo-European vocabulary in Baltic and Slavic is the result of an Indo-European substratum which contained an older non-Indo-European layer and was part of the Corded Ware horizon. The numbers show that a considerable part of the vocabulary was borrowed after the split between Baltic and Slavic, which came about when their speakers moved westwards north and south of the Pripet marshes. These events are older than the westward movement of the Slavs which brought them into contact with Temematic speakers. One may conjecture that the Venedi occupied the Oder basin and then expanded eastwards over the larger part of present-day Poland before the western Balts came down the river Niemen and moved onwards to the lower Vistula. We may then identify the Venedic expansion with the spread of the Corded Ware horizon and the westward migration of the Balts and the Slavs with their integration into the larger cultural complex. The theory that the Venedi separated from the Veneti in the upper Sava region and moved through Moravia and Silesia to the Baltic Sea explains the “im Namenmaterial auffällige Übereinstimmung zwischen dem Baltikum und den Gebieten um den Nordteil der Adria” (Udolph 1981: 61). The Balts probably moved in two stages because the differences between West and East Baltic are considerable.

Instead of reinterpreting his views in light of the recent genetic finds, Kortlandt tries to mix in this paper his own old theories (see his paper Baltic, Slavic, Germanic) with the recent interpretations of genetic papers, using also dubious secondary sources – e.g. Iversen and Kroonen (2017) or Klejn (2017) [see here, and here] – which, in my opinion, creates a potentially dangerous circular reasoning.

For example, even though he criticizes the general stance of recent genetic papers with regard to Proto-Indo-European dialectalization and expansion as too early, and he supports the Danube expansion route, he nevertheless follows their interpretations in accepting that Corded Ware was Indo-European (following the newest model proposed by Anthony):

The [Yamnaya] penetrated central and northern Europe from the lower Danube through the Carpathian basin, not from the east. The Carpathian basis was evidently the cradle of the Corded Ware cultures, where the descendants of the Yamnaya mixed with the local early farmers before proceeding to the north. The development has a clear parallel in the Middle Ages, when the Hungarians mixed with the local Slavic populations in the same territory (cf. Kushniarevich & al. 2015).

He still follows his good old Indo-Slavonic group in the east, but at the same time maintains Kallio’s view that there were no early Uralic loanwords in Balto-Slavic, and also Kallio’s (and the general) view that there were close contacts with PIE and Pre-Proto-Indo-Iranian…

NOTE. The latest paper on Eurasian migrations by Damgaard et al. (Nature 2018), which shows mainly Proto-Iranians dominating over East Europe after the Early Bronze Age, have left still fewer space for a Proto-Balto-Slavic group emerging from the east.

Also, he asserts the following, which is a rather weird interpretation of events:

It appears that the Corded Ware horizon spread to southern Scandinavia (cf. Iversen & Kroonen 2017) but not to the Baltic region during the Neolithic.

“However, we also find indications of genetic impact from exogenous populations during the Neolithic, most likely from northern Eurasia and the Pontic Steppe. These influences are distinct from the Anatolian-farmer-related gene flow found in Central Europe during this period.”

It follows that the Indo-Europeans did not reach the Baltic region before the Late Neolithic. The influx of non-local people from northern Eurasia may be identified with the expansion of the Finno-Ugrians, who came into contact with the Indo-Europeans as a result of the eastward expansion of the latter in the fourth millennium. This was long before the split between Balto-Slavic and Indo-Iranian.

In the Late Neolithic there was “a further population movement into the regions surrounding the Baltic Sea” that was “accompanied by the first evidence of extensive animal husbandry in the Eastern Baltic”, which “suggests import of the new economy by an incoming steppe-like population independent of the agricultural societies that were already established to the south and west of the Baltic Sea.” (Mittnik & al. 2018). These may have been the ancestors of Balto-Slavic speakers. At a later stage, the Corded Ware horizon spread eastward, giving rise to farming ancestry in Eastern Baltic individuals and to a female gene-flow from the Eastern Baltic into Central Europe (ibidem).

Late Copper Age migrations in Asia ca. 2800-2300 BC.

He is a strong Indo-Uralic supporter, and supports a parallel Indo-European – Uralic development in Eastern Europe, and (as you can read) he misunderstands the description of population movements in the Baltic region, and thus misplaces Finno-Ugric speakers as Eurasian migrants arriving in the Baltic from the east during the Late Neolithic, before the Corded Ware expansion, which is not what the cited papers implied.

NOTE. Such an identification of westward Neolithic migrations with Uralic speakers is furthermore to be rejected following the most recent paper on Fennoscandian samples.

He had previously asserted that the substrate common to Germanic and Balto-Slavic is Indo-European with non-Indo-European substrate influence, so I guess that Corded Ware influencing as a substrate both Germanic and Balto-Slavic is the best way he could put everything together, if one assumes the widespread interpretations of genetic papers:

Thus, I think that the western Indo-European vocabulary in Baltic and Slavic is the result of an Indo-European substratum which contained an older non-Indo-European layer and was part of the Corded Ware horizon. The numbers show that a considerable part of the vocabulary was borrowed after the split between Baltic and Slavic, (…)

NOTE. It is very likely that this paper was sent in late 2017. That’s the main problem with traditional publications including the most recent genetic investigation: by the time something gets eventually published, the text is already outdated.

I obviously share his opinion on precedence of disciplines in Indo-European studies:

The methodological point to be emphasized here is that the linguistic evidence takes precedence over archaeological and genetic data, which give no information about the languages spoken and can only support the linguistic evidence. The relative chronology of developments must be established on the basis of the comparative method and internal reconstruction. The location of a reconstructed language can only be established on the basis of lexical and onomastic material. On the other hand, archaeological or genetic data may supply the corresponding absolute chronology. It is therefore incorrect to attribute cultural influences in southern Scandinavia and the Baltic region in the third millennium to Germanic or Baltic speakers because these languages did not yet exist. While the Italo-Celtic branch may have separated from its Indo-European neighbors in the first half of the third millennium, Proto-Balto-Slavic and Proto-Indo-Iranian can be dated to the second millennium and Proto-Germanic to the end of the first millennium BC (cf. Kortlandt 2010: 173f., 197f., 249f.). The Indo-Europeans who moved to southern Scandinavia as part of the Corded Ware horizon were not the ancestors of Germanic speakers, who lived farther to the south, but belonged to an unknown branch that was eventually replaced by Germanic.

I hope we can see more and more anthropological papers like this, using traditional linguistics coupled with archaeology and the most recent genetic investigations.

EDIT (4 JUL 2018): Some errors corrected.


Yamna/Afanasevo elite males dominated by R1b-L23, Okunevo brings ancient Siberian/Asian population


Open access paper New genetic evidence of affinities and discontinuities between bronze age Siberian populations, by Hollard et al., Am J Phys Anthropol. (2018) 00:1–11.

NOTE. This seems to be a peer-reviewed paper based on a more precise re-examination of the samples from Hollard’s PhD thesis, Peuplement du sud de la Sibérie et de l’Altaï à l’âge du Bronze : apport de la paléogénétique (2014).

Interesting excerpts:

Afanasevo and Yamna

The Afanasievo culture is the earliest known archaeological culture of southern Siberia, occupying the Minusinsk-Altai region during the Eneolithic era 3600/3300 BC to 2500 BC (Svyatko et al., 2009; Vadetskaya et al., 2014). Archeological data showed that the Afanasievo culture had strong affinities with the Yamnaya and pre-Yamnaya Eneolithic cultures in the West (Grushin et al., 2009). This suggests a Yamnaya migration into western Altai and into Afanasievo. Note that, in most current publications, “the Yamnaya culture” combines the so-called “classical Yamnaya culture” of the Early Bronze Age and archeological sites of the preceding Repin culture in the middle reaches of the Don and Volga rivers. In the present article we conventionally use the term Yamnaya in the same sense, in which case the beginning of the “Yamnaya culture” can be dated after the middle of the 4th millennium BC, when the Afanasievo culture appeared in the Altai.

Because of numerous traits attributed to early Indo-Europeans and cultural relations with Kurgan steppe cultures, members of the Afanasievo culture are believed to have been Indo-European speakers (Mallory and Mair, 2000). In a recent whole-genome sequencing study, Allentoft et al. (2015) concluded that Eastern Yamnaya individuals and Afanasievo individuals were genetically indistinguishable. Moreover, this study and one published concurrently by Haak et al. (2015) analyzed 11 Eastern Yamnaya males and showed that all of them belonged to the R1b1a1a (formerly R1b1a) (…)

Early Chalcolithic migrations ca. 3300-2600 BC.

Published works indicate that R1b was a predominant haplogroup from the late Neolithic to the early Bronze Age, notably in the Bell Beaker and Yamnaya cultures (Allentoft et al., 2015; Haak et al., 2015; Lee et al., 2012; Mathieson et al., 2015). Nearly 100% of the Afanasievo men we typed belonged to the R1b1a1a subhaplogroup and, for at least three of them, more precisely to the L23 (xM412) subclade. (…)

(…) our results therefore support the hypothesis of a genetic link between Afanasievo and Yamnaya. This also suggests that R1b was indeed dominant in the early Bronze Age Siberian steppe, at least in individuals that were buried in kurgans (possibly an elite part of the population). The geographical and temporal distribution of subhaplogroup R1b1a1a supports the hypothesis of population expansion from West to East in the Eurasian steppe during this period. It should however be noted that the Yamnaya burials from which the samples for DNA analysis were obtained (Allentoft et al., 2015; Haak et al., 2015; Mathieson et al., 2015) were dated within the limits of the Afanasievo period. Ancestors of both East Yamnaya and Afanasievo populations must therefore be sought in the context of earlier Eneolithic cultures in Eastern Europe. Sufficient Y-chromosomal data from such Eneolithic populations is, unfortunately, not yet available.

Mitochondrial- (A) and Y- (B) haplogroup distribution in studied populations

Okunevo and paternal lineage shift in South Siberia

Results obtained in the current study, from more than a dozen Okunevo individuals belonging to the earliest stage of Okunevo culture, that is the Uibat period (2500–2200 BC) (Lazaretov, 1997), suggest a discontinuity in the genetic pool between Afanasievo and Okunevo cultures. Although Y-chromosomal data obtained for bearers of the Okunevo culture showed that one individual carried haplogroup R1b, most Okunevo Y-haplogroups are representative of an Asian component represented by paternal lineages Q and NO1.

Okunevo carrier of Y-haplogroup Q1b1a-L54, which also supports this hypothesis (L54 being a marker of the lineage from which M3, the main Ameridian lineage, arose). Okunevo people could therefore be a remnant paleo-Siberian population with possible Afanasievo input, as suggested by the presence of the R1b1a1a2a subhaplogroup in one individual.

Late Chalcolithic migrations ca. 2600-2250 BC.

Replacement of Asian Indo-European elite lineages by R1a

Published genetic data from the late Bronze Age Andronovo culture from the Minusinsk Basin (Keyser et al., 2009), the Sintashta culture from Russia (Allentoft et al., 2015) and the Srubnaya culture from the region of Samara (Mathieson et al., 2015), show that males did not belong to Y-haplogroup R1b but mostly to R1a clades: there appears to have been a change in the dominant Y-chromosomal haplogroup between the early and the late Bronze Age in these regions. Moreover, as described in Allentoft et al. (2015), the Andronovo and Sintashta peoples were closely related to each other but clearly distinct from both Yamnaya and Afanasievo. Although these results do not imply that Y-haplogroup R1b was entirely absent in these later populations, they could correspond to a replacement of the elite between these two main periods and therefore a difference in the haplogroups of the men that were preferentially buried.

Early Bronze Age migrations ca. 2250-1750 BC.

Afanasevo and the Tarim Basin

The discovery, in the Tarim Basin, of well-preserved mummies from the Bronze Age allows for the construction of two hypotheses regarding the peopling of the Xinjiang province at this period. The “steppe hypothesis,” argues for a link with nomadic steppe herders (Hemphill and Mallory, 2004), possibly represented in this case by Afanasievo populations and their descendants (Mallory and Mair, 2000). However, newly published cultural data from the burial grounds of Gumugou (Wang, 2014) and Xiaohe (Xinjiang, 2003, 2007) shows material culture and burial rites incompatible with the Afanasievo culture. The earliest 14C date for Tarim Basin burials would place them at the turn of the 2nd millenium BC (Wang, 2013), 500 years after the Afanasievo period.

Instead, early Gumugou and Xiaohe burial grounds were contemporary with the start of the Andronovo period. Likewise, the Bronze Age population of the Xinjiang at Gumugou/Qäwrighul is not phenotypically closest to Afanasievo but to the Andronovo (Fedorovo) group of northeastern Kazakhstan and western Altai (Kozintsev, 2009). Our investigations demonstrate that Y-chromosomal lineage composition is also compatible with the notion that the ancient Tarim population was genetically distinct from the Afanasievo population. The only Y-haplogroup found by Li et al. (2010) in the Bronze Age Tarim Basin population was Y-haplogroup R1a, which suggests a proximity of this population with Andronovo groups rather than Afanasievo groups.

I don’t think these finds are much of a surprise based on what we already know, or need much explanation…

I would add that, once again, we have more proof that the movement of Okunevo and related ancient Siberian migrants from Central or North Asia will not be able to explain the presence of Uralic languages spread over North-East Europe and Scandinavia already during the Bronze Age.

Also interesting is to read in more peer-reviewed papers the idea of Late Indo-European speakers clearly linked to the expansion of patrilineally-related elite males marked by haplogroup R1b-L23, most likely since Eneolithic Khvalynsk/Repin cultures.


Consequences of Damgaard et al. 2018 (I): EHG ancestry in Maykop samples, and the potential Anatolian expansion routes


This is part I of two posts on the most recent data concerning the earliest known Indo-European migrations.

Anatolian in Armi

I am reading in forums about “Kroonen’s proposal” of Anatolian in the 3rd millennium. That is false. The Copenhagen group (in particular the authors of the linguistic supplement, Kroonen, Barjamovic, and Peyrot) are merely referencing Archi (2011. “In Search of Armi”. Journal of Cuneiform Studies 63: 5–34) in turn using transcriptions from Bonechi (1990. “Aleppo in età arcaica; a proposito di un’opera recente”. Studi Epigrafici e Linguistici sul Vicino Oriente Antico 7: 15–37.), who asserted the potential Anatolian origin of the terms. This is what Archi had to say about this:

Most of these personal names belong to a name-giving tradition different from that of Ebla; Arra-ti/tulu(m) is attested also at Dulu, a neighbouring city-state (Bonechi 1990b: 22–25).28 We must, therefore, deduce that Armi belonged to a marginal, partially Semitized linguistic area different from the ethno-linguistic region dominated by Ebla. Typical are masculine personal names ending in -a-du: A-la/li-wa-du/da, A-li/lu-wa-du, Ba-mi-a-du, La-wadu, Mi-mi-a-du, Mu-lu-wa-du. This reminds one of the suffix -(a)nda, -(a)ndu, very productive in the Anatolian branch of Indo-European (Laroche 1966: 329). Elements such as ali-, alali-, lawadu-, memi-, mula/i- are attested in Anatolian personal names of the Old Assyrian period (Laroche 1966: 26–27, 106, 118, 120).

Ebla’ first kingdom at its height c. 2340 BC. Hipothetical location of Armi depicted. The first Eblaite kingdom extended from Urshu in the north,1 to Damascus area in the south.2 And from Phoenicia and the coastal mountains in the west,3 4 to Tuttul,5 and Haddu in the east.6 The eastern kingdom of Nagar controlled most of the Khabur basin from the river junction with the Euphrates to the northwestern part at Nabada.7 Page 101. From Wikipedia.

This was used by Archi to speculatively locate the state of Armi, in or near Ebla territory, which could correspond with the region of modern north-western Syria:

The onomastic tradition of Armi, so different from that of Ebla and her allies (§ 5), obliges us to locate this city on the edges of the Semitized area and, thus, necessarily north of the line running through Hassuwan – Ursaum – Irritum – Harran. If Armi were to be found at Banat-Bazi, it would have represented an anomaly within an otherwise homogenous linguistic scenario.34

Taken as a whole, the available information suggests that Armi was a regional state, which enjoyed a privileged relationship with Ebla: the exchange of goods between the two cities was comparable only to that between Ebla and Mari. No other state sent so many people to Ebla, especially merchants, lú-kar. It is only a hypothesis that Armi was the go-between for Ebla and for the areas where silver and copper were extracted.

This proposal is similar to the one used to support Indo-Aryan terminology in Mittanni (ca. 16th-14th c. BC), so the scarce material should not pose a problem to those previously arguing about the ‘oldest’ nature of Indo-Aryan.

NOTE. On the other hand, the theory connecting ‘mariannu‘, a term dated to 1761 BC (referenced also in the linguistic supplement), and put in relation with PIIr. *arya, seems too hypothetical for the moment, although there is a clear expansion of Aryan-related terms in the Middle East that could support one or more relevant eastern migration waves of Indo-Aryans from Asia.

Potential routes of Anatolian migration

Once we have accepted that Anatolian is not Late PIE – and that only needed a study of Anatolian archaisms, not the terminology from Armi – , we can move on to explore the potential routes of expansion.

On the Balkan route

A current sketch of the dots connecting Khvalynsk with Anatolia is as follows.

1—39 — sceptre bearers of the type Giurgiuleşti and Suvorovo; 40—60 — Gumelniţa-Varna-Bolgrad-Aldeni cultural sphere; 61 — Fălciu; 62 — Cainari; 63 — Giurgiuleşti; 64 — Suvorovo; 65 — Casimcea; 66 — Kjulevča; 67 — Reka Devnja; 68 — Drama; 69 — Gonova Mogila; 70 — Reževo.

First, we have the early expansion of Suvorovo chieftains spreading from ca. 4400-4000 BC in the lower Danube region, related to Novodanilovka chiefs of the North Pontic region, and both in turn related to Khvalynsk horse riders (read a a recent detailed post on this question).

Then we have Cernavoda I (ca. 3850-3550 BC), a culture potentially derived from the earlier expansion of Suvorovo chiefs, as shown in cultural similarities with preceding cultures and Yamna, and also in the contacts with the North Pontic steppe cultures (read a a recent detailed post on this question).

We also have proof of genetic inflow from the steppe into populations of cultures near those suggested to be heirs of those dominated by Suvorovo chiefs, from the 5th millennium BC (in Varna I ca. 4630 BC, and Smyadovo ca. 4500 BC, see image below).

If these neighbouring Balkan peoples of ca. 4500 BC are taken as proxies for Proto-Anatolians, then it becomes quite clear why Old Hittite samples dated 3,000 years after this migration event of elite chiefs could show no or almost no ancestry from Europe (for this question, read my revision of Lazaridis’ preprint).

NOTE. A full account of the crisis in the lower Danube, as well as the Suvorovo-Novodanilovka intrusion, is available in Anthony (2007).

Modified image, including PCA and supervised ADMIXTURE data from Mathieson et al. (2018). Blue arrow represents incoming ancestry from Suvorovo chiefs, red line represents distance from the majority of the neighbouring Balkan population in this period studied to date. Northwestern-Anatolian Neolithic (grey), Yamnaya from Samara (yellow), EHG (pink) and WHG (green).

The southern Balkans and Anatolia

The later connection of Cernavoda II-III and related cultures (and potentially Ezero) with Troy, on the other hand, is still blurry. But, even if a massive migration of Common Anatolian is found to happen from the Balkans into Anatolia in the late 4th / beginning of the 3rd millennium, the people responsible for this expansion could show a minimal trace of European ancestry.

A new paper has appeared recently (in Russian), Dubene and Troy: Gold and Prosperity in the Third Millennium Cal. BCE in Eurasia. Stratum Plus, 2 (2018), by L. Nikolova, showing commercial contacts between Troy and cultures from Bulgaria:

Earlier third millennium cal BCE is the period of development of interconnected Early Bronze Age societies in Eurasia, which economic and social structures expressed variants of pre-state political structures, named in the specialized literature tribes and chiefdoms. In this work new arguments will be added to the chiefdom model of third millennium cal BC societies of Yunatsite culture in the Central Balkans from the perspectives of the interrelations between Dubene (south central Bulgaria) and Troy (northwest Turkey) wealth expression.

Possible explanations of the similarity in the wealth expression between Troy and Yunatsite chiefdoms is the direct interaction between the political elite. However, the golden and silver objects in the third millennium cal BCE in the Eastern Mediterranean are most of all an expression of economic wealth. This is the biggest difference between the early state and chiefdoms in the third millennium cal BCE in Eurasia and Africa. The literacy and the wealth expression in the early states was politically centralized, while the absence of literacy and wider distribution of the wealth expression in the chiefdoms of the eastern Mediterranean are indicators, that wider distribution of wealth and the existed stable subsistence layers prevented the formation of states and the need to regulate the political systems through literacy.

The only way to link Common Anatolians to their Proto-Anatolian (linguistic) ancestors would therefore be to study preceding cultures and their expansions, until a proper connecting route is found, as I said recently.

These late commercial contacts in the south-eastern Balkans (Nikolova also offers a simplified presentation of data, in English) are yet another proof of how Common Anatolian languages may have further expanded into Anatolia.

NOTE. One should also take into account the distribution of modern R1b-M269* and L23* subclades (i.e. those not belonging to the most common subclades expanding with Yamna), which seem to peak around the Balkans. While those may just belong to founder effects of populations preceding Suvorovo or related to Yamna migrants, the Balkans is a region known to have retained Y-DNA haplogroup diversity, in contrast with other European regions.

On a purely linguistic aspect, there are strong Hattic and Hurrian influences on Anatolian languages, representing a unique layer that clearly differentiates them from LPIE languages, pointing also to different substrates behind each attested Common Anatolian branch or individual language:

  • Phonetic changes, like the appearance of /f/ and /v/.
  • Split ergativity: Hurrian is ergative, Hattic probably too.
  • Increasing use of enclitic pronoun and particle chains after first stressed word: in Hattic after verb, in Hurrian after nominal forms.
  • Almost obligatory use of clause initial and enclitic connectors: e.g. semantic and syntactic identity of Hattic pala/bala and Hittite nu.

NOTE. For a superficial discussion of this, see e.g. An Indo-European Linguistic Area and its Characteristics: Ancient Anatolia. Areal Diffusion as a Challenge to the Comparative Method?, by Calvert Watkins. You can also search for any of the mentioned shared isoglosses between Middle Eastern languages and Anatolian if you want more details.

On the Caucasus route

It seems that the Danish group is now taking a stance in favour of a Maykop route (from the linguistic supplement):

The period of Proto-Anatolian linguistic unity can now be placed in the 4th millennium BCE and may have been contemporaneous with e.g. the Maykop culture (3700–3000 BCE), which influenced the formation and apparent westward migration of the Yamnaya and maintained commercial and cultural contact with the Anatolian highlands (Kristiansen et al. 2018).

In fact, they have data to support this:

The EHG ancestry detected in individuals associated with both Yamnaya (3000–2400 BCE) and the Maykop culture (3700–3000 BCE) (in prep.) is absent from our Anatolian specimens, suggesting that neither archaeological horizon constitutes a suitable candidate for a “homeland” or “stepping stone” for the origin or spread of Anatolian Indo- European speakers to Anatolia. However, with the archaeological and genetic data presented here, we cannot reject a continuous small-scale influx of mixed groups from the direction of the Caucasus during the Chalcolithic period of the 4th millennium BCE.

While it is difficult to speak about the consequences of this find without having access to this paper in preparation or its samples, we already knew that Maykop had obvious cultural contacts with the steppe.

It will not be surprising to find not only EHG, but also R1b-L23 subclades there. In my opinion, though, the most likely source of EHG ancestry in Maykop (given the different culture shown in other steppe groups) is exogamy.

The question will still remain: was this a Proto-Anatolian-speaking group?

Diachronic map of Eneolithic migrations ca. 4000-3100 BC

My opinion in this regard – again, without access to the study – is that you would still need to propose:

  • A break-up of Anatolian ca. 4500 BC represented by some early group migrating into the Northern Caucasus area.
  • For this group – who were closely related linguistically and culturally to early Khvalynsk – to remain isolated in or around the Northern Caucasus, i.e. somehow ‘hidden’ from the evolving LPIE speakers in late Khvalynsk/early Yamna peoples.
  • Then, they would need to have migrated from Maykop to Anatolian territory only after ca. 3700 BC – while having close commercial contacts with Khvalynsk and the North Pontic cultures in the period 3700-3000 BC -, in some migration wave that has not showed up in the archaeological records to date.
  • Then appear as Old Hittites without showing EHG ancestry (even though they show it in the period 3700-3000 BC), near the region of the Armi state, where Anatolian was supposedly spoken already in the mid-3rd millennium.

Not a very convincing picture, right now, but indeed possible.

Also, we have R1b-Z2103 lineages and clear steppe ancestry in the region probably ca. 2500 BC with Hajji Firuz, which is most likely the product of the late Khvalynsk migration waves that we are seeing in the recent papers.

These migrations are then related to early LPIE-speaking migrants spreading after ca. 3300 BC – that also caused the formation of early Yamna and the expansion of Tocharian-related migrants – , which leaves almost no space for an Anatolian expansion, unless one supports that the former drove the latter.

NOTE. In any case, if the Caucasus route turned out to be the actual Anatolian route, I guess this would be a way as good as any other to finally kill their Indo-European – Corded Ware theory, for obvious reasons.

On the North Iranian homeland

A few thoughts for those equating CHG ancestry in IE speakers (and especially now in Old Hittites) with an origin in North Iran, due to a recent comment by David Reich:

In the paper it is clearly stated that there is no Neolithic Iranian ancestry in the Old Hittite samples.

Ancestry is not people, and it is certainly not language. The addition of CHG ancestry to the Eneolithic steppe need not mean a population or linguistic replacement. Although it could have been. But this has to be demonstrated with solid anthropological models.

NOTE. On the other hand, if you find people who considered (at least until de Barros Damgaard et al. 2018) steppe (ancestry/PCA) = Indo-European, then you should probably confront them about why CHG in Hittites and the arrival of CHG in steppe groups is now not to be considered the same, i.e why CHG / Iran_N ≠ PIE.

Since there has been no serious North Iranian homeland proposal made for a while, it is difficult to delineate a modern sketch, and I won’t spend the time with that unless there is some real anthropological model and genetic proof of it. I guess the Armenian homeland hypothesis proposed by Gamkrelidze and Ivanov (1995) would do, but since it relies on outdated data (some of which appears also in Gimbutas’ writings), it would need a full revision.

NOTE. Their theory of glottalic consonants (or ejectives) relied on the ‘archaism’ of Hittite, Germanic, and Armenian. As you can see (unless you live in the mid-20th century) this is not very reasonable, since Hittite is attested quite late and after heavy admixture with Middle Eastern peoples, and Germanic and Armenian are some of the latest attested (and more admixed, phonetically changed) languages.

This would be a proper answer, indeed, for those who would accept this homeland due to the reconstruction of ‘ejectives’ for these languages. Evidently, there is no need to posit a homeland near Armenia to propose a glottalic theory. Kortlandt is a proponent of a late and small expansion of Late PIE from the steppe, and still proposes a reconstruction of ejectives for PIE. But, this was the main reason of Gamkrelidze and Ivanov to propose that homeland, and in that sense it is obviously flawed.

Those claiming a relationship of the North Iranian homeland with such EHG ancestry in Maykop, or with the hypothetic Proto-Euphratic or Gutian, are obviously not understanding the implications of finding steppe ancestry coupled with (likely) early Late PIE migrants in the region in the mid-4th millennium.