Fast life history as adaptive regional response to less hospitable and unstable Early Indo-Iranian territory


Another interesting paper, Life in the fast lane: Settled pastoralism in the Central Eurasian Steppe during the Middle Bronze Age, by Judd et al. (2017).

Abstract (emphasis mine):

We tested the hypothesis that the purported unstable climate in the South Urals region during the Middle Bronze Age (MBA) resulted in health instability and social stress as evidenced by skeletal response.The skeletal sample (n = 99) derived from Kamennyi Ambar 5 (KA-5), a MBA kurgan cemetery (2040-1730 cal. BCE, 2 sigma) associated with the Sintashta culture. Skeletal stress indicators assessed included cribra orbitalia, porotic hyperostosis, dental enamel hypoplasia, and tibia periosteal new bone growth. Dental disease (caries, abscess, calculus, and periodontitis) and trauma were scored. Results were compared to regional data from the nearby Samara Valley, spanning the Early to Late Bronze Age (EBA, LBA).Lesions were minimal for the KA-5 and MBA-LBA groups except for periodontitis and dental calculus. No unambiguous weapon injuries or injuries associated with violence were observed for the KA-5 group; few injuries occurred at other sites. Subadults (<18 years) formed the majority of each sample. At KA-5, subadults accounted for 75% of the sample with 10% (n = 10) estimated to be 14-18 years of age.Skeletal stress markers and injuries were uncommon among the KA-5 and regional groups, but a MBA-LBA high subadult mortality indicates elevated frailty levels and inability to survive acute illnesses. Following an optimal weaning program, subadults were at risk for physiological insult and many succumbed. Only a small number of individuals attained biological maturity during the MBA, suggesting that a fast life history was an adaptive regional response to a less hospitable and perhaps unstable environment

Interesting excerpt:

The low frequencies of violence-related trauma contrast sharply to the epidemic of skeletal violence observed during the Iron Age (8th-2nd centuries BC) at other regional sites, notably Aymyrlyg (Murphy, 2003). The paucity of weapon-related injuries among the Bronze Age groups may be the outcome of many factors. While weapons and chariots did exist, they could have had multi-functional contexts aside from warfare. Individuals killed in warfare may not be present if bodies were abandoned on battlefields or disposed of where the individual died. Alternatively, warfare may have involved the capture of humans in addition to material resources, such as herds or weapons, leaving no skeletal trace of physical violence (Martin, Harrod, & Fields, 2010; Wilkinson, 1997). Trauma analysis is further complicated by the lack of soft tissue, which is the target for those attempting to kill or immobilize their opponent (Judd, 2008; Judd & Redfern, 2012), and it is possible that violence-related injuries or burns sustained from metallurgy were absent because only the soft tissue was affected. The skeletal evidence for trauma is minimal at KA-5 and its contemporary sites, which may be partially attributed to the less than desirable preservation of the collections. Based on the skeletal material available, internal or external social tensions resulting in altercations are not supported.

The lack of material or skeletal evidence for warfare has encouraged a more optimistic interpretation of Steppe community relations living with environmental instability. Herding camps, such as that at Peschanyi Dol, provided evidence for assorted groups utilizing the site based on the clay sources of ceramic sherds found in the camp’s trash pit (Anthony, Brown, Kuznetsov, & Mochalov, 2016a). Anthony et al. (2016a, 2016b) suggested that herders shifted according to a schedule that permitted several settlements to use prime camp sites. They proposed a cooperative region-wide organization of groups that worked together in three key activities: mining, summer herding, and winter wolf-dog rituals (Anthony et al., 2016a). A similar regional social arrangement may have existed in the KA-5 vicinity and accords with the livestock management models proposed by Stobbe and colleagues (2016).

Demographic distribution of KA-5 and Samara Valley samples

Using the available sampling, and based on the absence of skeletal stress markers (in combination with the high subadult mortality among Sintashta samples), the study concludes that the available data cannot support the traditional view that MBA was a period of social strife.

Since other Samara Valley samples do not follow a similar trend with Sintashta, a homogeneous, long-term relationship with the environment is suggested for this culture, independent of climatic shift or unpredictability.

We already know that R1a-Z645 subclades, which expanded with the Corded Ware culture, appeared in a Poltavka cemetery rather early, which, coupled with the incomplete replacement that we see in Early Indo-Iranian communities, suggest a gradual expansion of these subclades.

My limited, speculative proposal of how this lineage replacement took place was based precisely on this traditional description of partially isolated, warring communities:

The process by which this cultural assimilation happened in the Sintashta-Petrovka region, given the presupposed warring nature of their contacts, remains unclear. It is conceivable, in a region of highly fortified settlements, to think about alliances of different groups against each other, akin to the situation found in Bronze Age Europe: a minority of Abashevo chiefs and their families would dominate over certain fortified settlements and wage war against other, neighbouring tribes.

After a certain number of generations, the most successful settlements would have replaced the paternal lineages of the region – with only a slight drift to steppe admixture observed in PCA compared to Corded Ware –, while the majority of the population in these settlements – including females, commoners and slaves – retained the original Poltavka culture. R1b1a1a2a2-Z2103 lineages were mostly replaced in the region by haplogroup R1a1a1b2-Z93, as demonstrated by the later expansion of its subclades with Andronovo and Srubna cultures, and by present-day distribution of R1a1a1b2-Z93 lineages in Eurasia.

Now we see more proof for a likely bottleneck in a more peaceful (or, rather, cooperative) region, as recently described by Anthony. In fact, if you take a look at the sampling of the paper (which is obviously not randomised), Potapovka – coeval with Sintashta, but genetically more similar to the earlier Yamna and Poltavka – follows a less steep demographic distribution than Sintashta, with succeeding Srubna (which shows a marked shift toward the Corded Ware cluster) maintaining a similar demographic pattern…

I guess the answer is probably between both positions, war and environment; the main issue is which one was the most important contributing factor. If we judged the whole picture solely by the samples studied in this paper, the answer would be the environment.

In any case, even though we like to see every single paternal lineage substitution in a territory as necessarily linked with a meaningful migration coupled with ethnolinguistic change, sometimes this is not the case; as, the replacement of R1b-L23 lineages in Proto-Balto-Slavic and Proto-Indo-Iranian communities by R1a-Z645 subclades; the replacement of R1a-Z645 by N1c-L392 subclades in Uralic-speaking territories; or the replacement of native lineages by R1b-L51 subclades among Basques.


On Proto-Finnic language guesstimates, and its western homeland

Recent chapter The Indo-Europeans and the Non-Indo-Europeans in Prehistoric Northern Europe, by Petri Kallio, In: Language and Prehistory of the Indo- European Peoples: A Cross-Disciplinary Perspective, Copenhagen (2017).

Interesting excerpts (emphasis mine), especially when read in combination with the most recent papers on Early Indo-Iranian and Fennoscandian genomes:

Like the Indo-Europeanists, also the Uralicists suffer from their “school who wants it large and wants it early”. This time, however, the desired homeland is even larger and earlier, covering the whole northern half of Europe already at the end of the Ice Age (Wiik 2002). As a Finn, admittedly, I find such an idea very flattering indeed. As a historical linguist, however, I also find it absurd for the same reasons which I already gave above in the case of the Indo-Europeans.

True, linguistic palaeontology is less helpful in the case of the Uralians, even though especially Common Uralic *pata ‘clay pot’ and *wäśkä ‘copper’ indicate that Proto-Uralic was not spoken before the Subneolithic period, which in the East-Baltic area is dated about 5300–3200 BC. However, the most valuable evidence comes from the earliest Indo-European loanwords in the Uralic languages, which show that Proto-Uralic cannot have been spoken much earlier than Proto-Indo-European dated about 3500 BC (Koivulehto 2001: 235, 257).

As the same loanword evidence naturally also shows that the Uralic and Indo-European homelands were not located far from one another, the Uralic homeland can most likely be located in the Middle and Upper Volga region, right north of the Indo-European homeland*. From the beginning of the Subneolithic period about 5900 BC onwards, this region was an important innovation centre, from where several cultural waves spread to the Finnish Gulf area, such as the Sperrings Ware wave about 4900 BC, the Combed Ware wave about 3900 BC, and the Netted Ware wave about 1900 BC (Carpelan & Parpola 2001: 78–90).

* Interestingly, the only Uralicists who generally reject the Central Russian homeland are the Russian ones who prefer the Siberian homeland instead. Some Russians even advocate that the Central Russian homeland is only due to Finnish nationalism or, as one of them put it a bit more tactfully, “the political and ideological situation in Finland in the first decades of the 20th century” (Napolskikh 1995: 4). Still, some Finns (and especially those who also belong to the “school who wants it large and wants it early”) simultaneously advocate that exactly the same Central Russian homeland is due to Finnlandisierung (Wiik 2001: 466). Fortunately, I do not even need to resort to playing the politics card myself, because there is enough convincing evidence for the Central Russian homeland anyway.

Remarkably, the loanword evidence furthermore suggests that the ancestors of Finnic and Saamic had at least phonologically remained very close to Proto-Uralic as late as the Bronze Age (ca. 1700–500 BC). In particular, certain loanwords, whose Baltic and Germanic sources point to the first millennium BC, after all go back to the Finno-Saamic proto-stage, which is phonologically almost identical to the Uralic proto-stage (see especially the table in Sammallahti 1998: 198–202). This being the case, Dahl’s wave model could perhaps have some use in Uralic linguistics, too.

Even though Bronze-Age Finnic and Saamic were still two dialects rather than two languages, it does not mean that they would still have been spoken in a geographically limited area. On the contrary, their Indo-European loanwords dating to this period indicate that their speech areas were already geographically separate. The fact that at that time both Baltic and Germanic influenced Finnic much more strongly than Saamic must be considered a crucial piece of information when we are trying to locate the Finnic and Saamic homelands.

Iron Age migrations in Europe.

(…)the fact that Palaeo-Germanic loanwords are much more numerous in Finnic than in Saamic must lead to the same conclusion. As I noted above, the most likely Palaeo-Germanic speaking carriers of the Nordic Bronze culture (ca. 1700–500 BC) spread from Scandinavia to the Finnish and Estonian coastal areas. As they never spread any further to the east than as far as the bottom of the Finnish Gulf, the idea that the Finnic homeland included neither Finland nor Estonia completely fails to explain the very existence of Palaeo-Germanic loanwords, whose quantity and quality in Finnic presuppose a superstrate rather than an adstrate.

(…) as the Nordic Bronze culture influenced coastal Finland much more strongly than it did coastal Estonia, the idea that the Finnic homeland did not include Finland but Estonia alone similarly fails to explain the very strength of the Bronze-Age Palaeo-Germanic superstrate in Finnic, which can indeed be compared with the Medieval French superstrate in English, for instance (Kallio 2000: 96–97). From a Germanicist point of view, therefore, Itkonen’s theory concerning the Finnic homeland does not only seem to be the best but also the only alternative (Koivulehto 1984: 198–200).

As the same can now also be said about the Indo-Europeanization of the Baltic speech area, the fact that Baltic and Finnic are the most conservative branches of their language families and that they have relatively few substrate words may really be due to exactly the same reason, namely that before their arrival the East-Baltic region was still very sparsely populated by Subneolithic hunter-fisher-gatherers, whose linguistic influence on the newcomers was therefore rather limited. On the other hand, as these language shifts already took place millennia ago, there has been a lot of time for the Baltic and Finnic speakers to replace most of their old substrate words by all kinds of new lexical innovations.

Speaking of loanword evidence, the Aikios and especially Saarikivi (2004b) have furthermore argued that the Indo-Iranian loanwords occurring in Finnic and/or Saamic alone force us to locate the Finnic and Saamic homelands further to the east (e.g. near the White Lake). Still, I fail to see why the Indo-Iranian loanwords counted in dozens should be more relevant in locating these two homelands than the Germanic loanwords counted in hundreds. Besides, the Indo-Iranian loanwords mainly consist of cultural borrowings which do not necessarily presuppose a superstrate but only an adstrate. Moreover, they must be dated so much earlier than Vedic Sanskrit (ca. 1500–1000 BC) and Gathic Avestan (ca. 1000–800 BC) anyway that their spread can very well be connected with the abovementioned Netted Ware wave about 1900 BC.

An interesting read, where the author expressly refers to the many political (nationalist) and xenophobic overtones (including his own) that arise in ethnolinguistic identifications of prehistoric cultures.

We are seeing how the newest dialectalisation trends want it ‘late and small’, and ‘late’ corresponds smoothly with the most recent genomic findings involving Chalcolithic and Bronze Age expansions.

In the Uralic case, in North-Eastern Europe only Corded Ware migrants are known to have expanded within a suitable time frame into the region, and their patrilineal descendants show a widespread distribution in the region during the Bronze Age.

Also, if Proto-Finnic is coeval with Proto-Germanic, and expanded from the western part of North-East Europe (necessarily including the Gulf of Finland), well… You know the drill.

Of course, regarding Proto-Indo-European and Uralic, there are also a lot of people who still want itlarge and early‘ – and the most recent research won’t deter them from such proposals.


Consequences of O&M 2018 (III): The Balto-Slavic conundrum in Linguistics, Archaeology, and Genetics

This is part of a series of posts analyzing the findings of the recent Nature papers Olalde et al.(2018) and Mathieson et al.(2018) (abbreviated O&M 2018).

The recent publication of Narasimhan et al. (2018) has outdated the draft of this post a bit, and it has made it at the same time still more interesting.

While we wait for the publication of the dataset (and the actual Y-DNA haplogroups and precise subclades with the revision of the paper), and as we watch the wrath of Hindu nationalists vented against the West (as if the steppe was in Western Europe) and science itself, we have already seen confirmation from the Reich Lab of their new approach to Late Proto-Indo-European migrations.

Yamna/Steppe EMBA, previously identified as the direct source of “steppe” ancestry (AKA Yamnaya‘ ancestry) and Late Indo-European migrations in Asia – through Corded Ware, it is to be understood – has been officially changed. In the case of Indo-Iranian migrations it is the “Steppe MLBA cloud”, after a direct contribution to it of Yamna/Steppe EMBA, which expanded Indo-Iranian, as I predicted ancient DNA could support.

In Twitter, the main author responded the following when asked for this change regarding the origin of steppe ancestry in Asian migrants (emphasis mine):

Our reasons are:

  1. The Turan samples show no elevated steppe ancestry till 2000BC.
  2. MLBA is R1a
  3. Indus periphery doesn’t have steppe ancestry but Swat does, and EMBA doesn’t work both in terms of time or genetic ancestry to explain the difference.
Image modified from Narasimhan et al. (2018), including the most likely proto-language identification of different groups. Original description “Modeling results including Admixture events, with clines or 2-way mixtures shown in rectangles, and clouds or 3-way mixtures shown in ellipses”. Yes, this map is the latest official view on migrations from the Reich Lab now. See the original full image here.

I am glad to see finally recognized that Y-DNA haplogroups and time have to be taken into account, and happy also to see an end to the by now obsolete ‘ADMIXTURE/PCA-only relevance’ in Human Ancestry. The timing of archaeological migrations, the cultural attribution of each sample, and the role of Y-DNA variability reduction and expansion have been finally recognized as equally important to assess potential migrations, as I requested.

This change was already in the making some months ago, when David Anthony – who has worked with the group for this paper and others before it – already changed his official view on Corded Ware – from his previous support of the 2015 model. His latest theory, which linked Yamna settlements in Hungary with a potential mixed society of migrants (of R1b-L23 and R1a-Z645 lineages) from West Yamna, is most likely wrong, too, but it was clearly a brave step forward in the right direction.

The only reasonable model now is that Yamna expanded Late Proto-Indo-European languages with steppe ancestry + R1b-L23 subclades.

You can either accept this change, or you can deny it and wait until one sample of R1a-Z645 appears in West Yamna or central Europe, or one sample of R1b-L23 appears in Corded Ware (as it is obvious it could happen), to keep spreading the wrong ideas still some more years, while the rest of the world goes on: Mallory, Anthony, and other archaeologists co-authoring the latest paper (probably part of the stronger partnership with academics that we were going to see), who had formally put forward complex, detailed theories, investing their time and name in them, have rejected their previous migration models to develop new ones based on the most recent findings. If they can do that, I am sure any amateur geneticist out there can, too.

Modified image, from Narasimhan et al. (2018). Anthony’s new model of a Yamna Hungary -> Corded Ware (Małopolska) migration arrow in red. Notice also how they keep the arrow from West Yamna to the north (in black), due probably to the Baltic Late Neolithic samples (see below).

The Balto-Slavic dialect and its homeland

An interesting question in Linguistics and Archaeology, now that Corded Ware cannot be identified as “Indo-Slavonic” or any other imaginary ancient group (like Indo-Slavo-Germanic), remains thus mostly unchanged since before the famous 2015 genetic papers:

  • Was Balto-Slavic a dialect of the expanding North-West Indo-European language, a Northern LPIE dialect, as we support, based on morphological and lexical isoglosses?
  • Or was it part of an Indo-Slavonic group in East Yamna, i.e. a Graeco-Aryan dialect, based mainly on the traditional Satem-Centum phonological division?

I am a strong supporter of Balto-Slavic being a member of a North-West Indo-European group. That’s probably because I educated myself first with the main Spanish books* on Proto-Indo-European reconstruction, and its authors kept repeating this consistent idea, but I have found no relevant data to reject it in the past 15 years.

* Today two of the three volumes are available in English, although they are from the early 1990s, hence a bit outdated. They also maintain certain peculiarities from Adrados’ own personal theories, such as multiple (coloured) laryngeals, 5 cases – with a common ancestral oblique case – for Middle PIE, etc. But it has lots of detailed discussions on the different aspects of the reconstruction. It is not an easy introductory manual to the field, though; for that you have already many famous short handbooks out there, like those of Fortson (N.American), Beekes (Leiden), or Meier-Brügger (Germany).

Fernando and I have always maintained that North-West Indo-European must have formed a very recent community, probably connected well into the early 2nd millennium BC for certain recent isoglosses to spread among its early dialects, based on our guesstimates*, and on our belief that it formed at some point not just a dialect continuum, but probably a common language, so we estimated that the expansion was associated with the pan-European influence of Únětice and close early Bronze Age European contacts.

NOTE. I know, you must be thinking “linguistic guesstimates? Bollocks, that’s not Science”. Right? Wrong. When you learn a dozen languages from different branches, half a dozen ancient ones, and then still study some reconstructed proto-languages from them, you begin to make your own assumptions about how the language changes you perceive could have developed according to your mental time frames. If you just learned a second language and some Latin in school, and try to make assumptions as to how language changes, or you believe you can judge it with this limited background, you have evidently the wrong idea of what a guesstimate is. I accept criticism to this concept from a scientist used only to statistical methods, since it comes from pure ignorance of what it means. And I accept alternative guesstimates from linguists whose language backgrounds may differ (and thus their perception of language change). However, I would not accept a glottochronological or otherwise (supposedly) statistical model instead (or a religious model, for that matter), so we have no alternatives to guesstimates for the moment.

In fact, guesstimates and dialectalization have paved the way to the steppe hypothesis, first with the kurgan hypothesis by Marija Gimbutas, then complemented further in the past 60 years by linguists and archaeologists into a detailed Khvalynsk -> Yamna -> Afanasevo/Bell Beaker/Sintashta-Andronovo expansion model, now confirmed with genomics. So either you trust us (or any other polyglot who deals with Indo-European matters, like Adrados, Lehmann, Beekes, Kloekhorst, Kortlandt, etc.), or you begin learning ancient languages and obtaining your own guesstimates, whichever way you prefer. The easy way of numbers + computer science does not exist yet, and is quite far from happening – until we can understand how our brains summarize and select important details involved in obtaining estimates – , no matter what you might be reading (even in Nature or Science) recently

Proto-Indo-European dialectal expansion according to Adrados (1998).

Data from the 2015 papers changed my understanding of the original NWIE-speaking community, and I have since shifted my preffered anthropological model (from a Northern dialect in Yamna spreading into a loose NWIE-speaking Corded Ware -> Únětice) to a quite close group formed by late Yamna settlers in the Carpathian Basin, expanded as East Bell Beakers, and later continuing with close contacts through Central European EBA.

NOTE. As you can read, we initially rejected Gimbutas’ and Anthony’s (2007) notion of a Late PIE splitting suddenly into all known dialects (viz. Italo-Celtic with Vučedol/Bell Beaker), and looked thus for a common NWIE spread with Corded Ware migrants, with help from inferences of modern haplogroup distribution (as was common in the early 2000s). Language reconstruction was the foundation of that model, and it was right in its own way. It probably gave the wrong idea to geneticists and archaeologists, who quite easily accepted some results from the 2015 papers as supporting this model. But it also helped us develop a new model and predict what would happen in future papers, as demonstrated in O&M 2018. Any alternative linguistic and archaeological model could explain what is seen today in genomics, but our model of North-West Indo-European reconstruction is obviously at present the best fit for it.

Map of Chalcolithic migrations (A Grammar of Modern Indo-European, 2nd ed. 2008): Corded Ware as the vector of Indo-European languages.

Nevertheless, one of the most important Balticists and Slavicists alive, Frederik Kortlandt, posits that there was in fact an Indo-Slavonic group, so one has to take that possibility into account. Not that his ideas are flawless, of course: he defends the glottalic theory – which is still held today by just a handful of researchers – , and I strongly oppose his description of Balto-Slavic and Germanic oblique cases in *-m- (against other LPIE *-bh-) as an ancestral remnant related to Anatolian (an ending which few scholars would agree corresponds to what he claims), since that would probably represent an older split than warranted in our model. I believe genetics is proving that the dialectalization of Late PIE happened as Fernando López-Menchero and I described.

NOTE. The idea with these examples of how he has been wrong in LPIE and MPIE reconstruction is not to observe the common ad hominem arguments used by amateur geneticists to dismiss academic proposals (“he said that and was wrong, ergo he is wrong now”). It is to bring into attention that the argument from authority is important for the academic community insofar as it creates a common ground, i.e. especially when there are many relevant scholars agreeing on the same subject. But, indeed, any model can and should be challenged, and all authorities are capable of being wrong, and in fact they often are.

The most common explanation today for the dialectal development *-m- is an innovation (not an archaism), whether morphological (viz. Ita. and Gk. them. pl *-i) or phonological (as I defend); and the most commonly repeated model for the satemization trend (even for those supporting a three-dorsal theory for PIE) is areal contact, whether driven by a previous (most likely Uralic) substratum, or not. Hence, if Kortlandt’s main different phonological and morphological assessments of the parent language are flawed, and they are the basis for his dialectal scheme, it should be revised.

The ‘atomic bomb’ that Indo-Slavonic proponents launched, in my opinion, was Holzer’s Temematic (born roughly at the same time as the renewed Old European concept in North-West Indo-European model of Oettinger) – and indeed Kortlandt’s acceptance of it. It seems to me like the linguistic equivalent of the archaeological “patron-client relationship” proposed by Anthony for a cultural diffusion of Late PIE into different Corded Ware regions: almost impossible to be fully rejected, if the Indo-Slavonic superstrate is proposed for a relatively early time.

In my opinion, the shared morphological layer with North-West Indo-European is obviously older than Iranian influence on Slavic, and I think this is communis opinio today. But how could we disentangle the dialectalization of Balto-Slavic, if there is (as it seems) an ancestral substrate layer (most likely Uralic) common to both Balto-Slavic and Indo-Iranian? It seems a very difficult task.

Diachronic map of migrations in Europe ca. 2250-1750 BC

The expansion of Balto-Slavic

In any case, there are two, and only two mainstream choices right now.

NOTE. Mainstream, as in representing trends current today among Indo-Europeanists, so that many programs around the world would explain these alternative models to their students, or they would easily appear in most handbooks. Not like the word “mainstream” you read in any comment out there by anyone who has never been interested in Indo-European studies, and uses any text from any author, written who knows how long ago, merely to justify their ethnic preconceptions coupled with certain genomic finds.

You can agree with:

A) The Spanish and German schools of thought, together with many American and British scholars, as well as archaeologists like Heyd, Mallory, or Prescott, and now Anthony, too: the language ancestral to Balto-Slavic, Germanic, and Italo-Celtic accompanied expanding West Yamna/East Bell Beakers into Europe, and then their speakers – like the rest of peoples everywhere in Europe – admixed later in the different regions.

B) Frederik Kortlandt and other Indo-Slavicists. The ‘original’ Balto-Slavic would have spread with Srubna (and likely Potapovka before it), as a product of the admixture of East Yamna’s Indo-Slavonic with incoming Corded Ware migrants (this would correspond to my description of Indo-Iranian). ‘True’ Balto-Slavic speakers would have then absorbed the Temematic-speaking migrants (equivalent to early Balto-Slavic migrants as described in the demic diffusion model) spreading from the west, most likely in the steppe. Later developments from the steppe would have then brought Baltic to the north, and Slavic to the west.

Therefore, in both cases the language spoken by early R1a-Z645 lineages in Únětice or Mierzanowice/Nitra EBA cultures would have been an eastern North-West Indo-European dialect associated with expanding Bell Beakers, and closely related to Germanic and Italo-Celtic. In the second case, the ancient samples we see genetically closer to modern West Slavs could thus be identified with those speaking the Temematic substrate absorbed later by Balto-Slavic, or maybe by Balts migrating northward, and Slavs spreading west- and southward.

NOTE. In any case, we know that R1a-Z645 subclades resurged in Central-East Europe after the expansion of Bell Beakers, potentially showing an ancient link with the prevalent R1a subclades in the region today. We know that some ancient Central European populations cluster near modern West Slavs, but in other interesting regions (like the British Isles, Central Europe, Scandinavia, or Iberia) we also see close clusters, and nevertheless observe historically documented radical ethnolinguistic changes, as well as many different subsequent genetic inflows and founder effects, that have significantly altered the anthropological picture in these regions, so it could very well be that the lineages we find in ancient samples do not correspond to modern West Slavic lineages, or even similar ancient and modern lineages could show a radical cultural discontinuity (as is likely the case in this to-and-from-the-steppe migration scheme).

Diachronic map of migrations in Europe ca. 1250-750 BC.

Since we are going to see signs of both – west and east admixture – in early Slavic communities near the steppe, and the distribution from South, West, and East Slavs will include a wide “cloud” connecting Central, East, and South-East Europe, as it is evident already from early Germanic samples, it may be interesting to shift our attention to the Tollense valley and Lusatian samples, and their predominant Y-DNA haplogroups. Once again, tracking male-driven migrations from Central Europe to the Baltic region and the steppe, and back again to much of Central and South Europe, will determine which groups expanded this eastern NWIE dialect initially and in later times.

Since Baltic and Slavic languages are attested quite late, genetics is likely to help us select among the different available models for Balto-Slavic, although (it is worth repeating it) these lineages may not be the same that later expanded each dialect.

NOTE. Bronze and Iron Age samples might begin to depict the true Balto-Slavic migration map. Apart from the strong differences in the satemization processes seen among Baltic, Slavic, and Indo-Iranian, from an archaeological point of view the geographic location of the earliest attested Baltic languages and the prehistoric developments of the region seem to me almost incompatible with a homeland in the steppe. Anyway, in the worst-case scenario – for those of us who work with Balto-Slavic to reconstruct North-West Indo-European – there is consensus that there must an eastern North-West Indo-European language (which some would call Temematic), whose common traits with Germanic and Italo-Celtic we use to reconstruct their parent language. The question remains thus mostly theoretical, of limited pragmatic use for the reconstruction.

The third way: Baltic Late Neolithic

I have referred to Kristiansen and his group‘s position regarding Corded Ware as Indo-European as flawed before. While their latest interpretation (and language identification) was wrong, Kristiansen’s original idea of long-lasting contacts in the Dnieper-Dniester region with the area occupied by late Trypillia developing a Proto-Corded Ware culture was probably right, as we are seeing now.

New data in Mittnik et al. 2018 show some interesting early Late Neolithic samples from the Baltic region – Zvejnieki, Gyvakarai1 (R1a-Z645) and Plinkaigalis242 – , proving what I predicted: that elevated steppe ancestry and R1a-Z645 subclades would be found in the Dnieper-Dniester region unrelated to the Yamna expansion, and, it seems, to migrants of the Corded Ware A-horizon.

Funnily enough, this shows that there were probably ancient interactions in the region, as originally asserted by Kristiansen, and probably following some of Victor Klochko‘s proposed exchange paths, but earlier than predicted by him.

Nevertheless, linguist Guus Kroonen (from Kristiansen’s workgroup) issued a quick response to O&M 2018 in yet another twist of his agricultural substrate theory, changing Corded Ware from the vector to a vector of expansion of Late Proto-Indo-European languages (thus following again strictly Gimbutas’ oudated model), which fails thus to tackle the main inconsistencies of their previous models, as shown now with the latest paper on South Asian migrations. As I said, they were always one step behind Anthony, and they still are.

Funny also how Anthony, too – like Kristiansen – , may have been right all along since 2007, in proposing that Corded Ware (the nuclear Corded Ware migrants) stemmed from the Dnieper-Dniester region roughly at the same time as Yamna migrants expanded west, and that they did not have any direct genetic connection (in terms of migrations) with each other.

Most likely Pre-Proto-Anatolian migration with Suvorovo-Novodanilovka chiefs in the North Pontic steppe and the Balkans.

Both researchers, who collaborated with the latest genomic research, remade their models, and have to revise now their most recent proposals with the new data, influencing each new paper published with their pressure to be right in their previous models, and with new genomic data compelling them to change their theories under the pressure not to be too wrong again, in this strange vicious circle. Had they remained silent and committed to their archaeological theories, they could have been right all along, each one in their own way.

NOTE. BTW, in case you see ad hominem here too, I feel compelled to say that only thanks to their commitment to disentangle the truth about ancient migrations, and their readiness to collaborate with genetic research – unlike many others in their field – we know today what we know. If they have been wrong many times, it is because they have tried to connect the genetic dots as they were told. Only because of their readiness to explore their science further they should be praised by all. But, again, that does not mean that they cannot be wrong in their models…

Thanks to Anthony’s latest change of mind, we don’t have to hear the “cultural diffusion” argument anymore, and I consider this a great advance for the field.

NOTE. Not that there could not be prehistoric cultural diffusion events of language (i.e. not accompanied by genetic admixture), of course, but such theories, almost impossible to disprove, probably need much more than a simple “patron-client relationship” proposal and anthropometry to justify them, in a time when we will be able to see almost every meaningful personal exchange in Genomics…

Today – since the finding of Ukraine_Eneolithic sample I6561, of haplogroup R1a-Z93, dated ca. 4200 BC, and likely from the Sredni Stog culture – it seems more likely than ever that the expansion of R1a-Z645 subclades was in fact associated with the spread of steppe admixture probably near the North Pontic forest-steppe region, most likely from the Dnieper-Dniester or Upper Dniester region.

The appearance of a ‘late’ Z93 subclade already at such an early date, with steppe admixture, makes it still more likely that the Proto-Corded Ware culture, from where Corded Ware migrants of R1a-Z645 lineages later spread, was probably associated with this wide region.

In a parallel but unrelated migration, as it is now clear, steppe admixture also expanded with Yamna settlers of R1b-L23 lineages into the North Pontic steppe – from the North Caspian steppe, where it had developed previously as the Khvalynsk and (likely) Repin cultures -, roughly at the same time as Proto-Corded Ware expanded to the north, ca. 3300-3000 BC, and then expanded to the west into the Balkans (contributing to the formation of Balkan EBA cultures, and to the East Bell Beaker group).

NOTE. A migration of Yamna settlers northward along the Prut dated ca. 3000 BC or later could have justified the appearance of steppe admixture in the Dnieper-Dniester region, as I proposed for the Zvejnieki sample, although dates from Baltic samples are likely too early for that. For this to be corroborated, migrants should be accompanied up to a certain region by R1b-L23 lineages, and this could mean in turn a revival of Anthony’s original model of cultural diffusion of 2007. The most likely scenario, however, as predicted by Heyd, given the early appearance of steppe admixture and R1a-Z93 subclades in the forest-steppe during the 5th millennium, is that the admixture happened much earlier than that, fully unrelated to Late PIE migrations.

Diachronic map of Copper Age migrations in Europe ca. 3100-2600 BC

The modern Baltic and Slavic conundrum

As for some people of Northern European ancestry previously supporting a bulletproof Yamna (R1a/R1b) -> Corded Ware migration that was obviously wrong; now supporting different Sredni Stog -> Corded Ware groups representing Indo-Slavonic (and Germanic??) in a model that is clearly wrong: how are these attempts different from Western Europeans supporting the autochthonous continuity of R1b-P312 lineages against all recent data, from Indians supporting the autochthonous continuity of R1a-M417 lineages no matter what, and from the more recent trend of autochthonous continuity theories for N1c lineages and Uralic in Eastern Europe?

Modern Germanic-speaking peoples can trace their common language to Nordic Iron Age Proto-Germanic, Celts to La Tène’s expansion of Proto-Celtic, and Romance speakers to the Roman expansion (and to an earlier Proto-Italic), all three dating approximately to the Iron Age. Proto-Slavic is dated much later than that, and probably Proto-Baltic too (or maybe earlier depending on the dialectal proposal), with Balto-Slavic being possibly coeval with Pre-Proto-Germanic and Italo-Celtic, but probably slightly later than that. Also, the language ancestral to Slavic may be (like a theoretical Proto-Romance language) impossible to reconstruct with precision, due to multiple substrate (or superstrate?) influences on the wide territory where Proto-Slavic formed and expanded from, in close alliance with steppe communities of different ethnolinguistic backgrounds.

We know that proto-historic Germanic, Celtic, and Italic peoples spread from relatively small regions, and had almost nothing to do with historic groups speaking their daughter languages, let alone modern speakers. Baltic and Slavic are not different.

NOTE. We have read that Weltzin samples clustered closely to Central Europeans (especially Austrians), and at a certain distance from modern Poles. That’s the conclusion of Sell’s PhD thesis, and it may be right, if you take only modern samples for comparison. However, if you have read or thought that they represented some kind of “ancestral Germanic vs. Slavic” battle, please imagine Trump’s voice for my opinion: Wrroonng, wrroonng, wrroonng. They cluster closely with Bell Beaker migrants, Poland BA, and Únětice (in this order), which we now know thanks to the data from O&M 2018 and Mittnik et al. 2018. And we also know who they don’t cluster close too: Corded Ware and Trzciniec samples. Therefore, people from the region near the most likely homelands of Pre-Proto-Germanic and Proto-Balto-Slavic are – as expected – likely descendants from Bell Beaker migrants in Central Europe. The genetic relationship of those ancient samples to modern inhabitants of Central-East Europe? Not obvious – at all.

PCA of samples from Tollense Valley battlefield and some ancient and modern samples.

We also know (and have known for a long time, well before these recent papers) that the oldest attested Indo-European languagesMycenaean, early Anatolian languages, and Indo-Aryan (through certain words in Mitanni inscriptions) – do not show continuity from the places where they were first attested to the Late and Middle Proto-Indo-European (steppe) homeland either. There should be no problem then in accepting that there is no linguistic, archaeological, or common sense reason to support that Balto-Slavic is older or shows more regional continuity than other IE languages from Europe.

NOTE. Oh yes, Balts saying “Baltic is the most similar language to PIE” I hear you thinking? Uh-huh, sure. And according to some Greeks (supported e.g. by the conclusions from Lazaridis et al. 2017) Mycenaeans were ‘autochthonous’, and Proto-Greek the most similar to PIE. For many Hindus, Vedic Sanskrit is in fact PIE), and the latest paper by Narasimhan et al. (2018) only reinforces this idea (don’t ask me why). Also, Caucasian scholar Gamkrelidze (with Ivanov) supported the origin of the language precisely in the Caucasus, with Armenian being thus the purest language. For Italians fans of Virgil and the Roman Empire, Latin (like Aeneas) comes from Anatolian linguistically and genetically, hence it must be the ‘oldest’ IE dialect alive… No, wait, Danish scholars Kroonen and Iversen quite recently asserted that Germanic is the oldest to branch off, then it should thus be nearest to PIE! I think you can see a pattern here…And don’t forget about the new Vasconic-Uralic hypotheses going on now, with Vasconic fans of R1b changing from Palaeolithic to Mesolithic, and now to European Neolithic and whatnot, or Uralic fans of N1c changing now from Mesolithic EHG to Siberia (for ancestry) or Central Asia (for N1c subclades), or whatever is necessary to believe in ‘continuity’ of their people following the newest genetic papers… Just pick whatever theory you want, call it “mainstream”, and that’s it.

So, if there is no reliable archaeological model connecting Bronze or Iron Age cultures to Eastern European cultures which are supposed to represent the Proto-Slavic and Proto-Baltic homelands…why on earth would any reasonable amateur (not to speak about scholars) dare propose any sort of genetic or linguistic continuity for thousands of years from PIE to early Slavs, a people whose first blurry appearance in historical records happened during the Middle Ages in rather turbulent and genetically admixed regions? It does not make any sense, and it had all odds against it. Blond hair, blue eyes, lactase persistence? Sure, and ABO group, brachycephaly, anthropometry… All very scientifish.

Diachronic map of migrations during Classical Antiquity in Europe 250 BC – 250 AD.
Where’s Proto-Slavic Wally?


Human ancestry can only help refine solid academic theories, it cannot create one. Every new pet theory used to satisfy modern cultural pre- and misconceptions has failed, and it will fail again, and again, and again…

To have an own anthropological model of prehistoric migration requires time and study. It is not enough to play with software and to misuse traditional academic disciplines just to ‘prove’ some completely irrelevant, meaningless, and false continuity.


Proto-Indo-European homeland south of the Caucasus?

User Camulogène Rix at Anthrogenica posted an interesting excerpt of Reich’s new book in a thread on ancient DNA studies in the news (emphasis mine):

Ancient DNA available from this time in Anatolia shows no evidence of steppe ancestry similar to that in the Yamnaya (although the evidence here is circumstantial as no ancient DNA from the Hittites themselves has yet been published). This suggests to me that the most likely location of the population that first spoke an Indo-European language was south of the Caucasus Mountains, perhaps in present-day Iran or Armenia, because ancient DNA from people who lived there matches what we would expect for a source population both for the Yamnaya and for ancient Anatolians. If this scenario is right the population sent one branch up into the steppe-mixing with steppe hunter-gatherers in a one-to-one ratio to become the Yamnaya as described earlier- and another to Anatolia to found the ancestors of people there who spoke languages such as Hittite.

The thread has since logically become a trolling hell, and it seems not to be working right for hours now.

Reich’s proposal based on ancestral components to explain the formation of a people and language is a continuation of their emphasis on ancestry to explain cultures and languages. It seems quite interesting to see this happen again, given their current trend to surreptitiously modify their previous ‘Yamnaya ancestry’ concept and Yamnaya millennia-long R1a-R1b community (that supposedly explains a Yamna -> Corded Ware -> Bell Beaker migration) to a more general ‘steppe people’ sharing a ‘steppe ancestry’ who spoke a ‘steppe language’.

Interesting arrows of dispersal of steppe ancestry, from Yamna -> Corded Ware -> Bell Beaker, from David Reich’s new book (yes, from 2018, number one bestseller in

This new idea based on ancestral components suffers thus from the same essential methodological problems, which equate it – yet again – to pure speculation:

  1. It is a conclusion based on the genomic analysis of few individuals from distant regions and different periods, and – maybe more disturbingly – on the lack of steppe ancestry in the few samples at hand.
  2. Wait, what? Steppe ancestry? So they are trying to derive potential genetic connections among specific prehistoric cultures with a poorly depicted genetic sketch, based on previous flawed concepts (instead of on anthropological disciplines), which seems a rather long stretch for any scientist, whether they are content with seeing themselves as barbaric scientific conquerors of academic disciplines or not. In other words, statistics is also science (in fact, the main one to assert anything in almost any scientific field), and you cannot overcome essential errors (design, sampling, hypothesis testing) merely by using a priori correct statistical methods. Results obtained this way constitute a statistical fallacy.

  3. Even if the sampling and hypothesis testing were fine, to derive anthropological models from genomic investigation is completely wrong. Ancestral component ≠ population.
  4. To include not only potential migrations, but also languages spoken by these potential migrants? It’s sad that we have a need to repeat it, but if ancestral component ≠ population, how could ancestral component = language?

The Proto-Indo-European-speaking community

This is what we know about the formation of a Proto-Indo-European community (i.e. a community speaking a reconstructible Proto-Indo-European language) in the Pontic-Caspian steppe, which is based on linguistic reconstruction and guesstimates, tracing archaeological cultures backwards from cultures known to have spoken ancient (proto-)languages, and helping both disciplines with anthropological models (for which ancient genomics is only helping select certain details) of migration or – rarely – cultural diffusion:

NOTE. The following dates are obviously simplified. Read here a more detailed linguistic assessment based on phonology.

Most likely Pre-Proto-Anatolian migration with Suvorovo-Novodanilovka chiefs in the North Pontic steppe and the Balkans.
  • ca. 5000 BC. Early Proto-Indo-European (or Indo-Uralic) spoken probably during the formation and development of a loose Early Khvalynsk – Sredni Stog I cultural-historical community over the Pontic-Caspian steppe region, whose indigenous population probably had mainly Caucasus hunter-gatherer ancestry.
  • ca. 4500 BC. Khvalynsk probably speaking Middle Proto-Indo-European expands, most likely including Suvorovo-Novodanilovka chiefs into the North Pontic steppe, and probably expanding R1b-M269 lineages for the first time.
  • ca. 4000 BC. Separated communities develop, including North Pontic cultures probably gradually dominated by R1a-Z645 (potentially speaking Proto-Uralic); and Khvalynsk (and Repin) cultures probably dominated by R1b-L23 lineages, most likely developing a Late Proto-Indo-European already separated from Proto-Anatolian.
  • ca. 3500 BC. A Proto-Corded Ware population dominated by R1a-Z645 expands to the north, and slightly later an early Yamna community develops from Late Khvalynsk and Repin, expanding to the west of the Don River, and to the east into Afanasevo. This is most likely the period of reduction of variability and expansion of subclades of R1a-Z645 and R1b-L23 that we expect to see with more samples.
  • ca. 3000 BC. Expansion of Corded Ware migrants in northern Europe, and Yamna migrants along the Danube and into the Balkans, with further reduction and expansion of certain subclades.
  • ca. 2500 BC. Expansion of Bell Beaker migrants dominated by R1b-L51 subclades in Europe, and late Corded Ware migrants in east Yamna expanding R1a-Z93 subclades.

All these events are compatible with language reconstruction in mainstream European schools since at least the 1980s, supported by traditional archaeological research of the past 20 years, and is being confirmed with Genomics.

For those willingly lost in a myriad of new dreams boosted by the shallow comment contained in David Reich’s paragraph on CHG ancestry, even he does not doubt that the origin of Late Proto-Indo-European lies in Yamna, to the north of the Caucasus, based on Anthony’s (2007) account:

Both images from the book, posted by Twitter user Jasper at

NOTE: By the way, David Anthony, one of the main sources of information for Reich’s group, never considered Corded Ware to have received Yamna migrants, and althought he changed his model due to the conclusions of the 2015 papers, he has recently changed his model again to adapt it to the inconsistencies found in phylogeography.

CHG ancestry and PIE homeland south of the Caucasus

As for the potential origins of CHG ancestry in early Proto-Indo-European speakers, I already stated clearly my opinion quite recently. They may be attributed to:

Just to be clear, an expansion of Proto-Anatolian to the south, through the Caucasus, cannot be discarded today. It will remain a possibility until Maykop and more Balkan Chalcolithic and Anatolian-speaking samples are published.

However, an original Early Proto-Indo-European community south of the Caucasus seems to me highly unlikely, based on anthropological data, which should drive any conclusion. From what I could read, here are the rather simplistic arguments used:

  • Gimbutas and Maykop: Maykop was thought to be (in Gimbutas’ times) a rather late archaeological culture, directly connected to a Transcaucasian Copper Age culture ca. 2400-2300 BC. It has been demonstrated in recent years that this culture is substantially older, and even then language guesstimates for a Late PIE / Proto-Anatolian would not fit a migration to the north. While our ignorance may certainly be used to derive far-fetched conclusions about potential migrations from and to it, using Gimbutas (or any archaeological theory until the 1990s) today does not make any sense. Still less if we think that she favoured a steppe homeland.

NOTE. It seems that the Reich Lab may have already access to Maykop samples, so this suggested Proto-Indo-European – Maykop connection may have some real foundation. Regardless, we already know that intense contacts happened, so there will be no surprise (unless Y-DNA shows some sort of direct continuity from one to the other).

  • Gamkrelidze & Ivanov: they argued for an Armenian homeland (and are thus at the origin of yet another autochthonous continuity theory), but they did so to support their glottalic theory, i.e. merely to support what they saw as favouring their linguistic model (with Armenian being the most archaic dialect). The glottalic theory is supported today – as far as I know – mainly by Kortlandt, Jagodziński, or (Nostraticist) Bomhard, but even they most likely would not need to argue for an Armenian homeland. In fact, their support of a Graeco-Aryan group (also supported by Gamkrelidze & Ivanov) would be against this, at least in archaeological terms.
  • Colin Renfrew and the Anatolian homeland: This conceptual umbrella of language spreading with farming everywhere has changed so much and so many times in the past 20 years, with so many glottochronological and archaeological estimates circulating, that you can support anything by now using them. Mostly used today for abstract models of long-lasting language contacts, cultural diffusion, and constellation analogies. Anyway, he strives to keep up-to-date information to revise the model, that much is certain:
  • Glottochronology, phylogenetic trees, Swadesh list analysis, statistical estimates, psychics, pyramid power, and healing crystals: no, please, no.
Science Magazine
“A first line of evidence comes from linguistic analysis based on quantitative lexical data, which returned a tree compatible with the Anatolian hypothesis

In principle, unlike many other recent autochthonous continuity theories, I doubt there can be much racial-based opposition anywhere in the world to an origin of Proto-Indo-European in the Middle East, where the oldest civilizations appeared – apart, obviously, from modern Northeast and Northwest Caucasian, Kartvelian, or Semitic speakers, who may in turn have to revisit their autochthonous continuity theories radically…

Nevertheless, it is obvious that prehistoric (and many historic) migrations are signalled by the reduction in variability and expansion of certain Y-DNA haplogroups, and not just by ancestral components. That is generally accepted, although the reasons for this almost universal phenomenon are not always clear.

In fact, Proto-Anatolian and Common Anatolian speakers need not share any ancestral component, PCA cluster, or any other statistical parameter related to steppe populations, not even the same Y-DNA haplogroups, given that approximately three thousand years might have passed between their split from an Indo-Hittite community and the first attested Anatolian-speaking communities…We must carefully follow their tracks from Anatolia ca. 1500 BC to the steppe ca. 4500 BC, otherwise we risk creating another mess like the Corded Ware one.

In my opinion, the substantial contribution of EHG ancestry and R1a-M417 lineages to the Pontic-Caspian steppe (probably ca. 6500 BC) from Central or East Eurasia is the most recent sizeable genomic event in the region, and thus the best candidate for the community that expanded a language ancestral to Proto-Indo-European – whether you call it Pre-Proto-Indo-European, Pre-Indo-Uralic, or Eurasiatic, depending on your preferences.

An early (and substantial) contribution of CHG ancestry in Khvalynsk relative to North Pontic cultures, if it is found with new samples, may actually be a further proof of the Caucasian substrate of Proto-Indo-European proposed by Kortlandt (or Bomhard) as contributing to the differentiation of Middle PIE from Uralic. Genomics could thus help support, again, traditional disciplines in accepting or rejecting academic controversial theories.


In the case of an Early PIE (or Indo-Uralic) homeland, genomic data is scarce. But all traditional anthropological disciplines point to the Pontic-Caspian steppe, so we should stick to it, regardless of the informal suggestion written by a renown geneticist in one paragraph of a book conceived as an introduction to the field.

It seems we are not learning much from the hundreds of peer-reviewed, statistically (superficially, at least) sound genetic papers whose anthropological conclusions have been proven wrong by now. A lot of people should be spending their time learning about the complex, endless methods at hand in this kind of research – not just bioinformatics – , instead of fruitlessly speculating about wild unsubstantiated proposals.

As a final note, I would like to remind some in the discussion, who seem to dismiss the identification of CHG with Proto-Indo-European by supporting a “R1a-R1b” community for PIE, of their previous commitment to ancestral components in identifying peoples and languages, and thus their support to Reich’s (and his group’s) fundamental premises.

You cannot have it both ways. At least David Reich is being consistent.


The uneasy relationship between Archaeology and Ancient Genomics

Allentoft Corded Ware

News feature Divided by DNA: The uneasy relationship between archaeology and ancient genomics, Two fields in the midst of a technological revolution are struggling to reconcile their views of the past, by Ewen Callaway, Nature (2018) 555:573-576.

Interesting excerpts (emphasis mine):

In duelling 2015 Nature papers6,7the teams arrived at broadly similar conclusions: an influx of herders from the grassland steppes of present-day Russia and Ukraine — linked to Yamnaya cultural artefacts and practices such as pit burial mounds — had replaced much of the gene pool of central and Western Europe around 4,500–5,000 years ago. This was coincident with the disappearance of Neolithic pottery, burial styles and other cultural expressions and the emergence of Corded Ware cultural artefacts, which are distributed throughout northern and central Europe. “These results were a shock to the archaeological community,” Kristiansen says.


Still, not everyone was satisfied. In an essay8 titled ‘Kossinna’s Smile’, archaeologist Volker Heyd at the University of Bristol, UK, disagreed, not with the conclusion that people moved west from the steppe, but with how their genetic signatures were conflated with complex cultural expressions. Corded Ware and Yamnaya burials are more different than they are similar, and there is evidence of cultural exchange, at least, between the Russian steppe and regions west that predate Yamnaya culture, he says. None of these facts negates the conclusions of the genetics papers, but they underscore the insufficiency of the articles in addressing the questions that archaeologists are interested in, he argued. “While I have no doubt they are basically right, it is the complexity of the past that is not reflected,” Heyd wrote, before issuing a call to arms. “Instead of letting geneticists determine the agenda and set the message, we should teach them about complexity in past human actions.”

Many archaeologists are also trying to understand and engage with the inconvenient findings from genetics. (…)
[Carlin:] “I would characterize a lot of these papers as ‘map and describe’. They’re looking at the movement of genetic signatures, but in terms of how or why that’s happening, those things aren’t being explored,” says Carlin, who is no longer disturbed by the disconnect. “I am increasingly reconciling myself to the view that archaeology and ancient DNA are telling different stories.” The changes in cultural and social practices that he studies might coincide with the population shifts that Reich and his team are uncovering, but they don’t necessarily have to. And such biological insights will never fully explain the human experiences captured in the archaeological record.

Reich agrees that his field is in a “map-making phase”, and that genetics is only sketching out the rough contours of the past. Sweeping conclusions, such as those put forth in the 2015 steppe migration papers, will give way to regionally focused studies with more subtlety.

This is already starting to happen. Although the Bell Beaker study found a profound shift in the genetic make-up of Britain, it rejected the notion that the cultural phenomenon was associated with a single population. In Iberia, individuals buried with Bell Beaker goods were closely related to earlier local populations and shared little ancestry with Beaker-associated individuals from northern Europe (who were related to steppe groups such as the Yamnaya). The pots did the moving, not the people.

This final paragraph apparently sums up a view that Reich has of this field, since he repeats it:

Reich concedes that his field hasn’t always handled the past with the nuance or accuracy that archaeologists and historians would like. But he hopes they will eventually be swayed by the insights his field can bring. “We’re barbarians coming late to the study of the human past,” Reich says. “But it’s dangerous to ignore barbarians.”

I would say that the true barbarians didn’t have a habit or possibility to learn from the higher civilizations they attacked or invaded. Geneticists, on the other hand, only have to do what they expect archaeologists to do: study.

EDIT (30 MAR 2018): A new interesting editorial of Nature, On the use and abuse of ancient DNA.

See also:

Statistical methods fashionable again in Linguistics: Reconstructing Proto-Australian dialects

Reconstructing remote relationships – Proto-Australian noun class prefixation, by Mark Harvey & Robert Mailhammer, Diachronica (2017) 34(4): 470–515


Evaluation of hypotheses on genetic relationships depends on two factors: database size and criteria on correspondence quality. For hypotheses on remote relationships, databases are often small. Therefore, detailed consideration of criteria on correspondence quality is important. Hypotheses on remote relationships commonly involve greater geographical and temporal ranges. Consequently, we propose that there are two factors which are likely to play a greater role in comparing hypotheses of chance, contact and inheritance for remote relationships: (i) spatial distribution of corresponding forms; and (ii) language specific unpredictability in related paradigms. Concentrated spatial distributions disfavour hypotheses of chance, and discontinuous distributions disfavour contact hypotheses, whereas hypotheses of inheritance may accommodate both. Higher levels of language-specific unpredictability favour remote over recent transmission. We consider a remote relationship hypothesis, the Proto-Australian hypothesis. We take noun class prefixation as a test dataset for evaluating this hypothesis against these two criteria, and we show that inheritance is favoured over chance and contact.

I was redirected to this work by my wife – who discovered it reading BBC News – , suspicious of its potential glottochronological content. However, I must say – speaking from my absolute ignorance of the main language family investigated – , that it seemed in general an interesting read, with some thorough discussion and attention to detail.

The statistical analyses, however, seem to disrupt the content, and – in my opinion – do not help support its conclusions.

Map of Non-Pama-Nyungan languages.

Computer Science and Linguistics

We are evidently on alert to tackle dubious research, because of the revival of pseudoscientific methods in linguistic investigation, promoted (yet again) by Nature.

It seems that journals with the highest impact factor, in their search for groundbreaking conclusions supported by any methods involving numbers, are setting a still lower level of standards for academic disciplines.

NOTE. If you think about it – if glottochronology has survived the disgrace it fell into in the 2000s, to come back again now to the top of the publishing industry… How can we expect the “Yamnaya ancestry” concept to be overcome? I guess we will still see certain Eastern Europeans in 2030 arguing for elevated steppe ancestry here and there to support the conclusions of the 2015 papers, no matter what…

I am sure that worse times lie ahead for traditional comparative grammar. For example, it seems that there will be more publications on Proto-Indo-European using novel computer methods: a group led by Janhunen and Pyysalo, from the Department of Languages at the University of Helsinki, promises – under an ever-growing bubble of mistery (or so it seems from their Twitter and Facebook accounts) – a machine-implemented reconstruction (with the generative etymological PIE lexicon project) that will once and for all solve all our previous ‘inconsistencies’…

Spoiler alert for their publications: whether they select to go on mainly with computer-implemented methods, or they use them to support more traditional results, their conclusions will confirm (surprise!) their authors’ previous reactionary theses, such as a renewed support for the traditional monolaryngealism, and a rejection of Kortlandt’s or Kloekhorst’s (i.e. the Leiden School’s) theories on Proto-Indo-European phonology, and thus a PIE relationship to Proto-Uralic, probably stressing yet again an independent origin for both proto-languages.

See also:

Y-DNA haplogroup R1b-Z2103 in Proto-Indo-Iranians?


We already know that the Sintashta -> Andronovo migrants will probably be dominated by Y-DNA R1a-Z93 lineages. However, I doubt it will be the only Y-DNA haplogroup found.

I said in my predictions for this year that there could not be much new genetic data to ascertain how Pre-Indo-Iranian survived the invasion, gradual replacement and founder effects that happened in terms of male haplogroups after the arrival of late Corded Ware migrants, and that we should probably have to rely on anthropological explanations for language continuity despite genetic replacement, as in the Basque case.

Nevertheless, since we have very few samples, I think we could still see a clear genetic contribution from Yamna to Corded Ware immigrants in the North Caspian region (from Abashevo, in turn a mix of Fatyanovo/Balanovo and Catacomb/Poltavka cultures) in terms of:

  • Ancestral components and PCA in new Sintashta-Petrovka, Andronovo, and/or later samples – similar the ‘steppe’ drift seen in Potapovka relative to Sintashta samples, both formed by incoming Corded Ware migrants – ; and
  • R1b-L23 subclades, either appearing scattered during the Sintashta melting pot (of Abashevo/R1a-Z645 and East Yamna-Poltavka/R1b-Z2103 peoples), or resurging after this period, as we have seen in Pre-Balto-Slavic territory.

This contribution could better explain the obvious language continuity in the region, beautifully complementing the complex anthropological model we have now of archaeological continuity of Sintashta and Potapovka with the previous Poltavka, seen in a similar material and symbolic culture that survived the arrival of newcomers.

A lot of people seem to be looking like crazy since O&M 2018 for some sort of connection between Corded Ware and Yamna migrants in Eastern and Central Europe (wheter in SNP calls of samples published, or among almost forgotten academic papers), either to support the ideas of the 2015 papers – for those who relied on their conclusions and built (even if only mentally) far-fetched migration models around it – , or just because of some sort of absurd continuity theory involving modern R1a-Z645 subclades:

NOTE. The situation we have seen with the hundreds of samples from O&M 2018, and with the recent additional Eastern European samples, depict an unexpected absolutely clear-cut distinction in Y-DNA haplogroups between Corded Ware and Yamna/Bell Beaker: I really can’t see how the situation could be more obvious for everyone, so I doubt any further samples will make certain people change their minds. Their hope is, I guess, that just one sample may give some more oxygen to infinite pet theories, as we are still surprisingly seeing even with reactionary R1b autochthonous continuists in Western Europe…

However, looking into the most likely future for the field, what we should be expecting right now is continuity of Yamna ancestry and lineages in early Proto-Indo-Iranian territory. Since we only have a few samples from Sintashta-Petrovka, Potapovka, and Andronovo, I think there might be a sizeable number of R1b-Z2103 subclades in the territory inhabited by those who – no doubt – spread the language into Central Asia.

Modern Y-DNA haplogroup R1b distribution, by Maulucioni at Wikipedia

While full population replacement by R1a-Z93 lineages in the North Caspian region ca. 2000 BC is not impossible, I don’t think it is very likely, since we already know that there are R1b-Z2103 lineages widely distributed in Indo-Iranian-speaking territory, and Z93 is now known to be an older subclade than YFull’s mean formation date suggested (due to the Ukraine_Eneolithic I6561 sample‘s SNP call), so what we can infer now that actually happened in Sintashta -> Andronovo is not exactly the spread of haplogroup Z93 during its formation, but rather a regional reduction in its variability coupled with the expansion of some of its subclades.

The main question, after the South Asia paper is finally published, will then be:

  1. Given that Yamna peoples were an elite group of patrilineally-related families mainly of R1b-L23 subclades:
  2. Accepting that PCA, ADMIXTURE, and other statistical methods are not relevant (alone) for ethnolinguistic identification: e.g. Yamna ‘outliers’ and East Bell Beaker migrants of R1b-L23 lineages without steppe ancestry; N1c1a1a-L392 lineages and Siberian ancestry unrelated to Uralic speakers; R1a-Z645 and steppe ancestry in North-East Europe related to Uralic-speaking cultures
  3. If we find now, as I expect, genetic continuity of east Yamna in Sintashta -> Andronovo (relative to other late Corded Ware peoples), probably including haplogroup R1b-Z2103 mixed with R1a-Z93 before its further reduction of subclades (e.g. to L657) and expansion during its subsequent spread southward…

Diachronic map of migrations in Asia ca. 2250-1750 BC

Why exactly do we need Corded Ware to explain migrations of Late Indo-European speakers?

In other words: if we had the data we have today in 2015, would we have a need for Corded Ware to explain Indo-European migrations from the steppe? Are some people so blinded by their will to (appear to) be right in their past interpretations that they can’t just let go?

NOTE. On a side note, wouldn’t it be nice for this paper to publish some other R1b-L23 (x2103) sample – maybe even R1b-L51 – in Yamna, Andronovo, or Afanasevo territory, to end both autochthonous continuity theories (of North-Eastern and Western Europe) at the same time?

I really hope someone in David Reich’s team understands this matter, or else they will still identify Corded Ware as the (now probably ‘a’ instead) vector of expansion of Indo-European languages, and some of us will still have fun for another 2 or 3 years with such conclusions, until someone in the lab realizes that ancestry ≠ population ≠ ethnic identification ≠ language.

NOTE. It seems rather dull to read how people are discussing in the Twitterverse conventional constructs like ‘human race‘ as found in Reich’s op-ed in The New York Times, as if such grandiose semantic discussions had any practical meaning, when basic anthropological questions actually relevant for Genomics, like the essential ancestral component ≠ people tenet seem not to be of interest for anyone in the field….

Since our Indo-European demic difusion model (and its consequences for our reconstruction of North-West Indo-European) and this blog are becoming more and more popular each day – judging by the constant growth in visits in the past 6 months or so – , I guess the simplemindedness and predictability of certain geneticists is benefitting traditional anthropology directly, driving more and more amateur geneticists to look for sound academic models to answer the growing inconsistencies of genetic research.

NOTE. I am not saying the rejection of Corded Ware as spreading Indo-European is definitive. Maybe more samples within some years will depict a clear ancient expansion of Early or Middle Proto-Indo-Europeans from Khvalynsk to the forest-steppe and forest zone, and later with certain Corded Ware migrants into Central Europe, over whose territory a Late Indo-European dialect from Bell Beakers became the superstrate, as some have proposed in the past – e.g. to explain Krahe’s Old European hydronymy. I really doubt you could demonstrate such an old ethnolinguistic identification with a clear, unbroken archaeological trail, though, and we know now that this old hydronymy is probably of Late Indo-European nature (possibly even more recent).

What I am saying is: with the data we have now, it does not make any sense to keep the anthropological models invented by geneticists ex nihilo in 2015, and the hundred different alternative Late Indo-European migration models that arebornwitheachnewpaper.

These Yamna -> Corded Ware migration models didn’t have any sense for me since early 2016, but now after O&M 2017, and especially O&M 2018, I don’t think any geneticist with a little knowledge in Linguistics or Archaeology (if they are decent about their quest for truth in describing ancient European migrations) would buy them, if not for some sort of created ‘tradition’. So let’s ditch Corded Ware as Late Indo-European-speaking, let’s accept that late Corded Ware migrants should most likely be identified as early Uralic speakers, and then future data will tell if we are – again – wrong.

Please, don’t let Genomics become another pseudoscience based solely on Bioinformatics like glottochronology: let anthropologists (preferably mainstream archaeologists, but also the true Indo-Europeanists, linguists) help you interpret your raw data. Don’t deceive yourselves thinking that you have read enough about the Indo-European question, or that you know enough Indo-Europeanists (say what?) to derive your own conclusions.

Use the South Asia paper to begin expressly retracting the Corded Ware mess.

Please pretty please with sugar on top?


For commenters: this post concerns an anthropological question, and deals with the expansion of Late Proto-Indo-European speakers from Yamna, and the controversy surrounding the role of Corded Ware migrants that a handful of academics propose spread from it, based on a renewed model of Gimbutas’ outdated Kurgan theory and on the so-called ‘Yamnaya’ ancestry.

It happens so that the discussion has turned lately mainly to ancient Y-DNA haplogroups, because they help confirm previous mainstream anthropological models of cultural diffusion and migration. It is obviously not reasonable to judge prehistoric ethnolinguistic migrations from ca. 5,000 years ago based on historical nation-states and ethnic or religious concepts invented since the Middle Ages, coupled with “your” people’s main modern (or your own) paternal lineage.

EDIT (27 MAR 2018): Minor corrections and post made shorter.

Oldest N1c1a1a-L392 samples and Siberian ancestry in Bronze Age Fennoscandia

Open access preprint at bioRxiv, Ancient Fennoscandian genomes reveal origin and spread of Siberian ancestry in Europe, by Lamnidis et al. (2018).

Abstract (emphasis mine):

European history has been shaped by migrations of people, and their subsequent admixture. Recently, evidence from ancient DNA has brought new insights into migration events that could be linked to the advent of agriculture, and possibly to the spread of Indo-European languages. However, little is known so far about the ancient population history of north-eastern Europe, in particular about populations speaking Uralic languages, such as Finns and Saami. Here we analyse ancient genomic data from 11 individuals from Finland and Northwest Russia. We show that the specific genetic makeup of northern Europe traces back to migrations from Siberia that began at least 3,500 years ago. This ancestry was subsequently admixed into many modern populations in the region, in particular populations speaking Uralic languages today. In addition, we show that ancestors of modern Saami inhabited a larger territory during the Iron Age than today, which adds to historical and linguistic evidence for the population history of Finland.

Interesting excerpts (edited):

While the Siberian genetic component described here was previously described in modern-day populations from the region, we gain further insights into its temporal depth. Our data suggest that this fourth genetic component found in modern-day north-eastern Europeans arrived in the area around 4,000 years ago at the latest, as illustrated by ALDER dating using the ancient genome-wide data from Bolshoy Oleni Ostrov. The upper bound for the introduction of this component is harder to estimate. The component is absent in the Karelian hunter-gatherers (EHG) 3 dated to 8,300-7,200 yBP as well as Mesolithic and Neolithic populations from the Baltics from 8,300 yBP and 7,100-5,000 yBP respectively. While this suggests an upper bound of 5,000 yBP for the arrival of Siberian ancestry, we cannot exclude the possibility of its presence even earlier, yet restricted to more northern regions, as suggested by its absence in populations in the Baltic during the Bronze Age. Our study also presents the earliest occurrence of the Y-chromosomal haplogroup N1c in Fennoscandia. N1c is common among modern Uralic speakers, and has also been detected in Hungarian individuals dating to the 10th century, yet it is absent in all published Mesolithic genomes from Karelia and the Baltics.

The large Siberian component in the Bolshoy individuals from the Kola Peninsula provides the earliest direct genetic evidence for an eastern migration into this region. Such contact is well documented in archaeology, with the introduction of asbestos-mixed Lovozero ceramics during the second millenium BC, and the spread of even-based arrowheads in Lapland from 1,900 BCE. Additionally, the nearest counterparts of Vardøy ceramics, appearing in the area around 1,600-1,300 BCE, can be found on the Taymyr peninsula, much further to the east. Finally, the Imiyakhtakhskaya culture from Yakutia spread to the Kola Peninsula during the same period. Contacts between Siberia and Europe are also recognised in linguistics. The fact that the Siberian genetic component is consistently shared among Uralic-speaking populations, with the exceptions of Hungarians and the non-Uralic speaking Russians, would make it tempting to equate this component with the spread of Uralic languages in the area. However, such a model may be overly simplistic. First, the presence of the Siberian component on the Kola Peninsula at ca. 4000 yBP predates most linguistic estimates of the spread of Uralic languages to the area. Second, as shown in our analyses, the admixture patterns found in historic and modern Uralic speakers are complex and in fact inconsistent with a single admixture event. Therefore, even if the Siberian genetic component partly spread alongside Uralic languages, it likely presented only an addition to populations carrying this component from earlier.

Plot of ADMIXTURE (K=3) results containing West Eurasian populations and the Nganasan. Ancient individuals from this study are represented by thicker bars.

The novel genome-wide data here presented from ancient individuals from Finland opens new insights into Finnish population history. Two of the three higher coverage individuals and all six low coverage individuals from Levänluhta showed low genetic affinity to modern-day Finnish speakers of the area. Instead, an increased affinity was observed to modern-day Saami speakers, now mostly residing in the north of the Scandinavian Peninsula. These results suggest that the geographic range of the Saami extended further south in the past, and hints at a genetic shift at least in the western Finnish region during the Iron Age. The findings are in concordance with the noted linguistic shift from Saami languages to early Finnish. Further ancient DNA from Finland is needed to conclude to what extent these signals of migration and admixture are representative of Finland as a whole.

PCA plot of 113 Modern Eurasian populations, with individuals from this study projected on the principal components. Uralic speakers are highlighted in light purple.

The two samples of haplogroup N1c1a1a-L392/L1026, dated ca. 1500 BC, come from the site Bolshoy Oleniy Ostrov, in the Kola Peninsula.

Bolshoy Oleniy Ostrov (Great Reindeer Island), situated in the Kola Bay of the Barents Sea and separated from the mainland by Yekarerininsky Island and two straits, harbors the ancient cemetery of an unknown Early Metal Age culture. The preservation of artifacts made from bone and antler, wooden structures, as well as human remains is remarkable for the location and age this site represents. Altogether 19 skeletons of adults and children have been recognized from both single and collective burials of the site, together with more than 250 artifacts. (…) Apart from these excavations, approximately 25 burials were revealed in 1934 during the construction of fortifications. (…) Radiocarbon dates are provided by Moiseyev and Khartanovich in their 2012 study, placing the site in middle to the late 2nd millennium BC (…)

After seing how Late Indo-European languages spread with Yamna and (mainly) R1b-L23 lineages, we are now obtaining proof of how Siberian ancestry – likely accompanying N1c-L392 lineages – was probably related to an early archaeological Siberian influence in the easternmost region of North-East Europe, seen also probably in linguistics.

NOTE. Whereas I proposed – based mainly on common guesstimates – that R1a-M417 and EHG ancestry might have signaled the arrival of an early Yukaghir substratum to NE Europe, later acquired by Uralic spreading over this territory, while N1c1a1a lineages with the Seima-Turbino phenomenon might have given Uralic its later Altaic traits, it is indeed possible – and more likely with the findings in this paper – that N1c1a1a lineages may have in fact spread Yukaghir languages, especially if (like the Leiden school) one supports an Indo-Uralic community.

The linguistic effect of this migration may depend on one’s preferred model for Proto-Uralic and its strata, and especially on one’s position in the Proto-Uralic vs. Proto-Uralo-Yukaghir controversy. Although I really didn’t have a strong opinion on this matter, it is clear from my texts that (unlike Kortlandt) I didn’t consider Yukaghir to share a common ancestor with Uralic languages. What genomics is showing right now seems to me directly translatable to a linguistic model, and we should therefore reject an original Proto-Uralo-Yukaghir community.

Also, it seems that the Finnish population peak which expanded today’s prevalent N1c-L392 lineages – after the Iron Age bottleneck which likely reduced its haplogroup diversity – may have been associated with the event that displaced the Saami population from Finland after ca. 1000 AD.

I think it is becoming still clearer where Uralic languages came from.


On the potential origin of Caucasus hunter-gatherer ancestry in Eneolithic steppe cultures

An interesting open genomic question is the origin and spread of Caucasus hunter-gatherer (CHG) ancestry in steppe populations during the Eneolithic.

My broad theory regarding the appearance of this ancestral component is based on:

Two recently published papers ivestigating the Don Region may shed some light on this issue:

Plant food subsistence in the human diet of the Bronze Age Caspian and Low Don steppe pastoralists: archaeobotanical, isotope and 14C data, by Shishlina, Bobrov, Simakova, et al. Veget Hist Archaeobot (2018).

EDIT (16/3/2018): You can now read or download the paper at


The paper presents the result of analysis of charred food on the interior part of the vessels from the graves of the East Manych and West Manych Catacomb archaeological cultures (2500–2350 cal bc). The phytolith and pollen analyses identified pollen of wild steppe plants and phytoliths of domesticated gramineous plants determined as barley phytoliths. Direct 14С dating of one of the samples demonstrates that barley spikelets and stems were used in funeral rites by local steppe communities. However, there are no data suggesting that steppe inhabitants of the Lower Don Region were engaged in agriculture in the mid-3000 bc. Supposedly, barley could have reached the steppes through seasonal migrations of mobile pastoralists to the south, use of North Caucasus grasslands in the economic system of seasonal moves and exchange with local people. Nevertheless, presence of carbonized barley seeds in the occupation layers at North Caucasus settlements of 4000–3000 bc requires confirmation by direct 14С dating of such samples.

Location of sites. 1: Ulan IV; 2: Peschany IV and V; 3: Shakhaevskaya 1; 4: Zunda-Tolga 2; 5: Lesnoye; 6: Chidgom; 7: Meshoko; 8: Chishkho; 9: Svobodnoye

Dynamics of Chemical and Microbiological Soil Properties in the Desert–Steppe Zone of the Southeast Russian Plain during the Second Part of the Holocene (4000 BC–XIII century AC), Kashirskaya, Khomutova, Kuznetsova, et al. Arid Ecosyst (2018) 8(1):38-46.


The results of studies of the chemical and microbiological properties of the soils buried under the barrows of the Eneolithic, Bronze, and Middle Ages periods of the southeast of the Russian Plain are presented. It was shown that the climate of the region in the Eneolithic period (4200–4100 BC) and in the Middle Ages (700 years ago) was more humid in comparison to the present time. The third millennium BC was characterized by a gradual increase of the climate aridity. Its peak was at the end of the III millennium BC. The number and biomass of microbial cells was maximal in soils buried in periods of high atmospheric humidity (4200–4100 and 3000–2800 BC) and sharply decreased during the aridization period in the second half of the III millennium BC. In general, the variability of indicators of microbocenosis conditions of desert–steppe buried soils of all ages from the burial mounds correlated with the centuries-old dynamics of the climate.

Number of microbial cells in buried soils of different ages and modern background soil.

It is well known that access to more food – as in favorable crops and cattle feeding – may cause demographic explosions, and the second article – together with recent genomic data – may be yet another proof of that.

Until now, pastoralism seemed to be the main subsistence economy for most steppe groups. It seems that earlier Eneolithic contacts of certain steppe groups with settlements of the Northern Caucasus might have been not just to obtain prestige goods though, but – if proper radiocarbon dating confirms it – also implied essential goods, and maybe more stable seasonal exchange systems.

Such stable economic exchanges might have therefore included bidirectional exogamy practices, justifying the sizeable genomic contribution from the Caucasus.

At this point this is just another good theory to take into account.


Consequences of O&M 2018 (II): The unsolved nature of Suvorovo-Novodanilovka chiefs, and the route of Proto-Anatolian expansion


This is part of a series of posts analyzing the findings of the recent Nature papers Olalde et al.(2018) and Mathieson et al.(2018) (abbreviated O&M 2018).

I already expressed my predictions for 2018. One of the most interesting questions among them is the identification of the early Anatolian offshoot, and this is – I believe – where Genomics has the most to say in Indo-European migrations.

Linguistics and Archaeology had already a mainstream account from Late PIE/Yamna onwards, and it has been proven right in Genomic investigation. There is, however, no consensus on Indo-Hittite.


Apart from the Anatolian homeland hypothesis and its westward migration (as referenced e.g. by Lazaridis et al. 2017), the other possibility including the most likely steppe homeland is that Proto-Anatolian spread through the Balkans, and must have separated from Khvalynsk and travelled first westward through the North Pontic region, and then southward to Ezero.

EDIT (10 MAR 2018): The Anatolian westward route within the steppe homeland model refers to the possibility that Proto-Anatolian spread south through the Caucasus, and then westward through Anatolia, as suggested e.g. originally by Marija Gimbutas for Maykop, as a link in the Caucasus.

We all know that this Khvalynsk -> Novodanilovka-Suvorovo -> Cernavoda -> Ezero -> Troy migration model proposed by Anthony shows no conspicuous chain in Archaeology, but obvious contacts (including Genomics) are seen among some of these neighbouring cultures in different times.

We know that remains of Suvorovo-Novodanilovka culture of chiefs emerged around 4400-4200 BC among ordinary local Sredni Stog settlements:

  • the Novodanilovka rich burials in the steppes, near the Dnieper,
  • and the Suvorovo group in the Danube delta, roughly coinciding with the massive abandonment of old tell settlements in the area.

One of the strongest cultural connections between Khvalynsk and Suvorovo Novodanilovka chiefs is the similar polished stone mace-heads shaped like horse heads found in both cultures, a typical steppe prestige object going back to the east Pontic-Caspian steppe beginning ca. 5000-4800 BC.

Its finding in the Danube valley may have signalled the expansion of horse riding, which is compatible with the finding of ancient domesticated horses in the region. Horses were not important in Old European cultures, and it seems that they weren’t in Sredni Stog or Kvitjana either.

Steppe and Danubian sites at the time: of the Suvorovo-Novodanilovka intrusion, about 4200-3900 BC. David W. Anthony (2007).

NOTE. Telegin, the main source of knowledge in Ukraine prehistoric cultures for Anthony, was eventually convinced that Surovovo-Novodanilovka was a separate culture. However, for Anthony (using Telegin’s first impressions), it may have been a wealthy elite among Sredni Stog peoples. Anthony considers Sredni Stog to have been also influenced by Khvalynsk, and thus potentially related to the Suvorovo-Novodanilovka chiefs.

Nevertheless, he obviously cannot link North Pontic Eneolithic cultures to Khvalynsk nor to horse riding – whilst he clearly assumes horse riding for Novodanilovka-Suvorovo chiefs – , and he does not link North Pontic cultures to later expansions of Late Proto-Indo-Europeans from late Khvalynsk and Yamna, either.

The question here for Anthony (as with further Proto-Anatolian expansions described in his 2007 book), in my opinion, was to offer a plausible string of connections between Khvalynsk and Anatolia, and the simplest connection one can make among steppe cultures is a general, broad community between North Pontic and North Caspian cultures. That way, the knot tying Khvalynsk to the Danube seems stronger, whatever the origin of Suvorovo-Novodanilovka chiefs.

If, however, a direct genetic connection is made between Suvorovo-Novodanilovka chiefs and Khvalynsk – as in its association with R1b-M269 and R1b-L23 lineages – , there will be little need to include Sredni Stog or any other intermediate culture in the equation.

We have already seen a movement of steppe ancestry into mainland Greece, and I would not be surprised if a parallel movement could be seen from Ezero to Troy (or a neighbouring North-West Anatolian region), so that the final migration of Common Anatolian had in fact been triggered by the massive steppe migrations during the Chalcolithic.

NOTE. Whereas we are certain to find R1b-L23 subclades in the direct Balkan migrations from Yamna, the link of steppe->Anatolia migrations may be a little trickier: even if we find out that the Suvorovo-Novodanilovka expansion was associated with an expansion and reduction of haplogroup variability (to haplogroups R1b-M269 and R1b-L23), we don’t know yet if the ca. 1,500 years passed (and the different cultural and population changes occurred) between Proto-Anatolian and Common Anatolian migrations may have impacted the main haplogroup composition of both communities.

O&M 2018

A probably unsurprising – because of its previously known admixture and PCA – , but nevertheless disappointing finding came from the Y-SNP call of the haplogroup R1 found in Varna (R1b-V88, given first by Genetiker), leaving us with no new haplogroup data standing out for this period.

This sample’s lack of obvious genetic links with the steppe and early date didn’t deter me from believing it could show subclade M269, and thus a sign of incoming Suvorovo chiefs in the region. After all, R1b-P297 subclades seemed to have almost disappeared from the Balkans by that time, and we know that assessments based only on ancestral components and PCA clusters are not infallible – we are seeing that in many, many samples already.

1—39 — sceptre bearers of the type Giurgiuleşti and Suvorovo; 40—60 — Gumelniţa-Varna-Bolgrad-Aldeni cultural sphere; 61 — Fălciu; 62 — Cainari; 63 — Giurgiuleşti; 64 — Suvorovo; 65 — Casimcea; 66 — Kjulevča; 67 — Reka Devnja; 68 — Drama; 69 — Gonova Mogila; 70 — Reževo. Țerna S., Govedarica B. (2016)

NOTE. In fact, the first time I checked Mathieson et al. (2018) supplementary tables I thought that the ‘Ukraine_Eneolithic’ sample of R1b-L23 subclade was ‘it’: the first clear proof in ancient samples of incoming Suvorovo chiefs from Khvalynsk I was looking for…Until I realized its date, and that it was more likely a Late Yamna (or Catacomb) sample.

Steppe ancestry is found in the Varna and Smyadovo outliers, though, and these samples cluster closely to Ukraine Eneolithic samples (which are among Khvalynsk, Ukraine Neolithic, and Anatolia Neolithic clusters), so some population movement must have happened around or before that time in the region, and it is obvious that it happened from east to west.

It remains to be seen, therefore:

a) If the incoming Suvorovo-Novodanilovka chiefs (most likely originally from Khvalynsk) dominating over North Pontic and Danube regions show – as I bet – R1b-M269, and possibly also early R1b-L23* subclades,

b) Or else they still show mixed lineages, reflecting an older admixed population of the Pontic-Caspian steppe – as the early Khvalynsk and Ukraine Eneolithic samples we have now.

NOTE. Even though my preferred model of migration is through the Balkans – due to the many east-west migrations seen from the steppe into Europe – , there is no general consensus here because of the lack of solid anthropological models, and there are cultural links found also between the steppe and Anatolia through the Caucasus, so the question remains open.