On the origin of haplogroup R1b-L51 in late Repin / early Yamna settlers

steppe-eneolithic-migrations

A recent comment on the hypothetical Central European origin of PIE helped me remember that, when news appeared that R1b-L51 had been found in Khvalynsk ca. 4250-4000 BC, I began to think about alternative scenarios for the expansion of this haplogroup, with one of them including Central Europe.

Because, if YFull‘s (and Iain McDonald‘s) estimation of the split of R1b-L23 in L51 and Z2103 (ca. 4100 BC, TMRCA ca. 3700 BC) was wrong, by as much as the R1a-Z645 estimates proved wrong, and both subclades were older than expected, then maybe R1b-L51 was not part of the Yamna expansion, but rather part of an earlier expansion with Suvorovo-Novodanilovka into central Europe.

That is, R1b-L51 and R1b-Z2103 would have expanded wih Khvalynsk-Novodanilovka migrants, and they would have either disappeared among local populations, or settled and expanded with successful lineages in certain regions. I think this may give rise to two potential models.

A hidden group in the European east-central steppes?

Here is what Heyd (2011), for example, has to say about the effect of the Khvalynsk-Novodanilovka expansion in the 4th millennium BC, with the first Kurgan wave that shuttered the social, economic, and cultural foundations of south-eastern Europe (before the expansion of west Yamna migrants in the region):

indo-european-anatolian-uralic-migrations
Proto-Anatolian migrations with Khvalynsk-Novodanilovka expansion, including ADMIXTURE data from Wang et al. (2018).

As the Boleraz and Baden tumuli cases in Serbia and Hungary demonstrate, there are earlier, 4th millennium cal. B.C. round tumuli in the Carpathian basin. There are also earlier north-Pontic steppe populations who infiltrated similar environments west of the Black Sea prior to the rise of the Yamnaya culture. This situation can be traced back to the 2nd half of the 5th millennium cal. B.C. to a group of distinct burials, zoomorphic maceheads, long flint blades, triangular flint points, etc., summarized under the term Suvurovo-Novodanilovka (Govedarica 2004; Rassamakin 2004; Anthony 2007; Heyd forthcoming 2011). They also erected round personalized tumuli, though smaller in size and height, above inhumations of single individuals. Suvorovo and Casimcea are the key examples in the lower Danube region of Romania. In northeast Bulgaria, the primary grave of Polska Kosovo (ochre-stained supine extended body position: information communicated by S. Alexandrov) can also be seen as such, as should the Targovishte-“Gonova mogila” primary grave 1 in the Thracian plain with a burial arranged in a supine position with flexed legs, southeast-northwest orientated, and strewed with ochre (Kanchev 1991 , p. 56- 57; Ivanova Gaydarska 2007). In addition to the many copper and shell beads, the 17.4cm long obsidian blade is exceptional, which links this grave to the Csongrád-“Kettoshalom” grave in the south Hungarian plain (Ecsedy 1979). It also yielded an obsidian blade ( 13.2cm long) and copper, shell and limestone beads.

suvorovo-novodanilovka-expansion-europe
The Southeast European distribution of graves of the Suvorovo-Novodanilovka group and such unequipped ones mentioned in the text which can be attributed by burial custom and stratigraphic position in the barrow, plus zoomorphic and abstract animal head sceptres as well as specific maceheads with knobs as from Decea Maresului (mid-5th millennium until around 4000 BC). Heyd (2016).

However, no traces of a tumulus have been recorded above the Kettoshalom tomb. Conventionally, it is dated to the Bodrogkeresztur-period in east Hungary, shortly after 4000 cal. B.C., which would correspond very well with the suggested Cernavodă I (or its less known cultural equivalent in the Thracian plain) attribution for the “Gonova mogila” grave, a cultural background to which the Csongrád grave should have also belonged. Bodrogkeresztur and Cernavodă I periods are not the only examples of 4th millennium cal. B.C. tumuli and burials displaying this steppe connection. Indeed we can find this early steppe impact throughout the 4th millennium cal. B.C. These include adscriptions to the Horodiștea II (Corlateni-Dealul Stadole, grave I: Burtanescu l 998, p. 37; Holbocai, grave 34: Coma 1998, p. 16); to Gordinești-Cernavodă 11 (Liești-Movila Arbănașu, grave 22: Brudiu 2000); to Gorodsk-Usatovo (Corlăteni Dealul Cetăţii, grave I: Comșa 1998, p. 17- 18, in Romania; Durankulak, grave 982: Vajsov 2002, in Bulgaria); and to Cernavodă III(Golyama Detelina, tum. 4: Leshtakov, Borisov 1995), and early (end of 4th millennium cal. B.C.) Ezero in Ovchartsi, primary grave (Kalchev 1994, p. 134-138) and Golyama Detelina, tum. 2 (Kanchev 1991) in Bulgaria. Also the Boleráz and Baden tumuli of Banjevac-Tolisavac and Mokrin in the south Carpathian basin account for this, since one should perhaps take into account primary grave 12 of the Sárrédtudavari-Orhalom tumulus in the Hungarian Alfold: a left-sided crouched juvenile ( 15- 17 y) individual in an oval, NW-SE orientated grave pit 14C dated to 3350-3100 cal. B.C. at 2 sigma (Dani, Ncpper 2006). Neither the burial custom (no ochre strewing or depositing a lump of ochre has been recorded), nor date account for its ascription to the Yamnaya!

All of these tumuli and burials demonstrate, though, that there is already a constant but perhaps low-level 4th millennium cal. B.C. steppe interaction, linking the regions of the north of the Black Sea with those of the west, and reaching deep into the Carpathian basin. This has to be acknowledged. even if these populations remain small, bounded to their steppe habitat with an economy adapted to this special environment, and are not always visible in the record. Indirect hints may help in seeing them, such as the frequent occurrence of horse bones, regarded as deriving from domesticated horses, in Hungarian Baden settlements (Bokonyi 1978; Benecke 1998), and in those of the south German Cham Culture (Matuschik 1999, p. 80-82) and the east German Bernburg Culture (Becker 1999; Benecke 1999). These occur, however, always in low numbers, perhaps not enough to maintain and regenerate a herd. Does this point us towards otherwise archaeologically hidden horsebreeders in the Carpathian basin, before the Yamnaya? In any case, I hope to make one case clear: these are by no means Yamnaya burials in the strict definition! Attribution to the Yamnaya in its strict definition applies.

pit-graves-central-europe
Distribution of Pit-Grave burials west of the Black Sea likely dating to the 2nd half of the 4th millennium BC (triangles: side-crouched burials; filled circles: supine extended burials; open circles: suspected). In Alin Frînculeasa, Bianca Preda, Volker Heyd, Pit-Graves, Yamnaya and Kurgans along the Lower Danube.

Also, about the expansion of Yamna settlers along the steppes:

However, it should have been made clear by the distribution map of the Western Yamnaya that they were confining themselves solely to their own, well-known, steppe habitat and therefore not occupying, or pushing away and expelling, the locally settled farming societies. Also, living solely in the steppes requires another lifestyle, and quite different economic and social bases, most likely very different to the established farming societies. Although surely regarded as incoming strangers, they may therefore not have been seen as direct competitors. This argument can be further enforced when remembering that the lowlands and the steppes in the southeast of Europe had already been populated throughout the 4th millennium cal. B.C., as demonstrated above, by societies with a similar north-Pontic steppe origin and tradition, albeit in lower numbers. It is only for these groups that the Yamnaya may have become a threat, but their common origin and perhaps a similar economic/ social background with comparable lifestyles would surely have assisted to allow rapid assimilation. More important, though, is that farming societies in this region may therefore have been accustomed to dealing and interacting with different people and ethnic strangers for a long time. (…)

When assessing farming and steppe societies’ interaction from a general point of view, attitudes can diverge in three main directions:

  1. the violent one; with raids, fights, struggles, warfare, suppression and finally the superiority and exploitation of the one over the other;
  2. the peaceful one; with a continuous exchange of gifts, goods, work, information and genes in a balanced reciprocal system, leading eventually to the merging of the two societies and creation of a new identity;
  3. the neutral one; with the two societies ignoring each other for a long time.

What we see from trying to understand the record of the Yamnaya, based on their tumuli and burials, and the local and neighbouring contemporary societies, based on their settlements, hoards, and graves, is likely a mixture of all three scenarios, with the balance perhaps more towards exchange in a highly dynamic system with alterations over time. However, violence and raids cannot be ruled out; they would be difficult to see in the archaeological record; or only indirectly, such as the building of hill forts, particularly the defence-like chain of Vucedol hillforts along the south shore of the Danube on the Serbian/Croatian border zone (Tasic 1995a), and the retreat of people into them (Falkenstein 1998, p. 261-262), with other interpretations also possible. And finally, we are dealing here with very different local and neighbouring societies, as well as with more distant contemporary ones, looking, in reality, rather like a chequer board of societies and archaeological cultures (see Parzinger 1993 for the overview). These display different regional backgrounds and traditions leading to different social and settlement organizations, different economic bases and material cultures in the wide areas between Prut and Maritza rivers, and Black Sea and Tisza river. They surely found their individual way of responding to the incoming and settling Yamnaya people.

yamna-tumuli-west-carpathians
Yamnaya tumuli signalling the expansion of West Yamna from ca. 3100 BC (especially after ca. 2950 BC). Heyd (2011).

The best data we have about this potential non-Yamna origin of R1b-L51 – and thus in favour of its admixture in the Carpathian basin – lies in:

  1. The majority of R1a-Z2103 subclades found to date among Yamna samples.
  2. The presence of R1b-Z2103 in the Catacomb culture – in the Northern Caucasus and in Ukraine.
  3. The limited presence of (ancient and modern) R1b-L51 in eastern Europe and India, whose isolated finds are commonly (and simplistically) attributed to ‘late migrations’.
  4. The presence of R1b-L51 (xZ2103) in cultures related to the ‘Yamna package’, but supposedly not to Yamna settlers. So for example I7043, of haplogroup R1b-L151(xU106,xP312), ca. 2500-2200 BC from Szigetszentmiklós-Üdülősor, probably from the Bell Beaker (Csepel group), but maybe from the early Nagýrev culture.
  5. The expansion of its subclades apparently only from a single region, around the Carpathian basin, in contrast to R1b-Z2103.
  6. The already ‘diluted’ steppe admixture found in the earliest samples with respect to Yamna, which points to the appearance after the Yamna admixture with the local population.
  7. Ukrainian archaeologists (in contrast to their Russian colleagues) point to the relevance of North Pontic cultures like Kvitjana and Lower Mikhailovka in the development of Early Yamna in the west, and some eastern European researchers also believe in this similarity.
  8. If R1b-Z2103 and R1b-L51 had expanded with Suvorovo-Novodanilovka migrants to the west, and had admixed later as Hungary_LCA-LBA-like peoples with Yamna migrants during the long-term contacts with other ‘kurganized cultures’ ca. 2900-2500 BC in the Great Hungarian Plains, it could explain some peculiar linguistic traits of North-West Indo-European, and also why R1b-Z2103 appears in cultures associated with this earlier ‘steppe influence’ (i.e. not directly related to Yamna) such as Vučedol (with a R1b-Z2103 sample, see below). That could also explain the presence of R1b-L151(xP312, xU106) in similar Balkan cultures, possibly not directly related to Yamna.
PCA-r1b-l51
Image modified from Wang et al. (2018). PCA of ancient and modern samples. Red circle in dashed line around Varna, Greece Neolithic, and (approximate position of) Smyadovo outliers, part of Khvalynsk-Novodanilovka settlers.

A hidden group among north or west Pontic Eneolithic steppe cultures?

The expansion of Khvalynsk as Novodanilovka into the North Pontic area happened through the south across the steppe, near the coast, with the forest-steppe region working as a clear natural border for this culture of likely horse-riding chieftains, whose economy was probably based on some rudimentary form of mobile pastoralism.

Although archaeologists are divided as to the origin of each individual Middle Eneolithic group near the Black Sea after the end of the Khvalynsk-Novodanilovka period, it seems more or less clear that steppe cultures like Cernavodă, Lower Mikhailovka, or Kvitjana are closer (or “more archaic”) in their steppe features, which connects them to Volga–Ural and Northern Caucasus cultures, like Northern Caucasus, Repin or Khvalynsk.

On the other hand, forest-steppe cultures like Dereivka (including Alexandria) show innovative traits and contacts with para- or sub-Neolithic cultures to the north, like Comb-Pit Ware groups, apart from corded decoration influenced by Trypillian groups to the west, especially in their later (‘Proto-Corded Ware‘) stage after ca. 3500 BC.

If Ukrainian researchers like Rassamakin are right, Early Yamna expanded not only from Repin settlers, but also from local steppe cultures adopting Repin traits to develop an Early Yamna culture, similar to how eastern (Volga–Ural groups) seem to have synchronously adopted Early Yamna without massive affluence of Repin settlements.

Furthermore, local traits develop in southern groups, like anthropomorphic stelae (shared with Kemi-Oba, direct heir of Lower Mikhailovka), and rich burials featuring wagons. These traits are seen in west Yamna settlers.

north-pontic-kvityana-dereivka-repin
Modified from Rassamakin (1999), adding red color to Repin expansion. The system of the latest Eneolithic Pointic cultures and the sites of the Zhivotilovo-Volchanskoe type: 1) Volchanskoe; 2) Zhivotilovka; 3) Vishnevatoe; 4) Koisug.

Problems of this model include:

  1. On the North Pontic area – in contrast to the Volga–Ural region – , there was a clear “colonization” wave of Repin settlers, also supported by Ukrainian researchers, based on the number of new settlements and burials, and on the progressive retreat of Dereivka, Kvitjana, as well as (more recent) Maykop- and Trypillia-related groups from the North Pontic area ca. 3350/3300 BC. It seems unlikely that these expansionist, semi-nomadic, cattle-breeding, patrilineally-related steppe clans that were driving all native populations out of their territories suddenly decided, at some point during their spread into the North Pontic area ca. 3300-3100 BC, to join forces with some foreign male lineages from the area, and then continue their expansion to the west…
  2. Similar to the fate of R1b-P297 subclades in the Baltic after the expansion of Corded Ware migrants, previous haplogropus of the North Pontic region – such as R1a, R1b-V88, and I2 subclades basically disappeared from the ancient DNA record after the expansion of Khvalynsk-Novodanilovka, and then after the expansion of Yamna, as is clear from Yamna, Afanasevo, and Bell Beaker samples obtained to date. This, in combination with what we know about Y-chromosome bottlenecks in post-Neolithic expansions, leaves little space to think that a big enough territorial group with a majority of “native” haplogroups could survive later expansions (be it R1b-L51 or R1a-Z645).
  3. Supporting an expansion of the same male (and partly female) population, the Yamna admixture from east to west is quite homogeneous, with the only difference found in (non-significant) EEF-like proportion which becomes elevated in distant areas [apart from significant ‘southern’ contribution to certain outlier samples]. Based on the also homogeneous Y-DNA picture, the heterogeneity must come, in general, from the female exogamy practiced by expanding groups.
  4. There is a short period, spanning some centuries (approximately 3300-2700 BC), in which the North Pontic area – especially the forest-steppe territories to the west of the Dnieper, i.e. the Upper Dniester, Boh, and Prut-Siret areas – are a chaos of incoming and emigrating, expanding and shrinking groups of different cultures, such as late Trypillian groups, Maykop-related traits, TRB, GAC, (Proto-)Corded Ware, and Early Yamna settlements. No natural geographic frontier can be delimited between these groups, which probably interacted in different ways. Nevertheless, based on their cultural traits, admixture, and especially on their Y-DNA, it seems that they never incorporated foreign male lineages, beyond those they probably had during their initial expansion trends.
  5. The further expansionist waves of Early Yamna seen ca. 3100 BC, from the Danube Delta to the west, give an overall image of continuously expanding patrilineal clans of R1b-M269 subclades since the Khvalynsk-Novodanilovka migration, in different periodic steps, mostly from eastern Pontic-Caspian nuclei, usually overriding all encountered cultures and (especially male) populations, rather than showing long-term collaboration and interaction. Such interaction is seen only in exceptional cases, e.g. the long-term admixture between Abashevo and Poltavka, as seen in Proto-Indo-Iranian peoples and their language.
PCA-Ukraine-r1b-l51
Image modified from Wang et al. (2018). PCA of ancient and modern samples. Arrows depicting Khvalynsk -> Yamna drift (blue), and hypothetic approximate Ukraine Eneolithic -> Yamna drift accompanying R1b-L51 (red).

Consequences

We are living right now an exemplary ego-, (ethno-)nationalism-, and/or supremacy-deflating moment, for some individuals of eastern and northern European descent who believed that R1a or ‘steppe ancestry proportions’ meant something special. The same can be said about those who had interiorized some social or ethnolinguistic meaning for the origin of R1b in western Europe, N1c in north-eastern Europe, as well as Greeks, Iranians, Armenians, or Mediterranean peoples in general of ‘Near Eastern’ ancestry or haplogroups, or peoples of Near Eastern origin and/or language.

These people had linked their haplogroups or ancestry with some fantasy continuity of ‘their’ ancestral populations to ‘their’ territories or languages (or both), and all are being proven wrong.

Apart from teaching such people a lesson about what simplistic views are useful for – whether it is based on ABO or RH group, white skin, blond hair, blue eyes, lactase persistence, or on the own ancestry or Y-DNA haplogroup -, it teaches the rest of us what can happen in the near future among western Europeans. Because, until recently, most western Europeans were comfortably settled thinking that our ancestors were some remnant population from an older, Palaeolithic or Mesolithic population, who acquired Indo-European languages by way of cultural diffusion in different periods, including only minor migrations.

Judging by what we can see now among some individuals of Northern and Eastern European descent, the only thing that can worsen the air of superiority among western Europeans is when they realize (within a few years, when all these stupid battles to control the narrative fade) that not only are they the cultural ‘heirs’ of the Graeco-Roman tradition that began with the Roman Empire, but that most of them are the direct patrilineal descendants of Khvalynsk, Yamna, Bell Beaker, and European Bronze Age peoples, and thus direct descendants of Middle PIE, Late PIE, and NWIE speakers.

steppe-chalcolithic-migrations
Steppe-related migrations ca. 3100-2600 BC with tentative linguistic identification.

The finding of R1b-L51 and R1b-Z2103 among expanding Suvorovo-Novodanilovka chieftains, with pockets of R1b-L51 remaining in steppe-like societies of the Balkans and the Carpathian Basin, would have beautifully complemented what we know about the East Yamna admixture with R1a-Z93 subclades (Uralic speakers) ca. 2600-2100 BC to form Proto-Indo-Iranian, and about the regional admixtures seen in the Balkans, e.g. in Proto-Greeks, with the prevalent J subclades of the region.

It would have meant an end to any modern culture or nation identifying themselves with the ‘true’ Late PIE and Yamna heirs, because these would be exclusively associated with the expansion of R1b-Z2103 subclades with late Repin, and later as the full-fledged Late PIE with Yamna settlers to south-east and central Europe, and to the southern Urals. The language would have had then obviously undergone different language changes in all these territories through long-lasting admixture with other populations. In that sense, it would have ended with the ideas of supremacy in western Europe before they even begin.

The most likely future

However limited the evidence, it seems that R1b-L51 expanded with Yamna, though, based on the estimates for the haplogroups involved, and on marginal hints at the variability of L23 subclades within Yamna and neighbouring populations. If R1b-L51 expanded with West Repin / Early Yamna settlers, this is why they have not yet been found among Yamna samples:

steppe-eneolithic-migrations
Simplified map of Repin expansions from ca. 3500/3400 BC.
  • The subclade division of Yamna settlers needs not be 50:50 for L51:Z2103, either in time or in space. I think this is the simplistic view underlying many thoughts on this matter. Many different expanding patrilineal clans of L23 subclades may have been more or less successful in different areas, and non-Z2103 may have been on the minority, or more isolated relative to Z2103-clans among expanding peoples on the steppe, especially on the east. In fact, we usually talk in terms of “Z2103 vs. L51” as if
    1. these two were the only L23 subclades; and
    2. both had split and succeeded (expanding) synchronously;

    that is, as if there had not been multiple subclades of both haplogroups, and as if there had not been different expansion waves for hundreds of years stemming from different evolving nuclei, involving each time only limited (successful) clans. Many different subclades of haplogroups L23 (xZ2103, xL51), Z2103, and L51 must have been unsuccessful during the ca. 1,500 years of late Khvalynsk and late Repin-Early Yamna expansions in which they must have participated (for approximately 60-75 generations, based on a mean 20-25 years).

  • If we want to imagine a pocket of ‘hidden’ L51 for some region of the North Pontic or Carpathian region, the same can be imagined – and much more likely – for any unsampled territory of expanding late Repin/Early Yamna settlers from the Lower Don – Lower Volga region (probably already a mixed society of L51 and Z2103 subclades since their beginning, as the early Repin culture, ca. 3800 BC), with L51 clans being probably successful to the west.
  • The Repin culture expanded only in small, mobile settlements from the Lower Don – Lower Volga to the north, east, and south, starting ca. 3500/3400 BC, in the waves that eventually gave a rather early distant offshoot in the Altai region, i.e. Afanasevo. Starting ca. 3300 BC in the archaeological record, the majority of R1b-Z2103 subclades found to date in Afanasevo also supports either
    • a mixed Repin society, with Z2103-clans predominating among eastern settlers; or
    • a Repin society marked by haplogroup L51, and thus a cultural diffusion of late Repin/Early Yamna traits among neighbouring (Khvalynsk, Samara, etc.) groups of essentially the same (early Khvalynsk-Novodanilovka) genetic stock in the Volga–Ural region.

    Both options could justify a majority of Z2103 in the Lower Volga–Ural region, with the latter being supported by the scattered archaeological remains of late Repin in the region before the synchronous emergence of Early Yamna findings in the whole Pontic-Caspian steppe.

  • Most Z2103 from Yamna samples to date are from around 3100 BC (in average) onward, and from the right bank of the Lower Don to the east, particularly from the Lower Volga–Ural area (especially the Samara region), which – based on the center of expansion of late Repin settlers – may be depicting an artificially high Z2103-distribution of the whole Yamna community.
repin-expansion-khvalynsk-cultures
Repin expansion into the Volga–Ural region from ca. 3500/3400 BC. Map made by me based on maps and data from Morgunova (2014, 2016). Lopatino is marked with number 64.
  • Yamna sample I0443, R1b-L23 (Y410+, L51-), ca. 3300-2700 BCE from Lopatino II, points to an intermediate subclade between L23 and L51, near one of the supposed late Repin sites (based on kurgan burials with late Repin cultural traits) in the Samara region.
  • Other Balkan cultures potentially unrelated to the Yamna expansion also show Z2103 (and not only L51) subclades, like I3499 (ca. 2884-2666 calBC), of the Vučedol culture, from Beli Manastir-Popova zemlja, which points to the infiltration of Yamna peoples in other cultures. In any case, the appearance of R1b-L23 subclades in the region happens only after the Yamna expansion ca. 3100 BC, probably through intrusions into different neighbouring regions, if these Balkan cultures are not directly derived from Yamna settlements (which is probably the case of the Csepel Bell Beaker or early Nagýrev sample, see above).
  • The diversity of haplogroups found in or around the Carpathian Basin in Late Chalcolithic / Early Bronze Age samples, including L151(xP312, xU106), P312, U106, Z2103, makes it the most likely sink of Yamna settlers, who spread thus with expanding family clans of different R1b-L23 subclades.
  • Even though some Yamna vanguard groups are known to have expanded up to Saxony-Anhalt before ca. 2700 BC, haplogroup Z2103 seems to be restricted to more eastern regions, which suggests that R1b-L51 was already successful among expanding West Yamna clans in Hungary, which gave rise only later to expanding East Bell Beakers (overwhelmingly of L151 subclades). The source of R1b-L51 and L151 expansion over Z2103 must lie therefore in the West Yamna period, and not in the Bell Beaker expansion.
indo-european-uralic-migrations-yamna-gac
Yamna migrants ca. 3300-2600. Most likely site of admixture with GAC circled in red.
  • The R1b-Z2103 found in Poltavka, Catacomb, and to the south point to a late migration displacing the western R1b-L51, only after the late Repin expansion. This is also seen in the steppe ancestry and R1b-Z2103 south of the Caucasus, in Hajji Firuz, which points to this route as a potential source of the supposed “Earliest Proto-Indo-Iranian” (the mariannu term) of the Near East. A similar replacement event happened some centuries later with expanding R1a-Z93 subclades from the east wiping out haplogroup R1b-Z2103 from the Pontic-Caspian steppe.
  • Many ancient samples from Khvalynsk, Northern Caucasus, Yamna, or later ones are reported simply as R1b-M269 or L23, without a clear subclade, so the simplistic ‘Yamna–Z2103’ picture is not real: if one takes into account that Z2103 might have been successful quite early in the eastern region, it is more likely to obtain a successful Y-SNP call of a Z2103 subclade in the Volga–Ural region than a xZ2103 one.
  • There are some modern samples of R1b-L51 in eastern Europe and Asia, whose common simplistic attribution to “late expansions” is usually not substantiated; and also ancient R1b-L51 samples might be confirmed soon for Asia.
  • ‘Western’ features described by archaeologists for West Yamna settlers, associated with Kemi Oba and southern Yamna groups in the North Pontic area – like rich burials with anthropomorphic stelae and wagons – are actually absent in burials from settlers beyond Bulgaria, which does not support their affiliation with these local steppe groups of the Black Sea. Also, a mix with local traditions is seen accross all Early Yamna groups of the Pontic-Caspian steppe, and still genetics and common cultural traits point to their homogeneization under the same patrilineal clans expanding continuously for centuries. The maintenance of local traditions (as evidenced by East Bell Beakers in Iberia related to Iberian Proto-Beakers) is often not a useful argument in genetics, especially when the female population is not replaced.
yamna-settlers-hungary
Yamna settlers in the Great Pannonian Plain, showing only kurgans of Hungary ca. 2950-2500 BC. Yamna Hungary was one of the biggest West Yamna provinces. From Hórvath et al. (2013).

Conclusion

This is what we know, using linguistics, archaeology, and genetics:

  • Middle Proto-Indo-European expanded with Khvalynsk-Novodanilovka after ca. 4800 BC, with the first Suvorovo settlements dated ca. 4600 BC.
  • Archaic Late Proto-Indo-European expanded with late Repin (or Volga–Ural settlers related to Khvalynsk, influenced by the Repin expansion) into Afanasevo ca. 3500/3400 BC.
  • Late Proto-Indo-European expanded with Early Yamna settlers to the west into central Europe and the Balkans ca. 3100 BC; and also to the east (as Pre-Proto-Indo-Iranian) into the southern Urals ca. 2600 BC.
  • North-West Indo-European expanded with Yamna Hungary -> East Bell Beakers, from ca. 2500 BC.
  • Proto-Indo-Iranian expanded with Sintashta, Potapovka, and later Andronovo and Srubna from ca. 2100 BC.

It seems that the subclades from Khvalynsk ca. 4250-4000 BC were wrongly reported – like those of Narasimhan et al. (2018). However, even if they are real and YFull estimates have to be revised, and even if the split had happened before the expansion of Suvorovo-Novodanilovka, the most likely origin of R1b-L51 among Bell Beakers will still be the expansion of late Repin / Early Yamna settlers, and that is what ancient DNA samples will most likely show, whatever the social or political consequences.

The only relevance of the finding of R1b-L51 in one place or another – especially if it is found to be a remnant of a Middle PIE expansion coupled with centuries of admixture and interaction in the Carpathian Basin – is the potential influence of an archaic PIE (or non-IE) layer on the development of North-West Indo-European in Yamna Hungary -> East Bell Beaker. That is, more or less like the Uralic influence related to the appearance of R1a-Z93 among Proto-Indo-Iranians, of R1a-Z284 among Pre-Germanic peoples, and of R1a-Z282 among Balto-Slavic peoples.

I think there is little that ancient DNA samples from West Yamna could add to what we know in general terms of archaeology or linguistics at this point regarding Late PIE migrations, beyond many interesting details. I am sure that those who have not attributed some random 6,000-year-old paternal ancestor any magical (ethnic or nationalist) meaning are just having fun, enjoying more and more the precise data we have now on European prehistoric populations.

As for those who believe in magical consequences of genetic studies, I don’t think there is anything for them to this quest beyond the artificially created grand-daddy issues. And, funnily enough, those who played (and play) the ‘neutrality’ card to feel superior in front of others – the “I only care about the truth”-type of lie, while secretly longing for grandpa’s ethnolinguistic continuity – are suffering the hardest fall.

Related

Y-DNA haplogroup R1b-Z2103 in Proto-Indo-Iranians?

chalcolithic_early-asia

We already know that the Sintashta -> Andronovo migrants will probably be dominated by Y-DNA R1a-Z93 lineages. However, I doubt it will be the only Y-DNA haplogroup found.

I said in my predictions for this year that there could not be much new genetic data to ascertain how Pre-Indo-Iranian survived the invasion, gradual replacement and founder effects that happened in terms of male haplogroups after the arrival of late Corded Ware migrants, and that we should probably have to rely on anthropological explanations for language continuity despite genetic replacement, as in the Basque case.

Nevertheless, since we have very few samples, I think we could still see a clear genetic contribution from Yamna to Corded Ware immigrants in the North Caspian region (from Abashevo, in turn a mix of Fatyanovo/Balanovo and Catacomb/Poltavka cultures) in terms of:

  • Ancestral components and PCA in new Sintashta-Petrovka, Andronovo, and/or later samples – similar the ‘steppe’ drift seen in Potapovka relative to Sintashta samples, both formed by incoming Corded Ware migrants – ; and
  • R1b-L23 subclades, either appearing scattered during the Sintashta melting pot (of Abashevo/R1a-Z645 and East Yamna-Poltavka/R1b-Z2103 peoples), or resurging after this period, as we have seen in Pre-Balto-Slavic territory.

This contribution could better explain the obvious language continuity in the region, beautifully complementing the complex anthropological model we have now of archaeological continuity of Sintashta and Potapovka with the previous Poltavka, seen in a similar material and symbolic culture that survived the arrival of newcomers.

A lot of people seem to be looking like crazy since O&M 2018 for some sort of connection between Corded Ware and Yamna migrants in Eastern and Central Europe (wheter in SNP calls of samples published, or among almost forgotten academic papers), either to support the ideas of the 2015 papers – for those who relied on their conclusions and built (even if only mentally) far-fetched migration models around it – , or just because of some sort of absurd continuity theory involving modern R1a-Z645 subclades:

NOTE. The situation we have seen with the hundreds of samples from O&M 2018, and with the recent additional Eastern European samples, depict an unexpected absolutely clear-cut distinction in Y-DNA haplogroups between Corded Ware and Yamna/Bell Beaker: I really can’t see how the situation could be more obvious for everyone, so I doubt any further samples will make certain people change their minds. Their hope is, I guess, that just one sample may give some more oxygen to infinite pet theories, as we are still surprisingly seeing even with reactionary R1b autochthonous continuists in Western Europe…

However, looking into the most likely future for the field, what we should be expecting right now is continuity of Yamna ancestry and lineages in early Proto-Indo-Iranian territory. Since we only have a few samples from Sintashta-Petrovka, Potapovka, and Andronovo, I think there might be a sizeable number of R1b-Z2103 subclades in the territory inhabited by those who – no doubt – spread the language into Central Asia.

Haplogroup_R1b_(Y-DNA)
Modern Y-DNA haplogroup R1b distribution, by Maulucioni at Wikipedia

While full population replacement by R1a-Z93 lineages in the North Caspian region ca. 2000 BC is not impossible, I don’t think it is very likely, since we already know that there are R1b-Z2103 lineages widely distributed in Indo-Iranian-speaking territory, and Z93 is now known to be an older subclade than YFull’s mean formation date suggested (due to the Ukraine_Eneolithic I6561 sample‘s SNP call), so what we can infer now that actually happened in Sintashta -> Andronovo is not exactly the spread of haplogroup Z93 during its formation, but rather a regional reduction in its variability coupled with the expansion of some of its subclades.

The main question, after the South Asia paper is finally published, will then be:

  1. Given that Yamna peoples were an elite group of patrilineally-related families mainly of R1b-L23 subclades:
  2. Accepting that PCA, ADMIXTURE, and other statistical methods are not relevant (alone) for ethnolinguistic identification: e.g. Yamna ‘outliers’ and East Bell Beaker migrants of R1b-L23 lineages without steppe ancestry; N1c1a1a-L392 lineages and Siberian ancestry unrelated to Uralic speakers; R1a-Z645 and steppe ancestry in North-East Europe related to Uralic-speaking cultures
  3. If we find now, as I expect, genetic continuity of east Yamna in Sintashta -> Andronovo (relative to other late Corded Ware peoples), probably including haplogroup R1b-Z2103 mixed with R1a-Z93 before its further reduction of subclades (e.g. to L657) and expansion during its subsequent spread southward…

bronze_age_early_Asia-andronovo
Diachronic map of migrations in Asia ca. 2250-1750 BC

Why exactly do we need Corded Ware to explain migrations of Late Indo-European speakers?

In other words: if we had the data we have today in 2015, would we have a need for Corded Ware to explain Indo-European migrations from the steppe? Are some people so blinded by their will to (appear to) be right in their past interpretations that they can’t just let go?

NOTE. On a side note, wouldn’t it be nice for this paper to publish some other R1b-L23 (x2103) sample – maybe even R1b-L51 – in Yamna, Andronovo, or Afanasevo territory, to end both autochthonous continuity theories (of North-Eastern and Western Europe) at the same time?

I really hope someone in David Reich’s team understands this matter, or else they will still identify Corded Ware as the (now probably ‘a’ instead) vector of expansion of Indo-European languages, and some of us will still have fun for another 2 or 3 years with such conclusions, until someone in the lab realizes that ancestry ≠ population ≠ ethnic identification ≠ language.

NOTE. It seems rather dull to read how people are discussing in the Twitterverse conventional constructs like ‘human race‘ as found in Reich’s op-ed in The New York Times, as if such grandiose semantic discussions had any practical meaning, when basic anthropological questions actually relevant for Genomics, like the essential ancestral component ≠ people tenet seem not to be of interest for anyone in the field….

Since our Indo-European demic difusion model (and its consequences for our reconstruction of North-West Indo-European) and this blog are becoming more and more popular each day – judging by the constant growth in visits in the past 6 months or so – , I guess the simplemindedness and predictability of certain geneticists is benefitting traditional anthropology directly, driving more and more amateur geneticists to look for sound academic models to answer the growing inconsistencies of genetic research.

NOTE. I am not saying the rejection of Corded Ware as spreading Indo-European is definitive. Maybe more samples within some years will depict a clear ancient expansion of Early or Middle Proto-Indo-Europeans from Khvalynsk to the forest-steppe and forest zone, and later with certain Corded Ware migrants into Central Europe, over whose territory a Late Indo-European dialect from Bell Beakers became the superstrate, as some have proposed in the past – e.g. to explain Krahe’s Old European hydronymy. I really doubt you could demonstrate such an old ethnolinguistic identification with a clear, unbroken archaeological trail, though, and we know now that this old hydronymy is probably of Late Indo-European nature (possibly even more recent).

What I am saying is: with the data we have now, it does not make any sense to keep the anthropological models invented by geneticists ex nihilo in 2015, and the hundred different alternative Late Indo-European migration models that arebornwitheachnewpaper.

These Yamna -> Corded Ware migration models didn’t have any sense for me since early 2016, but now after O&M 2017, and especially O&M 2018, I don’t think any geneticist with a little knowledge in Linguistics or Archaeology (if they are decent about their quest for truth in describing ancient European migrations) would buy them, if not for some sort of created ‘tradition’. So let’s ditch Corded Ware as Late Indo-European-speaking, let’s accept that late Corded Ware migrants should most likely be identified as early Uralic speakers, and then future data will tell if we are – again – wrong.

Please, don’t let Genomics become another pseudoscience based solely on Bioinformatics like glottochronology: let anthropologists (preferably mainstream archaeologists, but also the true Indo-Europeanists, linguists) help you interpret your raw data. Don’t deceive yourselves thinking that you have read enough about the Indo-European question, or that you know enough Indo-Europeanists (say what?) to derive your own conclusions.

Use the South Asia paper to begin expressly retracting the Corded Ware mess.

Please pretty please with sugar on top?

Related:

For commenters: this post concerns an anthropological question, and deals with the expansion of Late Proto-Indo-European speakers from Yamna, and the controversy surrounding the role of Corded Ware migrants that a handful of academics propose spread from it, based on a renewed model of Gimbutas’ outdated Kurgan theory and on the so-called ‘Yamnaya’ ancestry.

It happens so that the discussion has turned lately mainly to ancient Y-DNA haplogroups, because they help confirm previous mainstream anthropological models of cultural diffusion and migration. It is obviously not reasonable to judge prehistoric ethnolinguistic migrations from ca. 5,000 years ago based on historical nation-states and ethnic or religious concepts invented since the Middle Ages, coupled with “your” people’s main modern (or your own) paternal lineage.

EDIT (27 MAR 2018): Minor corrections and post made shorter.

Uralic as a Corded Ware substrate of Indo-Iranian, and loanwords in Finno-Ugric

bronze_age_early-sejma-turbino

Asko Parpola has recently published a new paper, Finnish vatsa ~ Sanskrit vatsá and the formation of Indo-Iranian and Uralic languages.

Abstract:

Finnish vatsa ‘stomach’ < PFU *vaćća < Proto-Indo-Aryan *vatsá- ‘calf’ < PIE *vet-(e)s-ó- ‘yearling’ contrasts with Finnish vasa- ‘calf’ < Proto-Iranian *vasa- ‘calf’. Indo-Aryan -ts- versus Iranian -s- refl ects the divergent development of PIE *-tst- in the Iranian branch (> *-st-, with Greek and Balto-Slavic) and in the Indo-Aryan branch ( > *-tt-, probably due to Uralic substratum). The split of Indo-Iranian can be traced in the archaeological record to the differentiation of the Yamnaya culture in the North Pontic and Volga steppes respectively during the third millennium BCE, due to the use of separate sources of metal: the Iranian branch was dependent on the North Caucasus, while the Indo-Aryan branch was oriented towards the Urals. It is argued that the Abashevo culture of the Mid-Volga-Kama-Belaya basins and the Sejma-Turbino trade network (2200–1900 BCE) were bilingual in Proto-Indo-Aryan and PFU, and introduced the PFU as the basis of West Uralic (Volga-Finnic) into the Netted Ware Culture of the Upper Volga-Oka (1900–200 BCE).

He updates thus his quite recent model from On the emergence, contacts and dispersal of Proto-Indo-European, Proto-Uralic and Proto-Aryan in an archaeological perspective (2017).

In it he supported a North-West Indo-European expansion with Corded Ware, and a Neolithic Proto-Uralic community in East Europe (associated with the Comb Ware culture), as I did before the famous 2015 papers.

In fact, he supports that the satemization trend of Proto-Indo-Iranian is due to a Proto-Finno-Ugric substratum in its population in the Volga-Ural region, similar to the model I propose (with the Corded Ware substratum hypothesis).

NOTE. While for Parpola the ‘satemizing’ substratum of Balto-Slavic (a NWIE dialect) may not come exactly from the same Finno-Ugric population as for Indo-Iranian, but from a different Uralic dialect (as I explain in my hypothesis), for the few extant supporters of an Indo-Slavonic group there should not be any problem identifying the same ancient substrate as for the Proto-Indo-Iranian population…

Now that North-West Indo-European is clearly associated with the Yamna -> Bell Beaker expansion, I understand that his previous model is obsolete and needs a revision.

I find it especially difficult to understand (in light of his previous theory) why he compares Indo-Aryan *vatsa– and Iranian *vasa– to assert that the former is the origin of the loanword in Finno-Ugric, when the Proto-Indo-Iranian form is essentially the same as the Indo-Aryan one, with respect to the *w– evolution into *v– in both PII and late FU dialects…

NOTE: I wrote him yesterday asking for this issue, I will post here his answer.

EDIT (20 MAR 2018): The summary of his answer regarding his selection of Indo-Aryan *vatsa– vs. Iranian *vasa– (instead of just PII *watsa-/vatsa-) is one based on Archaeology (and likley guesstimates), since he understands the split into Iranian and Indo-Aryan to have happened early within the Yamna culture, so that the cultural admixture of Abashevo must have happened after the separation.

netted-ware-parpola
Potential spread of Finnic. “Distribution of the Netted Ware according to Carpelan (2002: 198). A: Emergence of the Netted Ware on the Upper Volga c. 1900 calBC. B: Spread of Netted Ware by c. 1800 calBC. C: Early Iron Age spread of Netted Ware. (After Carpelan 2002: 198 > Parpola 2012a: 151.)

His effort to link the actual expansion of Finno-Ugric to Corded Ware territory, linking it also partially to population movements from the Seima-Turbino phenomenon – probably associated with the initial expansion of N1c lineages – is another good example of convergence of the different anthropological theories thanks to recent Genomic studies.

Related:

First Iberian R1b-DF27 sample, probably from incoming East Bell Beakers

bronze_age_iberia

I had some more time to read the paper by Valdiosera et al. (2018) and its supplementary material.

One of the main issues since the publication of Olalde et al. (2018) (and its hundreds of Bell Beaker samples) was the lack of a clear Y-DNA R1b-DF27 subclades among East Bell Beaker migrants, which left us wondering when the subclade entered the Iberian Peninsula, since it could have (theoretically) happened from the Chalcolithic to the Iron Age.

My prediction was that this lineage found today widespread among the Iberian population crossed the Pyrenees quite early, during the Chalcolithic, with migrating East Bell Beakers expanding North-West Indo-European dialects, and that it spread slowly afterwards.

The first ancient sample clearly identified as of R1b-DF27 subclade is found in this paper, at the Late Bronze Age site Cueva de los Lagos. Although it is unidentified and has no radiocarbon date, the site as a whole is associated with the Cogotas culture and its Bouquique ceramic decoration.

iberia-y-dna-mtdna
Y-DNA and mtDNA haplogroups, from the paper. Sequencing statistics and contamination rates for newly generated sequence data.

It was found in the northern part of the Cogotas culture territory (which lies mainly between Castille and Aragon, in North-Central Spain), shows evident steppe admixture, and it has become obvious with the latest papers (including this one) that R1b-M269 lineages intruded south of the Pyrenees associated with East Bell Beaker migrations.

The Proto-Cogotas culture is associated with a Bell Beaker substrate influenced by either El Argar or Atlantic Bronze, and the specific type of ceramics found at this Cogotas culture site are probably from the mid-2nd millennium, which is too early for the Celtic expansion.

iberia-steppe-admixture
Supervised ADMIXTURE results.

Nevertheless, due to the quite likely late date of the sample (in the centuries around 1500 BC), there is still a possibility that incoming R1b-DF27 lineages were not among the early R1b-M269 lineages found in the Iberian Chalcolithic, and were associated with later migrations from Central Europe, potentially linked to the expansion of the Urnfield culture, and thus nearer to an Italo-Celtic community.

bronze-age-tollense
Diachronic map of migrations in Europe ca. 1250-750 BC.

In any of these scenarios, a Pre-Celtic expansion of North-West Indo-European in Iberia (possibly associated with Lusitanian) is still the best explanation for the origin and expansion of (at least some) modern Iberian R1b-DF27 lineages, including those found among the Basque-speaking population.

This implies that the ‘indigenous’ Neolithic lineages of Iberia (like I2 and G2a2) were replaced with subsequent internal gene flows and founder effects, such as those that evidently happened (probably quite recently) among Basques, even though indigenous languages show an obvious continuity.

I would say this is the last nail in the coffin for autochthonous Y-DNA continuity theories for Spain and France (i.e. for the traditional Vasconic-Uralic hypothesis), but we know that data is never enough for any die hard continuist…so let’s just say another nail in the coffin for endless autochthonous continuity theories.

EDIT (18 & 26 MAR 2018): Genetiker has published Y-SNP calls for both R1b samples, showing this one is R1b1a1a2a1a2a-BY15964 (see modern members of this subclade in ytree), and that the other one is R1b1a1a2a~L23.

Related:

Olalde et al. and Mathieson et al. (Nature 2018): R1b-L23 dominates Bell Beaker and Yamna, R1a-M417 resurges in East-Central Europe during the Bronze Age

The official papers Olalde et al. (Nature 2018) and Mathieson et al. (Nature 2018) have appeared. They are based on the 2017 preprints at BioRxiv The Beaker Phenomenon And The Genomic Transformation Of Northwest Europe and The Genomic History Of Southeastern Europe respectively, but with a sizeable number of new samples.

Papers are behind a paywall, but here are the authors’ shareable links to read the papers and supplementary materials: Olalde et al. (2018), Mathieson et al. (2018).

NOTE: The corresponding datasets have been added to the Reich Lab website. Remember you can use my drafts on DIY Human Ancestry analysis (viz. Plink/Eigensoft, PCA, or ADMIXTURE) to investigate the data further in your own computer.

olalde_pca
Image modified by me, from Olalde et al (2018). PCA of 999 Eurasian individuals. Marked is the late CWC outlier sample from Esperstedt, showing how early East Bell Beaker samples are the closest to Yamna samples.

I don’t have time to analyze the samples in detail right now, but in short they seem to convey the same information as before: in Olalde et al. (2018) the pattern of Y-DNA haplogroup and steppe ancestry distribution is overwhelming, with an all-R1b-L23 Bell Beaker people accompanying steppe ancestry into western Europe.

EDIT: In Mathieson et al. (2018), a sample classified as of Ukraine_Eneolithic from Dereivka ca. 2890-2696 BC is of R1b1a1a2a2-Z2103 subclade, so Western Yamna during the migrations also of R1b-L23 subclades, in contrast with the previous R1a lineages in Ukraine. In Olalde et al. (2018), it is clearly stated that of the four BB individuals with higher steppe ancestry, the two with higher coverage could be classified as of R1b-S116/P312 subclades.

This is compatible with the expansion of Indo-European-speaking Yamna migrants (also mainly of R1b-L23 subclades) into the East Bell Beaker group, as described with detail in Archaeology (and with the population movement we are seeing having been predicted) first by Volker Heyd in 2007.

yamna-bell-beaker
Yamna – East Bell Beaker migration 3000-2300 BC. Adapted from Harrison and Heyd (2007), Heyd (2007)

Also, the resurge of R1a-Z645 subclades in Czech and Polish lands (from previous Corded Ware migrants) accompanying other lineages indigenous to the region – seems to have happened only after the Bell Beaker expansion into these territories, during the Bronze Age, probably leading to the formation of the Balto-Slavic community, as I predicted based on previous papers. The fact that a sample of R1b-U106 subclade pops up in this territory is interesting from the point of view of a shared substrate with Germanic, as is the earlier BB sample of R1b-Z2103 for its connection with Graeco-Aryan dialects.

All this suggests that a North-West Indo-European dialect – ancestor of Italo-Celtic, Germanic, and Balto-Slavic -, supported in Linguistics by most modern Indo-European schools of thought, expanded roughly along the Danube, and later to northern, eastern, and western Europe with the Bell Beaker expansion, as supported in Anthropology by Mallory (in Celtic from the West 2, 2013), and by Prescott for the development of a Nordic or Pre-Germanic language in Scandinavia since 1995.

copper-age-late-bell-beaker
Diachronic map of Late Copper Age migrations including Classical Bell Beaker (east group) expansion from central Europe ca. 2600-2250 BC

Maybe more importantly, the fact that only Indo-Iranian-speaking Sintashta-Petrovka (and later Andronovo) cultures were clearly associated with R1a-Z645 subclades, and rather late – after mixing with early Chalcolithic North Caspian steppe groups (mainly East Yamna and Poltavka herders of R1b-L23 subclades) – gives support to the theory that Corded Ware (and probably the earlier Sredni Stog) groups did not speak or spread Indo-European languages with their migration, but most likely Uralic – as seen in recent papers on the much later arrival of haplogroup N1c – (compatible with the Corded Ware substrate hypothesis), adopting Indo-Iranian by way of cultural diffusion or founder effect events.

As Sheldon Cooper would say,

Under normal circumstances I’d say I told you so. But, as I have told you so with such vehemence and frequency already the phrase has lost all meaning. Therefore, I will be replacing it with the phrase, I informed you thusly

I informed you thusly:

The Indo-European demic diffusion model, and the “R1b – Indo-European” association

yamna_bell_beaker_cut

Beginning with the new year, I wanted to commit myself to some predictions, as I did last year, even though they constantly change with new data.

I recently read Proto-Indo-European homelands – ancient genetic clues at last?, by Edward Pegler, which is a good summary of the current state of the art in the Indo-European question for many geneticists – and thus a great example of how well Genetics can influence Indo-European studies, and how badly it can be used to interpret actual cultural events – although more time is necessary for some to realize it. Notice for example the distribution of ‘Yamnaya’ in 3000 BC, all the way to Latvia (based on the initial findings of Mathieson et al. 2017), and the map of 2000 BC with ‘Corded Ware’, both suggesting communities linked by admixture and unrelated to actual cultures.

Some people – especially those interested in keeping a simplistic picture of Europe, either divided into admixture groups or simplistic R1b-Vasconic / R1a-Indo-European / N1c-Uralic (or any combination thereof) – want (others) to believe that I am linking ‘Indo-Europeans’ with haplogroup R1b. That is simply not true. In fact, my model dismisses such simplistic identifications of the reconstructible proto-languages with any modern peoples, admixtures, or haplogroups.

vasconic-uralic
Simplistic Vasconic/R1b-Uralic/N1c distribution, and intruding Indo-European/R1a, according to Wiik.

The beauty of the model lies, therefore, precisely in that if you take any modern group speaking Indo-European languages, none can trace back their combination of language, admixture, and/or haplogroup to a common Indo-European-speaking people. All our ancestral lines have no doubt changed language families (and indeed cultures), they have admixed, and our European regions’ paternal lines have changed, so that any dreams of ‘purity’ or linguistic/cultural/regional continuity become absurd.

That conclusion, which should be obvious to all, has been denied for a long time in blogs and forums alike, and is behind the effort of many of those involved in amateur genetics.

Main linguistic aim

The main consequence of the model, as the title of the paper suggests, is that reconstructible Indo-European proto-languages expanded with people, i.e. with actual communities, which is what we can assert with the help of Genomics. From a personal (or ethnic, or political) point of view genomics is useless, but from an anthropological (and thus linguistic) point of view, genomics can be a very useful tool to decide between alternative models of language diffusion, which has given lots of headaches to those of us involved in Indo-European studies.

The demic diffusion theory for the three main stages of the proto-language expansion was originally, therefore, a dismissal of impossible-to-prove cultural diffusion models for the proto-language – e.g. the adoption of Late Proto-Indo-European by Corded Ware groups due to a patron-client relationship (as proposed by Anthony), or a long-lasting connection between cultures (as proposed by Kristiansen, and favoured by “constellation analogy” proponents like Clackson, who negated the existence of common proto-languages). It also means the acceptance of the easiest anthropological model for language change: migration and – consequently – replacement.

By the time of the famous 2015 papers, I had been dealing for some time with the idea that the shared features between Indo-Iranian and Balto-Slavic may have been due to a common substrate, and must have therefore had some reflection in genomic finds. The data on these papers, and the addition of a weak connection between Pre-Germanic and Balto-Slavic communities, together with their clearest genetic link – R1a-M417 subclades (especially European Z283) – made it still easier to propose a Corded Ware substrate, partially common to the three.

Allentoft Corded Ware
Allentoft et al. “Arrows indicate migrations — those from the Corded Ware reflect the evidence that people of this archaeological culture (or their relatives) were responsible for the spreading of Indo-European languages. All coloured boundaries are approximate.”

Before the famous 2015 papers (and even after them, if we followed their interpretation), we were left to wonder why the supposed vector of expansion of Indo-European languages, Corded Ware migrants – represented by R1a-Z645 subclades, and supposedly continued unchanged into modern populations in its ‘original’ ancestral territories, Balto-Slavic and Indo-Iranian – , were precisely the (phonetically) most divergent Indo-European languages – relative to the parent Late Indo-European proto-language.

My paper implied therefore the dismissal of an unlikely Indo-Slavonic group, as proposed by Kortlandt, and of a still less factible Germano-Slavonic, or Germano-Indo-Slavonic (?) group, as loosely implied by some in the past, and maybe supported in certain archaeological models (viz. Kristiansen or partially Anthony), and presently by some geneticists since their simplistic 2015 papers on “massive migrations from the steppe“, and amateur genetic fans with infinite pet theories, indeed.

A common Corded Ware substrate to Balto-Slavic and Indo-Iranian, and common also partially between Balto-Slavic and Germanic (as supported by Kortlandt, too, albeit with different linguistic connotations), would explain their common features. The Corded Ware culture (and Uralic, tentatively proposed by me as the group’s main language family) is a strong potential connection between them, further supported by phylogeography, too.

Other consequences

Interpretations in my paper help thus dismiss the simplistic Yamna -> Corded Ware -> Bell Beaker migration model implied with phylogeography in the 2000s, and revived again by geneticists and Kristiansen’s workgroup based on the famous 2015 papers, whereby – due to the “Yamnaya ancestral component” – the Yamna culture would have been composed of communities of R1a-M417 and R1b-M269 lineages which remained against all odds ‘related but separated’ for more than two thousand years, sharing a common unitary language (why? and how?), and which expanded from Yamna (mainly R1b-L23) into Corded Ware (mainly R1a-M417) and then into Bell Beaker (mainly R1b-L51), in imaginary migration waves whose traces Archaeology has not found, or Anthropology described, before.

While phylogeography (especially the distribution of ancient samples of certain R1b and R1a subclades) was the main genetic aspect I used in combination with Archaeology and Anthropology to challenge the reliability of the “Yamnaya ancestral component” in assessing migrations – and thus Kristiansen’s now-popular-again modified Kurgan model – , my main aim was to prove a recent expansion of Late Proto-Indo-European from the steppe, and a still more recent expansion of a common group of speakers of North-West Indo-European, the language ancestral to Italo-Celtic, Germanic, and probably Balto-Slavic (or ‘Temematic’, the NWIE substrate of Balto-Slavic, according to some linguists).

My arguments serve for this purpose, and modern distributions of haplogroups or admixture are fully irrelevant: I am ready to change my view at any time, regarding the role of any haplogroup, or ancestral component, archaeological data, or anthropological migration model, to the extent that it supports the soundest linguistic model.

proto-indo-european-stages
Stages of Proto-Indo-European evolution. IU: Indo-Uralic; PU: Proto-Uralic; PAn: Pre-Anatolian; PToch: Pre-Tocharian; Fin-Ugr: Finno-Ugric. The period between Balkan IE and Proto-Greek could be divided in two periods: an older one, called Proto-Greek (close to the time when NWIE was spoken), probably including Macedonian, and spoken somewhere in the Balkans; and a more recent one, called Mello-Greek, coinciding with the classically reconstructed Proto-Greek, already spoken in the Greek peninsula (West 2007). Similarly, the period between Northern Indo-European and North-West Indo-European could be divided, after the split of Pre-Tocharian, into a North-West Indo-European proper, during the expansion of Yamna to the west, and an Old European period, coinciding with the formation and expansion of the East Bell Beaker group.

Gimbutas’ old theory of sudden and recent expansion served well to support a real community of Proto-Indo-European speakers, as did later the Yamna -> Corded Ware -> Bell Beaker theory that circulated in the 2000s based on modern phylogeography, and as did later partially Anthony’s updated steppe theory (2007). On the other hand, Kristiansen’s long-lasting connections among north-west Pontic steppe cultures and Globular Amphorae and Trypillian cultures, did not fit well with a close community expanding rapidly – although recent genetic data on Trypillia and Globular Amphorae might be compelling him to improve his migration theory.

So, if data turns out to be not as I expect now, I will reflect that in future versions of the paper. I have no problem saying I am wrong. I have been wrong many times before, and something I am certain is that I am wrong now in many details, and I am going to be in the future.

If, for example, R1b-L23(xZ2105) is demonstrated to come from Hungary and not the steppe (as supported by Balanovsky) or R1a-M417 samples are proved to have expanded with West Yamna settlers (as recently proposed by Anthony, see below the Balto-Slavic question), I would support the same model from a linguistic point of view, but modified to reflect these facts. Or if a direct migration link is found in Archaeology from Yamna to Corded Ware, and from Corded Ware to Bell Beaker (as proposed in the 2015 papers), I will revise that too (again, see the image below). Or, if – as Lazaridis et al. (2017) paper on Minoans and Mycenaeans suggested – the Anatolian hypothesis (that is, one of the multiple ones proposed) turns out to be somehow right, I will support it.

calcolithic-expansion
My map of Late Proto-Indo-European expansion (A Grammar of Modern Indo-European, 2006), following Gimbutas and Mallory.

Haplogroups are the least important aspect of the whole model, they are just another data that has to be taken into account for a throrough explanation of migrations. It has become essential today because of the apparent lack of vision on the part of geneticists, who failed to use them to adjust their findings of admixture with findings of haplogroup expansions, favouring thus a marginal theory of long-lasting steppe expansion instead of the mainstream anthropological models.

Since many of these alternative scenarios seem less and less likely with each new paper, it is probably more efficient to talk about which developments are most likely to challenge my model.

Main points

My main predictions – based mostly on language guesstimates, archaeological cultures, and anthropological models of migration -, even with the scarce genomic data we had, have been proven right until know with new samples from Mathieson et al. (2017) and Olalde et al. (2017), among other papers of this past year. These were my original assumptions:

(1) A Middle Proto-Indo-European expansion defined by the appearance of steppe ancestry + reduction in haplogroup diversity and expansion of (mainly) R1b-M269 and R1b-L23 lineages;

(2) A Late Proto-Indo-European expansion defined by steppe ancestry + reduction in haplogroup diversity and expansion of (mainly) R1b-L23 subclades; and

(3) A North-West Indo-European expansion defined by steppe ancestry + reduction in haplogroup diversity and expansion of (mainly) R1b-L51 subclades.

The expansion of Corded Ware peoples, associated with steppe ancestry + reduction in haplogroup diversity and expansion of (mainly) R1a-Z645 subclades, represents thus a different migration, which is compatible with the different nature of the Corded Ware culture, unrelated to Yamna and without migration waves from one to the other (although there were certainly contacts in neighbouring regions).

As you can see, neither of the 3+1 expansion models imply that no other haplogroup can be found in the culture or regions involved (others have in fact been found, and still the models remain valid): these migrations imply a reduction of haplogroup diversity, and the expansion of certain subclades as is common in population expansions throughout history. While we all accept this general idea, some people have difficulties accepting just those cases not compatible with their dreams of autochthonous continuity.

Nevertheless, there are still voids in genetic investigation.

Controversial aspects

In my humble opinion, these are potential conflict periods and the most likely areas of change for the future of the theory:

1. When and how did R1b-M269 lineages become “chiefs” in the steppe?

Based on scarce data from Khvalynsk, it seems that during the Neolithic there were many haplogroups in the North Pontic and North Caspian steppes. A reduction to R1b-M269 subclades must have happened either just before or (as I support) during (the migrations that caused) the Suvorovo-Novodanilovka expansion among Sredni Stog, probably coinciding also with the expansion (or one of the expansions) of CHG ancestry (and thus the appearance of ‘Steppe component’ in the steppe). My theory was based initially on Anthony’s account and TMRCA of haplogroups of modern populations (both ca. 4200-4000 BC), but recent samples of the Balkans (R1b-M269 and steppe ancestry) seem to trace the population expansion some centuries back.

If my assessment is correct, then modern populations of haplogroup R1b-M269* and R1b-L23* in the Balkans probably reflect that ancient expansion, and samples related to Proto-Anatolian cultures in the Balkans will most likely be of R1b-M269 subclades and R1b-L23*. After admixture in the Balkans, posterior migrations of Anatolian languages into Anatolia might be associated with a different admixture component and haplogroups, we don’t have enough data yet.

If the haplogroup reduction and expansion in Khvalynsk happened later than the Suvorovo-Novodanilovka expansion, then we might find the expansion of Pre- or Proto-Anatolian associated with many different haplogroups, such as R1b (xM269), R1a, I, J, or G2, and more or less associated with steppe ancestry in the Balkans.

Another reason for finding such variety of haplogroups in ancient samples from the Balkans would be that this Khvalynsk group of “chiefs” traversed – and mixed with – the Sredni Stog population. Nevertheless, if we suppose homogeneity in haplogroups in Khvalynsk during the expansion, a high proportion of different haplogroups explained by admixture with the local population of Sredni Stog would challenge the whole “chief domination” explanation by Anthony, and we would have to return to the “different culture” theory by Rassamakin and potentially an older migration from Khvalynsk. In any case, both researchers show clear links of the Suvorovo-Novodanilovka phenomenon to Khvalynsk, and a differentiation with the surrounding Sredni Stog culture.

A less likely model would support the identification of the whole Eneolithic Pontic-Caspian steppe as a loose Indo-Hittite-speaking community, which would be in my opinion too big a territory and too loose a cultural bond to justify such a long-lasting close linguistic connection. This will probably be the refuge of certain people looking desperately for R1a-IE connections. However, the nature of the western steppe will remain distinct from Late Proto-Indo-European, which must have developed in the Yamna culture, so autochthonous continuity is not on the table anymore, in any case…

suvorovo-novodanilovka-region
Coexistence of the Varna-Gumelniţa culture and the Suvorovo phase of the sceptre-bearer communities. 1 — Fălciu; 2 — Fundeni-Lungoţi; 3 — Novoselskaja; 4 — Suvorovo; 5 — Casimcea; 6 — Kjulevča; 7 — Reka Devnja; 8 — Drama; 9 — Gonova mogila; 10 — Reževo; 11 — geographically separate Decea variant of the sceptre bearer group (after Govedarica, Manzura 2011: Abb. 5, adapted).

2. How did R1a-M417 (and especially R1a-Z645) haplogroups came to dominate over the Corded Ware cultures?

If I am right (again, based on TMRCA of modern populations), then it is precisely at the time of the potential expansion of Proto-Corded Ware from the Dnieper-Dniester forest, forest-steppe, and steppe regions, ca 3300-3000. Furholt’s recent radiocarbon analysis and suggestions of a Lesser Poland origin of the third or A-horizon, on which disparate archaeologists such as Anthony or Klejn rely now, seem to suggest also that Corded Ware was a cultural complex rather than a compact culture reflecting a migration of peoples – similar thus to the Bell Beaker complex.

This cultural complex interpretation of Corded Ware contrasts with the quite homogeneous late samples we have, suggesting clear migration waves in northern Europe, at least at some point in time, so Genomics will be a great tool to ascertain when and from where approximately did Corded Ware peoples expand. Right now, it seems that Eneolithic Ukraine populations are the closest to its origin, so the traditional interpretation of its regional origin by Kristiansen or Anthony remains valid.

3. How was Indo-Iranian adopted by Corded Ware invaders?

This is rather an anthropological question. We need reasonable models of founder effect/cultural diffusion necessary for that to happen – similar to the ones necessary to explain the arrival of N1c subclades into north-east Europe, or the arrival of R1b subclades in Basque/Iberian-speaking regions in south-west Europe. My description of potential events in the eastern steppe – based partially on Anthony – is merely a short sketch. Genomic data is unlikely to offer more than it does today (replacement of haplogroups, and gradually of some steppe component, by late Corded Ware groups in the steppe), but let’s see what new samples can contribute.

As for what some Indians – and other people willing to confront them – are looking for, regarding R1a-M417 and/or Indo-European origins in India, I don’t see the point, we already know a) that the origin of the expansion is in the steppe and b) that Hindu nationalist biggots will not accept results from research that oppose their views. I don’t expect huge surprises there, just more fruitless discussions (fomented by those who live from trolling or conspiracies)…

4. Yamna settlers from Hungary

Anthony’s new theory – and the nature of Balto-Slavic – hinges on the presence of R1a-M417 subclades (associated with later Corded Ware samples) in Yamna settlers of Hungary, potentially originally from the North Pontic area, where the oldest sample has been found.

My ‘modified’ version of Anthony’s new model (the only I deem just remotely factible) includes the expansion of a Proto-Corded Ware from Lesser Poland, but (given the overwhelming R1b found in East Bell Beaker), with R1a-M417 being associated with the region. How to explain this language change with objective data? Well, we have Bell Beaker expanding to these areas at a later time, so we would need to find R1b-L23 settlers in Lesser Poland, and then a resurge of R1a-M417 haplogroup. If not, resorting yet again to cultural diffusion Yamna “patrons” to Corded Ware “clients” of Lesser Poland would bring us to square one, now with the ‘steppe ancestry’ controversy included…

Since some Eastern Europeans are (for no obvious reason whatsoever) putting their hopes on that IE-R1a-CWC association, let’s hope some samples of R1a-M417 in Yamna or Hungary give them a break, so that they can begin accepting something closer to mainstream anthropological models. We could then work from there a Yamna-> Bell Beaker / North-West Indo-European association truce, and from there keep accepting that no single haplogroup from Yamna settlers is linked with modern languages, cultures or ethnic groups.

yamna-region
localization of Central-European funerary monuments with elements of the Pit Grave culture (after Bátora 2006);

5. How and when was Balto-Slavic associated with haplogroup R1a?

If we accept the Southern or Graeco-Aryan nature of Balto-Slavic with influence from an absorbed North-West Indo-European dialect, “Temematic” (as Kortlandt does), then Indo-Slavonic adopted in the steppe from Potapovka by Sintashta and Poltavka populations divided ca. 2000 BC into Indo-Iranian (migrating to the east with Andronovo), and Balto-Slavic (migrating westward with the Srubna culture). History from there is not straightforward, and it should follow Srubna, Thraco-Cimmerian, or other late expansions from cultures of the steppe.

On the other hand, if it is a Northern dialect related closely to Germanic and Italo-Celtic (in a North-West Indo-European group), then its origin has to be found in the initial expansion of East Bell Beakers, and its development into either the Únětice culture (of Balkan and thus potentially “Southern IE” influence), or the Mierzanowice-Nitra culture (of Corded Ware and thus potentially Uralic influence), or maybe from both, given the intermediate substrate found in Germanic and Balto-Slavic.

It is my opinion that the association of Balto-Slavic with haplogroup R1a is quite early after the East Bell Beaker expansion, probably initially with the subclade typically associated with West Slavic, R1a-M458. I have not much data to support this (apart from the most common linguistic model), just modern haplogroup distribution maps and common TMRCA, and highly hypothetical archaeological-anthropological models. Genetics will hopefully bring more data.

Let’s see also what information on ancient haplogroups we can obtain from the Tollense valley (already showing a close cluster with modern West Slavic populations) and steppe regions.

6. How did Germanic, Celtic, and Italic expand?

Germanic is probably the most interesting one. Following the expansion of R1b-L51 subclades (especially R1b-U106) and steppe ancestry (a confounding factor, with the previous expansion of R1a-Z284 subclades) in Scandinavia is going to be fascinating. Anthropological models already point to a linguistic and archaeological expansion of Pre-Germanic with Bell Beaker peoples.

The expansion of Celtic seems to be associated with chiefdoms, untraceable today in terms of haplogroups, and it seems thus different from previous expansions. New studies might tell how that happened, if it was actually in successive ways, as proposed, or maybe we don’t have enough data yet to reach conclusions.

We don’t know either how Italic expanded into the Italian Peninsula, or whether Latin expanded with peoples from Italy, if at all, or it was mostly a cultural diffusion event, as it seems.

Regarding Etruscan, while I think it is a controversy initiated based on fantastic accounts, and ignited with few finds of Middle Eastern ancestry (that seem logical from the point of view of regional contacts), it will be important for Italian linguists and archaeologists, also to accept the most likely scenario.

As for Palaeo-Hispanic languages, while steppe ancestry is found quite reduced in R1b-L51 subclades (after so many different expansions and admixture events since the departure from the steppe), their distribution from the Chalcolithic onwards and the resurgence of native haplogroups may serve to ascertain which Pre-Roman tribes were associated with the oldest regions where these subclades dominated. For that aim, a closer look at the developments in Aquitania and other pre-Roman Vasconic- and Iberian-speaking regions may shed some light on how founder effects might develop to leave the native language intact (in a case similar to the adoption of Indo-Iranian by post-Corded Ware Sinthastha and Potapovka in the eastern Pontic-Caspian steppe).

NOTE: Although mostly unrelated, linguistic questions may also be somehow altered with a change of migration models. For example, our current Corded Ware Substrate Hypothesis – strongly contested by Kortlandt and others – implies that Uralic was potentially the language spoken by Eneolithic Ukraine / Proto-Corded Ware peoples, therefore early Uralic languages were spoken by Corded Ware peoples, as a substrate for Germanic and Balto-Slavic, and Balto-Slavic and Indo-Iranian. If an Indo-Hittite branch different from Late PIE is accepted for Eneolithic Ukraine (thus suggesting a millennia-long cultural-historical community in the steppe), then the model still stands (e.g. Ger. and BSl. *-mos/-mus, as stated by Kortlandt, would correspond to the oldest morphological IE layer). As you can read in the different versions of our model, the different possibilities for the common substrate are stated, and the most likely one selected. But the most likely a priori option sometimes turns out to be wrong…

NOTE 2: You can comment whatever you want here, but I opened a specific thread in our forum if you want serious comments on the model to stuck and be further discussed.

Featured images: from the book Interactions, changes and meanings. Essays in honour of Igor Manzura on the occasion of his 60th birthday. Țerna S., Govedarica B. (eds.). 2016. Kishinev: Stratum Plus.

See also:

Our monograph on North-West Indo-European (first draft) is out

I wrote yesterday about the recently updated Indo-European demic diffusion model.

Fernando López-Menchero and I have published our first draft on the North-West Indo-European proto-language. Our contribution concerns mainly phonetics, and namely two of its most controversial aspects: a common process of laryngeal loss and two series of velars for PIE.

There is also an updated linguistic model for the Corded Ware substrate hypothesis, which seeks to explain certain similarities between Germanic and Balto-Slavic, and between Balto-Slavic and Indo-Iranian, and potential isoglosses between the three.

Available links:

As you probably know, our interest is (and has been for the past 15 years or so, even before our common project) the reconstruction of a North-West Indo-European proto-language, the ancestor of Italo-Celtic, Germanic, and Balto-Slavic. At least since Krahe’s proposal of an Alteuropäische substrate to European hydronymy, some 70 years ago, Indo-Europeanists have been supporting an Old European branch of Proto-Indo-European.

Old_European_hydronymic_map_for_the_root__Sal-,__Salm-
Root *sal-, *salm in European river names. Krahe (1949). From Wikipedia.

However, dialectal divisions were tentative. Since Oettinger, some 30 years ago, we have a clearer picture of a group of closely related dialects, namely Italo-Celtic, Germanic, and Balto-Slavic. Although the nature of Balto-Slavic is somehow contended (for the few scholars who support an Indo-Slavonic group), the minimalist view holds that at least the substrate language of Baltic and Slavic, Holzer‘s Temematic, was part of the North-West Indo-European group.

A North-West Indo-European (NWIE) proto-language not only solved the controversial question of Pan-European IE hydronymy (clearly of Late Indo-European nature), but also – and more elegantly – the question on the origin of the many fragmentary languages attested in Western Europe, usually attributed to a “Pre-Celtic” or “Pre-Italic” nature depending on their surrounding languages (Venetic has even said to be related to Germanic…).

proto-indo-european-stages
Stages of Proto-Indo-European evolution. IU: Indo-Uralic; PU: Proto-Uralic; PAn: Pre-Anatolian; PToch: Pre-Tocharian; Fin-Ugr: Finno-Ugric. The period between Balkan IE and Proto-Greek could be divided in two periods: an older one, called Proto-Greek (close to the time when NWIE was spoken), probably including Macedonian, and spoken somewhere in the Balkans; and a more recent one, called Mello-Greek, coinciding with the classically reconstructed Proto-Greek, already spoken in the Greek peninsula (West 2007). Similarly, the period between Northern Indo-European and North-West Indo-European could be divided, after the split of Pre-Tocharian, into a North-West Indo-European proper, during the expansion of Yamna to the west, and an Old European period, coinciding with the formation and expansion of the East Bell Beaker group.

Described first mainly in terms of lexical isoglosses, the concept of a NWIE language was then gradually and strongly founded in common grammatical features, contributed to mainly by the German, North American, and Spanish schools (as you know, the British or French schools are quite divided on the nature of Proto-Indo-European itself…). Recent archaeological models pioneered by Harrison and Heyd (2007) showed how this might have happened, with Yamna migrants that evolved as the East Bell Beaker group, and their subsequent expansion into most of Europe.

Genetics is now clearly supporting such a closely related group, too.

yamna-bell-beaker
Yamna – East Bell Beaker migration 3000-2300 BC according to Heyd in Harrison and Heyd (2007).

The work of Prescott and Walderhaug (1995) on the Pre-Germanic homeland, and the more precise archaeological migration model developed by Prescott clearly established the advent of Bell Beakers in Scandinavia as the key factor for the development of a unitary Pre-Germanic language in Scandinavia during the Dagger Period of the Nordic Late Neolithic.

The nature of Únětice and Mierzanowice/Nitra cultures as of Bell Beaker absorption of preceding Corded Ware cultures made the identification of the Balto-Slavic homeland in the Lusatian culture as quite likely – and this is now being confirmed with the study of Bronze Age samples, like those of the Tollense battlefield, which cluster closely to West Slavic and East German samples.

At the time of Marija Gimbutas’ breakthrough model of the “kurgan peoples” a common dialect from this Old European branch was deemed to be ‘Northern European‘ (or ‘Germano-Balto-Slavic’), which greatly influenced her work, supporting an identification of different burial types as stemming from the same source. This model, rejected already some years after Gimbutas’ proposal, has sadly survived to this day because of tradition (due e.g. to the work and influence of Kristiansen, and to some extent Anthony), and for some years (until the advent of ancient DNA) because of the modern distribution of haplogroup R1a in Europe and its relation to the ancient distribution of the Corded Ware culture.

This traditional model of a ‘Corded Ware -> Bell Beaker expansion of NWIE’ which we also followed until recently, never fit well with the known migrations paths from Yamna (into Balkan Early Bronze Age cultures), with the geographic distribution of Old European hydronymy, or with the guesstimates for Late Indo-European and North-West Indo-European. This compelled us to support a break-up of the proto-language further back in time than warranted by models of language change, and it needed certain unlikely cultural diffusion events over huge areas (because no such migration from Yamna to northern Europe has been attested): along the steppe/forest-steppe zone first, for a diffusion from Yamna into Corded Ware cultures, and along the Danube or the Rhine later, for a diffusion of Corded Ware into Bell Beaker. These models were also based on the wrong interpretation of the first radiocarbon dates of Beakers – placing an origin of the Bell Beaker people in Iberia (which has been rejected in Archaeology, and now also in Genetics).

Such a ‘Germano-Balto-Slavic’ group faded in Linguistics long ago, with most Indo-Europeanists preferring to talk about late contacts (viz. Celto-Germanic or Italo-Germanic contacts), and for some there is – if any subgroup at all – a core West Indo-European or Italo-Celto-Germanic group, which may be supported by recent genetic research on Bell Beaker peoples, with the Beaker group of the Netherlands being the key. Our research on the potential language spoken by Corded Ware peoples – most likely related to Uralic, from an Indo-Uralic community from the Pontic-Caspian steppe – can elegantly explain the isoglosses that both European dialects share.

copper-age-late-bell-beaker
Diachronic map of Late Copper Age migrations including Classical Bell Beaker (east group) expansion from central Europe ca. 2600-2250 BC

Read also: Schleicher’s Fable in Proto-Indo-European – pitch and stress accent

Related:

The Tollense Valley battlefield: the North European ‘Trojan war’ that hints to western Balto-Slavic origins

bronze-age-tollense-battle

It was reported long ago that genetic studies were being made on remains of a surprisingly big battle that happened in the Tollense valley in north-eastern Germany, at the confluence between Nordic, Tumulus/Urnfield, and Proto-Lusatian/Lusatian territories, ca. 1200 BC.

At least 130 bodies and 5 horses have been identified from the bones found. Taking into account that this is a small percentage of the potential battlefield, around 750 bodies are expected to be buried in the riverbank, so an estimated 4,000-strong army fought there, accounting for one in five participants killed and left on the battlefield.

Tollense riverbank
The river Tollense near the village Weltzin in the district Demmin (Mecklenburg-Vorpommern, Germany). From Wikipedia

Body armour, shields, helmet, and corselet used may have needed training and specialised groups of warriors, with their organisation being a display of military force. According to Kristiansen , this battle is therefore unlike any other known conflict of this period north of the Alps – circumscribed to raids by small groups of young men –, and may have heralded a radical change in the north, from individual farmsteads and a low population density to heavily fortified settlements.

The Urnfield culture (ca. 1300-750 BC) is associated with the rise of a new warrior elite, and the formation of new farming settlements and their urnfields. In some areas there is continuity from Tumulus to Urnfield culture, with narrowing and concentration of settlements along the river valleys, but there is also wide-ranging migrations. These migrations are similar to those seen later in the La Tène culture. This period is also coincident with the time of the mythical battle of Troy, with the collapse of the Mycenaean civilisation, and with the raids of Sea People in Egypt, and the marauders of the Hittites.

bronze-age-tollense
Diachronic map of migrations in Europe ca. 1250-750 BC, with the site of the Tollense valley marked.

Chemical traces already suggested that warriors fighting in Tollense came from far away, with only a few showing values typical of the northern European plain. A recently published PhD dissertation, Addressing challenges of ancient DNA sequence data obtained with next generation methods, by Christian Sell (2017) has not confirmed this:

The majority of sampled individuals fall within the variation of contemporary northern central European samples (including Nordic Late Neolithic and Bronze Age and Únětice samples); however, there are also some outliers closer to Neolithic LBK and modern Basques, suggesting that central and western European cultures were still at that time closely interconnected, continuing thus the connections created during the Bell Beaker expansion a thousand years earlier. The genetic similarity of most samples to modern western Slavic populations (as well as Austrians and Scots) gives support to the origin of Balto-Slavic in Bronze Age north-central Europe, and more specifically in the Lusatian culture.

tollense-welzin
PCA of samples from Tollense Valley battlefield. Welzin samples cluster closely to East German and Polish samples.

The Indo-European demic diffusion model supports the origin of Pre-Balto-Slavic in north-central Europe, with Únětice and Mierzanowice/Nitra groups as its potential homeland, from a common North-West Indo-European parent language (expanded through East Bell Beaker). Proto-Lusatian is therefore the best candidate for its initial development, and Lusatian for its eastern expansion, before its separation into its two main dialects (or maybe three, if Baltic is to be divided in two branches).

In fact, scarce aDNA from late Urnfield populations from its north-eastern territories, in Saxony – near the Lusatian culture –, already show a mixture of lineages, which suggest genetic continuity with older cultures (or more likely a resurge) after the Bell Beaker expansions: R1a1a1b1a-Z282 lineage was found in Halberstadt (ca. 1085 BC), and of the eight males studied from the Lichtenstein cave (ca. 1000 BC), five were of haplogroup I2a2b-L38, two of haplogroup R1a1-M459, and one of haplogroup R1b-M343.

Regarding modern populations, the eastern and western peaks in R1a1a1b1a1-M458 lineages might support a west-east migration, as well as an east-west migration, and indeed both in different periods, which is expected to be found if Lusatian is linked to the initial eastward expansion of Balto-Slavic during the Bronze and Iron Ages, and later younger subclades are linked to the West Slavic expansion to the west during Antiquity.

R-M458_frequency_distribution
Map rendered in pseudocolours for R-M458 frequencies, data derived from Underhill et al. (2014). Positions of boundaries (NE,NW,C,etc) are approximate. Variation of N and S. Caucasus region of Russia rendered as stripes showing range of variation in the region. From Wikipedia.

Now, if this is so, then we have to accept that these territories of north-central Europe (between East Germany and Poland), occupied earlier by Corded Ware cultures, adopted Balto-Slavic only after the Bell Beaker expansion; therefore, models arguing for Balto-Slavic origins in east European late Corded Ware groups (or heir cultures), like Trzciniec, Chornoles, Bilozerska, or Milograd (see e.g. the article on Wikipedia) have to be rejected. We also know that Pre-Germanic could have only formed in the Nordic Late Neolithic, after the cultural unification of the Dagger Period, heraled by the arrival of Bell Beakers; and that Indo-Iranian was the language of the Sintashta-Petrovka culture, which had absorbed the previous (Yamna-related) Poltavka culture.

chalcolithic-bell-beaker-europe
Diachronic map of Late Copper Age migrations including Classical Bell Beaker (east group) expansion from central Europe ca. 2600-2250 BC

But, if Indo-European was only spoken at both ends of territories previously occupied by Corded Ware cultures – stretching from Scandinavia to the Urals, including the Baltic region… what language did Corded Ware peoples actually speak? The most likely one? Uralic, indeed.

Related: