Złota a GAC-CWC transitional group…but not the origin of Corded Ware peoples

koszyce-gac-zlota-cwc

Open access Unraveling ancestry, kinship, and violence in a Late Neolithic mass grave, by Schroeder et al. PNAS (2019).

Interesting excerpts of the paper and supplementary materials, about the Złota group variant of Globular Amphora (emphasis mine):

A special case is the so-called Złota group, which emerged around 2,900 BCE in the northern part of the Małopolska Upland and existed until 2,600-2,500 BCE. Originally defined as a separate archaeological “culture” (15), this group is mainly defined by the rather local introduction of a distinct form of burial in the area mentioned. Distinct Złota settlements have not yet been identified. Nonetheless, because of the character of its burial practices and material culture, which both retain many elements of the GAC and yet point forward to the Corded Ware tradition, and because of its geographical location, the Złota group has attracted significant archaeological attention (15, 16).

The Złota group buried their dead in a new, distinct type of funerary structure; so-called niche graves (also called catacomb graves). These structures featured an entrance shaft or pit and, below that, a more or less extensive niche, sometimes connected to the entrance area by a narrow corridor. Local limestone was used to seal off the entrance shaft and to pave the floor of the niche, on which the dead were usually placed along with grave goods. This specific and relatively sophisticated form of burial probably reflects contacts between the northern Małopolska Upland and the steppe and forest-steppe communities further to the east, who also buried their dead in a form of catacomb graves. Individual cases of the use of ochre and of deformation of skulls in Złota burials provide further indications of such a connection (15). At the same time, the Złota niche grave practice also retains central elements of the GAC funerary tradition, such as the frequent practice of multiple burials in one grave, often entailing redeposition and violation of the anatomical order of corpses, and thus differs from the catacomb grave customs found on the steppes which are strongly dominated by single graves. Nonetheless, at Złota group cemeteries single burial graves appear, and even in multiple burial graves the identity of each individual is increasingly emphasized, e.g. by careful deposition of the body and through the personal nature of grave goods (16).

globular-amphorae-corded-ware-zlota-amphorae
Correspondence analysis of amphorae from the Złota-graveyards reveals that there is no typological break between Globular Amphorae and Corded Ware Amphorae, including ‘Strichbündelamphorae’ (after Furholt 2008)

Just like its burial practices, the material culture and grave goods of the Złota group combine elements of the GAC, such as amber ornaments and central parts of the ceramic inventory, with elements also found in the Corded Ware tradition, such as copper ornaments, stone shaft-hole axes, bone and shell ornaments, and other stylistic features of the ceramic inventory. In particular, Złota group ceramic styles have been seen as a clear transitional phenomenon between classical GAC styles and the subsequent Corded Ware ceramics, probably playing a key role in the development of the typical cord decoration patterns that came to define the latter (17).

As briefly summarized above, the Złota group displays a distinct funerary tradition and combination of material culture traits, which give the clear impression of a cultural “transitional situation”. While the group also appears to have had long-distance contacts directed elsewhere (e.g. to Baden communities to the south), it is the combination of Globular Amphora traits, on the one hand, and traits found among late Yamnaya or Catacomb Grave groups to the east as well as the closely related Corded Ware groups that emerged around 2,800 BCE, on the other hand, that is such a striking feature of the Złota group and which makes it interesting when attempting to understand cultural and demographic dynamics in Central and Eastern Europe during the early 3rd millennium BCE.

catacomb-grave-ksiaznice
Catacomb grave no. 2a/06 from Książnice, Złota culture (acc. to Wilk 2013). Image from Włodarczak (2017)

Książnice (site 2, grave 3ZC), Świętokrzyskie province. This burial, a so-called niche grave of the Złota type (with a vertical entrance shaft and perpendicularly situated niche), was excavated in 2006 and contained the remains of 8 individuals, osteologically identified as three adult females and five children, positioned on limestone pavement in the niche part of the grave. Radiocarbon dating of the human remains indicates that the grave dates to 2900-2630 BCE, 95.4% probability (Dataset S1). The grave had an oval entrance shaft with a diameter of 60 cm and depth of 130 cm; the depth of the niche reached to 170 cm (both measured from the modern surface), and it also contained a few animal bones, a few flint artefacts and four ceramic vessels typical of the Złota group. Książnice is located in the western part of the Małopolska Upland, which only has a few Złota group sites but a stronger presence of other, contemporary groups (including variants of the Baden culture).

Wilczyce (site 90, grave 10), Świętokrzyskie province. A rescue excavation in 2001 uncovered a niche grave of the Złota type, which had a round entrance shaft measuring 90 cm in diameter. The grave was some 60-65 cm deep below the modern surface and the bottom of the niche was paved with thin limestone plates, on which remains of three individuals had been placed; two adults, one female and one male, and one child. Four ceramic vessels of Złota group type were deposited in the niche along with the bodies. Wilczyce is located in the Sandomierz Upland, an area with substantial presence of both the Globular Amphora culture and Złota group, as well as the Corded Ware culture from 2800 BCE.

zlota-gac-cwc
Genetic affinities of the Koszyce individuals and other GAC groups (here including Złota) analyzed in this study. (A) Principal component analysis of previously published and newly sequenced ancient individuals. Ancient genomes were projected onto modern reference populations, shown in gray. (B) Ancestry proportions based on supervised ADMIXTURE analysis (K = 3), specifying Western hunter-gatherers, Anatolian Neolithic farmers, and early Bronze Age steppe populations as ancestral source populations. LP, Late Paleolithic; M, Mesolithic; EN, Early Neolithic; MN, Middle Neolithic; LN, Late Neolithic; EBA, Early Bronze Age; PWC, Pitted Ware culture; TRB, Trichterbecherkultur/Funnelbeaker culture; LBK, Linearbandkeramik/Linear Pottery culture; GAC, Globular Amphora culture; Złota, Złota culture. Image modified to outline in red GAC and Złota groups.

To further investigate the ancestry of the Globular Amphora individuals, we performed a supervised ADMIXTURE (6) analysis, specifying typical western European hunter-gatherers (Loschbour), early Neolithic Anatolian farmers (Barcın), and early Bronze Age steppe populations (Yamnaya) as ancestral source populations (Fig. 2B). The results indicate that the Globular Amphora/Złota group individuals harbor ca. 30% western hunter-gatherer and 70% Neolithic farmer ancestry, but lack steppe ancestry. To formally test different admixture models and estimate mixture proportions, we then used qpAdm (7) and find that the Polish Globular Amphora/Złota group individuals can be modeled as a mix of western European hunter-gatherer (17%) and Anatolian Neolithic farmer (83%) ancestry (SI Appendix, Table S2), mirroring the results of previous studies.

zlota-steppe-ancestry-cwc
Table S2. qpADM results. The ancestry of most Globular Amphora/Złota group individuals
can be modelled as a two-way mixture of Mesolithic western hunter-gatherers (WHG), and early Anatolian Neolithic farmers (Barcın). The five individuals from Książnice (Złota group) show evidence for additional gene flow, most likely from an eastern source.

The lack of a direct genetic connection of Corded Ware peoples with the Złota group despite their common “steppe-like traits” – shared with Yamna – reveals, once more, how the few “Yamna-like” traits of Corded Ware do not support a direct connection with Indo-Europeans, and are the result of the expansion of the so-called steppe package all over Europe, and particularly among cultures closely related to the Khvalynsk expansion, and later under the influence of expanding Yamna peoples.

The results from Książnice may support that early Corded Ware peoples were in close contact with GAC peoples in Lesser Poland during the complex period of GAC-Trypillia-CWC interactions, and especially close to the Złota group at the beginning of the 3rd millennium BC. Nevertheless, patrilineal clans of Złota apparently correspond to Globular Amphorae populations, with the only male sample available yet being within haplogroup I2a-L801, prevalent in GAC.

NOTE. The ADMIXTURE of Złota samples in common with GAC samples (and in contrast with the shared Sredni Stog – Corded Ware “steppe ancestry”) makes the possibility of R1a-M417 popping up in the Złota group from now on highly unlikely. If it happened, that would complicate further the available picture of unusually diverse patrilineal clans found among Uralic speakers expanding with early Corded Ware groups, in contrast with the strict patrilineal and patrilocal culture of Indo-Europeans as found in Repin, Yamna and Bell Beakers.

Once again the traditional links between groups hypothesized by archaeologists – like Gimbutas and Kristiansen in this case – are wrong, as is the still fashionable trend in descriptive archaeology, of supporting 1) wide cultural relationships in spite of clear-cut inter-cultural differences (and intra-cultural uniformity kept over long distances by genetically-related groups), 2) peaceful interactions among groups based on few common traits, and 3) regional population continuities despite cultural change. These generalized ideas made some propose a steppe language shared between Pontic-Caspian groups, most of which have been proven to be radically different in culture and genetics.

gimbutas-kurgan-indo-european
The background shading indicates the tree migratory waves proposed by Marija Gimbutas, and personally checked by her in 1995. Image from Tassi et al. (2017).

Furthermore, paternal lines show once again marked bottlenecks in expanding Neolithic cultures, supporting their relevance to follow the ethnolinguistic identity of different cultural groups. The steppe- or EHG-related ancestry (if it is in fact from early Corded Ware peoples) in Książnice was thus probably, as in the case of Trypillia, in the form of exogamy with females of neighbouring groups:

The presence of unrelated females and related males in the grave is interesting because it suggests that the community at Koszyce was organized along patrilineal lines of descent, adding to the mounting evidence that this was the dominant form of social organization among Late Neolithic communities in Central Europe. Usually, patrilineal forms of social organization go hand in hand with female exogamy (i.e., the practice of women marrying outside their social group). Indeed, several studies (11, 12) have shown that patrilocal residence patterns and female exogamy prevailed in several parts of Central Europe during the Late Neolithic. (…) the high diversity of mtDNA lineages, combined with the presence of only a single Y chromosome lineage, is certainly consistent with a patrilocal residence system.

funnelbeaker-trypillia-corded-ware
Map of territorial ranges of Funnel Beaker Culture (and its settlement concentrations in Lesser Poland), local Tripolyan groups and Corded Ware Culture settlements (■) at the turn of the 4th/3rd millennia BC.

Since ancient and modern Uralians show predominantly Corded Ware ancestry, and Proto-Uralic must have been in close contact with Proto-Indo-European for a very long time – given the different layers of influence that can be distinguished between them -, it follows as logical consequence that the North Pontic forest-steppes (immediately to the west of the PIE homeland in the Don-Volga-Ural steppes) is the most likely candidate for the expansion of Proto-Uralic, accompanying the spread of Sredni Stog ancestry and a bottleneck under R1a-M417 lineages.

The early TMRCAs in the 4th millennium BC for R1a-M417 and R1a-Z645 support this interpretation, like the R1a-M417 sample found in Sredni Stog. On the other hand, the resurgence of typical GAC-like ancestry in late Corded Ware groups, with GAC lineages showing late TMRCAs in the 3rd millennium BC, proves the disintegration of Corded Ware all over Europe (except in Textile Ceramics- and Abashevo-related groups) as the culture lost its cohesion and different local patrilineal clans used the opportunity to seize power – similar to how eventually I2a-L621 infiltrated eastern (Finno-Ugrian) groups.

Related

Yekaterinovsky Cape, a link between the Samara culture and early Khvalynsk

ekaterinovsky-cape

We already had conflicting information about the elite individual from the Yekaterinovsky Cape and the materials of his grave, which seemed quite old:

For the burial of 45 in the laboratory of the University of Pennsylvania, a 14C date was obtained: PSUAMS-2880 (Sample ID 16068)> 30 kDa gelatin Russia. 12, Ekaterinovka Grave 45 14C age (BP) 6325 ± 25 δ 13C (‰) –23.6 δ15 N (‰) 14.5. The results of dating suggest chronological proximity with typologically close materials from Yasinovatsky and Nikolsky burial grounds (Telegini et al. 2001: 126). The date obtained also precedes the existing dates for the Khvalynsk culture (Morgunova 2009: 14–15), which, given the dominance of Mariupol traits of the burial rite and inventory, confirms its validity. However, the date obtained for human bones does not exclude the possibility of a “reservoir effect” when the age can increase three or more centuries (Shishlin et al. 2006: 135–140).

Now the same date is being confirmed by the latest study published on the site, by Korolev, Kochkina, and Stachenkov (2019) and it seems it is really going to be old. Abstract (in part the official one, in part newly translated for clarity):

For the first time, pottery of the Early Eneolithic burial ground Ekaterinovsky Cape is published. Ceramics were predominantly located on the sacrificial sites in the form of compact clusters of fragments. As a rule, such clusters were located above the burials, sometimes over the burials, some were sprinkled with ocher. The authors have identified more than 70 vessels, some of which have been partially reconstructed. Ceramic was made with inclusion of the crushed shell into molding mass. The rims of vessels had the thickened «collar»; the bottoms had a rounded shape. The ornament was located on the rims and the upper part of the potteries. Fully decorated vessels are rare. The vessels are ornamented with prints of comb and rope stamps, with small pits. A particularity of ceramics ornamentation is presented by the imprints of soft stamps (leather?) or traces of leather form for the making of vessels. The ornamentation, made up of «walking comb» and incised lines, was used rarely as well as the belts of pits made decoration under «collar» of a rim. Some features of the ceramics decoration under study relate it with ceramics of the Khvalynsk culture. The ceramics of Ekaterinovsky Cape burial ground is attributed by the authors to the Samara culture. The ceramic complex under study has proximity to the ceramics from Syezzhe burial ground and the ceramics of the second phase of Samara culture. The chronological position is determined by the authors as a later period than the ceramics from the Syezzhe burial ground, and earlier than the chronological position of ceramics of the Ivanovka stage of the Samara culture and the Khvalynsk culture.

ekaterinovsky-cape-pottery
Ceramics from Ekaterinovsky Cape burial ground. 1–2, 4–5, 7–11 – ceramics from aggregations; 3, 6 – ceramics from the cultural layer.

More specifically:

Based on ceramic fragments from a large vessel from a cluster of sq.m. 14, the date received was: SPb-2251–5673 ± 120 BP. The second date was obtained in fragments from the aggregation [see picture above] from the cluster of sq.m. 45–46: SPb-2252–6372 ± 100 BP. The difference in dating indicates that the process of determining the chronology of the burial ground is far from complete, although we note that the earlier date almost coincided with the date obtained from the human bone from individual 45 (Korolev, Kochkina, Stashenkov, 2018, p. 300).

Therefore, the ceramics of the burial ground Ekaterinovsky Cape possess an originality that determines the chronological position of the burial ground between the earliest materials of the burial type in Syezzhe and the Khvalynsk culture. Techno-typological features of dishes make it possible to attribute it to the Samara culture at the stage preceding the appearance of Ivanovska-Khvalynsk ceramics.

It seems that this site showed cultural influences from the upstream region near the Kama-Vyatka interfluve, too, according to Korolev, Kochkina, Stashenkov, and Khokhlov (2018):

In 2017, excavation of burial ground Ekaterinovsky Cape were continued, located in the area of the confl uence of the Bezenchuk River in the Volga River. During the new excavations, 14 burials were studied. The skeleton of the buried were in a position elongated on the back, less often – crooked on the back with knees bent at the knees. In one burial (No. 90), a special position of the skeleton was recorded. In the burial number 90 in the anatomical order, parts of the male skeleton. This gave grounds for the reconstruction of his original position in a semi-sitting position with the support of elbows on the bottom of the pit. Noteworthy inventory: on the pelvic bones on the left lay a bone spoon, near the right humerus, the pommel of a cruciform club was found. A conclusion is made about the high social status of the buried. The results of the analysis of the burial allow us to outline the closest circle of analogies in the materials of Khvalynsky I and Murzikhinsky burial grounds.

Important sites mentioned in both papers and in this text:

To sum up, it seems that the relative dates we have used until now have to be corrected: older Khvalynsk I Khvalynsk II individuals, supposedly dated ca. 5200-4000 BC (most likely after 4700 BC), and younger Yekaterinovsky individuals, supposedly of the fourth quarter of the 5th millennium (ca. 4250-4000 BC), are possibly to be considered, in fact, roughly reversed, if not chronologically, at least culturally speaking.

Interestingly, this gives a new perspective to the presence of a rare fish- or reptile-headed pommel-scepter, which would be natural in a variable period of expansion of the horse and horse-related symbolism, a cultural trait rooted in the Samara culture attested in Syezzhe before the unification of the symbol of power under the ubiquitous Khvalynsk-Suvorovo horse-headed scepters and related materials.

ekaterinovsky-cape-pommel-mace
Ekaterinovsky Cape Burial Ground. Inventory of the burial no 90: 1, 2 – stone pommel of the mace; 3, 4 – bone article.

The Khvalynsk chieftain

If the reported lineages from Yekaterinovsky Cape are within the R1b-P297 tree, but without further clades, as Yleaf comparisons may suggest, there is not much change to what we have, and R1b-M269 could actually represent a part of the local population, but also incomers from the south (e.g. the north Caspian steppe hunter-gatherers like Kairshak), the east (with hunter-gatherer pottery), or the west near the Don River (in contact with Mariupol-related cultures, as the authors inferred initially from material culture).

Just like R1a-M417 became incorporated into the Sredni Stog groups after the Novodanilovka-Suvorovo expansion, probably as incoming hunter-gatherer pottery groups from the north admixing with peoples of “Steppe ancestry”, R1b-M269 lineages might have expanded explosively only during the Repin expansion, and maybe (like R1b-L51 later) they formed just a tiny part of the clans that dominated the steppe during the Khvalynsk-Novodanilovka community.

On the other hand, the potential finding of various R1b-M269/L23 samples in Yekaterinovsky Cape (including an elite individual) would suggest now, as it was supported in the original report by Mathieson et al. (2015), that these ancient R1b lineages found in the Volga – Ural region are in fact most likely all R1b-M269 without enough coverage to obtain proper SNP calls, which would simplify the picture of Neolithic expansions (yet again). From the supplementary materials:

10122 / SVP35 (grave 12). Male (confirmed genetically), age 20-30, positioned on his back with raised knees, with 293 copper artifacts, mostly beads, amounting to 80% of the copper objects in the combined cemeteries of Khvalynsk I and II. Probably a high-status individual, his Y-chromosome haplotype, R1b1, also characterized the high-status individuals buried under kurgans in later Yamnaya graves in this region, so he could be regarded as a founder of an elite group of patrilineally related families. His MtDNA haplotype H2a1 is unique in the Samara series.

khvalynsk-cemetery
Khvalynsk cemetery and grave gifts. Grave 90 contained copper beads and rings, a harpoon, flint blades, and a bird-bone tube. Both graves (90 and 91) were partly covered by Sacrificial Deposit 4 with the bones from a horse, a sheep, and a cow. Center: grave goods from the Khvalynsk cemetery-copper rings and bracelets, polished stone mace heads, polished stone bracelet, Cardium shell ornaments, boars tusk chest ornaments, flint blades, and bifiacial projectile points. Bottom: shell-tempered pottery from the Khvalynsk cemetery. After Agapov, Vasiliev, and Pestrikova 1990; and Ryndina 1998, Figure 31. Modified from Anthony (2007).

This remarkable Khvalynsk chieftain, whose rich assemblage may correspond to the period of domination of the culture all over the Pontic-Caspian steppes, has been consistently reported as of hg. R1b-L754 in all publications, including Wang et al. (2018/2019) tentative SNP calls in the supplementary materials (obtained with Yleaf, as the infamous Narasimhan et al. 2018 samples), but has been variously reported by amateurs as within the R1b-M73, R1b-V88, or (lately) R1b-V1636 trees, which makes it unlikely that quality of the sample is allowing for a proper SNP call.

The fact that Mathieson et al. (2015) considered it a member of the R1b-M269 clans appearing later in Yamna seems on point right now, especially if samples from Yekaterinovka are all within this tree. The relevance of R1b-L23 in the expansion of Repin and Yamna is reminiscent of the influence of successful clans among Yamna offshoots, such as Bell Beakers, and among Bell Beaker offshoots during the Bronze Age all over Europe.

Taking these younger expansions as example, it seems quite likely based on cultural links that (at least part of) the main clans of Khvalynsk were of R1b-M269 lineage, stemming from a R1b-dominated Samara culture, in line with the known succeeding expansions and the expected strictly patriarcal and patrilineal society of Proto-Indo-Europeans, which would have exacerbated the usual reduction in Y-chromosome haplogroup variability that happens during population expansions, and the aversion towards foreign groups while the culture lasted.

pontic-steppe-neolithic
Cultures of the Pontic-Caspian steppes and forest-steppes and surrounding areas during the Neolithic.

The finding of R1b-L23 in Yekaterinovka, associated with the Samara culture, before or during the Khvalynsk expansion, and close to the Khvalynsk site, would make this Khvalynsk chieftain most likely a member of the M269 tree (paradoxically, the only R1b-L754 branch amateurs have not yet reported for it). Similarly, the sample of a “Samara hunter-gatherer” of Lebyazhinka, of hg. R1b-P297, could also be under this tree, just like most R1b-M269 from Yamna are downstream from R1b-L23, and most reported R1b-M269 or R1b-L23 from Bell Beakers are under R1b-L151.

On the other hand, we know of the shortcomings of attributing a haplogroup expansion to the best known rulers, such as the famous lineages previously wrongly attributed to Niall of the Nine Hostages or Genghis Khan. The known presence of R1b-V1636 up to modern Greeks would be in line with an ancient steppe expansion that we know will show up during the Neolithic, although it could also be a sign of a more recent migration from the Caucasus. The presence of a sister clade of R1b-L23, R1b-PF7562, among modern Balkan populations, may also be attributed to a pre-Yamna steppe expansion.

y-dna-khvalynsk
Y-DNA samples from Khvalynsk and neighbouring cultures. See full version here.

On SNP calls

I reckon that even informal reports on SNP calls, like any other analyses, should be offered in full: not only with a personal or automatic estimation of the result, but with a detailed explanation of the good, dubious, and bad calls, alternatives to that SNP estimation, and a motivated reasoning of why one branch should be preferred over others. Downloading a sample and giving an instruction using a free software tool is never enough, as it became crystal clear recently for the hilariously biased and flawed qpAdm reports on Dutch Bell Beakers as the ‘missing link’ between Corded Ware and Bell Beakers…

Another example I can recall is the report of a R1a-Z93 subclade in the R1a-M417 sample ca. 4000 BC from Alexandria, which seems rather unlikely, seeing how this subclade must have split and expanded explosively with R1a-Z645 to the east with eastern Corded Ware groups, i.e. 1,000 years later, just like Z282 lineages expanded mainly to the north-east. But then again, as with the Khvalynsk chieftain, I have only seen indirect reports of that supposed SNP (including Y26+!), so we should just stick with its officially reported R1a-M417 lineage. This upstream haplogroup was, in fact, repeated with Yleaf’s tentative estimates in Wang et al. (2019) supplementary materials…

The combination of inexperienced, biased, or simply careless design, analyses, and reports, including SNP calls and qpAdm analyses (whether in forums or publications), however well-intentioned (or not) they might be, are hindering a proper analysis of data, adding to the difficulties we already have due to the scarcity of samples, their limited coverage, and the lack of proper context.

Some people like to repeat ad nauseam that archaeology and/or linguistics are ‘not science’ whenever they don’t fit their beliefs and myths based on haplogroup and/or ancestry. But it’s becoming harder and harder to rely on certain genetic data, too, and on their infinite changing interpretations, much more than it is to rely on linguistic and archaeological research, including data, assessments, and discussions that are open for anyone to review…if one is truly interested in them.

Aquitanians and Iberians of haplogroup R1b are exactly like Indo-Iranians and Balto-Slavs of haplogroup R1a

eba-indo-iranian-balto-slavs

The final paper on Indo-Iranian peoples, by Narasimhan and Patterson (see preprint), is soon to be published, according to the first author’s Twitter account.

One of the interesting details of the development of Bronze Age Iberian ethnolinguistic landscape was the making of Proto-Iberian and Proto-Basque communities, which we already knew were going to show R1b-P312 lineages, a haplogroup clearly associated during the Bell Beaker period with expanding North-West Indo-Europeans:

From the Bronze Age (~2200–900 BCE), we increase the available dataset from 7 to 60 individuals and show how ancestry from the Pontic-Caspian steppe (Steppe ancestry) appeared throughout Iberia in this period, albeit with less impact in the south. The earliest evidence is in 14 individuals dated to ~2500–2000 BCE who coexisted with local people without Steppe ancestry. These groups lived in close proximity and admixed to form the Bronze Age population after 2000 BCE with ~40% ancestry from incoming groups. Y-chromosome turnover was even more pronounced, as the lineages common in Copper Age Iberia (I2, G2, and H) were almost completely replaced by one lineage, R1b-M269.

iberia-admixture-y-dna
Proportion of ancestry derived from central European Beaker/Bronze Age populations in Iberians from the Middle Neolithic to the Iron Age (table S15). Colors indicate the Y-chromosome haplogroup for each male. Red lines represent period of admixture. Modified from Olalde et al. (2019).

The arrival of East Bell Beakers speaking Indo-European languages involved, nevertheless, the survival of the two non-IE communities isolated from each other – likely stemming from south-western France and south-eastern Iberia – thanks to a long-lasting process of migration and admixture. There are some common misconceptions about ancient languages in Iberia which may have caused some wrong interpretations of the data in the paper and elsewhere:

NOTE. A simple reading of Iberian prehistory would be enough to correct these. Two recent books on this subject are Villar’s Indoeuropeos, iberos, vascos y otros parientes and Vascos, celtas e indoeuropeos. Genes y lenguas.

Iberian languages were spoken at least in the Mediterranean and the south (ca. “1/3 of Iberia“) during the Bronze Age.

Nope, we only know the approximate location of Iberian culture and inscriptions from the Late Iron Age, and they occupy the south-eastern and eastern coastal areas, but before that it is unclear where they were spoken. In fact, it seems evident now that the arrival of Urnfield groups from the north marks the arrival of Celtic-speaking peoples, as we can infer from the increase in Central European admixture, while the expansion of anthropomorphic stelae from the north-west must have marked the expansion of Lusitanian.

Vasconic was spoken in both sides of the Pyrenees, as it was in the Middle Ages.

Wrong. One of the worst mistakes I am seeing in many comments since the paper was published, although admittedly the paper goes around this problem talking about “Modern Basques”. Vasconic toponyms appear south of the Pyrenees only after the Roman conquests, and tribes of the south-western Pyrenees and Cantabrian regions were likely Celtic-speaking peoples. Aquitanians (north of the western Pyrenees) are the only known ancient Vasconic-speaking population in proto-historic times, ergo the arrival of Bell Beakers in Iberia was most likely accompanied by Indo-European languages which were later replaced by Celtic expanding from Central Europe, and Iberian expanding from south-east Iberia, and only later with Latin and Vasconic.

Ligurian is non-Indo-European, and Lusitanian is Celtic-like, so Iberia must have been mostly non-Indo-European-speaking.

The fragmentary material available on Ligurian is enough to show that phonetically it is a NWIE dialect of non-Celtic, non-Italic nature, much like Lusitanian; that is, unless you follow laryngeals up to Celtic or Italic, in which case you can argue anything about this or any other IE language, as people who reconstruct laryngeals for Baltic in the common era do.

EDIT (19 Mar 2019): It was not clear enough from this paragraph, because Ligurian-like languages in NE Iberia is just a hypothesis based on the archaeological connection of the whole southern France Bell Beaker region. My aim was to repeat the idea that Old European topo-hydronymy is older in NE Iberia (as almost anywhere in Iberia) than Iberian toponymy, so the initial hypothesis is that:

  1. a Palaeo-European language (as Villar puts it) expanded into most regions of Iberia in ancient times (he considered at some point the Mesolithic, but that is obviously wrong, as we know now); then
  2. Celts expanded at least to the Ebro River Basin; then
  3. Iberians expanded to the north and replaced these in NE Iberia; and only then
  4. after the Roman invasion, around the start of the Common Era, appear Vasconic toponyms south of the Pyrenees.

Lusitanian obviously does not qualify as Celtic, lacking the most essential traits that define Celticness…Unless you define “(Para-)Celtic” as Pre-Proto-Celtic-like, or anything of the sort to support some Atlantic continuity, in which case you can also argue that Pre-Italic or Pre-Germanic are Celtic, because you would be essentially describing North-West Indo-European

If Basques have R1b, it’s because of a culture of “matrilocality” as opposed to the “patrilocality” of Indo-Europeans

So wrong it hurts my eyes every time I read this. Not only does matrilocality in a regional group have few known effects in genetics, but there are many well-documented cases of population replacement (with either ancestry or Y-DNA haplogroups, or both) without language replacement, without a need to resort to “matrilineality” or “matrilocality” or any other cultural difference in any of these cases.

In fact, it seems quite likely now that isolated ancient peoples north of the Pyrenees will show a gradual replacement of surviving I2a lineages by neighbouring R1b, while early Iberian R1b-DF27 lineages are associated with Lusitanians, and later incoming R1b-DF27 lineages (apart from other haplogroups) are most likely associated with incoming Celts, which must have remained in north-central and central-east European groups.

NOTE. Notice how R1a is fully absent from all known early Indo-European peoples to date, whether Iberian IE, British IE, Italic, or Greek. The absence of R1a in Iberia after the arrival of Celts is even more telling of the origin of expanding Celts in Central Europe.

I haven’t had enough time to add Iberian samples to my spreadsheet, and hence neither to the ASoSaH texts nor maps/PCAs (and I don’t plan to, because it’s more efficient for me to add both, Asian and Iberian samples, at the same time), but luckily Maciamo has summed it up on Eupedia. Or, graphically depicted in the paper for the southeast:

iberia-haplogroups
Y chromosome haplogroup composition of individuals from southeast Iberia during the past 2000 years. The general Iberian Bronze and Iron Age population is included for comparison. Modified from Olalde et al. (2019).

Does this continued influx of Y-DNA haplogroups in Iberia with different cultures represent permanent changes in language? Are, therefore, modern Iberian languages derived from Lusitanian, Sorothaptic/Celtic, Greek, Phoenician, East or West Germanic, Hebrew, Berber, or Arabic languages? Obviously not. Same with Italy (see the recent preprint on modern Italians by Raveane et al. 2018), with France, with Germany, or with Greece.

If that happens in European regions with a known ancient history, why would the recent expansions and bottlenecks of R1b in modern Basques (or N1c around the Baltic, or R1a in Slavs) in the Middle Ages represent an ancestral language surviving into modern times?

Indo-Iranians

If something is clear from Narasimhan, Patterson, et al. (2018), is that we know finally the timing of the introduction and expansion of R1a-Z645 lineages among Indo-Iranians.

We could already propose since 2015 that a slow admixture happened in the steppes, based on archaeological finds, due to settlement elites dominating over common peoples, coupled with the known Uralic linguistic traits of Indo-Iranian (and known Indo-Iranian influence on Finno-Ugric) – as I did in the first version of the Indo-European demic diffusion model.

The new huge sampling of Sintashta – combined with that of Catacomb, Poltavka, Potapovka, Andronovo, and Srubna – shows quite clearly how this long-term admixture process between Uralic peoples and Indo-Iranians happened between forest-steppe CWC (mainly Abashevo) and steppe groups. The situation is not different from that of Iberia ca. 2500-2000 BC; from Narasimhan, Patterson, et al. (2018):

We combined the newly reported data from Kamennyi Ambar 5 with previously reported data from the Sintashta 5 individuals (10). We observed a main cluster of Sintashta individuals that was similar to Srubnaya, Potapovka, and Andronovo in being well modeled as a mixture of Yamnaya-related and Anatolian Neolithic (European agriculturalist-related) ancestry.

Even with such few words referring to one of the most important data in the paper about what happened in the steppes, Wang et al. (2018) help us understand what really happened with this simplistic concept of “steppe ancestry” regarding Yamna vs. Corded Ware differences:

anatolia-neolithic-steppe-eneolithic
Image modified from Wang et al. (2018). Marked are: in red, approximate limit of Anatolia_Neolithic ancestry found in Yamna populations; in blue, Corded Ware-related groups. “Modelling results for the Steppe and Caucasus 1128 cluster. Admixture proportions based on (temporally and geographically) distal and proximal models, showing additional Anatolian farmer-related ancestry in Steppe groups as well as additional gene flow from the south in some of the Steppe groups as well as the Caucasus groups (see also Supplementary Tables 10, 14 and 20).”

As with Iberia (or any prehistoric region), the details of how exactly this language change happened are not evident, but we only need a plausible explanation coupled with archaeology and linguistics. Poltavka, Potapovka, and Sintashta samples – like the few available Iberian ones ca. 2500-2000 BC – offer a good picture of the cohabitation of R1b-L23 (mainly Z2103) and R1a-Z645 (mainly Z93+): a glimpse at the likely presence of R1a-Z93 within settlements – which must have evolved as the dominant elites – in a society where the majority of the population was initially formed by nomad herders (probably most R1b-Z2103), who were usually buried outside of the main settlements.

Will the upcoming Narasimhan, Patterson et al. (2019) deal with this problem of how R1a-M417 replaced R1b-M269, and how the so-called “Steppe_MLBA” (i.e. Corded Ware) ancestry admixed with “Steppe_EMBA” (i.e. Yamnaya) ancestry in the steppes, and which one of their languages survived in the region (that is, the same the Reich Lab has done with Iberia)? Not likely. The ‘genetic wars’ in Iberia deal with haplogroup R1b-P312, and how it was neither ‘native’ nor associated with Basques and non-Indo-European peoples in general. The ‘genetic wars’ in South Asia are concerned with the steppe origin of R1a, to prove that it is not a ‘native’ haplogroup to India, and thus neither are Indo-Aryan languages. To each region a politically correct account of genetic finds, with enough care not to fully dismiss national myths, it seems.

NOTE. Funnily enough, these ‘genetic wars’ are the making of geneticists since the 1990s and 2000s, so we are still in the midst of mostly internal wars caused by what they write. Just as genetic papers of the 2020s will most likely be a reaction to what they are writing right now about “steppe ancestry” and R1a. You won’t find much change to the linguistic reconstruction in this whole period, except for the most multicolored glottochronological proposals…

The first author of the paper has engaged, as far as I could see in Twitter, in dialogue with Hindu nationalists who try to dismiss the arrival of steppe ancestry and R1a into South Asia as inconclusive (to support the potential origin of Sanskrit millennia ago in the Indus Valley Civilization). How can geneticists deal with the real problem here (the original ethnolinguistic group expanding with Corded Ware), when they have to fend off anti-steppists from Europe and Asia? How can they do it, when they themselves are part of the same societies that demand a politically correct presentation of data?

This is how the data on the most likely Indo-Iranian-speaking region should be presented in an ideal world, where – as in the Iberia paper – geneticists would look closely to the Volga-Ural region to discover what happened with Proto-Indo-Iranians from their earliest to their latest stage, instead of constantly looking for sites close to the Indus Valley to demonstrate who knows what about modern Indian culture:

indo-iranian-admixture-similar-iberians
Tentative map of the Late PIE and Indo-Iranian community in the Volga-Ural steppes since the Eneolithic. Proportion of ancestry derived from central European Corded Ware peoples. Colors indicate the Y-chromosome haplogroup for each male. Red lines represent period of admixture. Modified from Olalde et al. (2019).

Now try and tell Hindu nationalists that Sanskrit expanded from an Early Bronze Age steppe community of R1b-rich nomadic herders that spoke Pre-Indo-Iranian, which was dominated and eventually (genetically) mostly replaced by elite Uralic-speaking R1a peoples from the Russian forest, hence the known phonetic (and some morphological) traits that remained. Good luck with the Europhobic shitstorm ahead..

Balto-Slavic

Iberian cultures, already with a majority of R1b lineages, show a clear northward expansion over previously Urnfield-like groups of north-east Iberia and Mediterranean France (which we now know probably represent the migration of Celts from central Europe). Similarly, Eastern Balts already under a majority of R1a lineages expanded likely into the Baltic region at the same time as the outlier from Turlojiškė (ca. 1075 BC), which represents the first obvious contacts of central-east Europe with the Baltic.

Iberia shows a more recent influx of central and eastern Mediterranean peoples, one of which eventually succeeded in imposing their language in Western Europe: Romans were possibly associated mainly with R1b-U152, apart from many other lineages. Proto-Slavs probably expanded later than Celts, too, connected to the disintegration of the Lusatian culture, and they were at some point associated with R1a-M458 and R1a-Z280(xZ92) lineages, apart from others already found in Early Slavs.

pca-balto-slavs-tollense-valley
PCA of central-eastern European groups which may have formed the Balto-Slavic-speaking community derived from Bell Beaker, evident from the position ‘westwards’ of CWC in the PCA, and surrounding cultures. Left: Early Bronze Age. Right: Tollense Valley samples.

This parallel between Iberia and eastern Europe is no coincidence: as Europe entered the Bronze Age, chiefdom-based systems became common, and thus the connection of ancestry or haplogroups with ethnolinguistic groups became weaker.

What happened earlier (and who may represent the Pre-Balto-Slavic community) will be clearer when we have enough eastern European samples, but basically we will be able to depict this admixture of NWIE-speaking BBC-derived peoples with Uralic-speaking CWC-derived groups (since Uralic is known to have strongly influenced Balto-Slavic), similar to the admixture found in Indo-Iranians, more or less like this:

iberian-admixture-balto-slavic
Tentative map of the North-West Indo-European and Balto-Slavic community in central-eastern Europe since the East Bell Beaker expansion. Proportion of ancestry derived from Corded Ware peoples. Colors indicate the Y-chromosome haplogroup for each male. Red lines represent period of admixture. Modified from Olalde et al. (2019).

The Early Scythian period marked a still stronger chiefdom-based system which promoted the creation of alliances and federation-like groups, with an earlier representation of the system expanding from north-eastern Europe around the Baltic Sea, precisely during the spread of Akozino warrior-traders (in turn related to the Scythian influence in the forest-steppes), who are the most likely ancestors of most N1c-V29 lineages among modern Germanic, Balto-Slavic, and Volga-Finnic peoples.

Modern haplogroup+language = ancient ones?

It is not difficult to realize, then, that the complex modern genetic picture in Eastern Europe and around the Urals, and also in South Asia (like that of the Aegean or Anatolia) is similar to the Iron Age / medieval Iberian one, and that following modern R1a as an Indo-European marker just because some modern Indo-European-speaking groups showed it was always a flawed methodology; as flawed as following R1b for ancient Vasconic groups, or N1c for ancient Uralic groups.

Why people would argue that haplogroups mean continuity (e.g. R1b with Basques, N1c with Finns, R1a with Slavs, etc.) may be understood, if one lives still in the 2000s. Just like why one would argue that Corded Ware is Indo-European, because of Gimbutas’ huge influence since the 1960s with her myth of “Kurgan peoples”. Not many denied these haplogroup associations, because there was no reason to do it, and those who did usually aligned with a defense of descriptive archaeology.

However, it is a growing paradox that some people interested in genetics today would now, after the Iberian paper, need to:

  • accept that ancient Iberians and probably Aquitanians (each from different regions, and probably from different “Basque-Iberian dialects” in the Chalcolithic, if both were actually related) show eventually expansions with R1b-L23, the haplogroup most obviously associated with expanding Indo-Europeans;
  • acknowledge that modern Iberians have many different lineages derived from prehistoric or historic peoples (Celts, Phoenicians, Greeks, Romans, Jews, Goths, Berbers, Arabs), which have undergone different bottlenecks, the last ones during the Reconquista, but none of their languages have survived;
  • realize that a similar picture is to be found everywhere in central and western Europe since the first proto-historic records, with language replacement in spite of genetic continuity, such as the British Isles (and R1b-L21 continuity) after the arrival of Celts, Romans, Anglo-Saxons, Vikings, or Normans;
  • but, at the same time, continue blindly asserting that haplogroup R1a + “steppe ancestry” represent some kind of supernatural combination which must show continuity with their modern Indo-Iranian or Balto-Slavic language from time immemorial.
sintashta-y-dna
Replacement of R1b-L23 lineages during the Early Bronze Age in eastern Europe and in the Eurasian steppes: emergence of R1a in previous Yamnaya and Bell Beaker territories. Modified from EBA Y-DNA map.

Behave, pretty please

The ‘conservative’ message espoused by some geneticists and amateur genealogists here is basically as follows:

  • Let’s not rush to new theories that contradict the 2000s, lest some people get offended by granddaddy not being these pure whatever wherever as they believed, and let’s wait some 5, 10, or 20 years, as long as necessary – to see if some corner of the Yamna culture shows R1a, or some region in north-eastern Europe shows N1c, or some Atlantic Chalcolithic sample shows R1b – to challenge our preferred theories, if we actually need to challenge anything at all, because it hurts too much.
  • Just don’t let many of these genetic genealogists or academics of our time be unhappy, pretty please with sugar on top, and let them slowly adapt to reality with more and more pet theories to fit everything together (past theories + present data), so maybe when all of them are gone, within 50 or 70 years, society can smoothly begin to move on and propose something closer to reality, but always as politically correct as possible for the next generations.
  • For starters, let’s discuss now (yet again) that Bell Beakers may not have been Indo-European at all, despite showing (unlike Corded Ware) clearly Yamna male lineages and ancestry, because then Corded Ware and R1a could not have been Indo-European and that’s terrible, so maybe Bell Beakers are too brachycephalic to speak Indo-European or something, or they were stopped by the Fearsome Tisza River, or they are not pure Dutch Single Grave in The South hence not Indo-European, or whatever, and that’s why Iron Age Iberians or Etruscans show non-Indo-European languages. That’s not disrespectful to the history of certain peoples, of course not, but talking about the evident R1a-Uralic connection is, because this is The South, not The North, and respect works differently there.
  • Just don’t talk about how Slavs and Balts enter history more than 1,500 years later than Indo-European peoples in Western and Southern Europe, including Iberia, and assume a heroic continuity of Balts and Slavs as pure R1a ‘steppe-like’ peoples dominating over thousands of kms. in the Baltic, Fennoscandia, eastern Europe, and northern Asia for 5,000 years, with multiple Balto-Slavs-over-Balto-Slavs migrations, because these absolute units of Indo-European peoples were a trip and a half. They are the Asterix and Obelix of white Indo-European prehistory.
  • Perhaps in the meantime we can also invent some new glottochronological dialectal scheme that fits the expansion of Sredni Stog/Corded Ware with (Germano-?)Indo-Slavonic separated earlier than any other Late PIE dialect; and Finno-Volgaic later than any other Uralic dialect, in the Middle Ages, with N1c.
balto-slavic-pca
Genetic structure of the Balto-Slavic populations within a European context according to the three genetic systems, from Kushniarevich et al. (2015). Pure Balto-Slavs from…hmm…yeah this…ancient…region…or people…cluster…Whatever, very very steppe-like peoples, the True Indo-Europeans™, so close to Yamna…almost as close as Finno-Ugrians.

To sum up: Iberia, Italy, France, the British Isles, central Europe, the Balkans, the Aegean, or Anatolia, all these territories can have a complex history of periodic admixture and language replacement everywhere, but some peoples appearing later than all others in the historical record (viz. Basques or Slavs) apparently cannot, because that would be shameful for their national or ethnic myths, and these should be respected.

Ignorance of the own past as a blank canvas to be filled in with stupid ethnolinguistic continuity, turned into something valuable that should not be challenged. Ethnonationalist-like reasoning proper of the 19th century. How can our times be called ‘modern’ when this kind of magical thinking is still prevalent, even among supposedly well-educated people?

Related

Ahead of the (Indo-European – Uralic) game: in theory and in numbers

yamnaya-expansion-bell-beaker

There is a good reason for hope, for those who look for a happy ending to the revolution of population genomics that is quickly turning into an involution led by beliefs and personal interests. This blog is apparently one of the the most read sites on Indo-European peoples, if not the most read one, and now on Uralic peoples, too.

I’ve been checking the analytics of our sites, and judging by the numbers of the English blog, Indo-European.eu (without the other languages) is quickly turning into the most visited one from Academia Prisca‘s sites on Indo-European languages, beyond Indo-European.info (and its parent sites in other languages), which host many popular files for download.

If we take into account file downloads (like images or PDFs), and not only what Google Analytics can record, Indo-European.eu has not more users than all other websites of Academia Prisca, but at this pace it will soon reach half the total visits, possibly before the end of 2019.

Overall, we have evolved from some 10,000 users/year in 2006 to ~300,000 active users/year and >1,000,000 page+file views/year in 2018 (impossible to say exactly without spending too much time on this task). Nothing out of the ordinary, I guess, and obviously numbers are not a quality index, but rather a hint at increasing popularity of the subject and of our work.

NOTE. The mean reading time is ~2:40 m, which I guess fits the length of most posts, and most visitors read a mean of ~2+ pages before leaving, with increasing reader fidelity over time.

indo-european-eu-analytics
Number of active users of indo-european.eu, according to Google Analytics since before the start of the new blog. Notice the peaks corresponding to the posts below (except the last one, corresponding to the publication of A Song of Sheep and Horses).

The most read posts of 2018, now that we can compare those from the last quarter, are as follows:

  1. – The series on the Corded Ware-Uralic theory, with a marked increase in readers, especially with the last three posts:
    1. Finno-Permic and the expansion of N-L392/Siberian ancestry,
    2. “Siberian ancestry” and Ugric-Samoyedic expansions, and
    3. Haplogroups R1a and N in Finno-Ugric and Samoyedic
  2. Haplogroup is not language, but R1b-L23 expansion was associated with Proto-Indo-Europeans
  3. The history of the simplistic ‘haplogroup R1a — Indo-European’ association
  4. On the origin of haplogroup R1b-L51 in late Repin / early Yamna settlers
  5. On the origin and spread of haplogroup R1a-Z645 from eastern Europe
  6. The Caucasus a genetic and cultural barrier; Yamna dominated by R1b-M269; Yamna settlers in Hungary cluster with Yamna
  7. Something is very wrong with models based on the so-called ‘Yamnaya admixture’ – and archaeologists are catching up (II)
  8. Olalde et al. and Mathieson et al. (Nature 2018): R1b-L23 dominates Bell Beaker and Yamna, R1a-M417 resurges in East-Central Europe during the Bronze Age
  9. Early Indo-Iranian formed mainly by R1b-Z2103 and R1a-Z93, Corded Ware out of Late PIE-speaking migrations
  10. “Steppe ancestry” step by step: Khvalynsk, Sredni Stog, Repin, Yamna, Corded Ware

NOTE. Of course, the most recent posts are the most visited ones right now, but that’s because of the constant increase in the number of visitors.

I think it is obvious what the greatest interest of readers has been in the past two years. You can see the pattern by looking at the most popular posts of 2017, when the blog took off again:

  1. Germanic–Balto-Slavic and Satem (‘Indo-Slavonic’) dialect revisionism by amateur geneticists, or why R1a lineages *must* have spoken Proto-Indo-European
  2. The renewed ‘Kurgan model’ of Kristian Kristiansen and the Danish school: “The Indo-European Corded Ware Theory”
  3. The new “Indo-European Corded Ware Theory” of David Anthony
  4. Correlation does not mean causation: the damage of the ‘Yamnaya ancestral component’, and the ‘Future American’ hypothesis
  5. The Aryan migration debate, the Out of India models, and the modern “indigenous Indo-Aryan” sectarianism

The most likely reason for the radical increase in this blog’s readership is very simple, then: people want to know what is really happening with the research on ancestral Indo-Europeans and Uralians, and other blogs and forums are not keeping up with that demand, being content with repeating the same ideas again and again (R1a-CWC-IE, R1b-BBC-Vasconic, and N-Comb Ware-Uralic), despite the growing contradictions. As you can imagine, once you have seen the Yamna -> Bell Beaker migration model of North-West Indo-European, with Corded Ware obviously representing Uralic, you can’t unsee it.

The online bullying, personal attacks, and similar childish attempts to silence those who want to talk about this theory elsewhere (while fringe theories like R1a/CHG-OIT, R1b-Vasconic, or the Anatolian/Armenian-CHG hypotheses, to name just a few, are openly discussed) has had, as could be expected, the opposite effect to what was intended. I guess you can say this blog and our projects have profited from the first relevant Streisand effect of population genomics, big time.

If this trend continues this year (and other bloggers’ or forum users’ faith in miracles is not likely to change), I suppose that after the Yamna Hungary samples are published (with the expected results) this blog is going to be the most read in 2020 by a great margin… I can only infer that this tension is also helping raise the interest in (and politicization of) the question, hence probably the overall number of active users and their participation in other blogs and forums is going to increase everywhere in 2019, too, as this debate becomes more and more heated.

So, what I infer from the most popular posts and the numbers is that people want criticism and controversy, and if you want blood you’ve got it. Here it is, my latest addition to the successful series criticizing the “Corded Ware/R1a–Indo-European” pet theories, a post I wrote two-three months ago, slightly updated with the newest comedy, and a sure success for 2019 (already added to the static pages of the menu):

The “Indo-European Corded Ware theory” doesn’t hold water

This is how I feel when I see spikes in visits with more and more returning users linked to my controversial posts 😉

Are you not entertained?! Are you not entertained?! Is this not why you are here?!

ASoSaH Reread (II): Y-DNA haplogroups among Uralians (apart from R1a-M417)

corded-ware-yamna-ancestry

This is mainly a reread of from Book Two: A Game of Clans of the series A Song of Sheep and Horses: chapters iii.5. Early Indo-Europeans and Uralians, iv.3. Early Uralians, v.6. Late Uralians and vi.3. Disintegrating Uralians.

“Sredni Stog”

While the true source of R1a-M417 – the main haplogroup eventually associated with Corded Ware, and thus Uralic speakers – is still not known with precision, due to the lack of R1a-M198 in ancient samples, we already know that the Pontic-Caspian steppes were probably not it.

We have many samples from the north Pontic area since the Mesolithic compared to the Volga-Ural territory, and there is a clear prevalence of I2a-M223 lineages in the forest-steppe area, mixed with R1b-V88 (possibly a back-migration from south-eastern Europe).

R1a-M459 (xR1a-M198) lineages appear from the Mesolithic to the Chalcolithic scattered from the Baltic to the Caucasus, from the Dniester to Samara, in a situation similar to haplogroups Q1a-M25 and R1b-L754, which supports the idea that R1a, Q1a, and R1b expanded with ANE ancestry, possibly in different waves since the Epipalaeolithic, and formed the known ANE:EHG:WHG cline.

y-dna-khvalynsk
Y-DNA samples from Khvalynsk and neighbouring cultures. See full version.

The first confirmed R1a-M417 sample comes from Alexandria, roughly coinciding with the so-called steppe hiatus. Its emergence in the area of the previous “early Sredni Stog” groups (see the mess of the traditional interpretation of the north Pontic groups as “Sredni Stog”) and its later expansion with Corded Ware supports Kristiansen’s interpretation that Corded Ware emerged from the Dnieper-Dniester corridor, although samples from the area up to ca. 4000 BC, including the few Middle Eneolithic samples available, show continuity of hg. I2a-M223 and typical Ukraine Neolithic ancestry.

NOTE. The further subclade R1a-Z93 (Y26) reported for the sample from Alexandria seems too early, given the confidence interval for its formation (ca. 3500-2500 BC); even R1a-Z645 could be too early. Like the attribution of the R1b-L754 from Khvalynsk to R1b-V1636 (after being previously classifed as of Pre-V88 and M73 subclade), it seems reasonable to take these SNP calls with a pinch of salt: especially because Yleaf (designed to look for the furthest subclade possible) does not confirm for them any subclade beyond R1a-M417 and R1b-L754, respectively.

The sudden appearance of “steppe ancestry” in the region, with the high variability shown by Ukraine_Eneolithic samples, suggests that this is due to recent admixture of incoming foreign peoples (of Ukraine Neolithic / Comb Ware ancestry) with Novodanilovka settlers.

The most likely origin of this population, taking into account the most common population movements in the area since the Neolithic, is the infiltration of (mainly) hunter-gatherers from the forest areas. That would confirm the traditional interpretation of the origin of Uralic speakers in the forest zone, although the nature of Pontic-Caspian settlers as hunter-gatherers rather than herders make this identification today fully unnecessary (see here).

EDIT (3 FEB 2019): As for the most common guesstimates for Proto-Uralic, roughly coinciding with the expansion of this late Sredni Stog community (ca. 4000 BC), you can read the recent post by J. Pystynen in Freelance Reconstruction, Probing the roots of Samoyedic.

eneolithic-ukraine-corded-ware
Late Sredni Stog admixture shows variability proper of recent admixture of forest-steppe peoples with steppe-like population. See full version here.

NOTE. Although my initial simplistic interpretation (of early 2017) of Comb Ware peoples – traditionally identified as Uralic speakers – potentially showing steppe ancestry was probably wrong, it seems that peoples from the forest zone – related to Comb Ware or neighbouring groups like Lublyn-Volhynia – reached forest-steppe areas to the south and eventually expanded steppe ancestry into east-central Europe through the Volhynian Upland to the Polish Upland, during the late Trypillian disintegration (see a full account of the complex interactions of the Final Eneolithic).

The most interesting aspect of ascertaining the origin of R1a-M417, given its prevalence among Uralic speakers, is to precisely locate the origin of contacts between Late Proto-Indo-European and Proto-Uralic. Traditionally considered as the consequence of contacts between Middle and Upper Volga regions, the most recent archaeological research and data from ancient DNA samples has made it clear that it is Corded Ware the most likely vector of expansion of Uralic languages, hence these contacts of Indo-Europeans of the Volga-Ural region with Uralians have to be looked for in neighbours of the north Pontic area.

sredni-stog-repin-contacts
Sredni Stog – Repin contacts representing Uralic – Late Indo-European contacts were probably concentrated around the Don River.

My bet – rather obvious today – is that the Don River area is the source of the earliest borrowings of Late Uralic from Late Indo-European (i.e. post-Indo-Anatolian). The borrowing of the Late PIE word for ‘horse’ is particularly interesting in this regard. Later contacts (after the loss of the initial laryngeal) may be attributed to the traditionally depicted Corded Ware – Yamna contact zone in the Dnieper-Dniester area.

NOTE. While the finding of R1a-M417 populations neighbouring R1b-L23 in the Don-Volga interfluve would be great to confirm these contacts, I don’t know if the current pace of more and more published samples will continue. The information we have right now, in my opinion, suffices to support close contacts of neighbouring Indo-Europeans and Uralians in the Pontic-Caspian area during the Late Eneolithic.

Classical Corded Ware

After some complex movements of TRB, late Trypillia and GAC peoples, Corded Ware apparently emerged in central-east Europe, under the influence of different cultures and from a population that probably (at least partially) stemmed from the north Pontic forest-steppe area.

Single Grave and central Corded Ware groups – showing some of the earliest available dates (emerging likely ca. 3000/2900 BC) – are as varied in their haplogroups as it is expected from a sink (which does not in the least resemble the Volga-Ural population):

Interesting is the presence of R1b-L754 in Obłaczkowo, potentially of R1b-V88 subclade, as previously found in two Central European individuals from Blätterhole MN (ca. 3650 and 3200 BC), and in the Iron Gates and north Pontic areas.

Haplogroups I2a and G have also been reported in early samples, all potentially related to the supposed Corded Ware central-east European homeland, likely in southern Poland, a region naturally connected to the north Pontic forest-steppe area and to the expansion of Neolithic groups.

corded-ware-haplogroups
Y-DNA samples from early Corded Ware groups and neighbouring cultures. See full version.

The true bottlenecks under haplogroup R1a-Z645 seem to have happened only during the migration of Corded Ware to the east: to the north into the Battle Axe culture, mainly under R1a-Z282, and to the south into Middle Dnieper – Fatyanovo-Balanovo – Abashevo, probably eventually under R1a-Z93.

This separation is in line with their reported TMRCA, and supports the split of Finno-Permic from an eastern Uralic group (Ugric and Samoyedic), although still in contact through the Russian forest zone to allow for the spread of Indo-Iranian loans.

This bottleneck also supports in archaeology the expansion of a sort of unifying “Corded Ware A-horizon” spreading with people (disputed by Furholt), the disintegrating Uralians, and thus a source of further loanwords shared by all surviving Uralic languages.

Confirming this ‘concentrated’ Uralic expansion to the east is the presence of R1a-M417 (xR1a-Z645) lineages among early and late Single Grave groups in the west – which essentially disappeared after the Bell Beaker expansion – , as well as the presence of these subclades in modern Central and Western Europeans. Central European groups became thus integrated in post-Bell Beaker European EBA cultures, and their Uralic dialect likely disappeared without a trace.

NOTE. The fate of R1b-L51 lineages – linked to North-West Indo-Europeans undergoing a bottleneck in the Yamna Hungary -> Bell Beaker migration to the west – is thus similar to haplogroup R1a-Z645 – linked to the expansion of Late Uralians to the east – , hence proving the traditional interpretation of the language expansions as male-driven migrations. These are two of the most interesting genetic data we have to date to confirm previous language expansions and dialectal classifications.

It will be also interesting to see if known GAC and Corded Ware I2a-Y6098 subclades formed eventually part of the ancient Uralic groups in the east, apart from lineages which will no doubt appear among asbestos ware groups and probably hunter-gatherers from north-eastern Europe (see the recent study by Tambets et al. 2018).

Corded Ware ancestry marked the expansion of Uralians

Sadly, some brilliant minds decided in 2015 that the so-called “Yamnaya ancestry” (now more appropriately called “steppe ancestry”) should be associated to ‘Indo-Europeans’. This is causing the development of various new pet theories on the go, as more and more data contradicts this interpretation.

There is a clear long-lasting cultural, populational, and natural barrier between Yamna and Corded Ware: they are derived from different ancestral populations, which show clearly different ancestry and ancestry evolution (although they did converge to some extent), as well as different Y-DNA bottlenecks; they show different cultures, including those of preceding and succeeding groups, and evolved in different ecological niches. The only true steppe pastoralists who managed to dominate over grasslands extending from the Upper Danube to the Altai were Yamna peoples and their cultural successors.

corded-ware-yamna-pca
Corded Ware admixture proper of expanding late Sredni Stog-like populations from the forest-steppe. See full version here.

NOTE. You can also read two recent posts by FrankN in the blog aDNA era, with detailed information on the Pontic-Caspian cultures and the formation of “steppe ancestry” during the Palaeolithic, Mesolithic and Neolithic: How did CHG get into Steppe_EMBA? Part 1: LGM to Early Holocene and How did CHG get into Steppe_EMBA? Part 2: The Pottery Neolithic. Unlike your typical amateur blogger on genetics using few statistical comparisons coupled with ‘archaeolinguoracial mumbo jumbo’ to reach unscientific conclusions, these are obviously carefully redacted texts which deserve to be read.

I will not enter into the discussion of “steppe ancestry” and the mythical “Siberian ancestry” for this post, though. I will just repost the opinion of Volker Heyd – an archaeologist specialized in Yamna Hungary and Bell Beakers who is working with actual geneticists – on the early conclusions based on “steppe ancestry”:

[A]rchaeologist Volker Heyd at the University of Bristol, UK, disagreed, not with the conclusion that people moved west from the steppe, but with how their genetic signatures were conflated with complex cultural expressions. Corded Ware and Yamnaya burials are more different than they are similar, and there is evidence of cultural exchange, at least, between the Russian steppe and regions west that predate Yamnaya culture, he says. None of these facts negates the conclusions of the genetics papers, but they underscore the insufficiency of the articles in addressing the questions that archaeologists are interested in, he argued. “While I have no doubt they are basically right, it is the complexity of the past that is not reflected,” Heyd wrote, before issuing a call to arms. “Instead of letting geneticists determine the agenda and set the message, we should teach them about complexity in past human actions.

Related

A very “Yamnaya-like” East Bell Beaker from France, probably R1b-L151

bell-beaker-expansion

Interesting report by Bernard Sécher on Anthrogenica, about the Ph.D. thesis of Samantha Brunel from Institut Jacques Monod, Paris, Paléogénomique des dynamiques des populations humaines sur le territoire Français entre 7000 et 2000 (2018).

NOTE. You can visit Bernard Sécher’s blog on genetic genealogy.

A summary from user Jool, who was there, translated into English by Sécher (slight changes to translation, and emphasis mine):

They have a good hundred samples from the North, Alsace and the Mediterranean coast, from the Mesolithic to the Iron Age.

There is no major surprise compared to the rest of Europe. On the PCA plot, the Mesolithic are with the WHG, the early Neolithics with the first farmers close to the Anatolians. Then there is a small resurgence of hunter-gatherers that moves the Middle Neolithics a little closer to the WHGs.

From the Bronze Age, they have 5 samples with autosomal DNA, all in Bell Beaker archaeological context, which are very spread on the PCA. A sample very high, close to the Yamnaya, a little above the Corded Ware, two samples right in the Central European Bell Beakers, a fairly low just above the Neolithic package, and one last full in the package. The most salient point was that the Y chromosomes of their 12 Bronze Age samples (all Bell Beakers) are all R1b, whereas there was no R1b in the Neolithic samples.

Finally they have samples of the Iron Age that are collected on the PCA plot close to the Bronze Age samples. They could not determine if there is continuity with the Bronze Age, or a partial replacement by a genetically close population.

PCA-caucasus-yamna
Image modified from Wang et al. (2018). Samples projected in PCA of 84 modern-day West Eurasian populations (open symbols). Previously known clusters have been marked and referenced. Marked and labelled are interesting samples; In red, likely position of late Yamna Hungary / early East Bell Beakers An EHG and a Caucasus ‘clouds’ have been drawn, leaving Pontic-Caspian steppe and derived groups between them. See the original file here. To understand the drawn potential Caucasus Mesolithic cluster, see above the PCA from Lazaridis et al. (2018).

The sample with likely high “steppe ancestry“, clustering closely to Yamna (more than Corded Ware samples) is then probably an early East Bell Beaker individual, probably from Alsace, or maybe close to the Rhine Delta in the north, rather than from the south, since we already have samples from southern France from Olalde et al. (2018) with high Neolithic ancestry, and samples from the Rhine with elevated steppe ancestry, but not that much.

This specific sample, if confirmed as one of those reported as R1b (then likely R1b-L151), as it seems from the wording of the summary, is key because it would finally link Yamna to East Bell Beaker through Yamna Hungary, all of them very “Yamnaya-like”, and therefore R1b-L151 (hence also R1b-L51) directly to the steppe, and not only to the Carpathian Basin (that is, until we have samples from late Repin or West Yamna…)

NOTE. The only alternative explanation for such elevated steppe ancestry would be an admixture between a ‘less Yamnaya-like’ East Bell Beaker + a Central European Corded Ware sample like the Esperstedt outlier + drift, but I don’t think that alternative is the best explanation of its position in the PCA closer to Yamna in any of the infinite parallel universes, so… Also, the sample from Esperstedt is clearly a late outlier likely influenced by Yamna vanguard settlers from Hungary, not the other way round…

Unexpectedly, then, fully Yamnaya-like individuals are found not only in Yamna Hungary ca. 3000-2500 BC, but also among expanding East Bell Beakers later than 2500 BC. This leaves us with unexplained, not-at-all-Yamnaya-like early Corded Ware samples from ca. 2900 BC on. An explanation based on admixture with locals seems unlikely, seeing how Corded Ware peoples continue a north Pontic cluster, being thus different from Yamna and their ancestors since the Neolithic; and how they remained that way for a long time, up to Sintashta, Srubna, Andronovo, and even later samples… A different, non-Indo-European community it is, then.

olalde_pca2
Image modified from Olalde et al. (2018). PCA of 999 Eurasian individuals. Marked is the Espersted Outlier with the approximate position of Yamna Hungary, probably the source of its admixture. Different Bell Beaker clines have been drawn, to represent approximate source of expansions from Central European sources into the different regions. In red, likely zone of Yamna Hungary and reported early East Bell Beaker individual from France.

Let’s wait and see the Ph.D. thesis, when it’s published, and keep observing in the meantime the absurd reactions of denial, anger, bargaining, and depression (stages of grief) among BBC/R1b=Vasconic and CWC/R1a=Indo-European fans, as if they had lost something (?). Maybe one of these reactions is actually the key to changing reality and going back to the 2000s, who knows…

Featured image: initial expansion of the East Bell Beaker Group, by Volker Heyd (2013).

Related

Genetic landscape and past admixture of modern Slovenians

slovenes-snp

Open access Genetic Landscape of Slovenians: Past Admixture and Natural Selection Pattern, by Maisano Delser et al. Front. Genet. (2018).

Interesting excerpts (emphasis mine):

Samples

Overall, 96 samples ranging from Slovenian littoral to Lower Styria were genotyped for 713,599 markers using the OmniExpress 24-V1 BeadChips (Figure 1), genetic data were obtained from Esko et al. (2013). After removing related individuals, 92 samples were left. The Slovenian dataset has been subsequently merged with the Human Origin dataset (Lazaridis et al., 2016) for a total of 2163 individuals.

Y chromosome

First, Y chromosome genetic diversity was assessed. A total of 52 Y chromosomes were analyzed for 195 SNPs. The majority of individuals (25, 48.1%) belong to the haplogroup R1a1a1a (R-M417) while the second major haplogroup is represented by R1b (R-M343) including 15 individuals (28.8%). Twelve samples are assigned to haplogroup I (I M170): five and two samples belong to haplogroup I2a (I L460) and I1 (I M253), respectively, while the remaining five samples did not have enough information to be further assigned.

pca-slovenes
PCA of Slovenian samples with European populations (Slovenian_HO_EU dataset). For details regarding the populations included, see Supplementary Table 1.

PCA

Considering the unbalanced sample size of the Slovenian population compared to the other populations included in the dataset, a subset of 20 Slovenian individuals randomly sampled was used.

All Slovenian samples group together with Hungarians, Czechs, and some Croatians (“Central-Eastern European” cluster) as also suggested by the PCA. All Basque individuals with few French and Spanish cluster together (“Basque” cluster) while a “Northern-European” cluster is made of the majority of French, English, Icelanders, Norwegians, and Orcadians. Five populations contributed to the “Eastern-European” cluster including Belarusians, Estonians, Lithuanians, Mordovians, and Russians. Western and South Europe is split into two cluster: the first (“Western European” cluster) includes all Spanish individuals, few French, and some Italians (North Italy) while the second (“Southern-European” cluster) groups Sicilians, Greeks, some Croatians, Romanians, and some Italians (North Italy).

Admixture Pattern and Migration

admixture-slovenians
Modified image, from the paper (Central-East Europeans marked). Unsupervised admixture analysis of Slovenians. Results for K = 5 are showed as it represents the lowest cross-validation error. Slovenian samples show an admixture pattern similar to the neighboring populations such as Croatians and Hungarians. The major ancestral components are: the blue one which is shared with Lithuanians and Russians, followed by the dark green one that is mostly present in Greek samples and the light blue which characterizes Orcadians and English. For population acronyms see Supplementary Table 1.

All Slovenian individuals share common pattern of genetic ancestry, as revealed by ADMIXTURE analysis. The three major ancestry components are the North East and North West European ones (light blue and dark blue, respectively, Figure 3), followed by a South European one (dark green, Figure 3). Contribution from the Sardinians and Basque are present in negligible amount. The admixture pattern of Slovenians mimics the one suggested by the neighboring Eastern European populations, but it is different from the pattern suggested by North Italian populations even though they are geographically close.

Using ALDER, the most significant admixture event was obtained with Russians and Sardinians as source populations and it happened 135 ± 9.31 generations ago (Z-score = 11.54). (…) When tested for multiple admixture events (MALDER), we obtained evidence for one admixture event 165.391 ± 17.1918 generations ago corresponding to ∼2620 BCE (CI: 3101–2139) considering a generation time of 28 years (Figure 4), with Kalmyk and Sardinians as sources.

We then modeled the Slovenian population as target of admixture of ancient individuals from Haak et al. (2015) while computing the f3(Ancient 1, Ancient 2, Slovenian) statistic. The most significant signal was obtained with Yamnaya and HungaryGamba_EN (Z-score = -10.66), followed by MA1 with LBK_EN (Z-score -9.7) and Yamnaya with Stuttgart (Z-score = -8.6) used as possible source populations (Supplementary Figure 5).

We found a significant signal of admixture by using both pairs as ancient sources. Specifically, for the pair Yamnaya and Hungary_EN the admixture event is dated at 134.38 ± 23.69 generations ago (Z-score = 5.26, p-value of 1.5e-07) while for Yamnaya and LBK_EN at 153.65 ± 22.19 generations ago (Z-score = 6.92, p-value 4.4e-12). Outgroup f3 with Yamnaya put Slovenian population close to Hungarians, Czechs, and English, indicating a similar shared drift between these population with the Steppe populations (Supplementary Figure 6).

admixture-events-slovenes
Admixture events identified with ALDER and MALDER. The gray dots represent significant admixture events detected with ALDER using Slovenians as target, the solid line represents the single admixture event detected using MALDER, dashed lines represent the confidence interval. Only the significant results after multiple testing correction are plotted. For ALDER results see Supplementary Table 5.

Not that any of this would come as a surprise, but:

  • R1a-M458 and some R1a-Z280 (xR1a-Z92) lineages (found among Slovenes) were associated with the Slavic expansion, likely with the Prague-Korchak culture, originally stemming probably from peoples of the Lusatian culture. Other R1a-Z280 lineages remained associated with Uralic peoples, and some became Slavicized only recently.
  • PCA keeps supporting the common cluster of certain West, South, and East Slavs in a “Central-Eastern European” cluster, distinct from the “North-Eastern European” cluster formed by modern Finno-Ugrians, as well as ancient Finno-Ugrians of north-eastern Europe who were only recently Slavicized.
  • Admixture supports the same ancient ‘western’ (a core West+South+East Slavic) cluster, and the admixture event with Yamna + Hungary_EN is logically a proxy for Yamna Hungary being at the core of ancestral Central-East population movements related to Bell Beakers in the mid- to late 3rd millennium.

The theory that East Slavs are at the core of the Slavic expansion makes no sense, in terms of archaeology (see Florin Curta’s dismissal of those recent eastern ‘Slavic’ finds, his commentary on 19th century Pan-Slavic crap, or his book on Slavic migrations), in terms of ancient DNA (the earliest Slavs sampled cluster with modern West Slavs, distant from the steppe cluster, unlike Finno-Ugrians), or in terms of modern DNA.

I don’t know where exactly this impulse for the theory of Russia being the cradle of Slavs comes from today (although there are some obvious political trends to revive 19th c. ideas), but it was always clear for everyone, including Russians, that East Slavs had migrated to the east and north and assimilated indigenous Finno-Ugrians, apart from Turkic-, Iranian-, and Caucasian-speaking peoples to the east. Genetics is only confirming what was clear from other disciplines long ago.

Related

Corded Ware—Uralic (IV): Hg R1a and N in Finno-Ugric and Samoyedic expansions

haplogroup-uralians

This is the fourth of four posts on the Corded Ware—Uralic identification:

Let me begin this final post on the Corded Ware—Uralic connection with an assertion that should be obvious to everyone involved in ethnolinguistic identification of prehistoric populations but, for one reason or another, is usually forgotten. In the words of David Reich, in Who We Are and How We Got Here (2018):

Human history is full of dead ends, and we should not expect the people who lived in any one place in the past to be the direct ancestors of those who live there today.

Haplogroup N

Another recurrent argument – apart from “Siberian ancestry” – for the location of the Uralic homeland is “haplogroup N”. This is as serious as saying “haplogroup R1” to refer to Indo-European migrations, but let’s explore this possibility anyway:

Ancient haplogroups

We have now a better idea of how many ancient migrations (previously hypothesized to be associated with westward Uralic migrations) look like in genetic terms. From Damgaard et al. (Science 2018):

These serial changes in the Baikal populations are reflected in Y-chromosome lineages (Fig. SA; figs. S24 to S27, and tables S13 and SI4). MAI carries the R haplogroup, whereas the majority of Baikal_EN males belong to N lineages, which were widely distributed across Northern Eurasia (29), and the Baikal_LNBA males all carry Q haplogroups, as do most of the Okunevo_EMBA as well as some present-day Central Asians and Siberians.

The only N1c1 sample comes from Ust’Ida Late Neolithic, 180km to the north of Lake Baikal, which – together with the Bronze Age sample from the Kola peninsula, and the medieval sample from Ust’Ida – gives a good idea of the overall expansion of N subclades and Siberian ancestry among the Circum-Arctic peoples of Eurasia, speakers of Palaeo-Siberian languages.

eurasian-n-subclades
Geographical location of ancient samples belonging to major clade N of the Y-chromosome.

Modern haplogroups

What we should expect from Uralic peoples expanding with haplogroup N – seeing how Yamna expands with R1b-L23, and Corded Ware expands with R1a-Z645 – is to find a common subclade spreading with Uralic populations. Let’s see if it works like that for any N-X subclade, in data from Ilumäe et al. (2016):

haplogroup_n1
Geographic-Distribution Map of hg N3 / N1c / N1a.

Within the Eurasian circum-Arctic spread zone, N3 and N2a reveal a well-structured spread pattern where individual sub-clades show very different distributions:

N1a1-M46 (or N-TAT), formed ca. 13900 BC, TMRCA 9800 BC

   N1a1a2-B187, formed ca. 9800 BC, TMRCA 1050 AD:

The sub-clade N3b-B187 is specific to southern Siberia and Mongolia, whereas N3a-L708 is spread widely in other regions of northern Eurasia.

     N1a1a1a-L708, formed ca. 6800 BC, TMRCA 5400 BC.

       N1a1a1a2-B211/Y9022, formed ca. 5400 BC, TMRCA 1900 BC:

The deepest clade within N3a is N3a1-B211, mostly present in the Volga-Uralic region and western Siberian Khanty and Mansi populations.

         N1a1a1a1a-L392/L1026), formed ca. 4400 BC, TMRCA 2800 BC:

The neighbor clade, N3a3’6-CTS6967, spreads from eastern Siberia to the eastern part of Fennoscandia and the Baltic States

haplogroup_n3a3
Frequency-Distribution Maps of Individual Subclade N3a3 / N1a1a1a1a1a-CTS2929/VL29, probably initially with Akozino warrior-traders.

           N1a1a1a1a1a-CTS2929/VL29, formed ca. 2100 BC, TMRCA 1600 BC:

In Europe, the clade N3a3-VL29 encompasses over a third of the present-day male Estonians, Latvians, and Lithuanians but is also present among Saami, Karelians, and Finns (Table S2 and Figure 3). Among the Slavic-speaking Belarusians, Ukrainians, and Russians, about three-fourths of their hg N3 Y chromosomes belong to hg N3a3.

In the post on Finno-Permic expansions, I depicted what seems to me the most likely way of infiltration of N1c-L392 lineages with Akozino warrior-traders into the western Finno-Ugric populations, with an origin around the Barents sea.

This includes the potential spread of (a minority of) N1c-B211 subclades due to contacts with Anonino on both sides of the Urals, through a northern route of forest and forest-steppe regions (equivalent to the distribution of Cherkaskul compared to Andronovo), given the spread of certain subclades in Ugric populations.

NOTE. An alternative possibility is the association of certain B211 subclades with a southern route of expansion with Pre-Scythian and Scythian populations, under whose influence the Ananino culture emerged -which would imply a very quick infiltration of certain groups of haplogroup N everywhere among Finno-Ugrics on both sides of the Urals – , and also the expansion of some subclades with Turkic-speaking peoples, who apparently expanded with alliances of different peoples. Both (Scythian and Turkic) populations expanded from East Asia, where haplogroup N (including N1c) was present since the Neolithic. I find this a worse model of expansion for upper clades, but – given the YFull estimates and the presence of this haplogroup among Turkic peoples – it is a possibility for many subclades.

           N1a1a1a1a2-Z1936, formed ca. 2800 BC, TMRCA 2400 BC:

The only notable exception from the pattern are Russians from northern regions of European Russia, where, in turn, about two-thirds of the hg N3 Y chromosomes belong to the hg N3a4-Z1936—the second west Eurasian clade. Thus, according to the frequency distribution of this clade, these Northern Russians fit better among other non-Slavic populations from northeastern Europe. N3a4 tends to increase in frequency toward the northeastern European regions but is also somewhat unexpectedly a dominant hg N3 lineage among most Turcic-speaking Volga Tatars and South-Ural Bashkirs.

haplogroup_n3a4
Frequency-Distribution Maps of Individual Subclade N3a4 / N1a1a1a1a2-Z1936, probably with the Samic (first) and Fennic (later) expansions into Paleo-Lakelandic and Palaeo-Laplandic territories.

The expansion of N1a-Z1936 in Fennoscandia is most likely associated with the expansion of Saami into asbestos ware-related territory (like the Lovozero culture) during the Late Iron Age – and mixture with its population – , and with the later Fennic expansion to the east and north, replacing their language, as well as with Arctic and forest populations assimilated during Permic, Ugric, and Samoyedic expansions to the north.

           N1a1a1a1a4-M2019 (previously N3a2), formed ca. 4400 BC, TMRCA 1700 BC:

Sub-hg N3a2-M2118 is one of the two main bifurcating branches in the nested cladistic structure of N3a2’6-M2110. It is predominantly found in populations inhabiting present-day Yakutia (Republic of Sakha) in central Siberia and at lower frequencies in the Khanty and Mansi populations, which exhibit a distinct Y-STR pattern (Table S7) potentially intrinsic to an additional clade inside the sub-hg N3a2

The second widespread sub-clade of hg N is N2a. (…):

   N1a2b-P43 (B523/FGC10846/Y3184), formed ca. 6800 BC, TMRCA ca. 2700 BC:

The absolute majority of N2a individuals belong to the second sub-clade, N2a1-B523, which diversified about 4.7 kya (95% CI = 4.0–5.5 kya). Its distribution covers the western and southern parts of Siberia, the Taimyr Peninsula, and the Volga-Uralic region with frequencies ranging from from 10% to 30% and does not extend to eastern Siberia (…)

haplogroup_n2
Geographic-Distribution Map of hg N2a1 / N1a2b-P43

The “European” branch suggested earlier from Y-STR patterns turned out to consist of two clades

     N1a2b2a-Y3185/FGC10847, formed ca. 2200 BC, TMRCA 800 BC:

N2a1-L1419, spread mainly in the northern part of that region.

     N1a2b2b1-B528/Y24382, formed ca. 900 BC, TMRCA ca. 900 BC:

N2a1-B528, spread in the southern Volga-Uralic region.

Haplogroup R1a

We also have a good idea of the distribution of haplogroup R1a-Z645 in ancient samples. Its subclades were associated with the Corded Ware expansion, and some of them fit quite well the early expansion of Finno-Permic, Ugric, and Samoyedic peoples to the east.

r1a-z282-z280-z2125-distribution
Modified image, from Underhill et al. (2015). Spatial frequency distributions of Z282 (green) and Z93 (blue) affiliated haplogroups.. Notice the potential Finno-Ugric-associated distribution of Z282 (especially R1a-M558, a Z280 subclade), the expansion of R1a-Z2123 subclades with Central Asian forest-steppe groups.

This is how the modern distribution of R1a among Uralians looks like, from the latest report in Tambets et al. (2018):

  • Among Fennic populations, Estonians and Karelians (ca. 1.1 million) have not suffered the greatest bottleneck of Finns (ca. 6-7 million), and show thus a greater proportion of R1a-Z280 than N1c subclades, which points to the original situation of Fennic peoples before their expansion. To trust Finnish Y-DNA to derive conclusions about the Uralic populations is as useful as relying on the Basque Y-DNA for the language spread by R1b-P312
  • Among Volga-Finnic populations, Mordovians (the closest to the original Uralic cluster, see above) show a majority of R1a lineages (27%).
  • Hungarians (ca. 13-15 million) represent the majority of Ugric (and Finno-Ugric) peoples. They are mainly R1a-Z280, also R1a-Z2123, have little N1c, and lack Siberian ancestry, and represent thus the most likely original situation of Ugric peoples in 4th century AD (read more on Avars and Hungarians).
  • Among Samoyedic peoples, the Selkup, the southernmost ones and latest to expand – that is, those not heavily admixed with Siberian populations – , also have a majority of R1a-Z2123 lineages (see also here for the original Samoyedic haplogroups to the south).

To understand the relevance of Hungarians for Ugric peoples, as well as Estonians, Karelians, and Mordovians (and northern Russians, Finno-Ugric peoples recently Russified) for Finno-Permic peoples, as opposed to the Circum-Arctic and East Siberian populations, one has to put demographics in perspective. Even a modern map can show the relevance of certain territories in the past:

population-density
Population density (people per km2) map of the world in 1994. From Wikipedia.

Summary of ancestry + haplogroups

Fennic and Samic populations seem to be clearly influenced by Palaeo-Laplandic peoples, whereas Volga-Finnic and especially Permic populations may have received gene flow from both, but essentially Palaeo-Siberian influence from the north and east.

The fact that modern Mansis and Khantys offer the highest variation in N1a subclades, and some of the highest “Siberian ancestry” among non-Nganasans, should have raised a red flag long ago. The fact that Hungarians – supposedly stemming from a source population similar to Mansis – do not offer the same amount of N subclades or Siberian ancestry (not even close), and offer instead more R1a, in common with Estonians (among Finno-Samic peoples) and Mordvins (among Volga-Finnic peoples) should have raised a still bigger red flag. The fact that Nganasans – the model for Siberian ancestry – show completely different N1a2b-P43 lineages should have been a huge genetic red line (on top of the anthropological one) to regard them as the Uralian-type population.

We know now that ethnolinguistic groups have usually expanded with massive (usually male-biased) migrations, and that neighbouring locals often ‘resurge’ later without changing the language. That is seen in Europe after the spread of Bell Beakers, with the increase of previous ancestry and lineages in Scandinavia during the formation of the Nordic ethnolinguistic community; in Central-West Europe, with the resurgence of Neolithic ancestry (and lineages) during the Bronze Age over steppe ancestry; and in Central-East Europe (with Unetice or East European Bronze Age groups like Mierzanowice, Trzciniec, or Lusatian) showing an increase in steppe ancestry (and resurge of R1a subclades); none of them represented a radical ethnolinguistic change.

finno-ugric-haplogroup-n
Map of archaeological cultures in north-eastern Europe ca. 8th-3rd centuries BC. [The Mid-Volga Akozino group not depicted] Shaded area represents the Ananino cultural-historical society. Fading purple arrows represent likely stepped movements of subclades of haplogroup N for centuries (e.g. Siberian → Ananino → Akozino → Fennoscandia [N-VL29]; Circum-Arctic → forest-steppe [N1, N2]; etc.). Blue arrows represent eventual expansions of Uralic peoples to the north. Modified image from Vasilyev (2002).

It is not hard to model the stepped arrival, infiltration, and/or resurge of N subclades and “Siberian ancestries”, as well as their gradual expansion in certain regions, associated with certain migrations first – such as the expansions to the Circum-Arctic region, and later the Scythian- and Turkic-related movements – , as well as limited regional developments, like the known bottleneck in Finns, or the clear late expansion of Ugric and Samoyedic languages to the north among nomadic Palaeo-Siberians due to traditions of exogamy and multilingualism. This fits quite well with the different arrival of N (N1c and xN1c) lineages to the different Uralic-speaking groups, and to the stepped appearance of “Siberian ancestry” in the different regions.

The aternative

It is evident that a lot of people were too attached to the idea of Palaeolithic R1b lineages ‘native’ to western Europe speaking Basque languages; of R1a lineages speaking Indo-European and spreading with Yamna; and N lineages ‘native’ to north-eastern Europe and speaking Uralic, and this is causing widespread weeping and gnashing of teeth (instead of the joy of discovering where one’s true patrilineal ancestors come from, and what language they spoke in each given period, which is the supposed objective of genetic genealogy…)

Since an Indo-Germanic branch (as revived now by some in the Copenhaguen group to fit Kristiansen’s theory of the 1980s with recent genetic data) does not make any sense in linguistics, the finding of R1a in Yamna would not have led where some think it would have, because North-West Indo-European would still be the main Late PIE branch in Europe. Don’t take my word for it; take James P. Mallory’s (2013).

mallory-adams-tree
The levels of Indo-European reconstruction, from Mallory & Adams (2006).

If an (unlikely) Indo-Slavonic group were posited, though, such a group would still be bound (with Indo-Iranian) to the steppes with East Yamna/Poltavka (admixing with Abashevo migrants, but retaining its language), developing Sintashta/Potapovka → Srubna/Andronovo, and R1a lineages would have equally undergone the known bottlenecks of the steppes where they replaced R1b-Z2103 – which this eastern group shares with Balkan languages, a haplogroup that links therefore together the Graeco-Aryan group.

As far as I know – and there might be many other similar pet theories out there – there have been proposals of “modern Balto-Slavic-like” populations (in an obvious circular reasoning based on modern populations) in some Scythian clusters of the Iron Age.

NOTE. I will not enter into “Balto-Slavic-like R1a” of the Late Bronze Age or earlier because no one can seriously believe at this point of development of Population Genetics that autosomal similarity predating 1,500+ years the appearance of Slavs equates to their (ethnolinguistic) ancestral population, without a clear intermediate cultural and genetic trail – something we lack today in the Slavic case even for the late Roman period…

finno-saamic-palaeo-germanic-substratum
The Finnic and Saamic separation looks shallower than it actually is. Invisible convergence can be ‘triangulated’ with the help of Germanic layers of mutual loanwords (Häkkinen 2012).

We also know of R1a-Z280 lineages in Srubna, probably expanding to the west. With that in mind, and knowing that Palaeo-Germanic was in close contact with Finno-Samic while both were already separated but still in contact, and that Palaeo-Germanic was also in contact and closely related to a ‘Temematic’ distinct from Balto-Slavic (and also that early Proto-Baltic and Proto-Slavic from the Roman Iron Age and later were in contact with western Uralic) this will be the linguistic map of the Iron Age if R1a is considered to expand Indo-European from some kind of “patron-client” relationship with west Yamna:

palaeo-germanic-italo-celtic
Eastern European language map during the Late Bronze Age / Iron Age, if R1a spread Indo-European languages and Eastern Yamna spoke Indo-Slavonic. Palaeo-Germanic (i.e. Pre- to Proto-Germanic) needs to be in contact with both the Samic Lovozero population and the Fennic west Circum-Arctic one. Italic and Celtic in contact with Pre-Germanic. Germanic in contact with Temematic. Balto-Slavic in contact with Iranian, and near Fennic to allow for later loanwords. For Germanic and Temematic, see Kortlandt (2018).

You might think I have some personal or political reason against this kind of proposals. I haven’t. We have been proposing Indo-European to be the language of the European Union for more than 10 years, so to support R1b-Italo-Celtic in the whole Western Europe, R1a-Germanic in Central and Eastern Europe, and R1a-Indo-Slavonic in the steppes (as the Danish group seems to be doing) has nothing inherently bad (or good) for me. If anything, it gives more reason to support the revival of North-West Indo-European in Europe.

My problem with this proposal is that it is obviously beholden to the notion of the uninterrupted cultural, historic and ethnic continuity in certain territories. This bias is common in historiography (von Falkenhausen 1993), but it extends even more easily into the lesser known prehistory of any territory, and now more than ever some people feel the need to corrupt (pre)history based on their own haplogroups (or the majority haplogroups of their modern countries). However, more than on philosophical grounds, my rejection is based on facts: this picture is not what the combination of linguistic, archaeological, and genetic data shows. Period.

Nevertheless, if Yamna + Corded Ware represented the “big and early expansion” of Germanic and Italo-Celtic peoples proper of the dream Nazi’s Lebensraum and Fascist’s spazio vitale proposals; Uralians were Siberian hunter-gatherers that controlled the whole eastern and northern Russia, and miraculously managed to push (ethnolinguistically) Neolithic agropastoralists to the west during and after the Iron Age, with gradual (and often minimal) genetic impact; and Balto-Slavic peoples were represented by horse riders from Pokrovka/Srubna, hiding then somewhere around the forest-steppe until after the Scythian expansion, and then spreading their language (without much genetic impact) during the early Middle Ages…so be it.

See also

Related