Bell Beakers and Mycenaeans from Yamnaya; Corded Ware from the forest steppe


I have recently written about the spread of Pre-Yamnaya or Yamnaya ancestry and Corded Ware-related ancestry throughout Eurasia, using exclusively analyses published by professional geneticists, and filling in the gaps and contradictory data with the most reasonable interpretations. I did so consciously, to avoid any suspicion that I was interspersing my own data or cherry picking results.

Now I’m finished recapitulating the known public data, and the only way forward is the assessment of these populations using the available datasets and free tools.

Understanding the complexities of qpAdm is fairly difficult without a proper genetic and statistical background, which I won’t pretend to have, so its tweaking to get strictly correct results would require an unending game of trial and error. I have sadly little time for this, even taking my tendency to procrastination into account… so I have used a simple model akin to those published before – in particular, the outgroup selection by Ning, Wang et al. (2019), who seem to be part of the only group interested in distinguishing Yamnaya-related from Corded Ware-related ancestry, probably the most relevant question discussed today in population genomics regarding the Proto-Indo-European and Proto-Uralic homelands.

Supplementary Table 13. P values of rank=2 and admixture proportions in modelling Steppe ancestry populations as a three-way admixture of Eneolithic steppe Anatolian_Neolithic and WHG using 14 outgroups.
Left populations: Test, Eneolithic_steppe, Anatolian_Neolithic, WHG.
Right populations: Mbuti.DG, Ust_Ishim.DG, Kostenki14, MA1, Han.DG, Papuan.DG, Onge.DG, Villabruna, Vestonice16, ElMiron, Ethiopia_4500BP.SG, Karitiana.DG, Natufian, Iran_Ganj_Dareh_Neolithic.

I have used for all analyses below a merged dataset including the curated one of the Reich Lab, the latest on Central and South Asia by Narasimhan, Patterson et al. (2019), on Iberia by Olalde et al. (2019), and on the East Baltic by Saag et al. (2019), as well as datasets including samples from Wang et al. (2019) and Lamnidis et al. (2018). I used (and intend to use) the same merged dataset in all cases, despite its huge size, to avoid adding one more uncontrolled variable to the analyses, so that all results obtained can be compared.

I try to prepare in advance a bunch of relevant files with left pops and right pops for each model:

  1. It seems a priori more reasonable to use geographically and chronologically closer proxy populations (say, Trypillia or GAC for Steppe-related peoples) than hypothetic combinations of ancestral ones (viz. Anatolian farmer, WHG, and EHG).
  2. This also means using subgroups closer to the most likely source population, such as (Don-Volga interfluve) Yamnaya_Kalmykia rather than (Middle Volga) Yamnaya_Samara for the western expansion of late Repin/early Yamnaya, or the early Germany_Corded_Ware.SG or Czech_Corded Ware for the group closest to the Proto-Corded Ware population (see below), likely neighbouring the Upper Vistula region.
  3. I usually test two source populations for different targets, which seems like a much more efficient way of using computer resources, whenever I know what I want to test, since I need my PC back for its normal use; whenever I don’t know exactly what to test, I use three-way admixture models and look for subsets to try and improve the results.

I have probably left out some more complex models by individualizing the most relevant groups, but for the time being this would have to do. Also, no other formal stats have been used in any case, which is an evident shortcoming, ruling out an interpretation drawn directly and only from the results below.

Full qpAdm results for each batch of samples are presented in a Google Spreadsheet, with each tab (bottom of the page) showing a different combination of sources, usually in order of formally ‘best’ (first to the left) to ‘worst’ (last to the right) fits, although the order is difficult to select in highly heterogeneous target groups, as will be readily visible.

Disintegration, migration, and imports of the Azov–Black Sea region. First migration event (solid arrows): Gordineşti–Maikop expansion (groups: I – Bursuchensk; II – Zhyvotylivka; III – Vovchans’k; IV – Crimean; V – Lower Don; VI – pre-Kuban). Second migration event (hollow arrows): Repin expansion. After Rassamakin (1999), Demchenko (2016).

Corded Ware origins

The latest publications on the Yampil barrow complex have not improved much our understanding of the complexity of Corded Ware origins from an archaeological point of view, involving multiple cultural (hence likely population) influences. This bit is from Ivanova et al., Baltic-Pontic Studies (2015) 20:1, and most hypotheses of the paper remain unanswered (except maybe for the relevance of the Złota group):

In the light of the above outline therefore one should argue that the ‘architecture of barrows’ associated in the ‘Yampil landscape’ of the Middle Dniester Area with the Eneolithic (specifically, mainly with the TC), precedes the development of a similar phenomenon that can be observed from 2900/2800 BC in the Upper Dniester Area and drainage basin of the Upper Vistula, associated with the CWC [Goslar et al. 2015; Włodarczak 2006; 2007; 2008; Jarosz, Włodarczak 2007]. The most consuming research question therefore is whether ritual customs making use of Eneolithic (Tripolye) ‘barrow architecture’ could have penetrated northwards along the Dniester route, where GAC communities functioned. One could also ask what role the rituals played among the autochthons [Kośko 2000; Włodarczak 2008; 2014: 335; Ivanova, Toshchev 2015b].

This issue has already been discussed with a resulting tentative systemic taxonomy in the studies of Włodarczak, arguing for the Złota culture (ZC) in the Vistula region as an illustration of one of the (Małopolska) reception centres of civilization inspirations from the oldest Pontic ‘barrow culture’ circle associated with the Eneolithic and Early Bronze Age [Włodarczak 2008]. Notably, it is in the ZC that one can notice a set of cultural traits (catacomb grave construction, burial details, forms and decoration of vessels) analogous to those shared by the north-western Black Sea Coast groups of the forest-steppe Eneolithic (chiefly Zhyvotilovka-Volchansk) and the Late Tripolye circle (chiefly Usatovo-Gordinești-Horodiștea-Kasperovtsy).

Globular Amphorae culture „exodus” to the Danube Delta: a – Globular Amphorae culture; b – GAC (1), Gorodsk (2), Vykhvatintsy (3) and Usatovo (4) groups of Trypillia culture; c – Coţofeni culture; d – northern border of the late phase of Baden culture;red arrows – direction of Globular Amphora culture expansion; blue arrow – direction of „reflux” of Globular Amphora culture (apud Włodarczak, 2008, with changes).

Taking into account that I6561 might be wrongly dated, we cannot include the Corded Ware-like sample of the end-5th millennium BC in the analysis of Corded Ware origins. That uncertainty in the chronology of the appearance of “Steppe ancestry” in Proto-Corded Ware peoples complicates the selection of any potential source population from the CHG cline.

Nevertheless, the lack of hg. R1a-M417 and sizeable Pre-Yamnaya-related ancestry in the sampled Pontic forest-steppe Eneolithic populations (represented exclusively by two samples from Dereivka ca. 3600-3400 BC) would leave open the interesting possibility that a similar ancestry got to the forest-steppe region between modern Poland and Ukraine during the known complex population movements of the Late Eneolithic.

It is known that Corded Ware-derived groups and Steppe Maykop show bad fits for Pre-Yamnaya/Yamnaya ancestry, and also that Steppe Maykop is a potential source of “Steppe-related ancestry” within the Eneolithic CHG mating network of the Pontic-Caspian steppes and forest-steppes. Testing Corded Ware for recent Trypillia and Maykop influences, proper of Late Trypillia and Late Maykop groups in the North Pontic area (such as Zhyvotylivka–Vovchans’k and Gordineşti) side by side with potential Pre-Yamnaya and Yamnaya sources makes thus sense:

Now, the main obvious difference between Khvalynsk-Yamnaya and Corded Ware is the long-lasting, pervasive Y-chromosome bottlenecks under R1b lineages in the former, compared to the haplogroup variability and late bottleneck under R1a-M417 in the latter, which speaks in favour – on top of everything else – of a different community of sub-Neolithic hunter-gatherers including hg. R1a-M417 hijacking the expansion of Steppe_Maykop-related ancestry around the Volhynian-Podolian Upland.

Akin to how Yamnaya patrilineal descendants hijacked regional EEF (±CWC) ancestry components mainly through exogamy, dragging them into the different expanding Bell Beaker groups (see below), but kept their Indo-European languages, these hunter-gatherers that admixed with peoples of “Steppe ancestry” were the most likely vector of expansion of Uralic languages in Eastern Europe.

PCA of ancient Eurasian samples. Marked likely Proto-Corded Ware samples and potential origin of its PCA cluster based on qpAdm results. See full PCA and more related files.

Baltic Corded Ware

One of the most interesting aspects of the results above is the surprising heterogeneity of the different regional groups, which is also reflected in the Y-DNA variability of early Corded Ware samples.

Seeing how Baltic CWC groups, especially the early Latvia_LN sample, show particularly bad fits with the models above, it seems necessary to test how this population might have come to be. My first impression in 2017 was that they could represent early Corded Ware groups admixed with Yamnaya settlers through their interactions along the Dnieper-Dniester corridor.

However, I recently predicted that the most likely admixture leading to their ancestry and PCA cluster would involve a Corded Ware-like group and a group related to sub-Neolithic cultures of eastern Europe, whose best proxy to date are EHG-like Khvalynsk samples (i.e. excluding the outlier with Pre-Yamnaya ancestry, I0434):

Detail of the PCA of the Corded Ware expansion. See full PCA and more related files.

Late Corded Ware + Yamnaya vanguard

Relevant are also the mixtures of Corded Ware from Esperstedt, and particularly those of the sample I0104, which I have repeated many times in this blog I suspected to be influenced by vanguard Yamnaya settlers:

The infeasible models of CWC + Yamnaya_Kalmykia ± Hungary_Baden (see below for Bell Beakers) and the potential cluster formed with other samples from the Baltic suggest that it could represent a more complex set of mixtures with sub-Neolithic populations. On the other hand, its location in Germany, late date (ca. 2500 BC or later), and position in the PCA, together with the good fits obtained for Germany_Beaker as a source, suggest that the increase in Steppe-related ancestry + EEF makes it impossible for the model (as I set it) to directly include Yamnaya_Kalmykia, despite this excess Steppe-related ancestry actually coming from Yamnaya vanguard groups.

I think it is very likely that the future publication of EEF-admixed Yamnaya_Hungary samples (or maybe even Yamnaya vanguard samples) will improve the fits of this model.

These results confirm at least the need to distrust the common interpretation of mixtures including late Corded Ware samples from Esperstedt (giving rise to the “up to 75% Yamnaya ancestry of CWC” in the 2015 papers) as representative of the Corded Ware culture as a whole, and to keep always in mind that an admixture of European BA groups including Corded Ware Esperstedt as a source also includes East BBC-like ancestry, unless proven otherwise.

Yamnaya vanguard groups in Corded Ware territory before the expansion of Bell Beakers (ca. 2500 BC). See full map.

Bell Beaker expansion

A hotly (re)debated topic in the past 6 months or so, and for all the wrong reasons, is the origin of the Bell Beaker folk. Archaeology, linguistics, and different Y-chromosome bottlenecks clearly indicate that Bell Beakers were at the origin of the North-West Indo-European expansion in Europe, while the survival of Corded Ware-related groups in north-eastern Europe is clearly related to the expansion of Uralic languages.

NOTE. For the interesting case of Proto-Indo-Iranians expanding with Corded Ware-like ancestry, see more on the formation of Sintashta-Potapovka-Filatovka from East Uralic-speaking Abashevo and Pre-Proto-Indo-Iranian-speaking Poltavka herders. See also more on R1a in Indo-Iranians and on the social complexity of Sintashta.

Nevertheless, every single discarded theory out there seems to keep coming back to life from time to time, and a new wave of interest in “Bell Beaker from the Single Grave culture” somehow got revived in the process, too, because this obsession – unlike the “Bell Beakers from Iberia Chalcolithic” – is apparently acceptable in certain circles, for some reason.

We know that Iberian Beakers, British Beakers, or Sicilian EBA – representing the most likely closest source population of speakers of Proto-Galaico-Lusitanian, Pre-Celtic Indo-European, and Proto-Elymian, respectively – have already been successfully tested for a direct origin among Western European Beakers in Olalde et al. (2018), Olalde et al. (2019), and Fernandes et al. (2019).

This success in ascertaining a closer Beaker source is probably due to the physical isolation of the specific groups (related to Germany_Beaker, Netherlands_Beaker, and NE_Mediterranean_Beaker samples, respectively) after their migration into regions dominated by peoples without Steppe-related ancestry. Furthermore, Celtic-speaking populations expanding with Urnfield south of the Pyrenees also show a good fit with a source close to France_Beaker.

So I decided to test sampled Bell Beaker populations, to see if it could shed light to the most likely source population of individual Beaker groups and the direction of migration within Central Europe, i.e. roughly eastwards or westwards. As it was to be expected for closely related populations (see the relevant discussion here), an attempt to offer a simplistic analysis of direction based on formal stats does not make any sense, because most of the alternative hypotheses cannot be rejected:

Not only because of the similar values obtained, but because it is absurd to take p-values as a measure of anything, especially when most of these conflicting groups with slightly ‘better’ or ‘worse’ p-values represent multiple different mixtures of the type (Yamnaya + EEF) + (Corded Ware + EEF ± Yamnaya), impossible to distinguish without selecting proper, direct ancestral populations…

A further example of how explosive the Bell Beaker expansion was into different territories, and of their extensive local admixture, is shown by the unsuccessful attempt by Olalde et al. (2018) to obtain an origin of the EEF source for all Beaker groups (excluding Iberian Beakers):

Investigating the genetic makeup of Beaker-complex-associated individuals. Testing different populations as a source for the Neolithic ancestry component in Beaker-complex-associated individuals. The table shows P values (* indicates values > 0.05) for the fit of the model: ‘Steppe_EBA + Neolithic/Copper Age’ source population.
Map of attested Yamnaya pit-grave burials in the Hungarian plains; superimposed in shades of blue are common areas covered by floods before the extensive controls imposed in the 19th century; in orange, cumulative thickness of sand, unfavourable loamy sand layer. Marked are settlements/findings of Boleráz (ca. 3500 BC on), Baden (until ca. 2800 BC), Kostolac (precise dates unknown), and Yamna kurgans (from ca. 3100/3000 BC on).

Now, there is a simpler way to understand what kind of Steppe-related ancestry is proper of Bell Beakers. I tested two simple models for some Beaker groups: Yamnaya + Hungary Baden vs. Corded Ware + GAC Poland. After all, the Bell Beaker folk should prefer a source more closely related to either Yamnaya Hungary or Central European Corded Ware:

Interestingly, models including Yamnaya + Baden show good fits for the most important groups related to North-West Indo-Europeans, including Bell Beakers from Germany, the Netherlands, Italy, and Poland, representing the most likely closest source populations of speakers of Pre-Proto-Celtic, Pre-Proto-Germanic, Proto-Italo-Venetic, and Pre-Proto-Balto-Slavic, respectively.

The admixed Yamnaya samples from Hungary that will hopefully be published soon by the Jena Lab will most likely further improve these fits, especially in combination with intermediate Chalcolithic populations of the Middle and Upper Danube and its tributaries, to a point where there will be an absolute chronological and geographical genomic trail from the fully Yamnaya-like Yamnaya settlers from Hungary to all North-West Indo-European-speaking groups of the Early Bronze Age.

The only difference between groups will be the gradual admixture events of their source Beaker group with local populations on their expansion paths, including peoples of mainly EEF, CWC+EEF, or CWC+EEF+Yamnaya related ancestry. There is ample evidence beyond ancestry models to support this, in particular continued Y-DNA bottlenecks under typical Yamnaya paternal lineages, mainly represented by R1b-L51 subclades.

Distribution of the Bell Beaker East Group, with its regional provinces, as of c. 2400 cal BC (after Heyd et al. 2004, modified). See full maps.

European Early Bronze Age

European EBA groups that might show conflicting results due to multiple admixture events with Corded Ware-related populations are the Únětice culture and the Nordic Late Neolithic.

The results for Únětice groups seem to be in line with what is expected of a Central European EBA population derived from Bell Beakers admixed with surrounding poulations of East Bell Beaker and/or late (Epi-)Corded Ware descent.

Potential models of mixture for Nordic Late Neolithic samples – despite the bad fits due to the lack of direct ancestral CWC and BBC groups from Denmark – seem to be impossible to justify as derived exclusively from Single Grave or (even less) from Battle Axe peoples, supporting immigration waves of Bell Beakers from the south and further admixture events with local groups through maritime domination.

PCA of ancient European samples. Marked are Bronze Age clusters. See full PCAs.

Balkans Bronze Age

The potential origin of the typical Corded Ware Steppe-related ancestry in the social upheaval and population movements of the Dnieper-Dniester forest-steppe corridor during the 4th millennium BC raises the question: how much do Balkan Bronze Age groups owe their ancestry to a population different than the spread of Pre-Yamnaya-like Suvorovo-Novodanilovka chieftains? Furthermore, which Bronze Age groups seem to be more likely derived exclusively from Pre-Yamnaya groups, and which are more likely to be derived from a mixture of Yamnaya and Pre-Yamnaya? Do the formal stats obtained correspond to the expected results for each group?

Since the expansion of hg. I2a-L699 (TMRCA ca. 5500 BC) need not be associated with Yamnaya, some of these values – together with the assessment of each individual archaeological culture – may question their origin in a Yamnaya-related expansion rather than in a Khvalynsk-related one.

NOTE. These are the last ones I was able to test yesterday, and I have not thought these models through, so feel free to propose other source and target groups. In particular, complex movements through the North Pontic area during the Late Eneolithic would suggest that there might have been different Steppe-ancestry-related vs. EEF-related interactions in the north-west and west Pontic area before and during the expansion of Yamnaya.


One of the key Indo-European populations that should be derived from Yamnaya to confirm the Steppe hypothesis, together with North-West Indo-Europeans, are Proto-Greeks, who will in turn improve our understanding of the preceding Palaeo-Balkan community. Unfortunately, we only have Mycenaean samples from the Aegean, with slight contributions of Steppe-related ancestry.

Still, analyses with potential source populations for this Steppe ancestry show that the Yamnaya outlier from Bulgaria is a good fit:

The comparison of all results makes it quite evident the why of the good fits from (Srubnaya-related) Bulgaria_MLBA I2163 or of Sintashta_MLBA relative to the only a priori reasonable Yamnaya and Catacomb sources: it is not about some hypothetical shared ancestor in Graeco-Aryan-speaking East Yamnaya– or even Catacomb-Poltavka-related groups, because all available Yamnaya-related peoples are almost indistinguishable from each other (at least with the sampling available today). These results reflect a sizeable contribution of similar EEF-related populations from around the Carpathians in both Steppe-related groups: Corded Ware and Yamnaya settlers from the Balkans.

Cultural groups in and around the Balkans during the Early Bronze Age. See full maps.

qpAdm magic

In hobby ancestry magic, as in magic in general, it is not about getting dubious results out of thin air: misdirection is the key. A magician needs to draw the audience attention to ‘remarkable’ ancestry percentages coupled with ‘great’ (?) p-values that purportedly “prove” what the audience expects to see, distracting everyone from the true interesting aspects, like statistical design, the data used (and its shortcomings), other opposing models, a comparison of values, a proper interpretation…you name it.

I reckon – based on the examples above – that the following problems lie at the core of bad uses of qpAdm:

  1. In the formal aspect, the poor understanding of what p-values and other formal stats obtained actually mean, and – more importantly – what they don’t mean. The simplistic trend to accept results of a few analyses at face value is necessarily wrong, in so far as there is often no proper reasoning of what is being assessed and how, and there is never a previous opinion about what could be expected if the alternative hypotheses were true.
  2. In the interpretation aspect, the poor judgement of accompanying any results with simplistic, superficial, irrelevant, and often plainly wrong archaeological or linguistic data selected a posteriori; the inclusion of some racial or sociopolitical overtones in the mixture to set a propitious mood in the target audience; and a sort of ritualistic theatrics with the main theme of ‘winning’, that is best completed with ad hominems.

If you get rid of all this, the most reasonable interpretation of the output of a model proposed and tested should be similar to Nick Patterson’s words in his explanation of qpWave and qpAdm use:

Here we see that, at least in this analysis there are reasonable models with CordedWareNeolithic is a mix of either WHG or LBKNeolithic and YamnayaEBA. (…) The point of this note is not to give a serious phylogenetic analysis but the results here certainly support a major Steppe contribution to the Corded Ware population, which is entirely concordant with the archaeology [?].

Very far, as you can see, from the childish “Eureka! I proved the source!”-kind of thinking common among hobbyists.

The Mycenaean case is an illustrative example: if the Yamnaya outlier from Bulgaria were not available, and if one were not careful when designing and assessing those mixture models, the interpretation would range from erroneous (viz. a Graeco-Aryan substrate, as I initially thought) to impossible (say, inventing migration waves of Sintashta or Srubnaya peoples into Crete). The models presented above show that a contribution of Yamnaya to Mycenaeans couldn’t be rejected, and this alone should have been enough to accept Yamnaya as the most likely source population of “Steppe ancestry” in Proto-Greeks, pending intermediate samples from the Balkans. In other words, one could actually find that ‘the best’ p-values for source populations of Mycenaeans is a combination of modern Poles + Turks, despite the impracticality of such a model…

I haven’t been able to reproduce results which supposedly showed that Corded Ware is more likely to be derived from (Pre-)Yamnaya than other source population, or that Corded Ware is better suited as the ancestral population of Bell Beakers. The analyses above show values in line with what has been published in recent scientific papers, and what should be expected based on linguistics and archaeology. So I’ll go out on a limb here and say that it’s only through a careful selection of outgroups and samples tested, and of as few compared models as possible, that you could eventually get this kind of results and interpretation, if at all.

Whether that kind of special care for outgroups and samples is about (a) an acceptable fine-tuning of the analyses, (b) a simplistic selection dragged from the first papers published and applied indiscriminately to all models, or (c) cherry picking analyses until results fit the expected outcome, is a question that will become mostly irrelevant when future publications continue to support an origin of the expansion of ancient Indo-European languages in Khvalynsk- and Yamnaya-related migrations.

Feel free to suggest (reasonable) modifications to correct some of these models in the comments. Also, be sure to check out other values such as proportions, SD or SNPs of the different results that I might have not taken into account when assessing ‘good’ or ‘bad’ fits.


Vikings, Vikings, Vikings! “eastern” ancestry in the whole Baltic Iron Age


Open access Population genomics of the Viking world, by Margaryan et al. bioRxiv (2019), with a huge new sampling from the Viking Age.

Interesting excerpts (emphasis mine, modified for clarity):

To understand the genetic structure and influence of the Viking expansion, we sequenced the genomes of 442 ancient humans from across Europe and Greenland ranging from the Bronze Age (c. 2400 BC) to the early Modern period (c. 1600 CE), with particular emphasis on the Viking Age. We find that the period preceding the Viking Age was accompanied by foreign gene flow into Scandinavia from the south and east: spreading from Denmark and eastern Sweden to the rest of Scandinavia. Despite the close linguistic similarities of modern Scandinavian languages, we observe genetic structure within Scandinavia, suggesting that regional population differences were already present 1,000 years ago.

Maps illustrating the following texts have been made based on data from this and other papers:

  • Maps showing ancestry include only data from this preprint (which also includes some samples from Sigtuna).
  • Maps showing haplogroup density include Vikings from other publications, such as those from Sigtuna in Krzewinska et al. (2018), and from Iceland in Ebenesersdóttir et al. (2018).
  • Maps showing haplogroups of ancient DNA samples based on their age include data from all published papers, but with slightly modified locations to avoid overcrowding (randomized distance approx. ± 0.1 long. and lat.).

Y-DNA haplogroups in Europe during the Viking expansions (full map). See other maps from the Middle Ages.

We find that the transition from the BA to the IA is accompanied by a reduction in Neolithic farmer ancestry, with a corresponding increase in both Steppe-like ancestry and hunter-gatherer ancestry. While most groups show a slight recovery of farmer ancestry during the VA, there is considerable variation in ancestry across Scandinavia. In particular, we observe a wide range of ancestry compositions among individuals from Sweden, with some groups in southern Sweden showing some of the highest farmer ancestry proportions (40% or more in individuals from Malmö, Kärda or Öland).

Ancestry proportions in Norway and Denmark on the other hand appear more uniform. Finally we detect an influx of low levels of “eastern” ancestry starting in the early VA, mostly constrained among groups from eastern and central Sweden as well as some Norwegian groups. Testing of putative source groups for this “eastern” ancestry revealed differing patterns among the Viking Age target groups, with contributions of either East Asian- or Caucasus-related ancestry.

Ancestry proportions of four-way models including additional putative source groups for target groups for which three-way fit was rejected (p ≤ 0.01);

Overall, our findings suggest that the genetic makeup of VA Scandinavia derives from mixtures of three earlier sources: Mesolithic hunter-gatherers, Neolithic farmers, and Bronze Age pastoralists. Intriguingly, our results also indicate ongoing gene flow from the south and east into Iron Age Scandinavia. Thus, these observations are consistent with archaeological claims of wide-ranging demographic turmoil in the aftermath of the Roman Empire with consequences for the Scandinavian populations during the late Iron Age.

Genetic structure within Viking-Age Scandinavia

We find that VA Scandinavians on average cluster into three groups according to their geographic origin, shifted towards their respective present-day counterparts in Denmark, Sweden and Norway. Closer inspection of the distributions for the different groups reveals additional complexity in their genetic structure.

Natural neighbor interpolation of “Danish ancestry” among Vikings.

We find that the ‘Norwegian’ cluster includes Norwegian IA individuals, who are distinct from both Swedish and Danish IA individuals which cluster together with the majority of central and eastern Swedish VA individuals. Many individuals from southwestern Sweden (e.g. Skara) cluster with Danish present-day individuals from the eastern islands (Funen, Zealand), skewing towards the ‘Swedish’ cluster with respect to early and more western Danish VA individuals (Jutland).

Some individuals have strong affinity with Eastern Europeans, particularly those from the island of Gotland in eastern Sweden. The latter likely reflects individuals with Baltic ancestry, as clustering with Baltic BA individuals is evident in the IBS-UMAP analysis and through f4-statistics.

Natural neighbor interpolation of “Norwegian ancestry” among Vikings.

For more on this influx of “eastern” ancestry see my previous posts (including Viking samples from Sigtuna) on Genetic and linguistic continuity in the East Baltic, and on the Pre-Proto-Germanic homeland based on hydrotoponymy.

Baltic ancestry in Gotland

Genetic clustering using IBS-UMAP suggested genetic affinities of some Viking Age individuals with Bronze Age individuals from the Baltic. To further test these, we quantified excess allele sharing of Viking Age individuals with Baltic BA compared to early Viking Age individuals from Salme using f4 statistics. We find that many individuals from the island of Gotland share a significant excess of alleles with Baltic BA, consistent with other evidence of this site being a trading post with contacts across the Baltic Sea.

Natural neighbor interpolation of “Finnish ancestry” among Vikings.

The earliest N1a-VL29 sample available comes from Iron Age Gotland (VK579) ca. AD 200-400 (see Iron Age Y-DNA maps), which also proves its presence in the western Baltic before the Viking expansion. The distribution of N1a-VL29 and R1a-Z280 (compared to R1a in general) among Vikings also supports a likely expansion of both lineages in succeeding waves from the east with Akozino warrior-traders, at the same time as they expanded into the Gulf of Finland.

Density of haplogroup R1a-Z280 (samples in pink) overlaid over other R1a samples (in green, with R1a-Z284 in cyan) among Vikings.

Vikings in Estonia

(…) only one Viking raiding or diplomatic expedition has left direct archaeological traces, at Salme in Estonia, where 41 Swedish Vikings who died violently were buried in two boats accompanied by high-status weaponry. Importantly, the Salme boat-burial predates the first textually documented raid (in Lindisfarne in 793) by nearly half a century. Comparing the genomes of 34 individuals from the Salme burial using kinship analyses, we find that these elite warriors included four brothers buried side by side and a 3rd degree relative of one of the four brothers. In addition, members of the Salme group had very similar ancestry profiles, in comparison to the profiles of other Viking burials. This suggests that this raid was conducted by genetically homogeneous people of high status, including close kin. Isotope analyses indicate that the crew descended from the Mälaren area in Eastern Sweden thus confirming that the Baltic-Mid-Swedish interaction took place early in the VA.

Natural neighbor interpolation of “Swedish ancestry” among Vikings.

Viking samples from Estonia show thus ancient Swedes from the Mälaren area, which proves once again that hg. N1a-VL29 (especially subclade N1a-L550) and tiny proportions of so-called “Siberian ancestry” expanded during the Early Iron Age into the whole Baltic Sea area, not only into Estonia, and evidently not spreading with Balto-Finnic languages (since the language influence is in the opposite direction, east-west, Germanic > Finno-Samic, during the Bronze Age).

N1a-VL29 lineages spread again later eastwards with Varangians, from Sweden into north-eastern Europe, most likely including the ancestors of the Rurikid dynasty. Unsurprisingly, the arrival of Vikings with Swedish ancestry into the East Baltic and their dispersal through the forest zone didn’t cause a language shift of Balto-Finnic, Mordvinic, or East Slavic speakers to Old Norse, either…

NOTE. For N1a-Y4339 – N1a-L550 subclade of Swedish origin – as main haplogroup of modern descendants of Rurikid princes, see Volkov & Seslavin (2019) – full text in comments below. Data from ancient samples show varied paternal lineages even among early rulers traditionally linked to Rurik’s line, which explains some of the discrepancies found among modern descendants:

  • A sample from Chernihiv (VK542) potentially belonging to Gleb Svyatoslavich, the 11th century prince of Tmutarakan/Novgorod, belongs to hg. I2a-Y3120 (a subclade of early Slavic I2a-CTS10228) and has 71% “Modern Polish” ancestry (see below).
  • Izyaslav Ingvarevych, the 13th century prince of Dorogobuzh, Principality of Volhynia/Galicia, is probably behind a sample from Lutsk (VK541), and belongs to hg. R1a-L1029 (a subclade of R1a-M458), showing ca. 95% of “Modern Polish” ancestry.
  • Yaroslav Osmomysl, the 12th century Prince of Halych (now in Western Ukraine), was probably of hg. E1b-V13, yet another clearly early Slavic haplogroup.

Density of haplogroup N1a-VL29, N1a-L550 (samples in pink, most not visible) among Vikings. Samples of hg. R1b in blue, hg. R1a in green, hg. I in orange.

Finnish ancestry

Firstly, modern Finnish individuals are not like ancient Finnish individuals, modern individuals have ancestry of a population not in the reference; most likely Steppe/Russian ancestry, as Chinese are in the reference and do not share this direction. Ancient Swedes and Norwegians are more extreme than modern individuals in PC2 and 4. Ancient UK individuals were more extreme than Modern UK individuals in PC3 and 4. Ancient Danish individuals look rather similar to modern individuals from all over Scandinavia. By using a supervised ancient panel, we have removed recent drift from the signal, which would have affected modern Scandinavians and Finnish populations especially. This is in general a desirable feature but it is important to check that it has not affected inference.

PCA of the ancient and modern samples using the ancient palette, showing different PCs. Modern individuals are grey and the K=7 ancient panel surrogate populations are shown in strong colors, whilst the remaining M-K=7 ancient populations are shown in faded colors.

The story for Modern-vs-ancient Finnish ancestry is consistent, with ancient Finns looking much less extreme than the moderns. Conversely, ancient Norwegians look like less-drifted modern Norwegians; the Danish admixture seen through the use of ancient DNA is hard to detect because of the extreme drift within Norway that has occurred since the admixture event. PC4 vs PC5 is the most important plot for the ancient DNA story: Sweden and the UK (along with Poland, Italy and to an extent also Norway) are visibly extremes of a distribution the same “genes-mirror-geography” that was seen in the Ancient-palette analysis. PC1 vs PC2 tells the same story – and stronger, since this is a high variance-explained PC – for the UK, Poland and Italy.

Uniform manifold approximation and projection (UMAP) analysis of the VA and other ancient samples.

Evidence for Pictish Genomes

The four ancient genomes of Orkney individuals with little Scandinavian ancestry may be the first ones of Pictish people published to date. Yet a similar (>80% “UK ancestry) individual was found in Ireland (VK545) and five in Scandinavia, implying that Pictish populations were integrated into Scandinavian culture by the Viking Age.

Our interpretation for the Orkney samples can be summarised as follows. Firstly, they represent “native British” ancestry, rather than an unusual type of Scandinavian ancestry. Secondly, that this “British” ancestry was found in Britain before the Anglo-Saxon migrations. Finally, that in Orkney, these individuals would have descended from Pictish populations.

Natural neighbor interpolation of “British ancestry” among Vikings.

(…) ‘UK’ represents a group from which modern British and Irish people all receive an ancestry component. This information together implies that within the sampling frame of our data, they are proxying the ‘Briton’ component in UK ancestry; that is, a pre-Roman genetic component present across the UK. Given they were found in Orkney, this makes it very likely that they were descended from a Pictish population.

Modern genetic variation within the UK sees variation between ‘native Briton’ populations Wales, Scotland, Cornwall and Ireland as large compared to that within the more ‘Anglo-Saxon’ English. This is despite subsequent gene flow into those populations from English-like populations. We have not attempted to disentangle modern genetic drift from historically distinct populations. Roman-era period people in England, Wales, Ireland and Scotland may not have been genetically close to these Orkney individuals, but our results show that they have a shared genetic component as they represent the same direction of variation.

Density of haplogroup R1b-L21 (samples in red), overlaid over all samples of hg. R1b among Vikings (R1b-U106 in green, other R1b-L151 in deep red). To these samples one may add the one from Janakkala in south-western Finland (AD ca. 1300), of hg. R1b-L21, possibly related to these population movements.

For more on Gaelic ancestry and lineages likely representing slaves among early Icelanders, see Ebenesersdóttir et al. (2018).


As in the case of mitochondrial DNA, the overall distribution profile of the Y chromosomal haplogroups in the Viking Age samples was similar to that of the modern North European populations. The most frequently encountered male lineages were the haplogroups I1, R1b and R1a.

Haplogroup I (I1, I2)

The distribution of I1 in southern Scandinavia, including a sample from Sealand (VK532) ca. AD 100 (see Iron Age Y-DNA maps) proves that it had become integrated into the West Germanic population already before their expansions, something that we already suspected thanks to the sampling of Germanic tribes.

Density of haplogroup I (samples in orange) among Vikings. Samples of hg. R1b in blue, hg. R1a in green, N1a in pink.
Density of haplogroup I1 (samples in red) overlaid over all samples of hg. I among Vikings.

Haplogroup R1b (M269, U106, P312)

Especially interesting is the finding of R1b-L151 widely distributed in the historical Nordic Bronze Age region, which is in line with the estimated TMRCA for R1b-P312 subclades found in Scandinavia, despite the known bottleneck among Germanic peoples under U106. Particularly telling in this regard is the finding of rare haplogroups R1b-DF19, R1b-L238, or R1b-S1194. All of that points to the impact of Bell Beaker-derived peoples during the Dagger period, when Pre-Proto-Germanic expanded into Scandinavia.

Also interesting is the finding of hg. R1b-P297 in Troms, Norway (VK531) ca. 2400 BC. R1b-P297 subclades might have expanded to the north through Finland with post-Swiderian Mesolithic groups (read more about Scandinavian hunter-gatherers), and the ancestry of this sample points to that origin.

However, it is also known that ancestry might change within a few generations of admixture, and that the transformation brought about by Bell Beakers with the Dagger Period probably reached Troms, so this could also be a R1b-M269 subclade. In fact, the few available data from this sample show that it comes from the natural harbour Skarsvågen at the NW end of the island Senja, and that its archaeologist thought it was from the Viking period or slightly earlier, based on the grave form. From Prescott (2017):

In 1995, Prescott and Walderhaug tentatively argued that a dramatic transformation took place in Norway around the Late Neolithic (2350 BCE), and that the swift nature of this transition was tied to the initial Indo-Europeanization of southern and coastal Norway, at least to Trøndelag and perhaps as far north as Troms. (…)

The Bell Beaker/early Late Neolithic, however, represents a source and beginning of these institution and practices, exhibits continuity to the following metal age periods and integrated most of Northern Europe’s Nordic region into a set of interaction fields. This happened around 2400 BCE, at the MNB to LN transition.

NOTE. This particular sample is not included in the maps of Viking haplogroups.

Density of haplogroup R1b (samples in blue) among Vikings. Samples of hg. I in orange, hg. R1a in green, N1a in pink.
Density of haplogroup R1b-U106 (samples in green) overlaid over all samples of hg. R1b (other R1b-L23 samples in red) among Vikings.
Density of R1b-L151 (xR1b-U106) (samples in deep red) overlaid over all samples of hg. R1b (R1b-U106 in green, other R1b-M269 in blue) among Vikings.

Haplogroup R1a (M417, Z284)

The distribution of hg. R1a-M417, in combination with data on West Germanic peoples, shows that it was mostly limited to Scandinavia, similar to the distribution of I1. In fact, taking into account the distribution of R1a-Z284 in particular, it seems even more isolated, which is compatible with the limited impact of Corded Ware in Denmark or the Northern European Plain, and the likely origin of R1a-Z284 in the expansion with Battle Axe from the Gulf of Finland. The distribution of R1a-Z280 (see map above) is particularly telling, with a distribution around the Baltic Sea mostly coincident with that of N1a.

Density of haplogroup R1a (samples in green) among Vikings. Samples of hg. R1b in blue, of hg. I in orange, N1a in pink.
Density of haplogroup R1a-Z284 (samples in cyan) overlaid over all samples of hg. R1a (in green, with R1a-Z280 in pink) among Vikings.

Other haplogroups

Among the ancient samples, two individuals were derived haplogroups were identified as E1b1b1-M35.1, which are frequently encountered in modern southern Europe, Middle East and North Africa. Interestingly, the individuals carrying these haplogroups had much less Scandinavian ancestry compared to the most samples inferred from haplotype based analysis. A similar pattern was also observed for less frequent haplogroups in our ancient dataset, such as G (n=3), J (n=3) and T (n=2), indicating a possible non-Scandinavian male genetic component in the Viking Age Northern Europe. Interestingly, individuals carrying these haplogroups were from the later Viking Age (10th century and younger), which might indicate some male gene influx into the Viking population during the Viking period.

Natural neighbor interpolation of “Italian ancestry” among Vikings.

As the paper says, the small sample size of rare haplogroups cannot distinguish if these differences are statistically relevant. Nevertheless, both E1b samples have substantial Modern Polish-like ancestry: one sample from Gotland (VK474), of hg. E1b-L791, has ca. 99% “Polish” ancestry, while the other one from Denmark (VK362), of hg. E1b-V13, has ca. 35% “Polish”, ca. 35% “Italian”, as well as some “Danish” (14%) and minor “British” and “Finnish” ancestry.

Given the E1b-V13 samples of likely Central-East European origin among Lombards, Visigoths, and especially among Early Slavs, and the distribution of “Polish” ancestry among Viking samples, VK362 is probably a close description of the typical ancestry of early Slavs. The peak of Modern Polish-like ancestry around the Upper Pripyat during the (late) Viking Age suggests that Poles (like East Slavs) have probably mixed since the 10th century with more eastern peoples close to north-eastern Europeans, derived from ancient Finno-Ugrians:

Natural neighbor interpolation of “Polish ancestry” among Vikings.

Similarly, the finding of R1a-M458 among Vikings in Funen, Denmark (VK139), in Lutsk, Poland (VK541), and in Kurevanikha, Russia (VK160), apart from the early Slav from Usedom, may attest to the origin of the spread of this haplogroup in the western Baltic after the Bell Beaker expansion, once integrated in both Germanic and Balto-Slavic populations, as well as intermediate Bronze Age peoples that were eventually absorbed by their expansions. This contradicts, again, my simplistic initial assessment of R1a-M458 expansion as linked exclusively (or even mainly) to Balto-Slavs.

Y-DNA haplogroups in Europe during Antiquity (full map). See other maps of cultures and ancient DNA from Antiquity.


European hydrotoponymy (IV): tug of war between Balto-Slavic and West Uralic


In his recent paper on Late Proto-Indo-European migrations, when citing Udolph to support his model, Frederik Kortlandt failed to mention that the Old European hydrotoponymy in northern Central-East Europe evolved into Baltic and Slavic layers, and both take part in some Northern European (i.e. Germanic – Balto-Slavic) commonalities.


From Expansion slavischer Stämme aus namenkundlicher und bodenkundlicher sicht, by Udolph, Onomastica (2016), translated into English (emphasis mine):

NOTE. An archived version is available here. The DOI references for Onomastica do not work.

(…) there is a clear center of Slavic names in the area north of the Carpathians. Among them are root words of the Slavic languages such as reka / rzeka, potok u. a. m.

Even more important than this mapping is the question of how the dispersion of ancient Slavic names happened. What is meant by ancient Slavic names? I elaborated on this in this journal years ago (Udolph, 1997):

(1)Ancient suffixes that are no longer productive today.

This clearly includes Slavic *-(j)ava as in Vir-ava, Vod-ava, Il-ava, Glin-iawa, Breg-ava, Ljut-ava, Mor-ava, Orl-java among others. It has clear links to the ancient common Indo-European language (Lupawa, Morava-March-Moravia, Orava, Widawa). They have a center north of the Carpathians.


(2) Unproductive appellatives (water words), which have disappeared from the language, are certain witnesses of ancient Slavic settlements. A nice example of this is Ukr. bahno, Pol. bagno ‘swamp, bog, morass’ etc. The word has long been missing in South Slavic, although it appears in South Slavic names, but only in very specific areas (see Udolph, 1979, pp. 324-336).

(3) Names that go back to different sound shifts. [Examples:]

  • (…) the Slavic clan around Old Sorbian brna ‘feces, earth’, Bulgarian OCS brьnije ‘feces, loam’, OCS brъna ‘feces’, Slovenian brn, ‘river mud’, etc. is solved with the inclusion of onomastic materials (Udolph, 1979, p. 499-514). (…) Toponymic mapping shows important details.
  • bryn-slavic
    Karte 4. brъn < *brŭn und bryn- < *brūn- in slavischen Namen
  • (…)We also have an ablauting *krŭn-:*krūn- in front of us. Map 5 shows the distribution of both variants in Slavic names.
  • The next case is quite similar. It concerns Russ. appellative grjaz’ ‘dirt, feces, mud’, (…) for which an Old Slavic form *gręz exists. Slavic also knows the ablauting variant *grǫz.

    These maps (see Map 6, p. 222) show that a homeland of Slavic tribes can only be inferred north of the Carpathians.

    (4) Place-names formed by Slavic suffixes of Pre-Slavic nature, i.e. derived from Old European hydronyms.

    (a) The largest river in Poland, the Wisła, German Vistula, bears a clearly Pre-Slavic name, no matter how one explains it (Babik, 2001, pp. 311-315; Bijak, 2013, p. 34, Udolph, 1990 , Pp. 303-311).

    (b) With the same suffix are formed Sanok, place on the southwest of Przemyśl; Sanoka, a no longer known waters name, 1448 as fluvium Szanoka, near the place Sanoka and with a diminutive suffix -ok- a tributary of the Sanok, which is called Sanoczek (for details see Udolph, 1990, pp. 264-270; Rymut / Majtan, 1998, p. 222). The San also has a single-language name, but that does not change anything about the right etymology. The suffix variant -očь also includes Liwocz and Liwoczka, river names near Cracow; also a mountain range of the Beskydy is mentioned at Długosz as Lywocz.

    According to the opinion of the “Słownik prasłowiański” (Sławski (red.), 1974, p. 92), the suffix -ok- represents a Proto-Slavic archaism. It appears, for example, in sъvědokъ, snubokъ, vidokъ, edok, igrok, inok among others, but its antiquity also shows, among other things, that it started at archaic athematic tribes.

    Mapping of older and younger East Slavic place-names and translation into settlement evolution.

    Slavonic Urheimat

    If we apply this to the loess distribution in western Ukraine and south-eastern Poland, it is very noticeable that the center of the Old Slavic place names lies in the area where loess dispersal is gradually “frayed out”, i.e. for example, in the area west of Kiev between Krakow in the west and Winnycja and Moldavia in the east. In short, the distribution of good soils coincides with ancient Slavic names. If that is correct, we can expect a homeland in the Pre-Carpathian region, or better, a core landscape of Slavic settlement.

    The existence of Pre-Slavic Indo-European place names and water names whose structure indicates that they originated from an Indo-European basis, but then also developed Slavic peculiarities, can now – as stated above – only be understood to mean that the language group that we call today Slavic emerged in a century-long process from an Indo-European dialectal area.

    Loess areas between Poland and Ukraine. Image from Jary et al. (2018).

    From a genetic point of view, the scarce data published to date show a clear shift of central-east populations from more Corded Ware-like groups in the EBA towards more BBC-derived ancestry in the common era, to the point where ancient DNA samples from East Germany, Poland and Lithuania evolve from clustering between Corded Ware and Sub-Neolithic peoples to clustering close to Bell Beaker-derived groups, such as West Germanic peoples, Tollense samples, etc. (see below)

    Furthermore, sampled Early Slavs show bottlenecks under “Dinaric” I2a-L621 and central-eastern E1b-V13, which – in combination with the known phylogeography of Únětice and Urnfield – is compatible with its late expansion from a central-east European Slavonic homeland, such as the Pomeranian culture, in turn likely derived from Lusatian culture groups.

    This doesn’t preclude a more immediate expansion of Common Slavic in Antiquity closer to the northern Carpathians, which is also supported by the available Early Slavic sampling, apart from samples from the Avar and Hungarian polities.

    Likely Baltic (yellow-green) and Slavic (orange) groups ca. 500 AD on, with Finnic (cyan) and Mordvinic (blue) groups roughly divided through hydrotoponymy line ca. 1000 AD Top Left: Late Iron Age cultures. Top right: PCA of groups from the Iron Age to the Middle Ages. Y-DNA haplogroups during the Germanic migrations (Bottom left) and during the Middle Ages (Bottom right). Notice a majority non-R1a lineages among sampled Early Slavs. See full maps and PCAs.

    Proto-Baltic / Proto-Slavic

    Northern European hydronymy

    From Alteuropäische Hydronymie und urslavische Gewässernamen, by Udolph, Onomastica (1997), translated into English (emphasis mine):

    NOTE. An HTML version is available at Jurgen Udolph’s personal site.

    Because of the already striking similarities as the well-known “-m-case”, the number-words for ‘1000’, ’11’ and ’12’ and so on, J. Grimm had already assumed a close relationship between Germanic and Baltic and Slavic. (…)

    In my own search, I approached this trinity from the nomenclature side. In doing so, I noticed some name groups that can speak for a certain common context:

    1.* bhelgh-, *bholgh-.

    Map 10, p. 64, shows that a root * bhelgh- occurs in the name material of a region from which later Germanic, Baltic and Slavic originated. The Balkans play no role in this.


    2. *dhelbh-, *dholbh-, *dhl̥bh-

    The proof of the three ablauting * dhelbh, * dholbh, * dhl̥bh- within a limited area shows the close relationship that this root has with the Indo-European basis. Again it is significant in which area the names meet (…)


    3. An Indo-European root extension *per-s- with the meaning ‘spray, splash, dust, drop’ is detectable in several languages (…). From a Baltic-Slavic-Germanic peculiarity cannot therefore be spoken from the toponymic point of view. The picture changes, however, if one includes the derived water names.

    4. The root extension *pel-t-, *pol-t-, *pl̥-t- of a tribe widely spread in the Indo-European languages around *pel-, pol- ‘pour, flow, etc.’, whose reflexes are found Armenian through Baltic and Slavic to the Celtic area, is found in the Baltic toponymy, cf. Latv. palts, palte ‘puddle, pool’.

    The dynamics of stylistic changes of the form of the “Trzciniec pot” in the lowland regions of Central Europe, and spreading routes of the Trzciniec package in Central Europe. A good proxy for contacts through the Northern European Plain during the Early Bronze Age. Modified from Czebreszuk (1998).

    Early Balto-Finnic

    In order to properly delimit (geographically and chonologically) the Proto-Baltic and Proto-Slavic expansions, it is necessary to understand where the late Balto-Finnic homeland was located during the Bronze Age. The following are excerpts from the comprehensive hydrotoponymic study by Pauli Rahkonen (2013):

    In any case, Finnic probably had its origin somewhere around the Gulf of Finland. Names of large and central rivers such as Vuoksi (< Finnic vuo ‘stream’) and Neva (< Finnic neva ‘marsh, river’) must be very old and might represent Proto-Finnic hydronyms. In the southern coastal area of Finland, the names Kymi and Nietoo < *Niet|oja (id. later Porvoonjoki) may also be of Finnic origin and derive from, respectively, kymi ‘stream’ (see SSA I s.v. *kymi; see however SPK s.v. Kemijärvi; Rahkonen 2013: 24) and nieto(s) ‘heap of snow’ (SSA II s.v. nietos), in hydronyms probably ‘high (snowy?) banks of a river’. Mustion|joki is clearly a Finnish name < *must|oja ‘black river’. The river name Vantaa remains somewhat obscure, although Nissilä (see SPK s.v. Vantaanjoki) has derived it from the Finnic word vana ‘water route’. In western Finland the names of large rivers, such as Aura and Eura, are supposedly of Germanic origin (Koivulehto 1987).

    In Estonia the names of many of the most important rivers might be of Finnic origin: e.g. Ema|jõgi Est. ema ‘mother’ [Tartu district] (?? cf. the Lake Piiga|ndi < Est. piiga ‘maiden’), Pärnu [Pärnu district] < Est. pärn ‘linden’, Valge|jõgi [Loksa district] < Est. valge ‘white’, Must|jõgi [Võru district] < Est. must ‘black’. It is possible that Emajogi and especially Piigandi are the result of later folk etymologizing of a name with some unknown origin. However, as a naming motif there exist in Finland numerous toponyms with the stems Finnic *emä (e.g. 3 Emäjoki), *neit(V)- ‘maiden’ (e.g. Neitijärvi, Neittävänjoki, Neittävänjärvi) and Saami stems that can be derived from Proto Saami *nejte̮ ‘id’ (GT2000; NA).

    The historical southern boundary of Finnic hydronyms, excluding hydronyms produced by the Karelian refugees of the 17th century.

    These seemingly very old names of relatively large rivers in southern Finland, modern Leningrad oblast and Estonia support the hypothesis that Proto-Finnic was spoken for a long time on both sides of the Gulf of Finland and it thus basically corresponds to the hypothesis of Terho Itkonen (see below). In the Novgorod, Tver or Vologda oblasts of Russia, Finnic names for large rivers cannot be found (Rahkonen 2011: 229). For this reason, it is likely that the Late Proto-Finnic homeland was the area around the Gulf of Finland.

    Beyond the southeastern boundary of the modern or historically known Finnic-speaking area, there exists a toponymic layer belonging to the supposedly non-Finnic Novgorodian Čudes (see Rahkonen 2011). In theory it is possible that Proto-Finnic and Proto-Čudian separated from each other at an early stage or it is even possible that Proto-Čudian was identical with Proto-Finnic. However, this cannot be proven, because there is not enough material available describing what Novgorodian Čudic was like exactly.

    Yakhr-, -khra, yedr-, -dra and yer-/yar, -er(o), -or(o) names of lakes in Central and North Russia and the possible boundary of the proto-language words *jäkra/ä and *järka/ä. Rahkonen (2013)

    A summary of the data is then:

    • The Daugava River and the Gulf of Livonia formed the most stable south-western Balto-Finnic border (up until ca. 1000 AD): the Daugava shows a likely Indo-European etymology, while some of its tributaries are best explained as derived from Uralic.
    • The first layer of “Early Baltic” loans in Early Balto-Finnic are of a non-attested Baltic dialect closest to Proto-Balto-Slavic (read more about this early layer).
    • The latest samples of the Trzciniec culture (or derived Iron Age group) from its easternmost group in Turlojiškė (ca. 1000-800 BC?) show a western shift towards Bell Beaker, although they show a majority of hg. R1a-Z280; while the earliest sample from Gustorzyn (ca. 1900 BC), likely from Trzciniec/Iwno, from the westernmost area of the culture, shows a Corded Ware-like ancestry (and hg. R1a-Z280, likely S24902+) among a BA sampling from Poland clearly derived from Bell Beaker groups.

    One can therefore infer that the expansion of the Trzciniec culture – as the earliest expansion of central-west European peoples into the Baltic after the Bell Beaker period – represented either the whole disintegrating Balto-Slavic community, or at least an Early Baltic-speaking community expanding from the West Baltic area to the east.

    The similarity of Early Slavs and the Trzciniec outlier with the Czech BA cluster, formed by samples from Bohemia (ca. 2200–1700 BC), and the varied haplogroups found among Early Slavs – reminiscent of the variability of the Unetice/Urnfield sampling – may help tentatively connect the early Proto-Slavic homeland more strongly with a Proto-Lusatian community immediately to the south-west of the Iwno/Proto-Trzciniec core.

    Top Left:Likely Baltic, Slavic, and Balto-Finnic-speaking territories (asynchronous), overlaid over Late Bronze Age cultures. Balto-Slavic in green: West(-East?) Baltic (B1), unattested early Baltic (B2), and Slavic (S). Late Balto-Finnic (F) in cyan. In red, Tollense and Turlojiškė sampling. Dashed black line: Balto-Slavic/West Uralic hydrotoponymy border until ca. 1000 AD. Top right: PCA of groups from the Early Bronze Age to the Late Bronze Age. Marked are Iwno/Pre-Trzciniec of Gustorzyn (see below), Late Trzciniec/Iron Age samples from Turlojiškė, and in dashed line approximate extent of Tollense cluster; Y-DNA haplogroups during the Late Bronze Age (Bottom left) and during the Early Iron Age (Bottom right). Notice a majority non-R1a lineages among sampled Early Slavs. See full maps and PCAs.

    Proto-Balto-Slavic homeland

    Disconnected western border: Germanic

    The common Balto-Slavic – Germanic community must necessarily be traced back to the West Baltic. From Udolph’s Namenkundliche Studien zum Germanenproblem, de Gruyter (1994), translated from German (emphasis mine):

    My work [Namenkundliche Studien zum Germanenproblem] has shown how strong the Germanic toponymy is related to the East, less to Slavic, much more to Baltic. It confirms the recent thesis by W.P. Schmid on the special relationship Germanic and Baltic, according to which “the formation of the typical Germanic linguistic characteristics…must have taken place in the neighborhood of Baltic“.

    If one starts from a Germanic core area whose eastern boundary is to be set on the middle Elbe between the Erzgebirge and Altmark, there are little more than 400 km. to the undoubtedly Baltic settlement area east of the Vistula. Stretching the Baltic area westwards over the Vistula (as far as the much-cited Persante), the distance is reduced to less than 300 km. Assuming further that Indo-European tribes between the developing Germanic and the Baltic groups represent the connection between the two language groups, so can one understand well the special relationship proposed by W.P. Schmid between Germanic and Baltic. In an earlier period shared Slavic evidently the same similarities (Baltic-Slavic-Germanic peculiarities).

    Top: Palaeo-Germanic (G2, blue area), Proto-Balto-Slavic/Pre-Baltic (PBSL, green area) and Early Proto-Balto-Finnic (PBF, cyan area) homelands superimposed over Early Bronze Age cultures. Persante hydronym and Gustorzyn ancient DNA sample location marked. Y-DNA haplogroups during the Early Bronze Age (Bottom left) and during the Middle Bronze Age (Bottom right). Notice a mix of R1b-L151 samples from the west and the process of integration of R1a-Z645 lineages from the the north-east. See full maps and PCAs.

    Substrate and immediate eastern border: Early Balto-Finnic

    While Balto-Finnic shows a late Balto-Slavic adstrate, Balto-Slavic has a Balto-Finnic(-like) substrate, also found later in Baltic and Slavic, which implies that Balto-Slavic (and later Baltic and Slavic) replaced the language of peoples who spoke Balto-Finnic(-like) languages, influencing at the same time the language of neighbouring peoples, who still spoke Balto-Finnic (or were directly connected to the Balto-Finnic community).

    For more on this relative chronology in Balto-Slavic – Balto-Finnic contacts, see e.g. the recent posts on Kallio (2003), Olander (2019), or a summary of this substrate.

    While Rahkonen (2013) entertains Parpola’s theory of a West-Uralic-speaking Netted Ware area (ca. 1900-500 BC), due to the Uralic-like hydrotoponymy of its territory, he also supports Itkonen’s idea of the ancient presence of almost exclusively Balto-Finnic place and river names in the Eastern Baltic and the Gulf of Finland since at least the Corded Ware period, due to the lack of Indo-European layers there:

    NOTE. This idea was also recently repeated by Kallio (2015), who can’t find a non-Uralic layer of hydrotoponymy in Balto-Finnic-speaking areas.

    It should be observed that the territory between the historical Finnic and Mordvin-speaking areas matches quite well with the area of the so-called Textile Ceramics [circa 1900–800 BC] (cf. Parpola 2012: 288). The culture of Textile Ceramics could function as a bridge between these two extreme points. Languages that were spoken later in this vast territory between Finland–Estonia and Mordovia seem to derive from Western Uralic (WU) as well. I have called those languages Meryan-Muroma, Eastern and Western Čudian and an unknown “x” language spoken in inland Finland, Karelia and the Lake Region of the Russian North (Rahkonen 2011; 241; 2012a: 19–27; 2013: 5– 43). This might mean that the territory of the Early Textile Ceramics reflects to some extent the area of late Western Uralic.

    The archaeologically problematic area is Estonia, Livonia and Coastal Finland – the area traditionally assumed to have been populated by the late Proto-Finns. The Textile Ceramics culture was absent there. It is very difficult to believe that the Textile Ware population in inland Finland migrated or was even the main factor bringing the Pre- or Early Proto-Finnic language to Estonia or Livonia. There are no archaeological or toponymic signs of it. Therefore, I am forced to believe that Textile Ceramics did not bring Uralic-speaking people to those regions. This makes it possible, but not absolutely proven, to assume that some type of Uralic language was spoken in the region of the Gulf of Finland already before Textile Ceramics spread to the northwest (circa 1900 BC).

    Top Left: Corded Ware culture expansion. Top right: PCA of Corded Ware and Sub-Neolithic groups. Y-DNA haplogroups during the Corded Ware expansion (Bottom left) and during the subsequent Bell Beaker expansion (Bottom right). Notice the rapid population replacement of typical Corded Ware R1a-Z645 lineages by expanding Bell Beakers of hg. R1b-L23 in central-east Europe, while they show continuity in the described ancestral Fennoscandian West-Uralic-speaking territory. See full maps and PCAs.

    The Corded Ware population in Finland is thought to have been NW Indo-European by many scholars (e.g. Koivulehto 2006: 154–155; Carpelan & Parpola 2001: 84). At least, it is probable that the Corded Ware culture was brought to Finland by waves of migration, because the representatives of the former Late Comb Ceramics partially lived at the same time side by side with the Corded Ware population. However, it is possible that the immigrants were a population that spoke Proto-Uralic, who had adopted the Corded Ware culture from their Indo-European neighbors, possibly from the population of the Fatjanovo culture, e.g. in the Valdai region. This was suggested by Terho Itkonen (1997: 251) as well. In that case the population of the Typical and Late Comb Ceramics may have spoken some Paleo European language (see Saarikivi 2004a). In the Early Bronze Age, the Baltic Pre-Finnic language that I have suggested must have been very close to late WU and therefore no substantial linguistic differences existed between the Baltic Pre-Finns and the population of Textile Ceramics in inland Finland. I admit that this model is difficult to prove, but I have presented it primarily in order to offer new models of thinking.16 At least, there is no archaeological or linguistic reason against this idea.

    This dubitative attribution of Proto-Uralic to the expansion of Corded Ware groups in eastern Europe, which is what hydrotoponymic data suggests in combination with archaeology, has to be understood as a consequence of how striking Rahkonen finds the results of his research, despite Itkonen’s previous proposal, in the context of an overwhelming majority of Indo-Europeanists who, until very recently, simplistically associated Corded Ware with the Indo-European expansion.


    Even Kortlandt accepts at this point the identification of expanding East Bell Beakers from the Carpathian Basin as those who left the Alteuropäische layer reaching up to the Baltic. However, he identified Udolph’s data solely with West Indo-European, forgetting to mention the commonly agreed upon western Proto-Balto-Slavic homeland, most likely because it contradicts two of his main tenets:

    1. that Balto-Slavic split from a hypothetical Indo-Slavonic (i.e. Satem) group expanding from the east; and
    2. that laryngeals can be reconstructed for Balto-Slavic – unlike for North-West Indo-European.
    Indo-European hydrotoponymy in Europe and the Middle East (scarce Central Asian data). Baltic data compensated, statistical method RBF: intermediate regions devoid of Indo-European toponyms are inferred to have them; it compensates thus e.g. for the scarce Indo-European hydrotoponyms in Poland by assuming ‘soft’ continuity from West Germany to the Baltic.

    A hypothetic “Pre-Indo-Slavonic” laryngeal Indo-European layer reaching Fennoscandia and the Forest Zone with Corded Ware is fully at odds with all known data:

    • in comparative grammar, since the one feature that characterizes Graeco-Aryan is precisely its set of innovations relative to Northern Indo-European, which presupposes a longer contact (and further laryngeal loss) once Tocharian and North-West Indo-European had separated – hence probably represented by Palaeo-BalkanCatacomb-Poltavka contacts once Afanasevo and Yamna settlers from the Carpathian Basin / East Bell Beakers had become isolated;
    • in hydrotoponymy, because of the prehistoric linguistic areas that can be inferred from (1) the distribution of Old European hydrotoponymy; (2) Udolph’s work on Germanic and the likely non-Indo-European substrate in Scandinavia and land contacts with Balto-Finnic; (3) from the Northern European traits in the Northern European Plain; or (4) from the decreasing proportion of Indo-European place and river names from central Europe towards the east and north.
    • NOTE. An alternative explanation of Old European/Balto-Slavic layers, e.g. by a ‘Centum’ Temematic – even if one obviates the general academic rejection to Holzer’s proposal – couldn’t account for the absolute lack of an ancestral layer of Indo-European hydrotoponymy in North-Eastern Europe (i.e. the longest-lasting Corded Ware territory), in sharp contrast with Western Europe, South-Eastern Europe, and South Asia. All of that contradicts an Eastern Indo-European community, even without a need to recall that the oldest hydrotoponymic layers common to Fennoscandia and the Forest Zone are of Uralic nature.

    • in archaeology, because cultural expansions of the Eastern European Early Bronze Age province since the Bell Beaker period (viz. Mierzanowice, Trzciniec, Lusatian, Pomeranian, West Baltic Culture of Cairns) suggest once and again west-east movements, most (if not all) of which – based on the presence of Indo-European speakers during the common era – were likely associated with Indo-European-speaking communities replacing or displacing previous ones.
    • in palaeogenomics, because of the late and different association of Corded Ware ancestry and haplogroups among Balto-Slavic and Indo-Iranian communities, in turn corresponding to the different satemization processes found in both dialects, which may have actually been related to the Uralic substrate that is found in both (read more on Uralic influences on Balto-Slavic and on Indo-Iranian).

    On the other hand, a careful combination of Uralic and Indo-European comparative grammar, hydrotoponymic data, and population genomics fits perfectly well Itkonen’s and Rahkonen’s association of Corded Ware in Eastern Europe with Uralic languages, as well as the traditional mainstream view of Uralic before Indo-European in Fennoscandia and in the Forest Zone, as I explained in a recent post about genetic continuity in the East Baltic area.

    Population genomics is not the main reason to reject the Indo-European Corded Ware theory – or any other prehistoric ethnolinguistic identification, for that matter. It can’t be. This new field offers just the occasional confirmation of a well-founded theory or, alternatively, another nail in the coffin of fringe theories that were actually never that likely, but seemed impossible to fully dismiss on purely theoretical grounds.

    The problem with Corded Ware was that we couldn’t see how unlikely its association with Indo-European languages was until we had ancient DNA to corroborate archaeological models, because few (if any) Indo-Europeanists really cared about the linguistic prehistory of eastern and northern Europe, or about Uralic languages in general (contrary to the general trend among Uralicists to be well-versed in Indo-European studies). Now they will.


    Genetic continuity among Uralic-speaking cultures in north-eastern Europe


    The recent study of Estonian Late Bronze Age/Iron Age samples has shown, as expected, large genetic continuity of Corded Ware populations in the East Baltic area, where West Uralic is known to have been spoken since at least the Early Bronze Age.

    The most interesting news was that, unexpectedly for many, the impact of “Siberian ancestry” (whatever that actually means) was small, slow, and gradual, with slight increases found up to the Middle Ages, compatible with multiple contact events in north-eastern Europe. Haplogroup N became prevalent among Finnic populations only through late bottlenecks, as research of modern populations have long suggested, and as ancient DNA research hinted since at least 2015.

    I risked to correlate the arrival of chiefs from the south-west with the infiltration of N1c-VL29 subclades during the transition to the Iron Age, coupled with that minimal “Siberian” ancestry (see e.g. here and here). Now we know that the penetration of this non-CW ancestry started, as predicted, in the Iron Age; that it was highly variable in the few samples where it appeared, with ca. 1-4%, while most Iron Age individuals show 0%; and that it was not especially linked to individuals of N1c-Vl29 lineages.

    It is also basically confirmed, based on the (ancient and Modern Swedish) N1c-L550 subclades found among Iron Age Estonians, that N1c-VL29 lineages and the so-called “Siberian” ancestry will be found simultaneously around the Baltic coastal areas, and that different lineages must have suffered later founder effects among Finns, which suggests that these alliances through exogamy brought exactly as much language change in Sweden, Lithuania, or Poland, as they did in the East Baltic region…

    On the other hand, the paper has also shown a potential movement of Corded Ware-derived peoples, if the change from LBA to IA samples is meaningful; in fact, even more Corded Ware-like than Baltic and Estonian BA populations. The exact origin of that movement is difficult to pinpoint, and it may not be related to the arrival of Akozino warrior-traders from the south-east, since theirs seems to be a minor impact proper of elites in a chiefdom system around the Baltic.

    Distribution of fortified settlements (filled circles) and other hilltop sites (empty circles) of the Late Bronze Age and Pre-Roman Iron Ages in the East Baltic region. Tentative area of most intensive contacts between Baltic and Balto-Finnic communities marked with a dashed line. Image modified from (Lang 2016).

    Also suggesting a potential movement is the ‘southern’ shift observed in the West and East Baltic areas, likely showing the arrival of Proto-East Baltic speakers (such as the Trzciniec outlier), as we have already discussed in this blog. The unexpected increase in Corded Ware-like ancestry in the Eastern Baltic, coupled with the expected large continuity of hg. R1a-Z283 in the homeland of Balto-Finnic expansions, gives even more support to the known complex system of exogamy along the Baltic coasts, and offers another potential reason for the rise of Baltic-speaking territories in the West Baltic: elite domination.

    It is nevertheless important to understand that, even among the most “genetic continuous” regions like Estonia, not a single population in Europe is heir of some ancestral, immutable people. Not in terms of haplogroups, and not in terms of admixture. Balto-Finnic speakers, however continuous they might seem (e.g. in Southern Estonians) aren’t an exception.

    After all, this blog was (re)born to fight the currently prevalent sheer stupidity surrounding the simplistic “R1a/steppe ancestry=Indo-European” association, so I wouldn’t like to see it replaced with some other stupid continuity or purity ideas within 10 to 20 years…

    Late Uralic stems from East Corded Ware groups

    With the currently available tools – linguistics, archaeology, and now genetics -, I don’t think there is any argument to date to question the direct connection of the Late Proto-Uralic expansion with all Eastern Corded Ware groups (i.e. Battle Axe, Fatyanovo-Balanovo, and Abashevo), and thus at least with the unifying A-horizon of Corded Ware and the bottlenecks under R1a-Z645.

    NOTE. The only out-group among Corded Ware cultures is the Single Grave culture. It appears to be an early Corded Ware offshoot, reflected in their non-unitary cultural traits (distinct from later unifying waves), in their varied patrilineal clans, and in the short-lasting cultural effect in northern Europe before their complete demise under pressure of expanding Yamna/Bell Beaker peoples from the Danube. The culture’s minimal (if any) effects on succeeding peoples might be seen mostly in the (mainly phonetic) Uralic substrate found in Balto-Slavic – although this may also stem from a more eastern influence, close to the Baltic – and in the contacts of Celtic with Uralic. The huge time depth between this early hypothetic Uralic layer in northern Europe and the emergence of peoples inhabiting these territories in recorded history have no doubt been erroneously interpreted as a lack of Uralic presence in the area.

    1) That connection was evident in the Yamna – CWC differences in archaeology, and especially later, with at least Fatyanovo-Balanovo and Abashevo representing the obvious replacement of the Volosovo culture before further expansions of CWC-related groups west and east of the Urals.

    The mythical millennia-long continuity of Volosovo hunter-gatherers, including centuries among Corded Ware peoples, as expected lately by the Copenhagen group (and anyone who doesn’t want to question the 1960s association of Indo-European with CWC) must be rejected today in population genomics, as the recent studies of ancient and modern populations show, and as ancient DNA from the region will confirm.

    2) In linguistics, the survival of Volosovo as The Uralic-speaking culture was also hardly believable. From Kallio (2015):

    While we can say at least something about Uralic substrates in Northeastern Europe, non-Uralic substrates cannot at all easily be identified, because of multiple language shifts, viz. first from non-Uralic to Uralic and then from Uralic to Russian. Yet the Soviet Uralicist Boris Serebrennikov (1956, 1959) argued that there are some non-Uralic substrate toponyms in the Volga-Oka region, but his idea was never taken seriously in the west (cf. Sauvageot 1958), and it pretty soon also sank into oblivion in Russia, even though it can still occasionally pop up there in non-onomastic circles (cf. Napolskikh 1995: 18–19). However, not all the hypotheses on non-Uralic substrates in Northeastern Europe should be rejected (see e.g. Helimski 2001b).

    Tentative map of the distribution of known languages in Eastern Europe during the Early Bronze Age. See full map.

    Helimski (2001) argues for a non-Uralic topo-hydronomy in Northern Russia, whose population may have kept their languages up to the Common Era despite the Corded Ware expansion, which is in line with the survival of some non-Indo-European languages everywhere in Europe after the expansion of Yamna and its offshoots:

    It should be borne in mind that these [Uralic] hydronyms reached us mainly through Northern Russian and, accordingly, with a tendency to phonetic-morphological adaptation and unification (for river names it is “natural” to be, like the word ‘river’ itself, feminine and to end in -a). Taking into account this circumstance, it may turn out to be non-useless for etymological identification of at least some of the hydronyms on the Finno-Ugric basis.

    On the other hand, I wouldn’t exclude the possibility that some parts of this large geographical area were never (completely) Finno-Ugric. The population that created the most important part of the hydronymy of the Russian North could be finally pushed aside or assimilated only at the end of the 1st – beginning of the 2nd millennium AD, during the Russian colonization, retaining the memory of the White-Eyed Chude in its own memory.

    NOTE. For more on this non-IE substrate in (especially West) Uralic, see e.g. Zhivlov (2015),

    The same non-Uralic substrate is most likely behind most of the shared traits by Mordvinic and Balto-Finnic (see below).

    3) In genetics, I don’t think the picture could get any clearer. I don’t know what “Steppe ancestry = Indo-European” proponents expected from 2019, if they expected anything at all (I haven’t seen any coherent model, proposal, or prediction for a long time now), but I doubt the recent results are compatible with any of their implied expectations.

    Detail of the PCA of the Corded Ware expansion. See full PCA and more related files.

    Notice, from the PCA above, how this Baltic Late Neolithic group shows actually a shift from Sredni Stog (see PCA with Sredni Stog) towards typical Khvalynsk-Urals-related ancestry, i.e. populations from eastern European forested regions, derived from hunter-gatherer pottery groups, as I have proposed for a very long time, since the first time a Baltic LN “outlier” appeared. It’s amazing how some amateurs can find 0.1% of any Siberian outlier’s ancestry among Uralians 4,000 years later, but fail to see the direct connection here. The esoteric uses of qpAdm, I guess…

    Especially noticeable is the extra WHG-like ancestry and corresponding shift, seen especially marked in late Polish CWC samples, but also in Baltic CWC and especially in one Sweden Battle Axe sample, all of them shifting apparently closer to Pitted Ware and SHG. While that may have been interpreted as an in situ admixture in Scandinavia before, the late Polish CWC samples show likely a resurgence of local populations, so we can assume that both shifts (to SHG- and EHG-like populations) of available CWC samples around the Baltic are clearly part of the WHG:EHG continuum that will be found in the eastern European sub-Neolithic cultures, from Narva to Volosovo.

    This WHG-related ancestry is clearly predominant in groups with which Battle Axe peoples admixed, based on the shift towards Pitted Ware, which – I can only guess based on modern Volga Finns – is different from the shift we will see in Netted Ware, more towards the Khvalynsk-Urals cluster. This is in line with the expansion of Battle Axe eastward through coastal areas (West to East Baltic and Finland into Sweden), while Fatyanovo peoples probably emerged from a slightly different route, but also a northern one, if one is to follow archaological similarities and their chronology.

    Detail of the PCA of European Bronze Age populations. See full PCA and more related files.

    During the Iron Age, the only peoples that probably shifted strongly (based on modern populations) are West Baltic ones, getting closer to the available Late Trzciniec samples, and even closer to the Trzciniec outlier, i.e. away from the earlier Eastern Corded Ware cluster, and towards Central European groups like Czech EBA or Poland EBA, both of them clearly derived from Bell Beakers, but also admixed with (and thus shifted toward) CW-like populations.

    If one looks carefully at the previous PCA on Bronze Age populations, and the next one on Iron Age clusters, it is evident that adding the Swedish LN outlier to East Baltic BA (both strongly related to Battle Axe populations) essentially gives us the continuity of East Baltic BA into the Iron Age. This cluster is continued also in two outliers from Sigtuna, a Viking town close to the Gulf of Finland, known to be an important trading site, 1,500 years later. Not much of a change around the Gulf of Finland, then:

    Detail of the PCA of East and North European Iron Age populations. See full PCA and more related files.

    Based on the two simplistic Uralic clines one might see described (among the many that certainly existed, from Corded Ware to different Eurasian populations), and just like BOO was for some months fashionable as “Samic”, some may be tempted to say that certain Sintashta or Srubna outliers close to the Urals mark the True Uralic™ peoples. Because, of course they do. Ghost haplogroup N and stuff. And Corded Ware never ever Uralic. Because Gimbutas, and my IE R1a grandfather.

    NOTE. Funny thing here: there might be Corded Ware, Iranian, Slavic, Germanic, etc… outliers or out-groups, and they might form the widest genetic clusters ever seen, but they are all of one language, because archaeology and linguistics; however, one “outlier” (also, put your own definition of “outlier” here, let’s say 1% of whatever, and strontium isotope potentially from 100 km away) ca. 600 BC in the Baltic who (surprise!) happens to show hg. N, and he signals the first incoming True Uralic™ speaker from wherever… It won’t be the first or the last time some people resort to “the complexity of Uralic-speaking peoples” in ancestry, just to look for “hg. N = Uralic” like crazy. You only need common sense to understand that this is not how this works. Amateur genomics can’t get more embarrassing than the current “let’s look for ‘Siberian ancestry’ in every individual of haplogroup N” trend. Or maybe it can, and it will, but I can’t see it yet.

    If one were to insist on looking for ‘foreign’ contributions among Iron Age Estonians, though, I think one should also check out first archaeology, and then the PC3 (or, more graphically, a 3D plot), to understand what might be happening with the many Uralic clines derived from Corded Ware, before starting to play around with bioinformatic tools to discover a teeny tiny 1% admixture of the wrong population, and rushing to build far-fetched narratives. Apparently, one of the different clines formed roughly between southern (steppe – forest-steppe) and northern (tundra-taiga) populations in Uralians is also seen in some Iron Age Estonian individuals – especially in some late samples from Ingria…This is not my main interest, so I will leave this here for others to keep wasting their time chasing the white whale of the 0.5% of True Uralic™ ancestry in ancient Baltic samples of hg. N.

    Still images of the 3D plot of Eurasian samples. Typical PC1 vs. PC2 visualization to the left, and shift of the view to PC3 on the right image. See full PCA and more related files.

    An exclusive Volga-Kama homeland for Disintegrating Uralic?

    Since I don’t believe in macro-regions of largely continuous ethnolinguistic communities, as I have often said about Slavic (naively associated with prehistoric tribes of Eastern Europe) or Germanic (absurdly considered to be represented by Battle Axe), it is difficult for me to believe that Battle Axe-derived cultures remained of the same Finno-Samic dialects since the Corded Ware expansion…unless we live in Westeros, where everything happens “for thousands of years”.

    I have to admit, then, that the now prevalent identification among Uralicists has become quite attractive:

    • Fatyanovo-Balanovo as Finno-Permic:
      • Fatyanovo/Netted Ware with West Uralic (also called Finno-Mordvinic).
      • Balanovo/Chirkovo-Kazan with Central Uralic (Mari-Permic).
    • Abashevo, into the Andronovo-like Horizon through the Seima-Turbino phenomenon, with East Uralic (also Ugro-Samoyedic).

    Exactly like the identification of Yamna Hungary – Bell Beaker transition as the North-West Indo-European homeland, it gives us simplicity and small and late ethnolinguistic communities, away from the traditionally overused big and early language territories.

    This late homeland would be supported, among others, by:

    • The presence of Indo-Iranian loanwords in Finno-Permic and Ugric (probably also in Samoyedic, either lost, or – much more likely – underresearched), compatible with the immediate contact between Abashevo – Sintashta-Potapovka-Filatovka and Fatyanovo-Balanovo.
    • The supposed expansion of Netted Ware from Fatyanovo to the north-west, which may be explained as the split and expansion of Balto-Finnic and Samic ca. 1900 BC.
    • A longer-lasting Finno-Permic (West+Central Uralic) community contrasting with the early separation of East Uralic.
    • The compatibility of this late expansion with the late expansion of Pre-Germanic from Denmark with the Dagger Period, and of Balto-Slavic with Trzciniec, which puts all three dialects reaching the Baltic Sea in the EBA.

    NOTE. I meant to update the linguistic text to include the most recently favoured phylogenetic tree of Uralic languages after Häkkinen (2007, 2009, 2014), which has very quickly become the new normal among Uralicists, but I don’t think I will have enough time to review the necessary papers for that. I am rushing to publish a printed edition, so the text will wind up being a mixture of “traditional” (meaning, basically, pre-2010s) description of Uralic dialects but using modern divisions; say, “West Uralic” instead of “Finno-Samic”. By the way, I am still amazed that none of my reader-haters (or any online user discussing Uralic migrations, for that matter) have come up with the questions that the new division pose, and it supports my suspicion about the complete lack of interest in linguistics of most (a)DNA fans, except for the occasional use of old and free PDFs Googled to support new narratives invented expressly for some qpAdm results…

    Textile ceramic styles and influence of Bronze Age cultures divided in clusters.

    Problems with this Parpola-Carpelan’s (2012-2018) interpretation include:

    • The differentiation between Fennoscandian Textile Ceramics vs. Netted Ware, which is not warranted in archaeology. The assumption that Netted Ware expanded to the Baltic Sea (as Kallio does, following the traditional view) is thus weak, and it was probably a question of cultural contacts coupled with short-distance population movements/exchange in both directions (from the Baltic to the Volga and vice versa). In fact, the culture division relies on some fairly common and technically simple ornamentation patterns, widespread all over northern Europe, even before the Corded Ware expansion, and it is very difficult to separate certain neighboring Textile Ceramics from Netted Ware groups in southern Finland (i.e. Sarsa-Tomitsa groups).
    • The strict and radical direction described for the Netted Ware by Carpelan, as an eastward and northward expansion, within a very short time frame (ca. 1900-1800 BC), based on few radiocarbon dates, which seems to me like a very risky assumption. We know how this kind of descriptions of direction of culture expansion based on radiocarbon dates has turned out in much more complex “packages”, like the Bell Beaker culture… In fact, the earliest dates for Textile Ware are from the East Baltic, earlier than those of Netted Ware.
    • The assumption that Balto-Finnic traits shared with Mordvinic are a) late and b) meaningful for dialectalization of two closely related dialects, when it is clear that both dialects separated quite early. Phonologically Finnic is more conservative, morphologically less so, and the shared traits include a handful of non-Uralic substrate words which can’t be traced to a single common source, hence they were adopted when both languages had already separated… All in all, Finnic – Mordvinic correspondances are not even close to Italo-Celtic ones, which is clearly fully incompatible with a proposal of a Finnic separation from Mordvinic coinciding with the LBA-IA transition.

    Especially problematic for Parpola’s model is the lack of genetic impact in Bronze Age or Iron Age Estonians, not reaching a significant level under any possible statistical threshold – which I am sure was quite disappointing for some of my readers -, but is in line with major archaeological continuity of groups the from region, only disturbed in cultural (and Y-chromosome) terms by the expansion of Akozino warrior-traders all over the Baltic Sea. Any proposed population movement will be very difficult to support in genetics, given the Corded Ware-derived populations that we will see in both regions, and the continued Baltic-Volga contacts since the Corded Ware expansion.

    Problems with an interpretation of such a small impact in population genomics includes the similarly weak impacts and haplogroup infiltrations that can be seen among populations basically everywhere in Eurasia, during any given period, and much greater genetic impacts that are supposed to be (or that were certainly) followed by ethnolinguistic continuity.

    Distribution of the Akozino-Mälar axes according to Sergej V. Kuz’minykh (1996: 8, Abb. 2).

    The Battle Axe question

    From Kallio (2015), about choosing a tentative homeland for Proto-Uralic:

    (…) linguistically uniform Proto-Uralic would have been spoken in the Volga-Oka region until the mid-third millennium BC when the Proto-Uralic-speaking area would have expanded to the Volga-Kama region as well. By the end of the same millennium, this expansion would have led to the earliest dialectal splits within Uralic into Finno-Mordvin, Mari-Permic, and Ugro-Samoyed. The splitting up of these three soon followed during the early second millennium BC when the Uralic-speaking area finally stretched from the Baltic Sea in the west to the Altai mountains in the east. Indeed, no matter where Proto-Uralic was spoken, the branching into the nine well-attested subgroups (viz. Finnic, Saami, Mordvin, Mari, Permic, Hungarian, Mansi, Khanty, and Samoyed) must have taken less than a millennium, because their shared phonological and morphosyntactic isoglosses are rather limited (see Salminen 2002). The traditional view that all this branching would have taken several millennia violates everything linguistic typology teaches us about the rate of language change.

    The basic problem of this identification of Fatyanovo-Balanovo as West-Central Uralic and Abashevo as East Uralic is the nature of the Battle Axe culture, including the Bronze Age East Baltic and Gulf of Finland area. Even if it is accepted that Fatyanovo-Balanovo represented all Western groups, Battle Axe must have represented West Uralic-like dialects.

    The ethnolinguistic identification of Battle Axe depends ultimately on the nature of contacts of Fatyanovo/Netted Ware with Battle Axe/Textile Ceramics. If both groups were close and interacted profusely, as it seems, it doesn’t seem granted that we will be able to distinguish a close Para-West Uralic dialect of Scandinavia from the actual expanding Balto-Finnic and Samic dialects, if they were actually linked to the Netted Ware expansion. Also from Kallio (2015):

    No doubt the most convincing substrate theory has recently been put forward by the Saami Uralicist Ante Aikio (2004), who has not only rehabilitated but also improved the old idea of a non-Uralic substrate in Saami. His study shows that there were still non-Uralic languages spoken in Northern Fennoscandia as recently as the first millennium AD. Most of all, they were not only genetically non-Uralic but also typologically non-Uralic-looking, bearing a closer resemblance to the so-called Palaeo-European substrates (for which see e.g. Schrijver 2001; Vennemann 2003).

    In comparison, the case of Finnic is much more difficult. The fact that Proto-Uralic was not spoken in the East Baltic region means that this area must have originally been non-Uralic-speaking, but so far the evidence for a non-Uralic substrate in Finnic has consisted of appellatives and proper names with no etymology (cf. Ariste 1971; Saarikivi 2004a). Contrary to the proposed substrate words in Saami, those in Finnic show no structural non-Uralisms, as if they had indeed been borrowed from some genetically related or at least typologically similar languages, as I suggested above. Also none of them is more recent than the Middle Proto-Finnic stage, which makes them at least two millennia old. All this agrees with archaeological evidence discussed earlier that the Uralicization of the East Baltic region occurred during the Bronze Age (ca. 1900–500 BC).

    The discussion of the paper continues with an unsuccessful attempt to find a hypothetical ancient Indo-European substrate that Kallio believes must be associated with the expansion of Corded Ware, in line with the traditional belief. For example, the often mentioned – almost folk etymology-like, unsurprisingly popular among amateurs – ‘Neva’ as derived from IE “young” is logically rejected…Unlike Parpola, Kallio’s view seems to be confident that Netted Ware (as Textile Ware) expanded into the East Baltic, on both sides of the Gulf of Finland, already during the Bronze Age.

    As it has become apparent in population genomics, none of them was right, and Textile Ceramics will essentially show – like Netted Ware – a large genetic continuity of Corded Ware peoples in the whole north-eastern European forest zone – despite small regional population movements, obviously -, which necessarily implies that the whole Corded Ware culture – and not only Fatyanovo-Balanovo and Abashevo – were Uralic-speaking territories.

    The similarities in terms of culture and Y-DNA bottlenecks between Battle Axe and Fatyanovo-Balanovo also imply that the linguistic differences between these groups were probably not many, and became strongly divided only after their territorial division. Continued contacts between Battle Axe- and Fatyanovo-derived groups can explain the proposed contacts (Finnic with Samic, Finnic with Mordvinic) after their linguistic-but-not-physical separation.

    East European movement directions (arrows) of the representatives of the Central European Corded Ware Culture (according to I.I. Artemenko).

    Battle Axe spoke “Para-Balto-Finnic”?

    The Balto-Finnic-speaking nature of Battle Axe is thus supported by:

    • The lack of non-Uralic substrates in Balto-Finnic territory (Kallio 2015).
    • The early separation of Samic and Finnic from Mordvinic, and the virtual identity of Proto-West-Uralic and Proto-Uralic, which suggests that Proto-Uralic spread fast (Parpola 2012).
    • The scarce non-Uralic topo-hydronymy in the East Baltic and around the Gulf of Finland (Saarikivi 2004), comparable to that on the Upper Volga region.
    • The strong influence of a Balto-Finnic-like substrate on Pre-Germanic (or, in Kallio’s opinion, the same Scandinavian substrate influencing both Germanic and Balto-Finnic at the same time), and the continued influence of Balto-Finnic on Proto-Baltic and Proto-Slavic.
    • The continued influence of Corded Ware-derived groups in central-east Sweden in Finland and the East Baltic in terms of agricultural innovations appearing in the LBA, compatible with Schrijver’s proposal of intermediate Germanic-shifted Balto-Finnic groups and Balto-Finnic groups influenced by their pronunciation.
    • The intense Palaeo-Germanic and late Balto-Slavic / early Proto-Baltic superstrate on Balto-Finnic, which place all three dialects around the Baltic Sea since the Early Bronze Age.
    • The easy replacement of a hypothetic Para-Balto-Finnic dialect by incoming Proto-Balto-Finnic-speaking peoples (say, with textile ceramics), without much linguistic impact.

    In fact, the continuous contacts of the East Baltic with the Volga, and especially the close interaction with Akozino warrior-traders just before the Tarand-grave period, could be the actual origin of the recent (if any) Finnic-Mordvinic connections that need to be traced back to the LBA-IA (maybe here the number ‘ten’), since most of them can be related to a Pit-Comb Ware culture substrate and earlier contacts through the forest zone, which Samic (due to its early split and presence to the north of the Gulf of Finland during the BA) does not share. In fact, some of them can be traced back to Balto-Finnic first

    These are the most often mentioned, in order of descending relevance for a shared ancient community:

    • Noun paradigms and the form and function of individual cases.
    • The geminate *mm (foreign to Proto-Uralic before the development of Fennic under Germanic influence) and other non-Uralic consonant clusters.
    • The change of numeral *luka ‘ten’ with (non-Uralic) *kümmen.
    • The presence of loanwords of non-Uralic origin, related to farming and trees, potentially Palaeo-European in nature.

    It’s not only a question of quantity. Are these shared Mordvinic – Balto-Finnic traits really more relevant than, say, those between Italo-Celtic, which are supposed to have formed a community for a very short period at the end of the 3rd millennium around the Alps? Are these traits even sufficient to propose a common early Mordvinic-Finnic group within West Uralic, rather than loose Mordvinic – Balto-Finnic contacts, i.e. contacts between East Baltic (Textile Ceramics) and Volga-Kama (Netted Ware)?

    Based on the alternative (Kallio’s) view of continued contacts between Textile Ceramics groups, even without knowing anything about linguistics, you can guess that Parpola is spinning very thin when assuming that these changes suggest that Balto-Finnic may have expanded with Akozino warrior-traders, separating thus ca. 800 BC from Mordvinic…

    Genetic findings now clearly help dismiss any meaningful population impact in the LBA-IA transition, although any linguist can obviously argue for linguistic change in spite of major genetic continuity. But then we are stuck in the pre-ancient DNA era, so what’s ancient DNA for.

    Middle Bronze Age cultures of Eastern Europe.

    Genetic continuity = language continuity?

    In the end, it’s very difficult to say how much language continuity there is around Estonia since the arrival of Corded Ware peoples. Looking at Modern Estonians, they have been clearly influenced by recent contacts with Baltic- and Germanic-speaking peoples clustering to the south-west in the PCA. They seem to have also received contacts from north(-east)ern peoples, likely from Finland, evidenced by their shifts toward the modern Estonian cluster during and after the Middle Ages, with a slight increase in Siberian ancestry and N1c subclades associated with Lovozero Ware. How much language change did these contacts bring? Maybe an expansion of Gulf of Finland Finnic (Northern Estonian) over Inland Finnic (Southern Estonian) and Gulf of Riga Finnic (Livonian)? Difficult to know, exactly, but, in the traditional view of Balto-Finnic dialectal distribution among Uralicists like Kallio, possibly no change at all.

    So, if the obvious changes in the Estonia_MA cluster relative to Estonia_IA cluster and Estonia_Modern relative to Estonia_MA do not represent radical language change…Why would Estonia_IA represent a change relative to Estonia_BA, when it is statistically basically the same? Or Estonia_BA relative to CWC_Baltic? Because of the infiltration of haplogroup N1c around the whole Baltic? Because of the occasional 1% “Siberian” ancestry in some non-locals of varied haplogroups across the whole Baltic area?

    In spite of all this, the amount of special pleading we are seeing among openly Nordicist amateurs when discussing the Uralic homeland relative to the Indo-European question in genetics has become a matter of plain willful ignorance. Like the living corpses of the Anatolian homeland, the Armenian homeland, the OIT proponents, or the nativist Basque R1b association, the personal involvement in the revival of “R1a=Indo-European” and “N=Uralic” trends is just painful to watch.

    [Next post in this line, if I manage to make time for it: “Genetic (dis)continuity in Central Europe“. Let’s see if early Balts and early Slavs, as well as Germanic peoples, show a cluster closer to Danubian EBA (viz. Maros), Hungary-Balkans BA, and Urnfield-related samples than their predecessors in their areas, i.e. away from East Corded Ware groups… If you want, you can enjoy for the moment the new PCAs I could get done and the tentative map of languages in the Early Bronze Age, that will probably give you the right idea about early Indo-European and Uralic population movements]

    European Early Bronze Age: tentative language map based on linguistics, archaeology, and genetics. See full map.


    Baltic Finns in the Bronze Age, of hg. R1a-Z283 and Corded Ware ancestry


    Open access The Arrival of Siberian Ancestry Connecting the Eastern Baltic to Uralic Speakers further East, by Saag et al. Current Biology (2019).

    Interesting excerpts:

    In this study, we present new genomic data from Estonian Late Bronze Age stone-cist graves (1200–400 BC) (EstBA) and Pre-Roman Iron Age tarand cemeteries (800/500 BC–50 AD) (EstIA). The cultural background of stone-cist graves indicates strong connections both to the west and the east [20, 21]. The Iron Age (IA) tarands have been proposed to mirror “houses of the dead” found among Uralic peoples of the Volga-Kama region [22].

    (…) The 33 individuals included 15 from EstBA, 6 from EstIA, 5 from Pre-Roman to Roman Iron Age Ingria (500 BC–450 AD) (IngIA), and 7 from Middle Age Estonia (1200–1600 AD) (EstMA) and yielded endogenous DNA ∼4%–88%, average genomic coverages ∼0.017–0.734×, and contamination estimates <4% (Table S1). We analyzed the data in the context of modern and other ancient individuals, including from Neolithic Estonia [13].

    Archaeological Information, Genetic Sex, mtDNA and Y Chromosome Haplogroups, and Average Coverage of the Individuals of This Study. Modified from the paper to mark distinct Y-DNA haplogroups in the LBA and IA.

    We identified chrY hgs for 30 male individuals (Tables 1 and S2; STAR Methods). All 16 successfully haplogrouped EstBA males belonged to hg R1a, showing no change from the CWC period, when this was also the only chrY lineage detected in the Eastern Baltic [11, 13, 30, 31]. Three EstIA and two IngIA individuals also belonged to hg R1a, but three EstIA males belonged to hg N3a, the earliest so far observed in the Eastern Baltic. Three EstMA individuals belonged to hg N3a, two to hg R1a, and one to hg J2b. ChrY lineages found in the Baltic Sea region before the CWC belong to hgs I, R1b, R1a5, and Q [10, 11, 12, 13, 17, 32]. Thus, it appears that these lineages were substantially replaced in the Eastern Baltic by hg R1a [10, 11, 12, 13], most likely through steppe migrations from the east [30, 31]. (…) Our results enable us to conclude that, although the expansion time for R1a1 and N3a3′5 in Eastern Europe is similar [25], hg N3a likely reached Estonia or at least became comparably frequent to modern Estonia [1] only during the BA-IA transition.

    A clear shift toward West Eurasian hunter-gatherers is visible between European LN and BA (including Baltic CWC) and EstBA individuals, the latter clustering together with Latvian and Lithuanian BA individuals [11]. EstIA, IngIA, and EstMA individuals project between BA individuals and modern Estonians, partially overlapping with both.

    (…) EstBA individuals are clearly distinguishable from Estonian CWC individuals as the former have more of the blue component most frequent in WHGs and less of the brown and yellow components maximized in Caucasus hunter-gatherers and modern Khanty, respectively. The individuals of EstBA, EstIA, IngIA, EstMA, and modern Estonia are quite similar to each other on average, indicating that the relatively high proportion of WHG ancestry in modern Eastern Baltic populations compared to other present-day Europeans [15] traces back to the BA.

    Detail of the PCA, modified from the paper to label populations. Estonian Bronze Age and Iron Age samples cluster close to Early Corded Ware from the Baltic.. Principal-component analysis results of modern West Eurasians with ancient individuals projected onto the first two components (PC1 and PC2). BA, Bronze Age; EF, early farmers; HG, hunter-gatherers; IA, Iron Age; IMA, Iron/Middle Ages; LN, Late Neolithic; LNBA, Late Neolithic/Bronze Age; MA, Middle Ages

    When comparing Estonian CWC and EstBA using autosomal outgroup f3 and Patterson’s D statistics (Table S3), the latter is more similar to other Baltic BA populations, to Baltic IA and Middle Age (MA) populations, and also to populations similar to WHGs and Scandinavian hunter-gatherers (SHGs), but not to Estonian CCC (Figures 2A and S2A; Data S1). The increase in WHG or SHG ancestry could be connected to western influences seen in material culture [20, 21] and facilitated by a decline in local population after the CCC-CWC period [20]. A slight trend of bigger similarity of Estonian CWC to forest or steppe zone populations and of EstBA to European early farmer populations can also be seen.

    (…) When comparing to modern populations, Estonian CWC is slightly more similar to Caucasus individuals but EstBA to Baltic populations and Finnic speakers (Figure 2B; Data S1). Outgroup f3 and D statistics do not reveal apparent differences when comparing EstBA to EstIA, EstIA to IngIA, and EstIA to EstMA (Data S1).

    qpAdm results. Error bars indicate one SE. Central MN, Central European Middle Neolithic; EstBA, Estonian Bronze Age; EstIA, Estonian Iron Age; IngIA, Ingrian Iron Age; EstMA, Estonian Middle Ages; WHG, western hunter-gatherers.

    These results highlight how uniparental and autosomal data can lead to different demographic inferences—the genetic change between CWC and BA not seen in uniparental lineages is clear in autosomal data and the appearance of chrY hg N in the IA is not matched by a clear shift in autosomal profiles.

    EstBA individuals have no Nganasan-related ancestry and EstIA, IngIA, and EstMA individuals on average have 2% or 4% (Figure 3; Data S1). The differentiation remains when using BA or IA Fennoscandian populations [26] instead of Nganasans (Data S1). Notably, the proportion of Nganasan-related ancestry varies between 0% and 12% among sampled EstIA, IngIA, and EstMA individuals (Data S1), which may suggest its relatively recent admixture into the target population. Moreover, two individuals from Kunda (0LS10 and V10) have the highest proportions of Nganasan ancestry among EstIA (6% and 8%), one of them has chrY hg N3a, and isotopic analysis suggests neither individual being born in Kunda [34].

    About these two males from Tarand-graves, ‘foreign’ to Kunda:

    0LS10: Male from tarand III (burial 9; TÜ 1325: L777), age 17–25 years [34]. He had a fragment of a sheep/goat bone and ceramics as grave goods. This burial has two radiocarbon dates: 2430 ± 35 BP (Poz-10801; 760–400 cal BC) and 2530 ± 41 BP (UBA-26114; 800–530 cal BC) [34]. According to the isotopic analysis, the person was not born in the vicinity of Kunda; his place of birth is still unknown (but south-western Finland and Sweden are excluded) [34]. Sampled tooth r P1.

    V10: Male from tarand XI (burial 24; TÜ 1325: L1925), age 25–35 years [34], date 2484 ± 40 BP (UBA-26115; 790–430 cal BC) [34]. He had a few potsherds near the skull. Likewise, this person was not locally born [34]. Sampled tooth l P1.

    Autosomal Analyses’ Results for Gyvakarai1 as the closest available Corded Ware source for Balto-Finnic populations.

    The paper shows thus:

    • Major continuity of ancestry from Corded Ware to modern Estonians, with only slight changes in different periods. In fact, one of the best fits for the Late Bronze Age ancestry is Gyvakarai1, one of the Corded Ware “outliers” described as “closer to Yamna”, which I already said may be closer to Sredni Stog/EHG populations instead. Another interesting take is that the change from Bronze Age to Iron Age corresponds to an increase in Baltic Corded Ware-related ancestry, rather than being driven by Siberian ancestry.
    • pca-mittnik-gyvakarai
      File modified by me from Mittnik et al. (2018) to include the approximate position of the most common ancestral components, and an identification of potential outliers. Zoomed-in version of the European Late Neolithic and Bronze Age samples. “Principal components analysis of 1012 present-day West Eurasians (grey points, modern Baltic populations in dark grey) with 294 projected published ancient and 38 ancient North European samples introduced in this study (marked with a red outline). From Mittnik et al. (2018).
    • A Volosovo-related migration of hg. N1c with Netted Ware into the area seems to be discarded, based on the full replacement of paternal lines and continuity of R1a-Z283. It is only during the Tarand-grave period when a system of chiefdoms (spread from Ananyino/Akozino) brings haplogroup N1c to the Gulf of Finland. During the Iron Age, the proportion of paternal lineages is still clearly in favour of R1a (50% in the coast, 100% in Ostrobothnia), which indicates a gradual replacement led by elites, likely because of the incorporation of Akozino warrior-traders spreading all over the Baltic, bringing the described shared Mordvinic traits in Fennic.
    • finno-ugric-haplogroup-n
      Map of archaeological cultures in north-eastern Europe ca. 8th-3rd centuries BC. [The Mid-Volga Akozino group not depicted] Shaded area represents the Ananino cultural-historical society. Fading purple arrows represent likely stepped movements of subclades of haplogroup N for centuries (e.g. Siberian → Ananino → Akozino → Fennoscandia [N-VL29]; Circum-Arctic → forest-steppe [N1, N2]; etc.). Blue arrows represent eventual expansions of Uralic peoples to the north. Modified image from Vasilyev (2002).
    • The arrival of Akozino warrior-traders (bringing N1c and R1a lineages) was probably linked to this minimal “Nganasan-like” ancestry of some samples in the transition to the Iron Age. This arrival is supported by samples 0LS10 (the earliest hg. N1c) and V10 (of hg. R1a), both dated to ca. 800-400 BC, with V10 showing the highest “Nganasan-like” ancestry with 4.8%, both of them neighbouring samples showing 0%. This variable admixture among local and foreign paternal lineages might support the described social system of family alliances with intermarriages. In fact, a medieval sample, 0LS03_1 (hg. R1a) also shows a recent “Nganasan-like” ancestry, which probably points to the integration of different Arctic-related ancestry components among Modern Estonians, in this case related to Finnish expansions and thus integration of Levänluhta-related ancestry, as per the supplementary data.
    • NOTE. Such minimal proportions of “Nganasan-like” ancestry evidence the process of admixture of Volga Finns in Akozino territory through their close interactions with Permians of Ananyino, who in turn acquired this Palaeo-Arctic admixture most likely during the expansion of the linguistic community to hunter-gatherer territories, to the north of the Cis-Urals. This process of stepped infiltration and expansion without language change is not dissimilar to the one seen among Indo-Iranians and Balto-Slavs of hg. R1b, or Vasconic speakers of hg. I2a, although in the case of Baltic Finns of hg. R1a the process of infiltration and expansion of hg. N1c is much less dramatic, with no radical replacement anywhere before the huge bottlenecks observable in Finns.

    • The expansion of haplogroup N1c among Finnic populations, as we are going to see in samples from the Middle Ages such as Luistari, is the consequence of late founder effects after huge bottlenecks expected based on the analysis of modern populations. The expansion of N1c-VL29 is different in origin from that of N1c-Z1936 among Samic (later integrated into Finnish populations), most likely from the east and originally associated with Lovozero Ware.
    Frequency-Distribution Maps of Individual Subclade N3a3 / N1a1a1a1a1a-CTS2929/VL29, probably initially with Akozino warrior-traders. Map from Ilumäe et al. (2016).

    In spite of all this, the conclusion of the paper is (surprise!) that Siberian ancestry and hg. N heralded the arrival of Finnic to the Gulf of Finland in the Iron Age… However, this conclusion is supposedly* supported, not by their previous papers, but by a recent phylogenetic study by Honkola et al. (2013), which doesn’t actually argue for such a late ‘arrival’: it argues for the split of Balto-Finnic around 1500 BC.

    NOTE. I say ‘supposedly’ because Kristiina Tambets, for example, has been following the link of Uralic with haplogroup N since the 2000s, so this is not some conclusion they just happened to misread from some random paper they Googled. In those initial assessments, she argued that the “ancient homeland” of the Tat C mutation suggested that Finno-Ugrians were in Fennoscandia before Indo-Europeans. Apparently, since haplogroup N appears later and from the east, it is now more important to follow this haplogroup than what is established in archaeology and linguistics.

    Even in the referred paper, this split is considered an in situ development, since the phylogenetic study takes the information – among others – 1) from Parpola and Carpelan, who consider Netted Ware, a culture derived from Fatyanovo/Abashevo and Volosovo, as the culprit of the Finno-Ugric expansion; and 2) from Kallio (2006), who clearly states that Proto-Balto-Finnic (like Proto-Finno-Samic) was spoken around the Gulf of Finland during the Bronze Age. Both of them set the terminus ante quem of the language presence in the Baltic ca. 1900 BC.

    Anyways, as a consequence of geneticists keeping these untenable pre-ancient DNA haplogroup-based arguments today, I expect to see this “Finnic” language expansion also described for the Western Baltic, Scandinavia or northern Europe, when this same proportion of hg. N1c and “Nganasan” ancestry is observed in Iron Age samples around the Baltic Sea. The nativist trends that this domination of “Finns” all over Northern Europe 2,500 years ago will create will be even more fun to read than the current ones…

    EDIT (10 May 2019) How I see the reaction of many to ancient DNA, in keeping their old theories:


    Corded Ware—Uralic (II): Finno-Permic and the expansion of N-L392/Siberian ancestry


    This is the second of four posts on the Corded Ware—Uralic identification:

    I read from time to time that “we have not sampled Uralic speakers yet”, and “we are waiting to see when Uralic-speaking peoples are sampled”. Are we, though?

    Proto-language homelands are based on linguistic data, such as guesstimates for dialectal evolution, loanwords and phonetic changes for language contacts, toponymy for ancient territories, etc. depending on the available information. The trace is then followed back, using available archaeological data, from the known historic speakers and territory to the appropriate potential prehistoric cultures. Only then can genetic analyses help us clarify the precise prehistoric population movements that better fit the models.

    The traditional family tree of the Uralic branches. Kallio (2014)

    The linguistic homeland

    We thought – using linguistic guesstimates and fitting prehistoric cultures and their expansion – that Yamna was the Late Proto-Indo-European culture, so when Yamna was sampled, we had Late Proto-Indo-Europeans sampled. Simple deduction.

    We thought that north-eastern Europe was a Uralic-speaking area during the Neolithic:

    • For those supporting a western continuity (and assuming CWC was Indo-European), the language was present at least since the Comb Ware culture, potentially since the Mesolithic.
    • For those supporting a late introduction into Finland, Uralic expanded the latest with Abashevo-related movements after its incorporation of Volosovo and related hunter-gatherers.

    The expansion to the east must have happened through progressive infiltrations with Seima-Turbino / Andronovo-related expansions.

    Some datings for the traditional proto-stages from Uralic to Finnic. Kallio (2014).

    Finding the linguistic homeland going backwards can be described today as follows:

    I. Proto-Fennic homeland

    Based on the number of Baltic loanwords, not attested in the more eastern Uralic branches (and reaching only partially Mordvinic), the following can be said about western Finno-Permic languages (Junttila 2014):

    The Volga-Kama Basin lies still too far east to be included in a list of possible contact locations. Instead, we could look for the contact area somewhere between Estonia in the west and the surroundings of Moscow in the east, a zone with evidence of Uralic settlement in the north and Baltic on the south side.

    The only linguistically well-grounded version of the Stone Age continuation theory was presented by Mikko Korhonen in 1976. Its validity, however, became heavily threatened when Koivulehto 1983a-b proved the existence of a Late Proto-Indo-European or Pre-Baltic loanword layer in Saami, Finnic, and Mordvinic. Since this layer must precede the Baltic one and it was presumably acquired in the Baltic Sea region, Koivulehto posited it on the horizon of the Battle Axe period. This forces a later dating for the Baltic–Finnic contacts.

    Today the Battle Axe culture is dated at 3200 to 3000 BC, a period far too remote to correspond linguistically with Proto-Baltic (Kallio 1998a).

    Since the Baltic contacts began at a very initial phase of Proto-Finnic, the language must have been relatively uniform at that time. Hence, if we consider that the layer of Baltic loanwords may have spread over the Gulf of Finland at that time, we could also insist that the whole of the Proto-Finnic language did so.

    Prehistoric Balts as the southern neighbours of Proto-Finnic speakers. 1 = The approximated area of Proto-Uralic. 2 = The approximated area of Finnic during the Iron Age. 3 = The area of ancient Baltic hydronyms. 4 = The area of Baltic languages in about 1200 AD. 5 = The problem: When did Uralic expand westwards and when did it meet Baltic? Junntila (2012).

    II. Proto-Finno-Saamic homeland

    The evidence of continued Palaeo-Germanic loanwords (from Pre- to Proto-Germanic stages) is certainly the most important data to locate the Finno-Saamic homeland, and from there backwards into the true Uralic homeland. Following Kallio (2017):

    (…) the loanword evidence furthermore suggests that the ancestors of Finnic and Saamic had at least phonologically remained very close to Proto-Uralic as late as the Bronze Age (ca. 1700–500 BC). In particular, certain loanwords, whose Baltic and Germanic sources point to the first millennium BC, after all go back to the Finno-Saamic proto-stage, which is phonologically almost identical to the Uralic proto-stage (see especially the table in Sammallahti 1998: 198–202). This being the case, Dahl’s wave model could perhaps have some use in Uralic linguistics, too.

    The presence of Pre-Germanic loanwords points rather to the centuries around the turn of the 2nd – 1st millennium BC or earlier. Proto-Germanic words must have been borrowed before the end of Germanic influence in the eastern Baltic at the beginning of the Iron Age, which sets a clear terminus ante quem ca. 800 BC.

    The arrival of Bell Beaker peoples in Scandinavia ca. 2350 BC, heralding the formation of the Dagger Period, as well as the development of Pre-Germanic in common with Finnic-like populations point to the late 3rd / early 2nd millennium BC as the first time of close interaction through the Baltic region.

    III. Proto-Uralic homeland

    (…) the earliest Indo-European loanwords in the Uralic languages (…) show that Proto-Uralic cannot have been spoken much earlier than Proto-Indo-European dated about 3500 BC (Koivulehto 2001: 235, 257). As the same loanword evidence naturally also shows that the Uralic and Indo-European homelands were not located far from one another, the Uralic homeland can most likely be located in the Middle and Upper Volga region, right north of the Indo-European homeland*. From the beginning of the Subneolithic period about 5900 BC onwards, this region was an important innovation centre, from where several cultural waves spread to the Finnish Gulf area, such as the Sperrings Ware wave about 4900 BC, the Combed Ware wave about 3900 BC, and the Netted Ware wave about 1900 BC (Carpelan & Parpola 2001: 78–90).

    The mainstream position is nowadays trying to hold together the traditional views of Corded Ware as Indo-European, and a Uralic Fennoscandia during the Bronze Age.

    The following is an example of how this “Volosovo/Forest Zone hunter-gatherer theory” of Uralic origins looks like, as a ‘mixture’ of cultures and languages that benefits from the lack of genetic data for certain regions and periods (taken from Parpola 2018):

    The extent of Typical Comb Ware (TCW), Asbestos- and Organic-tempered Wares (AOW) and Volosovo and Garino-Bor cultures; areas with deposits of native copper in Karelia and copperbearing sandstone in Volga-Kama-area are marked dark gray (after Zhuravlev 1977; Krajnov 1987; Nagovitsyn 1987; Chernykh 1992; Carpelan 1999; Zhul´nikov 1999). From Nordqvist et al. (2012).

    The Corded Ware (or Battle Axe) culture intruded into the Eastern Baltic and coastal Finland already around 3100 BCE. The continuity hypothesis maintains that the early Proto-Finnic speakers of the coastal regions, who had come to Finland in the 4th millennium BCE with the Comb-Pitted Ware, coexisted with the Corded Ware newcomers, gradually adopting their pastoral culture and with it a number of NW-IE loanwords, but assimilating the immigrants linguistically.

    The fusion of the Corded Ware and the local Comb-Pitted Ware culture resulted into the formation of the Kiukais culture (c. 2300–1500) of southwestern Finland, which around 2300 received some cultural impulses from Estonia, manifested in the appearance of the Western Textile Ceramic (which is different from the more easterly Textile Ceramic or Netted Ware, and which is first attested in Estonia c. 2700 BCE, cf. Kriiska & Tvauri 2007: 88), and supposed to have been accompanied by an influx of loanwords coming from Proto-Baltic. At the same time, the Kiukais culture is supposed to have spread the custom of burying chiefs in stone cairns to Estonia.

    The coming of the Corded Ware people and their assimilation created a cultural and supposedly also a linguistic split in Finland, which the continuity hypothesis has interpreted to mean dividing Proto-Saami-Finnic unity into its two branches. Baltic Finnic, or simply Finnic, would have emerged in the coastal regions of Finland and in the northern East Baltic, while preforms of Saami would have been spoken in the inland parts of Finland.

    The Nordic Bronze Age culture, correlated above with early Proto-Germanic, exerted a strong influence upon coastal Finland and Estonia 1600–700 BCE. Due to this, the Kiukais culture was transformed into the culture of Paimio ceramics (c. 1600–700 BCE), later continued by Morby ceramics (c. 700 BCE – 200 CE). The assumption is that clear cultural continuity was accompanied by linguistic continuity. Having assimilated the language of the Germanic traders and relatively few settlers of the Bronze Age, the language of coastal Finland is assumed to have reached the stage of Proto-Finnish at the beginning of the Christian era. In Estonia, the Paimio ceramics have a close counterpart in the contemporaneous Asva ceramics.

    Eastern homelands?

    I will not comment on Siberian or Central Asian homeland proposals, because they are obviously not mainstream, still less today when we know that Uralic was certainly in contact with Proto-Indo-European, and then with Pre- and Proto-Indo-Iranian, as supported even by the Copenhagen group in Damgaard et al. (2018).

    This is what Kallio (2017) has to say about the agendas behind such proposals:

    Interestingly, the only Uralicists who generally reject the Central Russian homeland are the Russian ones who prefer the Siberian homeland instead. Some Russians even advocate that the Central Russian homeland is only due to Finnish nationalism or, as one of them put it a bit more tactfully, “the political and ideological situation in Finland in the first decades of the 20th century” (Napolskikh 1995: 4).

    Still, some Finns (and especially those who also belong to the “school who wants it large and wants it early”) simultaneously advocate that exactly the same Central Russian homeland is due to Finnlandisierung (Wiik 2001: 466).

    Hence, for those of you willing to learn about fringe theories not related to North-Eastern Europe, you also have then the large and early version of the Uralic homeland, with Wiik’s Palaeolithic continuity of Uralic peoples spread over all of eastern and central Europe (hence EHG and R1a included):

    Palaeolithic boat peoples and Finno-Ugric. Source

    These fringe Finnish theories look a lot like the Corded Ware expansion… Better not go the Russian or Finnish nationalist ways? Agreed then, let’s discuss only rational proposals based on current data.

    The archaeological homeland

    For a detailed account of the Corded Ware expansion with Battle Axe, Fatyanovo-Balanovo, and Abashevo groups into the area, you can read my recent post on the origin of R1a-Z645.

    1. Textile ceramics

    During the 2nd millennium BC, textile impressions appear in pottery as a feature across a wide region, from the Baltic area through the Volga to the Urals, in communities that evolve from late Corded Ware groups without much external influence.

    While it has been held that this style represents a north-west expansion from the Volga region (with the “Netted Ware” expansion), there are actually at least two original textile styles, one (earlier) in the Gulf of Finland, common in the Kiukainen pottery, which evolves into the Textile ware culture proper, and another which seems to have an origin in the Middle Volga region to the south-east.

    The Netted ware culture is the one that apparently expands into inner Finland – a region not densely occupied by Corded Ware groups until then. There are, however, no clear boundaries between groups of both styles; textile impressions can be easily copied without much interaction or population movement; and the oldest textile ornamentation appeared on the Gulf of Finland. Hence the tradition of naming all as groups of Textile ceramics.

    Maximum distribution of Textile ceramics during the Bronze Age (ca. 2000-800 BC). Asbestos-tempered ware lies to the north (and is also continued in western Fennoscandia).

    The fact that different adjacent groups from the Gulf of Finland and Forest Zone share similar patterns making it very difficult to differentiate between ‘Netted Ware’ or ‘Textile Ware’ groups points to:

    • close cultural connections that are maintained through the Gulf of Finland and the Forest Zone after the evolution of late Corded Ware groups; and
    • no gross population movements in the original Battle Axe / Fatyanovo regions, except for the expansion of Netted Ware to inner Finland, Karelia, and the east, where the scattered Battle Axe finds and worsening climatic conditions suggest most CWC settlements disappeared at the end of the 3rd millennium BC and recovered only later.

    NOTE. This lack of population movement – or at least significant replacement by external, non-CWC groups – is confirmed in genetic investigation by continuity of CWC-related lineages (see below).

    The technology present in Textile ceramics is in clear contrast to local traditions of sub-Neolithic Lovozero and Pasvik cultures of asbestos-tempered pottery to the north and east, which point to a different tradition of knowledge and learning network – showing partial continuity with previous asbestos ware, since these territories host the main sources of asbestos. We have to assume that these cultures of northern and eastern Fennoscandia represent Palaeo-European (eventually also Palaeo-Siberian) groups clearly differentiated from the south.

    The Chirkovo culture (ca. 1800-700 BC) forms on the middle Volga – at roughly the same time as Netted Ware formed to the west – from the fusion of Abashevo and Balanovo elites on Volosovo territory, and is also related (like Abashevo) to materials of the Seima-Turbino phenomenon.

    Bronze Age ethnolinguistic groups

    In the Gulf of Finland, Kiukainen evolves into the Paimio ceramics (in Finland) — Asva Ware (in Estonia) culture, which lasts from ca. 1600 to ca. 700 BC, probably representing an evolving Finno-Saamic community, while the Netted Ware from inner Finland (the Sarsa and Tomitsa groups) and the groups from the Forest Zone possibly represent a Volga-Finnic community.

    NOTE. Nevertheless, the boundaries between Textile ceramic groups are far from clear, and inner Finland Netted Ware groups seem to follow a history different from Netted Ware groups from the Middle and Upper Volga, hence they could possibly be identified as an evolving Pre-Saamic community.

    Based on language contacts, with Early Baltic – Early Finnic contacts starting during the Iron Age (ca. 500 BC onwards), this is a potential picture of the situation at the end of this period, when Germanic influence on the coast starts to fade, and Lusatian culture influence is stronger:

    The linguistic situation in Lapland and the northern Baltic Sea Area in the Early Iron Age prior to the expansion of Saami languages; the locations of the language groups are schematic. The black line indicates the distribution of Saami languages in the 19th century, and the gray line their approximate maximal distribution before the expansion of Finnic. Aikio (2012)

    The whole Finno-Permic community remains thus in close contact, allowing for the complicated picture that Kallio mentions as potentially showing Dahl’s wave model for Uralic languages.

    Genetic data shows a uniform picture of these communities, with exclusively CWC-derived ancestry and haplogroups. So in Mittnik et al. (2018) all Baltic samples show R1a-Z645 subclades, while the recent session on Estonian populations in ISBA 8 (see programme in PDF) clearly states that:

    [Of the 24 Bronze Age samples from stone-cist graves] all 18 Bronze Age males belong to R1a.

    Regarding non-Uralic substrates found in Saami, supposedly absorbed during the expansion to the north (and thus representing languages spoken in northern Fennoscandia during the Bronze Age) this is what Aikio (2012) has to say:

    The Saami substrate in the Finnish dialects thus reveals that also Lakeland Saami languages had a large number of vocabulary items of obscure origin. Most likely many of these words were substrate in Lakeland Saami, too, and ultimately derive from languages spoken in the region before Saami. In some cases the loan origin of these words is obvious due to their secondary Proto-Saami vowel combinations such as *ā–ë in *kāvë ‘bend; small bay’ and *šāpšë ‘whitefish’. This substrate can be called ‘Palaeo-Lakelandic’, in contrast to the ‘Palaeo-Laplandic’ substrate that is prominent in the lexicon of Lapland Saami. As the Lakeland Saami languages became extinct and only fragments of their lexicon can be reconstructed via elements preserved in Finnish place-names and dialectal vocabulary, we are not in a position to actually study the features of this Palaeo-Lakelandic substrate. Its existence, however, appears evident from the material above.

    If we wanted to speculate further, based on the data we have now, it is very likely that two opposing groups will be found in the region:

    A) The central Finnish group, in this hypothesis the Palaeo-Lakelandic group, made up of the descendants of the Mesolithic pioneers of the Komsa and Suomusjärvi cultures, and thus mainly Baltic HG / Scandinavian HG ancestry and haplogroups I / R1b(xM269) (see more on Scandinavian HG).

    Frequency map of the so-called ‘Siberian’ component. From Tambets et al. (2018).

    B) Lapland and Kola were probably also inhabited by similar Mesolithic populations, until it was eventually assimilated by expanding Siberian groups (of Siberian ancestry and N1c-L392 lineages) from the east – entering the region likely through the Kola peninsula – , forming the Palaeo-Laplandic group, which was in turn later replaced by expanding Proto-Saamic groups.

    Siberian ancestry appears first in Fennoscandia at Bolshoy Oleni Ostrov ca. 1520 BC, with haplogroup N1c-L392 (2 samples, BOO002 and BOO004), and with Siberian ancestry. This is their likely movement in north-eastern Europe, from Lamnidis et al (2018):

    The large Siberian component in the Bolshoy individuals from the Kola Peninsula provides the earliest direct genetic evidence for an eastern migration into this region. Such contact is well documented in archaeology, with the introduction of asbestos-mixed Lovozero ceramics during the second millenium BC, and the spread of even-based arrowheads in Lapland from 1,900 BCE. Additionally, the nearest counterparts of Vardøy ceramics, appearing in the area around 1,600-1,300 BCE, can be found on the Taymyr peninsula, much further to the east. Finally, the Imiyakhtakhskaya culture from Yakutia spread to the Kola Peninsula during the same period.

    PCA plot of 113 Modern Eurasian populations, with individuals from this study projected on the principal components. Uralic speakers are highlighted in light purple. Image modified from Lamnidis et al. (2018)

    Obviously, these groups of asbestos-tempered ware are not connected to the Uralic expansion. From the same paper:

    The fact that the Siberian genetic component is consistently shared among Uralic-speaking populations, with the exceptions of Hungarians and the non-Uralic speaking Russians, would make it tempting to equate this component with the spread of Uralic languages in the area. However, such a model may be overly simplistic. First, the presence of the Siberian component on the Kola Peninsula at ca. 4000 yBP predates most linguistic estimates of the spread of Uralic languages to the area. Second, as shown in our analyses, the admixture patterns found in historic and modern Uralic speakers are complex and in fact inconsistent with a single admixture event. Therefore, even if the Siberian genetic component partly spread alongside Uralic languages, it likely presented only an addition to populations carrying this component from earlier.

    2. The Early Iron Age

    The Ananino culture appears in the Vyatka-Kama area, famed for its metallurgy, with traditions similar to the North Pontic area, by this time developing Pre-Sauromatian traditions. It expanded to the north in the first half of the first millennium BC, remaining in contact with the steppes, as shown by the ‘Scythian’ nature of its material culture.

    NOTE. The Ananino culture can be later followed through its zoomorphic styles into Iron Age Pjanoborskoi and Gljadenovskoi cultures, later to Ural-Siberian Middle Age cultures – Itkuska, Ust’-Poluiska, Kulaiska cultures –, which in turn can be related as prototypes of medieval Permian styles.

    Territory of (early and maximum) Ananino material culture. Vasilyev (2002).

    At the same time as the Ananino culture begins to expand ca. 1000 BC, the Netted Ware tradition from the middle Oka expanded eastwards into the Oka-Vyatka interfluve of the middle Volga region, until then occupied by the Chirkovo culture. Eventually the Akozino or Akhmylovo group (ca. 800-300 BC) emerged from the area, showing a strong cultural influence from the Ananino culture, by that time already expanding into the Cis-Urals region.

    The Akozino culture remains nevertheless linked to the western Forest Zone traditions, with long-ranging influences from as far as the Lusatian culture in Poland (in metallurgical techniques), which at this point is also closely related with cultures from Scandinavia (read more on genetics of the Tollense Valley).

    Mälar celts and molds for casting (a) and the main distribution area (в) of Mälar-type celts of the Mälar type in the Volga-Kama region (according to Kuzminykh 1983: figure 92) and Scandinavia (according to Baudou 1960: Karte 10); Ananino celts and molds for casting (б) and the main distribution area (г) of the distribution of the celts of the Ananino type in the Volga-Kama area (according to Kuzminykh 1983: figure 9); dagger of Ananino type (д).Map from (Yushkova 2010)

    Different materials from Akozino reach Fennoscandia late, at the end of the Bronze Age and beginning of the Early Iron Age, precisely when the influence of the Nordic Bronze Age culture on the Gulf of Finland was declining.

    This is a period when Textile ceramic cultures in north-eastern Europe evolve into well-armed chiefdom-based groups, with each chiefdom including thousands or tens of thousands, with the main settlements being hill forts, and those in Fennoscandia starting ca. 1000-400 BC.

    Mälar-type celts and Ananino-type celts appear simultaneously in Fennoscandia and the Forest Zone, with higher concentrations in south-eastern Sweden (Mälaren) and the Volga-Kama region, supporting the existence of a revived international trade network.

    Distribution of the Akozino-Mälar axes according to Sergej V. Kuz’minykh (1996: 8, Abb. 2).

    The Paimio—Asva Ware culture evolves (ca. 700-200 BC) into the Morby (in Finland) — Ilmandu syle (in Estonia, Latvia, and Mälaren) culture. The old Paimio—Asva tradition continues side by side with the new one, showing a clear technical continuity with it, but with ornamentation compared to the Early Iron Age cultures of the Upper Volga area. This new south-eastern influence is seen especially in:

    • Akozino-Mälar axes (ca. 800-500 BC): introduced into the Baltic area in so great numbers – especially south-western Finland, the Åland islands, and the Mälaren area of eastern Sweden – that it is believed to be accompanied by a movement of warrior-traders of the Akozino-Akhmylovo culture, following the waterways that Vikings used more than a thousand years later. Rather than imports, they represent a copy made with local iron sources.
    • Tarand graves (ca. 500 BC – AD 400): these ‘mortuary houses’ appear in the coastal areas of northern and western Estonia and the islands, at the same time as similar graves in south-western Finland, eastern Sweden, northern Latvia and Courland. Similar burials are found in Akozino-Akhmylovo, with grave goods also from the upper and middle Volga region, while grave goods show continuity with Textile ware.

    The use of asbestos increases in mainland Finnish wares with Kjelmøy Ware (ca. 700 BC – AD 300), which replaced the Lovozero Ware; and in the east in inner Finland and Karelia with the Luukonsaari and Sirnihta wares (ca. 700-500 BC – AD 200), where they replaced the previous Sarsa-Tomitsa ceramics.

    The Gorodets culture appears during the Scythian period in the forest-steppe zone north and west of the Volga, shows fortified settlements, and there are documented incursions of Gorodets iron makers into the Samara valley, evidenced by deposits of their typical pottery and a bloom or iron in the region.

    Iron Age ethnolinguistic groups

    According to (Koryakova and Epimakhov 2007):

    It is commonly accepted by archaeology, ethnography, and linguistics that the ancestors of the Permian peoples (the Udmurts, Komi-Permians, and Komi-Zyryans) left the sites of Ananyino cultural intercommunity.

    NOTE. For more information on the Late Metal Ages and Early Medieval situation of Finno-Ugric languages, see e.g. South-eastern contact area of Finnic languages in the light of onomastics (Rahkonen 2013).

    Yakhr-, -khra, yedr-, -dra and yer-/yar, -er(o), -or(o) names of lakes in Central and North Russia and the possible boundary of the proto-language words *jäkra/ä and *järka/ä. Rahkonen (2011)

    Certain innovations shared between Proto-Fennic (identified with the Gulf of Finland) and Proto-Mordvinic (from the Gorodets culture) point to their close contact before the Proto-Fennic expansion, and thus to the identification of Gorodets as Proto-Mordvinic, hence Akozino as Volgaic (Parpola 2018):

    • the noun paradigms and the form and function of individual cases,
    • the geminate *mm (foreign to Proto-Uralic before the development of Fennic under Germanic influence) and other non-Uralic consonant clusters.
    • the change of numeral *luka ‘ten’ with *kümmen.
    • The presence of loanwords of non-Uralic origin, related to farming and trees, potentially Palaeo-European in nature (hence possibly from Siberian influence in north-eastern Europe).
    Map of archaeological cultures in north-eastern Europe ca. 8th-3rd centuries BC. [The Mid-Volga Akozino group not depicted] Shaded area represents the Ananino cultural-historical society. Purple area show likely zones of predominant Siberian ancestry and N1c-L392 lineages. Blue areas likely zones of predominant CWC ancestry and R1a-Z645 lineages. Fading purple arrows represent likely stepped movements of haplogroup N1c-L392 for centuries (Siberian → Ananino → Akozino → Fennoscandia), found eventually in tarand graves. Blue arrows represent eventual expansions of Fennic and (partially displaced) Saamic. Modified image from Vasilyev (2002).

    The introduction of a strongly hierarchical chiefdom system can quickly change the pre-existing social order and lead to a major genetic shift within generations, without a radical change in languages, as shown in Sintashta-Potapovka compared to the preceding Poltavka society (read more about Sintashta).

    Fortified settlements in the region represented in part visiting warrior-traders settled through matrimonial relationships with local chiefs, eager to get access to coveted goods and become members of a distribution network that could guarantee them even military assistance. Such a system is also seen synchronously in other cultures of the region, like the Nordic Bronze Age and Lusatian cultures (Parpola 2013).

    The most likely situation is that N1c subclades were incorporated from the Circum-Artic region during the Anonino (Permic) expansion to the north, later emerged during the formation of the Akozino group (Volgaic, under Anonino influence), and these subclades in turn infiltrated among the warrior traders that spread all over Fennoscandia and the eastern Baltic (mainly among Fennic, Saamic, Germanic, and Balto-Slavic peoples), during the age of hill forts, creating alliances partially based on exogamy strategies (Parpola 2013).

    Over the course of these events, no language change is necessary in any of the cultures involved, since the centre of gravity is on the expanding culture incorporating new lineages:

    • first on the Middle Volga, when Ananino expands to the north, incorporatinig N1c lineages from the Circum-Artic region.
    • then with the expansion of the Akozino-Akhmylovo culture into Ananino territory, admixing with part of its population;
    • then on the Baltic region, when materials are imported from Akozino into Fennoscandia and the eastern Baltic (and vice versa), with local cultures being infiltrated by foreign (Akozino) warrior-traders and their materials;
    • and later with the different population movements that led eventually to a greater or lesser relevance of N1c in modern Finno-Permic populations.

    To argue that this infiltration and later expansion of lineages changed the language in one culture in one of these events seems unlikely. To use this argument of “opposite movement of ethnic and language change” for different successive events, and only on selected regions and cultures (and not those where the greatest genetic and cultural impact is seen, like e.g. Sweden for Akozino materials) is illogical.

    NOTE. Notice how I write here about “infiltration” and “lineages”, not “migration” or “populations”. To understand that, see below the next section on autosomal studies to compare Bronze Age, Iron Age, Medieval and Modern Estonians, and see how little the population of Estonia (homeland of Proto-Fennic and partially of Proto-Finno-Saamic) has changed since the Corded Ware migrations, suggesting genetic continuity and thus mostly close inter-regional and intra-regional contacts in the Forest Zone, hence a very limited impact of the absorbed N1c lineages (originally at some point incorporated from the Circum-Artic region). You can also check on the most recent assessment of R1a vs. N1c in modern Uralic populations.

    Iron Age and later populations

    From the session on Estonian samples on ISBA 8, by Tambets et al.:

    [Of the 13 samples from the Iron Age tarand-graves] We found that the Iron Age individuals do in fact carry chrY hg N3 (…) Furthermore, based on their autosomal data, all of the studied individuals appear closer to hunter-gatherers and modern Estonians than Estonian CWC individuals do.

    EDIT (16 OCT) A recent abstract with Saag as main author (Tambets second) cites 3 out of 5 sampled Iron Age individuals as having haplogroup N3.

    EDIT (28 OCT): Notice also the appearance of N1a1a1a1a1a1a1-L1025 in Lithuania (ca. 300 AD), from Damgaard (Nature 2018); the N1c sample of the Krivichi Pskov Long Barrows culture (ca. 8th-10th c. AD), and N1a1a1a1a1a1a7-Y4341 among late Vikings from Sigtuna (ca. 10th-12th c. AD) in Krzewinska (2018).

    PCA of Estonian samples from the Bronze Age, Iron Age and Medieval times. Tambets et al. (2018, upcoming).

    Looking at the plot, the genetic inflow marking the change from the Bronze Age to the Iron Age looks like an obvious expansion of nearby peoples with CWC-related ancestry, i.e. likely from the south-east, near the Middle Volga, where influence of steppe peoples is greater (hence likely Akozino) into a Proto-Fennic population already admixed (since the arrival of Corded Ware groups) with Comb Ware-like populations.

    All of these groups were probably R1a-Z645 (likely R1a-Z283) since the expansion of Corded Ware peoples, with an introduction of some N1c lineages precisely during this Iron Age period. This infiltration of N1c-L392 with Akozino is obviously not directly related to Siberian cultures, given what we know about the autosomal description of Estonian samples.

    Rather, N1c-L392 lineages were likely part of the incoming (Volgaic) Akozino warrior-traders, who settled among developing chiefdoms based on hill fort settlements of cultures all over the Baltic area, and began to appear thus in some of the new tarand graves associated with the Iron Age in north-eastern Europe.f

    A good way to look at this is to realize that no new cluster appears compared to the data we already have from Baltic LN and BA samples from Mittnik et al. (2018), so the Estonian BA and IA clusters must be located (in a proper PCA) in the cline from Pit-Comb Ware culture through Baltic BA to Corded Ware groups:

    PCA and ADMIXTURE analysis reflecting three time periods in Northern European prehistory. a Principal components analysis of 1012 present-day West Eurasians (grey points, modern Baltic populations in dark grey) with 294 projected published ancient and 38 ancient North European samples introduced in this study (marked with a red outline). Population labels of modern West Eurasians are given in Supplementary Fig. 7 and a zoomed-in version of the European Late Neolithic and Bronze Age samples is provided in Supplementary Fig. 8. b Ancestral components in ancient individuals estimated by ADMIXTURE (k = 11)

    This genetic continuity from Corded Ware (the most likely Proto-Uralic homeland) to the Proto-Fennic and Proto-Saamic communities in the Gulf of Finland correlates very well with the known conservatism of Finno-Saamic phonology, quite similar to Finno-Ugric, and both to Proto-Uralic (Kallio 2017): The most isolated region after the expansion of Corded Ware peoples, the Gulf of Finland, shielded against migrations for almost 1,500 years, is then the most conservative – until the arrival of Akozino influence.

    NOTE. This has its parallel in the phonetic conservatism of Celtic or Italic compared to Finno-Ugric-influenced Germanic, Balto-Slavic, or Indo-Iranian.

    Only later would certain regions (like Finland or Lappland) suffer Y-DNA bottlenecks and further admixture events associated with population displacements and expansions, such as the spread of Fennic peoples from their Estonian homeland (evidenced by the earlier separation of South Estonian) to the north and east:

    The Finnic family tree. Kallio (2014).

    The initial Proto-Fennic expansion was probably coupled with the expansion of Proto-Saami to the north, with the Kjelmøy Ware absorbing the Siberian population of Lovozero Ware, and potentially in inner Finland and Karelia with the Luukonsaari and Sirnihta wares (Carpelan and Parpola 2017).

    This Proto-Saami population expansion from the mainland to the north, admixing with Lovozero-related peoples, is clearly reflected in the late Iron Age Saamic samples from Levänluhta (ca. 400-800 AD), as a shift (of 2 out of 3 samples) to Siberian-like ancestry from their original CWC_Baltic-like situation (see PCA from Lamnidis et al. 2018 above).

    Also, Volgaic and Permic populations from inner Finland and the Forest Zone to the Cis-Urals and Circum-Artic regions probably incorporate Siberian ancestry and N1c-L392 lineages during these and later population movements, while the westernmost populations – Estonian, Mordvinic – remain less admixed (see PCA from Tambets et al. 2018 below).

    We also have data of N1c-L392 in Nordic territory in the Middle Ages, proving its likely strong presence in the Mälaren area since the Iron Age, with the arrival of Akozino warrior traders. Similarly, it is found among Balto-Slavic groups along the eastern Baltic area. Obviously, no language change is seen in Nordic Bronze Age and Lusatian territory, and none is expected in Estonian or Finnish territory, either.

    Therefore, no “N1c-L392 + Siberian ancestry” can be seen expanding Finno-Ugric dialects, but rather different infiltrations and population movements with limited effects on ancestry and Y-DNA composition, depending on the specific period and region.

    Selection of the PCA, with the group of Estonians, Mordovians, and Hungarians selected. See Tambets et al. (2018) for more information.

    An issue never resolved

    Because N1c-L392 subclades & Siberian ancestry, which appear in different proportions and with different origins among some modern Uralic peoples, do not appear in cultures supposed to host Uralic-speaking populations until the Iron Age, people keep looking into any direction to find the ‘true’ homeland of those ‘Uralic N1c peoples’? Kind of a full circular reasoning, anyone? The same is valid for R1a & steppe ancestry being followed for ‘Indo-Europeans’, or R1b-P312 & Neolithic farmer ancestry being traced for ‘Basques’, because of their distribution in modern populations.

    I understand the caution of many pointing to the need to wait and see how samples after 2000 BC are like, in every single period, from the middle and upper Volga, Kama, southern Finland, and the Forest Zone between Fennoscandia and the steppe. It’s like waiting to see how people from Western Yamna and the Carpathian Basin after 3000 BC look like, to fill in what is lacking between East Yamna and Bell Beakers, and then between them and every single Late PIE dialect.

    But the answer for Yamna-Bell Beaker-Poltavka peoples during the Late PIE expansion is always going to be “R1b-L23, but with R1a-Z645 nearby” (we already have a pretty good idea about that); and the answer for the Forest Zone and northern Cis- and Trans-Urals area – during the time when Uralic languages are known to have already been spoken there – is always going to be “R1a-Z645, but with haplogroup N nearby”, as is already clear from the data on the eastern Baltic region.

    So, without a previously proposed model as to where those amateurs expressing concern about ‘not having enough data’ expect to find those ‘Uralic peoples’, all this waiting for the right data looks more like a waiting for N1c and Siberian ancestry to pop up somewhere in the historic Uralic-speaking area, to be able to say “There! A Uralic-speaking male!”. Not a very reasonable framework to deal with prehistoric peoples and their languages, I should think.

    But, for those who want to do that, let me break the news to you already:

    First N1c – Finno-Ugric person arrives in Estonia to teach Finno-Saamic to Balto-Slavic peoples.

    And here it is, an appropriate fantasy description of the ethnolinguistic groups from the region. You are welcome:

    • During the Bronze Age, late Corded Ware groups evolve as the western Textile ware Fennic Balto-Slavic group in the Gulf of Finland; the Netted Ware Saamic Balto-Slavic group of inner Finland; the south Netted Ware / Akozino Volgaic Balto-Slavic groups of the Middle Volga; and the Anonino Permic Balto-Slavic group in the north-eastern Forest Zone; all developing still in close contact with each other, allowing for common traits to permeate dialects.
    • These Balto-Slavic groups would then incorporate west of the Urals during and after the Iron Age (ca. 800-500 BC first, and also later during their expansion to the north) limited ancestry and lineages from eastern European hunter-gatherer groups of Palaeo-European Fennic and Palaeo-Siberian Volgaic and Permic languages from the Circum-Artic region, but they adopted nevertheless the language of the newcomers in every single infiltration of N1c lineages and/or admixture with Siberian ancestry. Oh and don’t forget the Saamic peoples from central Sweden, of course, the famous N1c-L392 ‘Rurikid’ lineages expanding Saamic to the north and replacing Proto-Germanic…

    The current model for those obsessed with modern Y-DNA is, therefore, that expanding Neolithic, Bronze Age and Iron Age cultures from north-eastern Europe adopted the languages of certain lineages originally from sub-Neolithic (Scandinavian and Siberian) hunter-gatherer populations of the Circum-Artic region; lineages that these cultures incorporated unevenly during their expansions. Hmmmm… Sounds like an inverse Western movie, where expanding Americans end up speaking Apache, and the eastern coast speaks Spanish until Italian migrants arrive and make everyone speak English… or something. A logic, no-nonsense approach to ethnolinguistic identification.

    I kid you not, this is the kind of models we are going to see very soon. In 2018 and 2019, with ancient DNA able to confirm or reject archaeological hypotheses based on linguistic data, people will keep instead creating new pet theories to support preconceived ideas based on the Y-DNA prevalent among modern populations. That is, information available in the 2000s.

    So what’s (so much published) ancient DNA useful for, exactly?

    [Next post on the subject: Corded Ware—Uralic (III): Seima-Turbino and the Ugric and Samoyedic expansion]

    See also


    More evidence on the recent arrival of haplogroup N and gradual replacement of R1a lineages in North-Eastern Europe


    A new article (in Russian), Kinship Analysis of Human Remains from the Sargat Mounds, Baraba forest-steppe, Western Siberia, by Pilipenko et al. Археология, этнография и антропология Евразии Том 45 № 4 2017, downloadable at ResearchGate.


    We present the results of a paleogenetic analysis of nine individuals from two Early Iron Age mounds in the Baraba forest -teppe, associated with the Sargat culture (fi ve from Pogorelka-2 mound 8, and four from Vengerovo-6 mound 1). Four systems of genetic markers were analyzed: mitochondrial DNA, the polymorphic part of the amelogenin gene, autosomal STR-loci, and those of the Y-chromosome. Complete or partial data, obtained for eight of the nine individuals, were subjected to kinship analysis. No direct relatives of the “parent-child” type were detected. However, the data indicate close paternal and maternal kinship among certain individuals. This was evidently one of the reasons why certain individuals were buried under a single mound. Paternal kinship appears to have been of greater importance. The diversity of mtDNA and Y-chromosome lineages among individuals from one and the same mound suggests that kinship was not the only motive behind burying the deceased people jointly. The presence of very similar, though not identical, variants of the Y chromosome in different burial grounds may indicate the existence of groups such as clans, consisting of paternally related males. Our conclusions need further confi rmation and detailed elaboration. Keywords: Paleogenetics, ancient DNA, kinship analysis, mitochondrial DNA, uniparental genetic markers, STR-loci, Y-chromosome, Baraba forest-steppe, Sargat culture, Early Iron Age.

    From the older study of the same region (Baraba, numbered 4) “Location of ancient human groups with a high frequency of mtDNA haplogroups U5, U4 and U2e lineages. The area of Northern Eurasian anthropological formation is marked by yellow region on the map (References: 1. Bramanti et al., 2009; 2. Malmstrom et
    al., 2009; 3. Krause et al., 2010; 4. this study)”

    Chronological time scale of Bronze Age Cultures from the Baraba region
    This is the same team that brought an ancient mtDNA study of different cultures within the Baraba steppe-forest region (from the Open Access book Population Dynamics in Prehistory and Early History).

    The Baraba steppe-forest is a region between the Ob and Irtysh rivers (about 800 km from west to east), stretching over 200 km from the taiga zone in the north to the steppes in the south.

    The new study brings a more recent picture of the region, from the Iron Age Sargat culture, ca. 500 BC – 500 AD, with five samples of haplogroup N and two samples of haplogroup R1a.

    R1a lineages in the region probably derive from the previous expansion of Andronovo and related cultures, which had absorbed North Caspian steppe populations and their Late Indo-European culture.

    N subclades prevalent in certain modern Eurasian populations are probably derived from the expansion of the Seima-Turbino phenomenon.

    While samples are scarce, Y-DNA data keeps showing the same picture I have spoken about more than once:

    N subclades (potentially originally speaking Proto-Yukaghir languages) gradually replacing haplogroup R1a (originally probably speaking Uralic languages), probably through successive founder effects (such as the bottlenecks found in Finland), which left their Uralic culture and ethnolinguistic identification intact.

    Therefore, late Corded Ware groups of North-Eastern Europe (in the Forest Zone and the Baltic), mainly of R1a-Z645 subclades, probably never adopted Late Indo-European languages.