Corded Ware ancestry in North Eurasia and the Uralic expansion

uralic-clines-nganasan

Now that it has become evident that Late Repin (i.e. Yamnaya/Afanasevo) ancestry was associated with the migration of R1b-L23-rich Late Proto-Indo-Europeans from the steppe in the second half of the the 4th millennium BC, there’s still the question of how R1a-rich Uralic speakers of Corded Ware ancestry expanded , and how they spread their languages throughout North Eurasia.

Modern North Eurasians

I have been collecting information from the supplementary data of the latest papers on modern and ancient North Eurasian peoples, including Jeong et al. (2019), Saag et al. (2019), Sikora et al. (2018), or Flegontov et al. (2019), and I have tried to add up their information on ancestral components and their modern and historical distributions.

Fortunately, the current obsession with simplifying ancestry components into three or four general, atemporal groups, and the common use of the same ones across labs, make it very simple to merge data and map them.

Corded Ware ancestry

There is no doubt about the prevalent ancestry among Uralic-speaking peoples. A map isn’t needed to realize that, because ancient and modern data – like those recently summarized in Jeong et al. (2019) – prove it. But maps sure help visualize their intricate relationship better:

natural-modern-srubnaya-ancestry
Natural neighbor interpolation of Srubnaya ancestry among modern populations. See full map.
kriging-modern-srubnaya-ancestry
Kriging interpolation of Srubnaya ancestry among modern populations. See full map

Interestingly, the regions with higher Corded Ware-related ancestry are in great part coincident with (pre)historical Finno-Ugric-speaking territories:

uralic-languages-modern
Modern distribution of Uralic languages, with ancient territory (in the Common Era) labelled and delimited by a red line. For more information on the ancient territory see here.

Edit (29/7/2019): Here is the full Steppe_MLBA ancestry map, including Steppe_MLBA (vs. Indus Periphery vs. Onge) in modern South Asian populations from Narasimhan et al. (2018), apart from the ‘Srubnaya component’ in North Eurasian populations. ‘Dummy’ variables (with 0% ancestry) have been included to the south and east of the map to avoid weird interpolations of Steppe_MLBA into Africa and East Asia.

modern-steppe-mlba-ancestry2
Natural neighbor interpolation of Steppe MLBA-like ancestry among modern populations. See full map.

Anatolia Neolithic ancestry

Also interesting are the patterns of non-CWC-related ancestry, in particular the apparent wedge created by expanding East Slavs, which seems to reflect the intrusion of central(-eastern) European ancestry into Finno-Permic territory.

NOTE. Read more on Balto-Slavic hydrotoponymy, on the cradle of Russians as a Finno-Permic hotspot, and about Pre-Slavic languages in North-West Russia.

natural-modern-lbk-en-ancestry
Natural neighbor interpolation of LBK EN ancestry among modern populations. See full map.
kriging-modern-lbk-en-ancestry
Kriging interpolation of LBK EN ancestry among modern populations. See full map

WHG ancestry

The cline(s) between WHG, EHG, ANE, Nganasan, and Baikal HG are also simplified when some of them excluded, in this case EHG, represented thus in part by WHG, and in part by more eastern ancestries (see below).

modern-whg-ancestry
Natural neighbor interpolation of WHG ancestry among modern populations. See full map.
kriging-modern-whg-ancestry
Kriging interpolation of WHG ancestry among modern populations. See full map.

Arctic, Tundra or Forest-steppe?

Data on Nganasan-related vs. ANE vs. Baikal HG/Ulchi-related ancestry is difficult to map properly, because both ancestry components are usually reported as mutually exclusive, when they are in fact clearly related in an ancestral cline formed by different ancient North Eurasian populations from Siberia.

When it comes to ascertaining the origin of the multiple CWC-related clines among Uralic-speaking peoples, the question is thus how to properly distinguish the proportions of WHG-, EHG-, Nganasan-, ANE or BaikalHG-related ancestral components in North Eurasia, i.e. how did each dialectal group admix with regional groups which formed part of these clines east and west of the Urals.

The truth is, one ought to test specific ancient samples for each “Siberian” ancestry found in the different Uralic dialectal groups, but the simplistic “Siberian” label somehow gets a pass in many papers (see a recent example).

Below qpAdm results with best fits for Ulchi ancestry, Afontova Gora 3 ancestry, and Nganasan ancestry, but some populations show good fits for both and with similar proportions, so selecting one necessarily simplifies the distribution of both.

Ulchi ancestry

modern-ulchi-ancestry
Natural neighbor interpolation of Ulchi ancestry among modern populations. See full map.
kriging-modern-ulchi-ancestry
Kriging interpolation of Ulchi ancestry among modern populations. See full map.

ANE ancestry

natural-modern-ane-ancestry
Natural neighbor interpolation of ANE ancestry among modern populations. See full map.
kriging-modern-ane-ancestry
Kriging interpolation of ANE ancestry among modern populations. See full map.

Nganasan ancestry

modern-nganasan-ancestry
Natural neighbor interpolation of Nganasan ancestry among modern populations. See full map.
kriging-modern-nganasan-ancestry
Kriging interpolation of Nganasan ancestry among modern populations. See full map.

Iran Chalcolithic

A simplistic Iran Chalcolithic-related ancestry is also seen in the Altaic cline(s) which (like Corded Ware ancestry) expanded from Central Asia into Europe – apart from its historical distribution south of the Caucasus:

modern-iran-chal-ancestry
Natural neighbor interpolation of Iran Neolithic ancestry among modern populations. See full map.
kriging-modern-iran-neolithic-ancestry
Kriging interpolation of Iran Chalcolithic ancestry among modern populations. See full map.

Other models

The first question I imagine some would like to know is: what about other models? Do they show the same results? Here is the simplistic combination of ancestry components published in Damgaard et al. (2018) for the same or similar populations:

NOTE. As you can see, their selection of EHG vs. WHG vs. Nganasan vs. Natufian vs. Clovis of is of little use, but corroborate the results from other papers, and show some interesting patterns in combination with those above.

EHG

damgaard-modern-ehg-ancestry
Natural neighbor interpolation of EHG ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-ehg-ancestry
Kriging interpolation of EHG ancestry among modern populations. See full map.

Natufian ancestry

damgaard-modern-natufian-ancestry
Natural neighbor interpolation of Natufian ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-natufian-ancestry
Kriging interpolation of Natufian ancestry among modern populations. See full map.

WHG ancestry

damgaard-modern-whg-ancestry
Natural neighbor interpolation of WHG ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-whg-ancestry
Kriging interpolation of WHG ancestry among modern populations. See full map.

Baikal HG ancestry

damgaard-modern-baikalhg-ancestry
Natural neighbor interpolation of Baikal hunter-gatherer ancestry among modern populations, data from Damgaard et al. (2018). See full map.
damgaard-kriging-baikal-hg-ancestry
Kriging interpolation of Baikal HG ancestry among modern populations. See full map.

Ancient North Eurasians

Once the modern situation is clear, relevant questions are, for example, whether EHG-, WHG-, ANE, Nganasan-, and/or Baikal HG-related meta-populations expanded or became integrated into Uralic-speaking territories.

When did these admixture/migration events happen?

How did the ancient distribution or expansion of Palaeo-Arctic, Baikalic, and/or Altaic peoples affect the current distribution of the so-called “Siberian” ancestry, and of hg. N1a, in each specific population?

NOTE. A little excursus is necessary, because the calculated repetition of a hypothetic opposition “N1a vs. R1a” doesn’t make this dichotomy real:

  1. There was not a single ethnolinguistic community represented by hg. R1a after the initial expansion of Eastern Corded Ware groups, or by hg. N1a-L392 after its initial expansion in Siberia:
  2. Different subclades became incorporated in different ways into Bronze Age and Iron Age communities, most of which without an ethnolinguistic change. For example, N1a subclades became incorporated into North Eurasian populations of different languages, reaching Uralic- and Indo-European-speaking territories of north-eastern Europe during the late Iron Age, at a time when their ancestral origin or language in Siberia was impossible to ascertain. Just like the mix found among Proto-Germanic peoples (R1b, R1a, and I1)* or among Slavic peoples (I2a, E1b, R1a)*, the mix of many Uralic groups showing specific percentages of R1a, N1a, or Q subclades* reflect more or less recent admixture or acculturation events with little impact on their languages.

*other typically northern and eastern European haplogroups are also represented in early Germanic (N1a, I2, E1b, J, G2), Slavic (I1, G2, J) and Finno-Permic (I1, R1b, J) peoples.

ananino-culture-new
Map of archaeological cultures in north-eastern Europe ca. 8th-3rd centuries BC. [The Mid-Volga Akozino group not depicted] Shaded area represents the Ananino cultural-historical society. Fading purple arrows represent likely stepped movements of subclades of haplogroup N for centuries (e.g. Siberian → Ananino → Akozino → Fennoscandia [N-VL29]; Circum-Arctic → forest-steppe [N1, N2]; etc.). Blue arrows represent eventual expansions of Uralic peoples to the north. Modified image from Vasilyev (2002).

The problem with mapping the ancestry of the available sampling of ancient populations is that we lack proper temporal and regional transects. The maps that follow include cultures roughly divided into either “Bronze Age” or “Iron Age” groups, although the difference between samples may span up to 2,000 years.

NOTE. Rough estimates for more external groups (viz. Sweden Battle Axe/Gotland_A for the NW, Srubna from the North Pontic area for the SW, Arctic/Nganasan for the NE, and Baikal EBA/”Ulchi-like” for the SE) have been included to offer a wider interpolated area using data already known.

Bronze Age

Similar to modern populations, the selection of best fit “Siberian” ancestry between Baikal HG vs. Nganasan, both potentially ± ANE (AG3), is an oversimplification that needs to be addressed in future papers.

Corded Ware ancestry

bronze-age-corded-ware-ancestry
Natural neighbor interpolation of Srubnaya ancestry among Bronze Age populations. See full map.

Nganasan-like ancestry

bronze-age-nganasan-like-ancestry
Natural neighbor interpolation of Nganasan-like ancestry among Bronze Age populations. See full map.

Baikal HG ancestry

bronze-age-baikal-hg-ancestry
Natural neighbor interpolation of Baikal Hunter-Gatherer ancestry among Bronze Age populations. See full map.

Afontova Gora 3 ancestry

bronze-age-afontova-gora-ancestry
Natural neighbor interpolation of Afontova Gora 3 ancestry among Bronze Age populations. See full map.

Iron Age

Corded Ware ancestry

Interestingly, the moderate expansion of Corded Ware-related ancestry from the south during the Iron Age may be related to the expansion of hg. N1a-VL29 into the chiefdom-based system of north-eastern Europe, including Ananyino/Akozino and later expanding Akozino warrior-traders around the Baltic Sea.

NOTE. The samples from Levänluhta are centuries older than those from Estonia (and Ingria), and those from Chalmny Varre are modern ones, so this region has to be read as a south-west to north-east distribution from the Iron Age to modern times.

iron-age-corded-ware-ancestry
Natural neighbor interpolation of Srubnaya ancestry among Iron Age populations. See full map.

Baikal HG-like ancestry

The fact that this Baltic N1a-VL29 branch belongs in a group together with typically Avar N1a-B197 supports the Altaic origin of the parent group, which is possibly related to the expansion of Baikalic ancestry and Iron Age nomads:

iron-age-baikal-ancestry
Natural neighbor interpolation of Baikal HG ancestry among Iron Age populations. See full map.

Nganasan-like ancestry

The dilution of Nganasan-like ancestry in an Arctic region featuring “Siberian” ancestry and hg. N1a-L392 at least since the Bronze Age supports the integration of hg. N1a-Z1934, sister clade of Ugric N1a-Z1936, into populations west and east of the Urals with the expansion of Uralic languages to the north into the Tundra region (see here).

The integration of N1a-Z1934 lineages into Finnic-speaking peoples after their migration to the north and east, and the displacement or acculturation of Saami from their ancestral homeland, coinciding with known genetic bottlenecks among Finns, is yet another proof of this evolution:

iron-age-nganasan-ancestry
Natural neighbor interpolation of Nganasan ancestry among Iron Age populations. See full map.

WHG ancestry

Similarly, WHG ancestry doesn’t seem to be related to important population movements throughout the Bronze Age, which excludes the multiple North Eurasian populations that will be found along the clines formed by WHG, EHG, ANE, Nganasan, Baikal HG ancestry as forming part of the Uralic ethnogenesis, although they may be relevant to follow later regional movements of specific populations.

iron-age-whg-ancestry
Natural neighbor interpolation of WHG ancestry among Iron Age populations. See full map.

Conclusion

It seems natural that people used to look at maps of haplogroup distribution from the 2000s, coupled with modern language distributions, and would try to interpret them in a certain way, reaching thus the wrong conclusions whose consequences are especially visible today when ancient DNA keeps contradicting them.

In hindsight, though, assuming that Balto-Slavs expanded with Corded Ware and hg. R1a, or that Uralians expanded with “Siberian” ancestry and hg. N1a, was as absurd as looking at maps of ancestry and haplogroup distribution of ancient and modern Native Americans, trying to divide them into “Germanic” or “Iberian”…

The evolution of each specific region and cultural group of North Eurasia is far from being clear. However, the general trend speaks clearly in favour of an ancient, Bronze Age distribution of North Eurasian ancestry and haplogroups that have decreased, diluted, or become incorporated into expanding Uralians of Corded Ware ancestry, occasionally spreading with inter-regional expansions of local groups.

Given the relatively recent push of Altaic and Indo-European languages into ancestral Uralic-speaking territories, only the ancient Corded Ware expansion remains compatible with the spread of Uralic languages into their historical distribution.

Related

Vikings, Vikings, Vikings! “eastern” ancestry in the whole Baltic Iron Age

vikings-middle-age

Open access Population genomics of the Viking world, by Margaryan et al. bioRxiv (2019), with a huge new sampling from the Viking Age.

Interesting excerpts (emphasis mine, modified for clarity):

To understand the genetic structure and influence of the Viking expansion, we sequenced the genomes of 442 ancient humans from across Europe and Greenland ranging from the Bronze Age (c. 2400 BC) to the early Modern period (c. 1600 CE), with particular emphasis on the Viking Age. We find that the period preceding the Viking Age was accompanied by foreign gene flow into Scandinavia from the south and east: spreading from Denmark and eastern Sweden to the rest of Scandinavia. Despite the close linguistic similarities of modern Scandinavian languages, we observe genetic structure within Scandinavia, suggesting that regional population differences were already present 1,000 years ago.

Maps illustrating the following texts have been made based on data from this and other papers:

  • Maps showing ancestry include only data from this preprint (which also includes some samples from Sigtuna).
  • Maps showing haplogroup density include Vikings from other publications, such as those from Sigtuna in Krzewinska et al. (2018), and from Iceland in Ebenesersdóttir et al. (2018).
  • Maps showing haplogroups of ancient DNA samples based on their age include data from all published papers, but with slightly modified locations to avoid overcrowding (randomized distance approx. ± 0.1 long. and lat.).

middle-ages-europe-y-dna
Y-DNA haplogroups in Europe during the Viking expansions (full map). See other maps from the Middle Ages.

We find that the transition from the BA to the IA is accompanied by a reduction in Neolithic farmer ancestry, with a corresponding increase in both Steppe-like ancestry and hunter-gatherer ancestry. While most groups show a slight recovery of farmer ancestry during the VA, there is considerable variation in ancestry across Scandinavia. In particular, we observe a wide range of ancestry compositions among individuals from Sweden, with some groups in southern Sweden showing some of the highest farmer ancestry proportions (40% or more in individuals from Malmö, Kärda or Öland).

Ancestry proportions in Norway and Denmark on the other hand appear more uniform. Finally we detect an influx of low levels of “eastern” ancestry starting in the early VA, mostly constrained among groups from eastern and central Sweden as well as some Norwegian groups. Testing of putative source groups for this “eastern” ancestry revealed differing patterns among the Viking Age target groups, with contributions of either East Asian- or Caucasus-related ancestry.

saami-ancestry-vikings
Ancestry proportions of four-way models including additional putative source groups for target groups for which three-way fit was rejected (p ≤ 0.01);

Overall, our findings suggest that the genetic makeup of VA Scandinavia derives from mixtures of three earlier sources: Mesolithic hunter-gatherers, Neolithic farmers, and Bronze Age pastoralists. Intriguingly, our results also indicate ongoing gene flow from the south and east into Iron Age Scandinavia. Thus, these observations are consistent with archaeological claims of wide-ranging demographic turmoil in the aftermath of the Roman Empire with consequences for the Scandinavian populations during the late Iron Age.

Genetic structure within Viking-Age Scandinavia

We find that VA Scandinavians on average cluster into three groups according to their geographic origin, shifted towards their respective present-day counterparts in Denmark, Sweden and Norway. Closer inspection of the distributions for the different groups reveals additional complexity in their genetic structure.

vikings-danish-ancestry
Natural neighbor interpolation of “Danish ancestry” among Vikings.

We find that the ‘Norwegian’ cluster includes Norwegian IA individuals, who are distinct from both Swedish and Danish IA individuals which cluster together with the majority of central and eastern Swedish VA individuals. Many individuals from southwestern Sweden (e.g. Skara) cluster with Danish present-day individuals from the eastern islands (Funen, Zealand), skewing towards the ‘Swedish’ cluster with respect to early and more western Danish VA individuals (Jutland).

Some individuals have strong affinity with Eastern Europeans, particularly those from the island of Gotland in eastern Sweden. The latter likely reflects individuals with Baltic ancestry, as clustering with Baltic BA individuals is evident in the IBS-UMAP analysis and through f4-statistics.

vikings-norwegian-ancestry
Natural neighbor interpolation of “Norwegian ancestry” among Vikings.

For more on this influx of “eastern” ancestry see my previous posts (including Viking samples from Sigtuna) on Genetic and linguistic continuity in the East Baltic, and on the Pre-Proto-Germanic homeland based on hydrotoponymy.

Baltic ancestry in Gotland

Genetic clustering using IBS-UMAP suggested genetic affinities of some Viking Age individuals with Bronze Age individuals from the Baltic. To further test these, we quantified excess allele sharing of Viking Age individuals with Baltic BA compared to early Viking Age individuals from Salme using f4 statistics. We find that many individuals from the island of Gotland share a significant excess of alleles with Baltic BA, consistent with other evidence of this site being a trading post with contacts across the Baltic Sea.

vikings-finnish-ancestry
Natural neighbor interpolation of “Finnish ancestry” among Vikings.

The earliest N1a-VL29 sample available comes from Iron Age Gotland (VK579) ca. AD 200-400 (see Iron Age Y-DNA maps), which also proves its presence in the western Baltic before the Viking expansion. The distribution of N1a-VL29 and R1a-Z280 (compared to R1a in general) among Vikings also supports a likely expansion of both lineages in succeeding waves from the east with Akozino warrior-traders, at the same time as they expanded into the Gulf of Finland.

vikings-y-dna-haplogroup-r1a-z280-over-r1a
Density of haplogroup R1a-Z280 (samples in pink) overlaid over other R1a samples (in green, with R1a-Z284 in cyan) among Vikings.

Vikings in Estonia

(…) only one Viking raiding or diplomatic expedition has left direct archaeological traces, at Salme in Estonia, where 41 Swedish Vikings who died violently were buried in two boats accompanied by high-status weaponry. Importantly, the Salme boat-burial predates the first textually documented raid (in Lindisfarne in 793) by nearly half a century. Comparing the genomes of 34 individuals from the Salme burial using kinship analyses, we find that these elite warriors included four brothers buried side by side and a 3rd degree relative of one of the four brothers. In addition, members of the Salme group had very similar ancestry profiles, in comparison to the profiles of other Viking burials. This suggests that this raid was conducted by genetically homogeneous people of high status, including close kin. Isotope analyses indicate that the crew descended from the Mälaren area in Eastern Sweden thus confirming that the Baltic-Mid-Swedish interaction took place early in the VA.

vikings-swedish-ancestry
Natural neighbor interpolation of “Swedish ancestry” among Vikings.

Viking samples from Estonia show thus ancient Swedes from the Mälaren area, which proves once again that hg. N1a-VL29 (especially subclade N1a-L550) and tiny proportions of so-called “Siberian ancestry” expanded during the Early Iron Age into the whole Baltic Sea area, not only into Estonia, and evidently not spreading with Balto-Finnic languages (since the language influence is in the opposite direction, east-west, Germanic > Finno-Samic, during the Bronze Age).

N1a-VL29 lineages spread again later eastwards with Varangians, from Sweden into north-eastern Europe, most likely including the ancestors of the Rurikid dynasty. Unsurprisingly, the arrival of Vikings with Swedish ancestry into the East Baltic and their dispersal through the forest zone didn’t cause a language shift of Balto-Finnic, Mordvinic, or East Slavic speakers to Old Norse, either…

NOTE. For N1a-Y4339 – N1a-L550 subclade of Swedish origin – as main haplogroup of modern descendants of Rurikid princes, see Volkov & Seslavin (2019) – full text in comments below. Data from ancient samples show varied paternal lineages even among early rulers traditionally linked to Rurik’s line, which explains some of the discrepancies found among modern descendants:

  • A sample from Chernihiv (VK542) potentially belonging to Gleb Svyatoslavich, the 11th century prince of Tmutarakan/Novgorod, belongs to hg. I2a-Y3120 (a subclade of early Slavic I2a-CTS10228) and has 71% “Modern Polish” ancestry (see below).
  • Izyaslav Ingvarevych, the 13th century prince of Dorogobuzh, Principality of Volhynia/Galicia, is probably behind a sample from Lutsk (VK541), and belongs to hg. R1a-L1029 (a subclade of R1a-M458), showing ca. 95% of “Modern Polish” ancestry.
  • Yaroslav Osmomysl, the 12th century Prince of Halych (now in Western Ukraine), was probably of hg. E1b-V13, yet another clearly early Slavic haplogroup.

vikings-y-dna-haplogroup-n1a
Density of haplogroup N1a-VL29, N1a-L550 (samples in pink, most not visible) among Vikings. Samples of hg. R1b in blue, hg. R1a in green, hg. I in orange.

Finnish ancestry

Firstly, modern Finnish individuals are not like ancient Finnish individuals, modern individuals have ancestry of a population not in the reference; most likely Steppe/Russian ancestry, as Chinese are in the reference and do not share this direction. Ancient Swedes and Norwegians are more extreme than modern individuals in PC2 and 4. Ancient UK individuals were more extreme than Modern UK individuals in PC3 and 4. Ancient Danish individuals look rather similar to modern individuals from all over Scandinavia. By using a supervised ancient panel, we have removed recent drift from the signal, which would have affected modern Scandinavians and Finnish populations especially. This is in general a desirable feature but it is important to check that it has not affected inference.

ancient-modern-finns-steppe
PCA of the ancient and modern samples using the ancient palette, showing different PCs. Modern individuals are grey and the K=7 ancient panel surrogate populations are shown in strong colors, whilst the remaining M-K=7 ancient populations are shown in faded colors.

The story for Modern-vs-ancient Finnish ancestry is consistent, with ancient Finns looking much less extreme than the moderns. Conversely, ancient Norwegians look like less-drifted modern Norwegians; the Danish admixture seen through the use of ancient DNA is hard to detect because of the extreme drift within Norway that has occurred since the admixture event. PC4 vs PC5 is the most important plot for the ancient DNA story: Sweden and the UK (along with Poland, Italy and to an extent also Norway) are visibly extremes of a distribution the same “genes-mirror-geography” that was seen in the Ancient-palette analysis. PC1 vs PC2 tells the same story – and stronger, since this is a high variance-explained PC – for the UK, Poland and Italy.

Uniform manifold approximation and projection (UMAP) analysis of the VA and other ancient samples.

Evidence for Pictish Genomes

The four ancient genomes of Orkney individuals with little Scandinavian ancestry may be the first ones of Pictish people published to date. Yet a similar (>80% “UK ancestry) individual was found in Ireland (VK545) and five in Scandinavia, implying that Pictish populations were integrated into Scandinavian culture by the Viking Age.

Our interpretation for the Orkney samples can be summarised as follows. Firstly, they represent “native British” ancestry, rather than an unusual type of Scandinavian ancestry. Secondly, that this “British” ancestry was found in Britain before the Anglo-Saxon migrations. Finally, that in Orkney, these individuals would have descended from Pictish populations.

vikings-british-ancestry
Natural neighbor interpolation of “British ancestry” among Vikings.

(…) ‘UK’ represents a group from which modern British and Irish people all receive an ancestry component. This information together implies that within the sampling frame of our data, they are proxying the ‘Briton’ component in UK ancestry; that is, a pre-Roman genetic component present across the UK. Given they were found in Orkney, this makes it very likely that they were descended from a Pictish population.

Modern genetic variation within the UK sees variation between ‘native Briton’ populations Wales, Scotland, Cornwall and Ireland as large compared to that within the more ‘Anglo-Saxon’ English. This is despite subsequent gene flow into those populations from English-like populations. We have not attempted to disentangle modern genetic drift from historically distinct populations. Roman-era period people in England, Wales, Ireland and Scotland may not have been genetically close to these Orkney individuals, but our results show that they have a shared genetic component as they represent the same direction of variation.

Density of haplogroup R1b-L21 (samples in red), overlaid over all samples of hg. R1b among Vikings (R1b-U106 in green, other R1b-L151 in deep red). To these samples one may add the one from Janakkala in south-western Finland (AD ca. 1300), of hg. R1b-L21, possibly related to these population movements.

For more on Gaelic ancestry and lineages likely representing slaves among early Icelanders, see Ebenesersdóttir et al. (2018).

Y-DNA

As in the case of mitochondrial DNA, the overall distribution profile of the Y chromosomal haplogroups in the Viking Age samples was similar to that of the modern North European populations. The most frequently encountered male lineages were the haplogroups I1, R1b and R1a.

Haplogroup I (I1, I2)

The distribution of I1 in southern Scandinavia, including a sample from Sealand (VK532) ca. AD 100 (see Iron Age Y-DNA maps) proves that it had become integrated into the West Germanic population already before their expansions, something that we already suspected thanks to the sampling of Germanic tribes.

vikings-y-dna-haplogroup-i
Density of haplogroup I (samples in orange) among Vikings. Samples of hg. R1b in blue, hg. R1a in green, N1a in pink.
vikings-y-dna-haplogroup-i1-over-i
Density of haplogroup I1 (samples in red) overlaid over all samples of hg. I among Vikings.

Haplogroup R1b (M269, U106, P312)

Especially interesting is the finding of R1b-L151 widely distributed in the historical Nordic Bronze Age region, which is in line with the estimated TMRCA for R1b-P312 subclades found in Scandinavia, despite the known bottleneck among Germanic peoples under U106. Particularly telling in this regard is the finding of rare haplogroups R1b-DF19, R1b-L238, or R1b-S1194. All of that points to the impact of Bell Beaker-derived peoples during the Dagger period, when Pre-Proto-Germanic expanded into Scandinavia.

Also interesting is the finding of hg. R1b-P297 in Troms, Norway (VK531) ca. 2400 BC. R1b-P297 subclades might have expanded to the north through Finland with post-Swiderian Mesolithic groups (read more about Scandinavian hunter-gatherers), and the ancestry of this sample points to that origin.

However, it is also known that ancestry might change within a few generations of admixture, and that the transformation brought about by Bell Beakers with the Dagger Period probably reached Troms, so this could also be a R1b-M269 subclade. In fact, the few available data from this sample show that it comes from the natural harbour Skarsvågen at the NW end of the island Senja, and that its archaeologist thought it was from the Viking period or slightly earlier, based on the grave form. From Prescott (2017):

In 1995, Prescott and Walderhaug tentatively argued that a dramatic transformation took place in Norway around the Late Neolithic (2350 BCE), and that the swift nature of this transition was tied to the initial Indo-Europeanization of southern and coastal Norway, at least to Trøndelag and perhaps as far north as Troms. (…)

The Bell Beaker/early Late Neolithic, however, represents a source and beginning of these institution and practices, exhibits continuity to the following metal age periods and integrated most of Northern Europe’s Nordic region into a set of interaction fields. This happened around 2400 BCE, at the MNB to LN transition.

NOTE. This particular sample is not included in the maps of Viking haplogroups.

vikings-y-dna-haplogroup-r1b
Density of haplogroup R1b (samples in blue) among Vikings. Samples of hg. I in orange, hg. R1a in green, N1a in pink.
vikings-y-dna-haplogroup-r1b-U106-over-r1b
Density of haplogroup R1b-U106 (samples in green) overlaid over all samples of hg. R1b (other R1b-L23 samples in red) among Vikings.
vikings-y-dna-haplogroup-r1b-P312-over-r1b
Density of R1b-L151 (xR1b-U106) (samples in deep red) overlaid over all samples of hg. R1b (R1b-U106 in green, other R1b-M269 in blue) among Vikings.

Haplogroup R1a (M417, Z284)

The distribution of hg. R1a-M417, in combination with data on West Germanic peoples, shows that it was mostly limited to Scandinavia, similar to the distribution of I1. In fact, taking into account the distribution of R1a-Z284 in particular, it seems even more isolated, which is compatible with the limited impact of Corded Ware in Denmark or the Northern European Plain, and the likely origin of R1a-Z284 in the expansion with Battle Axe from the Gulf of Finland. The distribution of R1a-Z280 (see map above) is particularly telling, with a distribution around the Baltic Sea mostly coincident with that of N1a.

vikings-y-dna-haplogroup-r1a
Density of haplogroup R1a (samples in green) among Vikings. Samples of hg. R1b in blue, of hg. I in orange, N1a in pink.
vikings-y-dna-haplogroup-r1a-z284-over-r1a
Density of haplogroup R1a-Z284 (samples in cyan) overlaid over all samples of hg. R1a (in green, with R1a-Z280 in pink) among Vikings.

Other haplogroups

Among the ancient samples, two individuals were derived haplogroups were identified as E1b1b1-M35.1, which are frequently encountered in modern southern Europe, Middle East and North Africa. Interestingly, the individuals carrying these haplogroups had much less Scandinavian ancestry compared to the most samples inferred from haplotype based analysis. A similar pattern was also observed for less frequent haplogroups in our ancient dataset, such as G (n=3), J (n=3) and T (n=2), indicating a possible non-Scandinavian male genetic component in the Viking Age Northern Europe. Interestingly, individuals carrying these haplogroups were from the later Viking Age (10th century and younger), which might indicate some male gene influx into the Viking population during the Viking period.

vikings-italian-ancestry
Natural neighbor interpolation of “Italian ancestry” among Vikings.

As the paper says, the small sample size of rare haplogroups cannot distinguish if these differences are statistically relevant. Nevertheless, both E1b samples have substantial Modern Polish-like ancestry: one sample from Gotland (VK474), of hg. E1b-L791, has ca. 99% “Polish” ancestry, while the other one from Denmark (VK362), of hg. E1b-V13, has ca. 35% “Polish”, ca. 35% “Italian”, as well as some “Danish” (14%) and minor “British” and “Finnish” ancestry.

Given the E1b-V13 samples of likely Central-East European origin among Lombards, Visigoths, and especially among Early Slavs, and the distribution of “Polish” ancestry among Viking samples, VK362 is probably a close description of the typical ancestry of early Slavs. The peak of Modern Polish-like ancestry around the Upper Pripyat during the (late) Viking Age suggests that Poles (like East Slavs) have probably mixed since the 10th century with more eastern peoples close to north-eastern Europeans, derived from ancient Finno-Ugrians:

vikings-polish-ancestry
Natural neighbor interpolation of “Polish ancestry” among Vikings.

Similarly, the finding of R1a-M458 among Vikings in Funen, Denmark (VK139), in Lutsk, Poland (VK541), and in Kurevanikha, Russia (VK160), apart from the early Slav from Usedom, may attest to the origin of the spread of this haplogroup in the western Baltic after the Bell Beaker expansion, once integrated in both Germanic and Balto-Slavic populations, as well as intermediate Bronze Age peoples that were eventually absorbed by their expansions. This contradicts, again, my simplistic initial assessment of R1a-M458 expansion as linked exclusively (or even mainly) to Balto-Slavs.

antiquity-europe-y-dna
Y-DNA haplogroups in Europe during Antiquity (full map). See other maps of cultures and ancient DNA from Antiquity.

Related

Uralic speakers formed clines of Corded Ware ancestry with WHG:ANE populations

steppe-forest-tundra-biomes-uralic

The preprint by Jeong et al. (2018) has been published: The genetic history of admixture across inner Eurasia Nature Ecol. Evol. (2019).

Interesting excerpts, referring mainly to Uralic peoples (emphasis mine):

A model-based clustering analysis using ADMIXTURE shows a similar pattern (Fig. 2b and Supplementary Fig. 3). Overall, the proportions of ancestry components associated with Eastern or Western Eurasians are well correlated with longitude in inner Eurasians (Fig. 3). Notable outliers include known historical migrants such as Kalmyks, Nogais and Dungans. The Uralic- and Yeniseian-speaking populations, as well as Russians from multiple locations, derive most of their Eastern Eurasian ancestry from a component most enriched in Nganasans, while Turkic/Mongolic speakers have this component together with another component most enriched in populations from the Russian Far East, such as Ulchi and Nivkh (Supplementary Fig. 3). Turkic/Mongolic speakers comprising the bottom-most cline have a distinct Western Eurasian ancestry profile: they have a high proportion of a component most enriched in Mesolithic Caucasus hunter-gatherers and Neolithic Iranians and frequently harbour another component enriched in present-day South Asians (Supplementary Fig. 4). Based on the PCA and ADMIXTURE results, we heuristically assigned inner Eurasians to three clines: the ‘forest-tundra’ cline includes Russians and all Uralic and Yeniseian speakers; the ‘steppe-forest’ cline includes Turkic- and Mongolic-speaking populations from the Volga and Altai–Sayan regions and Southern Siberia; and the ‘southern steppe’ cline includes the rest of the populations.

eurasian-clines-uralic-altaic
The first two PCs summarizing the genetic structure within 2,077 Eurasian individuals. The two PCs generally mirror geography. PC1 separates western and eastern Eurasian populations, with many inner Eurasians in the middle. PC2 separates eastern Eurasians along the northsouth cline and also separates Europeans from West Asians. Ancient individuals (color-filled shapes), including two Botai individuals, are projected onto PCs calculated from present-day individuals.

For the forest-tundra populations, the Nganasan + Srubnaya model is adequate only for the two Volga region populations, Udmurts and Besermyans (Fig. 5 and Supplementary Table 8).

For the other populations west of the Urals, six from the northeastern corner of Europe are modelled with additional Mesolithic Western European hunter-gatherer (WHG) contribution (8.2–11.4%; Supplementary Table 8), while the rest need both WHG and early Neolithic European farmers (LBK_EN; Supplementary Table 2). Nganasan-related ancestry substantially contributes to their gene pools and cannot be removed from the model without a significant decrease in the model fit (4.1–29.0% contribution; χ2 P ≤ 1.68 × 10−5; Supplementary Table 8).

west-urals-finno-ugrians-qpadm
Supplementary Table 8. QpAdm-based admixture modeling of the forest-tundra cline populations. For the 13 populations west of the Urals, we present a four-way admixture model, Nganasan+Srubnaya+WHG+LBK_EN, or its minimal adequate subset. Modified from the article, to include colors for cultures, and underlined best models for Corded Ware ancestry among Uralians.

NOTE. It doesn’t seem like Hungarians can be easily modelled with Nganasan ancestry, though…

For the 4 populations east of the Urals (Enets, Selkups, Kets and Mansi), for which the above models are not adequate, Nganasan + Srubnaya + AG3 provides a good fit (χ2 P ≥ 0.018; Fig. 5 and Supplementary Table 8). Using early Bronze Age populations from the Baikal Lake region (‘Baikal_EBA’; Supplementary Table 2) as a reference instead of Nganasan, the two-way model of Baikal_EBA + Srubnaya provides a reasonable fit (χ2 P ≥ 0.016; Supplementary Table 8) and the three-way model of Baikal_EBA + Srubnaya + AG3 is adequate but with negative AG3 contribution for Enets and Mansi (χ2 P ≥ 0.460; Supplementary Table 8).

east-urals-ugric-samoyedic-qpadm
Supplementary Table 8. QpAdm-based admixture modeling of the forest-tundra cline populations. For the four populations east of the Urals, we present three admixture models: Baikal_EBA+Srubnaya, Baikal_EBA+Srubnaya+AG3 and Nganasan+Srubnaya+AG3. For each model, we present qpAdm p-value, admixture coefficient estimates and associated 5 cM jackknife standard errors (estimate ± SE). Modified from the article, to include colors for cultures, and underlined best models for Corded Ware ancestry among Uralians.

Bronze/Iron Age populations from Southern Siberia also show a similar ancestry composition with high ANE affinity (Supplementary Table 9). The additional ANE contribution beyond the Nganasan + Srubnaya model suggests a legacy from ANE-ancestry-rich clines before the Late Bronze Age.

bronze-age-iron-age-karasuk-mezhovska-tagar-qpadm
Supplementary Table 9. QpAdm-based admixture modeling of Bronze and Iron Age populations of southern Siberia. For ancieint individuals associated with Karasuk and Tagar cultures, Nganasan+Srubnaya model is insufficient. For all five groups, adding AG3 as the third ancestry or substituting Nganasan with Baikal_EBA with higher ANE affinity provides an adequate model. For each model, we present qpAdm p-value, admixture coefficient estimates and associated 5 cM jackknife standard errors (estimate ± SE). Models with p-value ≥ 0.05 are highlighted in bold face. Modified from the article, to include colors for cultures, and underlined best models for Corded Ware ancestry among Uralians.

Lara M. Cassidy comments the results of the study in A steppe in the right direction (you can read it here):

Even among the earliest available inner Eurasian genomes, east–west connectivity is evident. These, too, form a longitudinal cline, characterized by the easterly increase of a distinct ancestry, labelled Ancient North Eurasian (ANE), lowest in western European hunter-gatherers (WHG) and highest in Palaeolithic Siberians from the Baikal region. Flow-through from this ANE cline is seen in steppe populations until at least the Bronze Age, including the world’s earliest known horse herders — the Botai. However, this is eroded over time by migration from west and east, following agricultural adoption on the continental peripheries (Fig. 1b,c).

Strikingly, Jeong et al. model the modern upper steppe cline as a simple two-way mixture between western Late Bronze Age herders and Northeast Asians (Fig. 1c), with no detectable residue from the older ANE cline. They propose modern steppe peoples were established mainly through migrations post-dating the Bronze Age, a sequence for which has been recently outlined using ancient genomes. In contrast, they confirm a substantial ANE legacy in modern Siberians of the northernmost cline, a pattern mirrored in excesses of WHG ancestry west of the Urals (Fig. 1b). This marks the inhospitable biome as a reservoir for older lineages, an indication that longstanding barriers to latitudinal movement may indeed be at work, reducing the penetrance of gene flows further south along the steppe.

eurasian-clines-uralic-turkic-mongol-altaic
The genomic formation of inner Eurasians. b–d, Depiction of the three main clines of ancestry identified among Inner Eurasians. Sources of admixture for each cline are represented using proxy ancient populations, both sampled and hypothesised, based on the study’s modelling results. The major eastern and western ancestries used to model each cline are shown in bold; the peripheral admixtures that gave rise to these are also shown. Additional contributions to subsections of each cline are marked with dashed lines. b, The northernmost cline, illustrating the legacy of WHG and ANE-related populations. c,d, The upper (c) and lower (d) steppe clines are shown, both of which have substantial eastern contributions related to modern Tungusic speakers. The authors propose these populations are themselves the result of an admixture between groups related to the Nganasan, whose ancestors potentially occupied a wider range, and hunter-gatherers (HGs) from the Amur River Basin. While the upper steppe cline in c can be described as a mixture between this eastern ancestry and western steppe herders, the current model for the southern steppe cline as shown in d is not adequate and is likely confounded by interactions with diverse bordering ancestries. Credit: Ecoregions 2017, Resolve https://ecoregions2017.appspot.com/

Given the findings as reported in the paper, I think it should be much easier to describe different subclines in the “northernmost cline” than in the much more recent “Turkic/Mongolic cline”, which is nevertheless subdivided in this paper in two clines. As an example, there are at least two obvious clines with “Nganasan-related meta-populations” among Uralians, which converge in a common Steppe MLBA (i.e. Corded Ware) ancestry – one with Palaeo-Laplandic peoples, and another one with different Palaeo-Siberian populations:

siberian-clines-uralic-altaic
PCA of ancient and modern Eurasian samples. Ancient Palaeo-Laplandic, Palaeosiberian, and Altai clines drawn, with modern populations labelled. See a version with higher resolution.

The inclusion of certain Eurasian groups (or lack thereof) in the PCA doesn’t help to distinguish these subclines visually, and I guess the tiny “Naganasan-related” ancestral components found in some western populations (e.g. the famous ~5% among Estonians) probably don’t lend themselves easily to further subdivisions. Notice, nevertheless, the different components of the Eastern Eurasian source populations among Finno-Ugrians:

uralic-admixture-qpadm
Characterization of the Western and Eastern Eurasian source ancestries in inner Eurasian populations. [Modified from the paper, includes only Uralic populations]. a, Admixture f3 values are compared for different Eastern Eurasian (Mixe, Nganasan and Ulchi; green) and Western Eurasian references (Srubnaya and Chalcolithic Iranians (Iran_ChL); red). For each target group, darker shades mark more negative f3 values. b, Weights of donor populations in two sources characterizing the main admixture signal (date 1 and PC1) in the GLOBETROTTER analysis. We merged 167 donor populations into 12 groups (top right). Target populations were split into five groups (from top to bottom): Aleuts; the forest-tundra cline populations; the steppe-forest cline populations; the southern steppe cline populations; and ‘others’.

Also remarkable is the lack of comparison of Uralic populations with other neighbouring ones, since the described Uralic-like ancestry of Russians was already known, and is most likely due to the recent acculturation of Uralic-speaking peoples in the cradle of Russians, right before their eastward expansions.

west-eurasian-east-eurasian-ancestry
Supplementary Fig. 4. ADMIXTURE results qualitatively support PCA-based grouping of inner Eurasians into three clines. (A) Most southern steppe cline populations derive a higher proportion of their total Western Eurasian ancestry from a source related to Caucasus, Iran and South Asian populations. (B) Turkic- and Mongolic-speaking populations tend to derive their Eastern Eurasian ancestry more from the Devil’s Gate related one than from Nganasan-related one, while the opposite is true for Uralic- and Yeiseian-speakers. To estimate overall western Eurasian ancestry proportion, we sum up four components in our ADMIXTURE results (K=14), which are the dominant components in Neolithic Anatolians (“Anatolia_N”), Mesolithic western European hunter-gatherers (“WHG”), early Holocene Caucasus hunter-gatherers (“CHG”) and Mala from southern India, respectively. The “West / South Asian ancestry” is a fraction of it, calculated by summing up the last two components. To estimate overall Eastern Eurasian ancestry proportion, we sum up six components, most prevalent in Surui, Chipewyan, Itelmen, Nganasan, Atayal and early Neolithic Russian Far East individuals (“Devil’s Gate”). Eurasians into three clines. (A) Most southern steppe cline populations derive a higher proportion of their total Western Eurasian ancestry from a source related to Caucasus, Iran and South Asian populations. (B) Turkic- and Mongolic-speaking populations tend to derive their Eastern Eurasian ancestry more from the Devil’s Gate related one than from Nganasan-related one, while the opposite is true for Uralic- and Yeiseian-speakers. To estimate overall western Eurasian ancestry proportion, we sum up four components in our ADMIXTURE results (K=14), which are the dominant components in Neolithic Anatolians (“Anatolia_N”), Mesolithic western European hunter-gatherers (“WHG”), early Holocene Caucasus hunter-gatherers (“CHG”) and Mala from southern India, respectively. The “West / South Asian ancestry” is a fraction of it, calculated by summing up the last two components. To estimate overall Eastern Eurasian ancestry proportion, we sum up six components, most prevalent in Surui, Chipewyan, Itelmen, Nganasan, Atayal and early Neolithic Russian Far East individuals (“Devil’s Gate”).

A comparison of Estonians and Finns with Balts, Scandinavians, and Eastern Europeans would have been more informative for the division of the different so-called “Nganasan-like meta-populations”, and to ascertain which one of these ancestral peoples along the ancient WHG:ANE cline could actually be connected (if at all) to the Cis-Urals.

Because, after all, based on linguistics and archaeology, geneticists are not supposed to be looking for populations from the North Asian Arctic region, for “Siberian ancestry”, or for haplogroup N1c – despite previous works by their peers – , but for the Bronze Age Volga-Kama region…

Related

Iron Age bottleneck of the Proto-Fennic population in Estonia

tarand-graves-estonia-early-late

Demographic data and figures derived from Estonian Iron Age graves, by Raili Allmäe, Papers on Anthropology (2018) 27(2).

Interesting excerpts (emphasis mine):

Introduction

Inhumation and cremation burials were both common in Iron Age Estonia; however, the pattern which burials were prevalent has regional as well temporal peculiarities. In Estonia, cremation burials appear in the Late Bronze Age (1100–500 BC), for example, in stone-cist graves and ship graves, although inhumation is still characteristic of the period [28, 18]. Cremation burials have occasionally been found beneath the Late Bronze Age cists and the Early Iron Age (500 BC–450 AD) tarand graves [30, 28]. In south-eastern Estonia, including Setumaa, the tradition to bury cremated human remains in pit graves also appeared in the Bronze Age and lasted during the Pre-Roman period (500 BC–50 AD) and the Roman Iron Age (50–450 AD), and even up to the medieval times [30, 23, 33, 9]. During the Early Iron Age, cremations appear in cairn graves and have occasionally been found in many Pre-Roman early tarand graves where they appear with inhumations [27, 28, 19, 20, 21, 22, 24]. In Roman Iron Age tarand graves, cremation as well inhumation were practiced [28, 36, 37]; however, cremation was the prevailing burial practice during the Roman Iron Age, for example, in tarand graves in south-eastern Estonia [30, 28]. Roman Iron Age (50 AD–450 AD) burial sites have not been found in continental west Estonia [28, 38]). At the beginning of the Middle Iron Age (450–800 AD), burial sites, for example stone graves without a formal structure, like Maidla I, Lihula and Ehmja ‘Varetemägi’, appear in Läänemaa, west Estonia; in these graves cremations as well inhumations have been found [39, 48]. Like underground cremation burial, the stone grave without a formal structure was the most common grave type during the Late Iron Age (800– 1200 AD) in west Estonia [39, 35, 48]. In south-eastern and eastern Estonia, sand barrows with cremation burials appeared at the beginning of the Middle Iron Age. Cremation barrows are attributed to the Culture of Long Barrows and are most numerous in the villages Laossina and Rõsna in northern Setomaa, on the western shore of Lake Peipsi [8, 48]. Apparently during the Iron Age, the practiced burial customs varied in Estonia.

cist-grave-tarand
Typical prehistoric Estonian graves. Top: Cist-graves common during the Bronze Age, by Terker (GNU FDL 1.2). Bottom: Tarand graves of the Iron Age, by Marika Mägi (2017)

Abstract:

Three Iron Age cremation graves from south-eastern Estonia and four graves including cremations as well inhumations from western Estonia were analysed by osteological and palaeodemographic methods in order to estimate the age and sex composition of burial sites, and to propose some possible demographic figures and models for living communities.

The crude birth/death rate estimated on the basis of juvenility indices varied between 55.1‰ and 60.0‰ (58.5‰ on average) at Rõsna village in south-eastern Estonia in the Middle Iron Age. The birth/death rates based on juvenility indices for south eastern graves varied to a greater degree. The estimated crude birth/death rate was somewhat lower (38.9‰) at Maidla in the Late Iron Age and extremely high (92.1‰) at Maidla in the Middle Iron Age, which indicates an unsustainable community. High crude birth/death rates are also characteristic of Poanse tarand graves from the Pre-Roman Iron Age – 92.3‰ for the 1st grave and 69.6‰for the 2nd grave. Expectedly, newborn life expectancies are extremely low in both communities – 10.8 years at Poanse I and 14.4 years at Poanse II. Most likely, both Maidla I and Poanse I were unsustainable communities.

tarand-graves-estonia
Locations of the investigated Estonian Iron Age graves. Map by R. Allmäe

According to the main model where the given period of grave usage is 150 years, the burial grounds were most likely exploited by communities of 3–14 people. In most cases, this corresponds to one family or household. In comparison with other graves, Maidla II stone grave in western Estonia and Rõsna-Saare I barrow cemetery in south-eastern Estonia could have been used by a somewhat larger community, which may mean an extended family, a larger household or usage by two nuclear families.

More papers on the same subject by the author – who participated in the recent Mittnik et al. (2018) paper – include Observations On Estonian Iron Age Cremations (2013), and The demography of Iron Age graves in Estonia (2014).

Fast life history in Iron Age Estonia

While the demographic situation in the Gulf of Finland during the Iron Age is not well known – and demography is always difficult to estimate based on burials, especially when cremation is prevalent – , there is a clear genetic bottleneck in Finns, which has been estimated precisely to this period, coincident with the expansion of Proto-Fennic.

estonian-pca
PCA of Estonian samples from the Bronze Age, Iron Age and Medieval times. Tambets et al. (2018, upcoming).

The infiltration of N1c lineages in Estonia – the homeland of Proto-Fennic – happened during the Iron Age – as of yet found in 3 out of 5 sampled Tarand graves – , while the previous period was dominated by 100% R1a and Corded Ware + Baltic HG ancestry. With the Iron Age, a slight shift towards Corded Ware ancestry can be seen, which probably signals the arrival of warrior-traders from the Alanino culture, close to the steppe. They became integrated through alliances and intermarriages in a culture of chiefdoms dominated by hill forts. Their origin in the Mid-Volga area is witnessed by their material culture, such as Tarand-like graves (read here a full account of events).

This new political structure, reminiscent of the chiefdom system in Sintashta (with a similar fast life history causing a bottleneck of R1a-Z645 lineages), coupled with the expansion of Fennic (and displaced Saamic) peoples to the north, probably caused the spread of N1c-L392 among Finnic peoples. The linguistic influence of these early Iron Age trading movements from the Middle Volga region can be seen in similarities between Fennic and Mordvinic, which proves that the Fenno-Saamic community was by then not only separated linguistically, but also physically (unlike the period of long-term Palaeo-Germanic influence, where loanwords could diffuse from one to the other).

NOTE. Either this, or the alternative version: an increase in Corded Ware ancestry in Estonia during the Iron Age marks the arrival of the first Fennic speakers ca. 800 BC or later, splitting from Mordvinic? A ‘Mordvin-Fennic’ group in the Volga, of mainly Corded Ware ancestry…?? Which comes in turn from a ‘Volga-Saamic’ population of Siberian ancestry in the Artic region??? And, of course, Palaeo-Germanic widely distributed in North-Eastern Europe with R1a during the Bronze Age! Whichever model you find more logical.

Related

Corded Ware—Uralic (II): Finno-Permic and the expansion of N-L392/Siberian ancestry

finno-ugric-samoyedic

This is the second of four posts on the Corded Ware—Uralic identification:

I read from time to time that “we have not sampled Uralic speakers yet”, and “we are waiting to see when Uralic-speaking peoples are sampled”. Are we, though?

Proto-language homelands are based on linguistic data, such as guesstimates for dialectal evolution, loanwords and phonetic changes for language contacts, toponymy for ancient territories, etc. depending on the available information. The trace is then followed back, using available archaeological data, from the known historic speakers and territory to the appropriate potential prehistoric cultures. Only then can genetic analyses help us clarify the precise prehistoric population movements that better fit the models.

uralic-language-family
The traditional family tree of the Uralic branches. Kallio (2014)

The linguistic homeland

We thought – using linguistic guesstimates and fitting prehistoric cultures and their expansion – that Yamna was the Late Proto-Indo-European culture, so when Yamna was sampled, we had Late Proto-Indo-Europeans sampled. Simple deduction.

We thought that north-eastern Europe was a Uralic-speaking area during the Neolithic:

  • For those supporting a western continuity (and assuming CWC was Indo-European), the language was present at least since the Comb Ware culture, potentially since the Mesolithic.
  • For those supporting a late introduction into Finland, Uralic expanded the latest with Abashevo-related movements after its incorporation of Volosovo and related hunter-gatherers.

The expansion to the east must have happened through progressive infiltrations with Seima-Turbino / Andronovo-related expansions.

uralic-time-space
Some datings for the traditional proto-stages from Uralic to Finnic. Kallio (2014).

Finding the linguistic homeland going backwards can be described today as follows:

I. Proto-Fennic homeland

Based on the number of Baltic loanwords, not attested in the more eastern Uralic branches (and reaching only partially Mordvinic), the following can be said about western Finno-Permic languages (Junttila 2014):

The Volga-Kama Basin lies still too far east to be included in a list of possible contact locations. Instead, we could look for the contact area somewhere between Estonia in the west and the surroundings of Moscow in the east, a zone with evidence of Uralic settlement in the north and Baltic on the south side.

The only linguistically well-grounded version of the Stone Age continuation theory was presented by Mikko Korhonen in 1976. Its validity, however, became heavily threatened when Koivulehto 1983a-b proved the existence of a Late Proto-Indo-European or Pre-Baltic loanword layer in Saami, Finnic, and Mordvinic. Since this layer must precede the Baltic one and it was presumably acquired in the Baltic Sea region, Koivulehto posited it on the horizon of the Battle Axe period. This forces a later dating for the Baltic–Finnic contacts.

Today the Battle Axe culture is dated at 3200 to 3000 BC, a period far too remote to correspond linguistically with Proto-Baltic (Kallio 1998a).

Since the Baltic contacts began at a very initial phase of Proto-Finnic, the language must have been relatively uniform at that time. Hence, if we consider that the layer of Baltic loanwords may have spread over the Gulf of Finland at that time, we could also insist that the whole of the Proto-Finnic language did so.

migration-theory
Prehistoric Balts as the southern neighbours of Proto-Finnic speakers. 1 = The approximated area of Proto-Uralic. 2 = The approximated area of Finnic during the Iron Age. 3 = The area of ancient Baltic hydronyms. 4 = The area of Baltic languages in about 1200 AD. 5 = The problem: When did Uralic expand westwards and when did it meet Baltic? Junntila (2012).

II. Proto-Finno-Saamic homeland

The evidence of continued Palaeo-Germanic loanwords (from Pre- to Proto-Germanic stages) is certainly the most important data to locate the Finno-Saamic homeland, and from there backwards into the true Uralic homeland. Following Kallio (2017):

(…) the loanword evidence furthermore suggests that the ancestors of Finnic and Saamic had at least phonologically remained very close to Proto-Uralic as late as the Bronze Age (ca. 1700–500 BC). In particular, certain loanwords, whose Baltic and Germanic sources point to the first millennium BC, after all go back to the Finno-Saamic proto-stage, which is phonologically almost identical to the Uralic proto-stage (see especially the table in Sammallahti 1998: 198–202). This being the case, Dahl’s wave model could perhaps have some use in Uralic linguistics, too.

The presence of Pre-Germanic loanwords points rather to the centuries around the turn of the 2nd – 1st millennium BC or earlier. Proto-Germanic words must have been borrowed before the end of Germanic influence in the eastern Baltic at the beginning of the Iron Age, which sets a clear terminus ante quem ca. 800 BC.

The arrival of Bell Beaker peoples in Scandinavia ca. 2350 BC, heralding the formation of the Dagger Period, as well as the development of Pre-Germanic in common with Finnic-like populations point to the late 3rd / early 2nd millennium BC as the first time of close interaction through the Baltic region.

III. Proto-Uralic homeland

(…) the earliest Indo-European loanwords in the Uralic languages (…) show that Proto-Uralic cannot have been spoken much earlier than Proto-Indo-European dated about 3500 BC (Koivulehto 2001: 235, 257). As the same loanword evidence naturally also shows that the Uralic and Indo-European homelands were not located far from one another, the Uralic homeland can most likely be located in the Middle and Upper Volga region, right north of the Indo-European homeland*. From the beginning of the Subneolithic period about 5900 BC onwards, this region was an important innovation centre, from where several cultural waves spread to the Finnish Gulf area, such as the Sperrings Ware wave about 4900 BC, the Combed Ware wave about 3900 BC, and the Netted Ware wave about 1900 BC (Carpelan & Parpola 2001: 78–90).

The mainstream position is nowadays trying to hold together the traditional views of Corded Ware as Indo-European, and a Uralic Fennoscandia during the Bronze Age.

The following is an example of how this “Volosovo/Forest Zone hunter-gatherer theory” of Uralic origins looks like, as a ‘mixture’ of cultures and languages that benefits from the lack of genetic data for certain regions and periods (taken from Parpola 2018):

asbestos-ware
The extent of Typical Comb Ware (TCW), Asbestos- and Organic-tempered Wares (AOW) and Volosovo and Garino-Bor cultures; areas with deposits of native copper in Karelia and copperbearing sandstone in Volga-Kama-area are marked dark gray (after Zhuravlev 1977; Krajnov 1987; Nagovitsyn 1987; Chernykh 1992; Carpelan 1999; Zhul´nikov 1999). From Nordqvist et al. (2012).

The Corded Ware (or Battle Axe) culture intruded into the Eastern Baltic and coastal Finland already around 3100 BCE. The continuity hypothesis maintains that the early Proto-Finnic speakers of the coastal regions, who had come to Finland in the 4th millennium BCE with the Comb-Pitted Ware, coexisted with the Corded Ware newcomers, gradually adopting their pastoral culture and with it a number of NW-IE loanwords, but assimilating the immigrants linguistically.

The fusion of the Corded Ware and the local Comb-Pitted Ware culture resulted into the formation of the Kiukais culture (c. 2300–1500) of southwestern Finland, which around 2300 received some cultural impulses from Estonia, manifested in the appearance of the Western Textile Ceramic (which is different from the more easterly Textile Ceramic or Netted Ware, and which is first attested in Estonia c. 2700 BCE, cf. Kriiska & Tvauri 2007: 88), and supposed to have been accompanied by an influx of loanwords coming from Proto-Baltic. At the same time, the Kiukais culture is supposed to have spread the custom of burying chiefs in stone cairns to Estonia.

The coming of the Corded Ware people and their assimilation created a cultural and supposedly also a linguistic split in Finland, which the continuity hypothesis has interpreted to mean dividing Proto-Saami-Finnic unity into its two branches. Baltic Finnic, or simply Finnic, would have emerged in the coastal regions of Finland and in the northern East Baltic, while preforms of Saami would have been spoken in the inland parts of Finland.

The Nordic Bronze Age culture, correlated above with early Proto-Germanic, exerted a strong influence upon coastal Finland and Estonia 1600–700 BCE. Due to this, the Kiukais culture was transformed into the culture of Paimio ceramics (c. 1600–700 BCE), later continued by Morby ceramics (c. 700 BCE – 200 CE). The assumption is that clear cultural continuity was accompanied by linguistic continuity. Having assimilated the language of the Germanic traders and relatively few settlers of the Bronze Age, the language of coastal Finland is assumed to have reached the stage of Proto-Finnish at the beginning of the Christian era. In Estonia, the Paimio ceramics have a close counterpart in the contemporaneous Asva ceramics.

Eastern homelands?

I will not comment on Siberian or Central Asian homeland proposals, because they are obviously not mainstream, still less today when we know that Uralic was certainly in contact with Proto-Indo-European, and then with Pre- and Proto-Indo-Iranian, as supported even by the Copenhagen group in Damgaard et al. (2018).

This is what Kallio (2017) has to say about the agendas behind such proposals:

Interestingly, the only Uralicists who generally reject the Central Russian homeland are the Russian ones who prefer the Siberian homeland instead. Some Russians even advocate that the Central Russian homeland is only due to Finnish nationalism or, as one of them put it a bit more tactfully, “the political and ideological situation in Finland in the first decades of the 20th century” (Napolskikh 1995: 4).

Still, some Finns (and especially those who also belong to the “school who wants it large and wants it early”) simultaneously advocate that exactly the same Central Russian homeland is due to Finnlandisierung (Wiik 2001: 466).

Hence, for those of you willing to learn about fringe theories not related to North-Eastern Europe, you also have then the large and early version of the Uralic homeland, with Wiik’s Palaeolithic continuity of Uralic peoples spread over all of eastern and central Europe (hence EHG and R1a included):

atlantic-finnic-theory
Palaeolithic boat peoples and Finno-Ugric. Source

These fringe Finnish theories look a lot like the Corded Ware expansion… Better not go the Russian or Finnish nationalist ways? Agreed then, let’s discuss only rational proposals based on current data.

The archaeological homeland

For a detailed account of the Corded Ware expansion with Battle Axe, Fatyanovo-Balanovo, and Abashevo groups into the area, you can read my recent post on the origin of R1a-Z645.

1. Textile ceramics

During the 2nd millennium BC, textile impressions appear in pottery as a feature across a wide region, from the Baltic area through the Volga to the Urals, in communities that evolve from late Corded Ware groups without much external influence.

While it has been held that this style represents a north-west expansion from the Volga region (with the “Netted Ware” expansion), there are actually at least two original textile styles, one (earlier) in the Gulf of Finland, common in the Kiukainen pottery, which evolves into the Textile ware culture proper, and another which seems to have an origin in the Middle Volga region to the south-east.

The Netted ware culture is the one that apparently expands into inner Finland – a region not densely occupied by Corded Ware groups until then. There are, however, no clear boundaries between groups of both styles; textile impressions can be easily copied without much interaction or population movement; and the oldest textile ornamentation appeared on the Gulf of Finland. Hence the tradition of naming all as groups of Textile ceramics.

textile-ware-cultures
Maximum distribution of Textile ceramics during the Bronze Age (ca. 2000-800 BC). Asbestos-tempered ware lies to the north (and is also continued in western Fennoscandia).

The fact that different adjacent groups from the Gulf of Finland and Forest Zone share similar patterns making it very difficult to differentiate between ‘Netted Ware’ or ‘Textile Ware’ groups points to:

  • close cultural connections that are maintained through the Gulf of Finland and the Forest Zone after the evolution of late Corded Ware groups; and
  • no gross population movements in the original Battle Axe / Fatyanovo regions, except for the expansion of Netted Ware to inner Finland, Karelia, and the east, where the scattered Battle Axe finds and worsening climatic conditions suggest most CWC settlements disappeared at the end of the 3rd millennium BC and recovered only later.

NOTE. This lack of population movement – or at least significant replacement by external, non-CWC groups – is confirmed in genetic investigation by continuity of CWC-related lineages (see below).

The technology present in Textile ceramics is in clear contrast to local traditions of sub-Neolithic Lovozero and Pasvik cultures of asbestos-tempered pottery to the north and east, which point to a different tradition of knowledge and learning network – showing partial continuity with previous asbestos ware, since these territories host the main sources of asbestos. We have to assume that these cultures of northern and eastern Fennoscandia represent Palaeo-European (eventually also Palaeo-Siberian) groups clearly differentiated from the south.

The Chirkovo culture (ca. 1800-700 BC) forms on the middle Volga – at roughly the same time as Netted Ware formed to the west – from the fusion of Abashevo and Balanovo elites on Volosovo territory, and is also related (like Abashevo) to materials of the Seima-Turbino phenomenon.

Bronze Age ethnolinguistic groups

In the Gulf of Finland, Kiukainen evolves into the Paimio ceramics (in Finland) — Asva Ware (in Estonia) culture, which lasts from ca. 1600 to ca. 700 BC, probably representing an evolving Finno-Saamic community, while the Netted Ware from inner Finland (the Sarsa and Tomitsa groups) and the groups from the Forest Zone possibly represent a Volga-Finnic community.

NOTE. Nevertheless, the boundaries between Textile ceramic groups are far from clear, and inner Finland Netted Ware groups seem to follow a history different from Netted Ware groups from the Middle and Upper Volga, hence they could possibly be identified as an evolving Pre-Saamic community.

Based on language contacts, with Early Baltic – Early Finnic contacts starting during the Iron Age (ca. 500 BC onwards), this is a potential picture of the situation at the end of this period, when Germanic influence on the coast starts to fade, and Lusatian culture influence is stronger:

aikio-finnic-saamic
The linguistic situation in Lapland and the northern Baltic Sea Area in the Early Iron Age prior to the expansion of Saami languages; the locations of the language groups are schematic. The black line indicates the distribution of Saami languages in the 19th century, and the gray line their approximate maximal distribution before the expansion of Finnic. Aikio (2012)

The whole Finno-Permic community remains thus in close contact, allowing for the complicated picture that Kallio mentions as potentially showing Dahl’s wave model for Uralic languages.

Genetic data shows a uniform picture of these communities, with exclusively CWC-derived ancestry and haplogroups. So in Mittnik et al. (2018) all Baltic samples show R1a-Z645 subclades, while the recent session on Estonian populations in ISBA 8 (see programme in PDF) clearly states that:

[Of the 24 Bronze Age samples from stone-cist graves] all 18 Bronze Age males belong to R1a.

Regarding non-Uralic substrates found in Saami, supposedly absorbed during the expansion to the north (and thus representing languages spoken in northern Fennoscandia during the Bronze Age) this is what Aikio (2012) has to say:

The Saami substrate in the Finnish dialects thus reveals that also Lakeland Saami languages had a large number of vocabulary items of obscure origin. Most likely many of these words were substrate in Lakeland Saami, too, and ultimately derive from languages spoken in the region before Saami. In some cases the loan origin of these words is obvious due to their secondary Proto-Saami vowel combinations such as *ā–ë in *kāvë ‘bend; small bay’ and *šāpšë ‘whitefish’. This substrate can be called ‘Palaeo-Lakelandic’, in contrast to the ‘Palaeo-Laplandic’ substrate that is prominent in the lexicon of Lapland Saami. As the Lakeland Saami languages became extinct and only fragments of their lexicon can be reconstructed via elements preserved in Finnish place-names and dialectal vocabulary, we are not in a position to actually study the features of this Palaeo-Lakelandic substrate. Its existence, however, appears evident from the material above.

If we wanted to speculate further, based on the data we have now, it is very likely that two opposing groups will be found in the region:

A) The central Finnish group, in this hypothesis the Palaeo-Lakelandic group, made up of the descendants of the Mesolithic pioneers of the Komsa and Suomusjärvi cultures, and thus mainly Baltic HG / Scandinavian HG ancestry and haplogroups I / R1b(xM269) (see more on Scandinavian HG).

siberian-ancestry-map
Frequency map of the so-called ‘Siberian’ component. From Tambets et al. (2018).

B) Lapland and Kola were probably also inhabited by similar Mesolithic populations, until it was eventually assimilated by expanding Siberian groups (of Siberian ancestry and N1c-L392 lineages) from the east – entering the region likely through the Kola peninsula – , forming the Palaeo-Laplandic group, which was in turn later replaced by expanding Proto-Saamic groups.

Siberian ancestry appears first in Fennoscandia at Bolshoy Oleni Ostrov ca. 1520 BC, with haplogroup N1c-L392 (2 samples, BOO002 and BOO004), and with Siberian ancestry. This is their likely movement in north-eastern Europe, from Lamnidis et al (2018):

The large Siberian component in the Bolshoy individuals from the Kola Peninsula provides the earliest direct genetic evidence for an eastern migration into this region. Such contact is well documented in archaeology, with the introduction of asbestos-mixed Lovozero ceramics during the second millenium BC, and the spread of even-based arrowheads in Lapland from 1,900 BCE. Additionally, the nearest counterparts of Vardøy ceramics, appearing in the area around 1,600-1,300 BCE, can be found on the Taymyr peninsula, much further to the east. Finally, the Imiyakhtakhskaya culture from Yakutia spread to the Kola Peninsula during the same period.

saamic-lovozero-pca
PCA plot of 113 Modern Eurasian populations, with individuals from this study projected on the principal components. Uralic speakers are highlighted in light purple. Image modified from Lamnidis et al. (2018)

Obviously, these groups of asbestos-tempered ware are not connected to the Uralic expansion. From the same paper:

The fact that the Siberian genetic component is consistently shared among Uralic-speaking populations, with the exceptions of Hungarians and the non-Uralic speaking Russians, would make it tempting to equate this component with the spread of Uralic languages in the area. However, such a model may be overly simplistic. First, the presence of the Siberian component on the Kola Peninsula at ca. 4000 yBP predates most linguistic estimates of the spread of Uralic languages to the area. Second, as shown in our analyses, the admixture patterns found in historic and modern Uralic speakers are complex and in fact inconsistent with a single admixture event. Therefore, even if the Siberian genetic component partly spread alongside Uralic languages, it likely presented only an addition to populations carrying this component from earlier.

2. The Early Iron Age

The Ananino culture appears in the Vyatka-Kama area, famed for its metallurgy, with traditions similar to the North Pontic area, by this time developing Pre-Sauromatian traditions. It expanded to the north in the first half of the first millennium BC, remaining in contact with the steppes, as shown by the ‘Scythian’ nature of its material culture.

NOTE. The Ananino culture can be later followed through its zoomorphic styles into Iron Age Pjanoborskoi and Gljadenovskoi cultures, later to Ural-Siberian Middle Age cultures – Itkuska, Ust’-Poluiska, Kulaiska cultures –, which in turn can be related as prototypes of medieval Permian styles.

ananino-culture-homeland
Territory of (early and maximum) Ananino material culture. Vasilyev (2002).

At the same time as the Ananino culture begins to expand ca. 1000 BC, the Netted Ware tradition from the middle Oka expanded eastwards into the Oka-Vyatka interfluve of the middle Volga region, until then occupied by the Chirkovo culture. Eventually the Akozino or Akhmylovo group (ca. 800-300 BC) emerged from the area, showing a strong cultural influence from the Ananino culture, by that time already expanding into the Cis-Urals region.

The Akozino culture remains nevertheless linked to the western Forest Zone traditions, with long-ranging influences from as far as the Lusatian culture in Poland (in metallurgical techniques), which at this point is also closely related with cultures from Scandinavia (read more on genetics of the Tollense Valley).

malar-celts-ananino
Mälar celts and molds for casting (a) and the main distribution area (в) of Mälar-type celts of the Mälar type in the Volga-Kama region (according to Kuzminykh 1983: figure 92) and Scandinavia (according to Baudou 1960: Karte 10); Ananino celts and molds for casting (б) and the main distribution area (г) of the distribution of the celts of the Ananino type in the Volga-Kama area (according to Kuzminykh 1983: figure 9); dagger of Ananino type (д).Map from (Yushkova 2010)

Different materials from Akozino reach Fennoscandia late, at the end of the Bronze Age and beginning of the Early Iron Age, precisely when the influence of the Nordic Bronze Age culture on the Gulf of Finland was declining.

This is a period when Textile ceramic cultures in north-eastern Europe evolve into well-armed chiefdom-based groups, with each chiefdom including thousands or tens of thousands, with the main settlements being hill forts, and those in Fennoscandia starting ca. 1000-400 BC.

Mälar-type celts and Ananino-type celts appear simultaneously in Fennoscandia and the Forest Zone, with higher concentrations in south-eastern Sweden (Mälaren) and the Volga-Kama region, supporting the existence of a revived international trade network.

akozino-malar-axes-fennoscandia
Distribution of the Akozino-Mälar axes according to Sergej V. Kuz’minykh (1996: 8, Abb. 2).

The Paimio—Asva Ware culture evolves (ca. 700-200 BC) into the Morby (in Finland) — Ilmandu syle (in Estonia, Latvia, and Mälaren) culture. The old Paimio—Asva tradition continues side by side with the new one, showing a clear technical continuity with it, but with ornamentation compared to the Early Iron Age cultures of the Upper Volga area. This new south-eastern influence is seen especially in:

  • Akozino-Mälar axes (ca. 800-500 BC): introduced into the Baltic area in so great numbers – especially south-western Finland, the Åland islands, and the Mälaren area of eastern Sweden – that it is believed to be accompanied by a movement of warrior-traders of the Akozino-Akhmylovo culture, following the waterways that Vikings used more than a thousand years later. Rather than imports, they represent a copy made with local iron sources.
  • Tarand graves (ca. 500 BC – AD 400): these ‘mortuary houses’ appear in the coastal areas of northern and western Estonia and the islands, at the same time as similar graves in south-western Finland, eastern Sweden, northern Latvia and Courland. Similar burials are found in Akozino-Akhmylovo, with grave goods also from the upper and middle Volga region, while grave goods show continuity with Textile ware.

The use of asbestos increases in mainland Finnish wares with Kjelmøy Ware (ca. 700 BC – AD 300), which replaced the Lovozero Ware; and in the east in inner Finland and Karelia with the Luukonsaari and Sirnihta wares (ca. 700-500 BC – AD 200), where they replaced the previous Sarsa-Tomitsa ceramics.

The Gorodets culture appears during the Scythian period in the forest-steppe zone north and west of the Volga, shows fortified settlements, and there are documented incursions of Gorodets iron makers into the Samara valley, evidenced by deposits of their typical pottery and a bloom or iron in the region.

Iron Age ethnolinguistic groups

According to (Koryakova and Epimakhov 2007):

It is commonly accepted by archaeology, ethnography, and linguistics that the ancestors of the Permian peoples (the Udmurts, Komi-Permians, and Komi-Zyryans) left the sites of Ananyino cultural intercommunity.

NOTE. For more information on the Late Metal Ages and Early Medieval situation of Finno-Ugric languages, see e.g. South-eastern contact area of Finnic languages in the light of onomastics (Rahkonen 2013).

finno-saamic-mordvin
Yakhr-, -khra, yedr-, -dra and yer-/yar, -er(o), -or(o) names of lakes in Central and North Russia and the possible boundary of the proto-language words *jäkra/ä and *järka/ä. Rahkonen (2011)

Certain innovations shared between Proto-Fennic (identified with the Gulf of Finland) and Proto-Mordvinic (from the Gorodets culture) point to their close contact before the Proto-Fennic expansion, and thus to the identification of Gorodets as Proto-Mordvinic, hence Akozino as Volgaic (Parpola 2018):

  • the noun paradigms and the form and function of individual cases,
  • the geminate *mm (foreign to Proto-Uralic before the development of Fennic under Germanic influence) and other non-Uralic consonant clusters.
  • the change of numeral *luka ‘ten’ with *kümmen.
  • The presence of loanwords of non-Uralic origin, related to farming and trees, potentially Palaeo-European in nature (hence possibly from Siberian influence in north-eastern Europe).
ananino-textile-ware-cultures
Map of archaeological cultures in north-eastern Europe ca. 8th-3rd centuries BC. [The Mid-Volga Akozino group not depicted] Shaded area represents the Ananino cultural-historical society. Purple area show likely zones of predominant Siberian ancestry and N1c-L392 lineages. Blue areas likely zones of predominant CWC ancestry and R1a-Z645 lineages. Fading purple arrows represent likely stepped movements of haplogroup N1c-L392 for centuries (Siberian → Ananino → Akozino → Fennoscandia), found eventually in tarand graves. Blue arrows represent eventual expansions of Fennic and (partially displaced) Saamic. Modified image from Vasilyev (2002).

The introduction of a strongly hierarchical chiefdom system can quickly change the pre-existing social order and lead to a major genetic shift within generations, without a radical change in languages, as shown in Sintashta-Potapovka compared to the preceding Poltavka society (read more about Sintashta).

Fortified settlements in the region represented in part visiting warrior-traders settled through matrimonial relationships with local chiefs, eager to get access to coveted goods and become members of a distribution network that could guarantee them even military assistance. Such a system is also seen synchronously in other cultures of the region, like the Nordic Bronze Age and Lusatian cultures (Parpola 2013).

The most likely situation is that N1c subclades were incorporated from the Circum-Artic region during the Anonino (Permic) expansion to the north, later emerged during the formation of the Akozino group (Volgaic, under Anonino influence), and these subclades in turn infiltrated among the warrior traders that spread all over Fennoscandia and the eastern Baltic (mainly among Fennic, Saamic, Germanic, and Balto-Slavic peoples), during the age of hill forts, creating alliances partially based on exogamy strategies (Parpola 2013).

Over the course of these events, no language change is necessary in any of the cultures involved, since the centre of gravity is on the expanding culture incorporating new lineages:

  • first on the Middle Volga, when Ananino expands to the north, incorporatinig N1c lineages from the Circum-Artic region.
  • then with the expansion of the Akozino-Akhmylovo culture into Ananino territory, admixing with part of its population;
  • then on the Baltic region, when materials are imported from Akozino into Fennoscandia and the eastern Baltic (and vice versa), with local cultures being infiltrated by foreign (Akozino) warrior-traders and their materials;
  • and later with the different population movements that led eventually to a greater or lesser relevance of N1c in modern Finno-Permic populations.

To argue that this infiltration and later expansion of lineages changed the language in one culture in one of these events seems unlikely. To use this argument of “opposite movement of ethnic and language change” for different successive events, and only on selected regions and cultures (and not those where the greatest genetic and cultural impact is seen, like e.g. Sweden for Akozino materials) is illogical.

NOTE. Notice how I write here about “infiltration” and “lineages”, not “migration” or “populations”. To understand that, see below the next section on autosomal studies to compare Bronze Age, Iron Age, Medieval and Modern Estonians, and see how little the population of Estonia (homeland of Proto-Fennic and partially of Proto-Finno-Saamic) has changed since the Corded Ware migrations, suggesting genetic continuity and thus mostly close inter-regional and intra-regional contacts in the Forest Zone, hence a very limited impact of the absorbed N1c lineages (originally at some point incorporated from the Circum-Artic region). You can also check on the most recent assessment of R1a vs. N1c in modern Uralic populations.

Iron Age and later populations

From the session on Estonian samples on ISBA 8, by Tambets et al.:

[Of the 13 samples from the Iron Age tarand-graves] We found that the Iron Age individuals do in fact carry chrY hg N3 (…) Furthermore, based on their autosomal data, all of the studied individuals appear closer to hunter-gatherers and modern Estonians than Estonian CWC individuals do.

EDIT (16 OCT) A recent abstract with Saag as main author (Tambets second) cites 3 out of 5 sampled Iron Age individuals as having haplogroup N3.

EDIT (28 OCT): Notice also the appearance of N1a1a1a1a1a1a1-L1025 in Lithuania (ca. 300 AD), from Damgaard (Nature 2018); the N1c sample of the Krivichi Pskov Long Barrows culture (ca. 8th-10th c. AD), and N1a1a1a1a1a1a7-Y4341 among late Vikings from Sigtuna (ca. 10th-12th c. AD) in Krzewinska (2018).

estonian-pca
PCA of Estonian samples from the Bronze Age, Iron Age and Medieval times. Tambets et al. (2018, upcoming).

Looking at the plot, the genetic inflow marking the change from the Bronze Age to the Iron Age looks like an obvious expansion of nearby peoples with CWC-related ancestry, i.e. likely from the south-east, near the Middle Volga, where influence of steppe peoples is greater (hence likely Akozino) into a Proto-Fennic population already admixed (since the arrival of Corded Ware groups) with Comb Ware-like populations.

All of these groups were probably R1a-Z645 (likely R1a-Z283) since the expansion of Corded Ware peoples, with an introduction of some N1c lineages precisely during this Iron Age period. This infiltration of N1c-L392 with Akozino is obviously not directly related to Siberian cultures, given what we know about the autosomal description of Estonian samples.

Rather, N1c-L392 lineages were likely part of the incoming (Volgaic) Akozino warrior-traders, who settled among developing chiefdoms based on hill fort settlements of cultures all over the Baltic area, and began to appear thus in some of the new tarand graves associated with the Iron Age in north-eastern Europe.f

A good way to look at this is to realize that no new cluster appears compared to the data we already have from Baltic LN and BA samples from Mittnik et al. (2018), so the Estonian BA and IA clusters must be located (in a proper PCA) in the cline from Pit-Comb Ware culture through Baltic BA to Corded Ware groups:

baltic-samples
PCA and ADMIXTURE analysis reflecting three time periods in Northern European prehistory. a Principal components analysis of 1012 present-day West Eurasians (grey points, modern Baltic populations in dark grey) with 294 projected published ancient and 38 ancient North European samples introduced in this study (marked with a red outline). Population labels of modern West Eurasians are given in Supplementary Fig. 7 and a zoomed-in version of the European Late Neolithic and Bronze Age samples is provided in Supplementary Fig. 8. b Ancestral components in ancient individuals estimated by ADMIXTURE (k = 11)

This genetic continuity from Corded Ware (the most likely Proto-Uralic homeland) to the Proto-Fennic and Proto-Saamic communities in the Gulf of Finland correlates very well with the known conservatism of Finno-Saamic phonology, quite similar to Finno-Ugric, and both to Proto-Uralic (Kallio 2017): The most isolated region after the expansion of Corded Ware peoples, the Gulf of Finland, shielded against migrations for almost 1,500 years, is then the most conservative – until the arrival of Akozino influence.

NOTE. This has its parallel in the phonetic conservatism of Celtic or Italic compared to Finno-Ugric-influenced Germanic, Balto-Slavic, or Indo-Iranian.

Only later would certain regions (like Finland or Lappland) suffer Y-DNA bottlenecks and further admixture events associated with population displacements and expansions, such as the spread of Fennic peoples from their Estonian homeland (evidenced by the earlier separation of South Estonian) to the north and east:

diversification-finnic
The Finnic family tree. Kallio (2014).

The initial Proto-Fennic expansion was probably coupled with the expansion of Proto-Saami to the north, with the Kjelmøy Ware absorbing the Siberian population of Lovozero Ware, and potentially in inner Finland and Karelia with the Luukonsaari and Sirnihta wares (Carpelan and Parpola 2017).

This Proto-Saami population expansion from the mainland to the north, admixing with Lovozero-related peoples, is clearly reflected in the late Iron Age Saamic samples from Levänluhta (ca. 400-800 AD), as a shift (of 2 out of 3 samples) to Siberian-like ancestry from their original CWC_Baltic-like situation (see PCA from Lamnidis et al. 2018 above).

Also, Volgaic and Permic populations from inner Finland and the Forest Zone to the Cis-Urals and Circum-Artic regions probably incorporate Siberian ancestry and N1c-L392 lineages during these and later population movements, while the westernmost populations – Estonian, Mordvinic – remain less admixed (see PCA from Tambets et al. 2018 below).

We also have data of N1c-L392 in Nordic territory in the Middle Ages, proving its likely strong presence in the Mälaren area since the Iron Age, with the arrival of Akozino warrior traders. Similarly, it is found among Balto-Slavic groups along the eastern Baltic area. Obviously, no language change is seen in Nordic Bronze Age and Lusatian territory, and none is expected in Estonian or Finnish territory, either.

Therefore, no “N1c-L392 + Siberian ancestry” can be seen expanding Finno-Ugric dialects, but rather different infiltrations and population movements with limited effects on ancestry and Y-DNA composition, depending on the specific period and region.

estonians-hungarians-mordvinian
Selection of the PCA, with the group of Estonians, Mordovians, and Hungarians selected. See Tambets et al. (2018) for more information.

An issue never resolved

Because N1c-L392 subclades & Siberian ancestry, which appear in different proportions and with different origins among some modern Uralic peoples, do not appear in cultures supposed to host Uralic-speaking populations until the Iron Age, people keep looking into any direction to find the ‘true’ homeland of those ‘Uralic N1c peoples’? Kind of a full circular reasoning, anyone? The same is valid for R1a & steppe ancestry being followed for ‘Indo-Europeans’, or R1b-P312 & Neolithic farmer ancestry being traced for ‘Basques’, because of their distribution in modern populations.

I understand the caution of many pointing to the need to wait and see how samples after 2000 BC are like, in every single period, from the middle and upper Volga, Kama, southern Finland, and the Forest Zone between Fennoscandia and the steppe. It’s like waiting to see how people from Western Yamna and the Carpathian Basin after 3000 BC look like, to fill in what is lacking between East Yamna and Bell Beakers, and then between them and every single Late PIE dialect.

But the answer for Yamna-Bell Beaker-Poltavka peoples during the Late PIE expansion is always going to be “R1b-L23, but with R1a-Z645 nearby” (we already have a pretty good idea about that); and the answer for the Forest Zone and northern Cis- and Trans-Urals area – during the time when Uralic languages are known to have already been spoken there – is always going to be “R1a-Z645, but with haplogroup N nearby”, as is already clear from the data on the eastern Baltic region.

So, without a previously proposed model as to where those amateurs expressing concern about ‘not having enough data’ expect to find those ‘Uralic peoples’, all this waiting for the right data looks more like a waiting for N1c and Siberian ancestry to pop up somewhere in the historic Uralic-speaking area, to be able to say “There! A Uralic-speaking male!”. Not a very reasonable framework to deal with prehistoric peoples and their languages, I should think.

But, for those who want to do that, let me break the news to you already:

ananino-culture-balto-slavic
First N1c – Finno-Ugric person arrives in Estonia to teach Finno-Saamic to Balto-Slavic peoples.

And here it is, an appropriate fantasy description of the ethnolinguistic groups from the region. You are welcome:

  • During the Bronze Age, late Corded Ware groups evolve as the western Textile ware Fennic Balto-Slavic group in the Gulf of Finland; the Netted Ware Saamic Balto-Slavic group of inner Finland; the south Netted Ware / Akozino Volgaic Balto-Slavic groups of the Middle Volga; and the Anonino Permic Balto-Slavic group in the north-eastern Forest Zone; all developing still in close contact with each other, allowing for common traits to permeate dialects.
  • These Balto-Slavic groups would then incorporate west of the Urals during and after the Iron Age (ca. 800-500 BC first, and also later during their expansion to the north) limited ancestry and lineages from eastern European hunter-gatherer groups of Palaeo-European Fennic and Palaeo-Siberian Volgaic and Permic languages from the Circum-Artic region, but they adopted nevertheless the language of the newcomers in every single infiltration of N1c lineages and/or admixture with Siberian ancestry. Oh and don’t forget the Saamic peoples from central Sweden, of course, the famous N1c-L392 ‘Rurikid’ lineages expanding Saamic to the north and replacing Proto-Germanic…

The current model for those obsessed with modern Y-DNA is, therefore, that expanding Neolithic, Bronze Age and Iron Age cultures from north-eastern Europe adopted the languages of certain lineages originally from sub-Neolithic (Scandinavian and Siberian) hunter-gatherer populations of the Circum-Artic region; lineages that these cultures incorporated unevenly during their expansions. Hmmmm… Sounds like an inverse Western movie, where expanding Americans end up speaking Apache, and the eastern coast speaks Spanish until Italian migrants arrive and make everyone speak English… or something. A logic, no-nonsense approach to ethnolinguistic identification.

I kid you not, this is the kind of models we are going to see very soon. In 2018 and 2019, with ancient DNA able to confirm or reject archaeological hypotheses based on linguistic data, people will keep instead creating new pet theories to support preconceived ideas based on the Y-DNA prevalent among modern populations. That is, information available in the 2000s.

So what’s (so much published) ancient DNA useful for, exactly?

[Next post on the subject: Corded Ware—Uralic (III): Seima-Turbino and the Ugric and Samoyedic expansion]

See also

Related

Haplogroup R1a and CWC ancestry predominate in Fennic, Ugric, and Samoyedic groups

uralic-languages

Open access Genes reveal traces of common recent demographic history for most of the Uralic-speaking populations, by Tambets et al. Genome Biology (2018).

Interesting excerpts (emphasis mine):

Methods

A total of 286 samples of Uralic-speaking individuals, of those 121 genotyped in this study, were analysed in the context of 1514 Eurasian samples (including 14 samples published for the first time) based on whole genome single nucleotide polymorphisms (SNPs) (Additional file 1: Table S1). All these samples, together with the larger sample set of Uralic speakers, were characterized for mtDNA and chrY markers.

The question as which material cultures may have co-spread together with proto-Uralic and Uralic languages depends on the time estimates of the splits in the Uralic language tree. Deeper age estimates (6,000 BP) of the Uralic language tree suggest a connection between the spread of FU languages from the Volga River basin towards the Baltic Sea either with the expansion of the Neolithic culture of Combed Ware, e.g. [6, 7, 17, 26] or with the Neolithic Volosovo culture [7]. Younger age estimates support a link between the westward dispersion of Proto-Finno-Saamic and eastward dispersion of Proto-Samoyedic with a BA Sejma-Turbino (ST) cultural complex [14, 18, 27, 28] that mediated the diffusion of specific metal tools and weapons from the Altai Mountains over the Urals to Northern Europe or with the Netted Ware culture [23], which succeeded Volosovo culture in the west. It has been suggested that Proto-Uralic may have even served as the lingua franca of the merchants involved in the ST phenomenon [18]. All these scenarios imply that material culture of the Baltic Sea area in Europe was influenced by cultures spreading westward from the periphery of Europe and/or Siberia. Whether these dispersals involved the spread of both languages and people remains so far largely unknown.

The population structure of Uralic speakers

To contextualize the autosomal genetic diversity of Uralic speakers among other Eurasian populations (Additional file 1: Table S1), we first ran the principal component (PC) analysis (Fig. 2a, Additional file 3: Figure S1). The first two PCs (Fig. 2a, Additional file 3: Figure S1A) sketch the geography of the Eurasian populations along the East-West and North-South axes, respectively. The Uralic speakers, along with other populations speaking Slavic and Turkic languages, are scattered along the first PC axis in agreement with their geographic distribution (Figs. 1 and 2a) suggesting that geography is the main predictor of genetic affinity among the groups in the given area. Secondly, in support of this, we find that FST-distances between populations (Additional file 3: Figure S2) decay in correlation with geographical distance (Pearson’s r = 0.77, p < 0.0001). On the UPGMA tree based on these FST-distances (Fig. 2b), the Uralic speakers cluster into several different groups close to their geographic neighbours.

uralic-pca
Principal component analysis (PCA) and genetic distances of Uralic-speaking populations. a PCA (PC1 vs PC2) of the Uralic-speaking populations.

We next used ADMIXTURE [48], which presents the individuals as composed of inferred genetic components in proportions that maximize Hardy-Weinberg and linkage equilibrium in the overall sample (see the ‘Methods’ section for choice of presented K). Overall, and specifically at lower values of K, the genetic makeup of Uralic speakers resembles that of their geographic neighbours. The Saami and (a subset of) the Mansi serve as exceptions to that pattern being more similar to geographically more distant populations (Fig. 3a, Additional file 3: S3). However, starting from K = 9, ADMIXTURE identifies a genetic component (k9, magenta in Fig. 3a, Additional file 3: S3), which is predominantly, although not exclusively, found in Uralic speakers. This component is also well visible on K = 10, which has the best cross-validation index among all tests (Additional file 3: S3B). The spatial distribution of this component (Fig. 3b) shows a frequency peak among Ob-Ugric and Samoyed speakers as well as among neighbouring Kets (Fig. 3a). The proportion of k9 decreases rapidly from West Siberia towards east, south and west, constituting on average 40% of the genetic ancestry of FU speakers in Volga-Ural region (VUR) and 20% in their Turkic-speaking neighbours (Bashkirs, Tatars, Chuvashes; Fig. 3a). The proportion of this component among the Saami in Northern Scandinavia is again similar to that of the VUR FU speakers, which is exceptional in the geographic context. It is also notable that North Russians, sampled from near the White Sea, differ from other Russians by sporting higher proportions of k9 (10–15%), which is similar to the values we observe in their Finnic-speaking neighbours. Notably, Estonians and Hungarians, who are geographically the westernmost Uralic speakers, virtually lack the k9 cluster membership.

siberian-ancestry
Population structure of Uralic-speaking populations inferred from ADMIXTURE analysis on autosomal SNPs in Eurasian context. a Individual ancestry estimates for populations of interest for selected number of assumed ancestral populations (K3, K6, K9, K11). Ancestry components discussed in a main text (k2, k3, k5, k6, k9, k11) are indicated and have the same colours throughout. The names of the Uralic-speaking populations are indicated with blue (Finno-Ugric) or orange (Samoyedic). The full bar plot is presented in Additional file 3: Figure S3. b Frequency map of component k9

We also tested the different demographic histories of female and male lineages by comparing outgroup f3 results for autosomal and X chromosome (chrX) data for pairs of populations (Estonians, Udmurts or Khanty vs others) with high versus low probability to share their patrilineal ancestry in chrY hg N (see the ‘Methods’ section, Additional file 3: Figure S13). We found a minor but significant excess of autosomal affinity relative to chrX for pairs of populations that showed a higher than 10% chance of two randomly sampled males across the two groups sharing their chrY ancestry in hg N3-M178, compared to pairs of populations where such probability is lower than 5% (Additional file 3: Figure S13).

In sum, these results suggest that most of the Uralic speakers may indeed share some level of genetic continuity via k9, which, however, also extends to the geographically close Turkic speakers.

uralic-modern-europe

Identity-by-descent

We found that it is the admixture with the Siberians that makes the Western Uralic speakers different from the tested European populations (Additional file 3: Figure S4A-F, H, J, L). Differentiating between Estonians and Finns, the Siberians share more derived alleles with Finns, while the geographic neighbours of Estonians (and Finns) share more alleles with Estonians (Additional file 3: Figure S4M). Importantly, Estonians do not share more derived alleles with other Finnic, Saami, VUR FU or Ob-Ugric-speaking populations than Latvians (Additional file 3: Figure S4O). The difference between Estonians and Latvians is instead manifested through significantly higher levels of shared drift between Estonians and Siberians on the one hand and Latvians and their immediate geographic neighbours on the other hand. None of the Uralic speakers, including linguistically close Khanty and Mansi, show significantly closer affinities to the Hungarians than any non-FU population from NE Europe (Additional file 3: Figure S4R).

ibd-uralic-genetics
Share of ~ 1–2 cM identity-by-descent (IBD) segments within and between regional groups of Uralic speakers. For each Uralic-speaking population representing lines in this matrix, we performed permutation test to estimate if it shows higher IBD segment sharing with other population (listed in columns) as compared to their geographic control group. Empty rectangles indicate no excess IBD sharing, rectangles filled in blue indicate comparisons when statistically significant excess IBD sharing was detected between one Uralic-speaking population with another Uralic-speaking population (listed in columns), rectangles filled in green mark the comparisons when a Uralic-speaking population shows excess IBD sharing with a non-Uralic-speaking population. For each tested Uralic speaker (matrix rows) populations in the control group that were used to generate permuted samples are indicated using small circles. For example, the rectangle filled in blue for Vepsians and Komis (A) implies that the Uralic-speaking Vepsians share more IBD segments with the Uralic-speaking Komis than the geographic control group for Vepsians, i.e. populations indicated with small circles (Central and North Russians, Swedes, Latvians and Lithuanians). The rectangle filled in green for Vepsians and Dolgans shows that the Uralic-speaking Vepsians share more IBD segments with the non-Uralic-speaking Dolgans than the geographic control group

Time of Siberian admixture

The time depth of the Globetrotter (Fig. 5b) inferred admixture events is relatively recent—500–1900 AD (see also complementary ALDER results, in Additional file 13: Table S12 and Additional file 3: Figure S7)—and agrees broadly with the results reported in Busby et al. [55]. A more detailed examination of the ALDER dates, however, reveals an interesting pattern. The admixture events detected in the Baltic Sea region and VUR Uralic speakers are the oldest (800–900 AD or older) followed by those in VUR Turkic speakers (∼1200–1300 AD), while the admixture dates for most of the Siberian populations (>1500 AD) are the most recent (Additional file 3: Figure S7). The West Eurasian influx into West Siberia seen in modern genomes was thus very recent, while the East Eurasian influx into NE Europe seems to have taken place within the first millennium AD (Fig. 5b, Additional file 3: Figure S7).

Affinities of the Uralic speakers with ancient Eurasians

We next calculated outgroup f3-statistics [48] to estimate the extent of shared genetic drift between modern and ancient Eurasians (Additional file 14: Table S13, Additional file 3: Figures S8-S9). Consistent with previous reports [45, 50], we find that the NE European populations including the Uralic speakers share more drift with any European Mesolithic hunter-gatherer group than Central or Western Europeans (Additional file 3: Figure S9A-C). Contrasting the genetic contribution of western hunter-gatherers (WHG) and eastern hunter-gatherers (EHG), we find that VUR Uralic speakers and the Saami share more drift with EHG. Conversely, WHG shares more drift with the Finnic and West European populations (Additional file 3: Figure S9A). Interestingly, we see a similar pattern of excess of shared drift between VUR and EHG if we substitute WHG with the aDNA sample from the Yamnaya culture (Additional file 3: Figure S9D). As reported before [2, 45], the genetic contribution of European early farmers decreases along an axis from Southern Europe towards the Ural Mountains (Fig. 6, Additional file 3: Figure S9E-F).

yamna-cwc-qpgraph-admixture-uralic
Proportions of ancestral components in studied European and Siberian populations and the tested qpGraph model. a The qpGraph model fitting the data for the tested populations. Colour codes for the terminal nodes: pink—modern populations (‘Population X’ refers to test population) and yellow—ancient populations (aDNA samples and their pools). Nodes coloured other than pink or yellow are hypothetical intermediate populations. We putatively named nodes which we used as admixture sources using the main recipient among known populations. The colours of intermediate nodes on the qpGraph model match those on the admixture proportions panel. b Admixture proportions (%) of ancestral components. We calculated the admixture proportions summing up the relative shares of a set of intermediate populations to explain the full spectrum of admixture components in the test population. We further did the same for the intermediate node CWC’ and present the proportions of the mixing three components in the stacked column bar of CWC’. Colour codes for ancestral components are as follows: dark green—Western hunter gatherer (WHG’); light green—Eastern hunter gatherer (EHG’); grey—European early farmer (LBK’); dark blue—carriers of Corded Ware culture (CWC’); and dark grey—Siberian. CWC’ consists of three sub-components: blue—Caucasian hunter-gatherer in Yamnaya (CHGinY’); light blue—Eastern hunter-gatherer in Yamnaya (EHGinY’); and light grey—Neolithic Levant (NeolL’)

We then used the qpGraph software [48] to test alternative demographic scenarios by trying to fit the genetic diversity observed in a range of the extant Finno-Ugric populations through a model involving the four basic European ancestral components: WHG, EHG, early farmers (LBK), steppe people of Yamnaya/Corded Ware culture (CWC) and a Siberian component (Fig. 6, Additional file 3: Figure S10). We chose the modern Nganasans to serve as a proxy for the latter component because we see least evidence for Western Eurasian admixture (Additional file 3: Figure S3) among them. We also tested the Khantys for that proxy but the model did not fit (yielding f2-statistics, Z-score > 3). The only Uralic-speaking population that did not fit into the tested model with five ancestral components were Hungarians. The qpGraph estimates of the contributions from the Siberian component show that it is the main ancestry component in the West Siberian Uralic speakers and constitutes up to one third of the genomes of modern VUR and the Saami (Fig. 6). It drops, however, to less than 10% in most of NE Europe, to 5% in Estonians and close to zero in Latvians and Lithuanians.

Discussion

uralic-groups-haplogroup-r1a
Additional file 6: Table S5. Y chromosome haplogroup frequencies in Eurasia. Modified by me: in bold haplogroup N1c and R1a from Uralic-speaking populations, with those in red showing where R1a is the major haplogroup. Observe that all Uralic subgroups – Finno-Permic, Ugric, and Samoyedic – have some populations with a majority of R1a lineages.

One of the notable observations that stands out in the fineSTRUCTURE analysis is that neither Hungarians nor Estonians or Mordovians form genetic clusters with other Uralic speakers but instead do so with a broad spectrum of geographically adjacent samples. Despite the documented history of the migration of Magyars [63] and their linguistic affinity to Khantys and Mansis, who today live east of the Ural Mountains, there is nothing in the present-day gene pool of the sampled Hungarians that we could tie specifically to other Uralic speakers.

Perhaps even more surprisingly, we found that Estonians, who show close affinities in IBD analysis to neighbouring Finnic speakers and Saami, do not share an excess of IBD segments with the VUR or Siberian Uralic speakers. This is eIn this context, it is important to remind that the limited (5%, Fig. 6) East Eurasian impact in the autosomal gene pool of modern Estonians contrasts with the fact that more than 30% of Estonian (but not Hungarian) men carry chrY N3 that has an East Eurasian origin and is very frequent among NE European Uralic speakers [36]. However, the spread of chrY hg N3 is not language group specific as it shows similar frequencies in Baltic-speaking Latvians and Lithuanians, and in North Russians, who in all our analyses are very similar to Finnic-speakers. The latter, however, are believed to have either significantly admixed with their Uralic-speaking neighbours or have undergone a language shift from Uralic to Indo-European [38].ven more striking considering that the immediate neighbours—Finns, Vepsians and Karelians—do.

With some exceptions such as Estonians, Hungarians and Mordovians, both IBD sharing and Globetrotter results suggest that there are detectable inter-regional haplotype sharing ties between Uralic speakers from West Siberia and VUR, and between NE European Uralic speakers and VUR. In other words, there is a fragmented pattern of haplotype sharing between populations but no unifying signal of sharing that unite all the studied Uralic speakers.

Comments

The paper is obviously trying to find a “N1c/Siberian ancestry = Uralic” link, but it shows (as previous papers using ancient DNA) that this identification is impossible, because it is not possible to identify “N1c=Siberian ancestry”, “N1c=Uralic”, or “Siberian ancestry = Uralic”. In fact, the arrival of N subclades and Siberian ancestry are late, both events (probably multiple stepped events) are unrelated to each other, and represent east-west demic diffusion waves (as well as founder effects) that probably coincide in part with the Scythian and Turkic (or associated) expansions, i.e. too late for any model of Proto-Uralic or Proto-Finno-Ugric expansion.

On the other hand, it shows interesting data regarding ancestry of populations that show increased Siberian influence, such as those easternmost groups admixed with Yeniseian-like populations (Samoyedic), those showing strong founder effects (Finnic), or those isolated in the Circum-Artic region with neighbouring Siberian peoples in Kola (Saami). All in all, Hungarians, Estonians and Mordovians seem to show the original situation better than the other groups, which is also reflected in part in Y-DNA, conserved as a majority of R1a lineages precisely in these groups. Just another reminder that CWC-related ancestry is found in every single Uralic group, and that it represents the main ancestral component in all non-Samoyedic groups.

estonians-hungarians-mordvinian
Selection of the PCA, with the group of Estonians, Mordovians, and Hungarians selected.

The qpGraph shows the ancestor of Yamna (likely Khvalynsk) and Corded Ware stemming as different populations from a common (likely Neolithic) node – whose difference is based on the proportion of Anatolian-related ancestry – , that is, probably before the Indo-Hittite expansion; and ends with CWC groups forming the base for all Uralic peoples. Below is a detail of the qpGraph on the left, and my old guess (2017) on the right, for comparison:

yamna-corded-ware-qpgraph

#EDIT (22 sep 2018): I enjoyed re-reading it, and found this particular paragraph funny:

Despite the documented history of the migration of Magyars [63] and their linguistic affinity to Khantys and Mansis, who today live east of the Ural Mountains, there is nothing in the present-day gene pool of the sampled Hungarians that we could tie specifically to other Uralic speakers.

They are so obsessed with finding a link to Siberian ancestry and N1c, and so convinced of Kristiansen’s idea of CWC=Indo-European, that they forgot to examine their own data from a critical point of view, and see the clear link between all Uralic peoples with Corded Ware ancestry and R1a-Z645 subclades… Here is a reminder about Hungarians and R1a-Z282, and about the expansion of R1a-Z645 with Uralic peoples.

Related

Bayesian estimation of partial population continuity by using ancient DNA and spatially explicit simulations

europe-palaeolithic-neolithic

Open access Bayesian estimation of partial population continuity by using ancient DNA and spatially explicit simulations, by Silva et al., Evolutionary Applications (2018).

Abstract (emphasis mine):

The retrieval of ancient DNA from osteological material provides direct evidence of human genetic diversity in the past. Ancient DNA samples are often used to investigate whether there was population continuity in the settlement history of an area. Methods based on the serial coalescent algorithm have been developed to test whether the population continuity hypothesis can be statistically rejected by analysing DNA samples from the same region but of different ages. Rejection of this hypothesis is indicative of a large genetic shift, possibly due to immigration occurring between two sampling times. However, this approach is only able to reject a model of full continuity model (a total absence of genetic input from outside), but admixture between local and immigrant populations may lead to partial continuity. We have recently developed a method to test for population continuity that explicitly considers the spatial and temporal dynamics of populations. Here we extended this approach to estimate the proportion of genetic continuity between two populations, by using ancient genetic samples. We applied our original approach to the question of the Neolithic transition in Central Europe. Our results confirmed the rejection of full continuity, but our approach represents an important step forward by estimating the relative contribution of immigrant farmers and of local hunter‐gatherers to the final Central European Neolithic genetic pool. Furthermore, we show that a substantial proportion of genes brought by the farmers in this region were assimilated from other hunter‐gatherer populations along the way from Anatolia, which was not detectable by previous continuity tests. Our approach is also able to jointly estimate demographic parameters, as we show here by finding both low density and low migration rate for pre‐Neolithic hunter‐gatherers. It provides a useful tool for the analysis of the numerous aDNA datasets that are currently being produced for many different species.

central-european-neolithic
A) Different zones defined for computing proportions of ancestry in Central Europeans 4,500 BP. B) Schematic representation of various population contributions. C) Mean proportions of ancestry from the various PHG zones (A+B+C+D) in Central European populations from zone A at the end of the Neolithic transition 4,500 BP, computed for autosomal and mitochondrial markers.

Relevant excerpts:

Our results are in general accordance with two distinct ancestry components that have previously been detected at the continental scale by Lazaridis, Patterson et al. (2014): the “early European farmer” (EEF), which corresponds here to the NFA from Anatolia (zone C in Figure 3), and the “West European hunter-gatherer” (WHG), which corresponds here to the PHG from zones A and B in Figure 3. Notably, the contribution of an Ancient North Eurasians (ANE) component is not included in our model as we did not consider potential post-Neolithic immigration waves, which could have contributed to the modern European genetic pool, such as the wave that came from the Pontic steppes and was associated with the Yamnaya culture (Haak, Lazaridis et al. 2015). Without considering the ANE ancestry component, our estimate of the autosomal genetic contribution of Early farmers to the gene pool of Central European populations (25%) tends to be lower than the EEF ancestry estimated in most modern Western European populations, but is of the same order than the estimations in modern Estonians and in the ancient Late Neolithic genome “Karsdorf” from Germany (Lazaridis, Patterson et al. 2014, Haak, Lazaridis et al. 2015). Note that the contribution of hunter-gatherers to Neolithic communities appears to be variable in different regions of Europe (Skoglund, Malmstrom et al. 2012, Brandt, Haak et al. 2013, Lazaridis, Patterson et al. 2014), while we computed an average value for Central Europe. Moreover, we computed the ancestry of the two groups at the end of the Neolithic period while previous studies estimated it in modern times. Finally, previous studies used molecular information to directly estimate admixture proportions, while we use molecular information to estimate the model parameters and, then, we computed the expected genetic contributions of both groups using the best parameters, without using molecular information during this second step. Model assumptions may thus influence the inferences on the relative genetic contribution of both groups. In particular, we made the assumption of a uniform expansion of NFA with constant and similar assimilation of PHG over the whole continent but spatio-temporally heterogeneous environment, variable assimilation rate and long distance dispersal may have played an important role. The effects of those factors should be investigated in future studies.

On Proto-Finnic language guesstimates, and its western homeland

bronze_age_early-sejma-turbino

Recent chapter The Indo-Europeans and the Non-Indo-Europeans in Prehistoric Northern Europe, by Petri Kallio, In: Language and Prehistory of the Indo- European Peoples: A Cross-Disciplinary Perspective, Copenhagen (2017).

Interesting excerpts (emphasis mine), especially when read in combination with the most recent papers on Early Indo-Iranian, Corded Ware, and Fennoscandian genomes:

Like the Indo-Europeanists, also the Uralicists suffer from their “school who wants it large and wants it early”. This time, however, the desired homeland is even larger and earlier, covering the whole northern half of Europe already at the end of the Ice Age (Wiik 2002). As a Finn, admittedly, I find such an idea very flattering indeed. As a historical linguist, however, I also find it absurd for the same reasons which I already gave above in the case of the Indo-Europeans.

True, linguistic palaeontology is less helpful in the case of the Uralians, even though especially Common Uralic *pata ‘clay pot’ and *wäśkä ‘copper’ indicate that Proto-Uralic was not spoken before the Subneolithic period, which in the East-Baltic area is dated about 5300–3200 BC. However, the most valuable evidence comes from the earliest Indo-European loanwords in the Uralic languages, which show that Proto-Uralic cannot have been spoken much earlier than Proto-Indo-European dated about 3500 BC (Koivulehto 2001: 235, 257).

As the same loanword evidence naturally also shows that the Uralic and Indo-European homelands were not located far from one another, the Uralic homeland can most likely be located in the Middle and Upper Volga region, right north of the Indo-European homeland*. From the beginning of the Subneolithic period about 5900 BC onwards, this region was an important innovation centre, from where several cultural waves spread to the Finnish Gulf area, such as the Sperrings Ware wave about 4900 BC, the Combed Ware wave about 3900 BC, and the Netted Ware wave about 1900 BC (Carpelan & Parpola 2001: 78–90).

* Interestingly, the only Uralicists who generally reject the Central Russian homeland are the Russian ones who prefer the Siberian homeland instead. Some Russians even advocate that the Central Russian homeland is only due to Finnish nationalism or, as one of them put it a bit more tactfully, “the political and ideological situation in Finland in the first decades of the 20th century” (Napolskikh 1995: 4). Still, some Finns (and especially those who also belong to the “school who wants it large and wants it early”) simultaneously advocate that exactly the same Central Russian homeland is due to Finnlandisierung (Wiik 2001: 466). Fortunately, I do not even need to resort to playing the politics card myself, because there is enough convincing evidence for the Central Russian homeland anyway.

Remarkably, the loanword evidence furthermore suggests that the ancestors of Finnic and Saamic had at least phonologically remained very close to Proto-Uralic as late as the Bronze Age (ca. 1700–500 BC). In particular, certain loanwords, whose Baltic and Germanic sources point to the first millennium BC, after all go back to the Finno-Saamic proto-stage, which is phonologically almost identical to the Uralic proto-stage (see especially the table in Sammallahti 1998: 198–202). This being the case, Dahl’s wave model could perhaps have some use in Uralic linguistics, too.

Even though Bronze-Age Finnic and Saamic were still two dialects rather than two languages, it does not mean that they would still have been spoken in a geographically limited area. On the contrary, their Indo-European loanwords dating to this period indicate that their speech areas were already geographically separate. The fact that at that time both Baltic and Germanic influenced Finnic much more strongly than Saamic must be considered a crucial piece of information when we are trying to locate the Finnic and Saamic homelands.

Iron Age migrations in Europe.

(…)the fact that Palaeo-Germanic loanwords are much more numerous in Finnic than in Saamic must lead to the same conclusion. As I noted above, the most likely Palaeo-Germanic speaking carriers of the Nordic Bronze culture (ca. 1700–500 BC) spread from Scandinavia to the Finnish and Estonian coastal areas. As they never spread any further to the east than as far as the bottom of the Finnish Gulf, the idea that the Finnic homeland included neither Finland nor Estonia completely fails to explain the very existence of Palaeo-Germanic loanwords, whose quantity and quality in Finnic presuppose a superstrate rather than an adstrate.

(…) as the Nordic Bronze culture influenced coastal Finland much more strongly than it did coastal Estonia, the idea that the Finnic homeland did not include Finland but Estonia alone similarly fails to explain the very strength of the Bronze-Age Palaeo-Germanic superstrate in Finnic, which can indeed be compared with the Medieval French superstrate in English, for instance (Kallio 2000: 96–97). From a Germanicist point of view, therefore, Itkonen’s theory concerning the Finnic homeland does not only seem to be the best but also the only alternative (Koivulehto 1984: 198–200).

As the same can now also be said about the Indo-Europeanization of the Baltic speech area, the fact that Baltic and Finnic are the most conservative branches of their language families and that they have relatively few substrate words may really be due to exactly the same reason, namely that before their arrival the East-Baltic region was still very sparsely populated by Subneolithic hunter-fisher-gatherers, whose linguistic influence on the newcomers was therefore rather limited. On the other hand, as these language shifts already took place millennia ago, there has been a lot of time for the Baltic and Finnic speakers to replace most of their old substrate words by all kinds of new lexical innovations.

Speaking of loanword evidence, the Aikios and especially Saarikivi (2004b) have furthermore argued that the Indo-Iranian loanwords occurring in Finnic and/or Saamic alone force us to locate the Finnic and Saamic homelands further to the east (e.g. near the White Lake). Still, I fail to see why the Indo-Iranian loanwords counted in dozens should be more relevant in locating these two homelands than the Germanic loanwords counted in hundreds. Besides, the Indo-Iranian loanwords mainly consist of cultural borrowings which do not necessarily presuppose a superstrate but only an adstrate. Moreover, they must be dated so much earlier than Vedic Sanskrit (ca. 1500–1000 BC) and Gathic Avestan (ca. 1000–800 BC) anyway that their spread can very well be connected with the abovementioned Netted Ware wave about 1900 BC.

An interesting read, where the author expressly refers to the many political (nationalist) and xenophobic overtones (including his own) that arise in ethnolinguistic identifications of prehistoric cultures.

We are seeing how the newest dialectalisation trends want it ‘late and small’, and ‘late’ corresponds smoothly with the most recent genomic findings involving Chalcolithic and Bronze Age expansions.

In the Uralic case, in North-Eastern Europe only Corded Ware migrants are known to have expanded within a suitable time frame into the region, and their patrilineal descendants show a widespread distribution in the region during the Bronze Age.

Also, if Proto-Finnic is coeval with Pre-Proto-Germanic, and expanded from the western part of North-East Europe (necessarily including the Gulf of Finland), well… You know the drill.

Of course, regarding Proto-Indo-European and Uralic, there are also a lot of people who still want itlarge and early‘ – and the most recent research won’t deter them from such proposals.

Related: