Sorry for the last weeks of silence, I have been rather busy lately. I am having more projects going on, and (because of that) I also wanted to finish a project I have been working on for many months already.
I have therefore decided to publish a provisional version of the text, in the hope that it will be useful in the following months, when I won’t be able to update it as often as I would like to:
Don’t forget to check out the maps included in the supplementary materials (I have added Y-DNA, mtDNA, and ADMIXTURE data using GIS software).
NOTE. Right now the files are only in my server. I will try to upload them to Academia.edu and Research Gate when I have time, in case the websites are too slow.
I would have preferred to wait for a thorough revision of the section on archaeology and the linguistic sections on Uralic, but I doubt I will have time when the reviews come, so it was either now or maybe next December…
I say so in the introduction, but it is evident that certain aspects of the book are tentative to say the least: the farther back we go from Late Proto-Indo-European, the less clear are many aspects. Also, linguistically I am not convinced about Eurasiatic or Nostratic, although they do have a certain interest when we try to offer a comprehensive view of the past, including ethnolinguistic identities.
I cannot be an expert in everything, and these books cover a lot. I am bound to publish many corrections as new information appears and more reviews are sent. For example, just days ago (before SNP calls of Wang et al. 2018 were published) some paragraphs implied that AME might have expanded Nostratic from the Middle East. Now it does not seem so, and I changed them just before uploading the text. That’s how tentative certain routes are, and how much all of this may change. And that only if we accept a Nostratic phylum…
NOTE. Since the first book I wrote was the linguistic one, and I have spent the last months updating the archaeology + genetics part, now many of you will probably understand 1) why I am so convinced about certain language relationships and 2) how I used many posts to clarify certain ideas and receive comments. Many posts offer probably a good timeline of what I worked with, and when.
I did not add this section to the books, because they are still not ready for print, but I think this is due somewhere now. It is impossible to reference all who have directly or indirectly contributed to this, so this is a list of those I feel have played an important role.
I am indebted to the following people (which does not mean that they share my views, obviously):
First and foremost, to Fernando López-Menchero, for having the patience to review with detail many parts on Indo-European linguistics, knowing that I won’t accept many of his comments anyway. The additional information he offers is invaluable, but I didn’t want to turn this into a huge linguistic encyclopaedia with unending discussions of tiny details of each reconstructed word. I think it is already too big as it is.
Professor Kortlandt is still to review the text, but he contributed to both previous essays in some very interesting ways, so I hope he can help me improve the parts on Uralic, and maybe alternative accounts of expansion for Balto-Slavic, depending on the time depth that he would consider warranted according to the Temematic hypothesis.
I would not have thought about doing this if it were not for the interest of Wekwos (Xavier Delamarre) in publishing a full book about the Indo-European demic diffusion model (in the second half of 2017, I think). It was them who suggested that I extended the content, when all I had done until then was write an essay and draw some maps in my free time between depositing the PhD thesis and defending it.
Sadly, as much as I would like to publish a book with a professional publisher, I don’t think ancient DNA lends itself for the traditional format, so my requests (mainly to have free licenses and being able to review the text at will, as new genetic papers are published) were logically not acceptable. Also, the main aim of all volumes, especially the linguistic one, is the teaching of essentials of Late Proto-Indo-European and related languages, and this objective would be thwarted by selling each volume for $50-70 and only in printed format. I prefer a wider distribution.
At first I didn’t think much of this proposal, because I do not benefit from this kind of publications in my scientific field, but with time my interest in writing a whole, comprehensive book on the subject grew to the point where it was already an ongoing project, probably by the start of 2018.
I would not have been in contact with Wekwos if it were not for user Camulogène Rix at Anthrogenica, so thanks for that and for the interest in this work.
I would not have thought of writing this either if not for the spontaneous support (with an unexpected phone call!) of a professor of the Complutense University of Madrid, Ángel Gómez Moreno, who is interested in this subject – as is his wife, a professor of Classics more closely associated to Indo-European studies, and who helped me with a search for Indo-Europeanists.
EDIT (1 JAN 2019): I remembered that Karin Bojs sent me her book after reading the demic diffusion model. I may have also thought about writing a whole book back then, but mid-2017 is probably too early for the project.
The maps are evidently (for those who are interested in genetics) in part the result of the effort of the late Jean Manco: As you can see from the maps including Y-DNA and mtDNA samples, I have benefitted from her way of organising data and publishing it. Similarly, the work of Iain McDonald in assessing the potential migration routes of R1b and R1a in Europe with the help of detailed maps was behind my idea for the first maps, and consequently behind these, too.
Readers of this blog with interesting comments have also been essential for the improvement of the texts. You can probably see some of your many contributions there. I may not answer many comments, because I am always busy (and sometimes I just don’t have anything interesting to say), but I try to read all of them.
Users of other sites, like Anthrogenica, whose particular points of view and deep knowledge of some very specific aspects are sometimes very useful. In particular, user Anglesqueville helped me to fix some issues with the merging of datasets to obtain the PCAs and ADMIXTURE, and prepared some individual samples to merge them.
Even without posting anything, Google Analytics keeps sending me messages about increasing user fidelity (returning users), and stats haven’t really changed (which probably means more people are reading old posts), so thank you for that.
To understand the population history and context of dairy pastoralism in the eastern Eurasian steppe, we applied genomic and proteomic analyses to individuals buried in Late Bronze Age (LBA) burial mounds associated with the Deer Stone-Khirigsuur Complex (DSKC) in northern Mongolia. To date, DSKC sites contain the clearest and most direct evidence for animal pastoralism in the Eastern steppe before ca. 1200 BCE.
Most LBA Khövsgöls are projected on top of modern Tuvinians or Altaians, who reside in neighboring regions. In comparison with other ancient individuals, they are also close to but slightly displaced from temporally earlier Neolithic and Early Bronze Age (EBA) populations from the Shamanka II cemetry (Shamanka_EN and Shamanka_EBA, respectively) from the Lake Baikal region. However, when Native Americans are added to PC calculation, we observe that LBA Khövsgöls are displaced from modern neighbors toward Native Americans along PC2, occupying a space not overlapping with any contemporary population. Such an upward shift on PC2 is also observed in the ancient Baikal populations from the Neolithic to EBA and in the Bronze Age individuals from the Altai associated with Okunevo and Karasuk cultures.
(…) two individuals fall on the PC space markedly separated from the others: ARS017 is placed close to ancient and modern northeast Asians, such as early Neolithic individuals from the Devil’s Gate archaeological site (22) and present-day Nivhs from the Russian far east, while ARS026 falls midway between the main cluster and western Eurasians.
Upper Paleolithic Siberians from nearby Afontova Gora and Mal’ta archaeological sites (AG3 and MA-1, respectively) (25, 26) have the highest extra affinity with the main cluster compared with other groups, including the eastern outlier ARS017, the early Neolithic Shamanka_EN, and present-day Nganasans and Tuvinians (Z > 6.7 SE for AG3). Main cluster Khövsgöl individuals mostly belong to Siberian mitochondrial (A, B, C, D, and G) and Y (all Q1a but one N1c1a) haplogroups.
Previous studies show a close genetic relationship between WSH populations and ANE ancestry, as Yamnaya and Afanasievo are modeled as a roughly equal mixture of early Holocene Iranian/ Caucasus ancestry (IRC) and Mesolithic Eastern European hunter-gatherers, the latter of which derive a large fraction of their ancestry from ANE. It is therefore important to pinpoint the source of ANE-related ancestry in the Khövsgöl gene pool: that is, whether it derives from a pre-Bronze Age ANE population (such as the one represented by AG3) or from a Bronze Age WSH population that has both ANE and IRC ancestry.
The amount of WSH contribution remains small (e.g., 6.4 ± 1.0% from Sintashta). Assuming that the early Neolithic populations of the Khövsgöl region resembled those of the nearby Baikal region, we conclude that the Khövsgöl main cluster obtained ∼11% of their ancestry from an ANE source during the Neolithic period and a much smaller contribution of WSH ancestry (4–7%) beginning in the early Bronze Age.
Apparently, then, the first individual with substantial WSH ancestry in the Khövsgöl population (ARS026, of haplogroup R1a-Z2123), directly dated to 1130–900 BC, is consistent with the first appearance of admixed forest-steppe-related populations like Karasuk (ca. 1200-800 BC) in the Altai. Interestingly, haplogroup N1a1a-M178 pops up (with mtDNA U5a2d1) among the earlier Khövsgöl samples.
I will repeat what I wrote recently here: Samoyedic arrived in the Altai with Karasuk and hg R1a-Z645 + Steppe_MLBA-like ancestry, admixed with Altai populations, clustering thus within an Ancient Altai cline. Only later did N1a1a subclades infiltrate Samoyedic (and Ugric) populations, bringing them closer to their modern Palaeo-Siberian cline. The shared mtDNA may support an ancestral EHG-“Siberian” cline, or else a more recent Afanasevo-related origin.
Also interesting, Q1a2 subclades and ANE ancestry making its appearance everywhere among ancestral Eurasian peoples, as Chetan recently pointed out.
Even though proposals of an Eastern Uralic (or Ugro-Samoyedic) group are in the minority – and those who support it tend to search for an origin of Uralic in Central Asia – , there is nothing wrong in supporting this from the point of view of a western homeland, because the eastward migration of both Proto-Ugric and Pre-Samoyedic peoples may have been coupled with each other at an early stage. It’s like Indo-Slavonic: it just doesn’t fit the linguistic data as well as the alternative, i.e. the expansion of Samoyedic first, different from a Finno-Ugric trunk. But, in case you are wondering about this possibility, here is Häkkinen’s (2012) phonological argument:
The case of Samoyedic is quite similar to that of Hungarian, although the earliest Palaeo-Siberian contact languages have been lost. There were contacts at least with Tocharian (Kallio 2004), Yukaghir (Rédei 1999) and Turkic (Janhunen 1998). Samoyedic also:
a) has moved far from the related languages and has been exposed to strong foreign influence
b) shares a small number of common words with other branches (from Sammallahti 1988: only 123 ‘Uralic’ words, versus 390 ‘Uralic’ + ‘Finno-Ugric’ words found in other branches than Samoyedic = 31,5 %)
c) derives phonologically from the East Uralic dialect.
The phonological level is taxonomically more reliable, since it lacks the distortion caused by invisible convergence and false divergence at the lexical level. Thus we can conclude that the traditional taxonomic model, according to which Samoyedic was the first branch to split off from the Proto-Uralic unity, is just as incorrect as the view that Hungarian was the first branch to split off.
Late Uralic can be traced back to metallurgical cultures thanks to terms like PU *wäśka ‘copper/bronze’ (borrowed from Proto-Samoyedic *wesä into Tocharian); PU *äsa and *olna/*olni, ‘lead’ or ‘tin’, found in *äsa-wäśka ‘tin-bronze’; and e.g. *weŋći ‘knife’, borrowed into Indo-Iranian (through the stage of vocalization of nasals), appearing later as Proto-Indo-Aryan *wāćī ‘knife, awl, axe’.
It is known that the southern regions of the Abashevo culture developed Proto-Indo-Iranian-speaking Sintashta-Petrovka and Pokrovka (Early Srubna). To the north, however, Abashevo kept its Uralic nature, with continuous contacts allowing for the spread of lexicon – mainly into Finno-Ugric – , and phonetic influence – mainly Uralisms into Proto-Indo-Iranian phonology (read more here).
The northern part of Abashevo (just like the south) was mainly a metallurgical society, with Abashevo metal prospectors found also side by side with Sintashta pioneers in the Zeravshan Valley, near BMAC, in search of metal ores. About the Seima-Turbino phenomenon, from Parpola (2013):
From the Urals to the east, the chain of cultures associated with this network consisted principally of the following: the Abashevo culture (extending from the Upper Don to the Mid- and South Trans-Urals, including the important cemeteries of Sejma and Turbino), the Sintashta culture (in the southeast Urals), the Petrovka culture (in the Tobol-Ishim steppe), the Taskovo-Loginovo cultures (on the Mid- and Lower Tobol and the Mid-Irtysh), the Samus’ culture (on the Upper Ob, with the important cemetery of Rostovka), the Krotovo culture (from the forest steppe of the Mid-Irtysh to the Baraba steppe on the Upper Ob, with the important cemetery of Sopka 2), the Elunino culture (on the Upper Ob just west of the Altai mountains) and the Okunevo culture (on the Mid-Yenissei, in the Minusinsk plain, Khakassia and northern Tuva). The Okunevo culture belongs wholly to the Early Bronze Age (c. 2250–1900 BCE), but most of the other cultures apparently to its latter part, being currently dated to the pre-Andronovo horizon of c. 2100–1800 BCE (cf. Parzinger 2006: 244–312 and 336; Koryakova & Epimakhov 2007: 104–105).
The majority of the Sejma-Turbino objects are of the better quality tin-bronze, and while tin is absent in the Urals, the Altai and Sayan mountains are an important source of both copper and tin. Tin is also available in southern Central Asia. Chernykh & Kuz’minykh have accordingly suggested an eastern origin for the Sejma-Turbino network, backing this hypothesis also by the depiction on the Sejma-Turbino knives of mountain sheep and horses characteristic of that area. However, Christian Carpelan has emphasized that the local Afanas’evo and Okunevo metallurgy of the Sayan-Altai area was initially rather primitive, and could not possibly have achieved the advanced and difficult technology of casting socketed spearheads as one piece around a blank. Carpelan points out that the first spearheads of this type appear in the Middle Bronze Age Caucasia c. 2000 BCE, diffusing early on to the Mid-Volga-Kama-southern Urals area, where “it was the experienced Abashevo craftsmen who were able to take up the new techniques and develop and distribute new types of spearheads” (Carpelan & Parpola 2001: 106, cf. 99–106, 110). The animal argument is countered by reference to a dagger from Sejma on the Oka river depicting an elk’s head, with earlier north European prototypes (Carpelan & Parpola 2001: 106–109). Also the metal analysis speaks for the Abashevo origin of the Sejma-Turbino network. Out of 353 artefacts analyzed, 47% were of tin-bronze, 36% of arsenical bronze, and 8.5% of pure copper. Both the arsenical bronze and pure copper are very clearly associated with the Abashevo metallurgy.
The Abashevo metal production was based on the Volga-Kama-Belaya area sandstone ores of pure copper and on the more easterly Urals deposits of arsenical copper (Figure 9). The Abashevo people, expanding from the Don and Mid-Volga to the Urals, first reached the westerly sandstone deposits of pure copper in the Volga and Kama basins, and started developing their metallurgy in this area, before moving on to the eastern side of the Urals to produce harder weapons and tools of arsenical copper. Eventually they moved even further south, to the area richest in copper in the whole Urals region, founding there the very strong and innovative Sintashta culture.
Regarding the most likely expansion of Eastern Uralic peoples:
Nataliya L’vovna Chlenova (1929–2009; cf. Korenyako & Ku’zminykh 2011) published in 1981 a detailed study of the Cherkaskul’ pottery. In her carefully prepared maps of 1981 and 1984 (Figure 10), she plotted Cherkaskul’ monuments not only in Bashkiria and the Trans-Urals, but also in thick concentrations on the Upper Irtysh, Upper Ob and Upper Yenissei, close to the Altai and Sayan mountains, precisely where the best experts suppose the homeland of Proto-Samoyed to be.
The Cherkaskul’ culture was transformed into the genetically related Mezhovka culture (c. 1500–1000 BCE), which occupied approximately the same area from the Mid-Kama and Belaya rivers to the Tobol river in western Siberia (cf. Parzinger 2006: 444–448; Koryakova & Epimakhov 2007: 170–175). The Mezhovka culture was in close contact with the neighbouring and probably Proto-Iranian speaking Alekseevka alias Sargary culture (c. 1500–900 BCE) of northern Kazakhstan (Figure 4 no. 8) that had a Fëdorovo and Cherkaskul’ substratum and a roller pottery superstratum (cf. Parzinger 2006: 443–448; Koryakova & Epimakhov 2007: 161–170). Both the Cherkaskul’ and the Mezhovka cultures are thought to have been Proto-Ugric linguistically, on the basis of the agreement of their area with that of Mansi and Khanty speakers, who moreover in their Fëdorovo-like ornamentation have preserved evidence of continuity in material culture (cf. Chlenova 1984; Koryakova & Epimakhov 2007: 159, 175).
The Mezhovka culture was succeeded by the genetically related Gamayun culture (c. 1000–700 BCE) (cf. Parzinger 2006: 446; 542–545).
From the Gamayun culture descend Trans-Urals cultures in close contact with Finno-Permic populations of the Cis-Ural region:
[Proto-Mansi] Itkul’ culture (c. 700–200 BCE) distributed along the eastern slope of the Ural Mountains (cf. Parzinger 2006: 552–556). Known from its walled forts, it constituted the principal Trans-Uralian centre of metallurgy in the Iron Age, and was in contact with both the Anan’ino and Akhmylovo cultures (the metallurgical centres of the Mid-Volga and Kama-Belaya region) and the neighbouring Gorokhovo culture.
[Proto-Hungarian] via the Vorob’evo Group (c. 700–550 BCE) (cf. Parzinger 2006: 546–549), to the Gorokhovo culture (c. 550–400 BCE) of the Trans-Uralian forest steppe (cf. Parzinger 2006: 549–552). For various reasons the local Gorokhovo people started mobile pastoral herding and became part of the multicomponent pastoralist Sargat culture (c. 500 BCE to 300 CE), which in a broader sense comprized all cultural groups between the Tobol and Irtysh rivers, succeeding here the Sargary culture. The Sargat intercommunity was dominated by steppe nomads belonging to the Iranian-speaking Saka confederation, who in the summer migrated northwards to the forest steppe
[Proto-Khanty] Late Bronze Age and Early Iron Age cultures related to the Gamayunskoe and Itkul’ cultures that extended up to the Ob: the Nosilovo, Baitovo, Late Irmen’, and Krasnoozero cultures (c. 900–500 BCE). Some were in contact with the Akhmylovo on the Mid-Volga.
Parpola (2012) connects the expansion of Samoyedic with the Cherkaskul variant of Andronovo. As we know, Andronovo was genetically diverse, which speaks in favour of different groups developing similar material cultures in Central Asia.
Juha Janhunen, author of the etymological dictionary of the Samoyed languages (1977), places the homeland of Proto-Samoyedic in the Minusinsk basin on the Upper Yenissei (cf. Janhunen 2009: 72). Mainly on the basis of Bulghar Turkic loanwords, Janhunen (2007: 224; 2009: 63) dates Proto-Samoyedic to the last centuries BCE. Janhunen thinks that the language of the Tagar culture (c. 800–100 BCE) ought to have been Proto-Samoyedic (cf. Janhunen 1983: 117– 118; 2009: 72; Parzinger 2001: 80 and 2006: 619–631 dates the Tagar culture c. 1000–200 BCE; Svyatko et al. 2009: 256, based on human bone samples, c. 900 BCE to 50 CE). The Tagar culture largely continues the traditions of the Karasuk culture (c. 1400–900 BCE), (…)
The use of a map of “Siberian ancestry” peaking in the arctic to show a supposedly late Uralic population movement (starting in the Iron Age!) seems to be the latest trend in population genomics:
I guess that would make this map of Neolithic farmer ancestry represent an expansion of Indo-European from the south, because Anatolia, Greece, Italy, southern France, and Iberia – where this ancestry peaks in modern populations – are among the oldest territories where Indo-European languages were recorded:
Probably not the right interpretation of this kind of simplistic data about modern populations, though…
Overall, and specifically at lower values of K, the genetic makeup of Uralic speakers resembles that of their geographic neighbours. The Saami and (a subset of) the Mansi serve as exceptions to that pattern being more similar to geographically more distant populations (Fig. 3a, Additional file 3: S3). However, starting from K = 9, ADMIXTURE identifies a genetic component (k9, magenta in Fig. 3a, Additional file 3: S3), which is predominantly, although not exclusively, found in Uralic speakers. This component is also well visible on K = 10, which has the best cross-validation index among all tests (Additional file 3: S3B). The spatial distribution of this component (Fig. 3b) shows a frequency peak among Ob-Ugric and Samoyed speakers as well as among neighbouring Kets (Fig. 3a). The proportion of k9 decreases rapidly from West Siberia towards east, south and west, constituting on average 40% of the genetic ancestry of FU speakers in Volga-Ural region (VUR) and 20% in their Turkic-speaking neighbours (Bashkirs, Tatars, Chuvashes; Fig. 3a).
However, this ‘something’ that some people occasionally find in some Uralic populations is also common to other modern and ancient groups, and not so common in some other Uralic peoples. Simply put:
I already said this in the recent publication of Siberian samples, where a renamed and radiocarbon dated Finnish_IA clearly shows that Late Iron Age Saami (ca. 400 AD) had little “Siberian ancestry”, if any at all, representing the most likely Fennic (and Samic) ancestral components before their expansion into central and northern Finland, where they admixed with circum-polar peoples of asbestos ware cultures.
I will say that again and again, any time they report the so-called “Siberian ancestry” in Uralic samples, no matter how it is defined each time: it does not seem to be that special something people are looking for, but rather (at least in a great part) a quite old ancestral component forming an evident cline with EHG, whose best proximate source are Baikal_EN (and/or Devil’s Gate) at this moment, and thus also East European hunter-gatherers for Western Uralic peoples:
So either Samara_HG, Karelia_HG, and many other groups from eastern Europe all spoke Uralic according to this ADMIXTURE graphic (and the formation of steppe ancestry in the Volga-Ural region brought the Proto-Indo-European language to the steppes through the CHG/ANE expansion), or a great part of this “Siberian ancestry” found in modern Uralic-speaking populations is not what some people would like to think it is…
PCA clines can be looked for to represent expansions of ancient populations. Most recently, Flegontov et al. (2018) are attempting to do this with Asian populations:
For some Turkic groups in the Urals and the Altai regions and in the Volga basin, a different admixture model fits the data: the same West Eurasian source + Uralic- or Yeniseian-speaking Siberians. Thus, we have revealed an admixture cline between Scythians and the Iranian farmer genetic cluster, and two further clines connecting the former cline to distinct ancestry sources in Siberia. Interestingly, few Wusun-period individuals harbor substantial Uralic/Yeniseian-related Siberian ancestry, in contrast to preceding Scythians and later Turkic groups characterized by the Tungusic/Mongolic-related ancestry. It remains to be elucidated whether this genetic influx reflects contacts with the Xiongnu confederacy. We are currently assembling a collection of samples across the Eurasian steppe for a detailed genetic investigation of the Hunnic confederacies.
There are potential errors with this approach:
The main one is practical – does a modern cline represent an ancestral language? The answer is: sometimes. It depends on the anthropological context that we have, and especially on the precision of the PCA:
The ‘Europe’, ‘Middle East’, etc. clines of the above PCA do not represent one language, but many. For starters, the PCA includes too many (and modern) populations, its precision is useless for ethnolinguistic groups. Which is the right level? Again, it depends.
The other error is one of detail of the clines drawn (which, in turn, depends on the precision of the PCA). For example, we can draw two paralell lines (or even one line, as in Flegontov et al. above) in one PCA graphic, but we still don’t have the direction of expansion. How do we know if this supposed “Uralic-speaking cline” goes from one region to the other? For that level of detail, we should examine closely modern Uralic-speaking peoples and Circum-Arctic populations:
The real ancient Uralic cluster (drawn above in blue) is thus probably from a North-East European source (probably formed by Battle Axe / Fatyanovo-Balanovo / Abashevo) to the east into Siberian populations, and to the north into Laplandic populations (see below also on Mezhovska ancestry for the drawn ‘European cline’, which some may a priori wrongly assume to be quite late).
The fact that the three formed clines point to an admixture of CWC-related populations from North-Eastern Europe, and that variation is greater at the Palaeo-Laplandic and Palaeo-Siberian extremities compared to the CWC-related one, also supports this as the correct interpretation.
However, judging by the two main clines formed, one could be alternatively inclined to interpret that Palaeo-Laplandic and Palaeo-Siberian populations formed a huge ancestral “Uralic” ghost cluster in Siberia (spanning from the Palaeo-Laplandic to the Palaeo-Siberian one), and from there expanded Finno-Samic on one hand, and “Volga-Ugro-Samoyed” on the other. That poses different problems: an obvious linguistic and archaeological one – which I assume a lot of people do not really care about – , and a not-so-obvious genetic one (see below for ancient samples and for the expansion of haplogroup N).
Unlike this PCA with ancient samples, where Bell Beaker clines could be a rough approximation to the real sources for each population, and where a cluster spanning all three depicted Early Bronze Age clusters could give a rough proximate source of European Bell Beakers in Hungary (and where one can even distinguish the Y-DNA bottlenecks in the L23 trunk created by each cline) the PCA of modern Uralic populations is probably not suitable for a good estimate of the ancient situation, which may be found shifted up or down of the drawn “Uralic” cluster along East European groups.
After all, we already know that the Siberian cline shows probably as much an ancient admixture event – from the original Uralic expansion to the east with Corded Ware ancestry – as another more recent one – a westward migration of Siberian ancestry (or even more than one). While we know with more or less exactitude what happened with the Palaeo-Laplandic admixture by expanding Proto-Finno-Samic populations (see here), the Proto-Ugric and Pre-Samoyedic populations formed probably more than one cline during the different ancient migrations through central Asia.
Apparently, the Corded Ware expansion to the east was not marked by a huge change in ancestry. While the final version of Narasimhan et al. (2018) may show a little more detail about other forest-steppe Seima-Turbino/Andronovo-related migrations (and thus also Eastern Uralic peoples), we have already had enough information for quite some time to get a good idea.
Mezhovska‘s position is similar to the later Pre-Scythian and Scythian populations. There are some interesting details: apart from haplogroup R1a-Z280 (CTS1211+), there is one R1b-M269 (PF6494+), probably Z2103, and an outlier (out of three) in a similar position to the recently described central/southern Scythian clusters.
NOTE. The finding of R1b-M269 in the forest-steppe is probably either 1) from an Afanasevo-Okunevo origin, or 2) from an admixture with neighbouring Andronovo-related populations, such as Sargary. A third, maybe less likely option is that this haplogroup admixed with Abashevo directly (as it happened in Sintashta, Potapovka, or Pokrovka) and formed part of early Uralic migrations. In any case, since Mezhovska is a Bronze Age society from the Urals region, its association with R1b-Z2103 – like the association of R1b-Z2103 in Scythian clusters – cannot be attributed to “Thracian peoples”, a link which is (as I already said) too simplistic.
The drawn “European cline” of Hungarians (see above), leading from ‘west-like’ Mansi to Hungarian populations – and hosting also Finnic and Estonian samples – , cannot therefore be attributed simply to late “Slavic/Balkan-like” admixture.
Karasuk – located further to the east – is basically also Corded Ware peoples showing clearly a recent admixture with local ANE / Baikal_EN-like populations. In terms of haplogroups it shows haplogroup Q, R1a-Z2124, and R1a-Z2123, later found among early Hungarians, and present also in ancient Samoyedic populations now acculturated.
The most interesting aspect of both Mezhovska and Karasuk is that they seem to diverge from a point close to Ukraine_Eneolithic, which is the supposed ancestral source of Corded Ware peoples (read more about the formation of “steppe ancestry”). This means that Eastern Uralians derive from a source closer to Middle Dnieper/Abashevo populations, rather than Battle Axe (shifted to Latvian Neolithic), which is more likely the source prevalent in Finno-Permic peoples.
Their initial admixture with (Palaeo-)Siberian populations is thus seen already starting by this time in Mezhovska and especially in Karasuk, but this process (compared to modern populations) is incomplete:
We know now that Samic peoples expanded during the Late Iron Age into Palaeo-Laplandic populations, admixing with them and creating this modern cline. Finns expanded later to the north (in one of their known genetic bottlenecks), admixing with (and displacing) the Saami in Finland, especially replacing their male lines.
So how did Ugric and Samoyedic peoples admix with Palaeo-Siberian populations further, to obtain their modern cline? The answer is, logically, with East Asian migrations related to forest-steppe populations of Central Asia after the Mezhovska and Karasuk periods, i.e. during the Iron Age and later. Other groups from the forest-steppe in Central Asia show similar East Asian (“Siberian”) admixture. We know this from Narasimhan et al. (2018):
(…) we observe samples from multiple sites dated to 1700-1500 BCE (Maitan, Kairan, Oy_Dzhaylau and Zevakinsikiy) that derive up to ~25% of their ancestry from a source related to present-day East Asians and the remainder from Steppe_MLBA. A similar ancestry profile became widespread in the region by the Late Bronze Age, as documented by our time transect from Zevakinsikiy and samples from many sites dating to 1500-1000 BCE, and was ubiquitous by the Scytho-Sarmatian period in the Iron Age.
Flegontov: Present day Turkic speakers fall into two clusters of admixture patterns (Uralic/Yenisean and Tungussic/Mngolic) based on genomic data with ancient Turks belonging almost exclusively to the first cluster. #ISBA8
The Ugric-speaking Sargat culture in Western Siberia shows the expected mixture of haplogroups (ca. 500 BC – 500 AD), with 5 samples of hg N and 2 of hg R1a1, in Pilipenko et al. (2017). Although radiocarbon dates and subclades are lacking, N lineages probably spread late, because of the late and gradual admixture of Siberian cultures into the Sargat melting pot.
The observed reduction in the genetic distance between the Middle Tagar population and other Scythian like populations of Southern Siberia(Fig 5; S4 Table), in our opinion, is primarily associated with an increase in the role of East Eurasian mtDNA lineages in the gene pool (up to nearly half of the gene pool) and a substantial increase in the joint frequency of haplogroups C and D (from 8.7% in the Early Tagar series to 37.5% in the Middle Tagar series). These features are characteristic of many ancient and modern populations of Southern Siberia and adjacent regions of Central Asia, including the Pazyryk population of the Altai Mountains.
Before the Iron Age, the Karasuk and Mezhovska population were probably already somehow ‘to the north’ within the ancient Steppe-Altai cline (see image below9 created by expanding Seima-Turbino- and Andronovo-related populations. During the Iron Age, further Siberian contributions with Iranian expansions must have placed Uralians of the Central Asian forest-steppe areas much closer to today’s Palaeo-Siberian cline.
However, the modern genetic picture was probably fully developed only in historic times, when Samoyedic and Ugric languages expanded to the north, only in part admixing further with Palaeo-Siberian-speaking nomads from the Circum-Arctic region (see here for a recent history of Samoyedic Enets), which justifies their more recent radical ‘northern shift’.
This late acquisition of the language by Palaeo-Siberian nomads (without much population replacement) also justifies the wide PCA clusters of very small Siberian populations. See for example in the PCA from Tambets et al. (2018):
For their relationship with modern Mansi, we have information on Hungarian conqueror populations from Neparáczki et al. (2018):
Moreover, Y, B and N1a1a1a1a Hg-s have not been detected in Finno-Ugric populations [80–84], implying that the east Eurasian component of the Conquerors and Finno-Ugric people are probably not directly related. The same inference can be drawn from phylogenetic data, as only two Mansi samples appeared in our phylogenetic trees on the side branches (S1 Fig, Networks; 1, 4) suggesting that ancestors of the Mansis separated from Asian ancestors of the Conquerors a long time ago. This inference is also supported by genomic Admixture analysis of Siberian and Northeastern European populations , which revealed that Mansis received their eastern Siberian genetic component approximately 5–7 thousand years ago from ancestors of modern Even and Evenki people. Most likely the same explanation applies to the Y-chromosome N-Tat marker which originated from China [86,87] and its subclades are now widespread between various language groups of North Asia and Eastern Europe .
The genetic picture of Hungarians (their formed cline with Mansi and their haplogroups) may be quite useful for the true admixture found originally in Mansi peoples at the beginning of the Iron Age. By now it is clear even from modern populations that Steppe_MLBA ancestry accompanied the Uralic expansion to the east (roughly approximated in the graphic with Afanasievo_EBA + Bichon_LP EasternHG_M):
A total of 286 samples of Uralic-speaking individuals, of those 121 genotyped in this study, were analysed in the context of 1514 Eurasian samples (including 14 samples published for the first time) based on whole genome single nucleotide polymorphisms (SNPs) (Additional file 1: Table S1). All these samples, together with the larger sample set of Uralic speakers, were characterized for mtDNA and chrY markers.
The question as which material cultures may have co-spread together with proto-Uralic and Uralic languages depends on the time estimates of the splits in the Uralic language tree. Deeper age estimates (6,000 BP) of the Uralic language tree suggest a connection between the spread of FU languages from the Volga River basin towards the Baltic Sea either with the expansion of the Neolithic culture of Combed Ware, e.g. [6, 7, 17, 26] or with the Neolithic Volosovo culture . Younger age estimates support a link between the westward dispersion of Proto-Finno-Saamic and eastward dispersion of Proto-Samoyedic with a BA Sejma-Turbino (ST) cultural complex [14, 18, 27, 28] that mediated the diffusion of specific metal tools and weapons from the Altai Mountains over the Urals to Northern Europe or with the Netted Ware culture , which succeeded Volosovo culture in the west. It has been suggested that Proto-Uralic may have even served as the lingua franca of the merchants involved in the ST phenomenon . All these scenarios imply that material culture of the Baltic Sea area in Europe was influenced by cultures spreading westward from the periphery of Europe and/or Siberia. Whether these dispersals involved the spread of both languages and people remains so far largely unknown.
The population structure of Uralic speakers
To contextualize the autosomal genetic diversity of Uralic speakers among other Eurasian populations (Additional file 1: Table S1), we first ran the principal component (PC) analysis (Fig. 2a, Additional file 3: Figure S1). The first two PCs (Fig. 2a, Additional file 3: Figure S1A) sketch the geography of the Eurasian populations along the East-West and North-South axes, respectively. The Uralic speakers, along with other populations speaking Slavic and Turkic languages, are scattered along the first PC axis in agreement with their geographic distribution (Figs. 1 and 2a) suggesting that geography is the main predictor of genetic affinity among the groups in the given area. Secondly, in support of this, we find that FST-distances between populations (Additional file 3: Figure S2) decay in correlation with geographical distance (Pearson’s r = 0.77, p < 0.0001). On the UPGMA tree based on these FST-distances (Fig. 2b), the Uralic speakers cluster into several different groups close to their geographic neighbours.
We next used ADMIXTURE , which presents the individuals as composed of inferred genetic components in proportions that maximize Hardy-Weinberg and linkage equilibrium in the overall sample (see the ‘Methods’ section for choice of presented K). Overall, and specifically at lower values of K, the genetic makeup of Uralic speakers resembles that of their geographic neighbours. The Saami and (a subset of) the Mansi serve as exceptions to that pattern being more similar to geographically more distant populations (Fig. 3a, Additional file 3: S3). However, starting from K = 9, ADMIXTURE identifies a genetic component (k9, magenta in Fig. 3a, Additional file 3: S3), which is predominantly, although not exclusively, found in Uralic speakers. This component is also well visible on K = 10, which has the best cross-validation index among all tests (Additional file 3: S3B). The spatial distribution of this component (Fig. 3b) shows a frequency peak among Ob-Ugric and Samoyed speakers as well as among neighbouring Kets (Fig. 3a). The proportion of k9 decreases rapidly from West Siberia towards east, south and west, constituting on average 40% of the genetic ancestry of FU speakers in Volga-Ural region (VUR) and 20% in their Turkic-speaking neighbours (Bashkirs, Tatars, Chuvashes; Fig. 3a). The proportion of this component among the Saami in Northern Scandinavia is again similar to that of the VUR FU speakers, which is exceptional in the geographic context. It is also notable that North Russians, sampled from near the White Sea, differ from other Russians by sporting higher proportions of k9 (10–15%), which is similar to the values we observe in their Finnic-speaking neighbours. Notably, Estonians and Hungarians, who are geographically the westernmost Uralic speakers, virtually lack the k9 cluster membership.
We also tested the different demographic histories of female and male lineages by comparing outgroup f3 results for autosomal and X chromosome (chrX) data for pairs of populations (Estonians, Udmurts or Khanty vs others) with high versus low probability to share their patrilineal ancestry in chrY hg N (see the ‘Methods’ section, Additional file 3: Figure S13). We found a minor but significant excess of autosomal affinity relative to chrX for pairs of populations that showed a higher than 10% chance of two randomly sampled males across the two groups sharing their chrY ancestry in hg N3-M178, compared to pairs of populations where such probability is lower than 5% (Additional file 3: Figure S13).
In sum, these results suggest that most of the Uralic speakers may indeed share some level of genetic continuity via k9, which, however, also extends to the geographically close Turkic speakers.
We found that it is the admixture with the Siberians that makes the Western Uralic speakers different from the tested European populations (Additional file 3: Figure S4A-F, H, J, L). Differentiating between Estonians and Finns, the Siberians share more derived alleles with Finns, while the geographic neighbours of Estonians (and Finns) share more alleles with Estonians (Additional file 3: Figure S4M). Importantly, Estonians do not share more derived alleles with other Finnic, Saami, VUR FU or Ob-Ugric-speaking populations than Latvians (Additional file 3: Figure S4O). The difference between Estonians and Latvians is instead manifested through significantly higher levels of shared drift between Estonians and Siberians on the one hand and Latvians and their immediate geographic neighbours on the other hand. None of the Uralic speakers, including linguistically close Khanty and Mansi, show significantly closer affinities to the Hungarians than any non-FU population from NE Europe (Additional file 3: Figure S4R).
Time of Siberian admixture
The time depth of the Globetrotter (Fig. 5b) inferred admixture events is relatively recent—500–1900 AD (see also complementary ALDER results, in Additional file 13: Table S12 and Additional file 3: Figure S7)—and agrees broadly with the results reported in Busby et al. . A more detailed examination of the ALDER dates, however, reveals an interesting pattern. The admixture events detected in the Baltic Sea region and VUR Uralic speakers are the oldest (800–900 AD or older) followed by those in VUR Turkic speakers (∼1200–1300 AD), while the admixture dates for most of the Siberian populations (>1500 AD) are the most recent (Additional file 3: Figure S7). The West Eurasian influx into West Siberia seen in modern genomes was thus very recent, while the East Eurasian influx into NE Europe seems to have taken place within the first millennium AD (Fig. 5b, Additional file 3: Figure S7).
Affinities of the Uralic speakers with ancient Eurasians
We next calculated outgroup f3-statistics  to estimate the extent of shared genetic drift between modern and ancient Eurasians (Additional file 14: Table S13, Additional file 3: Figures S8-S9). Consistent with previous reports [45, 50], we find that the NE European populations including the Uralic speakers share more drift with any European Mesolithic hunter-gatherer group than Central or Western Europeans (Additional file 3: Figure S9A-C). Contrasting the genetic contribution of western hunter-gatherers (WHG) and eastern hunter-gatherers (EHG), we find that VUR Uralic speakers and the Saami share more drift with EHG. Conversely, WHG shares more drift with the Finnic and West European populations (Additional file 3: Figure S9A). Interestingly, we see a similar pattern of excess of shared drift between VUR and EHG if we substitute WHG with the aDNA sample from the Yamnaya culture (Additional file 3: Figure S9D). As reported before [2, 45], the genetic contribution of European early farmers decreases along an axis from Southern Europe towards the Ural Mountains (Fig. 6, Additional file 3: Figure S9E-F).
We then used the qpGraph software  to test alternative demographic scenarios by trying to fit the genetic diversity observed in a range of the extant Finno-Ugric populations through a model involving the four basic European ancestral components: WHG, EHG, early farmers (LBK), steppe people of Yamnaya/Corded Ware culture (CWC) and a Siberian component (Fig. 6, Additional file 3: Figure S10). We chose the modern Nganasans to serve as a proxy for the latter component because we see least evidence for Western Eurasian admixture (Additional file 3: Figure S3) among them. We also tested the Khantys for that proxy but the model did not fit (yielding f2-statistics, Z-score > 3). The only Uralic-speaking population that did not fit into the tested model with five ancestral components were Hungarians. The qpGraph estimates of the contributions from the Siberian component show that it is the main ancestry component in the West Siberian Uralic speakers and constitutes up to one third of the genomes of modern VUR and the Saami (Fig. 6). It drops, however, to less than 10% in most of NE Europe, to 5% in Estonians and close to zero in Latvians and Lithuanians.
One of the notable observations that stands out in the fineSTRUCTURE analysis is that neither Hungarians nor Estonians or Mordovians form genetic clusters with other Uralic speakers but instead do so with a broad spectrum of geographically adjacent samples. Despite the documented history of the migration of Magyars  and their linguistic affinity to Khantys and Mansis, who today live east of the Ural Mountains, there is nothing in the present-day gene pool of the sampled Hungarians that we could tie specifically to other Uralic speakers.
Perhaps even more surprisingly, we found that Estonians, who show close affinities in IBD analysis to neighbouring Finnic speakers and Saami, do not share an excess of IBD segments with the VUR or Siberian Uralic speakers. This is eIn this context, it is important to remind that the limited (5%, Fig. 6) East Eurasian impact in the autosomal gene pool of modern Estonians contrasts with the fact that more than 30% of Estonian (but not Hungarian) men carry chrY N3 that has an East Eurasian origin and is very frequent among NE European Uralic speakers . However, the spread of chrY hg N3 is not language group specific as it shows similar frequencies in Baltic-speaking Latvians and Lithuanians, and in North Russians, who in all our analyses are very similar to Finnic-speakers. The latter, however, are believed to have either significantly admixed with their Uralic-speaking neighbours or have undergone a language shift from Uralic to Indo-European .ven more striking considering that the immediate neighbours—Finns, Vepsians and Karelians—do.
With some exceptions such as Estonians, Hungarians and Mordovians, both IBD sharing and Globetrotter results suggest that there are detectable inter-regional haplotype sharing ties between Uralic speakers from West Siberia and VUR, and between NE European Uralic speakers and VUR. In other words, there is a fragmented pattern of haplotype sharing between populations but no unifying signal of sharing that unite all the studied Uralic speakers.
The paper is obviously trying to find a “N1c/Siberian ancestry = Uralic” link, but it shows (as previous papers using ancient DNA) that this identification is impossible, because it is not possible to identify “N1c=Siberian ancestry”, “N1c=Uralic”, or “Siberian ancestry = Uralic”. In fact, the arrival of N subclades and Siberian ancestry are late, both events (probably multiple stepped events) are unrelated to each other, and represent east-west demic diffusion waves (as well as founder effects) that probably coincide in part with the Scythian and Turkic (or associated) expansions, i.e. too late for any model of Proto-Uralic or Proto-Finno-Ugric expansion.
On the other hand, it shows interesting data regarding ancestry of populations that show increased Siberian influence, such as those easternmost groups admixed with Yeniseian-like populations (Samoyedic), those showing strong founder effects (Finnic), or those isolated in the Circum-Artic region with neighbouring Siberian peoples in Kola (Saami). All in all, Hungarians, Estonians and Mordovians seem to show the original situation better than the other groups, which is also reflected in part in Y-DNA, conserved as a majority of R1a lineages precisely in these groups. Just another reminder that CWC-related ancestry is found in every single Uralic group, and that it represents the main ancestral component in all non-Samoyedic groups.
The qpGraph shows the ancestor of Yamna (likely Khvalynsk) and Corded Ware stemming as different populations from a common (likely Neolithic) node – whose difference is based on the proportion of Anatolian-related ancestry – , that is, probably before the Indo-Hittite expansion; and ends with CWC groups forming the base for all Uralic peoples. Below is a detail of the qpGraph on the left, and my old guess (2017) on the right, for comparison:
#EDIT (22 sep 2018): I enjoyed re-reading it, and found this particular paragraph funny:
Despite the documented history of the migration of Magyars  and their linguistic affinity to Khantys and Mansis, who today live east of the Ural Mountains, there is nothing in the present-day gene pool of the sampled Hungarians that we could tie specifically to other Uralic speakers.
The positions of non-Tagar Iron Age groups in the MDS plot were correlated with their geographic position within the Eurasian steppe belt and with frequencies of Western and Eastern Eurasian mtDNA lineages in their gene pools. Series from chronological Tagar stages (similar to the overall Tagar series) were located within the genetic variability (in terms of mtDNA) of Scythian World nomadic groups (Figs 5 and 6; S4 and S6 Tables). Specifically, the Early Tagar series was more similar to western nomads (North Pontic Scythians), while the Middle Tagar was more similar to the Southern Siberian populations of the Scythian period. The Late Tagar group (Tes`culture) belonging to the Early Xiongnu period had the “western-most” location on the MDS plot with the maximal genetic difference from Xiongnu and other eastern nomadic groups (but see Discussion concerning the low sample size for the Tes`series).
In a comparison of our Tagar series with modern populations in Eurasia, we detected similarity between the Tagar group and some modern Turkic-speaking populations (with the exception of the Indo-Iranian Tajik population) (Fig 7; S2 Table). Among the modern Turkic-speaking groups, populations from the western part of the Eurasian steppe belt, such as Bashkirs from the Volga-Ural region and Siberian Tatars from the West Siberian forest-steppe zone, were more similar to the Tagar group than modern Turkic-speaking populations of the Altay-Sayan mountain system (including the Khakassians from the Minusinsk basin) (Fig 7).
Mitochondrial DNA diversity and genetic relationships of the Tagar population
Our results are not inconsistent with the assumption of a probable role of gene flow due to the migration from Western Eurasia to the Minusinsk basin in the Bronze Age in the formation of the genetic composition of the Tagar population. Particularly, we detected many mtDNA lineages/clusters with probable West Eurasian origin that were dominant in modern populations of different parts of Europe, Caucasus, and the Near East (such as K and HV6) in our Tagar series based on a phylogeographic analysis.
We detected relatively low genetic distances between our Tagar population and two Bronze Age populations from the Minusinsk basin—the Okunevo culture population (pre-Andronovo Bronze Age) and Andronovo culture population, followed by Afanasievo population from the Minusinsk Basin and Middle Bronze Age population from the Mongolian Altai Mountains (the region adjacent to the Minusinsk basin) (Figs 3 and 6; S3 and S5 Tables). Among West Eurasian part of our Tagar series we also observed haplogroups/sub-haplogroups and haplotypes shared with Early and Middle Bronze Age populations from Minusinsk Basin and western part of Eurasian steppe belt (Fig 4; S5 Table). Thus, our results suggested a potentially significant role of the genetic components, introduced by migrants from Western Eurasia during the Bronze Age, in the formation of the genetic composition of the Tagar population. It is necessary to note the relatively small size of available mtDNA samples from the Bronze Age populations of Minusinsk basin; accordingly, additional mtDNA data for these populations are required to further confirm our inference.
Another substantial part of the mtDNA pool of the Tagar and other eastern populations of the Scythian World is typical of populations in Southern Siberia and adjacent regions of Central Asia (autochthonous Central Asian mtDNA clusters). Most of these components belong to the East Eurasian cluster of mtDNA haplogroups. Moreover, the role of each of these components in the formation of the genetic composition of subsequent (to the present) populations in South Siberia and Central Asia could be very different. In this regard, cluster C4a2a (and its subcluster C4a2a1), and haplogroup A8 are of particular interest.
Genetic features of successive Tagar groups
We compared successive Tagar groups (Early, Middle, and Late Tagar) with each other and with other Iron Age nomadic populations to evaluate changes in the mtDNA pool structure. Despite the genetic similarity between the Early and Middle Tagar series and Scythian World nomadic groups (Figs 5 and 6; S4 and S6 Tables), there were some peculiarities. For example, the Early Tagar series was more similar to North Pontic Classic Scythians, while the Middle Tagar samples were more similar to the Southern Siberian populations of the Scythian period (i.e., completely synchronous populations of regions neighboring the Minusinsk basin, such as the Pazyryk population from the Altay Mountains and Aldy-Bel population from Tuva).
We observed differences in the mtDNA pool structure between the Early and the Middle chronological stages of the Tagar culture population, as evidenced by the change in the ratio of Western to Eastern Eurasian mtDNA components. The contribution of Eastern Eurasian lineages increased from about one-third (34.8%) in the Early Tagar group to almost one-half (45.8%) in the Middle Tagar group.
At the level of mtDNA haplogroups, we detected a decrease in the diversity of phylogenetic clusters during the transition from the Early Tagar to the Middle Tagar. This decline in diversity equally affected the West Eurasian and East Eurasian components of the Tagar mtDNA pool. It should be noted that this decrease can be partially explained by the smaller number of Middle Tagar than Early Tagar samples. Under a simple binomial approximation the mtDNA clusters, observed at frequencies of 6.3% and 11.7%, could be lost by chance in our Early (N = 46) and Middle (N = 24) Tagar samples, respectively. However, the simultaneous lack of several such clusters, with a total frequency in the gene pool of the Early group of 34.8%, is unlikely.
The observed reduction in the genetic distance between the Middle Tagar population and other Scythian-like populations of Southern Siberia(Fig 5; S4 Table), in our opinion, is primarily associated with an increase in the role of East Eurasian mtDNA lineages in the gene pool (up to nearly half of the gene pool) and a substantial increase in the joint frequency of haplogroups C and D (from 8.7% in the Early Tagar series to 37.5% in the Middle Tagar series). These features are characteristic of many ancient and modern populations of Southern Siberia and adjacent regions of Central Asia, including the Pazyryk population of the Altai Mountains. We did not obtain strong evidence for an intensification of genetic contact between the population of the Minusinsk basin and the Altai Mountains in the Middle Tagar period compared with the Early Tagar period. Although, several archaeologists have found evidence for the intensification of contact at the level of material culture, namely, a cultural influence of the population of the Altai Mountains (represented by the Pazyryk population) on the population of the Minusinsk basin (the Saragash Tagar group) [6, 71, 72].
Another important issue is the change in the genetic structure of the Tagar population during the transition from the Middle (Saragash) to the Late (Tes`) stage. The Late Tagar stage refers to the Xiongnu period. Many archaeologists suggest that the formation of the Tes`stage involved the direct cultural influence of the Xiongnu and/or related groups of nomads from more eastern regions of Central Asia [71, 73]. Some archaeologists have even suggested renaming the Tes`stage in the Tes`culture , emphasizing the role of new eastern cultural elements. If this influence also existed at the genetic level, then we would expect to observe new genetic elements in the Tes`gene pool, particularly those of East Eurasian origin.
Just a reminder of the recent session in ISBA 8 on expanding Scythians (and also Mongolians and Turks) spreading Siberian ancestry, usually (wrongly) identified as “Uralic-Yeniseian” based on modern populations (similar to how steppe ancestry is wrongly identified as “Indo-European”), see the following graphic including the Tagar population:
And also the poster by Alexander M. Kim et al. Yeniseian hypotheses in light of genome-wide ancient DNA from historical Siberia:
The relevance of ancient DNA data to debates in historical linguistics is an emphatic strand in much recent work on the archaeogenetics of Eurasia, where the discussion has focused heavily on Indo-European (Haak et al. 2015; Narasimhan et al. 2018; de Barros Damgaard et al. 2018a,b). We present new genome-wide ancient DNA data from a historical Siberian individual in relation to Yeniseian, an isolated language “microfamily” (Vajda 2014) that nonetheless sits at the center of numerous controversial proposals in historical linguistics and cultural interaction. Yeniseian’s sole surviving representative is Ket, a critically endangered language fluently spoken by only a few dozen individuals near the Middle Yenisei River of Central Siberia.
In strong contrast to the present-day picture, river names and argued substrate influences and loanwords in languages outside the current range of Yeniseian, as well as direct records from the Russian colonial period, indicate that speakers of extinct Yeniseian languages had a formerly much broader presence in the taiga of Central Siberia as well as further south in the mountainous Altai-Sayan region – and perhaps even further afield in Inner Asia (Vajda 2010; Gorbachov 2017; Blažek 2016). The consilience of these proposals with genetic data is not straightforward (Flegontov et al. 2015, 2017) and faces a major obstacle in the lack of genetic information from verifiable speakers of Yeniseian languages other than the Kets, who have had complex ongoing interactions with speakers of non-Yeniseian languages such as the Samoyedic Selkups. We attempt to remedy this with new historical Siberian aDNA data, orienting our search for common denominators and systematic difference in a broader landscape of concordance, discordance, and uncertainty at the interface of diachronic linguistics and genetics.
Exploring the genomic impact of colonization in north-eastern Siberia, by Seguin-Orlando et al.
Yakutia is the coldest region in the northern hemisphere, with winter record temperatures below minus 70°C. The ability of Yakut people to adapt both culturally and biologically to extremely cold temperatures has been key to their subsistence. They are believed to descend from an ancestral population, which left its original homeland in the Lake Baykal area following the Mongol expansion between the 13th and 15th centuries AD. They originally developed a semi-nomadic lifestyle, based on horse and cattle breeding, providing transportation, primary clothing material, meat, and milk. The early colonization by Russians in the first half of the 17th century AD, and their further expansion, have massively impacted indigenous populations. It led not only to massive epidemiological outbreaks, but also to an important dietary shift increasingly relying on carbohydrate-rich resources, and a profound lifestyle transition with the gradual conversion from Shamanism to Christianity and the establishment of new marriage customs. Leveraging an exceptional archaeological collection of more than a hundred of bodies excavated by MAFSO (Mission Archéologique Française en Sibérie Orientale) over the last 15 years and naturally kept frozen by the extreme cold temperatures of Yakutia, we have started to characterize the (epi)genome of indigenous individuals who lived from the 16th to the 20th century AD. Current data include the genome sequence of approximately 50 individuals that lived prior to and after Russian contact, at a coverage from 2 to 40 fold. Combined with data from archaeology and physical anthropology, as well as microbial DNA preserved in the specimens, our unique dataset is aimed at assessing the biological consequences of the social and biological changes undergone by the Yakut people following their neolithisation by Russian colons.
Clio Der Sarkissian: Age, sex, geography and parental relatedness are not factors which influence oral microbial diversity in 124 individuals from 17thC Siberia #ISBA8
preliminary conclusions: no detectable impact of Russian colonization on Yakut oral microbiome diversity despite dietary and other societal changes (but perhaps calculus not adequately sensitive) pic.twitter.com/oO2OjqIHKg
Ancient DNA from a Medieval trading centre in Northern Finland
Using ancient DNA to identify the ancestry of individuals from a Medieval trading centre in Northern Finland, by Simoes et al.
Analyzing genomic information from archaeological human remains has proved to be a powerful approach to understand human history. For the archaeological site of Ii Hamina, ancient DNA can be used to infer the ancestries of individuals buried there. Situated approximately 30 km from Oulu, in Northern Finland, Ii Hamina was an important trade place since Medieval times. The historical context indicates that the site could have been a melting pot for different cultures and people of diversified genetic backgrounds. Archaeological and osteological evidence from different individuals suggest a rich diversity. For example, stable isotope analyses indicate that freshwater and marine fish was the dominant protein source for this population. However, one individual proved to be an outlier, with a diet containing relatively more terrestrial meat or vegetables. The variety of artefacts that was found associated with several human remains also points to potential differences in religious beliefs or social status. In this study, we aimed to investigate if such variation could be attributed to different genetic ancestries. Ten of the individuals buried in Ii Hamina’s churchyard, dating to between the 15th and 17th century AD, were screened for presence of authentic ancient DNA. We retrieved genome-wide data for six of the individuals and performed downstream analysis. Data authenticity was confirmed by DNA damage patterns and low estimates of mitochondrial contamination. The relatively recent age of these human remains allows for a direct comparison to modern populations. A combination of population genetics methods was undertaken to characterize their genetic structure, and identify potential familiar relationships. We found a high diversity of mitochondrial lineages at the site. In spite of the putatively distant origin of some of the artifacts, most individuals shared a higher affinity to the present-day Finnish or Late Settlement Finnish populations. Interestingly, different methods consistently suggested that the individual with outlier isotopic values had a different genetic origin, being more closely related to reindeer herding Saami. Here we show how data from different sources, such as stable isotopes, can be intersected with ancient DNA in order to get a more comprehensive understanding of the human past.
A closer look at the bottom left corner of the poster (the left columns are probably the new samples):
Plant resources processed in HG pottery from the Upper Volga
Multiple criteria for the detection of plant resources processed in hunter-gatherer pottery vessels from the Upper Volga, Russia, by Bondetti et al.
In Northern Eurasia, the Neolithic is marked by the adoption of pottery by hunter-gatherer communities. The degree to which this is related to wider social and lifestyle changes is subject to ongoing debate and the focus of a new research programme. The use and function of early pottery by pre-agricultural societies during the 7th-5th millennia BC is of central interest to this debate. Organic residue analysis provides important information about pottery use. This approach relies on the identification and isotopic characteristics of lipid biomarkers, absorbed into the pores of the ceramic or charred deposits adhering to pottery vessel surfaces, using a combined methodology, namely GC-MS, GC-c-IRMS and EA-IRMS. However, while animal products (e.g., marine, freshwater, ruminant, porcine) have the benefit of being lipid-rich and well-characterised at the molecular and isotopic level, the identification of plant resources still suffers from a lack of specific criteria for identification. In huntergatherer contexts this problem is exacerbated by the wide range of wild, foraged plant resources that may have been potentially exploited. Here we evaluate approaches for the characterisation of terrestrial plant food in pottery through the study of pottery assemblages from Zamostje 2 and Sakhtysh 2a, two hunter-gatherer settlements located in the Upper Volga region of Russia.
GC-MS analysis of the lipids, extracted from the ceramics and charred residues by acidified methanol, suggests that pottery use was primarily oriented towards terrestrial and aquatic animal products. However, while many of the Early Neolithic vessels contain lipids distinctive of freshwater resources, triterpenoids are also present in high abundance suggesting mixing with plant products. When considering the isotopic criteria, we suggest that plants were a major commodity processed in pottery at this time. This is supported by the microscopic identification of Viburnum (Viburnum Opulus L.) berries in the charred deposits on several vessels from Zamostje.
The study of Upper Volga pottery demonstrated the importance of using a multidisciplinary approach to determine the presence of plant resources in vessels. Furthermore, this informs the selection of samples, often subject to freshwater reservoir effects, for 14C dating.
Bronze Age population dynamics and the rise of dairy pastoralism on the eastern Eurasian steppe
Bronze Age population dynamics and the rise of dairy pastoralism on the eastern Eurasian steppe, by Warinner et al.
Recent paleogenomic studies have shown that migrations of Western steppe herders (WSH), beginning in the Eneolithic (ca. 3300-2700 BCE), profoundly transformed the genes and cultures of Europe and Central Asia. Compared to Europe, the eastern extent of this WSH expansion is not well defined. Here we present genomic and proteomic data from 22 directly dated Bronze Age khirigsuur burials from Khövsgöl, Mongolia (ca. 1380-975 BCE). Only one individual showed evidence of WSH ancestry, despite the presence of WSH populations in the nearby Altai-Sayan region for more than a millennium. At the same time, LCMS/ MS analysis of dental calculus provides direct protein evidence of milk consumption from Western domesticated livestock in 7 of 9 individuals. Our results show that dairy pastoralism was adopted by Bronze Age Mongolians despite minimal genetic exchange with Western steppe herders.
Comments on ancestry of the Deer Stone-Khirigsuur ancestry; one “eastern” outlier and a (late) “western” outlier – but in the main only low (2-7%) levels of western admixture (of “Sintashta” and not “Afanasievo” type) pic.twitter.com/9E3jCQKTlm
Tracing the origin and expansion of the Turkic and Hunnic confederations, by Flegontov et al.
Turkic-speaking populations, now spread over a vast area in Asia, are highly heterogeneous genetically. The first confederation unequivocally attributed to them was established by the Göktürks in the 6th c. CE. Notwithstanding written resources from neighboring sedentary societies such as Chinese, Persian, Indian and Eastern Roman, earlier history of the Turkic speakers remains debatable, including their potential connections to the Xiongnu and Huns, which dominated the Eurasian steppe in the first half of the 1st millennium CE. To answer these questions, we co-analyzed newly generated human genome-wide data from Central Asia (the 1240K panel), spanning the period from ca. 3000 to 500 YBP, and the data published by de Barros Damgaard et al. (137 ancient human genomes from across the Eurasian steppes, Nature, 2018). Firstly, we generated a PCA projection to understand genetic affinities of ancient individuals with respect to present-day Tungusic, Mongolic, Turkic, Uralic, and Yeniseian-speaking groups. Secondly, we modeled hundreds of present-day and few ancient Turkic individuals using the qpAdm tool, testing various modern/ancient Siberian and ancient West Eurasian proxies for ancestry sources.
A majority of Turkic speakers in Central Asia, Siberia and further to the west share the same ancestry profile, being a mixture of Tungusic or Mongolic speakers and genetically West Eurasian populations of Central Asia in the early 1st millennium CE. The latter are themselves modelled as a mixture of Iron Age nomads (western Scythians or Sarmatians) and ancient Caucasians or Iranian farmers. For some Turkic groups in the Urals and the Altai regions and in the Volga basin, a different admixture model fits the data: the same West Eurasian source + Uralic- or Yeniseian-speaking Siberians. Thus, we have revealed an admixture cline between Scythians and the Iranian farmer genetic cluster, and two further clines connecting the former cline to distinct ancestry sources in Siberia. Interestingly, few Wusun-period individuals harbor substantial Uralic/Yeniseian-related Siberian ancestry, in contrast to preceding Scythians and later Turkic groups characterized by the Tungusic/Mongolic-related ancestry. It remains to be elucidated whether this genetic influx reflects contacts with the Xiongnu confederacy. We are currently assembling a collection of samples across the Eurasian steppe for a detailed genetic investigation of the Hunnic confederacies.
Flegontov: Present day Turkic speakers fall into two clusters of admixture patterns (Uralic/Yenisean and Tungussic/Mngolic) based on genomic data with ancient Turks belonging almost exclusively to the first cluster. #ISBA8
New interesting information on the gradual arrival of the “Uralic-Yeniseian” (Siberian) ancestry in eastern Europe with Iranian and Turkic-speaking peoples. We already knew that Siberian ancestry shows no original relationship with Uralic-speaking peoples, so to keep finding groups who expanded this ancestry eastwards in North Eurasia should be no surprise for anyone at this point.
Central Asia and Indo-Iranian
The session The Genomic Formation of South and Central Asia, by David Reich, on the recent paper by Narasimhan et al. (2018).
Ancient DNA and the peopling of the British Isles – pattern and process of the Neolithic transition, by Brace et al.
Over recent years, DNA projects on ancient humans have flourished and large genomic-scale datasets have been generated from across the globe. Here, the focus will be on the British Isles and applying aDNA to address the relative roles of migration, admixture and acculturation, with a specific focus on the transition from a Mesolithic hunter-gatherer society to the Neolithic and farming. Neolithic cultures first appear in Britain ca. 6000 years ago (kBP), a millennium after they appear in adjacent areas of northwestern continental Europe. However, in Britain, at the margins of the expansion the pattern and process of the British Neolithic transition remains unclear. To examine this we present genome-wide data from British Mesolithic and Neolithic individuals spanning the Neolithic transition. These data indicate population continuity through the British Mesolithic but discontinuity after the Neolithic transition, c.6000 BP. These results provide overwhelming support for agriculture being introduced to Britain primarily by incoming continental farmers, with surprisingly little evidence for local admixture. We find genetic affinity between British and Iberian Neolithic populations indicating that British Neolithic people derived much of their ancestry from Anatolian farmers who originally followed the Mediterranean route of dispersal and likely entered Britain from northwestern mainland Europe.
MN Atlantic / Megalithic cultures
Genomics of Middle Neolithic farmers at the fringe of Europe, by Sánchez Quinto et al.
Agriculture emerged in the Fertile Crescent around 11,000 years before present (BP) and then spread, reaching central Europe some 7,500 years ago (ya.) and eventually Scandinavia by 6,000 ya. Recent paleogenomic studies have shown that the spread of agriculture from the Fertile Crescent into Europe was due mainly to a demic process. Such event reshaped the genetic makeup of European populations since incoming farmers displaced and admixed with local hunter-gatherers. The Middle Neolithic period in Europe is characterized by such interaction, and this is a time where a resurgence of hunter-gatherer ancestry has been documented. While most research has been focused on the genetic origin and admixture dynamics with hunter-gatherers of farmers from Central Europe, the Iberian Peninsula, and Anatolia, data from farmers at the North-Western edges of Europe remains scarce. Here, we investigate genetic data from the Middle Neolithic from Ireland, Scotland, and Scandinavia and compare it to genomic data from hunter-gatherers, Early and Middle Neolithic farmers across Europe. We note affinities between the British Isles and Iberia, confirming previous reports. However, we add on to this subject by suggesting a regional origin for the Iberian farmers that putatively migrated to the British Isles. Moreover, we note some indications of particular interactions between Middle Neolithic Farmers of the British Isles and Scandinavia. Finally, our data together with that of previous publications allow us to achieve a better understanding of the interactions between farmers and hunter-gatherers at the northwestern fringe of Europe.
Central European Bronze Age
Ancient genomes from the Lech Valley, Bavaria, suggest socially stratified households in the European Bronze Age, by Mittnik et al.
Archaeogenetic research has so far focused on supra-regional and long-term genetic developments in Central Europe, especially during the third millennium BC. However, detailed high-resolution studies of population dynamics in a microregional context can provide valuable insights into the social structure of prehistoric societies and the modes of cultural transition.
Here, we present the genomic analysis of 102 individuals from the Lech valley in southern Bavaria, Germany, which offers ideal conditions for such a study. Several burial sites containing rich archaeological material were directly dated to the second half of the 3rd and first half of the 2nd millennium BCE and were associated with the Final Neolithic Bell Beaker Complex and the Early and Middle Bronze Age. Strontium isotope data show that the inhabitants followed a strictly patrilocal residential system. We demonstrate the impact of the population movement that originated in the Pontic-Caspian steppe in the 3rd millennium BCE and subsequent local developments. Utilising relatedness inference methods developed for low-coverage modern DNA we reconstruct farmstead related pedigrees and find a strong association between relatedness and grave goods suggesting that social status is passed down within families. The co-presence of biologically related and unrelated individuals in every farmstead implies a socially stratified complex household in the Central European Bronze Age.
Alissa Mittnik of @MPI_SHH with a talk that heralds a new era of studying archaeological sites: using high resolution ancient DNA to reconstruct relatedness patterns—her results reveal patrilocality in Late Neolithic and Bronze Age Central Europe #isba8
Gene geography of the Russian Far East populations – faces, genome-wide profiles, and Y-chromosomes, by Balanovsky et al.
Russian Far East is not only a remote area of Eurasia but also a link of the chain of Pacific coast regions, spanning from East Asia to Americas, and many prehistoric migrations are known along this chain. The Russian Far East is populated by numerous indigenous groups, speaking Tungusic, Turkic, Chukotko-Kamchatka, Eskimo-Aleut, and isolated languages. This linguistic and geographic variation opens question about the patterns of genetic variation in the region, which was significantly undersampled and received minor attention in the genetic literature to date. To fill in this gap we sampled Aleuts, Evenks, Evens, Itelmens, Kamchadals, Koryaks, Nanais, Negidals, Nivkhs, Orochi, Udegeis, Ulchi, and Yakuts. We also collected the demographic information of local populations, took physical anthropological photos, and measured the skin color. The photos resulted in the “synthetic portraits” of many studied groups, visualizing the main features of their faces.
Finland AD 5th-8th c.
Sadly, no information will be shared on the session A 1400-year transect of ancient DNA reveals recent genetic changes in the Finnish population, by Salmela et al. We will have to stick to the abstract:
Objectives: Our objective was to use aDNA to study the population history of Finland. For this aim, we sampled and sequenced 35 individuals from ten archaeological sites across southern Finland, representing a time transect from 5th to 18th century.
Methods: Following genomic DNA extraction and preparation of indexed libraries, the samples were enriched for 1,2 million genomewide SNPs using in-solution capture and sequenced on an Illumina HighSeq 4000 instrument. The sequence data were then compared to other ancient populations as well as modern Finns, their geographical neighbors and worldwide populations. Authenticity testing of the data as well as population history inference were based on standard computational methods for aDNA, such as principal component analysis and F statistics.
Results: Despite the relatively limited temporal depth of our sample set, we are able to see major genetic changes in the area, from the earliest sampled individuals – who closely resemble the present-day Saami population residing markedly further north – to the more recent ancient individuals who show increased affinity to the neighboring Circum-Baltic populations. Furthermore, the transition to the present-day population seems to involve yet another perturbation of the gene pool.
So, most likely then, in my opinion – although possibly Y-DNA will not be reported – Finns were in the Classical Antiquity period mostly R1a with secondary N1c in the Circum-Baltic region (similar to modern Estonians, as I wrote recently), while Saami were probably mostly a mix of R1a-Z282 and I1 in southern Finland. That’s what the first transition after the 5th c. probably reflects, the spread of Finns (with mainly N1c lineages) to the north, while the more recent transition shows probably the introduction of North Germanic ancestry (and thus also R1b-U106, R1a-Z284, and I1 lineages) in the west.
Dairying in ancient Mongolia
The History of Dairying in ancient Mongolia, by Wilkin et al.
The use of mass spectrometry based proteomics presents a novel method for investigating human dietary intake and subsistence strategies from archaeological materials. Studies of ancient proteins extracted from dental calculus, as well as other archaeological material, have robustly identified both animal and plant-based dietary components. Here we present a recent case study using shotgun proteomics to explore the range and diversity of dairying in the ancient eastern Eurasian steppe. Contemporary and prehistoric Mongolian populations are highly mobile and the ephemerality of temporarily occupied sites, combined with the severe wind deflation common across the steppes, means detecting evidence of subsistence can be challenging. To examine the time depth and geographic range of dairy use in Mongolia, proteins were extracted from ancient dental calculus from 32 individuals spanning burial sites across the country between the Neolithic and Mongol Empire. Our results provide direct evidence of early ruminant milk consumption across multiple time periods, as well as a dramatic increase in the consumption of horse milk in the late Bronze Age. These data provide evidence that dairy foods from multiple species were a key part of subsistence strategies in prehistoric Mongolia and add to our understanding of the importance of early pastoralism across the steppe.
Hypothesis: dairy pastoralism extends into Late #BronzeAge – calculus samples from 31 individuals 3000 BC – AD 1400 – shotgun proteomics; liquid chromatography–mass spectrometry – BLG peptides differentiate ruminant and equine milk, caprine-specific markers
The confirmation of the date 3000-2700 BC for dairying in the eastern steppe further supports what was already known thanks to archaeological remains, that the pastoralist subsistence economy was brought for the first time to the Altai region by expanding late Khvalynsk/Repin – Early Yamna pastoralists that gave rise to the Afanasevo culture.
Neolithic transition in Northeast Asia
Genomic insight into the Neolithic transition peopling of Northeast Asia, by C. Ning
East Asian representing a large geographic region where around one fifth of the world populations live, has been an interesting place for population genetic studies. In contrast to Western Eurasia, East Asia has so far received little attention despite agriculture here evolved differently from elsewhere around the globe. To date, only very limited genomic studies from East Asia had been published, the genetic history of East Asia is still largely unknown. In this study, we shotgun sequenced six hunter-gatherer individuals from Houtaomuga site in Jilin, Northeast China, dated from 12000 to 2300 BP and, 3 farming individuals from Banlashan site in Liaoning, Northeast China, dated around 5300 BP. We find a high level of genetic continuity within northeast Asia Amur River Basin as far back to 12000 BP, a region where populations are speaking Tungusic languages. We also find our Compared with Houtaomuga hunter-gatherers, the Neolithic farming population harbors a larger proportion of ancestry from Houtaomuga related hunter-gathers as well as genetic ancestry from central or perhaps southern China. Our finding further suggests that the introduction of farming technology into Northeast Asia was probably introduced through demic diffusion.
“Genomic insight into the peopling of Northeast China” – Chao Ning @MPI_SHH#ISBA8. Amazing genomic time transect 12000–2300BP from Houtaomuga, Jilin, PRC with #aDNA evidence for genetic continuity of #Tungusic-like groups in #Amur region even deeper than Chertovy Voroda (5700BC) pic.twitter.com/DGqibs52IE
A detail of the reported haplogroups of the Houtaomuga site:
Y-DNA in Northeast Asia shows thus haplogroup N1b1 ~5000 BC, probably representative of the Baikal region, with a change to C2b-448del lineages before the Xiongnu period, which were later expanded by Mongols.
The Keriyan, Lopnur and Dolan peoples are isolated populations with sparse numbers living in the western border desert of our country. By sequencing and typing the complete Y-chromosome of 179 individuals in these three isolated populations, all mutations and SNPs in the Y-chromosome and their corresponding haplotypes were obtained. Types and frequencies of each haplotype were analyzed to investigate genetic diversity and genetic structure in the three isolated populations. The results showed that 12 haplogroups were detected in the Keriyan with high frequencies of the J2a1b1 (25.64%), R1a1a1b2a (20.51%), R2a (17.95%) and R1a1a1b2a2 (15.38%) groups. Sixteen haplogroups were noted in the Lopnur with the following frequencies: J2a1 (43.75%), J2a2 (14.06%), R2 (9.38%) and L1c (7.81%). Forty haplogroups were found in the Dolan, noting the following frequencies: R1b1a1a1 (9.21%), R1a1a1b2a1a (7.89%), R1a1a1b2a2b (6.58%) and C3c1 (6.58%). These data show that these three isolated populations have a closer genetic relationship with the Uygur, Mongolian and Sala peoples. In particular, there are no significant differences in haplotype and frequency between the three isolated populations and Uygur (f=0.833, p=0.367). In addition, the genetic haplotypes and frequencies in the three isolated populations showed marked Eurasian mixing illustrating typical characteristics of Central Asian populations.
My knowledge of written Chinese is almost zero, so here are some excerpts with the help of Google Translate:
The source of 179 blood samples used in the study is shown in Figure 1. The Keriyan blood samples were collected from Dali Yabuyi Township, Yutian County (39 samples). The blood samples of the Lopnur people were collected from Kaerqu Township, Yuli County (64 cases); the blood samples of the Dolan people were collected from the town of Uluru, Awati County (76).
The composition and frequency of the Keriyan people’s haplogroup are closest to those of the Uighurs, and both Principal Component Analysis and Phylogenetic Tree Analysis show that their kinship is recent. We initially infer that the Keriyan are local desert indigenous people. They have a connection with the source of the Uighurs. Chen et al.  studied the patriarchal and maternal genetic analysis of the Keriyan people and found that they are not descendants of the Tibetan ethnic group in the West. The Keriyan people are a mixed group of Eastern and Western Europeans, which may originate from the local Vil group. Duan Ranhui  and other studies have shown that the nucleotide variability and average nucleotide differences in the Keriyan population are between the reported Eastern and Western populations. The phylogenetic tree also shows that the populations in Central Asia are between the continental lineage of the eastern population and the European lineage of the western population, and the genetic distance between the Keriyan and the Uighurs is the closest, indicating that they have a close relationship.
Regarding the origin of the Lopnur people, Purzhevski judged that it was a mixture of Mongolians and Aryans according to the physical characteristics of the Lopnur people. In 1934, the Sino-Swiss delegation discovered the famous burials of the ancient tombs in the Peacock River. After research, they were the indigenous people before the Loulan period; the researcher Yang Lan, a researcher at the Institute of Cultural Relics of the Chinese Academy of Social Sciences, said that the Lopnur people were descendants of the ancient “Landan survivors”. However, the Loulan people speaking an Indo-European language, and the Lopnur people speaking Uyghur languages contradict this; the historical materials of the Western Regions, “The Geography of the Western Regions” and “The Western Regions of the Ming Dynasty” record the Uighurs who lived in Cao Cao in the late 17th and early 18th centuries. Because of the occupation of the land by the Junggar nobles and their oppression, they fled. Some of them were forced to move to the Lop Nur area. There are many similar archaeological discoveries and historical records. We have no way to determine their accuracy, but they are at different times, and there is a great difference in what is heard in the same region. (…) The genetic characteristics of modern Lopnur people are the result of the long-term ethnic integration of Uyghurs, Mongols, and Europeans. This is also consistent with the similarity of the genetic structure of the Y chromosome of Lopnur in this study with the Uighurs and Mongolians. For example, the frequency of J haplogroup is as high as 59.37%, while J and its downstream sub-haplogroup are mainly distributed in western Europe, West Asia and Central Asia; the frequency of O, R haplogroup is close to that of Mongolians.
According to Ming History·Western Biography, the Mongolians originated from the Mobei Plateau and later ruled Asia and Eastern Europe. Mongolia was established, and large areas of southern Xinjiang and Central Asia were included. Later, due to the Mongolian king’s struggle for power, it fell into a long-term conflict. People of the land fled to avoid the war, and the uninhabited plain of the lower reaches of the Yarkant River naturally became a good place to live. People from all over the world gathered together and called themselves “Dura” and changed to “Dang Lang”. The long-term local Uyghur exchanges that entered the southern Mongolian monks and “Dura” were gradually assimilated . According to the report, locals wore Mongolian clothes, especially women who still maintained a Mongolian face . In 1976, the robes and waistbands found in the ancient time of the Daolang people in Awati County were very similar to those of the ancients. Dalang Muqam is an important part of Daolang culture. It is also a part of the Uyghur Twelve Muqam, and it retains the ancient Western culture, but it also contains a larger Mongolian culture and relics. The above historical records show that the Daolang people should appear in the Chagatai Khanate and be formed by the integration of Mongolian and Uighur ethnic groups. Through our research, we also found that the paternal haplotype of the Daolang people is contained in both Uygur and Mongolian, and the main haplogroups are the same, whereas the frequencies are different (see Table 3). The principal component analysis and the NJ analysis are also the same. It is very close to the Uyghur and the Mongolian people, which establishes new evidence for the “mixed theory” in molecular genetics.
If the nomenclature follows a recent ISOGG standard, it appears that:
The presence of exclusively R1a-Z93 subclades and the lack of R1b-M269 samples is compatible with the expansion of R1a-Z93 into the area with Proto-Tocharians, at the turn of the 3rd-2nd millennium BC, as suggested by the Xiaohe samples, supposedly R1a(xZ93).
Lacking proper assessment of ancient DNA from Proto-Tocharians, this potential early Y-DNA replacement is still speculative*. However, if that is the case, I wonder what the Copenhagen group will say when supporting this, but rejecting at the same time the more obvious Y-DNA replacement in East Yamna / Poltavka in the mid-3rd millennium with incoming Corded Ware-related peoples. I guess the invention of an Indo-Tocharian group may be near…
*NOTE. The presence of R1b-M269 among Proto-Tocharians, as well as the presence of R1b-M269 among Tarim Basin peoples in modern and ancient times is not yet fully discarded. The prevalence of R1a-Z93 may also be the sign of a more recent replacement by Iranian peoples, before the Mongolian and Turkic expansions that probably brought R1b(xM269).
Also, the presence of R1b (xM269) samples in east Asia strengthens the hypothesis of a back-migration of R1b-P297 subclades, from Northern Europe to the east, into the Lake Baikal area, during the Early Mesolithic, as found in the Botai samples and later also in Turkic populations – which are the most likely source of these subclades (and probably also of Q1a2 and N1c) in the region.