N1c-L392 associated with expanding Turkic lineages in Siberia


Second in popularity for the expansion of haplogroup N1a-L392 (ca. 4400 BC) is, apparently, the association with Turkic, and by extension with Micro-Altaic, after the Uralic link preferred in Europe; at least among certain eastern researchers.

New paper in a recently created journal, by the same main author of the group proposing that Scythians of hg. N1c were Turkic speakers: On the origins of the Sakhas’ paternal lineages: Reconciliation of population genetic / ancient DNA data, archaeological findings and historical narratives, by Tikhonov, Gurkan, Demirdov, and Beyoglu, Siberian Research (2019).

Interesting excerpts:

According to the views of a number of authoritative researchers, the Yakut ethnos was formed in the territory of Yakutia as a result of the mixing of people from the south and the autochthonous population [34].

These three major Sakha paternal lineages may have also arrived in Yakutia at different times and/ or from different places and/or with a difference in several generations instead, or perhaps Y-chromosomal STR mutations may have taken place in situ in Yakutia. Nevertheless, the immediate common ancestor(s) from the Asian Steppe of these three most prevalent Sakha Y-chromosomal STR haplotypes possibly lived during the prominence of the Turkic Khaganates, hence the near-perfect matches observed across a wide range of Eurasian geography, including as far as from Cyprus in the West to Liaoning, China in the East, then Middle Lena in the North and Afghanistan in the South (Table 3 and Figure 5). There may also be haplotypes closely-related to ‘the dominant Elley line’ among Karakalpaks, Uzbeks and Tajiks, however, limitations in the loci coverage for the available dataset (only eight Y-chromosomal STR loci) precludes further conclusions on this matter [25].

17-loci median-joining network analysis of the original/dominant Elley, Unknown and Omogoy Y-chromosomal STR haplotypes with the YHRD matches from outside Yakutia populations.

According to the results presented here, very similar Y-STR haplotypes to that of the original Elley line were found in the west: Afghanistan and northern Cyprus, and in the east: Liaoning Province, China and Ulaanbaator, Northern Mongolia. In the case of the dominant Omogoy line, very closely matching haplotypes differing by a single mutational step were found in the city of Chifen of the Jirin Province, China. The widest range of similar haplotypes was found for the Yakut haplotype Unknown: In Mongolia, China and South Korea. For instance, haplotypes differing by a single step mutation were found in Northern Mongolia (Khalk, Darhad, Uryankhai populations), Ulaanbaator (Khalk) and in the province of Jirin, China (Han population).

14-loci median-joining network analysis for the original/dominant Elley (Ell), Unknown Clan
(Vil), Omogoy (Omo), Eurasian (Eur) and Xiongnu (Xuo) Y-chromosomal STR haplotypes and that for a representative ancient DNA sample (Ch0 or DSQ04) from the Upper Xiajiadian Culture
recovered from the Inner Mongolia Autonomous Region, China.

Notably, Tat-C-bearing Y-chromosomes were also observed in ancient DNA samples from the 2700-3000 years-old Upper Xiajiadian culture in Inner Mongolia, as well as those from the Serteya II site at the Upper Dvina region in Russia and the ‘Devichyi gory’ culture of long barrow burials at the Nevel’sky district of Pskovsky region in Russia. A 14-loci Y-chromosomal STR median-joining network of the most prevalent Sakha haplotypes and a Tat-C-bearing haplotype from one of the ancient DNA samples recovered from the Upper Xiajiadian culture in Inner Mongolia (DSQ04) revealed that the contemporary Sakha haplotype ‘Xuo’ (Table 2, Haplotype ID “Xuo”) classified as that of ‘the Xiongnu clan’ in our current study, was the closest to the ancient Xiongnu haplotype (Figure 6). TMRCA estimate for this 14-loci Y-chromosomal STR network was 4357 ± 1038 years or 2341 ± 1038 BCE, which correlated well with the Upper Xiajiadian culture that was dated to the Late Bronze Age (700-1000 BCE).

Geographical location of ancient samples belonging to major clade N of the Y-chromosome.

NOTE. Also interesting from the paper seems to be the proportion of E1b1b among admixed Russian populations, in a proportion similar to R1a or I2a(xI2a1).

It is tempting to associate the prevalent presence of N1c-L392 in ancient Siberian populations with the expansion of Altaic, by simplistically linking the findings (in chronological order) near Lake Baikal (Damgaard et al. 2018), Upper Xiajiadian (Cui et al. 2013), among Khövsgöl (Jeong et al. 2018), in Huns (Damgaard et al. 2018), and in Mongolic-speaking Avars (Csáky et al. 2019).

However, its finding among Palaeo-Laplandic peoples in the Kola peninsula ca. 1500 BC (Lamnidis et al. 2018) and among Palaeo-Siberian populations near the Yana River (Sikora et al. 2018) ca. AD 1200 should be enough to accept the hypothesis of ancestral waves of expansion of the haplogroup over northern Eurasia, with acculturation and further expansions in the different regions since the Iron Age (see more on its potential expansion waves).

Also, a simple look at the TMRCA and modern distribution was enough to hypothesize long ago the lack of connection of N1c-L392 with Altaic or Uralic peoples. From Ilumäe et al. (2016):

Previous research has shown that Y chromosomes of the Turkic-speaking Yakuts (Sakha) belong overwhelmingly to hg N3 (formerly N1c1). We found that nearly all of the more than 150 genotyped Yakut N3 Y chromosomes belong to the N3a2-M2118 clade, just as in the Turkic-speaking Dolgans and the linguistically distant Tungusic-speaking Evenks and Evens living in Yakutia (Table S2). Hence, the N3a2 patrilineage is a prime example of a male population of broad central Siberian ancestry that is not intrinsic to any linguistically defined group of people. Moreover, the deepest branch of hg N3a2 is represented by a Lebanese and a Chinese sample. This finding agrees with the sequence data from Hallast et al., where one Turkish Y chromosome was also assigned to the same sub-clade. Interestingly, N3a2 was also found in one Bhutan individual who represents a separate sub-lineage in the clade. These findings show that although N3a2 reflects a recent strong founder effect primarily in central Siberia (Yakutia, Sakha), the sub-clade has a much wider distribution area with incidental occurrences in the Near East and South Asia.

Frequency-Distribution Maps of Individual Sub-clades of hg N3a2, by Ilumäe et al. (2016).

The most striking aspect of the phylogeography of hg N is the spread of the N3a3’6-CTS6967 lineages. Considering the three geographically most distant populations in our study—Chukchi, Buryats, and Lithuanians—it is remarkable to find that about half of the Y chromosome pool of each consists of hg N3 and that they share the same sub-clade N3a3’6. The fractionation of N3a3’6 into the four sub-clades that cover such an extraordinarily wide area occurred in the mid-Holocene, about 5.0 kya (95% CI = 4.4–5.7 kya). It is hard to pinpoint the precise region where the split of these lineages occurred. It could have happened somewhere in the middle of their geographic spread around the Urals or further east in West Siberia, where current regional diversity of hg N sub-lineages is the highest (Figure 1B). Yet, it is evident that the spread of the newly arisen sub-clades of N3a3’6 in opposing directions happened very quickly. Today, it unites the East Baltic, East Fennoscandia, Buryatia, Mongolia, and Chukotka-Kamchatka (Beringian) Eurasian regions, which are separated from each other by approximately 5,000–6,700 km by air. N3a3’6 has high frequencies in the patrilineal pools of populations belonging to the Altaic, Uralic, several Indo-European, and Chukotko-Kamchatkan language families. There is no generally agreed, time-resolved linguistic tree that unites these linguistic phyla. Yet, their split is almost certainly at least several millennia older than the rather recent expansion signal of the N3a3’6 sub-clade, suggesting that its spread had little to do with linguistic affinities of men carrying the N3a3’6 lineages.

Frequency-Distribution Maps of Individual Subclade N3a3 / N1a1a1a1a1a-CTS2929/VL29.

It was thus clear long ago that N1c-L392 lineages must have expanded explosively in the 5th millennium through Northern Eurasia, probably from a region to the north of Lake Baikal, and that this expansion – and succeeding ones through Northern Eurasia – may not be associated to any known language group until well into the common era.


How the genocidal Yamnaya men loved to switch cultures


After some really interesting fantasy full of arrows, it seems Kristiansen & friends are coming back to their most original idea from 2015, now in New Scientist’s recent clickbait Story of most murderous people of all time revealed in ancient DNA (2019):

Teams led by David Reich at Harvard Medical School and Eske Willerslev at the University of Copenhagen in Denmark announced, independently, that occupants of Corded Ware graves in Germany could trace about three-quarters of their genetic ancestry to the Yamnaya. It seemed that Corded Ware people weren’t simply copying the Yamnaya; to a large degree they actually were Yamnayan in origin.

If you think you have seen that movie, it’s because you have. They are at it again, Corded Ware from Yamna, and more “steppe ancestry” = “more Indo-European. It seems we haven’t learnt anything about “Steppe ancestry” since 2015. But there’s more:

Genocidal peoples who “switch cultures”

Burial practices shifted dramatically, a warrior class appeared, and there seems to have been a sharp upsurge in lethal violence. “I’ve become increasingly convinced there must have been a kind of genocide,” says Kristian Kristiansen at the University of Gothenburg, Sweden.

The collaboration revealed that the origin and initial spread of Bell Beaker culture had little to do – at least genetically – with the expansion of the Yamnaya or Corded Ware people into central Europe. “It started in It is in that region that the earliest Bell Beaker objects – including arrowheads, copper daggers and distinctive Bell-shaped pots – have been found, in archaeological sites carbon-dated to 4700 years ago. Then, Bell Beaker culture began to spread east, although the people more or less stayed put. By about 4600 years ago, it reached the most westerly Corded Ware people around where the Netherlands now lies. For reasons still unclear, the Corded Ware people fully embraced it. “They simply take on part of the Bell Beaker package and become Beaker people,” says Kristiansen.

The fact that the genetic analysis showed the Britons then all-but disappeared within a couple of generations might be significant. It suggests the capacity for violence that emerged when the Yamnaya lived on the Eurasia steppe remained even as these people moved into Europe, switched identity from Yamnaya to Corded Ware, and then switched again from Corded Ware to Bell Beaker.

Notice what Kristiansen did there? Yamnaya men “switched identities” into Corded Ware, then “switched identities” into Bell Beakers…So, the most aggresive peoples who have ever existed, exterminating all other Europeans, were actually not so violent when embracing wholly different cultures whose main connection is that they built kurgans (yes, Gimbutas lives on).

NOTE. By the way, just so we are clear, only Indo-Europeans are “genocidal”. Not like Neolithic farmers, or Palaeolithic or Mesolithic populations, or more recent Bronze Age or Iron Age peoples, who also replaced Y-DNA from many regions…


In fact, there is much stronger evidence that these Yamnaya Beakers were ruthless. By about 4500 years ago, they had pushed westwards into the Iberian Peninsula, where the Bell Beaker culture originated a few centuries earlier. Within a few generations, about 40 per cent of the DNA of people in the region could be traced back to the incoming Yamnaya Beakers, according to research by a large team including Reich that was published this month. More strikingly, the ancient DNA analysis reveals that essentially all the men have Y chromosomes characteristic of the Yamnaya, suggesting only Yamnaya men had children.

“The collision of these two populations was not a friendly one, not an equal one, but one where the males from outside were displacing local males and did so almost completely,” Reich told New Scientist Live in September. This supports Kristiansen’s view of the Yamnaya and their descendants as an almost unimaginably violent people. Indeed, he is about to publish a paper in which he argues that they were responsible for the genocide of Neolithic Europe’s men. “It’s the only way to explain that no male Neolithic lines survived,” he says.

So these unimaginably violent Yamnaya men had children exclusively with their Y chromosomes…but not Dutch Single Grave peoples. These great great steppe-like northerners switched culture, cephalic index…and Y-chromosome from R1a (and others) to R1b-L151 to expand Italo-Celtic From The West™.

It’s hilarious how (exactly like their latest funny episode of PIE from south of the Caucasus) this new visionary idea copied by Copenhagen from amateur friends (or was it the other way around?) had been already rejected before this article came out, in Olalde et al. (2019), and that “Corded Ware=Indo-European” fans have become a parody of themselves.

What’s not to love about 2019 with all this back-and-forth hopping between old and new pet theories?

NOTE. I would complain (again) that the obsessive idea of the Danes is that Denmark CWC is (surprise!) the Pre-Germanic community, so it has nothing to do with “steppe ancestry = Indo-European” (or even with “Corded Ware = Indo-European”, for that matter), but then again you have Koch still arguing for Celtic from the West, Kortlandt still arguing for Balto-Slavic from the east, and – no doubt worst of all – “R1a=IE / R1b=Vasconic / N1c=Uralic” ethnonationalists arguing for whatever is necessary right now, in spite of genetic research.

So prepare for the next episode in the nativist and haplogroup fetishist comedy, now with western and eastern Europeans hand in hand: Samara -> Khvalynsk -> Yamnaya -> Bell Beaker spoke Vasconic-Tyrsenian, because R1b. Wait for it…

Vanguard Yamnaya groups

On a serious note, interesting comment by Heyd in the article:

A striking example of this distinction is a discovery made near the town of Valencina de la Concepción in southern Spain. Archaeologists working there found a Yamnaya-like kurgan, below which was the body of a man buried with a dagger and Yamnaya-like sandals, and decorated with red pigment just as Yamnaya dead were. But the burial is 4875 years old and genetic information suggests Yamnaya-related people didn’t reach that far west until perhaps 4500 years ago. “Genetically, I’m pretty sure this burial has nothing to do with the Yamnaya or the Corded Ware,” says Heyd. “But culturally – identity-wise – there is an aspect that can be clearly linked with them.” It would appear that the ideology, lifestyle and death rituals of the Yamnaya could sometimes run far ahead of the migrants.

NOTE. I have been trying to find which kurgan is this, reviewing this text on the archaeological site, but didn’t find anything beyond occasional ochre and votive sandals, which are usual. Does some reader know which one is it?

Yamna expansion and succeeding East Bell Beaker expansion, without color on Bell Beaker territories. Notice vanguard Yamna groups in blue where East Bell Beakers later emerge. See original image with Bell Beaker territories.

Notice how, if you add all those vanguard Yamna findings of Central and Western Europe, including this one from southern Spain, you begin to get a good idea of the territories occupied by East Bell Beakers expanding later. More or less like vanguard Abashevo and Sintashta finds in the Zeravshan valley heralded the steppe-related Srubna-Andronovo expansions in Turan…

It doesn’t seem like Proto-Beaker and Yamna just “crossed paths” at some precise time around the Lower Danube, and Yamna men “switched cultures”. It seems that many Yamna vanguard groups, probably still in long-distance contact with Yamna settlers from the Carpathian Basin, were already settled in different European regions in the first half of the 3rd millennium BC, before the explosive expansion of East Bell Beakers ca. 2500 BC. As Heyd says, there are potentially many Yamna settlements along the Middle and Lower Danube and tributaries not yet found, connecting the Carpathian Basin to Western and Northern Europe.

These vanguard groups would have more easily transformed their weakened eastern Yamna connections with the fashionable Proto-Beaker package expanding from the west (and surrounding all of these loosely connected settlements), just like the Yamna materials from Seville probably represent a close cultural contact of Chalcolithic Iberia with a Yamna settlement (the closest known site with Yamna traits is near Alsace, where high Yamna ancestry is probably going to be found in a Bell Beaker R1b-L151 individual).

This does not mean that there wasn’t a secondary full-scale migration from the Carpathian Basin and nearby settlements, just like Corded Ware shows a secondary (A-horizon?) migration to the east with R1a-Z645. It just means that there was a complex picture of contacts between Yamna and European Chalcolithic groups before the expansion of Bell Beakers. Doesn’t seem genocidal enough for a popular movie, tho.


Updates to ASoSaH: new maps, updated PCA, and added newest research papers


The title says it all. I have used some free time to update the series A Song of Sheep and Horses:

I basically added information from the latest papers published, which (luckily enough for me) haven’t been too many, and I have added images to illustrate certain sections.

I have updated the PCAs by including North Caucasus samples from Wang et al. (2018), whose position I could only infer for older versions from previously published PCA graphs.

PCA of ancient and modern Eurasian samples. Early Eneolithic admixture events in the steppe drawn.

I have also added to the supplementary materials the “Tip of the Iceberg” R1b tree by Mike Walsh from the FTDNA R1b group, with permission, because some relevant genetic sections are centered on the evolution of R1b lineages, and the reader can get easily lost with so many subclades.

I have also updated maps, including some of the Y-DNA ones, and managed to finish two new maps I was working on, and I added them to the supplementary materials and to the menu above:

One on Yamna kurgans in Hungary, coupled with contemporaneous sites of Baden-Boleráz or Kostolac cultures:

Map of attested Yamnaya pit-grave burials in the Hungarian plains; superimposed in shades of blue are common areas covered by floods before the extensive controls imposed in the 19th century; in orange, cumulative thickness of sand, unfavourable loamy sand layer. Marked are settlements/findings of Boleráz (ca. 3500 BC on), Baden (until ca. 2800 BC), Kostolac (precise dates unknown), and Yamna kurgans (from ca. 3100/3000 BC on).

Another one on Steppe ancestry expansion, with a tentative distribution of “steppe ancestry” divided into that of Sredni Stog/Corded Ware origin vs. that of Repin/Yamna origin, a difference that has been known for quite some time already.

It is tentative because there hasn’t been any professional study or amateur attempt to date to differentiate both “steppe ancestries” in Yamna, and especially in Bell Beakers. So much for the call of professional geneticists since 2018 (see here and here) and archaeologists since 2017 (see e.g. here and here) to distinguish fine-scale population structure to be able to follow neighbouring populations which expanded with different archaeological (and thus ethnolinguistic) groups.

Tentative map of fine-scale population structure during steppe-related expansions (ca. 3500–2000 BC), including Repin–Yamna–Bell Beaker/Balkans and Sredni Stog–Corded Ware groups. Data based on published samples and pairwise comparisons tested to date. Notice that the potential admixture of expanding Repin/Early Yamna settlers in the North Pontic area with the late Sredni Stog population (and thus Sredni Stog-related ancestry in Yamna) has been omitted for simplicity purposes, assuming thus a homogeneous Yamna vs. Corded Ware ancestry.

I think both maps are especially important today, given the current Nordicist reactionary trends arguing (yet again) for an origin of Indo-Europeans in The North™, now based on the Fearsome Tisza River hypothesis, on cephalic index values, and a few pairwise comparisons – i.e. an absolutely no-nonsense approach to the Indo-European question (LOL). At least I get to relax and sit this year out just observing how other people bury themselves and their beloved “steppe ancestry=IE” under so many new pet theories…

NOTE. Not that there is anything wrong with a northern origin of North-West Indo-European from a linguistic point of view, as I commented recently – after all, a Corded Ware origin would roughly fit the linguistic guesstimates, unlike the proposed ancestral origins in Anatolia or India. The problem is that, like many other fringe theories, it is today just based on tradition, or (even worse) ethnic, political, or personal desires, and it doesn’t make sense when all findings from disciplines involved in the Indo-European and Uralic questions are combined.

Simple ancestry percentages in modern populations. Recent image by Iain Mathieson 2019 (min. 5.57). A simplistic “Steppe ancestry” defining Indo-European speakers…? Sure.

Within 20 or 30 years, when genetic genealogists (or amateur geneticists, or however you want to call them) ask why we had the opportunity since 2015 to sample as many Hungarian Yamnaya individuals as possible and we didn’t, when it is clear that the number of unscathed kurgans is diminishing every year (from an estimated 4,000 in the 20th century, of the original tens of thousands, to less than 1,500 today) the answer will not be “because this or that archaeologist or linguist was a dilettante or a charlatan‘, as they usually describe academics they dislike.

It will be precisely because the very same genetic genealogists – supposedly interested today in the origin of R1b-L151 and/or genetic marker associated with North-West Indo-Europeans – are obsessed with finding them anywhere else but for Hungary, and prefer to use their money and time to play with a few statistical tools within a biased framework of flawed assumptions and study designs, obtaining absurd results and accepting far-fetched interpretations of them, to be told exactly what they want to hear: be it the Franco-Cantabrian homeland, the Dutch or Moravian Beaker from CWC homeland, the Maykop homeland, or the Moon homeland.

Poetic justice this heritage destruction, whose indirect causes will remain written in Internet archives for everyone to see, as a good lesson for future generations.

ASoSaH Reread (I): Y-DNA haplogroups among Indo-Europeans (apart from R1b-L23)


Given my reduced free time in these months, I have decided to keep updating the text on Indo-European and Uralic migrations and/or this blog, simultaneously or alternatively, to make the most out of the time I can dedicate to this. I will add the different ‘A Song of Sheep and Horses (ASoSaH) reread’ posts to the original post announcing the books. I would be especially interested in comments and corrections to the book chapters rather than the posts, but any comments are welcome (including in the forum, where comments are more likely to stick).

This is mainly a reread of iv.2. Indo-Anatolians and vi.1. Disintegrating Indo-Europeans.

Indo-Anatolians and Late Indo-Europeans

I have often written about R1b-L23 as the majority haplogroup among Late Proto-Indo-Europeans (see my predictions for 2018 and my summary of 2018), but always expected other haplogroups to pop up somewhere along the way, in Khvalynsk, in Repin, in Yamna, and in Bell Beakers (see e.g. the post on common fallacies of R1a/IE-fans).

Luckily enough – for those of us who want precise answers to our previous infinite models of Indo-European language expansions (viz. GAC-associated expansion, IE-speaking Old Europe, Anatolian homeland, Iran homeland, Maykop as Proto-Anatolian, Palaeolithic Continuity Theory, Celtic in the Atlantic façade, etc.) – the situation has been more clear-cut than expected: it turns out that, especially during population expansions, acute Y-chromosome bottlenecks were very common in the past, at least until the Iron Age.

Khvalynsk and Repin-Yamna expansions were no different, and that seems quite natural in hindsight, given the strong familial ties and aversion to foreigners proper of the Late Proto-Indo-European society and culture – probably not really that different from other contemporary societies, like the neighbouring Late Proto-Uralic or Trypillian ones.

Y-DNA samples from Khvalynsk and neighbouring cultures. See full version here.

Y-DNA haplogroups

During the expansion of early Khvalynsk, the most likely Indo-Anatolian culture, the society of the Don-Volga area was probably made up of different lineages including R1b-V1636, R1b-M269, R1a-YP1272, Q1a-M25, and I2a-L699 (and possibly some R1b-V88?), a variability possibly greater than that of the contemporary north Pontic area, probably a sign of this region being a sink of different east and west migrations from steppe and forest areas.

During its expansion, the Khvalynsk society saw its haplogroup variability reduced, as evidenced by the succeeding expansive Repin culture:

Afanasevo, representing Pre-Tocharian (the earliest Late PIE dialect to branch off), expanded with R1b-L23 – especially R1b-Z2103 – lineages, while early Yamna expanded with R1b-L23 and I2a-L699 lineages, which suggests that these are the main haplogroups that survived the Y-DNA bottleneck undergone during the Khvalynsk expansion, and especially later during the late Repin expansion. Nevertheless, other old haplogroups might still pop up during the Repin and early Yamna period, such as the R1b-V1636 sample from Yamna in the Northern Caucasus.

It is still unclear if R1b-L23 sister clade R1b-PF7562 (formed ca. 4400 BC, TMRCA ca. 3400 BC), prevalent among modern Albanians, expanded with Yamna migrants, or if it was part of an earlier expansion of R1b-M269 into the Balkans, and represent thus Indo-Anatolian speakers who later hitchhiked the expansion of the Late PIE language from the north or west Pontic area. The early TMRCA seems to suggest an association with Repin (and therefore Yamna), rather than later movements in the Balkans.

Y-DNA samples from Yamnaya and neighbouring cultures. See full version here.

‘Yamnaya’ or ‘steppe’ ancestry?

After the early years when population genetics relied mainly on modern Y-DNA haplogroups, geneticists and amateurs have been recently playing around with testing “ancestry percentages”, based on newly developed free statistical tools, which offer obviously just one among many types of data to achieve a proper interpretation of the past.

Today we have quite a lot Y-DNA haplogroups reported for ancient samples of more recent prehistoric periods, and they seem to offer (at least since the 2015 papers, but more evidently since the 2018 papers on Bell Beakers and Europeans, Corded Ware, or Fennoscandia among others) the most straightforward interpretation of all results published in population genomics research.

NOTE. The finding of a specific type of ancestry in one isolated 40,000-year-old sample from Tianyuan can offer very interesting information on potential population movements to the region. However, the identification of ethnolinguistic communities and their migrations among neighbouring groups in Neolithic or Bronze Age groups is evidently not that simple.

Yamnaya (Indo-European peoples) and their evolution in the steppes, together with North Pontic (eventually Uralic) peoples.Notice how little Indo-European ancestry changes from Khvalynsk (Indo-Anatolian) to Yamna Hungary (North-West Indo-Europeans) Image modified from Wang et al. (2018). See more on the evolution of “steppe ancestry”.

It is becoming more and more clear with each paper that the true “Yamnaya ancestry” – not the originally described one – was in fact associated with Indo-Europeans (see more on the very Yamnaya-like Yamna Hungary and early East Bell Beaker R1b samples, all of quite similar ancestry and PCA cluster before their further admixture with EEF- and CWC-like groups).

The so-called “steppe ancestry”, on the other hand, reflects the contribution of a Northern Caucasus-related ancestry to expanding Khvalynsk settlers, who spread through the steppes more than a thousand years before the expansion of Late Proto-Indo-Europeans with late Repin, and can thus be found among different groups related to the Pontic-Caspian steppes (see more on the emergence and evolution of “steppe ancestry”).

In fact, after the Yamna/Indo-European and Corded Ware/Uralic expansions, it is more likely to find “steppe ancestry” to the north and east in territories traditionally associated with Uralic languages, whereas to the south and west – i.e. in territories traditionally associated with Indo-European languages – it is more likely to find “EEF ancestry” with diminished “steppe ancestry”, among peoples patrilineally descended from Yamna settlers.

Y-DNA haplogroups, the only uniparental markers (see exceptions in mtDNA inheritance) – unlike ancestry percentages based on the comparison of a few samples and flawed study designs – do not admix, do not change, and therefore they do not lend themselves to infinite pet theories (see e.g. what David Reich has to say about R1b-P312 in Iberia directly derived from Yamna migrants in spite of their predominant EEF ancestry): their cultural continuity can only be challenged with carefully threaded linguistic, archaeological, and genetic data.


Mongolian tribes cluster with East Asians, closely related to the Japanese


New paper behind paywall Whole-genome sequencing of 175 Mongolians uncovers population-specific genetic architecture and gene flow throughout North and East Asia, by Bai et al Nature Genetics (2018).

Interesting excerpts (emphasis mine):

Genome sequencing, variant calling, and construction of the Mongolian reference panel. We collected peripheral blood with informed consent from 175 Mongolian individuals representing six distinct tribes/regions in northern China and Mongolia, including the Abaga, Khalkha, Oirat, Buryat, Sonid, and Horchin tribes.

Population genetic structure. a, PCA of Mongolian individuals and 1000G samples. Mongolians fill a large, less characterized gap between Admixed/Native Americans and other East Asians in the 1000G project. b, PCA of Mongolians and East Asians of 1000G. The abbreviations of EAS populations were used from reference 11.

The fixation index (FST) was used to estimate pairwise genetic differentiation among our Mongolian samples and 26 modern human populations selected from 1000G (…) the Mongolian tribes cluster with East Asian groups. The Mongolian populations show the smallest differentiation from the CHB, and FST values increase relative to the magnitude of geographical separation. The Buryat are the most differentiated tribe compared with other East Asians (1.82–2.97%), while the Horchin are the least (0.25–1.35%). All tribes are closer to the Japanese (JPT) than the CHS with the exception of the Horchin. Among the tribes, the Abaga, Khalkha, Oirat, and Sonid show the least differentiation from one another (FST < 0.15%)

A PCA places the Mongolians in close genetic proximity to a group of North Asian Siberians, including Altaians, Tuvinians, Evenki, and Yakut, indicating that the Mongolian whole-genome variation panel could be a better proxy for these groups than any populations currently in the 1000G panel

The most common Y-chromosome haplogroups are from the C3 sublineage (41.67%), including C3c (29.17%) and C3b (12.50%), followed by haplogroup O (23.61%), and haplogroup N (18.06%) (…) While haplogroups C and O are primarily restricted to Asia, haplogroup N is present at high frequency in Finns (60.5%), at low frequency in non-Mongolian East Asians (< 1%), and virtually absent throughout the remainder of European and African samples in 1000G

Comparison with Finns

Distribution of D-values from D-test under the model of [EAS, Mongolians, X, chimpanzee], where X represents the test population and chimpanzee serves as an outgroup. The positive D-value (Z > 3) indicates that the test population (X) is closer to Mongolians than to EAS. The whiskers correspond to range, and the dots to individual data points, box limits are the upper and lower quartiles. The n in each boxplot is 30. All abbreviations of populations in the figure were used from reference 11.

Of the populations included in our study, Mongolians share the second-highest level of IBD with the Finnish people (FIN), behind only Northern Han Chinese (CHB). While Mongolians share more IBD with Europeans (EUR) as a whole compared with other non-EAS people (Fig. 4b), removal of Finns from the Europeans drops the level of sharing to as low as that with South Asians (SAS) or Admixed American (AMR).

There is considerable geographic separation between modern-day Mongolians and Europe. The positive D-statistic that reveal gene flow between Mongolians and Europeans (Fig. 4c), and the high degree of IBD sharing with Finnish people in particular suggest that complex admixture may have occurred throughout northeastern Europe and Siberia. To see whether Mongolians represent the ethnic group in East Asia with the highest level of gene flow with Finnish people, we calculated a D-statistic for each set of populations [Mongolians, X, FIN, Yoruba (YRI)], where X represents a population from Siberia or Northern Canada. Most of the populations reveal an imbalance in allele frequencies that suggests gene flow with Finns (D >0, Z >3), but the greatest imbalance is observed between Siberians/Northern Canadians and Finnish, rather than between Mongolians and Finns. This pattern indicates that northern Asian populations interacted across large geographic ranges.

6 migration events, from the supplementary materials.

I guess the 1000G does not have northern Eurasian groups, because the IBD map and values would be lightening up with Palaeo-Siberian peoples


“Steppe ancestry” step by step: Khvalynsk, Sredni Stog, Repin, Yamna, Corded Ware


Wang et al. (2018) is obviously a game changer in many aspects. I have already written about the upcoming Yamna Hungary samples, about the new Steppe_Eneolithic and Caucasus Eneolithic keystones, and about the upcoming Greece Neolithic samples with steppe ancestry.

An interesting aspect of the paper, hidden among so many relevant details, is a clearer picture of how the so-called Yamnaya or steppe ancestry evolved from Samara hunter-gatherers to Yamna nomadic pastoralists, and how this ancestry appeared among Proto-Corded Ware populations.

Image modified from Wang et al. (2018). Marked are in orange: equivalent Steppe_Maykop ADMIXTURE; in red, approximate limit of Anatolia_Neolithic ancestry found in Yamna populations; in blue, Corded Ware-related groups. “Modelling results for the Steppe and Caucasus cluster. Admixture proportions based on (temporally and geographically) distal and proximal models, showing additional Anatolian farmer-related ancestry in Steppe groups as well as additional gene flow from the south in some of the Steppe groups as well as the Caucasus groups.”

Please note: arrows of “ancestry movement” in the following PCAs do not necessarily represent physical population movements, or even ethnolinguistic change. To avoid misinterpretations, I have depicted arrows with Y-DNA haplogroup migrations to represent the most likely true ethnolinguistic movements. Admixture graphics shown are from Wang et al. (2018), and also (the K12) from Mathieson et al. (2018).

1. Samara to Early Khvalynsk

The so-called steppe ancestry was born during the Khvalynsk expansion through the steppes, probably through exogamy of expanding elite clans (eventually all R1b-M269 lineages) originally of Samara_HG ancestry. The nearest group to the ANE-like ghost population with which Samara hunter-gatherers admixed is represented by the Steppe_Eneolithic / Steppe_Maykop cluster (from the Northern Caucasus Piedmont).

Steppe_Eneolithic samples, of R1b1 lineages, are probably expanded Khvalynsk peoples, showing thus a proximate ancestry of an Early Eneolithic ghost population of the Northern Caucasus. Steppe_Maykop samples represent a later replacement of this Steppe_Eneolithic population – and/or a similar population with further contribution of ANE-like ancestry – in the area some 1,000 years later.


This is what Steppe_Maykop looks like, different from Steppe_Eneolithic:


NOTE. This admixture shows how different Steppe_Maykop is from Steppe_Eneolithic, but in the different supervised ADMIXTURE graphics below Maykop_Eneolithic is roughly equivalent to Eneolithic_Steppe (see orange arrow in ADMIXTURE graphic above). This is useful for a simplified analysis, but actual differences between Khvalynsk, Sredni Stog, Afanasevo, Yamna and Corded Ware are probably underestimated in the analyses below, and will become clearer in the future when more ancestral hunter-gatherer populations are added to the analysis.

2. Early Khvalynsk expansion

We have direct data of Khvalynsk-Novodanilovka-like populations thanks to Khvalynsk and Steppe_Eneolithic samples (although I’ve used the latter above to represent the ghost Caucasus population with which Samara_HG admixed).

We also have indirect data. First, there is the PCA with outliers:


Second, we have data from north Pontic Ukraine_Eneolithic samples (see next section).

Third, there is the continuity of late Repin / Afanasevo with Steppe_Eneolithic (see below).

3. Proto-Corded Ware expansion

It is unclear if R1a-M459 subclades were continuously in the steppe and resurged after the Khvalynsk expansion, or (the most likely option) they came from the forested region of the Upper Dnieper area, possibly from previous expansions there with hunter-gatherer pottery.

Supporting the latter is the millennia-long continuity of R1b-V88 and I2a2 subclades in the north Pontic Mesolithic, Neolithic, and Early Eneolithic Sredni Stog culture, until ca. 4500 BC (and even later, during the second half).

Only at the end of the Early Eneolithic with the disappearance of Novodanilovka (and beginning of the steppe ‘hiatus’ of Rassamakin) is R1a to be found in Ukraine again (after disappearing from the record some 2,000 years earlier), related to complex population movements in the north Pontic area.

NOTE. In the PCA, a tentative position of Novodanilovka closer to Anatolia_Neolithic / Dzudzuana ancestry is selected, based on the apparent cline formed by Ukraine_Eneolithic samples, and on the position and ancestry of Sredni Stog, Yamna, and Corded Ware later. A good alternative would be to place Novodanilovka still closer to the Balkan outliers (i.e. Suvorovo), and a source closer to EHG as the ancestry driven by the migration of R1a-M417.


The first sample with steppe ancestry appears only after 4250 BC in the forest-steppe, centuries after the samples with steppe ancestry from the Northern Caucasus and the Balkans, which points to exogamy of expanding R1a-M417 lineages with the remnants of the Novodanilovka population.


4. Repin / Early Yamna expansion

We don’t have direct data on early Repin settlers. But we do have a very close representative: Afanasevo, a population we know comes directly from the Repin/late Khvalynsk expansion ca. 3500/3300 BC (just before the emergence of Early Yamna), and which shows fully Steppe_Eneolithic-like ancestry.


Compared to this eastern Repin expansion that gave Afanasevo, the late Repin expansion to the west ca. 3300 BC that gave rise to the Yamna culture was one of colonization, evidenced by the admixture with north Pontic (Sredni Stog-like) populations, no doubt through exogamy:


This admixture is also found (in lesser proportion) in east Yamna groups, which supports the high mobility and exogamy practices among western and eastern Yamna clans, not only with locals:


5. Corded Ware

Corded Ware represents a quite homogeneous expansion of a late Sredni Stog population, compatible with the traditional location of Proto-Corded Ware peoples in the steppe-forest/forest zone of the Dnieper-Dniester region.


We don’t have a comparison with Ukraine_Eneolithic or Corded Ware samples in Wang et al. (2018), but we do have proximate sources for Abashevo, when compared to the Poltavka population (with which it admixed in the Volga-Ural steppes): Sintashta, Potapovka, Srubna (with further Abashevo contribution), and Andronovo:


The two CWC outliers from the Baltic show what I thought was an admixture with Yamna. However, given the previous mixture of Eneolithic_Steppe in north Pontic steppe-forest populations, this elevated “steppe ancestry” found in Baltic_LN (similar to west Yamna) seems rather an admixture of Baltic sub-Neolithic peoples with a north Pontic Eneolithic_Steppe-like population. Late Repin settlers also admixed with a similar population during its colonization of the north Pontic area, hence the Baltic_LN – west Yamna similarities.

NOTE. A direct admixture with west Yamna populations through exogamy by the ancestors of this Baltic population cannot be ruled out yet (without direct access to more samples), though, because of the contacts of Corded Ware with west Yamna settlers in the forest-steppe regions.


A similar case is found in the Yamna outlier from Mednikarovo south of the Danube. It would be absurd to think that Yamna from the Balkans comes from Corded Ware (or vice versa), just because the former is closer in the PCA to the latter than other Yamna samples. The same error is also found e.g. in the Corded Ware → Bell Beaker theory, because of their proximity in the PCA and their shared “steppe ancestry”. All those theories have been proven already wrong.

NOTE. A similar fallacy is found in potential Sintashta→Mycenaean connections, where we should distinguish statistically that result from an East/West Yamna + Balkans_BA admixture. In fact, genetic links of Mycenaeans with west Yamna settlers prove this (there are some related analyses in Anthrogenica, but the site is down at this moment). To try to relate these two populations (separated more than 1,000 years before Sintashta) is like comparing ancient populations to modern ones, without the intermediate samples to trace the real anthropological trail of what is found…Pure numbers and wishful thinking.


Yamna and Corded Ware show a similar “steppe ancestry” due to convergence. I have said so many times (see e.g. here). This was clear long ago, just by looking at the Y-chromosome bottlenecks that differentiate them – and Tomenable noticed this difference in ADMIXTURE from the supplementary materials in Mathieson et al. (2017), well before Wang et al. (2018).

This different stock stems from (1) completely different ancestral populations + (2) different, long-lasting Y-chromosome bottlenecks. Their similarities come from the two neighbouring cultures admixing with similar populations.

If all this does not mean anything, and each lab was going to support some pre-selected archaeological theories from the 1960s or the 1980s, coupled with outdated linguistic models no matter what – Anthony’s model + Ringe’s glottochronological tree of the early 2000s in the Reich Lab; and worse, Kristiansen’s CWC-IE + Germano-Slavonic models of the 1940s in the Copenhagen group – , I have to repeat my question again:

What’s (so much published) ancient DNA useful for, exactly?


Waves of Palaeolithic ANE ancestry driven by P subclades; new CWC-like Finnish Iron Age

New preprint The population history of northeastern Siberia since the Pleistocene, by Sikora et al. bioRxiv (2018).

Interesting excerpts (emphasis mine; most internal references removed):

ANE ancestry

The earliest, most secure archaeological evidence of human occupation of the region comes from the artefact-rich, high-latitude (~70° N) Yana RHS site dated to ~31.6 kya (…)

The Yana RHS human remains represent the earliest direct evidence of human presence in northeastern Siberia, a population we refer to as “Ancient North Siberians” (ANS). Both Yana RHS individuals were unrelated males, and belong to mitochondrial haplogroup U, predominant among ancient West Eurasian hunter-gatherers, and to Y chromosome haplogroup P1, ancestral to haplogroups Q and R, which are widespread among present-day Eurasians and Native Americans.

Symmetry tests using f4 statistics reject tree-like clade relationships with both Early West Eurasians (EWE; Sunghir) and Early East Asians (EEA; Tianyuan); however, Yana is genetically closer to EWE, despite its geographic location in northeastern Siberia

Using admixture graphs (qpGraph) and outgroup-based estimation of mixture proportions (qpAdm), we find that Yana can be modelled as EWE with ~25% contribution from EEA

Among all ancient individuals, Yana shares the most genetic drift with Mal’ta, and f4 statistics show that Mal’ta shares more alleles with Yana than with EWE (e.g. f4(Mbuti,Mal’ta;Sunghir,Yana) = 0.0019, Z = 3.99). Mal’ta and Yana also exhibit a similar pattern of genetic affinities to both EWE and EEA, consistent with previous studies.The ANE lineage can thus be considered a descendant of the ANS lineage, demonstrating that by 31.6 kya early representatives of this lineage were widespread across northern Eurasia, including far northeastern Siberia.


Ancient Palaeosiberian

(…) the 9.8 kya Kolyma1 individual, representing a group we term “Ancient Paleosiberians” (AP). Our results indicate that AP are derived from a first major genetic shift observed in the region. Principal component analysis (PCA), outgroup f3-statistics and mtDNA and Y chromosome haplogroups (G1b and Q1a1a, respectively) demonstrate a close affinity between AP and present-day Koryaks, Itelmen and Chukchis, as well as with Native Americans.

For both AP and Native Americans, ANS ancestry appears more closely related to Mal’ta than Yana, therefore rejecting a direct contribution of Yana to later AP or Native American groups.

Lake Baikal Neolithic – Bronze Age

(…) the newly reported genomes from Ust’Belaya and recently published neighbouring Neolithic and Bronze Age sites show a succession of three distinct genetic ancestries over a ~6 ky time span. The earliest individuals show predominantly East Asian ancestry, closely related to the ancient individuals from DGC. In the early Bronze Age (BA), we observe a resurgence of AP ancestry (up to ~50% ancestry fraction), as well as influence of West Eurasian Steppe ANE ancestry represented by the early BA individuals from Afanasievo in the Altai region (~10%) This is consistent with previous reports of gene flow from an unknown ANE-related source into Lake Baikal hunter-gatherers.

Our results suggest a southward expansion of AP as a possible source, which is also consistent with the replacement of Y chromosome lineages observed at Lake Baikal, from predominantly haplogroup N in the Neolithic to haplogroup Q in the BA. Finally, the most recent individual from Ust’Belaya, dated to ~600 years ago, falls along the Neosiberian cline, similar to the ~760 year-old ‘Young Yana’ individual from northeastern Siberia, demonstrating the widespread distribution of Neosiberian ancestry in the most recent epoch.

Genetic structure of ancient northeast Siberians. PCA of ancient individuals projected onto a set of modern Eurasian and American individuals. Abbreviations in group labels: UP – Upper Palaeolithic; LP – Late Palaeolithic; M – Mesolithic; EN – Early Neolithic; MN – Middle Neolithic; LN – Late Neolithic; EBA – Early Bronze Age; LBA – Late Bronze Age; IA – Iron Age; PE – Paleoeskimo; MED – Medieval

Finland Saami

At the western edge of northern Eurasia, genetic and strontium isotope data from ancient individuals at the Levänluhta site documents the presence of Saami ancestry in Southern Finland in the Late Holocene 1.5 kya. This ancestry component is currently limited to the northern fringes of the region, mirroring the pattern observed for AP ancestry in northeastern Siberia. However, while the ancient Saami individuals harbour East Asian ancestry, we find that this is better modelled by DGC rather than AP, suggesting that AP influence was likely restricted to the eastern side of the Urals. Comparison of ancient Finns and Saami with their present-day counterparts reveals additional gene flow over the past 1.6 kya, with evidence for West Eurasian admixture into modern Saami. The ancient Finn from Levänluhta shows lower Siberian ancestry than modern Finns .

EDIT (27 OCT 2018): By comparing the three, I see these are samples published already (at least two) in Lamnidis et al. (2018), but here with added (1) specific radiocarbon dates, (2) comparison with Neosiberian populations and (3) strontium isotope analyses.

Finnish_IA (ca. 350 AD) is probably a Saami-speaking individual, just like the Saami_IA with newly reported radiocarbon dates from Levänluhta ca. 400-600 AD (since Fennic peoples were then likely around the Gulf of Finland).

The conflicting strontium isotope data on marine dietary resources on certain samples from the supplementary material hint at possible external origin of the diet of some of the previously reported (and possibly one newly reported) Saami Iron Age individuals, from some 25-30 km. to the northwest through the river up to hundreds of km. to the southwest of Levänluhta (i.e. the whole coast of the Bothnian Sea). It is unclear why they would prefer an origin of the dietary source in southern Baltic regions instead of some km. to the west, though, unless that’s what they want to propose based on the sample’s admixture…

The coast of the Bothnian Sea (=the northern part of the Baltic Sea, between Sweden and Finland) lay only 25-30 km to the northwest, and accessible to the Iron Age people of the Levänluhta region via the Kyrönjoki river. (…) For individual JA2065/DA236, the low 87Sr/86Sr value (0.71078) would imply an exceptionally heavy reliance on Baltic Sea resources. The δ13C and δ15N values of the individual are near comparable (especially considering within-Baltic latitudinal gradients in δ13C; Torniainen et al. 2017) to the δ13C and δ15N values of a Middle Neolithic population on the Baltic island of Gotland (Eriksson, 2004) interpreted to have subsisted primarily on seals.

These new data on the samples give us some more information than what we already had, because the early date of Finnish_IA implies that there was few East Asian admixture (if any at all) in west Finland during the Roman Iron Age, which pushes still farther forward in time the expected appearance of Siberian ancestry among Saamic (first) and Fennic populations (later). It is unclear whether this East Asian ancestry found in Finnish_IA is actually related to DGC, or it is rather related to the ENA-like ancestry found already in Baltic hunter-gatherers (i.e. in some EHG samples from Karelia), for which Baikal_EN is a good proxy in Lazaridis et al. (2018).

Since Bronze Age and Iron Age samples from Estonia show more Baltic_HG drift compared to Corded Ware samples, it is likely that this supposedly DGC-related ancestry (here considered part of the ‘Siberian ancestry’) is actually an EHG-related ENA component of north-east European hunter-gatherers, with whom Finno-Saamic peoples admixed during the expansion of the Corded Ware culture into Finland.

The paper finds thus increased (probably the actual) Siberian ancestry in modern Finns compared to this Iron Age Saami individual. Coupled with the later Saami Iron Age samples, from between one to three centuries later – showing the start of Siberian ancestry influx – , we can begin to establish when the expansion of Siberian ancestry happened in central Finland, and thus quite likely when the Saami began to expand to the north and east and admix with Palaeo-Laplandic peoples.

Admixture modelling using qpAdm. Maps showing locations and ancestry proportions of ancient (left) and modern (right) groups.

One sample of haplogroup N1a1a1a1a4a1-M1982, Yana_MED, is found in the Arctic region (north-eastern Yakutia) ca. 1100 AD. Since it is derived from N1a1a1a1a-L392, it might be a surprise for some to find it in a clearly non-Uralic speaking environment at the same time other subclades of this haplogroup were admixing in the west with well-established Finno-Saamic, Volga-Finnic, Ugric, and Samoyedic populations…

On the growing doubts that these data – contradicting the CWC=IE theory – are creating among geneticists (from the supplementary materials):

NOTE. This paper comes from the Copenhagen group, also signed by Kristiansen, one of today’s strongest supporters of this connection

The Proto-Saami language evolved in southern Finland and Karelia in the Early Iron Age, an area now host to Finnish and the closely related Karelian, but with Saami toponyms showing that the latter two languages are intrusive here (Saarikivi 2004). Saami-speaking populations are thought to have retreated to Lapland during the Middle Iron Age (300–800 AD), where it diverged into the modern Saami dialects. Genetically, the northward retreat of the Saami language correlates with the documented decrease of Saami ancestry in Southern Finland between the Iron Age and the modern period (cf. Lamnidis et al. 2018).

On the way to Lapland, the Saami replaced at least two linguistically obscure groups. This can be inferred from 1) an influx of non-Uralic loanwords into Proto-Saami in the Finnish Lakeland area, and 2) an influx of non-Uralic, non-Germanic words into Saami dialects in Lapland (Aikio 2012). Both of these borrowing events imply contact with non-Saami-speaking groups, e.g. non-Uralic-speaking hunter-gatherers that may have left a genetic and linguistic footprint on modern Saami populations.

The linguistic prehistory of Finland thus does not allow for a straightforward interpretation of the genetic data. The detection of East Asian ancestry in the genetically Saami individual is indicative of a population movement from the east (cf. Lamnidis et al. 2018, Rootsi et al. 2007), one that given the affinities with the ~7.6 ky old individuals from the Devil’s Gate Cave may have been a western extension of the Neosiberian turnover. However, it remains unclear whether this gene flow should be associated with the arrival of Uralic speakers, thus providing further support for a Uralic homeland in Eastern Eurasia, or with an earlier immigration of pre-Uralic, so-called “Paleo-Lakelandic” groups.

I think the genetic interpretation is already straightforward, though. We had a sneak peek at how this late admixture with non-Uralians (mainly Palaeo-Lakelandic and Palaeo-Laplandic peoples from Lovozero and related asbestos ware cultures) is going to unfold among expanding Saami-speaking populations thanks to Lamnidis et al. (2018):

PCA plot of 113 Modern Eurasian populations, with individuals from this study projected on the principal components. Uralic speakers are highlighted in light purple. Image modified from Lamnidis et al. (2018)

Also, still no trace of R1a in far East Asia (reported as M17 ca. 5300 BC near Lake Baikal by Moussa et al. 2016), so I still have doubts about my previous assessment that R1a split into M17 (and thus also M417) in Siberia, with those expanding hunter-gatherer pottery.


The Tungusic Ulchi population probably linked to haplogroup C2b1a


New paper (behind paywall) Demographic and Genetic Portraits of the Ulchi Population, by Balanovska et al. Russian Journal of Genetics (2018) 54(10):1245–1253.

Interesting excerpts (emphasis mine):

Marital structure. The intensity of interethnic marriages puts the existence of the Ulchi population at risk. The colorful ethnic composition of the Ulchi settlements is reflected in the marriage structure [see featured image]. We found that the proportion of single-ethnic marriages of the Ulchi is on average 51%. The greatest number of such marriages takes place in the village of Bulava. Marriages of Ulchi with Russians are in second place. Marriages with indigenous peoples of the Far East, Nanais, Nivkhs, Evenks, and others, are in third place. Thus, almost half of the Ulchi marriages are with representatives of other nationalities. Such a significant level of interethnic mixing makes it possible to talk about intense processes of assimilation of this indigenous people and puts to the forefront the problem of loss of the unique gene pool of the Ulchi.

Haplogroup C (its branch M48) was genotyped for its five subbranches with markers M86, B470, F13686, B93, and the marker at position 16645386 (GRCh37), which was found by our team for the first time. Variant B93 is rare in the Ulchi, and 14 samples (that is, more than a quarter of the entire gene pool of the Ulchi, Fig. 2) belong to M86 and its subvariants. Therefore, we genotyped STR markers of C-M86 carriers for the Ulchi and neighboring Amur populations and analyzed the relationships of detected haplotypes on the phylogenetic network (Fig. 3, STR haplotypes are available from authors upon request).

(…) On the network, different clusters are associated with different populations: most Mongols belong to F13686, all Evenks of the Amur River region with this haplogroup form a subcluster within F13686, and part of Upper Nanais is the basis of cluster B470.

Frequencies of haplogroups of Y chromosome in the Ulchi population. The nomenclature of haplogroups is given according to [9]. Markers that are not in bold type were not typed, but are ancestral for these nodes.

An estimate of the age of the entire haplogroup C-F12355 obtained from the data of genome-wide sequencing of seven specimens is 2400 ± 500 years (O.P. Balanovsky, unpublished data). That is, the common ancestor of all the studied representatives of various peoples with this haplogroup lived not so long ago, the first millennium BC. The formation time of cluster F13686 is somewhat later: 1990 ± 600 years.

(…) obvious traces of the interaction of the gene pool of the Ulchi with neighboring and remote peoples of the Far East and Central Asia in the time range of the last one to three thousand years were revealed. This shows that the results of work [4] on the similarity of the gene pool of the ancient (age of 7500 years) Neolithic genomes of the Amur River region to the Ulchi probably indicate not the uniqueness of the Ulchi, but the fact that this ancient gene pool was preserved in a vast circle of populations of the Far East interwoven with gene flows both with each other and, to a lesser extent, with populations of Central Asia.

The expansion of C2b1a2a-M86 (among many basal C2-M217 samples) is thus possibly associated with the spread of Tungusic, which puts C2b1a at the root of the Micro-Altaic expansion, with a formation date ca. 12700 BC, TMRCA 12500 BC (and not only Mongolian). This shows that Micro-Altaic is connected with a local population which shows a clear continuity since at least 3500 BC. This, however, tells us little about the origin of the language.

See also the recent ISBA presentation on the Houtaomuga site, Neolithic transition in Northeast Asia; and also Bronze Age population dynamics and rise of dairy pastoralism in Mongolia, Impact of colonization in north-eastern Siberia

That leaves the ancestral N lineages found among Far East Asians as Palaeo-Siberian in origin, and their late expansions to the west not particularly linked with any of the known Palaeo-Siberian ethnolinguistic groups, let alone a supposed “Uralo-Altaic” language…