Of particular interest to the current study are the archaeogenetic investigations associated with the exemplary mound 1 from the Ak-Alakha-1 site on the Ukok Plateau in the Altai Republic (Polosmak 1994a; Pilipenko et al. 2015). This typical Pazyryk “frozen grave” was dated around 2268±39 years before present (Bln-4977) (Gersdorff and Parzinger 2000). Initial anthropological findings suggested an undisturbed dual inhumation comprising “a middle-aged European- type man” and “a young European-type woman”, both of whom presumably had a high social status among the Pazyryk elite (Polosmak 1994a). In contrast, recent archaeogenetic investigations revealed somewhat contradicting results since analyses at both the amelogenin gene and Y-chromosome short tandem repeat (Y-STR) loci clearly established that both Scythians were actually males and had paternal and maternal lineages that are typically associated with eastern Eurasians (Pilipenko et al. 2015). Through the use of mitochondrial, autosomal and Y-chromosomal DNA typing systems, it was possible to not only investigate the potential relationships between the two ancient Scythians but also to gather initial phylogenetic and phylogeographic information on their paternal and maternal lineages (Pilipenko et al. 2015).
Based on the Y-STR data available, the two Ak-Alakha-1 Scythians had an in silico haplogroup assignment of N, which first appeared in southeastern Asia and then expanded in southern Siberia (Rootsi et al. 2007; Pilipenko et al. 2015).
Current study aims to investigate the geographical distributions of the ancient and contemporary matches and close genetic variants of the maternal and paternal lineages observed in the two Scythians from the exemplary Ak-Alakha-1 kurgan.
In response to aggressive Xiongnu expansion into the Altai region around the 2nd century BCE, some members of the Pazyryk culture may have started moving up North, and eventually reached the Vilyuy River at the beginning of 1st century CE. Notably, there is clear population continuity between the Uralic people such as Khants, Mansis and Nganasans, Paleo-Siberian people such as Yukaghirs and Chuvantsi, and the Pazyryk people even when considering just the two mtDNA and Y-STR haplotypes from the Ak-Alakha-1 mound 1 kurgan (Tables 1a, b, Table 2, Fig. 1). These concepts are also in agreement with the famous Yakut ethnographer Ksenofontov, who suggested that technologies associated with ferrous metallurgy were brought to the Vilyuy Valley at around 1st century CE by the first (proto)Turkic-speaking pioneers (Ksenofontov 1992). Yakut ethnogenesis per se possibly involved two major stages, the first being the proto-Turkic epoch through the arrival of Scytho-Siberian culture originating from Southern Siberia, such as that associated with the Pazyryk culture and the second being the proper Turkic epoch.
Nomadic peoples from the Central Asian steppes are East Iranian speakers whenever they are of haplogroup R1a, but “Uralic-Altaic” speakers whenever they are of haplogroup N. True story.
Anyway, based on the multi-ethnic federations created during this time, and on the ancestral components visible in the different groups (see a post on Karasuk by Chad Rohlfsen), the Pazyryk culture’s language is unknown, and it could be, as a matter of fact (apart from the obvious East Iranian connection):
Uralic: based on the presence of other Uralic-speaking groups nearby in the Siberian forest-steppes, and on their Karasuk-like admixture in common with Eastern Uralians. In fact, we already have a Pazyryk sample of haplogroup R1a-Z2124 from Berel’ in the Altai region (ca. 4th-3rd c. BC) from Unterländer et al. (2017), which may correspond to Eastern Uralic peoples (as the Bronze Age expansion of R1a-Z645 up to the eastern steppes shows). The appearance of haplogroup N in elite individuals would be quite representative of the infiltration process that must have happened among Ugrians and Samoyeds, and among Finno-Permians in the west.
We also know that haplogroup N and Siberian ancestry expanded into cultures of Northern Eurasia precisely with the creation of the new social paradigm of chiefdoms and alliances, roughly at the same time as Scythians expanded, with the first sample of haplogroup N in Hungary appearing with Cimmerians.
While the study of modern populations is interesting, the problem I have with the paper is the reasoning of “language of ancient haplogroups based on modern populations”, and especially with the concept of “Uralic-Altaic”, and the highly hypothetic “Proto-Turkic” nomadic steppe pastoralists before “Hunnic Turkic” (which is itself questionable), before the “real Turkic” layer (being the authors apparently Turkic themselves), and the supposed “continuity” of Eastern Uralic and Turkic groups in Asia since the Out of Africa migration. The combination of all of this in the same text is just disturbing.
If you look at it from the bright side, at least these samples were not of haplogroup R1a-Z280, or we would be talking about great Slavonic Scythians showing continuity from Russia with love, as the paper threatened to do in its introduction…
If you are enjoying the comeback of this retro 2000s comedy in 2019 (based on the classic nativist “R1a=IE”, “R1b=Basque”, and “N=Uralic” combo) it’s because you – like me – are putting yourself in this guy’s shoes every time a new episode of funny self-destruction appears:
Consistent with their origin, Mongolic-speaking Buryats demonstrate genetic similarity with Mongols, and Turkic-speaking Altai-Kizhi and Teleuts are drawn close to CAS groups. The Tungusic-speaking Evenks collected in central and eastern Siberia cluster together and overlap with Yukagirs. Dolgans are widely scattered in the plot, justifying their recent origin from one Evenk clan, Yakuts, and Russian peasants in the 18th century (Popov, 1964). Uralic-speaking populations comprise a very wide cluster with Komi drawn to Europe, and Khants showing a closer affinity with Selkups, Tundra and Forest Nentsi. Yenisey-speaking Kets are intermingled with Selkups. Interestingly, Samoyedic-speaking Nganasans from the Taymyr Peninsula form a separate tight cluster closer to Evenks, Yukagirs, and Koryaks.
ADMIXTURE and the “Siberian component”
Among Siberians, the Komi are primarily Europeans, while Nganasans, Evenks, Yukagirs, and Koryaks are nearly 100% East Asians. At K = 4 finer scale subcontinental structure can be distinguished with the emergence of a “Siberian” component. This component is highly pronounced in the Nganasans. Outside Siberia, this component is present in Germany and in CAS at low frequency. Within ancient cultures, this component has the highest frequency in three BA Karasuk samples. It is also found in Mal’ta, ENE Afanasievo and BA Andronovo, but not in Ust’-Ishim and BA Okunevo. At K = 5, the “Siberian” component is roughly subdivided into two components with different geographic distributions. The “Nganasan” component is frequent in nearly all Siberian populations, except the Komi, Kets and Selkups. The newly derived “Selkup-Ket” component is found at high frequencies in western Siberian populations. It is observed in BA Karasuk and in Mal’ta. At K = 6, the western Siberian “Nentsi-Khant” ancestry component was developed in Forest and Tundra Nentsi, Khants. This component is also present at low levels in EUR, CAS, Tibet, and southern Siberia.
The Dolgans share more segments with the Nganasans than within themselves (54.13 vs 41.72, Mann-Whitney test, P = .000000000001562546). The result is not surprising as the demographic data showed that the Nganasans were subjected to intense assimilation by the Dolgans in the second half of the 20th century (Goltsova, Osipova, Zhadanov, & Villems, 2005). Tundra Nentsi share more IBD with Forest Nentsi than within themselves (83.96 vs 50.3, P = .000055) possibly due to the common origin and long-term gene flow. The Ket and Selkup populations allocate significantly more IBD blocks between populations than with individuals from their own population (121.2 cM vs 85.9 cM for Kets, P = .000008, and 121.2 cM vs 114.9 cM for Selkups, P = .043).
Haplogroup N in Siberia
Although Siberia exhibits 42 haplogroups, the vast majority of Siberian Y-chromosomes belong only to 4 of the 18 major clades (N = 46.2%; C = 20.9%; Q = 14.4%; and R = 15.2%). The Y-chromosome haplogroup N is widely spread across Siberia and Eastern Europe (Ilumae et al., 2016; Karafet et al., 2002; Wong et al., 2016) and reaches its maximum frequency among Siberian populations such as Nganasans (94.1%) and Yakuts (91.9%). Within Siberia, two sister subclades N-P43 and N-L708 show different geographic distributions. N-P43 and derived haplogroups N-P63 and N- P362 (phylogenetically identical to N-B478* and N-B170, respectively) (Ilumae et al., 2016) are extremely rare in other major geographic regions. Likely originating in western Siberia, they are limited almost entirely to northwest Siberia, the Volga- Uralic regions, and the Taymyr Peninsula (ie, do not extend to eastern Siberia). Conversely, clade N-L708 is frequent in all Siberian populations except the Kets and Selkups, reaching its highest frequency in the Yakuts (91.9%).
Surprisingly, not a single sign of the proposed reindeer pastoralist horde led by Nganasans into north-eastern Europe. This is strange because “Siberian” migrants hypothetically imposed their language over Indo-Europeans quite recently, apparently after the Iron Age…
Interesting comparisons among Siberian groups, though.
To understand the population history and context of dairy pastoralism in the eastern Eurasian steppe, we applied genomic and proteomic analyses to individuals buried in Late Bronze Age (LBA) burial mounds associated with the Deer Stone-Khirigsuur Complex (DSKC) in northern Mongolia. To date, DSKC sites contain the clearest and most direct evidence for animal pastoralism in the Eastern steppe before ca. 1200 BCE.
Most LBA Khövsgöls are projected on top of modern Tuvinians or Altaians, who reside in neighboring regions. In comparison with other ancient individuals, they are also close to but slightly displaced from temporally earlier Neolithic and Early Bronze Age (EBA) populations from the Shamanka II cemetry (Shamanka_EN and Shamanka_EBA, respectively) from the Lake Baikal region. However, when Native Americans are added to PC calculation, we observe that LBA Khövsgöls are displaced from modern neighbors toward Native Americans along PC2, occupying a space not overlapping with any contemporary population. Such an upward shift on PC2 is also observed in the ancient Baikal populations from the Neolithic to EBA and in the Bronze Age individuals from the Altai associated with Okunevo and Karasuk cultures.
(…) two individuals fall on the PC space markedly separated from the others: ARS017 is placed close to ancient and modern northeast Asians, such as early Neolithic individuals from the Devil’s Gate archaeological site (22) and present-day Nivhs from the Russian far east, while ARS026 falls midway between the main cluster and western Eurasians.
Upper Paleolithic Siberians from nearby Afontova Gora and Mal’ta archaeological sites (AG3 and MA-1, respectively) (25, 26) have the highest extra affinity with the main cluster compared with other groups, including the eastern outlier ARS017, the early Neolithic Shamanka_EN, and present-day Nganasans and Tuvinians (Z > 6.7 SE for AG3). Main cluster Khövsgöl individuals mostly belong to Siberian mitochondrial (A, B, C, D, and G) and Y (all Q1a but one N1c1a) haplogroups.
Previous studies show a close genetic relationship between WSH populations and ANE ancestry, as Yamnaya and Afanasievo are modeled as a roughly equal mixture of early Holocene Iranian/ Caucasus ancestry (IRC) and Mesolithic Eastern European hunter-gatherers, the latter of which derive a large fraction of their ancestry from ANE. It is therefore important to pinpoint the source of ANE-related ancestry in the Khövsgöl gene pool: that is, whether it derives from a pre-Bronze Age ANE population (such as the one represented by AG3) or from a Bronze Age WSH population that has both ANE and IRC ancestry.
The amount of WSH contribution remains small (e.g., 6.4 ± 1.0% from Sintashta). Assuming that the early Neolithic populations of the Khövsgöl region resembled those of the nearby Baikal region, we conclude that the Khövsgöl main cluster obtained ∼11% of their ancestry from an ANE source during the Neolithic period and a much smaller contribution of WSH ancestry (4–7%) beginning in the early Bronze Age.
Apparently, then, the first individual with substantial WSH ancestry in the Khövsgöl population (ARS026, of haplogroup R1a-Z2123), directly dated to 1130–900 BC, is consistent with the first appearance of admixed forest-steppe-related populations like Karasuk (ca. 1200-800 BC) in the Altai. Interestingly, haplogroup N1a1a-M178 pops up (with mtDNA U5a2d1) among the earlier Khövsgöl samples.
I will repeat what I wrote recently here: Samoyedic arrived in the Altai with Karasuk and hg R1a-Z645 + Steppe_MLBA-like ancestry, admixed with Altai populations, clustering thus within an Ancient Altai cline. Only later did N1a1a subclades infiltrate Samoyedic (and Ugric) populations, bringing them closer to their modern Palaeo-Siberian cline. The shared mtDNA may support an ancestral EHG-“Siberian” cline, or else a more recent Afanasevo-related origin.
Also interesting, Q1a2 subclades and ANE ancestry making its appearance everywhere among ancestral Eurasian peoples, as Chetan recently pointed out.
The Nganasans have been eastern neighbours of the Enets for at least several centuries, or even longer, as indicated in Figures 2 and 3.10 They often dwelled on the same grounds and had common households with the Enets. Nganasans and Enets could intermarry (Dolgikh 1962a), while the Nganasans did not marry representatives of any other ethnic groups. As a result, it was not unusual for Enets and Nganasans to live in the same tent and/or to have common relatives. Such close contact must clearly have favoured acquisition of Nganasan by Enets children and of Enets by Nganasan children from an early age.
The Nenets have been close neighbours of all the Enets groups more recently (Figures 2 and 3). In the seventeenth century, there were only warlike contacts between the Nenets and the Enets, while in the eighteenth century the Nenets started to live on the traditional Enets lands, on the western bank of the Yenisey river, with more peaceful interactions reported. (…) Since then the same situation of intermarriages and common households has been attested for these western Enets neighbours as with the Nganasans (Dolgikh 1962a), and this has also created conditions favouring early acquisition of both languages by children.
As for the Evenkis and the Selkups, the Enets had regular contact with these peoples (Figures 2 and 3), though they were not their close neighbours: in fact, geographically, the Selkups were not neighbours at all by the end of the nineteenth century. The Evenkis had always been direct south-eastern neighbours (…) Contacts with Selkups could be trade based, or they could simply be occasional encounters on adjacent lands. (…) [With Evenkis] some sporadic contacts were similar in nature to those with the Selkups, however many other contacts were war-like. Traditionally, the Enets considered the Evenkis to have a martial spirit, and the Evenkis were known as being accustomed to stealing Enets women. A number of stories in Dolgikh (1961) concern Evenkis stealing Enets women and Enets men going to Evenki lands to find and return them. It is clear, therefore, that if Evenki or Selkup were acquired by the Enets, this happened later in life, and this acquisition required particular conditions for it, i. e. it was not readily acquired through regular or harmonious contact (as with Nganasan).
In a pattern similar to the situation with Nganasan, in the second half of the twentieth century most Enets elders could speak Nenets (Vasil’jev 1963; Eugen Helimski p.c., the lead author’s fieldwork experience).
At the start of the period studied, in the 1850s, the Enets linguistic community could be characterized as multilingual in the following five languages: Enets, Nganasan, Nenets, Evenki, and Russian (Figure 4). The number of Enets individuals who were able to converse in each of the other four languages differed and generally was a property of the individuals who had regular social contact with speakers of the other four languages. (…) Note that in all cases of interethnic communication there could well be a lack of perfect proficiency in a language for which the multilingualism is ascribed to the Enets community or Enets individuals: as Braunmüller and Ferraresi (2003: 3) put it: “Nobody would ever have expected to know other languages ‘perfectly’ (whatever that may mean in detail). This expectation seems to be a quite modern idea when discussing issues of bilingualism or multilingualism in general”.
The complex interactions of Siberian populations during the 17th-19th centuries offer a reasonably good picture of the life in the centuries before these accounts, when Samoyedic peoples migrated northwards, and Palaeo-Siberian and Tungusic populations were gradually assimilated into their Uralic culture and language, through intermarriage and close contacts among naturally nomadic populations.
You can read more about the origin of Nganasans – and other modern Samoyedic-speaking peoples – as Palaeo-Siberian populations (hence probably speaking Palaeo-Siberian languages more or less related to each other) who adopted Samoyedic languages in Wikipedia, which offers a summary of Boris Dolgikh’s On the Origin of the Nganasans (1962). Dolgikh is one of the main sources of information for these Siberian groups, as is reflected in this paper, too.
Why some geneticists are using Nganasans – in fact the latest Palaeo-Siberians to learn Samoyedic, already during historic times – as a model for the expansion of Uralic? I have never understood that. Among the many cases of circular reasoning based on modern populations that have been created since the start of population genomics, the use of Nganasans as a model of ‘true Uralians’ is probably the most clearly frontally opposed to what was well known in anthropology before geneticists started this new field.
If Kallio is right, most “eastern homeland” proposals are due to the interest of Russian nationalism, which is sadly quite likely to be influencing genetic research, too. It’s like letting Hindu nationalists influence publications on steppe-related migrations. As David Reich puts it in his book:
The tensest twenty-four hours of my scientific career came in October 2008, when my collaborator Nick Patterson and I traveled to Hyderabad to discuss these initial results with Singh and Thangaraj.
Our meeting on October 28 was challenging. Singh and Thangaraj seemed to be threatening to nix the whole project. Prior to the meeting, we had shown them a summary of our findings, which were that Indians today descend from a mixture of two highly divergent ancestral populations, one being “West Eurasians.” Singh and Thangaraj objected to this formulation because, they argued, it implied that West Eurasian people migrated en masse into India. They correctly pointed out that our data provided no direct evidence for this conclusion. They even reasoned that there could have been a migration in the other direction, of Indians to the Near East and Europe. (…) They also implied that the suggestion of a migration from West Eurasia would be politically explosive. They did not explicitly say this, but it had obvious overtones of the idea that migration from outside India had a transformative effect on the subcontinent.
If you add the nation-building myths in Eastern Europe (like the Russian Euro-Asian movements) to the now prevalent Indo-European—CWC idea, and a Siberian ancestry peaking in the Arctic, with little demographic or political relevance of modern Uralic-speaking peoples, you have clearly an explosive sociopolitical mix (based on a mythical Pan-Eurasian Indo-Slavonic) in the making…
Interesting excerpts (emphasis mine; most internal references removed):
The earliest, most secure archaeological evidence of human occupation of the region comes from the artefact-rich, high-latitude (~70° N) Yana RHS site dated to ~31.6 kya (…)
The Yana RHS human remains represent the earliest direct evidence of human presence in northeastern Siberia, a population we refer to as “Ancient North Siberians” (ANS). Both Yana RHS individuals were unrelated males, and belong to mitochondrial haplogroup U, predominant among ancient West Eurasian hunter-gatherers, and to Y chromosome haplogroup P1, ancestral to haplogroups Q and R, which are widespread among present-day Eurasians and Native Americans.
Symmetry tests using f4 statistics reject tree-like clade relationships with both Early West Eurasians (EWE; Sunghir) and Early East Asians (EEA; Tianyuan); however, Yana is genetically closer to EWE, despite its geographic location in northeastern Siberia
Using admixture graphs (qpGraph) and outgroup-based estimation of mixture proportions (qpAdm), we find that Yana can be modelled as EWE with ~25% contribution from EEA
Among all ancient individuals, Yana shares the most genetic drift with Mal’ta, and f4 statistics show that Mal’ta shares more alleles with Yana than with EWE (e.g. f4(Mbuti,Mal’ta;Sunghir,Yana) = 0.0019, Z = 3.99). Mal’ta and Yana also exhibit a similar pattern of genetic affinities to both EWE and EEA, consistent with previous studies.The ANE lineage can thus be considered a descendant of the ANS lineage, demonstrating that by 31.6 kya early representatives of this lineage were widespread across northern Eurasia, including far northeastern Siberia.
(…) the 9.8 kya Kolyma1 individual, representing a group we term “Ancient Paleosiberians” (AP). Our results indicate that AP are derived from a first major genetic shift observed in the region. Principal component analysis (PCA), outgroup f3-statistics and mtDNA and Y chromosome haplogroups (G1b and Q1a1a, respectively) demonstrate a close affinity between AP and present-day Koryaks, Itelmen and Chukchis, as well as with Native Americans.
For both AP and Native Americans, ANS ancestry appears more closely related to Mal’ta than Yana, therefore rejecting a direct contribution of Yana to later AP or Native American groups.
Lake Baikal Neolithic – Bronze Age
(…) the newly reported genomes from Ust’Belaya and recently published neighbouring Neolithic and Bronze Age sites show a succession of three distinct genetic ancestries over a ~6 ky time span. The earliest individuals show predominantly East Asian ancestry, closely related to the ancient individuals from DGC. In the early Bronze Age (BA), we observe a resurgence of AP ancestry (up to ~50% ancestry fraction), as well as influence of West Eurasian Steppe ANE ancestry represented by the early BA individuals from Afanasievo in the Altai region (~10%) This is consistent with previous reports of gene flow from an unknown ANE-related source into Lake Baikal hunter-gatherers.
Our results suggest a southward expansion of AP as a possible source, which is also consistent with the replacement of Y chromosome lineages observed at Lake Baikal, from predominantly haplogroup N in the Neolithic to haplogroup Q in the BA. Finally, the most recent individual from Ust’Belaya, dated to ~600 years ago, falls along the Neosiberian cline, similar to the ~760 year-old ‘Young Yana’ individual from northeastern Siberia, demonstrating the widespread distribution of Neosiberian ancestry in the most recent epoch.
At the western edge of northern Eurasia, genetic and strontium isotope data from ancient individuals at the Levänluhta site documents the presence of Saami ancestry in Southern Finland in the Late Holocene 1.5 kya. This ancestry component is currently limited to the northern fringes of the region, mirroring the pattern observed for AP ancestry in northeastern Siberia. However, while the ancient Saami individuals harbour East Asian ancestry, we find that this is better modelled by DGC rather than AP, suggesting that AP influence was likely restricted to the eastern side of the Urals. Comparison of ancient Finns and Saami with their present-day counterparts reveals additional gene flow over the past 1.6 kya, with evidence for West Eurasian admixture into modern Saami. The ancient Finn from Levänluhta shows lower Siberian ancestry than modern Finns .
EDIT (27 OCT 2018): By comparing the three, I see these are samples published already (at least two) in Lamnidis et al. (2018), but here with added (1) specific radiocarbon dates, (2) comparison with Neosiberian populations and (3) strontium isotope analyses.
Finnish_IA (ca. 350 AD) is probably a Saami-speaking individual, just like the Saami_IA with newly reported radiocarbon dates from Levänluhta ca. 400-600 AD (since Fennic peoples were then likely around the Gulf of Finland).
The conflicting strontium isotope data on marine dietary resources on certain samples from the supplementary material hint at possible external origin of the diet of some of the previously reported (and possibly one newly reported) Saami Iron Age individuals, from some 25-30 km. to the northwest through the river up to hundreds of km. to the southwest of Levänluhta (i.e. the whole coast of the Bothnian Sea). It is unclear why they would prefer an origin of the dietary source in southern Baltic regions instead of some km. to the west, though, unless that’s what they want to propose based on the sample’s admixture…
The coast of the Bothnian Sea (=the northern part of the Baltic Sea, between Sweden and Finland) lay only 25-30 km to the northwest, and accessible to the Iron Age people of the Levänluhta region via the Kyrönjoki river. (…) For individual JA2065/DA236, the low 87Sr/86Sr value (0.71078) would imply an exceptionally heavy reliance on Baltic Sea resources. The δ13C and δ15N values of the individual are near comparable (especially considering within-Baltic latitudinal gradients in δ13C; Torniainen et al. 2017) to the δ13C and δ15N values of a Middle Neolithic population on the Baltic island of Gotland (Eriksson, 2004) interpreted to have subsisted primarily on seals.
These new data on the samples give us some more information than what we already had, because the early date of Finnish_IA implies that there was few East Asian admixture (if any at all) in west Finland during the Roman Iron Age, which pushes still farther forward in time the expected appearance of Siberian ancestry among Saamic (first) and Fennic populations (later). It is unclear whether this East Asian ancestry found in Finnish_IA is actually related to DGC, or it is rather related to the ENA-like ancestry found already in Baltic hunter-gatherers (i.e. in some EHG samples from Karelia), for which Baikal_EN is a good proxy in Lazaridis et al. (2018).
The paper finds thus increased (probably the actual) Siberian ancestry in modern Finns compared to this Iron Age Saami individual. Coupled with the later Saami Iron Age samples, from between one to three centuries later – showing the start of Siberian ancestry influx – , we can begin to establish when the expansion of Siberian ancestry happened in central Finland, and thus quite likely when the Saami began to expand to the north and east and admix with Palaeo-Laplandic peoples.
One sample of haplogroup N1a1a1a1a4a1-M1982, Yana_MED, is found in the Arctic region (north-eastern Yakutia) ca. 1100 AD. Since it is derived from N1a1a1a1a-L392, it might be a surprise for some to find it in a clearly non-Uralic speaking environment at the same time other subclades of this haplogroup were admixing in the west with well-established Finno-Saamic, Volga-Finnic, Ugric, and Samoyedic populations…
On the growing doubts that these data – contradicting the CWC=IE theory – are creating among geneticists (from the supplementary materials):
The Proto-Saami language evolved in southern Finland and Karelia in the Early Iron Age, an area now host to Finnish and the closely related Karelian, but with Saami toponyms showing that the latter two languages are intrusive here (Saarikivi 2004). Saami-speaking populations are thought to have retreated to Lapland during the Middle Iron Age (300–800 AD), where it diverged into the modern Saami dialects. Genetically, the northward retreat of the Saami language correlates with the documented decrease of Saami ancestry in Southern Finland between the Iron Age and the modern period (cf. Lamnidis et al. 2018).
On the way to Lapland, the Saami replaced at least two linguistically obscure groups. This can be inferred from 1) an influx of non-Uralic loanwords into Proto-Saami in the Finnish Lakeland area, and 2) an influx of non-Uralic, non-Germanic words into Saami dialects in Lapland (Aikio 2012). Both of these borrowing events imply contact with non-Saami-speaking groups, e.g. non-Uralic-speaking hunter-gatherers that may have left a genetic and linguistic footprint on modern Saami populations.
The linguistic prehistory of Finland thus does not allow for a straightforward interpretation of the genetic data. The detection of East Asian ancestry in the genetically Saami individual is indicative of a population movement from the east (cf. Lamnidis et al. 2018, Rootsi et al. 2007), one that given the affinities with the ~7.6 ky old individuals from the Devil’s Gate Cave may have been a western extension of the Neosiberian turnover. However, it remains unclear whether this gene flow should be associated with the arrival of Uralic speakers, thus providing further support for a Uralic homeland in Eastern Eurasia, or with an earlier immigration of pre-Uralic, so-called “Paleo-Lakelandic” groups.
I think the genetic interpretation is already straightforward, though. We had a sneak peek at how this late admixture with non-Uralians (mainly Palaeo-Lakelandic and Palaeo-Laplandic peoples from Lovozero and related asbestos ware cultures) is going to unfold among expanding Saami-speaking populations thanks to Lamnidis et al. (2018):
Also, still no trace of R1a in far East Asia (reported as M17 ca. 5300 BC near Lake Baikal by Moussa et al. 2016), so I still have doubts about my previous assessment that R1a split into M17 (and thus also M417) in Siberia, with those expanding hunter-gatherer pottery.
Marital structure. The intensity of interethnic marriages puts the existence of the Ulchi population at risk. The colorful ethnic composition of the Ulchi settlements is reflected in the marriage structure [see featured image]. We found that the proportion of single-ethnic marriages of the Ulchi is on average 51%. The greatest number of such marriages takes place in the village of Bulava. Marriages of Ulchi with Russians are in second place. Marriages with indigenous peoples of the Far East, Nanais, Nivkhs, Evenks, and others, are in third place. Thus, almost half of the Ulchi marriages are with representatives of other nationalities. Such a significant level of interethnic mixing makes it possible to talk about intense processes of assimilation of this indigenous people and puts to the forefront the problem of loss of the unique gene pool of the Ulchi.
Haplogroup C (its branch M48) was genotyped for its five subbranches with markers M86, B470, F13686, B93, and the marker at position 16645386 (GRCh37), which was found by our team for the first time. Variant B93 is rare in the Ulchi, and 14 samples (that is, more than a quarter of the entire gene pool of the Ulchi, Fig. 2) belong to M86 and its subvariants. Therefore, we genotyped STR markers of C-M86 carriers for the Ulchi and neighboring Amur populations and analyzed the relationships of detected haplotypes on the phylogenetic network (Fig. 3, STR haplotypes are available from authors upon request).
(…) On the network, different clusters are associated with different populations: most Mongols belong to F13686, all Evenks of the Amur River region with this haplogroup form a subcluster within F13686, and part of Upper Nanais is the basis of cluster B470.
An estimate of the age of the entire haplogroup C-F12355 obtained from the data of genome-wide sequencing of seven specimens is 2400 ± 500 years (O.P. Balanovsky, unpublished data). That is, the common ancestor of all the studied representatives of various peoples with this haplogroup lived not so long ago, the first millennium BC. The formation time of cluster F13686 is somewhat later: 1990 ± 600 years.
(…) obvious traces of the interaction of the gene pool of the Ulchi with neighboring and remote peoples of the Far East and Central Asia in the time range of the last one to three thousand years were revealed. This shows that the results of work  on the similarity of the gene pool of the ancient (age of 7500 years) Neolithic genomes of the Amur River region to the Ulchi probably indicate not the uniqueness of the Ulchi, but the fact that this ancient gene pool was preserved in a vast circle of populations of the Far East interwoven with gene flows both with each other and, to a lesser extent, with populations of Central Asia.
The expansion of C2b1a2a-M86 (among many basal C2-M217 samples) is thus possibly associated with the spread of Tungusic, which puts C2b1a at the root of the Micro-Altaic expansion, with a formation date ca. 12700 BC, TMRCA 12500 BC (and not only Mongolian). This shows that Micro-Altaic is connected with a local population which shows a clear continuity since at least 3500 BC. This, however, tells us little about the origin of the language.
That leaves the ancestral N lineages found among Far East Asians as Palaeo-Siberian in origin, and their late expansions to the west not particularly linked with any of the known Palaeo-Siberian ethnolinguistic groups, let alone a supposed “Uralo-Altaic” language…
It has been known for a long time that the Caucasus must have hosted many (at least partially) isolated populations, probably helped by geographical boundaries, setting it apart from open Eurasian areas.
David Reich writes in his book the following about India:
The genetic data told a clear story. Around a third of Indian groups experienced population bottlenecks as strong or stronger than the ones that occurred among Finns or Ashkenazi Jews. We later confirmed this finding in an even larger dataset that we collected working with Thangaraj: genetic data from more than 250 jati groups spread throughout India (…)
Rather than an invention of colonialism as Dirks suggested, long-term endogamy as embodied in India today in the institution of caste has been overwhelmingly important for millennia. (…)
The Han Chinese are truly a large population. They have been mixing freely for thousands of years. In contrast, there are few if any Indian groups that are demographically very large, and the degree of genetic differentiation among Indian jati groups living side by side in the same village is typically two to three times higher than the genetic differentiation between northern and southern Europeans. The truth is that India is composed of a large number of small populations.
There is little doubt now, based on findings spanning thousands of years, that the Mesolithic and Neolithic Caucasus hosted various very small populations, even if the ancestral components may be reduced to the few known to date (such as ANE, EHG, AME*, ENA, CHG, and other “deep” ancestral components).
NOTE. I will call the ancestral component of Dzudzuana/Anatolian hunter-gatherers Ancient Middle Easterner (AME), to give a clear idea of its likely extension during the Late Upper Palaeolithic, and to avoid using the more simplistic Dzudzuana, unless it is useful to mention these specific local samples.
Genetic labs have a strong fixation with ancestry. I guess the use of complex statistical methods gives professionals and laymen alike the feeling of dealing with “Science”, as opposed to academic fields where you have to interpret data. I think language reveals a lot about the way people think, and the fact that ancestral components are called ‘lineages’ – while not wrong per se – is a clear symptom of the lack of interest in the true lineages: Y-DNA haplogroups.
It has become quite clear that male-biased migrations are often the ones which can be confidently followed for actual population movements and ethnolinguistic identification, at least until the Iron Age. The frequently used Palaeolithic clusters offer a clear example of why ancestry does not represent what some people believe: They merely give a basic idea of sizeable population replacements by distant peoples.
Both concepts are important: sizeable and distant peoples. For example, during the Upper Palaeolithic in Europe there was a sizeable population replacement of the Aurignacian Goyet cluster by the Gravettian Vestonice cluster (probably from populations of far eastern Russia) coupled with the arrival of haplogroup I, although during the thousands of years that this material culture lasted, the previously expanded C1a2 lineages did not disappear, and there were probably different resurgence and admixture events.
Haplogroup I certainly expanded with the Gravettian culture to Iberia, where the Goyet ancestry did not change much – probably because of male-driven migrations -, to the extent that during the Magdalenian expansions haplogroup I expanded with an ancestry closer to Goyet, in what is called a ‘resurge’ of the Goyet cluster – even though there is a clear replacement of male lines.
The Villabruna (WHG) cluster is another good example. It probably spread with haplogroup R1b-L754, which – based on the extra ‘East Asian’ affinity of some samples and on modern samples from the Middle East – came probably from the east through a southern route, and not too long before the expansion of WHG likely from around the Black Sea, although this is still unclear. The finding of haplogroup I in samples of mostly WHG ancestry could confuse people that do not care about timing, sub-structured populations, and gene flow.
NOTE. If you don’t understand why ‘clusters’ that span thousands of years don’t really matter for the many Palaeolithic population expansions that certainly happened among hunter-gatherers in Europe, just take a look at what happened with Bell Beakers expanding from Yamna into western Europe within 500 years.
If we don’t thread carefully when talking about population migrations, these terms are bound to confuse people. Just as the fixation on “steppe ancestry” – which marks the arrival in Chalcolithic Europe of peoples from the Pontic-Caspian region – has confused a lot of researchers to this day.
When I began to write about the Indo-European demic diffusion model, my concern was to find a single spot where a North-West Indo-European proto-language could have expanded from ca. 2000 BC (our most common guesstimate). Based on the 2015 papers, and in spite of their conclusions, I thought it had become clear that Corded Ware was not it, and it was rather Bell Beakers. I assumed that Uralic was spoken to the north (as was the traditional belief), and thus Corded Ware expanded from the forest zone, hence steppe ancestry would also be found there with other R1a lineages.
With the publication of Mathieson et al. (2017) and Olalde et al. (2017), I changed my mind, seeing how “steppe ancestry” did in fact appear quite late, hence it was likely to be the result of very specific population movements, probably directly from the Caucasus. Later, Mathieson published in a revision the sample from Alexandria of hg R1a-M417 (probably R1a-Z645, possibly Z93+), which further supported the idea that the migration of Corded Ware peoples started near the North Pontic forest-steppe (as I included in a the next revision).
The question remains the same I repeated recently, though: where do the extra Caucasus components (i.e. beyond EHG) of Eneolithic Ukraine/Corded Ware and Khvalynsk/Yamna come from?
Considering 2-way mixtures, we can model Karelia_HG as deriving 34 ± 2.8% of its ancestry from a Villabruna-related source, with the remainder mainly from ANE represented by the AfontovaGora3 (AG3) sample from Lake Baikal ~17kya.
AG3 was likely of haplogroup Q1a (as reported by YFull, see Genetiker), and probably the ANE ancestry found in Eastern Europe accompanied a Palaeolithic migration of Q1a2-M25 (formed ca. 22600 BC, TMRCA ca. 14300 BC).
Combined with what we know about the Eneolithic Steppe and Caucasus populations – it is likely that ANE ancestry remained the most important component of some of the small ghost populations of the Caucasus until their emergence with the Lola culture.
The first sample we have now attributed to the EHG cluster is Sidelkino, from the Samara region (ca. 9300 BC), mtDNA U5a2. In Damgaard et al. (Science 2018), Yamnaya could be modelled as a CHG population related to Kotias Klde (54%) and the remaining from ANE population related to Sidelkino (>46%), with the following split events:
A split event, where the CHG component of Yamnaya splits from KK1. The model inferred this time at 27 kya (though we note the larger models in Sections S2.12.4 and S2.12.5 inferred a more recent split time).
A split event, where the ANE component of Yamnaya splits from Sidelkino. This was inferred at about about 11 kya.
A split event, where the ANE component of Yamnaya splits from Botai. We inferred this to occur 17 kya. Note that this is above the Sidelkino split time, so our model infers Yamnaya to be more closely related to the EHG Sidelkino, as expected.
An ancestral split event between the CHG and ANE ancestral populations. This was inferred to occur around 40 kya.
Other samples classified as of the EHG cluster:
Popovo2 (ca. 6250 BC) of hg J1, mtDNA U4d – Po2 and Po4 from the same site (ca. 6550 BC) show continuity of mtDNA.
Karelia_HG, from Juzhnii Oleni Ostrov (ca. 6300 BC): I0211/UzOO40 (ca. 6300 BC) of hg J1(xJ1a), mtDNA U4a; and I0061/UzOO74 of hg R1a1(xR1a1a), mtDNA C1
UzOO77 and UzOO76 from Juzhnii Oleni Ostrov (ca. 5250 BC) of mtDNA R1b.
Samara_HG from Lebyanzhinka (ca. 5600 BC) of hg R1b1a, mtDNA U5a1d.
About the enigmatic Anatolia_Neolithic-related ancestry found in Pontic-Caspian steppe samples, this is what Wang et al. (2018) had to say:
We focused on model of mixture of proximal sources such as CHG and Anatolian Chalcolithic for all six groups of the Caucasus cluster (Eneolithic Caucasus, Maykop and Late Makyop, Maykop-Novosvobodnaya, Kura-Araxes, and Dolmen LBA), with admixture proportions on a genetic cline of 40-72% Anatolian Chalcolithic related and 28-60% CHG related (Supplementary Table 7). When we explored Romania_EN and Greece_Neolithic individuals as alternative southeast European sources (30-46% and 36-49%), the CHG proportions increased to 54-70% and 51-64%, respectively. We hypothesize that alternative models, replacing the Anatolian Chalcolithic individual with yet unsampled populations from eastern Anatolia, South Caucasus or northern Mesopotamia, would probably also provide a fit to the data from some of the tested Caucasus groups.
The first appearance of ‘Near Eastern farmer related ancestry’ in the steppe zone is evident in Steppe Maykop outliers. However, PCA results also suggest that Yamnaya and later groups of the West Eurasian steppe carry some farmer related ancestry as they are slightly shifted towards ‘European Neolithic groups’ in PC2 (Fig. 2D) compared to Eneolithic steppe. This is not the case for the preceding Eneolithic steppe individuals. The tilting cline is also confirmed by admixture f3-statistics, which provide statistically negative values for AG3 as one source and any Anatolian Neolithic related group as a second source
Detailed exploration via D-statistics in the form of D(EHG, steppe group; X, Mbuti) and D(Samara_Eneolithic, steppe group; X, Mbuti) show significantly negative D values for most of the steppe groups when X is a member of the Caucasus cluster or one of the Levant/Anatolia farmer-related groups (Supplementary Figs. 5 and 6). In addition, we used f- and D-statistics to explore the shared ancestry with Anatolian Neolithic as well as the reciprocal relationship between Anatolian- and Iranian farmer-related ancestry for all groups of our two main clusters and relevant adjacent regions (Supplementary Fig. 4). Here, we observe an increase in farmer-related ancestry (both Anatolian and Iranian) in our Steppe cluster, ranging from Eneolithic steppe to later groups. In Middle/Late Bronze Age groups especially to the north and east we observe a further increase of Anatolian farmer related ancestry consistent with previous studies of the Poltavka, Andronovo, Srubnaya and Sintashta groups and reflecting a different process not especially related to events in the Caucasus.
(…) Surprisingly, we found that a minimum of four streams of ancestry is needed to explain all eleven steppe ancestry groups tested, including previously published ones (Fig. 2; Supplementary Table 12). Importantly, our results show a subtle contribution of both Anatolian farmer-related ancestry and WHG-related ancestry (Fig.4; Supplementary Tables 13 and 14), which was likely contributed through Middle and Late Neolithic farming groups from adjacent regions in the West. The discovery of a quite old AME ancestry has rendered this probably unnecessary, because this admixture from an Anatolian-like ghost population could be driven even by small populations from the Caucasus.
While it is not yet fully clear, the increased Anatolian_Neolithic-like ancestry in Ukraine_Eneolithic samples (see below) makes it unlikely that all such ancestry in Corded Ware groups comes from a GAC-related contribution. It is likely that at least part of it represents contributions from populations of the Caucasus, based on the mostly westward population movements in the steppe from ca. 4600 BC on, including the Suvorovo-Novodanilovka expansion, and especially the Kuban-Maykop expansion during the final Eneolithic into the North Pontic area.
NOTE. Since CHG-like groups from the Caucasus may have combinations of AME and ANE ancestry similar to Yamna (which may thus appear as ‘steppe ancestry’ in the North Pontic area), it is impossible to interpret with precision the following ADMIXTURE graphic:
The East Asian contribution to samples from the WHG samples (like Loschbour or La Braña), as specified in Fu et al. (2016), does not seem to be related to Baikal_EN, and appears possibly (in the ADMIXTURE analysis) integrated into he Villabruna component. I guess this implies that the shared alleles with East Asians are quite early, and potentially due to the expansion of R1b-L754 from the East.
It would be interesting to know the specific material culture Sidelkino belonged to – i.e. if it was related to the expansion of the North-Eastern Technocomplex – , and its Y-DNA. The Post-Swiderian expansion into eastern Europe, probably associated with the expansion of R1b-P297 lineages (including R1b-M73, found later in Botai and in Baltic HG) is supposed to have begun during the 11th millennium BC, but migrations to the Urals and beyond are probably concentrated in the 9th millennium, so this sample is possibly slightly early for R1b.
NOTE. User Rozenfeld at Anthrogenica posted this, which I think is interesting (in case anyone wants to try a Y-SNP call):
there is something strange with Sidelkino EHG: first, its archaeological context is not described in the supplementary. Second, its sex is not listed in the supplementary tables. Third, after looking for info about this sample, I found that: “Сиделькино-3. Для снятия вопроса о половой принадлежности индивида была проведена генетическая экспертиза, выявившая принадлежность останков мужчине.”(translation: Sidelkino-3. To resolve the question about sex of the remains, the genetic analysis was conducted, which showed that remains belonged to male), source: http://static.iea.ras.ru/books/7487_Traditsii.pdf
So either they haven’t mentioned his Y-DNA in the paper for some reason, or there are more than one Sidelkino sample and the male one has not yet been published. The coverage of the Sidelkino sample from the paper is 2.9, more than enough to tell Y-DNA haplogroup.
My speculative guess right now about specific population movements in far eastern Europe, based on the few data we have:
The expansion of the North-Eastern Technocomplex first around the 9th millennium BC, most likely expanded R1b-P279 ca. 11300 BC, judging by its TMRCA, with both R1b-M73 (TMRCA 5300) and R1b-M269 (TMRCA 4400 BC) info (with extra El Mirón ancestry) back, and thus Eurasiatic.
The expansion of haplogroup J1 to the north may have happened before or after the R1b-P279 expansion. Judging by the increase in AG3-related ancestry near Karelia compared to Baltic_HG, it is possible that it expanded just after R1b-P279 (hence possibly J1-Y6304? TMRCA 9700 BC). Its long-lasting presence in the Caucasus is supported by the Satsurblia (ca. 11300 BC) and the Dolmen BA (ca. 1300 BC) samples.
The expansion of R1a-M17 ca. 6600 BC is still likely to have happened from the east, based on the R1a-M17 samples found in Baikalic cultures slightly later (ca. 5300 BC). The presence of elevated Baikal_EN ancestry in Karelia HG and in Samara HG, and the finding of R1a-M417 samples in the Forest Zone after the Mesolithic suggests a connection with the expansion of Hunter-Gatherer pottery, from the Elshanka culture in the Samara region northward into the Forset Zone and westward into the North Pontic area.
The expansion of R1b-M73 ca. 5300 BC is likely to be associated with the emergence of a group east of the Urals (related to the later Botai culture, and potentially Pre-Yukaghir). Its presence in a Narva sample from Donkalnis (ca. 5200 BC) suggest either an early split and spread of both R1b-P297 lineages (M73 and M269) through Eastern Europe, or maybe a back-migration with hunter-gatherer pottery.
R1b-M269 spread successfully ca. 4400 BC (and R1b-L23 ca. 4100 BC, both based on TMRCA), and this successful expansion is probably to be associated with the Khvalynsk-Novodanilovka expansion. We already know that Samara_HG ca. 5600 was R1b1a, so it is likely that R1b-M269 appeared (or ‘resurged’) in the Volga-Ural region shortly after the expansion of R1a-M17, whose expansion through the region may be inferred by the additional AG3 and Baikal_EN ancestry. Interesting from Samara_HG compared to the previous Sidelkino sample is the introduction of more El Mirón-related ancestry, typical of WHG populations (and thus proper of Baltic groups).
NOTE. The TMRCA dates are obviously gross approximations, because a) the actual rate of mutation is unknown and b) TMRCA estimates are based on the convergence of lineages that survived. The potential finding of R1a-Z645 (possibly Z93+) in Ukraine Eneolithic (ca. 4000 BC), and the potential finding of R1b-L23 in Khvalynsk ca. 4250 BC complicates things further, in terms of dates and origins of any subclade.
The question thus remains as it was long ago: did R1b-M269 lineages expand (‘return’) from the east, near the Urals, or directly from the north? Were they already near Samara at the same time as the expansion of hunter-gatherer pottery, and were not much affected by it? Or did they ‘resurge’ from populations admixed with Caucasus-related ancestry after the expansion of R1a-M17 with this pottery (since there are different stepped expansions from the Samara region)? We could even ask, did R1a-M17 really expand from the east, i.e. are the dates on Baikalic subclades from Moussa et al. (2016) reliable? Or did R1a-M17 expand from some pockets in the Pontic-Caspian steppe, taking over the expansion of HG pottery at some point?
The most interesting aspect from the new paper (regarding Indo-Uralic migrations) is that Ancestral Middle Easterner ancestry will probably be a better proxy for the Anatolia_Neolithic component found in Ukraine Mesolithic to Eneolithic, and possibly also for some of the “more CHG-like” component found among Pontic-Caspian steppe populations, all likely derived from different admixture events with groups from the Caucasus.
NOTE. Even the supposed gene flow of Neolithic Iranian ancestry into the Caucasus can be put into question, since that means possibly a Dzudzuana-like population with greater “deep ancestry” proportion than the one found in CHG, which may still be found within the Caucasus.
If it was not clear already that following ‘steppe ancestry’ wherever it appears is a rather lame way of following Indo-European migrations, every single sample from the Caucasus and their admixture with Pontic-Caspian steppe populations will probably show that “steppe ancestry” is in fact formed by a variety of steppe-related ancestral components, impossible to follow coherently with a single population. Exactly what is happening already with the Siberian ancestry.
If the paper on the Dzudzuana samples has shown something, is that the expansion of an ANE-like population shook the entire Caucasus area up to the Zagros Mountains, creating this ANE – AME cline that are CHG and Iran_N, with further contributions of “deep ancestries” (probably from the south) complicating the picture further.
If this happens with few known samples, and we know of an ANE-like ghost population in the Caucasus (appearing later in the Lola culture), we can already guess that the often repeated “CHG component” found in Ukraine_Eneolithic and Khvalynsk will not be the same (except the part mediated by the Novodanilovka expansion).
This ANE-like expansion happened probably in the Late Upper Palaeolithic, and reached Northern Europe probably after the expansion of the Villabruna cluster (ca. 12000 BC), judging by the advance of AG3-like and ENA-like ancestry in later WHG samples.
The population movements during the Mesolithic and Early Neolithic in the North Pontic area are quite complicated: the extra AME ancestry is probably connected to the admixture with populations from the Caucasus, while the close similarity of Ukraine populations with Scandinavian ones (with an increase in Villabruna ancestry from Mesolithic to Neolithic samples), probably reveal population movements related to the expansion of Maglemose-related groups.
These Maglemose-related groups were probably migrants from the north-west, originally from the Northern European Plains, who occupied the previous Swiderian territory, and then expanded into the North Pontic area. The overwhelming presence of I2a (likely all I2a2a1b1b) lineages in Ukraine Neolithic supports this migration.
The likely picture of Mesolithic-Neolithic migrations in the North Pontic area right now is then:
Expansion of R1a-M459 from the east ca. 12000 BC – probably coupled with AG3 and also some Baikal_EN ancestry. First sample is I1819 from Vasilievka (ca. 8700 BC), another is from Dereivka ca. 6900 BC.
Expansion of R1b-V88 from the Balkans in the west ca. 9700 BC, based on its TMRCA and also the Balkan hunter-gatherer population overwhemingly of this haplogroup from the 10th millennium until the Neolithic. First sample is I1734 from Vasilievka (ca. 7252 BC), which suggests that it replaced the male population there, based on their similar EHG-like adxmixture (and lack of sizeable WHG increase), and shared mtDNA U5b2, U5a2.
Expansion of I2a-Y5606 probably ca. 6800 based on its TMRCA with Janislawice culture. Supporting this is the increase in WHG contribution to Neolithic samples, including the spread of U4 subclades compared to the previous period.
Expansion of R1a-M17 starting probably ca. 6600 BC in the east (see above).
NOTE. The first sample of haplogroup I appears in the Mesolithic: I1763 (ca. 8100 BC) of haplogroup I2a1, probably related to an older Upper Palaeolithic expansion.
It is becoming more and more clear with each new paper that – unless the number of very ancient samples increases – the use of Y-chromosome haplogroups remains one of the most important tools for academics; this is especially so in the steppes, in light of the diversity found in populations from the Caucasus. A clear example comes from the Yamna – Corded Ware similarities:
The presence of haplogroups Q and R1a-M459 (xM17) in Khvalynsk along with a R1b1a sample, which some interpreted as being akin to modern ‘mixed’ populations in the past, is likely to point instead to a period of Khvalynsk-Novodanilovka expansion with R1b-M269, where different small populations from the steppe were being integrated into the common Khvalynsk stock, but where differences are seen in material culture surrounding their burials, as supported by the finding of R1b1 in the Kuban area already in the first half of the 5th millennium. The case would be similar to the early ‘mixed’ Icelandic population.
Only after the emergence of the Samara culture (in the second half of the 6th millennium BC), with a sample of haplogroup R1b1a, starts then the obvious connection with Early Proto-Indo-Europeans; and only after the appearance of late Sredni Stog and haplogroup R1a-M417 (ca. 4000 BC) is its connection with Uralic also clear. In previous population movements, I think more haplogroups were involved in migrations of small groups, and only some communities among them were eventually successful, expanding to be dominant, creating ever growing cultures during their expansions.
Indeed, if you think in terms of Uralic and Indo-European just as converging languages, and forget their potential genetic connection, then the genetic + linguistic picture becomes simplified, and the upper frontier of the 6th millennium BC with a division North Pontic (Mariupol) vs. Volga-Ural (Samara) is enough. However, tracing their movements backwards – with cultural expansions from west to east (with the expansion of farming), and earlier east to west (with hunter-gatherer pottery), and still earlier west to east (with the north-eastern technocomplex), offers an interesting way to prove their potential connection to macrofamilies, at least in terms of population movements.
I am quite convinced right now that it would be possible to connect the expansion of R1b-L754 subclades with a speculative Nostratic (given the R1b-V88 connection with Afroasiatic, and the obvious connection of R1b-L297 with Eurasiatic). Paradoxically, the connection of an Indo-Uralic community in the steppes (after the separation of Yukaghir) with any lineage expansion (R1a-M17, R1b-M269, or even Q, I or J1) seems somehow blurrier than one year ago, possibly just because there are too many open possibilities.
David Reich says about the admixture with Neanderthals, which he helped discover:
At the conclusion of the Neanderthal genome project, I am still amazed by the surprises we encountered. Having found the first evidence of interbreeding between Neanderthals and modern humans, I continue to have nightmares that the finding is some kind of mistake. But the data are sternly consistent: the evidence for Neanderthal interbreeding turns out to be everywhere. As we continue to do genetic work, we keep encountering more and more patterns that reflect the extraordinary impact this interbreeding has had on the genomes of people living today.
I think this is a shared feeling among many of us who have made proposals about anything, to fear that we have made a gross, evident mistake, and constantly look for flaws. However, it seems to me that geneticists are more preoccupied with being wrong in their developed statistical methods, in the theoretical models they are creating, and not so much about errors in the true ancient ethnolinguistic picture human population genetics is (at least in theory) concerned about. Their publications are, after all, constantly associating genetic finds with cultures and (whenever possible) languages, so this aspect of their research should not be taken lightly.
Seeing how David Anthony or Razib Khan (among many others) have changed their previously preferred migration models as new data was published, and they continue to be respected in their own fields, I guess we can be confident that professionals with integrity are going to accept whatever new picture appears. While I don’t think that genetic finds can change what we can reconstruct with comparative grammar, I am also ready to revise guesstimates and routes of expansion of certain dialects if R1a-Z645 is shown to have accompanied Late Proto-Indo-Europeans during their expansion with Yamna, and later integrated somehow with Corded Ware.
However, taking into account the obsession of some with an ancestral, uninterrupted R1a—Indo-European association, and the lack of actual political repercussion of Neanderthal admixture, I think the most common nightmare that all genetic researchers should be worried about is to keep inflating this “Yamnaya ancestry”-based hornet’s nest, which has been constantly stirred up for the past two years, by rejecting it – or, rather, specifying it into its true complex nature.
This succession of corrections and redefinitions, coupled with the distinct Y-DNA bottleneck of each steppe population, will eventually lead to a completely different ethnolinguistic picture of the Pontic-Caspian region during the Eneolithic, which is likely to eventually piss off not only reasonable academics stubbornly attached to the CWC-IE idea, but also a part of those interested in daydreaming about their patrilineal ancestors.
Sometimes it’s better to just rip off the band-aid once and for all…
Interesting excerpts (emphasis mine, some links to images and tables deleted for clarity):
Late Bronze Age (LBA) Srubnaya-Alakulskaya individuals carried mtDNA haplogroups associated with Europeans or West Eurasians (17) including H, J1, K1, T2, U2, U4, and U5 (table S3). In contrast, the Iron Age nomads (Cimmerians, Scythians, and Sarmatians) additionally carried mtDNA haplogroups associated with Central Asia and the Far East (A, C, D, and M). The absence of East Asian mitochondrial lineages in the more eastern and older Srubnaya-Alakulskaya population suggests that the appearance of East Asian haplogroups in the steppe populations might be associated with the Iron Age nomads, starting with the Cimmerians.
#UPDATE (5 OCT 2018): Some Y-SNP calls have been published in a Molgen thread, with:
Srubna samples have possibly two R1a-Z280, three R1a-Z93.
Cimmerians may not have R1b: cim357 is reported as R1a.
Some Scythians have low coverage to the point where it is difficult to assign even a reliable haplogroup (they report hg I2 for scy301, or E for scy197, probably based on some shared SNPs?), but those which can be reliably assigned seem R1b-Z2103 [hence probably the use of question marks and asterisks in the table, and the assumption of the paper that all Scythians are R1b-L23]:
The most recent subclade is found in scy305: R1b-Z2103>Z2106 (Z2106+, Y12538/Z8131+)
scy304: R1b-Z2103 (M12149/Y4371/Z8128+).
scy009: R1b-P312>U152>L2 (P312+, U152?, L2+)?
Sarmatians are apparently all R1a-Z93 (including tem002 and tem003);
Srubnaya-Alakulskaya individuals exhibited genetic affinity to northern and northeastern present-day Europeans, and these results were also consistent with outgroup f3 statistics.
The Cimmerian individuals, representing the time period of transition from Bronze to Iron Age, were not homogeneous regarding their genetic similarities to present-day populations according to the PCA. F3 statistics confirmed the heterogeneity of these individuals in comparison with present-day populations
The Scythians reported in this study, from the core Scythian territory in the North Pontic steppe, showed high intragroup diversity. In the PCA, they are positioned as four visually distinct groups compared to the gradient of present-day populations:
A group of three individuals (scy009, scy010, and scy303) showed genetic affinity to north European populations (…).
A group of four individuals (scy192, scy197, scy300, and scy305) showed genetic similarities to southern European populations (…).
A group of three individuals (scy006, scy011, and scy193) located between the genetic variation of Mordovians and populations of the North Caucasus (…). In addition, one Srubnaya-Alakulskaya individual (kzb004), the most recent Cimmerian (cim357), and all Sarmatians fell within this cluster. In contrast to the Scythians, and despite being from opposite ends of the Pontic-Caspian steppe, the five Sarmatians grouped close together in this cluster.
A group of three Scythians (scy301, scy304, and scy311) formed a discrete group between the SC and SE and had genetic affinities to present-day Bulgarian, Greek, Croatian, and Turkish populations (…).
Finally, one individual from a Scythian cultural context (scy332) is positioned outside of the modern West Eurasian genetic variation (Fig. 1C) but shared genetic drift with East Asian populations.
The presence of an SA component (as well as finding of metals imported from Tien Shan Mountains in Muradym 8) could therefore reflect a connection to the complex networks of the nomadic transmigration patterns characteristic of seasonal steppe population movements. These movements, although dictated by the needs of the nomads and their animals, shaped the economic and social networks linking the outskirts of the steppe and facilitated the flow of goods between settled, semi-nomadic, and nomadic peoples. In contrast, all Cimmerians carried the Siberian genetic component. Both the PCA and f4 statistics supported their closer affinities to the Bronze Age western Siberian populations (including Karasuk) than to Srubnaya. It is noteworthy that the oldest of the Cimmerians studied here (cim357) carried almost equal proportions of Asian and West Eurasian components, resembling the Pazyryks, Aldy-Bel, and Iron Age individuals from Russia and Kazakhstan (12). The second oldest Cimmerian (cim358) was also the only one with both uniparental markers pointing toward East Asia. The Q1* Y chromosome sublineage of Q-M242 is widespread among Asians and Native Americans and is thought to have originated in the Altai Mountains (24)
In contrast to the eastern steppe Scythians (Pazyryks and Aldy-Bel) that were closely related to Yamnaya, the western North Pontic Scythians were instead more closely related to individuals from Afanasievo and Andronovo groups. Some of the Scythians of the western Pontic-Caspian steppe lacked the SA and the East Eurasian components altogether and instead were more similar to a Montenegro Iron Age individual (3), possibly indicating assimilation of the earlier local groups by the Scythians.
Toward the end of the Scythian period (fourth century CE), a possible direct influx from the southern Ural steppe zone took place, as indicated by scy332. However, it is possible that this individual might have originated in a different nomadic group despite being found in a Scythian cultural context.
I am surprised to find this new R1b-L23-based bottleneck in Eastern Iranian expansions so late, but admittedly – based on data from later times in the Pontic-Caspian steppe near the Caucasus – it was always a possibility. The fact that pockets of R1b-L23 lineages remained somehow ‘hidden’ in early Indo-Iranian communities was clear already since Narasimhan et al. (2018), as I predicted could happen, and is compatible with the limited archaeological data on Sintashta-Potapovka populations outside fortified settlements. I already said that Corded Ware was out of Indo-European migrations then, this further supports it.
Even with all these data coming just from a north-west Pontic steppe region (west of the Dnieper), these ‘Cimmerians’ – or rather the ‘Proto-Scythian’ nomadic cultures appearing before ca. 800 BC in the Pontic-Caspian steppes – are shown to be probably formed by diverse peoples from Central Asia who brought about the first waves of Siberian ancestry (and Asian lineages) seen in the western steppes. You can read about a Cimmerian-related culture, Anonino, key for the evolution of Finno-Permic peoples.
Also interesting about the Y-DNA bottleneck seen here is the rejection of the supposed continuous western expansions of R1a-Z645 subclades with steppe tribes since the Bronze Age, and thus a clearest link of the Hungarian Árpád dynasty (of R1a-Z2123 lineage) to either the early Srubna-related expansions or – much more likely – to the actual expansions of Hungarian tribes near the Urals in historic times.
NOTE. I will add the information of this paper to the upcoming post on Ugric and Samoyedic expansions, and the late introduction of Siberian ancestry to these peoples.
A few interesting lessons to be learned:
Remember the fantasy story about that supposed steppe nomadic pastoralist society sharing different Y-DNA lineages? You know, that Yamna culture expanding with R1b from Khvalynsk-Repin into the whole Pontic-Caspian steppes and beyond, developing R1b-dominated Afanasevo, Bell Beaker, and Poltavka, but suddenly appearing (in the middle of those expansions through the steppes) as a different culture, Corded Ware, to the north (in the east-central European forest zone) and dominated by R1a? Well, it hasn’t happened with any other steppe migration, so…maybe Proto-Indo-Europeans were that kind of especially friendly language-teaching neighbours?
Remember that ‘pure-R1a’ Indo-Slavonic society emerged from Sintashta ca. 2100 BC? (or even Graeco-Aryan??) Hmmmm… Another good fantasy story that didn’t happen; just like a central-east European Bronze Age Balto-Slavic R1a continuitydidn’t happen, either. So, given that cultures from around Estonia are those showing the closest thing to R1a continuity in Europe until the Iron Age, I assume we have to get ready for the Gulf of Finland Balto-Slavic soon.
Remember that ‘pure-R1a’ expansion of Indo-Europeans based on the Tarim Basin samples? This paper means ipso facto an end to the Tarim Basin – Tocharian artificial controversy. The Pre-Tocharian expansion is represented by Afanasevo, and whether or not (Andronovo-related) groups of R1a-Z645 lineages replaced part or eventually all of its population before, during, or after the Tocharian expansion into the Tarim Basin, this does not change the origin of the language split and expansion from Yamna to Central Asia; just like this paper does not change the fact that these steppe groups were Proto-Iranian (Srubna) and Eastern Iranian (Scythian) speakers, regardless of their dominant haplogroup.
Do you smell that fresher air? It’s the Central and East European post-Communist populist and ethnonationalist bullshit (viz. pure blondR1a-based Pan-Nordicism / pro-Russian Pan-Slavism / Pan-Eurasianism, as well as Pan-Turanism and similar crap from the 19th century) going down the toilet with each new paper.
#EDIT (5 OCT 2018): It seems I was too quick to rant about the consequences of the paper without taking into account the complexity of the data presented. Not the first time this impulsivity happens, I guess it depends on my mood and on the time I have to write a post on the specific work day…
While the data on Srubna, Cimmerians, and Sarmatians shows clearer Y-DNA bottlenecks (of R1a-Z645 subclades) with the new data, the Scythian samples remain controversial, because of the many doubts about the haplogroups (although the most certain cases are R1b-Z2103), their actual date, and cultural attribution. However, I doubt they belong to other peoples, given the expansionist trends of steppe nomads before, during, and after Scythians (as shown in statistical analyses), so most likely they are Scythian or ‘Para-Scythian’ nomadic groups that probably came from the east, whether or not they incorporated Balkan populations. This is further supported by the remaining R1b-P312 and R1b-Z2103 populations in and around the modern Eurasian steppe region.
You can find an interesting and detailed take on the data published (in Russian) at Vol-Vlad’s LiveJournal (you can read an automatic translation from Google). I think that post is maybe too detailed in debunking all information associated to the supposed Scythians – to the point where just a single sample seems to be an actual Scythian (?!) -, but is nevertheless interesting to read the potential pitfalls of the study.