Updated phylogenetic tree of haplogroup Q-M242 points to Palaeolithic expansions


New paper (behind paywall) Paternal origin of Paleo-Indians in Siberia: insights from Y-chromosome sequences by Wei et al., Eur. J. Hum. Genet. (2018)

Interesting excerpts (for Eurasian migrations):

Differentiation and diffusion in Palaeolithic Siberia

Based on the phylogenetic analyses and the current distributions of relative sub-lineages, we propose that the prehistoric population differentiation in Siberia after the LGM (post-LGM) provided the genetic basis for the emergence of the Paleo-Indian, American aborigine, population. According to the phylogenetic tree of Y-chromosome haplogroup C2-M217 (Fig. 2 and Figure S1), eight sub-lineages emerged in a short period between 15.3 kya and 14.3 kya (Table S5). Within these sub-lineages, haplogroups C2-M48, C2-F1918, and C2- F1756 are predominant paternal lineages in modern Altaic-speaking populations [46, 51, 52]. Samples of haplogroups C2-F8535 and C2-P53.1 were found in two Turkic- and Mongolic-speaking minorities in China (Table S1). Both archeological and genetic data suggest that Altaic-speaking populations are results of population expansion in the past several thousand years in the Altai Mountain, Mongolia Plateau, and Amur River region [51–54].

By contrast, three other sub-lineages, C2-B79, C2-B77, and C2-P39, appear only in Koryaks and Native Americans [16, 35]. The latitude of the Altai Mountain, the Mongolia Plateau, and Amur River region are much lower than that of Beringia, where the ancestors of Native Americans finally separated from their close relatives in Siberia. Therefore, the phylogeographic patterns of sub-lineages of C2-M217 in this study reveal a major splitting event between populations in a lower latitude region of Siberia and ancestors of Koryaks and Native Americans during the post-LGM period.

The sub-lineages of the Y-chromosome Q-M242 haplogroup were found in populations throughout the Eurasia continent. According to available data, the Q1-L804 lineage is exclusively found in Northwest Europe, while Q1-M120 is primarily restricted to East Asia [48]. Additionally, the lineage Q1-L330 is the predominant paternal lineage in Altai, Tuva, and Kets in South Siberia [34–36, 55]. A number of Q1-M242 samples have also been found in ancient remains from South Siberia and adjacent regions [56, 57]. Other sub-lineages of Q-M242 are scattered widely in different geographic regions of Eurasia, including Q1-L275, Q1-M25, and Q1-Y2659 [14, 35, 37, 58]. Additionally, the Y-chromosome of a 6000–5100 BCE sample (I4550) from Zvejnieki, Latvia has been identified as Q1-L56 [59]. These findings suggest that the sub-lineages of Q-M242 started to diffuse throughout Eurasia in a very ancient period.

Founding paternal lineages of American aborigines and their most closely related lineages among Eurasia populations

Emergence of Paleo-Indian populations

The revised phylogenetic tree of Y-chromosome haplogroup Q-M242 in this study provides clues regarding the origin of Native American lineages Q1-M3 and Q1-Z780 (Fig. 3). According to our estimates, haplogroup Q1-L54 expanded rapidly between 17.2 kya and 15.0 kya and finally gave rise to two major founding paternal lineages of Native American populations, known as Q1-Z780 and Q1-M3. Ancient DNA studies indicate that the early population in South Siberia, represented by MA1 genomes, had a genetic influence on both modern western European and Native American populations [7]. Therefore, we conclude that the accumulated diversity of sub-lineages of Q-M242 before 15.3 kya resulted from the in situ differentiation of Q-M242 in Central Eurasia and South Siberia since the Paleolithic Age, and the appearance of the Paleo-Indian population is part of the great human diffusion throughout the Eurasia after the Last Glacial Maximum.

The Southern Caucasus PIE homeland

Image modified from Wang et al. (2018). Samples projected in PCA of 84 modern-day West Eurasian populations (open symbols). Previously known clusters have been marked and referenced. An EHG and a Caucasus ‘clouds’ have been drawn, leaving Pontic-Caspian steppe and derived groups between them.See the original file here.

The origin of Q-M242 in Zvejnieki, like those of Lola (Q1a2-M25) and Steppe Maykop (Q1a2-M25) from Wang et al. (2018) are therefore most likely migrations throughout North Eurasia dated to the Palaeolithic.

As you might remember, the sample of haplogroup Q1a from Khvalynsk was the closest one (in the PCA, see above) to those we now know most likely represent one or more groups of the steppe north of the Caucasus, which were absorbed during the formation and expansion of Khvalynsk.

NOTE. In fact, the position of this early Khvalynsk sample in the PCA is near the Steppe Eneolithic cluster, in turn near ANE (with the Lola sample Q1a2-M25, circle in dark blue/violet above), and Steppe Maykop (which includes the other Q1a2-M25 sample).

It is often assumed that these populations absorbed in the Pontic-Caspian steppe were dominated by haplogroup J, due to the oldest representatives of CHG ancestry (Kotias Klde and Satsurblia).

However, it would not be surprising now to find out that (one or more of) these “CHG/ANE-rich” groups from the steppe (possibly the Kairshak culture in the North Caspian region) were in fact dominated by Q1-M25 subclades.

If this is the case, I don’t know where the proponents of the (south of the) Caucasus homeland will retreat to.


Origins of equine dentistry in Mongolia in the early first millennium BC

New paper (behind paywall) Origins of equine dentistry, by Taylor et al. PNAS (2018).

Interesting excerpts (emphasis mine):

The practice of horse dentistry by contemporary nomadic peoples in Mongolia, coupled with the centrality of horse transport to Mongolian life, both now and in antiquity, raises the possibility that dental care played an important role in the development of nomadic life and domestic horse use in the past. To investigate, we conducted a detailed archaeozoological study of horse remains from tombs and ritual horse inhumations across the Mongolian Steppe, assessing evidence for anthropogenic dental modifications and comparing our findings with broader patterns in horse use and nomadic material culture.

We conducted a detailed study of archaeological horse collections spanning the past 3,200 y, including those from the Late Bronze Age DSK complex (ca. 1200–700 BCE, n = 70), Early Iron Age Slab Burial culture (ca. 700–300 BCE, n = 4), Pazyryk culture (ca. 600–200 BCE, n = 2), Late Iron Age Xiongnu Empire (ca. 200 BCE–200 CE, n = 3), Early Middle Ages post-Xiongnu period (ca. 100–550 CE, n = 3), and Turkic Khaganate (ca. 600–800 CE, n = 3).

A (top): Contemporary Mongolian herder engaged in horseback riding, using left-handed rein position causing asymmetric pressures to the horse’s skull. Photo by Orsoo Bayarsaikhan. B(center) contemporary Mongolian horse skulls, showing asymmetric and skewed thinning to the nasal bones caused by bridle pressure. C(bottom) Asymmetric deformation to the cranial bones of a Deer Stone-Khirigsuur horse (left), alongside an early Middle Ages horse with a similar feature (right). Modified from Taylor and Tuvshinjargal (2018).


This Late Bronze Age dental modification counts among the earliest documented instances of equine veterinary care, and the oldest known evidence for horse dentistry. At first glance, the detailed historical record of early equine veterinary care in places such as China, Greece, Rome, and Syria, which spans the late second millennium BCE through the early centuries CE (11, 15, 16), might imply that equine dentistry emerged in the sedentary civilizations of the Old World. However, the earliest textual references describe only nonsurgical medicinal treatments and make few mentions of oral health (11). Recent archaeological discoveries suggest that human care of domestic animals was practiced by hunter-gatherers as far back as the Paleolithic (46), and that pastoralists may have occasionally practiced surgical procedures on domestic animals as early as the Neolithic in Europe (47). The evidence presented here indicates that horse dentistry was developed by nomadic pastoralists living on the steppes of Mongolia and northeast Asia during the Late Bronze Age, concurrent with the local adoption of the metal bit and many centuries before the first mention of dental practices in historical accounts from sedentary Old World civilizations.

Our results reveal a fundamental link between equine dentistry and the emergence of horsemanship in the steppes of Eurasia. At the turn of the first millennium BCE, militarized, horse-mounted peoples reshaped the social and economic landscape of many areas of the Eurasian continent. Conflagrations with equestrian peoples, such as those between the Persian Empire and the Pontic “Scythians,” plagued alluvial civilizations from the Near East to India and China, while large-scale movements of people linked East and West in never-before-seen ways (48). The archaeological and historical records indicate that the earliest horseback riding was accomplished without stirrups or saddles, and probably using only bitless or organic-mouthpiece bridles (49, 50). The bronze snaffle bit, and the improved control it provided, was a key technological development that enabled the use of horseback riding for more stressful and difficult activities, such as long-distance transportation and warfare (32). We argue that these technological improvements in horse control were preceded and sustained by innovations in veterinary dentistry by nomadic peoples living in the continental interior. By increasing herd survival and mitigating behavioral and health issues caused by horse equipment, innovations in equine dentistry improved the reliability of horseback riding for ancient nomads, enabling horses to be used for nonpastoral activities like warfare, high-speed riding, and distance travel.

Damage to the retained wolf tooth in a 4-5 year old mummified horse, dating to the 2-4th centuries CE from the site of Urd Ulaan-Uneet in western Mongolia


Archaeozoological data from Mongolian horses indicate that the nomadic practice of equine dentistry dates back more than 3,000 y to the DSK complex, a Late Bronze Age culture associated with the first mounted horseback riding and mobile pastoralism in eastern Eurasia. Attempted removal of deciduous incisors through sawing of the exterior suggests experimentation with dental extraction, but not the removal of wolf teeth. The appearance of extracted first premolars in the first millennium BCE coincides with the arrival of metal bits in the archaeological record and oral trauma linked with metal bit use, suggesting that innovations in dental practice were an adaptation to the mechanical changes in horse equipment. These bronze and metal bits provided greater control over the horse, facilitating the development of military uses for the horse, but also introduced new dental problems with the first premolar. Our results indicate that, coincident with the earliest evidence for metal bit use, wolf tooth extraction was practiced in Mongolia by ca. 750 BCE and continued through the early Middle Ages. These results push back the earliest dates for equine dentistry by more than a millennium and suggest that nomadic peoples developed key innovations in veterinary care that enabled more sophisticated horse control, ultimately changing the structure of communication, exchange, and military power in ancient Eurasia.


Genetic history of admixture across inner Eurasia; Botai shows R1b-M73


Open access Characterizing the genetic history of admixture across inner Eurasia, by Jeong et al. (2018).

Abstract (emphasis mine):

The indigenous populations of inner Eurasia, a huge geographic region covering the central Eurasian steppe and the northern Eurasian taiga and tundra, harbor tremendous diversity in their genes, cultures and languages. In this study, we report novel genome-wide data for 763 individuals from Armenia, Georgia, Kazakhstan, Moldova, Mongolia, Russia, Tajikistan, Ukraine, and Uzbekistan. We furthermore report genome-wide data of two Eneolithic individuals (~5,400 years before present) associated with the Botai culture in northern Kazakhstan. We find that inner Eurasian populations are structured into three distinct admixture clines stretching between various western and eastern Eurasian ancestries. This genetic separation is well mirrored by geography. The ancient Botai genomes suggest yet another layer of admixture in inner Eurasia that involves Mesolithic hunter-gatherers in Europe, the Upper Paleolithic southern Siberians and East Asians. Admixture modeling of ancient and modern populations suggests an overwriting of this ancient structure in the Altai-Sayan region by migrations of western steppe herders, but partial retaining of this ancient North Eurasian-related cline further to the North. Finally, the genetic structure of Caucasus populations highlights a role of the Caucasus Mountains as a barrier to gene flow and suggests a post-Neolithic gene flow into North Caucasus populations from the steppe.

Interesting excerpts:

On North Eurasians

In a PCA of Eurasian individuals, we find that PC1 separates eastern and western Eurasian populations, PC2 splits eastern Eurasians along a north-south cline, and PC3 captures variation in western Eurasians with Caucasus and northeastern European populations at opposite ends (Figure 2A and Figures S1-S2). Inner Eurasians are scattered across PC1 in between, largely reflecting their geographic locations. Strikingly, inner Eurasian populations seem to be structured into three distinct west-east genetic clines running between different western and eastern Eurasian groups, instead of being evenly spaced in PC space. Individuals from northern Eurasia, speaking Uralic or Yeniseian languages, form a cline connecting northeast Europeans and the Uralic (Samoyedic) speaking Nganasans from northern Siberia (“forest-tundra” cline). Individuals from the Eurasian steppe, mostly speaking Turkic and Mongolic languages, are scattered along two clines below the forest-tundra cline. Both clines run into Turkic- and Mongolic-speaking populations in southern Siberia and Mongolia, and further into Tungusic-speaking populations in Manchuria and the Russian Far East in the East; however, they diverge in the west, oneheading to the Caucasus and the other heading to populations of the Volga-308 Ural area (the “southern steppe” and “steppe-forest” clines, respectively; Figure 2 and Figure S2).
The forest-tundra cline populations derive most of their eastern Eurasian ancestry from a component most enriched in Nganasans, while those on the steppe-forest and southern steppe clines have this component together with another component most enriched in populations from the Russian Far East, such as Ulchi and Nivkh. The southern steppe cline groups are distinct from the others in their western Eurasian ancestry profile, in the sense that they have a high proportion of a component most enriched in Mesolithic Caucasus hunter-gatherers (“CHG”) and Neolithic Iranians (“Iran_N”) and frequently harbor another component enriched in South Asians (Figure S4).

qpAdm-based admixture models for the forest-tundra cline populations. For populations to the east of the Urals (Enets, Selkups, Kets, and Mansi), EHG+Yamnaya+Nganasan provides a good fit, except for Mansi, for which adding WHG significantly increases the model fit. For the rest of the groups, WHG+LBK_EN+Yamnaya+Nganasan in general provides a good fit. 5 cM jackknifing standard errors are marked by the horizontal bar.

For the forest-tundra cline populations, for which currently no relevant Holocene ancient genomes are available, we took a more generalized approach of using proxies for contemporary Europeans: WHG, WSH (represented by “Yamnaya_Samara”), and early Neolithic European farmers (EEF; represented by “LBK_EN”; Table S2). Adding Nganasans as the fourth reference, we find that most Uralic-speaking populations in Europe (i.e. west of the Urals) and Russians are well modeled by this four-way admixture model (χ 2 p ≥ 0.05 for all but three groups; Figure 5 and Table S8). Nganasan-related ancestry substantially contributes to their gene pools and cannot be removed from the model without a significant decrease in model fit (4.7% to 29.1% contribution; χ 2 p ≤ 1.12×10-8; Table S8). The ratio of contributions from three European references varies from group to group, probably reflecting genetic exchange with neighboring non-Uralic groups. For example, Saami from northern Fennoscandia contain a higher WHG and lower WSH contribution (16.1% and 41.3%, respectively) than Udmurts or Besermyans from the Volga river region do (4.9-6.6% and 50.7-53.2%, respectively), while the three groups have similar amounts of Nganasan-related ancestry (25.5-29.1%).

The Caucasus Mountains form a barrier to gene flow

By applying EEMS to the Caucasus region, we identify a strong barrier to gene flow separating North and South Caucasus populations. This genetic barrier coincides with the Greater Caucasus mountain ridge even to small scale: a weaker barrier in the middle, overlapping with Ossetia, matches well with the region where the ridge also becomes narrow. We also observe weak barriers running in the north-south direction that separate northeastern populations from northwestern ones. Together with PCA, EEMS results suggest that the Caucasus Mountains have posed a strong barrier to human migration.

The Greater Caucasus mountain ridge as a barrier to 856 genetic exchange. Barriers (brown) and conduits (green) of gene flow around the Caucasus region are estimated by the EEMS program. Red diamonds show the location of vertices to which groups are assigned. A strong barrier to gene flow overlaps with the Greater Caucasus mountain ridge reflecting the genetic differentiation between populations of the north and south of the Caucasus. The barrier becomes considerably weaker in the middle where present-day Ossetians live.

On the Botai individuals

The Y-chromosome of the male Botai individual (TU45) belongs to the haplogroup R1b (Table 411 S6). However, it falls into neither a predominant European branch R1b-L5165 nor into a R1b-GG400 branch found in Yamnaya individuals. Thus, phylogenetically this Botai individual should belong to the R1b-M73 branch which is frequent in the Eurasian steppe (Figure S9). This branch was also found in Mesolithic samples from Latvia as well as in numerous modern southern Siberian and Central Asian groups.

The Botai genomes provide a critical snapshot of the genetic profile of pre-Bronze Age steppe populations. Our admixture modeling positions Botai primarily on an ancient genetic cline of the pre-Neolithic western Eurasian hunter-gatherers: stretching from the post-Ice Age western European hunter-gatherers (e.g. WHG) to EHG in Karelia and Samara to the Upper Paleolithic southern Siberians (e.g. AG3). Botai’s position on this cline, between EHG and AG3, fits well with their geographic location and suggests that ANE-related ancestry in the East did have a lingering genetic impact on Holocene Siberian and Central Asian populations at least till the time of Botai.
The most recent clear connection with the Botai ancestry can be found in the Middle Bronze Age Okunevo individuals (Figure S6C). In contrast, additional EHG-related ancestry is required to explain the forest-tundra populations to the east of the Urals (Figure 5 and Table S8). Their multi-way mixture model may in fact portrait a prehistoric two-way mixture of a WSH population and a hypothetical eastern Eurasian one that has an ANE-related contribution higher than that in Nganasans. Botai and Okunevo individuals prove the existence of such ANE ancestry-rich populations. Pre-Bronze Age genomes from Siberia will be critical for testing this hypothesis.

The first two PCs summarizing the genetic structure within 2,077 Eurasian individuals. The two PCs generally mirror geography. PC1 separates western and eastern Eurasian populations, with many inner Eurasians in the middle. PC2 separates eastern Eurasians along the north-south cline and also separates Europeans from West Asians. Ancient individuals (color-filled shapes), including two Botai individuals, are projected onto PCs calculated from present-day individuals.

So, to sum up:

  • Northern Eurasia forms a Uralic – Yeniseian cline from east to west, with contribution from Steppe, WHG, and Siberian ancestry. Siberian ancestry is represented by Palaeo-Siberian Nganasans, who adopted Samoyedic quite late. It was already known that the different waves of Siberian ancestry are too late and do not represent the spread of Uralic languages, so that leaves us with Steppe and WHG.
  • The Caucasus Mountains were a long-lasting prehistoric barrier to gene flow (as recently shown in Y-DNA, too).
  • The Botai sample (ca. 3632-3100 BC) represents thus the furthest east that R1b-P297 subclades had expanded (we did know that, and that they didn’t have close genetic links with Khvalynsk, so the haplogroup spread there probably much earlier). It expanded R1b-M269’s sister clade R1b-M73 (also found in the Baltic region), and the Botai are on the ‘eastern’ end of an ancient genetic cline stretching from WHG to EHG to Afontova Gora.

EDIT (23 MAY 2018) Both samples share mtDNA, and the male one shares Y-DNA, with those reported in Damgaard et al. (Nature 2018); although dates are slightly different (3371-3354 calBC for BOT 14), it is within the range given for this one; for the female, the dates are similar (3521-3377 calBC for BOT2016, 3517-3367 cal. BCE for this one). The lack of data on their origin may point to the fact that we only have different bone samples from the same two Botai individuals. So probably still 50% R1b-M73 (with the other 50% being N2* from BOT15)…

It seems therefore not only that R1b-M269 is bound to split from the parent haplogroup in or around the steppe or forest-steppe: the Mesolithic spread of haplogroup R1b in North Eurasia is wider and its relevance thus greater than previously thought.

We may need to rethink the role of haplogroup R1a in spreading EHG and Indo-Uralic from east to west…

Featured image, from the supplementary materials: Frequency distribution map of the Y-chromosomal haplogroup R1b-P343(xM269) identified in the Eneolithic Botai individual. All modern Eurasian samples with this haplogroup tested to date for the downstream markers fall into R1b-M73 branch, suggesting Botai sample be one of its earliest representatives.


Eurasian steppe dominated by Iranian peoples, Indo-Iranian expanded from East Yamna


The expected study of Eurasian samples is out (behind paywall): 137 ancient human genomes from across the Eurasian steppes, by de Barros Damgaard et al. Nature (2018).

Dicussion (emphasis mine):

Our findings fit well with current insights from the historical linguistics of this region (Supplementary Information section 2). The steppes were probably largely Iranian-speaking in the first and second millennia bc. This is supported by the split of the Indo-Iranian linguistic branch into Iranian and Indian33, the distribution of the Iranian languages, and the preservation of Old Iranian loanwords in Tocharian34. The wide distribution of the Turkic languages from Northwest China, Mongolia and Siberia in the east to Turkey and Bulgaria in the west implies large-scale migrations out of the homeland in Mongolia since about 2,000 years ago35. The diversification within the Turkic languages suggests that several waves of migration occurred36 and, on the basis of the effect of local languages, gradual assimilation to local populations had previously been assumed37. The East Asian migration starting with the Xiongnu accords well with the hypothesis that early Turkic was the major language of Xiongnu groups38. Further migrations of East Asians westwards find a good linguistic correlate in the influence of Mongolian on Turkic and Iranian in the last millennium39. As such, the genomic history of the Eurasian steppes is the story of a gradual transition from Bronze Age pastoralists of West Eurasian ancestry towards mounted warriors of increased East Asian ancestry—a process that continued well into historical times.

This paper will need a careful reading – better in combination with Narasimhan et al. (2018), when their tables are corrected – , to assess the actual ‘Iranian’ nature of the peoples studied. Their wide and long-term dominion over the steppe could also potentially explain some early samples from Hajji Firuz with steppe ancestry.

Principal component analyses. The principal components 1 and 2 were plotted for the ancient data analysed with the present-day data (no projection bias) using 502 individuals at 242,406 autosomal SNP positions. Dimension 1 explains 3% of the variance and represents a gradient stretching from Europe to East Asia. Dimension 2 explains 0.6% of the variance, and is a gradient mainly represented by ancient DNA starting from a ‘basal-rich’ cluster of Natufian hunter-gatherers and ending with EHGs. BA, Bronze Age; EMBA, Early-to-Middle Bronze Age; SHG, Scandinavian hunter-gatherers.

For the moment, at first sight, it seems that, in terms of Y-DNA lineages:

  • R1b-Z93 (especially Z2124 subclades) dominate the steppes in the studied periods.
  • R1b-P312 is found in Hallstatt ca. 810 BC, which is compatible with its role in the Celtic expansion.
  • R1b-U106 is found in a West Germanic chieftain in Poprad (Slovakia) ca. 400 AD, during the Migration Period, hence supporting once again the expansion of Germanic tribes especially with R1b-U106 lineages.
  • A new sample of N1c-L392 (L1025) lineage dated ca. 400 AD, now from Lithuania, points again to a quite late expansion of this lineage to the region, believed to have hosted Uralic speakers for more than 2,000 years before this.
  • A sample of haplogroup R1a-Z282 (Z92) dated ca. 1300 AD in the Golden Horde is probably not quite revealing, not even for the East Slavic expansion.
  • Also, interestingly, some R1b(xM269) lineages seem to be associated with Turkic expansions from the eastern steppe dated around 500 AD, which probably points to a wide Eurasian distribution of early R1b subclades in the Mesolithic.

NOTE. I have referenced not just the reported subclades from the paper, but also (and mainly) further Y-SNP calls studied by Open Genomes. See the spreadsheet here.

Interesting also to read in the supplementary materials the following, by Michaël Peyrot (emphasis mine):

1. Early Indo-Europeans on the steppe: Tocharians and Indo-Iranians

The Indo-European language family is spread over Eurasia and comprises such branches and languages as Greek, Latin, Germanic, Celtic, Sanskrit etc. The branches relevant for the Eurasian steppe are Indo-Aryan (= Indian) and Iranian, which together form the Indo-Iranian branch, and the extinct Tocharian branch. All Indo-European languages derive from a postulated protolanguage termed Proto-Indo-European. This language must have been spoken ca 4500–3500 BCE in the steppe of Eastern Europe21. The Tocharian languages were spoken in the Tarim Basin in present-day Northwest China, as shown by manuscripts from ca 500–1000 CE. The Indo-Aryan branch consists of Sanskrit and several languages of the Indian subcontinent, including Hindi. The Iranian branch is spread today from Kurdish in the west, through a.o. Persian and Pashto, to minority languages in western China, but was in the 2nd and 1st millennia BCE widespread also on the Eurasian steppe. Since despite their location Tocharian and Indo-Iranian show no closer relationship within Indo-European, the early Tocharians may have moved east before the Indo-Iranians. They are probably to be identified with the Afanasievo Culture of South Siberia (ca 2900 – 2500 BCE) and have possibly entered the Tarim Basin ca 2000 BCE103.

The Indo-Iranian branch is an extension of the Indo-European Yamnaya Culture (ca 3000–2400 BCE) towards the east. The rise of the Indo-Iranian language, of which no direct records exist, must be connected with the Abashevo / Sintashta Culture (ca 2100 – 1800 BCE) in the southern Urals and the subsequent rise and spread of Andronovo-related Culture (1700 – 1500 BCE). The most important linguistic evidence of the Indo-Iranian phase is formed by borrowings into Finno-Ugric languages104–106. Kuz’mina (2001) identifies the Finno-Ugrians with the Andronoid cultures in the pre-taiga zone east of the Urals107. Since some of the oldest words borrowed into Finno-Ugric are only found in Indo-Aryan, Indo-Aryan and Iranian apparently had already begun to diverge by the time of these contacts, and when both groups moved east, the Iranians followed the Indo-Aryans108. Being pushed by the expanding Iranians, the Indo-Aryans then moved south, one group surfacing in equestrian terminology of the Anatolian Mitanni kingdom, and the main group entering the Indian subcontinent from the northwest.

Summary map. Depictions of the five main migratory events associated with the genomic history of the steppe pastoralists from 3000 bc to the present. a, Depiction of Early Bronze Age migrations related to the expansion of Yamnaya and Afanasievo culture. b, Depiction of Late Bronze Age migrations related to the Sintashta and Andronovo horizons. c, Depiction of Iron Age migrations and sources of admixture. d, Depiction of Hun-period migrations and sources of admixture. e, Depiction of Medieval migrations across the steppes.

2. Andronovo Culture: Early Steppe Iranian

Initially, the Andronovo Culture may have encompassed speakers of Iranian as well as Indo-Aryan, but its large expansion over the Eurasian steppe is most probably to be interpreted as the spread of Iranians. Unfortunately, there is no direct linguistic evidence to prove to what extent the steppe was indeed Iranian speaking in the 2nd millennium BCE. An important piece of indirect evidence is formed by an archaic stratum of Iranian loanwords in Tocharian34,109. Since Tocharian was spoken beyond the eastern end of the steppe, this suggests that speakers of Iranian spread at least that far. In the west of the Tarim Basin the Iranian languages Khotanese and Tumshuqese were spoken. However, the Tocharian B word etswe ‘mule’, borrowed from Iranian *atswa- ‘horse’, cannot derive from these languages, since Khotanese has aśśa- ‘horse’ with śś instead of tsw. The archaic Iranian stratum in Tocharian is therefore rather to be connected with the presence of Andronovo people to the north and possibly to the east of the Tarim Basin from the middle of the 2nd millennium BCE onwards110.

Since Kristiansen and Allentoft sign the paper (and Peyrot is a colleague of Kroonen), it seems that they needed to expressly respond to the growing criticism about their recent Indo-European – Corded Ware Theory. That’s nice.

They are obviously trying to reject the Corded Ware – Uralic links that are on the rise lately among Uralicists, now that Comb Ware is not a suitable candidate for the expansion of the language family.

IECWT-proponents are apparently not prepared to let it go quietly, and instead of challenging the traditional Neolithic Uralic homeland in Eastern Europe with a recent paper on the subject, they selected an older one which partially fit, from Kuz’mina (2001), now shifting the Uralic homeland to the east of the Urals (when Kuz’mina asserts it was south of the Urals).

Different authors comment later in this same paper about East Uralic languages spreading quite late, so even their text is not consistent among collaborating authors.

Also interesting is the need to resort to the questionable argument of early Indo-Aryan loans – which may have evidently been Indo-Iranian instead, since there is no way to prove a difference between both stages in early Uralic borrowings from ca. 4,500-3,500 years ago…

EDIT (10/5/2018) The linguistic supplement of the Science paper deals with different Proto-Indo-Iranian stages in Uralic loans, so on the linguistic side at least this influence is clear to all involved.

A rejection of such proposals of a late, eastern homeland can be found in many recent writings of Finnic scholars; see e.g. my references to Parpola (2017), Kallio (2017), or Nordqvist (2018).

NOTE. I don’t mind repeating it again: Uralic is one possibility (the most likely one) for the substrate language that Corded Ware migrants spread, but it could have been e.g. another Middle PIE dialect, similar to Proto-Anatolian (after the expansion of Suvorovo-Novodanilovka chiefs). I expressly stated this in the Corded Ware substrate hypothesis, since the first edition. What was clear since 2015, and should be clear to anyone now, is that Corded Ware did not spread Late PIE languages to Europe, and that some east CWC groups only spread languages to Asia after admixing with East Yamna. If they did not spread Uralic, then it was a language or group of languages phonetically similar, which has not survived to this day.

Their description of Yamna migrations is already outdated after Olalde et al. & Mathieson et al. (2018), and Narasimhan et al. (2018), so they will need to update their model (yet again) for future papers. As I said before, Anthony seems to be one step behind the current genetic data, and the IECWT seems to be one step behind Anthony in their interpretations.

At least we won’t have the Yamna -> Corded Ware -> BBC nonsense anymore, and they expressly stated that LPIE is to be associated with Yamna, and in particular the “Indo-Iranian branch is an extension of the Indo-European Yamnaya Culture (ca 3000–2400 BCE) to the East” (which will evidently show an East Yamna / Poltavka society of R1b-L23 subclades), so that earlier Eneolithic cultures have to be excluded, and Balto-Slavic identification with East Europe is also out of the way.


Ancient nomadic tribes of the Mongolian steppe dominated by a single paternal lineage

The genome of an ancient Rouran individual reveals an important paternal lineage in the Donghu population, by Li et al. Am J Phys Anthropol (2018), 1–11.

Abstract (emphasis mine):

Following the Xiongnu and Xianbei, the Rouran Khaganate (Rouran) was the third great nomadic tribe on the Mongolian Steppe. However, few human remains from this tribe are available for archaeologists and geneticists to study, as traces of the tombs of these nomadic people have rarely been found. In 2014, the IA‐M1 remains (TL1) at the Khermen Tal site from the Rouran period were found by a Sino‐Mongolian joint archaeological team in Mongolia, providing precious material for research into the genetic imprint of the Rouran.

Materials and methods
The mtDNA hypervariable sequence I (HVS‐I) and Y‐chromosome SNPs were analyzed, and capture of the paternal non‐recombining region of the Y chromosome (NRY) and whole‐genome shotgun sequencing of TL1 were performed. The materials from three sites representing the three ancient nationalities (Donghu, Xianbei, and Shiwei) were selected for comparison with the TL1 individual.

The mitochondrial haplotype of the TL1 individual was D4b1a2a1. The Y‐chromosome haplotype was C2b1a1b/F3830 (ISOGG 2015), which was the same as that of the other two ancient male nomadic samples (ZHS5 and GG3) related to the Xianbei and Shiwei, which were also detected as F3889; this haplotype was reported to be downstream of F3830 by Wei et al. (2017).

We conclude that F3889 downstream of F3830 is an important paternal lineage of the ancient Donghu nomads. The Donghu‐Xianbei branch is expected to have made an important paternal genetic contribution to Rouran. This component of gene flow ultimately entered the gene pool of modern Mongolic‐ and Manchu‐speaking populations.

The ancient males (TL1, ZHS5, and GG3) was grouped under C2b1a1b1/F3880 on the Y-DNA haplogroup C lineage using BEAST


These results suggested that TL1 likely presents a close paternal relationship to the Donghu people and may have even descended from a branch of the ancient Donghu-Xianbei people, based on the conclusion that haplogroup C2b1a/F3918 can be considered the paternal branch of the ancient Donghu people (Zhang et al., 2018). The Y-chromosome phylogenetic tree showed that TL1 shared a branch with modern Mongolian-Buryats, Hezhen, Xibo, Yugur, and Kazakh, suggesting that the TL1 individual from the Rouran period should also generally present close paternal genetic relationships with modern Mongolic- and Manchu-speaking peoples.

In general, the Rouran Khaganate originated from an alliance of the ancient Eurasian steppe nomads, which disintegrated and disappeared with the progress of history. This group was complex, and its origin cannot be explained based only on one individual. However, we can trace the genetic imprint of the Rouran people through genome analysis of the TL1 individual. On the basis of the comparison with other ancient nomadic people (Donghu, Xianbei, and Shiwei) and data on modern individuals from published articles (Lippold et al., 2014; Wei et al., 2017) (Supporting Information S5), we found that they all share the same haplotype implying shared paternal ancestry between the Donghu, Xianbei and Rouran populations. Furthermore, this gene flow (mainly haplogroup C2b1a/F3918) did not stop with the disappearance of the Rouran, and a portion was instead passed on in other groups, such as the ancient Shiwei people (later than Rouran), eventually reaching the gene pool of modern Mongolic- and Manchu-speaking populations (Mongolian-Buryats, Hezhen, Xibo, et al).

Interesting to see now confirmed with ancient DNA the proposal of a C3*-DYS448del cluster as the paternal lineage defining ancient Mongolian tribes, a theory based on ancient and modern samples – since it is found in low frequency in almost all Mongolic- and Turkic-speaking populations.

This is yet another proof of how prehistoric ethnolinguistic expansions are usually accompanied by haplogroup expansion and reduction in variability.

I wonder what other ancient chiefdom-type steppe-based nomadic groups were also dominated by a single paternal lineage