Proto-Tocharians: From Afanasievo to the Tarim Basin through the Tian Shan


A reader commented recently that there is little information about Indo-Europeans from Central and East Asia in this blog. Regardless of the scarce archaeological data compared to European prehistory, I think it is premature to write anything detailed about population movements of Indo-Iranians in Asia, especially now that we are awaiting the updates of Narasimhan et al (2018).

Furthermore, there was little hope that Tocharians would be different than neighbouring Andronovo-like populations (see a recent post on my predicted varied admixture of Common Tocharians), so the history of both unrelated Late PIE languages would have had to be explained by the admixture of Afanasievo-related groups with peoples of Andronovo descent and their acculturation.

However, data reported recently by Ning, Wang et al. Current Biology (2019) confirmed that peoples of mainly Afanasievo ancestry – as opposed to those of Corded Ware-related ancestry expanding with the Srubna-Andronovo horizon – spread the Tocharian branch of Proto-Indo-European from the Altai into the Tian Shan area, surviving essentially unadmixed into the Early Iron Age.

This genetic continuity of Tocharians will no doubt help us disentangle a great part the ethnolinguistic history of speakers of the Tocharian branch of Proto-Indo-European, from Pre-Proto-Tocharians of Afanasievo to Common Tocharians of the Late Bronze Age/Iron Age eastern Tian Shan.

NOTE. Tocharian’s isolation from the rest of Late PIE dialects and its early and intense language contacts have always been the key to support an early migration and physical separation of the group, hence the traditional association with Afanasievo, a late Repin/early Yamna offshoot. Even with the current incomplete archaeological and genetic picture, there is no other option left for the expansion of Tocharian.

It is not possible to use the currently available ancestry data to map the evolution of Afanasievo ancestry, lacking a proper geographical and temporal transect of Central and East Asian groups. In spite of this, Ning, Wang, et al. (2019) is a huge leap forward, discarding some archaeological models, and leaving only a few potential routes by which Tocharians may have spread southward from the Altai.

NOTE. I have updated the maps of prehistoric cultures accordingly, with colours – as always – reflecting the language/ancestry evolution of the different groups, even though the archaeological data of some groups of Xinjiang remains scarce, so their ethnolinguistic attribution – and the colours picked for them – remain tentative.

A rough timeline of related archaeological sites from North Eurasia. Image modified from Yang (2019).


The recent book Ancient China and its Eurasian Neighbors. Artifacts, Identity and Death in the Frontier, 3000–700 BCE, by Linduff, Sun, Cao, and Liu, Cambridge University Press (2017) offers an interesting summary of the introduction of metalworking into western China.

Here are some relevant excerpts (emphasis mine):

Although [the Xinjiang] route is not uniformly agreed upon (Shelach-Lavi 2009: 134–46), this western transmission has been thought to have passed through eastern Kazakhstan, especially as it is manifest in Semireiche, with Yamnaya, Afanasievo (copper) and Andronovo (tin bronze) peoples (Mei 2000: Fig. 3). From Xinjiang this knowledge has been thought to have traveled through the Gansu Corridor via the Qijia peoples (Bagley 1999) and then into territories controlled by dynastic China. The dating of this process is still a problem, as the sites and their contents in Xinjiang are consistently later than those in Gansu, suggesting that the point of contact was in Gansu and that the knowledge then spread from there westward.

1. Eneolithic Altai

Afanasievo expansion ca. 3300-2600 BC. See full culture and ancient DNA maps.

The Afanasievo sites, as they are identified in Mongolia, for instance, make up an Eneolithic culture analogous to that of southern Siberia (3100/2500–2000 BCE) in the Upper Yenissei Valley that is characterized by copper tools and an economy reliant on horse, sheep and cattle breeding as well as hunting. (…) The Afanasievo is best known through study of its burials, which typically include groups of round barrows (kurgans), each up to 12 m in diameter with a stone kerb and covering a central pit grave containing multiple inhumations. In their Siberian context, burial pottery types and styles have suggested contacts with the slightly earlier Kelteminar culture of the Aral and Caspian Sea area.

The Afanasievo culture monuments, located in the northern Altai and in the Minusinsk Basin (the western Sayan), have been seen as analogous evidence for cross-Eurasian exchange. These complexes contain small collections of metal, and many of the items are made of brass, although golden, silver and iron ornaments were also identified. A mere one-fourth of these objects are tools and ornaments, while the rest consist of unshaped remains and semi-manufactured objects. Its metallurgical tradition has recently been dated by Chernykh to as early as 3100 to 2700 BCE (1992),making it more compatible chronologically with the early brass-using sites in Shaanxi mentioned above. Kovalev and Erdenebaatar have excavated barrows in Bayan-Ulgii, Mongolia, that have been carbon-dated to the first half of the third millennium BCE and associated by ceramic types and styles and burial patterns with the Afanasievo (Kovalev and Erdenebaatar 2009: 357–58). These mounded kurgans were covered with stone and housed rectangular, wooden-faced tombs that included Afanasievo-type bronze awls, plates and small “leaf-shaped” knife blades (Kovalev and Erdenebaatar 2009: Figs. 6 and 7).

They also excavated sites belonging to the more recently identified Chemurchek archaeological culture, located in the foothills of the Mongolian Altai (Kovalev 2014, 2015) (Fig. 2.6). These sites are carbon-dated to the same period as the Afanasievo burials or to c. 3100/2500–1800 BCE (six barrows in Khovd aimag and four in Bayan-Ulgo aimag). In the rectangular stone kerbed Chemurchek slab burials (Ulaaanhus sum, Bayan-ul’gi aimag and so forth), bronze items included awls; and at Khovd aimag, Bulgan sum, in addition to stone sculptures, three lead and one bronze ring were excavated (Kovalev and Erdenebaatar 2009: Figs. 2 and 3; Fig. 2.6). Although we will not know if they were produced locally until much further investigation is undertaken, these discoveries do document knowledge of various uses and types of metal objects in western and south central Mongolia. The types of metal items thus far recovered are simple tools (awls) and rings (ornamental?) not unlike those associated with Andronovo archaeological cultures as well.

This is a complex circumstance where archaeological evidence is not complete, but raises very important questions about transmission of metallurgical knowledge to and from areas in present-day China. In the 1970s some Afanasievo mounds were excavated in Central Mongolia by a Soviet–Mongolian expedition led by V. V. Volkov and E. A. Novgorodova (Novgorodova 1989: 81–85). Unfortunately, these mounds did not yield metal objects, only ceramics, but they show that the Afanasievo culture with the Eneolithic metallurgical tradition of manufacturing pure copper items had already moved east at least far as central Mongolia. In 2004, Kovalev and Erdenebaatar investigated a large Afanasievo mound, Kulala ula, in the extreme northwest of Mongolia, near the Russian border (Kovalev and Erdenebaatar 2009). There they found a copper knife and awl (Fig. 2.5). There are five C14 dates on wood, coal and human bones from this mound, which belong to the period 2890–2570 BCE. This shows that the Afanasievo culture were carriers of technology and produced artifacts in the first half of the third millennium BCE and that they also moved south along the foothills of the Mongolian Altai. Afanasievo culture in Altai and the Minusinsk basin is dated by C14 to 3600–2500 BCE (Svyatko et al. 2009; Polyakov 2010). In the north of Xinjiang in the Altai district, several typical egg-shaped vessels and two censers of Afanasievo types were found. Some of these have been obtained from the stone boxes (chambers of megalithic graves of the Chemurchek culture) (Kovalev 2011). Thus, the Afanasievo tradition of pure copper metallurgy must have spread to the northern foothills of the Tienshan Mountains no later than the mid-third millennium BCE. The links with Afanasievo and local cultures adjacent to and south of the mountains into present-day China can now be assumed.

Afanasievo – Chemurchek evolution ca. 2600-2200 BC. See full culture and ancient DNA maps.

2. Bronze Age Altai

Kovalev and Erdenebaatar (2014a) and later Tishkin, Grushin, Kovalev and Munkhbayar (2015) in Western Mongolia conducted large-scale excavations of megalithic barrows of the Chemurchek culture (dated about 2600–1800 BCE). This peculiar culture appeared in Dzungaria and the Mongolian Altai in the second quarter of the third millennium BCE and for some time existed together with the late Afanasievo culture, as evidenced by the findings of Afanasievo ceramics in Chemurchek graves, in the stone boxes. Unfortunately, in China we do not yet know of any metal object related,without doubt, to the Chemurchek culture. Kovalev, Erdenebaatar, Tishkin and Grushin found several leaden ear rings and one ring of tin bronze in three excavated Chemurchek stone boxes (Kovalev and Erdenebaatar 2014a; Tishkin et al. 2015). Such lead rings are typical for Elunino culture,which occupied the entire West Altai after 2400–2300 BCE (Tishkin et al. 2015). This culture had developed a tradition of bronze metallurgy with various dopants, primarily tin. Thus, the tradition of bronze metallurgy as early as this time could have penetrated the Mongolian Altai far to the south. In addition, in the Hadat ovoo Chemurchek stone box, Kovalev and Erdenebaatar discovered stone vessels refurbished with the help of copper “patches,” indicating the presence there of metallurgical production (Fig. 2.7) (Kovalev and Erdenebaatar 2014a). In one of the secondary

Chemurchek graves unearthed by Kovalev and Erdenebaatar in Bayan-Ulgi (2400–2220 BCE), a bronze awl was found (Kovalev and Erdenebaatar 2009). Kovalev and Erdenebaatar also discovered a new culture in the territory of Mongolia (Map 2.3), one that begins immediately after Chemurchek – Munkh-Khairkhan culture (Kovalev and Erdenebaatar 2009, 2014b). To date, about 17 mounds of this culture have been excavated in Khovd, Zavkhan, Khovsgol, Bulgan aimag of Mongolia. This culture dates from about 1800 to 1500 BCE, that is, contemporary with the Andronovo culture. Therefore, the Andronovo culture does not extend far into the territory of Mongolia. Three knives without dedicated handles or stems and five awls have been found in the Munkh-Khairkhan culture mounds (Fig. 2.8). All these products are made of tin bronze. (…) Additionally, eight Late Bronze Age burials (c. 1400–1100 BCE) were unearthed in the Bulgan sum of Khovd aimag and belong to another previously unknown culture called Baitag. And in the Gobi Altai, a new group of “Tevsh” sites dating to the Late Bronze Age were defined in Bayankhongor and South Gobi aimags (Miyamoto and Obata 2016: 42–50). From these Tevsh and Baitag sites, we see the expansion of burial goods to include beads of semiprecious stones (carnelian), bronze beads, buttons and rings and even the famous elaborate golden hair ornaments (Tevsh uul;Bogd sum;Uverkhanagia aimag) from the Baitag barrows (Kovalev and Erdenebaatar 2009: Fig. 5; Miyamoto and Obata 2016).

2.1. Chemurchek

About the Chemurchek culture, from A re-analysis of the Qiemu’erqieke (Shamirshak) cemeteries, Xinjiang, China, by Jia and Betts JIES (2010) 38(4):

The major characteristics of Qiemu’erqieke Phase I include:

  1. Burials with two orientations of approximately 20° or 345°.
  2. Rectangular enclosures built using large stone slabs. The size of the enclosure varies from a maximum of 28 x 30 m.*to a minimum of 10.5 x 4.4 m. (Figure 8, Table 2).
  3. *The stone enclosure located near Hayinar is the largest one at approximately 30 x 40 m. based on pacing of the site during a visit by the authors in 2008.

  4. Almost life-sized anthropomorphic stone stelae erected along one side of the stone enclosures (Lin Yun 2008).
  5. Single enclosures tend to contain one or more than one burial, all or some with stone cist coffins.
  6. The cist coffin is usually constructed using five large stone slabs, four for the sides and one on top, leaving bare earth at the base (Zhang Yuzhong 2007). Sometimes the insides of the slabs have simple painted designs (Zhang Yuzhong 2005).
  7. Primary and secondary burials occur in the same grave.
  8. Some decapitated bodies (up to 20) may be associated with the main burial in one cist.
  9. Bodies are commonly placed on the back or side with the legs drawn up.
  10. Grave goods include stone and bronze arrowheads, handmade gray or brown round-bottomed ovoid jars, and small numbers of flat-bottomed jars (Fig. 7).
  11. Clay lamps appear to occur together with roundbottomed jars.
  12. Complex incised decoration on ceramics is common but some vessels are undecorated.
  13. The stone vessels are distinctive for the high quality of manufacture.
  14. Stone moulds indicate relatively sophisticated metallurgical expertise.
  15. Artefacts made from pure copper occur.
  16. Sheep knucklebones (astragali) imply a tradition (as in historical and modern times) of keeping knucklebones for ritual or other purposes. They also indicate the herding of domestic sheep as part of the subsistence economy.
Chemurchek culture ca. 2200-1750 BC. See full culture and ancient DNA maps.

Chemurchek dating

Available evidence suggests that the date range for Qiemu’erqieke Phase I should fall from the later third into the early second millennium BC. There are several reasons to suggest that the time span is around the early second millennium BC. Lin Yun (2008) (…) maintains that the bronze artefacts found in Phase I show a greater sophistication in the level of copper alloy technology than that of the pure copper artefacts common to the Afanasievo tradition. On this basis it might be suggested that the Afanasievo could be considered to be Chalcolithic with a time span across much of the third millennium BC ( Gorsdorf et al. 2004: 86, Fig. 1). Qiemu’erqieke Phase I, however, should more properly be considered as Bronze Age.

Lin Yun also used the bronze arrowhead from burial Ml 7 to narrow down the date of Qiemu’erqieke Phase I. Two arrowheads were found in this burial, one of them leaf shaped with a single barb on the back (Fig. 7:4). A similar arrowhead, together with its casting mould, has been found at the Huoshaogou site of Siba tradition (Li Shuicheng 2005, Sun Shuyun and Han Rufen 1997), in Gansu province, northwest China, dated around 2000-1800 BC (Li Shuicheng and Shui Tao 2000) . This supports a date in the early second millennium BC for the Qiemu’erqieke arrowhead. The painted, round-bottomed jar from the Tianshanbeilu cemetery Qia Weiming, Betts and Wu Xinhua 2008: Fig. 7, bottom left) has been considered as a hybrid between the Upper Yellow River Bronze Age cultures of Siba in northwest China and the steppe tradition of Qiemu’erqieke in west Siberia (Li Shuicheng 1999). If this assumption is correct, the date of Tianshanbeilu, around 2000 BC, can be used as a reference for Qiemu’erqieke Phase I (Jia Weiming, Betts and Wu Xinhua 2008, Lin Yun 2008, Li Shuicheng 1999). Stone arrowheads found in Qiemu’erqieke Phase I also imply that the date is likely to fall within the earlier part of the Bronze Age as no such stone arrowheads have yet been found elsewhere in sites of the Bronze Age in Xinlang dated after the beginning of the second millennium BC.*
*For example Chawuhu and Xiaohe cemeteries (Xinjiang Institute of Archaeology 1999, 2003).

Pottery of Afanasevo and East European traits from the Chemurchek complex. Image modified from Kovalev (2017).

(…) Pottery “oil burners” (goblet-like ceramic vessels, possibly lamps) have been found in three traditions: Afanasievo (Gryaznov and Krizhevskaya 1986:21), Okunevo and Qiemu’erqieke. It is believed that this oil-burner found in Siberia and the Altai is a heritage from the Yamnaya and Catacomb
cultures (Sulimirski 1970: 225, 425; Shishlina 2008:46) in the Caspian steppe further to the west, but does not seem to exist in known Andronovo cultures.
The oil-burner tends to disappear after around 2300 BC during the mid-Okunevo period. It is, however, possible that the tradition continues longer in the Qiemu’erqieke sites.

The construction of the stone enclosures also reveals a close connection between Qiemu’erqieke Phase I and the mid and late Okunevo tradition (Sokolova 2007). Slab built stone enclosures emerged in both the Okunevo and Afanasievo traditions (Gryaznov and Krizhevskaya 1986:15-23, Kovalev 2008, Sokolova 2007, Anthony 2007:310, Koryakova and Epimakhov 2007). In the early Afanasievo the enclosure is circular with no cist coffin (Anthony 2007:310, Gryaznov and Krizhevskaya 1986:20), but in the early stage of the Okunevo square stone enclosures with a single cist burial are dominant. Square or rectangular stone enclosures are a marked feature of Qiemu’erqieke Phase I, suggesting temporal relationships between Qiemu’erqieke Phase I and the Okunevo. In Okunevo chronological group II, possibly with influence from the Anfanasievo, circular stone enclosures appeared in combination with rectangular enclosures within individual cemeteries, referred to by Sokolova (2007: table 2) as hybrid examples. By Okunevo chronological group III, rectangular stone slab enclosures with multi-burials emerged again. This is the dominant form in Qiemu’erqieke Phase I. Okunevo burial traditions changed again to single cist burials in the late stage around chronological group V ( Sokol ova 2007). A specific mortuary rite of decapitated burials exists in both the Qiemu’erqieke and Okunevo traditions (Sokolova 2007, Chen Kwang-tzuu and Hiebert 1995), as does the occasional occurrence of painted designs on the interior of the slabs forming the cists ( e.g., Khavrin 1997: 70, fig. 4; 77: tab. IV.5). Based on these comparisons, the date of Qiemu’erqieke Phase I may well parallel that of the Okunevo from at least chronological group II around 2400 BC (Gorsdorf et al. 2004: fig. 1).

Khuh Udzuuriin I-1 elite barrow (ca. 2470-2190 BC). Modified from Image modified from Kovalev (2014).

In addition to the pottery making tradition, the anthropomorphic stone stelae may also have earlier antecedents. In the Okunevo assemblage there are anthropomorphic stelae that are longer, thinner and more abstract than those of Qiemu’erqieke. There is no indication of such stelae in the Afanasievo tradition (Gryaznov and Krizhevskaya 1986:15-23). However, further to the west, anthropomorphic stone stelae are associated with the Kemi-Oba and Yamnya cultures around the third millennium BC (Telegin and Mallory 1994; Figure 13). Some major characteristics of these stelae such as the icons on the front face of the stelae (Telegin and Mallory 1994:8-9) also appear on stelae found in Qiemu’erqieke Phase I. Recalling the oil burners that may have been inherited from the Yamnya culture and which are found in the Afansievo, Okunevo and Qiemu’erqieke Phase I, it migh t be possible to speculate that Qiemu’erqieke Phase I has its origins even earlier than the first half of the third millennium BC. This idea has also been suggested by Kovalev ( 1999).

Despite the affinities with the Okunevo cultural tradition, Qiemu’erqieke Phase I appears to be a discrete regional variant. The ceramic assemblage shows traits unique to this cluster of sites, while the anthropomorphic stelae are also distinctive markers of this tradition.

Khuh Udzuur anthropomorphic stone stela, oriented toward the south – south-east. Image modified from Kovalev (2014).

3. Bronze Age Xinjiang

I recently reported on this blog the description of Xiaohe and Gumugou cemeteries from interesting Master’s thesis Shifting Memories: Burial Practices and Cultural Interaction in Bronze Age China: A study of the Xiaohe-Gumugou cemeteries in the Tarim Basin, by Yunyun Yang, Uppsala University, Department of Archaeology and Ancient History (2019).

It also offered a full summary of findings from prehistoric sites of Xinjiang related to the arrival of a cultural package from the Altai region, ultimately connected to Afanasievo. Relevant excerpts include the following (emphasis mine):

In Bronze Age Xinjiang, burials were diverse but also show some common features between different geographic sections. The main three mountains, including Kunlun Mountains, Tian Shan (mountains) and Altai Mountains, enclose the Tarim Basin, and the Dzungaria Basin, but leave the eastern part of the Tarim Basin and the south-eastern part of the Dzungaria Basin open (with easy access to the surroundings). The Hami Basin is located at the transitional area, connecting the two basins. Burials are mainly spread along the edge of the mountain ranges.

An assumption of the spreading/expansion routes stone burial construct.

3.1. The Lop Nur region

In the Lop Nur region, the Xiaohe cemetery (2000-1450 BCE) and the Gumugou cemetery (1900-1800 BCE) had many common features shared, and so is the Keliyahe northern cemetery:

  • Cemeteries were located in sandy areas;
  • Rectangular/boat-shaped wooden coffins with monuments of wooden planks or poles;
  • Coffins had no bottoms;
  • The dead were placed lying straight on the back;
  • The dead were commonly buried in single graves.

The Gumugou cemetery contained six special sun-radiating-spokes burial pattern in addition to the normal burials, which were similar to the wooden coffin graves of the Xiaohe cemetery.

NOTE. For more on Xiaohe and Gumugou, see the recent post on Proto-Tocharians. See other papers on the Andronovo horizon for other Early to Middle Bronze Age cultural groups less clearly associated with the Xiaohe horizon, like Hazandu, Xintala, or the Chust culture.

From Shuicheng (2006):

An assemblage of early bronzes had been recovered from northwestern Xinjiang and the periphery of Dzungaria 准噶尔 Basin. It comprises a variety of utilitarian tools and weapons, and a small number of apparels. These artifacts bear the stamps of Andronovo Culture in form, artifact type and decorative pattern. The metallographic analysis on selected artifacts indicates that they comprise mainly of tin-bronzes that contain 2–10% of tin. Moreover, the chemical compositions of these artifacts are similar to that of the Andronovo Culture. Latter date (first half of the 1st millennium BC) artifacts of the assemblage include a small number of arsenic bronzes. In all, during the period between the mid-2nd and mid-1st millennium BC, copper and bronze artifacts coexisted in this region, albeit tin-bronze comprised the majority. The composition of alloy did not show significant change over time. Some colleagues pointed out that the Nulasai 奴拉赛 site at Nileke 尼勒克 County in the Yili 伊犁 River basin of Xinjiang was the pioneer in the use of “sulphuric ore–ice copper–copper”technology. It is also the only early smelting site in Euro-Asia that arsenic ore was added to deliberately produce an alloy

Prehistoric cultures of Xinjiang during the Middle Bronze Age. See full culture and ancient DNA maps.

3.2. The Hami Basin-the Balikun Grassland

From Yang (2019):

The Hami Basin-the Balikun Grassland area is located at the eastern part of Tian Shan. The area is divided in a northern basin and a southern basin by the east-west stretch of the Tian Shan. In the Hami Basin-the Balikun Grassland area, the main type of burials were earth-pit graves in the early Bronze Age, and burials of stone-pit with barrows became more common in the late Bronze Age. The Hami-Tianshan-Beilu cemetery is a representative of the earth-pit graves. The features of the Hami-Tianshan-Beilu cemetery (2000-1500 bce) here were:

  • Rectangular earth pit graves;
  • The dead were often in a hocker position lying on one side;
  • Commonly a single dead in one grave.
The Balikun grassland today (source).

The Hami-Wubu cemetery (earlier than 1000 bce) and the Yanbulake cemetery (1200-600 bce) are representatives of another common earth-pit graves. Common features here were:

  • Rectangular earth pits, with two storeys and/or roofed with wooden boards;
  • The dead was placed in a hocker position lying on one side;
  • Mostly a single dead in one grave.

Later there appeared more stone-pit graves in this area, and the features can be summarized as:

  • Round burial mounds, commonly constructed by stones or a mix of stones and earth;
  • Burial mounds with a sunken top or a normal (dome) top;
  • The diameter of the burial mounds varied between 3 and 25.4 m (but not necessarily limited in this scope);
  • Circular or rectangular stone kerbs;
  • Rectangular stone pits, constructed by earth, or stones, or a mix of earth and stones;
  • Rectangular stone pits contained wooden coffins (represented by the Yiwu Baiqi’er cemetery).
Some representatives of stone burials in the Hami Basin – the Balikun Grassland in the Iron Age (Adapted from: Xinjiang 2011, 29-41). Image modified from Yang (2019).

In the Hami Basin, the Bronze Age cemeteries show common burial features like earth pits and hocker position of the dead. With similar pottery styles in the Hami-Tianshan-Beilu cemetery to those in the Machang and Siba cultures (Xinjiang 2011: 17), it suggests possible cultural influence or people’s migrating from the Hexi Corridor in the east.

In the Balikun Grassland, burials in an earlier time contained mostly earth-pit graves but also a small number of stone-pit graves. The pebbles were imbedded in the floors and the walls of the graves in a rectangular shape, e.g. the Balikun-Nanwan cemetery (1600-1000 bce). In a later time, there appeared huge burial mounds with a sunken top, and with the diameters of the burial mounds varying from 3 to 25.4 m, e.g. the Balikun-Dongheigou cemetery and the Balikun-Heigouliang cemetery. The Yiwu-Bai’erqi and the Yiwu-Kuola cemeteries contained either round stone burial mounds or circular stone kerbs on the ground surface. Considering the three burial elements including burial mounds, stone pits and circular kerbs, the later period cemeteries in the Balikun Grassland were actually similar to cemeteries from the southern edge of the Altai Mountain area.

From Shuicheng (2006):

The Nanwan 南湾 cemetery site at Kuisu 奎苏 Town, Balikun 巴里坤 (1600–1100 BC) also yielded an assemblage of early bronzes. The style of its early phase artifacts is similar to that of the burials distributed in the North Tianshan Route. Some sorts of cultural connection should have existed between the two.

The dates of Yanbulake 焉不拉克 Culture (1300–700 BC) are comparatively late. Its metallurgy was a continuation of the western China tradition. Artifact types include a variety of utilitarian tools, weapons and apparels.

Prehistoric cultures of Xinjiang during the Late Bronze Age. See full culture and ancient DNA maps.

3.3. The Turpan Basin-the middle part of Tian Shan

From Yang (2019):

Turpan Basin is located at the western part of the Hami Basin, and lies at the southern edge of the eastern Tian Shan. In the Turpan Basin-the middle part of Tian Shan area, the main representative of the Bronze Age cemeteries is the Yanghai Nr.1 cemetery. The features here were:

  • Elliptic earth pit graves, commonly covered by round logs on the top;
  • Some graves contained burial beds made of round logs or reeds;
  • The dead were mainly placed lying straight on the back;
  • Mostly a single dead in one grave.

In Iron Age, the stone burials became dominant, but the stone burials varied in different regions of the Turpan Basin-the middle part of Tian Shan area. Graves containing burial mounds, stone pit, and circular stone kerbs are represented by the Shanshan-Ertanggou cemetery, the Tuokexun-Alagou cemetery, the Urumqi-Chaiwobu cemetery and the Urumqi-Yizihu-Sayi cemetery, etc. The stone funeral construction features here are similar to those contemporary cemeteries in the Hami Basin-the Balikun Grassland area.

3.4. The southern edge of the western and middle part of Tian Shan

In the southern edge of the western and middle part of Tian Shan area, the main representatives of the late Bronze Age cemeteries are the Hejing-Chawuhu Nr.4 cemetery (around 1000-500 bce), the Hejing-Xiaoshankou cemetery, the Baicheng-cemetery, etc. The main burial features of the late Bronze Age and the early Iron Age cemeteries (see Fig.12) here were:

  • Burial mounds, constructed by stones or a mix of stones and earth;
  • Irregular circular or rectangular stone kerbs;
  • Stone pit graves in a bell-shape or a rectangular shape;
  • Stone pit graves constructed by imbedding pebbles or stone slabs in walls and floors;
  • The dead were often placed lying on their back with bent legs;
  • The dead were commonly reburied a second time with multiple burials.

From the late Bronze Age to the early Iron Age in this area, the burial traditions tended to be in a more varied way. In the stone burials with stone kerbs, there is a mixture of stone pit and earth pit graves. The burial features of the Iron Age cemeteries in this section were similar to those contemporary both in the Hami Basin-the Balikun Grassland area and in the Turpan Basin-the middle part of Tian Shan area.

From Shuicheng (2006):

The Chawuhu 察吾呼 Culture (1100–500 BC) distributes on the foothills between the middle section of the Tianshan Mountain Ranges and Tarim River. Its bronze assemblage comprises a variety of weapons, utilitarian tools and small apparels. They show no apparent temporal change in form and type through the four cultural phases. In addition, bronzes bear the Chawuhu characteristics were found in Hejing 和静, Baicheng 拜城 and Luntai 轮台 (Bügür). Yet, sites distributed along the Tarim River, such as Heshuo 和硕, Kuga 库车and Aksu 阿克苏, yielded remains of a bronze culture different from that of Chawuhu. Bronzes recovered include double-eared socketed axe, arrowheads, awls, knives, needles and bracelets. Their absolute dates have been estimated to be earlier than that of Chawuhu.

Prehistoric cultures of Xinjiang during the Early Iron Age. See full culture and ancient DNA maps

3.5. The Pamir Plateau

From Yang (2019):

A typical Bronze Age cemetery from the Pamir Plateau area is the Tashenku’ergan-Xiabandi cemetery (around 1000-500 bce). The burial features here were:

  • Mainly inhumations, but also a few cremations;
  • Burial mounds, constructed of stones;
  • Irregular circular or rectangular stone kerbs;
  • Mostly a single dead in one grave;
  • The dead was placed in a hocker position lying on one side.

The adoption of burial customs from the east supports the migration of Afanasievo-related peoples from the Tian Shan up to the Pamir Plateau, strongly influencing the findings of the Xiabandi cemetery, which has been dated from an early Bronze Age phase (ca. 1500-300 BC) to a late date up to ca. 600 BC.

While it is today unclear how far the Afanasievo admixture reached into the western Xinjiang, it seems that the Pamir Plateau remained culturally connected to neighbouring Andronovo-related cultures in pottery and metallurgical innovations, hence their language probably belonged – during most part of the Bronze and Iron Ages – to the Indo-Iranian branch, even though specific dialects might have changed with each new attested group.

In particular, it is possible that the early Andronovo groups related to the Xiaohe Horizon spoke Indo-Aryan or West Iranian dialects, while Saka-related groups replaced them – or an intermediate Tocharian-speaking group – with East Iranian dialects. A close interaction with West Iranian would justify the known ancient borrowings of Tocharian, although they could also be explained by contacts with Chust-related groups farther west. For more on this, see Ged Carling’s work on the different layers of Iranian loans.

Xinjiang BA/IA Summary

From Yang (2019):

In the early Bronze Age, there are distinct regional differences in the burial customs in and surrounding the Tarim Basin. At the southern edge of the Altai Mountains area, the burial customs included stone burial mounds, stone pit graves, circular or rectangular stone kerbs and stone human sculptures; the dead were placed lying straight on the back. In the Hami Basin-the Balikun Grassland area, the burial customs included earth pit graves; the dead were placed in a hocker position lying on one side. In the Turpan Basin-the middle part of Tian Shan area, the burial customs included earth pit graves; the dead were placed lying straight on the back. In the Lop Nur region, the burial customs included wooden coffins buried in sand; the dead were placed lying straight on the back.

But from the late Bronze Age to the early Iron Age, there was a common shift in burial customs from earth pit graves to stone burials in the Hami Basin-the Balikun Grassland area and in the Turpan Basin-the middle part of Tian Shan area. The main features of the stone burials include stone burial mounds, circular or rectangular stone kerbs, and the stone pit graves in the cemeteries. Similar stone burial customs commonly appeared at the southern edge of the western and middle part of Tian Shan area and the Pamir Plateau area in Iron Age. The burial features in most areas are in a mixture of both the earth pit graves and stone pit graves, especially in the Hami Basin-the Balikun Grassland area and the Turpan Basin-the middle part of Tian Shan area.


From Shuicheng (2006):

Historians of metallurgy conducted metallographic analyses on a sample of 234 metal specimens recovered from 16 localities in eastern Xinjiang. They concluded that the metallurgic industry in eastern Xinjiang could be roughly partitioned into three developmental phases. The early phase is represented by the burials distributed in the North Tianshan Route. The majority of the metal assemblage was tin-bronzes; however, copper and arsenic-bronzes maintained considerable proportions. The middle phase is represented by the burials at Yanbulake. During this phase, tin-bronze still maintained the majority; the proportion of arsenic-bronze increased, and some of them were high arsenic-bronzes. The late phase is represented by the burials at Heigouliang 黑沟梁. The composition of lead increased in the bronze alloy in the expense of arsenic. In addition, this phase witnessed the appearance of high tin-bronze that composed up to 16% of tin and the appearance of brass, that is, an alloy of copper and zinc. The bronze alloy consistently contained significant amount of impurities regardless of temporal difference. Casting and forging technologies coexisted throughout the three phases. The early bronzes (2000–500 BC) of eastern Xinjiang, in general, contained arsenic; however, the composition of arsenic was usually under 8%, but a few artifacts contained more than 20% arsenic. In all, arsenic had long been used in the alloy-forming of the early bronzes in eastern Xinjiang. Consequently, arsenic-bronzes were widely found in the prehistoric archaeology of the region. The artifact types, chemical compositions and manufacture techniques of the bronze assemblage of the burials of the North Tianshan Route are similar to those of Siba Culture, indicating that eastern Xinjiang had played a significant role in the East-West interactions.

An assemblage of early bronzes had been recovered from northwestern Xinjiang and the periphery of Dzungaria 准噶尔 Basin. It comprises a variety of utilitarian tools and weapons, and a small number of apparels. These artifacts bear the stamps of Andronovo Culture in form, artifact type and decorative pattern. The metallographic analysis on selected artifacts indicates that they comprise mainly of tin-bronzes that contain 2–10% of tin. Moreover, the chemical compositions of these artifacts are similar to that of the Andronovo Culture. Latter date (first half of the 1st millennium BC) artifacts of the assemblage include a small number of arsenic-bronzes. In all, during the period between the mid-2nd and mid-1st millennium BC, copper and bronze artifacts coexisted in this region, albeit tin-bronze comprised the majority.

Prehistoric cultures of Xinjiang during the Late Iron Age. See full culture and ancient DNA maps.

Tocharians in population genomics

Prehistoric population movements between the Altai and the Tian Shan are difficult to pinpoint, not the least because of the division of these territories among three different countries and their archaeological teams, only recently (more) open to the international scholarship.

The available schematic archaeological picture, where migrations could only be roughly inferred, has been recently updated to a great extent by Ning, Wang et al. (2019), whose genetic analysis of the samples is as thorough as anyone could have asked for, with a level of detail which matches the complex genetic picture of the region by the Iron Age.

As a summary, here is what they described about the samples from Shirenzigou (ca 400-200 BC), corresponding to the Iron Age populations of the Hami Basin-the Balikun Grassland area, and closely related to the preceding Yanbulake Culture:

As shown in Figure S3, the Steppe_MLBA populations including Srubnaya, Andronovo, and Sintashta were shifted toward farming populations compared with Yamnaya groups and the Shirenzigou samples. This observation is consistent with ADMIXTURE analysis that Steppe_MLBA populations have an Anatolian and European farmer-related component that Yamnaya groups and the Shirenzigou individuals do not seem to have. The analysis consistently suggested Yamnaya-related Steppe populations were the better source in modeling the West Eurasian ancestry in Shirenzigou.

Biplot of f3-outgroup tests illustrating the Kostenki14 and Anatolia_N like ancestries in Shirenzigou individuals. Most Shirenzigou individuals were on a cline with Yamnaya and European hunter-gatherer groups, lacking the European farmer ancestry as compared to the Steppe_MLBA populations such as Andronovo, Srubnaya and Sintashta [S1-S5]. Horizontal and vertical bars represent ± 3 standard errors, corresponding to form of outgroup f3 tests on the x axis and y axis respectively.

We continued to use qpAdm to estimate the admixture proportions in the Shirenzigou samples by using different pairs of source populations, such as Yamnaya_Samara, Afanasievo, Srubnaya, Andronovo, BMAC culture (Bustan_BA and Sappali_ Tepe_BA) and Tianshan_Hun as the West Eurasian source and Han, Ulchi, Hezhen, Shamanka_EN as the East Eurasian source. In all cases, Yamnaya, Afanasievo, or Tianshan_Hun always provide the best model fit for the Shirenzigou individuals, while Srubnaya, Andronovo, Bustan_BA and Sappali_Tepe_BA only work in some cases.

Table S2. P values in modelling a two-way (P=rank 1) admixture in Shirenzigou samples using each of the four populations (Bustan_BA, Sappali_Tepe_BA, Andronovo.SG, Srubnaya) together with Han Chinese as two sources [S6], Related to Figure 2. We used the following set of outgroups populations: Dinka, Ust_Ishim, Kostenki14, Onge, Papuan, Australian, Iran_N, EHG, LBK_EN.


In the PCA, ADMIXTURE, outgroup f3 statistics [see Figure S4], as well as f4 statistics (Table S3), we observed the Shirenzigou individuals were closer to the present day Tungusic and Mongolic-speaking populations in northern Asia than to the populations in central and southern China, suggesting the northern populations might contribute more to the Shirenzigou individuals. Based on this, we then modeled Shirenzigou as a three-way admixture of Yamnaya_Samara, Ulchi (or Hezhen) and Han to infer the source from the East Eurasia side that contributed to Shirenzigou. We found the Ulchi or Hezhen and Han-related ancestry had a complicated and unevenly distribution in the Shirenzigou samples. The most Shirenzigou individuals derived the majority of their East Eurasian ancestry from Ulchi or Hezhen-related populations, while the following two individuals M820 and M15-2 have more Han related than Ulchi/ Hezhen-related ancestry

It is unclear whether the Chemurchek population will show a sizeable local contribution from neighbouring groups. The fact that Okunevo shows 20% Yamnaya-related ancestry strongly supports the nature of neighbouring stone-grave-building peoples of the Altai and the northern Tian Shan as mostly Afanasievo-like, and the apparent lack of contributions of Srubna/Andronovo-like ancestry in the early Hami-Balikun stone burial builders also speaks for radical population replacement events reaching the areas south of Tian Shan, at least initially.

While ancestry cannot settle linguistic questions, it seems that nomads of the Gansu and Qinghai grasslands retained an ancestry close to Andronovo, whereas nomads of the Hami Basin-Balikun grasslands and related populations of Xinjiang remained closely related to Afanasievo. This doesn’t preclude that the ancestors of the Yuezhi became acculturated under the influence of peoples from eastern Xinjiang, but all data combined suggest an isolation of both populations – relative to other groups and to each other – and it is therefore more likely that they spoke Indo-Iranian-related languages rather than a language of the Tocharian branch.


In an interesting twist of events, despite the initially reported hg. R1b and Q, Tocharians from Shirenzigou actually show a haplogroup diversity comparable to that attested in other late Iron Age populations: a similar diversity is seen, for example, among Germanic, Baltic, and Balto-Finnic peoples of the Baltic region; among East Germanic or Scythians of the north Pontic region; or among Mediterranean peoples sampled to date. Iron Age peoples show thus a complex sociopolitical setting that overcame the previous patrilineal homogeneity of Bronze Age expansions.

PCA and ADMIXTURE for Shirenzigou Samples. Modified from the original to include in black squares samples related to Yamnaya. Modified from the paper to include labels of modern populations and a dotted lines with the cline formed by Shirenzigou, from (Yamnaya-like) Afanasievo to Central and East Asian-like populations. In red circles, samples with best fit for Andronovo-like ancestry. In green circles, samples with Han-related admixture.

M15-2 (with Han-related ancestry) is of the rare haplogroup Q1a-M120, while the samples with highest Steppe_MLBA-related ancestry are of hg. R1b-PH155, which points to their recent origin among Yuezhi, or to Hun-related populations showing an admixture related to the proto-historic nomads of the Gansu and Qinghai grasslands.

The expansion of Chemurchek-related peoples was probably associated more with hg. Q1a (dubious if it’s a Pre-ISOGG 2017 nomenclature, hence possibly Q1b), a haplogroup that might be found in Khvalynsk as a “significant minority” according to Anthony (2019), and it might also be attested in sampled individuals from Afanasievo in its late phase. This might be, therefore, a case similar to the early expansion of Indo-Europeans with R1b-V1636 lineages through the Volga – North Caucasus region, and of the later expansion with I2a-L699 lineages into the Balkans.

Haplogroup Q1a2-M25 is found in individual X3, whose Steppe ancestry is likely a combination of Afanasievo plus Andronovo-like ancestry heavily admixed with Hezhen/Ulchi-like populations, in line with the expected recent contacts with the neighbouring Xiongnu, Yuezhi, and other population movements affecting eastern Xinjiang.

Sample M4, which packs the most Afanasievo-like ancestry, is of hg. R1a-Z645, which – like sample M8R1 of hg. O – is most likely related to haplogroup resurgence events of local populations, which left the predominant Afanasievo-like admixture brought by builders of stone burials essentially intact, evidenced by the almost 100% of R1a found in the Xiaohe cemetery – and in most of the early Andronovo horizon – and among expanding Kangju and Wusun, as well as by the prevalence of hg. O among sampled East Asian populations.

A question that will only be answered with more samples is how and when the prevalent R1b-L23 and Q1b lineages among Afanasievo-related peoples began to be replaced to reach the high variability seen in Shirenzigou. Given the pastoralist nature of peoples around Tian Shan, the succeeding expansions of Proto-Tocharians, and the late isolation of different Common Tocharian groups, it is more than likely that this variability represents a late and local phenomenon within Xinjiang itself.

Peoples of Xinjiang during Antiquity. See full culture and ancient DNA maps.


Tocharians are one of the main pillars that confirm the Late Proto-Indo-European homeland of the R1b-rich populations of the Don-Volga region. There is already:

Just like the East Bell Beaker expansion from Yamnaya Hungary has confirmed that Corded Ware peoples did not partake in spreading Indo-European languages (spreading Uralic languages instead), data on the expansion of Tocharian speakers from Afanasievo to the Tian Shan was always there; population genomics is merely helping to connect the dots.

In summary, genetic research is supporting the expected linguistic expansions of the Neolithic and Bronze Age step by step, slowly but surely.


North-West Indo-Europeans of Iberian Beaker descent and haplogroup R1b-P312


The recent data on ancient DNA from Iberia published by Olalde et al. (2019) was interesting for many different reasons, but I still have the impression that the authors – and consequently many readers – focused on not-so-relevant information about more recent population movements, or even highlighted the least interesting details related to historical events.

I have already written about the relevance of its findings for the Indo-European question in an initial assessment, then in a more detailed post about its consequences, then about the arrival of Celtic languages with hg. R1b-M167, and later in combination with the latest hydrotoponymic research.

This post is thus a summary of its findings with the help of natural neighbour interpolation maps of the reported Germany_Beaker and France_Beaker ancestry for individual samples. Even though maps are not necessary, visualizing geographically the available data facilitates a direct comprehension of the most relevant information. What I considered key points of the paper are highlighted in bold, and enumerated.

NOTE. To get “more natural” maps, extrapolation for the whole Iberian Peninsula is obtained by interpolation through the use of external data from the British Isles, Central Europe, and Africa. This is obviously not ideal, but – lacking data from the corners of the Iberian Peninsula – this method gives a homogeneous look to all maps. Only data in direct line between labelled samples in each map is truly interpolated for the Iberian Peninsula, while the rest would work e.g. for a wider (and more simplistic) map of European Bronze Age ancestry components.


Iberian Chalcolithic groups and expansion of the Proto-Beaker package. See full map.

The Proto-Beaker package may or may not have expanded into Central Europe with typical Iberia_Chalcolithic ancestry. A priori, it seems a rather cultural diffusion of traits stemming from west Iberia roughly ca. 2800 BC.

Map of Y-DNA haplogroups among Iberia Chalcolithic samples. See full map.

The situation during the Chalcolithic is only relevant for the Indo-European question insofar as it shows a homogeneous Iberia_Chalcolithic-like ancestry with typical Y-chromosome (and mtDNA) haplogroups of the Iberian Neolithic dominating over the whole Peninsula until about 2500 BC. This might represent an original Basque-Iberian community.

Map of mtDNA haplogroups among Iberia Chalcolithic samples. See full map.

Bell Beaker period

Iberian Bell Beaker groups and potential routes of expansion. See full map.

The expansion of the Bell Beaker folk brought about a cultural and genetic change in all Europe, to the point where it has been rightfully considered by Mallory (2013) – the last one among many others before him – the vector of expansion of North-West Indo-European languages. Olalde et al. (2019) proved two main points in this regard, which were already hinted in Olalde et al. (2018):

(1) East Bell Beakers brought hg. R1b-L23 and Yamnaya ancestry to Iberia, ergo the Bell Beaker phenomenon was not a (mere) local development in Iberia, but involved the expansion of peoples tracing their ancestry to the Yamnaya culture who eventually replaced a great part of the local population.

Natural neighbor interpolation of Germany_Beaker ancestry in Iberia during the Bell Beaker period (ca. 2600-2250 BC). See full map.

(2) Classical Bell Beakers have their closest source population in Germany Beakers, and they reject an origin close to Rhine Beakers (i.e. Beakers from the British Isles, the Netherlands, or northern France), ergo the Single Grave culture was not the origin of the Bell Beaker culture, either (see here).

Map of Y-DNA haplogroups among Iberian Bell Beaker samples. See full map.
Map of mtDNA haplogroups among Iberian Bell Beaker samples. See full map.

Early Bronze Age

Iberian Early Bronze Age groups and likely population and culture expansions. See full map.

Interestingly, the European Early Bronze Age in Iberia is still a period of adjustments before reaching the final equilibrium. Unlike the situation in the British Isles, where Bell Beakers brought about a swift population replacement, Iberia shows – like the Nordic Late Neolithic period – centuries of genomic balancing between Indo-European- and non-Indo-European-speaking peoples, as could be suggested by hydrotoponymic research alone.

(3) Palaeo-Indo-European-speaking Old Europeans occupied first the whole Iberian Peninsula, before the potential expansion of one or more non-Indo-European-speaking groups, which confirms the known relative chronology of hydrotoponymic layers of Iberia.

Natural neighbor interpolation of Germany_Beaker ancestry in Iberia during the Early Bronze Age period (ca. 2250-1750 BC). See full map.

This balancing is seen in terms of Germany_Beaker vs. Iberia_Chalcolithic ancestry, but also in terms of Y-chromosome haplogroups, with the most interesting late developments happening in southern Iberia, around the territory where El Argar eventually emerged in radical opposition to the Bell Beaker culture.

Map of Y-DNA haplogroups among Iberia Early Bronze Age samples. See full map.

(4) Bell Beakers and descendants expanded under male-driven migrations, proper of the Indo-European patrilineal tradition, seen in Yamnaya and even earlier in Khvalynsk:

We obtained lower proportions of ancestry related to Germany_Beaker on the X-chromosome than on the autosomes (Table S14), although the Z-score for the differences between the estimates is 2.64, likely due to the large standard error associated to the mixture proportions in the X-chromosome.


Map of mtDNA haplogroups among Iberia Early Bronze Age samples. See full map.

Regarding the PCA, Iberia Bronze Age samples occupy an intermediate cluster between Iberia Chalcolithic and Bell Beakers of steppe ancestry, with Yamnaya-rich samples from the north (Asturias, Burgos) representing the likely source Old European population whose languages survived well into the Roman Iron Age:

PCA of ancient European samples. Marked and labelled are Bronze Age groups and relevant samples. See full image.

Middle Bronze Age

Iberian Middle Bronze Age groups and likely population and culture expansions. See full map.

During the Middle Bronze Age, the equilibrium reached earlier is reversed, with a (likely non-Indo-European-speaking) Argaric sphere of influence expanding to the west and north featuring Iberia Chalcolithic and lesser amount of Germany_Beaker ancestry, present now in the whole Peninsula, although in varying degrees.

Natural neighbor interpolation of Germany_Beaker ancestry in Iberia during the Middle Bronze Age period (ca. 1750-1250 BC). See full map.

All Iberian groups were probably already under a bottleneck of R1b-DF27 lineages, although it is likely that specific subclades differed among regions:

Map of Y-DNA haplogroups among Iberia Middle Bronze Age samples. See full map.
Map of mtDNA haplogroups among Iberia Middle Bronze Age samples. See full map.

Late Bronze Age

Iberian Late Bronze Age groups and likely population and culture expansions. See full map.

The Late Bronze Age represents the arrival of the Urnfield culture, which probably expanded with Celtic-speaking peoples. A Late Bronze Age transect before their genetic impact still shows a prevalent Germany_Beaker-like Steppe ancestry, probably peaking in north/west Iberia:

Natural neighbor interpolation of Germany_Beaker ancestry in Iberia during the Late Bronze Age period (ca. 1250-750 BC). See full map.

(5) Galaico-Lusitanians were descendants of Iberian Beakers of Germany_Beaker ancestry and hg. R1b-M269. Autosomal data of samples I7688 and I7687, of the Final Bronze (end of the reported 1200-700 BC period for the samples), from Gruta do Medronhal (Arrifana, Coimbra, Portugal) confirms this.

In the 1940s, human bones, metallic artifacts (n=37) and non-human bones were discovered in the natural cave of Medronhal (Arrifana, Coimbra). All these findings are currently housed in the Department of Life Sciences of the University of Coimbra and are analyzed by a multidisciplinary team. The artifacts suggest a date at the beginning of the 1st millennium BC, which is confirmed by radiocarbon date of a human fibula: 890–780 cal BCE (2650±40 BP, Beta–223996). This natural cave has several rooms and corridors with two entrances. No information is available about the context of the human remains. Nowadays these remains are housed mixed and correspond to a minimum number of 11 individuals, 5 adults and 6 non-adults.

In particular, sample I7687 shows hg. R1b-M269, with no available quality SNPs, positive or negative, under it (see full report). They represent thus another strong support of the North-West Indo-European expansion with Bell Beakers.

Map of Y-DNA haplogroups among Iberian Late Bronze Age samples. See full map.
Map of mtDNA haplogroups among Iberian Late Bronze Age samples. See full map.

NOTE. To understand how the region around Coimbra was (Proto-)Lusitanian – and not just Old European in general – until the expansion of the Turduli Oppidani, see any recent paper on Bronze Age expansion of warrior stelae, hydrotoponymy, anthroponymy, or theonymy (see e.g. about Spear-vocabulary).

Iron Age

Iberian Pre-Roman Iron Age groups and likely population and culture expansions. See full map.

In a complex period of multiple population movements and language replacements, the temporal transect in Olalde et al. (2019) offers nevertheless relevant clues for the Pre-Roman Iron Age:

(6) The expansion of Celtic languages was associated with the spread of France_Beaker-like ancestry, most likely already with the LBA Urnfield culture, since a Tartessian and a Pre-Iberian samples (both dated ca. 700-500 BC) already show this admixture, in regions which some centuries earlier did not show it. Similarly, a BA sample from Álava ca. 910–840 BC doesn’t show it, and later Celtiberian samples from the same area (ca. 4th c. BC and later) show it, depicting a likely north-east to west/south-west routes of expansion of Celts.

Natural neighbor interpolation of France_Beaker ancestry in Iberia during the Pre-Roman Iron Age period (ca. 750-250 BC). See full map.

(7) The distribution of Germany_Beaker ancestry peaked, by the Iron Age, among Old Europeans from west Iberia, including Galaico-Lusitanians and probably also Astures and Cantabri, in line with what was expected before genetic research:

Natural neighbor interpolation of Germany_Beaker ancestry in Iberia during the Pre-Roman Iron Age period (ca. 750-250 BC). See full map.

A probably more precise picture of the Final Bronze – Early Iron Age transition is obtained by including the Final Bronze samples I2469 from El Sotillo, Álava (ca. 910-875 BC) as Celtic ancestry buffer to the west, and the sample I3315 from Menorca (ca. 904-861 BC), lacking more recent ones from intermediate regions:

Natural neighbor interpolation of Germany_Beaker ancestry in Iberia during the Final Bronze Age – Early Iron Age transition. See full map.
Natural neighbor interpolation of France_Beaker ancestry in Iberia during the Final Bronze Age – Early Iron Age transition. See full map.

In terms of Y-DNA and mtDNA haplogroups, the situation is difficult to evaluate without more samples and more reported subclades:

Map of Y-DNA haplogroups among Iberian Iron Age samples. See full map.
Map of mtDNA haplogroups among Iberian Iron Age samples. See full map.

In the PCA, Proto-Lusitanian samples occupy an intermediate cluster between Iberian Bronze Age and Bronze Age North (see above), including the Final Bronze sample from Álava, while Celtic-speaking peoples (including Pre-Iberians and Iberians of Celtic descent from north-east Iberia) show a similar position – albeit evidently unrelated – due to their more recent admixture between Iberian Bronze Age and Urnfield/Hallstatt from Central Europe:

PCA of ancient European samples. Marked and labelled are Iron Age groups and relevant samples. See full image.

(8) Iberian-speaking peoples in north-east Iberia represent a recent expansion of the language from the south, possibly accompanied by an increase in Iberia_Chalcolithic/Germany_Beaker admixture from east/south-east Iberia.

(9) Modern Basques represent a recent isolation + Y-DNA bottlenecks after the Roman Iron Age population movements, probably from Aquitanians migrating south of the Pyrenees, admixing with local peoples, and later becoming isolated during the Early Middle Ages and thereafter:

[Modern Basques] overlap genetically with Iron Age populations showing substantial levels of Steppe ancestry.

Assuming that France_Beaker ancestry is associated with the Urnfield culture (spreading with Celtic-speaking peoples), Vasconic speakers were possibly represented by some population – most likely from France – whose ancestry is close to Rhine Beakers (see here).

Alternatively, a Vasconic language could have survived in some France/Iberia_Chalcolithic-like population that got isolated north of the Pyrenees close to the Atlantic Façade during the Bronze Age, and who later admixed with Celtic-speaking peoples south of the Pyrenees, such as the Vascones, to the point where their true ancestry got diluted.

In any case, the clear Celtic Steppe-like admixture of modern Basques supports for the time being their recent arrival to Aquitaine before the proto-historical period, which is in line with hydrotoponymic research.


The most interesting aspects to discuss after the publication of Olalde et al. (2019) would have been thus the nature of controversial Palaeohispanic peoples for which there is not much linguistic data, such as:

  • the Astures and the Cantabri, usually considered Pre-Celtic Indo-European (see here);
  • the Vaccaei, usually considered Celtic;
  • the Vettones, traditionally viewed as sharing the same language as Lusitanians due to their apparent shared hydrotoponymic, anthroponymic, and/or theonymic layers, but today mostly viewed as having undergone Celticization and helped the westward expansion of Celtic languages (and archaeologically clearly divided from Old European hostile neighbours to the west by their characteristic verracos);
  • the Pellendones or the Carpetani, who were once considered Pre-Celtic Indo-Europeans, too;
  • the nature of Tartessian as Indo-European, or maybe even as “Celtic”, as defended by Koch;
  • or the potential remote connection of Basque and Iberian languages in a common trunk featuring Iberian/France_Chalcolithic ancestry (also including Palaeo-Sardo).
Pre-Roman Palaeohispanic peoples ca. 300 BC. See full map. Image modified from the version at Wikipedia, a good example of how to disseminate the wrong ideas about Palaeohispanic languages.

Despite these interesting questions still open for discussion, the paper remarked something already known for a long time: that modern Basques had steppe ancestry and Y-DNA proper of the Yamnaya 5,000 years ago, and that Bell Beakers had brought this steppe ancestry and R1b-P312 lineages to Iberia. This common Basque-centric interpretation of Iberian prehistory is the consequence of a 19th-century tradition of obsessively imagining Vasconic-speaking peoples in their medieval territories extrapolated to Cro-Magnons and Atapuerca (no, really), inhabiting undisturbed for millennia a large territory encompassing the whole Iberia and France, “reduced” or “broken” only with the arrival of Celts just before the Roman conquests. A recursive idea of “linguistic autochthony” and “genetic purity” of the peoples of Iberia that has never had any scientific basis.

Similarly, this paper offered the Nth proof already in population genomics that traditional nativist claims for the origin of the Bell Beaker folk in Western Europe were wrong, both southern (nativist Iberian origin) and northern European (nativist Lower Rhine origin). Both options could be easily rejected with phylogeography since 2015, they were then rejected in Olalde et al. and Mathieson et al (2017), then again with the update of many samples in Olalde et al. (2018) and Mathieson et al (2018), and it has most clearly been rejected recently with data from Wang et al. (2018) and its Yamnaya Hungary samples. Findings from Olalde et al. (2019) are just another nail to coffins that should have been well buried by now.

Even David Anthony didn’t have any doubt in his latest model (2017) about the Carpathian Basin origin of North-West Indo-Europeans (see here), and his latest update to the Proto-Indo-European homeland question (2019) shows that he is convinced now about R1b bottlenecks and proper Pre-Yamnaya ancestry stemming from a time well before the Bell Beaker expansion. This won’t be the last setback to supporters of zombie theories: like the hypotheses of an Anatolian, Armenian, or OIT origin of the PIE homeland, other mythical ideas are so entrenched in nationalist and/or nativist tradition that many supporters will no doubt prefer them to die hard, under the most numerous and shameful rejections of endlessly remade reactionary models.


Yamnaya ancestry: mapping the Proto-Indo-European expansions


The latest papers from Ning et al. Cell (2019) and Anthony JIES (2019) have offered some interesting new data, supporting once more what could be inferred since 2015, and what was evident in population genomics since 2017: that Proto-Indo-Europeans expanded under R1b bottlenecks, and that the so-called “Steppe ancestry” referred to two different components, one – Yamnaya or Steppe_EMBA ancestry – expanding with Proto-Indo-Europeans, and the other one – Corded Ware or Steppe_MLBA ancestry – expanding with Uralic speakers.

The following maps are based on formal stats published in the papers and supplementary materials from 2015 until today, mainly on Wang et al. (2018 & 2019), Mathieson et al. (2018) and Olalde et al. (2018), and others like Lazaridis et al. (2016), Lazaridis et al. (2017), Mittnik et al. (2018), Lamnidis et al. (2018), Fernandes et al. (2018), Jeong et al. (2019), Olalde et al. (2019), etc.

NOTE. As in the Corded Ware ancestry maps, the selected reports in this case are centered on the prototypical Yamnaya ancestry vs. other simplified components, so everything else refers to simplistic ancestral components widespread across populations that do not necessarily share any recent connection, much less a language. In fact, most of the time they clearly didn’t. They can be interpreted as “EHG that is not part of the Yamnaya component”, or “CHG that is not part of the Yamnaya component”. They can’t be read as “expanding EHG people/language” or “expanding CHG people/language”, at least no more than maps of “Steppe ancestry” can be read as “expanding Steppe people/language”. Also, remember that I have left the default behaviour for color classification, so that the highest value (i.e. 1, or white colour) could mean anything from 10% to 100% depending on the specific ancestry and period; that’s what the legend is for… But, fere libenter homines id quod volunt credunt.


  1. Neolithic or the formation of Early Indo-European
  2. Eneolithic or the expansion of Middle Proto-Indo-European
  3. Chalcolithic / Early Bronze Age or the expansion of Late Proto-Indo-European
  4. European Early Bronze Age and MLBA or the expansion of Late PIE dialects

1. Neolithic

Anthony (2019) agrees with the most likely explanation of the CHG component found in Yamnaya, as derived from steppe hunter-fishers close to the lower Volga basin. The ultimate origin of this specific CHG-like component that eventually formed part of the Pre-Yamnaya ancestry is not clear, though:

The hunter-fisher camps that first appeared on the lower Volga around 6200 BC could represent the migration northward of un-admixed CHG hunter-fishers from the steppe parts of the southeastern Caucasus, a speculation that awaits confirmation from aDNA.

Natural neighbor interpolation of CHG ancestry among Neolithic populations. See full map.

The typical EHG component that formed part eventually of Pre-Yamnaya ancestry came from the Middle Volga Basin, most likely close to the Samara region, as shown by the sampled Samara hunter-gatherer (ca. 5600-5500 BC):

After 5000 BC domesticated animals appeared in these same sites in the lower Volga, and in new ones, and in grave sacrifices at Khvalynsk and Ekaterinovka. CHG genes and domesticated animals flowed north up the Volga, and EHG genes flowed south into the North Caucasus steppes, and the two components became admixed.

Natural neighbor interpolation of EHG ancestry among Neolithic populations. See full map.

To the west, in the Dnieper-Dniester area, WHG became the dominant ancestry after the Mesolithic, at the expense of EHG, revealing a likely mating network reaching to the north into the Baltic:

Like the Mesolithic and Neolithic populations here, the Eneolithic populations of Dnieper-Donets II type seem to have limited their mating network to the rich, strategic region they occupied, centered on the Rapids. The absence of CHG shows that they did not mate frequently if at all with the people of the Volga steppes (…)

Natural neighbor interpolation of WHG ancestry among Neolithic populations. See full map.

North-West Anatolia Neolithic ancestry, proper of expanding Early European farmers, is found up to border of the Dniester, as Anthony (2007) had predicted.

Natural neighbor interpolation of Anatolia Neolithic ancestry among Neolithic populations. See full map.

2. Eneolithic

From Anthony (2019):

After approximately 4500 BC the Khvalynsk archaeological culture united the lower and middle Volga archaeological sites into one variable archaeological culture that kept domesticated sheep, goats, and cattle (and possibly horses). In my estimation, Khvalynsk might represent the oldest phase of PIE.

(…) this middle Volga mating network extended down to the North Caucasian steppes, where at cemeteries such as Progress-2 and Vonyuchka, dated 4300 BC, the same Khvalynsk-type ancestry appeared, an admixture of CHG and EHG with no Anatolian Farmer ancestry, with steppe-derived Y-chromosome haplogroup R1b. These three individuals in the North Caucasus steppes had higher proportions of CHG, overlapping Yamnaya. Without any doubt, a CHG population that was not admixed with Anatolian Farmers mated with EHG populations in the Volga steppes and in the North Caucasus steppes before 4500 BC. We can refer to this admixture as pre-Yamnaya, because it makes the best currently known genetic ancestor for EHG/CHG R1b Yamnaya genomes.

From Wang et al (2019):

Three individuals from the sites of Progress 2 and Vonyuchka 1 in the North Caucasus piedmont steppe (‘Eneolithic steppe’), which harbour EHG and CHG related ancestry, are genetically very similar to Eneolithic individuals from Khvalynsk II and the Samara region. This extends the cline of dilution of EHG ancestry via CHG-related ancestry to sites immediately north of the Caucasus foothills

Natural neighbor interpolation of Pre-Yamnaya ancestry among Neolithic populations. See full map. This map corresponds roughly to the map of Khvalynsk-Novodanilovka expansion, and in particular to the expansion of horse-head pommel-scepters (read more about Khvalynsk, and specifically about horse symbolism)

NOTE. Unpublished samples from Ekaterinovka have been previously reported as within the R1b-L23 tree. Interestingly, although the Varna outlier is a female, the Balkan outlier from Smyadovo shows two positive SNP calls for hg. R1b-M269. However, its poor coverage makes its most conservative haplogroup prediction R-M343.

The formation of this Pre-Yamnaya ancestry sets this Volga-Caucasus Khvalynsk community apart from the rest of the EHG-like population of eastern Europe.

Natural neighbor interpolation of non-Pre-Yamnaya EHG ancestry among Eneolithic populations. See full map.

Anthony (2019) seems to rely on ADMIXTURE graphics when he writes that the late Sredni Stog sample from Alexandria shows “80% Khvalynsk-type steppe ancestry (CHG&EHG)”. While this seems the most logical conclusion of what might have happened after the Suvorovo-Novodanilovka expansion through the North Pontic steppes (see my post on “Steppe ancestry” step by step), formal stats have not confirmed that.

In fact, analyses published in Wang et al. (2019) rejected that Corded Ware groups are derived from this Pre-Yamnaya ancestry, a reality that had been already hinted in Narasimhan et al. (2018), when Steppe_EMBA showed a poor fit for expanding Srubna-Andronovo populations. Hence the need to consider the whole CHG component of the North Pontic area separately:

Natural neighbor interpolation of non-Pre-Yamnaya CHG ancestry among Eneolithic populations. See full map. You can read more about population movements in the late Sredni Stog and closer to the Proto-Corded Ware period.

NOTE. Fits for WHG + CHG + EHG in Neolithic and Eneolithic populations are taken in part from Mathieson et al. (2019) supplementary materials (download Excel here). Unfortunately, while data on the Ukraine_Eneolithic outlier from Alexandria abounds, I don’t have specific data on the so-called ‘outlier’ from Dereivka compared to the other two analyzed together, so these maps of CHG and EHG expansion are possibly showing a lesser distribution to the west than the real one ca. 4000-3500 BC.

Natural neighbor interpolation of WHG ancestry among Eneolithic populations. See full map.

Anatolia Neolithic ancestry clearly spread to the east into the north Pontic area through a Middle Eneolithic mating network, most likely opened after the Khvalynsk expansion:

Natural neighbor interpolation of Anatolia Neolithic ancestry among Eneolithic populations. See full map.
Natural neighbor interpolation of Iran Chl. ancestry among Eneolithic populations. See full map.

Regarding Y-chromosome haplogroups, Anthony (2019) insists on the evident association of Khvalynsk, Yamnaya, and the spread of Pre-Yamnaya and Yamnaya ancestry with the expansion of elite R1b-L754 (and some I2a2) individuals:

Y-DNA haplogroups in West Eurasia during the Early Eneolithic in the Pontic-Caspian steppes. See full map, and see culture, ADMIXTURE, Y-DNA, and mtDNA maps of the Early Eneolithic and Late Eneolithic.

3. Early Bronze Age

Data from Wang et al. (2019) show that Corded Ware-derived populations do not have good fits for Eneolithic_Steppe-like ancestry, no matter the model. In other words: Corded Ware populations show not only a higher contribution of Anatolia Neolithic ancestry (ca. 20-30% compared to the ca. 2-10% of Yamnaya); they show a different EHG + CHG combination compared to the Pre-Yamnaya one.

Supplementary Table 13. P values of rank=2 and admixture proportions in modelling Steppe ancestry populations as a three-way admixture of Eneolithic steppe Anatolian_Neolithic and WHG using 14 outgroups.
Left populations: Test, Eneolithic_steppe, Anatolian_Neolithic, WHG.
Right populations: Mbuti.DG, Ust_Ishim.DG, Kostenki14, MA1, Han.DG, Papuan.DG, Onge.DG, Villabruna, Vestonice16, ElMiron, Ethiopia_4500BP.SG, Karitiana.DG, Natufian, Iran_Ganj_Dareh_Neolithic.

Yamnaya Kalmykia and Afanasievo show the closest fits to the Eneolithic population of the North Caucasian steppes, rejecting thus sizeable contributions from Anatolia Neolithic and/or WHG, as shown by the SD values. Both probably show then a Pre-Yamnaya ancestry closest to the late Repin population.

Modelling results for the Steppe and Caucasus cluster. Admixture proportions based on (temporally and geographically) distal and proximal models, showing additional AF ancestry in Steppe groups and additional gene flow from the south in some of the Steppe groups as well as the Caucasus groups. See tables above. Modified from Wang et al. (2019). Within a blue square, Yamnaya-related groups; within a cyan square, Corded Ware-related groups. Green background behind best p-values. In red circle, SD of AF/WHG ancestry contribution in Afanasevo and Yamnaya Kalmykia, with ranges that almost include 0%.

EBA maps include data from Wang et al. (2018) supplementary materials, specifically unpublished Yamnaya samples from Hungary that appeared in analysis of the preprint, but which were taken out of the definitive paper. Their location among Yamnaya settlers from Hungary is speculative, although most uncovered kurgans in Hungary are concentrated in the Tisza-Danube interfluve.

Natural neighbor interpolation of Pre-Yamnaya ancestry among Early Bronze Age populations. See full map. This map corresponds roughly with the known expansion of late Repin/Yamnaya settlers.

The Y-chromosome bottleneck of elite males from Proto-Indo-European clans under R1b-L754 and some I2a2 subclades, already visible in the Khvalynsk sampling, became even more noticeable in the subsequent expansion of late Repin/early Yamnaya elites under R1b-L23 and I2a-L699:

Y-DNA haplogroups in West Eurasia during the Yamnaya expansion. See full map and maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Early Chalcolithic and Yamnaya Hungary.

Maps of CHG, EHG, Anatolia Neolithic, and probably WHG show the expansion of these components among Corded Ware-related groups in North Eurasia, apart from other cultures close to the Caucasus:

NOTE. For maps with actual formal stats of Corded Ware ancestry from the Early Bronze Age to the modern times, you can read the post Corded Ware ancestry in North Eurasia and the Uralic expansion.

Natural neighbor interpolation of non-Pre-Yamnaya CHG ancestry among Early Bronze Age populations. See full map.
Natural neighbor interpolation of non-Pre-Yamnaya EHG ancestry among Early Bronze Age populations. See full map.
Natural neighbor interpolation of WHG ancestry among Early Bronze Age populations. See full map.
Natural neighbor interpolation of Anatolia Neolithic ancestry among Early Bronze Age populations. See full map.
Natural neighbor interpolation of Iran Chl. ancestry among Early Bronze Age populations. See full map.

4. Middle to Late Bronze Age

The following maps show the most likely distribution of Yamnaya ancestry during the Bell Beaker-, Balkan-, and Sintashta-Potapovka-related expansions.

4.1. Bell Beakers

The amount of Yamnaya ancestry is probably overestimated among populations where Bell Beakers replaced Corded Ware. A map of Yamnaya ancestry among Bell Beakers gets trickier for the following reasons:

  • Expanding Repin peoples of Pre-Yamnaya ancestry must have had admixture through exogamy with late Sredni Stog/Proto-Corded Ware peoples during their expansion into the North Pontic area, and Sredni Stog in turn had probably some Pre-Yamnaya admixture, too (although they don’t appear in the simplistic formal stats above). This is supported by the increase of Anatolia farmer ancestry in more western Yamna samples.
  • Later, Yamnaya admixed through exogamy with Corded Ware-like populations in Central Europe during their expansion. Even samples from the Middle to Upper Danube and around the Lower Rhine will probably show increasing contributions of Steppe_MLBA, at the same time as they show an increasing proportion of EEF-related ancestry.
  • To complicate things further, the late Corded Ware Espersted family (from ca. 2500 BC or later) shows, in turn, what seems like a recent admixture with Yamnaya vanguard groups, with the sample of highest Yamnaya ancestry being the paternal uncle of other individuals (all of hg. R1a-M417), suggesting that there might have been many similar Central European mating networks from the mid-3rd millennium BC on, of (mainly) Yamnaya-like R1b elites displaying a small proportion of CW-like ancestry admixing through exogamy with Corded Ware-like peoples who already had some Yamnaya ancestry.
Natural neighbor interpolation of Yamnaya ancestry among Middle to Late Bronze Age populations (Esperstedt CWC site close to BK_DE, label is hidden by BK_DE_SAN). See full map. You can see how this map correlated with the map of Late Copper Age migrations and Yamanaya into Bell Beaker expansion.

NOTE. Terms like “exogamy”, “male-driven migration”, and “sex bias”, are not only based on the Y-chromosome bottlenecks visible in the different cultural expansions since the Palaeolithic. Despite the scarce sampling available in 2017 for analysis of “Steppe ancestry”-related populations, it appeared to show already a male sex bias in Goldberg et al. (2017), and it has been confirmed for Neolithic and Copper Age population movements in Mathieson et al. (2018) – see Supplementary Table 5. The analysis of male-biased expansion of “Steppe ancestry” in CWC Esperstedt and Bell Beaker Germany is, for the reasons stated above, not very useful to distinguish their mutual influence, though.

Based on data from Olalde et al. (2019), Bell Beakers from Germany are the closest sampled ones to expanding East Bell Beakers, and those close to the Rhine – i.e. French, Dutch, and British Beakers in particular – show a clear excess “Steppe ancestry” due to their exogamy with local Corded Ware groups:

Only one 2-way model fits the ancestry in Iberia_CA_Stp with P-value>0.05: Germany_Beaker + Iberia_CA. Finding a Bell Beaker-related group as a plausible source for the introduction of steppe ancestry into Iberia is consistent with the fact that some of the individuals in the Iberia_CA_Stp group were excavated in Bell Beaker associated contexts. Models with Iberia_CA and other Bell Beaker groups such as France_Beaker (P-value=7.31E-06), Netherlands_Beaker (P-value=1.03E-03) and England_Beaker (P-value=4.86E-02) failed, probably because they have slightly higher proportions of steppe ancestry than the true source population.


The exogamy with Corded Ware-like groups in the Lower Rhine Basin seems at this point undeniable, as is the origin of Bell Beakers around the Middle-Upper Danube Basin from Yamnaya Hungary.

To avoid this excess “Steppe ancestry” showing up in the maps, since Bell Beakers from Germany pack the most Yamnaya ancestry among East Bell Beakers outside Hungary (ca. 51.1% “Steppe ancestry”), I equated this maximum with BK_Scotland_Ach (which shows ca. 61.1% “Steppe ancestry”, highest among western Beakers), and applied a simple rule of three for “Steppe ancestry” in Dutch and British Beakers.

NOTE. Formal stats for “Steppe ancestry” in Bell Beaker groups are available in Olalde et al. (2018) supplementary materials (PDF). I didn’t apply this adjustment to Bk_FR groups because of the R1b Bell Beaker sample from the Champagne/Alsace region reported by Samantha Brunel that will pack more Yamnaya ancestry than any other sampled Beaker to date, hence probably driving the Yamnaya ancestry up in French samples.

The most likely outcome in the following years, when Yamnaya and Corded Ware ancestry are investigated separately, is that Yamnaya ancestry will be much lower the farther away from the Middle and Lower Danube region, similar to the case in Iberia, so the map above probably overestimates this component in most Beakers to the north of the Danube. Even the late Hungarian Beaker samples, who pack the highest Yamnaya ancestry (up to 75%) among Beakers, represent likely a back-migration of Moravian Beakers, and will probably show a contribution of Corded Ware ancestry due to the exogamy with local Moravian groups.

Despite this decreasing admixture as Bell Beakers spread westward, the explosive expansion of Yamnaya R1b male lineages (in words of David Reich) and the radical replacement of local ones – whether derived from Corded Ware or Neolithic groups – shows the true extent of the North-West Indo-European expansion in Europe:

Y-DNA haplogroups in West Eurasia during the Bell Beaker expansion. See full map and see maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Late Copper Age and of the Yamnaya-Bell Beaker transition.

4.2. Palaeo-Balkan

There is scarce data on Palaeo-Balkan movements yet, although it is known that:

  1. Yamnaya ancestry appears among Mycenaeans, with the Yamnaya Bulgaria sample being its best current ancestral fit;
  2. the emergence of steppe ancestry and R1b-M269 in the eastern Mediterranean was associated with Ancient Greeks;
  3. Thracians, Albanians, and Armenians also show R1b-M269 subclades and “Steppe ancestry”.

4.3. Sintashta-Potapovka-Filatovka

Interestingly, Potapovka is the only Corded Ware derived culture that shows good fits for Yamnaya ancestry, despite having replaced Poltavka in the region under the same Corded Ware-like (Abashevo) influence as Sintashta.

This proves that there was a period of admixture in the Pre-Proto-Indo-Iranian community between CWC-like Abashevo and Yamnaya-like Catacomb-Poltavka herders in the Sintashta-Potapovka-Filatovka community, probably more easily detectable in this group because of the specific temporal and geographic sampling available.

Supplementary Table 14. P values of rank=3 and admixture proportions in modelling Steppe ancestry populations as a four-way admixture of distal sources EHG, CHG, Anatolian_Neolithic and WHG using 14 outgroups.
Left populations: Steppe cluster, EHG, CHG, WHG, Anatolian_Neolithic
Right populations: Mbuti.DG, Ust_Ishim.DG, Kostenki14, MA1, Han.DG, Papuan.DG, Onge.DG, Villabruna, Vestonice16, ElMiron, Ethiopia_4500BP.SG, Karitiana.DG, Natufian, Iran_Ganj_Dareh_Neolithic.

Srubnaya ancestry shows a best fit with non-Pre-Yamnaya ancestry, i.e. with different CHG + EHG components – possibly because the more western Potapovka (ancestral to Proto-Srubnaya Pokrovka) also showed good fits for it. Srubnaya shows poor fits for Pre-Yamnaya ancestry probably because Corded Ware-like (Abashevo) genetic influence increased during its formation.

On the other hand, more eastern Corded Ware-derived groups like Sintashta and its more direct offshoot Andronovo show poor fits with this model, too, but their fits are still better than those including Pre-Yamnaya ancestry.

Natural neighbor interpolation of non-Pre-Yamnaya EHG ancestry among Middle to Late Bronze Age populations. See full map.
Natural neighbor interpolation of non-Pre-Yamnaya CHG ancestry among Middle to Late Bronze Age populations. See full map.
Natural neighbor interpolation of Anatolia Neolithic ancestry among Middle to Late Bronze Age populations. See full map.
Natural neighbor interpolation of Iran Chl. ancestry among Middle to Late Bronze Age populations. See full map.

NOTE For maps with actual formal stats of Corded Ware ancestry from the Early Bronze Age to the modern times, you should read the post Corded Ware ancestry in North Eurasia and the Uralic expansion instead.

The bottleneck of Proto-Indo-Iranians under R1a-Z93 was not yet complete by the time when the Sintashta-Potapovka-Filatovka community expanded with the Srubna-Andronovo horizon:

Y-DNA haplogroups in West Eurasia during the European Early Bronze Age. See full map and see maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Early Bronze Age.

4.4. Afanasevo

At the end of the Afanasevo culture, at least three samples show hg. Q1b (ca. 2900-2500 BC), which seemed to point to a resurgence of local lineages, despite continuity of the prototypical Pre-Yamnaya ancestry. On the other hand, Anthony (2019) makes this cryptic statement:

Yamnaya men were almost exclusively R1b, and pre-Yamnaya Eneolithic Volga-Caspian-Caucasus steppe men were principally R1b, with a significant Q1a minority.

Since the only available samples from the Khvalynsk community are R1b (x3), Q1a(x1), and R1a(x1), it seems strange that Anthony would talk about a “significant minority”, unless Q1a (potentially Q1b in the newer nomenclature) will pop up in some more individuals of those ca. 30 new to be published. Because he also mentions I2a2 as appearing in one elite burial, it seems Q1a (like R1a-M459) will not appear under elite kurgans, although it is still possible that hg. Q1a was involved in the expansion of Afanasevo to the east.

Y-DNA haplogroups in West Eurasia during the Middle Bronze Age. See full map and see maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Middle Bronze Age and the Late Bronze Age.

Okunevo, which replaced Afanasevo in the Altai region, shows a majority of hg. Q1b, but also some R1b-M269 samples proper of Afanasevo, suggesting partial genetic continuity.

NOTE. Other sampled Siberian populations clearly show a variety of Q subclades that likely expanded during the Palaeolithic, such as Baikal EBA samples from Ust’Ida and Shamanka with a majority of Q1b, and hg. Q reported from Elunino, Sagsai, Khövsgöl, and also among peoples of the Srubna-Andronovo horizon (the Krasnoyarsk MLBA outlier), and in Karasuk.

From Damgaard et al. Science (2018):

(…) in contrast to the lack of identifiable admixture from Yamnaya and Afanasievo in the CentralSteppe_EMBA, there is an admixture signal of 10 to 20% Yamnaya and Afanasievo in the Okunevo_EMBA samples, consistent with evidence of western steppe influence. This signal is not seen on the X chromosome (qpAdm P value for admixture on X 0.33 compared to 0.02 for autosomes), suggesting a male-derived admixture, also consistent with the fact that 1 of 10 Okunevo_EMBA males carries a R1b1a2a2 Y chromosome related to those found in western pastoralists. In contrast, there is no evidence of western steppe admixture among the more eastern Baikal region region Bronze Age (~2200 to 1800 BCE) samples.

This Yamnaya ancestry has been also recently found to be the best fit for the Iron Age population of Shirenzigou in Xinjiang – where Tocharian languages were attested centuries later – despite the haplogroup diversity acquired during their evolution, likely through an intermediate Chemurchek culture (see a recent discussion on the elusive Proto-Tocharians).

Haplogroup diversity seems to be common in Iron Age populations all over Eurasia, most likely due to the spread of different types of sociopolitical structures where alliances played a more relevant role in the expansion of peoples. A well-known example of this is the spread of Akozino warrior-traders in the whole Baltic region under a partial N1a-VL29-bottleneck associated with the emerging chiefdom-based systems under the influence of expanding steppe nomads.

Y-DNA haplogroups in West Eurasia during the Early Iron Age. See full map and see maps of cultures, ADMIXTURE, Y-DNA, and mtDNA of the Early Iron Age and Late Iron Age.

Surprisingly, then, Proto-Tocharians from Shirenzigou pack up to 74% Yamnaya ancestry, in spite of the 2,000 years that separate them from the demise of the Afanasevo culture. They show more Yamnaya ancestry than any other population by that time, being thus a sort of Late PIE fossils not only in their archaic dialect, but also in their genetic profile:


The recent intrusion of Corded Ware-like ancestry, as well as the variable admixture with Siberian and East Asian populations, both point to the known intense Old Iranian and Old/Middle Chinese contacts. The scarce Proto-Samoyedic and Proto-Turkic loans in Tocharian suggest a rather loose, probably more distant connection with East Uralic and Altaic peoples from the forest-steppe and steppe areas to the north (read more about external influences on Tocharian).

Interestingly, both R1b samples, MO12 and M15-2 – likely of Asian R1b-PH155 branch – show a best fit for Andronovo/Srubna + Hezhen/Ulchi ancestry, suggesting a likely connection with Iranians to the east of Xinjiang, who later expanded as the Wusun and Kangju. How they might have been related to Huns and Xiongnu individuals, who also show this haplogroup, is yet unknown, although Huns also show hg. R1a-Z93 (probably most R1a-Z2124) and Steppe_MLBA ancestry, earlier associated with expanding Iranian peoples of the Srubna-Andronovo horizon.

All in all, it seems that prehistoric movements explained through the lens of genetic research fit perfectly well the linguistic reconstruction of Proto-Indo-European and Proto-Uralic.


Iron Age Tocharians of Yamnaya ancestry from Afanasevo show hg. R1b-M269 and Q1a1

New open access Ancient Genomes Reveal Yamnaya-Related Ancestry and a Potential Source of Indo-European Speakers in Iron Age Tianshan, by Ning et al. Current Biology (2019).

Interesting excerpts (emphasis mine, changes for clarity):

Here, we report the first genome-wide data of 10 ancient individuals from northeastern Xinjiang. They are dated to around 2,200 years ago and were found at the Iron Age Shirenzigou site. We find them to be already genetically admixed between Eastern and Western Eurasians. We also find that the majority of the East Eurasian ancestry in the Shirenzigou individuals is related to northeastern Asian populations, while the West Eurasian ancestry is best presented by ∼20% to 80% Yamnaya-like ancestry. Our data thus suggest a Western Eurasian steppe origin for at least part of the ancient Xinjiang population. Our findings furthermore support a Yamnaya-related origin for the now extinct Tocharian languages in the Tarim Basin, in southern Xinjiang.


The dominant mtDNA lineages of the Shirenzigou people are commonly found in modern and ancient West Eurasian populations, such as U4, U5, and H, while they also have East Eurasian-specific haplogroups A, D4, and G3, preliminarily documenting admixed ancestry from eastern and western Eurasia.

The admixture profile is also shown on the paternal Y chromosome side that 4 out of 6 males in Shirenzigou (Figure S2) belong to the West Eurasian-specific haplogroup R1b (n = 2) and East Eurasian-specific haplogroup Q1a (n = 2), the former is predominant in ancient Yamnaya and nearly 100% in Afanasievo, different from the Middle and Late Bronze Age Steppe groups (Steppe_MLBA) such as Andronovo, [Potapovka], Srubnaya, and Sintashta whose Y chromosomal haplogroup is mainly R1a.



We first carried out principal component analysis (PCA) to assess the genetic affinities of the ancient individuals qualitatively by projecting them onto present-day Eurasian variation (Figure 2). We observed a distinct separation between East and West Eurasians. Our ancient Shirenzigou samples and present-day populations from Central Asia and northwestern China form a genetic cline from East to West in the first PC. The distribution of Shirenzigou samples on the cline is relatively scattered with two major clusters, one being closer to modern-day Uygurs and Kazakhs and the other being closer to recently published ancient Saka and Huns from the Tianshan in Kazakhstan (…).

We applied a formal admixture test using f3 statistics in the form of f3 (Shirenzigou; X, Y) where X and Y are worldwide populations that might be the genetic sources for the Shirenzigou individuals. We observed the most significant signals of admixture in the Shirenzigou samples when using Yamnaya_Samara or Srubnaya as the West Eurasian source and some Northern Asians or Koreans as the East Eurasian source (Table S1). We also plotted the outgroup f3 statistics in the form of f3 (Mbuti; X, Anatolia_Neolithic) and f3 (Mbuti; X, Kostenki14) to visualize the allele sharing between population X and Anatolian farmers. As shown in Figure S3, the Steppe_MLBA populations including Srubnaya, Andronovo, and Sintashta were shifted toward farming populations compared with Yamnaya groups and the Shirenzigou samples. This observation is consistent with ADMIXTURE analysis that Steppe_MLBA populations have an Anatolian and European farmer-related component that Yamnaya groups and the Shirenzigou individuals do not seem to have. The analysis consistently suggested Yamnaya-related Steppe populations were the better source in modeling the West Eurasian ancestry in Shirenzigou.

PCA and ADMIXTURE for Shirenzigou Samples. Modified from the original to include in black squares samples related to Yamnaya.

Genetic Composition of Iron Age Shirenzigou Individuals

We continued to use qpAdm to estimate the admixture proportions in the Shirenzigou samples by using different pairs of source populations, such as Yamnaya_Samara, Afanasievo, Srubnaya, Andronovo, BMAC culture (Bustan_BA and Sappali_Tepe_BA) and Tianshan_Hun as the West Eurasian source and Han, Ulchi, Hezhen, Shamanka_EN as the East Eurasian source. In all cases, Yamnaya, Afanasievo, or Tianshan_Hun always provide the best model fit for the Shirenzigou individuals, while Srubnaya, Andronovo, Bustan_BA and Sappali_Tepe_BA only work in some cases. The Yamnaya_Samara or Afanasievo-related ancestry ranges from ∼20% to 80% in different Shirenzigou individuals, consistent with the scattered distribution on the East-West cline in the PCA


(…) we then modeled Shirenzigou as a three-way admixture of Yamnaya_Samara, Ulchi (or Hezhen) and Han to infer the source from the East Eurasia side that contributed to Shirenzigou. We found the Ulchi or Hezhen and Han-related ancestry had a complicated and unevenly distribution in the Shirenzigou samples. The most Shirenzigou individuals derived the majority of their East Eurasian ancestry from Ulchi or Hezhen-related populations, while the following two individuals M820 and M15-2 have more Han related than Ulchi/Hezhen-related ancestry.

One important question remains, though: how and when did these Proto-Tocharian speakers migrate from the Afanasevo culture in the Altai into the Tarim Basin? The traditional answer, now more likely than ever, is through the Chemurchek culture. See e.g. A re-analysis of the Qiemu’erqieke (Shamirshak) cemeteries, Xinjiang, China, by Jia and Betts JIES (2010) 38(4).

Also, given the apparent lack of (extra farmer ancestry that characterizes) Corded Ware ancestry, if the results were already suspicious before, how likely are now the published R1a(xZ93) and/or radiocarbon dates of the Xiaohe mummies from Li et al. (2010, 2015)? Because, after all, one should have expected in such a late date a generalized admixture with neighbouring Srubna/Andronovo-like populations.


Sea Peoples behind Philistines were Aegeans, including R1b-M269 lineages

New open access paper Ancient DNA sheds light on the genetic origins of early Iron Age Philistines, by Feldman et al. Science Advances (2019) 5(7):eaax0061.

Interesting excerpts (modified for clarity, emphasis mine):

Here, we report genome-wide data from human remains excavated at the ancient seaport of Ashkelon, forming a genetic time series encompassing the Bronze to Iron Age transition. We find that all three Ashkelon populations derive most of their ancestry from the local Levantine gene pool. The early Iron Age population was distinct in its high genetic affinity to European-derived populations and in the high variation of that affinity, suggesting that a gene flow from a European-related gene pool entered Ashkelon either at the end of the Bronze Age or at the beginning of the Iron Age. Of the available contemporaneous populations, we model the southern European gene pool as the best proxy for this incoming gene flow. Last, we observe that the excess European affinity of the early Iron Age individuals does not persist in the later Iron Age population, suggesting that it had a limited genetic impact on the long-term population structure of the people in Ashkelon.

Ancient genomes (marked with color-filled symbols) projected onto the principal components inferred from present-day west Eurasians (gray circles). The newly reported Ashkelon populations are annotated in the upper corner.

Genetic discontinuity between the Bronze Age and the early Iron Age people of Ashkelon

In comparison to ASH_LBA, the four ASH_IA1 individuals from the following Iron Age I period are, on average, shifted along PC1 toward the European cline and are more spread out along PC1, overlapping with ASH_LBA on one extreme and with the Greek Late Bronze Age “S_Greece_LBA” on the other. Similarly, genetic clustering assigns ASH_IA1 with an average of 14% contribution from a cluster maximized in the Mesolithic European hunter-gatherers labeled “WHG” (shown in blue in Fig. 2B) (15, 22, 26). This component is inferred only in small proportions in earlier Bronze Age Levantine populations (2 to 9%).

In agreement with the PCA and ADMIXTURE results, only European hunter-gatherers (including WHG) and populations sharing a history of genetic admixture with European hunter-gatherers (e.g., as European Neolithic and post-Neolithic populations) produced significantly positive f4-statistics (Z ≥ 3), suggesting that, compared to ASH_LBA, ASH_IA1 has additional European-related ancestry.

We find that the PC1 coordinates positively correlate with the proportion of WHG ancestry modeled in the Ashkelon individuals, suggesting that WHG reasonably tag a European-related ancestral component within the ASH_IA1 individuals.

We plot the ancestral proportions of the Ashkelon individuals inferred by qpAdm using Iran_ChL, Levant_ChL, and WHG as sources ±1 SEs. P values are annotated under each model. In cases when the three-way model failed (χ2P < 0.05), we plot the fitting two-way model. The WHG ancestry is necessary only in ASH_IA1.

The best supported one (χ2P = 0.675) infers that ASH_IA1 derives around 43% of ancestry from the Greek Bronze Age “Crete_Odigitria_BA” (43.1 ± 19.2%) and the rest from the ASH_LBA population.

(…) only the models including “Sardinian,” “Crete_Odigitria_BA,” or “Iberia_BA” as the candidate population provided a good fit (χ2P = 0.715, 49.3 ± 8.5%; χ2P = 0.972, 38.0 ± 22.0%; and χ2P = 0.964, 25.8 ± 9.3%, respectively). We note that, because of geographical and temporal sampling gaps, populations that potentially contributed the “European-related” admixture in ASH_IA1 could be missing from the dataset.

The transient impact of the “European-related” gene flow on the Ashkelon gene pool

The ASH_IA2 individuals are intermediate along PC1 between the ASH_LBA ones and the earlier Bronze Age Levantines (Jordan_EBA/Lebanon_MBA) in the west Eurasian PCA (Fig. 2A). Notably, despite being chronologically closer to ASH_IA1, the ASH_IA2 individuals position closer, on average, to the earlier Bronze Age individuals.

See more information on Y-DNA SNP calls, including ASH067 as R1b-M269 (xL151).

The transient excess of European-related genetic affinity in ASH_IA1 can be explained by two scenarios. The early Iron Age European-related genetic component could have been diluted by either the local Ashkelon population to the undetectable level at the time of the later Iron Age individuals or by a gene flow from a population outside of Ashkelon introduced during the final stages of the early Iron Age or the beginning of the later Iron Age.

By modeling ASH_IA2 as a mixture of ASH_IA1 and earlier Bronze Age Levantines/Late Period Egyptian, we infer a range of 7 to 38% of contribution from ASH_IA1, although no contribution cannot be rejected because of the limited resolution to differentiate between Bronze Age and early Iron Age ancestries in this model.

Hg. R1b-M269 and the Aegean

I already predicted this relationship of Philistines and Aegeans (Greeks in particular) months ago, based on linguistics, archaeology, and phylogeography, although it was (and still is) yet unclear if these paternal lineages might have come from other nearby populations which might be descended from Common Anatolians instead, given the known intense contacts between Helladic and West Anatolian groups.

The alternative view: The Sea Peoples can be traced back to the Aegean, so they could also have consisted of Luwian petty kingdoms, who had formed an alliance and attacked Hatti from the south.

The deduction process for the Greek connection was quite simple:

Palaeo-Balkan populations

We know that R1b-Z2103 expanded with Yamna, including West Yamna settlers: they appear in Vučedol, which means they formed part of the earliest expansion waves of Yamna settlers into the Carpathian Basin, and they also appear scattered among Bell Beakers (apart from dominating East Yamna and Afanasevo), which suggests that they were possibly one of the most successful lineages during the late Repin/early Yamna expansion.

The “Steppe ancestry” associated with I2a-L699 samples among Balkan BA peoples may have also been associated with recent Bronze Age expansions, and this haplogroup’s presence among modern Balkan peoples may also suggest that it expanded with Palaeo-Balkan languages. Nevertheless, we don’t know which specific lineages and “Steppe ancestry” they represent, sadly.

These samples may well be related to remnants of previous Balkan populations like Cernavodă or Ezero, because there has been no peer-reviewed attempt at distinguishing Khvalynsk-/Novodanilovka- from Sredni Stog- from Yamnaya-related populations (see here), and some groups that are associated with this ancestry, like Corded Ware, are known to be culturally distinct from Yamna.

In any case, Proto-Greeks from the southern Balkans (say, Sitagroi IV and related groups) are probably going to show, based on Palaeo-Balkan substrate and Pre-Greek substrate and on the available Mycenaean samples, a process of decreasing proportion of R1b-Z2103 lineages relative to local ones, and a relatively similar cline of Yamna:EEF ancestry from northern to southern areas, at least in the periods closest to the Yamna expansion.

NOTE. The finding of “archaic” R1b-L389 (R1b-V1636) and R1a-M198 subclades among modern Greeks and the likely Neolithic origin of these paternal lineages around the Caucasus suggest that their presence in Greece may be from any of the more recent migrations that have happened between Anatolia and the Balkans, especially during the Common Era, rather than Indo-Anatolian migrations; probably very very recently.

Bronze Age cultures in the Balkans and the Aegean. See full map including ancient samples with Y-DNA, mtDNA, and ADMIXTURE.

Minoans and haplogroup J

In the Aegean, it is already evident that the population changed language partly through cultural diffusion, probably through elite domination of Proto-Greek speakers. Whether that happened before the invasion into the Greek Peninsula or after it is unclear, as we discussed recently, because we only have one reported Y-chromosome haplogroup among Mycenaeans, and it is J (probably continuing earlier lineages).

Now we have more samples from the so-called Emporion 2 cluster in Olalde et al. (2019), which shows Mycenaean-like eastern Mediterranean ancestry and 3 (out of 3) samples of haplogroup J, which – given the origin of the colony in Phocea – may be interpreted as the prevalence of West Anatolian-like ancestry and lineages in the eastern part of the Aegean (and possibly thus south Peloponnese), in line with the modern situation.

NOTE. It does not seem likely that those R or R1b-L23 samples from the Emporion 1 cluster are R1b-Z2103, based on their West European-like ancestry, although they still may be, because – as we know – ancestry (unlike haplogroup) changes too easily to interpret it as an ancestral ethnolinguistic marker.

PCA of ancient samples related to the Aegean, with Minoans, Mycenaeans (including the Emporion 2 cluster in the background) Anatolia N-Ch.-BA and Levantine BA-LBA populations, including Tel Shadud samples. See more PCAs of ancient Eurasian populations.

Greeks and haplogroup R1b-M269

Therefore, while the presence of R1b-Z2103 among ancient Balkan peoples connected to the Yamna expansion is clear, one might ask if R1b-Z2103 really spread up to the Peloponnese by the time of the Mycenaean Civilization. That has only one indirect answer, and it’s most likely yes.

We already had some R1b-Z2103 among Thracians and around the Armenoid homeland, which offers another clue at the migration of these lineages from the Balkans. The distribution of different “archaic” R1b-Z2103 subclades among modern Balkan populations and around the Aegean offered more support to this conclusion.

But now we have two interesting ancient populations that bear witness to the likely intrusion of R1b-M269 with Proto-Greeks:

An Ancient Greek of hg. R1b

A single ancient sample supports the increase in R1b-Z2103 among Greeks during the “Dorian” invasions that triggered the Dark Ages and the phenomenon of the Aegean Sea Peoples. It comes from a Greek lab study, showing R1b1b (i.e. R1b-P297 in the old nomenclature) as the only Y-chromosome haplogroup obtained from the sampling of the Gulf of Amurakia ca. 470-30 BC, i.e. before the Roman foundation of Nikopolis, hence from people likely from Anaktorion in Ancient Acarnania, of Corinthian origin.


Even with the few data available – and with the caution necessary for this kind of studies from non-established labs, which may be subject to many different kinds of errors – one could argue that the western Greek areas, which received different waves of migrants from the north and shows a higher distribution of R1b-Z2103 in modern times, was probably more heavily admixed with R1b-Z2103 than southern and eastern areas, which were always dominated by Greek-speaking populations more heavily admixed with locals.

The Dorian invasion and the Greek Dark Ages may thus account for a renewed influx of R1b-Z2103 lineages accompanying the dialects that would eventually help form the Hellenic Koiné. In a sense, it is only natural that demographically stronger populations around the Bronze Age Aegean would suffer a limited (male) population replacement with the succeeding invasions, starting with a higher genetic impact in the north-west and diminishing as they progressed to the south and the east, coupled with stepped admixture events with local populations.

This would be therefore the late equivalent of what happened at the end of the 3rd millennium BC, with Mycenaeans and their genetic continuity with Minoans.

Distribution of Pre-Greek place-names ending in -ssos/-ssa or -sos/-sa. See original images and more on the south/east cline distribution of Pre-Greek place-names here.

Sea peoples of hg. R1b-M269

Thanks to Wang et al. (2018) supplementary materials we knew that one of the two Levantine LBA II samples from Tel Shadud (final 13th–early 11th c. BC) published in van den Brink (2017) was of hg. R1b-M269 – in fact, the one interpreted as a Canaanite official residing at this site and emulating selected funerary aspects of Egyptian mortuary culture.

Both analyzed samples, this elite individual and a commoner of hg. J buried nearby, were genetically similar and indistinguishable from local populations, though:

Principal Components Analysis of L112 and L126 was carried out within the framework described in Lazaridis et al. (2016). This analysis showed that the two individuals cluster genetically, with similar estimated proportions of ancestry from diverse West Eurasian ancestral sources. These results are consistent with the hypothesis that they derive from the same population, or alternatively that they derive from two quite closely related populations.

We know that ancestry changes easily within a few generations, so there was not much information to go on, except for the fact that – being R1b-M269 – this individual could trace his paternal ancestor at some point to Proto-Indo-Europeans.

One might think that, because many haplogroups in this spreadsheet were wrong, this is also wrong; nevertheless, many haplogroups are correctly identified by Yleaf, and finding R1b-M269 in the Levant after the expansion of Sea Peoples could not be that surprising, because they were most likely related to populations of the Aegean Sea. Any other related hg. R1b (R1b-M73, R1b-V88, even R1b-V1636) wouldn’t fit as well as R1b-M269.


However, the early expansion of Proto-Indo-Aryans into the Middle East, as well as the later expansion of Armenians from the Balkans through Anatolia and of West Iranians from the east may have all potentially been related to this sample. But still, the previous linguistic and archaeological theories concerning the Philistines and the expansion of Sea Peoples in the Levant made this sample a likely (originally) Greek “Dorian” lineage, rather than the other (increasingly speculative) alternatives.

In any case, it was obvious to anyone – that is, to anyone with a minimum knowledge of how population genomics works – that just the two samples from van den Brink (2017) couldn’t be used to get to any conclusions about the ancestral origin of these individuals (or their differences) beyond Levantine peoples, because their ancestry was essentially (i.e. statistically) the same as the other few available ancient samples from nearby regions and similar periods.

If anything, the PCA suggested an origin of the R1b sample closer to Aegean populations relative to the J individual (see PCA above), and this should have been supported also by amateur models, without any possible confirmation (as with the ASH_IA2 cluster in this paper). However, if you have followed online discussions of Tel Shadud R1b-M269 sample since it was mentioned first on Eupedia months ago – including another wave of misguided speculation based on the ancestry of both individuals triggered by a discussion on this blog -, you have once more proof of how misleading ancestry analyses can be in the wrong hands.

NOTE. This is the Nth proof (and that only in 2019) of how it’s best to just avoid amateur analyses and interpretations altogether, as I did in the recent publication of the books. All those who didn’t take into account whatever was commented about the ancestry of these samples haven’t lost a single bit of relevant information on Levantine peoples, and have had more time for useful reads, compared to those dedicated to endless void speculation, once again gone awfully wrong, as does everything related to cocky ancient DNA crackpottery 😉

Late Bronze Age population movements in the Eastern Mediterranean and the Middle East. See full map including ancient DNA samples with Y-DNA, mtDNA, and ADMIXTURE.

Admittedly, though, even accepting the evident Mediterranean origin of this lineage, one could have argued that this sample may have been of R1b-L151 subclade, if one were inclined to support the theory that Italic peoples were behind Sea Peoples expanding east – and consequently that the ancestors of Etruscans had migrated eastward into the Aegean (e.g. into Lemnos), so that it could be asserted that Tyrsenian might have been a remnant language of an ancient population of northern Italy.


Fortunately, some of the samples recovered in Feldman et al. (2019) that could be analyzed (those of the cluster ASH_IA1) offer a very specific time frame where European ancestry appeared (ca. 1250 BC) before it subsequently became fully diluted (as seen in cluster ASH_IA2) among the prevalent Levantine ancestry of the area.

Also fortunately, this precise cluster shows another R1b-M269 sample, likely R1b-Z2103 (because it is probably xL151), and this sample together with others from the same cluster prove that the ancestry related to the original southern European incomers was:

  1. Recent, related thus to LBA population movements, as expected; and
  2. More closely related to coeval Aegeans, including Mycenaeans with Steppe-related ancestry.

NOTE. I say “fortunately” because, as you can imagine if you have dealt with amateurish discussions long enough, without this cluster with evident Aegean ancestry and the R1b-M269 (Z2103) sample precisely associated to it, some would enter again in endless comment loops created by ancestry magicians, showing how Aegean peoples were not behind Sea Peoples, or not behind Philistines, or not behind the R1b-M269 among Philistines, depending on their specific agendas.

Map of the Sea People invasions in the Aegean Sea and Eastern Mediterranean at the end of the Late Bronze Age (blue arrows).. Some of the major cities impacted by the raids are denoted with historical dates. Inland invasions are represented by purple arrows. From Kaniewski et al. (2011). Some of the major cities impacted by the raids are denoted with historical dates. Inland invasions are represented by purple arrows.

The results of the paper don’t solve the question of the exact origin of all Sea Peoples (not even that of Philistines), but it is quite clear that most of those forming this seafaring confederation must have come from sites around the Aegean Sea. This supports thus the traditional origin attributed to them, including a hint at the likely expansion of Eastern Mediterranean ancestry and lineages into the Italian Peninsula precisely from the Aegean, as some oral communications have already disclosed.

As an indirect conclusion from the findings in this paper, then, we can now more confidently support that Tyrsenian speakers most likely expanded into the Appenines and the Alps originally from a Tyrsenian-speaking LBA population from Lemnos, due to the social unrest in the whole Aegean region, and might have become heavily admixed with local Italic peoples quite quickly, as it happened with Philistines, resulting in yet another case of language expansion through (the simplistically called) elite domination.


Even more interesting than these specific findings, this paper confirms yet another hypothesis based on phylogeography, and proves once again two important starting points for ancient DNA interpretation that I have discussed extensively in this blog:

  • The rare R1b-M269 Y-chromosome lineage of Tel Shadud offered ipso facto the most relevant clue about the ancestral geographical origin of this Canaanite elite male’s paternal family, most likely from the north-west based on ancient phylogeography, which indirectly – in combination with linguistics and archaeology – supported the ancestral ethnolinguistic identification of Philistines with the Aegean and thus with (a population closest to) Ancient Greeks.
  • Ancestry analyses are often fully unreliable when assessing population movements, especially when few samples from incomplete temporal-geographical transects are assessed in isolation, because – unlike paternal (and maternal) haplogroups – ancestry might change fully within a few generations, depending on the particular anthropological setting. Their investigation is thus bound by many limitations – of design, statistical, and anthropological (i.e. archaeological and linguistic) – which are quite often not taken into account.

These cornerstones of ancient DNA interpretation have been already demonstrated to be valid not only for Levantine populations, as in this case, but also for Balkan peoples, for Bell Beakers, for steppe populations (like Khvalynsk, Sredni Stog, Yamna, Corded Ware), for Basques, for Balto-Slavs, for Ugrians and Samoyeds, and for many other prehistoric peoples.

I rest my case.


Yekaterinovsky Cape, a link between the Samara culture and early Khvalynsk


We already had conflicting information about the elite individual from the Yekaterinovsky Cape and the materials of his grave, which seemed quite old:

For the burial of 45 in the laboratory of the University of Pennsylvania, a 14C date was obtained: PSUAMS-2880 (Sample ID 16068)> 30 kDa gelatin Russia. 12, Ekaterinovka Grave 45 14C age (BP) 6325 ± 25 δ 13C (‰) –23.6 δ15 N (‰) 14.5. The results of dating suggest chronological proximity with typologically close materials from Yasinovatsky and Nikolsky burial grounds (Telegini et al. 2001: 126). The date obtained also precedes the existing dates for the Khvalynsk culture (Morgunova 2009: 14–15), which, given the dominance of Mariupol traits of the burial rite and inventory, confirms its validity. However, the date obtained for human bones does not exclude the possibility of a “reservoir effect” when the age can increase three or more centuries (Shishlin et al. 2006: 135–140).

Now the same date is being confirmed by the latest study published on the site, by Korolev, Kochkina, and Stachenkov (2019) and it seems it is really going to be old. Abstract (in part the official one, in part newly translated for clarity):

For the first time, pottery of the Early Eneolithic burial ground Ekaterinovsky Cape is published. Ceramics were predominantly located on the sacrificial sites in the form of compact clusters of fragments. As a rule, such clusters were located above the burials, sometimes over the burials, some were sprinkled with ocher. The authors have identified more than 70 vessels, some of which have been partially reconstructed. Ceramic was made with inclusion of the crushed shell into molding mass. The rims of vessels had the thickened «collar»; the bottoms had a rounded shape. The ornament was located on the rims and the upper part of the potteries. Fully decorated vessels are rare. The vessels are ornamented with prints of comb and rope stamps, with small pits. A particularity of ceramics ornamentation is presented by the imprints of soft stamps (leather?) or traces of leather form for the making of vessels. The ornamentation, made up of «walking comb» and incised lines, was used rarely as well as the belts of pits made decoration under «collar» of a rim. Some features of the ceramics decoration under study relate it with ceramics of the Khvalynsk culture. The ceramics of Ekaterinovsky Cape burial ground is attributed by the authors to the Samara culture. The ceramic complex under study has proximity to the ceramics from Syezzhe burial ground and the ceramics of the second phase of Samara culture. The chronological position is determined by the authors as a later period than the ceramics from the Syezzhe burial ground, and earlier than the chronological position of ceramics of the Ivanovka stage of the Samara culture and the Khvalynsk culture.

Ceramics from Ekaterinovsky Cape burial ground. 1–2, 4–5, 7–11 – ceramics from aggregations; 3, 6 – ceramics from the cultural layer.

More specifically:

Based on ceramic fragments from a large vessel from a cluster of sq.m. 14, the date received was: SPb-2251–5673 ± 120 BP. The second date was obtained in fragments from the aggregation [see picture above] from the cluster of sq.m. 45–46: SPb-2252–6372 ± 100 BP. The difference in dating indicates that the process of determining the chronology of the burial ground is far from complete, although we note that the earlier date almost coincided with the date obtained from the human bone from individual 45 (Korolev, Kochkina, Stashenkov, 2018, p. 300).

Therefore, the ceramics of the burial ground Ekaterinovsky Cape possess an originality that determines the chronological position of the burial ground between the earliest materials of the burial type in Syezzhe and the Khvalynsk culture. Techno-typological features of dishes make it possible to attribute it to the Samara culture at the stage preceding the appearance of Ivanovska-Khvalynsk ceramics.

It seems that this site showed cultural influences from the upstream region near the Kama-Vyatka interfluve, too, according to Korolev, Kochkina, Stashenkov, and Khokhlov (2018):

In 2017, excavation of burial ground Ekaterinovsky Cape were continued, located in the area of the confl uence of the Bezenchuk River in the Volga River. During the new excavations, 14 burials were studied. The skeleton of the buried were in a position elongated on the back, less often – crooked on the back with knees bent at the knees. In one burial (No. 90), a special position of the skeleton was recorded. In the burial number 90 in the anatomical order, parts of the male skeleton. This gave grounds for the reconstruction of his original position in a semi-sitting position with the support of elbows on the bottom of the pit. Noteworthy inventory: on the pelvic bones on the left lay a bone spoon, near the right humerus, the pommel of a cruciform club was found. A conclusion is made about the high social status of the buried. The results of the analysis of the burial allow us to outline the closest circle of analogies in the materials of Khvalynsky I and Murzikhinsky burial grounds.

Important sites mentioned in both papers and in this text:

To sum up, it seems that the relative dates we have used until now have to be corrected: older Khvalynsk I Khvalynsk II individuals, supposedly dated ca. 5200-4000 BC (most likely after 4700 BC), and younger Yekaterinovsky individuals, supposedly of the fourth quarter of the 5th millennium (ca. 4250-4000 BC), are possibly to be considered, in fact, roughly reversed, if not chronologically, at least culturally speaking.

Interestingly, this gives a new perspective to the presence of a rare fish- or reptile-headed pommel-scepter, which would be natural in a variable period of expansion of the horse and horse-related symbolism, a cultural trait rooted in the Samara culture attested in Syezzhe before the unification of the symbol of power under the ubiquitous Khvalynsk-Suvorovo horse-headed scepters and related materials.

Ekaterinovsky Cape Burial Ground. Inventory of the burial no 90: 1, 2 – stone pommel of the mace; 3, 4 – bone article.

The Khvalynsk chieftain

If the reported lineages from Yekaterinovsky Cape are within the R1b-P297 tree, but without further clades, as Yleaf comparisons may suggest, there is not much change to what we have, and R1b-M269 could actually represent a part of the local population, but also incomers from the south (e.g. the north Caspian steppe hunter-gatherers like Kairshak), the east (with hunter-gatherer pottery), or the west near the Don River (in contact with Mariupol-related cultures, as the authors inferred initially from material culture).

Just like R1a-M417 became incorporated into the Sredni Stog groups after the Novodanilovka-Suvorovo expansion, probably as incoming hunter-gatherer pottery groups from the north admixing with peoples of “Steppe ancestry”, R1b-M269 lineages might have expanded explosively only during the Repin expansion, and maybe (like R1b-L51 later) they formed just a tiny part of the clans that dominated the steppe during the Khvalynsk-Novodanilovka community.

On the other hand, the potential finding of various R1b-M269/L23 samples in Yekaterinovsky Cape (including an elite individual) would suggest now, as it was supported in the original report by Mathieson et al. (2015), that these ancient R1b lineages found in the Volga – Ural region are in fact most likely all R1b-M269 without enough coverage to obtain proper SNP calls, which would simplify the picture of Neolithic expansions (yet again). From the supplementary materials:

10122 / SVP35 (grave 12). Male (confirmed genetically), age 20-30, positioned on his back with raised knees, with 293 copper artifacts, mostly beads, amounting to 80% of the copper objects in the combined cemeteries of Khvalynsk I and II. Probably a high-status individual, his Y-chromosome haplotype, R1b1, also characterized the high-status individuals buried under kurgans in later Yamnaya graves in this region, so he could be regarded as a founder of an elite group of patrilineally related families. His MtDNA haplotype H2a1 is unique in the Samara series.

Khvalynsk cemetery and grave gifts. Grave 90 contained copper beads and rings, a harpoon, flint blades, and a bird-bone tube. Both graves (90 and 91) were partly covered by Sacrificial Deposit 4 with the bones from a horse, a sheep, and a cow. Center: grave goods from the Khvalynsk cemetery-copper rings and bracelets, polished stone mace heads, polished stone bracelet, Cardium shell ornaments, boars tusk chest ornaments, flint blades, and bifiacial projectile points. Bottom: shell-tempered pottery from the Khvalynsk cemetery. After Agapov, Vasiliev, and Pestrikova 1990; and Ryndina 1998, Figure 31. Modified from Anthony (2007).

This remarkable Khvalynsk chieftain, whose rich assemblage may correspond to the period of domination of the culture all over the Pontic-Caspian steppes, has been consistently reported as of hg. R1b-L754 in all publications, including Wang et al. (2018/2019) tentative SNP calls in the supplementary materials (obtained with Yleaf, as the infamous Narasimhan et al. 2018 samples), but has been variously reported by amateurs as within the R1b-M73, R1b-V88, or (lately) R1b-V1636 trees, which makes it unlikely that quality of the sample is allowing for a proper SNP call.

The fact that Mathieson et al. (2015) considered it a member of the R1b-M269 clans appearing later in Yamna seems on point right now, especially if samples from Yekaterinovka are all within this tree. The relevance of R1b-L23 in the expansion of Repin and Yamna is reminiscent of the influence of successful clans among Yamna offshoots, such as Bell Beakers, and among Bell Beaker offshoots during the Bronze Age all over Europe.

Taking these younger expansions as example, it seems quite likely based on cultural links that (at least part of) the main clans of Khvalynsk were of R1b-M269 lineage, stemming from a R1b-dominated Samara culture, in line with the known succeeding expansions and the expected strictly patriarcal and patrilineal society of Proto-Indo-Europeans, which would have exacerbated the usual reduction in Y-chromosome haplogroup variability that happens during population expansions, and the aversion towards foreign groups while the culture lasted.

Cultures of the Pontic-Caspian steppes and forest-steppes and surrounding areas during the Neolithic.

The finding of R1b-L23 in Yekaterinovka, associated with the Samara culture, before or during the Khvalynsk expansion, and close to the Khvalynsk site, would make this Khvalynsk chieftain most likely a member of the M269 tree (paradoxically, the only R1b-L754 branch amateurs have not yet reported for it). Similarly, the sample of a “Samara hunter-gatherer” of Lebyazhinka, of hg. R1b-P297, could also be under this tree, just like most R1b-M269 from Yamna are downstream from R1b-L23, and most reported R1b-M269 or R1b-L23 from Bell Beakers are under R1b-L151.

On the other hand, we know of the shortcomings of attributing a haplogroup expansion to the best known rulers, such as the famous lineages previously wrongly attributed to Niall of the Nine Hostages or Genghis Khan. The known presence of R1b-V1636 up to modern Greeks would be in line with an ancient steppe expansion that we know will show up during the Neolithic, although it could also be a sign of a more recent migration from the Caucasus. The presence of a sister clade of R1b-L23, R1b-PF7562, among modern Balkan populations, may also be attributed to a pre-Yamna steppe expansion.

Y-DNA samples from Khvalynsk and neighbouring cultures. See full version here.

On SNP calls

I reckon that even informal reports on SNP calls, like any other analyses, should be offered in full: not only with a personal or automatic estimation of the result, but with a detailed explanation of the good, dubious, and bad calls, alternatives to that SNP estimation, and a motivated reasoning of why one branch should be preferred over others. Downloading a sample and giving an instruction using a free software tool is never enough, as it became crystal clear recently for the hilariously biased and flawed qpAdm reports on Dutch Bell Beakers as the ‘missing link’ between Corded Ware and Bell Beakers…

Another example I can recall is the report of a R1a-Z93 subclade in the R1a-M417 sample ca. 4000 BC from Alexandria, which seems rather unlikely, seeing how this subclade must have split and expanded explosively with R1a-Z645 to the east with eastern Corded Ware groups, i.e. 1,000 years later, just like Z282 lineages expanded mainly to the north-east. But then again, as with the Khvalynsk chieftain, I have only seen indirect reports of that supposed SNP (including Y26+!), so we should just stick with its officially reported R1a-M417 lineage. This upstream haplogroup was, in fact, repeated with Yleaf’s tentative estimates in Wang et al. (2019) supplementary materials…

The combination of inexperienced, biased, or simply careless design, analyses, and reports, including SNP calls and qpAdm analyses (whether in forums or publications), however well-intentioned (or not) they might be, are hindering a proper analysis of data, adding to the difficulties we already have due to the scarcity of samples, their limited coverage, and the lack of proper context.

Some people like to repeat ad nauseam that archaeology and/or linguistics are ‘not science’ whenever they don’t fit their beliefs and myths based on haplogroup and/or ancestry. But it’s becoming harder and harder to rely on certain genetic data, too, and on their infinite changing interpretations, much more than it is to rely on linguistic and archaeological research, including data, assessments, and discussions that are open for anyone to review…if one is truly interested in them.

Iberia: East Bell Beakers spread Indo-European languages; Celts expanded later


New paper (behind paywall), The genomic history of the Iberian Peninsula over the past 8000 years, by Olalde et al. Science (2019).

NOTE. Access to article from Reich Lab: main paper and supplementary materials.


We assembled genome-wide data from 271 ancient Iberians, of whom 176 are from the largely unsampled period after 2000 BCE, thereby providing a high-resolution time transect of the Iberian Peninsula. We document high genetic substructure between northwestern and southeastern hunter-gatherers before the spread of farming. We reveal sporadic contacts between Iberia and North Africa by ~2500 BCE and, by ~2000 BCE, the replacement of 40% of Iberia’s ancestry and nearly 100% of its Y-chromosomes by people with Steppe ancestry. We show that, in the Iron Age, Steppe ancestry had spread not only into Indo-European–speaking regions but also into non-Indo-European–speaking ones, and we reveal that present-day Basques are best described as a typical Iron Age population without the admixture events that later affected the rest of Iberia. Additionally, we document how, beginning at least in the Roman period, the ancestry of the peninsula was transformed by gene flow from North Africa and the eastern Mediterranean.

Interesting excerpts:

From the Bronze Age (~2200–900 BCE), we increase the available dataset (6, 7, 17) from 7 to 60 individuals and show how ancestry from the Pontic-Caspian steppe (Steppe ancestry) appeared throughout Iberia in this period (Fig. 1, C and D), albeit with less impact in the south (table S13). The earliest evidence is in 14 individuals dated to ~2500–2000 BCE who coexisted with local people without Steppe ancestry (Fig. 2B). These groups lived in close proximity and admixed to form the Bronze Age population after 2000 BCE with ~40% ancestry from incoming groups (Fig. 2B and fig. S6).

Y-chromosome turnover was even more pronounced (Fig. 2B), as the lineages common in Copper Age Iberia (I2, G2, and H) were almost completely replaced by one lineage, R1b-M269. These patterns point to a higher contribution of incoming males than females, also supported by a lower proportion of nonlocal ancestry on the X-chromosome (table S14 and fig. S7), a paradigm that can be exemplified by a Bronze Age tomb from Castillejo del Bonete containing a male with Steppe ancestry and a female with ancestry similar to Copper Age Iberians.


For the Iron Age, we document a consistent trend of increased ancestry related to Northern and Central European populations with respect to the preceding Bronze Age (Figs. 1, C and D, and 2B). The increase was 10 to 19% (95% confidence intervals given here and in the percentages that follow) in 15 individuals along the Mediterranean coast where non-Indo-European Iberian languages were spoken; 11 to 31% in two individuals at the Tartessian site of La Angorrilla in the southwest with uncertain language attribution; and 28 to 43% in three individuals at La Hoya in the north where Indo-European Celtiberian languages were likely spoken (fig. S6 and tables S11 and S12).

This trend documents gene flow into Iberia during the Late Bronze Age or Early Iron Age, possibly associated with the introduction of the Urnfield tradition (18). Unlike in Central or Northern Europe, where Steppe ancestry likely marked the introduction of Indo-European languages (12), our results indicate that, in Iberia, increases in Steppe ancestry were not always accompanied by switches to Indo-European languages.

I think it is obvious they are extrapolating the traditional (not that well-known) linguistic picture of Iberia during the Iron Age, believing in continuity of that picture (especially non-Indo-European languages) during the Urnfield period and earlier.

What this data shows is, as expected, the arrival of Celtic languages in Iberia after Bell Beakers and, by extension, in the rest of western Europe. Somewhat surprisingly, this may have happened during the Urnfield period, and not during the La Tène period.

Also important are the precise subclades:

We thus detect three Bronze Age males who belonged to DF27 (154, 155), confirming its presence in Bronze Age Iberia. The other Iberian Bronze Age males could belong to DF27 as well, but the extremely low recovery rate of this SNP in our dataset prevented us to study its true distribution. All the Iberian Bronze Age males with overlapping sequences at R1b-L21 were negative for this mutation. Therefore, we can rule out Britain as a plausible proximate origin since contemporaneous British males are derived for the L21 subtype.

New open access paper Survival of Late Pleistocene Hunter-Gatherer Ancestry in the Iberian Peninsula, by Villalba-Mouco et al. Cell (2019):

BAL0051 could be assigned to haplogroup I1, while BAL003 carries the C1a1a haplogroup. To the limits of our typing resolution, EN/MN individuals CHA001, CHA003, ELT002 and ELT006 share haplogroup I2a1b, which was also reported for Loschbour [73] and Motala HG [13], and other LN and Chalcolithic individuals from Iberia [7, 9], as well as Neolithic Scotland, France, England [9], and Lithuania [14]. Both C1 and I1/ I2 are considered typical European HG lineages prior to the arrival of farming. Interestingly, CHA002 was assigned to haplogroup R1b-M343, which together with an EN individual from Cova de Els Trocs (R1b1a) confirms the presence of R1b in Western Europe prior to the expansion of steppe pastoralists that established a related male lineage in Bronze Age Europe [3, 6, 9, 13, 19]. The geographical vicinity and contemporaneity of these two sites led us to run genomic kinship analysis in order to rule out any first or second degree of relatedness. Early Neolithic individual FUC003 carries the Y haplogroup G2a2a1, commonly found in other EN males from Neolithic Anatolia [13], Starçevo, LBK Hungary [18], Impressa from Croatia and Serbia Neolithic [19] and Czech Neolithic [9], but also in MN Croatia [19] and Chalcolithic Iberia [9].

See also

Ahead of the (Indo-European – Uralic) game: in theory and in numbers


There is a good reason for hope, for those who look for a happy ending to the revolution of population genomics that is quickly turning into an involution led by beliefs and personal interests. This blog is apparently one of the the most read sites on Indo-European peoples, if not the most read one, and now on Uralic peoples, too.

I’ve been checking the analytics of our sites, and judging by the numbers of the English blog, (without the other languages) is quickly turning into the most visited one from Academia Prisca‘s sites on Indo-European languages, beyond (and its parent sites in other languages), which host many popular files for download.

If we take into account file downloads (like images or PDFs), and not only what Google Analytics can record, has not more users than all other websites of Academia Prisca, but at this pace it will soon reach half the total visits, possibly before the end of 2019.

Overall, we have evolved from some 10,000 users/year in 2006 to ~300,000 active users/year and >1,000,000 page+file views/year in 2018 (impossible to say exactly without spending too much time on this task). Nothing out of the ordinary, I guess, and obviously numbers are not a quality index, but rather a hint at increasing popularity of the subject and of our work.

NOTE. The mean reading time is ~2:40 m, which I guess fits the length of most posts, and most visitors read a mean of ~2+ pages before leaving, with increasing reader fidelity over time.

Number of active users of, according to Google Analytics since before the start of the new blog. Notice the peaks corresponding to the posts below (except the last one, corresponding to the publication of A Song of Sheep and Horses).

The most read posts of 2018, now that we can compare those from the last quarter, are as follows:

  1. – The series on the Corded Ware-Uralic theory, with a marked increase in readers, especially with the last three posts:
    1. Finno-Permic and the expansion of N-L392/Siberian ancestry,
    2. “Siberian ancestry” and Ugric-Samoyedic expansions, and
    3. Haplogroups R1a and N in Finno-Ugric and Samoyedic
  2. Haplogroup is not language, but R1b-L23 expansion was associated with Proto-Indo-Europeans
  3. The history of the simplistic ‘haplogroup R1a — Indo-European’ association
  4. On the origin of haplogroup R1b-L51 in late Repin / early Yamna settlers
  5. On the origin and spread of haplogroup R1a-Z645 from eastern Europe
  6. The Caucasus a genetic and cultural barrier; Yamna dominated by R1b-M269; Yamna settlers in Hungary cluster with Yamna
  7. Something is very wrong with models based on the so-called ‘Yamnaya admixture’ – and archaeologists are catching up (II)
  8. Olalde et al. and Mathieson et al. (Nature 2018): R1b-L23 dominates Bell Beaker and Yamna, R1a-M417 resurges in East-Central Europe during the Bronze Age
  9. Early Indo-Iranian formed mainly by R1b-Z2103 and R1a-Z93, Corded Ware out of Late PIE-speaking migrations
  10. “Steppe ancestry” step by step: Khvalynsk, Sredni Stog, Repin, Yamna, Corded Ware

NOTE. Of course, the most recent posts are the most visited ones right now, but that’s because of the constant increase in the number of visitors.

I think it is obvious what the greatest interest of readers has been in the past two years. You can see the pattern by looking at the most popular posts of 2017, when the blog took off again:

  1. Germanic–Balto-Slavic and Satem (‘Indo-Slavonic’) dialect revisionism by amateur geneticists, or why R1a lineages *must* have spoken Proto-Indo-European
  2. The renewed ‘Kurgan model’ of Kristian Kristiansen and the Danish school: “The Indo-European Corded Ware Theory”
  3. The new “Indo-European Corded Ware Theory” of David Anthony
  4. Correlation does not mean causation: the damage of the ‘Yamnaya ancestral component’, and the ‘Future American’ hypothesis
  5. The Aryan migration debate, the Out of India models, and the modern “indigenous Indo-Aryan” sectarianism

The most likely reason for the radical increase in this blog’s readership is very simple, then: people want to know what is really happening with the research on ancestral Indo-Europeans and Uralians, and other blogs and forums are not keeping up with that demand, being content with repeating the same ideas again and again (R1a-CWC-IE, R1b-BBC-Vasconic, and N-Comb Ware-Uralic), despite the growing contradictions. As you can imagine, once you have seen the Yamna -> Bell Beaker migration model of North-West Indo-European, with Corded Ware obviously representing Uralic, you can’t unsee it.

The online bullying, personal attacks, and similar childish attempts to silence those who want to talk about this theory elsewhere (while fringe theories like R1a/CHG-OIT, R1b-Vasconic, or the Anatolian/Armenian-CHG hypotheses, to name just a few, are openly discussed) has had, as could be expected, the opposite effect to what was intended. I guess you can say this blog and our projects have profited from the first relevant Streisand effect of population genomics, big time.

If this trend continues this year (and other bloggers’ or forum users’ faith in miracles is not likely to change), I suppose that after the Yamna Hungary samples are published (with the expected results) this blog is going to be the most read in 2020 by a great margin… I can only infer that this tension is also helping raise the interest in (and politicization of) the question, hence probably the overall number of active users and their participation in other blogs and forums is going to increase everywhere in 2019, too, as this debate becomes more and more heated.

So, what I infer from the most popular posts and the numbers is that people want criticism and controversy, and if you want blood you’ve got it. Here it is, my latest addition to the successful series criticizing the “Corded Ware/R1a–Indo-European” pet theories, a post I wrote two-three months ago, slightly updated with the newest comedy, and a sure success for 2019 (already added to the static pages of the menu):

The “Indo-European Corded Ware theory” doesn’t hold water

This is how I feel when I see spikes in visits with more and more returning users linked to my controversial posts 😉

Are you not entertained?! Are you not entertained?! Is this not why you are here?!