I have tried running supervised ADMIXTURE models by selecting distant populations based on PCAs and qpAdm results. The most accurate approximations to what the software should offer appear with a small K number, between K=5 and K=7, whether supervised or unsupervised, and adding more ancestral populations gives some weird results the more distant (in time) populations are from these selected samples.
Labels for ancestral components are used following those commonly referred to in the literature, although supervised ADMIXTURE using corresponding available samples (viz. Anatolia Neolithic for AHG, Iran Hotu and/or CHG for IHG, AG2, AG3 and Mal’ta for ANE, etc.) offer slightly different, less smooth outputs for some periods, especially among more recent populations.
Outputs depend on many different factors, and these files are intended as an overview of the evolution of these simplistic components. The number of available samples per period, the potential ancestry changes within each conventionally selected period, or whether or not each available sample is representative of the territory they were recovered from, among many other factors, influence the outputs and the maps.
NOTE. In summary, ADMIXTURE results like these below might be used to develop new ideas, to be then formally tested; they cannot be used to support anything. Don’t be like the Copenhagen group, randomly selecting “Steppe ancestry” with K=4, identifying this component as “Indo-Europeans”, and correlating its evolution with changes in vegetation composition in yet another obvious correlation = causation argument among many confounding factors left unaccounted for…
Static ADMIXTURE + culture maps
Colours correspond to the components as labelled in the video and in the files below.
The following maps offer natural neighbour interpolations of ancestral components in ancient DNA samples grouped by periods (conventionally selected following the same pattern as in the Prehistory Atlas).
Extrapolation (inferred ancestry beyond the frame created by available samples per map) is obtained by adding distant external locations (such as Greenland, Arctic, Alaska…) with a value of 0.
Videos offer a dynamic timeline.
Click on the images to see a version with higher resolution.
This ancestry peaks among Baikal HG, Ust’Belaya, Nganasans, or Ulchi, hence the different labels used.
Iran HG ancestry
ADMIXTURE maps by period
Click on each image for a higher resolution version.
Early Bronze Age
Middle Bronze Age
Late Bronze Age
Early Iron Age
Late Iron Age
These are the samples used for interpolations in each period (except for modern populations, which are those included in the Reich Lab curated dataset):
Lower Danube and Balkan cultures affected by Anatolian- and steppe-related (i.e. Khvalynsk-Novodanilovka) migrations.
This multiethnic interaction of the western steppe fits therefore the complex archaeological description of events in the North Pontic, Lower Danube, and Dnieper-Dniester regions. Here are some interesting samples related to those long-lasting contacts:
1. I3719 (mtDNA H1, Y-DNA I2a2a) Ukraine Neolithic sample from Dereivka ca. 4949–4799 BC, described in Mathieson et al. (2018) as of “entirely northwestern-Anatolian-Neolithic-related ancestry”.
3. The Yamna Bulgaria outlier (Y-DNA I2a2a1b1b), 3012-2900 calBCE, shows apparently an admixture with cultures of that region (but 1,500 years later).
Trypillia and Corded Ware
4. There is one ‘Trypillia outlier’ among five samples from the Verteba cave in Wang et al. (2018): I1927 (Y-DNA G2a2b2a1a1b1a1a1, mtDNA H1b), ca. 3619-2936 BC, a sample published previously in Nikitin et al. (2017) and Mathieson et al. (2017). We were very quick to dismiss Trypillia (three samples of haplogroup G2a, one sample E) and GAC as a source of Corded Ware admixture, but archaeology clearly shows important population movements at the end of the fourth millennium between late Trypillia groups, GAC, and post-Sredni Stog populations, and genetics is showing that in both cultures, too.
I am not a fan of the ‘lack of samples’ argument, but (similar to Old Hittite samples related to all Anatolian speakers) one site is not enough to describe a culture that spanned millennia and many different early and late groups. One among five Trypillian samples (from a single site), showing a late date (ca. 3228 BC) compared to the other samples (ca. 3700 BC), and quite close to the only three Ukraine Eneolithic samples we have may mean much more than what we may a priori think, i.e. some simplistic unidirectional punctual ‘intrusion’ of steppe ancestry, and instead hint at the known close contacts of late Trypillian groups and North Pontic cultures, including also the Caucasus.
NOTE. The big difference in PCA among GAC-like Hungary LCA – EBA samples (see above two star symbols close to Ukraine Neolithic outlier in the PCA, in contrast with the other three at the bottom) may also be significant, although we don’t have any data about their culture, sites, or the relationship between them.
Greece Neolithic outlier: Proto-Anatolians?
5. Especially interesting is I6423, one of the Greece Neolithic samples referred to in Wang et al. (2018), which is obviously an outlier among the three used in the paper. It does not seem to correspond to any of the ancient DNA samples published to date; it is not in Hofmanova et al. (2016), in Lazaridis et al. (2017), or in Mathieson et al. (2018).
Since the Neolithic in Greece could mean any period from ca. 6500 BC to ca. 3200 BC, I guess we are talking here about some migration related to the expansion of Khvalynsk-Novodanilovka chieftains after ca. 4500 BC, because it appears on the PCA precisely on the same spot as Varna and Smyadovo outliers, and its ADMIXTURE shows similar components…
So, this may be the smoking gun of Proto-Anatolian (or maybe early Common Anatolian) expansion with steppe migrants up to the border of Western Anatolia, and we may be able to get rid of those unfounded doubts about Anatolian origins once and for all…
NOTE. Also interesting seems another Greece Neolithic sample, I6420, in ADMIXTURE, although its position in the PCA (near Minoans and Mycenaeans) does not necessarily point to potential steppe influence, but rather to the extra ‘eastern (Caucasus/Iran-related) ancestry’ contribution found in Minoans and in Mycenaeans (and Anatolia Chalcolithic) compared to previous samples of the region. The third Greece Nelithic sample, I5427 (mtDNA K1a24), from Diros, Alepotrypa Cave, is dated 6005-5879 BC (mean 5892 BC), and appeared first in Mathieson et al. (2018).
If this Greece Neolithic sample is not related to Yamna migrations – and its use for statistical analysis of Caucasus samples from Wang et al. (2018) suggests that it is not – , it may have important consequences:
If it is located near the Western Anatolian coast – especially near Troy – there won’t be much to add about the potential site of entry of Common Anatolian languages into Anatolia… I have read some comments about how ‘impossible’ it was for steppe migrants and their language to ‘invade’ the more advanced cultures of Anatolia from the west, but it seems as ‘impossible’ as it was for Barbarians to invade the Roman Empire and impose their language as elites in certain regions. (And yes, we have at least one important weak political period among Middle Eastern cultures in the early 3rd millennium BC, similar to the period of the Fall of the Western Roman Empire).
If it is located somewhere more ‘central’ in the Greek Peninsula, then it could also be used to support the Anatolian nature of the controversial Pre-Greek (‘Pelasgian’) substrate. While we know that Greek (at least since Mycenaean) shows a strong Pre-Greek cultural and linguistic heritage (also reflected in its genetic continuity), the nature of that language is usually believed to be non-Indo-European, and Anatolian contacts are rather few and coincident with the Mycenaean period. I don’t think this sample can tell much about the Pre-Greek language, though, because – if it is really Neolithic, and comparing it with later Minoan and Mycenaean samples – it seems a clear outlier.
If it is, however, related to later Yamna migrations after ca. 3000 BC (and, like the ‘Ukraine Eneolithic’ sample that is likely from Catacomb, it is classified as Neolithic just because it cannot be attributed to precise Helladic periods), then we may be in front of the first obvious Yamna migrants in Greece. If that is the case (which I doubt), the sample wouldn’t be so informative for PIE dialectal expansions, because by now it is evident that we will find steppe ancestry and R1b-Z2103 subclades accompanying Yamna migrants in the southern Balkans, and probably well into Mycenaean Greece.
NOTE. Whatever the case, I am sure that for those fond of absurd autochthonous continuity theories, as well as for anti-steppe conspirationists, this sample will be just another good way of arguing for anything, ranging from a rejection of the Middle PIE – Late PIE division, to a support for some mythic ancient autochtonous Proto-Graeco-Anatolian group, or maybe some ancient Graeco–Indo-Slavonic split, or whatever new dialectal stage one can invent to support the own genealogical fantasies…
So, if it actually is a Neolithic sample, let’s hope that it shows a clear R1b-M269 (xL23 or early L23) subclade distinct from those (likely Z2103) expanded later with Late PIE-speaking Yamna (and probably to be found among Mycenaeans), so that there can be no more place for ethnic fantasies.
EDIT (28 JUL 2018): Added information on Greece Neolithic and Trypillia samples
An interesting observation is that steppe zone individuals directly north of the Caucasus (Eneolithic Samara and Eneolithic steppe) had initially not received any gene flow from Anatolian farmers. Instead, the ancestry profile in Eneolithic steppe individuals shows an even mixture of EHG and CHG ancestry, which argues for an effective cultural and genetic border between the contemporaneous Eneolithic populations in the North Caucasus, notably Steppe and Caucasus. Due to the temporal limitations of our dataset, we currently cannot determine whether this ancestry is stemming from an existing natural genetic gradient running from EHG far to the north to CHG/Iran in the south or whether this is the result of farmers with Iranian farmer/ CHG-related ancestry reaching the steppe zone independent of and prior to a stream of Anatolian farmer-like ancestry, where they mixed with local hunter-gatherers that carried only EHG ancestry.
Concerning the influences from the south, our oldest dates from the immediate Maykop predecessors Darkveti-Meshoko (Eneolithic Caucasus) indicate that the Caucasus genetic profile was present north of the range ~6500 BP, 4500 calBCE. This is in accordance with the Neolithization of the Caucasus, which had started in the flood plains of the great rivers in the South Caucasus in the 6th millennium BCE from where it spread to the West and Northwest Caucasus during the 5th millennium BCE9, 49. It remains unclear whether the local CHG ancestry profile (represented by Late Upper Palaeolithic/Mesolithic individuals from Kotias Klde and Satsurblia in today’s Georgia) was also present in the North Caucasus region before the Neolithic. However, if we take the Caucasus hunter-gatherer individuals from Georgia as a local baseline and the oldest Eneolithic Caucasus individuals from our transect as a proxy for the local Late Neolithic ancestry, we notice a substantial increase in Anatolian farmer-related ancestry. This in all likelihood is linked to the process of Neolithization, which also brought this type of ancestry to Europe. As a consequence, it is possible that Neolithic groups could have reached the northern flanks of the Caucasus earlier50 (Supplementary Information 1) and in contact with local hunter gatherers facilitated the exploration of the steppe environment for pastoralist economies. Hence, additional sampling from older individuals is needed to fill this temporal and spatial gap.
The newest paper of the Reich/Jena group has brought samples (probably) much nearer to the actual CHG and ANE contribution seen in Eneolithic steppe peoples than the previously available Kotias Klde, Satsurblia, Afontova Gora 3, or Mal’ta.
It is impossible to say without direct access to the samples, but it is very likely that we will soon be able to break down different gross contributions from groups similar to these Steppe/Caucasus Neolithic ancestral groups into the diverse Eneolithic cultures of the Pontic-Caspian steppe, and thus trace more precisely each of these cultures to their genetic (and thus ethnolinguistic) heirs.
Some more representative samples from Eneolithic steppe, steppe-forest and forest zone cultures of Eastern Europe will probably help with the fine-scale structure of different Chalcolithic groups, especially the homeland of early Corded Ware groups.
These new samples seem another good reason (like the Botai and R1b-M73) to rethink the role of (what I assumed were) different westward Mesolithic Eurasian waves of expansion influencing the formation of an Indo-Uralic and Indo-European community in Eastern Europe, and return to the simpler idea of local contributions from North Caucasus and steppe peoples absorbed by expanding EHG-like groups.
This post should probably read “Consequences of Narasimhan et al. (2018),” too, since there seems to be enough data and materials published by the Copenhagen group in Nature and Science to make a proper interpretation of the data that will appear in their corrected tables.
The finding of late Khvalynsk/early Yamna migrations, identified with early LPIE migrants almost exclusively of R1b-L23 subclades is probably one of the most interesting findings in the recent papers regarding the Indo-European question.
Although there are still few samples to derive fully-fledged theories, they begin to depict a clearer idea of waves that shaped the expansion of Late Proto-Indo-European migrants in Eurasia during the 4th millennium BC, i.e. well before the expansion of North-West Indo-European, Palaeo-Balkan, and Indo-Iranian languages.
Late Khvalynsk expansions and archaic Late PIE
Like Anatolian, Tocharian has been described as having a more archaic nature than the rest of Late PIE. However, Pre-Tocharian belongs to the Late PIE trunk, clearly distinguishable phonetically and morphologically from Anatolian.
It is especially remarkable that – even though it expanded into Asia – it has more in common with North-West Indo-European, hence its classification (together with NWIE) as part of a Northern group, unrelated to Graeco-Aryan.
The linguistic supplement by Kroonen et al. accepts that peoples from the Afanasevo culture (ca. 3000-2500 BC) are the most likely ancestors of Tocharians.
NOTE. For those equating the Tarim Mummies (of R1a-Z93 lineages) with Tocharians, you have this assertion from the linguistic supplement, which I support:
An intermediate stage has been sought in the oldest so-called Tarim Mummies, which date to ca. 1800 BCE (Mallory and Mair 2000; Wáng 1999). However, also the language(s) spoken by the people(s) who buried the Tarim Mummies remain unknown, and any connection between them and the Afanasievo culture on the one hand or the historical speakers of Tocharian on the other has yet to be demonstrated (cf. also Mallory 2015; Peyrot 2017).
New samples of late Khvalynsk origin
These are are the recent samples that could, with more or less certainty, correspond to migration waves from late Khvalynsk (or early Yamna), from oldest to most recent:
The Namazga III samples from the Late Eneolithic period (in Turkmenistan), dated ca. 3360-3000 BC (one of haplogroup J), potentially showing the first wave of EHG-related steppe ancestry into South Asia. Not related to Indo-Iranian migrations.
NOTE. A proper evaluation with further samples from Narasimhan et al. (2018) is necessary, though, before we can assert a late Khvalynsk origin of this ancestry.
Afanasevo samples, dated ca. 3081-2450 BC, with all samples dated before ca. 2700 BC uniformly of R1b-Z2103 subclades, sharing a common genetic cluster with Yamna, showing together the most likely genomic picture of late Khvalynsk peoples.
NOTE 1. Anthony (2007) put this expansion from Repin ca. 3300-3000 BC, while his most recent review (2015) of his own work put its completion ca. 3000-2800. While the migration into Afanasevo may have lasted some time, the wave of migrants (based on the most recent radiocarbon dates) must be set at least before ca. 3100 BC from Khvalynsk.
NOTE 2. I proposed that we could find R1b-L51 in Afanasevo, presupposing the development of R1b-L51 and R1b-Z2103 lineages with separating clans, and thus with dialectal divisions. While finding this is still possible within Khvalynsk regions, it seems we will have a division of these lineages already ca. 4250-4000 BC, which would require a closer follow-up of the different inner late Khvalynsk groups and their samples. For the moment, we don’t have a clear connection through lineages between North-West Indo-European groups and Tocharian.
Subsequent and similar migration waves are probably to be suggested from the new sample of Karagash, beyond the Urals (attributed to the Yamna culture, hence maintaining cultural contacts after the migration waves), of R1b-Z2103 subclade, ca. 3018-2887 BC, potentially connected then to the event that caused the expansion of Yamna migrants westward into the Carpathians at the same time. Not related to Indo-Iranian migrations.
The isolated Darra-e Kur sample, without cultural adscription, ca. 2655 BC, of R1b-L151 lineage. Not related to Indo-Iranian migrations.
The Hajji Firuz samples: I4243 dated ca. 2326 BC, female, with a clear inflow of steppe ancestry; and I2327 (probably to be dated to the late 3rd millennium BC or after that), of R1b-Z2103 lineage. Not related to Indo-Iranian migrations.
NOTE. A new radiocarbon dating of I2327 is expected, to correct the currently available date of 5900-5000 BC. Since it clusters nearer to Chalcolithic samples from the site than I4243 (from the same archaeological site), it is possible that both are part of similar groups receiving admixture around this period, or maybe I2327 is from a later period, coinciding with the Iron Age sample F38 from Iran (Broushaki et al. 2016), with which it closely clusters. Also, the finding of EHG-related ancestry in Maykop samples dated ca. 3700-3000 BC (maybe with R1b-L23 subclades) offers another potential source of migrants for this Iranian group.
NOTE. Samples from Narasimhan et al. (2018) still need to be published in corrected tables, which may change the actual subclades shown here.
These late Khvalynsk / early Yamna migration waves into Asia are quite early compared to the Indo-Iranian migrations, whose ancestors can only be first identified with Volga-Ural groups of Yamna/Poltavka (ca. 3000-2400 BC), with its fully formed language expanding only with MLBA waves ca. 2300-1200 BC, after mixing with incoming Abashevo migrants.
While the authors apparently forget to reference the previous linguistic theories whereby Tocharian is more archaic than the rest of Late PIE dialects, they refer to the ca. 1,000-year gap between Pre-Tocharian and Proto-Indo-Iranian migrations, and thus their obvious difference:
The fact that Tocharian is so different from the Indo-Iranian languages can only be explained by assuming an extensive period of linguistic separation.
Potential linguistic substrates in the Middle East
A few words about relevant substrate language proposals.
What Gordon Whittaker proposes is a North-West Indo-European-related substratum in Sumerian language and texts ca. 3500 BC, which may explain some non-Sumerian, non-Semitic word forms. It is just one of many theories concerning this substratum.
This is a summary of his findings from his latest writing on the subject (a chapter of a book on Indo-European phonetics, from the series Copenhagen Studies in Indo-European):
In Sumerian and Akkadian vocabulary, the cuneiform writing system, and the names of deities and places in Southern Mesopotamia a body of lexical material has been preserved that strongly suggests influence emanating from a superstrate of Indo-European origin. his Indo-European language, which has been given the name Euphratic, is, at present, attested only indirectly through the filters of Sumerian and Akkadian. The attestations consist of words and names recorded from the mid-4th millennium BC (Late Uruk period) onwards in texts and lexical lists. In addition, basic signs that originally had a recognizable pictorial structure in proto-cuneiform preserve (at least from the early 3rd millennium on) a number of phonetic values with no known motivation in Sumerian lexemes related semantically to the items depicted. This suggests that such values are relics from the original logographic values for the items depicted and, thus, that they were inherited from a language intimately associated with the development of writing in Mesopotamia. Since specialists working on proto-cuneiform, most notably Robert K. Englund of the Cuneiform Digital Library Initiative, see little or no evidence for the presence of Sumerian in the corpus of archaic tablets, the proposed Indo-European language provides a potential solution to this problem. It has been argued that this language, Euphratic, had a profound influence on Sumerian, not unlike that exerted by Sumerian and Akkadian on each other, and that the writing system was the primary vehicle of this influence. he phonological sketch drawn up here is an attempt to chart the salient characteristics of this influence, by comparing reconstructed Indo-European lexemes with similarly patterned ones in Sumerian (and, to a lesser extent, in Akkadian).
His original model, based on phonetic values in basic proto-cuneiform signs, is quite imaginative and a very interesting read, if you have the time. His Academia.edu account hosts most of his papers on the subject.
We could speculate about the potential expansion of this substrate language with the commercial contacts between Uruk and Maykop (as I did), now probably more strongly supported because of the EHG found in Maykop samples.
NOTE. We could also put it in relation with the Anatolian language of Mari, but this would require a new reassessment of its North-West Indo-European nature.
Nevertheless, this theory is far from being mainstream, anywhere. At least today.
NOTE. The proposal remains still hypothetic, because of the flaws in the Indo-European parallels – similar to Koch’s proposal of Indo-European in Tartessian inscriptions. A comprehensive critic approach to the theory is found in Sylvie Vanséveren’s A “new” ancient Indo-European language? On assumed linguistic contacts between Sumerian and Indo-European “Euphratic”, in JIES (2008) 36:3&4.
References to Gutian are popping up related to the Hajji Firuz samples of the mid-3rd millennium.
The hypothesis was put forward by Henning (1978) in purely archaeological terms.
This is the relevant excerpt from the book:
(…) Comparativists have asserted that, in spite of its late appearance, Tokharian is a relatively archaic form of Indo-European.3 This claim implies that the speakers of this group separated from their Indo-European brethren at a comparatively early date. They should accordingly have set out on their migrations rather early, and should have appeared within the Babylonian sphere of influence also rather early. Earlier, at any rate, than the Indo-Iranians, who spoke a highly developed (therefore probably later) form of Indo-European. Moreover, as some of the Indo-Iranians after their division into Iranians and Indo-Aryans4 appeared in Mesopotamia about 1500 B.C., we should expect the Proto-Tokharians about 2000 B.C. or even earlier.
If, armed with these assumptions as our working hypothesis, we look through the pages of history, we find one nation – one nation only – that perfectly fulfills all three conditions, which, therefore, entitles us to recognize it as the “Proto-Tokharians”. Tis name was Guti; the intial is also spelled with q (a voiceless back velar or pharyngeal), but the spelling with g is the original one. The closing -i is part of the name, for the Akkadian case-endings are added to it, nom. Gutium etc. Guti (or Gutium, as some scholars prefer) was valid for the nation, considered as an entity, but also for the territory it occupied.
The text goes on to follow the invasion of Babylonia by the Guti, and further eastward expansions supposedly connected with these, to form the attested Tocharians.
Among the Gutian rulers is one Elulumesh, whose name is evidently Akkadian Elulum slightly “Gutianized” by the Gutian case(?) ending -eš.40 This Gutian ruler Elulum is obviously the same man whom we find participating in the scramble for power after the death of Shar-kali-sharrii; his name appears there in Sumerian form without mimation as Elulu.
The Gutian dynasty, from ca. 22nd c. BC appears as follows:
I don’t think we could derive a potential relation to any specific Indo-European branch from this simple suffix repeated in Gutian rulers, though.
The hypothesis of the Tocharian-like nature of the Guti (apart from the obvious error of considering them as the ancestors of Tocharians) remains not contrasted in new works since. It was cited e.g. by Gamkrelidze and Ivanov (1995) to advance their Armenian homeland, and by Mallory and Adams in their Encyclopedia (1997).
It lies therefore in the obscurity of undeveloped archaeological-linguistic hypotheses, and its connection with the attested R1b-Z2103 samples from Iran is not (yet) warranted.
A lot of interesting data, I will try to analyse its main implications, if only superficially, in sections.
Anatolia_EBA from Ovaören, and Anatolia_MLBA (this including Assyrian and Old Hittite samples), all from Kalehöyük, show almost no change in Y-DNA lineages (three samples J2a, one G2a), and therefore an origin of these people in common with CHG and Iranian Neolithic populations is likely. No EHG ancestry is found. And PCA cluster is just somehow closer to Europe, but not to EHG populations.
NOTE. Hittite is attested only in the late first half of the 2nd millennium, although the authors cite (in the linguistic supplement) potential evidence from the palatial archives of the ancient city of Ebla in Syria to argue that Indo-European languages may have been already spoken in the region in the late 3rd millennium BCE.
Regarding the Assyrian samples (one J2a) from Ovaören:
Layer V of GT-137 was the richest in terms of architectural finds and dates to the Early Bronze Age II. In this layer, 2 different structures and a well were uncovered. The well was filled with stones, pottery, and human skeletons (Figs. S2 and S3). In total, skeletons belonging to 22 individuals, including adults, young adults, and children, must belong to the disturbed Early Bronze Age II graves adjacent to the well (103). Pottery and stones found below the skeletons demonstrate that the water well was consciously filled and closed. The fill consists of dumped stones, sherds and skeletons, and the closing stones demonstrate that the water well was consciously filled and cancelled.
Regarding the site most likely associated with the emergence of Old Hittite (two samples J2a1, one G2a2b1), this is what we know:
The Middle Bronze Age at Kaman-Kalehöyük represented by stratum IIIc yields material remains (seals and ceramics) contemporary with the international trade system managed by expatriate Assyrian merchants evidenced at the nearby site of Kültepe/Kanesh. It is therefore also referred to as belonging to the “Assyrian Colony Period” (98). The stratum has revealed three burned architectural units, and it has been suggested that the seemingly site-wide conflagration might be connected to a destruction event linked with the emergence of the Old Hittite state (99). (…) Omura (100) suggests that the rooms could belong to a public building, and that it might even be a small trade center based on the types of artifacts recovered. Omura (100) has concluded that the evidence from the first complex indicates a battle between 2 groups took place at the site. It is possible that a group died inside the buildings, mostly perishing in the fire, while another group died in the courtyard.
The PCA (Fig. 2B) indicates that all the Anatolian genome sequences from the Early Bronze Age ( -2200 BCE) and Late Bronze Age (-1600 BCE) cluster with a previously sequenced Copper Age ( -3900- 3700 BCE) individual from Northwestern Anatolia and lie between Anatolian Neolithic (Anatolia_ N) samples and CHG samples but not between Anatolia_N and EHG samples.
(…) we are not able to reject a two-population qpAdm model in which these groups derive -60% of their ancestry from Anatolian farmers and -40% from CHG-related ancestry (p-value = 0.5). This signal is not driven by Neolithic Iranian ancestry.
NOTE. Anatolian Iron Age samples, from the Hellenistic period, which was obviously greatly influenced by different, later Indo-European migrations, does show a change in PCA.
Regarding CHG ancestry:
Ancient DNA findings suggest extensive population contact between the Caucasus and the steppe during the Copper Age (-5000-3000 BCE) (1, 2, 42). Particularly, the first identified presence of Caucasian genomic ancestry in steppe populations is through the Khvalynsk burials (2, 47) and that of steppe ancestry in the Caucasus is through Armenian Copper Age individuals (42). These admixture processes likely gave rise to the ancestry that later became typical of the Yamnaya pastoralists (7), whose IE language may have evolved under the influence of a Caucasian language, possibly ‘from the Maykop culture (50, 55). This scenario is consistent with both the “Copper Age steppe” (4) and the “Caucasian” models for the origin of the Proto-Anatolian language (56).
The CHG specific ancestry and the absence of EHG-related ancestry in Bronze Age Anatolia would be in accordance with intense cultural interactions between populations in the Caucasus and Anatolia observed during the late 5th millennium BCE that seem to come to an end in the first half of the 4th millennium BCE with the village-based egalitarian Kura-Araxes society (59, 60), thus preceding the emergence and dispersal of Proto-Anatolian.
Our results indicate that the early spread of IE languages into Anatolia was not associated with any large-scale steppe-related migration, as previously suggested (61). Additionally, and in agreement with the later historical record of the region (62), we find no correlation between genetic ancestry and exclusive ethnic or political identities among the populations of Bronze Age Central Anatolia, as has previously been hypothesized ( 63).
The Anatolian question
There is no steppe ancestry or R1b-M269 lineages near early historic Hittites. Yet.
Nevertheless, we already know about potentially similar cases:
N1c lineages and Siberian ancestry arrived late in North-East Europe, modifying the ancestry of North-East European groups – with each region showing its own different late waves of N lineages or Siberian ancestry. Even after the known bottlenecks and the subsequent expansion of recently arrived haplogroups and ancestry, there was not much cultural (or ethnolinguistic) impact.
So there seems to be thus no theoretical problem in accepting:
That neither steppe ancestry nor R1b-M269 subclades, already diminished in Bulgaria in the mid-5th millennium, did reach Anatolia, but only those Common Anatolian-speaking Aegean groups over whose ancestors Proto-Anatolians (marked by incoming EHG ancestry) would have previously dominated in the Balkans.
That steppe ancestry and R1b-M269 subclades did in fact arrive in the Aegean, but EHG was further diluted among the CHG-related population by the time of the historic Anatolian-speaking peoples in central Anatolia. Or, the most likely option, that their trace have not been yet found. Probably the western Luwian peoples, near Troy, were genetically closer to Common Anatolians.
What we can assert right now is that Proto-Anatolian must have separated quite early for this kind of data to show up. This should mean an end to the Late PIE origin of Anatolian, if there was some lost soul from the mid-20th century still rooting for this.
As I said in my review of Lazaridis’ latest preprint, we will have to wait for the appropriate potential routes of expansion of Proto-Anatolian to be investigated. As he answered, the lack of EHG poses a problem for steppe expansion into Anatolia, but there is still no better alternative model proposed.
This is what the authors have to say:
Our findings are thus consistent with historical models of cultural hybridity and “Middle Ground” in a multi-cultural and multi-lingual but genetically homogeneous Bronze Age Anatolia (68, 69). Current linguistic estimations converge on dating the Proto-Anatolian split from residual PIE to the late 5th or early 4th millennia BCE (58, 70) and place the breakup of Anatolian IE inside Turkey prior to the mid-3rd millennium (53, 71,72).
We cannot at this point reject a scenario in which the introduction of the Anatolian IE languages into Anatolia was coupled with the CHG-derived admixture prior to 3700 BCE, but note that this is contrary to the standard view that PIE arose in the steppe north of the Caucasus (4) and that CHG ancestry is also associated with several non-IE-speaking groups, historical and current. Indeed, our data are also consistent with the first speakers of Anatolian IE coming to the region by way of commercial contacts and small-scale movement during the Bronze Age. Among comparative linguists, a Balkan route for the introduction of Anatolian IE is generally considered more likely than a passage through the Caucasus, due, for example, to greater Anatolian IE presence and language diversity in the west (73). Further discussion of these options is given in the archaeological and linguistic supplementary discussions (48, 49).
If you are asking yourselves why the Danish school (of Allentoft, Kristiansen, and Kroonen, co-authors of this paper) was not so fast to explain the findings the same way the proposed their infamous Indo-European – steppe ancestry association (i.e. ancestry = language, ergoCHG = PIE in this case), and resorted to mainstream anthropological models instead to explain the incongruence, I can think of two main reasons:
The possibility of having an early PIE around the Caucasus, potentially closely related not only to Uralic to the north, but also to Caucasian languages, Sumerian, Afroasiatic, Elamo-Dravidian, etc. could be a good reason for those excited with these few samples to begin dealing with macro-language proposals, such as Eurasiatic and Nostratic. If demonstrated to be true, a Northern Iranian origin of Middle PIE would also help relieve a little bit the pressure that some are feeling about the potentially male-driven Indo-European continuity (even if not “autochthonous”) associated with the expansion of R1b-L23 subclades.
Interesting data from an early East Yamna offshoot at Karagash, ca. 3018-2887 BC, of R1b-Z2106 lineage, which shows some ancestry, lineage, and cultural continuity in Sholpan, ca. 2620-2468 BC, in Kazakhstan.
On the formation of Yamna and its CHG contribution, from the supplementary material:
An admixture event, where Yamnaya is formed from a CHG population related to KK1 [=Kotias, dated ca. 7800 BC] and an ANE population related to Sidelkino and Botai. We inferred 54% of the Yamnaya ancestry to come from CHG and the remaining 46% to come from ANE.
A split event, where the CHG component of Yamnaya splits from KK1. The model inferred this time at 27 kya (though we note the larger models in Sections S2.12.4 and S2.12.5 inferred a more recent split time [see below graphic]).
A split event, where the ANE component of Yamnaya splits from Sidelkino. This was inferred at about about 11 kya.
A split event, where the ANE component of Yamnaya splits from Botai. We inferred this to occur 17 kya. Note that this is above the Sidelkino split time, so our model infers Yamnaya to be more closely related to the EHG Sidelkino, as expected.
An ancestral split event between the CHG and ANE ancestral populations. This was inferred to occur around 40 kya.
On the expansion of domestication
CHG is not found in Botai, no gene flow from Yamna is found in its samples, and they are more related to East Asians, while Yamna is related to West Eurasians:
The lack of evidence of admixture between Botai horse herders and western steppe pastoralists is consistent with these latter migrating through the central steppe but not settling until they reached the Altai to the east (4). More significantly, this lack of admixture suggests that horses were domesticated by hunter-gatherers not previously familiar with farming, as were the cases for dogs (38) and reindeer (39). Domestication of the horse thus may best parallel that of the reindeer, a food animal that can be milked and ridden, which has been proposed to be domesticated by hunters via the “prey path” (40); indeed anthropologists note similarities in cosmological beliefs between hunters and reindeer herders (41). In contrast, most animal domestications were achieved by settled agriculturalists (5).
NOTE. I am not sure, but they seem to hint that there were separate events of horse domestication and horse-riding technique by the Botai and Yamna populations due to their lack of genetic contribution from the latter to the former. I guess they did not take into account farming spreading to the steppe without genetic contribution beyond the Dnieper… In fact, the superiority in horse-riding shown by the expanding Yamna peoples – as they state – should also serve to suggest from where the original technique expanded.
On the expansion of Yamna, and the different expansion of Steppe MLBA (with Indo-Iranian speakers) into Asia, further supporting Narasimhan et al. (2018), they have this to say:
However, direct influence of Yamnaya or related cultures of that period is not visible in the archaeological record, except perhaps for a single burial mound in Sarazm in present-day Tajikistan of contested age (44, 45). Additionally, linguistic reconstruction of proto-culture coupled with the archaeological chronology evidences a Late (-2300-1200 BCE) rather than Early Bronze Age (-3000-2500 BCE) arrival of the Indo-Iranian languages into South Asia (16, 45, 46). Thus, debate persists as to how and when Western Eurasian genetic signatures and IE languages reached South Asia.
Samples from the Namazga region (current Turkmenistan) from the Iron Age show an obvious influence from steppe MLBA (ca. 2300-1200 BC), and not steppe EBA (i.e. Yamna), population, in contrast with samples from the Chalcolithic (ca. 3300 BC), which don’t show this influence. This helps distinguish prior contacts with Iran Neolithic from the actual steppe population that expanded Indo-Iranian into Asia.
Very interesting therefore the Namazga CA sample (ca. 855 BC), of R1a-Z93 subclade, showing the sign of immigrant Indo-Aryans in the region. For more on this we will need an evaluation in common with the corrected data from Narasimhan et al. (2018), and all, including de Barros (Nature 2018), in combination with statistical methods to ascertain differences between early Indo-Aryans and Iranians.
Siberian peoples and N1c lineages
We have already seen how the paper on Eurasian steppe samples tries to assign Uralic to Neolithic peoples east of the Urals. The association with Okunevo is unlikely, since most are of haplogroup Q1a2, but they seem to suggest (combining both papers) that they accompanied N lineages from Siberian hunter-gatherers (present e.g. in Botai or Shamanka II, during the Early Neolithic), and formed part of (or suffered from) different demic diffusion waves:
These serial changes in the Baikal populations are reflected in Y-chromosome lineages (Fig. SA; figs. S24 to S27, and tables S13 and SI4). MAI carries the R haplogroup, whereas the majority of Baikal_EN males belong to N lineages, which were widely distributed across Northern Eurasia (29), and the Baikal_LNBA males all carry Q haplogroups, as do most of the Okunevo_EMBA as well as some present-day Central Asians and Siberians.
NOTE. Also interesting to see no R1a in Baikal hunter-gatherers after ca. 3500 BC, and a prevalence of N lineages as supported in a previous paper on the Kitoi culture, which some had questioned in the past.
In fact, the only N1c1 sample comes from Ust’Ida Late Neolithic, 180km to the north of Lake Baikal, apparently before the expansion of Q1a2a lineages during the EBA period. While this sample may be related to those expanded later in Finno-Ugric territory (although it may only be related to those expanded much later with Yakuts), other samples are not clearly from those found widely distributed among North-East Europeans only after the Iron Age, or – as in the case of Shamanka II (N1c2), they are clearly not of the same haplogroup.
Regarding Y-DNA data, once again almost 100% of samples from late Khvalynsk/Yamna and derived cultures (like Afanasevo and Bell Beaker) are R1b-L23, no single R1a-M417 lineage found, and few expected by now, if any, within Late Proto-Indo-European territory.
While they claim to take Y-DNA into account to assess migrations – as they do for example with Asian cultures – , their previous model of a Yamna “R1a-R1b community” remains oddly unchanged, and they even insist on it in the supplementary materials, as they do in their parallel Nature paper.
They have also expressly mitigated the use of ancestral components to assess populations, citing the ancestral and modern association of CHG ancestry with different ethnolinguistic groups in the Middle East, to dismiss any rushed conclusions on the origin of Anatolian, and consequently of Middle PIE. And they did so evidently because it did not fit the anthropological data that is mainstream today (supporting a Balkan route), which is the right thing to do.
However, they have apparently not stopped to reconsider the links of CWC and steppe ancestry to ancestral and modern Uralic peoples – although they expressly mention the strong connection with modern Karelians in the supplementary material.
Also, after Narasimhan et al. (2018), there is a clear genetic continuity with East Yamna (in ancestry as in R1b-L23 subclades), so their interpretations about Indo-Iranian in this paper and especially de Barros (Nature 2018) – regarding the Abashevo -> Sintashta/Srunba/Andronovo connection – come, again, too late.
In duelling 2015 Nature papers6,7, the teams arrived at broadly similar conclusions: an influx of herders from the grassland steppes of present-day Russia and Ukraine — linked to Yamnaya cultural artefacts and practices such as pit burial mounds — had replaced much of the gene pool of central and Western Europe around 4,500–5,000 years ago. This was coincident with the disappearance of Neolithic pottery, burial styles and other cultural expressions and the emergence of Corded Ware cultural artefacts, which are distributed throughout northern and central Europe. “These results were a shock to the archaeological community,” Kristiansen says.
Still, not everyone was satisfied. In an essay8 titled ‘Kossinna’s Smile’, archaeologist Volker Heyd at the University of Bristol, UK, disagreed, not with the conclusion that people moved west from the steppe, but with how their genetic signatures were conflated with complex cultural expressions. Corded Ware and Yamnaya burials are more different than they are similar, and there is evidence of cultural exchange, at least, between the Russian steppe and regions west that predate Yamnaya culture, he says. None of these facts negates the conclusions of the genetics papers, but they underscore the insufficiency of the articles in addressing the questions that archaeologists are interested in, he argued. “While I have no doubt they are basically right, it is the complexity of the past that is not reflected,” Heyd wrote, before issuing a call to arms. “Instead of letting geneticists determine the agenda and set the message, we should teach them about complexity in past human actions.”
Many archaeologists are also trying to understand and engage with the inconvenient findings from genetics. (…)
[Carlin:] “I would characterize a lot of these papers as ‘map and describe’. They’re looking at the movement of genetic signatures, but in terms of how or why that’s happening, those things aren’t being explored,” says Carlin, who is no longer disturbed by the disconnect. “I am increasingly reconciling myself to the view that archaeology and ancient DNA are telling different stories.” The changes in cultural and social practices that he studies might coincide with the population shifts that Reich and his team are uncovering, but they don’t necessarily have to. And such biological insights will never fully explain the human experiences captured in the archaeological record.
Reich agrees that his field is in a “map-making phase”, and that genetics is only sketching out the rough contours of the past. Sweeping conclusions, such as those put forth in the 2015 steppe migration papers, will give way to regionally focused studies with more subtlety.
This is already starting to happen. Although the Bell Beaker study found a profound shift in the genetic make-up of Britain, it rejected the notion that the cultural phenomenon was associated with a single population. In Iberia, individuals buried with Bell Beaker goods were closely related to earlier local populations and shared little ancestry with Beaker-associated individuals from northern Europe (who were related to steppe groups such as the Yamnaya). The pots did the moving, not the people.
This final paragraph apparently sums up a view that Reich has of this field, since he repeats it:
Reich concedes that his field hasn’t always handled the past with the nuance or accuracy that archaeologists and historians would like. But he hopes they will eventually be swayed by the insights his field can bring. “We’re barbarians coming late to the study of the human past,” Reich says. “But it’s dangerous to ignore barbarians.”
I would say that the true barbarians didn’t have a habit or possibility to learn from the higher civilizations they attacked or invaded. Geneticists, on the other hand, only have to do what they expect archaeologists to do: study.
The Late Neolithic Corded Ware Culture (c. 2800–2300 BC) of Northern Europe is characterised by specific sets of grave goods and mortuary practices, but the organic components of these grave sets are poorly represented in the archaeological record. New microscopic analyses of soil samples collected during the 1930s from the Perttulanmäki grave in western Finland have, however, revealed preserved Neolithic animal hairs. Despite mineralisation, the species of animal has been successfully identified and offers the oldest evidence for domestic goat in Neolithic Finland, indicating a pastoral herding economy. The mortuary context of the goat hair also suggests that animals played a significant role in the Corded Ware belief system.
Although the material culture used in Corded Ware funerary rituals is well known, a full appreciation of the associated mortuary practices is still lacking. In fact, even though evidence for internal wooden and stone structures is commonly documented in Corded Ware mortuary contexts (e.g. Fischer 1956; Malmer 1962; Hansen 1994; Vander Linden 2007), detailed information on the ways that structures within the grave might have been furnished is largely missing. The use of textiles, mats and furs to cover grave pit walls and floors is commonly documented in Yamnaya Culture graves (Heyd 2011). This culture represents the best-known proxy for the incoming gene-flow that occurred in Europe during the third millennium BC, resulting in the formation of the Corded Ware phenomenon (Allentoft et al. 2015; Haak et al. 2015; Kristiansen et al. 2017). Similar practices may, therefore, have occurred in the Corded Ware tradition. In fact, the Corded Ware graves already show strong affinities to the Yamnaya burial rituals; for example, in the practice of a single inhumation under a barrow (Kristiansen et al. 2017: 336). New information on Corded Ware mortuary practices has come from the northern periphery of the cultural area. The results are based on microscopic analyses conducted on soil samples collected from the Perttulanmäki Corded Ware grave in western Finland (Figure 1). These analyses suggest that a goat skin had been placed in the grave. This discovery is important as it provides clear evidence of a mortuary practice that has only in rare cases been previously suspected (e.g. Torvinen 1979; Meurkens et al. 2015).
The animal accompaniment could be present in various forms. Several Corded Ware burials in the Baltic area have been furnished with artefacts made of domestic animal bone (Zagorska 2006: 103; Lõugas et al. 2007: 25–26; Larsson 2009: 63). That all milk residues from Finnish Corded Ware pottery were found exclusively in beakertype ‘drinking’ vessels (Cramp et al. 2014: 4) further supports this idea. These beakers are usually found in grave deposits (Edgren 1970: 76–77; Larsson 2009: 352), so the animal could have also been represented by placing milk, or a vessel connected with milk, in the grave (Edgren 1970: 76–77; Larsson 2009: 352). This being said, it must be noted that, due to the vast distribution area of the Corded Ware phenomenon, the same objects or symbols might not have been connected with the same ideas (Furholt 2014: 82). Moreover, some prehistoric societies might have also repeated the ritual practice simply as tradition—after its original meaning had been forgotten (Nilsson Stutz 2003: 319).