Enigmatic *-nt-Stems : an investigation of the secondary -t- of the Greek neuter nouns in *-men- and *-r/n-


Interesting Master thesis Enigmatic *-nt-Stems: an investigation of the secondary -t- of the Greek neuter nouns in *-men- and *-r/n-, by Stephanie Stringer, Université de Montréal (2018).


This paper aims to provide an explanation of the secondary -t- found in the oblique stem of ancient Greek neuters such as πρᾶγμα, πράγματος and ἧπαρ, ἥπατος. After a brief overview of the Greek data, and a survey of the relevant nominal classes in Greek and Indo- European, previous hypotheses are evaluated. To this end, several problems of nominal morphology are discussed, including the existence of a PIE suffix *-m(e)ntom, the secondary -t-s of certain animate nouns, the ablatival suffix *-tos, the Hittite ergative; and the ablaut of neuter active participles. Certain phonological issues are also addressed. Since the majority of hypotheses formulated to explain the secondary -nt- inflection of Greek neuters date from the nineteenth century, attempts are made to re-evaluate their conclusions in the light of more recent research, particularly that related to ablaut classes. Also considered are a number of twenty-first century works which purport to explain the Greek data as part of a larger Indo-European phenomenon.

This paper makes no attempt, however, to explain the PIE origins of either the *r/n-, or of the *nt- stems. It concludes that the best explanation of the Greek declensional pattern is to be found in the analogy between stems in -nt- and those in *-mn- or *-r/n-.

Interesting excerpts, from the conclusion (emphasis mine):

In comparison with other proposed solutions, there are relatively few objections to be levelled at Schmidt’s theory, in this slightly modified form. Anghelina (2010) criticises it on the ground that it does not provide an explanation of how the -t- came to be inserted into the r/n-stems, but this criticism has already been addressed. Anghelina also objects to the idea that participles might affect the declension of nouns. Given that participles can function syntactically as nouns, and that their declension is formally identical, except for the distinction of gender, it is difficult to see why the inflection of one might not affect the inflection of the other. Furthermore, it appears to have done so within the history of Greek. In addition to the neuters, a number of masculine n-stems are inflected as nt-stems, although related formations within Greek attest to the secondary nature of the -t-, e.g. δράκων, -οντος, but δράκαινα, λέων, λέοντος, but λέαινα etc. Perhaps Anghelina would prefer to explain these cases also as
developments from the dative plural, before the ablaut was levelled, but in general, the influence of participles is accepted as an explanation. One could also point to the influence of the pronominal declension on the endings of thematic stems in PIE.

Sihler (2008, 297) argued that if one accepted the nt-stems as a model it was “hard to progress beyond a vague likelihood” and “the supposed model paradigm has been everywhere replaced.” The first of these criticisms is valid, in a sense. One cannot conclusively demonstrate that the nt-stems served as the model for the men- and r/n-stems. Only that the model was available, and that the outcome of the change conforms with it. However, it does not seem that Sihler’s caution is more pertinent in the case of this theory than in any other proposed explanation of morphological change.

Silhler’s second criticism, is true as well. The starting point, a NA sg. nt. -n̥, G sg. -n̥t-os is indeed only preserved, at best, in a few relics. All the same, it can be assumed with some confidence to have existed at the right time. Furthermore, the analogy is unobjectionable. The nt. NA sg. ending -n̥ could genuinely belong to an nt-stem as well as to an n-stem. It is no surprise that the neuters were systematically replaced, while the masculines and feminines showed only sporadic transition to the nt-declension. In the neuter both the NA were liable to re-interpretation as a t-stem, while in the animate forms only the nominative was. The m(e)n-stems are a highly uniform group, and it is easy to understand how a change could spread relatively quickly. The connection to the r/n-stems is slightly more tenuous, but they do have more in common with the m(e)n-stems than with any other group. (A m(e)r/m(e)n- suffix does exist, but given that it is quite rare, and given that its only two representatives in early Greek, τέκμαρ and τέκμωρ are attested only in the NA sg., it is hard to see that this subclass can have played a significant role.)

The nt-stem theory provides an adequate explanation of the Greek situation. That was indeed the very limited aim of this paper. In very general terms, it may also provide an explanation for some of the “stray” t’s one finds attached at times to n-stems in PIE or other languages. Given the co-existence, whatever their origin, of both -nt- and -n-, and in fact t-stems, and given that both -n- and -t- were under certain conditions liable to be lost or assimilated to surrounding sounds, one might expect to find a certain degree of erratic fluctuation between the two classes. Such an observation is so vague as to be quite unhelpful, but at least it is not contradicted by known facts.

In opting for a solution that seems to account for the facts in Greek, one is forced to leave many other phenomena unexplained. Although it would be more satisfying if one were able to draw together the -t- of the NA *-r/n- in Sanskrit and the -t- of the nearly synonymous suffixes -man-, -vant-, man, mant, vasanta, gimmant- etc., it seems at present they can only be connected if one ignores many of the details of each specific situation. For the time being, it appears they must be dismissed as similar, but essentially unrelated, or at least only very indirectly related phenomena. It is entirely possible that further research will reverse this conclusion.

The Caucasus a genetic and cultural barrier; Yamna dominated by R1b-M269; Yamna settlers in Hungary cluster with Yamna


Open access The genetic prehistory of the Greater Caucasus, by Wang et al. bioRxiv (2018).

The Caucasus Mountains as a prehistoric barrier

I think the essential message we can extract from the paper is that the Caucasus was a long-lasting cultural and genetic barrier, although (obviously) it was not insurmontable.

Our results show that at the time of the eponymous grave mound of Maykop, the North Caucasus piedmont region was genetically connected to the south. Even without direct ancient DNA data from northern Mesopotamia, the new genetic evidence suggests an increased assimilation of Chalcolithic individuals from Iran, Anatolia and Armenia and those of the Eneolithic Caucasus during 6000-4000 calBCE23, and thus likely also intensified cultural connections. Within this sphere of interaction, it is possible that cultural influences and continuous subtle gene flow from the south formed the basis of Maykop.

The zoomed map shows the location of sites in the Caucasus. The size of the circle reflects number of individuals that produced genome-wide data. The dashed line illustrates a hypothetical geographic border between genetically distinct Steppe and Caucasus clusters.

Also, unlike more recent times, the North Caucasian piedmont and foothill of the Caucasus region was more strongly connected to Northern Iran than to the steppe, at least until the Bronze Age.

(…) our data shows that the northern flanks were consistently linked to the Near East and had received multiple streams of gene flow from the south, as seen e.g. during the Maykop, Kura-Araxes and late phase of the North Caucasus culture.

Northern Caucasus dominated by R1b, southern Caucasus by J and G2

Comparison of Y-chromosome (A) 1123 and mitochondrial (B) haplogroup distribution in the Steppe and Caucasus cluster.

The first samples from the Eneolithic (one ca. 4300 BC?, the other ca. 4100 BC) are R1b1, without further subclades, so it is difficult to say if they were V88. On the PCA, they seem to be an important piece of the early Khvalynsk -> early Yamna transition period, since they cluster closer to (or even among) subsequent Yamna samples.

From 3000 BC onwards, all samples from the Northern Caucasus group of Yamna are R1b-M269, which right now is probably no surprise for anyone.

The Catacomb culture is dominated by R1b-Z2103, which agrees with what we saw in the unclassified Ukraine Eneolithic sample. However, the new samples (clustering close to Yamna, but with slightly ‘to the south’ of it) don’t seem to cluster closely to that first sample, so that one may still remain a real ‘outlier’, showing incoming influence (through exogamy) from the north.

If anyone was still wondering, no R1a in any of the samples, either. This, and the homogeneous R1b-Z2103 community in Catacomb (a culture in an intermediate region between Late Yamna to the West, and Poltavka to the East), together with Poltavka dominated by R1b-Z2103, too, should put an end to the idea that Steppe MLBA (Sintashta-Petrovka/Potapovka) somehow formed in the North Pontic steppe and appeared directly in the Volga-Ural region. A Uralic/Indo-Iranian community it is, then.

The admixed population from the Caucasus probably points to an isolated region of diverse peoples and languages even in this period, which justifies the strong differences among the historic language families attested in the Caucasus.

So, not much space for Anatolian migrating with those expected Maykop samples with EHG ancestry, unless exogamy is proposed as a source of language change.

ADMIXTURE and PCA results, and chronological order of ancient Caucasus individuals. Samples from Hungary are surrounded by red circles (see below for ADMIXTURE data) (a) ADMIXTURE results (k=12) of the newly genotyped individuals (fillbred symbols with black outlines) sorted by genetic clusters (Steppe and Caucasus) and in chronological order (coloured bars indicate the relative archaeological dates, (b) white circles the mean calibrated radiocarbon date and the errors bars the 2-sigma range. (d) shows these projected onto a PCA of 84 modern-day West Eurasian populations (open symbols).

Yamna Hungary, and the previous Yamna “outliers”

Those western “Yamna outliers”, as I expected, were part of some late Khvalynsk/early Yamna groups that cluster “to the south” of eastern Yamna samples:

Another important observation is that all later individuals in the steppe region, starting with Yamnaya, deviate from the EHG-CHG admixture cline towards European populations in the West. This documents that these individuals had received Anatolian farmer-related ancestry, as documented by quantitative tests and recently also shown for two Yamnaya individuals from Ukraine (Ozera) and one from Bulgaria24. For the North Caucasus region, this genetic contribution could have occurred through immediate contact with groups in the Caucasus or further south. An alternative source, explaining the increase in WHG-related ancestry, would be contact with contemporaneous Chalcolithic/EBA farming groups at the western periphery of the Yamnaya culture distribution area, such as Globular Amphora and Tripolye (Cucuteni–Trypillia) individuals from Ukraine, which also have been shown to carry Anatolian Neolithic farmer-derived ancestry24.

On the other hand, it is interesting that – although no information is released about these samples – Yamna Bulgaria is now a clear outlier, among very “Yamnaya”-like Yamna settlers from Hungary, most likely from the Carpathian basin, and new Yamna LCA/EBA samples, possibly from Late Yamna (see them also marked in the PCA above):

Modified image, with red rectangles surrounding (unexplained) Hungarian samples (c) ADMIXTURE results of relevant prehistoric individuals mentioned in the text (filled symbols)

The important admixture of Yamna settlers with native populations, seen in expanding East Bell Beakers of R1b-L23 lineages from ca. 2500 BC on, must have therefore happened at the same time as the adoption of the proto-Bell Beaker package, i.e. precisely during the Carpathian Basin / Lower Danube settlements, and not in West Yamna.

Modified image, with red rectangles surrounding (unexplained) Yamna samples Modelling results for the Steppe and Caucasus cluster. Admixture proportions based on (temporally and geographically) distal and proximal models, showing additional Anatolian farmer-related ancestry in Steppe groups as well as additional gene flow from the south in some of the Steppe groups as well as the Caucasus groups

So, it can’t get clearer that Late Neolithic Baltic and Corded Ware migrants, sharing R1a-Z645 lineages and a different admixture, related to Eneolithic North Pontic groups such as Sredni Stog (see above ADMIXTURE graphics of CWC and Eneolithic Ukraine samples), did not come from West Yamna migrants, either.

So much for the R1a/R1b Yamna community that expanded Late PIE into Corded Ware.

NOTE. Andrew Gelman has coined a term for a curious phenomenon (taken from an anonymous commenter): “Eureka bias”, which refers not only to how researchers stick to previously reported incorrect results or interpretations, but also to how badly they react to criticism, even if they understand that it is well-founded. Directly applicable to the research groups that launched the Yamna-CWC idea (and the people who followed them) based on the fallacious “Yamnaya ancestry” concept, and who are still rooting for some version of it, from now on with exogamy, patron-client relationships, Eneolithic Indo-Slavonic, and whatnot. Unless, that is, Anthony’s latest model is right, and Yamna Hungary is suddenly full of R1a-Z645 samples…

Images used are from the article.


Eurasian steppe dominated by Iranian peoples, Indo-Iranian expanded from East Yamna


The expected study of Eurasian samples is out (behind paywall): 137 ancient human genomes from across the Eurasian steppes, by de Barros Damgaard et al. Nature (2018).

Dicussion (emphasis mine):

Our findings fit well with current insights from the historical linguistics of this region (Supplementary Information section 2). The steppes were probably largely Iranian-speaking in the first and second millennia bc. This is supported by the split of the Indo-Iranian linguistic branch into Iranian and Indian33, the distribution of the Iranian languages, and the preservation of Old Iranian loanwords in Tocharian34. The wide distribution of the Turkic languages from Northwest China, Mongolia and Siberia in the east to Turkey and Bulgaria in the west implies large-scale migrations out of the homeland in Mongolia since about 2,000 years ago35. The diversification within the Turkic languages suggests that several waves of migration occurred36 and, on the basis of the effect of local languages, gradual assimilation to local populations had previously been assumed37. The East Asian migration starting with the Xiongnu accords well with the hypothesis that early Turkic was the major language of Xiongnu groups38. Further migrations of East Asians westwards find a good linguistic correlate in the influence of Mongolian on Turkic and Iranian in the last millennium39. As such, the genomic history of the Eurasian steppes is the story of a gradual transition from Bronze Age pastoralists of West Eurasian ancestry towards mounted warriors of increased East Asian ancestry—a process that continued well into historical times.

This paper will need a careful reading – better in combination with Narasimhan et al. (2018), when their tables are corrected – , to assess the actual ‘Iranian’ nature of the peoples studied. Their wide and long-term dominion over the steppe could also potentially explain some early samples from Hajji Firuz with steppe ancestry.

Principal component analyses. The principal components 1 and 2 were plotted for the ancient data analysed with the present-day data (no projection bias) using 502 individuals at 242,406 autosomal SNP positions. Dimension 1 explains 3% of the variance and represents a gradient stretching from Europe to East Asia. Dimension 2 explains 0.6% of the variance, and is a gradient mainly represented by ancient DNA starting from a ‘basal-rich’ cluster of Natufian hunter-gatherers and ending with EHGs. BA, Bronze Age; EMBA, Early-to-Middle Bronze Age; SHG, Scandinavian hunter-gatherers.

For the moment, at first sight, it seems that, in terms of Y-DNA lineages:

  • R1b-Z93 (especially Z2124 subclades) dominate the steppes in the studied periods.
  • R1b-P312 is found in Hallstatt ca. 810 BC, which is compatible with its role in the Celtic expansion.
  • R1b-U106 is found in a West Germanic chieftain in Poprad (Slovakia) ca. 400 AD, during the Migration Period, hence supporting once again the expansion of Germanic tribes especially with R1b-U106 lineages.
  • A new sample of N1c-L392 (L1025) lineage dated ca. 400 AD, now from Lithuania, points again to a quite late expansion of this lineage to the region, believed to have hosted Uralic speakers for more than 2,000 years before this.
  • A sample of haplogroup R1a-Z282 (Z92) dated ca. 1300 AD in the Golden Horde is probably not quite revealing, not even for the East Slavic expansion.
  • Also, interestingly, some R1b(xM269) lineages seem to be associated with Turkic expansions from the eastern steppe dated around 500 AD, which probably points to a wide Eurasian distribution of early R1b subclades in the Mesolithic.

NOTE. I have referenced not just the reported subclades from the paper, but also (and mainly) further Y-SNP calls studied by Open Genomes. See the spreadsheet here.

Interesting also to read in the supplementary materials the following, by Michaël Peyrot (emphasis mine):

1. Early Indo-Europeans on the steppe: Tocharians and Indo-Iranians

The Indo-European language family is spread over Eurasia and comprises such branches and languages as Greek, Latin, Germanic, Celtic, Sanskrit etc. The branches relevant for the Eurasian steppe are Indo-Aryan (= Indian) and Iranian, which together form the Indo-Iranian branch, and the extinct Tocharian branch. All Indo-European languages derive from a postulated protolanguage termed Proto-Indo-European. This language must have been spoken ca 4500–3500 BCE in the steppe of Eastern Europe21. The Tocharian languages were spoken in the Tarim Basin in present-day Northwest China, as shown by manuscripts from ca 500–1000 CE. The Indo-Aryan branch consists of Sanskrit and several languages of the Indian subcontinent, including Hindi. The Iranian branch is spread today from Kurdish in the west, through a.o. Persian and Pashto, to minority languages in western China, but was in the 2nd and 1st millennia BCE widespread also on the Eurasian steppe. Since despite their location Tocharian and Indo-Iranian show no closer relationship within Indo-European, the early Tocharians may have moved east before the Indo-Iranians. They are probably to be identified with the Afanasievo Culture of South Siberia (ca 2900 – 2500 BCE) and have possibly entered the Tarim Basin ca 2000 BCE103.

The Indo-Iranian branch is an extension of the Indo-European Yamnaya Culture (ca 3000–2400 BCE) towards the east. The rise of the Indo-Iranian language, of which no direct records exist, must be connected with the Abashevo / Sintashta Culture (ca 2100 – 1800 BCE) in the southern Urals and the subsequent rise and spread of Andronovo-related Culture (1700 – 1500 BCE). The most important linguistic evidence of the Indo-Iranian phase is formed by borrowings into Finno-Ugric languages104–106. Kuz’mina (2001) identifies the Finno-Ugrians with the Andronoid cultures in the pre-taiga zone east of the Urals107. Since some of the oldest words borrowed into Finno-Ugric are only found in Indo-Aryan, Indo-Aryan and Iranian apparently had already begun to diverge by the time of these contacts, and when both groups moved east, the Iranians followed the Indo-Aryans108. Being pushed by the expanding Iranians, the Indo-Aryans then moved south, one group surfacing in equestrian terminology of the Anatolian Mitanni kingdom, and the main group entering the Indian subcontinent from the northwest.

Summary map. Depictions of the five main migratory events associated with the genomic history of the steppe pastoralists from 3000 bc to the present. a, Depiction of Early Bronze Age migrations related to the expansion of Yamnaya and Afanasievo culture. b, Depiction of Late Bronze Age migrations related to the Sintashta and Andronovo horizons. c, Depiction of Iron Age migrations and sources of admixture. d, Depiction of Hun-period migrations and sources of admixture. e, Depiction of Medieval migrations across the steppes.

2. Andronovo Culture: Early Steppe Iranian

Initially, the Andronovo Culture may have encompassed speakers of Iranian as well as Indo-Aryan, but its large expansion over the Eurasian steppe is most probably to be interpreted as the spread of Iranians. Unfortunately, there is no direct linguistic evidence to prove to what extent the steppe was indeed Iranian speaking in the 2nd millennium BCE. An important piece of indirect evidence is formed by an archaic stratum of Iranian loanwords in Tocharian34,109. Since Tocharian was spoken beyond the eastern end of the steppe, this suggests that speakers of Iranian spread at least that far. In the west of the Tarim Basin the Iranian languages Khotanese and Tumshuqese were spoken. However, the Tocharian B word etswe ‘mule’, borrowed from Iranian *atswa- ‘horse’, cannot derive from these languages, since Khotanese has aśśa- ‘horse’ with śś instead of tsw. The archaic Iranian stratum in Tocharian is therefore rather to be connected with the presence of Andronovo people to the north and possibly to the east of the Tarim Basin from the middle of the 2nd millennium BCE onwards110.

Since Kristiansen and Allentoft sign the paper (and Peyrot is a colleague of Kroonen), it seems that they needed to expressly respond to the growing criticism about their recent Indo-European – Corded Ware Theory. That’s nice.

They are obviously trying to reject the Corded Ware – Uralic links that are on the rise lately among Uralicists, now that Comb Ware is not a suitable candidate for the expansion of the language family.

IECWT-proponents are apparently not prepared to let it go quietly, and instead of challenging the traditional Neolithic Uralic homeland in Eastern Europe with a recent paper on the subject, they selected an older one which partially fit, from Kuz’mina (2001), now shifting the Uralic homeland to the east of the Urals (when Kuz’mina asserts it was south of the Urals).

Different authors comment later in this same paper about East Uralic languages spreading quite late, so even their text is not consistent among collaborating authors.

Also interesting is the need to resort to the questionable argument of early Indo-Aryan loans – which may have evidently been Indo-Iranian instead, since there is no way to prove a difference between both stages in early Uralic borrowings from ca. 4,500-3,500 years ago…

EDIT (10/5/2018) The linguistic supplement of the Science paper deals with different Proto-Indo-Iranian stages in Uralic loans, so on the linguistic side at least this influence is clear to all involved.

A rejection of such proposals of a late, eastern homeland can be found in many recent writings of Finnic scholars; see e.g. my references to Parpola (2017), Kallio (2017), or Nordqvist (2018).

NOTE. I don’t mind repeating it again: Uralic is one possibility (the most likely one) for the substrate language that Corded Ware migrants spread, but it could have been e.g. another Middle PIE dialect, similar to Proto-Anatolian (after the expansion of Suvorovo-Novodanilovka chiefs). I expressly stated this in the Corded Ware substrate hypothesis, since the first edition. What was clear since 2015, and should be clear to anyone now, is that Corded Ware did not spread Late PIE languages to Europe, and that some east CWC groups only spread languages to Asia after admixing with East Yamna. If they did not spread Uralic, then it was a language or group of languages phonetically similar, which has not survived to this day.

Their description of Yamna migrations is already outdated after Olalde et al. & Mathieson et al. (2018), and Narasimhan et al. (2018), so they will need to update their model (yet again) for future papers. As I said before, Anthony seems to be one step behind the current genetic data, and the IECWT seems to be one step behind Anthony in their interpretations.

At least we won’t have the Yamna -> Corded Ware -> BBC nonsense anymore, and they expressly stated that LPIE is to be associated with Yamna, and in particular the “Indo-Iranian branch is an extension of the Indo-European Yamnaya Culture (ca 3000–2400 BCE) to the East” (which will evidently show an East Yamna / Poltavka society of R1b-L23 subclades), so that earlier Eneolithic cultures have to be excluded, and Balto-Slavic identification with East Europe is also out of the way.


Brexit forces relocation of one of today’s main Yamna research projects to Finland


Archaeologist Volker Heyd is bringing his ERC Advanced Grant to Helsinki. So has proudly reported the University of Helskinki.

Some interesting excerpts (emphasis mine):

With his research group, Heyd wants to map out how the Yamnaya culture, also known as the Pit Grave culture, migrated from the Eurasian steppes to prehistoric south-eastern Europe approximately 3,000 years BCE. Most of the burial mounds typical of the Yamnaya culture have already been destroyed, but new techniques enable their identification and study.

The project is using multidisciplinary methods to solve the mystery. Archaeologists are collaborating with scholars of biological and environmental sciences, using the methods of funerary archaeology, landscape archaeology and remote sensing that are at the group’s disposal. From the field of biological sciences, the group is making use of genetics/DNA analysis, biological anthropology and biogeochemistry. As for environmental sciences, their contribution is in the form of palaeoclimatology, which studies climate before modern meteorological observations, and soil formation processes.

The project, coordinated by the discipline of archaeology at the University of Helsinki, will also welcome researchers from Mainz, London, Bristol and Budapest, in addition to which the group will collaborate with Czech, Slovak and Polish colleagues. Field studies and sample collection for the project will be conducted in Romania, Bulgaria, Hungary and Serbia.

In Helsinki, Volker Heyd’s main collaborator is Professor Heikki Seppä from the Department of Geosciences and Geography on the Kumpula Campus, while the team will also be hiring three postdoctoral researchers.

Yamna – East Bell Beaker migration 3000-2300 BC, after Heyd (2007, 2012)

Yam­naya from the east changed Europe forever

The researchers wish to understand how the Yamnaya migrated to Europe and how the arrival of a new culture changed an entire continent.

How many people actually arrived? Taking the scale of the changes, some estimates range in the millions, but according to Volker Heyd, the number of people representing the Yamnaya culture in southeast Europe was around several ten thousands. It is indeed remarkable how such a relatively small group of people has had such a significant and far-reaching impact on Europe.

The Yamnaya also brought with them new cultural and social norms that have had far-reaching consequences. For instance, patriarchy and monogamy seems to be part of the Yamnaya legacy. Another established theory speculates that marriages made women migrate and travel even across great distances.

In accordance with primogeniture, the first-born son of the family inherited his parents’ possessions, while the younger siblings had to make their own way through other means. Among other things, this practice guaranteed ample human resources for the legions of the Roman Empire, which enabled its establishment and expansion, and later filled the ranks of medieval monasteries across Europe.

Another interesting question is what made representatives of the Yamnaya culture migrate from the eastern European steppes to the west. Heyd believes that the underlying reason may have been climate change. The Yamnaya were almost exclusively dependent on animal husbandry. As the climate changed – when rainfalls decreased in the east – they may have been forced to migrate west to secure the welfare of their cattle.

North-East Europe and Corded Ware

Heyd has already been here as a visiting professor in the Helsinki University Humanities programme since the beginning of the year, working on another project. Together with Postdoctoral Researcher Kerkko Nordqvist, he is investigating the prehistoric settlement of north-eastern Europe 3,000 – 6,000 years ago with research methods similar to the new Yamnaya project. One of their central research questions is what made people migrate to this region, and which innovations they brought with them. In this case also, the reasons behind the migration may be related to changes in the environment and climate.

This is probably bad news for research in the UK (I say probably because I guess many Brexiteers will be happy to have less foreign researchers in their country), but it is great news to see both researchers, Heyd and Nordqvist (whose Ph.D. thesis includes research on the Corded Ware culture that I have recently mentioned) – , be able to collaborate together to assess Indo-European and Uralic migrations.

Heyd’s website at the University of Bristol states that he is currently working on:

  1. The Milking Revolution in Temperate Neolithic Europe (NeoMilk)‘. Funded by an ERC Advanced Grant, European Union, to R. Evershed. See, for further information: www.neomilk-erc.eu
  2. The Yamnaya Impact‘: Archaeology and scientific research of/into the Yamnaya populations of Southeastern Europe and their impact on contemporary local and neighboring 3rd millennium BC societies as well as their role in the emergence of the Corded Ware and Bell Beaker complexes in Europe.
  3. The Prehistoric Peopling of Northeastern Europe‘: Inter-/crossdisciplinary studies on the archaeology, anthropology, linguistics, and bio- and environmental sciences of early Uralic speakers and their first horizon of interactions with Indo-European speakers. This wider project is in cooperation with colleagues from Helsinki and Turku Universities in Finland, as well as from Russia, Estonia and Poland.
  4. Czech Republic‘: I am closely cooperating with the Institute of Archaeology, Czech Academy of Sciences, in Prague for two research projects funded by the Czech Grant Agency in which we measure various isotopes from human remains in Bristol to understand past mobility and diet. The Humboldt-Kolleg -conference ‘Reinecke’s Heritage’ (with P. Pavúk, M. Ernée and J. Peska) held in June 2017 at Chateau Křtiny/Moravia is also part of this cooperation. See, for further information: http://ukar.ff.cuni.cz/reinecke.
Image modified from Narasimhan et al. (2018), including the most likely proto-language identification of different groups. Original description “Modeling results including Admixture events, with clines or 2-way mixtures shown in rectangles, and clouds or 3-way mixtures shown in ellipses”. See the original full image here.

On the genetic aspect, we have gross Yamna migrations today as clearly depicted as they will ever be: late Khvalynsk/Yamna expanded Late Proto-Indo-European languages, and Bell Beakers brought North-West Indo-European to almost all of Europe, as predicted in Harrison and Heyd (2007). Full stop.

There is still fine-grained population structure, though, as Lazaridis puts it, to be detected in migratory movements contemporary or subsequent to the Yamna settlements in South-East Europe and the East Bell Beaker expansion.

We will probably lack a comprehensive description of local archaeological cultural exchanges – to fit the potential dialectal developments and expansions – to be coupled with small-scale migratory movements in genetics, as more samples are made available.

This work from the University of Helsinki will hopefully provide the necessary detailed anthropological foundations to be used with future genetic studies to obtain a more precise picture of the formation and expansion of North-West Indo-Europeans.


Lazaridis’ evolutionary history of human populations in Europe

Preprint of a review by Iosif Lazaridis, The evolutionary history of human populations in Europe.

Interesting excerpts:

Steppe populations during the Eneolithic to Bronze Age were a mix of at least two elements[28], the EHG who lived in eastern Europe ~8kya and a southern population element related to present-day Armenians[28], and ancient Caucasus hunter-gatherers[22], and farmers from Iran[24]. Steppe migrants made a massive impact in Central and Northern Europe post- 5kya[28,43]. Some of them expanded eastward, founding the Afanasievo culture[43] and also eventually reached India[24]. These expansions are probable vectors for the spread of Late Proto-Indo-European[44] languages from eastern Europe into both mainland Europe and parts of Asia, but the lack of steppe ancestry in the few known samples from Bronze Age Anatolia[45] raises the possibility that the steppe was not the ultimate origin of Proto-Indo-European (PIE), the common ancestral language of Anatolian speakers, Tocharians, and Late Proto-Indo Europeans. In the next few years this lingering mystery will be solved: either Anatolian speakers will be shown to possess steppe-related ancestry absent in earlier Anatolians (largely proving the steppe PIE hypothesis), or they will not (largely falsifying it, and pointing to a Near Eastern PIE homeland).

Our understanding of the spread of steppe ancestry into mainland Europe is becoming increasingly crisp. Samples from the Bell Beaker complex[46] are heterogeneous, with those from Iberia lacking steppe ancestry that was omnipresent in those from Central Europe, casting new light on the “pots vs. people” debate in archaeology, which argues that it is dangerous to propose a tight link between material culture and genetic origins. Nonetheless, it is also dangerous to dismiss it completely. Recent studies have shown that people associated with the Corded Ware culture in the Baltics[23,33] were genetically similar to those from Central Europe and to steppe pastoralists[28,43], and the people associated with the Bell Beaker culture in Britain traced ~90% of their ancestry to the continent, being highly similar to Bell Beaker populations there. Bell Beaker-associated individuals were bearers of steppe ancestry into the British Isles that was also present in Bronze Age Ireland[47], and Iron Age and Anglo-Saxon England[48]. The high genetic similarity between people from the British Isles and those of the continent makes it more difficult to trace migrations into the Isles. This high similarity masks a very detailed fine-scale population structure that has been revealed by study of present-day individuals[49]; a similar type of analysis applied to ancient DNA has the potential to reveal fine-grained population structure in ancient European populations as well.

Steppe ancestry did arrive into Iberia during the Bronze Age[50], but to a much lesser degree. A limited effect of steppe ancestry in Iberia is also shown by the study of mtDNA[51], which shows no detectible change during the Chalcolithic/Early Bronze Age[51], in contrast to central Europe[52]. Sex-biased gene flow has been implicated in the spread of steppe ancestry into Europe[33,53], although the presence and extent of such bias has been debated[54,55]. One aspect of the demographies of males and females was clearly different, as paternally-inherited Y-chromosome lineages experienced a bottleneck <10 kya which is not evident in maternally-inherited mtDNA[56], suggesting that many men living today trace their patrilineal ancestry to a relatively small number of men of the Neolithic and Bronze Ages.

Modified image, from the preprint. “A sketch of European evolutionary history based on ancient DNA. Bronze Age Europeans (~4.5-3kya) were a mixture of mainly two proximate sources of ancestry: (i) the Neolithic farmers of ~8-5kya who were themselves variable mixtures of farmers from Anatolia and hunter-gatherers of mainland Europe (WHG), and (ii) Bronze Age steppe migrants of ~5kya who were themselves a mixture of hunter-gatherers of eastern Europe (EHG) and southern populations from the Near East (…)”

Firstly, Tocharian (mentioned side by side with Anatolian and LPIE) has been discussed by linguists for quite some time now to be a more archaizing language than the rest, hence the linguistic proposal that it separated first – found to correspond beautifully with the expansion of Khvalynsk/Repin into Afanasevo – ; but it separated first from the common Late PIE trunk. Anatolian clearly separated earlier, from a Middle PIE stage.

Secondly, while Genomics could no doubt falsify the Balkan route for Anatolian, and make us come back to a Maykop route from the steppe (or even a Near Eastern PIE homeland, who knows), I doubt such falsification could come simply from sampled “Anatolian speakers”:

If there is no steppe ancestry in Anatolian speakers (of the 2nd millennium BC), a dismissal of the mainstream migration model could happen only when both potential routes of expansion, the selected cultures from the Balkans and the Caucasus, are sampled in the appropriate time period since the estimated separation (i.e. from the 5th millennium BC), until one of both routes shows the right migration picture.

On the other hand, if some samples from either Romania/Bulgaria or the Caucasus (and/or Anatolian speakers) show steppe ancestry and/or R1b-M269 lineages, as is expected, then the matter won’t need much more explanation.

In fact, the text goes on to define how male lineages experienced a bottleneck after ca. 8000 BC, i.e. accompanying Neolithisation – probably including the formation of Sredni Stog and early Khvalynsk, as it is becoming now clear – , when explaining how it is possible to demonstrate that East Bell Beaker migrants (of R1b-L23 lineages, it is to be understood) with few steppe ancestry reached Iberia.

This was already pointed out not long ago by David Reich, and I am glad to see more scholars showing the importance of taking phylogeography into account over statistical methods when assessing migrations, even if it is only used in those cases in which it does not disrupt too much previous interpretations, like that of the 2015 papers and the proposal of the ‘Yamnaya ancestral component’.

I found it refreshing that for the first time Corded Ware migrants – or, rather, their shared genetic relationship with Eneolithic steppe groups – were accepted (if only indirectly) as a confounding factor in assessing migrations of Bell Beakers. It is a step in the right direction, and it is a relief to read this from someone working with the Reich Lab.

Not just a few (and not only amateurs) are still scratching their heads trying to explain with the most imaginative (and unnecessary) novel migration routes the elevated steppe ancestry and closer relation (PCA cluster, FST, F3, etc.) to CWC and Yamna (due evidently to the absorbed CWC population) in some of the recently published Bell Beaker samples from Central Europe, the Netherlands, and later in Great Britain, compared to samples of South-East Europe near the Middle to Upper Danube region, the obvious homeland of East Bell Beakers, formed from Yamna settlers.

I found it also interesting that Lazaridis mentioned a southern population element related to CHG and Iran farmers. This should help dissipate the hype that some have artificially created as of late over a potential Northern Iranian homeland based on a single paragraph from David Reich’s book.

EDIT (9 MAY 2018). Lazaridis posted an answer to my questioning of potential Proto-Anatolian origins divided in tweets (I post a link to the first tweet, then the text in full):

The steppe hypothesis predicts some genetic input from eastern Europe (EHG) to Anatolia.

– Bronze Age Anatolians (Lazaridis et al. 2017) from historically IE-speaking Pisidia lack EHG; more samples obviously needed


  1. Additional Anatolian samples will have EHG: consistent with steppe PIE
  2. Additional Anatolian samples will not have EHG, then either:
    1. Steppe not PIE homeland
    2. Steppe PIE homeland but linguistic impact in Anatolia vastly greater than genetic impact

Tentative steppe->Anatolia movements reach Balkans early (Mathieson et al. 2018) and Armenia (some EHG in Lazaridis et al. 2016).

But not the last leg to Anatolia_ChL (Lazaridis et al. 2016) or Anatolia_BA (Lazaridis et al. 2017).

  • If Anatolians consistently don’t have EHG, steppe PIE is very difficult to affirm; Near Eastern alternative likely (contributing CHG/Iran_N-related ancestry to both western Anatolia/steppe)
  • If Anatolians have EHG, one could further investigate by what route they got it.

One way or another PIE homeland problem is almost solved IMHO, which is what my review tries to get at in that short section


Ancient DNA study reveals HLA susceptibility locus for leprosy in medieval Europeans


Open access Ancient DNA study reveals HLA susceptibility locus for leprosy in medieval Europeans, by Krause-Kyora et al., Nature Communications (2018)


Leprosy, a chronic infectious disease caused by Mycobacterium leprae (M. leprae), was very common in Europe till the 16th century. Here, we perform an ancient DNA study on medieval skeletons from Denmark that show lesions specific for lepromatous leprosy (LL). First, we test the remains for M. leprae DNA to confirm the infection status of the individuals and to assess the bacterial diversity. We assemble 10 complete M. leprae genomes that all differ from each other. Second, we evaluate whether the human leukocyte antigen allele DRB1*15:01, a strong LL susceptibility factor in modern populations, also predisposed medieval Europeans to the disease. The comparison of genotype data from 69 M. leprae DNA-positive LL cases with those from contemporary and medieval controls reveals a statistically significant association in both instances. In addition, we observe that DRB1*15:01 co-occurs with DQB1*06:02 on a haplotype that is a strong risk factor for inflammatory diseases today.

Relationship of 53 medieval leprosy-positive Danes to contemporary Europeans. Principal component analysis plot for 53 medieval St. Jørgen individuals in relation to European population samples from the 1000 Genomes project. (CEU, Northern Europeans from Utah; GBR, British in England and Scotland; IBS, Iberian population in Spain; TSI, Tuscans in Italy; FIN, Finnish in Finland)

The study shows mtDNA haplogroups comparable to those of northern Europeans today, and findings in general indicate no major genome-wide changes in the Danish population structure in the past 1000 years.

The paper may be of interest for earlier migrations:


Discovered via Iain Mathieson:


Proto-Indo-European homeland south of the Caucasus?

User Camulogène Rix at Anthrogenica posted an interesting excerpt of Reich’s new book in a thread on ancient DNA studies in the news (emphasis mine):

Ancient DNA available from this time in Anatolia shows no evidence of steppe ancestry similar to that in the Yamnaya (although the evidence here is circumstantial as no ancient DNA from the Hittites themselves has yet been published). This suggests to me that the most likely location of the population that first spoke an Indo-European language was south of the Caucasus Mountains, perhaps in present-day Iran or Armenia, because ancient DNA from people who lived there matches what we would expect for a source population both for the Yamnaya and for ancient Anatolians. If this scenario is right the population sent one branch up into the steppe-mixing with steppe hunter-gatherers in a one-to-one ratio to become the Yamnaya as described earlier- and another to Anatolia to found the ancestors of people there who spoke languages such as Hittite.

The thread has since logically become a trolling hell, and it seems not to be working right for hours now.

Reich’s proposal based on ancestral components to explain the formation of a people and language is a continuation of their emphasis on ancestry to explain cultures and languages. It seems quite interesting to see this happen again, given their current trend to surreptitiously modify their previous ‘Yamnaya ancestry’ concept and Yamnaya millennia-long R1a-R1b community (that supposedly explains a Yamna -> Corded Ware -> Bell Beaker migration) to a more general ‘steppe people’ sharing a ‘steppe ancestry’ who spoke a ‘steppe language’.

Interesting arrows of dispersal of steppe ancestry, from Yamna -> Corded Ware -> Bell Beaker, from David Reich’s new book (yes, from 2018, number one bestseller in Amazon.com).

This new idea based on ancestral components suffers thus from the same essential methodological problems, which equate it – yet again – to pure speculation:

  1. It is a conclusion based on the genomic analysis of few individuals from distant regions and different periods, and – maybe more disturbingly – on the lack of steppe ancestry in the few samples at hand.
  2. Wait, what? Steppe ancestry? So they are trying to derive potential genetic connections among specific prehistoric cultures with a poorly depicted genetic sketch, based on previous flawed concepts (instead of on anthropological disciplines), which seems a rather long stretch for any scientist, whether they are content with seeing themselves as barbaric scientific conquerors of academic disciplines or not. In other words, statistics is also science (in fact, the main one to assert anything in almost any scientific field), and you cannot overcome essential errors (design, sampling, hypothesis testing) merely by using a priori correct statistical methods. Results obtained this way constitute a statistical fallacy.

  3. Even if the sampling and hypothesis testing were fine, to derive anthropological models from genomic investigation is completely wrong. Ancestral component ≠ population.
  4. To include not only potential migrations, but also languages spoken by these potential migrants? It’s sad that we have a need to repeat it, but if ancestral component ≠ population, how could ancestral component = language?

The Proto-Indo-European-speaking community

This is what we know about the formation of a Proto-Indo-European community (i.e. a community speaking a reconstructible Proto-Indo-European language) in the Pontic-Caspian steppe, which is based on linguistic reconstruction and guesstimates, tracing archaeological cultures backwards from cultures known to have spoken ancient (proto-)languages, and helping both disciplines with anthropological models (for which ancient genomics is only helping select certain details) of migration or – rarely – cultural diffusion:

NOTE. The following dates are obviously simplified. Read here a more detailed linguistic assessment based on phonology.

Most likely Pre-Proto-Anatolian migration with Suvorovo-Novodanilovka chiefs in the North Pontic steppe and the Balkans.
  • ca. 5000 BC. Early Proto-Indo-European (or Indo-Uralic) spoken probably during the formation and development of a loose Early Khvalynsk – Sredni Stog I cultural-historical community over the Pontic-Caspian steppe region, whose indigenous population probably had mainly Caucasus hunter-gatherer ancestry.
  • ca. 4500 BC. Khvalynsk probably speaking Middle Proto-Indo-European expands, most likely including Suvorovo-Novodanilovka chiefs into the North Pontic steppe, and probably expanding R1b-M269 lineages for the first time.
  • ca. 4000 BC. Separated communities develop, including North Pontic cultures probably gradually dominated by R1a-Z645 (potentially speaking Proto-Uralic); and Khvalynsk (and Repin) cultures probably dominated by R1b-L23 lineages, most likely developing a Late Proto-Indo-European already separated from Proto-Anatolian.
  • ca. 3500 BC. A Proto-Corded Ware population dominated by R1a-Z645 expands to the north, and slightly later an early Yamna community develops from Late Khvalynsk and Repin, expanding to the west of the Don River, and to the east into Afanasevo. This is most likely the period of reduction of variability and expansion of subclades of R1a-Z645 and R1b-L23 that we expect to see with more samples.
  • ca. 3000 BC. Expansion of Corded Ware migrants in northern Europe, and Yamna migrants along the Danube and into the Balkans, with further reduction and expansion of certain subclades.
  • ca. 2500 BC. Expansion of Bell Beaker migrants dominated by R1b-L51 subclades in Europe, and late Corded Ware migrants in east Yamna expanding R1a-Z93 subclades.

All these events are compatible with language reconstruction in mainstream European schools since at least the 1980s, supported by traditional archaeological research of the past 20 years, and is being confirmed with Genomics.

For those willingly lost in a myriad of new dreams boosted by the shallow comment contained in David Reich’s paragraph on CHG ancestry, even he does not doubt that the origin of Late Proto-Indo-European lies in Yamna, to the north of the Caucasus, based on Anthony’s (2007) account:

Both images from the book, posted by Twitter user Jasper at https://twitter.com/jaspergregory.

NOTE: By the way, David Anthony, one of the main sources of information for Reich’s group, never considered Corded Ware to have received Yamna migrants, and althought he changed his model due to the conclusions of the 2015 papers, he has recently changed his model again to adapt it to the inconsistencies found in phylogeography.

CHG ancestry and PIE homeland south of the Caucasus

As for the potential origins of CHG ancestry in early Proto-Indo-European speakers, I already stated clearly my opinion quite recently. They may be attributed to:

Just to be clear, an expansion of Proto-Anatolian to the south, through the Caucasus, cannot be discarded today. It will remain a possibility until Maykop and more Balkan Chalcolithic and Anatolian-speaking samples are published.

However, an original Early Proto-Indo-European community south of the Caucasus seems to me highly unlikely, based on anthropological data, which should drive any conclusion. From what I could read, here are the rather simplistic arguments used:

  • Gimbutas and Maykop: Maykop was thought to be (in Gimbutas’ times) a rather late archaeological culture, directly connected to a Transcaucasian Copper Age culture ca. 2400-2300 BC. It has been demonstrated in recent years that this culture is substantially older, and even then language guesstimates for a Late PIE / Proto-Anatolian would not fit a migration to the north. While our ignorance may certainly be used to derive far-fetched conclusions about potential migrations from and to it, using Gimbutas (or any archaeological theory until the 1990s) today does not make any sense. Still less if we think that she favoured a steppe homeland.

NOTE. It seems that the Reich Lab may have already access to Maykop samples, so this suggested Proto-Indo-European – Maykop connection may have some real foundation. Regardless, we already know that intense contacts happened, so there will be no surprise (unless Y-DNA shows some sort of direct continuity from one to the other).

  • Gamkrelidze & Ivanov: they argued for an Armenian homeland (and are thus at the origin of yet another autochthonous continuity theory), but they did so to support their glottalic theory, i.e. merely to support what they saw as favouring their linguistic model (with Armenian being the most archaic dialect). The glottalic theory is supported today – as far as I know – mainly by Kortlandt, Jagodziński, or (Nostraticist) Bomhard, but even they most likely would not need to argue for an Armenian homeland. In fact, their support of a Graeco-Aryan group (also supported by Gamkrelidze & Ivanov) would be against this, at least in archaeological terms.
  • Colin Renfrew and the Anatolian homeland: This conceptual umbrella of language spreading with farming everywhere has changed so much and so many times in the past 20 years, with so many glottochronological and archaeological estimates circulating, that you can support anything by now using them. Mostly used today for abstract models of long-lasting language contacts, cultural diffusion, and constellation analogies. Anyway, he strives to keep up-to-date information to revise the model, that much is certain:
  • Glottochronology, phylogenetic trees, Swadesh list analysis, statistical estimates, psychics, pyramid power, and healing crystals: no, please, no.
Science Magazine
“A first line of evidence comes from linguistic analysis based on quantitative lexical data, which returned a tree compatible with the Anatolian hypothesis

In principle, unlike many other recent autochthonous continuity theories, I doubt there can be much racial-based opposition anywhere in the world to an origin of Proto-Indo-European in the Middle East, where the oldest civilizations appeared – apart, obviously, from modern Northeast and Northwest Caucasian, Kartvelian, or Semitic speakers, who may in turn have to revisit their autochthonous continuity theories radically…

Nevertheless, it is obvious that prehistoric (and many historic) migrations are signalled by the reduction in variability and expansion of certain Y-DNA haplogroups, and not just by ancestral components. That is generally accepted, although the reasons for this almost universal phenomenon are not always clear.

In fact, Proto-Anatolian and Common Anatolian speakers need not share any ancestral component, PCA cluster, or any other statistical parameter related to steppe populations, not even the same Y-DNA haplogroups, given that approximately three thousand years might have passed between their split from an Indo-Hittite community and the first attested Anatolian-speaking communities…We must carefully follow their tracks from Anatolia ca. 1500 BC to the steppe ca. 4500 BC, otherwise we risk creating another mess like the Corded Ware one.

In my opinion, the substantial contribution of EHG ancestry and R1a-M417 lineages to the Pontic-Caspian steppe (probably ca. 6500 BC) from Central or East Eurasia is the most recent sizeable genomic event in the region, and thus the best candidate for the community that expanded a language ancestral to Proto-Indo-European – whether you call it Pre-Proto-Indo-European, Pre-Indo-Uralic, or Eurasiatic, depending on your preferences.

An early (and substantial) contribution of CHG ancestry in Khvalynsk relative to North Pontic cultures, if it is found with new samples, may actually be a further proof of the Caucasian substrate of Proto-Indo-European proposed by Kortlandt (or Bomhard) as contributing to the differentiation of Middle PIE from Uralic. Genomics could thus help support, again, traditional disciplines in accepting or rejecting academic controversial theories.


In the case of an Early PIE (or Indo-Uralic) homeland, genomic data is scarce. But all traditional anthropological disciplines point to the Pontic-Caspian steppe, so we should stick to it, regardless of the informal suggestion written by a renown geneticist in one paragraph of a book conceived as an introduction to the field.

It seems we are not learning much from the hundreds of peer-reviewed, statistically (superficially, at least) sound genetic papers whose anthropological conclusions have been proven wrong by now. A lot of people should be spending their time learning about the complex, endless methods at hand in this kind of research – not just bioinformatics – , instead of fruitlessly speculating about wild unsubstantiated proposals.

As a final note, I would like to remind some in the discussion, who seem to dismiss the identification of CHG with Proto-Indo-European by supporting a “R1a-R1b” community for PIE, of their previous commitment to ancestral components in identifying peoples and languages, and thus their support to Reich’s (and his group’s) fundamental premises.

You cannot have it both ways. At least David Reich is being consistent.


First Iberian R1b-DF27 sample, probably from incoming East Bell Beakers


I had some more time to read the paper by Valdiosera et al. (2018) and its supplementary material.

One of the main issues since the publication of Olalde et al. (2018) (and its hundreds of Bell Beaker samples) was the lack of a clear Y-DNA R1b-DF27 subclades among East Bell Beaker migrants, which left us wondering when the subclade entered the Iberian Peninsula, since it could have (theoretically) happened from the Chalcolithic to the Iron Age.

My prediction was that this lineage found today widespread among the Iberian population crossed the Pyrenees quite early, during the Chalcolithic, with migrating East Bell Beakers expanding North-West Indo-European dialects, and that it spread slowly afterwards.

The first ancient sample clearly identified as of R1b-DF27 subclade is found in this paper, at the Late Bronze Age site Cueva de los Lagos. Although it is unidentified and has no radiocarbon date, the site as a whole is associated with the Cogotas culture and its Bouquique ceramic decoration.

Y-DNA and mtDNA haplogroups, from the paper. Sequencing statistics and contamination rates for newly generated sequence data.

It was found in the northern part of the Cogotas culture territory (which lies mainly between Castille and Aragon, in North-Central Spain), shows evident steppe admixture, and it has become obvious with the latest papers (including this one) that R1b-M269 lineages intruded south of the Pyrenees associated with East Bell Beaker migrations.

The Proto-Cogotas culture is associated with a Bell Beaker substrate influenced by either El Argar or Atlantic Bronze, and the specific type of ceramics found at this Cogotas culture site are probably from the mid-2nd millennium, which is too early for the Celtic expansion.

Supervised ADMIXTURE results.

Nevertheless, due to the quite likely late date of the sample (in the centuries around 1500 BC), there is still a possibility that incoming R1b-DF27 lineages were not among the early R1b-M269 lineages found in the Iberian Chalcolithic, and were associated with later migrations from Central Europe, potentially linked to the expansion of the Urnfield culture, and thus nearer to an Italo-Celtic community.

Diachronic map of migrations in Europe ca. 1250-750 BC.

In any of these scenarios, a Pre-Celtic expansion of North-West Indo-European in Iberia (possibly associated with Lusitanian) is still the best explanation for the origin and expansion of (at least some) modern Iberian R1b-DF27 lineages, including those found among the Basque-speaking population.

This implies that the ‘indigenous’ Neolithic lineages of Iberia (like I2 and G2a2) were replaced with subsequent internal gene flows and founder effects, such as those that evidently happened (probably quite recently) among Basques, even though indigenous languages show an obvious continuity.

I would say this is the last nail in the coffin for autochthonous Y-DNA continuity theories for Spain and France (i.e. for the traditional Vasconic-Uralic hypothesis), but we know that data is never enough for any die hard continuist…so let’s just say another nail in the coffin for endless autochthonous continuity theories.

EDIT (18 & 26 MAR 2018): Genetiker has published Y-SNP calls for both R1b samples, showing this one is R1b1a1a2a1a2a-BY15964 (see modern members of this subclade in ytree), and that the other one is R1b1a1a2a~L23.


Language continuity despite population replacement in Remote Oceania


New article (behind paywall) Language continuity despite population replacement in Remote Oceania, by Posth et al., Nat. Ecol. Evol. (2018).


Recent genomic analyses show that the earliest peoples reaching Remote Oceania—associated with Austronesian-speaking Lapita culture—were almost completely East Asian, without detectable Papuan ancestry. However, Papuan-related genetic ancestry is found across present-day Pacific populations, indicating that peoples from Near Oceania have played a significant, but largely unknown, ancestral role. Here, new genome-wide data from 19 ancient South Pacific individuals provide direct evidence of a so-far undescribed Papuan expansion into Remote Oceania starting ~2,500 yr BP, far earlier than previously estimated and supporting a model from historical linguistics. New genome-wide data from 27 contemporary ni-Vanuatu demonstrate a subsequent and almost complete replacement of Lapita-Austronesian by Near Oceanian ancestry. Despite this massive demographic change, incoming Papuan languages did not replace Austronesian languages. Population replacement with language continuity is extremely rare—if not unprecedented—in human history. Our analyses show that rather than one large-scale event, the process was incremental and complex, with repeated migrations and sex-biased admixture with peoples from the Bismarck Archipelago.

So, despite the population replacement in Oceania seen recently in Genomics, the people of present-day Vanuatu continue to speak languages descended from those spoken by the initial Austronesian inhabitants, rather than any Papuan language of the incoming migrants.

Professor Gray, Director of the Department of Linguistic and Cultural Evolution at the MPI-SHH, says:

Population replacement with language continuity is extremely rare – if not unprecedented – in human history. The linguist Bob Blust has long argued for a model in which a separate Papuan expansion reaches Vanuatu soon after initial Austronesian settlement, with the initial, and likely undifferentiated, Austronesian language surviving as a lingua franca for diverse Papuan migrant groups.

Dr. Adam Powell, senior author of the study and also of the MPI-SHH, continues,

The demographic history suggested by our ancient DNA analyses provides really strong support for this historical linguistic model, with the early arrival and complex, incremental process of genetic replacement by people from the Bismarck Archipelago. This provides a compelling explanation for the continuity of Austronesian languages despite the almost complete replacement of the initial genetic ancestry of Vanuatu.

Maps showing the migrations in the area, including, in the final map, the migrations revealed by the current study. Credit: Hans Sell, adapted from Skoglund et al. Genomic insights into the peopling of the Southwest Pacific. Nature (2016).

I think we can safely disagree now with their assertion. We are seeing more and more cases of language continuity in spite of population replacement quite clearly in Eurasian prehistory. At least:

All these cases can be explained with founder effects and gradual expansions after an initial arrival, maybe also initial close interaction between different ethnic groups, where one group (and its language) becomes the dominant one.

NOTE. Even if an alternative model is selected (say, that Corded Ware migrants spoke Indo-European languages), alternative language continuity events need to be proposed for some of these regions, so we are beyond their description as ‘rare language events’ already.

What is becoming clearer with ancient samples, therefore, is that there is little space for prehistoric cultural diffusion events (at least massive ones), which were quite popular explanations before the advent of genetic studies.


Consequences of O&M 2018 (I): The latest West Yamna “outlier”

This is the first of a series of posts analyzing the findings of the recent Nature papers Olalde et al.(2018) and Mathieson et al.(2018) (abbreviated O&M 2018).

As expected, the first Y-DNA haplogroup of a sample from the North Pontic region (apart from an indigenous European I2 subclade) during its domination by the Yamna culture is of haplogroup R1b-L23, and it is dated ca. 2890-2696 BC. More specifically, it is of Z2103 subclade, the main lineage found to date in Yamna samples. The site in question is Dereivka, “in the southern part of the middle Dnieper, at the boundary between the forest-steppe and the steppe zones”.

NOTE: A bit of history for those lost here, which appear to be many: the classical Yamna culture – from previous late Khvalynsk, and (probably) Repin groupsspread west of the Don ca. 3300 BC creating a cultural-historical community – and also an early offshoot into Asia – , with mass migrations following some centuries later along the Danube to the Carpathian Basin, but also south into the Balkans, and north along the Prut. There is thus a very short time frame to find Yamna peoples shaping these massive migrations – the most likely speakers of Late Proto-Indo-European dialects – in Ukraine, compared to their most stable historical settlements east of the Don River.

There is no data on this individual in the supplementary material – since Eneolithic Dereivka samples come from stored dental remains – , but the radiocarbon date (if correct) is unequivocal: the Yamna cultural-historical community dominated over that region at that precise time. Why would the authors name it just “Ukraine_Eneolithic”? They surely took the assessment of archaeologists, and there is no data on it, so I agree this is the safest name to use for a serious paper. This would not be the first sample apparently too early for a certain culture (e.g. Catacomb in this case) which ends up being nevertheless classified as such. And it is also not impossible that it represents another close Ukraine Eneolithic culture, since ancestral cultural groups did not have borders…

NOTE. Why, on the other hand, was the sample from Zvejnieki – classified as of Latvia_LN – assumed to correspond to “Corded Ware” (like the recent samples from Plinkaigalis242 or Gyvakarai1), when we don’t have data on their cultures either? No conspiracy here, just taking assessments from different archaeologists in charge of these samples: those attributed to “Corded Ware” have been equally judged solely by radiocarbon date, but, combining the known archaeological signs of herding in the region arriving around this time with the old belief (similar to the “Iberia is the origin of Bell Beaker peoples” meme) that “only the Corded Ware culture signals the arrival of herding in the Baltic”. This assumption has been contested recently by Furholt, in an anthropological model that is now mainstream, upheld also by Anthony.

We already know that, out of three previous West Yamna samples, one shows Anatolian Neolithic ancestry, the so-called “Yamna outlier”. We also know that one sample from Yamna in Bulgaria also shows Anatolian Neolithic ancestry, with a distinct ‘southern’ drift, clustering closely to East Bell Beaker samples, as we can still see in Mathieson et al. (2018), see below. So, two “outliers” (relative to East Yamna samples) out of four samples… Now a new, fifth sample from Ukraine is another “outlier”, coinciding with (and possibly somehow late to be a part of) the massive migration waves into Central Europe and the Balkans predicted long ago by academics and now confirmed with Genomics.

I think there are two good explanations right now for its ancestral components and position in PCA:

Modified image from Mathieson et al. (2018), including also approximate location of groups from Mittnik et al. (2018), and group (transparent shape outlined by dots) formed by new Bell Beaker samples from Olalde et al. (2018). “Principal components analysis of ancient individuals. Points for 486 ancient individuals are projected onto principal components defined by 777 present-day west Eurasian individuals (grey points). Present-day individuals are shown.”

a) The most obvious one, that the Dnieper-Dniester territory must have been a melting pot, as I suggested, a region which historically connected steppe, forest steppe, and forest zone with the Baltic, as we have seen with early Baltic Neolithic samples (showing likely earlier admixture in the opposite direction). The Yamna population, a rapidly expanding “elite group of patrilineally-related families” (words from the famous 2015 genetic papers, not mine), whose only common genetic trait is therefore Y-DNA haplogroup R1b-L23, must have necessarily acquired other ancestral components of Eneolithic Ukraine during the migrations and settlements west of the Don River.

How many generations are needed for ancestral components and PCA clusters to change to that extent, in regions where only some patrilocal chiefs but indigenous populations remain, and the population probably admixed due to exogamy, back-migrations, and “resurge” events? Not many, obviously, as we see from the differences among the many Bell Beaker samples of R1b-L23 subclades from Olalde et al. (2018)

b) That this sample shows the first genetic sign of the precise population that contributed to the formation of the Catacomb culture. Since it is a hotly debated topic where and how this culture actually formed to gradually replace the Yamna culture in the central region of the Pontic-Caspian steppe, this sample would be a good hint of how its population came to be.

See e.g. for free articles on the Catacomb culture its article on the Encyclopedia of Indo-European Culture, Catacomb culture wagons of the Eurasian steppes, or The Warfare of the Northern Pontic Steppe – Forest-Steppe Pastoral Societies: 2750 – 2000 B.C. There are also many freely available Russian and Ukrainian papers on anthropometry (a discipline I don’t especially like) which clearly show early radiocarbon dates for different remains.

This could then be not ‘just another West Yamna outlier’, but would actually show meaningful ‘resurge’ of Neolithic Ukraine ancestry in the Catacomb culture.

It could be meaningul to derive hypotheses, in the same way that the late Central European CWC sample from Esperstedt (of R1a-M417 subclade) shows recent exogamy directly from the (now more probably eastern part of the) steppe or steppe-forest, and thus implies great mobility among distant CWC groups. Although, given the BB samples with elevated steppe ancestry and close PCA cluster from Olalde et al. (2018), it could also just mean exogamy from a near-by region, around the Carpathian Basin where Yamna migrants settled…

If this was the case, it would then potentially mean a “continuity” break in the steppe, in the region that some looked for as a Balto-Slavic homeland, and which would have been only later replaced by Srubna peoples with steppe ancestry (and probably R1a-Z93 subclades). We would then be more obviously left with only two options: a hypothetic ‘Indo-Slavonic’ North Caspian group to the east (supported by Kortlandt), or a Central-East European homeland near Únětice, as one of the offshoots from the North-West Indo-European group (supported by mainstream Indo-Europeanists).

How to know which is the case? We have to wait for more samples in the region. For the moment, the date seems too early for the known radiocarbon dating of most archaeological remains of the Catacomb Culture.

Diachronic map of Late Copper Age migrations including steppe groups ca. 2600-2250 BC

An important consequence of the addition of these “Yamna outliers” for the future of research on Indo-European migrations is that, especially if confirmed as just another West Yamna sample – with more, similar samples – , early Palaeo-Balkan peoples migrating south of the Danube and later through Anatolia may need to be judged not only in terms of ancestral components or PCA (as in the paper on Minoans and Mycenaeans), but also and more decisively using phylogeography, especially with the earliest samples potentially connected with such migrations.

NOTE. Regarding the controversy (that some R1b European autochthonous continuists want to create) over the origin of the R1b-L151 lineages, we cannot state its presence for sure in Yamna territory right now, but we already have R1b-M269 in the eastern Pontic-Caspian steppe during the Neolithic-Chalcolithic transition, then R1b-L23 and subclades (mainly R1b-Z2013, but also one xZ2103, xL51 which suggests its expansion) in the region before and during the Yamna expansion, and now we have L51 subclades with elevated steppe ancestry in early East Bell Beakers, which most likely descended from Yamna settlers in the Carpathian Basin (yet to be sampled).

Even without express confirmation of its presence in the steppe, the alternative model of a Balkan origin seems unlikely, given the almost certain continuity of expanding Yamna clans as East Bell Beaker ones, in this clearly massive and relatively quick expansion that did not leave much time for founder effects. But, of course, it is not impossible to think about a previously hidden R1b-L151 community in the Carpathian Basin yet to be discovered, adopting North-West Indo-European (by some sort of founder effect) brought there by Yamna peoples of exclusively R1b-Z2103 lineages. As it is not impossible to think about a hidden and ‘magically’ isolated community of haplogroup R1a-M417 in Yamna waiting to be discovered…Just not very likely, either option.

As to why this sample or the other Bell Beaker samples “solve” the question of R1a-Z645 subclades (typical of Corded Ware migrants) not expanding with Yamna, it’s very simple: it doesn’t. What should have settled that question – in previous papers, at least since 2015 – is the absence of this subclade in elite chiefs of clans expanded from Khvalynsk, Yamna, or their only known offshoots Afanasevo and Bell Beaker. Now we only have still more proof, and no single ‘outlier’ in that respect.

No haplogroup R1a among hundreds of samples from a regionally extensive sampling of the only cultures mainstream archaeologists had thoroughly described as potentially representing Indo-European-speakers should mean, for any reasonable person (i.e. without a personal or professional involvement in an alternative hypothesis), that Corded Ware migrants (as expected) did not stem from Yamna, and thus did not spread Late Indo-European dialects.

This haplogroup’s hegemonic presence in North-Eastern Europe – and the lack of N1c lineages until after the Bronze Age – coinciding with dates when Uralicists have guesstimated Uralic dialectal expansion accross this wide region makes the question of the language spread with CWC still clearer. The only surprise would have been to find a hidden and isolated community of R1a-Z645 lineages clearly associated with the Yamna culture.

NOTE. A funny (however predictable) consequence for R1a autochthonous continuists of Northern or Eastern European ancestry: forum commentators are judging if this sample was of the Yamna culture or spoke Indo-European based on steppe component and PCA cluster of the few eastern Yamna samples which define now (you know, with the infallible ‘Yamnaya ancestral component’) the “steppe people” who spoke the “steppe language”™ – including, of course, North-Eastern European Late Neolithic

Not that radiocarbon dates or the actual origin of this sample cannot be wrong, mind you, it just strikes me how twisted such biased reasonings may be, depending on the specific sample at hand… Denial, anger, and bargaining, including shameless circular reasoning – we know the drill: we have seen it a hundred times already, with all kinds of supremacists autochthonous continuists who still today manage to place an oudated mythical symbolism on expanding Proto-Indo-Europeans, or on regional ethnolinguistic continuity…

More detailed posts on the new samples from O&M 2018 and their consequences for the Indo-European demic diffusion to come, indeed…

