Arrival of steppe ancestry with R1b-P312 in the Mediterranean: Balearic Islands, Sicily, and Iron Age Sardinia

steppe-balearic-sicily-sardinia

New preprint The Arrival of Steppe and Iranian Related Ancestry in the Islands of the Western Mediterranean by Fernandes, Mittnik, Olalde et al. bioRxiv (2019)

Interesting excerpts (emphasis in bold; modified for clarity):

Balearic Islands: The expansion of Iberian speakers

Mallorca_EBA dates to the earliest period of permanent occupation of the islands at around 2400 BCE. We parsimoniously modeled Mallorca_EBA as deriving 36.9 ± 4.2% of her ancestry from a source related to Yamnaya_Samara; (…). We next used qpAdm to identify “proximal” sources for Mallorca_EBA’s ancestry that are more closely related to this individual in space and time, and found that she can be modeled as a clade with the (small) subset of Iberian Bell Beaker culture associated individuals who carried Steppe-derived ancestry (p=0.442).

Suppl. Materials: The model used was with Bell_Beaker_Iberia_highsteppe, a group of outliers from Iberia buried in a Bell Beaker mortuary context who unlike most individuals from this context in that region had high proportions of Steppe ancestry (p=0.442).

Our estimates of Steppe ancestry in the two later Balearic Islands individuals are lower than the earlier one: 26.3 ± 5.1% for Formentera_MBA and 23.1 ± 3.6% for Menorca_LBA, but the Middle to Late Bronze Age Balearic individuals are not a clade relative to non-Balearic groups. Specifically, we find that f4(Mbuti.DG, X; Formentera_MBA, Menorca_LBA) is positive when X=Iberia_Chalcolithic (Z=2.6) or X=Sardinia_Nuragic_BA (Z=2.7). While it is tempting to interpret the latter statistic as suggesting a genetic link between peoples of the Talaiotic culture of the Balearic islands and the Nuragic culture of Sardinia, the attraction to Iberia_Chalcolithic is just as strong, and the mitochondrial haplogroup U5b1+16189+@16192 in Menorca_LBA is not observed in Sardinia_Nuragic_BA but is observed in multiple Iberia_Chalcolithic individuals. A possible explanation is that both the ancestors of Nuragic Sardinians and the ancestors of Talaiotic people from the Balearic Islands received gene flow from an unsampled Iberian Chalcolithic-related group (perhaps a mainland group affiliated to both) that did not contribute to Formentera_MBA.

This sample, like another one in El Argar, is of hg. R1b-P312. So there you are, the data that connects the Proto-Iberian expansion (replacing IE-speaking Bell Beakers) to the Iberian Chalcolithic population, signaled by the increase in Iberian Chalcolithic ancestry after the arrival of Bell Beakers, most likely connected originally to the Argaric and post-Argaric expansions during the MBA.

balearic-sicily-sardinia-pca
PCA with previously published ancient individuals (non-filled symbols), projected onto variation from present-day populations (gray squares).

Steppe in Sardinia IA: Phocaeans from Italy?

Most Sardinians buried in a Nuragic Bronze Age context possessed uniparental haplogroups found in European hunter-gatherers and early farmers, including Y-haplogroup R1b1a[xR1b1a1a] which is different from the characteristic R1b1a1a2a1a2 spread in association with the Bell Beaker complex. An exception is individual I10553 (1226-1056 calBCE) who carried Y-haplogroup J2b2a, previously observed in a Croatian Middle Bronze Age individual bearing Steppe ancestry, suggesting the possibility of genetic input from groups that arrived from the east after the spread of first farmers. This is consistent with the evidence of material culture exchange between Sardinians and mainland Mediterranean groups, although genome-wide analyses find no significant evidence of Steppe ancestry so the quantitative demographic impact was minimal.

Another interesting data, these (Mesolithic) remnant R1b-V88 lineages closely related to the Italian Peninsula, the most likely region of expansion of these lineages into Africa, in turn possibly connected to the expansion of Proto-Afroasiatic.

We detect definitive evidence of Iranian-related ancestry in an Iron Age Sardinian I10366 (391-209 calBCE) with an estimate of 11.9 ± 3.7.% Iran_Ganj_Dareh_Neolithic related ancestry, while rejecting the model with only Anatolian_Neolithic and WHG at p=0.0066 (Supplementary Table 9). The only model that we can fit for this individual using a pair of populations that are closer in time is as a mixture of Iberia_Chalcolithic (11.9 ± 3.2%) and Mycenaean (88.1 ± 3.2%) (p=0.067). This model fits even when including Nuragic Sardinians in the outgroups of the qpAdm analysis, which is consistent with the hypothesis that this individual had little if any ancestry from earlier Sardinians.

yamnaya-samara
Proportions of ancestry using a distal qpAdm framework on an individual basis (a), and based on qpWave clusters

Sicily EBA: The Lusitanian/Ligurian connection?

(…) While a previously reported Bell Beaker culture-associated individual from Sicily had no evidence of Steppe ancestry, (…) we find evidence of Steppe ancestry in the Early Bronze Age by ~2200 BCE. In distal qpAdm, the outlier Sicily_EBA11443 is parsimoniously modeled as harboring 40.2 ± 3.5% Steppe ancestry, and the outlier Sicily_EBA8561 is parsimoniously modeled as harboring 23.3 ± 3.5% Steppe ancestry. (…) The presence of Steppe ancestry in Early Bronze Age Sicily is also evident in Y chromosome analysis, which reveals that 4 of the 5 Early Bronze Age males had Steppe-associated Y-haplogroup R1b1a1a2a1a2. (Online Table 1). Two of these were Y-haplogroup R1b1a1a2a1a2a1 (Z195) which today is largely restricted to Iberia and has been hypothesized to have originated there 2500-2000 BCE. This evidence of west-to-east gene flow from Iberia is also suggested by qpAdm modeling where the only parsimonious proximate source for the Steppe ancestry we found in the main Sicily_EBA cluster is Iberians.

What’s this? An ancestral connection between Sicel Elymian and Galaico-Lusitanian or Ligurian (based on an origin in NE Iberia)? Impossible to say, especially if the languages of these early settlers were replaced later by non-Indo-European speakers from the eastern Mediterranean, and by Indo-European speakers from the mainland closely related to Proto-Italic during the LBA, but see below.

Regarding the comment on R1b-Z195, it is associated with modern Iberians, as DF27 in general, due to founder effects beyond the Pyrenees. It is a very old subclade, split directly from DF27 roughly at the same time as it split from the parent P312, i.e. it can be found anywhere in Europe, and it almost certainly accompanied the expansion of Celts from Central Europe under the subclade R1b-M167/SRY2627.

The connection is thus strong only because of the qpAdm modeling, since R1b-DF27 and subclade R1b-Z195 are certainly lineages expanded quite early, most likely with Yamna settlers in Hungary and East Bell Beakers.

In this case, if stemming from Iberia, it is most likely of subclade R1b-Z220 – or another Z195 (xM167) lineage – originally associated with the Old European substrate found in topo-hydronymy in Iberia, whose most likely remnants attested during the Iron Age were Lusitanians.

r1b-df27-z195
Left: Modern distribution of R1b-Z195 (YFull estimate 2700 BC); Right: Modern distribution of DF27. Both include later founder effects within Iberia, so the increase in the Basque country and the Crown of Aragon and the decrease in Portugal can safely be ignored. Contour maps of the derived allele frequencies of the SNPs analyzed in Solé-Morata et al. (2017).

We detect Iranian-related ancestry in Sicily by the Middle Bronze Age 1800-1500 BCE, consistent with the directional shift of these individuals toward Mycenaeans in PCA. Specifically, two of the Middle Bronze Age individuals can only be fit with models that in addition to Anatolia_Neolithic and WHG, include Iran_Ganj_Dareh_Neolithic. The most parsimonious model for Sicily_MBA3125 has 18.0 ± 3.6% Iranian-related ancestry (p=0.032 for rejecting the alternative model of Steppe rather than Iranian-related ancestry), and the most parsimonious model for Sicily_MBA has 14.9 ± 3.9% Iranian-related ancestry (p=0.037 for rejecting the alternative model).

The modern southern Italian Caucasus-related signal identified in Raveane et al. (2018) is plausibly related to the same Iranian-related spread of ancestry into Sicily that we observe in the Middle Bronze Age (and possibly the Early Bronze Age).

The non-Indo-European Sicanians and Elymians were possibly then connected to eastern Mediterranean groups before the expansion of the Sea Peoples.

For the Late Bronze Age group of individuals, qpAdm documented Steppe-related ancestry, modeling this group as 80.2 ± 1.8% Anatolia_Neolithic, 5.3 ± 1.6% WHG, and 14.5 ± 2.2% Yamnaya_Samara. Our modeling using sources more closely related in space and time also supports Sicily_LBA having Minoan-related ancestry or being derived from local preceding populations or individuals with ancestries similar to those of Sicily_EBA3123 (p=0.527), Sicily_MBA3124 (p=0.352), and Sicily_MBA3125 (p=0.095).

This increase in Steppe-related ancestry in a western site during the LBA most likely represents either an expansion from the Aegean or – maybe more likely, given the archaeological finds – a regional population similar to Sicily EBA re-emerging or rather being displaced from the eastern part of the island because of a westward movement from nearby Calabria.

Whether this population sampled spoke Indo-European or not at this time is questionable, since the Iron Age accounts show non-IE Elymians in this region.

Actually, Elymians seem to have spoken Indo-European, which fits well with the increase in steppe ancestry.

EDIT (21 MAR): Interesting about a proposed incoming Minoan-like ancestry is the potential origin of the Iran Neolithic-related ancestry that is going to appear in Central Italy during the LBA. This could then be potentially associated with Tyrsenians passing through the area, although the traditional description may be more more compatible with an arrival of Sea Peoples from the Adriatic.

Sad to read this:

This manuscript is dedicated to the memory of Sebastiano Tusa of the Soprintendenza del Mare in Palermo, who would have been an author of this study had he not tragically died in the crash of Ethiopia Airlines flight 302 on March 10.

Related

Cystic fibrosis probably spread with expanding Bell Beakers

indo-european-uralic-bell-beaker-corded-ware-migrations

New paper (behind paywall) Estimating the age of p.(Phe508del) with family studies of geographically distinct European populations and the early spread of cystic fibrosis, by Farrell et al., European Journal of Human Genetics (2018).

Interesting excerpts (emphasis mine):

Our results revealed tMRCA average values ranging from 4725 to 1175 years ago and support the estimates of Serre et al. (3000–6000 years ago) [11], rather than Morral et al. (52,000 years ago) [6], but the latter figure was challenged by Kaplan et al. [26] because of disagreement with assumptions used in their calculations. In addition, the tMRCA values from western European regions reported herein refine the results of Fichou et al. [7] from a study of Breton CF patients in which the Estiage analysis suggested that the most common recent ancestor lived 115 generations ago. That tMRCA value, however, may have underestimated the age of p.(Phe508del) in Brittany due to consideration of all the haplotypes, even those that were reconstructed with ambiguities, as well as a potential bias associated with consanguinity due to including both haplotypes in homozygous families. In the more stringent Estiage analyses reported herein, those potential biases were avoided for all populations, leading to estimates of the oldest tMCRA values corresponding to the Early Bronze Age in western Europe, which is generally agreed to begin around 3000 BCE. This finding extends our results from a direct investigation of aDNA in teeth from Iron Age burials near Vienna around 350 BCE and allow us to conclude that p.(Phe508del) was present in that region long before then. More specifically, in the Austrian families studied, the Estiage data revealed a mean tMCRA value of 3575 years ago, which converts to 1558 BCE (Middle Bronze Age) [22].

Perhaps most remarkably, the estimated ages of p.(Phe508del) in the three western European regions (France, Ireland, and Denmark) were similar with closely overlapping 95% CI values. This observation is also in line with previously documented spatial autocorrelograms expressing genetic and geographical distance for these populations [24]. Such data provide more insight about the ancient origin of CF in our judgment—both when and where—and lead us to propose that CFTR p.(Phe508del) is derived from ancestors who lived in western Europe during the Bronze Age, as early as 2700 BCE, and that its relatively rapid dissemination occurred because of human migrations around the northwestern Atlantic trading routes [21] and then towards central and eastern Europe [22]. Diffusion from northwestern to central Europe in approximately 1000 years is consistent with the prominent Bronze Age migrations evident in the archeological record [21, 22] and from genomic studies of aDNA [27]. On the other hand, we are assuming a discrete origin of the principal CF-causing variant, but it is possible that p.(Phe508del) arose more than once or earlier, and then reached western Europe subsequently through Neolithic migrations.

cystic-fibrosis

[About Bell Beakers] (…) More specifically, their distinctive Bell Beaker pottery appeared and spread across western and central Europe beginning around 3000–2750 BCE and then disappeared between 2200 and 1800 BCE [22, 29]. Their migrations are linked to the advent of western and central European metallurgy, as they manufactured and traded metal goods, especially weapons, while traveling over long distances [30]. Most relevant to our study is the evidence that they migrated in a direction and over a time period that fits well with the pattern of tMRCA data we found for the p.(Phe508del) variant. Olalde et al. [29] have shown that both migration and cultural transmission played a major role in diffusion of the “Beaker Complex” and led to a “profound demographic transformation” of Britain after 2400 BCE. Moreover, the cultural elements that unite the widely distributed Beaker folk are so obvious that some have considered them a distinct ethnicity of Bronze Age people [33].

From our results, we propose the novel concept that large scale, long term west-to-east migrations of the Bell Beaker Europeans [22, 28–30] during the Bronze Age, could explain the dissemination of p.(Phe508del) in Europe and its documented northwest-to-southeast gradient [4].In fact, our tMRCA data show a temporal gradient also.

As you can see from the references, they consulted with Barry Cunliffe (or people accepting his theory), who is obsessed with Bell Beakers expanding Celtic languages from the British Isles. He is like the British equivalent of Danish scholar Kristian Kristiansen, and his obsession with Corded Ware = Indo-European (and Germanic = CWC Denmark), immutable no matter what genetic results might show.

The funny thing is, the interpretation of the paper is probably right. From what we can see in the data, it is quite possible that the disease spread with expanding Bell Beakers…only it spread from the East group in Hungary, i.e. from east to west. The regional difference in TMRCA and apparent west—east cline would point to the different expansions of affected lineages in the corresponding regions, and not to an origin in the British Isles.

Related

Lazaridis’ evolutionary history of human populations in Europe

Preprint of a review by Iosif Lazaridis, The evolutionary history of human populations in Europe.

Interesting excerpts:

Steppe populations during the Eneolithic to Bronze Age were a mix of at least two elements[28], the EHG who lived in eastern Europe ~8kya and a southern population element related to present-day Armenians[28], and ancient Caucasus hunter-gatherers[22], and farmers from Iran[24]. Steppe migrants made a massive impact in Central and Northern Europe post- 5kya[28,43]. Some of them expanded eastward, founding the Afanasievo culture[43] and also eventually reached India[24]. These expansions are probable vectors for the spread of Late Proto-Indo-European[44] languages from eastern Europe into both mainland Europe and parts of Asia, but the lack of steppe ancestry in the few known samples from Bronze Age Anatolia[45] raises the possibility that the steppe was not the ultimate origin of Proto-Indo-European (PIE), the common ancestral language of Anatolian speakers, Tocharians, and Late Proto-Indo Europeans. In the next few years this lingering mystery will be solved: either Anatolian speakers will be shown to possess steppe-related ancestry absent in earlier Anatolians (largely proving the steppe PIE hypothesis), or they will not (largely falsifying it, and pointing to a Near Eastern PIE homeland).

Our understanding of the spread of steppe ancestry into mainland Europe is becoming increasingly crisp. Samples from the Bell Beaker complex[46] are heterogeneous, with those from Iberia lacking steppe ancestry that was omnipresent in those from Central Europe, casting new light on the “pots vs. people” debate in archaeology, which argues that it is dangerous to propose a tight link between material culture and genetic origins. Nonetheless, it is also dangerous to dismiss it completely. Recent studies have shown that people associated with the Corded Ware culture in the Baltics[23,33] were genetically similar to those from Central Europe and to steppe pastoralists[28,43], and the people associated with the Bell Beaker culture in Britain traced ~90% of their ancestry to the continent, being highly similar to Bell Beaker populations there. Bell Beaker-associated individuals were bearers of steppe ancestry into the British Isles that was also present in Bronze Age Ireland[47], and Iron Age and Anglo-Saxon England[48]. The high genetic similarity between people from the British Isles and those of the continent makes it more difficult to trace migrations into the Isles. This high similarity masks a very detailed fine-scale population structure that has been revealed by study of present-day individuals[49]; a similar type of analysis applied to ancient DNA has the potential to reveal fine-grained population structure in ancient European populations as well.

Steppe ancestry did arrive into Iberia during the Bronze Age[50], but to a much lesser degree. A limited effect of steppe ancestry in Iberia is also shown by the study of mtDNA[51], which shows no detectible change during the Chalcolithic/Early Bronze Age[51], in contrast to central Europe[52]. Sex-biased gene flow has been implicated in the spread of steppe ancestry into Europe[33,53], although the presence and extent of such bias has been debated[54,55]. One aspect of the demographies of males and females was clearly different, as paternally-inherited Y-chromosome lineages experienced a bottleneck <10 kya which is not evident in maternally-inherited mtDNA[56], suggesting that many men living today trace their patrilineal ancestry to a relatively small number of men of the Neolithic and Bronze Ages.

lazaridis-europe
Modified image, from the preprint. “A sketch of European evolutionary history based on ancient DNA. Bronze Age Europeans (~4.5-3kya) were a mixture of mainly two proximate sources of ancestry: (i) the Neolithic farmers of ~8-5kya who were themselves variable mixtures of farmers from Anatolia and hunter-gatherers of mainland Europe (WHG), and (ii) Bronze Age steppe migrants of ~5kya who were themselves a mixture of hunter-gatherers of eastern Europe (EHG) and southern populations from the Near East (…)”

Firstly, Tocharian (mentioned side by side with Anatolian and LPIE) has been discussed by linguists for quite some time now to be a more archaizing language than the rest, hence the linguistic proposal that it separated first – found to correspond beautifully with the expansion of Khvalynsk/Repin into Afanasevo – ; but it separated first from the common Late PIE trunk. Anatolian clearly separated earlier, from a Middle PIE stage.

Secondly, while Genomics could no doubt falsify the Balkan route for Anatolian, and make us come back to a Maykop route from the steppe (or even a Near Eastern PIE homeland, who knows), I doubt such falsification could come simply from sampled “Anatolian speakers”:

If there is no steppe ancestry in Anatolian speakers (of the 2nd millennium BC), a dismissal of the mainstream migration model could happen only when both potential routes of expansion, the selected cultures from the Balkans and the Caucasus, are sampled in the appropriate time period since the estimated separation (i.e. from the 5th millennium BC), until one of both routes shows the right migration picture.

On the other hand, if some samples from either Romania/Bulgaria or the Caucasus (and/or Anatolian speakers) show steppe ancestry and/or R1b-M269 lineages, as is expected, then the matter won’t need much more explanation.

In fact, the text goes on to define how male lineages experienced a bottleneck after ca. 8000 BC, i.e. accompanying Neolithisation – probably including the formation of Sredni Stog and early Khvalynsk, as it is becoming now clear – , when explaining how it is possible to demonstrate that East Bell Beaker migrants (of R1b-L23 lineages, it is to be understood) with few steppe ancestry reached Iberia.

This was already pointed out not long ago by David Reich, and I am glad to see more scholars showing the importance of taking phylogeography into account over statistical methods when assessing migrations, even if it is only used in those cases in which it does not disrupt too much previous interpretations, like that of the 2015 papers and the proposal of the ‘Yamnaya ancestral component’.

I found it refreshing that for the first time Corded Ware migrants – or, rather, their shared genetic relationship with Eneolithic steppe groups – were accepted (if only indirectly) as a confounding factor in assessing migrations of Bell Beakers. It is a step in the right direction, and it is a relief to read this from someone working with the Reich Lab.

Not just a few (and not only amateurs) are still scratching their heads trying to explain with the most imaginative (and unnecessary) novel migration routes the elevated steppe ancestry and closer relation (PCA cluster, FST, F3, etc.) to CWC and Yamna (due evidently to the absorbed CWC population) in some of the recently published Bell Beaker samples from Central Europe, the Netherlands, and later in Great Britain, compared to samples of South-East Europe near the Middle to Upper Danube region, the obvious homeland of East Bell Beakers, formed from Yamna settlers.

I found it also interesting that Lazaridis mentioned a southern population element related to CHG and Iran farmers. This should help dissipate the hype that some have artificially created as of late over a potential Northern Iranian homeland based on a single paragraph from David Reich’s book.

EDIT (9 MAY 2018). Lazaridis posted an answer to my questioning of potential Proto-Anatolian origins divided in tweets (I post a link to the first tweet, then the text in full):

The steppe hypothesis predicts some genetic input from eastern Europe (EHG) to Anatolia.

– Bronze Age Anatolians (Lazaridis et al. 2017) from historically IE-speaking Pisidia lack EHG; more samples obviously needed

Possibilities:

  1. Additional Anatolian samples will have EHG: consistent with steppe PIE
  2. Additional Anatolian samples will not have EHG, then either:
    1. Steppe not PIE homeland
    2. Steppe PIE homeland but linguistic impact in Anatolia vastly greater than genetic impact

Tentative steppe->Anatolia movements reach Balkans early (Mathieson et al. 2018) and Armenia (some EHG in Lazaridis et al. 2016).

But not the last leg to Anatolia_ChL (Lazaridis et al. 2016) or Anatolia_BA (Lazaridis et al. 2017).

  • If Anatolians consistently don’t have EHG, steppe PIE is very difficult to affirm; Near Eastern alternative likely (contributing CHG/Iran_N-related ancestry to both western Anatolia/steppe)
  • If Anatolians have EHG, one could further investigate by what route they got it.

One way or another PIE homeland problem is almost solved IMHO, which is what my review tries to get at in that short section

Related: