North-West Indo-Europeans of Iberian Beaker descent and haplogroup R1b-P312

iron-age-early-mediterranean

The recent data on ancient DNA from Iberia published by Olalde et al. (2019) was interesting for many different reasons, but I still have the impression that the authors – and consequently many readers – focused on not-so-relevant information about more recent population movements, or even highlighted the least interesting details related to historical events.

I have already written about the relevance of its findings for the Indo-European question in an initial assessment, then in a more detailed post about its consequences, then about the arrival of Celtic languages with hg. R1b-M167, and later in combination with the latest hydrotoponymic research.

This post is thus a summary of its findings with the help of natural neighbour interpolation maps of the reported Germany_Beaker and France_Beaker ancestry for individual samples. Even though maps are not necessary, visualizing geographically the available data facilitates a direct comprehension of the most relevant information. What I considered key points of the paper are highlighted in bold, and enumerated.

NOTE. To get “more natural” maps, extrapolation for the whole Iberian Peninsula is obtained by interpolation through the use of external data from the British Isles, Central Europe, and Africa. This is obviously not ideal, but – lacking data from the corners of the Iberian Peninsula – this method gives a homogeneous look to all maps. Only data in direct line between labelled samples in each map is truly interpolated for the Iberian Peninsula, while the rest would work e.g. for a wider (and more simplistic) map of European Bronze Age ancestry components.

Chalcolithic

iberia-chalcolithic
Iberian Chalcolithic groups and expansion of the Proto-Beaker package. See full map.

The Proto-Beaker package may or may not have expanded into Central Europe with typical Iberia_Chalcolithic ancestry. A priori, it seems a rather cultural diffusion of traits stemming from west Iberia roughly ca. 2800 BC.

iberia-y-dna-map-chalcolithic
Map of Y-DNA haplogroups among Iberia Chalcolithic samples. See full map.

The situation during the Chalcolithic is only relevant for the Indo-European question insofar as it shows a homogeneous Iberia_Chalcolithic-like ancestry with typical Y-chromosome (and mtDNA) haplogroups of the Iberian Neolithic dominating over the whole Peninsula until about 2500 BC. This might represent an original Basque-Iberian community.

iberia-mtdna-map-chalcolithic
Map of mtDNA haplogroups among Iberia Chalcolithic samples. See full map.

Bell Beaker period

iberia-bell-beaker-period
Iberian Bell Beaker groups and potential routes of expansion. See full map.

The expansion of the Bell Beaker folk brought about a cultural and genetic change in all Europe, to the point where it has been rightfully considered by Mallory (2013) – the last one among many others before him – the vector of expansion of North-West Indo-European languages. Olalde et al. (2019) proved two main points in this regard, which were already hinted in Olalde et al. (2018):

(1) East Bell Beakers brought hg. R1b-L23 and Yamnaya ancestry to Iberia, ergo the Bell Beaker phenomenon was not a (mere) local development in Iberia, but involved the expansion of peoples tracing their ancestry to the Yamnaya culture who eventually replaced a great part of the local population.

iberia-ancestry-bell-beaker-germany_beaker
Natural neighbor interpolation of Germany_Beaker ancestry in Iberia during the Bell Beaker period (ca. 2600-2250 BC). See full map.

(2) Classical Bell Beakers have their closest source population in Germany Beakers, and they reject an origin close to Rhine Beakers (i.e. Beakers from the British Isles, the Netherlands, or northern France), ergo the Single Grave culture was not the origin of the Bell Beaker culture, either (see here).

iberia-y-dna-map-bell-beaker-period
Map of Y-DNA haplogroups among Iberian Bell Beaker samples. See full map.
iberia-mtdna-map-bell-beaker-period
Map of mtDNA haplogroups among Iberian Bell Beaker samples. See full map.

Early Bronze Age

iberia-early-bronze-age
Iberian Early Bronze Age groups and likely population and culture expansions. See full map.

Interestingly, the European Early Bronze Age in Iberia is still a period of adjustments before reaching the final equilibrium. Unlike the situation in the British Isles, where Bell Beakers brought about a swift population replacement, Iberia shows – like the Nordic Late Neolithic period – centuries of genomic balancing between Indo-European- and non-Indo-European-speaking peoples, as could be suggested by hydrotoponymic research alone.

(3) Palaeo-Indo-European-speaking Old Europeans occupied first the whole Iberian Peninsula, before the potential expansion of one or more non-Indo-European-speaking groups, which confirms the known relative chronology of hydrotoponymic layers of Iberia.

iberia-ancestry-early-bronze-age-germany_beaker
Natural neighbor interpolation of Germany_Beaker ancestry in Iberia during the Early Bronze Age period (ca. 2250-1750 BC). See full map.

This balancing is seen in terms of Germany_Beaker vs. Iberia_Chalcolithic ancestry, but also in terms of Y-chromosome haplogroups, with the most interesting late developments happening in southern Iberia, around the territory where El Argar eventually emerged in radical opposition to the Bell Beaker culture.

iberia-y-dna-map-early-bronze-age
Map of Y-DNA haplogroups among Iberia Early Bronze Age samples. See full map.

(4) Bell Beakers and descendants expanded under male-driven migrations, proper of the Indo-European patrilineal tradition, seen in Yamnaya and even earlier in Khvalynsk:

We obtained lower proportions of ancestry related to Germany_Beaker on the X-chromosome than on the autosomes (Table S14), although the Z-score for the differences between the estimates is 2.64, likely due to the large standard error associated to the mixture proportions in the X-chromosome.

germany-beaker-x-chromosome

iberia-mtdna-map-early-bronze-age
Map of mtDNA haplogroups among Iberia Early Bronze Age samples. See full map.

Regarding the PCA, Iberia Bronze Age samples occupy an intermediate cluster between Iberia Chalcolithic and Bell Beakers of steppe ancestry, with Yamnaya-rich samples from the north (Asturias, Burgos) representing the likely source Old European population whose languages survived well into the Roman Iron Age:

iberia-pca-bronze-age
PCA of ancient European samples. Marked and labelled are Bronze Age groups and relevant samples. See full image.

Middle Bronze Age

iberia-middle-bronze-age
Iberian Middle Bronze Age groups and likely population and culture expansions. See full map.

During the Middle Bronze Age, the equilibrium reached earlier is reversed, with a (likely non-Indo-European-speaking) Argaric sphere of influence expanding to the west and north featuring Iberia Chalcolithic and lesser amount of Germany_Beaker ancestry, present now in the whole Peninsula, although in varying degrees.

iberia-ancestry-middle-bronze-age-germany_beaker
Natural neighbor interpolation of Germany_Beaker ancestry in Iberia during the Middle Bronze Age period (ca. 1750-1250 BC). See full map.

All Iberian groups were probably already under a bottleneck of R1b-DF27 lineages, although it is likely that specific subclades differed among regions:

iberia-y-dna-map-middle-bronze-age
Map of Y-DNA haplogroups among Iberia Middle Bronze Age samples. See full map.
iberia-mtdna-map-middle-bronze-age
Map of mtDNA haplogroups among Iberia Middle Bronze Age samples. See full map.

Late Bronze Age

iberia-late-bronze-age
Iberian Late Bronze Age groups and likely population and culture expansions. See full map.

The Late Bronze Age represents the arrival of the Urnfield culture, which probably expanded with Celtic-speaking peoples. A Late Bronze Age transect before their genetic impact still shows a prevalent Germany_Beaker-like Steppe ancestry, probably peaking in north/west Iberia:

iberia-ancestry-late-bronze-age-germany_beaker
Natural neighbor interpolation of Germany_Beaker ancestry in Iberia during the Late Bronze Age period (ca. 1250-750 BC). See full map.

(5) Galaico-Lusitanians were descendants of Iberian Beakers of Germany_Beaker ancestry and hg. R1b-M269. Autosomal data of samples I7688 and I7687, of the Final Bronze (end of the reported 1200-700 BC period for the samples), from Gruta do Medronhal (Arrifana, Coimbra, Portugal) confirms this.

In the 1940s, human bones, metallic artifacts (n=37) and non-human bones were discovered in the natural cave of Medronhal (Arrifana, Coimbra). All these findings are currently housed in the Department of Life Sciences of the University of Coimbra and are analyzed by a multidisciplinary team. The artifacts suggest a date at the beginning of the 1st millennium BC, which is confirmed by radiocarbon date of a human fibula: 890–780 cal BCE (2650±40 BP, Beta–223996). This natural cave has several rooms and corridors with two entrances. No information is available about the context of the human remains. Nowadays these remains are housed mixed and correspond to a minimum number of 11 individuals, 5 adults and 6 non-adults.

In particular, sample I7687 shows hg. R1b-M269, with no available quality SNPs, positive or negative, under it (see full report). They represent thus another strong support of the North-West Indo-European expansion with Bell Beakers.

iberia-y-dna-map-late-bronze-age
Map of Y-DNA haplogroups among Iberian Late Bronze Age samples. See full map.
iberia-mtdna-map-late-bronze-age
Map of mtDNA haplogroups among Iberian Late Bronze Age samples. See full map.

NOTE. To understand how the region around Coimbra was (Proto-)Lusitanian – and not just Old European in general – until the expansion of the Turduli Oppidani, see any recent paper on Bronze Age expansion of warrior stelae, hydrotoponymy, anthroponymy, or theonymy (see e.g. about Spear-vocabulary).

Iron Age

iberia-iron-age-early
Iberian Pre-Roman Iron Age groups and likely population and culture expansions. See full map.

In a complex period of multiple population movements and language replacements, the temporal transect in Olalde et al. (2019) offers nevertheless relevant clues for the Pre-Roman Iron Age:

(6) The expansion of Celtic languages was associated with the spread of France_Beaker-like ancestry, most likely already with the LBA Urnfield culture, since a Tartessian and a Pre-Iberian samples (both dated ca. 700-500 BC) already show this admixture, in regions which some centuries earlier did not show it. Similarly, a BA sample from Álava ca. 910–840 BC doesn’t show it, and later Celtiberian samples from the same area (ca. 4th c. BC and later) show it, depicting a likely north-east to west/south-west routes of expansion of Celts.

iberia-ancestry-iron-age-france_beaker
Natural neighbor interpolation of France_Beaker ancestry in Iberia during the Pre-Roman Iron Age period (ca. 750-250 BC). See full map.

(7) The distribution of Germany_Beaker ancestry peaked, by the Iron Age, among Old Europeans from west Iberia, including Galaico-Lusitanians and probably also Astures and Cantabri, in line with what was expected before genetic research:

iberia-ancestry-iron-age-germany_beaker
Natural neighbor interpolation of Germany_Beaker ancestry in Iberia during the Pre-Roman Iron Age period (ca. 750-250 BC). See full map.

A probably more precise picture of the Final Bronze – Early Iron Age transition is obtained by including the Final Bronze samples I2469 from El Sotillo, Álava (ca. 910-875 BC) as Celtic ancestry buffer to the west, and the sample I3315 from Menorca (ca. 904-861 BC), lacking more recent ones from intermediate regions:

iberia-ancestry-ia-germany_beaker
Natural neighbor interpolation of Germany_Beaker ancestry in Iberia during the Final Bronze Age – Early Iron Age transition. See full map.
iberia-ancestry-ia-france_beaker
Natural neighbor interpolation of France_Beaker ancestry in Iberia during the Final Bronze Age – Early Iron Age transition. See full map.

In terms of Y-DNA and mtDNA haplogroups, the situation is difficult to evaluate without more samples and more reported subclades:

iberia-y-dna-map-iron-age
Map of Y-DNA haplogroups among Iberian Iron Age samples. See full map.
iberia-mtdna-map-iron-age
Map of mtDNA haplogroups among Iberian Iron Age samples. See full map.

In the PCA, Proto-Lusitanian samples occupy an intermediate cluster between Iberian Bronze Age and Bronze Age North (see above), including the Final Bronze sample from Álava, while Celtic-speaking peoples (including Pre-Iberians and Iberians of Celtic descent from north-east Iberia) show a similar position – albeit evidently unrelated – due to their more recent admixture between Iberian Bronze Age and Urnfield/Hallstatt from Central Europe:

iberia-pca-iron-age
PCA of ancient European samples. Marked and labelled are Iron Age groups and relevant samples. See full image.

(8) Iberian-speaking peoples in north-east Iberia represent a recent expansion of the language from the south, possibly accompanied by an increase in Iberia_Chalcolithic/Germany_Beaker admixture from east/south-east Iberia.

(9) Modern Basques represent a recent isolation + Y-DNA bottlenecks after the Roman Iron Age population movements, probably from Aquitanians migrating south of the Pyrenees, admixing with local peoples, and later becoming isolated during the Early Middle Ages and thereafter:

[Modern Basques] overlap genetically with Iron Age populations showing substantial levels of Steppe ancestry.

Assuming that France_Beaker ancestry is associated with the Urnfield culture (spreading with Celtic-speaking peoples), Vasconic speakers were possibly represented by some population – most likely from France – whose ancestry is close to Rhine Beakers (see here).

Alternatively, a Vasconic language could have survived in some France/Iberia_Chalcolithic-like population that got isolated north of the Pyrenees close to the Atlantic Façade during the Bronze Age, and who later admixed with Celtic-speaking peoples south of the Pyrenees, such as the Vascones, to the point where their true ancestry got diluted.

In any case, the clear Celtic Steppe-like admixture of modern Basques supports for the time being their recent arrival to Aquitaine before the proto-historical period, which is in line with hydrotoponymic research.

Conclusion

The most interesting aspects to discuss after the publication of Olalde et al. (2019) would have been thus the nature of controversial Palaeohispanic peoples for which there is not much linguistic data, such as:

  • the Astures and the Cantabri, usually considered Pre-Celtic Indo-European (see here);
  • the Vaccaei, usually considered Celtic;
  • the Vettones, traditionally viewed as sharing the same language as Lusitanians due to their apparent shared hydrotoponymic, anthroponymic, and/or theonymic layers, but today mostly viewed as having undergone Celticization and helped the westward expansion of Celtic languages (and archaeologically clearly divided from Old European hostile neighbours to the west by their characteristic verracos);
  • the Pellendones or the Carpetani, who were once considered Pre-Celtic Indo-Europeans, too;
  • the nature of Tartessian as Indo-European, or maybe even as “Celtic”, as defended by Koch;
  • or the potential remote connection of Basque and Iberian languages in a common trunk featuring Iberian/France_Chalcolithic ancestry (also including Palaeo-Sardo).
pre-roman-palaeohispanic-languages-peoples-iberia-300bc
Pre-Roman Palaeohispanic peoples ca. 300 BC. See full map. Image modified from the version at Wikipedia, a good example of how to disseminate the wrong ideas about Palaeohispanic languages.

Despite these interesting questions still open for discussion, the paper remarked something already known for a long time: that modern Basques had steppe ancestry and Y-DNA proper of the Yamnaya 5,000 years ago, and that Bell Beakers had brought this steppe ancestry and R1b-P312 lineages to Iberia. This common Basque-centric interpretation of Iberian prehistory is the consequence of a 19th-century tradition of obsessively imagining Vasconic-speaking peoples in their medieval territories extrapolated to Cro-Magnons and Atapuerca (no, really), inhabiting undisturbed for millennia a large territory encompassing the whole Iberia and France, “reduced” or “broken” only with the arrival of Celts just before the Roman conquests. A recursive idea of “linguistic autochthony” and “genetic purity” of the peoples of Iberia that has never had any scientific basis.

Similarly, this paper offered the Nth proof already in population genomics that traditional nativist claims for the origin of the Bell Beaker folk in Western Europe were wrong, both southern (nativist Iberian origin) and northern European (nativist Lower Rhine origin). Both options could be easily rejected with phylogeography since 2015, they were then rejected in Olalde et al. and Mathieson et al (2017), then again with the update of many samples in Olalde et al. (2018) and Mathieson et al (2018), and it has most clearly been rejected recently with data from Wang et al. (2018) and its Yamnaya Hungary samples. Findings from Olalde et al. (2019) are just another nail to coffins that should have been well buried by now.

Even David Anthony didn’t have any doubt in his latest model (2017) about the Carpathian Basin origin of North-West Indo-Europeans (see here), and his latest update to the Proto-Indo-European homeland question (2019) shows that he is convinced now about R1b bottlenecks and proper Pre-Yamnaya ancestry stemming from a time well before the Bell Beaker expansion. This won’t be the last setback to supporters of zombie theories: like the hypotheses of an Anatolian, Armenian, or OIT origin of the PIE homeland, other mythical ideas are so entrenched in nationalist and/or nativist tradition that many supporters will no doubt prefer them to die hard, under the most numerous and shameful rejections of endlessly remade reactionary models.

Related

European hydrotoponymy (II): Basques and Iberians after Lusitanians and “Ligurians”

bronze-age-languages-western-europe

The first layer in hydrotoponymy of Iberia is clearly Indo-European, in territories that were occupied by Indo-Europeans when Romans arrived, but also in most of those occupied by non-Indo-Europeans.

Among Indo-European peoples, the traditional paradigm – carried around in Wikipedia-like texts until our days – has been to classify their languages as “Pre-Celtic” despite the non-Celtic phonetics (especially the initial -p-), because the same toponyms appear in areas occupied by Celts (e.g. Parisii, Pictones, Pelendones, Palantia); or – even worse – just as “Celtic”, because of the famous -briga and related components. This was evidently not tenable at the end of the 20th century, and it is simply anachronistic today.

NOTE. Since Indo-Europeans and non-Indo-Europeans of Western Europe show strong Y-chromosome bottlenecks under R1b-P312 lineages, maps below show the evolution of cultural groups side by side with ADMIXTURE of ancient DNA samples instead. The map series on prehistorical migrations contains also Y-DNA and mtDNA maps.

Most excerpts below (emphasis mine) are translated from Spanish (see the original text here):

iberia-bell-beakers-steppe
Top Left: Arrival of Indo-European-speaking East Bell Beakers and likely disruption of the Basque-Iberian community (ca 2500 BC on). Top Right: corresponding (unsupervised) ADMIXTURE map of ancient DNA samples. Arrival of Central European ancestry (“Steppe ancestry”, roughly represented by the blue color), with other components still prevalent, roughly including Anatolia Neolithic (brown), WHG (red), and sporadically Northern African (violet). Notice the high proportion of Central European ancestry in central and north-western Iberia. See full maps including Y-DNA and mtDNA. Bottom: PCA of Bell Beaker and contemporaneous samples.

Palaeo-Indo-Europeans

While the non-Celtic Indo-European nature of Lusitanian is certain, the nature of the “Pre-Celtic” language spoken by peoples such as Cantabri, Astures, Pellendones, Carpetani and Vettones is still being discussed, due to the scarcity of material to work with.

Galaico-Lusitanian

From Hacia una definición del lusitano, by Vallejo (2013):

It is certain that the delimitation of the geographical area set by Tovar is still valid, basically determined by the known direct documents, that is, the traditionally accepted inscriptions (the classic ones of Lamas de Moledo, Arroyo de la Luz and Cabeço das Fráguas), in addition to the new ones from Arroyo and the recent one from Arronches, see Fig. 1), to which some others could be added: the new bilingual inscription from Viseu necessarily compels us to consider it as indigenous, because it contains terms that belong to the core of the language and not only onomastics (I refer to the nexus igo and the nicknames deibabor and deibobor). By virtue of this new incorporation, we can also consider other texts as indigenous, although they do not include a common lexicon (see Fig. 1, inscriptions 7 to 22), in the expectation that many Lusitanian scribes were consciously mixing two linguistic registers (code switching), one to refer to the deities (for which they frequently used indigenous inflection) and another for anthroponyms (always with Latin inflection).

iberia-early-bronze-age
Left: Early Bronze Age cultures in Iberia (in red, likely Indo-European groups; in green, likely non-Indo-European groups). Right: Unsupervised ADMIXTURE of ancient DNA samples. See full maps including Y-DNA and mtDNA.

Firstly, it is striking that this geographical profile drawn by the texts correspond almost exactly to the distribution of large series of anthroponyms and theonyms.* Among the abundant names of people we can highlight those with a large number of repetitions whose appearance is circumscribed to our region of study (see Fig. 2). Some of them are truly frequent and lack parallels on the outside, such as the stem Tanc / Tang- (of Tanginus) with no less than 130 attestations, or Tonc- / Tong- (of Tongius or Tongetamus) with 70. Others show also sufficiently representative figures as Camalus and Maelo (with 46 repetitions each), Celtius (with 29), Caturo or Sunua (with 23), Camira (with 22), Doquirus (with 20), Louesius (with 18), Al(l)ucquius (with 17) or Malge(i)nus (with 16). According to these quantities, it appears that these are not casual occurrences of names, taking into account that chance tends to be reduced to a minimum in the study of the Iberian Peninsula, since we can easily handle the entire peninsular corpus. In turn, Reue, Bandue, Nauiae and Crougiae are the theonyms that best represent the Lusitanian-Galician area, coinciding fundamentally (Figure 3) with the picture that anthroponymy and texts had drawn, although with less examples.

lusitanian-inscriptions-toponymy-anthroponymy-teonymy
Top left: Lusitanian (long and short) inscriptions; top right: Map of the distribution of statue-menhirs and south-western stelae, by Rodríguez-Corral (2014) [(1) stelae in Beira Alta and Tras-os-Montes (Portugal), and Orense (Galicia, Spain); (2) both in the same territory: northwestern statue-menhirs and southwestern stelae; (3) hybridization of both into the same material form (stela/stela-menhir from Pedra Alta)]; bottom left: Lusitanian teonymy; bottom right: Lusitanian anthroponymy.

* The other subdivision of the onomastics, toponymy, presents difficulty in the elaboration of series, by the few repetitions of segments, once the universal element -briga has been eliminated.

It is not only these groups of names and roots that help us define a large northwestern area, but, as I have had occasion to mention in other places, some onomastic data that share a similar distribution can also be added: the desinence -oi (with an assimilation in -oe / -ui) of theonymic dative singular, the ending -bo of dative plural, the presence of the noun-forming suffix -aiko-, in addition to other phonetic features such as the passage of e> ei in anthroponymy, the reduction ug> uo the step of w> b.

iberia-north-west-dna
Genetic isolation in modern north-western Iberia (northern Portugal / southern Galicia) is greater than in other Iberian regions, forming different ancestral clusters splitting before others (including Basques). Image from Bycroft et al. (2018). See explanatory video by Carracedo.

Astur-Cantabrian

From The concept of Onomastic Landscape: the case of the Astures, by Vallejo (2013):

(…) First of all, it seems that there is an independent onomastic area, which can be defined by a series of names and suffixes that are repeated there exclusively or predominantly. This area does not seem to correspond with what we know of the Lusitanian-Galician onomastics nor of the more coastal Asturian; it also differs from the Celtiberian area, with which it does not have features in common. In this way, and always in the conjectural terrain, we could find ourselves before an Indo-European non-Celtic language different from the Lusitanian language.

A peculiarity that will have to be investigated is the presence of an excessively wide border corridor, where the names of the southern Astures (Augustales) do not predominate, but neither those of the northern Astures (Transmontanos). Similarly, we will have to see the scope of the hypothesis that there might have been a language perhaps differentiated from that spoken in the Lusitanian, Galician or Celtiberian zones; the lower documentary richness of the Asturian zone of Transmontana makes it more difficult to guarantee that it is not the same linguistic area as the one we isolate among Asturian cities.

In any case, de Hoz, even taking into account the difficulty of an affirmation of this type, pointed out ambiguously that we could find ourselves in front of different languages. On the other hand, the absence of texts directly transmitted by this people leaves us without a definitive confirmation the argument that it is a linguistically differentiated region, but it does not invalidate it at all. These drawbacks require the suspension of the exact characterization of our area, awaiting advances in the field of epigraphy and methodology.

astur-cantabrian-toponymy

Non-Indo-Europeans

The following are mainly excerpts from Villar (2007, 2014):

villar-vascos
Lenguas, genes y culturas en la Prehistoria de Europa y Asia suroccidental (2007). Buy the ebook online (or the printed version, if available).

Basques

Anthroponymy

The information provided by place-names and hydronyms on the one hand and anthroponyms on the other is of undoubted historical value in both cases, but of different specific significance. Anthroponyms reflect the present situation at the moment when living people were using them. It is an aspect very sensitive to social changes of all kinds, reaching its highest level of instability when there is language change.

(…) the Pre-Roman anthroponymic inventory of the Basque Country and Navarre indicates that prior to the arrival of Romans the language spoken was Indo-European (reflected in the names used) in the territories of Caristii, Varduli and Autrigones, while in Vasconic territory (especially in the current Navarre) most of the speakers chose Iberian names. In the territories of the current Basque Country, only a negligible statistical proportion chose Basque names, whereas in Navarre it was a minority of the population. That’s how things were towards the 3rd century BC.

Hydro-Toponymy

Cities and rivers are not subject to the ephemeral life cycle of humans. Rivers have very long cycles that go far beyond the life time not only of individuals, but also of languages ​​and cultures. Cities are also generally very stable, although social circumstances occasionally cause one to be abandoned or destroyed, while new ones are created from time to time. That means that the names of rivers and cities are not subject to fashions or frequent change. Nor does a language change imply a renewal of the previous hydronymy and toponymy.

Speakers of the new languages ​​incorporated into a territory learn from the natives the hydronymic and toponymic system, producing what we call the “toponymic transmission”. (…) it requires a prolonged contact between the native population and the new occupants, which can only occur when the indigenous population is not annihilated quickly and radically.

iberia-middle-bronze-age
Top Left: Middle Bronze Age cultures in Iberia (in red, likely Indo-European groups; in green, likely non-Indo-European groups). Top Right: Unsupervised ADMIXTURE of ancient DNA samples. See full maps including Y-DNA and mtDNA. Bottom: Bottom: PCA of Bronze Age groups.

The ancient onomastic data of the Basque Country and Navarre can be summarized as follows:

  • Ancient hydronymy, the longest lasting onomastic component, is not Basque, but Indo-European in its entirety.
  • The old toponymy, which follows it in durability, is also Indo-European in its entirety, except Poampaelo (now Pamplona) and Oiarso (now Oyarzun).
  • And in anthroponymy, which reflects the language used at the time when those names were in use, is also massively Indo-European, although there are between 10-15% anthroponyms of Vasconic etymology.

(…) the existing data show that, while in Roman times in Hispania there were only a couple of place-names in the Pyrenean border and a dozen anthroponyms of Vasconic etymology, in Aquitaine there was an abundant antroponymy of that etymology.

iberia-late-bronze-age
Left: Late Bronze Age cultures in Iberia (in red, likely Indo-European groups; in green, likely non-Indo-European groups). Right: Unsupervised ADMIXTURE of ancient DNA samples. See full maps including Y-DNA and mtDNA.

This set of facts is most compatible with a hypothesis that postulated a late infiltration of this type of population from Aquitaine, which at the time of the Roman conquest had only reached to establish a bridgehead, consisting of a small population center in Navarre and Alto Aragón and nothing else, except some isolated individuals in the current provinces of Álava, Vizcaya and Guipúzcoa. The almost complete absence of old place-names of Vasconic etymology would be explained in this way: Vasconic speakers, recently arrived and still in small numbers, would not have had the possibility of altering in depth the toponymic heritage prior to their arrival, which was Indo-European.

The idea of ​​a late Vasconization of a part of those territories, in the High Middle Ages or late Antiquity, is not new. Already in the 1920s M. Gómez Moreno said about the modern Basque provinces, with the district of Estella in Navarra, that “personal nomenclature allows comparisons of definitive value, probative that there lived people of the Cantabrian-Asturian race [who for Gómez Moreno were Indo-European], without the slightest trace of perceptible Basqueness”. For him, the first Indo-European people to penetrate the peninsula would have been Ligurian, which evolved into Cantabrians, Asturians, Venetians, Lusitanians, Tormogi, Vacaeans, Autrigones, Caristii and Varduli.

iberia-early-iron-age
Top Left: Pre-Roman cultures in Iberia (in red/brown, Indo-European groups; in pink, Greek; in yellow, Phoenician; in green, likely non-Indo-European groups; Tartessian is disputed). Top Right: Unsupervised ADMIXTURE of ancient DNA samples. See full maps including Y-DNA and mtDNA. Bottom: PCA of Iron Age groups.

Aquitaine

If, as we said above, Basque speakers began to enter the Iberian Peninsula from the other side of the Pyrenees only from the Roman-Republican era, to intensify their presence in the following centuries we must assume that they were to the north of the Pyrenees already before those dates. And, indeed, the existence of this abundant Vasconic antroponymy shows that in the first centuries of our era – while Vasconic speakers in the Peninsula were very few in number, their population in Aquitaine was abundant.

In a provisional manner we can advance that [Aquitaine’s] hydronyms are also known in other places of Europe and easily compatible with Indo-European etymologies (Argantia, Aturis, Tarnes, Sigmanos); and among the place names there are also many that are compatible with non-Gallic Indo-European etymologies, or not necessarily Gallic (Curianum, Aquitania, Burdigala, Cadurci, Auscii, Eluii, Rutani, Cala- (gorris), Latusates, Cossion, Sicor, Oscidates, Vesuna, etc.).

In addition to those place names that we classify as generically Indo-European, there are not a few Celts (Lugdunum, Mediolanum, Noviomagos, Segodunon, Bituriges, Petrucorii, Pinpedunni), several Latins (Aquae Augustae, Convenae, ad Sextum, Augusta), and even some Celto-Latin hybrids (Augustonemeton, Augustoriton). On the other hand, there are hardly any names, neither serial nor not serial, that have a reasonable possibility of being explained by Vasconic etymology (Anderedon could be one of them).

Consequently, the onomastic question of Aquitaine is not compatible with the possibility that Vasconic is the “primordial element” there, either. On the contrary, it is compatible with the hypothesis that they arrived also late in Aquitaine, when hydro-toponymy was already established. They had to Vasconize all or part of the previous population, that turned to use to a large extent the Vasconic anthroponymy. But the previous toponymy remained and the Vasconization process was probably soon interrupted by Celticization first, and Romanization later.

aquitanian-tribes-vascones
Aquitani and neighbouring tribes around the Pyrenees, as described by the Romans (ca. 1st c. BC). The Basque language likely expanded south and west of the Pyrenees into Indo-European-speaking territories during the Roman period. The term ‘Vascones’ only became applied to Basque-speaking tribes in medieval times. Map modified from image by Sémhur at Wikipedia.

A prediction in genetics

This is how Francisco Villar and co-authors from the University of Salamanca saw what would happen with the genetic studies of modern Basques in 2007, based on the similarity with neighbouring Iberians and French, and the late intrusion of the language in its current territory:

Unfortunately, linguistics does not have the means to establish the moment of that arrival in terms of absolute chronology. In any case, this hypothesis is not incompatible with some peculiarities in the frequency of certain genes of the Basque-speaking population. Indeed, today we tend to attribute these peculiarities to the joint action of genetic drift and isolation; to which perhaps we could add a bottleneck in the Vasconic founding population that would one day settle in Aquitaine.

villar-indoeuropeos
Indoeuropeos, iberos, vascos y sus parientes (2014). Buy the ebook online (Or printed version, if available).

Also Villar, in 2014:

In the hypothesis that I propose, future speakers of Basque would have settled initially in Aquitaine, where there would have been an inevitable genetic diffusion with pre-existing [first stage] populations. On the other hand, Basque speakers from Aquitaine would have started to arrive to the Basque Country and Navarre only from Roman times (only a couple of Vasconic toponyms, at least one of them of recent creation; scarce anthroponyms of Vasconic etymology). The part of those populations that mixed with the pre-existing Palaeo-Indo-Europeans (Indo-European names of rivers; general Indo-European toponymy) saw how the uniqueness of their haplogroups, if there was any, was diluted, making it difficult to distinguish from the general [Indo-European] background; being a minority, it could had been even lost as a result of adverse genetic drift.

Olalde et al. (2019) confirmed this hypothesis that modern Basques are quite similar to investigated Iron Age Indo-Europeans from Iberia (such as Celtiberians sampled from the Basque Country):

For the Iron Age, we document a consistent trend of increased ancestry related to Northern and Central European populations with respect to the preceding Bronze Age. The increase was 10 to 19% (95% confidence intervals given here and in the percentages that follow) in 15 individuals along the Mediterranean coast where non-Indo-European Iberian languages were spoken; 11 to 31% in two individuals at the Tartessian site of La Angorrilla in the southwest with uncertain language attribution; and 28 to 43% in three individuals at La Hoya in the north where Indo-European Celtiberian languages were likely spoken. This trend documents gene flow into Iberia during the Late Bronze Age or Early Iron Age, possibly associated with the introduction of the Urnfield tradition.

Modern Basques show therefore, paradoxically, an ancestry similar to recent Iron Age Indo-European invaders (quite likely the ancestors of Celtiberians), which confirms the hypothesis of bottlenecks/founder effects followed by a very recent isolation of its population:

(…) the genetic profile of present-day Basques who speak the only non-Indo-European language in Western Europe [] overlap genetically with Iron Age populations showing substantial levels of Steppe ancestry.

iberia-roman-period
Left: Roman period in Iberia. Right: Unsupervised ADMIXTURE of ancient DNA samples. See full maps including Y-DNA and mtDNA. Notice increase of steppe ancestry in the north, associated with the (Late Bronze Age / Early Iron Age) arrival of Central Europeans.

Iberians

Regarding the Iberian language, the circumstances of analysis are less favorable. However, we can observe in the ancient toponymy of typically Iberian areas (the Spanish Levant and Catalonia) a considerable proportion of toponymy of Indo-European etymology, often identical to that which F. Villar (2000) has called “Southern-Iberian-Pyrenean”. In fact, its presence in the Levant is nothing else but a continuation from Catalonia to the South along the Mediterranean coast. Here are some examples: Caluba, Sorobis, Uduba, Lesuros, Urce / Urci, Turbula, Arsi / Arse, Asterum, Cartalias, Castellona, ​​Lassira, Lucentum, Saguntum, Trete, Calpe, Lacetani, Onusa, Palantia, Saetabis, Saetabicula, Sarna , Segestica, Sicana, Turia, Turicae, Turis.

Compatible with the Indo-European etymology can also be Blanda, Sebelacum, Sucro, Tader, Sigarra, Mastia, Contestania, Liria, Lauro, Indibilis, Herna, Edeta, Dertosa, Cesetania, Cossetani, Celeret, Bernaba, Biscargis, (…)

Finally, in other place names there are Indo-European components in hybrid toponymic syntagms, such as:

  1. orc- / urc-: Orceiabar, Urcarailur, Urceatin, Urcebas, Urcecere, Urcescer, Urceticer.
  2. Il-: Iltukoite, Iluro (3), Ilurci, Ilorci, Ilurcis, Ilucia, Iliturgi, Ilarcurris, Iluberitani, etc.

il-iberian

Examples like these show that in Catalonia and the Spanish Levant the Iberian language is not the deepest identifiable substrate language, but that it took root there when there was previously an Indo-European language that had created a considerable network of toponyms and hydronyms that we can recognize, and over which Iberians settled as a superstrate. The pre-existence of an Indo-European language in the historically Iberian area is further corroborated by the fact that its ancient hydronyms are all Indo-European, with the exception of a single river that has a name that is supposed to be Iberian: the Iberus (Ebro), of which obviously the country and its inhabitants took their name. No doubt ib- was an appellation for river, so that in the language that created that hydronym the Iber should have simply been “the river”. But we will see in the body of this work that ib- is in various places outside the Iberian Peninsula as an appellation for «river», which will force us to rethink its supposed Iberian affiliation. In fact, the Iberus had another name, Elaisos, whose etymology is compatible with Indo-European. As we know with certainty that after Iberians no other Indo-European peoples came to their territory before the Romans, the Indo-European creators of that hydronymy have had to be there before the Iberians. And its antiquity must be considerable because, as we have already said, the vast majority of its hydronyms (Alebus, Caluba, Lesuros, Palantia, Saetabis, Sigarra, Sucro, Tader, Turia and Uduba, Elaisos) belong to that anonymous Indo-European language that didn’t leave written texts or had historical continuity.

inscriptions-celtiberians-iberians-hispania
Inscriptions in Iberia ca. 2nd–1st c. BC. Purple squares show Celtiberian inscriptions, blue circles show Iberian inscriptions. Image modified from Hesperia – Banco de datos de lenguas paleohispánicas.

Villar (2014):

Not always that a language is settled in a territory is it able to eradicate the existing ones definitively. Even a political system as unitary and unifying as the Roman was not able to eradicate the Basque language. And nowadays in Latin America, despite the crushing cultural dominance of Spanish, despite the means for the schooling of a modern society, in spite of the media, a multitude of pre-Columbian languages ​​are spoken that coexist with the language of culture, the only one that is written in those countries. In those situations, which can be prolonged for quite a lot of time, there are individuals who only speak the language newly imposed, others who speak only the language that has resisted disappearing, and others who speak both, in a broad framework of bilingualism. My proposal is that something similar to that must have happened in the Iberian territory when the Romans arrived: A language of culture, Iberian, diversified into more or less distant local dialects, coexisted with several previous languages, equally differentiated from the dialectal point of view. This explains the irruption in the Iberian texts of non-Iberian anthroponyms and, above all, the existence there of a Palaeo-Indo-European hydro-toponymy that had remained in use not only because it was transmitted to Iberian speakers, but also because its native users were still present.

Related

European hydrotoponymy (I): Old European substrate and its relative chronology

old-european-hydronymy-toponymy

These first two posts on Old European hydro-toponymy contain excerpts mainly from Indoeuropeos, iberos, vascos y sus parientes, by Francisco Villar, Universidad de Salmanca (2014), but also from materials of Lenguas, genes y culturas en la Prehistoria de Europa y Asia suroccidental, by Villar et al. Universidad de Salamanca (2007). I can’t recommend both books hardly enough for anyone interested in the history of Pre-Roman peoples in Iberia and Western Europe.

NOTE. Both books also contain detailed information on hydrotoponymy of other regions, like Northern Europe, the Aegean and the Middle East, with some information about Asia, apart from (outdated) genetic data, but their main aim is obviously the Prehistory of Iberia and neighbouring regions like France, Italy, or Northern Africa.

Here are only some excerpts (emphasis mine), translated from Spanish (see the original texts here), accompanied by images from both books.

villar-indoeuropeos
Indoeuropeos, iberos, vascos y sus parientes (2014). Buy the ebook online (Or printed version, if available).

Alteuropäisch and Krahe

The investigation of “Old European” or Alteuropäisch, popularized by Krahe, began precisely with the study of some toponyms and personal names spread all over Europe, previously considered “Ligurian” (by H. d’Arbois de Jubainville and C. Jullian) or “Illyrian” (by J. Pokorny), with which those linguistic groups – in turn badly known – were given an excessive extension, based only on some lexical coincidences.

This is a comment made by the author about Krahe‘s data and his opinions, frequently used against his compiled data, which I find paradoxically applicable to Villar’s data and his tentative assignment of the relative linguistic chronology to an absolute one – including the expansion of a “Mesolithic” Indo-European vs. a “Neolithic” Basque / Iberian vs. a Bronze Age Celtic – when it is now clear that the sequence of events was much later than that:

It is very widespread today a derogatory and globally disqualifying attitude to everything that sounds like Alteuropäisch and Krahe, sometimes without the necessary discrimination between different hypotheses, or even between data and hypothesis. It is not fair that the version of H. Krahe and that of W. P. Schmid be disqualified in a single simplistic judgment as if they were the same thing. But it is a major mistake to reduce the value of the hydro-toponymic data of Europe by the mere fact that Krahe attributed an implausible historical explanation to them. The data are real and still need an adequate explanation within a real historical framework, despite the unfeasibility of Krahe’s explanation.

With that we reach a point that I want to highlight. Among those who are allergic to anything that involves deviating one iota of the Indo-European paradigm as a single event, an attitude gaining momentum considers that hydro-toponymy was introduced in the different regions of Europe and Southeast Asia by the same Indo-European languages ​​that appear historically occupying their territory. H. Krahe had argued strongly against this possibility, so now I will save myself a deeper refutation and I will limit myself to pointing out some difficulties that position is forced to face.

salo-salano
Sala, Sala, Sala, Sala, Sala, Sala, Sala, Sala, Sala, Sala, Sala, Salaca/Salis, Salaceni,
Salacia, Salacia, Salaeni, Salam, Salandona, Salangi, Salangi , Salaniana, Sãlantas,
Salapa, Salapeni, Salaphitanum, Salapia / Salpia / Salapina palus / Salpe, Salar, Salara, Salarama,
Salarbima, Salariga, Salars, Salas, Salat, Salauris, Salcitani, Sale, Sale, Sale, Sale
stagnum, Salecon, Saleia, Salentina, Salentini, Salernum, Salerni, Sales, Sali, Salia, Salia,
Salica, Salica, Salice, Salii, Salija, Salinẽlis, Salìnis, Salìnis, Salìnis, Salìnis, Salinsae, Salionca,
Salius, Salō, Salō, Saloca, Salodurum, Salona, Salonae, Salonenica, Salonia, Saloniana,
Salonime, Salonium, Salontia, Saluca, Salum, Salum, Salunatasi, Saluntum / Salluntum,
Salùpis, Sãlupis, Salur, Salurnis, Selepitani, Sõlis.

The defenders of that alternative have to assume that the process of dialectalization, that before the migrations from the Urheimat was separating into the different Indo-European branches, affected each of them in the phonetic aspect in the general naming vocabulary, but left them unaltered in its phonetic predialectal state with regards to hydro-toponymy, as well as a good part of the naming lexicon related to the concepts of “river, water” and the different qualities of water currents. For example, according to those sharing that opinion, the Hispanic Palantia of the area of Vaccei would be in fact Celtic, but in that name the loss of the initial /p/ that characterizes Celtic would not have been applicable. Similarly, the hydro-toponymy in Germania is largely exempt from the Lautverschiebung, in Greece the loss of initial /s/, etc. These names not only fail to suffer the dialectal innovations corresponding to their zones, but sometimes they present innovations different from the features of the dialect involved. For example the word *mori “sea, standing water” is sometimes found in the hydro-toponymy of Gaul in the form *mari instead of *mori proper of Celtic (Marantium, Marisanga, Marsus), which in the framework of the paradigm has to be inevitably interpreted as a non-Celtic innovation.

wako-wogo
Potential geographic relationship between a priori unrelated graphic-phonetic variants.

Names of this nature that appear in areas where a pre-Roman historical Indo-European language never existed remain unexplained, such as in North Africa, Arabia Felix or the Caucasus: Lake Pallantias in Libya; the Salat River in Mauritania Tingitana; Auso in Mauritania Caesariensis; the Alonta River in Georgia; the Abas River in Caucasian Albania; Salma and Salapeni in Arabia Felix; etc. Of course, for these cases it is always possible to deny any relationship of kinship between these forms and their European cognates, and attribute everything to the chance of random homophonies. Thus, once again, the annoying comparative data are sacrificed in the sacred altar of the paradigm, despite the fact that they are so numerous and consistent that if there were no blind faith in the current dogma, they would be sufficient to articulate a new paradigm over them.

The choice of each Indo-Europeanist between the non-Indo-European and the Indo-European interpretation to explain the prehistoric toponymy of Europe is not motivated by the fact that they manage partial sets of hydronyms that are more propitious alternatively for the one or the other option. On the contrary, frequently the same batch of materials is claimed by both trends as its own. An extreme example is that of Th. Vennemann, who considers simply as non-Indo-European (specifically Paleo-Basque) exactly the same material that H. Krahe used to support his Indo-European interpretation. Thus, the structure and linguistic characteristics of the studied material have little role in the choice of one or the other path, which is rather conditioned by convictions and adhesion to a varied range of personal beliefs, traditional dogmas and scientific paradigms.

villar-vascos
Lenguas, genes y culturas en la Prehistoria de Europa y Asia suroccidental (2007). Buy the ebook online (or the printed version, if available).

The linguistic column

The sequence of languages ​​that were successively spoken in any territory constitutes what by analogy [with the “geological column”] we could call its “ethno-linguistic column”.

Next I offer the list of the languages ​​detected in the compositional (and to a lesser extent derivational) toponymic syntagms in which the appellatives ub-, up-, ab-, ap-, ur-, il-, igi, tuk, -ip – analyzed in this work – are involved.

From the interaction of the different strata in words and hybrid syntagms we can, therefore, establish the linguistic column in the Iberian Peninsula and its neighboring territories (Western Europe and Northern Africa) with the following sequence:

1. A first stratum of very old chronology, which in a previous publication I have proposed to call Palaeo-Indo-European [“arqueo-indoeuropeo”]. The toponymic elements belonging to this stratum dealt with throughout this text are abundant: kerso-, turso-, alawo-, lako-, mido-, silo-, tibo-, etc.

They always function as determinant toponyms of a place-name in any other language. It never uses the name “city” (or “river”) in hybrid syntagms. Their place names (determinants) are combined with names of the following languages:

   a) Iberian in Iberia or Southern France: kiŕś-iltiŕ, tuŕś-iltiŕ, alaun-iltiŕte, lakunm ∙ -iltiŕte.

   b) The language of the igi in southern Iberia and perhaps Northern Africa: Cantigi, Saltigi, Sagigi, Sicingi.

   c) The southern language of the postponed -il: Mid-ili, Sil-ili, Tib-ili.

   d) The language of the postponed -ip: Lac-ipo, Ost-ipo, Vent-ipo.

   e) Celtic in Gaul: kerso-ialos > Cersolius > Cerseuil; Ibili-duros > Ibliodurus.

karo-karanto
Cariensi, Carantium, Carandonis, Carae, Caraca / Caracca, Carrinensis, Cariaca, Carneus, Carula, Carlae, Carieco, Cariocieco, Caricillum, Carona, Carnona, Caranta, Carantonus / Carantana, Caronte, Carantomum / Carantomium, Carronenses / Garronenses, Cares / Carus, Caranusca, Carona, Caro vicus, Carninia, Carus, Carnutes, Carnonis castrum, Carenses, Caralis / Carallis, Carni, Carnicum, Caraceni, Careia, Carici, Carant / Carrant, Carnonacae, Carontō, Cariolum, Caritani, Carinum, Carantani, Carnuntum, Cariniana Vallis, Cariones, Careotae, Caroia, Caria, Careum, Carnae, Caran, Carnasium, Carnus, Carneates, Carnium, Carenus, Karlasuwa, Carnias, Karahna, Karna, Cariuntis, Kariuna, Careotis, Karu, Caralitis, Carus, Carnasso, Cares, Carene, Caranum, Caria, Carina, Carura, Caralis, Coralis, Carana, Carnalis, Carinum, Carnus, Carium, Carnium, Carnus Carnuntus / Carnusii, Chariuntas, Carandra, Carna, Carana, Carine, Cariatae, Caralae, Carura, Carei, Carura, Caricum, Caranis, Caralia, Carustum, Carystus, Carastasei.

This first Palaeo-Indo-European layer also corresponds to:

Several Palaeo-Indo-European varieties that have ab-, ap-, ub-, up- as a name for «river». To them belong also numerous place names (balsa-, siko-, wol-, etc.) that act as first members composed in both monoglotic and hybrid syntagmas.

Palaeo-Indo-European varieties in which ur- is the name “river”.

ab-hydronyms

2. The second stratum in decreasing order of antiquity is formed by the language of the place name igi “city”, although its presence is only verified with certainty in Iberia (especially in the south) and Northern Africa:

   a) It sets the igi name in compounds with Palaeo-Indo-European toponyms as in Salt-, Ast-, Olont-, Cant-, Aur- (Hispania) and Sagigi, Sicingi (Northern Africa).

   b) It works as the first place-name of the compound when the second is il: Igilium, Igilgili, Singili.

3. The third stratum is the language of the name il “city”:

   a) It puts the nickname il as determined in hybrid syntagms with Palaeo-Indo-European determinants: Mid-ili, Sil-ili, Tib-ili.

   b) It puts the nickname il as determined in hybrid syntagms with determinant toponyms igi: Igilium, Igilgili, Singili.

   c) It puts the place names (determinants) in front of the name (determined) of the language -ip (Il-ipa, Il-ipula and Il-ipla).

il-toponyms

4. Fourth is the language of the name ip- “city”, which puts the name (determined) in syntagms with:

   a) Palaeo-Indo-European toponym (determinant): Lac-ipo, Ost-ipo, Vent-ipo.

   b) Toponym (determinant) il: Ilipa.

   c) Second generation hybrid toponym of Palaeo-Indo-European + il: Balsilippa.

   d) In the Balsilippa and Sicilippa conglomerates, the three strata appear in the expected sequence: Palaeo-Indo-European + il + ip.

ip-toponyms

5. In the fifth place of the sequence is the language of the tuk-:

   a) It puts the name tuk- in compounds in which the place-name is a Palaeo-Indo-European element: Acatucci (see Aduatuci in Germania).

   b) It puts the name tuk- “height, top” in compounds in which the place-name is an ip- fossilized as place-names: Iptuci, etc.

   c) On at least one occasion an ip-fossilized syntagm acts as a toponym opposite a Celtic name: Itucodon (<Iptuco-dunum).

NOTE. Even though Villar talks about this stratum -tuk in Germania (Aduatukus) and the British Isles (Itucodon), only one case is found in each territory.

tuk-variants

6. The last place is occupied by Celtic:

   a) In Itucodon it puts the name (dunum) in front of a complex toponym of two previous strata, ip- + tuk-; and in Iliodurus it gives the name duro- in front of an equally complex Ibliodurus (<Ibili + duro).

   b) In bilbiliz it puts the casual morpheme in a fossilized bi-member toponym of a previous stratum, one of whose components is il-: Bilbil-iz.

linguistica-cronologia-hispania
[First column modified to include relative instead of absolute chronology]

A hard change of paradigm

More effort did it cost me to accept that ub- is a dialectal variant of a known Indo-European word for “water, river”, of which previously knew three others: ap-, ab-, up-. The obviousness of the phonetic correlation ap- / ab- // up- / ub- together with the semantic link with rivers, which can be verified above all outside of Spain, but is also present in our Peninsula, forced my resistance little by little. And with it fell the first trench of the dogma, unshakable until that moment, that everything in the Peninsula in the south was to be non-Indo-European.

ub-ob-hydronyms

Along with this serial component, many other isolated place names were revealed as very likely of Indo-European etymology, both in the “Iberian” East and in the “Tartessian” South. So the ubiquity of Indo-European throughout the Peninsula began to impose itself to me painfully. I say painfully because I lacked a paradigm in which to fit the new perspective that was making its way into my mind, which was therefore suspended in nothing, without any theoretical support, leaving me with a feeling that I was losing my footing. And for a time I was reluctant to accept the profound implications that all of this had entailed.

All il languages, in any of their locations, exhibit a compositional behavior in hybrid toponymic syntagms that place them all in an intermediate position between the clearly [first/second layer] strata, with place-names for their human settlements semantically derived from water realities (ur), and those clearly attributable to the [fifth layer] with appellations derived from settlements in heights (briga, dunum). But in that intermediate segment of the column there are three strata: 1) il, 2) ip-, 3) tuk-. In Andalusia there is an additional one: the igi stratum, of opaque semantics, which immediately precedes the il stratum.

or-ur-hydronyms
Hydronyms in -or-, -ur-.

To postulate that any of the toponymic strata of our column imply a new linguistic stratum, certain additional requirements will be necessary. One of them is that, in addition to the name in question, the languages ​​involved should share other features that could not have been lent, such as the very precise order of elements in the compounds Toponym + Name coexisting with Name + Adjective. Or the sharing of additional lexical elements that are not usually subject to loans, such as the semantically basic adjectives beri «new» and bels «black».

Unfortunately, the toponymic method, like the Comparative Method itself, does not have the capacity to establish precise absolute chronologies. (…)

Linguistic chronology

old-european-hydro-toponymy
Old European hydrotoponymy. Baltic data compensated. Statistical method Kriging.

In Europe (Hispania, South of France, Germania, British Isles, Baltic) the oldest stratum that can be identified is an indeterminable number of palaeo-varieties of the Indo-European macro-family, which do not have a direct local relationship with historical Indo-European languages, to the extent that we can verify. In fact, we have seen that stratigraphic signs lead us to consider the main Indo-European pre-Roman language of Hispania, the Celtic language, as a stratum after the il language, which in turn is later than the peninsular Indo-European palaeo-varieties.

In North Africa there is also a Palaeo-Indo-European stratum present. But there is also a very old non-Indo-European stratum whose identity I can not define through the material used. Nor has it been possible for me to establish relative antiquity of one and the other on African soil.

Another of the languages ​​involved, which has il- as an appellation for “city” in the Southwest of Hispania and North Africa, could have some kind of kinship relationship with Basque on the one hand and the Iberian language on the other, but the same indirect form that I have just pointed out for the Indo-European palaeo-varieties with respect to the historical Indo-European languages. Or in other words: the language(s) of the place-names referred to in this work would be palaeo-varieties of a linguistic family to which two known historical languages, Iberian and Basque, may have belonged, although we can’t establish a relation of direct affiliation neither between those two historical languages ​​among themselves, nor between any of them and the palaeo-varieties of the prehistoric toponymy.

linguistica-cronologia-africa
[First column modified to include relative instead of absolute chronology]

In general, Celtic does not have in its historical territories the onomastic behavior of an ancestral language, but that of an intrusive language, whose presence there is not only more recent than other Indo-European varieties, but also after that of various non-Indo-European strata, which are themselves ranked between the oldest detected (Palaeo-Indo-European) and the last of Pre-Romans, which is Celtic itself. If we only detected two strata, the Indo-European and the Celtic ones, we could discuss if it is possible that both are one and the same, so that what we define as “Celtic” is nothing other than the modern in situ evolution of Palaeo-Indo-European. But examples like those of kiŕśiltiŕ, kerso-ialos, Cirsa or Itucodon, among many others analyzed throughout this book, make it unlikely. And, in addition, the mediation of several strata in the column between the Palaeo-Indo-European language of Cirsa, as well as the greater antiquity of the ip- and tuk- languages ​​in Spanish, Gallic and British territory, defines the latter as a new and more recent layer than the aforementioned, which burst into its historical sites during the Iron Age.

Because Archaeology continues to deny the existence of population movements of a size worthy of consideration in the Iron Age, it is necessary to accept that the Indo-European Problem remains intact. It is understandable that before this aporia, many minds who are uncomfortable living with doubts, prefer to adopt a creed (the traditional, the Neolithic or the continuist) and expose it as a certainty to their students in the classrooms or their colleagues in conferences and publications. It’s not my case. For me, with Voltaire, “le doute est désagréable, mais la certitude est ridicule”. Or with Manzoni: “E men male l’agitarsi nel dubbio, che riposar nell’errore”.

Continue reading on European hydrotoponymy (II): Basques, Iberians, and Etruscans after Old Europeans.

Related

A Song of Sheep and Horses, revised edition, now available as printed books

cover-song-sheep-and-horses

As I said 6 months ago, 2019 is a tough year to write a blog, because this was going to be a complex regional election year and therefore a time of political promises, hence tenure offers too. Now the preliminary offers have been made, elections have passed, but the timing has slightly shifted toward 2020. So I may have the time, but not really any benefit of dedicating too much effort to the blog, and a lot of potential benefit of dedicating any time to evaluable scientific work.

On the other hand, I saw some potential benefit for publishing texts with ISBNs, hence the updates to the text and the preparation of these printed copies of the books, just in case. While Spain’s accreditation agency has some hard rules for becoming a tenured professor, especially for medical associates (whose years of professional experience are almost worthless compared to published peer-reviewed papers), it is quite flexible in assessing one’s merits.

However, regional and/or autonomous entities are not, and need an official identifier and preferably printed versions to evaluate publications, such as an ISBN for books. I took thus some time about a month ago to update the texts and supplementary materials, to publish a printed copy of the books with Amazon. The first copies have arrived, and they look good.

series-song-sheep-horses-cover

Corrections and Additions

Titles
I have changed the names and order of the books, as I intended for the first publication – as some of you may have noticed when the linguistic book was referred to as the third volume in some parts. In the first concept I just wanted to emphasize that the linguistic work had priority over the rest. Now the whole series and the linguistic volume don’t share the same name, and I hope this added clarity is for the better, despite the linguistic volume being the third one.

Uralic dialects
I have changed the nomenclature for Uralic dialects, as I said recently. I haven’t really modified anything deeper than that, because – unlike adding new information from population genomics – this would require for me to do a thorough research of the most recent publications of Uralic comparative grammar, and I just can’t begin with that right now.

Anyway, the use of terms like Finno-Ugric or Finno-Samic is as correct now for the reconstructed forms as it was before the change in nomenclature.

west-east-uralic-schema

Mediterranean
The most interesting recent genetic data has come from Iberia and the Mediterranean. Lacking direct data from the Italian Peninsula (and thus from the emergence of the Etruscan and Rhaetian ethnolinguistic community), it is becoming clearer how some quite early waves of Indo-Europeans and non-Indo-Europeans expanded and shrank – at least in West Iberia, West Mediterranean, and France.

Finno-Ugric
Some of the main updates to the text have been made to the sections on Finno-Ugric populations, because some interesting new genetic data (especially Y-DNA) have been published in the past months. This is especially true for Baltic Finns and for Ugric populations.

ananino-culture-new

Balto-Slavic
Consequently, and somehow unsurprisingly, the Balto-Slavic section has been affected by this; e.g. by the identification of Early Slavs likely with central-eastern populations dominated by (at least some subclades of) hg. I2a-L621 and E1b-V13.

Maps
I have updated some cultural borders in the prehistoric maps, and the maps with Y-DNA and mtDNA. I have also added one new version of the Early Bronze age map, to better reflect the most likely location of Indo-European languages in the Early European Bronze Age.

As those in software programming will understand, major changes in the files that are used for maps and graphics come with an increasing risk of additional errors, so I would not be surprised if some major ones would be found (I already spotted three of them). Feel free to communicate these errors in any way you see fit.

bronze-age-early-indo-european
European Early Bronze Age: tentative langage map based on linguistics, archaeology, and genetics.

SNPs
I have selected more conservative SNPs in certain controversial cases.

I have also deleted most SNP-related footnotes and replaced them with the marking of each individual tentative SNP, leaving only those footnotes that give important specific information, because:

  • My way of referencing tentative SNP authors did not make it clear which samples were tentative, if there were more than one.
  • It was probably not necessary to see four names repeated 100 times over.
  • Often I don’t really know if the person I have listed as author of the SNP call is the true author – unless I saw the full SNP data posted directly – or just someone who reposted the results.
  • Sometimes there are more than one author of SNPs for a certain sample, but I might have added just one for all.
ancient-dna-all
More than 6000 ancient DNA samples compiled to date.

For a centralized file to host the names of those responsible for the unofficial/tentative SNPs used in the text – and to correct them if necessary -, readers will be eventually able to use Phylogeographer‘s tool for ancient Y-DNA, for which they use (partly) the same data I compiled, adding Y-Full‘s nomenclature and references. You can see another map tool in ArcGIS.

NOTE. As I say in the text, if the final working map tool does not deliver the names, I will publish another supplementary table to the text, listing all tentative SNPs with their respective author(s).

If you are interested in ancient Y-DNA and you want to help develop comprehensive and precise maps of ancient Y-DNA and mtDNA haplogroups, you can contact Hunter Provyn at Phylogeographer.com. You can also find more about phylogeography projects at Iain McDonald’s website.

Graphics
I have also added more samples to both the “Asian” and the “European” PCAs, and to the ADMIXTURE analyses, too.

I previously used certain samples prepared by amateurs from BAM files (like Botai, Okunevo, or Hittites), and the results were obviously less than satisfactory – hence my criticism of the lack of publication of prepared files by the most famous labs, especially the Copenhagen group.

Fortunately for all of us, most published datasets are free, so we don’t have to reinvent the wheel. I criticized genetic labs for not releasing all data, so now it is time for praise, at least for one of them: thank you to all responsible at the Reich Lab for this great merged dataset, which includes samples from other labs.

NOTE. I would like to make my tiny contribution here, for beginners interested in working with these files, so I will update – whenever I have time – the “How To” sections of this blog for PCAs, PCA3d, and ADMIXTURE.

-iron-age-europe-romans
Detail of the PCA of European Iron Age populations. See full versions.

ADMIXTURE
For unsupervised ADMIXTURE in the maps, a K=5 is selected based on the CV, giving a kind of visual WHG : NWAN : CHG/IN : EHG : ENA, but with Steppe ancestry “in between”. Higher K gave worse CV, which I guess depends on the many ancient and modern samples selected (and on the fact that many samples are repeated from different sources in my files, because I did not have time to filter them all individually).

I found some interesting component shared by Central European populations in K=7 to K=9 (from CEU Bell Beakers to Denmark LN to Hungarian EBA to Iberia BA, in a sort of “CEU BBC ancestry” potentially related to North-West Indo-Europeans), but still, I prefer to go for a theoretically more correct visualization instead of cherry-picking the ‘best-looking’ results.

Since I made fun of the search for “Siberian ancestry” in coloured components in Tambets et al. 2018, I have to be consistent and preferred to avoid doing the same here…

qpAdm
In the first publication (in January) and subsequent minor revisions until March, I trusted analyses and ancestry estimates reported by amateurs in 2018, which I used for the text adding my own interpretations. Most of them have been refuted in papers from 2019, as you probably know if you have followed this blog (see very recent examples here, here, or here), compelling me to delete or change them again, and again, and again. I don’t have experience from previous years, although the current pattern must have been evidently repeated many times over, or else we would be still talking about such previous analyses as being confirmed today…

I wanted to be one step ahead of peer-reviewed publications in the books, but I prefer now to go for something safe in the book series, rather than having one potentially interesting prediction – which may or may not be right – and ten huge mistakes that I would have helped to endlessly redistribute among my readers (online and now in print) based on some cherry-picked pairwise comparisons. This is especially true when predictions of “Steppe“- and/or “Siberian“-related ancestry have been published, which, for some reason, seem to go horribly wrong most of the time.

I am sure whole books can be written about why and how this happened (and how this is going to keep happening), based on psychology and sociology, but the reasons are irrelevant, and that would be a futile effort; like writing books about glottochronology and its intermittent popularity due to misunderstood scientist trends. The most efficient way to deal with this problem is to avoid such information altogether, because – as you can see in the current revised text – they wouldn’t really add anything essential to the content of these books, anyway.

Continue reading

Official site of the book series:
A Song of Sheep and Horses: eurafrasia nostratica, eurasia indouralica

Ancient Sardinia hints at Mesolithic spread of R1b-V88, and Western EEF-related expansion of Vasconic

nuragic-sardinia-neolithic

New preprint Population history from the Neolithic to present on the Mediterranean island of Sardinia: An ancient DNA perspective, by Marcus et al. bioRxiv (2019)

Interesting excerpts (emphasis mine, edited for clarity):

On the high frequency of R1b-V88

Our genome-wide data allowed us to assign Y haplogroups for 25 ancient Sardinian individuals. More than half of them consist of R1b-V88 (n=10) or I2-M223 (n=7).

Francalacci et al. (2013) identi fied three major Sardinia-specifi c founder clades based on present-day variation within the haplogroups I2-M26, G2-L91 and R1b-V88, and here we found each of those broader haplogroups in at least one ancient Sardinian individual. Two major present-day Sardinian haplogroups, R1b-M269 and E-M215, are absent.

Compared to other Neolithic and present-day European populations, the number of identi fied R1b-V88 carriers is relatively high.

(…)ancient Sardinian mtDNA haplotypes belong almost exclusively to macro-haplogroups HV (n = 16), JT (n = 17) and U (n = 9), a composition broadly similar to other European Neolithic populations.

r1b-v88-europe
Geographic and temporal distribution of R1b-V88 Y-haplotypes in ancient European samples. We plot the geographic position of all ancient samples inferred to carry R1b-V88 equivalent markers. Dates are given as years BCE (means of calibrated 2s radio-carbon dates). Multiple V88 individuals with similar geographic positions are vertically stacked. We additionally color-code the status of the R1b-V88 subclade R1b-V2197, which is found in most present-day African R1b-V88 carriers.

On the origin of a Vasconic-like Paleosardo with the Western EEF

(…) the Neolithic (and also later) ancient Sardinian individuals sit between early Neolithic Iberian and later Copper Age Iberian populations, roughly on an axis that differentiates WHG and EEF populations and embedded in a cluster that additionally includes Neolithic British individuals. This result is also evident in terms of absolute genetic differentiation, with low pairwise FST ~ 0.005 +- 0.002 between Neolithic Sardinian individuals and Neolithic western mainland European populations. Pairwise outgroup-f3 analysis shows a very similar pattern, with the highest values of f3 (i.e. most shared drift) being with Neolithic and Copper Age Iberia, gradually dropping off for temporally and geographically distant populations.

In explicit admixture models (using qpAdm, see Methods) the southern French Neolithic individuals (France-N) are the most consistent with being a single source for Neolithic Sardinia (p ~ 0:074 to reject the model of one population being the direct source of the other); followed by other populations associated with the western Mediterranean Neolithic Cardial Ware expansion.

sardinians-ancient-eef
Principal Components Analysis based on the Human Origins dataset. A: Projection of ancient individuals’ genotypes onto principal component axes de fined by modern Western Eurasians (gray labels).

Pervasive Western Hunter-Gatherer ancestry in Iberian/French/Sardinian population

Similar to western European Neolithic and central European Late Neolithic populations, ancient Sardinian individuals are shifted towards WHG individuals in the top two PCs relative to early Neolithic Anatolians Admixture analysis using qpAdm infers that ancient Sardinian individuals harbour HG ancestry (~ 17%) that is higher than early Neolithic mainland populations (including Iberia, ~ 8%), but lower than Copper Age Iberians (~ 25%) and about the same as Southern French Middle-Neolithic individuals (~ 21%).

sardinia-modern-ancient-nuragic-pca
Principal Components Analysis based on the Human Origins dataset. B: Zoom into the region most relevant for Sardinian individuals.

Continuity from Sardinia Neolithic through the Nuragic

We found several lines of evidence supporting genetic continuity from the Sardinian Neolithic into the Bronze Age and Nuragic times. Importantly, we observed low genetic differentiation between ancient Sardinian individuals from various time periods.

A qpAdm analysis, which is based on simultaneously testing f-statistics with a number of outgroups and adjusts for correlations, cannot reject a model of Neolithic Sardinian individuals being a direct predecessor of Nuragic Sardinian individuals (…) Our qpAdm analysis further shows that the WHG ancestry proportion, in a model of admixture with Neolithic Anatolia, remains stable at ~17% throughout three ancient time-periods.

sardinians-modern-ancient-pca-admixture
Present-day genetic structure in Sardinia reanalyzed with aDNA. A: Scatter plot of the rst two principal components trained on 1577 present-day individuals with grand-parental ancestry from Sardinia. Each individual is labeled with a location if at least 3 of the 4 grandparents were born in the same geographical location (\small” three letter abbreviations); otherwise with \x” or if grand-parental ancestry is missing with \?”. We calculated median PC values for each Sardinian province (large abbreviations). We also projected each ancient Sardinian individual on to the top two PCs (gray points). B/C: We plot f-statistics that test for admixture of modern Sardinian individuals (grouped into provinces) when using Nuragic Sardinian individuals as one source population. Uncertainty ranges depict one standard error (calculated from block bootstrap). Karitiana are used in the f-statistic calculation as a proxy for ANE/Steppe ancestry (Patterson et al., 2012).

Steppe influx in Modern Sardinians

While contemporary Sardinian individuals show the highest affinity towards EEF-associated populations among all of the modern populations, they also display membership with other clusters (Fig. 5). In contrast to ancient Sardinian individuals, present-day Sardinian individuals carry a modest “Steppe-like” ancestry component (but generally less than continental present-day European populations), and an appreciable broadly “eastern Mediterranean” ancestry component (also inferred at a high fraction in other present-day Mediterranean populations, such as Sicily and Greece).

Related

Arrival of steppe ancestry with R1b-P312 in the Mediterranean: Balearic Islands, Sicily, and Iron Age Sardinia

steppe-balearic-sicily-sardinia

New preprint The Arrival of Steppe and Iranian Related Ancestry in the Islands of the Western Mediterranean by Fernandes, Mittnik, Olalde et al. bioRxiv (2019)

Interesting excerpts (emphasis in bold; modified for clarity):

Balearic Islands: The expansion of Iberian speakers

Mallorca_EBA dates to the earliest period of permanent occupation of the islands at around 2400 BCE. We parsimoniously modeled Mallorca_EBA as deriving 36.9 ± 4.2% of her ancestry from a source related to Yamnaya_Samara; (…). We next used qpAdm to identify “proximal” sources for Mallorca_EBA’s ancestry that are more closely related to this individual in space and time, and found that she can be modeled as a clade with the (small) subset of Iberian Bell Beaker culture associated individuals who carried Steppe-derived ancestry (p=0.442).

Suppl. Materials: The model used was with Bell_Beaker_Iberia_highsteppe, a group of outliers from Iberia buried in a Bell Beaker mortuary context who unlike most individuals from this context in that region had high proportions of Steppe ancestry (p=0.442).

Our estimates of Steppe ancestry in the two later Balearic Islands individuals are lower than the earlier one: 26.3 ± 5.1% for Formentera_MBA and 23.1 ± 3.6% for Menorca_LBA, but the Middle to Late Bronze Age Balearic individuals are not a clade relative to non-Balearic groups. Specifically, we find that f4(Mbuti.DG, X; Formentera_MBA, Menorca_LBA) is positive when X=Iberia_Chalcolithic (Z=2.6) or X=Sardinia_Nuragic_BA (Z=2.7). While it is tempting to interpret the latter statistic as suggesting a genetic link between peoples of the Talaiotic culture of the Balearic islands and the Nuragic culture of Sardinia, the attraction to Iberia_Chalcolithic is just as strong, and the mitochondrial haplogroup U5b1+16189+@16192 in Menorca_LBA is not observed in Sardinia_Nuragic_BA but is observed in multiple Iberia_Chalcolithic individuals. A possible explanation is that both the ancestors of Nuragic Sardinians and the ancestors of Talaiotic people from the Balearic Islands received gene flow from an unsampled Iberian Chalcolithic-related group (perhaps a mainland group affiliated to both) that did not contribute to Formentera_MBA.

This sample, like another one in El Argar, is of hg. R1b-P312. So there you are, the data that connects the Proto-Iberian expansion (replacing IE-speaking Bell Beakers) to the Iberian Chalcolithic population, signaled by the increase in Iberian Chalcolithic ancestry after the arrival of Bell Beakers, most likely connected originally to the Argaric and post-Argaric expansions during the MBA.

balearic-sicily-sardinia-pca
PCA with previously published ancient individuals (non-filled symbols), projected onto variation from present-day populations (gray squares).

Steppe in Sardinia IA: Phocaeans from Italy?

Most Sardinians buried in a Nuragic Bronze Age context possessed uniparental haplogroups found in European hunter-gatherers and early farmers, including Y-haplogroup R1b1a[xR1b1a1a] which is different from the characteristic R1b1a1a2a1a2 spread in association with the Bell Beaker complex. An exception is individual I10553 (1226-1056 calBCE) who carried Y-haplogroup J2b2a, previously observed in a Croatian Middle Bronze Age individual bearing Steppe ancestry, suggesting the possibility of genetic input from groups that arrived from the east after the spread of first farmers. This is consistent with the evidence of material culture exchange between Sardinians and mainland Mediterranean groups, although genome-wide analyses find no significant evidence of Steppe ancestry so the quantitative demographic impact was minimal.

Another interesting data, these (Mesolithic) remnant R1b-V88 lineages closely related to the Italian Peninsula, the most likely region of expansion of these lineages into Africa, in turn possibly connected to the expansion of Proto-Afroasiatic.

We detect definitive evidence of Iranian-related ancestry in an Iron Age Sardinian I10366 (391-209 calBCE) with an estimate of 11.9 ± 3.7.% Iran_Ganj_Dareh_Neolithic related ancestry, while rejecting the model with only Anatolian_Neolithic and WHG at p=0.0066 (Supplementary Table 9). The only model that we can fit for this individual using a pair of populations that are closer in time is as a mixture of Iberia_Chalcolithic (11.9 ± 3.2%) and Mycenaean (88.1 ± 3.2%) (p=0.067). This model fits even when including Nuragic Sardinians in the outgroups of the qpAdm analysis, which is consistent with the hypothesis that this individual had little if any ancestry from earlier Sardinians.

yamnaya-samara
Proportions of ancestry using a distal qpAdm framework on an individual basis (a), and based on qpWave clusters

Sicily EBA: The Lusitanian/Ligurian connection?

(…) While a previously reported Bell Beaker culture-associated individual from Sicily had no evidence of Steppe ancestry, (…) we find evidence of Steppe ancestry in the Early Bronze Age by ~2200 BCE. In distal qpAdm, the outlier Sicily_EBA11443 is parsimoniously modeled as harboring 40.2 ± 3.5% Steppe ancestry, and the outlier Sicily_EBA8561 is parsimoniously modeled as harboring 23.3 ± 3.5% Steppe ancestry. (…) The presence of Steppe ancestry in Early Bronze Age Sicily is also evident in Y chromosome analysis, which reveals that 4 of the 5 Early Bronze Age males had Steppe-associated Y-haplogroup R1b1a1a2a1a2. (Online Table 1). Two of these were Y-haplogroup R1b1a1a2a1a2a1 (Z195) which today is largely restricted to Iberia and has been hypothesized to have originated there 2500-2000 BCE. This evidence of west-to-east gene flow from Iberia is also suggested by qpAdm modeling where the only parsimonious proximate source for the Steppe ancestry we found in the main Sicily_EBA cluster is Iberians.

What’s this? An ancestral connection between Sicel Elymian and Galaico-Lusitanian or Ligurian (based on an origin in NE Iberia)? Impossible to say, especially if the languages of these early settlers were replaced later by non-Indo-European speakers from the eastern Mediterranean, and by Indo-European speakers from the mainland closely related to Proto-Italic during the LBA, but see below.

Regarding the comment on R1b-Z195, it is associated with modern Iberians, as DF27 in general, due to founder effects beyond the Pyrenees. It is a very old subclade, split directly from DF27 roughly at the same time as it split from the parent P312, i.e. it can be found anywhere in Europe, and it almost certainly accompanied the expansion of Celts from Central Europe under the subclade R1b-M167/SRY2627.

The connection is thus strong only because of the qpAdm modeling, since R1b-DF27 and subclade R1b-Z195 are certainly lineages expanded quite early, most likely with Yamna settlers in Hungary and East Bell Beakers.

In this case, if stemming from Iberia, it is most likely of subclade R1b-Z220 – or another Z195 (xM167) lineage – originally associated with the Old European substrate found in topo-hydronymy in Iberia, whose most likely remnants attested during the Iron Age were Lusitanians.

r1b-df27-z195
Left: Modern distribution of R1b-Z195 (YFull estimate 2700 BC); Right: Modern distribution of DF27. Both include later founder effects within Iberia, so the increase in the Basque country and the Crown of Aragon and the decrease in Portugal can safely be ignored. Contour maps of the derived allele frequencies of the SNPs analyzed in Solé-Morata et al. (2017).

We detect Iranian-related ancestry in Sicily by the Middle Bronze Age 1800-1500 BCE, consistent with the directional shift of these individuals toward Mycenaeans in PCA. Specifically, two of the Middle Bronze Age individuals can only be fit with models that in addition to Anatolia_Neolithic and WHG, include Iran_Ganj_Dareh_Neolithic. The most parsimonious model for Sicily_MBA3125 has 18.0 ± 3.6% Iranian-related ancestry (p=0.032 for rejecting the alternative model of Steppe rather than Iranian-related ancestry), and the most parsimonious model for Sicily_MBA has 14.9 ± 3.9% Iranian-related ancestry (p=0.037 for rejecting the alternative model).

The modern southern Italian Caucasus-related signal identified in Raveane et al. (2018) is plausibly related to the same Iranian-related spread of ancestry into Sicily that we observe in the Middle Bronze Age (and possibly the Early Bronze Age).

The non-Indo-European Sicanians and Elymians were possibly then connected to eastern Mediterranean groups before the expansion of the Sea Peoples.

For the Late Bronze Age group of individuals, qpAdm documented Steppe-related ancestry, modeling this group as 80.2 ± 1.8% Anatolia_Neolithic, 5.3 ± 1.6% WHG, and 14.5 ± 2.2% Yamnaya_Samara. Our modeling using sources more closely related in space and time also supports Sicily_LBA having Minoan-related ancestry or being derived from local preceding populations or individuals with ancestries similar to those of Sicily_EBA3123 (p=0.527), Sicily_MBA3124 (p=0.352), and Sicily_MBA3125 (p=0.095).

This increase in Steppe-related ancestry in a western site during the LBA most likely represents either an expansion from the Aegean or – maybe more likely, given the archaeological finds – a regional population similar to Sicily EBA re-emerging or rather being displaced from the eastern part of the island because of a westward movement from nearby Calabria.

Whether this population sampled spoke Indo-European or not at this time is questionable, since the Iron Age accounts show non-IE Elymians in this region.

Actually, Elymians seem to have spoken Indo-European, which fits well with the increase in steppe ancestry.

EDIT (21 MAR): Interesting about a proposed incoming Minoan-like ancestry is the potential origin of the Iran Neolithic-related ancestry that is going to appear in Central Italy during the LBA. This could then be potentially associated with Tyrsenians passing through the area, although the traditional description may be more more compatible with an arrival of Sea Peoples from the Adriatic.

Sad to read this:

This manuscript is dedicated to the memory of Sebastiano Tusa of the Soprintendenza del Mare in Palermo, who would have been an author of this study had he not tragically died in the crash of Ethiopia Airlines flight 302 on March 10.

Related

Aquitanians and Iberians of haplogroup R1b are exactly like Indo-Iranians and Balto-Slavs of haplogroup R1a

eba-indo-iranian-balto-slavs

The final paper on Indo-Iranian peoples, by Narasimhan and Patterson (see preprint), is soon to be published, according to the first author’s Twitter account.

One of the interesting details of the development of Bronze Age Iberian ethnolinguistic landscape was the making of Proto-Iberian and Proto-Basque communities, which we already knew were going to show R1b-P312 lineages, a haplogroup clearly associated during the Bell Beaker period with expanding North-West Indo-Europeans:

From the Bronze Age (~2200–900 BCE), we increase the available dataset from 7 to 60 individuals and show how ancestry from the Pontic-Caspian steppe (Steppe ancestry) appeared throughout Iberia in this period, albeit with less impact in the south. The earliest evidence is in 14 individuals dated to ~2500–2000 BCE who coexisted with local people without Steppe ancestry. These groups lived in close proximity and admixed to form the Bronze Age population after 2000 BCE with ~40% ancestry from incoming groups. Y-chromosome turnover was even more pronounced, as the lineages common in Copper Age Iberia (I2, G2, and H) were almost completely replaced by one lineage, R1b-M269.

iberia-admixture-y-dna
Proportion of ancestry derived from central European Beaker/Bronze Age populations in Iberians from the Middle Neolithic to the Iron Age (table S15). Colors indicate the Y-chromosome haplogroup for each male. Red lines represent period of admixture. Modified from Olalde et al. (2019).

The arrival of East Bell Beakers speaking Indo-European languages involved, nevertheless, the survival of the two non-IE communities isolated from each other – likely stemming from south-western France and south-eastern Iberia – thanks to a long-lasting process of migration and admixture. There are some common misconceptions about ancient languages in Iberia which may have caused some wrong interpretations of the data in the paper and elsewhere:

NOTE. A simple reading of Iberian prehistory would be enough to correct these. Two recent books on this subject are Villar’s Indoeuropeos, iberos, vascos y otros parientes and Vascos, celtas e indoeuropeos. Genes y lenguas.

Iberian languages were spoken at least in the Mediterranean and the south (ca. “1/3 of Iberia“) during the Bronze Age.

Nope, we only know the approximate location of Iberian culture and inscriptions from the Late Iron Age, and they occupy the south-eastern and eastern coastal areas, but before that it is unclear where they were spoken. In fact, it seems evident now that the arrival of Urnfield groups from the north marks the arrival of Celtic-speaking peoples, as we can infer from the increase in Central European admixture, while the expansion of anthropomorphic stelae from the north-west must have marked the expansion of Lusitanian.

Vasconic was spoken in both sides of the Pyrenees, as it was in the Middle Ages.

Wrong. One of the worst mistakes I am seeing in many comments since the paper was published, although admittedly the paper goes around this problem talking about “Modern Basques”. Vasconic toponyms appear south of the Pyrenees only after the Roman conquests, and tribes of the south-western Pyrenees and Cantabrian regions were likely Celtic-speaking peoples. Aquitanians (north of the western Pyrenees) are the only known ancient Vasconic-speaking population in proto-historic times, ergo the arrival of Bell Beakers in Iberia was most likely accompanied by Indo-European languages which were later replaced by Celtic expanding from Central Europe, and Iberian expanding from south-east Iberia, and only later with Latin and Vasconic.

Ligurian is non-Indo-European, and Lusitanian is Celtic-like, so Iberia must have been mostly non-Indo-European-speaking.

The fragmentary material available on Ligurian is enough to show that phonetically it is a NWIE dialect of non-Celtic, non-Italic nature, much like Lusitanian; that is, unless you follow laryngeals up to Celtic or Italic, in which case you can argue anything about this or any other IE language, as people who reconstruct laryngeals for Baltic in the common era do.

EDIT (19 Mar 2019): It was not clear enough from this paragraph, because Ligurian-like languages in NE Iberia is just a hypothesis based on the archaeological connection of the whole southern France Bell Beaker region. My aim was to repeat the idea that Old European hydro-toponymy is older in NE Iberia (as almost anywhere in Iberia) than Iberian toponymy, so the initial hypothesis is that:

  1. a Palaeo-European language (as Villar puts it) expanded into most regions of Iberia in ancient times (he considered at some point the Mesolithic, but that is obviously wrong, as we know now); then
  2. Celts expanded at least to the Ebro River Basin; then
  3. Iberians expanded to the north and replaced these in NE Iberia; and only then
  4. after the Roman invasion, around the start of the Common Era, appear Vasconic toponyms south of the Pyrenees.

Lusitanian obviously does not qualify as Celtic, lacking the most essential traits that define Celticness…Unless you define “(Para-)Celtic” as Pre-Proto-Celtic-like, or anything of the sort to support some Atlantic continuity, in which case you can also argue that Pre-Italic or Pre-Germanic are Celtic, because you would be essentially describing North-West Indo-European

If Basques have R1b, it’s because of a culture of “matrilocality” as opposed to the “patrilocality” of Indo-Europeans

So wrong it hurts my eyes every time I read this. Not only does matrilocality in a regional group have few known effects in genetics, but there are many well-documented cases of population replacement (with either ancestry or Y-DNA haplogroups, or both) without language replacement, without a need to resort to “matrilineality” or “matrilocality” or any other cultural difference in any of these cases.

In fact, it seems quite likely now that isolated ancient peoples north of the Pyrenees will show a gradual replacement of surviving I2a lineages by neighbouring R1b, while early Iberian R1b-DF27 lineages are associated with Lusitanians, and later incoming R1b-DF27 lineages (apart from other haplogroups) are most likely associated with incoming Celts, which must have remained in north-central and central-east European groups.

NOTE. Notice how R1a is fully absent from all known early Indo-European peoples to date, whether Iberian IE, British IE, Italic, or Greek. The absence of R1a in Iberia after the arrival of Celts is even more telling of the origin of expanding Celts in Central Europe.

I haven’t had enough time to add Iberian samples to my spreadsheet, and hence neither to the ASoSaH texts nor maps/PCAs (and I don’t plan to, because it’s more efficient for me to add both, Asian and Iberian samples, at the same time), but luckily Maciamo has summed it up on Eupedia. Or, graphically depicted in the paper for the southeast:

iberia-haplogroups
Y chromosome haplogroup composition of individuals from southeast Iberia during the past 2000 years. The general Iberian Bronze and Iron Age population is included for comparison. Modified from Olalde et al. (2019).

Does this continued influx of Y-DNA haplogroups in Iberia with different cultures represent permanent changes in language? Are, therefore, modern Iberian languages derived from Lusitanian, Sorothaptic/Celtic, Greek, Phoenician, East or West Germanic, Hebrew, Berber, or Arabic languages? Obviously not. Same with Italy (see the recent preprint on modern Italians by Raveane et al. 2018), with France, with Germany, or with Greece.

If that happens in European regions with a known ancient history, why would the recent expansions and bottlenecks of R1b in modern Basques (or N1c around the Baltic, or R1a in Slavs) in the Middle Ages represent an ancestral language surviving into modern times?

Indo-Iranians

If something is clear from Narasimhan, Patterson, et al. (2018), is that we know finally the timing of the introduction and expansion of R1a-Z645 lineages among Indo-Iranians.

We could already propose since 2015 that a slow admixture happened in the steppes, based on archaeological finds, due to settlement elites dominating over common peoples, coupled with the known Uralic linguistic traits of Indo-Iranian (and known Indo-Iranian influence on Finno-Ugric) – as I did in the first version of the Indo-European demic diffusion model.

The new huge sampling of Sintashta – combined with that of Catacomb, Poltavka, Potapovka, Andronovo, and Srubna – shows quite clearly how this long-term admixture process between Uralic peoples and Indo-Iranians happened between forest-steppe CWC (mainly Abashevo) and steppe groups. The situation is not different from that of Iberia ca. 2500-2000 BC; from Narasimhan, Patterson, et al. (2018):

We combined the newly reported data from Kamennyi Ambar 5 with previously reported data from the Sintashta 5 individuals (10). We observed a main cluster of Sintashta individuals that was similar to Srubnaya, Potapovka, and Andronovo in being well modeled as a mixture of Yamnaya-related and Anatolian Neolithic (European agriculturalist-related) ancestry.

Even with such few words referring to one of the most important data in the paper about what happened in the steppes, Wang et al. (2018) help us understand what really happened with this simplistic concept of “steppe ancestry” regarding Yamna vs. Corded Ware differences:

anatolia-neolithic-steppe-eneolithic
Image modified from Wang et al. (2018). Marked are: in red, approximate limit of Anatolia_Neolithic ancestry found in Yamna populations; in blue, Corded Ware-related groups. “Modelling results for the Steppe and Caucasus 1128 cluster. Admixture proportions based on (temporally and geographically) distal and proximal models, showing additional Anatolian farmer-related ancestry in Steppe groups as well as additional gene flow from the south in some of the Steppe groups as well as the Caucasus groups (see also Supplementary Tables 10, 14 and 20).”

As with Iberia (or any prehistoric region), the details of how exactly this language change happened are not evident, but we only need a plausible explanation coupled with archaeology and linguistics. Poltavka, Potapovka, and Sintashta samples – like the few available Iberian ones ca. 2500-2000 BC – offer a good picture of the cohabitation of R1b-L23 (mainly Z2103) and R1a-Z645 (mainly Z93+): a glimpse at the likely presence of R1a-Z93 within settlements – which must have evolved as the dominant elites – in a society where the majority of the population was initially formed by nomad herders (probably most R1b-Z2103), who were usually buried outside of the main settlements.

Will the upcoming Narasimhan, Patterson et al. (2019) deal with this problem of how R1a-M417 replaced R1b-M269, and how the so-called “Steppe_MLBA” (i.e. Corded Ware) ancestry admixed with “Steppe_EMBA” (i.e. Yamnaya) ancestry in the steppes, and which one of their languages survived in the region (that is, the same the Reich Lab has done with Iberia)? Not likely. The ‘genetic wars’ in Iberia deal with haplogroup R1b-P312, and how it was neither ‘native’ nor associated with Basques and non-Indo-European peoples in general. The ‘genetic wars’ in South Asia are concerned with the steppe origin of R1a, to prove that it is not a ‘native’ haplogroup to India, and thus neither are Indo-Aryan languages. To each region a politically correct account of genetic finds, with enough care not to fully dismiss national myths, it seems.

NOTE. Funnily enough, these ‘genetic wars’ are the making of geneticists since the 1990s and 2000s, so we are still in the midst of mostly internal wars caused by what they write. Just as genetic papers of the 2020s will most likely be a reaction to what they are writing right now about “steppe ancestry” and R1a. You won’t find much change to the linguistic reconstruction in this whole period, except for the most multicolored glottochronological proposals…

The first author of the paper has engaged, as far as I could see in Twitter, in dialogue with Hindu nationalists who try to dismiss the arrival of steppe ancestry and R1a into South Asia as inconclusive (to support the potential origin of Sanskrit millennia ago in the Indus Valley Civilization). How can geneticists deal with the real problem here (the original ethnolinguistic group expanding with Corded Ware), when they have to fend off anti-steppists from Europe and Asia? How can they do it, when they themselves are part of the same societies that demand a politically correct presentation of data?

This is how the data on the most likely Indo-Iranian-speaking region should be presented in an ideal world, where – as in the Iberia paper – geneticists would look closely to the Volga-Ural region to discover what happened with Proto-Indo-Iranians from their earliest to their latest stage, instead of constantly looking for sites close to the Indus Valley to demonstrate who knows what about modern Indian culture:

indo-iranian-admixture-similar-iberians
Tentative map of the Late PIE and Indo-Iranian community in the Volga-Ural steppes since the Eneolithic. Proportion of ancestry derived from central European Corded Ware peoples. Colors indicate the Y-chromosome haplogroup for each male. Red lines represent period of admixture. Modified from Olalde et al. (2019).

Now try and tell Hindu nationalists that Sanskrit expanded from an Early Bronze Age steppe community of R1b-rich nomadic herders that spoke Pre-Indo-Iranian, which was dominated and eventually (genetically) mostly replaced by elite Uralic-speaking R1a peoples from the Russian forest, hence the known phonetic (and some morphological) traits that remained. Good luck with the Europhobic shitstorm ahead..

Balto-Slavic

Iberian cultures, already with a majority of R1b lineages, show a clear northward expansion over previously Urnfield-like groups of north-east Iberia and Mediterranean France (which we now know probably represent the migration of Celts from central Europe). Similarly, Eastern Balts already under a majority of R1a lineages expanded likely into the Baltic region at the same time as the outlier from Turlojiškė (ca. 1075 BC), which represents the first obvious contacts of central-east Europe with the Baltic.

Iberia shows a more recent influx of central and eastern Mediterranean peoples, one of which eventually succeeded in imposing their language in Western Europe: Romans were possibly associated mainly with R1b-U152, apart from many other lineages. Proto-Slavs probably expanded later than Celts, too, connected to the disintegration of the Lusatian culture, and they were at some point associated with R1a-M458 and R1a-Z280(xZ92) lineages, apart from others already found in Early Slavs.

pca-balto-slavs-tollense-valley
PCA of central-eastern European groups which may have formed the Balto-Slavic-speaking community derived from Bell Beaker, evident from the position ‘westwards’ of CWC in the PCA, and surrounding cultures. Left: Early Bronze Age. Right: Tollense Valley samples.

This parallel between Iberia and eastern Europe is no coincidence: as Europe entered the Bronze Age, chiefdom-based systems became common, and thus the connection of ancestry or haplogroups with ethnolinguistic groups became weaker.

What happened earlier (and who may represent the Pre-Balto-Slavic community) will be clearer when we have enough eastern European samples, but basically we will be able to depict this admixture of NWIE-speaking BBC-derived peoples with Uralic-speaking CWC-derived groups (since Uralic is known to have strongly influenced Balto-Slavic), similar to the admixture found in Indo-Iranians, more or less like this:

iberian-admixture-balto-slavic
Tentative map of the North-West Indo-European and Balto-Slavic community in central-eastern Europe since the East Bell Beaker expansion. Proportion of ancestry derived from Corded Ware peoples. Colors indicate the Y-chromosome haplogroup for each male. Red lines represent period of admixture. Modified from Olalde et al. (2019).

The Early Scythian period marked a still stronger chiefdom-based system which promoted the creation of alliances and federation-like groups, with an earlier representation of the system expanding from north-eastern Europe around the Baltic Sea, precisely during the spread of Akozino warrior-traders (in turn related to the Scythian influence in the forest-steppes), who are the most likely ancestors of most N1c-V29 lineages among modern Germanic, Balto-Slavic, and Volga-Finnic peoples.

Modern haplogroup+language = ancient ones?

It is not difficult to realize, then, that the complex modern genetic picture in Eastern Europe and around the Urals, and also in South Asia (like that of the Aegean or Anatolia) is similar to the Iron Age / medieval Iberian one, and that following modern R1a as an Indo-European marker just because some modern Indo-European-speaking groups showed it was always a flawed methodology; as flawed as following R1b for ancient Vasconic groups, or N1c for ancient Uralic groups.

Why people would argue that haplogroups mean continuity (e.g. R1b with Basques, N1c with Finns, R1a with Slavs, etc.) may be understood, if one lives still in the 2000s. Just like why one would argue that Corded Ware is Indo-European, because of Gimbutas’ huge influence since the 1960s with her myth of “Kurgan peoples”. Not many denied these haplogroup associations, because there was no reason to do it, and those who did usually aligned with a defense of descriptive archaeology.

However, it is a growing paradox that some people interested in genetics today would now, after the Iberian paper, need to:

  • accept that ancient Iberians and probably Aquitanians (each from different regions, and probably from different “Basque-Iberian dialects” in the Chalcolithic, if both were actually related) show eventually expansions with R1b-L23, the haplogroup most obviously associated with expanding Indo-Europeans;
  • acknowledge that modern Iberians have many different lineages derived from prehistoric or historic peoples (Celts, Phoenicians, Greeks, Romans, Jews, Goths, Berbers, Arabs), which have undergone different bottlenecks, the last ones during the Reconquista, but none of their languages have survived;
  • realize that a similar picture is to be found everywhere in central and western Europe since the first proto-historic records, with language replacement in spite of genetic continuity, such as the British Isles (and R1b-L21 continuity) after the arrival of Celts, Romans, Anglo-Saxons, Vikings, or Normans;
  • but, at the same time, continue blindly asserting that haplogroup R1a + “steppe ancestry” represent some kind of supernatural combination which must show continuity with their modern Indo-Iranian or Balto-Slavic language from time immemorial.
sintashta-y-dna
Replacement of R1b-L23 lineages during the Early Bronze Age in eastern Europe and in the Eurasian steppes: emergence of R1a in previous Yamnaya and Bell Beaker territories. Modified from EBA Y-DNA map.

Behave, pretty please

The ‘conservative’ message espoused by some geneticists and amateur genealogists here is basically as follows:

  • Let’s not rush to new theories that contradict the 2000s, lest some people get offended by granddaddy not being these pure whatever wherever as they believed, and let’s wait some 5, 10, or 20 years, as long as necessary – to see if some corner of the Yamna culture shows R1a, or some region in north-eastern Europe shows N1c, or some Atlantic Chalcolithic sample shows R1b – to challenge our preferred theories, if we actually need to challenge anything at all, because it hurts too much.
  • Just don’t let many of these genetic genealogists or academics of our time be unhappy, pretty please with sugar on top, and let them slowly adapt to reality with more and more pet theories to fit everything together (past theories + present data), so maybe when all of them are gone, within 50 or 70 years, society can smoothly begin to move on and propose something closer to reality, but always as politically correct as possible for the next generations.
  • For starters, let’s discuss now (yet again) that Bell Beakers may not have been Indo-European at all, despite showing (unlike Corded Ware) clearly Yamna male lineages and ancestry, because then Corded Ware and R1a could not have been Indo-European and that’s terrible, so maybe Bell Beakers are too brachycephalic to speak Indo-European or something, or they were stopped by the Fearsome Tisza River, or they are not pure Dutch Single Grave in The South hence not Indo-European, or whatever, and that’s why Iron Age Iberians or Etruscans show non-Indo-European languages. That’s not disrespectful to the history of certain peoples, of course not, but talking about the evident R1a-Uralic connection is, because this is The South, not The North, and respect works differently there.
  • Just don’t talk about how Slavs and Balts enter history more than 1,500 years later than Indo-European peoples in Western and Southern Europe, including Iberia, and assume a heroic continuity of Balts and Slavs as pure R1a ‘steppe-like’ peoples dominating over thousands of kms. in the Baltic, Fennoscandia, eastern Europe, and northern Asia for 5,000 years, with multiple Balto-Slavs-over-Balto-Slavs migrations, because these absolute units of Indo-European peoples were a trip and a half. They are the Asterix and Obelix of white Indo-European prehistory.
  • Perhaps in the meantime we can also invent some new glottochronological dialectal scheme that fits the expansion of Sredni Stog/Corded Ware with (Germano-?)Indo-Slavonic separated earlier than any other Late PIE dialect; and Finno-Volgaic later than any other Uralic dialect, in the Middle Ages, with N1c.
balto-slavic-pca
Genetic structure of the Balto-Slavic populations within a European context according to the three genetic systems, from Kushniarevich et al. (2015). Pure Balto-Slavs from…hmm…yeah this…ancient…region…or people…cluster…Whatever, very very steppe-like peoples, the True Indo-Europeans™, so close to Yamna…almost as close as Finno-Ugrians.

To sum up: Iberia, Italy, France, the British Isles, central Europe, the Balkans, the Aegean, or Anatolia, all these territories can have a complex history of periodic admixture and language replacement everywhere, but some peoples appearing later than all others in the historical record (viz. Basques or Slavs) apparently cannot, because that would be shameful for their national or ethnic myths, and these should be respected.

Ignorance of the own past as a blank canvas to be filled in with stupid ethnolinguistic continuity, turned into something valuable that should not be challenged. Ethnonationalist-like reasoning proper of the 19th century. How can our times be called ‘modern’ when this kind of magical thinking is still prevalent, even among supposedly well-educated people?

Related

Haplogroup R1b-M167/SRY2627 linked to Celts expanding with the Urnfield culture

bronze-age-late-urnfield

As you can see from my interest in the recently published Olalde et al. (2019) Iberia paper, once you accept that East Bell Beakers expanded North-West Indo-European, the most important question becomes how did its known dialects spread to their known historic areas.

We already had a good idea about the expansion of Celts, based on proto-historical accounts, fragmentary languages, and linguistic guesstimates, but the connection of Celtic with either Urnfield or slightly later Hallstatt/La Tène was always blurred, due to the lack of precise data on population movements.

The latest paper on Iberia is interesting for many details, such as:

  • The express dismissal of the newest pet theory based on the simplistic “steppe ancestry = IE”: the obsessive comparisons of Dutch Bell Beakers as the origin of basically anything that moves in Europe.
  • A discrete influx of North African ancestry in certain samples before the Moorish invasion (which was probably mediated by peoples of North African rather than Levantine admixture).
  • The finding of very Mycenaean-like Greek colonies of the 5th century (interestingly, under R1b lineages).
iberia-celts-romans
Modified from section of PCA of ancient samples by Olalde et al. (2019). “IE Iberia” refers to Pre-Celtic Indo-European languages of Iberia, such as Galaico-Lusitanian in the west (see more on Lusitanian), and a potentially Ligurian-related language in the North-East and southern France.

The paper is, however, of particular importance from the perspective of historical linguistics. It confirms that:

  • Celtic-speaking peoples expanded in Iberia likely during the Late Bronze Age – Early Iron Age (probably with the Urnfield culture, before 1000 BC) with North/Central European ancestry.

NOTE. The paper marks what are believed to be the boundaries of non-Indo-European languages during the Iron Age in later times, extrapolating that situation to the past. Mediterranean sites with Iberian traits (ca. 6th century on) were probably non-Indo-European-speaking tribes, but it is unclear what happened in the centuries before their sampling, and there are no clear boundaries. These incoming Celts from central Europe with the Urnfield culture makes it very likely that the Iberian expansion to the north happened later, incorporating thus this central European ancestry in the process. The southern (orientalizing, Tartessian) site of La Angorrilla shows incineration and influence from Phoenician settlers, and their actual language is also far from clear. The other investigated samples, with higher central European contribution, are from Celtiberian sites.

  • The slightly later arrival of (Phoenician, Greek and) Latin-speaking peoples into Iberia is marked by Central/Eastern Mediterranean and North African ancestry.
iberia-migrations-celts-romans
Expansion of different ancestry components in Iberia during Prehistory. Modified from Olalde et al. (2019) to include labels with populations expanding with each component.

While both confirm what was more or less already known about the oldest attested NWIE dialects, and further support the role of East Bell Beakers in expanding North-West Indo-European, the first part is interesting for two main reasons:

  1. Koch’s Celtic from the West hypothesis, which made a recent comeback with a renewed model based on “steppe ancestry”, is once again rejected in population genomics, as expected. At this point I doubt this will mean anything to the supporters of the theory (because you can propose as many “Celtic-over-Celtic” layers as you want), but if you are not obsessed with autochthonous continuity of Celtic languages in the Atlantic area we might begin to judge the most correct dialectal split (and thus classification) among those proposed to date, based on ancestry and haplogroup expansions.
  2. We believed in the 2000s that the expansion of haplogroup R1b-M167 (TMRCA ca. 1100 BC for YTree or 1700 BC for YFull) was coupled with the expansion of Iberians from the Pyrenees, in turn (thus) closely related to Basques. This non-IE presence has been contested with toponymic data in linguistics, and with the testing of many modern samples and the subsequent discovery of the widespread distribution of the subclade in western and northern Europe. Now it has become even more likely (lacking confirmation with aDNA) that this haplogroup expanded with Celts.

NOTE. Regarding R1b SNPs, YTree has more samples (and thus more SNPs) to work with estimates, due to its connection with FTDNA groups, so it is in principle more reliable (although estimates were calculated in 2017). Nevertheless, the methods to estimate the age of the MRCA are different between YTree and YFull.

df27-m167-z262-mcdonald
YTree estimations of TMRCA for R1b-Z262 (left) and R1b-M167 (right).

Why this is important has to do with the realization that Celts must have expanded explosively in all directions during the estimated range for Common Celtic (ca. 1500-1000 BC), and as such R1b-M167 is probably going to be one of the clear Y-DNA markers of the Celtic expansion, when it appears in the ancient DNA record, maybe in new SNP calls from samples of the Olalde et al. (2019) paper, or in future Urnfield/Hallstatt/La Tène papers.

Sister clades derived from R1b-Z262 (TMRCA ca. 1650 BC for YTree, or 2700 for YFull), although sharing a quite old origin, may have taken part in the same communities that expanded R1b-M167, likely from some point in central Europe, possibly as remnants of a previous (Tumulus culture?) central European expansion, as the sample SZ5 from Szólád (R1b-CTS1595) and the distribution of modern samples suggest.

r1b-df27-m167-sry2627
Left: Modern distribution of upstream clade L176.2 (YFull R1b-CTS4188); Right: Modern distribution of M167. Both include later expansions within Iberia (probably with the Crown of Aragon during the Reconquista). Contour maps of the derived allele frequencies of the SNPs analyzed in Solé-Morata et al. (2017).

EDIT (23 APRIL): In Hernández et al. (2018), the TMRCA of R1b-M167 is reported as 3372-3718 ybp:

The youngest sub-branch, R1b-M167, dates to approximately 3.5 kya (95% CI= 2.5-5.3 kya), i.e. even after the Bronze Age.

r1b-df27-m167-europe
Contour (surface) maps displaying the frequencies of Y-chromosome haplogroup and its sub-lineages across Europe and the Mediterranean basin. Modified from Hernández et al. (2018).

NOTE. Admittedly, the maps are mainly based on Iberian samples and certain limited sampling elsewhere, so most of the frequencies displayed in other territories are extrapolated. Since the percentage of R1b-M167 in France is estimated to be ca. 3%, and in Bavaria ca. 5%, the distribution in Central Europe is probably much higher, and around the Mediterranean much lower than represented in them.

The Celtic expansion might not have been a mass migration of peoples replacing all male lines of their controlled territories (as was common in the Neolithic and Chalcolithic), because of the Bronze Age dominant chiefdom-based system that relied on alliances, but it is becoming clear that Early Celts are also going to show the expansion of certain successful male lineages.

Oh, and you can say goodbye to the autochthonous “Vasconic = R1b-DF27” (latest heir of the “Vasconic = R1b-P312”) theory, too, if – for some strange reason – you hadn’t already.

EDIT (16 MAR) Just in case the wording is not clear: the fact that this haplogroup most likely expanded with Celts does not mean that its lineages didn’t become eventually incorporated into Iberian cultures and adopted non-IE languages: some of them probably did at some point, in some regions of northern Iberia, and most were certainly later incorporated to the Roman civilization and spoke Latin, then to the medieval kingdoms with their languages, and so on until the present day… Only those eventually associated with Iron Age Aquitanians may have retained their non-IE language, unless those lineages today associated with Basques were incorporated later to the Basque-speaking regions by expanding medieval kingdoms. A complex picture repeated everywhere in Europe: no haplogroup+language continuity in sight, anywhere.

NOTE: This here is currently the most likely interpretation of data based on estimations of mutations; it is not confirmed with ancient samples.

Related