Over the past week or so, since the publication of new Corded Ware samples in Narasimhan, Patterson et al. (2019) and after finding out that the R1a-M417 star-like phylogeny may have started ca. 3000 BC, I have been ruminating the relevance of contradictory data about the Ukraine_Eneolithic_o sample from Alexandria, its potential wrong radiocarbon date, and its implications for the Indo-European question.
How many other similar ‘controversial’ samples are there which we haven’t even considered? And what mechanisms are in place to control that the case of Hajji_Firuz_CA I2327 is not repeated?
Ukraine Eneolithic outlier I6561
It was not the first time that I (or many others) have alternatively questioned its subclade or its date, but the contradictory data seem to keep piling up. We can still explain all these discrepancies by assuming that the radiocarbon date is correct – seeing how it is a direct and newly reported lab analysis – because it is an isolated individual from a poorly sampled region, so he may actually be the first one to show features proper of later Corded Ware-related samples.
The individual seems to be especially relevant for the Indo-European and Uralic homeland question. The last one to mention this sample in a publication was Anthony (2019), who considered it in common with two other Eneolithic samples from Dereivka to show how Anatolian farmer-related ancestry first appeared in the recently opened CHG mating network of the Pontic-Caspian steppes and forest-steppes during the Middle Eneolithic, after the expansion of Khvalynsk:
The currently oldest sample with Anatolian Farmer ancestry in the steppes in an individual at Aleksandriya, a Sredni Stog cemetery on the Donets in eastern Ukraine. Sredni Stog has often been discussed as a possible Yamnaya ancestor in Ukraine (Anthony 2007: 239- 254). The single published grave is dated about 4000 BC (4045–3974 calBC/ 5215±20 BP/ PSUAMS-2832) and shows 20% Anatolian Farmer ancestry and 80% Khvalynsk-type steppe ancestry (CHG&EHG). His Y-chromosome haplogroup was R1a-Z93, similar to the later Sintashta culture and to South Asian Indo-Aryans, and he is the earliest known sample to show the genetic adaptation to lactase persistence (I3910-T). Another pre-Yamnaya grave with Anatolian Farmer ancestry was analyzed from the Dnieper valley at Dereivka, dated 3600-3400 BC (grave 73, 3634–3377 calBC/ 4725±25 BP/ UCIAMS-186349). She also had 20% Anatolian Farmer ancestry, but she showed less CHG than Aleksandriya and more Dereivka-1 ancestry, not surprising for a Dnieper valley sample, but also showing that the old fifth-millennium-type EHG/WHG Dnieper ancestry survived into the fourth millennium BC in the Dnieper valley (Mathieson et al. 2018).
The main problem is that this sample has more than one inconsistent, anachronistic data compared to its reported precise radiocarbon date ca. 4045–3974 calBCE (5215±20BP, PSUAMS-2832). I summarized them on Twitter:
First known R1a-M417 sample, with subclade R1a-Y26 (Y2-), with formation date and TMRCA ca. 2750 BC (CI 95% ca. 3750–1950 BC), and proper of much later Steppe_MLBA bottlenecks. The closest available sample would be the Poltavka outlier of hg. R1a-Z94 (ca. 2700 BC), from a mixed cemetery that could belong to a later (likely Abashevo) layer; the closest related subclade is probably found in sample I12450 of Butkara_IA (ca. 800 BC).
NOTE. The formation date of upper clade R1a-Z93 is estimated ca. 3000 BC, with a CI 95% ca. 3550–2550 BC, suggesting that the actual TMRCA range for the subclade has most likely a lower maximum formation date than estimated with the available samples under Y3.
Ancestry and PCA cluster like Steppe_MLBA (see PCA below), different from neighbouring Sredni Stog samples of the roughly coetaneous Dereivka site (ca. 3600-3400 BC), and from a later Yamnaya sample from Dereivka (ca. 2800 BC), even more shifted toward WHG-related ancestry.
Allele for lactase persistence (I3910-T), found only much later among Bell Beakers, and still later in Sintashta and Steppe_MLBA samples. This suggests a strong selection in northern Europe and South Asia stemming from steppe-related (and not forest-steppe-related) peoples, postdating the age of massive Indo-European migrations.
My impression is that the Hajji_Firuz Chalcolithic outlier, initially dated ca. 5900-5500 BC, had much less reason to be questioned than this sample, since Pre-Yamnaya ancestry was (and apparently is still) believed by members of the Reich Lab to have come from south of the Caucasus, and to have arrived around that time or earlier to the North Caspian steppe, i.e. before the 5th millennium BC.
The formation date of its initially reported haplogroup, R1b-Z2103, is ca. 4100 BC (CI 95% 4800-3500 BC), which seems also roughly compatible with that date and site – at least as compatible as R1a-Y3(xY2) is for ca. 4000 BC -, so it could have been interpreted as a migrant from the South Caspian region, potentially related to Proto-Anatolians, especially before the description of the Caucasus genetic barrier in Wang et al (2018). For some reason, though, the Hajji_Firuz sample was questioned, but this one didn’t even merited an interrogation mark.
All in all, my guess is that genomic data of I6561 would have been a priori more compatible with a later period, during the expansion of East Corded Ware groups: at least Middle Dnieper culture, potentially Multi-Cordoned Ware culture, but most likely a Srubnaya-related one, given the most likely SNP mutation and TMRCA date, and the haplogroup variability found in the few samples available from that culture.
I tried to start a thread on the possibility that the radiocarbon date was wrong, and IF it were, how likely it would be that formal stats could actually show this, or how could we automatically prevent ancestry magicfiascos.
In other words: if this guy were a Srubnaya-related individual actually dated e.g. ca. 1700 BC, and someone would try to ‘prove’ – based on the current open source tools alone – that he was the ancestor of expanding peoples of the 4th and 3rd millennium BC (i.e. Balkan outliers, Yamnaya, Corded Ware, you name it), could these results be formally challenged?
I was hoping for some original brainstorming where people would propose crazy, essentially impossible to understand statistical models, say plotting dozens of well-studied mutations of different geographically related ancient samples with their reported dates, to visually highlight samples that don’t exactly fit with such a feature-based time series analysis; I mean, the kind of theoretical models I wouldn’t even be able to follow after the first two tweets or so. I didn’t receive an answer like that, but still:
Is it possible, with the currently available tools in population genomics, to tell whether this sample is ancestral to or derived from other Steppe-related populations, before the date is tested again? And how certain would these tests be…?
The question in this case is more theoretical, though. IF this case were wrongly 14C-dated, would it be possible to detect this e.g. by assessing the number of mutations (like LP and the subclade) and the type of ancestry that shouldn't be there? More or less as w/ I2327
Yfull dates are likely underestimated. I generally just multiply by 1.25. So, the subclade for this sample doesn't seem out of place. My opinion is that Sintashta and Andronovo don't come from CWC due to dated splits, etc., but formed in Eastern Europe.
I have nothing to add to these answers, because I agree that all contradictory data are circumstancial.
The current absolute lack of this kind of validity checks for ancestry models is disappointing, though, and leaves the so-called outliers in a dangerous limbo between “potentially very interesting samples” and “potentially wrongly dated samples”. Radiocarbon date is thus – together with compatibility of population source in terms of archaeological cultures and their potential relationship – a necessary variable to take into account in any statistical design: an error in one of these variables means a catastrophic error in the whole model.
For example, in these qpAdm models, I assumed Srubnaya, Ukraine_Eneolithic_outlier, and Bulgaria_MLBA samples were roughly coetaneous and potentially related to the Srubnaya-Sabatinovka–Noua cultural horizon, hence stemming from a source close to:
Abashevo-like individuals (whose best proxy to date should be Poltavka_outlier I0432) potentially admixed with Poltavka-like herders; or
Potapovka-like individuals potentially admixed with Catacomb-like peoples (whose best proxy until recently were probably Yamnaya_Kalmykia*).
Apart from the lack of more models for comparison (I’m not going to dedicate more time to this), the results can’t be interpreted without a proper sampling and context, either, because (1) Poltavka_o may actually be from a much later group closely related to Srubnaya; (2) Bulgaria_MLBA is only one sample; and (3) there are only two samples from Potapovka; so the models here presented are basically useless, as many similar models that have been tested looking just for a formal “best fit”.
So feel free to chime in and contribute with ideas as to how to detect in the future whether a sample is ancestral to or derived from others. I will post here informative answers from Twitter, too, if there are any. I don’t think a discussion about the potentially wrong date in this specific sample is very useful, because this seems impossible to prove or disprove at this point. Just what tools or data would you use to at least try and assess whether samples are compatible with its reported date or not – preferably in some kind of automated sieve that takes dozens or hundreds of samples into account.
On the bright side, there is so much more than formal stats to arrive to relevant inferences about prehistoric populations, their movements and languages. That’s why I6561 didn’t matter for the conclusion by Anthony (2019) that it was the R1b-rich Eneolithic Don-Volga-Caucasus region the most likely Indo-Anatolian and Late Proto-Indo-European homeland, due to the creation of a wide Eneolithic mating network with extended exogamy practices, where Y-chromosome bottlenecks seem to be one of the main genomic data to take into account from the Neolithic to the Middle Bronze Age.
And that is the same reason why it doesn’t matter that much for the Proto-Indo-European or Uralic question for me, either.
Interesting excerpts (emphasis mine, changes for clarity):
Here, we report the first genome-wide data of 10 ancient individuals from northeastern Xinjiang. They are dated to around 2,200 years ago and were found at the Iron Age Shirenzigou site. We find them to be already genetically admixed between Eastern and Western Eurasians. We also find that the majority of the East Eurasian ancestry in the Shirenzigou individuals is related to northeastern Asian populations, while the West Eurasian ancestry is best presented by ∼20% to 80% Yamnaya-like ancestry. Our data thus suggest a Western Eurasian steppe origin for at least part of the ancient Xinjiang population. Our findings furthermore support a Yamnaya-related origin for the now extinct Tocharian languages in the Tarim Basin, in southern Xinjiang.
The dominant mtDNA lineages of the Shirenzigou people are commonly found in modern and ancient West Eurasian populations, such as U4, U5, and H, while they also have East Eurasian-specific haplogroups A, D4, and G3, preliminarily documenting admixed ancestry from eastern and western Eurasia.
The admixture profile is also shown on the paternal Y chromosome side that 4 out of 6 males in Shirenzigou (Figure S2) belong to the West Eurasian-specific haplogroup R1b (n = 2) and East Eurasian-specific haplogroup Q1a (n = 2), the former is predominant in ancient Yamnaya and nearly 100% in Afanasievo, different from the Middle and Late Bronze Age Steppe groups (Steppe_MLBA) such as Andronovo, [Potapovka], Srubnaya, and Sintashta whose Y chromosomal haplogroup is mainly R1a.
We first carried out principal component analysis (PCA) to assess the genetic affinities of the ancient individuals qualitatively by projecting them onto present-day Eurasian variation (Figure 2). We observed a distinct separation between East and West Eurasians. Our ancient Shirenzigou samples and present-day populations from Central Asia and northwestern China form a genetic cline from East to West in the first PC.The distribution of Shirenzigou samples on the cline is relatively scattered with two major clusters, one being closer to modern-day Uygurs and Kazakhs and the other being closer to recently published ancient Saka and Huns from the Tianshan in Kazakhstan (…).
We applied a formal admixture test using f3 statistics in the form of f3 (Shirenzigou; X, Y) where X and Y are worldwide populations that might be the genetic sources for the Shirenzigou individuals. We observed the most significant signals of admixture in the Shirenzigou samples when using Yamnaya_Samara or Srubnaya as the West Eurasian source and some Northern Asians or Koreans as the East Eurasian source (Table S1). We also plotted the outgroup f3 statistics in the form of f3 (Mbuti; X, Anatolia_Neolithic) and f3 (Mbuti; X, Kostenki14) to visualize the allele sharing between population X and Anatolian farmers. As shown in Figure S3, the Steppe_MLBA populations including Srubnaya, Andronovo, and Sintashta were shifted toward farming populations compared with Yamnaya groups and the Shirenzigou samples. This observation is consistent with ADMIXTURE analysis that Steppe_MLBA populations have an Anatolian and European farmer-related component that Yamnaya groups and the Shirenzigou individuals do not seem to have. The analysis consistently suggested Yamnaya-related Steppe populations were the better source in modeling the West Eurasian ancestry in Shirenzigou.
Genetic Composition of Iron Age Shirenzigou Individuals
We continued to use qpAdm to estimate the admixture proportions in the Shirenzigou samples by using different pairs of source populations, such as Yamnaya_Samara, Afanasievo, Srubnaya, Andronovo, BMAC culture (Bustan_BA and Sappali_Tepe_BA) and Tianshan_Hun as the West Eurasian source and Han, Ulchi, Hezhen, Shamanka_EN as the East Eurasian source. In all cases, Yamnaya, Afanasievo, or Tianshan_Hun always provide the best model fit for the Shirenzigou individuals, while Srubnaya, Andronovo, Bustan_BA and Sappali_Tepe_BA only work in some cases. The Yamnaya_Samara or Afanasievo-related ancestry ranges from ∼20% to 80% in different Shirenzigou individuals, consistent with the scattered distribution on the East-West cline in the PCA
(…) we then modeled Shirenzigou as a three-way admixture of Yamnaya_Samara, Ulchi (or Hezhen) and Han to infer the source from the East Eurasia side that contributed to Shirenzigou. We found the Ulchi or Hezhen and Han-related ancestry had a complicated and unevenly distribution in the Shirenzigou samples. The most Shirenzigou individuals derived the majority of their East Eurasian ancestry from Ulchi or Hezhen-related populations, while the following two individuals M820 and M15-2 have more Han related than Ulchi/Hezhen-related ancestry.
Also, given the apparent lack of (extra farmer ancestry that characterizes) Corded Ware ancestry, if the results were already suspicious before, how likely are now the published R1a(xZ93) and/or radiocarbon dates of the Xiaohe mummies from Li et al. (2010, 2015)? Because, after all, one should have expected in such a late date a generalized admixture with neighbouring Srubna/Andronovo-like populations.
Interesting excerpts (emphasis mine, modified for clarity):
To understand the genetic structure and influence of the Viking expansion, we sequenced the genomes of 442 ancient humans from across Europe and Greenland ranging from the Bronze Age (c. 2400 BC) to the early Modern period (c. 1600 CE), with particular emphasis on the Viking Age. We find that the period preceding the Viking Age was accompanied by foreign gene flow into Scandinavia from the south and east: spreading from Denmark and eastern Sweden to the rest of Scandinavia. Despite the close linguistic similarities of modern Scandinavian languages, we observe genetic structure within Scandinavia, suggesting that regional population differences were already present 1,000 years ago.
Maps illustrating the following texts have been made based on data from this and other papers:
Maps showing ancestry include only data from this preprint (which also includes some samples from Sigtuna).
Maps showing haplogroups of ancient DNA samples based on their age include data from all published papers, but with slightly modified locations to avoid overcrowding (randomized distance approx. ± 0.1 long. and lat.).
We find that the transition from the BA to the IA is accompanied by a reduction in Neolithic farmer ancestry, with a corresponding increase in both Steppe-like ancestry and hunter-gatherer ancestry. While most groups show a slight recovery of farmer ancestry during the VA, there is considerable variation in ancestry across Scandinavia. In particular, we observe a wide range of ancestry compositions among individuals from Sweden, with some groups in southern Sweden showing some of the highest farmer ancestry proportions (40% or more in individuals from Malmö, Kärda or Öland).
Ancestry proportions in Norway and Denmark on the other hand appear more uniform. Finally we detect an influx of low levels of “eastern” ancestry starting in the early VA, mostly constrained among groups from eastern and central Sweden as well as some Norwegian groups. Testing of putative source groups for this “eastern” ancestry revealed differing patterns among the Viking Age target groups, with contributions of either East Asian- or Caucasus-related ancestry.
Overall, our findings suggest that the genetic makeup of VA Scandinavia derives from mixtures of three earlier sources: Mesolithic hunter-gatherers, Neolithic farmers, and Bronze Age pastoralists. Intriguingly, our results also indicate ongoing gene flow from the south and east into Iron Age Scandinavia. Thus, these observations are consistent with archaeological claims of wide-ranging demographic turmoil in the aftermath of the Roman Empire with consequences for the Scandinavian populations during the late Iron Age.
Genetic structure within Viking-Age Scandinavia
We find that VA Scandinavians on average cluster into three groups according to their geographic origin, shifted towards their respective present-day counterparts in Denmark, Sweden and Norway. Closer inspection of the distributions for the different groups reveals additional complexity in their genetic structure.
We find that the ‘Norwegian’ cluster includes Norwegian IA individuals, who are distinct from both Swedish and Danish IA individuals which cluster together with the majority of central and eastern Swedish VA individuals. Many individuals from southwestern Sweden (e.g. Skara) cluster with Danish present-day individuals from the eastern islands (Funen, Zealand), skewing towards the ‘Swedish’ cluster with respect to early and more western Danish VA individuals (Jutland).
Some individuals have strong affinity with Eastern Europeans, particularly those from the island of Gotland in eastern Sweden. The latter likely reflects individuals with Baltic ancestry, as clustering with Baltic BA individuals is evident in the IBS-UMAP analysis and through f4-statistics.
Genetic clustering using IBS-UMAP suggested genetic affinities of some Viking Age individuals with Bronze Age individuals from the Baltic. To further test these, we quantified excess allele sharing of Viking Age individuals with Baltic BA compared to early Viking Age individuals from Salme using f4 statistics. We find that many individuals from the island of Gotland share a significant excess of alleles with Baltic BA, consistent with other evidence of this site being a trading post with contacts across the Baltic Sea.
The earliest N1a-VL29 sample available comes from Iron Age Gotland (VK579) ca. AD 200-400 (see Iron Age Y-DNA maps), which also proves its presence in the western Baltic before the Viking expansion. The distribution of N1a-VL29 and R1a-Z280 (compared to R1a in general) among Vikings also supports a likely expansion of both lineages in succeeding waves from the east with Akozino warrior-traders, at the same time as they expanded into the Gulf of Finland.
Vikings in Estonia
(…) only one Viking raiding or diplomatic expedition has left direct archaeological traces, at Salme in Estonia, where 41 Swedish Vikings who died violently were buried in two boats accompanied by high-status weaponry. Importantly, the Salme boat-burial predates the first textually documented raid (in Lindisfarne in 793) by nearly half a century. Comparing the genomes of 34 individuals from the Salme burial using kinship analyses, we find that these elite warriors included four brothers buried side by side and a 3rd degree relative of one of the four brothers. In addition, members of the Salme group had very similar ancestry profiles, in comparison to the profiles of other Viking burials. This suggests that this raid was conducted by genetically homogeneous people of high status, including close kin. Isotope analyses indicate that the crew descended from the Mälaren area in Eastern Sweden thus confirming that the Baltic-Mid-Swedish interaction took place early in the VA.
N1a-VL29 lineages spread again later eastwards with Varangians, from Sweden into north-eastern Europe, most likely including the ancestors of the Rurikid dynasty. Unsurprisingly, the arrival of Vikings with Swedish ancestry into the East Baltic and their dispersal through the forest zone didn’t cause a language shift of Balto-Finnic, Mordvinic, or East Slavic speakers to Old Norse, either…
NOTE. For N1a-Y4339 – N1a-L550 subclade of Swedish origin – as main haplogroup of modern descendants of Rurikid princes, see Volkov & Seslavin (2019) – full text in comments below. Data from ancient samples show varied paternal lineages even among early rulers traditionally linked to Rurik’s line, which explains some of the discrepancies found among modern descendants:
A sample from Chernihiv (VK542) potentially belonging to Gleb Svyatoslavich, the 11th century prince of Tmutarakan/Novgorod, belongs to hg. I2a-Y3120 (a subclade of early Slavic I2a-CTS10228) and has 71% “Modern Polish” ancestry (see below).
Izyaslav Ingvarevych, the 13th century prince of Dorogobuzh, Principality of Volhynia/Galicia, is probably behind a sample from Lutsk (VK541), and belongs to hg. R1a-L1029 (a subclade of R1a-M458), showing ca. 95% of “Modern Polish” ancestry.
Firstly, modern Finnish individuals are not like ancient Finnish individuals, modern individuals have ancestry of a population not in the reference; most likely Steppe/Russian ancestry, as Chinese are in the reference and do not share this direction. Ancient Swedes and Norwegians are more extreme than modern individuals in PC2 and 4. Ancient UK individuals were more extreme than Modern UK individuals in PC3 and 4. Ancient Danish individuals look rather similar to modern individuals from all over Scandinavia. By using a supervised ancient panel, we have removed recent drift from the signal, which would have affected modern Scandinavians and Finnish populations especially. This is in general a desirable feature but it is important to check that it has not affected inference.
The story for Modern-vs-ancient Finnish ancestry is consistent, with ancient Finns looking much less extreme than the moderns. Conversely, ancient Norwegians look like less-drifted modern Norwegians; the Danish admixture seen through the use of ancient DNA is hard to detect because of the extreme drift within Norway that has occurred since the admixture event. PC4 vs PC5 is the most important plot for the ancient DNA story: Sweden and the UK (along with Poland, Italy and to an extent also Norway) are visibly extremes of a distribution the same “genes-mirror-geography” that was seen in the Ancient-palette analysis. PC1 vs PC2 tells the same story – and stronger, since this is a high variance-explained PC – for the UK, Poland and Italy.
Evidence for Pictish Genomes
The four ancient genomes of Orkney individuals with little Scandinavian ancestry may be the first ones of Pictish people published to date. Yet a similar (>80% “UK ancestry) individual was found in Ireland (VK545) and five in Scandinavia, implying that Pictish populations were integrated into Scandinavian culture by the Viking Age.
Our interpretation for the Orkney samples can be summarised as follows. Firstly, they represent “native British” ancestry, rather than an unusual type of Scandinavian ancestry. Secondly, that this “British” ancestry was found in Britain before the Anglo-Saxon migrations. Finally, that in Orkney, these individuals would have descended from Pictish populations.
(…) ‘UK’ represents a group from which modern British and Irish people all receive an ancestry component. This information together implies that within the sampling frame of our data, they are proxying the ‘Briton’ component in UK ancestry; that is, a pre-Roman genetic component present across the UK. Given they were found in Orkney, this makes it very likely that they were descended from a Pictish population.
Modern genetic variation within the UK sees variation between ‘native Briton’ populations Wales, Scotland, Cornwall and Ireland as large compared to that within the more ‘Anglo-Saxon’ English. This is despite subsequent gene flow into those populations from English-like populations. We have not attempted to disentangle modern genetic drift from historically distinct populations. Roman-era period people in England, Wales, Ireland and Scotland may not have been genetically close to these Orkney individuals, but our results show that they have a shared genetic component as they represent the same direction of variation.
As in the case of mitochondrial DNA, the overall distribution profile of the Y chromosomal haplogroups in the Viking Age samples was similar to that of the modern North European populations. The most frequently encountered male lineages were the haplogroups I1, R1b and R1a.
Haplogroup I (I1, I2)
The distribution of I1 in southern Scandinavia, including a sample from Sealand (VK532) ca. AD 100 (see Iron Age Y-DNA maps) proves that it had become integrated into the West Germanic population already before their expansions, something that we already suspected thanks to the sampling of Germanic tribes.
Haplogroup R1b (M269, U106, P312)
Especially interesting is the finding of R1b-L151 widely distributed in the historical Nordic Bronze Age region, which is in line with the estimated TMRCA for R1b-P312 subclades found in Scandinavia, despite the known bottleneck among Germanic peoples under U106. Particularly telling in this regard is the finding of rare haplogroups R1b-DF19, R1b-L238, or R1b-S1194. All of that points to the impact of Bell Beaker-derived peoples during the Dagger period, when Pre-Proto-Germanic expanded into Scandinavia.
Also interesting is the finding of hg. R1b-P297 in Troms, Norway (VK531) ca. 2400 BC. R1b-P297 subclades might have expanded to the north through Finland with post-Swiderian Mesolithic groups (read more about Scandinavian hunter-gatherers), and the ancestry of this sample points to that origin.
However, it is also known that ancestry might change within a few generations of admixture, and that the transformation brought about by Bell Beakers with the Dagger Period probably reached Troms, so this could also be a R1b-M269 subclade. In fact, the few available data from this sample show that it comes from the natural harbour Skarsvågen at the NW end of the island Senja, and that its archaeologist thought it was from the Viking period or slightly earlier, based on the grave form. From Prescott (2017):
In 1995, Prescott and Walderhaug tentatively argued that a dramatic transformation took place in Norway around the Late Neolithic (2350 BCE), and that the swift nature of this transition was tied to the initial Indo-Europeanization of southern and coastal Norway, at least to Trøndelag and perhaps as far north as Troms. (…)
The Bell Beaker/early Late Neolithic, however, represents a source and beginning of these institution and practices, exhibits continuity to the following metal age periods and integrated most of Northern Europe’s Nordic region into a set of interaction fields. This happened around 2400 BCE, at the MNB to LN transition.
NOTE. This particular sample is not included in the maps of Viking haplogroups.
Among the ancient samples, two individuals were derived haplogroups were identified as E1b1b1-M35.1, which are frequently encountered in modern southern Europe, Middle East and North Africa. Interestingly, the individuals carrying these haplogroups had much less Scandinavian ancestry compared to the most samples inferred from haplotype based analysis. A similar pattern was also observed for less frequent haplogroups in our ancient dataset, such as G (n=3), J (n=3) and T (n=2), indicating a possible non-Scandinavian male genetic component in the Viking Age Northern Europe. Interestingly, individuals carrying these haplogroups were from the later Viking Age (10th century and younger), which might indicate some male gene influx into the Viking population during the Viking period.
As the paper says, the small sample size of rare haplogroups cannot distinguish if these differences are statistically relevant. Nevertheless, both E1b samples have substantial Modern Polish-like ancestry: one sample from Gotland (VK474), of hg. E1b-L791, has ca. 99% “Polish” ancestry, while the other one from Denmark (VK362), of hg. E1b-V13, has ca. 35% “Polish”, ca. 35% “Italian”, as well as some “Danish” (14%) and minor “British” and “Finnish” ancestry.
Given the E1b-V13 samples of likely Central-East European origin among Lombards, Visigoths, and especially among Early Slavs, and the distribution of “Polish” ancestry among Viking samples, VK362 is probably a close description of the typical ancestry of early Slavs. The peak of Modern Polish-like ancestry around the Upper Pripyat during the (late) Viking Age suggests that Poles (like East Slavs) have probably mixed since the 10th century with more eastern peoples close to north-eastern Europeans, derived from ancient Finno-Ugrians:
Similarly, the finding of R1a-M458 among Vikings in Funen, Denmark (VK139), in Lutsk, Poland (VK541), and in Kurevanikha, Russia (VK160), apart from the early Slav from Usedom, may attest to the origin of the spread of this haplogroup in the western Baltic after the Bell Beaker expansion, once integrated in both Germanic and Balto-Slavic populations, as well as intermediate Bronze Age peoples that were eventually absorbed by their expansions. This contradicts, again, my simplistic initial assessment of R1a-M458 expansion as linked exclusively (or even mainly) to Balto-Slavs.
The earliest archaeological evidence of horse milking, harnessing, and corralling is found in the ∼5,500-year-old Botai culture of Central Asian steppes (Gaunitz et al., 2018, Outram et al., 2009; see Kosintsev and Kuznetsov, 2013 for discussion). Botai-like horses are, however, not the direct ancestors of modern domesticates but of Przewalski’s horses (Gaunitz et al., 2018). The genetic origin of modern domesticates thus remains contentious, with suggested candidates in the Pontic-Caspian steppes (Anthony, 2007), Anatolia (Arbuckle, 2012, Benecke, 2006), and Iberia (Uerpmann, 1990, Warmuth et al., 2011). Irrespective of the origins of domestication, the horse genome is known to have been reshaped significantly within the last ∼2,300 years (Librado et al., 2017, Wallner et al., 2017, Wutke et al., 2018). However, when and in which context(s) such changes occurred remains largely unknown.
To clarify the origins of domestic horses and reveal their subsequent transformation by past equestrian civilizations, we generated DNA data from 278 equine subfossils with ages mostly spanning the last six millennia (n = 265, 95%) (Figures 1A and 1B; Table S1; STAR Methods). Endogenous DNA content was compatible with economical sequencing of 87 new horse genomes to an average depth-of-coverage of 1.0- to 9.3-fold (median = 3.3-fold; Table S2). This more than doubles the number of ancient horse genomes hitherto characterized. With a total of 129 ancient genomes, 30 modern genomes, and new genome-scale data from 132 ancient individuals (0.01- to 0.9-fold, median = 0.08-fold), our dataset represents the largest genome-scale time series published for a non-human organism (Tables S2, S3, and S4; STAR Methods).
Discovering Two Divergent and Extinct Lineages of Horses
Domestic and Przewalski’s horses are the only two extant horse lineages (Der Sarkissian et al., 2015). Another lineage was genetically identified from three bones dated to ∼43,000–5,000 years ago (Librado et al., 2015, Schubert et al., 2014a). It showed morphological affinities to an extinct horse species described as Equus lenensis (Boeskorov et al., 2018). We now find that this extinct lineage also extended to Southern Siberia, following the principal component analysis (PCA), phylogenetic, and f3-outgroup clustering of an ∼24,000-year-old specimen from the Tuva Republic within this group (Figures 3, 5A and S7A). This new specimen (MerzlyYar_Rus45_23789) carries an extremely divergent mtDNA only found in the New Siberian Islands some ∼33,200 years ago (Orlando et al., 2013) (Figure 6A; STAR Methods) and absent from the three bones previously sequenced. This suggests that a divergent ghost lineage of horses contributed to the genetic ancestry of MerzlyYar_Rus45_23789. However, both the timing and location of the genetic contact between E. lenensis and this ghost lineage remain unknown.
Modeling Demography and Admixture of Extinct and Extant Horse Lineages
Phylogenetic reconstructions without gene flow indicated that IBE differentiated prior to the divergence between DOM2 and Przewalski’s horses (Figure 3; STAR Methods). However, allowing for one migration edge in TreeMix suggested closer affinities with one single Hungarian DOM2 specimen from the 3rd mill. BCE (Dunaujvaros_Duk2_4077), with extensive genetic contribution (38.6%) from the branch ancestral to all horses (Figure S7B).This, and the extremely divergent IBE Y chromosome (Figure 6B), suggest that a divergent but yet unidentified ghost population could have contributed to the IBE genetic makeup.
Rejecting Iberian Contribution to Modern Domesticates
The genome sequences of four ∼4,800- to 3,900-year-old IBE specimens characterized here allowed us to clarify ongoing debates about the possible contribution of Iberia to horse domestication (Benecke, 2006, Uerpmann, 1990, Warmuth et al., 2011). Calculating the so-called fG ratio (Martin et al., 2015) provided a minimal boundary for the IBE contribution to DOM2 members (Cahill et al., 2013) (Figure 7A). The maximum of such estimate was found in the Hungarian Dunaujvaros_Duk2_4077 specimen (∼11.7%–12.2%), consistent with its TreeMix clustering with IBE when allowing for one migration edge (Figure S7B). This specimen was previously suggested to share ancestry with a yet-unidentified population (Gaunitz et al., 2018). Calculation of f4-statistics indicates that this population is not related to E. lenensis but to IBE (Figure 7B; STAR Methods). Therefore, IBE or horses closely related to IBE, contributed ancestry to animals found at an Early Bronze Age trade center in Hungary from the late 3rd mill. BCE. This could indicate that there was long-distance exchange of horses during the Bell Beaker phenomenon (Olalde et al., 2018). The fG minimal boundary for the IBE contribution into an Iron Age Spanish horse (ElsVilars_UE4618_2672) was still important (~9.6%–10.1%), suggesting that an IBE genetic influence persisted in Iberia until at least the 7th century BCE in a domestic context. However, fG estimates were more limited for almost all ancient and modern horses investigated (median = ~4.9%–5.4%; Figure 7A).
Iron Age horses
Y chromosome nucleotide diversity (π) decreased steadily in both continents during the last ∼2,000 years but dropped to present-day levels only after 850–1,350 CE (Figures 2B and S2E; STAR Methods). This is consistent with the dominance of an ∼1,000- to 700-year-old oriental haplogroup in most modern studs (Felkel et al., 2018, Wallner et al., 2017). Our data also indicate that the growing influence of specific stallion lines post-Renaissance (Wallner et al., 2017) was responsible for as much as a 3.8- to 10.0-fold drop in Y chromosome diversity.
We then calculated Y chromosome π estimates within past cultures represented by a minimum of three males to clarify the historical contexts that most impacted Y chromosome diversity. This confirmed the temporal trajectory observed above as Byzantine horses (287–861 CE) and horses from the Great Mongolian Empire (1,206–1,368 CE) showed limited yet larger-than-modern diversity. Bronze Age Deer Stone horses from Mongolia, medieval Aukštaičiai horses from Lithuania (C9th–C10th [ninth through the tenth centuries of the Common Era]), and Iron Age Pazyryk Scythian horses showed similar diversity levels (0.000256–0.000267) (Figure 2A). However, diversity was larger in La Tène, Roman, and Gallo-Roman horses, where Y-to-autosomal π ratios were close to 0.25. This contrasts to modern horses, where marked selection of specific patrilines drives Y-to-autosomal π ratios substantially below 0.25 (0.0193–0.0396) (Figure 2A). The close-to-0.25 Y-to-autosomal π ratios found in La Tène, Roman, and Gallo-Roman horses suggest breeding strategies involving an even reproductive success among stallions or equally biased reproductive success in both sexes (Wilson Sayres et al., 2014).
Lineage is used in this paper, as in many others in genetics, as defined by a specific ancestry. I keep that nomenclature below. It should not be confused with the “lineages” or “lines” referring to Y-chromosome (or mtDNA) haplogroups.
Supporting the “archaic” nature of the Hungarian BBC horses expanding from the Pontic-Caspian steppes are:
Among Y-chromosome lines, the common group formed by Botai-Borly4 (closely related to DOM2), Scythian horses from Aldy Bel (Arzhani), Iron Age horses from Estonia (Ridala), horses from the Xiongnu culture (Uushgiin Uvur), and Roman horses from Autricum (Chartres).
Among mtDNA lines, the common group formed by Botai samples, LebyazhinkaIV NB35, and different Eurasian domesticates, including many ancient Western European ones, which reveals a likely expansion of certain subclades east and west with the Repin culture.
(…) DOM2 contributed 22% to the ancestor of Przewalski’s horses ca. 9.47 kya, suggesting the Holocene optimum, rather than the Eneolithic Botai culture (∼5.5 kya), as a period of population contact. This pre-Botai introgression could explain the Y chromosome topology, where Botai horses were reported to carry two different segregating haplogroups: one occupied a basal position in the phylogeny while the other was closely related to DOM2. Multiple admixture pulses, however, are known to have occurred along the divergence of DOM2 and the Botai-Borly4 lineage, including 2.3% post-Borly4 contribution to DOM2, and a more recent 6.8% DOM2 intogression into Przewalski’s horses (Gaunitz et al., 2018). Model C2 parameters accommodate all these as a single admixture pulse, likely averaging the contributions of all these multiple events.
The paper cannot offer a detailed picture of ancient horse domestication, but it is yet another step in showing how Repin/Yamna is the most likely source of expansion of horse domesticates in Eurasia. Even more interestingly, Yamna settlers in Hungary probably expanded an ancient lineage of that horse at the same time as they spread with the Classical Bell Beaker culture. Remarkable parallels are thus found between:
The expansion of an ancient line of horse domesticates related to Yamna Hungary/East Bell Beakers seems to be confirmed by the pre-Iberian sample from Vilars I, Els Vilars4618 2672 (ca. 700-550 BC), likely of Iberian Beaker descent, showing a lineage older than the Indo-Iranian ones, which later replaced most European lines.
NOTE. For known contacts between Yamna and Proto-Beakers just before the expansion of East Bell Beakers, see a recent post on Vanguard Yamna groups.
The findings of the paper confirm the expansion of the horse firstly (and mainly) through the steppe biome, mimicking the expansion of Proto-Indo-Europeans first, and then replaced gradually (or not so gradually) by lines brought to Europe during westward expansions of Bronze Age, Iron Age, and later specialized horse-riding steppe cultures. The expansion also correlates well with the known spread of animal traction and pastoralism before 2000 BC:
Yamna expansion to the west “with horses and wagons”, with a more homogeneous ancestry in modern Europeans due to later migrations from the east (and north):
DR: inference is that two major migrations: farmers from Anatolia, followed by steppe pastoralists. Who are they? They took horses and wagons and spread. See rapid 90% pop turn over in Britain. Similar timing in Iberia, but a bit less turnover, and more period of overlap
To extend our knowledge of patterns of genomic diversity in Africa, we generated high coverage (> 30×) genome sequencing data from 43 geographically diverse Africans originating from 22 ethnic groups, representing a broad array of ethnic, linguistic, cultural, and geographic diversity (Additional file 1: Table S1). These include a number of populations of anthropological interest that have never previously been characterized for high-coverage genome sequence diversity such as Afroasiatic-speaking El Molo fishermen and Nilo-Saharan-speaking Ogiek hunter-gatherers (Kenya); Afroasiatic-speaking Aari, Agaw, and Amhara agro-pastoralists (Ethiopia); Niger-Congo-speaking Fulani pastoralists (Cameroon); Nilo-Saharan-speaking Kaba (Central African Republic, CAR); and Laka and Bulala (Chad) among others. We integrated this data with 49 whole genome sequences generated as part of the Simons Genome Diversity Project (SGDP)  (…)
Results and discussion
We found that the CRHG populations from central Africa, including the Mbuti from the Demographic Republic of Congo (DRC), Biaka from the CAR, and Baka, Bakola, and Bedzan from Cameroon, also form a basal lineage in the phylogeny. The other two hunter-gatherer populations, Hadza and Sandawe, living in Tanzania, group with populations from eastern Africa (Fig. 2). The two Nilo-Saharan-speaking populations, the Mursi from southern Ethiopia and the Dinka from southern Sudan, group into a single cluster, which is consistent with archeological data indicating that the migration of Nilo-Saharan populations to eastern Africa originated from a source population in southern Sudan in the last 3000 years [4, 23, 24, 25].
The Fulani people are traditionally nomadic pastoralists living across a broad geographic range spanning Sudan, the Sahel, Central, and Western Africa. The Fulani in our study, sampled from Cameroon, clustered with the Afroasiatic-speaking populations in East Africa in the phylogenetic analysis, indicating a potential language replacement from Afroasiatic to Niger-Congo in this population (Fig. 2). Prior studies suggest a complex history of the Fulani; analyses of Y chromosome variation suggest a shared ancestry with Nilo-Saharan and Afroasiatic populations , whereas mtDNA indicates a West African origin . An analysis based on autosomal markers found traces of West Eurasian-related ancestry in this population , which suggests a North African or East African origin (as North and East Africans also have such ancestry likely related to expansions of farmers and herders from the Near East) and is consistent with the presence at moderate frequency of the −13,910T variant associated with lactose tolerance in European populations [15, 16].
Phylogenetic reconstruction of the relationship of African individuals under a model allowing for migration using TREEMIX  largely recapitulates the NJ phylogeny with the exception of the Fulani who cluster near neighboring Niger-Congo-speaking populations with whom they have admixed (Additional file 2: Figure S1). Interestingly, TREEMIX analysis indicates evidence for gene flow between the Hadza and the ancestors of the Ju|‘hoan and Khomani San, supporting genetic, linguistic, and archeological evidence that Khoesan-speaking populations may have originated in Eastern Africa [28, 29, 30].
About the Fulani, this is what the referenced study of Y‐chromosome variation among 15 Sudanese populations by Hassan et al. (2008), had to say:
Haplogroups A-M13 and B-M60 are present at high frequencies in Nilo-Saharan groups except Nubians, with low frequencies in Afro-Asiatic groups although notable frequencies of B-M60 were found in Hausa (15.6%) and Copts (15.2%).
Haplogroup E (four different haplotypes) accounts for the majority (34.4%) of the chromosome and is widespread in the Sudan. E-M78 represents 74.5% of haplogroup E, the highest frequencies observed in Masalit and Fur populations. E-M33 (5.2%) is largely confined to Fulani and Hausa, whereas E-M2 is restricted to Hausa. E-M215 was found to occur more in Nilo-Saharan rather than Afro-Asiatic speaking groups.
In contrast, haplogroups F-M89, I-M170, J-12f2, and JM172 were found to be more frequent in the Afro-Asiatic speaking groups. J-12f2 and J-M172 represents 94% and 6%, respectively, of haplogroup J with high frequencies among Nubians, Copts, and Arabs.
Haplogroup K-M9 is restricted to Hausa and Gaalien with low frequencies and is absent in Nilo-Saharan and Niger-Congo.
Haplogroup R-M173 appears to be the most frequent haplogroup in Fulani, and haplogroup R-P25 has the highest frequency in Hausa and Copts and is present at lower frequencies in north, east, and western Sudan.
Haplogroups A-M51, A-M23, D-M174, H-M52, L-M11, OM175, and P-M74 were completely absent from the populations analyzed.
In this talk, I will describe how the new science of genome-wide ancient DNA can provide insights into past spreads of language and culture. I will discuss five examples: (1) the spread of Indo-European languages to Europe and South Asia in association with Steppe pastoralist ancestry, (2) the spread of Austronesian languages to the open Pacific islands in association with Taiwanese aboriginal-associated ancestry, (3) the spread of Austroasiatic languages through southeast Asia in association with the characteristic ancestry type that is also represented in western Indonesia suggesting that these languages were once widespread there, (4) the spread of Afroasiastic languages through in East Africa as part of the Pastoral Neolithic farming expansion, and (5) the spread of Na-Dene languages in North America in association with Proto-Paleoeskimo ancestry. I will highlight the ways that ancient DNA can meaningfully contribute to our understanding of language expansions—increasing the plausibility of some scenarios while decreasing the plausibility of others—while emphasizing that with genetic data by itself we can never definitively determine what languages ancient people spoke.
EDIT (3 MAY 2019): Apparently, there was not much to take from the talk:
DR: finally talks about spread of afroasiatic languages in northern Africa. Shows a complicated model. Idk man there's a lot of populations.
This seminar (and maybe some new paper on the Neolithic expansion in Africa) could shed light on population movements that may be related to the spread of Afroasiatic dialects. Until now, it seems that Bantu peoples have been more interesting for linguistics and archaeology, and South and East Africans for anthropology.
The process behind the introduction and development of farming in Africa is still unclear. It is not known how many independent invention events there were in the continent and to which extent the various first instances of farming in northern Africa are linked. Based on the archeological record, it was proposed that at least three regions in Africa may have developed agriculture independently: the Sahara/Sahel (around 7 ka), the Ethiopian highlands (7-4 ka), and western Africa (5-3 ka). In addition to these developments, the Nile River Valley is thought to have adopted agriculture (around 7.2 ka), from the Neolithic Revolution in the Middle East (Chapter 12 – Jobling et al. 2014; Chapter 35, 37 – Mitchell and Lane 2013). From these diverse centers of origin, farmers or farming practices spread to the rest of Africa, with domesticate animals reaching the southern tip of Africa ~2 ka and crop farming ~1,8 ka (Mitchell 2002; Huffman 2007)
Similar to the case in Europe and the 1990s-2000s wrong haplogroup history based on the modern distribution of R1b, R1a, N, or I2, it is possible that neither of the most often mentioned haplogroups linked to the Afroasiatic expansion, E and J, were responsible for its early spread within Africa, despite their widespread distribution in certain modern Afroasiatic-speaking areas. The fact that such assessments include implausible glottochronological dates spanning up to 20,000 years for the parent language, combined with regional language continuities despite archaeological changes, makes them even more suspicious.
Similar to the case with Indo-Europeans and the “steppe ancestry” concept of the 2010s, it may be that the often-looked-for West Eurasian ancestry among Africans is the effect of recent migrations, unrelated to the Afroasiatic expansion. The results of this paper could be offering another sign of how this ancestry may have expanded only quite recently westwards from East Africa through the Sahel, after the Semitic expansion to the south:
1. From approximately 1000 BC, accompanying Nilo-Saharan peoples.
Arguably, since the Fulani caste system wasn’t as elaborate in northern Nigeria, eastern Niger, and Cameroon, these specific groups would be a good example of the admixture with eastern populations, based on the (proportionally) huge amount of slaves they dealt with.
Similarly, it could be argued that the castes-based social stratification in most other territories (including Sudan) would have helped them keep a genetic make-up similar to their region of origin in terms of ancient lineages, hence similar to Chadic populations from west to east.
Reich’s assertion of the association of the language expansion with the spread of Pastoral Neolithic is still too vague, but – based on previous publications of ancient DNA in Africa and the Levant – I don’t have high hopes for a revolutionary paper in the near future. Without many samples and proper temporal transects, we are stuck with speculations based on modern distributions and scarce historical data.
About the potential genetic make-up of Cameroon before the arrival of the Neolithic, from the recent SAA 84th Annual Meeting (Abstracts in PDF):
Lipson, Mark (Harvard Medical School), Mary Prendergast (Harvard University), Isabelle Ribot (Université de Montréal), Carles Lalueza-Fox (Institute of Evolutionary Biology CSIC-UPF) and David Reich (Harvard Medical School)
 Ancient Human DNA from Shum Laka (Cameroon) in the Context of African Population History We generated genome-wide DNA data from four people buried at the site of Shum Laka in Cameroon between 8000–3000 years ago. One individual carried the deeply divergent Y chromosome haplogroup A00 found at low frequencies among some present-day Niger-Congo speakers, but the genome-wide ancestry profiles for all four individuals are very different from the majority of West Africans today and instead are more similar to West-Central African hunter-gatherers. Thus, despite the geographic proximity of Shum Laka to the hypothesized birthplace of Bantu languages and the temporal range of our samples bookending the initial Bantu expansion, these individuals are not representative of a Bantu source population. We present a phylogenetic model including Shum Laka that features three major radiations within Africa: one phase early in the history of modern humans, one close to the time of the migration giving rise to non-Africans, and one in the past several thousand years. Present-day West Africans and some East Africans, in addition to Central and Southern African hunter-gatherers, retain ancestry from the first phase, which is therefore still represented throughout the majority of human diversity in Africa today.
Second in popularity for the expansion of haplogroup N1a-L392 (ca. 4400 BC) is, apparently, the association with Turkic, and by extension with Micro-Altaic, after the Uralic link preferred in Europe; at least among certain eastern researchers.
According to the views of a number of authoritative researchers, the Yakut ethnos was formed in the territory of Yakutia as a result of the mixing of people from the south and the autochthonous population .
These three major Sakha paternal lineages may have also arrived in Yakutia at different times and/ or from different places and/or with a difference in several generations instead, or perhaps Y-chromosomal STR mutations may have taken place in situ in Yakutia. Nevertheless, the immediate common ancestor(s) from the Asian Steppe of these three most prevalent Sakha Y-chromosomal STR haplotypes possibly lived during the prominence of the Turkic Khaganates, hence the near-perfect matches observed across a wide range of Eurasian geography, including as far as from Cyprus in the West to Liaoning, China in the East, then Middle Lena in the North and Afghanistan in the South (Table 3 and Figure 5). There may also be haplotypes closely-related to ‘the dominant Elley line’ among Karakalpaks, Uzbeks and Tajiks, however, limitations in the loci coverage for the available dataset (only eight Y-chromosomal STR loci) precludes further conclusions on this matter .
According to the results presented here, very similar Y-STR haplotypes to that of the original Elley line were found in the west: Afghanistan and northern Cyprus, and in the east: Liaoning Province, China and Ulaanbaator, Northern Mongolia. In the case of the dominant Omogoy line, very closely matching haplotypes differing by a single mutational step were found in the city of Chifen of the Jirin Province, China. The widest range of similar haplotypes was found for the Yakut haplotype Unknown: In Mongolia, China and South Korea. For instance, haplotypes differing by a single step mutation were found in Northern Mongolia (Khalk, Darhad, Uryankhai populations), Ulaanbaator (Khalk) and in the province of Jirin, China (Han population).
Notably, Tat-C-bearing Y-chromosomes were also observed in ancient DNA samples from the 2700-3000 years-old Upper Xiajiadian culture in Inner Mongolia, as well as those from the Serteya II site at the Upper Dvina region in Russia and the ‘Devichyi gory’ culture of long barrow burials at the Nevel’sky district of Pskovsky region in Russia. A 14-loci Y-chromosomal STR median-joining network of the most prevalent Sakha haplotypes and a Tat-C-bearing haplotype from one of the ancient DNA samples recovered from the Upper Xiajiadian culture in Inner Mongolia (DSQ04) revealed that the contemporary Sakha haplotype ‘Xuo’ (Table 2, Haplotype ID “Xuo”) classified as that of ‘the Xiongnu clan’ in our current study, was the closest to the ancient Xiongnu haplotype (Figure 6). TMRCA estimate for this 14-loci Y-chromosomal STR network was 4357 ± 1038 years or 2341 ± 1038 BCE, which correlated well with the Upper Xiajiadian culture that was dated to the Late Bronze Age (700-1000 BCE).
Also, a simple look at the TMRCA and modern distribution was enough to hypothesize long ago the lack of connection of N1c-L392 with Altaic or Uralic peoples. From Ilumäe et al. (2016):
Previous research has shown that Y chromosomes of the Turkic-speaking Yakuts (Sakha) belong overwhelmingly to hg N3 (formerly N1c1). We found that nearly all of the more than 150 genotyped Yakut N3 Y chromosomes belong to the N3a2-M2118 clade, just as in the Turkic-speaking Dolgans and the linguistically distant Tungusic-speaking Evenks and Evens living in Yakutia (Table S2). Hence, the N3a2 patrilineage is a prime example of a male population of broad central Siberian ancestry that is not intrinsic to any linguistically defined group of people. Moreover, the deepest branch of hg N3a2 is represented by a Lebanese and a Chinese sample. This finding agrees with the sequence data from Hallast et al., where one Turkish Y chromosome was also assigned to the same sub-clade. Interestingly, N3a2 was also found in one Bhutan individual who represents a separate sub-lineage in the clade. These findings show that although N3a2 reflects a recent strong founder effect primarily in central Siberia (Yakutia, Sakha), the sub-clade has a much wider distribution area with incidental occurrences in the Near East and South Asia.
The most striking aspect of the phylogeography of hg N is the spread of the N3a3’6-CTS6967 lineages. Considering the three geographically most distant populations in our study—Chukchi, Buryats, and Lithuanians—it is remarkable to find that about half of the Y chromosome pool of each consists of hg N3 and that they share the same sub-clade N3a3’6. The fractionation of N3a3’6 into the four sub-clades that cover such an extraordinarily wide area occurred in the mid-Holocene, about 5.0 kya (95% CI = 4.4–5.7 kya). It is hard to pinpoint the precise region where the split of these lineages occurred. It could have happened somewhere in the middle of their geographic spread around the Urals or further east in West Siberia, where current regional diversity of hg N sub-lineages is the highest (Figure 1B). Yet, it is evident that the spread of the newly arisen sub-clades of N3a3’6 in opposing directions happened very quickly. Today, it unites the East Baltic, East Fennoscandia, Buryatia, Mongolia, and Chukotka-Kamchatka (Beringian) Eurasian regions, which are separated from each other by approximately 5,000–6,700 km by air. N3a3’6 has high frequencies in the patrilineal pools of populations belonging to the Altaic, Uralic, several Indo-European, and Chukotko-Kamchatkan language families. There is no generally agreed, time-resolved linguistic tree that unites these linguistic phyla. Yet, their split is almost certainly at least several millennia older than the rather recent expansion signal of the N3a3’6 sub-clade, suggesting that its spread had little to do with linguistic affinities of men carrying the N3a3’6 lineages.
It was thus clear long ago that N1c-L392 lineages must have expanded explosively in the 5th millennium through Northern Eurasia, probably from a region to the north of Lake Baikal, and that this expansion – and succeeding ones through Northern Eurasia – may not be associated to any known language group until well into the common era.
Some interesting excerpts, relevant for the latest papers (emphasis mine):
The Archaeology of the Early Slavs
(…) One of the most egregious problems with the current model of the Slavic migration is that it is not at all clear where it started. There is in fact no agreement as to the exact location of the primitive homeland of the Slavs, if there ever was one. The idea of tracing the origin of the Slavs to the Zarubyntsi culture dated between the 3rd century BC and the first century AD is that a gap of about 200 years separates it from the Kiev culture (dated between the 3rd and the 4th century AD), which is also attributed to the Slavs. Furthermore, another century separates the Kiev culture from the earliest assemblages attributed to the Prague culture. It remains unclear as to where the (prehistoric) Slavs went after the first century, and whence they could return, two centuries later, to the same region from which their ancestors had left. The obvious cultural discontinuity in the region of the presumed homeland raises serious doubts about any attempts to write the history of the Slavic migration on such a basis. There is simply no evidence of the material remains of the Zarubyntsi, Kiev, or even Prague culture in the southern and southwestern direction of the presumed migration of the Slavs towards the Danube frontier of the Roman Empire.
Moreover, the material culture revealed by excavations of 6th- to 7th-century settlements and, occasionally, cremation cemeteries in northwestern Russia, Belarus, Poland, Moravia, and Bohemia is radically different from that in the lands north of the Danube river, which according to the early Byzantine sources were inhabited at that time by Sclavenes: no settlement layout with a central, open area; no wheel-made pottery or pottery thrown on a tournette; no clay rolls inside clay ovens; few, if any clay pans; no early Byzantine coins, buckles, or remains of amphorae; no fibulae with bent stem, and few, if any bow fibulae. Conversely, those regions have produced elements of material culture that have no parallels in the lands north of the river Danube: oval, trough-like settlement features (which are believed to be remains of above-ground, log-houses); exclusively handmade pottery of specific forms; very large settlements, with over 300 houses; fortified sites that functioned as religious or communal centers; and burials under barrows. With no written sources to inform about the names and identities of the populations living in the 6th and 7th centuries in East Central and Eastern Europe, those contrasting material culture profiles could hardly be interpreted as ethnic commonality. In other words, there is no serious basis for attributing to the Sclavenes (or, at least, to those whom early Byzantine authors called so) any of the many sites excavated in Russia, Belarus, Poland, Moravia, and Bohemia.
There is of course evidence of migrations in the 6th and 7th centuries, but not in the directions assumed by historians. For example, there are clear signs of settlement discontinuity in northern Germany and in northwestern Poland. German archaeologists believe that the bearers of the Prague culture who reached northern Germany came from the south (from Bohemia and Moravia), and not from the east (from neighboring Poland or the lands farther to the east). At any rate, no archaeological assemblage attributed to the Slavs either in northern Germany or in northern Poland may be dated earlier than ca. 700. In Poland, settlement discontinuity was postulated, to make room for the new, Prague culture introduced gradually from the southeast (from neighboring Ukraine). However, there is increasing evidence of 6th-century settlements in Lower Silesia (western Poland and the lands along the Middle Oder) that have nothing to do with the Prague culture. Nor is it clear how and when did the Prague culture spread over the entire territory of Poland. No site of any of the three archaeological cultures in Eastern Europe that have been attributed to the Slavs (Kolochin, Pen’kivka, and Prague/Korchak) has so far been dated earlier than the sites in the Lower Danube region where the 6th century sources located the Sclavenes. Neither the Kolochin, nor the Pen’kivka cultures expanded westwards into East Central or Southeastern Europe; on the contrary, they were themselves superseded in the late 7th or 8th century by other archaeological cultures originating in eastern Ukraine. Meanwhile, there is an increasing body of archaeological evidence pointing to very strong cultural influences from the Lower and Middle Danube to the Middle Dnieper region during the 7th century—the opposite of the alleged direction of Slavic migration.
When did the Slavs appear in those regions of East Central and Eastern Europe where they are mentioned in later sources? A resistant stereotype of the current scholarship on the early Slavs is that “Slavs are Slavonic-speakers; Slavonic-speakers are Slavs.”* If so, when did people in East Central and Eastern Europe become “Slavonic speakers”? There is in fact no evidence that the Sclavenes mentioned by the 6th-century authors spoke Slavic (or what linguists now call Common Slavic). Nor can the moment be established (with any precision), at which Slavic was adopted or introduced in any given region of East Central and Eastern Europe.** To explain the spread of Slavic across those regions, some have recently proposed the model of a koiné, others that of a lingua franca. The latter was most likely used within the Avar polity during the last century of its existence (ca. 700 to ca. 800).
*Ziółkowski, “When did the Slavs originate?” p. 211. On the basis of the meaning of the Old Church Slavonic word ięzyk (“language,” but also “people” or “nation”), Darden, “Who were the Sclaveni?” p. 138 argues that the meaning of the name the Slavs gave to themselves was closely associated with the language they spoke.
**Uncertainty in this respect dominates even in recent studies of contacts between Slavic and Romance languages (particularly Romanian), even though such contacts are presumed to have been established quite early (Paliga, “When could be dated ‘the earliest Slavic borrowings’?”; Boček, Studie). Recent studies of the linguistic interactions between speakers of Germanic and speakers of Slavic languages suggest that the adoption of place names of Slavic origin was directly linked to the social context of language contact between the 9th and the 13th centuries (Klír, “Sociální kontext”).
During the 6th century, the area between the Danube and the Tisza in what is today Hungary, was only sparsely inhabited, and probably a “no man’s land” between the Lombard and Gepid territories. It is only after ca. 600 that this area was densely inhabited, as indicated by a number of new cemeteries that came into being along the Tisza and north of present-day Kecskemét. There can therefore be no doubt about the migration of the Avars into the Carpathian Basin, even though it was probably not a single event and did not involve only one group of population, or even a cohesive ethnic group.
The number of graves with weapons and of burials with horses is particularly large in cemeteries excavated in southwestern Slovakia and in neighboring, eastern Austria. This was a region of special status on the border of the qaganate, perhaps a “militarized frontier.” From that region, the Avar mores and fashions spread farther to the west and to the north, into those areas of East Central Europe in which, for reasons that are still not clear, Avar symbols of social rank were particularly popular, as demonstrated by numerous finds of belt fittings. Emulating the success of the Avar elites sometimes involved borrowing other elements of social representation, such as the preferential deposition of weapons and ornamented belts. For example, in the early 8th century, a few males were buried in Carinthia (southern Austria) with richly decorated belts imitating those in fashion in the land of the Avars, but also with Frankish weapons and spurs. Much like in the Avar-age cemeteries in Slovakia and Hungary, the graves of those socially prominent men are often surrounded by many burials without any grave goods whatsoever.
Carantania was a northern neighbor of the Lombard duchy of Friuli, which was inhabited by Slavs. According to Paul the Deacon, who was writing in the late 780s, those Slavs called their country Carantanum, by means of a corruption of the name of ancient Carnuntum (a former Roman legionary camp on the Danube, between Vienna and Bratislava). Carantanians were regarded as Slavs by the author of a report known as the Conversion of the Bavarians and Carantanians, and written in ca. 870 in order to defend the position of the archbishop of Salzburg against the claims of Methodius, the bishop of Pannonia.94 According to this text, a duke named Boruth was ruling over Carantania when he was attacked by Avars in ca. 740. He called for the military assistance of his Bavarian neighbors. The Bavarian duke Odilo (737–748) obliged, defeated the Avars, but in the process also subdued the Carantanians to his authority. Once Bavarian overlordship was established in Carantania, Odilo took with him as hostages Boruth’s son Cacatius and his nephew Chietmar (Hotimir). Both were baptized in Bavaria. During the 743 war between Odilo and Charles Martel’s two sons, Carloman and Pepin (the Mayors of the Palace in Austrasia and Neustria, respectively), Carantanian troops fought on the Bavarian side. The Bavarian domination cleared the field for missions of conversion to Christianity sent by Virgil, the new bishop of Salzburg (746–784). Many missionaries were of Bavarian origin, but some were Irish monks.
Several Late Avar cemeteries dated to the last quarter of the 8th century are known from the lands north of the middle course of the river Danube, in what is today southern Slovakia and the valley of the Lower Morava [see image below]. By contrast, only two cemeteries have so far been found in Moravia (the eastern part of the present-day Czech Republic), along the middle and upper course of the Morava and along its tributary, the Dyje. In both Dolní Dunajovice and Hevlín, the latest graves may be dated by means of strap ends and belt mounts with human figures to the very end of the Late Avar period. (…)
The archaeological evidence pertaining to burial assemblages dated to the early 9th century is completely different. Shortly before or after 800, all traces of cremation—with or without barrows—disappear from the valley of the Morava river and southwestern Slovakia, two regions in which cremation had been the preferred burial rite during the previous centuries. This dramatic cultural change has often been interpreted as a direct influence of both Avar and Frankish burial rites, but it coincides in time with the adoption of Christianity by local elites. In spite of conversion, however, the representation of status through furnished burial continued well into the 9th century. Unlike Avar-age sites in Hungary and the surrounding regions, many men were buried in 9th-century Moravia together with their spurs, in addition to such weapons as battle axes, “winged” lance heads, or swords with high-quality steel blades of Frankish production.
When the Magyars inflicted a crushing defeat on the Bavarians at Bratislava (July 4, 907), the fate of Moravia was sealed as well. Moravia and the Moravians disappear from the radar of the written sources, and historians and archaeologists alike believe that the polity collapsed as a result of the Magyar raids.
(…) although there can be no doubt about the relations between Uelgi and the sites in Hungary attributed to the first generations of Magyars, those relations indicate a migration directly from the Trans-Ural lands, and not gradually, with several other stops in the forest-steppe and steppe zones of Eastern Europe. In the lands west of the Ural Mountains, the Magyars are now associated with the Kushnarenkovo (6th to 8th century) and Karaiakupovo (8th to 10th century) cultures, and with such burial sites as Sterlitamak (near Ufa, Bashkortostan) and Bol’shie Tigany (near Chistopol, Tatarstan).* However, the same problem with chronology makes it difficult to draw the model of a migration from the lands along the Middle Volga. Many parallels for the so typically Magyar sabretache plates found in Hungary are from that region. They have traditionally been dated to the 9th century, but more recent studies point to the coincidence in time between specimens found in Eastern Europe and those from Hungary.
* Ivanov, Drevnie ugry-mad’iary; Ivanov and Ivanova, “Uralo-sibirskie istoki”; Boldog et al., “From the ancient homelands,” p. 3; Ivanov, “Similarities.” Ivanov, “Similarities,” p. 562 points out that the migration out of the lands along of the Middle Volga is implied by the disappearance of both cultures (Kushnarenkovo and Karaiakupovo) in the mid-9th century. For the Kushnarenkovo culture, see Kazakov, “Kushnarenkovskie pamiatniki.” For the Karaiakupovo culture, see Mogil’nikov, “K probleme.”
Given that the Magyars are first mentioned in relation to events taking place in the Lower Danube area in the 830s, the Magyar sojourn in Etelköz must have been no longer than 60 years or so—a generation. (…)
It has become obvious by now that one’s impression of the Magyars as “Easterners” and “steppe-like” was (and still is) primarily based on grave finds, while the settlement material is considerably more aligned with what is otherwise known from other contemporary settlement sites in Central and Southeastern Europe. The dominant feature on the 10th- and 11th-century settlements in Hungary is the sunken-floored building of rectangular plan, with a stone oven in a corner. Similarly, the pottery resulting from the excavation of settlement sites is very similar to that known from many other such sites in Eastern Europe. Moreover, while clear changes taking place in burial customs between ca. 900 and ca. 1100 are visible in the archaeological record from cemeteries, there are no substantial differences between 10th- and the 11th-century settlements in Hungary. (…)
As a matter of fact, the increasing quantity of paleobotanical and zooarchaeological data from 10th-century settlements strongly suggests that the economy of the first generations of Magyars in Hungary was anything but nomadic. To call those Magyars “half-nomad” is not only wrong, but also misleading, as it implies that they were half-way toward civilization, with social changes taking place that must have had material culture correlates otherwise visible in the burial customs.
The origin of “Slavs” (i.e. that of “Slavonic” as a language, whatever the ancestral Proto-Slavic ethnic make-up was) is almost as complicated as the origin of Albanians, Basques, Balts, or Finns. Their entry into history is very recent, with few reliable sources available until well into the Middle Ages. If you add our ignorance of their origin with the desire of every single researcher or amateur out there to connect them to the own region (or, still worse, to all the regions where they were historically attested), we are bound to find contradictory data and a constantly biased selection of information.
Furthermore, it is extremely complicated to connect any recent population to its ancestral (linguistic) one through haplogroups prevalent today, and just absurd to connect them through ancestral components. This, which was already suspected for many populations, has been confirmed recently for Basques in Olalde et al. (2019) and will be confirmed soon for Finns with a study of the Proto-Fennic populations in the Gulf of Finland.
NOTE. Yes, the “my parents look like Corded Ware in this PCA” had no sense. Ever. Why adult people would constantly engage in that kind of false 5,000-year-old connections instead of learning history – or their own family history – escapes all comprehension. But if something is certain about human nature, is that we will still see nativism and ancestry/haplogroup fetishism for any modern region or modern haplogroups and their historically attested ethnolinguistic groups.
As you can see from my maps and writings, I prefer neat and simple concepts: in linguistics, in archaeology, and in population movements. Hence my aversion to this kind of infinite proto-historical accounts (and interpretations of them) necessary to ascertain the origins of recent peoples (Slavs in this case), and my usual preference for:
Clear dialectal classifications, whether or not they can be as clear cut as I describe them. The only thing that sets Slavic apart from other recent languages is its connection with Baltic, luckily for both. Even though this connection is disputed by some linguists, and the question is always far from being resolved, a homeland of Proto-Balto-Slavic would almost necessarily need to be set to the north of the Carpathian Mountains in the Bronze Age (or at least close to them).
NOTE. A dismissal of a connection with Baltic would leave Slavic a still more complicated orphan, and its dialectal classification within Late PIE more dubious. Its union with Balto-Slavic locates it close to Germanic, and thus as a Bronze Age North-West Indo-European dialect close to northern Germany. So bear with me in accepting this connection, or enter the linguistic hell of arguing for Indo-Slavonic of R1a-Z93 mixed with Temematic….
A priori “pots = people” assumption, which may lead to important errors, but fewer than the usual “pots != people” of modern archaeologists. The traditional identification of the Common Slavic expansion with the Prague-Korchak culture – however undefined this culture may be – has clear advantages: it may be connected (although admittedly with many archaeological holes) with western cultures expanding east during the Bronze Age, and then west again after the Iron Age, and thus potentially also with Baltic.
A simplistic “haplogroup expansion = ethnolinguistic expansion”, which is quite useful for prehistoric migrations, but enters into evident contradictions as we approach the Iron Age. Common Slavs may be speculatively (for all we know) associated with an expansion of recent R1a-M458 lineages – among other haplogroups – from the east, and possibly Balto-Slavic as an earlier expansion of older subclades from the west, as I proposed in A Clash of Chiefs.
NOTE. The connection of most R1a-Z280 lineages is more obviously done with ancient Finno-Ugric peoples, as it is clear now (see here and here).
Slavs appeared first in the Danube?
No matter what my personal preference is, one can’t ignore the growing evidence, and it seems that Florin Curta‘s long-lasting view of a Danubian origin of expansion for Common Slavic, including its condition as a lingua franca of late Avars, won’t be easy to reject any time soon:
1) Theories concerning Chernyakhov as a Slavic homeland will apparently need to be fully rejected, due to the Germanic-like ancestry that will be reported in the study by Järve et al. (2019).
EDIT (3 MAY 2019). From their poster Shift in the genetic landscape of the western Eurasian Steppe not due to Scythian dominance, but rather at the transition to the Chernyakhov culture (Ostrogoths) (download PDF):
(…) the transition from the Scythian to the Chernyakhov culture (~2,100–1,700 cal BP) does mark a shift in the Ponto-Caspian genetic landscape. Our results agree well with the Ostrogothic origins of the Chernyakhov culture and support the hypothesis that Scythian dominance was cultural rather than achieved through population replacement.
2) Therefore, unless Przeworsk shows the traditionally described mixture of populations in terms of ancestry and/or haplogroups, it will also be a sign of East Germanic peoples expanding south (and potentially displacing the ancestors of Slavs in either direction, east or south).
It would seem we are stuck in a Danubian vs. Kievan homeland for Common Slavs, then:
3) About the homeland in the Kiev culture, two early Avar females from Szólád have been commented to cluster “among Modern Slavic populations” based on some data in Amorim et al. (2018).
Rather than supporting an origin of Slavs in common with modern Russians, Poles, and Ukranians as observed in the PCA, though, the admixture of AV1 and AV2 (ca. AD 540-640) paradoxically supports an admixture of Modern Slavs of Eastern Europe in common with early Avar peoples (an Altaic-speaking population) and other steppe groups with an origin in East Asia… So this admixture would actually support a western origin of the Common Slavs with which East Asian Avars may have admixed, and whose descendants are necessarily sampled at later times.
4) Favouring Curta’s Danubian origin (or even an origin near Bohemia) at the moment are thus:
The “western” cluster of Early Slavs from Brandýsek, Bohemia (ca. AD 600-900).
Two likely Slavic individuals from Usedom, in Mecklenburg-Vorpommern (AD 1200) show hg. R1a-M458 and E1b-M215 (Freder 2010).
An early West Slav individual from Hrádek nad Nisou in Northern Bohemia (ca. AD 1330) also shows E1b-M215 (Vanek et al. 2015).
One sample from Székkutas-Kápolnadülő (SzK/239) among middle or late Avars (ca. AD 650-710), a supposed Slavonic-speaking polity, of hg. E1b-V13.
Two samples from Karosc (K1/13, and K2/6) among Hungarian conquerors (ca. AD 895-950), likely both of hg. E1b-V13, probably connected to the alliance with Moravian elites.
Possibly a West Slavic sample from Poland in the High Middle Ages (see below).
A later Hungarian sample (II/53) from the Royal Basilica, where King Béla was interred, of hg. E1b1, supports the importance of this haplogroup among elite conquerors, although its original relation to the other buried individuals is unknown.
Even assuming that the R1a sample reported from the late Avar period is of a subclade typically associated with Slavs (I know, circular reasoning here), which is not warranted, we would have already 6 E1b1b vs. 1-2 R1a-M458 in populations that can be actually assumed to represent early Slavonic speakers (unlike many earlier cultures potentially associated with them), clearly earlier than other Slavic-speaking populations that will be sampled in eastern Europe. It is more and more likely that Early Slavs are going to strengthen Curta’s view, and this may somehow complicate the link of Proto-Slavic with eastern European BA cultures like Trzciniec or Lusatian.
NOTE. I am still expecting a clear expansion associated with Prague-Korchak, though, including a connection with bottlenecks based on R1a-M458 in the Middle Ages, whether the expansion is eventually shown to be from the west (i.e. Bohemia -> Prague -> Korchak), or from the east (i.e. Kiev -> Korchack -> Prague), and whether or not this cultural community was later replaced by other ‘true’ Slavonic-speaking cultures through acculturation or population movements.
5) Back to Przeworsk and the “north of the Carpathians” homeland (i.e. between the Upper Oder and the Upper Dniester), but compatible with Curta’s view: Even if Common Slavic is eventually evidenced to be driven by small migrations north and south of the Danube during the Roman Iron Age, before turning into a mostly “R1a-rich” migration or acculturation to the north in Bohemia and then east (which is what this early E1b-V13 connection suggests), this does not dismiss the traditional idea that Late Bronze Age – Iron Age central-eastern Europe was the Proto-Slavic homeland, i.e. likely the Pomeranian culture disturbed by the East Germanic migrations first (in Przeworsk), and the migrations of steppe nomads later (around the Danube).
Even without taking into account the connection with Baltic, the relevance of haplogroup E1b-V13 among Early Slavs may well be a sign of an ancestral population from the northern or eastern Carpathian region, supported by the finding of this haplogroup among the westernmost Scythians. The expansion of some modern E1b-CTS1273 lineages may link Slavic ancestrally with the Lusatian culture, which is an eastern (very specific) Urnfield culture group, stemming from central-east Europe.
An important paper in this respect is the upcoming Zenczak et al., where another hg. E1b1 will be added to the list above: such a sample is expected from Poland (from Kowalewko, Maslomecz, Legowo or Niemcza), either from the Roman Iron Age or Early Middle Ages, close to an early population of likely Scandinavian origin (eight I1 samples), apart from other varied haplogroups, with little relevance of R1a. Whether this E-V13 sample is an Iron Age one (justifying the bottleneck under E-V13 to the south) or, maybe more likely, a late one from the Middle Ages (maybe supporting a connection of the Gothic/Slavic E1b bottleneck with southern Chernyakhov or further west along the Danube) is unclear.
The finding of south-eastern European ancestry and lineages in both, Early Slavs and East Germanic tribes* suggests therefore a Slavonic homeland near (or within) the Przeworsk culture, close to the Albanoid one, as proposed based on topohydronymy. This may point to a complex process of acculturation of different eastern European populations which formed alliances, as was common during the Iron Age and later periods, and which cannot be interpreted as a clear picture of their languages’ original homeland and ancestral peoples (in the case of East Germanic tribes, apparently originally expanding from Scandinavia under strong I1 bottlenecks).
* Iberian samples of the Visigothic period in Spain show up to 25% E1b-V13 samples, with a mixture of haplogroups including local and foreign lineages, as well as some more E1b-V13 samples later during the Muslim period. Out of the two E1b samples from Longobards in Amorim et al. (2018), only SZ18 from Szólád (ca. AD 412-604) is within E1b-V13, in a very specific early branch (SNP M35.2), further locating the expansion of hg. E1b-V13 near the Danube. Samples of haplogroup J (maybe J2a) or G2a among Germanic tribes (and possibly in Poland’s Roman Iron Age / Early Middle Ages) are impossible to compare with early Hungarian ones without precise subclades.
I already interpreted the earlier Slavic samples we had as a sign of a Carpathian origin and very recent bottlenecks under R1a lineages among Modern Slavs:
The finding of haplogroup E1b1b-M215 in two independent early West Slavic individuals further supports that the current distribution of R1a1a1b1a-Z282 lineages in Slavic populations is the product of recent bottlenecks. The lack of a precise subclade within the E1b1b-M215 tree precludes a proper interpretation of a potential origin, but they are probably under European E1b1b1a1b1-L618 subclade E1b1b1a1b1a-V13 (formed ca. 6100 BC, TMRCA ca. 2800 BC), possibly under the mutation CTS1273 (formed ca. 2600 BC, TMRCA ca. 2000 BC), in common with other ancient populations around the Carpathians (see below §viii.11. Thracians and Albanians). This gross geographic origin would support the studies of the Common Slavic homeland based on toponymy (Figure 66), which place it roughly between the Upper Oder and the Upper Dniester, north of the Carpathians (Udolph 1997, 2016).
EDIT (8 APR 2019): Another interesting data is the haplogroup distribution among Modern Slavs and neighbouring peoples (see Wikipedia). For example, the bottleneck seen in Modern Albanians, under Z5017 subclade, also points to an origin of the expansion of E1b-V13 subclades among multiethnic groups around the Lower Danube coinciding with the Roman Iron Age, given the estimates for the arrival of Proto-Albanian close to the Latin and Greek linguistic frontier.
Remarkable is also its distribution among Rusyns, East Slavs from the Carpathians not associated with the Kievan Rus’, isolated thus quite soon from East Slavic expansions to the east. They were reported to show ca. 35% hg. E1b-V13 globally in FTDNA, with a frequency similar to or higher than R1a, in common with South Slavic peoples*, reflecting thus a situation similar to the source of East Slavs before further R1a-based bottlenecks (and/or acculturation events) to the east:
* Although probably due in part to founder effects and biased familial sampling, this should be assumed to be common to all FTDNA sampling, anyway.
Repeating what should be already evident: in complex organizations and/or demographically dense populations (more common since the Iron Age), we can’t expect language change to happen in the same way as during the known Neolithic or Chalcolithic population replacements, be it in Finland, Hungary, Iberia, or Poland. For example, no matter whether Romans (2nd c. BC) brought some R1b-U152 and other Mediterranean lineages to Iberia; Germanic peoples entering Hispania (AD 5th c.) were of typically Germanic lineages or not; Muslims who spoke mainly Berber (AD 8th c.) and were mainly of hg. E1b-M81 (and J?) brought North African ancestry; etc. the language or languages of Iberia changed (or not) with the political landscape: neither with radical population replacements (or full population continuity), nor with the dominant haplogroups’ ancestral language.
Y-chromosome haplogroups are, in those cases, useful for ascertaining a more recent origin of the population. Like the finding of certain R1a-Z645, I2a-L621 & N-L392 lineages among Hungarians shows a recent origin near the Trans-Urals forest-steppes, or the finding of I1, R1b-U106 & E1b-V13 among Visigoths shows a recent origin near the Danube, the finding of Early Slavs (ca. AD 6th-7th c.) originally with small elite groups of hg. R1a-M458 & E1b-V13 from the Lower/Middle Danube – if strengthened with more Early Slavic samples, with Slavonic partially expanding as a lingua franca in some regions – is not necessarily representative of the Proto-Slavic community, just as it is clearly not representative of the later expansion of Slavic dialects. It would be representative, though, of the same processes of acculturation repeated all over Eurasia at least since the Iron Age, where no genetic continuity can be found with ancestral languages.
A new paper (behind paywall) offers insight into the prevalent presence of R1a-Z93 among eastern Scytho-Siberian groups (most likely including Samoyedic speakers in the forest-steppes), and a new hint to the westward expansion of haplogroups Q and N (probably coupled with the so-called “Siberian ancestry”) from the east with different groups of Iron Age steppe nomads:
From an archeological and historical point of view, the term “Scythians” refers to Iron Age nomadic or seminomadic populations characterized by the presence of three types of artifacts in male burials: typical weapons, specific horse harnesses and items decorated in the so-called “Animal Style”. This complex of goods has been termed the “Scythian triad” and was considered to be characteristic of nomadic groups belonging to the “Scythian World” (Yablonsky 2001). This “Scythian World” includes both the Classic (or European) Scythians from the North Pontic region (7th–3th century BC) and the Southern Siberian (or Asian) populations of the Scythian period (also called Scytho-Siberians). These include, among others, the Sakas from Kazakhstan, the Tagar population from the Minusinsk Basin (Republic of Khakassia), the Aldy-Bel population from Tuva (Russian Federation) and the Pazyryk and Sagly cultures from the Altai Mountains.
In this work, we first aim to address the question of the familial and social organization of Scytho-Siberian groups by studying the genetic relationship of 29 individuals from the Aldy-Bel and Sagly cultures using autosomal STRs. (…) were obtained from 5 archeological sites located in the valley of the Eerbek river in Tuva Republic, Russia (Fig. 1). All the mounds of this archeological site were excavated but DNA samples were not collected from all of them. 14C dates mainly fall within the Hallstatt radiocarbon calibration plateau (ca. 800–400 cal BC) where the chronological resolution is poor. Only one date falls on an earlier segment of calibration curve: Le 9817–2650 ± 25 BP, i.e. 843–792 cal BC with a probability of 94.3% (using the OxCal v4.3.2 program). This sample (Bai-Dag 8, Kurgan 1, grave 10) is not from one of the graves studied but was used to date the kurgan as a whole.
Y-chromosome haplogroups were first assigned using the ISOGG 2018 nomenclature. In order to improve the precision of haplogroup definition, we also analyzed a set of Y-chromosome SNP (Supplementary Table 2). Nine samples belonged to the R1a-M513 haplogroup (defined by marker M513) and two of these nine samples were characterized as belonging to the R1a1a1b2-Z93 haplogroup or one of its subclades. Six samples belonged to the Q1b1a-L54 haplogroup and five of these six samples belonged to the Q1b1a3-L330 subclade. One sample belonged to the N-M231 haplogroup.
The distribution of these haplogroups in the population must be confronted with the prevalence of kinship among the samples. Although five individuals belonged to haplogroup Q1b1a3-L330, three of them (ARZ-T18, ARZ-T19 and ARZ-T20) were paternally related (Fig. 2). It must, therefore, be considered that haplogroup Q1b1a3-L330 is present in three independent instances (given that the remaining two instances exhibit no close familial relationship with other samples or one another). All five were buried on the Eki-Ottug 1 archaeological site (although in two different kurgans).
In the same way, although two groups, of two and three individuals, shared haplotypes belonging to the R1a-M513 haplogroup, these groups likely include a father/son pair (ARZ-T2 and ARZ-T12). Therefore, among nine R1a-M513 men, we found six independent haplotypes, one being present in two independent instances. All R1a-M513 haplotypes, however, including those attributed to the R1a1a1b2-Z93 subclade, only differed by one-step mutations, across 5 loci at most. All R1a-M513 individuals were buried on the same site, Eki-Ottug 2, in a single Kurgan.
Haplogroup R1a-M173 was previously reported for 6 Scytho-Siberian individuals from the Tagar culture (Keyser et al. 2009) and one Altaian Scytho-Siberian from the Sebÿstei site (Ricaut et al. 2004a), whereas haplogroup R1a1a1b2-Z93 (or R1a1a1b-S224) was described for one Scythian from Samara (Mathieson et al. 2015) and two Scytho-Siberians from Berel and the Tuva Republic (Unterländer et al. 2017). On the contrary, North Pontic Scythians were found to belong to the R1b1a1a2 haplogroup (Krzewińska et al. 2018), showing a distinction between the two groups of Scythians. (…) The absence of R1b lineages in the Scytho-Siberian individuals tested so far and their presence in the North Pontic Scythians suggest that these 2 groups had a completely different paternal lineage makeup with nearly no gene flow from male carriers between them.
The seven other male individuals studied in this work were found to carry Eastern Eurasian Y haplogroups Q1b1a and one of its subclades (n = 6) and N (n = 1). Haplogroup Q1b1a-L54 was previously described in four males from the Bronze Age in the Altai Mountains (Hollard et al. 2014, 2018) and was clearly associated with Siberian populations (Regueiro et al. 2013).
The N-M231 haplogroup emerged from haplogroup K in Southern Asia around 21,000 years BCE, maybe in Southern China (Shi et al. 2013; Ilumäe et al. 2016). Previous studies attested to its presence in samples from Neolithic and Bronze Age in China (Li et al. 2011; Cui et al. 2013). Waves of northwestern expansion of this haplogroup are described as beginning during the Paleolithic period (Derenko et al. 2006; Shi et al. 2013) but traces of this expansion in archeological samples were reported only in two Scytho-Siberian males from the Altai (Pilipenko et al. 2015).
The sample of haplogroup N comes from the Aldy-Bel culture (ARZ-T15), from the Eerbek site, but has no radiocarbon date. All Q1b-L330 samples come from the Sagly culture, and three are paternally related. The other Q1b-L54 sample is from other tombs in one kurgan at Aldy Bel.