There remain ongoing discussions about the origins of the ethnic Russian population. The ancestors of ethnic Russians were among the Slavic tribes that separated from the early Indo-European Group, which included ancestors of modern Slavic, Germanic and Baltic speakers, who appeared in the northeastern part of Europe ca. 1,500 years ago. Slavs were found in the central part of Eastern Europe, where they came in direct contact with (and likely assimilation of) the populations speaking Uralic (Volga-Finnish and Baltic- Finnish), and also Baltic languages [11–13]. In the following centuries, Slavs interacted with the Iranian-Persian, Turkic and Scandinavian peoples, all of which in succession may have contributed to the current pattern of genome diversity across the different parts of Russia. At the end of the Middle Ages and in the early modern period, there occurred a division of the East Slavic unity into Russians, Ukrainians and Belarusians. It was the Russians who drove the colonization movement to the East, although other Slavic, Turkic and Finnish peoples took part in this movement, as the eastward migrations brought them to the Ural Mountains and further into Siberia, the Far East, and Alaska. During that interval, the Russians encountered the Finns, Ugrians, and Samoyeds speakers in the Urals, but also the Turkic, Mongolian and Tungus speakers of Siberia. Finally, in the great expanse between the Altai Mountains on the border with Mongolia, and the Bering Strait, they encountered paleo-Asiatic groups that may be genetically closest to the ancestors of the Native Americans. Today’s complex patchwork of human diversity in Russia has continued to be augmented by modern migrations from the Caucasus, and from Central Asia, as modern economic migrations take shape.
In the current study, we annotated whole genome sequences of individuals currently living on the territory of Russia and identifying themselves as ethnic Russian or as members of a named ethnic minority (Fig. 1). We analyzed genetic variation in three modern populations of Russia (ethnic Russians from Pskov and Novgorod regions and ethnic Yakut from the Sakha Republic), and compared them to the recently released genome sequences collected from 52 indigenous Russian populations. The incidence of function-altering mutations was explored by identifying known variants and novel variants and their allele frequencies relative to variation in adjacent European, East Asian and South Asian populations. Genomic variation was further used to estimate genetic distance and relationships, historic gene flow and barriers to gene flow, the extent of population admixture, historic population contractions, and linkage disequilibrium patterns. Lastly, we present demographic models estimating historic founder events within Russia, and a preliminary HapMap of ethnic Russians from the European part of Russia and Yakuts from eastern Siberia.
The collection of identified SNPs was used to inspect quantitative distinctions among 264 individuals from across Eurasia (Fig. 1) using Principal Component Analysis (PCA) (Fig. 2). The first and the second eigenvectors of the PCA plot are associated with longitude and latitude, respectively, of the sample locations and accurately separate Eurasian populations according to geographic origin. East European samples cluster near Pskov and Novgorod samples, which fall between northern Russians, Finno-Ugric peoples (Karelian, Finns, Veps etc.), and other Northeastern European peoples (Swedes, Central Russians, Estonian, Latvians, Lithuanians, and Ukrainians) (Fig. 2b). Yakut individuals map into the Siberian sample cluster as expected (Fig. 2a). To obtain an extended view of population relationships, we performed a maximum likelihood-based estimation of ancestry and population structure using ADMIXTURE (Fig. 2c). The Novgorod and Pskov populations show similar profiles with their Northeastern European ancestors while the Yakut ethnic group showed mixed ancestry similar to the Buryat and Mongolian groups.
Possible admixture sources of the Genome Russia populations were addressed more formally by calculating F3 statistics, which is an allele frequency-based measure, allowing to test if a target population can be modeled as a mixture of two source populations . Results showed that Yakut individuals are best modeled as an admixture of Evens or Evenks with various European populations (Supplemental Table S4). Pskov and Novgorod showed admixture of European with Siberian or Finno-Ugric populations, with Lithuanian and Latvian populations being the dominant European sources for Pskov samples.
So, Russians expanding in the Middle Ages as acculturaded Finno-Volgaic peoples.
Interesting excerpts (emphasis mine, some references deleted for clarity):
The town of Sigtuna in eastern central Sweden was one of the pioneer urban hubs in the vast and complex communicative network of the Viking world. The town that is thought to have been royally founded was planned and organized as a formal administrative center and was an important focal point for the establishment of Christianity . The material culture in Sigtuna indicates that the town had intense international contacts and hosted several cemeteries with a Christian character. Some of them may have been used by kin-based groups or by people sharing the same sociocultural background. In order to explore the character and magnitude of mobility and migration in a late Viking Age town, we generated and analyzed genomic (n = 23) and strontium isotope (n = 31) data from individuals excavated in Sigtuna.
The mitochondrial genomes were sequenced at 1.5× to 367× coverage. Most of the individuals were assigned to haplogroups commonly found in current-day Europeans, such as H, J, and U [14, 26, 27]. All of these haplotypes are present in Scandinavia today.
The Y chromosome haplogroups were assigned in seven males. The Y haplogroups include I1a, I2a, N1a, G2a, and R1b. Two identified lineages (I2a and N1a) have not been found in modern-day Sweden or Norway [28, 29]. Haplogroups I and N are associated with eastern and central Europe, as well as Finno-Ugric groups . Interestingly, I2a was previously identified in a middle Neolithic Swedish hunter-gatherer dating to ca. 3,000 years BCE .
In Sigtuna, the genetic diversity in the late Viking Age was greater than the genetic diversity in late Neolithic and Bronze Age cultures (Unetice and Yamnaya as examples) and modern East Asians; it was on par with Roman soldiers in England but lower than in modern-day European groups (GBR and FIN; Figure 2B). Within the town, the group excavated at church 1 has somewhat greater diversity than that at cemetery 1. Interestingly, the diversity at church 1 is nearly as high as that observed in Roman soldiers in England, which is remarkable, since the latter was considered to be an exceptionally heterogeneous group in contemporary Europe .
Different sex-related mobility patterns for Sigtuna inhabitants have been suggested based on material culture, especially ceramics. Building on design and clay analyses, some female potters in Sigtuna are thought to have grown up in Novgorod in Rus’ . Moreover, historical sources mention female mobility in connection to marriage, especially among the elite from Rus’ and West Slavonic regions [41, 42]. Male mobility is also known from historical sources, often in connection to clergymen moving to the town .
Interestingly, we found a number of individuals from Sigtuna to be genetically similar to the modern-day human variation of eastern Europeans, and most harbor close genetic affinities to Lithuanians (Figure 2A). The strontium isotope ratios in 28 adult individuals with assigned biological sex and strontium values obtained from teeth (23 M1 and five M2) show that 70% of the females and 44% of the males from Sigtuna were non-locals (STAR Methods). The difference in migrant ratios between females and male mobility patterns was not statistically significant (Fisher’s exact test, p = 0.254 for 28 individuals and p = 0.376 for 16 individuals). Hence, no evidence of a sex-specific mobility pattern was found.
(…) As these social groups are not mirrored by our genetic or strontium data, this suggests that the inclusion in them was not based on kinship. Therefore, it appears as if socio-cultural factors, not biological bonds, governed where people were interred (i.e., the choice of cemetery).
Interesting from this paper is the higher genetic (especially Y-DNA) diversity found in more recent periods (see e.g. here) compared to Neolithic and Bronze Age cultures, which is probably the reason behind some obviously wrong interpretations, e.g. regarding links between Yamna and Corded Ware populations.
The sample 84001, a “first-generation short-distance migrant” of haplogroup N1c-L392 (N1a in the new nomenclature) brings yet more proof of how:
Admixture changes completely within a certain number of generations. In this case, the N1c-L392 sample clusters within the genetic variation of modern Norwegians, near to the Skane Iron Age sample, and not with its eastern origin (likely many generations before).
This haplogroup appeared quite late in Fennoscandia but still managed to integrate and expand into different ethnolinguistic groups; in this case, this individual was probably a Viking of Nordic language, given its genetic admixture and its non-local (but neighbouring Scandinavian) strontium values.
Complete mitochondrial genomics is an effective tool for studying the demographic history of human populations, but there is still a deficit of mitogenomic data in European populations. In this paper, we present results of study of variability of 80 complete mitochondrial genomes in two Hungarian populations from eastern part of Hungary (Szeged and Debrecen areas). The genetic diversity of Hungarian mitogenomes is remarkably high, reaching 99.9% in a combined sample. According to the analysis of molecular variance (AMOVA), European populations showed a low, but statistically significant level of between-population differentiation (Fst = 0.61%, p = 0), and two Hungarian populations demonstrate lack of between-population differences. Phylogeographic analysis allowed us to identify 71 different mtDNA sub-clades in Hungarians, sixteen of which are novel. Analysis of ancestry-informative mtDNA sub-clades revealed a complex genetic structure associated with the genetic impact of populations from different parts of Eurasia, though the contribution from European populations is the most pronounced. At least 8% of ancestry-informative haplotypes found in Hungarians demonstrate similarity with East and West Slavic populations (sub-clades H1c23a, H2a1c1, J2b1a6, T2b25a1, U4a2e, K1c1j, and I1a1c), while the influence of Siberian populations is not so noticeable (sub-clades A12a, C4a1a, and probably U4b1a4).
Our analysis of ancestry-informative mtDNA sub-clades revealed a complex genetic structure associated with the genetic impact of populations from different parts of Europe. At least 8% of ancestry-informative haplotypes found in Hungarians demonstrate similarity with East (Russians and Ukrainians) and West (Poles and Slovaks) Slavic populations (sub-clades H1c23a, H2a1c1, J2b1a6, T2b25a1, U4a2e, K1c1j, and I1a1c). This observation is consistent with the results of mtDNA studies of medieval populations living in the Hungarian-Slavic contact zone of the Carpathian Basin in the 9th–12th centuries AD (Csákyová et al. 2016). Taken together, these data confirm earlier historical and archaeological reports on mixed populations of medieval Slavs and Magyars, based on the research into cemeteries discovered in Central Europe (Csősz et al. 2016; Csákyová et al. 2016). On the other hand, we cannot confirm the Hungarian-Slavic contacts using molecular dating of the identified mtDNA sub-clades, since their age exceeds the estimated time of the contact period and varies from 1.3 kya (for K1c1j) to 5.2 kya (for T2b25a1) (Figure S1). One of an issue may be sample size problem, because some haplotypes may be missed in the sampling, and this can lead to an overestimate of the age of the mtDNA sub-clade (Richards et al. 2000).
However, it is known that the evolutionary ages of most mtDNA lineages specific to Eastern and Central Europeans correspond to approximately 4 kya (from 2.3 to 5.9 kya) (Malyarchuk et al. 2008, 2017; Mielnik-Sikorska et al. 2013; Översti et al. 2017), thus coinciding with the time of the Bronze Age expansion of Eastern Europeans in accordance with the Kurgan model established by archaeologists and paleogeneticists (Gimbutas 1971; Allentoft et al. 2015; Haak et al. 2015). Thus, similar haplotypes among Hungarians and Slavs and other European ethnic groups can be a reflection of the common genetic substratum which predates the formation of the most modern European populations. Therefore, mtDNA sub-clades H5a1m, T2a1c, and W3a1d1 (with the ages varying from 2.6 to 3.9 kya, based on complete mtDNA mutation rate), which are shared by Hungarians and Finno-Ugric peoples, such as Estonians and Finns, may testify these pan-European relationships (Figure S1). Another example is the sub-clade J2b1a6, which unites the mtDNA haplotypes of the ancient and modern population of Eastern and Central Europe from the Iron Age to the present (Figure S1).
The recent publication of Narasimhan et al. (2018) has outdated the draft of this post a bit, and it has made it at the same time still more interesting.
While we wait for the publication of the dataset (and the actual Y-DNA haplogroups and precise subclades with the revision of the paper), and as we watch the wrath of Hindu nationalists vented against the West (as if the steppe was in Western Europe) and science itself, we have already seen confirmation from the Reich Lab of their new approach to Late Proto-Indo-European migrations.
Yamna/Steppe EMBA, previously identified as the direct source of “steppe” ancestry (AKA ‘Yamnaya‘ ancestry) and Late Indo-European migrations in Asia – through Corded Ware, it is to be understood – has been officially changed. In the case of Indo-Iranian migrations it is the “Steppe MLBA cloud”, after a direct contribution to it of Yamna/Steppe EMBA, which expanded Indo-Iranian, as I predicted ancient DNA could support.
In Twitter, the main author responded the following when asked for this change regarding the origin of steppe ancestry in Asian migrants (emphasis mine):
Our reasons are:
The Turan samples show no elevated steppe ancestry till 2000BC.
MLBA is R1a
Indus periphery doesn’t have steppe ancestry but Swat does, and EMBA doesn’t work both in terms of time or genetic ancestry to explain the difference.
I am glad to see finally recognized that Y-DNA haplogroups and time have to be taken into account, and happy also to see an end to the by now obsolete ‘ADMIXTURE/PCA-only relevance’ in Human Ancestry. The timing of archaeological migrations, the cultural attribution of each sample, and the role of Y-DNA variability reduction and expansion have been finally recognized as equally important to assess potential migrations, as I requested.
This change was already in the making some months ago, when David Anthony – who has worked with the group for this paper and others before it – already changed his official view on Corded Ware – from his previous support of the 2015 model. His latest theory, which linked Yamna settlements in Hungary with a potential mixed society of migrants (of R1b-L23 and R1a-Z645 lineages) from West Yamna, is most likely wrong, too, but it was clearly a brave step forward in the right direction.
The only reasonable model now is that Yamna expanded Late Proto-Indo-European languages with steppe ancestry + R1b-L23 subclades.
You can either accept this change, or you can deny it and wait until one sample of R1a-Z645 appears in West Yamna or central Europe, or one sample of R1b-L23 appears in Corded Ware (as it is obvious it could happen), to keep spreading the wrong ideas still some more years, while the rest of the world goes on: Mallory, Anthony, and other archaeologists co-authoring the latest paper (probably part of the stronger partnership with academics that we were going to see), who had formally put forward complex, detailed theories, investing their time and name in them, have rejected their previous migration models to develop new ones based on the most recent findings. If they can do that, I am sure any amateur geneticist out there can, too.
The Balto-Slavic dialect and its homeland
An interesting question in Linguistics and Archaeology, now that Corded Ware cannot be identified as “Indo-Slavonic” or any other imaginary ancient group (like Indo-Slavo-Germanic), remains thus mostly unchanged since before the famous 2015 genetic papers:
Was Balto-Slavic a dialect of the expanding North-West Indo-European language, a Northern LPIE dialect, as we support, based on morphological and lexical isoglosses?
Or was it part of an Indo-Slavonic group in East Yamna, i.e. a Graeco-Aryan dialect, based mainly on the traditional Satem-Centum phonological division?
I am a strong supporter of Balto-Slavic being a member of a North-West Indo-European group. That’s probably because I educated myself first with the main Spanish books* on Proto-Indo-European reconstruction, and its authors kept repeating this consistent idea, but I have found no relevant data to reject it in the past 15 years.
* Today two of the three volumes are available in English, although they are from the early 1990s, hence a bit outdated. They also maintain certain peculiarities from Adrados’ own personal theories, such as multiple (coloured) laryngeals, 5 cases – with a common ancestral oblique case – for Middle PIE, etc. But it has lots of detailed discussions on the different aspects of the reconstruction. It is not an easy introductory manual to the field, though; for that you have already many famous short handbooks out there, like those of Fortson (N.American), Beekes (Leiden), or Meier-Brügger (Germany).
Fernando and I have always maintained that North-West Indo-European must have formed a very recent community, probably connected well into the early 2nd millennium BC for certain recent isoglosses to spread among its early dialects, based on our guesstimates*, and on our belief that it formed at some point not just a dialect continuum, but probably a common language, so we estimated that the expansion was associated with the pan-European influence of Únětice and close early Bronze Age European contacts.
NOTE. I know, you must be thinking “linguistic guesstimates? Bollocks, that’s not Science”. Right? Wrong. When you learn a dozen languages from different branches, half a dozen ancient ones, and then still study some reconstructed proto-languages from them, you begin to make your own assumptions about how the language changes you perceive could have developed according to your mental time frames. If you just learned a second language and some Latin in school, and try to make assumptions as to how language changes, or you believe you can judge it with this limited background, you have evidently the wrong idea of what a guesstimate is. I accept criticism to this concept from a scientist used only to statistical methods, since it comes from pure ignorance of what it means. And I accept alternative guesstimates from linguists whose language backgrounds may differ (and thus their perception of language change). However, I would not accept a glottochronological or otherwise (supposedly) statistical model instead (or a religious model, for that matter), so we have no alternatives to guesstimates for the moment.
In fact, guesstimates and dialectalization have paved the way to the steppe hypothesis, first with the kurgan hypothesis by Marija Gimbutas, then complemented further in the past 60 years by linguists and archaeologists into a detailed Khvalynsk -> Yamna -> Afanasevo/Bell Beaker/Sintashta-Andronovo expansion model, now confirmed with genomics. So either you trust us (or any other polyglot who deals with Indo-European matters, like Adrados, Lehmann, Beekes, Kloekhorst, Kortlandt, etc.), or you begin learning ancient languages and obtaining your own guesstimates, whichever way you prefer. The easy way of numbers + computer science does not exist yet, and is quite far from happening – until we can understand how our brains summarize and select important details involved in obtaining estimates – , no matter what you might be reading (even in Nature or Science) recently…
Data from the 2015 papers changed my understanding of the original NWIE-speaking community, and I have since shifted my preffered anthropological model (from a Northern dialect in Yamna spreading into a loose NWIE-speaking Corded Ware -> Únětice) to a quite close group formed by late Yamna settlers in the Carpathian Basin, expanded as East Bell Beakers, and later continuing with close contacts through Central European EBA.
NOTE. As you can read, we initially rejected Gimbutas’ and Anthony’s (2007) notion of a Late PIE splitting suddenly into all known dialects (viz. Italo-Celtic with Vučedol/Bell Beaker), and looked thus for a common NWIE spread with Corded Ware migrants, with help from inferences of modern haplogroup distribution (as was common in the early 2000s). Language reconstruction was the foundation of that model, and it was right in its own way. It probably gave the wrong idea to geneticists and archaeologists, who quite easily accepted some results from the 2015 papers as supporting this model. But it also helped us develop a new model and predict what would happen in future papers, as demonstrated in O&M 2018. Any alternative linguistic and archaeological model could explain what is seen today in genomics, but our model of North-West Indo-European reconstruction is obviously at present the best fit for it.
Nevertheless, one of the most important Balticists and Slavicists alive, Frederik Kortlandt, posits that there was in fact an Indo-Slavonic group, so one has to take that possibility into account. Not that his ideas are flawless, of course: he defends the glottalic theory – which is still held today by just a handful of researchers – , and I strongly oppose his description of Balto-Slavic and Germanic oblique cases in *-m- (against other LPIE *-bh-) as an ancestral remnant related to Anatolian (an ending which few scholars would agree corresponds to what he claims), since that would probably represent an older split than warranted in our model. I believe genetics is proving that the dialectalization of Late PIE happened as Fernando López-Menchero and I described.
NOTE. The idea with these examples of how he has been wrong in LPIE and MPIE reconstruction is not to observe the common ad hominem arguments used by amateur geneticists to dismiss academic proposals (“he said that and was wrong, ergo he is wrong now”). It is to bring into attention that the argument from authority is important for the academic community insofar as it creates a common ground, i.e. especially when there are many relevant scholars agreeing on the same subject. But, indeed, any model can and should be challenged, and all authorities are capable of being wrong, and in fact they often are.
The most common explanation today for the dialectal development *-m- is an innovation (not an archaism), whether morphological (viz. Ita. and Gk. them. pl *-i) or phonological (as I defend); and the most commonly repeated model for the satemization trend (even for those supporting a three-dorsal theory for PIE) is areal contact, whether driven by a previous (most likely Uralic) substratum, or not. Hence, if Kortlandt’s main different phonological and morphological assessments of the parent language are flawed, and they are the basis for his dialectal scheme, it should be revised.
The ‘atomic bomb’ that Indo-Slavonic proponents launched, in my opinion, was Holzer’s Temematic (born roughly at the same time as the renewed Old European concept in North-West Indo-European model of Oettinger) – and indeed Kortlandt’s acceptance of it. It seems to me like the linguistic equivalent of the archaeological “patron-client relationship” proposed by Anthony for a cultural diffusion of Late PIE into different Corded Ware regions: almost impossible to be fully rejected, if the Indo-Slavonic superstrate is proposed for a relatively early time.
In my opinion, the shared morphological layer with North-West Indo-European is obviously older than Iranian influence on Slavic, and I think this is communis opinio today. But how could we disentangle the dialectalization of Balto-Slavic, if there is (as it seems) an ancestral substrate layer (most likely Uralic) common to both Balto-Slavic and Indo-Iranian? It seems a very difficult task.
The expansion of Balto-Slavic
In any case, there are two, and only two mainstream choices right now.
NOTE. Mainstream, as in representing trends current today among Indo-Europeanists, so that many programs around the world would explain these alternative models to their students, or they would easily appear in most handbooks. Not like the word “mainstream” you read in any comment out there by anyone who has never been interested in Indo-European studies, and uses any text from any author, written who knows how long ago, merely to justify their ethnic preconceptions coupled with certain genomic finds.
You can agree with:
A) The Spanish and German schools of thought, together with many American and British scholars, as well as archaeologists like Heyd, Mallory, or Prescott, and now Anthony, too: the language ancestral to Balto-Slavic, Germanic, and Italo-Celtic accompanied expanding West Yamna/East Bell Beakers into Europe, and then their speakers – like the rest of peoples everywhere in Europe – admixed later in the different regions.
B) Frederik Kortlandt and other Indo-Slavicists. The ‘original’ Balto-Slavic would have spread with Srubna (and likely Potapovka before it), as a product of the admixture of East Yamna’s Indo-Slavonic with incoming Corded Ware migrants (this would correspond to my description of Indo-Iranian). ‘True’ Balto-Slavic speakers would have then absorbed the Temematic-speaking migrants (equivalent to early Balto-Slavic migrants as described in the demic diffusion model) spreading from the west, most likely in the steppe. Later developments from the steppe would have then brought Baltic to the north, and Slavic to the west.
Therefore, in both cases the language spoken by early R1a-Z645 lineages in Únětice or Mierzanowice/Nitra EBA cultures would have been an eastern North-West Indo-European dialect associated with expanding Bell Beakers, and closely related to Germanic and Italo-Celtic. In the second case, the ancient samples we see genetically closer to modern West Slavs could thus be identified with those speaking the Temematic substrate absorbed later by Balto-Slavic, or maybe by Balts migrating northward, and Slavs spreading west- and southward.
NOTE. In any case, we know that R1a-Z645 subclades resurged in Central-East Europe after the expansion of Bell Beakers, potentially showing an ancient link with the prevalent R1a subclades in the region today. We know that some ancient Central European populations cluster near modern West Slavs, but in other interesting regions (like the British Isles, Central Europe, Scandinavia, or Iberia) we also see close clusters, and nevertheless observe historically documented radical ethnolinguistic changes, as well as many different subsequent genetic inflows and founder effects, that have significantly altered the anthropological picture in these regions, so it could very well be that the lineages we find in ancient samples do not correspond to modern West Slavic lineages, or even similar ancient and modern lineages could show a radical cultural discontinuity (as is likely the case in this to-and-from-the-steppe migration scheme).
Since we are going to see signs of both – west and east admixture – in early Slavic communities near the steppe, and the distribution from South, West, and East Slavs will include a wide “cloud” connecting Central, East, and South-East Europe, as it is evident already from early Germanic samples, it may be interesting to shift our attention to the Tollense valley and Lusatian samples, and their predominant Y-DNA haplogroups. Once again, tracking male-driven migrations from Central Europe to the Baltic region and the steppe, and back again to much of Central and South Europe, will determine which groups expanded this eastern NWIE dialect initially and in later times.
Since Baltic and Slavic languages are attested quite late, genetics is likely to help us select among the different available models for Balto-Slavic, although (it is worth repeating it) these lineages may not be the same that later expanded each dialect.
NOTE. Bronze and Iron Age samples might begin to depict the true Balto-Slavic migration map. Apart from the strong differences in the satemization processes seen among Baltic, Slavic, and Indo-Iranian, from an archaeological point of view the geographic location of the earliest attested Baltic languages and the prehistoric developments of the region seem to me almost incompatible with a homeland in the steppe. Anyway, in the worst-case scenario – for those of us who work with Balto-Slavic to reconstruct North-West Indo-European – there is consensus that there must an eastern North-West Indo-European language (which some would call Temematic), whose common traits with Germanic and Italo-Celtic we use to reconstruct their parent language. The question remains thus mostly theoretical, of limited pragmatic use for the reconstruction.
The third way: Baltic Late Neolithic
I have referred to Kristiansen and his group‘s position regarding Corded Ware as Indo-European as flawed before. While their latest interpretation (and language identification) was wrong, Kristiansen’s original idea of long-lasting contacts in the Dnieper-Dniester region with the area occupied by late Trypillia developing a Proto-Corded Ware culture was probably right, as we are seeing now.
New data in Mittnik et al. 2018 show some interesting early Late Neolithic samples from the Baltic region – Zvejnieki, Gyvakarai1 (R1a-Z645) and Plinkaigalis242 – , proving what I predicted: that elevated steppe ancestry and R1a-Z645 subclades would be found in the Dnieper-Dniester region unrelated to the Yamna expansion, and, it seems, to migrants of the Corded Ware A-horizon.
Funnily enough, this shows that there were probably ancient interactions in the region, as originally asserted by Kristiansen, and probably following some of Victor Klochko‘s proposed exchange paths, but earlier than predicted by him.
Funny also how Anthony, too – like Kristiansen – , may have been right all along since 2007, in proposing that Corded Ware (the nuclear Corded Ware migrants) stemmed from the Dnieper-Dniester region roughly at the same time as Yamna migrants expanded west, and that they did not have any direct genetic connection (in terms of migrations) with each other.
Both researchers, who collaborated with the latest genomic research, remade their models, and have to revise now their most recent proposals with the new data, influencing each new paper published with their pressure to be right in their previous models, and with new genomic data compelling them to change their theories under the pressure not to be too wrong again, in this strange vicious circle. Had they remained silent and committed to their archaeological theories, they could have been right all along, each one in their own way.
NOTE. BTW, in case you see ad hominem here too, I feel compelled to say that only thanks to their commitment to disentangle the truth about ancient migrations, and their readiness to collaborate with genetic research – unlike many others in their field – we know today what we know. If they have been wrong many times, it is because they have tried to connect the genetic dots as they were told. Only because of their readiness to explore their science further they should be praised by all. But, again, that does not mean that they cannot be wrong in their models…
Thanks to Anthony’s latest change of mind, we don’t have to hear the “cultural diffusion” argument anymore, and I consider this a great advance for the field.
NOTE. Not that there could not be prehistoric cultural diffusion events of language (i.e. not accompanied by genetic admixture), of course, but such theories, almost impossible to disprove, probably need much more than a simple “patron-client relationship” proposal and anthropometry to justify them, in a time when we will be able to see almost every meaningful personal exchange in Genomics…
Today – since the finding of Ukraine_Eneolithic sample I6561, of haplogroup R1a-Z93, dated ca. 4200 BC, and likely from the Sredni Stog culture – it seems more likely than ever that the expansion of R1a-Z645 subclades was in fact associated with the spread of steppe admixture probably near the North Pontic forest-steppe region, most likely from the Dnieper-Dniester or Upper Dniester region.
The appearance of a ‘late’ Z93 subclade already at such an early date, with steppe admixture, makes it still more likely that the Proto-Corded Ware culture, from where Corded Ware migrants of R1a-Z645 lineages later spread, was probably associated with this wide region.
NOTE. A migration of Yamna settlers northward along the Prut dated ca. 3000 BC or later could have justified the appearance of steppe admixture in the Dnieper-Dniester region, as I proposed for the Zvejnieki sample, although dates from Baltic samples are likely too early for that. For this to be corroborated, migrants should be accompanied up to a certain region by R1b-L23 lineages, and this could mean in turn a revival of Anthony’s original model of cultural diffusion of 2007. The most likely scenario, however, as predicted by Heyd, given the early appearance of steppe admixture and R1a-Z93 subclades in the forest-steppe during the 5th millennium, is that the admixture happened much earlier than that, fully unrelated to Late PIE migrations.
The modern Baltic and Slavic conundrum
As for some people of Northern European ancestry previously supporting a bulletproof Yamna (R1a/R1b) -> Corded Ware migration that was obviously wrong; now supporting different Sredni Stog -> Corded Ware groups representing Indo-Slavonic (and Germanic??) in a model that is clearly wrong: how are these attempts different from Western Europeans supporting the autochthonous continuity of R1b-P312 lineages against all recent data, from Indians supporting the autochthonous continuity of R1a-M417 lineages no matter what, and from the more recent trend of autochthonous continuity theories for N1c lineages and Uralic in Eastern Europe?
Modern Germanic-speaking peoples can trace their common language to Nordic Iron Age Proto-Germanic, Celts to La Tène’s expansion of Proto-Celtic, and Romance speakers to the Roman expansion (and to an earlier Proto-Italic), all three dating approximately to the Iron Age. Proto-Slavic is dated much later than that, and probably Proto-Baltic too (or maybe earlier depending on the dialectal proposal), with Balto-Slavic being possibly coeval with Pre-Proto-Germanic and Italo-Celtic, but probably slightly later than that. Also, the language ancestral to Slavic may be (like a theoretical Proto-Romance language) impossible to reconstruct with precision, due to multiple substrate (or superstrate?) influences on the wide territory where Proto-Slavic formed and expanded from, in close alliance with steppe communities of different ethnolinguistic backgrounds.
We know that proto-historic Germanic, Celtic, and Italic peoples spread from relatively small regions, and had almost nothing to do with historic groups speaking their daughter languages, let alone modern speakers. Baltic and Slavic are not different.
NOTE. We have read that Weltzin samples clustered closely to Central Europeans (especially Austrians), and at a certain distance from modern Poles. That’s the conclusion of Sell’s PhD thesis, and it may be right, if you take only modern samples for comparison. However, if you have read or thought that they represented some kind of “ancestral Germanic vs. Slavic” battle, please imagine Trump’s voice for my opinion: Wrroonng, wrroonng, wrroonng. They cluster closely with Bell Beaker migrants, Poland BA, and Únětice (in this order), which we now know thanks to the data from O&M 2018 and Mittnik et al. 2018. And we also know who they don’t cluster close too: Corded Ware and Trzciniec samples. Therefore, people from the region near the most likely homelands of Pre-Proto-Germanic and Proto-Balto-Slavic are – as expected – likely descendants from Bell Beaker migrants in Central Europe. The genetic relationship of those ancient samples to modern inhabitants of Central-East Europe? Not obvious – at all.
We also know (and have known for a long time, well before these recent papers) that the oldest attested Indo-European languages – Mycenaean, early Anatolian languages, and Indo-Aryan (through certain words in Mitanni inscriptions) – do not show continuity from the places where they were first attested to the Late and Middle Proto-Indo-European (steppe) homeland either. There should be no problem then in accepting that there is no linguistic, archaeological, or common sense reason to support that Balto-Slavic is older or shows more regional continuity than other IE languages from Europe.
NOTE. Oh yes, Balts saying “Baltic is the most similar language to PIE” I hear you thinking? Uh-huh, sure. And according to some Greeks (supported e.g. by the conclusions from Lazaridis et al. 2017) Mycenaeans were ‘autochthonous’, and Proto-Greek the most similar to PIE. For many Hindus, Vedic Sanskrit is in fact PIE), and the latest paper by Narasimhan et al. (2018) only reinforces this idea (don’t ask me why). Also, Caucasian scholar Gamkrelidze (with Ivanov) supported the origin of the language precisely in the Caucasus, with Armenian being thus the purest language. For Italians fans of Virgil and the Roman Empire, Latin (like Aeneas) comes from Anatolian linguistically and genetically, hence it must be the ‘oldest’ IE dialect alive… No, wait, Danish scholars Kroonen and Iversen quite recently asserted that Germanic is the oldest to branch off, then it should thus be nearest to PIE! I think you can see a pattern here…And don’t forget about the new Vasconic-Uralic hypotheses going on now, with Vasconic fans of R1b changing from Palaeolithic to Mesolithic, and now to European Neolithic and whatnot, or Uralic fans of N1c changing now from Mesolithic EHG to Siberia (for ancestry) or Central Asia (for N1c subclades), or whatever is necessary to believe in ‘continuity’ of their people following the newest genetic papers… Just pick whatever theory you want, call it “mainstream”, and that’s it.
So, if there is no reliable archaeological model connecting Bronze or Iron Age cultures to Eastern European cultures which are supposed to represent the Proto-Slavic and Proto-Baltic homelands…why on earth would any reasonable amateur (not to speak about scholars) dare propose any sort of genetic or linguistic continuity for thousands of years from PIE to early Slavs, a people whose first blurry appearance in historical records happened during the Middle Ages in rather turbulent and genetically admixed regions? It does not make any sense, and it had all odds against it. Blond hair, blue eyes, lactase persistence? Sure, and ABO group, brachycephaly, anthropometry… All very scientifish.
Human ancestry can only help refinesolid academic theories, it cannot create one. Every new pet theory used to satisfy modern cultural pre- and misconceptions has failed, and it will fail again, and again, and again…
To have an own anthropological model of prehistoric migration requires time and study. It is not enough to play with software and to misuse traditional academic disciplines just to ‘prove’ some completely irrelevant, meaningless, and false continuity.
Simultaneous collective burials appear quite regularly in early medieval linear cemeteries. Despite their relatively regular occurrence, they are seen as extraordinary as the interred individuals’ right to be buried in a single grave was ignored for certain reasons. Here, we present a study examining the possible familial relationship of early medieval individuals buried in this way by using aDNA analysis of mitochondrial HVR-I, Y-STRs, and autosomal miniSTRs. We can show that biological relatedness may have been an additional reason for breaking the usual burial custom besides a common cause of death, such as the Plague, which is a precondition for a simultaneous burial. Finally, with our sample set, we also see that signs of interaction between individuals such as holding hands which are often interpreted by archeologists as signs of biological or social relatedness, do not always reflect true genetic kin relationships.
Most of the burials studied are from the mid-6th and early 7th century, and all are from collective burials:
Of the simultaneous burials nine graves are proven or potential (due to contemporaneity) Plague burials (Feldman et al., 2016; Harbeck et al., 2013) and one grave is attributed to interpersonal violence against the background of the early medieval feud system (Schneider, 2008). The remaining simultaneous and the two successive burials did not reveal hints on their individuals’ cause of death.
The distribution of lineages includes R1b, R1a, and I (one family each) in Altenerding-Klettham, and T, R1b, and R1a (two families) in Aschheim-Bajuwarenring.
There were, for example:
A father and son R1a in a “warrior grave”:
Showing traces of perimortal sharp traumata (AE 888), both men seem to have died in succession of a physical conflict (Sage, 1984). It must remain open, whether this conflict was executed as a blood vengeance in connection with the medieval feud system (Schneider, 2008; Steuer, 2008) or any other kind of interpersonal violence. Attacks and interpersonal violence are also often believed to be a precondition for individuals being buried together.
It has been assumed that burials of several men with weaponry, so-called “warrior graves”, are burials which reflect the early medieval feud system (Schneider, 2008; Steuer, 2008) in the very sophisticated but implausible assumption, that women and children might have been spared in those conflicts. While feuds were actually struggles between familiae, friends and servants of a particular family could be also involved, which would explain the deposition of nonrelated individuals in such burials.
Two children, half-siblings, one of haplogroup R1b, in a shared coffin.
A non-genetic family of an elderly man of haplogroup I and a child being protected:
The early medieval concept of familia not only comprised the (biological) nuclear family and individuals certainly entered a family clan by marriage. This leaves room for any possible social (i.e. non-genetic) relation that may have allowed these two individuals to be buried in a common grave.
It is tempting for me to hail the mixed genetic pool among late Germanic tribes found in recent genetic studies, as I have done for Proto-Balto-Slavic territory and Iberia.
It is indeed possible that the mostly R1b-L11 and I1 subclades seen in late medieval West Germanic-speaking populations (and in modern West Germanic speakers) are in fact the result of later internal migratory flows and founder effects.
However, Bavarians – like the recently studied Lombards (with a predominance of R1b and I lineages), and especially Goths (apparently showing ‘eastern’ ancestry) – occupied territories of mixed ‘Barbarian’ populations after the invasion of the Huns and their allies, and settled near Slavs and Avars.
EDIT (18 MAR 2018). We should add here for this southern Germanic territory the Merovingian burials (ca. 7th c.) from Ergolding, with 3 samples of haplogroup R1b, and 2 samples of G2a, published by Vanek, Saskova, & Koch (2009).
Earlier, expanding Proto-Germanic tribes may not show this variable admixture and haplogroups we are seeing right now, though.
Текст «Слова о полку Игореве» (далее «Слово») дошел до нас в двух неточных (отредактированных) копиях со списка нач. XVI в. и нескольких выписках из него. Наслоения, привнесенные переписчиком нач. XVI в. (или несколькими переписчиками) – редактура в русле 2 го южнославянского влияния и поздние диалектизмы – непоследовательны (§9.3.1) и не настолько исказили стихотворный текст рубежа XII–XIII вв., чтобы сделать невозможной его реконструкцию. «Слово» по своему жанру (светская поэзия) не принадлежит к текстам, которые по многу раз переписывались в монастырских скрипториях. Поэтому не исключено, что рукопись нач. XVI в. является хотя и небрежной, но первой по счету копией древнерусского оригинала.
«Слово» могло звучать приблизительно так, как я предлагаю в своей реконструкции, морфология и акцентология языка его автора могли быть устроены так, как я предполагаю, и оно могло быть создано в реконструируемой мною системе стихосложения. Однако в действительности многое могло быть устроено иначе. Реконструкция акцентологической системы и две другие гипотезы (о неравносложной силлаботонике и об опциональном прояснении слабых редуцированных) замкнуты друг на друге и образуют circulus in probando. Реконструируемая для «Слова» акцентологическая система выводится из праславянской реконструкции и подтверждается данными современных диалектов, однако она не засвидетельствована в древнерусских памятниках. Слабым местом моей реконструкции является прояснение слабых редуцированных в позициях, где оно нужно исключительно из метрических соображений. В работе, подобной этой, невозможно избежать домыслов и рискованных допущений, ряд выдвинутых гипотез находится «на грани фола», однако в целом моя реконструкция построена на фактах и их интерпретациях, являясь таким образом научным исследованием. В работе используютмя результаты смежных наук ‒ в первую очередь стиховедения. Представленная в настоящей книге реконструкция «Слова» является первым опытом системного моделирования стихотворного текста на гипотетическом древнерусском диалекте XII‒XIII в., существование которого весьма вероятно. Мне хотелось бы надеяться, что моя работа внесет свою скромную лепту в изучение великого памятника древнерусской литературы.
The Tale of Igor’s Campaign is probably the oldest Slavic epic available, recorded later than what oral tradition and linguistic details reflect, like the oldest Indo-Iranian texts. It contains many details interesting for Proto-Slavic (and North-West Indo-European) language and culture reconstruction.
Although the exact origins and migratory patterns of R1a and R1b are still under rigorous investigation, it seems that they are linked to Bronze Age migrations from the Western Eurasian Steppe and Eastern Europe into Southern (including Greece) and Western Europe. Apparently, such migrations (especially as regards R1a) into Cyprus were limited.
Additionally, the Greek population has received considerable migrations during the Byzantine era and the Middle Ages from other Balkanic populations, such as Slavs[62,63], Aromanians (Vlachs), and Albanians (Arvanites)[65,66]. The former, is very likely to have increased R1a frequencies among Greeks. In fact, Fig 3 (also S7 Table) indicate that R1a increases gradually with increasing latitude in Greece. There is no historical evidence for such migrations into Cyprus during the same period.
The only Greek sub-population showing close genetic proximity to Cypriots (in terms of Y-haplogroup composition) is Cretan Greeks (Figs 3 and 4). It could be speculated that Cypriots and Cretans experienced very similar migratory events over the centuries, which were characterized by high influx from populations rich in haplogroups J2a and G2, and moderate in R1b, while very limited influx from populations rich in haplogroups R1a and I (Eastern and Northern/ Central Europe), as well as from populations rich in J1 (Middle East) and E-M81 (North Africa).
If R1b-M269 lineages are linked – as I have proposed – to Yamna migrations, and especially R1b-Z2103 to Palaeo-Balkan migrations, whereas R1a-M417 is to be linked to Corded Ware migrations, the reason for this latitude-dependent (and also longitude-dependent) presence of R1a-M417 subclades in Greece and is probably linked to the expansion of Slavic R1a-Z282 lineages to the north and west of Greece (and from there through intermarriages and migrations within Greece into other regions), and Iranian R1a-Z93 lineages to the east traditional Greek territory, into Asia Minor. The expansion of Balkan peoples (including Slavs, Albanian, and Aromanian peoples) might have brought with them R1b-M269*, I2, or R1a-Z282 subclades.