A Game of Thrones in Indo-European: proto-languages in Westeros and Essos, and population genomics

game-of-thrones-westeros-essos-map

I think proto-languages can be applied to basically any appropriate prehistoric setting, and especially to science fiction and fantasy settings. I often viewed the lack of interest for them as based on the idea that they are not fantastic enough, that they would render a fantastic world too realistic to allow for an adequate immersion of the reader (or viewer) into a new world.

With time, I have become more and more convinced that most authors don’t use proto-languages (or tweaked versions of them) simply because they can’t, and resort to the easier way: inventing some rules and words based on some basic ideas and sounds they feel would fit a certain culture or people, to get going. After all, world-building is about a good enough, not too detailed description, and books are about characters and settings, not worlds.

After the end of the 7th season of the Game of Thrones TV series, of which I have become a great fan, I had some season finale grief to deal with, so I thought about applying what we knew about Proto-Indo-Europeans to the fantasy world. Since all book translations deal with English names as if they were translations of the Common Tongue (e.g. Spanish “Invernalia” or “Poniente” for “Winterfel” or “Westeros”), the idea of a translation into Proto-Indo-European seemed quite interesting.

NOTE. I understand that, for some, the idea that “the original language is the best” would make them reject this. However, just take into account the millions who enjoy the books and the TV series only in their native language, and know nothing about the ‘original’ version…

Here are the text and images:

A Dance with Old Tongues

As you can see, the idea of the Common Tongue being Late Proto-Indo-European brings about a whole new (infinite) world of dialectal evolution, language contacts, and population expansions which must be established for the whole setting to work. This is what the text I began to write was about: to use languages (and related populations) of ca. 6000-1500 BC, and to avoid anachronisms and impossible language relationships.

As an added advantage, fans of role-playing games could expand their world with the use of the language correspondences and the maps. This way, instead of “Northern English” being spoken in the North, and “Spanish English” being spoken in Dorne, according to some selections that have been naturally criticized, you have ancient languages that fit with the ancient setting, and which were actually related to each other.

8-westeros-essos-languages-equivalence
Equivalence of languages of the known world with coeval proto-languages. Solid red lines divide Graeco-Aryan from Northern Indo-European dialects (Tocharian is separated from North-West Indo-European by a dotted red line). See all maps.

I also began drawing a fantasy map, my first one – even though I have been member of Cartographer’s Guild for years – , which eventually helped me with my updates of maps of prehistoric migrations, and even with the use of arrows and colors for scientific publications. I drew details mainly to illustrate the text, not to offer a comprehensive translated world. Most of the work was done in the Summer of 2017, with some map changes done in 2018 with help of the maps and works of fans.

NOTE. I have reviewed it during some long travels lately, and included names of “bloodlines” (i.e. haplogroups), which I find more interesting today for people to understand bottlenecks during prehistoric migrations; I have also added a map using pie charts. If this doesn’t fit well with the whole picture, it’s because it’s a recent addition. The rest is more or less the same as one-two years ago.

I don’t have time now to correct much of what I wrote. I have forgotten most of the relevant details from the books, especially A World of Ice and Fire which I think helped me a lot with this, and I am sure that after writing A Song of Sheep and Horses (now you know the why of the book names) I would deal with some language identification and cognates differently.

I decided to publish it to liven up our Facebook page of Modern Indo-European now that the 8th season is near, so that people can participate and try to translate (translatable) names and expressions into Proto-Indo-European, to see how it would work out. You can also request access our Modern Indo-European and Proto-Indo-European groups; both are administered mainly by Fernando.

If you think this whole idea is crazy, or a huge loss of time, I agree; this is how you lose your time when you like fantasy, comic books, etc. But I am a great fan of fantasy and fiction, and I had a lot of free time back then, so I couldn’t help it…

On the other hand, if you feel that mixing fantasy (or SF) with the Proto-Indo-European question (especially population genomics) is a bad idea, I may have agreed with that two years ago, and maybe this is the reason why I hesitated to publish it then.

Hoewever, today we can read a whole new (2018 and 2019) bunch of “steppe ancestry=Indo-European” fantasies: invisible Nganasan reindeer hordes, a Fearsome Tisza River where Yamna settlers mysteriously disappear, shapeshifting Dutch CWC peoples who change haplogroups, languages dependent on cephalic types, or Yamna/Bell Beaker expanding Vasconic…So what’s the matter with some more fantasy?

Happy new year 2019…and enjoy our new books!

song-sheep-horses-header

Sorry for the last weeks of silence, I have been rather busy lately. I am having more projects going on, and (because of that) I also wanted to finish a project I have been working on for many months already.

I have therefore decided to publish a provisional version of the text, in the hope that it will be useful in the following months, when I won’t be able to update it as often as I would like to:

EDIT (20 JAN 2019): For those of you who are more comfortable reading in your native language, I have placed some links to automatic translations by Google Translate. They might work especially well for the texts of A Game of Clans & A Clash of Chiefs.

Don’t forget to check out the maps included in the supplementary materials: I have added Y-DNA, mtDNA, and ADMIXTURE data using GIS software. The PCA graphics are also important to follow the main text.

NOTE. Right now the files are only in my server. I will try to upload them to Academia.edu and Research Gate when I have time, I have uploaded them to Academia.edu and ResearchGate, in case the websites are too slow.

I would have preferred to wait for a thorough revision of the section on archaeology and the linguistic sections on Uralic, but I doubt I will have time when the reviews come, so it was either now or maybe next December…

I say so in the introduction, but it is evident that certain aspects of the book are tentative to say the least: the farther back we go from Late Proto-Indo-European, the less clear are many aspects. Also, linguistically I am not convinced about Eurasiatic or Nostratic, although they do have a certain interest when we try to offer a comprehensive view of the past, including ethnolinguistic identities.

I cannot be an expert in everything, and these books cover a lot. I am bound to publish many corrections as new information appears and more reviews are sent. For example, just days ago (before SNP calls of Wang et al. 2018 were published) some paragraphs implied that AME might have expanded Nostratic from the Middle East. Now it does not seem so, and I changed them just before uploading the text. That’s how tentative certain routes are, and how much all of this may change. And that only if we accept a Nostratic phylum…

NOTE. Since the first book I wrote was the linguistic one, and I have spent the last months updating the archaeology + genetics part, now many of you will probably understand 1) why I am so convinced about certain language relationships and 2) how I used many posts to clarify certain ideas and receive comments. Many posts offer probably a good timeline of what I worked with, and when.

Acknowledgements

I did not add this section to the books, because they are still not ready for print, but I think this is due somewhere now. It is impossible to reference all who have directly or indirectly contributed to this, so this is a list of those I feel have played an important role.

I am indebted to the following people (which does not mean that they share my views, obviously):

First and foremost, to Fernando López-Menchero, for having the patience to review with detail many parts on Indo-European linguistics, knowing that I won’t accept many of his comments anyway. The additional information he offers is invaluable, but I didn’t want to turn this into a huge linguistic encyclopaedia with unending discussions of tiny details of each reconstructed word. I think it is already too big as it is.

I would not have thought about doing this if it were not for the interest of Wekwos (Xavier Delamarre) in publishing a full book about the Indo-European demic diffusion model (in the second half of 2017, I think). It was them who suggested that I extended the content, when all I had done until then was write an essay and draw some maps in my free time between depositing the PhD thesis and defending it.

Sadly, as much as I would like to publish a book with a professional publisher, I don’t think ancient DNA lends itself for the traditional format, so my requests (mainly to have free licenses and being able to review the text at will, as new genetic papers are published) were logically not acceptable. Also, the main aim of all volumes, especially the linguistic one, is the teaching of essentials of Late Proto-Indo-European and related languages, and this objective would be thwarted by selling each volume for $50-70 and only in printed format. I prefer a wider distribution.

At first I didn’t think much of this proposal, because I do not benefit from this kind of publications in my scientific field, but with time my interest in writing a whole, comprehensive book on the subject grew to the point where it was already an ongoing project, probably by the start of 2018.

I would not have been in contact with Wekwos if it were not for user Camulogène Rix at Anthrogenica, so thanks for that and for the interest in this work.

I would not have thought of writing this either if not for the spontaneous support (with an unexpected phone call!) of a professor of the Complutense University of Madrid, Ángel Gómez Moreno, who is interested in this subject – as is his wife, a professor of Classics more closely associated to Indo-European studies, and who helped me with a search for Indo-Europeanists.

EDIT (1 JAN 2019): I remembered that Karin Bojs sent me her book after reading the demic diffusion model. I may have also thought about writing a whole book back then, but mid-2017 is probably too early for the project.

Professor Kortlandt is still to review the text, but he contributed to both previous essays in some very interesting ways, so I hope he can help me improve the parts on Uralic, and maybe alternative accounts of expansion for Balto-Slavic, depending on the time depth that he would consider warranted according to the Temematic hypothesis.

The maps are evidently (for those who are interested in genetics) in part the result of the effort of the late Jean Manco: As you can see from the maps including Y-DNA and mtDNA samples, I have benefitted from her way of organising data and publishing it. Similarly, the work of Iain McDonald in assessing the potential migration routes of R1b and R1a in Europe with the help of detailed maps was behind my idea for the first maps, and consequently behind these, too.

I should thank all people responsible for the release of free datasets to work with, including the Reich and Jena labs, the Veeramah Lab, and also researchers from the Max Planck Institute or the Mainz Palaeogenetics group, who didn’t mind to share with me datasets to work with.

Readers of this blog with interesting comments have also been essential for the improvement of the texts. You can probably see some of your many contributions there. I may not answer many comments, because I am always busy (and sometimes I just don’t have anything interesting to say), but I try to read all of them.

EDIT (1 JAN 2019) I think I should mention at least Chetan, Egg, or Robert George; but then I would leave out old europe, Sgr Ganesh, or Tileman Ehlen; and if I include them I would leave out others…

Users of other sites, like Anthrogenica, whose particular points of view and deep knowledge of some very specific aspects are sometimes very useful. In particular, user Anglesqueville helped me to fix some issues with the merging of datasets to obtain the PCAs and ADMIXTURE, and prepared some individual samples to merge them.

Even without posting anything, Google Analytics keeps sending me messages about increasing user fidelity (returning users), and stats haven’t really changed (which probably means more people are reading old posts), so thank you for that.

I hope you enjoy the books.

Happy new year!

Migrations in the Levant region during the Chalcolithic, also marked by distinct Y-DNA

halaf-ubaid-migrations

Open access Ancient DNA from Chalcolithic Israel reveals the role of population mixture in cultural transformation, by Harney et al. Nature Communications (2018).

Interesting excerpts (emphasis mine, reference numbers deleted for clarity):

Introduction

The material culture of the Late Chalcolithic period in the southern Levant contrasts qualitatively with that of earlier and later periods in the same region. The Late Chalcolithic in the Levant is characterized by increases in the density of settlements, introduction of sanctuaries, utilization of ossuaries in secondary burials, and expansion of public ritual practices as well as an efflorescence of symbolic motifs sculpted and painted on artifacts made of pottery, basalt, copper, and ivory. The period’s impressive metal artifacts, which reflect the first known use of the “lost wax” technique for casting of copper, attest to the extraordinary technical skill of the people of this period.

The distinctive cultural characteristics of the Late Chalcolithic period in the Levant (often related to the Ghassulian culture, although this term is not in practice applied to the Galilee region where this study is based) have few stylistic links to the earlier or later material cultures of the region, which has led to extensive debate about the origins of the people who made this material culture. One hypothesis is that the Chalcolithic culture in the region was spread in part by immigrants from the north (i.e., northern Mesopotamia), based on similarities in artistic designs. Others have suggested that the local populations of the Levant were entirely responsible for developing this culture, and that any similarities to material cultures to the north are due to borrowing of ideas and not to movements of people.

Previous genome-wide ancient DNA studies from the Near East have revealed that at the time when agriculture developed, populations from Anatolia, Iran, and the Levant were approximately as genetically differentiated from each other as present-day Europeans and East Asians are today. By the Bronze Age, however, expansion of different Near Eastern agriculturalist populations — Anatolian, Iranian, and Levantine — in all directions and admixture with each other substantially homogenized populations across the region, thereby contributing to the relatively low genetic differentiation that prevails today. Showed that the Levant Bronze Age population from the site of ‘Ain Ghazal, Jordan (2490–2300 BCE) could be fit statistically as a mixture of around 56% ancestry from a group related to Levantine Pre-Pottery Neolithic agriculturalists (represented by ancient DNA from Motza, Israel and ‘Ain Ghazal, Jordan; 8300–6700 BCE) and 44% related to populations of the Iranian Chalcolithic (Seh Gabi, Iran; 4680–3662 calBCE). Suggested that the Canaanite Levant Bronze Age population from the site of Sidon, Lebanon (~1700 BCE) could be modeled as a mixture of the same two groups albeit in different proportions (48% Levant Neolithic-related and 52% Iran Chalcolithic-related). However, the Neolithic and Bronze Age sites analyzed so far in the Levant are separated in time by more than three thousand years, making the study of samples that fill in this gap, such as those from Peqi’in, of critical importance.

This procedure produced genome-wide data from 22 ancient individuals from Peqi’in Cave (4500–3900 calBCE) (…)

Discussion

We find that the individuals buried in Peqi’in Cave represent a relatively genetically homogenous population. This homogeneity is evident not only in the genome-wide analyses but also in the fact that most of the male individuals (nine out of ten) belong to the Y-chromosome haplogroup T, a lineage thought to have diversified in the Near East. This finding contrasts with both earlier (Neolithic and Epipaleolithic) Levantine populations, which were dominated by haplogroup E, and later Bronze Age individuals, all of whom belonged to haplogroup J.

levant-chalcolithic-bronze-age
Detailed sample background data for each of the 22 samples from which we successfully obtained ancient DNA. Additionally, background information for all samples from Peqi’in that were screened is included in Supplementary Data 1. *Indicates that Y-chromosome haplogroup call should be interpreted with caution, due to low coverage data.

Our finding that the Levant_ChL population can be well-modeled as a three-way admixture between Levant_N (57%), Anatolia_N (26%), and Iran_ChL (17%), while the Levant_BA_South can be modeled as a mixture of Levant_N (58%) and Iran_ChL (42%), but has little if any additional Anatolia_N-related ancestry, can only be explained by multiple episodes of population movement. The presence of Iran_ChL-related ancestry in both populations – but not in the earlier Levant_N – suggests a history of spread into the Levant of peoples related to Iranian agriculturalists, which must have occurred at least by the time of the Chalcolithic. The Anatolian_N component present in the Levant_ChL but not in the Levant_BA_South sample suggests that there was also a separate spread of Anatolian-related people into the region. The Levant_BA_South population may thus represent a remnant of a population that formed after an initial spread of Iran_ChL-related ancestry into the Levant that was not affected by the spread of an Anatolia_N-related population, or perhaps a reintroduction of a population without Anatolia_N-related ancestry to the region. We additionally find that the Levant_ChL population does not serve as a likely source of the Levantine-related ancestry in present-day East African populations.

These genetic results have striking correlates to material culture changes in the archaeological record. The archaeological finds at Peqi’in Cave share distinctive characteristics with other Chalcolithic sites, both to the north and south, including secondary burial in ossuaries with iconographic and geometric designs. It has been suggested that some Late Chalcolithic burial customs, artifacts and motifs may have had their origin in earlier Neolithic traditions in Anatolia and northern Mesopotamia. Some of the artistic expressions have been related to finds and ideas and to later religious concepts such as the gods Inanna and Dumuzi from these more northern regions. The knowledge and resources required to produce metallurgical artifacts in the Levant have also been hypothesized to come from the north.

Our finding of genetic discontinuity between the Chalcolithic and Early Bronze Age periods also resonates with aspects of the archeological record marked by dramatic changes in settlement patterns, large-scale abandonment of sites, many fewer items with symbolic meaning, and shifts in burial practices, including the disappearance of secondary burial in ossuaries. This supports the view that profound cultural upheaval, leading to the extinction of populations, was associated with the collapse of the Chalcolithic culture in this region.

levant-chalcolithic-pca
Genetic structure of analyzed individuals. a Principal component analysis of 984 present-day West Eurasians (shown in gray) with 306 ancient samples projected onto the first two principal component axes and labeled by culture. b ADMIXTURE analysis of 984 and 306 ancient samples with K = 11
ancestral components. Only ancient samples are shown

Comments

I think the most interesting aspect of this paper is – as usual – the expansion of peoples associated with a single Y-DNA haplogroup. Given that the expansion of Semitic languages in the Middle East – like that of Anatolian languages from the north – must have happened after ca. 3100 BC, coinciding with the collapse of the Uruk period, these Chalcolithic north Levant peoples are probably not related to the posterior Semitic expansion in the region. This can be said to be supported by their lack of relationship with posterior Levantine migrations into Africa. The replacement of haplogroup E before the arrival of haplogroup J suggests still more clearly that Natufians and their main haplogroup were not related to the Afroasiatic expansions.

semitic-languages
Distribution of Semitic languages. From Wikipedia.

On the other hand, while their ancestry points to neighbouring regional origins, their haplogroup T1a1a (probably T1a1a1b2) may be closely related to that of other Semitic peoples to the south, as found in east Africa and Arabia. This may be due either to a northern migration of these Chalcolithic Levantine peoples from southern regions in the 5th millennium BC, or maybe to a posterior migration of Semitic peoples from the Levant to the south, coupled with the expansion of this haplogroup, but associated with a distinct population. As we know, ancestry can change within certain generations of intense admixture, while Y-DNA haplogroups are not commonly admixed in prehistoric population expansions.

Without more data from ancient DNA, it is difficult to say. Haplogroup T1a1 is found in Morocco (ca. 3780-3650 calBC), which could point to a recent expansion of a Berbero-Semitic branch; but also in a sample from Balkans Neolithic ca. 5800-5400 calBCE, which could suggest an Anatolian origin of the specific subclades encountered here. In any case, a potential origin of Proto-Semitic anywhere near this wide Near Eastern region ca. 4500-3500 BC cannot be discarded, knowing that their ancestors came probably from Africa.

haplogroup-t-levant
Distribution of haplogroup T of Y-chromosome. From Wikipedia.

Interesting from this paper is also that we are yet to find a single prehistoric population expansion not associated with a reduction of variability and expansion of Y-DNA haplogroups. It seems that the supposedly mixed Yamna community remains the only (hypothetical) example in history where expanding patrilineal clans will not share Y-DNA haplogroup…

Related