Villabruna cluster in Late Epigravettian Sicily supports South Italian corridor for R1b-V88


New preprint Late Upper Palaeolithic hunter-gatherers in the Central Mediterranean: new archaeological and genetic data from the Late Epigravettian burial Oriente C (Favignana, Sicily), by Catalano et al. bioRxiv (2019).

Interesting excerpts (emphasis mine):

Grotta d’Oriente is a small coastal cave located on the island of Favignana, the largest (~20 km2) of a group of small islands forming the Egadi Archipelago, ~5 km from the NW coast of Sicily.

The Oriente C funeral pit opens in the lower portion of layer 7, specifically sublayer 7D. Two radiocarbon dates on charcoal from the sublayers 7D (12149±65 uncal. BP) and 7E, 12132±80 uncal. BP are consistent with the associated Late Epigravettian lithic assemblages (Lo Vetro and Martini, 2012; Martini et al., 2012b) and refer the burial to a period between about 14200-13800 cal. BP, when Favignana was connected to the main island (Agnesi et al., 1993; Antonioli et al., 2002; Mannino et al. 2014).

A-B) Geographic location of Grotta d’Oriente.

The anatomical features of Oriente C are close to those of Late Upper Palaeolithic populations of the Mediterranean and show strong affinity with other Palaeolithic individuals of Sicily. As suggested by Henke (1989) and Fabbri (1995) the hunter-gatherer populations were morphologically rather uniform.

Genetic analysis

We confirmed the originally reported mitochondrial haplogroup assignment of U2’3’4’7’8’9. This haplogroup is present in both pre- and post-LGM populations, but is rare by the Mesolithic, when U5 dominates (Posth et al.2016).

Lipson et al. (2018) (their supplementary Figure S5.1) and Villalba-Mouco et al. (2019) (their Figure 2A) showed that European Late Palaeolithic and Mesolithic hunter-gatherers fall along two main axes of genetic variation. Multidimensional scaling (MDS) of f3-statistics shows that these axes form a “V” shape (Fig. 3). (…)

Focusing further on Oriente C, we find that it shares most drift with individuals from Northern Italy, Switzerland and Luxembourg, and less with individuals from Iberia, Scandinavia, and East and Southeast Europe (Fig. 4A-B). Shared drift decreases significantly with distance (Fig. 4C) and with time (Fig. 4D) although in a linear model of drift with distance and time as a covariate, only distance (p=1.3×10-6) and not time (p=0.11) is significant. Consistent with the overall E-W cline in hunter-gatherer ancestry, genetic distance to Oriente C increases more rapidly with longitude than latitude, although this may also be affected by geographic features. For example, Oriente C shares significantly more drift with the 8,000 year-old 1,400 km distant individual from Loschbour in Luxembourg (Lazaridis et al.,2014), than with the 9,000 year old individual from Vela Spila in Croatia (Mathieson et al.,2018) only 700 km away as shown by the D-statistic (Patterson et al.,2012) D (Mbuti, Oriente C, Vela Spila, Villabruna); Z=3.42. Oriente C’s heterozygosity was slightly lower than Villabruna (14% lower at 1240k transversion sites), but this difference is not significant (bootstrap P=0.12).

Multidimensional scaling of outgroup f3-statistics for Late 531 Upper Palaeolithic and Mesolithic hunter-gatherers.

Discussion and Conclusion

The robust record of radiocarbon dates proves that they reached Sicily not before 15-14 ka cal. BP, several millennia after the LGM peak. In our opinion, in fact, the hypothesis about an early colonization of Sicily by Aurignacians (Laplace, 1964; Chilardi et al., 1996) must be rejected, on the basis of a recent reinterpretation of the techno-typological features of the lithic industries from Riparo di Fontana Nuova (Martini et al., 2007; Lo Vetro and Martini, 2012; on this topic see also Di Maida et al., 2019).

These analyses have implications for understanding the origin and diffusion of the hunter-gatherers that inhabited Europe during the Late Upper Palaeolithic and Mesolithic. Our findings indicate that Oriente C shows a strong genetic relationship with Western European Late Upper Palaeolithic and Mesolithic hunter-gatherers, suggesting that the “Western hunter-gatherers” was a homogeneous population widely distributed in the Central Mediterranean, presumably as a consequence of continuous gene flow among different groups, or a range expansion following the LGM.

The same statistic as in A plotted with geographic position

The South Italian corridor

Once again, a hypothesis based on phylogeography – apart from scarce archaeological and palaeolinguistic data (“Semitic”-like topo-hydronymy and substrates in Europe) – seems to be confirmed step by step. Since the finding of the Villabruna individual of hg. R1b-L754 (likely R1b-V88, like south-eastern European lineages expanded with WHG ancestry), it was quite likely to find out that southern Europe was the origin of the expansion of R1b-V88 into Africa.

The most likely explanation for the presence of “archaic” R1b-V88 subclades among modern Sardinians was, therefore, that they represented a remnant from a Late Upper Palaeolithic/Early Mesolithic population that had not been replaced in subsequent migrations, and thus that the migration of these lineages into Northern Africa and the Green Sahara happened during a period when Italy was connected by a shallower Mediterranean (and more land connections) to Northern Africa.

Likely Late Epigravettian/Mesolithic expansion of R1b-V88 into Northern Africa. See full map.

Nevertheless, the arguments for a quite recent expansion of R1b-V88 through the Mediterranean and into Africa keep being repeated, probably based on ancestry from the few ancient (and many modern) populations that have been investigated to date, a simplistic approach prone to important errors that overarch whole migration models.

For example, in the recent paper by Marcus et al. (2019) the presence of these lineages among ancient Sardinians (from the late 4th millennium BC on) is interpreted as an expansion of R1b-V88 with the Cardial Neolithic based on their ancestry, disregarding the millennia-long gap between these samples and the presence of this haplogroup in Palaeolithic/Mesolithic Northern Iberia and Northern Italy, and the comparatively much earlier splits in the phylogenetic tree and dispersal among African populations.

Afroasiatic and Nostratic

I was asked recently if I really believed that we could reconstruct Proto-Nostratic and connect it with any ancestral population. My answer is simple: until the Chalcolithic – when the whole picture of Indo-Europeans, Uralians, Egyptians or Semites becomes quite clear – we have just very few (linguistic, archaeological, genetic) dots which we would like to connect, and we do so the best we can. The earlier the population and proto-language, the more difficult this task becomes.

NOTE. 1) I tentatively connected hg. R with Nostratic in a previous text – when it appeared that R1a expanded from around Lake Baikal, hence Eurasiatic; R1b from the south with AME-WHG ancestry, hence Afroasiatic; and R2 with Dravidian.

2) After that, I though it was more likely to be connected to AME ancestry and the Middle East, because of the apparent expansion of WHG from south-eastern Europe, and the potential association of Afroasiatic and (Elamo-?)Dravidian to Middle Eastern populations.

3) However, after finding more and more R1b samples expanding through northern Eurasia, spreading through the (then wider) steppe regions; and R1a essentially surviving among other groups in eastern Europe for thousands of years without being associated to significant migrations (like, say, hg. C after the Palaeolithic), it didn’t seem like this division was accurate, hence my most recent version.

But, in essence, it’s all about connecting the dots, and we have very few of them…

Phylogenetic tree from Pagel et al. (2013), partially in agreement with Kortlandt’s view on Eurasiatic. “Consensus phylogenetic tree of Eurasiatic superfamily (A) superimposed on Eurasia and (B) rooted tree with estimated dates of origin of families and of superfamily. (A) Unrooted consensus tree with branch lengths (solid lines) shown to scale and illustrating the correspondence between the tree and the contemporary north-south and east-west geographical positions of these language families. Abbreviations: P (proto) followed by initials of language family: PD, proto-Dravidian; PK, proto-Kartvelian; PU, proto-Uralic; PIE, proto–Indo-European; PA, proto-Altaic; PCK, proto–Chukchi-Kamchatkan; PIY, proto–Inuit-Yupik. The dotted line to PIY extends the inferred branch length into the area in which Inuit-Yupik languages are currently spoken: it is not a measure of divergence. The cross-hatched line to PK indicates that branch has been shortened (compare with B). The branch to proto-Dravidian ends in an area that Dravidian populations are thought to have occupied before the arrival of Indo-Europeans (see main text). (B) Consensus tree rooted using proto-Dravidian as the outgroup. The age at the root is 14.45 ± 1.75 kya (95% CI = 11.72–18.38 kya) or a slightly older 15.61 ± 2.29 kya (95% CI = 11.72–20.40 kya) if the tree is rooted with proto-Kartvelian. The age assumes midpoint rooting along the branch leading to proto-Dravidian (rooting closer to PD would produce an older root, and vice versa), and takes into account uncertainty around proto–Indo-European date of 8,700 ± 544 (SD) y following ref. 35 and the PCK date of 692 ± 67 (SD) y ago.”

In linguistics, I trust traditional linguists who tend to trust other more experimental linguists (like Hyllested or Kortlandt) who consider that – in their experience – an Indo-Uralic and a Eurasiatic phylum can be reconstructed. Similarly, linguists like Kortlandt are apparently (partially) supportive of attempts like that of Allan Bomhard with Nostratic – although almost everyone is critic of the Muscovite school‘s attachment to the Brugmannian reconstruction, stuck in pre-laryngeal Proto-Indo-Anatolian and similar archaisms.

I mostly use Nostratic as a way to give a simplistic ethnolinguistic label to the genetically related prehistoric peoples whose languages we will probably never know. I think it’s becoming clear that the strongest connection right now with the expansion of potential Eurasiatic dialects is offered by ANE-related populations (hence Y-chromosome bottlenecks under hg. R, Q, probably also N), however complicated the reconstruction of that hypothetic community (and its dialectalization) may be.

Therefore, the multiple expansions of lineages more or less closely associated to ANE-related peoples – like R1b-V88 in the case of Afrasian, or R2 in the case of Dravidians – are the easiest to link to the traditionally described Nostratic dialects and their highly hypothetic relationship.

Reconstruction of North African vegetation during past green Sahara periods. Estimated and reconstructed MAP for the Holocene GSP (6–10 kyr BP) projected onto a cross-section along the eastern Sahara (left panel) and map view of reconstructed MAP, vegetation and physiographic elements [7,8,11,45] (right panel). Image from Larrasoaña et al. (2013).

What should be clear to anyone is that the attempt of many modern Afroasiatic speakers to connect their language to their own (or their own community’s main) haplogroups, frequently E and/or J, is flawed for many reasons; it was simplistic in the 2000s, but it is absurd after the advent of ancient DNA investigation and more recent investigation on SNP mutation rates. R1b-V88 should have been on the table of discussions about the expansion of Afroasiatic communities through the Green Sahara long ago, whether one supports a Nostratic phylum or not.

The fact that the role of R1b bottlenecks and expansions in the spread of Afroasiatic is usually not even discussed despite their likely connection with the most recent population expansions through the Green Sahara fitting a reasonable time frame for Proto-Afroasiatic reconstruction, a reasonable geographical homeland, and a compatible dialectal division – unlike many other proposed (E or J) subclades – reveals (once again) a lot about the reasons behind amateur interest in genetics.

Just like seeing the fixation in (and immobility of) recent writings about the role of I1, I2, or (more recently) R1a in the Proto-Indo-European expansion, R1b with Vasconic, or N1c with Proto-Uralic.

NOTE. That evident interest notwithstanding, it is undeniable that we have a much better understanding of the expansions of R1b subclades than other haplogroups, probably due in great part to the easier recovery of ancient DNA from Eurasia (and Europe in particular), for many different – sociopolitical, geographical, technological – reasons. It is quite possible that a more thorough temporal transect of ancient DNA from the Middle East and Africa might radically change our understanding of population movements, especially those related to the Afroasiatic expansion. I am referring in this post to interpretations based on the data we currently have, despite that potential R1b-based bias.


Ancient genomes from North Africa evidence Neolithic migrations to the Maghreb

BioRxiv preprint now published (behind paywall) Ancient genomes from North Africa evidence prehistoric migrations to the Maghreb from both the Levant and Europe, by Fregel et al., PNAS (2018).

NOTE. I think one of the important changes in this version compared to the preprint is the addition of the recent Iberomaurusian samples.

Abstract (emphasis mine):

The extent to which prehistoric migrations of farmers influenced the genetic pool of western North Africans remains unclear. Archaeological evidence suggests that the Neolithization process may have happened through the adoption of innovations by local Epipaleolithic communities or by demic diffusion from the Eastern Mediterranean shores or Iberia. Here, we present an analysis of individuals’ genome sequences from Early and Late Neolithic sites in Morocco and from Early Neolithic individuals from southern Iberia. We show that Early Neolithic Moroccans (∼5,000 BCE) are similar to Later Stone Age individuals from the same region and possess an endemic element retained in present-day Maghrebi populations, confirming a long-term genetic continuity in the region. This scenario is consistent with Early Neolithic traditions in North Africa deriving from Epipaleolithic communities that adopted certain agricultural techniques from neighboring populations. Among Eurasian ancient populations, Early Neolithic Moroccans are distantly related to Levantine Natufian hunter-gatherers (∼9,000 BCE) and Pre-Pottery Neolithic farmers (∼6,500 BCE). Late Neolithic (∼3,000 BCE) Moroccans, in contrast, share an Iberian component, supporting theories of trans-Gibraltar gene flow and indicating that Neolithization of North Africa involved both the movement of ideas and people. Lastly, the southern Iberian Early Neolithic samples share the same genetic composition as the Cardial Mediterranean Neolithic culture that reached Iberia ∼5,500 BCE. The cultural and genetic similarities between Iberian and North African Neolithic traditions further reinforce the model of an Iberian migration into the Maghreb.

Ancestry inference in ancient samples from North Africa and the Iberian Peninsula. PCA analysis using the Human Origins panel (European, Middle Eastern, and North African populations) and LASER projection of aDNA samples.

Relevant excerpts:

FST and outgroup-f3 distances indicate a high similarity between IAM and Taforalt. As observed for IAM, most Taforalt sample ancestry derives from Epipaleolithic populations from the Levant. However, van de Loosdrecht et al. (17) also reported that one third of Taforalt ancestry was of sub-Saharan African origin. To confirm whether IAM individuals show a sub-Saharan African component, we calculated f4(chimpanzee, African population; Natufian, IAM) in such a way that a positive result for f4 would indicate that IAM is composed both of Levantine and African ancestries. Consistent with the results observed for Taforalt, f4 values are significantly positive for West African populations, with the highest value observed for Gambian and Mandenka (Fig. 3 and SI Appendix, Supplementary Note 10). Together, these results indicate the presence of the same ancestral components in ∼15,000-y old and ∼7,000-y-old populations from Morocco, strongly suggesting a temporal continuity between Later Stone Age and Early Neolithic populations in the Maghreb. However, it is important to take into account that the number of ancient genomes available for comparison is still low and future sampling can provide further refinement in the evolutionary history of North Africa.

Genetic analyses have revealed that the population history of modern North Africans is quite complex (11). Based on our aDNA analysis, we identify an Early Neolithic Moroccan component that is (i) restricted to North Africa in present-day populations (11); (ii) the sole ancestry in IAM samples; and (iii) similar to the one observed in Later Stone Age samples from Morocco (17). We conclude that this component, distantly related to that of Epipaleolithic communities from the Levant, represents the autochthonous Maghrebi ancestry associated with Berber populations. Our data suggests that human populations were isolated in the Maghreb since Upper Paleolithic times. Our hypothesis is in agreement with archaeological research pointing to the first stage of the Neolithic expansion in Morocco as the result of a local population that adopted some technological innovations, such as pottery production or farming, from neighboring areas.

By 3,000 BCE, a continuity in the Neolithic spread brought Mediterranean-like ancestry to the Maghreb, most likely from Iberia. Other archaeological remains, such as African elephant ivory and ostrich eggs found in Iberian sites, confirm the existence of contacts and exchange networks through both sides of the Gibraltar strait at this time. Our analyses strongly support that at least some of the European ancestry observed today in North Africa is related to prehistoric migrations, and local Berber populations were already admixed with Europeans before the Roman conquest. Furthermore, additional European/ Iberian ancestry could have reached the Maghreb after KEB people; this scenario is supported by the presence of Iberian-like Bell-Beaker pottery in more recent stratigraphic layers of IAM and KEB caves. Future paleogenomic efforts in North Africa will further disentangle the complex history of migrations that forged the ancestry of the admixed populations we observe today.

Ancestry inference in ancient samples from North Africa and the Iberian Peninsula. (B) ADMIXTURE analysis using the Human Origins dataset (European, Middle Eastern, and North African populations) for modern and ancient samples (K = 8). (D) Detail of ADMIXTURE analysis using the Human Origins dataset (European, Middle Eastern, North African, and sub-Saharan African populations) for modern and ancient samples, including Taforalt.

Also, from the main author’s Twitter account:

I just realized that the paragraph with information on data availability is missing! Sequence data in the European Nucleotide Archive (PRJEB22699). Consensus mtDNA sequences are available at the National Center of Biotechnology Information (Accession Numbers MF991431-MF991448).

I find it hard to believe that this genetic continuity from Upper Palaeolithic to Late Neolithic could be representative of an autochthonous development of Afroasiatic. An important population movement – likely more than one – must be found in ancient DNA influencing North-Central and North-East Africa, probably during the time of the Green Sahara corridor.

See here:

Distribution of Southern Iberian haplogroup H indicates exchanges in the western Mediterranean

Recent open access paper The distribution of mitochondrial DNA haplogroup H in southern Iberia indicates ancient human genetic exchanges along the western edge of the Mediterranean, by Hernández, Dugoujon, Novelletto, Rodríguez, Cuesta and Calderón, BMC Genetics (2017).

Abstract (emphasis mine):

The structure of haplogroup H reveals significant differences between the western and eastern edges of the Mediterranean, as well as between the northern and southern regions. Human populations along the westernmost Mediterranean coasts, which were settled by individuals from two continents separated by a relatively narrow body of water, show the highest frequencies of mitochondrial haplogroup H. These characteristics permit the analysis of ancient migrations between both shores, which may have occurred via primitive sea crafts and early seafaring. We collected a sample of 750 autochthonous people from the southern Iberian Peninsula (Andalusians from Huelva and Granada provinces). We performed a high-resolution analysis of haplogroup H by control region sequencing and coding SNP screening of the 337 individuals harboring this maternal marker. Our results were compared with those of a wide panel of populations, including individuals from Iberia, the Maghreb, and other regions around the Mediterranean, collected from the literature.

Both Andalusian subpopulations showed a typical western European profile for the internal composition of clade H, but eastern Andalusians from Granada also revealed interesting traces from the eastern Mediterranean. The basal nodes of the most frequent H sub-haplogroups, H1 and H3, harbored many individuals of Iberian and Maghrebian origins. Derived haplotypes were found in both regions; haplotypes were shared far more frequently between Andalusia and Morocco than between Andalusia and the rest of the Maghreb. These and previous results indicate intense, ancient and sustained contact among populations on both sides of the Mediterranean.

Our genetic data on mtDNA diversity, combined with corresponding archaeological similarities, provide support for arguments favoring prehistoric bonds with a genetic legacy traceable in extant populations. Furthermore, the results presented here indicate that the Strait of Gibraltar and the adjacent Alboran Sea, which have often been assumed to be an insurmountable geographic barrier in prehistory, served as a frequently traveled route between continents.

a, b, c. Interpolated frequency surfaces of clade H and its main sub-clades (H1 and H3). Frequencies (%) are showed in a colour scale. See information about the populations used in Additional files 4 and 5. Map templates were taken from Natural Earth free map repository (

I usually find mtDNA data, especially studies like this one based on modern populations, very difficult to interpret for anthropological purposes. It is well-known that there are important differences in the pattern of Y-DNA and mtDNA expansion and distribution.

A paragraph in this respect caught my attention:

The patterns of variation in the Y-chromosome between western and eastern Andalusians, based on 416 males, have also been investigated for a set of Y-Short Tandem Repeats (Y-STRs) and Y-SNPs [53, 54, 55], Calderón et al., unpublished data] in combination to mtDNA analyses ([18, 19] and present study). In general, for both uniparental makers, Andalusians exhibit a typical western European genetic background, with peak frequencies of mtDNA Hg H and Y-chromosome Hg R1b1b2-M269 (45% and 60%, respectively). Interestingly, our results have further revealed that the influence of African female input is far more significant when compared to male influence in contemporary Andalusians. The lack of correspondence between the maternal and paternal genetic profiles of human populations reflects intrinsic differences in migratory behavior related to sex-biased processes and admixture, as well as differences in male and female effective population sizes related to the variance in reproductive success affected, for example, by polygyny [56, 57].

I think that the greater reduction in patrilineal lineages compared to maternal lineages we usually see during and after prehistoric or historic migrations have more to do with the renown Uí Néill family case and with war-related casualties (since combatants were usually men) than with other more popular explanations, such as enslavement of women or polygyny.

The most successful paternal lines (anywhere in the world) were probably those who remained in power for a long time (be it a patriarchal society based on families, clans, or more complex organizational units), who were richer and thus more capable of having healthy offspring, who in turn were able to survive longer and have more children who inherited power, etc.

In case of recent migrations or population movements that disrupt the previously established organization, after a certain number of generations, successful patrilocal families (usually from incoming lineages) might slowly dominate over a whole region, with poorer families (usually of ‘indigenous’ lineages) suffering a greater – especially perinatal and child – mortality, without any obvious (pre)historic event associated to these gradual changes.

This gradual replacement of paternal lineages is compatible with the adoption of the native language by newcomers. If the number of migrants is greater that the native population, and especially if their technology is more advanced, then a more radical change including ethnolinguistic identification is more likely.

I don’t deny the (pre)historic existence of radical replacement of male populations with continuity of female lineages due to massacres of men, female slavery, or polygyny, but they are probably not the main explanation for most regional differences seen in paternal lineages, and should thus be used with caution.

Gradual replacement and founder effects are also the most logical explanation for why autochthonous continuity myths (that the modern regional prevalence of few successful lineages tended to create in the 2000s) haven’t been corroborated by ancient DNA; e.g. R1b-DF27 in Basques, N1c-M178 in Finnic populations, R1a-Z283 in Slavs, etc. There is nothing different in those areas from other recent founder effects and internal migratory flows seen everywhere in Europe in the past millennia.

Paper discovered via a link by Alberto Gonzalez on Facebook group Iberia ADN


Pleistocene North African genomes link Near Eastern and sub-Saharan African human populations


Pleistocene North African genomes link Near Eastern and sub-Saharan African human populations, by van de Loosdrecht et al. Science (2018).


North Africa is a key region for understanding human history, but the genetic history of its people is largely unknown. We present genomic data from seven 15,000-year-old modern humans from Morocco, attributed to the Iberomaurusian culture. We find a genetic affinity with early Holocene Near Easterners, best represented by Levantine Natufians, suggesting a pre-agricultural connection between Africa and the Near East. We do not find evidence for gene flow from Paleolithic Europeans into Late Pleistocene North Africans. The Taforalt individuals derive one third of their ancestry from sub-Saharan Africans, best approximated by a mixture of genetic components preserved in present-day West and East Africans. Thus, we provide direct evidence for genetic interactions between modern humans across Africa and Eurasia in the Pleistocene.


We analyzed the genetic affinities of the Taforalt individ-uals by performing principal component analysis (PCA) and model-based clustering of worldwide data (Fig. 2). When pro-jected onto the top PCs of African and West Eurasian popu-lations, the Taforalt individuals form a distinct cluster in an intermediate position between present-day North Africans (e.g., Amazighes (Berbers), Mozabite and Saharawi) and East Africans (e.g., Afar, Oromo and Somali) (Fig. 2A). Consist-ently, we find that all males with sufficient nuclear DNA preservation carry Y haplogroup E1b1b1a1 (M-78; table S16). This haplogroup occurs most frequently in present-day North and East African populations (18). The closely related E1b1b1b (M-123) haplogroup has been reported for Epipaleolithic Natufians and Pre-Pottery Neolithic Levantines (“Levant_N”) (16). Unsupervised genetic clustering also suggests a connection of Taforalt to the Near East. The three major components that comprise the Taforalt genomes are maximized in early Holocene Levantines, East African hunter-gatherer Hadza from north-central Tanzania, and West Africans (K = 10; Fig. 2B). In contrast, present-day North Africans have smaller sub-Saharan African components with minimal Hadza-related contribution (Fig. 2B).

Taforalt harboring an ancestry that contains additional affinity with South, East and Central African outgroups. None of the present-day or ancient Holocene African groups serve as a good proxy for this unknown ancestry, because adding them as the third source is still insufficient to match the model to the Taforalt gene pool.

Mitochondrial consensus sequences of the Taforalt indi-viduals belong to the U6a (n = 6) and M1b (n = 1) haplogroups (15), which are mostly confined to present-day populations in North and East Africa (7). U6 and M1 have been proposed as markers for autochthonous Maghreb ancestry, which might have been originally introduced into this region by a back-to-Africa migration from West Asia (6, 7). The occurrence of both haplogroups in the Taforalt individuals proves their pre-Holocene presence in the Maghreb.
(…) the diversification of haplogroup U6a and M1 found for Taforalt is dated to ~24,000 yBP (fig. S23), which is close in time to the earliest known appearance of the Iberomaurusian in Northwest Africa (25,845-25,270 cal. yBP at Tamar Hat (26)).

A summary of the genetic profile of the Taforalt individuals. (A) The top two PCs calculated from present-day African, Near Eastern and South European individuals from 72 populations. The Taforalt individuals are projected thereon (red-colored circles). Selected present-day populations are marked by colored symbols. Labels for other populations (marked by small grey circles) are provided in fig. S8. (B) ADMIXTURE results of chosen African and Middle Eastern populations (K = 10). Ancient individuals are labeled in red color. Major ancestry components in Taforalt are maximized in early Holocene Levantines (green), West Africans (purple) and East African Hadza (brown). The ancestry component prevalent in pre-Neolithic Europeans (beige) is absent in Taforalt.

The relationships of the Iberomaurusian culture with the preceding MSA, including the local backed bladelet technologies in Northeast Africa, and the Epigravettian in southern Europe have been questioned (13). The genetic profile of Taforalt suggests substantial Natufian-related and sub-Saharan African-related ancestries (63.5% and 36.5%, respec-tively), but not additional ancestry from Epigravettian or other Upper Paleolithic European populations. Therefore, we provide genomic evidence for a Late Pleistocene connection between North Africa and the Near East, predating the Neolithic transition by at least four millennia, while rejecting a potential Epigravettian gene flow from southern Europe into northern Africa within the resolution of our data.

It seems that the Taforalt gene pool (ca. 13000-12000 BC) cannot be explained by a connection with Upper Palaeolithic Europeans, but a more archaic admixture, so the authors cannot prove a migration through the Strait of Gibraltar or Sicily.

Nevertheless, these results apparently suggest:

  • That there is no contact before ca. 12000 BC through the Strait of Gibraltar; therefore the Sicilian route I support for the migration of R1b-V88 lineages is still the most likely one.
  • That the North African connection with Natufians is quite old – for which we already had modern Y-DNA investigation – , and therefore unlikely to be related to the Afroasiatic expansion.

I am glad I had some more time this week to read at least some interesting parts of the published papers, because the information to process is becoming insanely huge…


Expansion of peoples associated with spread of haplogroups: Mongols and C3*-F3918, Arabs and E-M183 (M81)


The expansion of peoples is known to be associated with the spread of a certain admixture component, joint with the expansion and reduction in variability of a haplogroup. In other words, few male lineages are usually more successful during the expansion.

Known examples include:

Two recent interesting papers add prehistoric cases of potential expansion of cultures associated with haplogroups:

1. Whole Y-chromosome sequences reveal an extremely recent origin of the most common North African paternal lineage E-M183 (M81), by Solé-Morata et al., Scientific Reports (2017).


E-M183 (E-M81) is the most frequent paternal lineage in North Africa and thus it must be considered to explore past historical and demographical processes. Here, by using whole Y chromosome sequences from 32 North African individuals, we have identified five new branches within E-M183. The validation of these variants in more than 200 North African samples, from which we also have information of 13 Y-STRs, has revealed a strong resemblance among E-M183 Y-STR haplotypes that pointed to a rapid expansion of this haplogroup. Moreover, for the first time, by using both SNP and STR data, we have provided updated estimates of the times-to-the-most-recent-common-ancestor (TMRCA) for E-M183, which evidenced an extremely recent origin of this haplogroup (2,000–3,000 ya). Our results also showed a lack of population structure within the E-M183 branch, which could be explained by the recent and rapid expansion of this haplogroup. In spite of a reduction in STR heterozygosity towards the West, which would point to an origin in the Near East, ancient DNA evidence together with our TMRCA estimates point to a local origin of E-M183 in NW Africa.

Distribution of E-M183 subclades among North Africa, the Near East and the Iberian Peninsula. Pie chart sectors areas are proportional to haplogroup frequency and are coloured according to haplogroup in the schematic tree to the right. n: sample size. Map was generated using R software.

An interesting excerpt, from the discussion:

Regarding the geographical origin of E-M183, a previous study suggested that an expansion from the Near East could explain the observed east-west cline of genetic variation that extends into the Near East. Indeed, our results also showed a reduction in STR heterozygosity towards the West, which may be taken to support the hypothesis of an expansion from the Near East. In addition, previous studies based on genome-wide SNPs reported that a North African autochthonous component increase towards the West whereas the Near Eastern decreases towards the same direction, which again support an expansion from the Near East. However, our correlations should be taken carefully because our analysis includes only six locations on the longitudinal axis, none from the Near East. As a result, we do not have sufficient statistical power to confirm a Near Eastern origin. In addition, rather than showing a west-to-east cline of genetic diversity, the overall picture shown by this correlation analysis evidences just low genetic diversity in Western Sahara, which indeed could be also caused by the small sample size (n = 26) in this region. Alternatively, given the high frequency of E-M183 in the Maghreb, a local origin of E-M183 in NW Africa could be envisaged, which would fit the clear pattern of longitudinal isolation by distance reported in genome-wide studies. Moreover, the presence of autochthonous North African E-M81 lineages in the indigenous population of the Canary Islands, strongly points to North Africa as the most probable origin of the Guanche ancestors. This, together with the fact that the oldest indigenous inviduals have been dated 2210 ± 60 ya, supports a local origin of E-M183 in NW Africa. Within this scenario, it is also worth to mention that the paternal lineage of an early Neolithic Moroccan individual appeared to be distantly related to the typically North African E-M81 haplogroup30, suggesting again a NW African origin of E-M183. A local origin of E-M183 in NW Africa > 2200 ya is supported by our TMRCA estimates, which can be taken as 2,000–3,000, depending on the data, methods, and mutation rates used.

The TMRCA estimates of a certain haplogroup and its subbranches provide some constraints on the times of their origin and spread. Although our time estimates for E-M78 are slightly different depending on the mutation rate used, their confidence intervals overlap and the dates obtained are in agreement with those obtained by Trombetta et al Regarding E-M183, as mentioned above, we cannot discard an expansion from the Near East and, if so, according to our time estimates, it could have been brought by the Islamic expansion on the 7th century, but definitely not with the Neolithic expansion, which appeared in NW Africa ~7400 BP and may have featured a strong Epipaleolithic persistence. Moreover, such a recent appearance of E-M183 in NW Africa would fit with the patterns observed in the rest of the genome, where an extensive, male-biased Near Eastern admixture event is registered ~1300 ya, coincidental with the Arab expansion. An alternative hypothesis would involve that E-M183 was originated somewhere in Northwest Africa and then spread through all the region. Our time estimates for the origin of this haplogroup overlap with the end of the third Punic War (146 BCE), when Carthage (in current Tunisia) was defeated and destroyed, which marked the beginning of Roman hegemony of the Mediterranean Sea. About 2,000 ya North Africa was one of the wealthiest Roman provinces and E-M183 may have experienced the resulting population growth.

2. The Y-chromosome haplogroup C3*-F3918, likely attributed to the Mongol Empire, can be traced to a 2500-year-old nomadic group, by Zhang et al., Journal of Human Genetics (2017)


The Mongol Empire had a significant role in shaping the landscape of modern populations. Many populations living in Eurasia may have been the product of population mixture between ancient Mongolians and natives following the expansion of Mongol Empire. Geneticists have found that most of these populations carried the Y-haplogroup C3* (C-M217). To trace the history of haplogroup (Hg) C3* and to further understand the origin and development of Mongolians, ancient human remains from the Jinggouzi, Chenwugou and Gangga archaeological sites, which belonged to the Donghu, Xianbei and Shiwei, respectively, were analysed. Our results show that nine of the eleven males of the Gangga site, two of the eight males of Chengwugou site and all of the twelve males of Jinggouzi site were found to have mutations at M130 (Hg C), M217 (Hg C3), L1373 (C2b, ISOGG2015), with the absence of mutations at M93 (Hg C3a), P39 (Hg C3b), M48 (Hg C3c), M407 (Hg C3d) and P62 (Hg C3f). These samples were attributed to the Y-chromosome Hg C3* (Hg C2b, ISOGG2015), and most of them were further typed as Hg C2b1a based on the mutation at F3918. Finally, we inferred that the Y-chromosome Hg C3*-F3918 can trace its origins to the Donghu ancient nomadic group.

The development of Mongolia and the frequencies of haplogroup C3* in modern Eurasians. a The development of Mongolia. b The frequencies of haplogroup C3 in modern Eurasians. The dotted line represents the approximate boundary between the Xiongnu and the Donghu. The black and grey arrows denote the migration of the Donghu and Mongolians, respectively

Featured image: Diachronic map of Iron Age migrations ca. 750-250 BC.