Recent Africa origin with hybridization, and back to Africa 70,000 years ago


Open access Carriers of mitochondrial DNA macrohaplogroup L3 basal lineages migrated back to Africa from Asia around 70,000 years ago, by Cabrera et al. BMC Evol Biol (2018) 18(98).

Abstract (emphasis mine):


The main unequivocal conclusion after three decades of phylogeographic mtDNA studies is the African origin of all extant modern humans. In addition, a southern coastal route has been argued for to explain the Eurasian colonization of these African pioneers. Based on the age of macrohaplogroup L3, from which all maternal Eurasian and the majority of African lineages originated, the out-of-Africa event has been dated around 60-70 kya. On the opposite side, we have proposed a northern route through Central Asia across the Levant for that expansion and, consistent with the fossil record, we have dated it around 125 kya. To help bridge differences between the molecular and fossil record ages, in this article we assess the possibility that mtDNA macrohaplogroup L3 matured in Eurasia and returned to Africa as basal L3 lineages around 70 kya.


The coalescence ages of all Eurasian (M,N) and African (L3 ) lineages, both around 71 kya, are not significantly different. The oldest M and N Eurasian clades are found in southeastern Asia instead near of Africa as expected by the southern route hypothesis. The split of the Y-chromosome composite DE haplogroup is very similar to the age of mtDNA L3. An Eurasian origin and back migration to Africa has been proposed for the African Y-chromosome haplogroup E. Inside Africa, frequency distributions of maternal L3 and paternal E lineages are positively correlated. This correlation is not fully explained by geographic or ethnic affinities. This correlation rather seems to be the result of a joint and global replacement of the old autochthonous male and female African lineages by the new Eurasian incomers.


These results are congruent with a model proposing an out-of-Africa migration into Asia, following a northern route, of early anatomically modern humans carrying pre-L3 mtDNA lineages around 125 kya, subsequent diversification of pre-L3 into the basal lineages of L3, a return to Africa of Eurasian fully modern humans around 70 kya carrying the basal L3 lineages and the subsequent diversification of Eurasian-remaining L3 lineages into the M and N lineages in the outside-of-Africa context, and a second Eurasian global expansion by 60 kya, most probably, out of southeast Asia. Climatic conditions and the presence of Neanderthals and other hominins might have played significant roles in these human movements. Moreover, recent studies based on ancient DNA and whole-genome sequencing are also compatible with this hypothesis.


You can also read the recent interesting open access review How did Homo sapiens evolve? by Julia Galway-Witham, Chris Stringer, Science (2018) 360:6395 1296-1298.


Bantu distinguished from Khoe by uniparental markers, not genome-wide autosomal admixture


The role of matrilineality in shaping patterns of Y chromosome and mtDNA sequence variation in southwestern Angola, by Oliveira et al. bioRxiv (2018).

Interesting excerpts (emphasis mine):

The origins of NRY diversity in SW Angola

In accordance with our previous mtDNA study9, the present NRY analysis reveals a major division between the Kx’a-speaking !Xun and the Bantu-speaking groups, whose paternal genetic ancestry does not display any old remnant lineages, or a clear link to pre-Bantu eastern African migrants introducing Khoe-Kwadi languages and pastoralism into southern Africa (cf. 15). This is especially evident in the distribution of the eastern African subhaplogroup E1b1b1b2b29, which reaches the highest frequency in the !Xun (25%) and not in the formerly Kwadi-speaking Kwepe (7%). This observation, together with recent genome-wide estimates of 9-22% of eastern African ancestry in other Kx’a and Tuu-speaking groups35, suggests that eastern African admixture was not restricted to present-day Khoe-Kwadi speakers. Alternatively, it is likely that the dispersal of pastoralism and Khoe-Kwadi languages involved a series of punctuated contacts that led to a wide variety of cultural, genetic and linguistic outcomes, including possible shifts to Khoe-Kwadi by originally Bantu-speaking peoples36.

Although traces of an ancestral pre-Bantu population may yet be found in autosomal genome-wide studies, the extant variation in both uniparental markers strongly supports a scenario in which all groups of the Angolan Namib share most of their genetic ancestry with other Bantu groups but became increasingly differentiated within the highly stratified social context of SW African pastoral societies11.

Y chromosome phylogeny, haplogroup distribution and map of the sampling locations. The phylogenetic tree was reconstructed in BEAST based on 2,379 SNPs and is in accordance with the known Y chromosome topology. Main haplogroup clades and their labels are shown with different colors. Age estimates are reported in italics near each node, with the TMRCA of main haplogroups shown with their corresponding color. A map of the sampling locations, re-used with permission from Oliveira et al. (2018) 9, is shown on the bottom left, and the haplogroup distribution per population is shown on the bottom right, with color-coding corresponding to the phylogenetic tree.

The influence of socio-cultural behaviors on the diversity of NRY and mtDNA

A comparison of the NRY variation with previous mtDNA results for the same groups 9 identifies three main sex-specific patterns. First, gene flow from the Bantu into the !Xun is much higher for male than for female lineages (31% NRY vs. 3% mtDNA), similar to the reported male-biased patterns of gene flow from Bantu to Khoisan-speaking groups33, and from non-Pygmies to Pygmies in Central Africa 37. A comparable trend, involving exclusive introgression of NRY eastern African lineages into the !Xun (25%) was also found. (…)

Secondly, the levels of intrapopulation diversity in the Bantu-speaking peoples from the Namib were found to be consistently higher for mtDNA than for the NRY, reflecting the marked association between the Bantu expansion and the relatively young NRY E1b1a1a1 haplogroup, which has no parallel in mtDNA25,39. (…)

In the context of the Bantu expansions, these patterns have been mostly interpreted as the result of polygyny and/or higher levels of assimilation of females from resident forager communities38,40. However, most groups from the Angolan Namib are only mildly polygynous11 and ethnographic data suggest that the actual rates of polygyny in many populations may be insufficient to significantly reduce Nem2,41. In addition, the finding of a large Nef/ Nem ratio in the Himba (Fig. S5), who have almost no Khoisan-related mtDNA lineages9, indicates that female biased introgression cannot fully explain the observed patterns.

An alternative explanation may be sought in the prevailing matrilineal descent rules, which might have created a sex-specific structuring effect, similar to that proposed for patrilineal groups from Central Asia (…)

Bayesian skyline plots (BSP) of effective population size change through time, based on mtDNA (red) and the NRY (black). Thick lines show the mean estimates and dashed lines show the 95% HPD intervals. The vertical line highlights the 2 ky before present mark. Effective sizes are plotted on a log scale. Generation times of 25 and 31 years were assumed for mtDNA and the NRY, respectively32.

The third important sex-specific pattern observed in this study is the much lower amount of between-group differentiation for NRY than for mtDNA among Bantu-speaking populations (4.4% NRY vs. 20.2% mtDNA), in spite of the patrilocal residence patterns of all ethnic groups (Table S5). This difference can hardly be explained by unequal levels of introgression of “Khoisan” mtDNA lineages into the Bantu, since the percentage of mtDNA variation remains high (18.8%) when the Kuvale, who have high frequencies of “Khoisan”-related mtDNA, are excluded from the comparisons. It therefore seems more plausible that differentiation is higher in the mtDNA simply because there is more ancestral mtDNA than NRY variation that can be sorted among different populations (see 45). Moreover, due to the matriclanic organization of all Bantu-speaking communities, factors enhancing inter-group differentiation, like kin-structured migration and kin-structured founder effects46, would have been restricted to mtDNA. Finally, it is also likely that the discrepancy between among-group divergence of mtDNA and NRY might have been influenced by higher migration rates in males than females. In fact, although all Bantu-speaking populations have patrilocal residence patterns, the observance of endogamy rules severely constrains the between-group mobility of females. In this context, the children from extramarital unions involving members from different populations tend to be raised in the mother’s group, effectively increasing male versus female migration rates. Moreover, it is likely that, in the highly hierarchized setting of the Namib, most intergroup extramarital unions would involve men from dominant groups and women from peripatetic communities. This hypothesis is indirectly supported by the finding that in NRY-based clusters (but not in mtDNA) pastoralist populations are grouped together with peripatetic communities that share their cultural traits (Figs. S6 and 3b), suggesting that migration of NRY lineages follows a path that is similar to horizontally transmitted cultural features.


Yamna/Afanasevo elite males dominated by R1b-L23, Okunevo brings ancient Siberian/Asian population


Open access paper New genetic evidence of affinities and discontinuities between bronze age Siberian populations, by Hollard et al., Am J Phys Anthropol. (2018) 00:1–11.

NOTE. This seems to be a peer-reviewed paper based on a more precise re-examination of the samples from Hollard’s PhD thesis, Peuplement du sud de la Sibérie et de l’Altaï à l’âge du Bronze : apport de la paléogénétique (2014).

Interesting excerpts:

Afanasevo and Yamna

The Afanasievo culture is the earliest known archaeological culture of southern Siberia, occupying the Minusinsk-Altai region during the Eneolithic era 3600/3300 BC to 2500 BC (Svyatko et al., 2009; Vadetskaya et al., 2014). Archeological data showed that the Afanasievo culture had strong affinities with the Yamnaya and pre-Yamnaya Eneolithic cultures in the West (Grushin et al., 2009). This suggests a Yamnaya migration into western Altai and into Afanasievo. Note that, in most current publications, “the Yamnaya culture” combines the so-called “classical Yamnaya culture” of the Early Bronze Age and archeological sites of the preceding Repin culture in the middle reaches of the Don and Volga rivers. In the present article we conventionally use the term Yamnaya in the same sense, in which case the beginning of the “Yamnaya culture” can be dated after the middle of the 4th millennium BC, when the Afanasievo culture appeared in the Altai.

Because of numerous traits attributed to early Indo-Europeans and cultural relations with Kurgan steppe cultures, members of the Afanasievo culture are believed to have been Indo-European speakers (Mallory and Mair, 2000). In a recent whole-genome sequencing study, Allentoft et al. (2015) concluded that Eastern Yamnaya individuals and Afanasievo individuals were genetically indistinguishable. Moreover, this study and one published concurrently by Haak et al. (2015) analyzed 11 Eastern Yamnaya males and showed that all of them belonged to the R1b1a1a (formerly R1b1a) (…)

Early Chalcolithic migrations ca. 3300-2600 BC.

Published works indicate that R1b was a predominant haplogroup from the late Neolithic to the early Bronze Age, notably in the Bell Beaker and Yamnaya cultures (Allentoft et al., 2015; Haak et al., 2015; Lee et al., 2012; Mathieson et al., 2015). Nearly 100% of the Afanasievo men we typed belonged to the R1b1a1a subhaplogroup and, for at least three of them, more precisely to the L23 (xM412) subclade. (…)

(…) our results therefore support the hypothesis of a genetic link between Afanasievo and Yamnaya. This also suggests that R1b was indeed dominant in the early Bronze Age Siberian steppe, at least in individuals that were buried in kurgans (possibly an elite part of the population). The geographical and temporal distribution of subhaplogroup R1b1a1a supports the hypothesis of population expansion from West to East in the Eurasian steppe during this period. It should however be noted that the Yamnaya burials from which the samples for DNA analysis were obtained (Allentoft et al., 2015; Haak et al., 2015; Mathieson et al., 2015) were dated within the limits of the Afanasievo period. Ancestors of both East Yamnaya and Afanasievo populations must therefore be sought in the context of earlier Eneolithic cultures in Eastern Europe. Sufficient Y-chromosomal data from such Eneolithic populations is, unfortunately, not yet available.

Mitochondrial- (A) and Y- (B) haplogroup distribution in studied populations

Okunevo and paternal lineage shift in South Siberia

Results obtained in the current study, from more than a dozen Okunevo individuals belonging to the earliest stage of Okunevo culture, that is the Uibat period (2500–2200 BC) (Lazaretov, 1997), suggest a discontinuity in the genetic pool between Afanasievo and Okunevo cultures. Although Y-chromosomal data obtained for bearers of the Okunevo culture showed that one individual carried haplogroup R1b, most Okunevo Y-haplogroups are representative of an Asian component represented by paternal lineages Q and NO1.

Okunevo carrier of Y-haplogroup Q1b1a-L54, which also supports this hypothesis (L54 being a marker of the lineage from which M3, the main Ameridian lineage, arose). Okunevo people could therefore be a remnant paleo-Siberian population with possible Afanasievo input, as suggested by the presence of the R1b1a1a2a subhaplogroup in one individual.

Late Chalcolithic migrations ca. 2600-2250 BC.

Replacement of Asian Indo-European elite lineages by R1a

Published genetic data from the late Bronze Age Andronovo culture from the Minusinsk Basin (Keyser et al., 2009), the Sintashta culture from Russia (Allentoft et al., 2015) and the Srubnaya culture from the region of Samara (Mathieson et al., 2015), show that males did not belong to Y-haplogroup R1b but mostly to R1a clades: there appears to have been a change in the dominant Y-chromosomal haplogroup between the early and the late Bronze Age in these regions. Moreover, as described in Allentoft et al. (2015), the Andronovo and Sintashta peoples were closely related to each other but clearly distinct from both Yamnaya and Afanasievo. Although these results do not imply that Y-haplogroup R1b was entirely absent in these later populations, they could correspond to a replacement of the elite between these two main periods and therefore a difference in the haplogroups of the men that were preferentially buried.

Early Bronze Age migrations ca. 2250-1750 BC.

Afanasevo and the Tarim Basin

The discovery, in the Tarim Basin, of well-preserved mummies from the Bronze Age allows for the construction of two hypotheses regarding the peopling of the Xinjiang province at this period. The “steppe hypothesis,” argues for a link with nomadic steppe herders (Hemphill and Mallory, 2004), possibly represented in this case by Afanasievo populations and their descendants (Mallory and Mair, 2000). However, newly published cultural data from the burial grounds of Gumugou (Wang, 2014) and Xiaohe (Xinjiang, 2003, 2007) shows material culture and burial rites incompatible with the Afanasievo culture. The earliest 14C date for Tarim Basin burials would place them at the turn of the 2nd millenium BC (Wang, 2013), 500 years after the Afanasievo period.

Instead, early Gumugou and Xiaohe burial grounds were contemporary with the start of the Andronovo period. Likewise, the Bronze Age population of the Xinjiang at Gumugou/Qäwrighul is not phenotypically closest to Afanasievo but to the Andronovo (Fedorovo) group of northeastern Kazakhstan and western Altai (Kozintsev, 2009). Our investigations demonstrate that Y-chromosomal lineage composition is also compatible with the notion that the ancient Tarim population was genetically distinct from the Afanasievo population. The only Y-haplogroup found by Li et al. (2010) in the Bronze Age Tarim Basin population was Y-haplogroup R1a, which suggests a proximity of this population with Andronovo groups rather than Afanasievo groups.

I don’t think these finds are much of a surprise based on what we already know, or need much explanation…

I would add that, once again, we have more proof that the movement of Okunevo and related ancient Siberian migrants from Central or North Asia will not be able to explain the presence of Uralic languages spread over North-East Europe and Scandinavia already during the Bronze Age.

Also interesting is to read in more peer-reviewed papers the idea of Late Indo-European speakers clearly linked to the expansion of patrilineally-related elite males marked by haplogroup R1b-L23, most likely since Eneolithic Khvalynsk/Repin cultures.


Native American genetic continuity and oldest mtDNA hg A2ah in the Andean region

Native American gene continuity to the modern admixed population from the Colombian Andes: Implication for biomedical, population and forensic studies by Criollo-Rayo et al., Forensic Sci Int Genet (2018), in press, corrected proof.

Abstract (emphasis mine):

Andean populations have variable degrees of Native American and European ancestry, representing an opportunity to study admixture dynamics in the populations from Latin America (also known as Hispanics). We characterized the genetic structure of two indigenous (Nasa and Pijao) and three admixed (Ibagué, Ortega and Planadas) groups from Tolima, in the Colombian Andes. DNA samples from 348 individuals were genotyped for six mitochondrial DNA (mtDNA), seven non-recombining Y-chromosome (NRY) region and 100 autosomal ancestry informative markers. Nasa and Pijao had a predominant Native American ancestry at the autosomal (92%), maternal (97%) and paternal (70%) level. The admixed groups had a predominant Native American mtDNA ancestry (90%), a substantial frequency of European NRY haplotypes (72%) and similar autosomal contributions from Europeans (51%) and Amerindians (45%). Pijao and nearby Ortega were indistinguishable at the mtDNA and autosomal level, suggesting a genetic continuity between them. Comparisons with multiple Native American populations throughout the Americas revealed that Pijao, had close similarities with Carib-speakers from distant parts of the continent, suggesting an ancient correlation between language and genes. In summary, our study aimed to understand Hispanic patterns of migration, settlement and admixture, supporting an extensive contribution of local Amerindian women to the gene pool of admixed groups and consistent with previous reports of European-male driven admixture in Colombia.

Ancestral uniparental haplogroups and diversity in Tolima. Geography of sampling locations. The
top and middle sections show the frequency of Native American mtDNA haplogroups and NRY lineages for all
populations. Gene diversity is shown below their respective pie chart. The lower part depicts the geography of the
region where the sampling sites of Ortega and Pijao are closely located in Tolima’s Magdalena river valley and
Ibague, Planadas and Nasa located in the Andes cordilleras (additional geographic details are shown in SF1).

Highlights from the paper:

  • MtDNA suggest a pre/post Columbian genetic continuity in the Colombian Andes.
  • Y-chromosome diversity follows a clinal gradient in the studied region.
  • Sex-biased/male-driven admixture process, involving Pijao women with European men.
  • Admixed closer to Indigenous resguardos have a higher Native American ancestry.

Also interesting is the recent paper Mitochondrial lineage A2ah found in a pre‐Hispanic individual from the Andean region, by Russo et al., in American Journal of Human Biology (2018), with an interesting sample from the Regional Developments II period (540 ± 60 BP).

Phylogeny of the A2ah mitochondrial lineage based on HVR I sequences. Both MaximumParsimony andMaximumLikelihood reconstructions led to the same typology. The tree was rooted with the RSRS. Sample ID: Cueva: Pukara de La Cueva, STACRUZ: Santa Cruz, BNI: Beni, BR: South-eastern Brazil, TobaChA: TobaGranChaco


Canid Y-chromosome phylogeny reveals distinct haplogroups among Neolithic European dogs


Open access Analysis of the canid Y-chromosome phylogeny using short-read sequencing data reveals the presence of distinct haplogroups among Neolithic European dogs, by Oetjens et al., BMC Genomics (2018) 19:350.

Interesting excerpts (modified for clarity, emphasis mine):


Canid mitochondrial phylogenies show that dogs and wolves are not reciprocally monophyletic. The mitochondrial tree contains four deeply rooted clades encompassing dogs and many grey wolf groups. These four clades form the basis of dog mitochondrial haplogroup assignment, known as haplogroups A-D. The time of the most recent common ancestor (TMRCA) of haplogroups A-D significantly predates estimates for domestication based on archeological and genetic evidence. Instead, these clades may represent variation present among the founding population of the dog lineage or the results of wolf introgressions into dog populations. The relative frequencies of mitochondria haplogroups are not stable over time, with changes reflecting processes such as drift, migration, and population growth. Although the mitochondria A and B haplogroups are most common in contemporary European dogs, surveys of ancient samples indicate that the majority of ancient European dogs carried the C or D mitochondrial haplotype. This apparent turnover in mitochondrial haplogroups may reflect the migration of a distinct dog population into Europe over the past 15,000 years.

Maximum likelihood phylogeny of 118 candid Y-chromosomes A Y-chromosome haplogroup tree produced by RAxML (8.1.13) using the GTR+ I model is depicted. Clades in the tree have been collapsed by haplogroup assignment. The number of samples within each collapsed node is indicated in parentheses next to the haplogroup assignment. For each node, percent bootstrap support out of 1000 iterations is indicated above the branch. The locations of three ancient samples, based on the presence of diagnostic mutations, are indicated in red


Using the variation discovered from sequence data, we applied a Bayesian MCMC approach to estimate TMRCAs for each haplotype group. Our estimated Y-chromosome mutation rate (3.07 × 10− 10 substitutions per site per year, relaxed clock model) falls within the range of a previous estimate by Ding et al. who used a similar calibration and estimate 1.35 × 10− 10– 4.31 × 10− 10 substitutions per site per year. The TMRCAs we estimated are substantially older than mitochondria phylogenies calibrated with tip dates of ancient samples, which report clade-specific TMRCAs < 25,000 years ago. We note that our Y-chromosome TMRCA estimates are extremely sensitive to our assumptions about the age of the root of the tree and should be interpreted with caution due to the uncertainty in this single calibration point. However, the relative ages of the branches and the chronological order of haplogroup divergences are more robust than the absolute estimated dates.

In general, the relationships between Y-chromosome haplogroups and autosomal ancestry we report are very similar to the relationships described in Shannon et al. As noted earlier, our dataset includes a subset of wolves with Y-chromosomes assigned to a dog Y-haplogroup. However, ADMIXTURE analysis does not indicate substantial recent dog ancestry in these samples, suggesting that their placement on the Y-chromosome phylogeny reflects variation in Y-chromosome haplotypes that was present in the ancestral population and therefore predates the domestication process or is the result of ancient introgression events whose signature of autosomal ancestry has been diluted.

The relationship between autosomal ancestry and Y-chromosome haplogroups Major groupings of canine ancestry are shown based on a principal components analysis of autosomal markers from 499 village dogs from Shannon et al. a. The geographic origin of each sample is indicated by color. The 104 male dogs used in this study are projected onto the resulting principal components and colored based on haplogroup (b). Village dogs from (a) are shown as transparent dots in (b)


Using sequencing data, we find that the estimated TMRCA of dog Y haplogroups predates dog domestication. We further reveal the placement of several wolf Y-chromosomes within deep branches of dog haplogroup clades. Using an expanded set of mutations diagnostic for each haplogroup, we find that distinct Y haplogroups were present in Europe during the Neolithic and that CTC, a ~ 4700 year old ancient dog from Germany has a Y-chromosome that shares diagnostic alleles with wolves found in India.

Other studies

On the same subject, you can read another recent study, bioRxiv preprint New Evidence of the Earliest Domestic Dogs in the Americas, by Perri et al. (2018); and also a recent, open access paper (see above featured image) Ancient European dog genomes reveal continuity since the Early Neolithic, by Botigué et al., Science Communications (2017).

While Proto-Indo-European- and Proto-Uralic-speakers had a close relationship with dogs (revealed in their reconstructed language and attributed archaeological cultures), I think it will be very difficult to ascertain any population movement based on them, unless there is a clear, well-established archaeological relationship between a specific culture and dog-breeding.

Nevertheless, I would say that this kind of studies are more likely to give some information related to these and other cultures than, for example, the study of honeybees in honey-hunting vs. beekeeping cultures (see e.g. The Complex Demographic History and Evolutionary Origin of the Western Honey Bee, Apis Mellifera, by Cridland, Tsutsui, and Ramírez GBE 2017), which was also related to the development of both PIE and PU cultures.

See also:

Post-Neolithic Y-chromosome bottleneck explained by cultural hitchhiking and competition between patrilineal clans

Open access study Cultural hitchhiking and competition between patrilineal kin groups explain the post-Neolithic Y-chromosome bottleneck, by Zeng, Aw, and Feldman, Nature Communications (2018).

Abstract (emphasis mine):

In human populations, changes in genetic variation are driven not only by genetic processes, but can also arise from cultural or social changes. An abrupt population bottleneck specific to human males has been inferred across several Old World (Africa, Europe, Asia) populations 5000–7000 BP. Here, bringing together anthropological theory, recent population genomic studies and mathematical models, we propose a sociocultural hypothesis, involving the formation of patrilineal kin groups and intergroup competition among these groups. Our analysis shows that this sociocultural hypothesis can explain the inference of a population bottleneck. We also show that our hypothesis is consistent with current findings from the archaeogenetics of Old World Eurasia, and is important for conceptions of cultural and social evolution in prehistory.

Relevant excerpts:

Tree of Y-chromosome genotypes from samples found among cultures with hunter-gatherer subsistence, and agropastoralist subsistence. The blue background represents hunter-gatherer subsistence while the green background represents agropastoralist subsistence. Letters in red circles match individuals from sites with their archaeological context. Note that R1b-P321 is synonymous with R1b-S116. Adapted from Figs. 3, 4, 5 and 6 of Kivisild67, with addition of information from Olalde et al.64. The vertical axis represents time; the position of branch points represent the ages of branch-defining mutations, with nomenclature and age from yfull (

Our hypothesis explains the bottleneck as a consequence of intergroup competition between patrilineal kin groups, which caused cultural hitchhiking between Y-chromosomes and cultural groups and reduction in Y-chromosomal diversity. Competition between demes can dramatically reduce genetic diversity within a population1, especially if the population is structured such that variation is greater between demes than within demes. Culturally transmitted kinship ideals and norms can cause homophilous sorting and limit interdemic gene flow, creating homogeneous demes that differ strongly from one another. Patrilineal corporate kin groups, with coresiding male group members descending from a common male ancestor, would produce such an effect on Y-chromosomes only, as patrilineal corporate kin groups generally coexist with female exogamy40, which would homogenize the mitochondrial gene pools of different groups41,42.

With intergroup competition between patrilineal corporate kin groups, two mechanisms would operate to reduce Y-chromosomal diversity. First, patrilineal corporate kin groups produce high levels of Y-chromosomal homogeneity within each social group due to common descent, as well as high levels of between-group variation. Second, the presence of such groups results in violent intergroup competition preferentially taking place between members of male descent groups, instead of between unrelated individuals. Casualties from intergroup competition then tend to cluster among related males, and group extinction is effectively the extinction of lineages.

There is evidence that other analogous situations involving gene-culture hitchhiking in culturally-defined social groups may have affected genetic diversity. Central Asian pastoralists, who are organized into patriclans, have high levels of intergroup competition and demonstrate ethnolinguistic and population-genetic turnover down into the historical period59. They also have a markedly lower diversity in Y-chromosomal lineages than nearby agriculturalists42,60. In fact, Central Asians are the only population whose male effective population size has not recovered from the post-Neolithic bottleneck; it remains disproportionately reduced, compared to female estimates using mtDNA4. Central Asians are also the only population to have star-shaped expansions of Y-chromosomes within the historical period, which may be due to competitive processes that led to the disproportionate political success of certain patrilineal clans60.

The simulation offers an interesting graphic. I had been thinking for some time about developing an interactive image with waves of expansion showing how only few haplogroups expand and thus their variability is reduced in successive migration waves, because a lot of people seemed not to be willing to accept this:

Schematic of the steps in the simulation, according to the order described in the algorithm. a (i) Patrilineal (PT) starting conditions, where cultural groups strictly determine haplogroup type. a (ii) The non-patrilineal (NPT) condition where they are perfectly uncorrelated. b The killing step, with a more (PT) and less (NPT) patrilineal starting condition. The number of deaths in each group is inversely related to group size. The blue cultural group goes extinct in both cases. This causes the haplogroup represented by the diamonds to go extinct in PT, but no haplogroup extinction occurs in NPT. c The mutation step, where a small number of individuals in the largest haplogroup change their haplogroup. d The regeneration step, where (i) is a replica of (b) PT (iii), and (d) (ii) shows how the original number of individuals before the killing step is restored by proportionally increasing the number of individuals in all cells. e Group fission step. Where an empty row occurs, the largest cultural group splits, and half the individuals form a new cultural group in the empty row. The step in which we remove cultural groups that are too small—between (c, d) (see Methods)—is not shown

You only have to imagine this process happening in many successive waves of expansion (external as well as internal to each culture) since the first Neolithic expansions in the steppe in the late-6th millennium BC, even before the formation of the Khvalynsk-Sredni Stog cultural-historical community, to understand what happened in the next thousands of years with evolving patrilineal clans and their distinct cultures.

The whole paper is an interesting read. It’s great to see sociology and genetics finally catch up and interact to develop more complex anthropological hypotheses.

The fact that this paper appears in mid-2018 and geneticists are beginning to discuss this only now when their statistical methods fail to explain the obvious (see David Reich’s recent interview) seems anachronistic, though, because all this was quite clear already in 2015 – at least for those who were looking for mainstream Yamna – Bell Beaker connections, instead of inventing new migration pathways to justify the results of certain statistical analyses

Anyway, better late than never.

Also, they use YFull estimates, which vindicates my use of them in the Indo-European demic diffusion model (2017). On the other hand, their use of these estimates right now in 2018 for R1a-M417 and R1b-M269 – when we know of a R1a-Z93 case much older than YFull’s estimated 5,000 YBP for this subclade, and possibly for R1b-L23, too, is the biggest pitfall in their temporal assessment, although the bottlenecks seen in Chalcolithic expansions seem to have indeed began during the Mesolithic-Neolithic transition in the steppe.

So, say goodbye (if you haven’t already) to dat fantasy ‘steppe people’ of mixed R1a/R1b descent cooperating with the same mixed steppe language, all represented by the Yamnaya™ ancestral component, and say hello to distinct, competing ethnolinguistic steppe groups during the Neolithic.


Decline of genetic diversity in ancient domestic stallions in Europe

Open access research article Decline of genetic diversity in ancient domestic stallions in Europe, by Wutke et al., Science (2018), 4(4):eaap9691.

Abstract (emphasis mine):

Present-day domestic horses are immensely diverse in their maternally inherited mitochondrial DNA, yet they show very little variation on their paternally inherited Y chromosome. Although it has recently been shown that Y chromosomal diversity in domestic horses was higher at least until the Iron Age, when and why this diversity disappeared remain controversial questions. We genotyped 16 recently discovered Y chromosomal single-nucleotide polymorphisms in 96 ancient Eurasian stallions spanning the early domestication stages (Copper and Bronze Age) to the Middle Ages. Using this Y chromosomal time series, which covers nearly the entire history of horse domestication, we reveal how Y chromosomal diversity changed over time. Our results also show that the lack of multiple stallion lineages in the extant domestic population is caused by neither a founder effect nor random demographic effects but instead is the result of artificial selection—initially during the Iron Age by nomadic people from the Eurasian steppes and later during the Roman period. Moreover, the modern domestic haplotype probably derived from another, already advantageous, haplotype, most likely after the beginning of the domestication. In line with recent findings indicating that the Przewalski and domestic horse lineages remained connected by gene flow after they diverged about 45,000 years ago, we present evidence for Y chromosomal introgression of Przewalski horses into the gene pool of European domestic horses at least until medieval times.

The frequencies of Y chromosome haplotypes started to change during the Late Bronze Age (1600–900 BCE).
Inferred temporal trajectories of haplotype frequencies. Each haplotype is displayed by a different color. The shaded area represents the 95% highest-density region. The trajectories were constructed taking the median values across frequencies from the simulations of the Bayesian posterior sample. The small chart represents the stacked frequencies; the amplitude of each colored area is proportional to the median haplotype frequencies (normalized) at a given time. The x and y axes of the small chart match those in the large one. Ka, thousands of years.

Interesting excerpts:

The first record of the modern domestic Y chromosome haplotype stems from two Bronze Age samples of similar age. Notably, both samples were found in two distantly located regions: present-day Slovakia (2000–1600 BCE, dated by archaeological context) and western Siberia (14C-dated: 1609–1436 cal. BCE). Although a very recent study proposes an oriental origin of this haplotype (14), we cannot determine the geographical origin of Y-HT-1 with certainty, because this haplotype has not been found thus far in predomestic or wild stallions. There are two possible scenarios: (i) Y-HT-1 emerged within the domestic population by mutation and (ii) Y-HT-1 was already present in wild horses and entered the domestic population either at the beginning of domestication (but initially restricted to Asian horses) or later by introgression (from wild Y-HT-1 carrying studs during the Iron Age). Crosses between domestic animals and their wild counterparts have been observed in several domestic species (15–18); thus, the simplest explanation would be that we missed Y-HT-1 in older samples because of limited geographical sampling. However, the estimated haplotype age is contemporary (Fig. 4) with the assumed starting point of horse domestication ~4000–3500 BCE (19), rendering it likely that Y-HT-1 originated within the domestic horse gene pool. Still, we cannot rule out definitively that it appeared before domestication.

Independent of its geographical origin, Y-HT-1 progressively replaced all other haplotypes—except for one additional lineage that is restricted to Yakutian horses (11). Considering our data, this trend in paternal diversity toward dominance of the modern lineage appears to start in the Bronze Age and becomes even more pronounced during the Iron Age. The Bronze Age was a time of large-scale human migrations across Eurasia (20–22), movements that were undoubtedly facilitated by the spread of horses as a means of transport and warfare. At that time, the western Eurasian steppes were inhabited by highly mobile cultures that largely relied on horses (20, 21, 23, 24). The genetic admixture of northern and central European humans with Caucasians/eastern Europeans did correlate with the spread of the Yamnaya culture from the Pontic-Caspian steppe (25), an area that has repeatedly been suggested as the center of horse domestication (19, 26, 27). Given the importance of domestic horses, it appears that deliberate selection/rejection of certain stallions by these people might have contributed to the loss of paternal diversity. The spread of humans out of this region might also have resulted in the spread of Y-HT-1 from Asia to Europe. This scenario also agrees with recent findings that the low male diversity of extant horses is not caused by recruiting only a limited number of stallions during early domestication (13).

Decline of paternal diversity began in Asia.
Maps displaying age, locality, and haplotype (different colors) of each successfully genotyped sample.

The presence of the Y chromosome haplotype carried by present-day Przewalski horses (Y-HT-2) in early domestic stallions and a European wild horse (Pie05; table S2) could be the result of introgression of Przewalski stallions. Although the original distribution of the Przewalski horse is unknown, it was probably much larger than that of the relict population in Mongolia that produced modern Przewalski horses and might even have extended into Central Europe. However, it is also possible that either Przewalski horses were among the initially domesticated horses or that Y-HT-2 occurred both in Przewalski horses and in those wild horses that are the ancestors of domestic horses, based on autosomal DNA data (30). Regardless of how Y-HT-2 entered the domestic gene pool, it was eventually lost, as were all haplotypes except Y-HT-1. In our sample set, Y-HT-2 was undetectable as early as the third time bin. However, it is possible that Y-HT-2 may have been present during this time period, but with a frequency below 0.11 (with 95% probability). The inferred time trajectories for Y-HT-2 frequencies suggest that it could nevertheless have persisted at very low frequencies until the Middle Ages (Fig. 3). On the basis of these simulations, this finding could be interpreted as a relic of this haplotype’s formerly higher frequency in the domestic horse gene pool. It is also possible that the presence of this haplotype could be the result of mating a wild stallion with a domestic mare, a frequently reported breeding practice when wild horses were still widely distributed. However, a significant contribution of the Przewalski horse to the gene pool of modern domestic horses has been almost ruled out by recent genomic studies (13, 31, 32).

Stallion lineages through time.
Temporal haplotype network of the four detected Y chromosome haplotypes. Age of the samples indicated by multiple layers separated by color; vertical lines connecting the haplotypes of consecutive layers/ages represent which haplotype was transferred into a later/younger period. Numbers constitute the respective number of individuals showing this particular haplotype for that period. Prz, Przewalski; Dom, domestic.


Y-DNA relevant in the postgenomic era, mtDNA study of Iron Age Italic population, and reconstructing the genetic history of Italians


Open Access Annals of Human Biology (2018), Volume 45, Issue 1, with the title Human population genetics of the Mediterranean.

Among the most interesting articles (emphasis mine):

Iron Age Italic population genetics: the Piceni from Novilara (8th–7th century BC), by Serventi, Panicucci, Bodega, et al.

Background: Archaeological data provide evidence that Italy, during the Iron Age, witnessed the appearance of the first communities with well defined cultural identities. To date, only a few studies report genetic data about these populations and, in particular, the Piceni have never been analysed.

Aims: To provide new data about mitochondrial DNA (mtDNA) variability of an Iron Age Italic population, to understand the contribution of the Piceni in shaping the modern Italian gene pool and to ascertain the kinship between some individuals buried in the same grave within the Novilara necropolis.

Subjects and methods: In a first set of 10 individuals from Novilara, we performed deep sequencing of the HVS-I region of the mtDNA, combined with the genotyping of 22 SNPs in the coding region and the analysis of several autosomal markers.

Results: The results show a low nucleotide diversity for the inhabitants of Novilara and highlight a genetic affinity of this ancient population with the current inhabitants of central Italy. No family relationship was observed between the individuals analysed here.

Conclusions: This study provides a preliminary characterisation of the mtDNA variability of the Piceni of Novilara, as well as a kinship assessment of two peculiar burials.

Reconstructing the genetic history of Italians: new insights from a male (Y-chromosome) perspective, by Grugni, Raveani, Mattioli, et al.

Background: Due to its central and strategic position in Europe and in the Mediterranean Basin, the Italian Peninsula played a pivotal role in the first peopling of the European continent and has been a crossroad of peoples and cultures since then.

Aim: This study aims to gain more information on the genetic structure of modern Italian populations and to shed light on the migration/expansion events that led to their formation.

Subjects and methods: High resolution Y-chromosome variation analysis in 817 unrelated males from 10 informative areas of Italy was performed. Haplogroup frequencies and microsatellite haplotypes were used, together with available data from the literature, to evaluate Mediterranean and European inputs and date their arrivals.

Results: Fifty-three distinct Y-chromosome lineages were identified. Their distribution is in general agreement with geography, southern populations being more differentiated than northern ones.

Conclusions: A complex genetic structure reflecting the multifaceted peopling pattern of the Peninsula emerged: southern populations show high similarity with those from the Middle East and Southern Balkans, while those from Northern Italy are close to populations of North-Western Europe and the Northern Balkans. Interestingly, the population of Volterra, an ancient town of Etruscan origin in Tuscany, displays a unique Y-chromosomal genetic structure.

Frequencies of the main Y-chromosome haplogroups E1b, J2 and R1b and their sub-clades in the 10 analysed Italian population samples. Black sectors in the primary pies are proportional to the frequency of the main haplogroup in each population. Coloured sectors in the secondary pies are proportional to the frequencies of sub-haplogroups within the relative main haplogroup.

Mitochondrial variability in the Mediterranean area: a complex stage for human migrations, by De Angelis, Scorrano, Martínez-Labarga, et al.

Context: The Mediterranean area has always played a significant role in human dispersal due to the large number of migratory events contributing to shape the cultural features and the genetic pool of its populations.

Objective: This paper aims to review and diachronically describe the mitogenome variability in the Mediterranean population and the main demic diffusions that occurred in this area over time.

Methods: Frequency distributions of the leading mitochondrial haplogroups have been geographically and chronologically evaluated. The variability of U5b and K lineages has been focussed to broaden the knowledge of their genetic histories.

Results: The mitochondrial genetic makeup of Palaeolithic hunter-gatherers is poorly defined within the extant Mediterranean populations, since only a few traces of their genetic contribution are still detectable. The Neolithic lineages are more represented, suggesting that the Neolithic revolution had a marked effect on the peopling of the Mediterranean area. The largest effect, however, was provided by historical migrations.

Conclusion: Although the mitogenome variability has been widely used to try and clarify the evolution of the Mediterranean genetic makeup throughout almost 50 000 years, it is necessary to collect whole genome data on both extinct and extant populations from this area to fully reconstruct and interpret the impact of multiple migratory waves and their cultural and genetic consequences on the structure of the Mediterranean populations.

Major migratory routes with the associated mtDNA haplogroups for the Upper Palaeolithic (solid lines) and the Neolithic (dashed lines) chronologies. Other hypothetical migratory routes are presented with dotted lines (see text for more details).

Mediterranean Y-chromosome 2.0—why the Y in the Mediterranean is still relevant in the postgenomic era, by Larmuseau & Ottoni.

Context: Due to its unique paternal inheritance, the Y-chromosome has been a highly popular marker among population geneticists for over two decades. Recently, the advent of cost-effective genome-wide methods has unlocked information-rich autosomal genomic data, paving the way to the postgenomic era. This seems to have announced the decreasing popularity of investigating Y-chromosome variation, which provides only the paternal perspective of human ancestries and is strongly influenced by genetic drift and social behaviour.

Objective: For this special issue on population genetics of the Mediterranean, the aim was to demonstrate that the Y-chromosome still provides important insights in the postgenomic era and in a time when ancient genomes are becoming exponentially available.

Methods: A systematic literature search on Y-chromosomal studies in the Mediterranean was performed.

Results: Several applications of Y-chromosomal analysis with future opportunities are formulated and illustrated with studies on Mediterranean populations.

Conclusions: There will be no reduced interest in Y-chromosomal studies going from reconstruction of male-specific demographic events to ancient DNA applications, surname history and population-wide estimations of extra-pair paternity rates. Moreover, more initiatives are required to collect population genetic data of Y-chromosomal markers for forensic research, and to include Y-chromosomal data in GWAS investigations and studies on male infertility.

Two-dimensional plot of the PCA of Y-chromosomal haplogroup frequencies of modern populations from Europe, the Near and Middle East and North Africa. Symbols are as in the legend. The inset shows the plot of factor coordinates of the variables used.

We are clearly seeing in the latest genomic papers that Y-DNA was indeed extremely important to assess ancient population movements.

See also: