Recent Africa origin with hybridization, and back to Africa 70,000 years ago


Open access Carriers of mitochondrial DNA macrohaplogroup L3 basal lineages migrated back to Africa from Asia around 70,000 years ago, by Cabrera et al. BMC Evol Biol (2018) 18(98).

Abstract (emphasis mine):


The main unequivocal conclusion after three decades of phylogeographic mtDNA studies is the African origin of all extant modern humans. In addition, a southern coastal route has been argued for to explain the Eurasian colonization of these African pioneers. Based on the age of macrohaplogroup L3, from which all maternal Eurasian and the majority of African lineages originated, the out-of-Africa event has been dated around 60-70 kya. On the opposite side, we have proposed a northern route through Central Asia across the Levant for that expansion and, consistent with the fossil record, we have dated it around 125 kya. To help bridge differences between the molecular and fossil record ages, in this article we assess the possibility that mtDNA macrohaplogroup L3 matured in Eurasia and returned to Africa as basal L3 lineages around 70 kya.


The coalescence ages of all Eurasian (M,N) and African (L3 ) lineages, both around 71 kya, are not significantly different. The oldest M and N Eurasian clades are found in southeastern Asia instead near of Africa as expected by the southern route hypothesis. The split of the Y-chromosome composite DE haplogroup is very similar to the age of mtDNA L3. An Eurasian origin and back migration to Africa has been proposed for the African Y-chromosome haplogroup E. Inside Africa, frequency distributions of maternal L3 and paternal E lineages are positively correlated. This correlation is not fully explained by geographic or ethnic affinities. This correlation rather seems to be the result of a joint and global replacement of the old autochthonous male and female African lineages by the new Eurasian incomers.


These results are congruent with a model proposing an out-of-Africa migration into Asia, following a northern route, of early anatomically modern humans carrying pre-L3 mtDNA lineages around 125 kya, subsequent diversification of pre-L3 into the basal lineages of L3, a return to Africa of Eurasian fully modern humans around 70 kya carrying the basal L3 lineages and the subsequent diversification of Eurasian-remaining L3 lineages into the M and N lineages in the outside-of-Africa context, and a second Eurasian global expansion by 60 kya, most probably, out of southeast Asia. Climatic conditions and the presence of Neanderthals and other hominins might have played significant roles in these human movements. Moreover, recent studies based on ancient DNA and whole-genome sequencing are also compatible with this hypothesis.


You can also read the recent interesting open access review How did Homo sapiens evolve? by Julia Galway-Witham, Chris Stringer, Science (2018) 360:6395 1296-1298.


Reconstruction of Y-DNA phylogeny helps also reconstruct Tibeto-Burman expansion


New paper (behind paywall) Reconstruction of Y-chromosome phylogeny reveals two neolithic expansions of Tibeto-Burman populations by Wang et al. Mol Genet Genomics (2018).

Interesting excerpts:

Archeological studies suggest that a subgroup of ancient populations of the Miaodigou culture (~ 6300–5500 BP) moved westward to the upper stream region of the Yellow River and created the Majiayao culture (~ 5400–4900 BP) (Liu et al. 2010), which was proposed to be the remains of direct ancestors of Tibeto-Burman populations (Sagart 2008). On the other hand, Han populations, the other major descendant group of the Yang-Shao culture (~ 7000–5500 BP), are composed of many other sub-lineages of Oα-F5 and extremely low frequencies of D-M174 (Additional files 1: Figure S1; Additional files 2: Table S1). Therefore, we propose that Oα-F5 may be one of the dominant paternal lineages in ancient populations of Yang-Shao culture and its successors.

In this study, we demonstrated that both sub-lineages of D-M174 and Oα-F5 are founding paternal lineages of modern Tibeto-Burman populations. The genetic patterns suggested that the ancestor group of modern Tibeto-Burman populations may be an admixture of two distinct ancient populations. One of them may be hunter–gatherer populations who survived on the plateau since the Paleolithic Age, represented by varied sub-lineages of sub-lineages of D-M174. The other one was comprised of farmers who migrated from the middle Yellow River basin, represented by sub-lineages of Oα-F5. In general, the genetic evidence in this study supports the conclusion that the appearance of the ancestor group of Tibeto-Burman populations was triggered by the Neolithic expansion from the upper-middle Yellow River basin and admixture with local populations on the Tibetan Plateau (Su et al. 2000).

Simplified phylogenetic tree showing sample locations. The size of the circle for each sampling location corresponds to the number of samples

Two neolithic expansion origins of Tibeto‑Burman populations

We also observed significant differences in the paternal gene pool of different subgroups of Tibeto-Burman populations. Haplogroup D-M174 contributed ~ 54% percent in a sampling of 2354 Tibetan males throughout the Tibetan Plateau (Qi et al. 2013). Previous studies have also found high frequencies of D-M174 in other populations on the Tibetan Plateau (Shi et al. 2008), including Sherpa (Lu et al. 2016) and Qiang (Wang et al. 2014). In contrast, haplogroup D-M174 is rare or absent from Tibeto-Burman populations from Northeast India and Burma (Shi et al. 2008). In populations of the Ngwi-Burmese language subgroup, the average frequencies of haplogroup D-M174 are ~ 5% (Dong et al. 2004; Peng et al. 2014). Furthermore, we found that lineage Oα1c1b-CTS5308 is mainly found in Tibeto-Burman populations from the Tibetan Plateau. In contrast, lineage Oα1c1a-Z25929 was found in Tibeto-Burman populations from Northeast India, Burma, and the Yunan and Hunan provinces of China (Additional files 1: Figure S1; Additional files 2: Table S1). In general, enrichment of lineage Oα1c1b- CTS5308 and high frequencies of D-M174 can be found in most Tibeto-Burman populations on the Tibetan Plateau and adjacent regions, whereas Tibeto-Burman populations from other regions tend to have lineage Oα1c1a-Z25929 and a little to no percentage of D-M174.

The inconsistent pattern we observed in the paternal gene pool of modern Tibeto-Burman populations suggested that there may be two distinct ancestor groups (Fig. 3). The proposed migration routes shown in Fig. 3 are somewhat different from those proposed by Su et al. (2000). According to our age estimation, most of the D1a2a-P47 samples belong to sub-lineage PH116, a young lineage that emerged ~ 2500 years ago (95% CI 1915–3188 years). On the other hand, continuous differentiation can be observed on a phylogenetic tree of lineages D1a1a1a1-PH4979 and D1a1a1a2-Z31591 since 6000 years ago. Therefore, we proposed that a group of ancient populations may have moved to the upper basin of the Yellow River and admixed intensively with local populations with high frequencies of haplogroup D-M174, including its sub-lineage D1a2a-P47 (Fig. 3). This ancestor group eventually gave birth to modern Tibeto-Burman populations on the Tibetan Plateau and adjacent regions. The other ancestor group moved toward the southwest and finally reached South East Asia (Burma and other locations) and the northeastern part of India (Fig. 3). This ancestor group may have had no or a minor admixture of D-M174 in their paternal gene pool.

Two proposed ancestor groups and migration routes for Tibeto-Burman populations

Long‑term admixture before expansion to a high‑altitude region

It is interesting to investigate the time gap between the appearance of Neolithic cultures in the northeastern part of the Tibetan Plateau and the final phase of human expansion across the Tibetan Plateau. The Majiayao culture (~ 5400–4900 BP) is the earliest Neolithic culture in the northeastern part of the Tibetan Plateau (Liu et al. 2010). However, previous archeological study has suggested that the final phase of diffusion into the high-altitude area of the Tibetan Plateau occurred at approximately 3.6 kya (Chen et al. 2015). Our genetic evidence in this study is consistent with this scenario based on archeological evidence. Based on Y-chromosome analysis in this study, many unique lineages of Tibeto-Burman populations emerged between 6000 years ago and 2500 years ago (Additional files 3: Table S2). The most recent common age of D1a2-PH116, a sub-lineage that spread throughout the Tibetan Plateau, is only 2500 years ago.

We propose that there may be two important factors for the observed age gap. First, living in a high-altitude environment may require some crucial physical characteristics that were lacking from Neolithic immigrants from the middle Yellow River Basin. Intense genetic admixture with local people who had survived on the Tibetan Plateau since the Paleolithic Age may have actually guaranteed the expansion of humans across the Tibetan Plateau. Therefore, a long period of admixture, lasting from 5.4 to 3.6 kya, may be necessary for the appearance of a population with beneficial genetic variants that was genetically adapted to the high-altitude environment. Second, technological innovations, such as the domestication of wheat and highland barley (Chen et al. 2015), establishment of yak pastoralism (Rhode et al. 2007), and introduction of other culture elements in the Bronze Age (Ma et al. 2016), are also important factors that facilitated permanent settlements with large population sizes in the high-altitude area of the Tibetan Plateau.


Close inbreeding and low genetic diversity in Inner Asian human populations despite geographical exogamy


Open access Close inbreeding and low genetic diversity in Inner Asian human populations despite geographical exogamy, by Marchi et al. Scientific Reports (2018) 8:9397.

Abstract (emphasis mine):

When closely related individuals mate, they produce inbred offspring, which often have lower fitness than outbred ones. Geographical exogamy, by favouring matings between distant individuals, is thought to be an inbreeding avoidance mechanism; however, no data has clearly tested this prediction. Here, we took advantage of the diversity of matrimonial systems in humans to explore the impact of geographical exogamy on genetic diversity and inbreeding. We collected ethno-demographic data for 1,344 individuals in 16 populations from two Inner Asian cultural groups with contrasting dispersal behaviours (Turko-Mongols and Indo-Iranians) and genotyped genome-wide single nucleotide polymorphisms in 503 individuals. We estimated the population exogamy rate and confirmed the expected dispersal differences: Turko-Mongols are geographically more exogamous than Indo-Iranians. Unexpectedly, across populations, exogamy patterns correlated neither with the proportion of inbred individuals nor with their genetic diversity. Even more surprisingly, among Turko-Mongols, descendants from exogamous couples were significantly more inbred than descendants from endogamous couples, except for large distances (>40 km). Overall, 37% of the descendants from exogamous couples were closely inbred. This suggests that in Inner Asia, geographical exogamy is neither efficient in increasing genetic diversity nor in avoiding inbreeding, which might be due to kinship endogamy despite the occurrence of dispersal.

Interesting excerpts:

Two cultural groups, which matrimonial systems are reported to differ, coexist in Inner Asia: Turko-Mongols are described as mainly exogamous while Indo-Iranians are thought to be mainly endogamous45. However, it is not always clear if exogamy refers to clan (ethnic) or village (geographical) exogamy. Here, we used a dataset of 16 populations representing 11 different ethnic groups from both cultural groups and we quantified geographical exogamy rates and distances in each population. Using an empirical threshold of 4 km, we confirmed that matrimonial behaviours differ as described in the literature, even though we found some exceptions: three Turko-Mongol populations (out of 14) have less than 50% exogamy, whereas one Indo-Iranian population (out of four) has more than 50% exogamy.(…).

Geographical distances between the birth places of couples in Turko-Mongols and Indo-Iranians. The geographical distances are plotted in log scale (km). Their densities are represented by population (dashed lines) or for the Indo-Iranian and Turko-Mongol groups (solid lines). We represented the average distances within couples per population using a Kernel’s density estimate implemented in R with a smoothing bandwidth of 0.2. See Supplementary Table 1B for population codes.

An additional important result of our study is that geographical distances are not negatively correlated with inbreeding, as could have been expected under an isolation-by-distance model65. Interestingly, a recent study based on a large genealogical dataset, collected across Western Europe and North America, and including birth places information, similarly found an absence of correlation between relatedness and the distance between couples, for the cohorts born before 185066. Our analyses within present-day Turko-Mongols reveal more specifically that the structure of the relationship between geographical distance and mating choice inbreeding is not linear, but rather tends to be bell-shaped, and thus cannot be correctly assessed with a single correlation test. Indeed, descendants from parents born 4 to 40 km apart are more inbred than descendants from endogamous couples (≤4 km) or from long-range exogamous ones (>40 km). As a consequence, close inbreeding exists despite geographical exogamy, and about a third of descendants from exogamous couples are inbred.

These results, in addition to those obtained by [Kaplanis et al. 2018]66, highlight the importance of using geographic distances rather than exogamy rates to characterize the impact of exogamy on inbreeding, as already described when studying patrilocality67. Indeed, when we compare mating choice inbreeding patterns for descendants from exogamous and endogamous couples defined for thresholds of 4, 10, 20 and 30 km, we find no significant differences (for number and total length of class C-ROHs and F-Median coefficient: MWU test p-values > 0.1). We only detect significantly lower values in descendants from exogamous couples for larger distances above 40 and 50 km (p-values < 0.03).

Genetic diversity (A) and inbreeding patterns (B,C) within populations. Grey lines in (B) represent inbreeding values corresponding to second-cousins and first-cousins. The grey line in (C) represents the homozygosity population baseline expected under panmixia. The number of samples per population is indicated between parentheses. See Supplementary Table 1B for population codes.

Our results also challenge the intuition that exogamy necessarily increases the genetic diversity within a population and therefore reduces drift inbreeding. Indeed, we found that Turko-Mongol populations have a lower genetic diversity (as measured by the mean haplotypic heterozygosity) and more intermediate ROHs associated with drift inbreeding than those of Indo-Iranians despite higher exogamous rates. (…)

Overall, this research sheds light on mating choice preferences: we showed that two thirds of partners that have not dispersed did mate with unrelated individuals, and that drift and mating choice inbreeding is variable, even among close-by populations. We also provide new insights into the relationship between dispersal and inbreeding in humans, based on genetic data, and demonstrate that geographical exogamy is not necessarily negatively associated with mating choice inbreeding, but rather can have a more complex non-linear relationship. Contrary to the common situation in many animals, this finding suggests that Inner Asian human populations who practise exogamy at small geographical scales might be focused on alliance strategies that result in kinship endogamy. (…)


Bantu distinguished from Khoe by uniparental markers, not genome-wide autosomal admixture


The role of matrilineality in shaping patterns of Y chromosome and mtDNA sequence variation in southwestern Angola, by Oliveira et al. bioRxiv (2018).

Interesting excerpts (emphasis mine):

The origins of NRY diversity in SW Angola

In accordance with our previous mtDNA study9, the present NRY analysis reveals a major division between the Kx’a-speaking !Xun and the Bantu-speaking groups, whose paternal genetic ancestry does not display any old remnant lineages, or a clear link to pre-Bantu eastern African migrants introducing Khoe-Kwadi languages and pastoralism into southern Africa (cf. 15). This is especially evident in the distribution of the eastern African subhaplogroup E1b1b1b2b29, which reaches the highest frequency in the !Xun (25%) and not in the formerly Kwadi-speaking Kwepe (7%). This observation, together with recent genome-wide estimates of 9-22% of eastern African ancestry in other Kx’a and Tuu-speaking groups35, suggests that eastern African admixture was not restricted to present-day Khoe-Kwadi speakers. Alternatively, it is likely that the dispersal of pastoralism and Khoe-Kwadi languages involved a series of punctuated contacts that led to a wide variety of cultural, genetic and linguistic outcomes, including possible shifts to Khoe-Kwadi by originally Bantu-speaking peoples36.

Although traces of an ancestral pre-Bantu population may yet be found in autosomal genome-wide studies, the extant variation in both uniparental markers strongly supports a scenario in which all groups of the Angolan Namib share most of their genetic ancestry with other Bantu groups but became increasingly differentiated within the highly stratified social context of SW African pastoral societies11.

Y chromosome phylogeny, haplogroup distribution and map of the sampling locations. The phylogenetic tree was reconstructed in BEAST based on 2,379 SNPs and is in accordance with the known Y chromosome topology. Main haplogroup clades and their labels are shown with different colors. Age estimates are reported in italics near each node, with the TMRCA of main haplogroups shown with their corresponding color. A map of the sampling locations, re-used with permission from Oliveira et al. (2018) 9, is shown on the bottom left, and the haplogroup distribution per population is shown on the bottom right, with color-coding corresponding to the phylogenetic tree.

The influence of socio-cultural behaviors on the diversity of NRY and mtDNA

A comparison of the NRY variation with previous mtDNA results for the same groups 9 identifies three main sex-specific patterns. First, gene flow from the Bantu into the !Xun is much higher for male than for female lineages (31% NRY vs. 3% mtDNA), similar to the reported male-biased patterns of gene flow from Bantu to Khoisan-speaking groups33, and from non-Pygmies to Pygmies in Central Africa 37. A comparable trend, involving exclusive introgression of NRY eastern African lineages into the !Xun (25%) was also found. (…)

Secondly, the levels of intrapopulation diversity in the Bantu-speaking peoples from the Namib were found to be consistently higher for mtDNA than for the NRY, reflecting the marked association between the Bantu expansion and the relatively young NRY E1b1a1a1 haplogroup, which has no parallel in mtDNA25,39. (…)

In the context of the Bantu expansions, these patterns have been mostly interpreted as the result of polygyny and/or higher levels of assimilation of females from resident forager communities38,40. However, most groups from the Angolan Namib are only mildly polygynous11 and ethnographic data suggest that the actual rates of polygyny in many populations may be insufficient to significantly reduce Nem2,41. In addition, the finding of a large Nef/ Nem ratio in the Himba (Fig. S5), who have almost no Khoisan-related mtDNA lineages9, indicates that female biased introgression cannot fully explain the observed patterns.

An alternative explanation may be sought in the prevailing matrilineal descent rules, which might have created a sex-specific structuring effect, similar to that proposed for patrilineal groups from Central Asia (…)

Bayesian skyline plots (BSP) of effective population size change through time, based on mtDNA (red) and the NRY (black). Thick lines show the mean estimates and dashed lines show the 95% HPD intervals. The vertical line highlights the 2 ky before present mark. Effective sizes are plotted on a log scale. Generation times of 25 and 31 years were assumed for mtDNA and the NRY, respectively32.

The third important sex-specific pattern observed in this study is the much lower amount of between-group differentiation for NRY than for mtDNA among Bantu-speaking populations (4.4% NRY vs. 20.2% mtDNA), in spite of the patrilocal residence patterns of all ethnic groups (Table S5). This difference can hardly be explained by unequal levels of introgression of “Khoisan” mtDNA lineages into the Bantu, since the percentage of mtDNA variation remains high (18.8%) when the Kuvale, who have high frequencies of “Khoisan”-related mtDNA, are excluded from the comparisons. It therefore seems more plausible that differentiation is higher in the mtDNA simply because there is more ancestral mtDNA than NRY variation that can be sorted among different populations (see 45). Moreover, due to the matriclanic organization of all Bantu-speaking communities, factors enhancing inter-group differentiation, like kin-structured migration and kin-structured founder effects46, would have been restricted to mtDNA. Finally, it is also likely that the discrepancy between among-group divergence of mtDNA and NRY might have been influenced by higher migration rates in males than females. In fact, although all Bantu-speaking populations have patrilocal residence patterns, the observance of endogamy rules severely constrains the between-group mobility of females. In this context, the children from extramarital unions involving members from different populations tend to be raised in the mother’s group, effectively increasing male versus female migration rates. Moreover, it is likely that, in the highly hierarchized setting of the Namib, most intergroup extramarital unions would involve men from dominant groups and women from peripatetic communities. This hypothesis is indirectly supported by the finding that in NRY-based clusters (but not in mtDNA) pastoralist populations are grouped together with peripatetic communities that share their cultural traits (Figs. S6 and 3b), suggesting that migration of NRY lineages follows a path that is similar to horizontally transmitted cultural features.


Complex history of dog origins and translocations in the Pacific revealed by ancient mitogenomes


Open access Complex history of dog (Canis familiaris) origins and translocations in the Pacific revealed by ancient mitogenomes, by Creig et al., Scientific Reports (2018).


Archaeological evidence suggests that dogs were introduced to the islands of Oceania via Island Southeast Asia around 3,300 years ago, and reached the eastern islands of Polynesia by the fourteenth century AD. This dispersal is intimately tied to human expansion, but the involvement of dogs in Pacific migrations is not well understood. Our analyses of seven new complete ancient mitogenomes and five partial mtDNA sequences from archaeological dog specimens from Mainland and Island Southeast Asia and the Pacific suggests at least three dog dispersal events into the region, in addition to the introduction of dingoes to Australia. We see an early introduction of dogs to Island Southeast Asia, which does not appear to extend into the islands of Oceania. A shared haplogroup identified between Iron Age Taiwanese dogs, terminal-Lapita and post-Lapita dogs suggests that at least one dog lineage was introduced to Near Oceania by or as the result of interactions with Austronesian language speakers associated with the Lapita Cultural Complex. We did not find any evidence that these dogs were successfully transported beyond New Guinea. Finally, we identify a widespread dog clade found across the Pacific, including the islands of Polynesia, which likely suggests a post-Lapita dog introduction from southern Island Southeast Asia.

A map of Southeast Asia and the Pacific showing the source location of the specimens and associated haplogroups (assignment to haplogroup follows Duleba and colleagues) and the median-joining network. The boundary between Near and Remote Oceania is also shown. Symbols identify the type of sequence: filled circle, ancient mitogenome; half circle, partial ancient sequence; hollow circle, modern mitogenome. Node colours represent the haplogroup, grey, A; red, A2b2, green, A2b3; yellow, A4’5; blue, B.


The dispersal of dogs across the Pacific is inseparably linked to the relationships between dogs and people. Unlike movement across continental landmasses, Pacific dogs must have been transported by people across the waters that separate islands. The ancient mitogenomes sequenced from archaeological dog specimens presented here offer a novel series of individual insights into the history of dog translocation from Southeast Asia as it occurred prior to the influence of modern European dog breeds. We generated seven mitogenomes and five partial sequences from ancient MSEA, ISEA and Pacific dogs, and four modern dingoes. Despite the small sample size, our results reveal levels of complexity and discontinuity in the introduction and movement of dogs, which are mirrored in the archaeological and linguistic evidence, suggesting at least three introductions of dogs to the wider Pacific region, in addition to the earlier appearance of the dingo in Australia. Further mtDNA studies of ancient dogs and modern village populations throughout the region may contribute additional data that can be used to evaluate these hypothesised dispersals. Autosomal and Y-chromosome analyses also have the potential to generate additional information about dog dispersal, which could reveal different dispersal signatures based on sex, or phenotypic characteristics, though the environmental conditions in the region are not particularly conducive to aDNA preservation.

Our molecular genetic analyses reveal one of the earliest dogs present in ISEA around 3,000 years ago from Timor-Leste possesses a mtDNA lineage not found elsewhere in the region. We also found similarities between mtDNA of modern dingoes and NGSDs and an ancient Taiwanese sequence, which supports previous observations about possible links between Y-chromosome markers of modern dingoes and a modern Taiwanese sample. More work is required to address whether these connections reflect the genetic diversity of a shared ancestral population in mainland China, or attest to a currently unknown dispersal event linking the two populations. Archaeological evidence for the introduction of dogs to Oceania as part of the LCC is extremely limited. Nonetheless, we demonstrate that mitogenomes from dogs in terminal Lapita and post-Lapita levels of archaeological sites along the south coast of mainland New Guinea also show affinities with an Iron Age dog specimen from Taiwan, raising the possibility of at least one introduction of dogs during Austronesian expansions ultimately from the north. Finally, we have identified a major late introduction of dogs across the islands of Oceania beginning around 2,000 years ago, which appears to have originated in MSEA, not Taiwan, and culminated in the establishment of dog populations in initial colonisation-era sites throughout East Polynesia.

Molecular phylogenetic analysis by maximum likelihood method, implemented in MEGA71. The evolutionary history shown inferred by using the maximum likelihood method based on the Hasegawa-Kishino-Yano model. The tree with the highest log likelihood (-25257.5243) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches. Initial tree(s) for the heuristic search were obtained automatically by applying Neighbor-Join and BioNJ algorithms to a matrix of pairwise distances estimated using the Maximum Composite Likelihood (MCL) approach, and then selecting the topology with superior log likelihood value. A discrete Gamma distribution was used to model evolutionary rate differences among sites (5 categories (+G, parameter = 0.0500)). The rate variation model allowed for some sites to be evolutionarily invariable ([+I], 0.0010% sites). The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 45 nucleotide sequences. There were a total of 16774 positions in the final dataset.

Also related, open access Elucidating biogeographical patterns in Australian native canids using genome wide SNPs, by Cairns et al., PLOS One (2018).


Dingoes play a strong role in Australia’s ecological framework as the apex predator but are under threat from hybridization and agricultural control programs. Government legislation lists the conservation of the dingo as an important aim, yet little is known about the biogeography of this enigmatic canine, making conservation difficult. Mitochondrial and Y chromosome DNA studies show evidence of population structure within the dingo. Here, we present the data from Illumina HD canine chip genotyping for 23 dingoes from five regional populations, and five New Guinea Singing Dogs to further explore patterns of biogeography using genome-wide data. Whole genome single nucleotide polymorphism (SNP) data supported the presence of three distinct dingo populations (or ESUs) subject to geographical subdivision: southeastern (SE), Fraser Island (FI) and northwestern (NW). These ESUs should be managed discretely. The FI dingoes are a known reservoir of pure, genetically distinct dingoes. Elevated inbreeding coefficients identified here suggest this population may be genetically compromised and in need of rescue; current lethal management strategies that do not consider genetic information should be suspended until further data can be gathered. D statistics identify evidence of historical admixture or ancestry sharing between southeastern dingoes and South East Asian village dogs. Conservation efforts on mainland Australia should focus on the SE dingo population that is under pressure from domestic dog hybridization and high levels of lethal control. Further data concerning the genetic health, demographics and prevalence of hybridization in the SE and FI dingo populations is urgently needed to develop evidence based conservation and management strategies.

Principal components analysis (PCA) based upon filtered whole genome SNP genotypes (58,512 sites) for 23 dingoes, 5 NGSD, 8 Borneo village dogs, 9 Vietnam village dogs, 10 Portugal village dogs and 8 Australian cattle dogs (‘Dataset B’). Colours represent population clusters: red for SE dingoes, purple for FI dingoes, blue for NW dingoes, dark green for NGSD, light green for Borneo village dogs, orange for Vietnam village dogs, yellow for Portugal village dogs and grey for Australian cattle dogs. (A) PC 1 versus PC 2. (B) PC 1 versus PC 3.

As I said in a previous post, the study of dogs may be useful to trace population migrations and to assess strong cultural contacts. Especially, as in this case, when crossbreeding among cultures is not easy…


Yamna/Afanasevo elite males dominated by R1b-L23, Okunevo brings ancient Siberian/Asian population


Open access paper New genetic evidence of affinities and discontinuities between bronze age Siberian populations, by Hollard et al., Am J Phys Anthropol. (2018) 00:1–11.

NOTE. This seems to be a peer-reviewed paper based on a more precise re-examination of the samples from Hollard’s PhD thesis, Peuplement du sud de la Sibérie et de l’Altaï à l’âge du Bronze : apport de la paléogénétique (2014).

Interesting excerpts:

Afanasevo and Yamna

The Afanasievo culture is the earliest known archaeological culture of southern Siberia, occupying the Minusinsk-Altai region during the Eneolithic era 3600/3300 BC to 2500 BC (Svyatko et al., 2009; Vadetskaya et al., 2014). Archeological data showed that the Afanasievo culture had strong affinities with the Yamnaya and pre-Yamnaya Eneolithic cultures in the West (Grushin et al., 2009). This suggests a Yamnaya migration into western Altai and into Afanasievo. Note that, in most current publications, “the Yamnaya culture” combines the so-called “classical Yamnaya culture” of the Early Bronze Age and archeological sites of the preceding Repin culture in the middle reaches of the Don and Volga rivers. In the present article we conventionally use the term Yamnaya in the same sense, in which case the beginning of the “Yamnaya culture” can be dated after the middle of the 4th millennium BC, when the Afanasievo culture appeared in the Altai.

Because of numerous traits attributed to early Indo-Europeans and cultural relations with Kurgan steppe cultures, members of the Afanasievo culture are believed to have been Indo-European speakers (Mallory and Mair, 2000). In a recent whole-genome sequencing study, Allentoft et al. (2015) concluded that Eastern Yamnaya individuals and Afanasievo individuals were genetically indistinguishable. Moreover, this study and one published concurrently by Haak et al. (2015) analyzed 11 Eastern Yamnaya males and showed that all of them belonged to the R1b1a1a (formerly R1b1a) (…)

Early Chalcolithic migrations ca. 3300-2600 BC.

Published works indicate that R1b was a predominant haplogroup from the late Neolithic to the early Bronze Age, notably in the Bell Beaker and Yamnaya cultures (Allentoft et al., 2015; Haak et al., 2015; Lee et al., 2012; Mathieson et al., 2015). Nearly 100% of the Afanasievo men we typed belonged to the R1b1a1a subhaplogroup and, for at least three of them, more precisely to the L23 (xM412) subclade. (…)

(…) our results therefore support the hypothesis of a genetic link between Afanasievo and Yamnaya. This also suggests that R1b was indeed dominant in the early Bronze Age Siberian steppe, at least in individuals that were buried in kurgans (possibly an elite part of the population). The geographical and temporal distribution of subhaplogroup R1b1a1a supports the hypothesis of population expansion from West to East in the Eurasian steppe during this period. It should however be noted that the Yamnaya burials from which the samples for DNA analysis were obtained (Allentoft et al., 2015; Haak et al., 2015; Mathieson et al., 2015) were dated within the limits of the Afanasievo period. Ancestors of both East Yamnaya and Afanasievo populations must therefore be sought in the context of earlier Eneolithic cultures in Eastern Europe. Sufficient Y-chromosomal data from such Eneolithic populations is, unfortunately, not yet available.

Mitochondrial- (A) and Y- (B) haplogroup distribution in studied populations

Okunevo and paternal lineage shift in South Siberia

Results obtained in the current study, from more than a dozen Okunevo individuals belonging to the earliest stage of Okunevo culture, that is the Uibat period (2500–2200 BC) (Lazaretov, 1997), suggest a discontinuity in the genetic pool between Afanasievo and Okunevo cultures. Although Y-chromosomal data obtained for bearers of the Okunevo culture showed that one individual carried haplogroup R1b, most Okunevo Y-haplogroups are representative of an Asian component represented by paternal lineages Q and NO1.

Okunevo carrier of Y-haplogroup Q1b1a-L54, which also supports this hypothesis (L54 being a marker of the lineage from which M3, the main Ameridian lineage, arose). Okunevo people could therefore be a remnant paleo-Siberian population with possible Afanasievo input, as suggested by the presence of the R1b1a1a2a subhaplogroup in one individual.

Late Chalcolithic migrations ca. 2600-2250 BC.

Replacement of Asian Indo-European elite lineages by R1a

Published genetic data from the late Bronze Age Andronovo culture from the Minusinsk Basin (Keyser et al., 2009), the Sintashta culture from Russia (Allentoft et al., 2015) and the Srubnaya culture from the region of Samara (Mathieson et al., 2015), show that males did not belong to Y-haplogroup R1b but mostly to R1a clades: there appears to have been a change in the dominant Y-chromosomal haplogroup between the early and the late Bronze Age in these regions. Moreover, as described in Allentoft et al. (2015), the Andronovo and Sintashta peoples were closely related to each other but clearly distinct from both Yamnaya and Afanasievo. Although these results do not imply that Y-haplogroup R1b was entirely absent in these later populations, they could correspond to a replacement of the elite between these two main periods and therefore a difference in the haplogroups of the men that were preferentially buried.

Early Bronze Age migrations ca. 2250-1750 BC.

Afanasevo and the Tarim Basin

The discovery, in the Tarim Basin, of well-preserved mummies from the Bronze Age allows for the construction of two hypotheses regarding the peopling of the Xinjiang province at this period. The “steppe hypothesis,” argues for a link with nomadic steppe herders (Hemphill and Mallory, 2004), possibly represented in this case by Afanasievo populations and their descendants (Mallory and Mair, 2000). However, newly published cultural data from the burial grounds of Gumugou (Wang, 2014) and Xiaohe (Xinjiang, 2003, 2007) shows material culture and burial rites incompatible with the Afanasievo culture. The earliest 14C date for Tarim Basin burials would place them at the turn of the 2nd millenium BC (Wang, 2013), 500 years after the Afanasievo period.

Instead, early Gumugou and Xiaohe burial grounds were contemporary with the start of the Andronovo period. Likewise, the Bronze Age population of the Xinjiang at Gumugou/Qäwrighul is not phenotypically closest to Afanasievo but to the Andronovo (Fedorovo) group of northeastern Kazakhstan and western Altai (Kozintsev, 2009). Our investigations demonstrate that Y-chromosomal lineage composition is also compatible with the notion that the ancient Tarim population was genetically distinct from the Afanasievo population. The only Y-haplogroup found by Li et al. (2010) in the Bronze Age Tarim Basin population was Y-haplogroup R1a, which suggests a proximity of this population with Andronovo groups rather than Afanasievo groups.

I don’t think these finds are much of a surprise based on what we already know, or need much explanation…

I would add that, once again, we have more proof that the movement of Okunevo and related ancient Siberian migrants from Central or North Asia will not be able to explain the presence of Uralic languages spread over North-East Europe and Scandinavia already during the Bronze Age.

Also interesting is to read in more peer-reviewed papers the idea of Late Indo-European speakers clearly linked to the expansion of patrilineally-related elite males marked by haplogroup R1b-L23, most likely since Eneolithic Khvalynsk/Repin cultures.


Native American genetic continuity and oldest mtDNA hg A2ah in the Andean region

Native American gene continuity to the modern admixed population from the Colombian Andes: Implication for biomedical, population and forensic studies by Criollo-Rayo et al., Forensic Sci Int Genet (2018), in press, corrected proof.

Abstract (emphasis mine):

Andean populations have variable degrees of Native American and European ancestry, representing an opportunity to study admixture dynamics in the populations from Latin America (also known as Hispanics). We characterized the genetic structure of two indigenous (Nasa and Pijao) and three admixed (Ibagué, Ortega and Planadas) groups from Tolima, in the Colombian Andes. DNA samples from 348 individuals were genotyped for six mitochondrial DNA (mtDNA), seven non-recombining Y-chromosome (NRY) region and 100 autosomal ancestry informative markers. Nasa and Pijao had a predominant Native American ancestry at the autosomal (92%), maternal (97%) and paternal (70%) level. The admixed groups had a predominant Native American mtDNA ancestry (90%), a substantial frequency of European NRY haplotypes (72%) and similar autosomal contributions from Europeans (51%) and Amerindians (45%). Pijao and nearby Ortega were indistinguishable at the mtDNA and autosomal level, suggesting a genetic continuity between them. Comparisons with multiple Native American populations throughout the Americas revealed that Pijao, had close similarities with Carib-speakers from distant parts of the continent, suggesting an ancient correlation between language and genes. In summary, our study aimed to understand Hispanic patterns of migration, settlement and admixture, supporting an extensive contribution of local Amerindian women to the gene pool of admixed groups and consistent with previous reports of European-male driven admixture in Colombia.

Ancestral uniparental haplogroups and diversity in Tolima. Geography of sampling locations. The
top and middle sections show the frequency of Native American mtDNA haplogroups and NRY lineages for all
populations. Gene diversity is shown below their respective pie chart. The lower part depicts the geography of the
region where the sampling sites of Ortega and Pijao are closely located in Tolima’s Magdalena river valley and
Ibague, Planadas and Nasa located in the Andes cordilleras (additional geographic details are shown in SF1).

Highlights from the paper:

  • MtDNA suggest a pre/post Columbian genetic continuity in the Colombian Andes.
  • Y-chromosome diversity follows a clinal gradient in the studied region.
  • Sex-biased/male-driven admixture process, involving Pijao women with European men.
  • Admixed closer to Indigenous resguardos have a higher Native American ancestry.

Also interesting is the recent paper Mitochondrial lineage A2ah found in a pre‐Hispanic individual from the Andean region, by Russo et al., in American Journal of Human Biology (2018), with an interesting sample from the Regional Developments II period (540 ± 60 BP).

Phylogeny of the A2ah mitochondrial lineage based on HVR I sequences. Both MaximumParsimony andMaximumLikelihood reconstructions led to the same typology. The tree was rooted with the RSRS. Sample ID: Cueva: Pukara de La Cueva, STACRUZ: Santa Cruz, BNI: Beni, BR: South-eastern Brazil, TobaChA: TobaGranChaco


Canid Y-chromosome phylogeny reveals distinct haplogroups among Neolithic European dogs


Open access Analysis of the canid Y-chromosome phylogeny using short-read sequencing data reveals the presence of distinct haplogroups among Neolithic European dogs, by Oetjens et al., BMC Genomics (2018) 19:350.

Interesting excerpts (modified for clarity, emphasis mine):


Canid mitochondrial phylogenies show that dogs and wolves are not reciprocally monophyletic. The mitochondrial tree contains four deeply rooted clades encompassing dogs and many grey wolf groups. These four clades form the basis of dog mitochondrial haplogroup assignment, known as haplogroups A-D. The time of the most recent common ancestor (TMRCA) of haplogroups A-D significantly predates estimates for domestication based on archeological and genetic evidence. Instead, these clades may represent variation present among the founding population of the dog lineage or the results of wolf introgressions into dog populations. The relative frequencies of mitochondria haplogroups are not stable over time, with changes reflecting processes such as drift, migration, and population growth. Although the mitochondria A and B haplogroups are most common in contemporary European dogs, surveys of ancient samples indicate that the majority of ancient European dogs carried the C or D mitochondrial haplotype. This apparent turnover in mitochondrial haplogroups may reflect the migration of a distinct dog population into Europe over the past 15,000 years.

Maximum likelihood phylogeny of 118 candid Y-chromosomes A Y-chromosome haplogroup tree produced by RAxML (8.1.13) using the GTR+ I model is depicted. Clades in the tree have been collapsed by haplogroup assignment. The number of samples within each collapsed node is indicated in parentheses next to the haplogroup assignment. For each node, percent bootstrap support out of 1000 iterations is indicated above the branch. The locations of three ancient samples, based on the presence of diagnostic mutations, are indicated in red


Using the variation discovered from sequence data, we applied a Bayesian MCMC approach to estimate TMRCAs for each haplotype group. Our estimated Y-chromosome mutation rate (3.07 × 10− 10 substitutions per site per year, relaxed clock model) falls within the range of a previous estimate by Ding et al. who used a similar calibration and estimate 1.35 × 10− 10– 4.31 × 10− 10 substitutions per site per year. The TMRCAs we estimated are substantially older than mitochondria phylogenies calibrated with tip dates of ancient samples, which report clade-specific TMRCAs < 25,000 years ago. We note that our Y-chromosome TMRCA estimates are extremely sensitive to our assumptions about the age of the root of the tree and should be interpreted with caution due to the uncertainty in this single calibration point. However, the relative ages of the branches and the chronological order of haplogroup divergences are more robust than the absolute estimated dates.

In general, the relationships between Y-chromosome haplogroups and autosomal ancestry we report are very similar to the relationships described in Shannon et al. As noted earlier, our dataset includes a subset of wolves with Y-chromosomes assigned to a dog Y-haplogroup. However, ADMIXTURE analysis does not indicate substantial recent dog ancestry in these samples, suggesting that their placement on the Y-chromosome phylogeny reflects variation in Y-chromosome haplotypes that was present in the ancestral population and therefore predates the domestication process or is the result of ancient introgression events whose signature of autosomal ancestry has been diluted.

The relationship between autosomal ancestry and Y-chromosome haplogroups Major groupings of canine ancestry are shown based on a principal components analysis of autosomal markers from 499 village dogs from Shannon et al. a. The geographic origin of each sample is indicated by color. The 104 male dogs used in this study are projected onto the resulting principal components and colored based on haplogroup (b). Village dogs from (a) are shown as transparent dots in (b)


Using sequencing data, we find that the estimated TMRCA of dog Y haplogroups predates dog domestication. We further reveal the placement of several wolf Y-chromosomes within deep branches of dog haplogroup clades. Using an expanded set of mutations diagnostic for each haplogroup, we find that distinct Y haplogroups were present in Europe during the Neolithic and that CTC, a ~ 4700 year old ancient dog from Germany has a Y-chromosome that shares diagnostic alleles with wolves found in India.

Other studies

On the same subject, you can read another recent study, bioRxiv preprint New Evidence of the Earliest Domestic Dogs in the Americas, by Perri et al. (2018); and also a recent, open access paper (see above featured image) Ancient European dog genomes reveal continuity since the Early Neolithic, by Botigué et al., Science Communications (2017).

While Proto-Indo-European- and Proto-Uralic-speakers had a close relationship with dogs (revealed in their reconstructed language and attributed archaeological cultures), I think it will be very difficult to ascertain any population movement based on them, unless there is a clear, well-established archaeological relationship between a specific culture and dog-breeding.

Nevertheless, I would say that this kind of studies are more likely to give some information related to these and other cultures than, for example, the study of honeybees in honey-hunting vs. beekeeping cultures (see e.g. The Complex Demographic History and Evolutionary Origin of the Western Honey Bee, Apis Mellifera, by Cridland, Tsutsui, and Ramírez GBE 2017), which was also related to the development of both PIE and PU cultures.

See also:

Shared ancestry of ancient Eurasian hepatitis B virus diversity linked to Bronze Age steppe


Ancient hepatitis B viruses from the Bronze Age to the Medieval period, by Mühlemann et al., Science (2018) 557:418–423.

NOTE. You can read the PDF at Dalia Pokutta’s account.

Abstract (emphasis):

Hepatitis B virus (HBV) is a major cause of human hepatitis. There is considerable uncertainty about the timescale of its evolution and its association with humans. Here we present 12 full or partial ancient HBV genomes that are between approximately 0.8 and 4.5 thousand years old. The ancient sequences group either within or in a sister relationship with extant human or other ape HBV clades. Generally, the genome properties follow those of modern HBV. The root of the HBV tree is projected to between 8.6 and 20.9 thousand years ago, and we estimate a substitution rate of 8.04 × 10−6–1.51 × 10−5 nucleotide substitutions per site per year. In several cases, the geographical locations of the ancient genotypes do not match present-day distributions. Genotypes that today are typical of Africa and Asia, and a subgenotype from India, are shown to have an early Eurasian presence. The geographical and temporal patterns that we observe in ancient and modern HBV genotypes are compatible with well-documented human migrations during the Bronze and Iron Ages1,2. We provide evidence for the creation of HBV genotype A via recombination, and for a long-term association of modern HBV genotypes with humans, including the discovery of a human genotype that is now extinct. These data expose a complexity of HBV evolution that is not evident when considering modern sequences alone.

Geographical distribution of analysed samples and modern genotypes. a (featured image), Distribution of modern human HBV genotypes. Genotypes relevant to this Letter are shown in colour. Coloured shapes indicate the locations of the HBV-positive samples included for further analysis. b (above this text), Locations of analysed Bronze Age samples are shown as circles and Iron Age and later samples are shown as triangles. Coloured markers indicate HBV-positive samples. Ancient genotype A samples are found in regions in which genotype D predominates today, and HBV-DA27 is of subgenotype D5 which today is found almost exclusively in India.

Interesting excerpts:

We find genotype A in south-western Russia by 4.3 ka (in samples RISE386 and RISE387) in individuals belonging to the Sintashta culture, and in a Hungarian sample (DA195) from the Scythian culture. The western Scythians are related to the Bronze Age cultures of western steppe populations2 and their shared ancestry suggests that the modern genotype A may descend from this ancient Eurasian diversity and not, as previously hypothesized, from African ancestors29,30. This is also consistent with the phylogeny (Fig. 2), as well as the fact that the three oldest ancient genotype A sequences (HBV-DA195, HBV-RISE386 and HBV-RISE387) lack the six-nucleotide insertion found in the youngest (HBV-DA119) and in all modern genotype A sequences. The ancestors of subgenotypes A1 and A3 could have been carried into Africa subsequently, via migration from western Eurasia31.

The ancient HBV genotype D sequences were all found in Central Asia. HBV-DA27, found in Kazakhstan and dated to 1.6 ka, falls basal to the modern subgenotype D5 sequences that today are found in the Paharia tribe from eastern India32. DA27 and the Paharia people in India are linked by their East Asian ancestry2,33.

Dated maximum clade credibility tree of HBV. A log-normal relaxed clock and coalescent exponential population prior were used. Grey horizontal bars indicate the 95% HPD interval of the age of the node. Larger numbers on the nodes indicate the median age and 95% HPD interval of the age (in parentheses) under a strict clock and Bayesian skyline tree prior. Clades of genotypes C (except clade C4), E, F, G and H are collapsed and shown as dots. The figure includes a possible tenth genotype, J, based on a single human isolate. Taxon names for ancient samples indicate era (BA, Bronze Age; IA, Iron Age or later), sample name, sample age in years, ISO 3166 three-letter abbreviation of country of sequence origin, and region of sequence origin. Taxon names for modern samples indicate human genotype or subgenotype or host species if non-human, GenBank accession number, sample age in years, ISO 3166 three-letter abbreviation of country of sequence origin, and region of sequence origin.

(…)Despite the age of the samples and the imperfect diagnostic test, our dataset contained a high proportion of HBV-positive individuals. The actual ancient prevalence during the Bronze Age and thereafter might have been higher, reaching or exceeding the prevalence typically found in contemporary indigenous populations5. This clearly establishes the potential of HBV as powerful proxy tool for research into human spread and interactions. The data from ancient genomes reveal aspects of complexity in HBV evolution that are not apparent when only modern sequences are considered. They show the existence of ancient HBV genotypes in locations incongruent with their present-day distribution, contradicting previously suggested geographical or temporal origins of genotypes or sub-genotypes; evidence for the creation of genotype A via recombination and the emergence of the genotype outside Africa; at least one now-extinct human genotype; ancient genotype-level localized diversity; and demonstrate that the viral substitution rate obtained from modern heterochronously sampled sequences is probably misleading. Together, these findings suggest that the difficulty in formulating a coherent theory for the origin and spread of HBV may be due to genetic evidence of an earlier evolutionary scenario being overwritten by relatively recent alterations, as has previously been suggested in the context of recombination24

See also:

Male-biased expansions and migrations also observed in Northwestern Amazonia

Open access preprint Cultural Innovations influence patterns of genetic diversity in Northwestern Amazonia, by Arias et al., bioRxiv (2018).

Abstract (emphasis mine):

Human populations often exhibit contrasting patterns of genetic diversity in the mtDNA and the non-recombining portion of the Y-chromosome (NRY), which reflect sex-specific cultural behaviors and population histories. Here, we sequenced 2.3 Mb of the NRY from 284 individuals representing more than 30 Native-American groups from Northwestern Amazonia (NWA) and compared these data to previously generated mtDNA genomes from the same groups, to investigate the impact of cultural practices on genetic diversity and gain new insights about NWA population history. Relevant cultural practices in NWA include postmarital residential rules and linguistic-exogamy, a marital practice in which men are required to marry women speaking a different language. We identified 2,969 SNPs in the NRY sequences; only 925 SNPs were previously described. The NRY and mtDNA data showed that males and females experienced different demographic histories: the female effective population size has been larger than that of males through time, and both markers show an increase in lineage diversification beginning ~5,000 years ago, with a male-specific expansion occurring ~3,500 years ago. These dates are too recent to be associated with agriculture, therefore we propose that they reflect technological innovations and the expansion of regional trade networks documented in the archaeological evidence. Furthermore, our study provides evidence of the impact of postmarital residence rules and linguistic exogamy on genetic diversity patterns. Finally, we highlight the importance of analyzing high-resolution mtDNA and NRY sequences to reconstruct demographic history, since this can differ considerably between males and females.

MDS plots for mtDNA and NRY. Stress values (within parentheses) are indicated in percentages.

Looking more precisely at the different groups (even with the resampling approach), there are no significant differences between matrilocal and patrilocal groups. At best, as the study proposes, “this is just one of the factors at play in structuring the observed genetic variation”.

Interesting excerpts:

(…) we found evidence that the patterns of genetic differentiation depend on the geographical scale of the study. The magnitude of between-population differentiation in the NRY compared to the mtDNA is smaller when looking at the continental scale than in NWA (Figure 6). This is in agreement with the findings of Wilkins and Marlowe (2006), who showed that the excess of between-population differentiation for the NRY in comparison to the mtDNA decreases when comparing more geographically distant populations. Heyer et al. (2012) and Wilkins and Marlowe (2006) have proposed that at a local scale the patterns of genetic diversity reflect cultural practices over a relatively small number of generations, whereas at a larger geographic scale the genetic diversity reflects old migration and/or old common ancestry patterns(Heyer et al. 2012; Wilkins and Marlowe 2006).

BSPs for the mtDNA and NRY sequences from NWA. The dotted lines indicate the 95% HPD intervals. Ne was corrected for generation time according to (Fenner 2005), using 26 years for mtDNA and 31 years for NRY.

The BSP plots and the diversity statistics indicate that overall the Ne of males has been smaller than that of females. One tentative explanation for this difference is that it reflects larger differences in reproductive success among males than among females. Some support for this explanation comes from the shape of the phylogenies (Supplementary Figures 1 and 6), since differences in reproductive success and the cultural transmission of fertility lead to imbalance phylogenies (Blum et al. 2006; Heyer et al. 2015). We estimated a common index of tree imbalance (Colless index) and calculated whether the mtDNA and NRY trees were more unbalanced than 1000 simulated trees generated under a Yule process (Bortolussi et al. 2006) (i.e. a simple pure birth process that assumes that the birth rate of new lineages is the same along the tree). We found that the NRY tree is more unbalanced than predicted by the Yule model (p-value=0.001), whereas the mtDNA tree is not significantly different from trees generated by the Yule model (p-value=0.628). It has been suggested that highly mobile hunter-gatherer societies, such as those typical of most of human prehistory, were polygynous bands (Dupanloup et al. 2003); similarly, nomadic horticulturalist Amazonian societies exhibit strong differences in reproductive success due to the common practice of polygyny, especially among community chiefs, whose offspring also enjoy a high fertility (Neel 1970; 1980; Neel and Weiss 1975).

Furthermore, a more recent expansion can be observed in the BSP based on the NRY, but not in the mtDNA BSP (Figure 5), indicating an expansion specifically in the paternal line. The reasons behind this recent male-biased population expansion, which starts ~3.5 kya, are as yet unclear. However, similar male-biased expansions have been observed in other studies using high-resolution NRY sequences (Batini et al. 2017; Karmin et al. 2015).