South-East Asia samples include shared ancestry with Jōmon


New paper (behind paywall) The prehistoric peopling of Southeast Asia, by McColl et al. (Science 2018) 361(6397):88-92 from a recent bioRxiv preprint.

Interesting is this apparently newly reported information including a female sample from the Ikawazu Jōmon of Japan ca. 570 BC (emphasis mine):

The two oldest samples — Hòabìnhians from Pha Faen, Laos [La368; 7950 with 7795 calendar years before the present (cal B.P.)] and Gua Cha, Malaysia (Ma911; 4415 to 4160 cal B.P.)—henceforth labeled “group 1,” cluster most closely with present-day Önge from the Andaman Islands and away from other East Asian and Southeast-Asian populations (Fig. 2), a pattern that differentiates them from all other ancient samples. We used ADMIXTURE (14) and fastNGSadmix (15) to model ancient genomes as mixtures of latent ancestry components (11). Group 1 individuals differ from the other Southeast Asian ancient samples in containing components shared with the supposed descendants of the Hòabìnhians: the Önge and the Jehai (Peninsular Malaysia), along with groups from India and Papua New Guinea.

We also find a distinctive relationship between the group 1 samples and the Ikawazu Jōmon of Japan (IK002). Outgroup f3 statistics (11, 16) show that group 1 shares the most genetic drift with all ancient mainland samples and Jōmon (fig. S12 and table S4). All other ancient genomes share more drift with present-day East Asian and Southeast Asian populations than with Jōmon (figs. S13 to S19 and tables S4 to S11). This is apparent in the fastNGSadmix analysis when assuming six ancestral components (K = 6) (fig. S11), where the Jōmon sample contains East Asian components and components found in group 1. To detect populations with genetic affinities to Jōmon, relative to present-day Japanese, we computed D statistics of the form D(Japanese, Jōmon; X, Mbuti), setting X to be different presentday and ancient Southeast Asian individuals (table S22). The strongest signal is seen when X=Ma911 and La368 (group 1 individuals), showing a marginally nonsignificant affinity to Jōmon (11). This signal is not observed with X = Papuans or Önge, suggesting that the Jōmon and Hòabìnhians may share group 1 ancestry (11).

Model for plausible migration routes into SEA. This schematic is based on ancestry patterns observed in the ancient genomes. Because we do not have ancient samples to accurately resolve how the ancestors of Jōmon and Japanese populations entered the Japanese archipelago, these migrations are represented by dashed arrows. A mainland component in Indonesia is depicted by the dashed red-green line. Gr, group; Kra, Kradai.

(…) Finally, the Jōmon individual is best-modeled as a mix between a population related to group 1/Önge and a population related to East Asians (Amis), whereas present-day Japanese can be modeled as a mixture of Jōmon and an additional East Asian component (Fig. 3 and fig. S29)

Interesting in relation to the oral communication of the SMBE O-03-OS02 Whole genome analysis of the Jomon remain reveals deep lineage of East Eurasian populations by Gakuuhari et al.:

Post late-Paleolithic hunter-gatherers lived throughout the Japanese archipelago, Jomonese, are thought to be a key to understanding the peopling history in East Asia. Here, we report a whole genome sequence (x1.85) of 2,500-year old female excavated from the Ikawazu shell-mound, unearthed typical remains of Jomon culture. The whole genome data places the Jomon as a lineage basal to contemporary and ancient populations of the eastern part of Eurasian continent, and supports the closest relationship with the modern Hokkaido Ainu. The results of ADMIXTURE show the Jomon ancestry is prevalent in present-day Nivkh, Ulchi, and people in the main-island Japan. By including the Jomon genome into phylogenetic trees, ancient lineages of the Kusunda and the Sherpa/Tibetan, early splitting from the rest of East Asian populations, is emerged. Thus, the Jomon genome gives a new insight in East Asian expansion. The Ikawazu shell-mound site locates on 34,38,43 north latitude, and 137,8, 52 east longitude in the central main-island of the Japanese archipelago, corresponding to a warm and humid monsoon region, which has been thought to be almost impossible to maintain sufficient ancient DNA for genome analysis. Our achievement opens up new possibilities for such geographical regions.


Complex history of dog origins and translocations in the Pacific revealed by ancient mitogenomes


Open access Complex history of dog (Canis familiaris) origins and translocations in the Pacific revealed by ancient mitogenomes, by Creig et al., Scientific Reports (2018).


Archaeological evidence suggests that dogs were introduced to the islands of Oceania via Island Southeast Asia around 3,300 years ago, and reached the eastern islands of Polynesia by the fourteenth century AD. This dispersal is intimately tied to human expansion, but the involvement of dogs in Pacific migrations is not well understood. Our analyses of seven new complete ancient mitogenomes and five partial mtDNA sequences from archaeological dog specimens from Mainland and Island Southeast Asia and the Pacific suggests at least three dog dispersal events into the region, in addition to the introduction of dingoes to Australia. We see an early introduction of dogs to Island Southeast Asia, which does not appear to extend into the islands of Oceania. A shared haplogroup identified between Iron Age Taiwanese dogs, terminal-Lapita and post-Lapita dogs suggests that at least one dog lineage was introduced to Near Oceania by or as the result of interactions with Austronesian language speakers associated with the Lapita Cultural Complex. We did not find any evidence that these dogs were successfully transported beyond New Guinea. Finally, we identify a widespread dog clade found across the Pacific, including the islands of Polynesia, which likely suggests a post-Lapita dog introduction from southern Island Southeast Asia.

A map of Southeast Asia and the Pacific showing the source location of the specimens and associated haplogroups (assignment to haplogroup follows Duleba and colleagues) and the median-joining network. The boundary between Near and Remote Oceania is also shown. Symbols identify the type of sequence: filled circle, ancient mitogenome; half circle, partial ancient sequence; hollow circle, modern mitogenome. Node colours represent the haplogroup, grey, A; red, A2b2, green, A2b3; yellow, A4’5; blue, B.


The dispersal of dogs across the Pacific is inseparably linked to the relationships between dogs and people. Unlike movement across continental landmasses, Pacific dogs must have been transported by people across the waters that separate islands. The ancient mitogenomes sequenced from archaeological dog specimens presented here offer a novel series of individual insights into the history of dog translocation from Southeast Asia as it occurred prior to the influence of modern European dog breeds. We generated seven mitogenomes and five partial sequences from ancient MSEA, ISEA and Pacific dogs, and four modern dingoes. Despite the small sample size, our results reveal levels of complexity and discontinuity in the introduction and movement of dogs, which are mirrored in the archaeological and linguistic evidence, suggesting at least three introductions of dogs to the wider Pacific region, in addition to the earlier appearance of the dingo in Australia. Further mtDNA studies of ancient dogs and modern village populations throughout the region may contribute additional data that can be used to evaluate these hypothesised dispersals. Autosomal and Y-chromosome analyses also have the potential to generate additional information about dog dispersal, which could reveal different dispersal signatures based on sex, or phenotypic characteristics, though the environmental conditions in the region are not particularly conducive to aDNA preservation.

Our molecular genetic analyses reveal one of the earliest dogs present in ISEA around 3,000 years ago from Timor-Leste possesses a mtDNA lineage not found elsewhere in the region. We also found similarities between mtDNA of modern dingoes and NGSDs and an ancient Taiwanese sequence, which supports previous observations about possible links between Y-chromosome markers of modern dingoes and a modern Taiwanese sample. More work is required to address whether these connections reflect the genetic diversity of a shared ancestral population in mainland China, or attest to a currently unknown dispersal event linking the two populations. Archaeological evidence for the introduction of dogs to Oceania as part of the LCC is extremely limited. Nonetheless, we demonstrate that mitogenomes from dogs in terminal Lapita and post-Lapita levels of archaeological sites along the south coast of mainland New Guinea also show affinities with an Iron Age dog specimen from Taiwan, raising the possibility of at least one introduction of dogs during Austronesian expansions ultimately from the north. Finally, we have identified a major late introduction of dogs across the islands of Oceania beginning around 2,000 years ago, which appears to have originated in MSEA, not Taiwan, and culminated in the establishment of dog populations in initial colonisation-era sites throughout East Polynesia.

Molecular phylogenetic analysis by maximum likelihood method, implemented in MEGA71. The evolutionary history shown inferred by using the maximum likelihood method based on the Hasegawa-Kishino-Yano model. The tree with the highest log likelihood (-25257.5243) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches. Initial tree(s) for the heuristic search were obtained automatically by applying Neighbor-Join and BioNJ algorithms to a matrix of pairwise distances estimated using the Maximum Composite Likelihood (MCL) approach, and then selecting the topology with superior log likelihood value. A discrete Gamma distribution was used to model evolutionary rate differences among sites (5 categories (+G, parameter = 0.0500)). The rate variation model allowed for some sites to be evolutionarily invariable ([+I], 0.0010% sites). The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 45 nucleotide sequences. There were a total of 16774 positions in the final dataset.

Also related, open access Elucidating biogeographical patterns in Australian native canids using genome wide SNPs, by Cairns et al., PLOS One (2018).


Dingoes play a strong role in Australia’s ecological framework as the apex predator but are under threat from hybridization and agricultural control programs. Government legislation lists the conservation of the dingo as an important aim, yet little is known about the biogeography of this enigmatic canine, making conservation difficult. Mitochondrial and Y chromosome DNA studies show evidence of population structure within the dingo. Here, we present the data from Illumina HD canine chip genotyping for 23 dingoes from five regional populations, and five New Guinea Singing Dogs to further explore patterns of biogeography using genome-wide data. Whole genome single nucleotide polymorphism (SNP) data supported the presence of three distinct dingo populations (or ESUs) subject to geographical subdivision: southeastern (SE), Fraser Island (FI) and northwestern (NW). These ESUs should be managed discretely. The FI dingoes are a known reservoir of pure, genetically distinct dingoes. Elevated inbreeding coefficients identified here suggest this population may be genetically compromised and in need of rescue; current lethal management strategies that do not consider genetic information should be suspended until further data can be gathered. D statistics identify evidence of historical admixture or ancestry sharing between southeastern dingoes and South East Asian village dogs. Conservation efforts on mainland Australia should focus on the SE dingo population that is under pressure from domestic dog hybridization and high levels of lethal control. Further data concerning the genetic health, demographics and prevalence of hybridization in the SE and FI dingo populations is urgently needed to develop evidence based conservation and management strategies.

Principal components analysis (PCA) based upon filtered whole genome SNP genotypes (58,512 sites) for 23 dingoes, 5 NGSD, 8 Borneo village dogs, 9 Vietnam village dogs, 10 Portugal village dogs and 8 Australian cattle dogs (‘Dataset B’). Colours represent population clusters: red for SE dingoes, purple for FI dingoes, blue for NW dingoes, dark green for NGSD, light green for Borneo village dogs, orange for Vietnam village dogs, yellow for Portugal village dogs and grey for Australian cattle dogs. (A) PC 1 versus PC 2. (B) PC 1 versus PC 3.

As I said in a previous post, the study of dogs may be useful to trace population migrations and to assess strong cultural contacts. Especially, as in this case, when crossbreeding among cultures is not easy…


Islands across the Indonesian archipelago show complex patterns of admixture


An open access article Complex patterns of admixture across the Indonesian archipelago, by Hudjashov et al. (2017), has appeared in Molecular Biology and Evolution, and clarifies further the Austronesian (AN) expansion.


Indonesia, an island nation as large as continental Europe, hosts a sizeable proportion of global human diversity, yet remains surprisingly under-characterized genetically. Here, we substantially expand on existing studies by reporting genome-scale data for nearly 500 individuals from 25 populations in Island Southeast Asia, New Guinea and Oceania, notably including previously unsampled islands across the Indonesian archipelago. We use high-resolution analyses of haplotype diversity to reveal fine detail of regional admixture patterns, with a particular focus on the Holocene. We find that recent population history within Indonesia is complex, and that populations from the Philippines made important genetic contributions in the early phases of the Austronesian expansion. Different, but interrelated processes, acted in the east and west. The Austronesian migration took several centuries to spread across the eastern part of the archipelago, where genetic admixture postdates the archeological signal. As with the Neolithic expansion further east in Oceania and in Europe, genetic mixing with local inhabitants in eastern Indonesia lagged behind the arrival of farming populations. In contrast, western Indonesia has a more complicated admixture history shaped by interactions with mainland Asian and Austronesian newcomers, which for some populations occurred more than once. Another layer of complexity in the west was introduced by genetic contact with maritime travelers from South Asia and strong demographic events in isolated local groups.

Among its results (emphasis is mine):

Most eastern Indonesian populations show traces of admixture that appear to reflect an expansion of AN speakers (Figure 4B, S3). There is a striking similarity between inferred events – each admixed population includes both a Philippine non-Kankanaey and western Indonesian-like source likely representing Holocene movements of Asian farming groups, as well as a Papuan-like source representing local indigenous ancestry. One reason for the lack of clear Taiwanese sources may be because the aboriginal populations of Taiwan were heavily affected by post-AN movements from mainland East Asia, most recently sinicization by Han Chinese, and thus no longer depict the ancestral AN gene pool (Mörseburg, et al. 2016). However, this notable pattern could equally be explained by the dominance of language and culture transfers during early phases of the Neolithic expansion from Taiwan into the Philippines, followed by people with predominantly Philippine ancestry driving later demic diffusion into the Indonesian archipelago. Interestingly, Mörseburg, et al. (2016), by using a different sample set and genotype-based analytical toolkit, indicated that the Kankanaey ethnic group from the Philippines is likely the closest living proxy of the source population that gave rise to the AN expansion. We did not detect this population among sources of admixture in eastern Indonesia, and therefore suggest that the place of individual Philippine groups in the AN expansion needs to be further addressed by better sampling in the Philippine archipelago.

Sumba and Flores, the two westernmost islands to the east of Wallace’s line, display a high proportion of Java and Bali surrogates in their AN admixing source. This suggests that the AN movement into eastern Indonesia, especially for Sumba and Flores, had earlier experienced some degree of genetic contact with western Indonesian groups. In contrast, the sources of AN admixture in Lembata, Alor, Pantar and Timor are dominated by Sulawesi (Figure 4B, S3, Table S3, S5). This generally agrees with expectations from the geography of the region, whereby AN groups exiting the southern Philippines were likely funneled into at least two streams, including a western path through Borneo and a central path through Sulawesi (Blust 2014).

Point estimates of genetic admixture times in eastern Indonesia lie within a narrow timeframe ranging between ca 185 BCE to 360 CE or 75 to 56 generations ago (95% CI 510 BCE – 475 CE or 87–52 generations) (Figure 4B, Table S3). These inferred dates are younger than some previous estimates (120–200 generations ago) (Xu, et al. 2012; Sanderson, et al. 2015; Sedghifar, et al. 2015). A major analysis of admixture in Indonesia estimated the date of AN contact in the eastern part of archipelago to be around 500 to 600 CE (ca 50 generations, CI estimates between 58–42 generations ago) (Lipson, et al. 2014), surprisingly young given the archaeological evidence. However, the study pooled a very small sample of genetically heterogeneous eastern Indonesian islands including, for example, Flores and Alor. As we show here (Figure 2, 4, 5, S3, Table S3, S5, S6), while the wave of AN speakers left a common genetic trace across the whole of eastern Indonesia, the details and dates of this contact vary considerably not only between islands (e.g., Flores and Alor), but also within individual islands (e.g., Flores Rampasasa vs. Flores Bama). The genetic dates, which were obtained here by denser geographical sampling of 8 eastern islands, a much larger number of individuals (28 per island on average) and a greater number of SNPs, are up to 30 generations older, predating the Common Era in many cases.

It therefore took migrants at least half a millennium to proceed from islands around Wallace’s line to the easternmost sampled part of eastern Indonesia. Nevertheless, observed dates for AN contact in eastern Indonesia are still approximately a millennium younger than the earliest Neolithic archaeological evidence in the region, and two explanations seem most likely here. First, the AN migration may have involved several waves of people leaving Taiwan, spanning multiple generations, which would bias date estimates later than the first arrival of the Neolithic archeological assemblage (Sedghifar, et al. 2015). Second, there may have been a substantial time gap between the spread of culture and technological traditions, and the beginning of extensive genetic contact between incoming farming groups and native inhabitants in Indonesia (Lansing, et al. 2011). The lack of considerable admixture with Papuan groups was recently noted in ancient Lapita individuals from Remote Oceania, whose genomes are mostly Asian and carry little to no Papuan ancestry, suggesting limited contact as they moved through Melanesia to previously uninhabited islands in the Pacific (Skoglund, et al. 2016). A lag in admixture between local and incoming Neolithic groups has also been observed in Europe, where hunter-gatherer and farming populations initially co-existed for nearly a thousand years without substantial genetic interaction (Malmström, et al. 2015).

austronesian-admixture Ancestral genomic components in regional populations. For every K, the modal solution with the highest number of ADMIXTURE runs is shown; individual ancestry proportions were averaged across all runs from the same mode and the number of runs (out of 50) assigned to the presented solution is shown in parentheses. Average cross validation statistics were calculated across all runs from the same mode (insert). The minimum cross-validation score is observed at K=9. Note major ancestry components in Indonesia and ISEA – Papuan (light purple), mainland Asian (light yellow) and AN (light blue) – as well as major differences in the distribution of these three ancestries between eastern and western Indonesia. Populations from the Philippines and Flores are abbreviated as ‘Ph.’ and ‘Fl.’, respectively.

Featured images are taken from the article.

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (, which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact

Related posts: