- March 2, 2020 at 11:50 am #27555
Announcements of changes to the compiled dataset of Y-DNA and mtDNA data for reported ancient samples, including analyses of BAM files, nomenclature, culture labelling, etc.
Official site for different formats is at
For direct download of the latest version published use https://haplogroup.info/
- March 4, 2020 at 10:08 am #27627
Files updated to v. 1.89, including newly reported samples from Sardinia and the Mediterranean, as well as some not so recent ones I had missed – Mazovian prince, Early Poles – and some updated SNPs of samples from different published papers.
- March 7, 2020 at 9:33 pm #27825
I have updated the file with SNP inferences of samples from Sirak et al. (2019).
I had to delete update dates and start anew, because the previous ones were all messed up, probably due to messing around with different formats (Excel, CSV, txt).
- March 8, 2020 at 6:29 pm #27888
These are some curiously similar SNP inferences around Lake Baikal, apparently N1a1*(xN1a1a), but nevertheless with multiple positives for N1a-L1026 equivalents, showing that this specific lineage (whichever it was) was widespread on both sides of the lake during the Neolithic.
For some reason, this last one didn’t make its way into YFull.
EDIT: According to Pribislav, they are N1a-pre-B187, from Y24317, a rare sister clade of N1a-708. ISOGG 2019 is really far behind new SNPs compared to FTDNA and YFull, and the current nomenclature doesn’t make much sense…
- This reply was modified 5 months, 3 weeks ago by Carlos Quiles. Reason: Pribislav SNP calls for Fofonovo and DA345
- March 8, 2020 at 6:37 pm #27889
The SNP calls for Villabruna show it is negative for V2219 and L389 subclades (although the L389 level is not covered). I’d say it was more likely of a basal subclade that hasn’t survived to this day.
The question is thus if the associated Epigravettian WHG expansion in Western Europe consisted mainly of this subclade, and V2219-associated peoples expanded in a different (later?) wave into SE Europe, or if it was a common L754-rich migration of which we can only see the effects after regional bottlenecks.
Sadly, Iboussieres31-2 has a too small coverage to help support any option.
- March 12, 2020 at 8:57 pm #27936
I have updated the dataset, including reported Neanderthal and Denisovan Y-DNA (ISOGG only).
I have also checked out some of the samples of hg. T. I can’t find Genetiker’s reported SNP for the Varna individual. The best I can do (like the original paper) is CT+.
It’s quite interesting that the R1a-Z93 from the Balkans shows SNP calls similar to the Glăvăneştii one, suggesting that it is an R1a-Z93* sample more closely related to Late Trypillian groups, and thus a potential resurgence event more than a Srubnaya-related migration:
I have also updated all maps of Y-DNA.
- March 13, 2020 at 9:52 am #27943
- March 16, 2020 at 10:54 am #27974
1. I have tested all Baltic Neolithic samples reported as R1b-L754 or P297: all have enough coverage to show they are of basal subclades P297* (xM73, xM269).
2. I also tried using Skoglund et al. (2014) PMDtools with different thresholds to improve damaged samples:
Unsuccessful with the Balkan Chalcolithic outlier from Smyadovo: all positive SNPs except BT are excluded, so we are stuck with the more risky: P-, but R+, R1b+, R1b-M269+ results. For some reason (maybe a specific threshold??) the authors assumed that the R-P280 call was acceptable, though.
Successful with the Samara HG sample: a low threshold (=0.1) confirms one R1b-M73-equivalent SNP, with two negative R1b-M269-equivalent reads, so the most plausible haplogroup seems to be M73, until proven otherwise.
3. I added samples from Egypt, including two newly reported from the Kurchatov Institute (no clear date or location), also the dubious R1b-M269 from the KV 55 coffin and the mtDNA of Djehutynakht in Loreille et al. (2018).
- March 23, 2020 at 9:57 am #28188
Changes into version 1.89.16 include:
1. Addition of mtDNA from Ancient mitogenomes show plateau populations from last 5200 years partially contributed to present-day Tibetans, by Ding et al. Proc R Soc B (2020).
2. Review of SNP inferences of Bronze Age R1b-Z2103 samples, including negative SNPs.
Now using Yleaf v. 2.2, but I didn’t see any marked differences with previous inferences made with Yleaf v.2.
- March 31, 2020 at 8:22 am #28724
In version 1.90.1 I added changes proposed by Kovalev to culture and group classification of samples from Jeong et al. (2020).
I have left the samples labelled as C2a… according to what I could find in Japanese pages, which suggest they belong to ISOGG 2019 C2b, even though no recent ISOGG nomenclature included them in the past 5 years… These include C2a1a1, C2a1a2, but particularly C2a1a3, whose corresponding C2b1a3?? I couldn’t find anywhere.
- April 6, 2020 at 9:17 am #29041
Updated version 1.90.4 with new mtDNA reported in Evaluation of DNA conservation in Nile-Saharan environment, Missiminia, in Nubia: Tracking maternal lineage of “X-Group”, by Yahia Mehdi Seddik Cherifi, Selma Amrani.
- April 9, 2020 at 6:53 pm #29159
Updated version 1.90.5, including corrections to I1 subclades (in my file) posted on YFull Facebook Group by Simon Hedley.
Included two mtDNA reported by Rogers et al. from WSU Human Biology Open Access preprints at https://digitalcommons.wayne.edu/humbiol_preprints/160
- April 9, 2020 at 8:21 pm #29160
- April 17, 2020 at 10:53 am #29327
Version 1.90.8 includes minor updates and mtDNA from the study Mitochondrial genomes from Bronze Age Poland reveal genetic continuity from the Late Neolithic and additional genetic affinities with the steppe populations, by Juras et al. J. Phys. Anthropol. (2020)
- April 22, 2020 at 10:53 am #29599
- May 8, 2020 at 10:38 am #29795
- May 21, 2020 at 12:50 pm #29856
Updated from version 1.91.9 to version 1.91.12, including the new Baikal samples from Yu et al. Cell (2020) and the Maros samples from Zegarac et al. bioRxiv (2020), as well as the few reported Trentino samples in Graeffen’s thesis (2020), the few mtDNA from East Asian genomes, or the few Inner Mongolia samples from Li et al. Phys. Anthr. (2020), which updates their previous Li et al. (2017) report.
- June 10, 2020 at 12:32 pm #30280
Update to version 1.93 with automated SNP calls from genotypes shared by Kolgeh (as suggested in comments).
- June 17, 2020 at 12:23 pm #30352
Version 2.0x includes new columns for:
- FTDNA haplotree
- YFull mtree
- Responsible for mtDNA SNP calls and the SNP calls published by them.
- Lactase Persistence – now separated from “other”, more focused on diseases.
The most interesting part is the correction of nomenclature and hyperlinks, so that the file may be accurately used for mtDNA phylogeography.
Newly reported samples – or recently found by me – have also been added.
The new standard versions don’t have fields specific for GIS maps.
Announcement is here.
- June 19, 2020 at 9:49 am #30358
- June 23, 2020 at 9:35 am #30513
Updated to version 2.01.14. I spent hours reviewing SNP calls for I2 subclades and adding positive, negative, and dubious SNPs. It was very interesting, but also very frustrating when I realized after spending so many hours during the weekend that I couldn’t find what I was looking for: a clear patrilineal connection between all Megalithic groups.
This post shows the result of that work:
- June 26, 2020 at 6:35 pm #30560
- July 5, 2020 at 2:08 pm #30720
- July 6, 2020 at 8:50 pm #30724
Updated to version 2.01.24, including mtDNA data from Umbri published in Modi et al. Scientific Reports (2020).
Given the recently published data from Norht-Eastern Europe and the Cis-Baikal Neolithic and EBA, I’ve also decided to change the symbol of the questionable Chalcolithic N-TAT from Chekunova (2014), as well as the two Baikalic Neolithic R1a-M198 from Moussa (2016), to an “Unknown” instead of N1c and R1a, respectively. I am not ready to strike them out yet. I still hope they will retest, retrace, or recheck those samples and/or perform radiocarbon dates in the near future.
- July 7, 2020 at 10:14 pm #30725
Updated to version 2.01.26, including update of Early Hungarians from Determination of the phylogenetic origins of the Árpád Dynasty based on Y chromosome sequencing of Béla the Third, by Nagy et al. Eur J Hum Genet (2020).
- July 21, 2020 at 9:55 am #31004
- August 2, 2020 at 1:57 pm #31292
Updated to version 2.02.01, including recently reported samples from the Tollense Valley.
Updated to version 2.02.07, including Xiongnu samples and updates based on STRs (mainly to R1a-Z2125) of previously reported R1a samples.
Also included are two updated mtDNA subclades from mitogenomes published in Furtwängler et al. (2020).
- August 4, 2020 at 6:12 pm #31316
Updated to version 2.02.07, including samples from A Paleogenomic Reconstruction of the Deep Population History of the Andes, by Nakatsuka, Lazaridis, et al. Cell (2020).
- August 11, 2020 at 8:20 pm #31425
- August 20, 2020 at 9:06 am #31461
- August 24, 2020 at 4:53 pm #31470
- August 26, 2020 at 8:52 pm #31514
Updated to version 2.02.33 with some updated Y-SNP calls from the FTDNA team.
- September 9, 2020 at 8:32 pm #31541
Updated to version 2.03.01:
- Completed update of references, including short name, full citation, DOI and (whenever available) accession codes.
- Updated subclades from the Mokrin necropolis and the Tollense valley.
- Some more old papers on ancient samples includes: mainly mtDNA, mostly from the Americas.
- September 15, 2020 at 10:45 am #31683
- You must be logged in to reply to this topic.