Somali Y Haplogroups | Somaliwave
  • Please, contribute interesting analytical writings you come across the web pertaining to Somalia and the Horn of Africa. Don't forget to include a cover image at the top of your post for visual appeal and a link to the source. You are welcome to post your own opinion pieces.

Somali Y Haplogroups

Malawax

Malaay iyo Muufo
Staff member
Wadani
🦨Primus
LV
10
 
Joined
Jul 30, 2023
Messages
301
Reaction score
2,580
Awards
1
YFull age estimation is currently in progress, and here are the latest changes.
The TMRCA for the following branches increased by 50 years:
E-Y163949
E-FT24258
E-FT81055
E-FTC83339
E-Y272476
E-S10293
E-FT77328
E-BY8085
J-Y445057

The TMRCA for E-Y515840 decreased by 100 years. This change occurred because @Saeed , @gacale , and the new Sheekhaal were not included in the current TMRCA estimation. Specifically, the exclusion of @Said led to an increase of 50 years in the TMRCA for E-FT77328. Additionally, with one fewer sample (@Saeed ) containing high SNPs used for age estimation, the average TMRCA between E-BY192465 and E-BY8081 decreased by 100 years.
E-Y18629 and E-Y18637 are still using the TMRCA of E-Y515840 because their calculated TMRCA is younger.
 
Last edited:
YFull age estimation is currently in progress, and here are the latest changes.
The TMRCA for the following branches increased by 50 years:
E-Y163949
E-FT24258
E-FT81055
E-FTC83339
E-Y272476
E-S10293
E-FT77328
E-BY8085

The TMRCA for E-Y515840 decreased by 100 years. This change occurred because @Saeed , @gacale , and the new Sheekhaal were not included in the current TMRCA estimation. Specifically, the exclusion of @Said led to an increase of 50 years in the TMRCA for E-FT77328. Additionally, with one fewer sample (@Saeed ) containing high SNPs used for age estimation, the average TMRCA between E-BY192465 and E-BY8081 decreased by 100 years.
E-Y18629 and E-Y18637 are still using the TMRCA of E-Y515840 because their calculated TMRCA is younger.

Appreciate the update as always brother! 🙏🏾
 
Samples not included in the current age estimations are:
E = YF129771, YF130450, YF129781, YF107912, YF084582, and YF102772.
J = YF117573
T = YF129443
These were excluded because they are either new or their BAM files were realigned to T2T.
YF117573 is neither new nor realigned to T2T, I don’t know why it wasn’t included. My sample is also not included again.
This can change after the update process is completed.
 
Last edited:
It seems that a Majeerteen sample was recently added to the YFull tree, even though it is not a new sample. It might have been private until now.
New Sample.png

We now have 99 known Somali samples on YFull.
 
Considerations Before Labeling All Samples Under the E-Z21175 Branch as Oromo

YFull uses an average mutation rate of 144.41 years, focusing only on SNPs from the combBED region for TMRCA estimation, with adjustments for each sample’s length coverage for age.
YFull age estimation methodology
Mutation Rate.png

The average mutation rate for Y chromosome SNPs used by FTDNA is between 80 and 144 years. This rate is highlighted in a video at Family Tree DNA's Big Y Block Tree starting at the 6:30 mark. Using FTDNA's mutation rate range (80 to 144 years), we can approximate the average mutation rate for Y chromosome SNPs in genealogical studies to be around 112 years (80+144)/2. However, it's important to note that the mutation rate can vary between different haplogroups and even between related branches.

Imagine a scenario 4,000 to 5,000 years before present (YBP) involving a man carrying the Z813 mutation. If this man had multiple sons, it’s possible that none or just one of his sons would develop a new mutation. The average mutation rate of 121 years suggests that it might take many years after the initial ancestor’s passing for new mutations to appear. It’s unlikely for each son to have a unique mutation used in genealogical studies.

Historically, groups of people did not subdivide based on their genetic mutations, as these changes are not visibly apparent. They likely remained as one community. Over time, some men within these groups would have sons with new mutations, while their brothers’ sons might not, although their grandkids or great-grandkids could have new and different mutations. As these groups spread and split, they did not segregate based on genetic mutations. Consequently, individuals with different mutations often remained in the same group, maintaining familial ties. This means that men with the Z21175 mutation could have stayed with their family members who did not have this specific mutation. Meanwhile, others with and without the mutation might have separated and moved to different geographical locations. This explains why we are likely to see more Oromo samples under E-Y17859 and more Somali samples under E-Z21175. I am only focusing on the Somali and Oromo samples, but I am not suggesting all the samples under Z813 are either Somali or Oromo.

If someone suggests that people who have been part of their Somali family for over a thousand years are not Somali, ask them to provide evidence to support their arguments. When their arguments are based on YFull and FTDNA branches, inquire if they understand the mutation rate for those SNPs and whether it could cause related people who stayed together for centuries to accumulate different mutations.

We know from @BluSkee and YFull profiles that two of the SDPP samples are YF095342 Howrarsame from Gedo region and YF081756 Dhulbahante from Sool region, and sample ID YF102772 belongs to Ajuuraan > Gareen > Walmuge from Baardheere Gedo. New Somali samples are also expected to join this branch.
There is no scientific evidence supporting the labeling of E-Z21175 as Oromo. Referring to individuals outside of their identified lineage is disrespectful. Therefore, I urge everyone to not alienate the Somalis under E-Z21175.

Useful information for understanding the mutation rate:
(PDF) Defining a New Rate Constant for Y-Chromosome SNPs based on Full Sequencing Data (researchgate.net)
Mutation Rate Variability across Human Y-Chromosome Haplogroups | Molecular Biology and Evolution | Oxford Academic
 
Considerations Before Labeling All Samples Under the E-Z21175 Branch as Oromo

YFull uses an average mutation rate of 144.41 years, focusing only on SNPs from the combBED region for TMRCA estimation, with adjustments for each sample’s length coverage for age.
YFull age estimation methodology
Mutation Rate.png

The average mutation rate for Y chromosome SNPs used by FTDNA is between 80 and 144 years. This rate is highlighted in a video at Family Tree DNA's Big Y Block Tree starting at the 6:30 mark. Using FTDNA's mutation rate range (80 to 144 years), we can approximate the average mutation rate for Y chromosome SNPs in genealogical studies to be around 112 years (80+144)/2. However, it's important to note that the mutation rate can vary between different haplogroups and even between related branches.

Imagine a scenario 4,000 to 5,000 years before present (YBP) involving a man carrying the Z813 mutation. If this man had multiple sons, it’s possible that none or just one of his sons would develop a new mutation. The average mutation rate of 121 years suggests that it might take many years after the initial ancestor’s passing for new mutations to appear. It’s unlikely for each son to have a unique mutation used in genealogical studies.

Historically, groups of people did not subdivide based on their genetic mutations, as these changes are not visibly apparent. They likely remained as one community. Over time, some men within these groups would have sons with new mutations, while their brothers’ sons might not, although their grandkids or great-grandkids could have new and different mutations. As these groups spread and split, they did not segregate based on genetic mutations. Consequently, individuals with different mutations often remained in the same group, maintaining familial ties. This means that men with the Z21175 mutation could have stayed with their family members who did not have this specific mutation. Meanwhile, others with and without the mutation might have separated and moved to different geographical locations. This explains why we are likely to see more Oromo samples under E-Y17859 and more Somali samples under E-Z21175. I am only focusing on the Somali and Oromo samples, but I am not suggesting all the samples under Z813 are either Somali or Oromo.

If someone suggests that people who have been part of their Somali family for over a thousand years are not Somali, ask them to provide evidence to support their arguments. When their arguments are based on YFull and FTDNA branches, inquire if they understand the mutation rate for those SNPs and whether it could cause related people who stayed together for centuries to accumulate different mutations.

We know from @BluSkee and YFull profiles that two of the SDPP samples are YF095342 Howrarsame from Gedo region and YF081756 Dhulbahante from Sool region, and sample ID YF102772 belongs to Ajuuraan > Gareen > Walmuge from Baardheere Gedo. New Somali samples are also expected to join this branch.
There is no scientific evidence supporting the labeling of E-Z21175 as Oromo. Referring to individuals outside of their identified lineage is disrespectful. Therefore, I urge everyone to not alienate the Somalis under E-Z21175.

Useful information for understanding the mutation rate:
(PDF) Defining a New Rate Constant for Y-Chromosome SNPs based on Full Sequencing Data (researchgate.net)
Mutation Rate Variability across Human Y-Chromosome Haplogroups | Molecular Biology and Evolution | Oxford Academic

I agree, people are so quick to judge without understanding how mutations work or without the full amount of data.

I am still trying to understand myself how mutations work and your breakdown was helpful.

There are so many people that haven’t tested I am sure there will be more Somalis under E-Z21175. There was a Mj that tested and received the same as the Dhulbahante under that clade but hid their results because he didn’t match the other Mjs.

0C64243C-337F-4231-B90C-7ECAFE104F4F.jpeg
 
I agree, people are so quick to judge without understanding how mutations work or without the full amount of data.

I am still trying to understand myself how mutations work and your breakdown was helpful.

There are so many people that haven’t tested I am sure there will be more Somalis under E-Z21175. There was a Mj that tested and received the same as the Dhulbahante under that clade but hid their results because he didn’t match the other Mjs.

0C64243C-337F-4231-B90C-7ECAFE104F4F.jpeg

I'm not surprised that some people hide their samples. This behavior stems from the false expectations created by linking ethnicity or clan to branches. I have tried to discourage this practice, but unfortunately, some members of our community are more interested in centering themselves and designating samples in other branches as outsiders rather than finding out how we are related.
 
I'm not surprised that some people hide their samples. This behavior stems from the false expectations created by linking ethnicity or clan to branches. I have tried to discourage this practice, but unfortunately, some members of our community are more interested in centering themselves and designating samples in other branches as outsiders rather than finding out how we are related.

Yes the qabyalaad is making it’s way to Ydna studies 😂. Hopefully we find our way out of it.
 
Last edited:
I am doing Big Y-700
I believe having my sample on FTDNA will increase my chances of finding a match. More importantly, the Somali community could benefit from it as well. Future members who take the cheaper Y-DNA tests might receive a haplogroup prediction based on my Y-700 results, even if they are not a perfect match.
 
I am doing Big Y-700
I believe having my sample on FTDNA will increase my chances of finding a match. More importantly, the Somali community could benefit from it as well. Future members who take the cheaper Y-DNA tests might receive a haplogroup prediction based on my Y-700 results, even if they are not a perfect match.

That’s true many of the kits previously on FTDNA helped new members big time coming up as a potential match.

Some of the recent members have a hard time understanding why they don’t have a match at times as well. There just isn’t enough people.
 
That’s true many of the kits previously on FTDNA helped new members big time coming up as a potential match.

Some of the recent members have a hard time understanding why they don’t have a match at times as well. There just isn’t enough people.

We have over 30 samples on YFull from labs other than FTDNA. This provides a lot of STR data that could be used for matching those samples. I can create a way to use the STRs on YFull to match people on FTDNA who have taken the 37-marker Y-DNA test and above, but it will require community collaboration.
 
We have over 30 samples on YFull from labs other than FTDNA. This provides a lot of STR data that could be used for matching those samples. I can create a way to use the STRs on YFull to match people on FTDNA who have taken the 37-marker Y-DNA test and above, but it will require community collaboration.

Insha Allah! I am willing to help with what information I may have access to.
 
Insha Allah! I am willing to help with what information I may have access to.

Thanks, but it can't be just the two of us. I'll try to identify good candidates from the FTDNA project. I am now a member of the Somali DNA Project. Is that the correct project name?
 
We now have 100 known Somali samples on YFull. The sample with the Omani flag has his clan information on FTDNA as Majeerteen > Ali Saleebaan, making him eligible to be on SomaliY.com
Omani.png
 
Thanks, but it can't be just the two of us. I'll try to identify good candidates from the FTDNA project. I am now a member of the Somali DNA Project. Is that the correct project name?

For sure! We need everyone’s help!

And yes that is the project, family tree or FTDNA Somali DNA project. It’s not active but a much needed presence on FTDNA.
 
My Y-STR results are ready. However, the Y-DNA matches page is currently stuck on "Generating matches." It looks like kit number 1016309 could match me on Y-37 with a genetic distance of 4. This doesn't guarantee anything, but there is potential.
 
My Y-STR results are ready. However, the Y-DNA matches page is currently stuck on "Generating matches." It looks like kit number 1016309 could match me on Y-37 with a genetic distance of 4. This doesn't guarantee anything, but there is potential.

Finally kit number 1016309 gained his first 37 marker match and he will be more than pleased to learn it may be from his sub clan!

37 marker match is a good indicator of a clade match 90% of the time.

6A80504C-7CD5-4EE7-8102-5F63F23D2B0E.jpeg
It seems the other 37 marker Harti Abgaal on FTDNA didn’t come up for him so far though. This doesn’t necessarily mean they won’t match on deeper testing.
 
Finally kit number 1016309 gained his first 37 marker match and he will be more than pleased to learn it may be from his sub clan!

37 marker match is a good indicator of a clade match 90% of the time.

6A80504C-7CD5-4EE7-8102-5F63F23D2B0E.jpeg
It seems the other 37 marker Harti Abgaal on FTDNA didn’t come up for him so far though. This doesn’t necessarily mean they won’t match on deeper testing.

This is great! Could you ask him if he's planning to upgrade to the Big Y-700? There's a sale going on right now, but I'm not sure if it applies to upgrading from 37 to 700. There's a good possibility of a new E-Y18629 subclade.
 
This is great! Could you ask him if he's planning to upgrade to the Big Y-700? There's a sale going on right now, but I'm not sure if it applies to upgrading from 37 to 700. There's a good possibility of a new E-Y18629 subclade.

Yes it’s wonderful! I will have our connect reach out to him and get back to you insha Allah!
 
The T-L208 side of the tree looks nice.
T-L208.png
 
Back
Top