Haplogroup H (Y-DNA)

Source: Wikipedia, the free encyclopedia.

Haplogroup H (Y-DNA)
Haplogroup H map
Possible time of origin~48,500 ybp
Possible place of originSouth Asia or West Asia[1] or Southern Central Asia[2][3]
AncestorHIJK
DescendantsH1 (L902/M3061);
H2 (P96);
H3 (Z5857)
Defining mutationsL901/M2939
Highest frequenciesSouth Asians and Roma

Haplogroup H (Y-DNA), also known as H-L901/M2939, is a

Y-chromosome haplogroup
.

The primary branch H1 (H-M69) and its subclades is one of the most predominant haplogroups amongst populations in South Asia, particularly its descendant H1a1 (M52). A primary branch of H-M52, H1a1a (H-M82), is found commonly among the Roma, who originated in South Asia and migrated into the Middle East and Europe, around the beginning of the 2nd millennium CE, and the Khmer people who got under influence from Indian populations.[4] The much rarer primary branch H3 (Z5857) is also concentrated in South Asia.

However, the primary branch H2 (P96) seems to have been found in sparse levels primarily in Europe and West Asia since prehistory. It has been found in remains of the Pre-Pottery Neolithic B (PPNB), which is part of the Pre-Pottery Neolithic, a Neolithic culture centered in upper Mesopotamia and the Levant, dating to c. 10,800 – c. 8,500 years ago, and also the later Linear Pottery culture and Neolithic Iberia.[5][6] H2 likely entered Europe during the Neolithic with the spread of agriculture.[6][7] Its present distribution is made up of various individual cases spread out throughout Europe and West Asia today.[8]

Structure

H-L901/M2939 is a direct descendant of Haplogroup GHIJK. There are, in turn, three direct descendants of H-L901/M2939 – their defining SNPs are as follows:

  • H1 (L902/M3061)
    • H1a previously haplogroup H1 (M69/Page45, M370)
    • H1b B108, Z34961, Z34962, Z34963, Z34964
  • H2 previously haplogroup F3,[9] (P96, L279, L281, L284, L285, L286, M282)
    • H2a FGC29299/Z19067
    • H2b Z41290
    • H2c Y21618, Z19080
  • H3 (Z5857)
    • H3a (Z5866)
    • H3b (Z13871)


Ancient distribution

H-L901/M2939 is believed to have split from HIJK 48,500 years before present.[10] It seems to represent the main Y-Chromosome haplogroup of the paleolithic inhabitants of South Asia. Possible site of introduction may be South Asia, since it is highly concentrated there.[11]

H1a

Shahr-i Sokhta and Gonur sites

Gonur, Turkmenistan. H1a1d2 - Bronze Age, 3200-1900 BCE. [12] [13]

H1a ANCIENT SAMPLES
Sample ID Location Radiocarbon Age Y-DNA
I11459 Shahr-i Sokhta, Iran 2875-2631 calBCE H1a1d2
I10409 Gonur, Turkmenistan 2280-2044 calBCE H1a1d2


Gogdara and Barikot sites

With limited ancient DNA testing in South Asia, accordingly there is a limited amount of ancient samples for H1a, despite it being a populous and well distributed haplogroup today. The first set of ancient DNA from South Asia was published in March 2018.[14] 65 samples were collected from the Swat Valley of northern Pakistan, 2 of which belonged to H1a.[14]

H1a ANCIENT SAMPLES
Date Subclade Location Country Culture Accompanying haplogroups Source
1100-900 BC H1a1 Gogdara, Swat Valley Pakistan Udegram Iron Age E1b1b1b2, E1b1b1b2a [14]
1000-800 BC H1a1 Barikot, Swat Valley Pakistan Barikot Iron Age [14]

H2

The earliest sample of H2 is found in the Pre-Pottery Neolithic B culture of the Levant 10,000 years ago.[15] From ancient samples, it is clear that H2 also has a strong association with the spread of agriculture from Anatolia into Europe, and is commonly found with haplogroup G2a.[16] H2 was found in Neolithic Anatolia, as well as in multiple later Neolithic cultures of Europe, such as the Vinča culture in Serbia,[17] and the Megalith culture of Western Europe.[17]

The 2021 study "Using Y-chromosome capture enrichment to resolve haplogroup H2 shows new evidence for a two-path Neolithic expansion to Western Europe"[7] found that while H2 is less than 0.2% in modern-day western European populations it was more common during the Neolithic, between 1.5 and 9%. They identified two major clades H2m and H2d. With respect to the current ISOGG nomenclature, H2m appears to be defined by a mix of H2, H2a, H2a1 and H2c1a SNPs while H2d appears to be defined by two H2b1 SNPs, and four additional SNPs which were previously undetected. They estimated TMRCA for H2d and H2m was  ~15.4 kya with H2m and H2d estimated TMRCAs of  ~11.8 and  ~11.9 kya respectively. H2 diversity probably existed in Near-Eastern hunter-gatherers and early farmers, and subsequently spread via the Neolithic expansion into Central and Western Europe. H2d was found along the inland/Danubian route into central Europe, but most H2m individuals are found along the Mediterranean route into Western Europe, the Iberian Peninsula and ultimately, Ireland.

There were also two occurrences of H2a found in the Neolithic Linkardstown burials in the southeast Ireland.[18] More Neolithic H2 samples have been found in Germany and France.[19]

H2 ancient samples
Date Location Country Culture Accompanying haplogroups Source
7300-6750 BC Motza Israel Levantine Pre-Pottery Neolithic B E1b1b1b2, T1a1, T1a2a (PPNB from Jordan) [15]
6500-6200 BC Barcin site, Yenişehir Valley Turkey Anatolian Neolithic G2a, I2C, C1a, J2a [20]
6500-6200 BC Barcin site, Yenişehir Valley Turkey Anatolian Neolithic G2a, I2C, C1a, J2a [20]
5832–5667 BC Старчево Serbia Vinča G2a [17]
5710–5662 BC Tell Kurdu, Amik Valley Turkey Anatolian Neolithic J1a2a, G2a2 [21]
5702–5536 BC Старчево Serbia Vinča G2a [17]
5400–5000 BC Szemely Hungary Vinča G2a2a, G2a2b2a1a [17]
3900–3600 BC La Mina site, Soria Spain
Megalithic
I2a2a1 [17]
3500–2500 BC Monte San Biagio, Latium Italy Rinaldone culture/Gaudo culture [22]
3925–3715 BC Arslantepe Turkey Early Bronze Age J2a1a1a2b2a, J1a2b1, E1b1b1b2a1a1, G2a2b1, J2a1a1a2b1b, R1b1a2 [21]
3366–3146 BC Arslantepe Turkey Early Bronze Age J2a1a1a2b2a, J1a2b1, E1b1b1b2a1a1, G2a2b1, J2a1a1a2b1b, R1b1a2 [21]
3336–3028 BC Dzhulyunitsa Bulgaria Bulgarian Bronze Age G2a2a1a2 [23]
2899–2678 BC El Portalon cave Spain Pre-
Bell Beaker
I2a2a [5]
2470–2060 BC Budapest-Bekasmegyer Hungary Kurgan Bell Beaker R1b1a1a2a1a2b1 [24]
1881–1700 BC Alalakh Turkey Levantine Middle Bronze Age II J1a2a1a2, J2b2, T1a1, L2-L595, J2a1a1a2b2a1b [25]
550–332 BC Beirut Lebanon Iron Age III Achaemenid period G2a2a1a2, G2a2b1a2, J1a2a1a2, I2a1b, Q1b [26]

Modern distribution

H1a

South Asia

H-M69 is common among populations of Bangladesh, India, Sri Lanka, Nepal and Pakistan, with lower frequency in Afghanistan.

Dhaka, Bangladesh (35.71%),[30] and H-M52 among Kalash (20.5%) in Pakistan.[31][28]

Haplogroup H is typically found among Indo-Aryan, Dravidian and Tribal (Indian as well as Pakistani Kalash) populations in the Indian subcontinent. In Europe it is mostly found among Roma, who belong predominantly (between 7% and 50%) to the H1a (M82) subclade.

Haplogroup H-M69 has been found in:

  • Bangladesh - 35.71% (15/42) in samples from Dhaka,[30] and 17.72% (115/649) among Bengali samples from various areas of the country.[32]
  • India - Divided into zones below:
    • South India – 27.2% (110/405) of a sample of unspecified ethnic composition.[33][34] Halpogroup H found among Dravidian tribal groups in highest frequency. Koraga people carry 87% and Koya tribe have 70% halpogroup H.[27][28] Gondi people carry around 62% of halpogroup H.[35] Another study has found haplogroup H-M69 in 26.4% (192/728) of an ethnically diverse pool of samples from various regions of India.[4]
    • West India - In Maharashtra - 33.3% of a sample of (68/204)[36] and in Gujarat, 20.69% (12/58) among Gujaratis in USA.[30] 13.8% (4/29) among unspesified Gujaratis in India,[37] 26% (13/50)among Dawoodi Bohra,[38] 27.27 (6/22) among Bhils,[39] 1.56% among Gujarati Brahmins[39]
    • North India - According to a study (Trivedi2007) it is found 24.5% (44/180) in both Caste and tribal population of North India. Most frequently found : 44.4% (8/18) among Uttar Pradesh (UP) Chamar,[4] 20.7% (6/29) among UP Rajputs,[4] 18.3% (9/49) among New Delhi Hindus,[40] 16.13% (5/31) among UP Brahmins,[39] 11.11% (6/54) among UP Kols,[39] 10.53% (2/19) among Himachal Brahmin,[39] 10.2% (5/49) in J&K Kashmiri Gujars,[39] 9.8% (5/51) in J&K Kashmiri Pandits,[39]
    • West Bengal Brahmins.[4]
  • Sri Lanka – in 25.3% (23/91) of a sample of unspecified ethnic composition[33][34] and in 10.3% (4/39) of a sample of Sinhalese.[28]
  • Newars (4/66 H-M82).[41] In another study, Y-DNA that belongs to Haplogroup H-M69 has been found in 25.7% (5/37 = 13.5% H-M69 from a village in Morang District, 9/57 = 15.8% H-M69 from a village in Chitwan District, and 30/77 = 39.0% H-M69 from another village in Chitwan District) of Tharus in Nepal.[42]
  • Hunza), including 5% (2/38) H-M82(xM36, M97, M39/M138) and 3% (1/38) H-M36.[43]

Roma people

Haplogroup H-M82 is a major lineage cluster in the Roma, especially

Balkan Roma, among whom it accounts for approximately as high as 60% of males.[44] A 2-bp deletion at M82 locus defining this haplogroup was also reported in one-third of males from traditional Roma populations living in Bulgaria, Spain, and Lithuania.[45]
High prevalence of Asian-specific Y chromosome haplogroup H-M82 supports their Indian origin and a hypothesis of a small number of founders diverging from a single ethnic group in India (Gresham et al. 2001).

Important studies show a limited introgression of the typical Roma Y-chromosome haplogroup H1 in several European groups, including approximately 0.61% in Gheg Albanians and 2.48% in Tosk Albanians.[46]

H1a in Roma populations
Population n/Sample size Percentage Source
Bulgarian Roma 98/248 39.5 [45]
Hungarian Roma 34/107 31.8 [47]
Kosovar Roma 25/42 59.5 [48]
Lithuanian Roma 10/20 50 [45]
Macedonian Roma 34/57 59.6 [44]
Portuguese Roma 21/126 16.7 [49]
Serbian Roma 16/46 34.8 [48]
Slovakian Roma 19/62 30.65 [47]
Spanish Roma 5/27 18.5 [45]

Europe, Caucasus, Central Asia & Middle East

Haplogroup H1a is found at much lower levels outside of the Indian subcontinent and the Roma populations but is still present in other populations:

East & South-East Asia

At the easternmost extent of its distribution, Haplogroup H-M69 has been found in

Chams from Binh Thuan, Vietnam (1/59 = 1.7% H-M69),[59] and Mongolians (1/149 = 0.7% H-M69).[33] The subclade H-M39/M138 has been observed in the vicinity of Cambodia, including one instance in a sample of six Cambodians[4] and one instance in a sample of 18 individuals from Cambodia and Laos.[43] A genome study about Khmer people resulted in an average amount of 16,5% of Khmer belonging to y-DNA H.[4]

H1b

H1b is defined by the SNPs - B108, Z34961, Z34962, Z34963, and Z34964.[63] Only discovered in 2015, H1b was detected in a single sample from an individual in Myanmar.[64] Due to only being classified recently, there are currently no studies recording H1b in modern populations.

H2

H2 (H-P96), which is defined by seven SNPs – P96, M282, L279, L281, L284, L285, and L286 – is the only primary branch found mainly outside South Asia.[63] Formerly named F3, H2 was reclassified as belonging to haplogroup H due to sharing the marker M3035 with H1.[65] While being found in numerous ancient samples, H2 has only been found scarcely in modern populations across West Eurasia.[5]

H2 in modern populations
Region Population n/Sample size Percentage Source
Central Asia Dolan 1/76 1.3 [66]
West Asia
UAE
1/164 0.6 [67]
West Asia South Iran 2/117 1.7 [68]
West Asia Assyrian 1/181 0.5 [69]
West Asia Armenia 5/900 0.6 [70]
Southern Europe Sardinia 2/1194 0.2 [71]

H3

H3 (Z5857) like H1, is also mostly centered in South Asia, albeit at much lower frequencies.[64]

Like other branches of H, due to it being newly classified it is not explicitly found in modern population studies. Samples belonging to H3 were likely labeled under F*.[64] In consumer testing, it has been found principally among South Indians and Sri Lankans, and other areas of Asia such as Arabia as well.[10]


The following gives a summary of most of the studies which specifically tested for the subclades H1a1a (H-M82) and H2 (H-P96), formerly F3, showing its distribution in different part of the world.[72]

Continent/subcontinental region Country &/or ethnicity Sample size H1a1a (M82) freq. (%) Source
East/Southeast Asia Cambodia 6 16.67 Sengupta et al. 2006
East/Southeast Asia Cambodia/Laos 18 5.56 Underhill et al. 2000
South Asia Nepal 188 4.25 Gayden et al. 2007
South Asia Afghanistan 204 3.43 Haber et al. 2012
South Asia Malaysian Indians 301 18.94 Pamjav et al. 2011
South Asia Terai-Nepal 197 10.66 Fornarino et al. 2009
South Asia Hindu New Delhi 49 10.2 Fornarino et al. 2009
South Asia Andhra Pradesh Tribals 29 27.6 Fornarino et al. 2009
South Asia Chenchu Tribe India 41 36.6 Kivisild et al. 2003
South Asia Koya Tribe India 41 70.7 Kivisild et al. 2003
South Asia West Bengal India 31 9.6 Kivisild et al. 2003
South Asia Konkanastha Brahmin India 43 9.3 Kivisild et al. 2003
South Asia Gujarat India 29 13.8 Kivisild et al. 2003
South Asia Lambadi India 35 8.6 Kivisild et al. 2003
South Asia Punjab India 66 4.5 Kivisild et al. 2003
South Asia Sinhalese Sri Lanka 39 10.3 Kivisild et al. 2003
South Asia Northwest India 842 14.49 Rai et al.2012
South Asia South India 1845 20.05 Rai et al.2012
South Asia Central India 863 14.83 Rai et al.2012
South Asia North India 622 13.99 Rai et al.2012
South Asia East India 1706 8.44 Rai et al.2012
South Asia West India 501 17.17 Rai et al.2012
South Asia Northeast India 1090 0.18 Rai et al.2012
South Asia Andaman Island 20 0 Thangaraj et al. 2003
Middle East and North Africa Saudi Arabia 157 0.64 Abu-Amero et al. 2009
Middle East and North Africa Turkish 523 0.19 Cinnioglu et al. 2004
Middle East and North Africa Iran 150 2 Abu-Amero et al. 2009
Middle East and North Africa Iran 938 1.2 Grugni et al. 2012

See also

References

  1. PMID 29410676
    .
  2. ^ .
  3. .
  4. ^ .
  5. ^ .
  6. ^ .
  7. ^ .
  8. ^ St Clairl M (March 2018). "Haplogroup H-M2713" (PDF). St. Clair Database.
  9. .
  10. ^ a b "H YTree". YFull.
  11. PMID 35046511
    .
  12. .
  13. ^ "Indus Valley DNA". 2 May 2022.
  14. ^ .
  15. ^ .
  16. .
  17. ^ .
  18. .
  19. .
  20. ^ .
  21. ^ .
  22. .
  23. .
  24. .
  25. .
  26. ^ Haber, Marc. "A Genetic History of the Near East from an aDNA Time Course Sampling Eight Points in the Past 4,000 Years". Cell.
  27. ^
    PMID 14761656
    .
  28. ^ .
  29. .
  30. ^ .
  31. ^ .
  32. ^ Mahmud H (November 2019). GENETIC DIVERSITY AMONG BANGALI AND SEVEN ETHNIC GROUPS OF BANGLADESH BASED ON Y-CHROMOSOME (PDF) (Ph.D. thesis). Dhaka, Bangladesh: Dhaka University. pp. 100–05.
  33. ^
    PMID 16328082
    .
  34. ^ .
  35. .
  36. .
  37. .
  38. .
  39. ^ .
  40. ^ .
  41. ^ .
  42. .
  43. ^ .
  44. ^ .
  45. ^ .
  46. .
  47. ^ .
  48. ^ .
  49. .
  50. ^ .
  51. ^ .
  52. ^ a b c d e Lu Yan (2011), "Genetic Mixture of Populations in Western China." Shanghai: Fudan University, 2011: 1-84. (Doctoral dissertation in Chinese: 陆艳, “中国西部人群的遗传混合”, 上海:复旦大学,2011: 1-84.)
  53. ^ a b c LIU Shuhu, NIZAM Yilihamu, RABIYAMU Bake, ABDUKERAM Bupatima, and DOLKUN Matyusup, "A study of genetic diversity of three isolated populations in Xinjiang using Y-SNP." Acta Anthropologica Sinica, 2018, 37(1): 146-156.
  54. S2CID 7017701
    .
  55. ^ .
  56. .
  57. .
  58. .
  59. ^ .
  60. .
  61. ^ .
  62. .
  63. ^ a b International Society of Genetic Genealogy (25 May 2016). "Y-DNA Haplogroup Tree 2016 Version: 11.144".
  64. ^ (PDF).
  65. .
  66. ^ Liu S, Nizam Y, Rabiyamu B, Abdukeram B, Dolkun M (2018). "A study of genetic diversity of three isolated populations in Xinjiang using Y-SNP". Acta Anthropologica Sinica.
  67. .
  68. .
  69. ^ "Assyrian". FamilyTreeDNA. Gene by Gene, Ltd.
  70. ^ "Armenian DNA Project". FamilyTreeDNA. Gene by Gene, Ltd.
  71. PMID 23908240
    .
  72. .

External links