SUPPLEMENTARY MATERIAL

Differential positive selection at malaria resistance genes in three indigenous populations of Peninsular Malaysia

Xuanyao Liu, Yushimah Yunus, Dongsheng Lu, Farhang Aghakhanian, Woei-Yuh Saw, Lian Deng, Mohammad Ali, Xu Wang, Fadzilah Ghazali7, Thuhairah Abdul Rahman, Shahrul Azlin Shaari, Mohd Zaki Salleh, Maude E Phipps, Rick Twee-Hee Ong, Shuhua Xu, Yik-Ying Teo, Boon-Peng Hoh

Positive Selection at Complement Receptor 1 in Che Wong

The extended haplotype homozygosity (EHH) of the chromosome 1 region between 206.96 and 210.84 Mb, encapsulating the complement receptor (CR1) gene, at a coding SNP rs6691117 (ValIle) for the derived guanine allele, did not decay below 0.25 even after two megabases (Figure S1A). Variants in the CR1 gene have previously been reported consistently for conferring a strong protection against falciparum-driven cerebral malaria, where the guanine allele at a coding SNP rs2274567 (HisArg) conferred a 3-fold protection for the heterozygotes and up to 38-fold protection for the homozygotes (Panda et al. 2012; Rout et al. 2011); and an intronic SNP rs11118133 (HindIII A>T) was reported to confer a 5-fold and 25-fold protection to cerebral malaria for heterozygote and homozygote carriers of the thymine allele in Asian populations (Rout et al. 2011). While both rs2274567 and rs11118133 were not present in our dataset, it was observed that these two SNPs were located in a region of high linkage disequilibrium with our index SNP rs6691117 (Figure S1B), and that the linkage disequilibrium between the two guanine alleles in our index SNP rs6691117 and rs2274567 was around r2 = 0.90 for East and Southeast Asian populations with population-level sequencing data (Table S24). The guanine allele at the nonsynonymous rs6691117 had also been previously reported to increase erythrocyte sedimentation rate (ESR) in a genome-wide association study on ESR in Europeans (Kullo et al. 2011) and Sardinians (Naitza et al. 2012), and was reported with a perfect LD (r2 = 1) with rs2274567 in the Sardinians (r2 = 0.74 in the Europeans). The extended haplotype form identified by haploPS in this region for the Che Wong carried the protective guanine allele at rs6691117 (Figure S1C), but there was no evidence of long haplotypes in the Jakun and Negrito (Figure S1D).

Positive selection at TNF/LTA in Negrito

HaploPS identified strong evidence of positive selection in both the Negrito (haploPS Punadjusted = 7.47 × 10-8, Padjusted = 0.0036) and Jakun (haploPS Punadjusted = 1.20 × 10-6, Padjusted = 0.023) where the extended haplotypes encompassed both TNF and LTA genes. Previous studies have identified six polymorphisms, including five SNPs in the promoter region of TNF and one SNP in the 5’UTR, that are involved in malaria-related phenotypes (Table S24), including rs1800750 (promoter SNP in TNF, -376A>G) where the guanine allele conferred between 3- to 5-fold protection against cerebral malaria onset in Africans (Knight et al. 1999). In addition, the AGGG haplotype with SNPs rs3093662 (TNF -851), TNF -238, rs1800629 (TNF -308) and rs1800750 (TNF -376) were reported to confer a 5-fold protection against cerebral malaria in samples from Myanmar (Ubalee et al. 2001). The extended haplotype forms identified by haploPS in both Jakun and Negrito were perfectly identical across overlapping sites (Haplotype Similarity Index, HSI = 1.00) and carried exactly the AGG configuration on rs3093662, rs1800629 and rs1800750 (Figure S2). The identity between the extended haplotype forms from Jakun and Negrito suggests that if malaria were responsible for driving the evolutionary adaptation, this would have happened prior to the split of the two populations.

Table S24. The LD between rs2274567, rs11118133 and index SNP rs6691117 in East and Southeast Asian populations with population-level sequencing data

rs2274567 / rs11118133
1KG CHB / 0.883519 / 0.883519
1KG CHS / 0.94152 / 0.94152
1KG JPT / 0.929663 / 0.964881
SSM / 0.928828 / 0.951227

Table S24. SNPs in TNF previously reported to be associated with malaria-related phenotypes.

SNP / Function / Risk / Protective / Phenotype / OR
rs1800750
(TNF 376) / Promoter / A / G / Cerebral malaria / Gambian 4.3 (1.5-12.8)
Kenyan 4.6 (1.3-15.7)1
rs1800629
(TNF 308) / Promoter / A / G / Severe malaria(SM) and cerebral malaria (CM) / Gambian (SM)
1.1(1.01-1.23)3
Gambian (CM)
1.29 (1.1-12.4)1
rs1800630
(TNF 863) / Promoter / A / C / Susceptibility to severe malaria / Indian 1.87 (1.03-3.4)2
rs1799964
(TNF 1031) / Promoter / C / T / Susceptibility to severe malaria / Indian 1.84 (1.01-3.38)2
rs3093662
(TNF 851) / Intron / G / A / Haplotype association with SM / 3
rs361525
(TNF 238) / Promoter / A / G / Haplotype association with SM / 3

1 Knight JC, Udalova I, Hill AV, Greenwood BM, Peshu N, Marsh K, Kwiatkowski D: A polymorphism that affects OCT-1 binding to the TNF promoter region is associated with severe malaria. Nat Genet 1999, 22(2):145-150.

2 Sinha S, Mishra SK, Sharma S, Patibandla PK, Mallick PK, Sharma SK, Mohanty S, Pati SS, Mishra SK, Ramteke BK, Bhatt R, Joshi H, Dash AP, Ahuja RC, Awasthi S, Indian Genome Variation Consortium, Venkatesh V, Habib S: Polymorphisms of TNF-enhancer and gene for FcgammaRIIa correlate with the severity of falciparum malaria in the ethnically diverse Indian population. Malar J 2008, 7:13.

3 Clark TG, Diakite M, Auburn S, Campino S, Fry AE, Green A, Richardson A, Small K, Teo YY, Wilson J, Jallow M, Sisay-Joof F, Pinder M, Griffiths MJ, Peshu N, Williams TN, Marsh K, Molyneux ME, Taylor TE, Rockett KA, Kwiatkowski DP: Tumor necrosis factor and lymphotoxin-alpha polymorphisms and severe malaria in African populations. J Infect Dis 2009, 199(4):569-575

Figure S1. Positive selection at CR1 in the Che Wong

(A) Decay of the extended haplotype homozygosity (EHH) for both the ancestral allele (blue dashed line) and derived allele (red solid line) around the focal SNP rs6691117. (B) Pairwise linkage disequilibrium (LD) between 21 SNPs in the CR1 gene region. The numbers in the squares represent the LD values between every possible pairs of SNPs as measured by r2. The target SNP rs6691117 is indicated by the red box, while the green and blue arrows represent the locations of rs2274567 and rs11118133 respectively. (C) Identifying the selected haplotype form surrounding CR1, where the extended haplotype form identified by haploPS is illustrated (bottom panel). The specific section that overlapped with CR1 is further zoomed into (top panel), and rs6691117 is indicated by the red line. (D) The longest haplotype forms identified by haploPS in Che Wong (red), Jakun (blue) and Negrito (green), between a haplotype frequency of 5% to 65%. At frequencies below 30%, the longest haplotype forms in Che Wong are significantly longer than those present in the Jakun and Negrito, which agree with the identification of a positive selection signal by multiple selection metrics in the Che Wong but not in the Jakun and Negrito.

Figure S2. Positive selection at TNF/LTA in Negrito. Extended haplotype forms around TNF in both the Jakun and the Negrito. The zoomed-in region illustrates the specific allelic variants reported to confer malaria protection.

REFERENCES

1Kwiatkowski, D. P. (2005). How malaria has affected the human genome and what human genetics can teach us about malaria. American Journal of Human Genetics, 77(2), 171-192. doi:10.1086/432519

3Auburn, S., Diakite, M., Fry, A. E., Ghansah, A., Campino, S., & Richardson, A., et al. (2008). Association of the GNAS locus with severe malaria. Human Genetics, 124(5), 499-506. doi:10.1007/s00439-008-0575-8

4.Pozzoli, U., Fumagalli, M., Cagliani, R, Comi G.P., Nereo B., et al. (2010) The role of protozoa-driven selection in shaping human genetic variability. Trends in Genetics, 26(3), 95-99

5.Schuldt, K., Kretz, C.C., Timmann, C., Sievertsen, J., Ehmen, C., Esser, C., et al. (2011) A -436C>A polymorphism in the human FAS gene promoter associated with severe childhood malaria. PLOS Genet 7(5):e1002066.doi:10.1371/journal.pgen.1002066

6.Malaria Genomic Epidemiology Network. (2014) Reappraisal of known malaria resistance loci in a large multicenter study: doi:10.1038/ng.2107

7.Band, G., Le, Q.S., Jostin, L., Pirinen, M., Kivinen, K., Jallow, M., et al. (2012). Imputation-Based Meta-Analysis of Severe Malaria in Three African Populations. PLOS Genet 9(5): e1003509. doi:10.1371/journal.pgen.1003509

8.Fumagalli M and Sironi M (2014) Human genome variability, natural selection and infectious diseases. Current Opinion in Immunology 30: 9-16.

9.Panda AK, Panda M, Tripathy R, Pattanaik SS, Ravindran B, Das BK (2012) Complement receptor 1 variants confer protection from severe malaria in Odisha, India. PLoS One 7: e49420. doi: 10.1371/journal.pone.0049420

10.Rout R, Dhangadamajhi G, Mohapatra BN, Kar SK, Ranjit M (2011) High CR1 level and related polymorphic variants are associated with cerebral malaria in eastern-India. Infect Genet Evol 11: 139-44. doi: 10.1016/j.meegid.2010.09.009

11.Kullo IJ, Ding K, Shameer K, McCarty CA, Jarvik GP, Denny JC, Ritchie MD, Ye Z, Crosslin DR, Chisholm RL, Manolio TA, Chute CG (2011) Complement receptor 1 gene variants are associated with erythrocyte sedimentation rate. Am J Hum Genet 89: 131-8. doi: S0002-9297(11)00212-6 [pii]10.1016/j.ajhg.2011.05.019

12.Naitza S, Porcu E, Steri M, Taub DD, Mulas A, Xiao X, Strait J, Dei M, Lai S, Busonero F, Maschio A, Usala G, Zoledziewska M, Sidore C, Zara I, Pitzalis M, Loi A, Virdis F, Piras R, Deidda F, Whalen MB, Crisponi L, Concas A, Podda C, Uzzau S, Scheet P, Longo DL, Lakatta E, Abecasis GR, Cao A, Schlessinger D, Uda M, Sanna S, Cucca F (2012) A genome-wide association scan on the levels of markers of inflammation in Sardinians reveals associations that underpin its complex regulation. PLoS Genet 8: e1002480. doi: 10.1371/journal.pgen.1002480

13.Knight JC, Udalova I, Hill AV, Greenwood BM, Peshu N, Marsh K, Kwiatkowski D (1999) A polymorphism that affects OCT-1 binding to the TNF promoter region is associated with severe malaria. Nat Genet 22: 145-50. doi: 10.1038/9649

14.Ubalee R, Suzuki F, Kikuchi M, Tasanor O, Wattanagoon Y, Ruangweerayut R, Na-Bangchang K, Karbwang J, Kimura A, Itoh K, Kanda T, Hirayama K (2001) Strong association of a tumor necrosis factor-alpha promoter allele with cerebral malaria in Myanmar. Tissue Antigens 58: 407-10.

15.Timmann C, Thye T, Vens M, Evans J, May J, Ehmen C, Sievertsen J, Muntau B, Ruge G, Loag W, Ansong D, Antwi S, Asafo-Adjei E, Nguah SB, Kwakye KO, Akoto AOY, Sylverken J, Brendel M, Schuldt K, Loley C, Franke A, Meyer CG, Agbenyega T, Ziegler A, Rolf D. Horstmann RD (2012) Genome-wide association study indicates two novel resistance loci for severe malaria. Nature 489: 443-446