- Open Access
Genetic risk score (GRS) constructed from polymorphisms in the PON1, IL-6, ITGB3, and ALDH2 genes is associated with the risk of coronary artery disease in Pakistani subjects
Lipids in Health and Diseasevolume 17, Article number: 224 (2018)
Coronary artery disease (CAD) is a major killer in today’s world. Pakistan is also affected by this non-communicable disease like other countries. It is a multifactorial disease and is influenced by many gene-gene and gene-environment interactions.
A total of 623 (219 controls, 404 cases) Pakistani subjects were genotyped for four SNPs, rs662 (PON1), rs5918 (ITGB3), rs671 (ALDH2), rs1800795 (IL-6) by PCR-RFLP. Various anthropometric parameters were noted and serum lipid profile was measured using commercially available kits. Statistical analysis was done by SPSS version 22. A Genetic Risk Score (GRS) was calculated from individual SNPs. The association of the SNPs and the GRS with CAD was checked using logistic regression.
The results showed that the risk allele frequencies of all variants were higher in the cases than the controls, however the difference was not statistically significant association (p > 0.0125). The mean GRS in the controls was 3.99 ± 1.42 and in cases, it was 4.29 ± 1.39, the difference between the groups was significant (p = 0.0109). logistic regression of individual SNPs and GRS with the CAD showed that independent SNPs were not significantly associated with the CAD however, the GRS had a strong association (p = 1.4 × 10− 4). The subjects were divided into three groups based on GRS (Gp 1 with GRS 0–2, Gp 2 with GRS 3–5 and Gp 3 with GRS 6–8). The analysis of the effect of the individual SNPs and GRS groups on different lipid profile parameters revealed no significant association of any of the tested SNPs with any lipid parameter, however, the GRS groups showed marginally significant for TC and highly significant association for TG, LDL-c and HDL-c.
In conclusion, use of a GRS can provide better information than individual SNPs. The larger the number of the SNPs included in the analysis, the better would be the risk prediction.
Cardiovascular diseases (CVDs) are disorders in the two systems, the heart and the circulatory system. Among the group of cardiovascular disorders, coronary artery disease (CAD) is the most frequent . The constriction of blood vessels due to atherosclerosis, leads to the poor blood supply to the heart . In the developing countries its prevalence ranked the highest . The increasing prevalence of coronary heart disease (CHD) in the South Asian countries poses a threat and huge burden to healthcare. The major reason of this increase was the adoption of modern lifestyle which also increases the risk of metabolic syndrome . According to the World Health Organization (WHO) report from Global Burden of Disease (GBD), cardiovascular diseases are responsible for 31% of all global deaths. The mortality rate due to coronary artery disease alone in the year 2012 was 7.4 million while the total deaths due to cardiovascular disease was 17.5 million . In Pakistan, one in every four adults suffered from coronary artery disease .
Coronary artery disease is a multifactorial disorder involving a complex interaction between environmental and genetic factors . The modifiable and non modifiable risk factors are two broad categories of conventional risk factors . Smoking, obesity, diabetes mellitus, hypertension, dyslipidemia, stress, depression and sedentary lifestyle are the modifiable risk factors while gender, age and family history are the non-modifiable risk factors . We selected four SNPs from different genes. The rs662 SNP is located in exon 6 of paraoxanase 1 (PON1) gene which results in the substitution of A to G (arginine (R) at the place of glutamine (Q) in the protein) . This single nucleotide polymorphism affects the catalytic activity for different substrates hydrolysis . The change in catalytic activity is substrate dependent as Q allele hydrolyzes soman, diazoxon and sarin rapidly while R allele hydrolyzes paraoxon more efficiently . The R allele carriers are more susceptible to cardiovascular disorders as Q192 is more effective in inhibiting oxidation of low density lipoprotein cholesterol as compared to R192 isoform . The second selected SNP is rs5918 from integrin beta 3 (ITGB3) gene, present on chromosome 17 long (q) arm at 21.32 position. The product of ITGB3 gene is a protein known as integrin beta III. It is also known as platelet glycoprotein IIIa, GP3A, GPIIIa or antigen CD61. A single base change in the exon 2 of this gene (C to T) results substitution of leucine (PlA1) amino acid for proline amino acid (PlA2) . This polymorphism results in the different spatial orientation and conformational change in protein in fibrinogen-binding region . It is suggested that this polymorphism has an important role in the progression coronary artery disease and coronary thrombosis  because a key event in acute coronary syndrome is the platelet aggregation with thrombus formation . The third SNP was rs671 in the aldehyde dehydrogenase 2 (ALDH2) gene located on chromosome 12q24.2 . Enzyme activity of ALDH2 reduces due to rs671 (G > A) polymorphism i.e. glutamate-to-lysine amino acid substitution at the protein level (also known as Glu504Lys) . The G allele encodes a functional ALDH2 enzyme needed for aldehyde detoxification, but substituted A allele makes a non-functional isozyme . The fourth selected SNP was from the promotor region of interleukin 6 (IL-6) gene and involves a change of guanine to cytosine, at position − 174. The encoded protein is important in inflammation resulting in increased oxidative stress inside the coronary arteries. These SNPs were selected for analysis in Pakistani subjects because 1) Pakistani population represents a unique ethnic group which allows the study of concentrated risk genetic markers 2) the selected SNPs have been reported to modulate serum lipids in Caucasians therefore their analysis in Pakistan can provide the information on the relationship of these genetic variants with lipids and 3) these SNPs have not been previously investigated in the Pakistani population and the current study is the first report of their investigation in our population. We therefore, aimed to genotype them to find their association pattern with CAD in our cohort and to compare the association of single SNPs and their cumulative genetic risk score.
For the current study, a total of 623 subjects (219 controls, 404 CAD cases) were recruited. The recruitment, inclusion and exclusion criteria have been described in detail earlier . The patients were angiographically confirmed CAD caes with stenosis of at least one major coronary vessel (50% 0r more of diameter) and diagnosed by cardiologists using biochemical markers like CK-MB, troponin T/I data, echocardiography, ECG, and radiological investigation. The patients were not using lipid-lowering and antihypertensive drugs. Only newly diagnosed cases were included in the study. The controls without any history of CAD at least in first-degree relative were selected . Exclusion criteria for cases were clinical diagnosis of cardiomyopathy, coagulopathy, collagenoses, presence of inflammatory and autoimmune diseases and acute poisoning and for controls were the presence of symptoms of CAD, myocardial infarction (MI), stroke, diabetes mellitus, inflammatory and autoimmune diseases and a familial history of cardiovascular diseases. Serum screening and any infectious blood sample like HIV, hepatitis B and C were not included. Study subjects below 40 years were also excluded. Ethical protocols were strictly followed included all procedures which were in compliance with the declaration of Helsinki and approval was obtained from the institutional ethical committee (Ethical Committee, School of Biological Sciences, University of the Punjab, Lahore, Pakistan).
For each study participant gender, age and smoking habit were recorded. The prevalence of comorbidities was noted for the cases and controls. Height (m) and weight (Kg) were measured and body mass index (BMI) in Kg/m2 was calculated for every study participant.
Venous blood was taken from the median cubital vein by using aseptic technique. 5 ml of blood sample was taken which was divided into two parts. In one part EDTA was added to prevent clotting of blood while the rest was poured in yellow vials with gel clot activator to accelerate blood clotting. The blood in EDTA vials was used for DNA isolation and stored at 4 °C while blood in gel vial was used for obtaining serum to be used for the determination of various biochemical parameters. Serum was separated from blood cells by centrifugation of gel activator vials at 14,000 rpm for 10 min and stored in sterilized eppendorf. Prior to used serum for determination of biochemical parameters, serum was screened for any infectious disease. Serum samples were screened by using a one-step device Accu-chek ® for hepatitis b (HBV), hepatitis c (HCV) and human immunodeficiency virus (HIV) infection.
Biochemical parameters determination
Serum lipid profile parameters including total cholesterol (TC), triglycerides (TG), low-density lipoprotein cholesterol (LDL-C) and high-density lipoprotein cholesterol (HDLC) were measured by using commercially variable kits (Spectrum Diagnostics, Obour City, Egypt). Determination of all optical density measurements was done by Epoch microplate reader (Biotek Instruments, Highlands Park, USA).
The salting out manual method was used for the extraction of genomic DNA from the peripheral white blood cells (WBC). The results were analyzed on 1% agarose gel. After isolation, all DNA samples were quantified before amplification using Epoch Biotek micro-plate reader (Biotek Instruments, Highlands Park, USA) and were standardized to the final working concentration of 5 ng/μl was made. For PONI, ITGB3 and ALDH2 polymorphisms, PCR-RFLP method was employed and for IL-6 SNP, tetra-ARMS PCR was used. The sequences of the primers used for amplification are as follows: rs662, forward 5’-TATTGTTGCTGTGGGACCTGAG-3′ and reverse 5’-CCTGAGAATCTGAGTAAATCCACT-3′ (product size 238 bp, restriction enzyme, AlwI); for rs5918, forward: 5’-GGATTATCCCAGGAAAGACCAC-3′, reverse: 5’-GACTTCCTCCTCAGACCTCCAC-3′ (product size 424 bp, restriction enzyme, MspI); for rs671, forward 5’-CCTGGGCAACAGAGAAAGAT-3′, and reverse 5’-AAACACTGATGGCCTCAAGC-3′ (product size 512 bp, restriction enzyme, HincII); for rs1800795 forward outer ACCTGGAGACGCCTTGAAGTAACT, reverse outer AAACCAAAGATGTTCTGAACTGAGT, forward inner GCCAGGCAGTCTACAACAGGCC, reverse inner GTGTTCTGGCTCTCCCTGTGTGC (outer product 186 bp, product for C allele 144 bp, product for G allele 86 bp). The PCR mixtures (25 μL) consisted of 1X Taq buffer ([NH4]2SO4), 2.5 mM MgCl2, dNTP’s mix 200 μM, Taq polymerase (Fermentas) 0.25 U/μl, forward and reverse primers 10 μM and 5 ng/μl DNA template. The PCR program consisted of initial denaturation at 95 °C for 2 min, followed by 35 cycles of denaturation (94 °C for 1 min, annealing at 65 °C for 1 min, extension at 72 °C for 2 min) and a final extension at 72 °C for 5 min. Restriction digestion reaction was optimized for amplified product concentration, duration of digestion and enzyme concentration per reaction volume. The reaction mixture for RFLP (15 μl) consisted of PCR product 10 μl, 10X buffers 1.5 μl, 0.3 U/ μl and 3.2 μl of nuclease-free water. RFLP products were seen by running 2% agarose gel electrophoresis and observing the gel under U.V transilluminator or Gel Doc system.
Microsoft Excel and statistical Package for the Social Sciences (SPSS, IBM statistics version 22) were used for the statistical analysis. Allele and genotype frequencies were calculated for each SNP and the study population was tested for Hardy Weinberg Equilibrium (HWE). The significance of difference of allele and genotype frequencies among cases and controls was checked by chi-squared test. Independent t-test was used to test the difference in mean values of quantitative variables between two groups. Logistic regression analysis was done to check the association of SNPs with the CAD. One way ANOVA (analysis of variance) was used to check the effect of the polymorphisms on lipid profile parameters. GRS was calculated for each study subject by following method: the genotypes were unanimously coded as 0 for homozygous protective genotype, 1 for heterozygous and 2 for homozygous polymorphic genotypes. A summation term was then created by adding the risk allele count for each participant. The mean GRS value of cases and controls was compared by t-test. As four SNPs were included, a subject could have a minimum of 0 and maximum of 8 risk alleles. The GRS was divided into three groups, Group I with risk allele count 0–2, Group II with risk allele count 3–5 and Group III with risk allele count 6–8. A corrected p-value less than 0.05 of 0.0125 was used as a statistical cutoff for all tests because of inclusion of four SNPs (0.05/4 = 0.0125).
Study subjects’ characteristics
The general characteristics of the study participants have been described in detail elsewhere . The controls had 119 males and 100 females while the cases had 238 males and 166 females. The mean age of the two groups differed significantly, however the controls on the average are of an older age indicating that they have been disease free for a longer time. The prevalence of hypertension and diabetes and smoking was high among patients as compared to the controls. The cases had a more atherogenic lipid profile compared to the controls. The baseline characteristics of the subjects are summarized in Table 1.
Allele and genotype frequencies of the selected SNPS
The allele/genotype frequencies of all the SNPs are given in Table 2. The minor allele frequency (MAF) for PON1 polymorphism in controls was 0.425 and in cases, it was 0.491 (OR: 1.15, CI: 0.898–1.470), p = 0.02). The MAF for ITGB3 polymorphism was 0.256 in controls and 0.318 in the cases (OR: 1.179, CI: 0.931–1.477, p = 0.05). The MAF for ALDH2 SNP was 0.326 in the controls and 0.349 in the cases (OR: 0.993, CI: 0.770–1.28, p = 0.421). The MAF for the IL-6 variant was 0.356 in the controls and 0.388 in the cases (OR: 1.21, CI: 0.943–1.540, p = 0.153). The MAFs for all SNPs were higher in the cases compared to the controls, however, the difference was statistically insignificant. The logistic regression analysis also revealed a non-significant association of the variants with the CAD.
Genetic risk score (GRS) analysis
The GRS was analyzed for descriptive parameters. The minimum GRS in the controls was 0 and in the cases was 1 while maximum GRS in the controls was 6 while in the cases it was 8. The mean GRS was 3.99 ± 1.42 in the controls and 4.29 ± 1.39 in the cases and this difference was statistically significant (p = 0.011). The association of GRS with the CAD was checked by logistic regression and was found to be significantly associated (OR: 4.12, CI: 1.003–6.781, p = 1.4 × 10− 4. The frequency of the subjects in each GRS group was analyzed. There were 66 (10.6%) subjects in group I (GRS = 0–2), 452 (72.6%) subjects in group II (GRS 3–5) and 105 subjects in group III (GRS 6–8).
Comparison of the effect of individual SNPs and GRS on lipid profile parameters
The effect of individual SNPs across the three genotypes and of GRS across three groups on lipid profile parameters was analyzed and the results are shown in Table 3. The PON1 SNP rs662 increases TG and decreases HDL-c when the subjects with at least one risk allele are compared to the common homozygotes, however, this effect is not statistically significant whereas the SNP has no effect on TC and LDL-C. The ITGB3 SNP rs5918 mildly increased TC, TC and LDL-C and decreased HDL-C but the difference was not significant. The ALDH2 SNP rs671 risk allele increased TC and TG but the effect was insignificant and had no effect on LDL-C and HDL-C. The IL-6 SNP rs1800795 moderately increased TC, TG and LDL-C and decreased HDL-C, but the effect was still insignificant. When the same analysis was done for the GRS groups, it was clear that the group III with highest GRS had significant effect on all lipid parameters with the strongest association with decrease in HDL-C (p = 1.5 × 10− 3) followed by increase in TG (p = 0.001), LDL-C (p = 0.005) and TC (p = 0.012).
In the current study, we selected a set of four SNPs from different genes known to affect the coronary arteries and genotyped them in a cohort of Pakistani individuals to construct a GRS and investigate whether the use of a GRS can provide better information compared to the individual SNPs. The Pakistani population represents a unique tool to investigate the contribution of genetic markers to diseases based on the restricted religious, social and cultural setting.
The allele and genotype frequencies of the selected SNPs showed that the cases had a higher MAF for all SNPs as compared to the controls, however, except the PON1 SNP, this difference couldn’t reach statistical significance. This is an indication of the low-modest effect size of the individual variants. At the same time, it must be kept in mind that the SNPs have been genotyped only in a set of participants and the frequencies can be different in the general population therefore, the effect sizes of the SNPs may seem different because of the smaller sample size not actually because they have little role in the CAD progression. These SNPs have previously been reported to be associated with CAD in different populations [19,20,21,22].
We showed that the GRS had a significant association with the CAD even when none of the individual variants had a statistically significant association with the disease. This indicated the cumulative power of the GRS over individual SNPs because the risk homozygote frequencies for all SNPs were higher in the cases than the controls but for single SNPs this difference could not achieve statistical significance. However, when GRS of these SNPs was used, the association was very apparent. The GRS between the cases and the controls overlapped and the difference was a bit smaller. This can, to some extent, be attributed to relatively small sample size. However, the difference is still conspicuous and if the number of the SNPs is increased as well as the sample size, the results can be highly applicable.
Regarding the association of individual, similar was the case for the relationship of the individual variants and the GRS with the lipid profile parameters. Individual variants showed mild but insignificant association with the atherogenic profile but the GRS had a highly significant association with the increased TC, TG, LDL-C and decreased HDL-C concentrations. A previous study of Pakistani individuals used the same approach to construct a GRS for 21 variants . This study reported similar findings as reported by us. The current study in combination with the previous study can be used to generate a panel of common variants that can be clinically implemented to calculate a lifetime risk of an individual for developing CAD.
The current study had the limitation of relatively small sample size and inability to include more SNPs. In addition, the more the biochemical parameters included, the more appropriately the mechanism of action of the SNPs could be elucidated. In future, therefore more research with larger sample size, more variants and biochemical parameters should be done on this unique ethnic group so that 1 day a strategy for preventing this fatal condition can be designed.
In conclusion, a GRS can provide better information for disease association compared to the single SNPs. However, the panel of such SNPs needs to be carefully designed so that the included SNPs are representative of all the candidate and GWAS genes, are of modest effect size and intermediate frequency and have been previously reported to be important in disease predisposition in various ethnicities. This information can then be added to the conventional risk factors so that the high risk individuals can be diagnosed prior to the onset of disease.
- ALDH2 :
Aldehyde dehydrogenase 2
Coronary artery disease
Global Burden of Disease
Genetic risk score
high-density lipoprotein cholesterol
- IL-6 :
- ITGB3 :
integrin beta 3
Low-density lipoprotein cholesterol
- PON1 :
Single nucleotide polymorphism
World Health Organization
Pranavchand R, Reddy B. Current status of understanding of the genetic etiology of coronary heart disease. J Postgrad Med. 2013;59:30–41.
Shrivastava AK, Singh HV, Raizada A, Singh SK. C-reactive protein, inflammation and coronary heart disease. The Egyptian Heart Journal. 2015;67:89–97.
Beaney KE, Cooper JA, Shahid SU, Ahmed W, Qamar R, Drenos F, Crockard MA, Humphries SE. Clinical utility of a coronary heart disease risk prediction gene score in UK healthy middle aged men and in the Pakistani population. PLoS One. 2015;10:e0130754.
Li S, Fonarow GC, Mukamal KJ, Liang L, Schulte PJ, Smith EE, DeVore A, Hernandez AF, Peterson ED, Bhatt DL. Sex and race/ethnicity–related disparities in care and outcomes after hospitalization for coronary artery disease among older adults. Circulation: Cardiovascular Quality and Outcomes. 2016;9:S36–44.
Habib S. Coronary artery disease in women. Pakistan Heart Journal. 2012;44.
Agrawal S, Mastana S. Genetics of coronary heart disease with reference to ApoAI-CIII-AIV gene region. World J Cardiol. 2014;6:755–63.
Sekhri T, Kanwar R, Wilfred R, Chugh P, Chhillar M, Aggarwal R, Sharma Y, Sethi J, Sundriyal J, Bhadra K, et al. Prevalence of risk factors for coronary artery disease in an urban Indian population. BMJ Open. 2014;4:e005346.
Bhalli MA, Kayani AM, Samore NA. Frequency of risk factors in male patients with acute coronary syndrome. Journal of the College of Physicians and Surgeons Pakistan. 2011;21:271–5.
Deshpande CS, Singhal RS, Mukherjee MS. Association of paraoxonase1 gene Q192R polymorphism and apolipoprotein B in Asian Indian women with coronary artery disease risk. Genetic testing and molecular biomarkers. 2013;17:140–6.
Ahmad I, Narang R, Venkatraman A, Das N. Two-and three-locus haplotypes of the paraoxonase (PON1) gene are associated with coronary artery disease in Asian Indians. Gene. 2012;506:242–7.
Eom S-Y, Kim Y-S, Lee C-J, Lee C-H, Kim Y-D, Kim H. Effects of intronic and exonic polymorphisms of paraoxonase 1 (PON1) gene on serum PON1 activity in a Korean population. J Korean Med Sci. 2011;26:720–5.
Mikkelsson J, Perola M, Laippala P, Savolainen V, Pajarinen J, Lalu K, Penttilä A, Karhunen P. Glycoprotein IIIa Pl (a) polymorphism associates with progression of coronary artery disease and with myocardial infarction in an autopsy series of middle-aged men who died suddenly. Arterioscler Thromb Vasc Biol. 1999;19:2573–8.
Mikkelsson J, Perola M, Penttilä A, Goldschmidt-Clermont PJ, Karhunen PJ. The GPIIIa (β3 integrin) PlA polymorphism in the early development of coronary atherosclerosis. Atherosclerosis. 2001;154:721–7.
Mikkelsson J, Perola M, Laippala P, Savolainen V, Pajarinen J, Lalu K, Penttilä A, Karhunen PJ. Glycoprotein IIIa PlA polymorphism associates with progression of coronary artery disease and with myocardial infarction in an autopsy series of middle-aged men who died suddenly. Arterioscler Thromb Vasc Biol. 1999;19:2573–8.
Ehlers CL, Liang T, Gizer IR. ADH and ALDH polymorphisms and alcohol dependence in Mexican and native Americans. The American journal of drug and alcohol abuse. 2012;38:389–94.
J-y G, L-w L. ALDH2 Glu504Lys polymorphism and susceptibility to coronary artery disease and myocardial infarction in east Asians: a meta-analysis. Arch Med Res. 2014;45:76–83.
Shahid SU, Cooper JA, Beaney KE, Li K, Rehman A, Humphries SE. Effect of SORT1, APOB and APOE polymorphisms on LDL-C and coronary heart disease in Pakistani subjects and their comparison with Northwick Park Heart Study II. Lipids in health and disease. 2016;15:83.
Shahid SU, Cooper JA, Rehman A, Humphries SE. Association of ACE and NOS3 gene polymorphisms with blood pressure in a case control study of coronary artery disease in Punjab, Pakistan. Pakistan Journal of Zoology. 2016;48:1125–32.
Guo Y-J, Chen L, Bai Y-P, Li L, Sun J, Zhang G-G, Yang T-L, Xia J, Li Y-J, Chen X-P. The ALDH2 Glu504Lys polymorphism is associated with coronary artery disease in Han Chinese: relation with endothelial ADMA levels. Atherosclerosis. 2010;211:545–50.
Liu T, Zhang X, Zhang J, Liang Z, Cai W, Huang M, Yan C, Zhu Z, Han Y. Association between PON1 rs662 polymorphism and coronary artery disease. Eur J Clin Nutr. 2014;68:1029.
Khatami M, Heidari MM, Soheilyfar S. Common rs5918 (PlA1/A2) polymorphism in the ITGB3 gene and risk of coronary artery disease. Archives of medical sciences Atherosclerotic diseases. 2016;1:e9.
Sun G, Wu G, Meng Y, Du B, Li Y. IL-6 gene promoter polymorphisms and risk of coronary artery disease in a Chinese population. Genet Mol Res. 2014;13:7718–24.
Shahid SU, Cooper JA, Beaney KE, Li K, Rehman A, Humphries SE. Genetic risk analysis of coronary artery disease in Pakistani subjects using a genetic risk score of 21 variants. Atherosclerosis. 2017;258:1–7.
Higher Education Commission of Pakistan is acknowledged for technical support to the study.
University of the Punjab provided the financial support for the study.
Availability of data and materials
All the necessary information has been provided along with the manuscript, however, the corresponding author can be contacted for any information related to this paper.
Ethics approval and consent to participate
The study was approved by the institutional ethics committee (Ethical Committee, School of Biological Sciences, University of the Punjab, Pakistan) and all procedures were carried out in compliance with the Helsinki Declaration.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.