Skip to main content

Advertisement

Sequence analysis and variant identification at the APOC3 gene locus indicates association of rs5218 with BMI in a sample of Kuwaiti’s

  • 239 Accesses

Abstract

Background

APOC3 is important in lipid transport and metabolism with limited studies reporting genetic sequence variations in specific ethnic groups. The present study aimed to analyze the full APOC3 sequence among Kuwaiti Arabs and test the association of selected variants with lipid levels and BMI.

Methods

Variants were identified by Sanger sequencing the entire APOC3 gene in 100 Kuwaiti Arabs. Variants and their genotypes were fully characterized and used to construct haplotype blocks. Four variants (rs5128, rs2854117, rs2070668, KUAPOC3N3 g.5196 A > G) were selected for testing association with serum lipid levels and BMI in a cohort (n = 733).

Results

APOC3 sequence (4.3 kb) of a Kuwaiti Arab was deposited in Genbank (accession number KJ437193). Forty-two variants including 3 novels were identified including an “A” insertion at genomic positions 116,700,599–116,700,600 (promoter region) and two substitutions in intron 1 at genomic positions 116,700,819 and 116,701,159. Only three variants, (rs5128, rs2854117, and rs2070668) were analyzed for association of which rs5128 showed a trend for association with increased BMI, TG and VLDL levels that was further investigated using multivariate analysis. A significant association of rs5128 with BMI (p <  0.05) was observed following a dominant genetic model with increased risk by an OR of 4.022 (CI: 1.13–14.30).

Conclusion

The present study is the first to report sequence analysis of APOC3 in an Arab ethnic group. This study supports the inclusion of rs5128 as a marker for assessing genetic risk to dyslipidemia and obesity and the inclusion of the novel variant g.5196 A > G for population stratification of Arabs.

Introduction

Apolipoprotein C3 (APOC3) has been implicated as an important candidate gene involved in plasma lipid level variation and other metabolic abnormalities. The APOC3 gene resides within the APOA5-APOA4-APOC3-APOA1 multi-gene cluster on human chromosome 11q23 [1]. The APOC3 gene is 3367 bp comprising 4 exons that encode a 99 amino acid glycoprotein which is synthesized mostly in the liver and to a lesser degree in the intestine, where it undergoes intracellular cleavage of 20-aminoacid-residue signal peptide yielding the mature 79 amino acid APOC3 [2]. APOC3 is a major protein constituent of triglyceride-rich lipoproteins (TRLs) including very low-density lipoprotein (VLDL), and chylomicron (CM) wherein it has a principal role in the regulation of triglyceride rich lipoproteins (TRL) catabolism [3]. APOC3 impairs lipolysis of TRL by inhibiting lipoprotein lipase (LPL) and the hepatic uptake of TRLs by remnant receptors. High circulating concentration of APOC3 was shown to be associated with increased levels of triglycerides (TG) in blood [4, 5] and in metabolic disorders including dyslipidemia [6, 7]. Dyslipidemia covers a broad spectrum of lipid abnormalities including elevated levels of plasma triglyceride (TG) and total cholesterol (TC), an increase in intermediate-density lipoprotein (IDL), presence of small dense low-density lipoprotein (LDL) particles, and a decreased level of high-density lipoprotein (HDL) [8]. Heritability studies revealed a strong genetic component to dyslipidemia, ranging from 0.20 to 0.60 in which these estimates are likely reflecting contributions from numerous gene variants including APOC3 [9,10,11,12,13,14].

The important role of APOC3 in lipid transport and metabolism deems it necessary to reveal the full genetic profile of single nucleotide polymorphisms (SNPs) at this locus and how they may correlate to inter-ethnic susceptibility of dyslipidemia and other metabolic abnormalities. Most of the genetic variants studied in the APOC3 gene are localized in the promoter region including the five common DNA polymorphisms (C-641A, G-630A, T-625 deletion, C-482 T, and T-455C) that show a minor allelic frequency of about 40% in the general population [15,16,17,18]. Sequence analysis of the full locus may provide an effective opportunity to assess the full spectrum of variants and how they may influence plasma lipid levels including both common and rare variants. Recently, exome sequencing of the APOCIII gene locus identified rare loss of function mutations associated with lower triglyceride levels in a large cohort [19]. Very few studies have investigated the role of APOC3 in the genetic predisposition to variation in lipid levels among Arabs [20] and none have reported full sequence analysis of the APOC3 gene locus in any Arab population (as far as our knowledge) while limited studies reported sequence analysis on Asians [21].

This paper aimed to investigate sequence variation at the APOC3 gene locus to identify potential variants that may contribute to the variation in lipid levels and specifically the genetic profile of an Arab population. This study was based on the hypothesis that there are genetic variants that vary in their frequency between different human populations so a common SNP in one ethnic population could be rare in another. Moreover, some APOC3 SNPs could be significantly associated with variation in lipid levels but are dependent on ethnicity. Therefore, the data generated in this study may allow better identification and selection of associated variants with variation in serum lipid levels and/or BMI.

Methods

Study population

This study involved two sample phases. The first was sequencing DNA samples from 100 apparently healthy subjects (50 females and 50 males) with Arabian origin from the Kuwaiti population. The age of the subjects ranged from 18 to 69 with an average age of 30.42 years. The BMI of the subjects ranged from 19 to 30 with an average BMI of 23.946 kg/m. The samples included Kuwait Arabs with normal lipid levels. Each sample had documented phenotypic data including medical and family history of hypertension, hypertriglyceridemia, hypercholesterolemia, diabetes and cardiovascular diseases as well as a pedigree with ethnic background. The pedigree was used to trace the ethnic background of both paternal and maternal lineages back at least four generations. The second phase involved 633 samples of the general the Kuwaiti population for the association of selected variants and validation of the novel variants making the cohort for the genetic association study to 733. Random Kuwaitis were recruited during routine check-up at different governmental hospitals around Kuwait. The inclusion criteria included participants who are Kuwaiti natives with documented ancestry and were above the age of 18 years. The exclusion criteria were those with documented diagnosis of diabetes mellitus type 2, hypertension and heart disease. The study population consisted of 438 females and 295 males. The age of the subjects ranged from 18 to 76. Demographic information, along with detailed personal medical and family history from all the participants (Table 1). The inclusion criteria included participants who are Kuwaiti natives with documented ancestry and were above the age of 18 years. The exclusion criteria were those with documented diagnosis of diabetes mellitus type 2 and/or hypertension. All the participants in this study were devoid of diabetes and/or hypertension. A summary of the procedures followed on the collected blood samples is illustrated in Fig. 1.

Fig. 1
figure1

Summary flowchart representing the methodology used for re-sequencing the APOC3 gene locus in this study

Table 1 Demographic and clinical features of the Kuwaiti cohort (n = 733)

Collection of blood samples and biochemical analysis

Venous blood samples were taken after a 12-h fast. The levels of serum TC, TG, HDL-C, and LDL-C were determined by enzymatic methods with commercially available kits and performed on a UniCel DxC 800 Synchron Clinical System from Beckman Coulter (USA) in the Clinical Chemistry facility at Al-Amiri Hospital (Kuwait).

TC was measured using an enzymatic colorimetric method that breaks it down into water and quinon-imine red dye which is directly proportional to TC concentration. For TG, multienzymatic reaction using glycerol kinase, glycerol-3-phosphate oxidase and peroxidase was performed in a sequence to get a red dye. TG concentration is proportional to the intensity of the color generated and measured photometrically. For HDL-C measurement, a unique detergent was used that is not only able to selectively solubilize cholesterol in HDL but also to inhibit its release from other lipoproteins. Released HDL-C was then determined enzymatically using cholesterol esterase and cholesterol oxidase to produce a color product that could be measured at 560 nm. Friedewald formula was used to calculate the concentration of both LDL-C and VLDL-C (LDL-C = TC – HDL-C – (TG/2.2, VLDL-C = TG/2.2) [22]. The reference values used in this study are those set by Kuwait Ministry of Health where: TC = 3.0–5.17 mmol/L, TG = 0.40–1.7 mmol/L, HDL-C = 0.91–2.5 mmol/L and LDL-C = 1.8–3.2 mmol/L.”

DNA extraction and APOC3 sequencing

Total genomic DNA was extracted from whole blood samples, based on the technique described by Miller using proteinase K and salting-out procedures [23]. The 3.4 Kb APOC3 gene along with the flanking sequences was amplified using nine sets of custom designed overlapping primers (Primer3 Input software version 0.4.0: //Frodo.wi.mit.edu/). The primers and polymerase chain reaction (PCR) conditions are provided in Additional file 1: Table S1. DNA template was first amplified by PCR using Gen Amp® Fast PCR Master Mix in an Applied Biosystem Fast thermal cycler (Version 1.01, Life Technologies, USA) (Additional file 1: Table S2) followed by purification using NucleoSpin® Extract II (Clontech Laboratories, Inc., Version No. PR48598) Kit and formaldehyde denaturation. Sequencing reactions were performed on both DNA strands using BigDye X. Terminator v.1.1 Cycle Sequencing Kits (Additional file 1: Table S2). Sanger bidirectional sequencing was performed using the Gene Analyzer 3130XL (Life Technologies, Applied Biosystems, USA), supported by ABI DNA Sequencing Analysis Software v5.2. The sequences from each pair of reaction were aligned together and checked for sequence accuracy using Clustal W pairwise sequence alignment. Multiple sequence alignment among all samples were compared to the reference sequence (NG_008949.1) in the GenBank database (http://www.ncbi.nlm.nih.gov, NCBI) to identify all the APOC3 variants among the 100 Kuwaiti Arab samples sequences.

Validation and association of common and novel SNPs identified

Three common SNPs (MAF >  0.05) identified (rs5128, rs2854117, and rs2070668) were selected for association analysis with variation in lipid levels along with the two novel variants (KUAPOC3N2, and KUAPOC3N3) which were tested for validation among larger cohort (n = 733). Allelic discrimination (VIC- and FAM-labeled) using real-time PCR (ABI 7800HT Realtime PCR (GS01/02) was performed for all of the five selected variants. Assay-on-demand of the commercially available TaqMan assays were ordered for the four common variants, while a set of customized primer and probe were used for the two novel variants (Table 2). The reaction was carried according to the instruction of the manufacturer (Applied Biosystem) using TaqmanTM Genotyping Master Mix (Applied Biosystems # 4371355). For quality control, samples were tested on duplicates to estimate genotyping reproducibility; concordance exceeded 99%.

Table 2 Selected SNPs and their relevant information. Reported Minor Allele Frequency and predicted consequence were obtained from http://www.ncbi.nlm.nih.gov/snp/, NCBI

Statistical analysis

Genotypic and allelic frequencies were determined by simple gene counting. The chi-square test was used to test the Hardy-Weinberg equilibrium (HWE) within the sample population. Haploview program v4.2 was used to check linkage disequilibrium (LD) between SNPs and construct haplotypes. The possible association of lipid profile with APOC3 polymorphisms was initially examined with regards to age, gender and BMI using SPSS v21.0. The R software v3.3.1 was used for further analysis utilizing the following packages SNPassoc, psych, genetics, and MASS [24]. Kruskal-Wallis ANOVA test was performed, and the results were reported as mean ± standard error. Log-transformation was applied across all of the lipid profile values (HDL-C, LDL-C, VLDL, and TG) so as to achieve an approximate normal distribution. Additionally, logistic regression model was used to check for any possible association between the studied SNPs and lipid profile parameters. Genetic modeling of the significant variants was performed. A p-value of 0.05 was considered statistically significant.

Results

APOC3 sequence analysis

The full nucleotide sequence of the 4.3 Kb APOC3 gene locus among 100 Kuwaiti Arabs was analyzed excluding the 224 bp repetitive segment spanning nucleotide positions 2366–2589 due to the technical difficulties generated by the sequence analysis software. The newly defined APOC3 gene sequence in the Kuwaiti Arab samples was deposited in the NCBI gene bank with accession number (GenBank: KJ437193). Sequence analysis identified 45 different polymorphisms including 42 previously reported SNPs and 3 novel SNPs (Fig. 2).

Fig. 2
figure2

Parts of the APOC3 reverse sequence electropherograms showing the three identified novel SNPs. Each novel SNP is indicated with an arrow on the figure and was confirmed by sequence alignment with the sequence generated by the forward primer. a A novel heterozygote (del/A) within the TATA box in the promoter region. b A novel heterozygote (A/G) within the first intron. c A novel heterozygote (G/A) within the first intron

All three novel variants were found in a heterozygous state. The first novel variant (KUAPOC3N1) is an insertion of one nucleotide (A) within the promoter region (25 bases upstream of the gene) between positions g.4976 and g.4977 relative to the GenBank sequence (NG_008949.1) corresponding to genomic positions 116,700,599 and 116,700,600 in the newly generated sequence (accession number: KJ437193) and was detected in two normolipidemic individuals. The other two novel SNPs were found within the first intron of the APOC3 locus, each of them in one individual of normal lipid profile. One variant (KUAPOC3N2) resulted from A to G transition at genomic position 116,700,819 on the newly generated APOC3 sequence (KJ437193) (NG_008949.1 g.5196 A > G). The other variant (KUAPOC3N3) resulted from G to A transition at genomic position 116,701,159 (KJ437193) (NG_008949.1 g. 5536G > A).

The remaining 42 variants were mainly SNPs with only 1 InDel. In general, a higher number of transitions type substitutions (n = 31) was observed compared to transversions (n = 10) (Fig. 3). Considering the individual substitutions, C to T (n = 21) was found to be predominant when compared to others G to A (n = 9), G to C (n = 4), G to T (n = 3), C to A (n = 2), and T to A (n = 1). Most of the identified SNPs were observed in non-coding regions, especially in intronic sequences totaling 26 SNPs (Fig. 4). There were 12 SNPs upstream of the gene, 2 SNPs in the 5′-UTR, and 3 SNPs in the 3′-UTR. Interestingly, there were only 2 SNPs found within the coding exons; the nonsense mutation rs76353203 (R19X) and the synonymous mutation rs4520 in exon 3 and exon 4 respectively.

Fig. 3
figure3

Substitution SNPs (n = 43) at the APOC3 gene locus among the analyzed 100 Kuwaiti Arab samples in term of their types

Fig. 4
figure4

Identified SNPs (n = 45) at the APOC3 gene locus among the analyzed Kuwaiti Arab samples (n = 100) in term of their location

SNPs and haplotype analysis

Analysis of the 45 SNPs in this study found 22 SNPs to have a minor allele frequency (MAF) more than 5% (Table 3) while the remaining 23 SNPs showed MAF < 5% (Additional file 1: Table S3). Most of the identified SNPs were found to be in HWE (p-value > 0.05) except for 6 SNPs (p-value < 0.05). Deviated SNPs included: rs2854117 (p-value < 0.001), rs734104 (p-value = 0.012), rs5142 (p-value = 0.0018), rs5141 (p-value = 0.001), rs645901 (p-value = 0.013), and rs5128 (p-value = 0.006). For all of the 6 SNPs, the homozygous genotypes were over-represented at the expense of heterozygous genotypes.

Table 3 Genotypic and allelic frequencies for the 5 selected APOC3 SNPs (n = 733) that were genotyped by Realtime PCR

As variants with low MAF (< 5%) are more prone to statistical error and false findings, only common variants were further analyzed. Haplotype analysis using 22 SNPs resulted in five common (> 5%) haplotypes in three blocks (Fig. 5). The first haplotype block consists of the 5′ promoter SNPs (rs12721080, rs2542052, rs10892037, rs11568823, rs2854116) and rs618354 within the first intron. The second haplotype block includes 3 consecutive polymorphisms within the first intron rs734104, rs2070669, and rs2070668. The third haplotype block includes rs5130 in intron 3 along with both rs5128 and rs4225 within the untranslated region of exon 4. Linkage analysis between SNPs were measured, with results showing four SNPs (rs2542052, rs10892037, rs11568823, and rs2854116) to be in complete LD (r2 = 1) (Additional file 1: Table S4).

Fig. 5
figure5

Linkage disequilibrium structure and haplotypic architecture in APOC3. a Haploview plot defining haplotype block structure of the APOC3 gene locus. Haplotype blocks are outlined in bold. Shading indicates strength of linkage disequilibrium between the SNPs as measured by r2, which is provided in the intersecting squares. r2 is not displayed for squares with r2 = 1. A diagram of the APOC3 gene structure is provided over the plot where the first and last SNPs are on the left and right sides of the diagram, respectively. Under the SNP ID numbers are the index numbers, shown in bold, for the SNPs based on the map file. b Haplotypes in the haplotype blocks across the APOC3. There are three haplotype blocks across the gene. The haplotype frequencies are shown to the right of each haplotype. Only haplotypes having a frequency > 1% are shown. The SNP numbers across the top of the haplotypes correspond to those in the Haploview plot. A multiallelic D′ statistic, which indicates the level of recombination between two blocks, is shown in the crossing area. Connections from one block to the next were shown for haplotypes of > 10% frequency with thick lines and > 1% frequency with thin lines

Validation and association of selected variants with serum lipid levels

Three SNPs (rs2854117, rs2070668, rs5128) were selected for further association analysis as they have been reported to be associated with lipid variation in other populations [21,22,23,24,25,26,27]. Genotyping of the three selected variants and the two novels were determined based on the allelic discrimination assay using real-time PCR. The genotypes obtained for the novel variants in the initial sequencing procedure were also observed by real-time PCR (Additional file 2: Figure S1). The allele frequencies in the cohort obtained from real-time PCR (Table 3) were mostly found to be consistent with the frequencies obtained using the sequencing method (n = 100) (Additional file 1: Table S3). Deviation from HWE was observed for rs5128, most likely an outcome of excessive homozygous carriers of the mutant wild type C allele in the studied population. The two novel variants were only identified in heterozygous states in < 1% of the cohort (n = 733) in which Novel 3 was identified in 10 additional samples while Novel 2 was not identified in any other sample than the originally sequenced sample. Novel 2 also showed deviation from HWE (p <  0.005) most likely due to status as a very rare allele and therefore was excluded from further association tests.

Analysis for genetic association of the selected SNPs with serum lipid level and BMI employing Kruskal-Wallis ANOVA after adjustment for sex, age and BMI did not show any significant associations (p-value > 0.05) (Table 4). However, rs5128 showed a trend for association with higher TG levels in homozygous (1.10 mmol/L ± 0.15) as well as heterozygous (1.00 mmol/L ± 0.05) of the minor allele when compared to homozygous for the wild type allele (0.90 mmol/L ± 0.04).

Table 4 Association of the 4 APOC3 SNPs with lipid profiles in the Kuwaiti Cohort (n = 733)

Multivariate analysis and genetic modeling of the associated variant (rs5128)

Genetic modelling was used to test the effect of rs5128 minor allele which showed (Additional file 1: Table S5) a strong significant association (p = 0.02) of the dominant model in which carriers of the minor allele had an increased risk for high BMI (OR: 3.78 (1.19–11.94). In addition, carriers of rs5128 minor allele showed slightly increased levels of TG (OR: 0.89 (0.80–0.97)) and VLDL (OR: 1.11 (1.01–1.22)) based on the dominant model (p <  0.05). Significant association (p = 0.03) of rs5128 remained after multivariate analysis in which the OR value was 4.022 (CI: 1.13–14.30) for BMI implicated the minor “C” as a “risk” allele for higher BMI (Table 5).

Table 5 Multivariate analysis on the effect of APOC3 rs5128 on BMI, TG and VLDL in the cohort (n = 733) assuming a dominant genetic model

Discussion

The current study reports for the first time the full genetic profile of the APOC3 gene locus (excluding the repetitive sequence) among Arab ethnicity (Kuwaiti Arabs) which included 42 previously reported SNPs and 3 novel variants.

Sequence and mutation analysis provided insight on the locus structure and its conservation. The ratio of substitution mutations (n = 43) to indels (n = 2) was expected to be high since InDels are subjected to strong purifying selection in order to avoid their severe functional constraint as they are more likely to disrupt protein structures or to interfere with the functions of coding, splicing, and regulatory sequence elements [28, 29]. Moreover, the rate of transition substitution was 3.3 times the rate of transversion at the APOC3 gene locus in the studied population. Part of this transition bias is thought to be driven by underlying chemical and structural properties of DNA that favor transition mutations as they are thermodynamically more stable [30].

It has been documented that the rates of SNPs are known to vary across the functional components near genes [31, 32]. In this study, the observed SNPs were more frequent in noncoding regions (n = 43: intronic = 26, upstream = 12, 3′-UTR =3, 5′-UTR =2) than in the coding exons (n = 2), a signature of purifying selection against changes. However, this feature does not preclude a functional effect, as SNPs in noncoding sequences may have regulatory roles especially with alternative splicing [33, 34]. Low occurrence of SNPs in the coding exons (n = 2) could be explained by selective evolutionary pressure to maintain their structural and functional integrity [32], especially since the studied population were apparently healthy at the time of data collection. However, low variability occurs not only in protein-coding regions but also in non-coding regions harboring exons including both 5′-UTR (n = 2) and 3′-UTR (n = 3), this accentuates the importance of conservation among such regions because of their sequence-dependent role in gene regulation through mRNA processing and translation [32,33,34,35].

Overabundance of SNPs in noncoding regions, especially introns (n = 26), could be explained simply by the evolution pressure in regions with less genomic sequence conservation being relatively low compared to regions encoding sequence-dependent functions [32]. However, this hypothesis cannot explain the observed polymorphisms accumulation in the promoter region, wherein 12 SNPs including a novel variant were detected in our study. One possible explanation is that these variants were introduced with some functional importance or role throughout the human evolutionary history in which APOC3 gene was under continuous natural selection pressure and alteration by mutation, genetic drift, and gene flow [36].

The allele frequency distribution at the APOC3 loci was compared to other reported populations. The overall pattern of allelic frequencies of APOC3 common SNPs (MAF higher than 5%) in the sampled population of Kuwaiti Arabs (n = 100) were found to be fairly comparable to the frequencies of other populations obtained from the 1000GENOMES and HapMap deposited in ensembl including Caucasion, American, Asian, and European (Additional file 1: Table S3) [37]. However, some SNPs showed large interethnic variations in their allelic frequencies. The largest ethnic variation in the allelic frequencies of the common APOC3 SNPs (MAF higher than 5%) was observed when compared to African population [37].

Most polymorphisms in APOC3 gene promoter are thought to play some role in the regulation of APOC3 expression. Therefore, the evaluation of the allelic distribution of some common promoter variations (rs2542052, rs10892037, rs11568823, rs2854116, and rs2854117) in various ethnic groups is crucial in understanding the interethnic variability in APOC3 activity. The common SNPs at sites A-641C (rs2542052), A-630G (rs10892037), −625insT (rs11568823) of the promoter region are representative to each other. Kuwaiti Arabs (n = 100) showed higher frequencies (49.5%) for the previously reported “variant” promoter alleles at sites -641A, −630A, and -625del when compared to Caucasions (42%) [15, 16]. In regard to the two common SNPs, T-455C (rs2854116) and C-482 T(rs2854117) observed within the insulin responsive element (IRE) in the promoter region, the allelic frequencies differed markedly among ethnic groups. The frequency of the -482C allele (rs2854117) detected in Kuwaiti Arabs (37%) is lower than that of Chinese Han population (53.45%) [4]. Kozlitina and his colleagues investigated the frequency of the same polymorphism in different ethnic groups and reported the highest frequency in African American (71.2%) and less common in Hispanics (38.9%) and Europeans (36.6%). These interethnic differences in the allele frequencies of the two common variatiants within IRE suggest that there may be potential ethnic differences in APOC3 expression downregulation pathway activity and IRE sensitivity [26]. The frequency of the common variant rs5128 (C3238G), in the 3′-UTR was also found to be different from other reports. A higher frequency of the rare 3238G allele (19.6%) was observed in Kuwaiti Arabs than Caucasians (0–11%), comparable to the frequencies reported for Saudi Arabians (18%), Iranians (14%) and Costa Ricans (19%) [17, 25, 38,39,40,41]; while being lower than those reported for Northwest Indian subpopulations (22.6–26.2%) Asian Indians (31.3%) [42, 43].

The haplotype structure exhibited a complete LD (r2 = 1) observed between 4 promoter SNPs, rs2542052 (A-641C), rs10892037 (A-630G), rs11568823 (− 625insT) and rs2854116 (T-455C), all of which were within 186 base pairs. Considering the polymorphism at site − 455 (rs2854116), no previous studies (to our knowledge) have shown complete LD with the 3 concordant promoter SNPs listed above. Dammerman study [15] on Caucasians showed a strong LD rather than a complete LD in which the − 625 (rs11568823) genotype predicted the − 455 (rs2854116) genotype in 169 out of 173 subjects [43]. Moreover, Brown and his colleagues also generated a very strong LD in the place of complete LD within this polymorphisms pair (rs11568823 and rs2854116) [17].

In the present study, 2 additional haplotypes blocks were observed beside the common promoter haplotype reported in other studies. Both of them are three-marker haplotypes found within moderate disequilibrium regions. The first of the two encompasses 3 sequential SNPs within the first intron (rs734104, rs2070669, and rs2070668) while the second block covers rs5130 in intron 3 along with both rs5128 and rs4225 within the untranslated region of exon 4. The emergence of both haplotypes in Kuwaiti Arabs population could be explained in view of population-level behavior of alleles at adjacent loci or interethnic allele frequency differences. The aggregation of the above-mentioned SNPs in each haplotype suggest that each 3 variants may functionally cooperate in the Kuwaiti Arabs population. Based on these findings, the variants analyzed for association in this study were selected according to their haplotype group.

Studies of the association between various APOC3 polymorphisms and lipid profile have reported apparently conflicting findings across different populations [18, 43,44,45,46]. In the studied cohort (n = 733) and among the 5 tested APOC3 SNPs (rs5128, rs2854117, rs2070668, and Novels SNP 2 and 3), only rs5128 showed an association with increased BMI, TG and VLDL levels (Table 4) that was further validated using multivariate analysis. The rs5128 variant was found to be significantly associated (p <  0.05) with BMI among the studied cohort following a dominant genetic model (Table 5) and increased risk by an OR of 4.022 (CI: 1.13–14.30). Such an association was in concordance with other studies reported in different population [18,19,20,21,22,23,24,25,26,27]. It must be noted that the deviation from HWE observed for rs5128 was most likely an outcome of excessive homozygous carriers of the mutant wild type C allele conferring to the above findings and is not an outcome of genotyping error. Randomly selected samples were genotyped, and the results were consistent. In addition, other studied variants such as rs2070668 conferred to HWE further supporting the potential involvement of rs5128 with BMI in a known population prevalent with obesity.

The contributed small effect of the rs5128 minor allele, localized in the 3″UTR of exon 4, could be the outcome of possible increased transcriptional activity of the gene resulting in higher plasma levels of APOC3 protein. Studies have reported that high APOC3 levels is directly correlated to increased levels of TG, VLDL, and BMI [47]. There are three different possible mechanisms involved in the elevation of TG levels by APOC3. First, APOC3 is an inhibitor of lipoprotein lipase (LPL), a key rate limiting enzyme in the hydrolysis of TG-rich particles thereby increased levels of APOC3 would increases the inhibition of LPL [47]. Second, APOC3 promotes the assembly and secretion of VLDL in the liver yielding higher circulation of VLDL in the bloodstream [48, 49]. Thirdly, at higher concentrations APOC3, inhibition of hepatic lipase (HL) activity may also occur leading to a delayed catabolism of TG-rich particles [50]. Other indirect mechanisms by which the APOC3 could affect lipid metabolism resulting in accumulation of TG have been suggested [51]. A more recent study reported that another variant (rs4225) in the vicinity had a role in the regulation of gene expression and increased TG levels through the introduction of an miR-4271 binding site [21]. Furthermore, it could be postulated that such variants could have a modified affect related to nutraceuticals in regulating lipid levels. These molecules as well as functional food ingredients have been shown [52] to affect lipid levels and more specifically may reduce VLDL levels and that the mechanism behind this could be under genetic control.

Conclusion

APOC3 was found to be highly polymorphic in the studied Kuwaiti Arab population in which 42 previously reported SNPs and 3 novel SNPs were identified, one of which as characterized as “very rare” variant which may make a useful maker for ethnic identification. Only rs5128 showed an association with increased BMI, TG and VLDL levels in which the G allele is a risk variant and was found to deviate from HWE most likely as a result of its association and not the outcome of sampling error. This study supports the inclusion of rs5128 a marker for assessing genetic risk to dyslipidemia. For future studies and considering the importance of the repetitive sequences in genetic control processes, it would be interesting to analyze the repetitive sequence of the APOC3 gene among different ethnic groups employing a specific and reproducible genotyping protocol. Repeats have always presented technical challenges for sequence alignment and assembly programs. Polymorphisms of the repetitive sequence may be genotyped by targeted PCR with primers flanking the repetitive sequence and examining the resolving the products on high resolution gels that would facilitate the identification of the repeat alleles. The limitation of this study is in the lack of apolipoprotein and LPL levels.

Availability of data and materials

Additional data and analysis are provided in the supplementary files. Any other data may be made available upon request from the corresponding author.

Abbreviations

3′-UTR:

3′-untranslated region

5′-UTR:

5′-untranslated region

APOC3 :

Apolipoprotein C3 human gene

APOC3:

Apolipoprotein C3 human protein

BMI:

Body mass index

bp:

Base pair

dbSNP:

The single nucleotide polymorphisms database

DNA:

Deoxyribonucleic acid

GMAF:

Global minor allele frequency

HDL-C:

High-density-lipoprotein-cholesterol

HWE:

Hardy-Weinberg equilibrium

Indel:

Insertion or deletion

Kb:

Kilo base

LDL-C:

Low-density lipoprotein-cholesterol

NCBI:

National center for biotechnology information

PCR:

Polymerase chain reaction

rs:

Reference sequence

SNPs:

Single nucleotide polymorphisms

TC:

Total cholesterol

TG:

Triglycerides

References

  1. 1.

    Pennacchio LA, Olivier M, Hubacek JA, Cohen JC, Cox DR, Fruchart JC, Krauss RM, Rubin EM. An apolipoprotein influencing triglycerides in humans and mice revealed by comparative sequencing. Science. 2001;294(5540):169–73.

  2. 2.

    Verrijken A, Beckers S, Francque S, Hilden H, Caron S, Zegers D, Ruppert M, Hubens G, Van Marck E, Michielsen P, Staels B, Taskinen MR, Van Hul W, Van Gaal L. A gene variant of PNPLA3, but not of APOC3, is associated with histological parameters of NAFLD in an obese population. Obesity (Silver Spring). 2013;21(10):2138–45.

  3. 3.

    Parzianello L, Oliveira G, Coelho JC. Apolipoprotein CIII polymorphism and triglyceride levels of a Japanese population living in southern Brazil. Braz J Med Biol Res. 2008;41(6):462–7.

  4. 4.

    Yu J, Wang HM, Yang SM, Yuan J, Chen LY, Chen CL, Huang DF, Wang YG, Ju SQ, Zhu JY. The effect of APOC3 promoter polymorphisms on the risk of hypertriglyceridemia in Chinese Han population with or without type 2 diabetes mellitus. Labmedicine. 2010;41(1):34–9.

  5. 5.

    Davidson J, Rotondo D. Control of serum triglyceride levels by the apolipoprotein C3 gene and its relationship to cardiovascular disease. Curr Opin Lipidol. 2018;29(3):271–2.

  6. 6.

    Sacks FM, Alaupovic P, Moye LA, Cole TG, Sussex B, Stampfer MJ, Pfeffer MA, Braunwald E. VLDL, apolipoproteins B, CIII, and E, and risk of recurrent coronary events in the cholesterol and recurrent events (CARE) trial. Circulation. 2000;102(16):1886–92.

  7. 7.

    Mendivil CO, Rimm EB, Furtado J, Chiuve SE, Sacks FM. Low-density lipoproteins containing apolipoprotein C-III and the risk of coronary heart disease. Circulation. 2011;124(19):2065–72.

  8. 8.

    Task Force for the management of dyslipidaemias of the European Society of, C., S. the European Atherosclerosis, Catapano AL, Reiner Z, De Backer G, Graham I, Taskinen MR, Wiklund O, Agewall S, Alegria E, Chapman MJ, Durrington P, Erdine S, Halcox J, Hobbs R, Kjekshus J, Filardi PP, Riccardi G, Storey RF, Wood D, E. S. C. C. f. P. Guidelines and Committees. ESC/EAS guidelines for the management of dyslipidaemias: the task force for the management of dyslipidaemias of the European Society of Cardiology (ESC) and the European Atherosclerosis Society (EAS). Atherosclerosis. 2011;217(Suppl 1):S1–44.

  9. 9.

    Edwards KL, Newman B, Mayer E, Selby JV, Krauss RM, Austin MA. Heritability of factors of the insulin resistance syndrome in women twins. Genet Epidemiol. 1997;14(3):241–53.

  10. 10.

    Kronenberg F, Coon H, Ellison RC, Borecki I, Arnett DK, Province MA, Eckfeldt JH, Hopkins PN, Hunt SC. Segregation analysis of HDL cholesterol in the NHLBI Family Heart Study and in Utah pedigrees. Eur J Hum Genet. 2002;10(6):367–74.

  11. 11.

    Wang X, Paigen B. Genetics of variation in HDL cholesterol in humans and mice. Circ Res. 2005;96(1):27–42.

  12. 12.

    Goode EL, Cherny SS, Christian JC, Jarvik GP, de Andrade M. Heritability of longitudinal measures of body mass index and lipid and lipoprotein levels in aging twins. Twin Res Hum Genet. 2007;10(5):703–11.

  13. 13.

    Herbeth B, Samara A, Ndiaye C, Marteau JB, Berrahmoune H, Siest G, Visvikis-Siest S. Metabolic syndrome-related composite factors over 5 years in the STANISLAS family study: genetic heritability and common environmental influences. Clin Chim Acta. 2010;411(11–12):833–9.

  14. 14.

    Cole CB, Nikpay M, McPherson R. Gene-environment interaction in dyslipidemia. Curr Opin Lipidol. 2015;26(2):133–8.

  15. 15.

    Dammerman M, Breslow JL. Genetic basis of lipoprotein disorders. Circulation. 1995;91(2):505–12.

  16. 16.

    Surguchov AP, Page GP, Smith L, Patsch W, Boerwinkle E. Polymorphic markers in apolipoprotein C-III gene flanking regions and hypertriglyceridemia. Arterioscler Thromb Vasc Biol. 1996;16(8):941–7.

  17. 17.

    Brown S, Ordovas JM, Campos H. Interaction between the APOC3 gene promoter polymorphisms, saturated fat intake and plasma lipoproteins. Atherosclerosis. 2003;170(2):307–13.

  18. 18.

    Song Y, Zhu L, Richa M, Li P, Yang Y, Li S. Associations of the APOC3 rs5128 polymorphism with plasma APOC3 and lipid levels: a meta-analysis. Lipids Health Dis. 2015;14:32.

  19. 19.

    Tg NHL, Hdl Working Group of the Exome Sequencing Project, Blood I, Crosby J, Peloso GM, Auer PL, Crosslin DR, Stitziel NO, Lange LA, Lu Y, Tang ZZ, Zhang H, Hindy G, Masca N, Stirrups K, Kanoni S, Do R, Jun G, Hu Y, Kang HM, Xue C, Goel A, Farrall M, Duga S, Merlini PA, Asselta R, Girelli D, Olivieri O, Martinelli N, Yin W, Reilly D, Speliotes E, Fox CS, Hveem K, Holmen OL, Nikpay M, Farlow DN, Assimes TL, Franceschini N, Robinson J, North KE, Martin LW, DePristo M, Gupta N, Escher SA, Jansson JH, Van Zuydam N, Palmer CN, Wareham N, Koch W, Meitinger T, Peters A, Lieb W, Erbel R, Konig IR, Kruppa J, Degenhardt F, Gottesman O, Bottinger EP, O'Donnell CJ, Psaty BM, Ballantyne CM, Abecasis G, Ordovas JM, Melander O, Watkins H, Orho-Melander M, Ardissino D, Loos RJ, McPherson R, Willer CJ, Erdmann J, Hall AS, Samani NJ, Deloukas P, Schunkert H, Wilson JG, Kooperberg C, Rich SS, Tracy RP, Lin DY, Altshuler D, Gabriel S, Nickerson DA, Jarvik GP, Cupples LA, Reiner AP, Boerwinkle E, Kathiresan S. Loss-of-function mutations in APOC3, triglycerides, and coronary disease. N Engl J Med. 2014;371(1):22–31.

  20. 20.

    Tas S. Strong association of a single nucleotide substitution in the 3′-untranslated region of the apolipoprotein-CIII gene with common hypertriglyceridemia in Arabs. Clin Chem. 1989;35(2):256–9.

  21. 21.

    Hu SL, Cui GL, Huang J, Jiang JG, Wang DW. An APOC3 3′UTR variant associated with plasma triglycerides levels and coronary heart disease by creating a functional miR-4271 binding site. Sci Rep. 2016;6:32700.

  22. 22.

    Friedewald WS, Levy RI, Fredrickson DS. Estimation of the concentration of low-density lipoprotein cholesterol in plasma, without use of the preparative ultracentrifuge. Clin Chem. 1972;18(6):499–502.

  23. 23.

    Miller SA, Dykes DD, Polesky HF. A simple salting out procedure for extracting DNA from human nucleated cells. Nucleic Acids Res. 1988;16(3):1215.

  24. 24.

    R Core Team. R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2018.

  25. 25.

    Bandegi AR, Firoozrai M, Akbari Eidgahi MR, Kokhaei P. SstI polymorphism of the apolipoprotein CIII gene in Iranian hyperlipidemic patients: a study in Semnan Province. Iran J Basic Med Sci. 2011;14(6):506–13.

  26. 26.

    Kozlitina J, Boerwinkle E, Cohen JC, Hobbs HH. Dissociation between APOC3 variants, hepatic triglyceride content and insulin resistance. Hepatology. 2011;53(2):467–74.

  27. 27.

    Cui F, Li K, Li Y, Zhang X, An C. Apolipoprotein C3 genetic polymorphisms are associated with lipids and coronary artery disease in a Chinese population. Lipids Health Dis. 2014;13:170.

  28. 28.

    Chen K, McLellan MD, Ding L, Wendl MC, Kasai Y, Wilson RK, Mardis ER. PolyScan: an automatic indel and SNP detection approach to the analysis of human resequencing data. Genome Res. 2007;17(5):659–66.

  29. 29.

    Chen JQ, Wu Y, Yang H, Bergelson J, Kreitman M, Tian D. Variation in the ratio of nucleotide substitution and indel rates across genomes in mammals and bacteria. Mol Biol Evol. 2009;26(7):1523–31.

  30. 30.

    Rosenberg MS, Subramanian S, Kumar S. Patterns of transitional mutation biases within and among mammalian genomes. Mol Biol Evol. 2003;20(6):988–93.

  31. 31.

    Shastry BS. SNPs in disease gene mapping, medicinal drug development and evolution. J Hum Genet. 2007;52(11):871–80.

  32. 32.

    Castle JC. SNPs occur in regions with less genomic sequence conservation. PLoS One. 2011;6(6):e20660.

  33. 33.

    Hull J, Campino S, Rowlands K, Chan MS, Copley RR, Taylor MS, Rockett K, Elvidge G, Keating B, Knight J, Kwiatkowski D. Identification of common genetic variation that modulates alternative splicing. PLoS Genet. 2007;3(6):e99.

  34. 34.

    Cooper GM, Shendure J. Needles in stacks of needles: finding disease-causal variants in a wealth of genomic data. Nat Rev Genet. 2011;12(9):628–40.

  35. 35.

    Hesketh J. 3′-Untranslated regions are important in mRNA localization and translation: lessons from selenium and metallothionein. Biochem Soc Trans. 2004;32(Pt 6):990–3.

  36. 36.

    Guo Y, Jamison DC. The distribution of SNPs in human gene regulatory regions. BMC Genomics. 2005;6:140.

  37. 37.

    Genomes Project, C, Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM, Handsaker RE, Kang HM, Marth GT, McVean GA. An integrated map of genetic variation from 1,092 human genomes. Nature. 2012;491(7422):56–65.

  38. 38.

    Rees A, Stocks J, Sharpe CR, Vella MA, Shoulders CC, Katz J, Jowett NI, Baralle FE, Galton DJ. Deoxyribonucleic acid polymorphism in the apolipoprotein A-1-C-III gene cluster. Association with hypertriglyceridemia. J Clin Invest. 1985;76(3):1090–5.

  39. 39.

    Paul-Hayase H, Rosseneu M, Robinson D, Van Bervliet JP, Deslypere JP, Humphries SE. Polymorphisms in the apolipoprotein (apo) AI-CIII-AIV gene cluster: detection of genetic variation determining plasma apo AI, apo CIII and apo AIV concentrations. Hum Genet. 1992;88(4):439–46.

  40. 40.

    Garenc C, Aubert S, Laroche J, Girouard J, Vohl MC, Bergeron J, Rousseau F, Julien P. Population prevalence of APOE, APOC3 and PPAR-alpha mutations associated to hypertriglyceridemia in French Canadians. J Hum Genet. 2004;49(12):691–700.

  41. 41.

    Johansen K, Skotnicki A, Tan JC, Kwaasi AA, Skotnicki M. Apolipoprotein A-I/C-III gene cluster polymorphism in Saudi Arabians, Filipinos and Caucasians. Clin Genet. 1990;37(3):194–7.

  42. 42.

    Singh P, Singh M, Bhatnagar DP, Kaur T, Mastana S. Apolipoprotein C3 (SstI) gene variability in Northwest India: a global perspective. Int J Hum Genet. 2008;8(1–2):51–60.

  43. 43.

    Chhabra S, Narang R, Krishnan LR, Vasisht S, Agarwal DP, Srivastava LM, Manchanda SC, Das N. Apolipoprotein C3 SstI polymorphism and triglyceride levels in Asian Indians. BMC Genet. 2002;3:9.

  44. 44.

    Dammerman M, Sandkuijl LA, Halaas JL, Chung W, Breslow JL. An apolipoprotein CIII haplotype protective against hypertriglyceridemia is specified by promoter and 3′ untranslated region polymorphisms. Proc Natl Acad Sci U S A. 1993;90(10):4562–6.

  45. 45.

    Hokanson JE, Kinney GL, Cheng S, Erlich HA, Kretowski A, Rewers M. Susceptibility to type 1 diabetes is associated with ApoCIII gene haplotypes. Diabetes. 2006;55(3):834–8.

  46. 46.

    Pollin TI, Damcott CM, Shen H, Ott SH, Shelton J, Horenstein RB, Post W, McLenithan JC, Bielak LF, Peyser PA, Mitchell BD, Miller M, O’Connell JR, Shuldiner AR. A null mutation in human APOC3 confers a favorable plasma lipid profile and apparent cardioprotection. Science. 2008;322(5908):1702–5.

  47. 47.

    Johansen CT, Kathiresan S, Hegele RA. Genetic determinants of plasma triglycerides. J Lipid Res. 2011;52(2):189–206.

  48. 48.

    Sundaram M, Zhong S, Bou Khalil M, Links PH, Zhao Y, Iqbal J, Hussain MM, Parks RJ, Wang Y, Yao Z. Expression of apolipoprotein C-III in McA-RH7777 cells enhances VLDL assembly and secretion under lipid-rich conditions. J Lipid Res. 2010;51(1):150–61.

  49. 49.

    Qin W, Sundaram M, Wang Y, Zhou H, Zhong S, Chang CC, Manhas S, Yao EF, Parks RJ, McFie PJ, Stone SJ, Jiang ZG, Wang C, Figeys D, Jia W, Yao Z. Missense mutation in APOC3 within the C-terminal lipid binding domain of human ApoC-III results in impaired assembly and secretion of triacylglycerol-rich very low-density lipoproteins: evidence that ApoC-III plays a major role in the formation of lipid precursors within the microsomal lumen. J Biol Chem. 2011;286(31):27769–80.

  50. 50.

    Jong MC, Rensen PC, Dahlmans VE, van der Boom H, van Berkel TJ, Havekes LM. Apolipoprotein C-III deficiency accelerates triglyceride hydrolysis by lipoprotein lipase in wild-type and apoE knockout mice. J Lipid Res. 2001;42(10):1578–85.

  51. 51.

    Luo M, Peng D. The emerging role of apolipoprotein C-III: beyond effects on triglyceride metabolism. Lipids Health Dis. 2016;15(1):184.

  52. 52.

    Scicchitano P, Cortese F, Ricci G, Carbonara S, Moncelli M, Iacoviello M, Cecere A, Gesualdo M, Zito A, Caldarola P, Scrutinio D, Lagioia R, Riccioni G, Ciccone MM. Ivabradine, coronary artery disease, and heart failure: beyond rhythm control. Drug Des Devel Ther. 2014;8:689–700.

Download references

Acknowledgments

The authors would like to acknowledge the General Facility Project (GS 01/02) for the use of the ABI 3130xl Gene Analyzer. The authors extend their deepest appreciation and gratitude to all the participants in this study and to the technical assistance provided by Mrs. Babitha G. Annice and Mrs. Sheela Thankakon.

Funding

This research was supported and funded by Kuwait University Research Administration (Project YS01/12).

Author information

ZM and SA contributed equally. ZM performed all the experiments, analyzed all the data and prepared the manuscript, AA participated in the study design, supervised and assisted the statistical analysis and revised the manuscript. HA conducted the statistical analysis for the association of the selected variants among the cohort samples WA supervised and assisted with the sequence alignment, its annotations and submission of SNPs and revised the manuscript. SA prepared the project proposal and study design, supervised the molecular genetic analysis and sample collection and documentation, assisted with the data interpretation and preparation of the manuscript. All the authors have read and approved the final manuscript.

Correspondence to Suzanne A. Al-Bustan.

Ethics declarations

Ethics approval and consent to participate

Informed consent and ethical approval were obtained for this study in accordance to the revised 2000 Helsinki guiltiness of 1975. Ethical approval was granted by the Ethical Committee at Kuwait University as well as the Ethics Board at the Ministry of Health in Kuwait for conducting this study.

Consent for publication

All authors have read and approved the manuscript for publication.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1: Table S1. Primers sets designed to sequence the full APOC3 gene. Table S2. PCR thermal profile used for all the primer sets of the APOC3 gene in this study during both amplification and sequencing reactions. Table S3. A summary of genotypic and allelic frequencies for the APOC3 SNPs showing within the total population (n = 100). Listed are the number of the SNP when used in the linkage disequilibrium and haplotype analysis (first column), the dbSNP reference number. Table S4. Pairwise test of linkage-disequilibrium as measured by r2 between the identified 22 segregating SNPs at the APOC3 gene locus (MAF > 5%). Table S5. Genetic modeling of APOC3 rs5128 with BMI, TG, and VLDL. Table S6. Minor allelic frequencies of commonly studied APOC3 SNPs (MAF > 5%) in reported in various populations.

Additional file 2: Figure S1. Amplification plots for the genotypes of the identified novel APOC3 variants as observed by real-time PCR.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Malalla, Z.H., Al-Serri, A.E., AlAskar, H.M. et al. Sequence analysis and variant identification at the APOC3 gene locus indicates association of rs5218 with BMI in a sample of Kuwaiti’s. Lipids Health Dis 18, 224 (2019) doi:10.1186/s12944-019-1165-6

Download citation

Keywords

  • APOC3
  • Sequence variants
  • Genetic association
  • Lipid levels
  • BMI
  • Arabs