Evaluation of Sex-Stratified Polygenic Risk Scores for Type 2 Diabetes Mellitus and Glycemic Traits in the Framingham Heart Study
Article information
Abstract
Background
Diabetes is a multifactorial disease with significant genetic predisposition. Polygenic risk scores (PRS) have been developed to estimate an individual’s genetic risk of a disease. Traditionally, PRS utilize sex-combined genome-wide association studies (GWAS) due to the limited availability of sex-stratified summary statistics. This study explores sex-dimorphic genetic effects and evaluates the potential benefits of incorporating sex-stratified effects in PRS for type 2 diabetes mellitus (T2DM) and glycemic traits by comparing PRS performance derived from sex-combined versus sex-stratified GWAS.
Methods
We performed a sex-heterogeneity test across sex-specific GWAS and identified nine signals with sex-dimorphic effects for T2DM. PRS[sex-combined] and PRS[sex-stratified] were developed using sex-combined and sex-stratified GWAS results for T2DM (41,444 cases and 354,539 controls), fasting glucose (n=120,595) and fasting insulin (n=98,210). We evaluated these PRS models in 8,379 participants (1,303 cases and 7,076 controls) from the Framingham Heart Study not included in the PRS derivation.
Results
Our findings suggest that sex-combined PRS currently offer better predictive performance for T2DM and glycemic traits.
Conclusion
These results highlight the need for larger sex-stratified studies and the optimization of sex-stratified risk models for clinical practice.
Highlights
• This study explores sex differences in genetic risk for T2DM.
• Nine loci were identified with sex-dimorphic effects on the risk of T2DM.
• Sex-stratified and combined genetic scores were built and evaluated.
• Findings highlight the need for larger sex-specific studies to improve prediction.
INTRODUCTION
Diabetes is a growing global health challenge, with its prevalence expected to increase from 540 million individuals in 2021 to an estimated 783 million by 2045 [1]. Type 2 diabetes mellitus (T2DM), accounting for over 90% of these diabetes cases, is a multifactorial chronic condition influenced by both genetic and environmental factors and their interaction [2]. Genome-wide association studies (GWAS) have identified over 600 loci associated with T2DM risk [3]. Glycemic traits, such as fasting glucose (FG) and fasting insulin (FI), play a critical role in T2DM pathophysiology, and hundreds of variants associated with these traits have been identified in large-scale GWAS [4-8].
Polygenic risk scores (PRS), which aggregate the effects of multiple genetic variants, quantify an individual’s genetic susceptibility to diseases [9,10]. PRS can identify high-risk individuals who may not be classified accurately through clinical factors alone, aiding in early diagnosis and prevention efforts [11]. PRS can be constructed using methods such as P value thresholds followed by linkage disequilibrium (LD) pruning, which selects a subset of genetic variants from GWAS summary statistics [12], or shrinkage-based approaches that incorporate millions of variants while accounting for LD [13-15]. Recent efforts in PRS method development aim to improve prediction across diverse ancestries and integrate functional annotations to inform pathway-specific processes [16-19].
Most GWAS results used to construct PRS are from sex-combined analyses, assuming similar genetic effects between sexes. However, there is compelling evidence that T2DM genetic architecture may differ by sex. Women face greater relative risks of cardiovascular complications and mortality once diabetes develops [20,21]. Sex hormones and differences in fat distribution drive variation in glucose metabolism and insulin sensitivity [22,23]. In addition, recent studies have revealed sex-dimorphic effects in genetic associations for T2DM and glycemic traits. For example, T2DM risk signals near c-Maf inducing protein (CMIP), KLF transcription factor 14 (KLF14), and transcription factor 12 (TCF12) show larger effects in females, while the signal near ankyrin 1 (ANK1) displays a larger effect in males. FG signals at insulin receptor substrate 1 (IRS1) and zinc finger protein 12 (ZNF12) also showed sex-dimorphic effects [24,25].
Given these findings, we conducted a sex-specific T2DM GWAS meta-analysis using the latest genetic data from the UK Biobank (UKBB), Mass General Brigham Biobank (MGBB), and the Genetic Epidemiology Research on Adult Health and Aging (GERA) cohort. We also leveraged publicly available sex-specific GWAS summary statistics for FG and FI to derive PRS models and evaluated these models in 8,379 (1,303 cases and 7,076 controls) participants from the Framingham Heart Study (FHS). Our goal was to determine under what conditions sex-stratified PRS improve prediction accuracy over sex-combined PRS models for T2DM and glycemic traits and to assess the extent of any observed differences.
METHODS
Sex-specific and sex-combined GWAS
We utilized sex-specific and sex-combined GWAS summary statistics for T2DM, FG, and FI. Sex-specific T2DM GWAS were conducted across 41,444 cases (24,809 males and 16,635 females) and 354,539 controls (157,022 males and 197,517 females) using data from UKBB, MGBB, and GERA cohorts, all of predominantly European ancestry [26]. Genotype imputation was performed using the Trans-Omics for Precision Medicine (TOPMed) reference panel. Genome-wide genetic variants were tested for association using logistic regression models for binary T2DM outcome fitted using REGENIE [27], adjusting for age, body mass index (BMI), principal components (PCs) and imputation batch. Cohort-specific results were further meta-analyzed using METa AnaLysis Helper (METAL) [28].
For FG and FI, we obtained publicly available sex-specific GWAS results [24], including 67,506 males and 73,089 females for FG and 47,806 males and 50,404 females for FI across 73 studies. All participants were free of diabetes and of European ancestry. Each study performed sex-specific FG and FI GWAS under additive genetic models adjusting for age, study sites and PCs. Summary statistics were imputed for unmeasured variants using LD patterns from the 1000 Genomes reference panel via SSImp [29] and then meta-analyzed using Genome-Wide Association Meta-Analysis (GWAMA) [30]. We performed sex-combined GWAS by inverse variance weighting of sex-specific summary statistics using GWAMA.
We also conducted a two-step sex-heterogeneity test to identify variants with sex-dimorphic effects [30,31]. In the first step, we applied a 2-degree-of-freedom (df) test that jointly evaluates both the main genetic effect and the interaction with sex, using a genome-wide significance threshold (P<5×10–8). In the second step, we tested for significant differences in effect sizes between sexes among the variants identified in step one. To correct for multiple testing, we used a Bonferroni-adjusted threshold based on the effective number of independent variants identified in step 1, estimated using the eigenvalue-based approach by Li and Ji [32].
PRS construction in the Framingham Heart Study
FHS participants were genotyped and imputed on the Michigan Imputation Server using Minimac3 with the Haplotype Reference Consortium (HRC, release 1.1, April 2016) reference panel [33]. Standard quality control was applied prior to imputation, including variant call rate ≥97%, Hardy–Weinberg equilibrium P≥1×10−6, <1,000 Mendelian errors, and minor allele frequency (MAF) ≥1%. Phasing was performed with SHAPEIT2 incorporating duoHMM to account for parental genotypes. Variants with imputation quality r2<0.3 were excluded.
Because FHS did not contribute to the GWAS for T2DM, FG, and FI, we developed PRS models using GWAS summary statistics and applied them to FHS participants. We used the PRS-continuous shrinkage (CS) method, which applies continuous shrinkage to estimate posterior effect sizes of genetic variants from GWAS summary statistics [14]. PRS-CS is a leading approach for constructing PRS in complex traits, as it leverages genome-wide LD structure, eliminates the need for arbitrary P value thresholds and LD pruning required by traditional approaches, and has demonstrated robust predictive performance across a variety of traits [34].
An external LD panel from the 1000 Genomes project was used for LD information in the PRS-CS [35], chosen to closely match the ancestry of the validation cohort. We calculated sex-stratified and sex-combined PRS, standardizing each to have a mean of zero and a standard deviation of one. Therefore, two sets of standardized PRS in FHS participants for each outcome: PRS[sex-stratified] and PRS[sex-combined].
PRS analysis in the Framingham Heart Study
Outcome definitions
We evaluated the associations of PRS with four outcomes: prevalent T2DM, incident T2DM, FG and FI in the FHS. T2DM status was determined based on physician diagnosis, self-reports, and diabetes medication use, and supplemented by glycemia measures (FG ≥7 mmol/L, glycosylated hemoglobin ≥6.5%, 2-hour oral glucose tolerance test ≥11.1 mmol/L, or random/non-FG ≥11.1 mmol/L). Individuals with pregnancy at diagnosis or age at diagnosis <25 years were excluded. Prevalent T2DM was defined using all available exams, and for incident T2DM analyses. Individuals with T2DM at baseline were excluded. Time-to-event was defined from the DNA draw date to the date of T2DM diagnosis or the end of followup, with a maximum follow-up of 26 years. For FG and FI analyses, we included only participants free of diabetes, using their baseline glycemic measures. FG was analyzed in mmol/L and FI was analyzed in pmol/L, with FI values log-transformed to reduce skewness.
PRS models
We compared PRS[sex-stratified] and PRS[sex-combined] models in females, males and all participants. For prevalent T2DM and the glycemic traits, we fit generalized mixed models (loglink for T2DM and linear for FG/FI) that incorporated the kinship matrix to account for familial relatedness. For incident T2DM, we used Cox proportional hazards model with robust sandwich estimators clustered by family ID to account for familial correlation.
All models were adjusted for age, sex, BMI, generation indicator, as FHS includes multiple generations [36,37], and PCs. The first 10 PCs were included in FG and FI analyses, and nominally significant PCs (P<0.05) were added in T2DM analyses. Odds ratios (ORs) were estimated for prevalent T2DM, hazard ratios (HRs) for incident T2DM, and standardized effect sizes for glycemic traits.
Comparison and evaluation
Model performances were evaluated in testing sets with 10-fold cross validation, where family members were kept intact to ensure familial independence across folds. We additionally constructed a PRS[mix] model defined as a weighted sum of PRS[sex-stratified] and PRS[sex-combined], with weights derived from a model including both sex-stratified and sex-combined PRS in the training set.
Evaluation metrics included the area under the receiver operating characteristic (AUC) for prevalent T2DM, Akaike information criterion for incident T2DM and R2 increment (i.e., R2 increase from null model with covariates only) for glycemic traits FG and FI. We further compared AUCs for prevalent T2DM models using Delong’s test, which evaluated whether AUC differences between models were statistically significant [38,39]. For incident T2DM, we performed partial likelihood to compare the performances of two PRS models.
Statistical analyses were performed using R version 4.2.1 (R Foundation for Statistical Computing, Vienna, Austria), and statistical significance was determined at a P value threshold of 0.05 unless otherwise noted. An overview of the study design and analytic workflow is provided in Fig. 1.
Study design and analytic workflow. Sex-specific and sex-combined type 2 diabetes mellitus (T2DM) genome-wide association studies (GWAS) were conducted in the UK Biobank (UKBB), Mass General Brigham Biobank (MGBB), and Genetic Epidemiology Research on Adult Health and Aging (GERA) cohorts. Fasting glucose (FG) and fasting insulin (FI) sex-specific and sex-combined GWAS results were obtained from The Malignant Germ Cell International Consortium (MaGIC) [24]. A two-step heterogeneity test was applied to identify variants with sex-dimorphic effects. Polygenic risk score (PRS) were constructed in the Framingham Heart Study (FHS) using PRS-continuous shrinkage (CS) with both sex-specific and sex-combined GWAS as input, along with the 1000 Genomes European linkage disequilibrium (LD) panel. In the FHS validation cohort, PRS were evaluated for four outcomes: prevalent T2DM, incident T2DM, FG, and FI. Mixed effects models and Cox proportional hazards model accounting for familial relatedness were used, adjusting for age, body mass index (BMI), generation, and principal components (PCs). Model performances were assessed with 10-fold cross validation using area under the receiver operating characteristic (AUC), Akaike information criterion (AIC), and R2 increment. GWAMA, Genome-Wide Association Meta-Analysis; OR, odds ratio; HR, hazard ratio. aSex-specific GWAS summary statistics are publicly available from Lagou et al. [24].
Ethics statement
This research was conducted using data and resources from the FHS of the National Heart Lung and Blood Institute of the National Institutes of Health and Boston University School of Medicine (Contract No. N01-HC-25195, HHSN268201500001I and 75N92019D00031). This study was approved by the Institutional Review Board (IRB) of the Boston University Medical Campus. All FHS participants provided written informed consent.
RESULTS
Sex-dimorphic effects at T2DM loci
We identified 12,651 variants that reached genome-wide significance (P<5×10−8) in the 2-df test which evaluates both main effects and their interaction with sex. Among these, nine T2DM-associated signals demonstrated significant sex-dimorphic effects, based on a Bonferroni correlation that accounted for the effective number of independent tests (0.05/739, heterogeneity P<6.8×10−5) with a MAF >0.0005 (Table 1, Supplementary Table 1, Supplementary Fig. 1). Several loci showed heterogeneity in association strengths or even opposite directions of effects between sexes, emphasizing the complexity of sex-dimorphic genetic regulation in T2DM.
The variant rs7524924 near lysophospholipase like 1 (LYPLAL1) showed a stronger protective effect in females than males (MAF=0.43, female OR=0.92, male OR=0.99, heterogeneity P=6.0×10−5). This variant is an adipose-specific expression quantitative trait locus (eQTL) for LYPLAL1-AS1 and has been associated with waist-hip ratio (WHR), a key obesity phenotype that influences T2DM risk in a sex-dimorphic manner [40].
Similarly, rs258494 near POC5 (MAF=0.44, female OR= 1.08, male OR=1.01, heterogeneity P=6.5×10–5) and rs3923113 near growth factor receptor bound protein 14 (GRB14)/cordon-bleu WH2 repeat protein like 1 (COBLL1) (MAF=0.43, female OR=0.86, male OR=0.92, heterogeneity P=3.3×10–5) also demonstrated stronger effects in females. As an adipose eQTL for COBLL1, rs3923113 has been associated with insulin resistance and FI levels, and showed sex-dimorphic effects on WHR trait [41].
We also identified rare variants with significant sex-dimorphic effects. Allele A of rs141203601, located upstream of NK6 homeobox 1 (NKX6-1), showed a highly protective effect in females but was associated with increased T2DM risk in males (MAF=0.0010, female OR=0.20, male OR=6.92, heterogeneity P=1.1×10–8). Another rare variant, rs146239492 near kinesin family member 2B (KIF2B), also showed significant sex-dimorphic effects (MAF=0.0031, female OR=0.43, male OR=5.20, heterogeneity P=3.8×10–8).
The variant rs73795193 near fibroblast growth factor 1 (FGF1) showed an increased risk in females but not in males (MAF=0.04, female OR=1.26, male OR=0.96, heterogeneity P=5.2×10–7). Though this variant has limited evidence of association with T2DM, FGF1 plays a key role in glucose homeostasis and adipogenesis, which may explain its involvement in sex-specific metabolic regulation [42].
Other loci with significant sex-heterogeneous effects included KLF14 (rs59326756, MAF=0.29, female OR=1.15, male OR=1.05, heterogeneity P=4.1×10–7), phosphodiesterase 3A (PDE3A) (rs7134375, MAF=0.39, female OR=0.93, male OR=1.00, heterogeneity P=2.7×10–5), and CMIP (rs12443634, MAF=0.27, female OR=0.87, male OR=0.96, heterogeneity P=2.6×10–7).
We did not investigate sex-heterogenous variants for FG and FI, as these variants were previously reported [24].
Participant characteristics in FHS
This study includes 8,379 FHS participants across three generational cohorts. All participants are of European ancestry. The characteristics of these participants are summarized in Table 2.
PRS of T2DM, FG, and FI
The correlation between PRS[sex-stratified] and PRS[sex-combined]) was higher for T2DM than for FG and FI (Pearson correlation: ρT2D=0.82, ρFG=0.77 and ρFI=0.66) (Supplementary Fig. 2). Associations between PRS and T2DM, FG, and FI were assessed separately for three groups: (1) all participants; (2) males; and (3) females. For prevalent T2DM analysis in all participants, OR was 1.67 per each standard deviation for PRS [sex-stratified] (95% CI for OR, 1.56 to 1.79) compared to 1.87 for PRS[sex-combined] (95% CI for OR, 1.74 to 2.01). For incident T2DM in all participants, HR was 1.45 per each standard deviation for PRS[sex-stratified] (95% CI for HR, 1.31 to 1.60) and 1.58 for PRS[sex-combined] (95% CI for HR, 1.42 to 1.76). Although these effect sizes were numerically different, overlapping confidence intervals for ORs and HRs indicated no statistically significant differences between the models. We observed similar overlapping patterns between PRS[sex-stratified] and PRS[sex-combined] for FG and log-transformed FI (Fig. 2).
Effects of two polygenic risk score (PRS) on prevalent type 2 diabetes mellitus (T2DM), incident T2DM, fasting glucose (FG), and fasting insulin (FI). Forest plots show the effect sizes of PRS[sex-stratified] and PRS[sex-combined] models for all participants, females and males across four outcomes. Odds ratios (ORs) are presented for prevalent T2DM, hazard ratios (HRs) for incident T2DM, and regression coefficient (beta) for FG and log-transformed FI. Effect sizes represent changes per 1 standard deviation increase in the PRS. Horizontal lines indicate 95% confidence interval (CI) for each effect size.
Combined PRS prediction on T2DM, FG, and FI
We then developed a model, called PRS[mix], that combines both sex-stratified and sex-combined PRS using a training dataset from FHS and tests it in an independent FHS subset of participants. For the combined model PRS[mix], it consistently assigned less than 25% weight to PRS[sex-stratified] across all outcomes. The contribution of PRS[sex-stratified] was relatively higher for glycemic traits (FG and FI) than for T2DM outcomes (Supplementary Fig. 3).
Across all four outcomes, PRS[sex-combined] constantly showed the best predictive accuracy. This may suggest that sex-combined PRS model captures enough variability without sex-stratification (Table 3). For prevalent T2DM, Delong’s test showed significant differences between PRS[sex-stratified] and both PRS[sex-combined] and PRS[mix] models (P=9.6×10−7 and P=2.1×10−7). In incident T2DM analysis, partial likelihood ratio tests indicated that both PRS[sex-combined] and PRS[mix] models provided significantly different fits compared to the PRS[sex-stratified] model (P=0.017 and P=0.013, respectively) (Table 4).
DISCUSSION
This study investigates the role of sex-dimorphic genetic effects in the identification of new variants associated with T2DM, and the development of PRS models for prediction of T2DM and glycemic traits (FG and FI). Among the identified T2DM loci, variant rs12443634 at CMIP locus presented a greater risk effect in females than in males. We also observed a similar pattern for rs2925979, a variant in high LD (R2>0.8) with variant rs12443634, which showed a modestly increased risk in females but not in males, supporting a stronger female-specific effect at this locus (Supplementary Table 1). Notably, rs2925979 was identified with significant sex-dimorphic effects in a previous large-scale T2DM GWAS meta-analysis [25]. In a Chinese family-based study, rs2925979 T allele was associated with 29% higher odds of T2DM in females but did not show a significant association in males [43]. Interestingly, their study found this allele was also associated with lower BMI and body fat percentage in non-diabetic females, suggesting pleiotropic effects of CMIP on both T2DM and obesity-related phenotypes. These findings underscore the importance of further investigation of the independent mechanisms underlying CMIP’s sex-dimorphic effects. In addition to CMIP region, we confirmed sex-dimorphic effects near GRB14/COBLL1 and KLF14, which only showed nominally differences in risk effects in a previous study [25].
Novel loci with sex-dimorphic effects were also identified, including LYPLAL1, NKX6-1, POC5, FGF1, PDE3A, and KIF2B. These findings contribute to the growing evidence of sex-specific genetic architecture in T2DM. LYPLAL1 has been implicated in lipid metabolism and fat distribution, and previous studies supported the sex-specific mechanism at this locus [44,45].
NKX6-1 is a transcription factor essential for pancreatic β-cell function, and its dysregulation may contribute to divergent T2DM risk between sexes [46]. FGF1 plays a role in adipogenesis and glucose homeostasis, with evidence suggesting differential gene expression by sex [47]. Understanding the biological relevance of these findings could provide insight into how sex-specific regulation of gene expression contributes to metabolic differences between males and females.
Our findings reveal that PRS derived from sex-combined GWAS better predict outcomes than those derived from sex-stratified GWAS for the outcomes investigated. These results suggest that current sample sizes in sex-stratified GWAS of T2DM, FG, and FI may be insufficient to detect subtle genetic contributions and improve predictive accuracy relative to sex-combined models.
Reduced statistical power is a key limitation of sex-stratified models. Despite identifying nine loci with significant sex-dimorphic effects for T2DM, the sample size in each sex-specific GWAS may have been too small to capture additional associations. Although the T2DM GWAS used to derive PRS were conducted in very large cohorts (UKBB, MGBB, and GERA), stratifying by sex effectively reduces the sample size within each analysis. This reduced power likely explains the weaker performance of sex-stratified PRS compared to sex-combined PRS. As larger sex-specific GWAS become available in the future, sex-stratified PRS may capture additional sex-dimorphic effects, and their predictive performance could improve. Consequently, sex-stratified PRS may not fully capture genetic risk variation, while sex-combined PRS can leverage shared genetic effects across sexes for greater detection power. Moreover, a relatively small number of variants with significant sex-dimorphic effects for the outcomes investigated limits the ability of sex-stratified PRS to demonstrate a performance advantage. Therefore, using sex-combined PRS with larger sample sizes remains advantageous in current research settings for T2DM, FG, and FI. Future more powerful sex-specific GWAS should identify additional sex-specific effects and may improve the performance of sex-dimorphic effects.
Another interesting observation was that PRS[sex-stratified] contributed more to PRS[mix] model for glycemic traits than for T2DM outcomes. This may reflect stronger sex-specific genetic influences on intermediate glycemic traits, which allow sex-stratified PRS to capture additional variance that contributes to prediction accuracy. Future studies could investigate how to best optimize the integration of sex-specific and sex-combined PRS across different traits and outcomes.
From a clinical perspective, PRS are valuable tools for identifying individuals at high genetic risk for diseases like T2DM, particularly among those perceived to have low risk based on classic clinical risk factors [11]. However, our results suggest that current sex-specific PRS models offer limited additional predictive value over sex-combined models for T2DM and glycemic traits. This finding highlights the limitation of current sex-stratified GWAS sample sizes, underscores the need for enlarging such datasets to enhance precision and cautions against premature clinical application until then. Future research should aim to increase the sample sizes of sex-stratified GWAS, particularly in non-European populations. This would improve the power to detect sex-dimorphic genetic effects and refine PRS models. Additionally, integrating multi-omics data and functional annotations may provide further insights into sex-specific mechanisms underlying T2DM and related traits. Encouragingly, studies of other complex traits, such as schizophrenia, coronary artery disease, and basal cell carcinoma, have demonstrated the clinical utility of sex-stratified PRS for risk stratification, supporting the need for continued research in this area [48-50].
In conclusion, while sex-combined PRS currently offer better predictive performance for T2DM and glycemic traits, ongoing research into sex-dimorphic effects remains crucial for advancing personalized medicine and improving disease risk prediction. Expanding sample sizes and incorporating functional insights will be essential for optimizing the integration of sex-stratified models into clinical practice.
SUPPLEMENTARY MATERIALS
Supplementary materials related to this article can be found online at https://doi.org/10.4093/dmj.2025.0557.
T2DM variants with sex-dimorphic effects in the meta-analysis of UKBB, MGBB, and GERA
Regional association plots (±500 kb) for loci showing sex-dimorphic effects on type 2 diabetes mellitus (T2DM). These plots were generated by the locuszoomr package [1] with linkage disequilibrium (LD) information based on the European population from the 1000 Genomes Program [2]. (A) Regional plots for the lysophospholipase like 1 (LYPLAL1) locus with sex-dimorphic effects on T2DM. The top panel shows the female-specific genome-wide association study (GWAS) results, and the bottom panel shows the male-specific results. The purple diamond indicates the index variant rs7524924, defined as the variant with the smallest P value in the sex-heterogeneity test within the region. (B) Regional plots for the growth factor receptor bound protein 14 (GRB14)/cordon-bleu WH2 repeat protein like 1 (COBLL1) locus with sex-dimorphic effects on T2DM. The top panel shows the female-specific GWAS results, and the bottom panel shows the male-specific results. The purple diamond indicates the index variant rs3923113, defined as the variant with the smallest P value in the sex-heterogeneity test within the region. (C) Regional plots for the NK6 homeobox 1 (NKX6-1) locus with sex-dimorphic effects on T2DM. The top panel shows the female-specific GWAS results, and the bottom panel shows the male-specific results. The purple diamond indicates the index variant rs141203601, defined as the variant with the smallest P value in the sex-heterogeneity test within the region. (D) Regional plots for the POC5 locus with sex-dimorphic effects on T2DM. The top panel shows the female-specific GWAS results, and the bottom panel shows the male-specific results. The purple diamond indicates the index variant rs258494, defined as the variant with the smallest P value in the sex-heterogeneity test within the region. (E) Regional plots for the fibroblast growth factor 1 (FGF1) locus with sex-dimorphic effects on T2DM. The top panel shows the female-specific GWAS results, and the bottom panel shows the male-specific results. The purple diamond indicates the index variant rs73795193, defined as the variant with the smallest P value in the sex-heterogeneity test within the region. (F) Regional plots for the KLF transcription factor 14 (KLF14) locus with sex-dimorphic effects on T2DM. The top panel shows the female-specific GWAS results, and the bottom panel shows the male-specific results. The purple diamond indicates the index variant rs59326756, defined as the variant with the smallest P value in the sex-heterogeneity test within the region. (G) Regional plots for the phosphodiesterase 3A (PDE3A) locus with sex-dimorphic effects on T2DM. The top panel shows the female-specific GWAS results, and the bottom panel shows the male-specific results. The purple diamond indicates the index variant rs7134375, defined as the variant with the smallest P value in the sex-heterogeneity test within the region. (H) Regional plots for the c-Maf inducing protein (CMIP) locus with sex-dimorphic effects on T2DM. The top panel shows the female-specific GWAS results, and the bottom panel shows the male-specific results. The purple diamond indicates the index variant rs12443634, defined as the variant with the smallest P value in the sex-heterogeneity test within the region. (I) Regional plots for the kinesin family member 2B (KIF2B) locus with sex-dimorphic effects on T2DM. The top panel shows the female-specific GWAS results, and the bottom panel shows the male-specific results. The purple diamond indicates the index variant rs146239492, defined as the variant with the smallest P value in the sex-heterogeneity test within the region.
Distribution of two standardized polygenic risk score (PRS) for (A) type 2 diabetes mellitus (T2DM), (B) fasting glucose (FG), and (C) fasting insulin (FI) across Framingham Heart Study (FHS) participants. Panels show the distributions of PRS[sex-stratified] and PRS[sex-combined] for T2DM, FG, and FI. PRS were standardized to have a mean of zero and standard deviation of one. The overlap of distributions indicates similarity between the two types of PRS. The dashed horizontal line represents the mean PRS at zero. The y-axis shows the density of PRS values.
Contribution of polygenic risk score (PRS) [sex-stratified] to PRS[mix] by outcome. Box plots illustrate the contribution of PRS[sex-stratified] to the development of PRS[mix] across four outcomes. Contributions were evaluated using 10-fold cross validation and remained consistently below 25% for all outcomes. The blue bars indicate the median contribution percentage, while the red dot and number indicate the average contribution. The contribution of PRS[sex-stratified] is relatively higher for glycemic traits (fasting glucose [FG] and fasting insulin [FI]) compared to prevalent and incident type 2 diabetes mellitus (T2DM).
Notes
CONFLICTS OF INTEREST
No potential conflict of interest relevant to this article was reported.
AUTHOR CONTRIBUTIONS
Conception or design: N.W., Y.Z., J.B.M., A.K.M., C.T.L., J.D., J.M.M.
Acquisition, analysis, or interpretation of data: all authors.
Drafting the work or revising: N.W., Y.Z., C.T.L., J.D., J.M.M.
Final approval of the manuscript: all authors.
FUNDING
Ningyuan Wang, Yixin Zhang, James B. Meigs, Alisa K. Manning, Ching-Ti Liu, Josée Dupuis are supported by NIDDK UM1DK078616. Josep M. Mercader is supported by American Diabetes Association grant #11-22-ICTSPM-16 and by NHGRI U01HG011723, by the National Institute Of Diabetes And Digestive And Kidney Diseases of the National Institutes of Health under Award Number R01DK137993 and U01 DK140757, AMP CMD award from RFP 6 from the Foundation for the National Institutes of Health, and a Medical University of Bialystok (MUB) grant from the Ministry of Science and Higher Education (Poland). This work is supported by the Novo Nordisk Foundation (NNF21SA0072102).
ACKNOWLEDGMENTS
The authors thank the participants of the Framingham Heart Study, as well as the study team for their contributions.
