Mode
Text Size
Log in / Sign up

Polygenic risk score enrichment reduces sample size needs for coronary artery disease and inflammatory bowel disease trials.

Polygenic risk score enrichment reduces sample size needs for coronary artery disease and inflammato…
Photo by CDC / Unsplash
Key Takeaway
Consider that PRS-enriched trial designs may reduce sample size requirements for specific genetic targets, but optimal thresholds vary by disease.

This in silico framework used an observational cohort design with UK Biobank participants having genomic and electronic health record data. The study modeled trial designs that restrict enrollment to the upper 75%, 50%, or 25% of a polygenic risk score (PRS) distribution, compared to unenriched designs drawing from the full population. The primary outcomes were disease prevalence, statistical power, sample size requirements, and time-to-event accrual.

Main results showed that PRS-enriched designs increased disease prevalence and improved empirical power relative to unenriched cohorts. For required per-arm sample sizes at 80% power, enrichment to the upper 25% of PRS distribution reduced needs by approximately 60% for CAD-PCSK9 and 78% for IBD-IL23R. Time-to-event accrual was accelerated in enriched designs. However, for glaucoma-ANGPTL7, the most restrictive threshold did not yield additional gains over moderate enrichment due to reduced sample size attenuating the detectable difference.

No safety or tolerability data were reported, as this was an in silico analysis. Key limitations include that the framework uses naturally occurring protective genetic variants as analogs for therapeutic interventions, optimal PRS thresholds are disease-context dependent, and reduced sample size may attenuate detectable differences in some diseases. The practice relevance is that this provides a scalable foundation for integrating genetic risk into clinical trial design using population-scale genomic data.

The study cautions that results are based on an in silico evaluation using UK Biobank data and are not a clinical trial or experimental intervention. Findings are limited to three model gene-disease pairs and should not be generalized universally.

Study Details

Study typeCohort
EvidenceLevel 3
PublishedApr 2026
View Original Abstract ↓
Background: Clinical trials are essential for therapeutic development but increasingly face challenges due to imprecise inclusion criteria, leading to low event rates and the need for large sample sizes. This inefficiency makes modern trials costly and time-consuming. Despite the availability of extensive clinical, genomic, and biological data, current trial enrollment strategies do not fully leverage this information. Incorporating genomic information into trial design could enable risk-based participant enrichment by preferentially enrolling individuals with higher disease risk, thereby increasing event rates and improving trial efficiency. Methods: In this study, we developed an in silico framework for evaluating prognostic enrichment guided by polygenic risk scores (PRS) in clinical trial design using genomic and electronic health record data from large-scale biobanks. Naturally occurring protective genetic variants were used as analogs of therapeutic interventions, with variant carriers treated as 'treatment' arms and non-carriers as 'control' arms. We compared unenriched designs, in which carriers and non-carriers were drawn from the full population, against PRS-enriched designs in which both arms were restricted to participants in the upper 75%, 50%, or 25% of the PRS distribution, respectively. Across these four designs, we quantified disease prevalence, statistical power, sample size requirements, and time-to-event accrual. Results: We applied this approach to the UK Biobank using three model gene-disease pairs: the protective variant p.Arg46Leu in PCSK9 for coronary artery disease (CAD), p.Gln175His in ANGPTL7 for glaucoma, and p.Arg381Gln in IL23R for inflammatory bowel disease (IBD). Across all three disease contexts, PRS-enriched designs increased disease prevalence, improved empirical power, and accelerated event accrual relative to unenriched cohorts. At 80% power, restricting enrollment to the upper 25% of the PRS distribution reduced required per-arm sample sizes by approximately 60% for CAD-PCSK9 and 78% for IBD-IL23R. Consistent reductions in time-to-event were also observed across enriched strata, suggesting that PRS-enriched trials could achieve target event counts with both smaller sample sizes and shorter follow-up. However, for glaucoma-ANGPTL7, the most restrictive threshold did not yield additional gains over moderate enrichment, as reduced sample size attenuated the detectable difference between arms. These results highlight the need to balance enrichment for higher-risk participants against retaining a sufficient eligible population, and underscore that optimal PRS thresholds are disease-context dependent. Conclusions: These findings establish a generalizable, data-driven framework for prospectively evaluating PRS-guided prognostic enrichment prior to trial initiation. In general, PRS-guided study designs lead to improved empirical power, lower required sample sizes, and faster trials. As population-scale genomic data become increasingly available within healthcare systems and biobanks, this framework provides a scalable foundation for integrating genetic risk information into clinical trial design.
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.