Mode
Text Size
Log in / Sign up

Polygenic risk score enrichment reduces sample size needs for coronary artery disease and inflammatory bowel disease trialsSmarter trial designs could cut study sizes by 60% for heart disease

AI-generated summary of the cited source, checked by automated accuracy review. How we work

Key Takeaway
Consider that PRS-enriched trial designs may reduce sample size requirements for specific genetic targets, but optimal thresholds vary by disease.

This in silico framework used an observational cohort design with UK Biobank participants having genomic and electronic health record data. The study modeled trial designs that restrict enrollment to the upper 75%, 50%, or 25% of a polygenic risk score (PRS) distribution, compared to unenriched designs drawing from the full population. The primary outcomes were disease prevalence, statistical power, sample size requirements, and time-to-event accrual.

Main results showed that PRS-enriched designs increased disease prevalence and improved empirical power relative to unenriched cohorts. For required per-arm sample sizes at 80% power, enrichment to the upper 25% of PRS distribution reduced needs by approximately 60% for CAD-PCSK9 and 78% for IBD-IL23R. Time-to-event accrual was accelerated in enriched designs. However, for glaucoma-ANGPTL7, the most restrictive threshold did not yield additional gains over moderate enrichment due to reduced sample size attenuating the detectable difference.

No safety or tolerability data were reported, as this was an in silico analysis. Key limitations include that the framework uses naturally occurring protective genetic variants as analogs for therapeutic interventions, optimal PRS thresholds are disease-context dependent, and reduced sample size may attenuate detectable differences in some diseases. The practice relevance is that this provides a scalable foundation for integrating genetic risk into clinical trial design using population-scale genomic data.

The study cautions that results are based on an in silico evaluation using UK Biobank data and are not a clinical trial or experimental intervention. Findings are limited to three model gene-disease pairs and should not be generalized universally.

Imagine a clinical trial that needs far fewer people to prove a new drug works. That’s the promise of a new approach that uses genetic risk scores to focus on volunteers most likely to get sick. Researchers tested this idea using UK Biobank data, simulating how it would work for coronary artery disease, glaucoma, and inflammatory bowel disease. They found that by enrolling only people in the top 25% of genetic risk, the required sample size for some trials could drop by about 60% for heart disease and 78% for IBD. This could also speed up how quickly trials gather results. But the benefit wasn’t universal; for glaucoma, the most restrictive approach didn’t add extra gain because the smaller sample size made it harder to see a difference. This was a computer simulation, not a real trial, and the best risk threshold depends on the specific disease. Still, it shows how genetic data might help build faster, more efficient studies in the future.

What this means for you:
Focusing trials on people with high genetic risk could make studies smaller and faster, but it doesn’t work the same for every disease.

Study Details

Study typeCohort
EvidenceLevel 3
PublishedApr 2026
View Original Abstract ↓
Background: Clinical trials are essential for therapeutic development but increasingly face challenges due to imprecise inclusion criteria, leading to low event rates and the need for large sample sizes. This inefficiency makes modern trials costly and time-consuming. Despite the availability of extensive clinical, genomic, and biological data, current trial enrollment strategies do not fully leverage this information. Incorporating genomic information into trial design could enable risk-based participant enrichment by preferentially enrolling individuals with higher disease risk, thereby increasing event rates and improving trial efficiency. Methods: In this study, we developed an in silico framework for evaluating prognostic enrichment guided by polygenic risk scores (PRS) in clinical trial design using genomic and electronic health record data from large-scale biobanks. Naturally occurring protective genetic variants were used as analogs of therapeutic interventions, with variant carriers treated as 'treatment' arms and non-carriers as 'control' arms. We compared unenriched designs, in which carriers and non-carriers were drawn from the full population, against PRS-enriched designs in which both arms were restricted to participants in the upper 75%, 50%, or 25% of the PRS distribution, respectively. Across these four designs, we quantified disease prevalence, statistical power, sample size requirements, and time-to-event accrual. Results: We applied this approach to the UK Biobank using three model gene-disease pairs: the protective variant p.Arg46Leu in PCSK9 for coronary artery disease (CAD), p.Gln175His in ANGPTL7 for glaucoma, and p.Arg381Gln in IL23R for inflammatory bowel disease (IBD). Across all three disease contexts, PRS-enriched designs increased disease prevalence, improved empirical power, and accelerated event accrual relative to unenriched cohorts. At 80% power, restricting enrollment to the upper 25% of the PRS distribution reduced required per-arm sample sizes by approximately 60% for CAD-PCSK9 and 78% for IBD-IL23R. Consistent reductions in time-to-event were also observed across enriched strata, suggesting that PRS-enriched trials could achieve target event counts with both smaller sample sizes and shorter follow-up. However, for glaucoma-ANGPTL7, the most restrictive threshold did not yield additional gains over moderate enrichment, as reduced sample size attenuated the detectable difference between arms. These results highlight the need to balance enrichment for higher-risk participants against retaining a sufficient eligible population, and underscore that optimal PRS thresholds are disease-context dependent. Conclusions: These findings establish a generalizable, data-driven framework for prospectively evaluating PRS-guided prognostic enrichment prior to trial initiation. In general, PRS-guided study designs lead to improved empirical power, lower required sample sizes, and faster trials. As population-scale genomic data become increasingly available within healthcare systems and biobanks, this framework provides a scalable foundation for integrating genetic risk information into clinical trial design.
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.