Polygenic risk score enrichment reduces sample size needs for coronary artery disease and inflammatory bowel disease trials.
This in silico framework used an observational cohort design with UK Biobank participants having genomic and electronic health record data. The study modeled trial designs that restrict enrollment to the upper 75%, 50%, or 25% of a polygenic risk score (PRS) distribution, compared to unenriched designs drawing from the full population. The primary outcomes were disease prevalence, statistical power, sample size requirements, and time-to-event accrual.
Main results showed that PRS-enriched designs increased disease prevalence and improved empirical power relative to unenriched cohorts. For required per-arm sample sizes at 80% power, enrichment to the upper 25% of PRS distribution reduced needs by approximately 60% for CAD-PCSK9 and 78% for IBD-IL23R. Time-to-event accrual was accelerated in enriched designs. However, for glaucoma-ANGPTL7, the most restrictive threshold did not yield additional gains over moderate enrichment due to reduced sample size attenuating the detectable difference.
No safety or tolerability data were reported, as this was an in silico analysis. Key limitations include that the framework uses naturally occurring protective genetic variants as analogs for therapeutic interventions, optimal PRS thresholds are disease-context dependent, and reduced sample size may attenuate detectable differences in some diseases. The practice relevance is that this provides a scalable foundation for integrating genetic risk into clinical trial design using population-scale genomic data.
The study cautions that results are based on an in silico evaluation using UK Biobank data and are not a clinical trial or experimental intervention. Findings are limited to three model gene-disease pairs and should not be generalized universally.