Composite Endpoints Show Weaker Treatment Effects Than EDSS Alone in Multiple Sclerosis Trials

Photo by Brett Jordan / Unsplash

medRxiv Published April 24, 2026 Medically reviewed April 24, 2026 Study authors: Bovis, F.; Montobbio, N.; Signori, A.; Kalincik, T.; Arnold, D. L.; Tintore, M.; Kappos, L.; Sormani… DOI ↗ By Dr. Ji-eun Park, MD · Brain, Mind & Pain

Key Takeaway

Note composite endpoints do not systematically improve treatment effect detection in multiple sclerosis trials.

This post-hoc analysis of individual patient data from ten phase III randomised controlled trials included 9369 participants with relapsing-remitting and progressive multiple sclerosis. The study followed patients for 24 weeks to evaluate composite endpoints constructed from the Expanded Disability Status Scale, the timed 25-foot walk test, and the nine-hole peg test.

Comparisons against the Expanded Disability Status Scale alone revealed that OR-type composite endpoints showed weaker treatment effects. The effect size reached {Delta}Z up to +2.26 with an interaction p = 0.004 indicating a reduction in sensitivity. While the combination of the Expanded Disability Status Scale and the nine-hole peg test showed numerically stronger treatment effects in progressive multiple sclerosis with {Delta}Z = -1.65, the interaction p = 0.051 was not statistically significant.

Timed walk test event rates generated the highest rates up to 46.8%, whereas nine-hole peg test event rates generated the lowest rates as low as 2.1%. Safety data including adverse events and serious adverse events were not reported. Limitations note that composite endpoints do not systematically improve treatment effect detection. Increased event capture driven by the timed walk test introduces noise that dilutes the treatment signal rather than amplifying it.

The combination of global disability and upper limb measures represents a promising direction for future endpoint development in progressive multiple sclerosis trials, warranting validation. Composite endpoints do not systematically improve treatment effect detection in multiple sclerosis trials based on current evidence.

Study Details

Study typeRct

Sample sizen = 9,369

EvidenceLevel 2

PublishedApr 2026

View Original Abstract ↓

Disability worsening is the critical long-term outcome in multiple sclerosis, yet the Expanded Disability Status Scale incompletely captures neurological deterioration and has limited sensitivity in the short time windows of clinical trials. Composite endpoints incorporating functional measures have been proposed to address these limitations, but whether they reliably improve detection of treatment effects has not been established across trials. We conducted a post-hoc analysis of individual patient data from ten phase III randomised controlled trials (ASCEND, BRAVO, CONFIRM, DEFINE, EXPAND, INFORMS, OLYMPUS, OPERA I/II, and ORATORIO; n = 9,369), spanning relapsing-remitting and progressive multiple sclerosis. Confirmed disability worsening was defined using harmonised criteria with the msprog package and confirmed at 24 weeks. Treatment effects were estimated using Cox proportional hazards models and combined across trials in a one-stage individual patient data framework. Composite endpoints were constructed from the Expanded Disability Status Scale, the timed 25-foot walk test, and the nine-hole peg test using logical unions (OR-type), intersections (AND-type), and majority-vote structures. Sensitivity to treatment effect was quantified using Z-scores (the ratio of the pooled log-hazard ratio to its standard error) and compared to the Expanded Disability Status Scale reference using interaction tests. Event rates varied across components: the timed walk test generated the highest rates (up to 46.8%) while the nine-hole peg test generated the lowest (as low as 2.1%). OR-type composite endpoints showed weaker treatment effects than the Expanded Disability Status Scale alone, with the largest reductions in sensitivity observed for endpoints incorporating the timed walk test ({Delta}Z up to +2.26; interaction p = 0.004). These findings were confirmed across disease subtypes and were pronounced in relapsing-remitting trials, where no composite endpoint outperformed the Expanded Disability Status Scale. In progressive multiple sclerosis, the combination of the Expanded Disability Status Scale and the nine-hole peg test showed numerically stronger treatment effects ({Delta}Z = -1.65), though interaction tests did not reach statistical significance (p = 0.051). Composite endpoints do not systematically improve treatment effect detection in multiple sclerosis trials. Increased event capture driven by the timed walk test introduces noise that dilutes the treatment signal rather than amplifying it, highlighting that event rate and endpoint quality are not interchangeable. Upper limb function assessed by the nine-hole peg test provides complementary and specific information, particularly in progressive disease. The combination of global disability and upper limb measures represents a promising direction for future endpoint development in progressive multiple sclerosis trials, warranting validation.

Composite Endpoints Show Weaker Treatment Effects Than EDSS Alone in Multiple Sclerosis Trials

Study Details

Intravenous tenecteplase improves functional outcomes but increases hemorrhage risk in non-large vessel occlusion acute ischemic stroke.

Tenecteplase improves stroke recovery but increases bleeding risk in late treatment window

Clinical research that matters. Delivered to your inbox.

Composite Endpoints Show Weaker Treatment Effects Than EDSS Alone in Multiple Sclerosis Trials

More on Multiple Sclerosis

Study Details

Intravenous tenecteplase improves functional outcomes but increases hemorrhage risk in non-large vessel occlusion acute ischemic stroke.

Tenecteplase improves stroke recovery but increases bleeding risk in late treatment window

Clinical research that matters. Delivered to your inbox.

Related in Neurology

From Other Specialties