Mode
Text Size
Log in / Sign up

AI endoscopic score shows better agreement with ulcerative colitis remission than Mayo score in RCTAI endoscopic scoring shows better agreement with ulcerative colitis symptoms in trial

AI-generated summary of the cited source, checked by automated accuracy review. How we work

Key Takeaway
Consider that AI endoscopic scoring shows better agreement with ulcerative colitis remission than Mayo score in this RCT.

This was a phase 3 randomized controlled trial in 387 subjects with ulcerative colitis from the TrueNorth trial. The study compared an artificial intelligence endoscopic cumulative disease score (CDS) to the standard Mayo endoscopic score (MES) over a 12.0-month follow-up. The primary outcome was agreement with symptoms and health-related quality of life.

In endoscopically active subjects achieving partial Mayo symptomatic remission, the CDS was lower (83.8 vs 186.9; P < 0.0001). The agreement of EuroQol 5 Dimension with CDS was better than with MES (κ = 0.53 vs 0.44). A low CDS cutoff (<40) had better agreement with partial Mayo remission (κ = 0.57 vs 0.72; area under the curve 0.85).

The difference between ozanimod and placebo remission with adequate endoscopic criteria increased from 22.6% to 28.6%. Safety data, including adverse events, serious adverse events, discontinuations, and tolerability, were not reported.

Key limitations include the observational analysis within an RCT, where association is reported, not causation. The practice relevance is that AI-enabled endoscopic scoring has better agreement with symptomatic remission and HR-QoLs compared with MES and could quantitatively redefine adequate mucosal healing.

This study tested an artificial intelligence (AI) system for scoring ulcerative colitis during endoscopy. It compared the AI system to the standard Mayo endoscopic score in 387 patients from a clinical trial. The main goal was to see how well each scoring method matched patient symptoms and quality of life.

The AI system showed better agreement with patient-reported symptoms and quality of life measures than the standard score. Patients with lower AI scores were more likely to be in symptomatic remission. The study also found that a low AI score cutoff had good agreement with remission status.

No safety concerns were reported for this analysis, as it focused on scoring agreement. The main reason to be careful is that this is an observational analysis within a larger trial; it shows an association, not that the AI system causes better outcomes. The results are from a phase 3 trial, so the certainty is high for the reported agreement measures.

Readers should understand that this study suggests AI scoring may align better with how patients feel, but it does not show that using AI changes treatment decisions or improves health outcomes. More research is needed to see if this scoring method leads to better patient care.

What this means for you:
AI endoscopic scoring agreed better with patient symptoms in ulcerative colitis, but this does not prove it improves outcomes.

Study Details

Study typeRct
EvidenceLevel 2
Follow-up12.0 mo
PublishedApr 2026
View Original Abstract ↓
INTRODUCTION: Ulcerative colitis (UC) disease assessment considers both mucosal healing and symptoms which can disagree. We compared the artificial intelligence (AI) endoscopic cumulative disease score (CDS) with the Mayo endoscopic score (MES) for agreement with symptoms and health related quality of life (HR-QoL) in UC. METHODS: Endoscopic video was obtained from TrueNorth, a 52-week phase 3 trial comparing ozanimod (OZA) with placebo (PBO) for UC. End of maintenance CDS was compared with endoscopically inactive (MES 0, 1) vs active (MES 2, 3) groups by partial Mayo remission (PMS, ≤2) and treatment using the Mann-Whitney U test. CDS and MES were compared with EuroQol 5 Dimension HR-QoL measures using the Kruskal-Wallis test. Low CDS cutoff for PMS remission used the Youden index, with agreement to PMS assessed using Cohen's κ. RESULTS: In 387 subjects, among endoscopically active subjects (MES 2, 3), the CDS was lower in those achieving PMS symptomatic remission (83.8 vs 186.9, P < 0.0001). EuroQol 5 Dimension had better agreement with CDS than MES (κ = 0.53 vs 0.44) and detected QoL differences within MES 2, 3 subjects for patient-reported health dimensions. Compared with low endoscopic maximum intensity (MES 0, 1), low-cumulative disease burden (CDS <40, area under the curve 0.85) had better agreement with PMS remission (κ = 0.57 vs 0.72). When defining adequate endoscopic criteria as PMS remission plus either low MES or low CDS, the difference between OZA vs PBO remission increased from 22.6% to 28.6%. DISCUSSION: AI-enabled endoscopic scoring has better agreement with symptomatic remission and HR-QoLs compared with MES and could quantitatively redefine adequate mucosal healing.
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.