This cohort study used data from the Childhood Arthritis and Rheumatology Research Alliance (CARRA) Registry, comprising 8093 children with non-systemic juvenile idiopathic arthritis. Researchers developed machine learning models, including Dynamic Bayesian Networks (DBN) and Convolutional Neural Networks (CNN), to predict the attainment of inactive disease. An independent test cohort was used for evaluation.
For inactive disease prediction, the DBN model had an AUC of 0.76, precision of 0.73, recall of 0.83, F1-score of 0.78, and accuracy of 0.71. The CNN model had an AUC of 0.76, precision of 0.71, recall of 0.63, F1-score of 0.67, and accuracy of 0.70. No p-values or confidence intervals were reported.
Safety and tolerability data were not reported for these models. Key limitations include suboptimal care patterns in the retrospective data, which limit the capacity to learn treatment-outcome relationships, and the need for prospective comparative studies. The practice relevance suggests machine learning can predict disease trajectories with good discriminative performance, but causal relationships are not established.
The models are based on retrospective data, and the follow-up duration was not reported. Do not overstate the ability to learn treatment-outcome relationships from this retrospective cohort.
View Original Abstract ↓
Background: Despite advances in therapy, optimal management of juvenile idiopathic arthritis (JIA) remains challenging. The ability to predict disease progression in JIA can improve personalized treatment decisions, but few reliable clinical predictors have been identified. We developed machine learning approaches to predict disease trajectories in children with JIA. Methods: Using data from the Childhood Arthritis and Rheumatology Research Alliance (CARRA) Registry (years 2015-2024), we developed machine learning models to predict attainment of inactive disease in children with non-systemic JIA. We applied Dynamic Bayesian Networks (DBN) to model temporal dependencies and causal relationships, and Convolutional Neural Networks (CNN) to capture complex non-linear patterns. Model input included demographic factors, longitudinal clinical factors, and medication use in the preceding 12 months. Findings: A total of 8,093 participants were included. When tested on an independent test cohort, both DBN (AUC:0.76; precision:0.73; recall:0.83; F1-score:0.78; accuracy:0.71) and CNN (AUC:0.76; precision:0.71; recall:0.63; F1-score:0.67; accuracy:0.70) models achieved comparable performance in predicting inactive disease. Disease activity levels in the preceding 12 months, presence of enthesitis and uveitis were the strongest predictors. Causal relationships captured in the DBN model revealed suboptimal care patterns, likely shaped by insurance constraints and a predominantly reactive approach to JIA management. Interpretation: Our study demonstrates that machine learning approaches can predict disease trajectories in JIA with good discriminative performance. Unlike prior studies that predict outcomes at single timepoints, our models are the first to predict inactive disease longitudinally. However, suboptimal care patterns in retrospective data limit the capacity to learn treatment-outcome relationships, underscoring critical opportunities to improve JIA care and the need for prospective comparative studies to better inform prediction models.