Mode
Text Size
Log in / Sign up

Machine learning model identifies RLS in ESRD patients with AUC 0.791

Machine learning model identifies RLS in ESRD patients with AUC 0.791
Photo by Trnava University / Unsplash
Key Takeaway
Note that a machine learning model shows moderate accuracy for RLS screening in ESRD patients pending external validation.

This cohort study included 396 patients with end-stage renal disease undergoing dialysis. The researchers developed a machine learning-based classification model to identify restless legs syndrome status. The model was compared against other machine learning algorithms including logistic regression, random forest, support vector machine, gradient boosting machine, K-nearest neighbors, decision tree, artificial neural network, multivariate adaptive regression splines, and quadratic discriminant analysis.

In the testing set, the support vector machine demonstrated optimal performance with an area under the curve of 0.791 (95% CI: 0.702–0.879). The model achieved an accuracy of 0.761, a sensitivity of 0.711, a specificity of 0.797, and an F1-score of 0.711. The Brier score was 0.183. SHAP analysis identified beta-2-microglobulin and anemia as the most influential variables with a mean absolute SHAP value of 0.131.

Safety and tolerability data were not reported. The study had a key limitation requiring prospective external validation in multi-center cohorts before clinical implementation. This tool may facilitate screening of prevalent RLS cases and inform clinical decision-making, but the evidence is observational and does not establish causality.

Study Details

Study typeCohort
EvidenceLevel 3
PublishedApr 2026
View Original Abstract ↓
BackgroundRestless legs syndrome (RLS) is a common and debilitating complication in end-stage renal disease (ESRD) patients undergoing dialysis, significantly impairing sleep quality and quality of life. Screening of prevalent cases remains challenging. This study aimed to develop and validate an interpretable machine learning-based classification model for identifying RLS status in ESRD patients.MethodsA total of 396 ESRD patients (173 hemodialysis, 223 peritoneal dialysis) were enrolled from April to October 2024. Patients were randomly divided into training (70%, n = 287) and testing (30%, n = 109) sets. Feature selection was performed using LASSO regression with five-fold cross-validation, followed by Akaike Information Criterion (AIC) refinement. Nine machine learning algorithms were developed: Logistic Regression (LR), Random Forest (RF), Support Vector Machine (SVM), Gradient Boosting Machine (GBM), K-Nearest Neighbors (KNN), Decision Tree (DT), Artificial Neural Network (ANN), Multivariate Adaptive Regression Splines (MARS), and Quadratic Discriminant Analysis (QDA). Model performance was evaluated using discrimination (AUC-ROC), calibration (Brier score, calibration curves), and clinical utility (Decision Curve Analysis, DCA). SHapley Additive exPlanations (SHAP) was employed to enhance model interpretability.ResultsFive variables were selected: β2-microglobulin, hemoglobin, diabetes mellitus, coronary heart disease, and alcohol consumption. SVM demonstrated optimal performance with AUC of 0.791 (95% CI: 0.702–0.879) in the testing set, outperforming other models. SVM achieved accuracy of 0.761, sensitivity of 0.711, specificity of 0.797, F1-score of 0.711, and Brier score of 0.183. Calibration curves showed good agreement between estimated and observed probabilities. DCA confirmed favorable net clinical benefit across threshold probabilities. SHAP analysis identified β2-microglobulin (mean |SHAP| = 0.131) and anemia as the most influential variables with diabetes, coronary heart disease, and alcohol consumption contributing moderately. SHAP dependence plots revealed interactions between β2-microglobulin and hemoglobin, as well as diabetes modifying the protective effect of higher hemoglobin.ConclusionWe developed and validated an interpretable SVM-based classification model for identifying RLS in ESRD patients using readily available clinical variables. This model demonstrates promising performance and requires prospective external validation in multi-center cohorts before clinical implementation. This tool may facilitate screening of prevalent RLS cases and inform clinical decision-making.
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.