This retrospective case-control study assessed risk factors for the comorbidity of cataract and age-related macular degeneration among 640 participants, comprising 264 cases and 376 controls. The analysis examined exposures including smoking, aging, C-reactive protein (CRP) levels, drusen severity, and lens opacity measured via LOCS III. A machine learning model utilizing XGBoost was trained and validated to stratify comorbidity risk.
The model demonstrated robust discriminative performance with an area under the curve (AUC) of 0.895 (95% CI: 0.85–0.93). Analysis of the interaction between smoking and aging revealed a non-linear synergistic effect, indicating exponentially higher comorbidity risk in individuals aged over 75 years. Additionally, a distinct saturation threshold effect was observed regarding CRP levels.
Safety data, including adverse events, discontinuations, and tolerability, were not reported in this study. Key limitations include the retrospective design and the inability to explicitly distinguish between association and causation. The study describes findings as 'associated with' or 'synergistic effects' rather than causal relationships. Generalizability beyond the 640 participants is uncertain.
This research establishes a robust, clinically applicable risk stratification tool for cataract and AMD comorbidity. Clinicians should interpret these results as associations derived from observational data rather than proof of causation. The model's performance metrics provide a basis for risk assessment within the specific population studied.
View Original Abstract ↓
BackgroundThe comorbidity of cataract and age-related macular degeneration (AMD) poses a significant public health burden. Traditional linear statistical models often fail to capture complex, non-linear interactions among risk factors. This study aimed to develop an interpretable machine learning framework to predict comorbidity risk and elucidate the synergistic effects of systemic and ocular factors.MethodsA retrospective case-control study was conducted involving 640 participants (264 comorbidity cases and 376 controls). Fifteen multi-dimensional clinical features were extracted. Four machine learning algorithms—Logistic Regression, Random Forest, SVM, and XGBoost—were trained and validated. Model performance was assessed via AUROC, AUPRC, and calibration curves. SHapley Additive exPlanations (SHAP) and LIME were employed to visualize global and local interpretability.ResultsThe XGBoost model demonstrated robust discriminative performance (AUC = 0.895, 95% CI: 0.85–0.93) and calibration compared to other algorithms. SHAP analysis identified drusen severity and lens opacity (LOCS III) as dominant ocular predictors, while C-reactive protein (CRP) and smoking were critical systemic contributors. Notably, interaction analysis revealed a non-linear synergistic effect: smoking was associated with an exponentially higher comorbidity risk in individuals aged >75 years, whereas CRP exhibited a distinct saturation threshold effect. Decision curve analysis confirmed the model's high net clinical benefit across a wide range of threshold probabilities.ConclusionThis study establishes a robust, clinically applicable risk stratification tool for cataract and AMD comorbidity. By uncovering non-linear interactions between aging, lifestyle, and inflammation, it provides valuable evidence-based support for personalized screening and preventive intervention.