This retrospective cohort study evaluated the performance of an interpretable machine learning framework (XGBoost model) for predicting detrusor underactivity in 538 urodynamically evaluated patients with benign prostatic hyperplasia (BPH). The model used five baseline clinical, anatomical, and uroflowmetry parameters and was compared against traditional multivariable logistic regression.
The primary outcome was discriminatory performance measured by area under the curve (AUC). The XGBoost model achieved an AUC of 0.958, significantly outperforming logistic regression which had an AUC of 0.787. Additionally, the XGBoost model exhibited superior calibration and higher net clinical benefit across varied threshold probabilities.
Safety and tolerability were not reported. The study's main limitation is that it was based on internal validation only, without external validation in independent cohorts. Therefore, the results should be considered preliminary.
For clinical practice, this model may help optimize preoperative patient selection and mitigate surgical failures in borderline clinical scenarios, but further validation is needed before routine use.
View Original Abstract ↓
ObjectiveTo develop and internally validate an interpretable, non-invasive machine learning framework to predict detrusor underactivity (DU) in patients with benign prostatic hyperplasia (BPH).MethodsThis retrospective cohort study enrolled 538 urodynamically evaluated BPH patients. A rigorous multidimensional feature selection pipeline (LASSO, Boruta, and Recursive Feature Elimination) distilled 15 baseline clinical, anatomical, and uroflowmetry parameters into a parsimonious five-feature subset. Five supervised machine learning algorithms were trained and systematically compared. Shapley Additive exPlanations (SHAP) analysis was integrated for global and local interpretability.ResultsThe optimized XGBoost model demonstrated superior discriminatory performance (AUC = 0.958), significantly outperforming traditional multivariable logistic regression (AUC = 0.787). XGBoost consistently exhibited superior calibration and higher net clinical benefit across varied threshold probabilities. Crucially, SHAP global dependence plots revealed non-linear pathological trajectories, notably demonstrating a U-shaped risk profile for bladder wall thickness (BWT) that was not captured by classical linear statistical detection. Local SHAP visualizations effectively translated complex probabilistic outputs into individualized clinical reasoning.ConclusionThe interpretable XGBoost framework serves as a robust non-invasive risk stratification tool for DU, decoding complex non-linear clinical interactions. This algorithm holds significant potential to optimize preoperative patient selection and mitigate surgical failures in borderline clinical scenarios.Clinical trial registrationIdentifier 2026-048.