This mixed-methods study (retrospective and prospective cohorts) evaluated a machine learning prediction model for obstructive sleep apnea screening in 4,036 participants. The model utilized gradient boosting algorithms (XGBoost, SVM, ANN, multinomial logistic regression) with a 15-item questionnaire and was compared against the STOP-Bang questionnaire (AUC: 0.68) and Berlin questionnaire (AUC: 0.72).
The model demonstrated superior diagnostic accuracy with AUCs of 0.92 for mild OSA, 0.94 for moderate OSA, and 0.97 for severe OSA. A clinical nomogram derived from the model showed a C-index of 0.93. Economic analysis suggested a 39.7% reduction in screening costs and 3.5-fold increase in case detection efficiency compared to existing screening paradigms.
Safety and tolerability data were not reported. The study used propensity score matching (1:4 ratio) to address potential confounding factors, but other limitations including study setting, follow-up duration, and funding sources were not reported. The authors describe this as a clinically deployable visual prediction framework, but prospective validation in diverse clinical populations is needed before implementation.
While the model shows promising accuracy improvements over existing screening tools, clinicians should interpret these findings cautiously given the observational nature of the evidence and lack of reported safety data. The economic benefits require confirmation in real-world healthcare settings.
View Original Abstract ↓
This study aimed to develop and validate a machine learning-enhanced screening questionnaire utilizing gradient boosting algorithms and to establish a clinically deployable visual prediction framework with superior diagnostic accuracy compared to existing screening paradigms.
We conducted a mixed-methods study analyzing polysomnography data from 4,036 participants between September 2019 and August 2025. The study included a retrospective cohort of 3,847 participants and a prospective cohort of 189 participants. We developed a 15-item questionnaire combining components from the modified Epworth Sleepiness Scale (ESS) and Snoring, Tiredness, Observed apneas, Blood pressure, Age, Neck circumference, and Gender (STOP-Bang) items. We evaluated four machine learning algorithms: XGBoost, support vector machine (SVM), artificial neural network (ANN), and multinomial logistic regression model. Performance was measured using the area under the curve (AUC), net reclassification improvement (NRI), calibration metrics, and decision curve analysis, while propensity score matching (1:4 ratio) addressed potential confounding factors.
XGBoost outperformed traditional screening tools, achieving AUC values of 0.92, 0.94, and 0.97 for mild, moderate, and severe obstructive sleep apnea (OSA), respectively, compared to the STOP-Bang questionnaire (AUC: 0.68) and the Berlin questionnaire (AUC: 0.72). The clinical nomogram exhibited excellent calibration characteristics with a C-index of 0.93. SHapley Additive exPlanations (SHAP) analysis identified neck circumference as the primary predictive feature (mean |SHAP| = 0.42), followed by body mass index (0.38) and witnessed apneas (0.35). Economic analysis revealed a 39.7% reduction in screening costs with a 3.5-fold increase in case detection efficiency.
The gradient boosting-enhanced OSA screening model represents a paradigmatic advancement in the diagnosis of sleep disorders, offering clinically actionable risk stratification through interpretable visualization while maintaining implementation feasibility. This methodological innovation provides a framework for artificial intelligence integration in clinical decision support, with potential applications extending beyond sleep medicine.