Mode
Text Size
Log in / Sign up

Machine learning shows no clear advantage over logistic regression in orthopedic sports medicine prediction models

Machine learning shows no clear advantage over logistic regression in orthopedic sports medicine pre…
Photo by Markus Winkler / Unsplash
Key Takeaway
Interpret ML prediction models cautiously; current evidence shows no clear advantage over logistic regression in orthopedic sports medicine.

This systematic review and meta-analysis examined 52 articles on clinical prediction models using machine learning (ML) in orthopedic sports medicine. The analysis compared ML models to traditional logistic regression (LR), focusing on methodological quality, performance reporting, and area under the curve (AUC) metrics. The study population and setting were not reported in the input data.

In comparisons at low risk of bias (125 comparisons), ML showed no performance benefit over LR, with a logit(AUC) difference of 0.00 lower for ML (95% CI: -0.18 to 0.18). In high risk-of-bias comparisons (43 comparisons), ML had a logit(AUC) 0.08 higher than LR (95% CI: -0.22 to 0.48). Random forest algorithms demonstrated superior performance with a logit(AUC) of 0.11 (95% CI: 0.00 to 0.21). Safety and tolerability data were not reported.

Key limitations include methodological heterogeneity across studies and potential bias in outcome imbalance and management of continuous predictors. The Level of Evidence is IV, indicating a systematic review and meta-analysis with more than 2 negative criteria. The authors note that random forest algorithms were associated with higher performance, but this represents association rather than causation.

For clinical practice, no conclusive recommendations can be made regarding the superiority of ML over LR in orthopedic sports medicine prediction models. The findings suggest that improvements in methodology and standardized reporting are required before ML models can be reliably recommended over traditional statistical approaches. Clinicians should interpret these results cautiously given the heterogeneity and potential biases in the underlying studies.

Study Details

Study typeMeta analysis
EvidenceLevel 1
PublishedApr 2026
View Original Abstract ↓
IMPORTANCE: The modeling methodology and reporting of performance metrics in the development of machine learning (ML) models in orthopedic sports medicine have not yet been systematically assessed. OBJECTIVE: The purpose of this study was to systematically review this literature for clinical prediction models utilizing ML and to evaluate the methodological quality of modeling and performance reporting, as well as to compare the performance of ML with logistic regression (LR) predictions where applicable. EVIDENCE REVIEW: A systematic search was conducted of the MEDLINE, Scopus, and Embase databases in September 2025 for articles pertaining to clinical prediction models using ML in orthopedic sports medicine. Study demographics, outcomes, modeling workflow, and risk-of-bias information were collected. A random-effects meta-regression controlling for article and sample size was performed to compare the differences, where applicable, in performance benefit, measured by area under the curve (AUC), of utilizing ML models over LR. FINDINGS: A total of 1033 articles were screened, resulting in the inclusion of 52 articles in the final analysis. The most frequently utilized ML algorithm was random forest, followed by boosted trees and support vector machines. Most noteworthy sources of potential bias were encountered in outcome imbalance and the management of continuous predictors. A total of 25 studies performed a total of 168 pairwise comparisons between ML and LR. For 43 comparisons at high risk of bias, logit-transformed AUC regression (logit(AUC)) was 0.08 (-0.22-0.48) higher for ML, for 125 comparisons at low risk of bias, logit(AUC) was 0.00 (-0.18-0.18) lower for ML. Overall, random forest (RF) models demonstrated superior performance with a logit(AUC) of 0.11 (0.00-0.21). CONCLUSION AND RELEVANCE: While RF algorithms were associated with higher performance relative to traditional methods in well-constructed prediction problems (i.e. adequately powered datasets with appropriate feature handling, class balance considerations, and rigorous validation), no conclusive recommendations can be made regarding the superiority of ML over LR, given the current evidence. Improvements in methodology and standardized reporting of performance metrics are required for the useful interpretation of future comparisons. LEVEL OF EVIDENCE: IV, Systematic Review and Meta-Analysis with more than 2 negative criteria.
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.