Mode
Text Size
Log in / Sign up

Meta-analysis finds X-ray deep learning models show high accuracy for ankle and foot fracture detection

Meta-analysis finds X-ray deep learning models show high accuracy for ankle and foot fracture detect…
Photo by CDC / Unsplash
Key Takeaway
Consider deep learning models as a potentially accurate diagnostic aid for ankle/foot fractures, pending further clinical validation.

A systematic review and meta-analysis evaluated the diagnostic performance of X-ray-based deep learning models for detecting ankle and foot fractures. The analysis included 14 studies from 506 reviewed, though specific population characteristics and clinical settings were not reported. The intervention involved deep learning models analyzing X-ray images, with no specific comparator detailed.

The primary outcome was diagnostic accuracy. The cumulative pooled sensitivity was 93.2% (95% CI 88.8-95.9%) and specificity was 94.5% (95% CI 90.1-97.0%), with low heterogeneity (I² 6.6%). The pooled F1 score was 0.94 (95% CI 0.88-0.97). Subgroup analyses revealed models detecting calcaneal fractures performed significantly better, with sensitivity of 95.1% (95% CI 93.0-96.6%), specificity of 98.3% (95% CI 97.00-99.0%), and a diagnostic odds ratio of 1751.8 (95% CI 445.9-6882.5). Performance did not differ significantly between models using multi-view versus single-view X-rays or based on dataset type.

Safety and tolerability data were not reported. Key limitations include the need for larger sample studies, external validation, and clinical implementation research. The analysis did not compare model performance to a specific clinical standard or radiologist performance, and the population and clinical setting were not described. The results are based on aggregated study data rather than direct clinical application.

For practice, this meta-analysis suggests deep learning models can be considered a notably accurate method for detecting ankle and foot fractures, particularly calcaneal fractures. However, clinicians should interpret these findings cautiously as they represent diagnostic accuracy associations from aggregated research, not evidence of clinical effectiveness or superiority to standard care. Further validation in real-world clinical settings is required before considering implementation.

Study Details

Study typeMeta analysis
EvidenceLevel 1
PublishedApr 2026
View Original Abstract ↓
OBJECTIVES: Lower extremity fractures are prevalent in vulnerable populations leading to significant burdens, which highlight the need for timely and precise diagnosis. Integrating deep learning with imaging findings has shown promising results for enhancing fracture detection accuracy. This study assesses the diagnostic accuracy of AI models using X-ray images to detect ankle and foot fractures and investigates the probable factors influencing their performance. MATERIALS AND METHODS: A comprehensive search of four databases was done up to September 30th, 2025. Studies that investigated the accuracy of deep learning models for the detection of ankle and foot fractures utilizing X-rays were included. A bivariate random-effects model was used to perform meta-analysis. RESULTS: A total of 506 studies were reviewed, and 14 were included in the meta-analysis. Analysis of all the representative models of the included studies had a cumulative sensitivity and specificity of 93.2% and 94.5% (95% CI 88.8-95.9%, 90.1-97.0%, respectively, I 6.6%). The pooled F1 score was estimated at 0.94 (95% CI 0.88-0.97). Subgroup analysis revealed no difference in the sensitivity or specificity of studies using multi-view vs. single-view X-rays (P = 0.64, P = 0.89, respectively). Models detecting calcaneal fractures performed significantly better than models detecting foot or ankle fractures, with a pooled sensitivity of 95.1% (95% CI 93.0-96.6%, P = 0.008), specificity of 98.3% (95% CI 97.00-99.0%), and a DOR of 1751.8 (95% CI 445.9-6882.5). The type of dataset used (validation vs. test, internal testing vs. external testing) did not significantly affect performance. CONCLUSION: Deep learning models can be considered a notably accurate method for the detection of ankle and foot fractures. Further studies with larger samples, external validation, and clinical implementations are required. PROSPERO REGISTRATION: This systematic review and meta-analysis study was registered with PROSPERO, registration number CRD42024624044.
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.