Mode
Text Size
Log in / Sign up

Meta-analysis finds X-ray deep learning models show high accuracy for ankle and foot fracture detectionCan AI spot broken ankles and feet on X-rays as well as the research suggests?

AI-generated summary of the cited source, checked by automated accuracy review. How we work

Key Takeaway
Consider deep learning models as a potentially accurate diagnostic aid for ankle/foot fractures, pending further clinical validation.

A systematic review and meta-analysis evaluated the diagnostic performance of X-ray-based deep learning models for detecting ankle and foot fractures. The analysis included 14 studies from 506 reviewed, though specific population characteristics and clinical settings were not reported. The intervention involved deep learning models analyzing X-ray images, with no specific comparator detailed.

The primary outcome was diagnostic accuracy. The cumulative pooled sensitivity was 93.2% (95% CI 88.8-95.9%) and specificity was 94.5% (95% CI 90.1-97.0%), with low heterogeneity (I² 6.6%). The pooled F1 score was 0.94 (95% CI 0.88-0.97). Subgroup analyses revealed models detecting calcaneal fractures performed significantly better, with sensitivity of 95.1% (95% CI 93.0-96.6%), specificity of 98.3% (95% CI 97.00-99.0%), and a diagnostic odds ratio of 1751.8 (95% CI 445.9-6882.5). Performance did not differ significantly between models using multi-view versus single-view X-rays or based on dataset type.

Safety and tolerability data were not reported. Key limitations include the need for larger sample studies, external validation, and clinical implementation research. The analysis did not compare model performance to a specific clinical standard or radiologist performance, and the population and clinical setting were not described. The results are based on aggregated study data rather than direct clinical application.

For practice, this meta-analysis suggests deep learning models can be considered a notably accurate method for detecting ankle and foot fractures, particularly calcaneal fractures. However, clinicians should interpret these findings cautiously as they represent diagnostic accuracy associations from aggregated research, not evidence of clinical effectiveness or superiority to standard care. Further validation in real-world clinical settings is required before considering implementation.

When you twist your ankle or hurt your foot, getting a clear answer from an X-ray is crucial. A new analysis of 14 studies suggests that artificial intelligence (AI) models designed to read these X-rays might be very good at their job. The review found that, when looking at the combined data, these deep learning models correctly identified fractures about 93% of the time and correctly ruled them out about 95% of the time.

The analysis looked at how well different versions of these AI tools performed. Interestingly, models that used multiple X-ray angles didn't do any better than those using just one view. The AI was particularly sharp at spotting fractures in the heel bone (calcaneus). What we don't know from this review is who exactly was in these studies or what the real-world clinical setting was like.

It's important to remember what this review does and doesn't tell us. The results are promising, but they come from combining data from separate research projects. The authors themselves note that more studies with larger groups of patients and real-world testing are needed. Crucially, this analysis didn't compare the AI's performance directly against radiologists, so we can't say if it's better, worse, or just different from a human expert's read.

What this means for you:
AI shows promise for reading foot and ankle X-rays, but it hasn't been tested head-to-head against doctors yet.

Study Details

Study typeMeta analysis
EvidenceLevel 1
PublishedApr 2026
View Original Abstract ↓
OBJECTIVES: Lower extremity fractures are prevalent in vulnerable populations leading to significant burdens, which highlight the need for timely and precise diagnosis. Integrating deep learning with imaging findings has shown promising results for enhancing fracture detection accuracy. This study assesses the diagnostic accuracy of AI models using X-ray images to detect ankle and foot fractures and investigates the probable factors influencing their performance. MATERIALS AND METHODS: A comprehensive search of four databases was done up to September 30th, 2025. Studies that investigated the accuracy of deep learning models for the detection of ankle and foot fractures utilizing X-rays were included. A bivariate random-effects model was used to perform meta-analysis. RESULTS: A total of 506 studies were reviewed, and 14 were included in the meta-analysis. Analysis of all the representative models of the included studies had a cumulative sensitivity and specificity of 93.2% and 94.5% (95% CI 88.8-95.9%, 90.1-97.0%, respectively, I 6.6%). The pooled F1 score was estimated at 0.94 (95% CI 0.88-0.97). Subgroup analysis revealed no difference in the sensitivity or specificity of studies using multi-view vs. single-view X-rays (P = 0.64, P = 0.89, respectively). Models detecting calcaneal fractures performed significantly better than models detecting foot or ankle fractures, with a pooled sensitivity of 95.1% (95% CI 93.0-96.6%, P = 0.008), specificity of 98.3% (95% CI 97.00-99.0%), and a DOR of 1751.8 (95% CI 445.9-6882.5). The type of dataset used (validation vs. test, internal testing vs. external testing) did not significantly affect performance. CONCLUSION: Deep learning models can be considered a notably accurate method for the detection of ankle and foot fractures. Further studies with larger samples, external validation, and clinical implementations are required. PROSPERO REGISTRATION: This systematic review and meta-analysis study was registered with PROSPERO, registration number CRD42024624044.
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.