Systematic review and meta-analysis of AI for infant hip ultrasound accuracy
This is a systematic review and diagnostic test-accuracy meta-analysis of AI applied to two-dimensional or three-dimensional hip ultrasound for developmental dysplasia of the hip in infants 12 months or younger. The analysis synthesized data from 9 studies comprising 6,351 hips.
The authors report a pooled sensitivity of 0.92 (95% CI 0.86-0.95) and a pooled specificity of 0.96 (95% CI 0.91-0.98) for AI-assisted ultrasound compared with expert Graf-based interpretation or follow-up consensus. Feasibility signals included operator training times and scan acquisition time reductions, though specific values were not reported.
Key limitations noted by the authors include frequently high or unclear risk of bias for patient selection and the index test, and limited economic reporting. The authors state that larger multicenter studies with external validation and robust economic evaluation are needed.
Practice relevance is restrained; the authors suggest AI-assisted ultrasound may help standardize hip imaging and facilitate safe use by nonexpert operators. However, they caution against inferring superiority beyond the reported diagnostic accuracy and against clinical implementation without external validation and economic evaluation.