Deep learning pipeline detects vertebral fractures with 92.6% sensitivity and 95.8% specificity in older adults during routine CT scans
This guideline presents a deep learning pipeline designed for vertebral segmentation, fracture detection, fracture severity grading, and Schmorl's node identification. The system was evaluated using 5,000 CT scans acquired for non-spinal indications in routine clinical care settings involving older adults. The comparator was radiologist consensus.
Performance metrics for the top-1 anatomical labeling accuracy were 96% when at least one spinal extremity was visible and 86% when both extremities were absent. Top-3 anatomical labeling accuracy reached 100% in both settings. The mean Intersection over Union was 0.94. Vertebra-level accuracy for fracture classification was 92.7%.
Sensitivity for fractured vertebrae was 92.6% and specificity was 95.8%. The F1-score for fractured vertebrae was 91.6%. Accuracy for Schmorl's node detection was 91.9% with an F1-score of 93.5%. Overall accuracy for fracture severity grading was 85.0% and the overall pipeline success rate was 85%. Multi-label consistency with radiologist consensus was 90%.
The authors note that this provides a practical framework for integrating AI-assisted vertebral screening into routine radiology practice. Adverse events, discontinuations, and tolerability were not reported. Funding or conflicts of interest were not reported. Follow-up duration was not reported.