Mode
Text Size
Log in / Sign up

Preprint review of data augmentation strategies for spine abnormality classification in a specific dataset

Preprint review of data augmentation strategies for spine abnormality classification in a specific d…
Photo by Faustina Okeke / Unsplash
Key Takeaway
Note that this preprint on data augmentation for spine imaging is based on a dataset, not a clinical trial.

This preprint research article focuses on evaluating various data augmentation strategies for classifying spine abnormalities. The scope includes geometric transformations, synthetic image generation using Generative Adversarial Networks (GANs), and a hybrid augmentation technique, all compared against a GAN-only approach. The analysis was conducted using the VinDr-SpineXR dataset, where the sample size and setting were not reported.

The primary outcome assessed was classifier performance, which reached a validation accuracy of approximately 99%. Secondary outcomes included computational overhead. The study did not report adverse events, tolerability, or discontinuations, as these are not applicable to this computational analysis. The authors noted a limitation regarding dataset imbalance within the VinDr-SpineXR dataset.

Because the study is a preprint based on a dataset rather than a clinical trial, and no clinical outcomes were reported, the certainty of the results is limited. The authors caution that these findings should not be overextended to clinical practice without further validation in prospective trials involving actual patient populations.

Study Details

EvidenceLevel 5
PublishedApr 2026
View Original Abstract ↓
Computer vision and deep learning techniques, including convolutional neural networks (CNNs) and transformers, have increased the performance of medical image classification systems. However, training deep learning models using medical images is a challenging task that necessitates a substantial amount of annotated data. In this paper, we implement data augmentation strategies to tackle dataset imbalance in the VinDr-SpineXR dataset, which has a lower number of spine abnormality X-ray images compared to normal spine X-ray images. Geometric transformations and synthetic image generation using Generative Adversarial Networks are explored and applied to the abnormal classes of the dataset, and classifier performance is validated using VGG-16 and InceptionNet to identify the most effective augmentation technique. Additionally, we introduce a hybrid augmentation technique that addresses class imbalance, reduces computational overhead relative to a GAN-only approach, and achieves ~99% validation accuracy with both classifiers across all three case studies. Keywords: Data augmentation, Generative Adversarial Network, VGG-16, InceptionNet, Class imbalance, Computer vision, Spine X-ray, Radiology.
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.