Preprint review of data augmentation strategies for spine abnormality classification in a specific dataset

Photo by Faustina Okeke / Unsplash

medRxiv Published April 18, 2026 Medically reviewed April 25, 2026 Study authors: Sivakumar, E.; Anand, A. DOI ↗ By Dr. Lars van Dijk, PhD · Surgical, Procedural & Diagnostic

Key Takeaway

Note that this preprint on data augmentation for spine imaging is based on a dataset, not a clinical trial.

This preprint research article focuses on evaluating various data augmentation strategies for classifying spine abnormalities. The scope includes geometric transformations, synthetic image generation using Generative Adversarial Networks (GANs), and a hybrid augmentation technique, all compared against a GAN-only approach. The analysis was conducted using the VinDr-SpineXR dataset, where the sample size and setting were not reported.

The primary outcome assessed was classifier performance, which reached a validation accuracy of approximately 99%. Secondary outcomes included computational overhead. The study did not report adverse events, tolerability, or discontinuations, as these are not applicable to this computational analysis. The authors noted a limitation regarding dataset imbalance within the VinDr-SpineXR dataset.

Because the study is a preprint based on a dataset rather than a clinical trial, and no clinical outcomes were reported, the certainty of the results is limited. The authors caution that these findings should not be overextended to clinical practice without further validation in prospective trials involving actual patient populations.

Study Details

EvidenceLevel 5

PublishedApr 2026

View Original Abstract ↓

Computer vision and deep learning techniques, including convolutional neural networks (CNNs) and transformers, have increased the performance of medical image classification systems. However, training deep learning models using medical images is a challenging task that necessitates a substantial amount of annotated data. In this paper, we implement data augmentation strategies to tackle dataset imbalance in the VinDr-SpineXR dataset, which has a lower number of spine abnormality X-ray images compared to normal spine X-ray images. Geometric transformations and synthetic image generation using Generative Adversarial Networks are explored and applied to the abnormal classes of the dataset, and classifier performance is validated using VGG-16 and InceptionNet to identify the most effective augmentation technique. Additionally, we introduce a hybrid augmentation technique that addresses class imbalance, reduces computational overhead relative to a GAN-only approach, and achieves ~99% validation accuracy with both classifiers across all three case studies. Keywords: Data augmentation, Generative Adversarial Network, VGG-16, InceptionNet, Class imbalance, Computer vision, Spine X-ray, Radiology.

Preprint review of data augmentation strategies for spine abnormality classification in a specific dataset

Study Details

Muscle ultrasonography shows high diagnostic accuracy for fasciculation detection in ALS

A Simple Scan Could Speed Up an ALS Diagnosis

Clinical research that matters. Delivered to your inbox.

Preprint review of data augmentation strategies for spine abnormality classification in a specific dataset

Study Details

Muscle ultrasonography shows high diagnostic accuracy for fasciculation detection in ALS

A Simple Scan Could Speed Up an ALS Diagnosis

Clinical research that matters. Delivered to your inbox.

Related in Radiology & Imaging

From Other Specialties