Mode
Text Size
Log in / Sign up

Meta-analysis shows radiomics-based machine learning models predict recurrence risk in non-small cell lung cancer patients.

Meta-analysis shows radiomics-based machine learning models predict recurrence risk in non-small cel…
Photo by Growtika / Unsplash
Key Takeaway
Note methodological limitations and lack of standardization in current radiomics models for NSCLC recurrence prediction.

This systematic review and meta-analysis assessed the accuracy of radiomics-based machine learning models in predicting the risk of recurrence among patients with non-small cell lung cancer (NSCLC). The study pooled data from multiple sources, encompassing a total sample size of 7,964 patients. The setting of the included studies was not reported in the source data. The primary outcome measured was the concordance index (c-index), which quantifies the ability of a model to correctly rank patients by their risk of recurrence. No specific comparator group was defined in the analysis, as the focus was on the intrinsic performance of the models.

The analysis reported distinct performance metrics based on the model composition and patient treatment history. In the training set, the c-index for radiomics-based machine learning models alone was 0.850 (95% CI 0.834-0.866, 95% PI 0.623-1.004). When the models were applied specifically to patients receiving stereotactic body radiation therapy, the c-index improved to 0.876 (95% CI 0.853-0.900). For patients undergoing surgery combined with other adjuvant treatment regimens, the c-index was 0.825 (95% CI 0.804-0.848).

The predictive performance also varied depending on whether clinical features were integrated with the radiomics data. In the training set, models combining radiomics with clinical features yielded a c-index of 0.833 (95% CI 0.822-0.854, 95% PI 0.717-0.945). In the validation set, the standalone radiomics model achieved a c-index of 0.878 (95% CI 0.854-0.902, 95% PI 0.681-1.000), while the combined model with clinical features resulted in a c-index of 0.854 (95% CI 0.830-0.878, 95% PI 0.655-0.992).

Safety and tolerability data were not reported in the included studies. Consequently, no adverse event rates, serious adverse events, discontinuation rates, or general tolerability profiles could be determined from this meta-analysis. The absence of this data limits the ability to assess the clinical safety of implementing these models in routine practice.

A critical limitation identified in this analysis was the methodological heterogeneity and lack of standardization across the included studies. The average Radiomics Quality Score (RQS) was 27.4%, indicating significant variability in study design and reporting quality. This low average score suggests that the evidence base is currently immature and prone to bias. Furthermore, the study phase was not reported, and funding sources or potential conflicts of interest were not disclosed.

These findings provide evidence-based support for the subsequent development or updating of radiomics-based machine learning models. However, the results should not be overextended to claim immediate clinical utility or generalizability beyond the specific cohorts included in the meta-analysis. The observed associations reflect predictive accuracy but do not establish causation or guarantee improved patient outcomes in diverse clinical settings.

Several important questions remain unanswered. The lack of standardization in radiomics acquisition and analysis protocols raises concerns about reproducibility. Additionally, the absence of safety data and the low quality scores suggest that further high-quality prospective studies are needed before these models can be reliably integrated into clinical decision-making pathways for NSCLC management.

Study Details

Study typeMeta analysis
Sample sizen = 7,964
EvidenceLevel 1
PublishedApr 2026
View Original Abstract ↓
BACKGROUND: During the diagnosis and treatment of non-small cell lung cancer (NSCLC), detecting the risk of its recurrence in an early phase is still challenging. Recent studies have investigated the radiomics-based machine learning (ML) models for detecting the risk of recurrence in NSCLC. However, there is still insufficient systematic evidence to prove its efficiency. OBJECTIVE: This study is designed to systematically evaluate the effectiveness of radiomics-based ML in predicting the risk of recurrence in NSCLC, aiming to provide evidence-based support for the subsequent development of scoring tools to forecast recurrence risk. METHODS: For acquiring research on radiomics-based models for forecasting the risk of recurrence in NSCLC, Cochrane Library, Web of Science, PubMed, and Embase were systematically retrieved, up to October 24, 2025. Studies on analyzing the recurrence of NSCLC using radiomics-based ML were included, while those in which only texture analysis was conducted or radiomics-based ML was not constructed were excluded. The Radiomics Quality Score (RQS) was used to appraise the eligible studies. Subgroup analyses were conducted according to the variables of the model, the background of treatment, the stage of lung cancer, and the pathological type. RESULTS: Ultimately, 30 eligible studies in total were included, covering 7964 patients with NSCLC. According to the meta-analysis, the c-index of radiomics-based ML models for forecasting the risk of recurrence in NSCLC was 0.850 (95% CI 0.834-0.866, 95% prediction interval [PI] 0.623-1.004) in the training set. Specifically, the pooled c-index was 0.876 (95% CI 0.853-0.900) among the patients receiving the stereotactic body radiation therapy and 0.825 (95% CI 0.804-0.848) among those who received surgeries combined with other adjuvant treatment regimens. The c-index of the radiomics-based ML models combined with clinical features for forecasting the risk of recurrence in NSCLC was 0.833 (95% CI 0.822-0.854, 95% PI 0.717-0.945) in the training set. In contrast, the c-index of radiomics-based ML models for forecasting the risk of recurrence in NSCLC was 0.878 (95% CI 0.854-0.902, 95% PI 0.681-1.000) in the validation set. The c-index of radiomics-based ML models combined with clinical features for forecasting the risk of recurrence in NSCLC was 0.854 (95% CI 0.830-0.878, 95% PI 0.655-0.992) in the validation set. The average RQS across the included studies was 27.4%, revealing methodological limitations and an absence of standardization. CONCLUSIONS: This study is the first to confirm that radiomics-based ML models effectively predict the risk of recurrence in NSCLC. This study provides evidence-based support for the subsequent development or updating of radiomics-based ML models. However, the current methodological application of radiomics remains concerning. Therefore, in the future, research should standardize the workflow for implementing radiomics-based ML and incorporate multicenter imaging data to enhance its generalizability.
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.