Mode
Text Size
Log in / Sign up

Machine learning models provide statistically significant improvements in 5-year survival prediction for gastric cancer patientsAI Models Slightly Improve Gastric Cancer Survival Prediction

AI-generated summary of the cited source, checked by automated accuracy review. How we work

Key Takeaway
Note that ML models show statistically significant improvements in 5-year survival prediction over conventional models.

This systematic review and meta-analysis evaluated the predictive performance of machine learning (ML) models versus conventional statistical models for 5-year overall survival (5-OS) in 15,643 adult patients with histologically confirmed gastric cancer who underwent curative-intent surgery. The analysis focused on AI models utilizing routinely collected electronic health record (EHR) data.

The meta-analysis found a modest but statistically significant improvement in AUC for ML-based models compared to conventional statistical models (pooled mean difference 0.04; 95% CI 0.02 to 0.07; p = 0.001). Additionally, boosting algorithms demonstrated a modest but statistically significant advantage over bagging methods (difference of 0.02; p = 0.04) in predicting survival outcomes.

While the results indicate that AI-based models provide clinically meaningful improvements in prediction, the evidence is derived from retrospective studies rather than prospective trials. The authors suggest that selecting the optimal AI algorithm should depend on the specific structure and type of input data to maximize both predictive performance and practical utility in clinical decision support systems.

How this fits prior evidence

This meta-analysis extends prior coverage regarding machine learning applications in gastric cancer surgery. While previous evidence established that machine learning and postoperative indicators provide higher AUC for predicting anastomotic leak than regression models, this study specifically addresses the prediction of 5-year overall survival (5-OS) using routinely collected electronic health record data.

A new meta-analysis of 10 retrospective studies involving 15,643 patients found that artificial intelligence (AI) models can modestly improve predictions of 5-year survival after gastric cancer surgery compared to conventional statistical models.

The AI models, which used routine electronic health record data, showed a small but statistically significant increase in predictive accuracy, with an average improvement of 0.04 in the AUC score (a measure of how well the model distinguishes survivors from non-survivors). Among different AI techniques, boosting algorithms slightly outperformed bagging methods by 0.02 AUC.

Because this analysis is based on retrospective studies, not prospective clinical trials, the results should be interpreted with caution. The studies varied in design and patient populations, which may affect the reliability of the findings. No safety concerns were reported, as the study focused on prediction models rather than treatments.

For patients and doctors, these findings suggest that AI tools may eventually help personalize follow-up care after gastric cancer surgery. However, more research is needed before these models are ready for routine clinical use. Always discuss your individual prognosis and treatment plan with your healthcare team.

What this means for you:
AI models offer a small but meaningful boost in predicting survival after gastric cancer surgery, but more research is needed.

Common questions

How much better are AI models at predicting survival than standard methods?

AI models improved the AUC score by an average of 0.04, which is a modest but statistically significant increase. This means they are slightly better at distinguishing which patients will survive 5 years after surgery.

What kind of data do these AI models use?

The AI models use routinely collected electronic health record (EHR) data, such as lab results, tumor characteristics, and patient history. This means no extra tests are needed beyond what is already gathered in standard care.

Are these AI models ready for use in hospitals?

Not yet. This meta-analysis was based on retrospective studies, not prospective trials. More research is needed to confirm the findings and ensure the models work well in real-world clinical settings.

Study Details

Study typeMeta analysis
EvidenceLevel 1
PublishedJun 2026
View Original Abstract ↓
Artificial intelligence (AI) is increasingly being applied to prognostic modeling in oncology; however, many AI-based survival prediction models rely on complex multimodal data that are not routinely available in clinical practice. This systematic review and meta-analysis aimed to evaluate the performance of AI models based on routinely collected electronic health record (EHR) data for predicting 5-year overall survival (5-OS) in patients undergoing surgical treatment for gastric cancer A systematic literature search was conducted in PubMed, Scopus, Nature, MedRxiv, and bioRxiv databases for studies published between January 2015 and July 2025. We included studies reporting area under the receiver operating characteristic curve (AUC) values for AI-based 5-OS prediction. Retrospective studies of adult patients with histologically confirmed gastric cancer who underwent curative-intent surgery were eligible, while studies primarily using non-routine multimodal data or lacking AUC outcomes were excluded. Risk of bias was assessed using the PROBAST-AI tool. Meta-analyses were performed to compare machine learning-based models with conventional statistical approaches, as well as different AI algorithm classes, including bagging and boosting ensemble methods, neural networks, random forest, support vector machines, and logistic regression. Random or fixed-effects models were applied according to between-study heterogeneity. The primary outcome was the pooled mean difference in AUC between machine learning-based and conventional statistical models for 5-OS prediction. Secondary outcomes included comparative performance across different AI algorithm classes and identification of the most frequently selected prognostic features. Ten retrospective studies comprising 15,643 patients were included. Machine learning-based models demonstrated a modest but statistically significant improvement in predictive performance compared with conventional approaches, with a pooled mean AUC increase of 0.04 (95% CI 0.02–0.07; p = 0.001). Boosting algorithms showed a modest but statistically significant advantage over bagging methods (AUC increase 0.02; p = 0.04). The type of clinical input data, particularly the inclusion of blood-based biomarkers, influenced algorithm performance. The most consistently identified prognostic features across studies were age, T stage, tumor size, serum albumin or prealbumin level, and metastatic-to-examined lymph node ratio. and relevance: AI-based prognostic models utilizing routinely available clinical data provide clinically meaningful improvements in 5-year survival prediction after gastric cancer surgery, and selection of the optimal AI algorithm should be guided by the structure and type of input data to maximize both predictive performance and practical applicability in clinical decision support systems. https://www.crd.york.ac.uk/PROSPERO/view/CRD420261282797, PROSPERO CRD420261282797.
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.