Machine learning models show moderate to high accuracy for predicting variceal bleeding in cirrhosis patients

Photo by Ben Maffin / Unsplash

Journal of medical Internet research Published April 12, 2026 Medically reviewed April 25, 2026 Study authors: Lian Yuan, Qiu Xinping, Wang Geli, Liu Nannan, Zhao Jing, Song Shanshan, Wang Shiqi, Sun Mingjun PubMed ↗ DOI ↗ By Dr. Amelia Tan, PhD · Internal Medicine & Chronic Disease

Key Takeaway

Consider ML prediction models as potentially useful for variceal bleeding risk stratification in cirrhosis, but await prospective validation.

This systematic review and meta-analysis evaluated the performance of machine learning prediction models for esophageal variceal bleeding (EVB) and esophagogastric variceal bleeding (EGVB) in patients with liver cirrhosis. The analysis included 21 studies with a total of 7,011 patients, though the specific clinical settings and geographic locations were not reported. The population consisted exclusively of patients with liver cirrhosis, with 1,412 patients (20.14%) developing EVB and 733 patients (10.45%) developing EGVB during the study periods. Follow-up durations were not consistently reported across the included studies.

The intervention examined was machine learning prediction models, which were developed using various input variables including clinical features, radiomics, endoscopic features, or combinations thereof. No specific comparator or standard prediction method was reported for comparison in this meta-analysis. The models were designed to predict the occurrence of EVB or EGVB in cirrhosis patients, with performance evaluated in validation sets rather than treatment outcomes.

For the primary outcome of predictive performance for EVB, the pooled c-index across studies was 0.85 (95% CI 0.77-0.92), with sensitivity of 0.93 (95% CI 0.87-0.96) and specificity of 0.66 (95% CI 0.46-0.82). For EGVB prediction, performance was slightly higher with a pooled c-index of 0.89 (95% CI 0.85-0.94), sensitivity of 0.77 (95% CI 0.66-0.85), and specificity of 0.81 (95% CI 0.67-0.90). These results indicate moderate to high discriminatory ability, though confidence intervals were wide for some estimates, particularly specificity for EVB prediction.

Key secondary outcomes included subgroup analyses by model variable type. For EVB prediction, models using clinical features alone achieved a c-index of 0.84 (95% CI 0.80-0.88), radiomics alone 0.82 (95% CI 0.69-0.96), combined radiomics and clinical features 0.78 (95% CI 0.67-0.89), and endoscopic features 0.97 (95% CI 0.95-1.00). For EGVB prediction, clinical feature models showed a c-index of 0.91 (95% CI 0.86-0.96) while combined radiomics and clinical features achieved 0.85 (95% CI 0.75-0.96). The endoscopic feature models demonstrated particularly high discrimination for EVB prediction, though this was based on limited studies.

Safety and tolerability data were not reported in this meta-analysis, as it focused on prediction model performance rather than therapeutic interventions. The analysis did not include information on adverse events, serious adverse events, or discontinuations related to model implementation or the diagnostic procedures used to obtain input variables.

Compared to traditional risk stratification methods for variceal bleeding in cirrhosis, such as Child-Pugh score, MELD score, or platelet count, these machine learning models appear to offer potentially superior discrimination based on the reported c-indices. However, direct comparisons with established clinical prediction rules were not performed in this analysis. The performance metrics are comparable to or exceed those reported for some existing prediction tools, though validation in head-to-head studies is needed.

Methodological limitations include the limited number of original studies included (21 total), which may affect the precision and generalizability of the pooled estimates. The analysis did not assess publication bias or between-study heterogeneity in detail. Most included studies were likely retrospective, introducing potential biases in data collection and model development. The wide confidence intervals for some performance metrics, particularly specificity for EVB prediction (0.46-0.82), indicate substantial uncertainty in these estimates.

Clinical implications suggest that machine learning approaches show promise for risk stratification of variceal bleeding in cirrhosis patients, potentially identifying high-risk individuals who might benefit from more intensive monitoring or prophylactic interventions. The high sensitivity for EVB prediction (0.93) could be valuable for ruling out low-risk patients, while the excellent discrimination of endoscopic feature models (c-index 0.97) supports the value of endoscopic information when available. However, these models require external validation in diverse clinical settings before implementation.

Unanswered questions include how these models perform compared to existing clinical prediction rules in prospective studies, whether they improve clinical outcomes when used to guide management decisions, and what the optimal model variables and algorithms are for different clinical contexts. The cost-effectiveness, implementation challenges, and effect on healthcare utilization of ML-based prediction tools also require investigation. Additionally, the generalizability to different cirrhosis etiologies and stages remains uncertain.

Study Details

Study typeMeta analysis

Sample sizen = 7,011

EvidenceLevel 1

PublishedApr 2026

PMID41950370

View Original Abstract ↓

BACKGROUND: Liver cirrhosis (LC) can lead to several complications. Esophageal variceal bleeding (EVB) and esophagogastric variceal bleeding (EGVB) are particularly severe, leading to a high risk of mortality. Early identification of esophageal varices and esophagogastric varices is essential. Several studies have constructed prediction models for EVB and EGVB in patients with LC. However, robust systematic evidence to prove their performance is lacking. OBJECTIVE: We included original studies that developed prediction models for esophageal or gastric variceal bleeding in patients with LC under different modeling variables. This study aimed to review the predictive performance of various models for EVB or EGVB in patients with LC, providing insights into the development or updating of simplified scoring tools in the future. METHODS: PubMed, Web of Science, Embase, and the Cochrane Library were searched up to August 21, 2024, to collect original full-text studies on machine learning (ML) in the prediction of EVB and EGVB in patients with LC. The models were evaluated using the Prediction Model Risk of Bias Assessment Tool (PROBAST). Subgroup analyses were carried out based on the modeling variables. RESULTS: In total, 21 studies were included, with 7011 patients with LC, among whom 1412 (20.14%) developed EVB and 733 (10.45%) developed EGVB. The meta-analysis results suggested that the pooled c-index, sensitivity, and specificity of the prediction model for predicting EVB in the validation set were 0.85 (95% CI 0.77-0.92), 0.93 (95% CI 0.87-0.96), and 0.66 (95% CI 0.46-0.82), respectively. The pooled c-index, sensitivity, and specificity of the prediction model for predicting EGVB in the validation set were 0.89 (95% CI 0.85-0.94), 0.77 (95% CI 0.66-0.85), and 0.81 (95% CI 0.67-0.90), respectively. The subgroup analysis based on modeling variables revealed that, for predicting EVB, the c-index in the validation set was 0.84 (95% CI 0.80-0.88) for models based on clinical features, 0.82 (95% CI 0.69-0.96) for radiomics-based models, 0.78 (95% CI 0.67-0.89) for models based on radiomics and clinical features, and 0.97 (95% CI 0.95-1.00) for models based on endoscopic features. Subgroup analyses based on modeling variables revealed that, for predicting EGVB, the c-index in the validation set was 0.91 (95% CI 0.86-0.96) for models based on clinical features and 0.85 (95% CI 0.75-0.96) for models based on radiomics and clinical features. CONCLUSIONS: ML methods are feasible for predicting EVB and EGVB in patients with LC. Nevertheless, the number of included original studies is limited. In the future, more studies with larger sample sizes are needed to promote the application of ML in the early assessment of EVB and EGVB in patients with LC in clinical practice.

Machine learning models show moderate to high accuracy for predicting variceal bleeding in cirrhosis patients

Study Details

Immune Checkpoint Inhibitors Improve Survival and Response Rates in Patients with Hepatocellular Carcinoma Compared to Standard Therapy

New Drug Gives Liver Cancer Patients More Time

Clinical research that matters. Delivered to your inbox.

Machine learning models show moderate to high accuracy for predicting variceal bleeding in cirrhosis patients

More on Liver Cirrhosis

Study Details

Immune Checkpoint Inhibitors Improve Survival and Response Rates in Patients with Hepatocellular Carcinoma Compared to Standard Therapy

New Drug Gives Liver Cancer Patients More Time

Clinical research that matters. Delivered to your inbox.

Related in Gastroenterology

From Other Specialties