Systematic review and meta-analysis evaluates ML and DL models for diagnosing MASH and fibrosis
This systematic review and meta-analysis included 106 studies, with 35 providing data for analysis (28 for ML, 7 for DL). The review evaluated ML and DL models for diagnosing MASH and liver fibrosis in patients with metabolic dysfunction-associated steatotic liver disease. The primary outcome was pooled area under the receiver operator characteristic curve (AUROC).
For diagnosing MASH, ML models showed a pooled AUROC of 0.833 (95% CI: 0.806-0.860), while DL models had a pooled AUROC of 0.841 (95% CI: 0.782-0.900). The best-performing ML model (LightGBM) achieved an AUROC of 0.920 (95% CI: 0.916-0.924), and the best-performing DL model (ResNet50) achieved an AUROC of 0.960 (95% CI: 0.951-0.969). For diagnosing fibrosis, ML models had a pooled AUROC of 0.826 (95% CI: 0.792-0.860), with CatBoost achieving the highest AUROC of 0.960 (95% CI: 0.950-0.970). DL models for fibrosis had a pooled AUROC of 0.875 (95% CI: 0.816-0.934).
The authors did not report specific limitations or safety data. The review highlights the potential of AI-driven approaches in MASLD management, but the absence of reported limitations and the variability in model performance warrant cautious interpretation. Further validation in diverse clinical settings is needed before widespread adoption.