Mode
Text Size
Log in / Sign up

Systematic review and meta-analysis of prediction models for diabetic kidney disease progression to ESRDKidney disease prediction models show promise but have major flaws

AI-generated summary of the cited source, checked by automated accuracy review. How we work

Key Takeaway
Consider that prediction models for diabetic kidney disease progression show promising discrimination but have major methodological limitations.

This is a systematic review and meta-analysis of prediction models for progression from diabetic kidney disease to end-stage renal disease. The authors synthesized data from 15 studies, with meta-analysis including training datasets from six studies (seven models) and validation datasets from three studies (three models). The primary outcome was discrimination performance, measured by area under the receiver operating characteristic curve (AUC).

The pooled AUC for training models was 0.896 (95% CI, 0.853–0.940), and for validation models it was 0.863 (95% CI, 0.803–0.923). The authors describe these results as promising for discrimination.

Key limitations noted include all 15 studies being at high risk of bias per PROBAST-AI, retrospective single-center designs, lack of blinded predictor assessment, and use of predictors not routinely available in practice. Other gaps are inadequate calibration, extreme heterogeneity, and scarce independent external validation. Seven studies were limited to biopsy-proven DKD, limiting applicability to routine clinical populations.

The authors conclude that published models show promising discrimination but have pervasive methodological limitations, extreme heterogeneity, and scarce independent external validation, severely restricting clinical generalizability. Practice relevance is restrained, as these models are not ready for routine clinical use.

Researchers reviewed many studies about computer models that try to predict which patients with diabetic kidney disease will develop kidney failure. They found that the models showed good accuracy in early testing. However, the studies had many serious problems that make the results hard to trust.

Most studies were small and only looked at patients from one hospital. They also often used test results that doctors do not normally have. This makes it hard to use these models in everyday care. The studies did not test the models on new groups of patients, which is very important.

The review found that all the studies had a high risk of bias. This means the results might not be correct. The models also varied a lot from one study to another. This extreme heterogeneity makes it difficult to know what the true accuracy is.

Because of these issues, the models cannot yet be used to guide patient care. More research is needed to fix these problems. Future studies should use better methods and test the models in different hospitals.

What this means for you:
Kidney disease prediction models look accurate but have too many flaws to be used in clinics.

Study Details

Study typeMeta analysis
EvidenceLevel 1
PublishedMay 2026
View Original Abstract ↓
BackgroundDiabetic kidney disease (DKD) is a major cause of end-stage renal disease (ESRD). Early identification of DKD patients at high risk of progressing to ESRD is essential, yet the overall performance, methodological quality, and translational readiness of prediction models for this transition remain unclear. To our knowledge, we conducted the first systematic review and meta-analysis focused specifically on prediction models for progression from established DKD to ESRD.MethodsWe searched PubMed, Embase, Cochrane Library, Web of Science, China National Knowledge Infrastructure (CNKI), Wanfang, VIP Chinese Journal Service Platform, and the Chinese Biomedical Literature Database (CBM) for English- and Chinese-language studies published through 27 September 2025 that developed or validated DKD to ESRD prediction models. Two reviewers independently screened records and extracted data. Risk of bias was assessed using the updated PROBAST-AI checklist. Reported AUCs were pooled using random-effects meta-analysis (Stata 18.0) with 95% confidence intervals (CIs). We performed sensitivity analyses, assessed publication bias, and conducted subgroup analyses by inclusion of pathological predictors and prediction horizon.ResultsFifteen studies met inclusion criteria. PROBAST-AI judged all 15 studies at high risk of bias, mainly due to retrospective single-center designs, lack of blinded predictor assessment, use of predictors not routinely available in practice, and inadequate calibration and external validation; seven studies were limited to biopsy-proven DKD, limiting their applicability to routine clinical populations. Meta-analysis included training datasets from six studies (seven models) and validation datasets from three studies (three models). Pooled AUCs were 0.896 (95% CI, 0.853–0.940) for training models and 0.863 (95% CI, 0.803–0.923) for validation models. Five prespecified sensitivity analyses yielded broadly similar pooled AUCs, but interpretation remained exploratory because of persistent heterogeneity and universal high risk of bias. Subgroup analyses found no significant differences by pathological predictor inclusion or prediction horizon.ConclusionsPublished DKD to ESRD models show promising discrimination in development and internal validation cohorts. However, pervasive methodological limitations, extreme heterogeneity, and scarce independent external validation severely restrict clinical generalizability. Prespecified sensitivity analyses yielded broadly similar pooled AUCs, but these results remain exploratory. Future prospective multicenter studies with rigorous external validation and calibration are urgently needed.Systematic Review Registrationhttps://www.crd.york.ac.uk/prospero/display_record.php?ID=CRD420251127778, identifier CRD420251127778.
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.