Mode
Text Size
Log in / Sign up

Explainable AI system predicts cardiovascular risk with high accuracy but struggles with patient retention

Explainable AI system predicts cardiovascular risk with high accuracy but struggles with patient ret…
Photo by Joshua Chehov / Unsplash
Key Takeaway
Consider CardioAI a pilot-stage tool requiring validation in Nigerian clinical populations.

This system development study created and tested CardioAI, an explainable artificial intelligence system with two predictive modules. The cardiovascular risk module was trained on 1,025 patient records from the UCI Heart Disease dataset, while the patient retention module used 800 synthetic records. The system is proposed for pilot validation in Nigerian teaching hospitals.

The cardiovascular risk module demonstrated high performance on held-out test data, with AUC-ROC scores of 0.999 for XGBoost, 0.998 for Random Forest, 0.994 for MLP, and 0.927 for Logistic Regression. Cross-validated AUC with constrained tree depth was 0.97. SHAP analysis identified Lifestyle Risk Index, ST depression, resting blood pressure, exercise-induced angina, and cholesterol as the most influential predictors. In contrast, the patient retention module achieved an AUC-ROC of only 0.66 using Logistic Regression.

No safety or tolerability data were reported. Key limitations include the retention module being trained on synthetic data and its poor predictive performance (AUC 0.66), which demonstrates the difficulty of dropout prediction. The cardiovascular model was trained on a non-Nigerian dataset (UCI Heart Disease), limiting immediate generalizability. The system is freely deployable and open-source, designed for pilot validation rather than clinical implementation at this stage.

Study Details

EvidenceLevel 5
PublishedMar 2026
View Original Abstract ↓
BackgroundCardiovascular disease is the leading cause of mortality in Nigeria and across sub-Saharan Africa, with rising incidence attributable to urbanisation, sedentary lifestyles, and limited access to early detection tools. Concurrently, patient dropout from rehabilitation programs remains a critical operational challenge for Nigerian clinics, with many patients failing to return after their initial consultation. MethodsWe developed CardioAI, an Explainable Artificial Intelligence system comprising two predictive modules. The cardiovascular risk module trained four machine learning models -- Logistic Regression, Random Forest, Gradient Boosting (XGBoost), and a Multilayer Perceptron -- on a combined UCI Heart Disease dataset of 1,025 patient records. A novel Lifestyle Risk Index was engineered from five modifiable clinical markers. SHAP (SHapley Additive exPlanations) was applied for per-prediction feature attribution. The patient retention module trained three classifiers on a synthetic dataset of 800 records, modelling 10 operational and behavioural dropout factors. An NLP and OCR pipeline using Tesseract v5.5 and spaCy was implemented for clinical document processing. ResultsThe cardiovascular risk module achieved an AUC-ROC of 0.999 (XGBoost), 0.998 (Random Forest), 0.994 (MLP), and 0.927 (Logistic Regression) on the held-out test set. Cross-validated AUC with constrained tree depth was 0.97, confirming generalisation. SHAP analysis identified the Lifestyle Risk Index, ST depression, resting blood pressure, exercise-induced angina, and cholesterol as the five most influential predictors. The retention module achieved AUC-ROC of 0.66 (Logistic Regression), demonstrating the difficulty of dropout prediction with synthetic data. ConclusionsCardioAI demonstrates that explainable machine learning can provide clinically actionable cardiovascular risk assessment and patient retention intelligence in a low-resource Nigerian healthcare context. The system is freely deployable, open-source, and designed for pilot validation in teaching hospitals across Lagos and Port Harcourt.
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.