Machine learning model using routine clinical indicators predicts coronary heart disease riskMachine learning model predicts heart disease risk with high accuracy

Frontiers in Medicine Published April 30, 2026 DOI ↗ Editorial oversight: Dr. Amelia Tan, PhD · Internal Medicine & Chronic Disease

AI-generated summary of the cited source, checked by automated accuracy review. How we work

Key Takeaway

Note that a machine learning model using routine clinical indicators shows high accuracy for CHD risk prediction.

This retrospective cohort model development and validation study evaluated a stacked ensemble machine learning model for predicting coronary heart disease (CHD) risk. The study utilized data from the Framingham Heart Study and a retrospective hospital cohort (2024–2025). The model incorporated routine clinical indicators, including age, systolic blood pressure, total cholesterol, and fasting glucose.

Internal validation was performed with a sample size of n = 4,240, yielding an AUC of 0.977, accuracy of 0.942, and F1 score of 0.944. External validation was conducted with a sample size of n = 200, demonstrating an AUC of 0.929 and accuracy of 0.885.

Safety and tolerability data, including adverse events or discontinuations, were not reported. The study focused on the predictive performance of the model across different cohorts.

While the model demonstrates strong discrimination for CHD risk and generalizes to an external cohort, it remains a tool for risk assessment based on routine measures. The clinical utility of this machine learning approach for cardiovascular risk assessment warrants further investigation in prospective settings.

Researchers developed a machine learning model to predict coronary heart disease (CHD) risk using common health indicators like blood pressure, age, cholesterol, and fasting glucose. The model was built and tested using data from the Framingham Heart Study, which included 4,240 people for internal validation, and a separate hospital cohort of 200 people for external validation.

The model performed well in both groups. In the internal validation, it achieved an AUC of 0.977 and accuracy of 94.2%. In the external hospital cohort, the AUC was 0.929 with 88.5% accuracy. These results suggest the model can reliably identify people at risk for CHD using information that is routinely collected during doctor visits.

No safety concerns were reported, as this was a data analysis study. The model is not yet ready for widespread clinical use, but it shows promise as a tool to help doctors assess cardiovascular risk without extra tests. More research is needed to confirm its benefits in real-world settings.

For now, this study highlights how existing health data might be used to improve heart disease prediction. If you have concerns about your heart health, talk to your doctor about your individual risk factors.

What this means for you:

A new model using routine health data accurately predicted heart disease risk, but more study is needed.

Study Details

Study typeCohort

EvidenceLevel 3

PublishedApr 2026

View Original Abstract ↓

BackgroundTo develop and externally validate a coronary heart disease (CHD) risk model from routine clinical indicators and identify key predictors.MethodsThe Framingham Heart Study cohort (n = 4,240) was used. Missing values and outliers were handled, and class imbalance was corrected with SMOTEENN/SMOTETomek. Data were split 7:3 for training and internal validation. A two-tier feature selection (chi-square, mutual information, ANOVA F-test) retained ten variables. A stacked ensemble of gradient boosting, random forest, and XGBoost with a logistic-regression meta-learner was trained. Performance was measured by AUC, accuracy, precision, recall, and F1. External validation used a retrospective hospital cohort (n = 200; 2024–2025). Model explanations were derived with SHAP.ResultsInternal validation yielded AUC 0.977 and accuracy 0.942 (F1: 0.944). External validation achieved AUC 0.929 and accuracy 0.885. SHAP identified systolic blood pressure, age, total cholesterol, and fasting glucose as leading contributors, with plausible nonlinear effects and interactions.ConclusionA model built from routinely available measures demonstrates strong discrimination for CHD risk and generalizes to an external cohort, offering a clinically interpretable tool for cardiovascular risk assessment.

Machine learning model using routine clinical indicators predicts coronary heart disease riskMachine learning model predicts heart disease risk with high accuracy

Study Details

Meta-analysis finds systemic lupus erythematosus elevates coronary artery disease risk

People with lupus face higher heart disease risk but genetics show no direct link

Clinical research that matters. Delivered to your inbox.

Machine learning model using routine clinical indicators predicts coronary heart disease riskMachine learning model predicts heart disease risk with high accuracy

More on Heart Failure

Study Details

Meta-analysis finds systemic lupus erythematosus elevates coronary artery disease risk

People with lupus face higher heart disease risk but genetics show no direct link

Clinical research that matters. Delivered to your inbox.

Related in Cardiology

From Other Specialties