Machine learning model using routine clinical indicators predicts coronary heart disease risk
This retrospective cohort model development and validation study evaluated a stacked ensemble machine learning model for predicting coronary heart disease (CHD) risk. The study utilized data from the Framingham Heart Study and a retrospective hospital cohort (2024–2025). The model incorporated routine clinical indicators, including age, systolic blood pressure, total cholesterol, and fasting glucose.
Internal validation was performed with a sample size of n = 4,240, yielding an AUC of 0.977, accuracy of 0.942, and F1 score of 0.944. External validation was conducted with a sample size of n = 200, demonstrating an AUC of 0.929 and accuracy of 0.885.
Safety and tolerability data, including adverse events or discontinuations, were not reported. The study focused on the predictive performance of the model across different cohorts.
While the model demonstrates strong discrimination for CHD risk and generalizes to an external cohort, it remains a tool for risk assessment based on routine measures. The clinical utility of this machine learning approach for cardiovascular risk assessment warrants further investigation in prospective settings.