Mode
Text Size
Log in / Sign up

Machine learning model predicts brain metastasis risk in breast cancer patients with high accuracy

Machine learning model predicts brain metastasis risk in breast cancer patients with high accuracy
Photo by Bhautik Patel / Unsplash
Key Takeaway
Consider this predictive model as preliminary evidence requiring validation before clinical use.

This cohort study analyzed 154,193 breast cancer patients in a training set, with 66,084 for internal validation and 765 for external validation, to develop a machine learning model predicting brain metastasis risk. Using XGBoost, the model identified risk factors including higher tumor grade, advanced T/N stage, advanced clinical stage, and PR positivity (P < 0.001), while protective factors included radiotherapy, chemotherapy, surgery, HR+/HER2- subtype, and unilateral tumors (P < 0.001). Multivariate analysis confirmed independent risk factors (poorer pathological grade, N3 lymph node status, later stage, PR positivity) and protective factors (radiotherapy, chemotherapy, surgery, non-HR-/HER2- subtypes, HER2 positivity).

The model demonstrated high performance with an AUC of 0.98 in 10-fold cross-validation, 0.99 in the internal test set, and 0.97 in external validation. AUPRC values were 0.933 in training, 0.864 in internal testing, and 0.648 in external validation. Decision curve analysis showed superior net benefit compared to alternative models, and SHAP analysis identified surgery as the primary protective factor, with stage and T classification as key risk enhancers.

No safety or tolerability data were reported. Limitations were not specified in the input, but the observational nature of cohort studies means associations cannot establish causality. The study suggests potential for personalized risk stratification, early screening, and resource optimization in breast cancer brain metastasis management, but clinical application requires further validation and prospective testing.

Study Details

Study typeCohort
EvidenceLevel 3
PublishedApr 2026
View Original Abstract ↓
BackgroundBreast cancer is the most common malignancy worldwide. Brain metastasis in breast cancer severely impacts prognosis, and the objective of this study is to develop a machine learning model for predicting the risk of brain metastasis in breast cancer patients to assist clinical management.MethodsUnivariate and multivariate logistic regression analyses were employed to screen the final included variables, and eight machine learning algorithms were utilized for model construction. Model performance was evaluated using receiver operating characteristic curves, precision-recall curves, decision curve analysis (DCA), and calibration curves, with the optimal model selected based on these metrics. The model was trained on a cohort of 154,193 patients, internally validated on 66,084 patients, and externally validated on 765 real-world cases, incorporating metrics such as area under the curve (AUC), area under the precision-recall curve (AUPRC), decision curves, and calibration plots, while SHAP analysis was applied to enhance interpretability. A web-based calculator was developed based on the optimal model to facilitate clinical application.ResultsUnivariate logistic regression identified higher tumor grade, advanced T/N stage, advanced clinical stage, and PR positivity as risk factors, whereas radiotherapy, chemotherapy, surgery, HR + /HER2- subtype, and unilateral tumors served as protective factors (P < 0.001). Multivariate analysis confirmed independent risk factors, including poorer pathological grade, N3 lymph node status, later stage, and PR positivity, and protective factors, including radiotherapy, chemotherapy, surgery, non-HR-/HER2- subtypes, and HER2 positivity. The XGBoost model achieved an AUC of 0.98 in 10-fold cross-validation, with AUCs of 0.99 and 0.97 in the internal test set and external validation set, respectively; AUPRC values were 0.933, 0.864, and 0.648; decision curve analysis demonstrated superior net benefit compared to alternative models within the 0.1–0.8 threshold range; calibration curves showed high concordance between predicted and observed event rates. SHAP analysis highlighted surgery as the primary protective factor, followed by stage and T classification as risk enhancers, revealing interactions among treatment variables.ConclusionThis study developed an interpretable and clinically deployable XGB model, accompanied by a web-based calculator, thereby advancing personalized risk stratification, early screening, and resource optimization in the management of breast cancer brain metastasis.
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.