Mode
Text Size
Log in / Sign up

Multimodal machine learning model differentiation of benign from malignant pulmonary space-occupying lesions in cohort study

Multimodal machine learning model differentiation of benign from malignant pulmonary space-occupying…
Photo by Google DeepMind / Unsplash
Key Takeaway
Recognize the multimodal model's high AUC but note limitations regarding benign lesion confirmation and observational study design.

This cohort study included 384 patients with pulmonary space-occupying lesions (PSOLs). The primary objective was to differentiate benign from malignant lesions using diagnostic imaging and clinical data. Follow-up duration was at least 12 months for benign lesions confirmed by clinical-imaging follow-up.

Researchers compared a multimodal machine learning model integrating CT radiomics, PET metabolic parameters, and clinical data against single-modality models (Radiomics, Clinical, Metabolic) and other integrated models (Logistic regression, random forest, support vector machine).

The multimodal XGBoost integrated model demonstrated an AUC of 0.967, which was significantly higher than all other models (Bonferroni-adjusted P = 0.002–0.032). Comparative AUCs included 0.808 for the Radiomics Model, 0.732 for the Clinical Model, and 0.874 for the Metabolic Model.

Safety data, including adverse events and tolerability, were not reported. A key limitation noted was that benign lesions were confirmed by clinical-imaging follow-up for at least 12 months (18%). Association versus causation was not distinguished, and surrogate versus clinical outcomes were not distinguished.

While the study suggests potential to facilitate clinical translation, the observational nature and incomplete follow-up data warrant careful interpretation before clinical application. Further validation is needed.

Study Details

Study typeCohort
EvidenceLevel 3
PublishedApr 2026
View Original Abstract ↓
ObjectiveTo construct a multimodal machine learning model integrating computed tomography (CT) radiomics, Positron Emission Tomography (PET) metabolic parameters, and clinical data for differentiating benign from malignant pulmonary space-occupying lesions (PSOLs), and develop an interpretable nomogram for clinical application.MethodologyThis study enrolled 384 patients with PSOLs who underwent dual-time-point 1⁸F-FDG PET/CT examinations. The cohort was divided into a training set (n = 268, 145 malignant, 123 benign) and an independent temporal validation set (n = 116, 69 malignant, 47 benign) at a 7:3 ratio according to the chronological order of patient enrollment, to avoid data leakage and rigorously assess model generalizability. All malignant lesions were confirmed by pathological examination, while benign lesions were confirmed by pathology (82%) or clinical-imaging follow-up for at least 12 months (18%). CT radiomic features with Intraclass Correlation Coefficient (ICC) values >0.75 were selected, and a Radiomics-score (Rad-score) was generated using the Least Absolute Shrinkage and Selection Operator (LASSO) regression algorithm. Integrated models [Logistic regression, random forest (RF), support vector machine (SVM), eXtreme Gradient Boosting (XGBoost)] were developed by fusing the Rad-score, clinical variables, and PET metabolic parameters. Model performance was evaluated using the area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, specificity, F1-score, and Brier score. Model calibration was assessed via calibration curves, and clinical utility was validated by decision curve analysis (DCA). Model interpretability was achieved using SHapley Additive exPlanations (SHAP) values for the optimal XGBoost model, and a clinically applicable, interpretable nomogram was constructed based on the core predictive features identified by SHAP analysis to facilitate clinical translation.ResultsA Rad-score was constructed from 17 optimally selected features. In the independent temporal validation set, the single-modality models achieved AUCs of 0.808 (Radiomics Model), 0.732 (Clinical Model), and 0.874 (Metabolic Model). Among all tested models, the XGBoost integrated model achieved the highest AUC of 0.967, which was significantly higher than that of all other models (Bonferroni-adjusted P = 0.002–0.032, all adjusted P 
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.