Multimodal machine learning model differentiation of benign from malignant pulmonary space-occupying lesions in cohort study

Photo by Google DeepMind / Unsplash

Frontiers in Medicine Published April 21, 2026 Medically reviewed April 25, 2026 DOI ↗ By Dr. Lars van Dijk, PhD · Surgical, Procedural & Diagnostic

Key Takeaway

Recognize the multimodal model's high AUC but note limitations regarding benign lesion confirmation and observational study design.

This cohort study included 384 patients with pulmonary space-occupying lesions (PSOLs). The primary objective was to differentiate benign from malignant lesions using diagnostic imaging and clinical data. Follow-up duration was at least 12 months for benign lesions confirmed by clinical-imaging follow-up.

Researchers compared a multimodal machine learning model integrating CT radiomics, PET metabolic parameters, and clinical data against single-modality models (Radiomics, Clinical, Metabolic) and other integrated models (Logistic regression, random forest, support vector machine).

The multimodal XGBoost integrated model demonstrated an AUC of 0.967, which was significantly higher than all other models (Bonferroni-adjusted P = 0.002–0.032). Comparative AUCs included 0.808 for the Radiomics Model, 0.732 for the Clinical Model, and 0.874 for the Metabolic Model.

Safety data, including adverse events and tolerability, were not reported. A key limitation noted was that benign lesions were confirmed by clinical-imaging follow-up for at least 12 months (18%). Association versus causation was not distinguished, and surrogate versus clinical outcomes were not distinguished.

While the study suggests potential to facilitate clinical translation, the observational nature and incomplete follow-up data warrant careful interpretation before clinical application. Further validation is needed.

Study Details

Study typeCohort

EvidenceLevel 3

PublishedApr 2026

View Original Abstract ↓

ObjectiveTo construct a multimodal machine learning model integrating computed tomography (CT) radiomics, Positron Emission Tomography (PET) metabolic parameters, and clinical data for differentiating benign from malignant pulmonary space-occupying lesions (PSOLs), and develop an interpretable nomogram for clinical application.MethodologyThis study enrolled 384 patients with PSOLs who underwent dual-time-point 1⁸F-FDG PET/CT examinations. The cohort was divided into a training set (n = 268, 145 malignant, 123 benign) and an independent temporal validation set (n = 116, 69 malignant, 47 benign) at a 7:3 ratio according to the chronological order of patient enrollment, to avoid data leakage and rigorously assess model generalizability. All malignant lesions were confirmed by pathological examination, while benign lesions were confirmed by pathology (82%) or clinical-imaging follow-up for at least 12 months (18%). CT radiomic features with Intraclass Correlation Coefficient (ICC) values >0.75 were selected, and a Radiomics-score (Rad-score) was generated using the Least Absolute Shrinkage and Selection Operator (LASSO) regression algorithm. Integrated models [Logistic regression, random forest (RF), support vector machine (SVM), eXtreme Gradient Boosting (XGBoost)] were developed by fusing the Rad-score, clinical variables, and PET metabolic parameters. Model performance was evaluated using the area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, specificity, F1-score, and Brier score. Model calibration was assessed via calibration curves, and clinical utility was validated by decision curve analysis (DCA). Model interpretability was achieved using SHapley Additive exPlanations (SHAP) values for the optimal XGBoost model, and a clinically applicable, interpretable nomogram was constructed based on the core predictive features identified by SHAP analysis to facilitate clinical translation.ResultsA Rad-score was constructed from 17 optimally selected features. In the independent temporal validation set, the single-modality models achieved AUCs of 0.808 (Radiomics Model), 0.732 (Clinical Model), and 0.874 (Metabolic Model). Among all tested models, the XGBoost integrated model achieved the highest AUC of 0.967, which was significantly higher than that of all other models (Bonferroni-adjusted P = 0.002–0.032, all adjusted P

Multimodal machine learning model differentiation of benign from malignant pulmonary space-occupying lesions in cohort study

Study Details

Muscle ultrasonography shows high diagnostic accuracy for fasciculation detection in ALS

A Simple Scan Could Speed Up an ALS Diagnosis

Clinical research that matters. Delivered to your inbox.

Multimodal machine learning model differentiation of benign from malignant pulmonary space-occupying lesions in cohort study

Study Details

Muscle ultrasonography shows high diagnostic accuracy for fasciculation detection in ALS

A Simple Scan Could Speed Up an ALS Diagnosis

Clinical research that matters. Delivered to your inbox.

Related in Radiology & Imaging

From Other Specialties