Machine learning model aids early risk stratification for carbapenem-resistant Pseudomonas aeruginosa infection.

Photo by Navy Medicine / Unsplash

Frontiers in Medicine Published April 17, 2026 Medically reviewed April 25, 2026 DOI ↗ By Dr. Amelia Tan, PhD · Internal Medicine & Chronic Disease

Key Takeaway

Consider this machine-learning model a rapid, low-cost adjunct for early CRPA stratification, not a standalone screening instrument.

This retrospective cohort study assessed a clinico-laboratory machine-learning model utilizing routine complete blood count parameters to stratify early risk for carbapenem-resistant Pseudomonas aeruginosa (CRPA). The analysis included 1,666 patients in a training set and 471 patients in an external validation set from a primary center setting. The model was compared against standard empirical practices, with performance measured over a 48–72 hour interval before final antimicrobial susceptibility results were available.

In the training set, the model achieved an AUC of 0.993. Average 5-fold cross-validation yielded an AUC of 0.929 ± 0.005. The internal test set showed an AUC of 0.837 (95% CI: 0.779–0.893), with specificity of 0.972 and sensitivity of 0.507. The Brier score was 0.084. During external validation, the AUC was 0.898, specificity was 0.985, and sensitivity was 0.600, with a Brier score of 0.073.

Safety and tolerability data were not reported, as adverse events, serious adverse events, discontinuations, and tolerability metrics were not assessed. Key limitations include the inability of negative predictions to safely rule out CRPA and the model's exclusive role as a rule-in tool rather than a standalone screening instrument. The study was not reported as an RCT, and funding or conflicts were not reported.

Study Details

Study typeCohort

EvidenceLevel 3

PublishedApr 2026

View Original Abstract ↓

IntroductionTo address the delayed identification of carbapenem-resistant Pseudomonas aeruginosa (CRPA), we developed an interpretable machine-learning (ML) model for early risk stratification. Utilizing routine complete blood count (CBC) and demographic data, this tool targets the critical 48–72 hour interval before final antimicrobial susceptibility results.MethodsData from 1,666 patients with P. aeruginosa infection (223 CRPA) at a primary center were retrospectively analyzed, alongside an independent external validation cohort (n=471). Following the least absolute shrinkage and selection operator (LASSO) regression on 32 variables, eight ML algorithms were trained. Model interpretability and clinical utility were evaluated using Shapley Additive Explanations (SHAP) and decision curve analysis (DCA). Eight ML algorithms were trained using 5-fold cross-validation and Bayesian hyperparameter optimization. To ensure reproducibility and handle class imbalance, fixed random seeds were set, and a sensitivity analysis using the Synthetic Minority Over-sampling Technique (SMOTE) was conducted. Model calibration was assessed using the Brier score.ResultsLASSO identified seven predictors: sex, age, mean corpuscular volume (MCV), hemoglobin (HGB), platelet-to-lymphocyte ratio (PLR), systemic inflammatory response index (SIRI), and intensive care unit (ICU) admission status. Among the evaluated algorithms, the random forest (RF) model achieved the best discrimination. The training area under the receiver operating characteristic curve (AUC) was 0.993; it achieved an average 5-fold cross-validation AUC of 0.929 ± 0.005. In the internal test set, it achieved an AUC of 0.837 (95% CI: 0.779–0.893), specificity of 0.972, and sensitivity of 0.507, with excellent calibration (Brier score = 0.084). The model retained strong performance externally (AUC: 0.898, specificity: 0.985, sensitivity: 0.600, Brier score: 0.073). SHAP analysis indicated that HGB was the most influential feature, inversely associated with CRPA risk. Decision curve analysis supported the clinical utility across threshold probabilities ranging from 15% to 65%.DiscussionThis clinlabomics-based RF model provides a rapid, low-cost adjunct for early CRPA stratification. Given its exceptionally high specificity (>0.97) and modest sensitivity, it functions exclusively as a reliable clinical “rule-in” tool. Positive predictions can confidently guide early targeted therapy and strict infection control. However, negative predictions cannot safely rule out CRPA, emphasizing its role alongside standard empirical practices rather than as a standalone screening instrument.

Machine learning model aids early risk stratification for carbapenem-resistant Pseudomonas aeruginosa infection.

Study Details

Metformin, fluvoxamine, or ivermectin for non-hospitalized COVID-19 adults in a Phase 3 trial

Metformin Cuts Long Covid Risk by 40%

Clinical research that matters. Delivered to your inbox.

Machine learning model aids early risk stratification for carbapenem-resistant Pseudomonas aeruginosa infection.

Study Details

Metformin, fluvoxamine, or ivermectin for non-hospitalized COVID-19 adults in a Phase 3 trial

Metformin Cuts Long Covid Risk by 40%

Clinical research that matters. Delivered to your inbox.

Related in Infectious Disease

From Other Specialties