Mode
Text Size
Log in / Sign up

Machine learning model aids early risk stratification for carbapenem-resistant Pseudomonas aeruginosa infectionNew tool helps identify resistant bacteria early in hospital patients

AI-generated summary of the cited source, checked by automated accuracy review. How we work

Key Takeaway
Consider this machine-learning model a rapid, low-cost adjunct for early CRPA stratification, not a standalone screening instrument.

This retrospective cohort study assessed a clinico-laboratory machine-learning model utilizing routine complete blood count parameters to stratify early risk for carbapenem-resistant Pseudomonas aeruginosa (CRPA). The analysis included 1,666 patients in a training set and 471 patients in an external validation set from a primary center setting. The model was compared against standard empirical practices, with performance measured over a 48–72 hour interval before final antimicrobial susceptibility results were available.

In the training set, the model achieved an AUC of 0.993. Average 5-fold cross-validation yielded an AUC of 0.929 ± 0.005. The internal test set showed an AUC of 0.837 (95% CI: 0.779–0.893), with specificity of 0.972 and sensitivity of 0.507. The Brier score was 0.084. During external validation, the AUC was 0.898, specificity was 0.985, and sensitivity was 0.600, with a Brier score of 0.073.

Safety and tolerability data were not reported, as adverse events, serious adverse events, discontinuations, and tolerability metrics were not assessed. Key limitations include the inability of negative predictions to safely rule out CRPA and the model's exclusive role as a rule-in tool rather than a standalone screening instrument. The study was not reported as an RCT, and funding or conflicts were not reported.

This study looked at patients with Pseudomonas aeruginosa infections to see if a machine-learning model could predict carbapenem resistance early on. The team used routine complete blood count data to train the model and then tested it on separate groups of patients. The goal was to help doctors decide on antibiotics before final lab results were available, which usually take 48 to 72 hours.

The model showed strong performance in predicting resistance. In the main group, it correctly identified resistant cases with high accuracy. When tested on a separate group of patients from another setting, the model remained reliable. It was particularly good at confirming resistance when the model said it was present, with very few false alarms.

However, the study has important limits. If the model said a patient was not resistant, doctors cannot safely rule out the infection based on that alone. The tool is meant to work alongside standard practices, not replace them. This research suggests a low-cost way to help guide early treatment, but more study is needed before it is widely adopted.

What this means for you:
A new model helps predict resistant bacteria using blood tests, but negative results do not rule out infection.

Study Details

Study typeCohort
EvidenceLevel 3
PublishedApr 2026
View Original Abstract ↓
IntroductionTo address the delayed identification of carbapenem-resistant Pseudomonas aeruginosa (CRPA), we developed an interpretable machine-learning (ML) model for early risk stratification. Utilizing routine complete blood count (CBC) and demographic data, this tool targets the critical 48–72 hour interval before final antimicrobial susceptibility results.MethodsData from 1,666 patients with P. aeruginosa infection (223 CRPA) at a primary center were retrospectively analyzed, alongside an independent external validation cohort (n=471). Following the least absolute shrinkage and selection operator (LASSO) regression on 32 variables, eight ML algorithms were trained. Model interpretability and clinical utility were evaluated using Shapley Additive Explanations (SHAP) and decision curve analysis (DCA). Eight ML algorithms were trained using 5-fold cross-validation and Bayesian hyperparameter optimization. To ensure reproducibility and handle class imbalance, fixed random seeds were set, and a sensitivity analysis using the Synthetic Minority Over-sampling Technique (SMOTE) was conducted. Model calibration was assessed using the Brier score.ResultsLASSO identified seven predictors: sex, age, mean corpuscular volume (MCV), hemoglobin (HGB), platelet-to-lymphocyte ratio (PLR), systemic inflammatory response index (SIRI), and intensive care unit (ICU) admission status. Among the evaluated algorithms, the random forest (RF) model achieved the best discrimination. The training area under the receiver operating characteristic curve (AUC) was 0.993; it achieved an average 5-fold cross-validation AUC of 0.929 ± 0.005. In the internal test set, it achieved an AUC of 0.837 (95% CI: 0.779–0.893), specificity of 0.972, and sensitivity of 0.507, with excellent calibration (Brier score = 0.084). The model retained strong performance externally (AUC: 0.898, specificity: 0.985, sensitivity: 0.600, Brier score: 0.073). SHAP analysis indicated that HGB was the most influential feature, inversely associated with CRPA risk. Decision curve analysis supported the clinical utility across threshold probabilities ranging from 15% to 65%.DiscussionThis clinlabomics-based RF model provides a rapid, low-cost adjunct for early CRPA stratification. Given its exceptionally high specificity (>0.97) and modest sensitivity, it functions exclusively as a reliable clinical “rule-in” tool. Positive predictions can confidently guide early targeted therapy and strict infection control. However, negative predictions cannot safely rule out CRPA, emphasizing its role alongside standard empirical practices rather than as a standalone screening instrument.
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.