Hybrid NLP and LLM system extracts deprescribing recommendations from electronic health records for older hospitalized patients.
This retrospective cohort study assessed a two-stage hybrid system combining rule-based natural language processing and open-source large language models. The system was evaluated using data from 850 patients aged 65 years or older who were hospitalized for 48 hours or more. The evaluation took place across six public hospitals in New South Wales, Australia. The primary outcome was the automated extraction of deprescribing recommendations from electronic health records. No comparator was reported for this technical validation study.
The system extracted 9,631 medications with a median of 11 per patient. It also identified 1,061 candidate sentences. Model 2 achieved an F1 score of 0.91 and an accuracy of 0.90. Processing time averaged 12.6 seconds. Inter-rater reliability showed substantial agreement with a Cohen's kappa of 0.70.
Safety and tolerability data were not reported. The study did not collect clinical outcome data or adverse event information. The most common misclassification was incorrectly identifying actions completed during hospitalization as post-discharge recommendations. Funding or conflicts of interest were not reported.
The practice relevance includes enabling cost-efficient, privacy-compliant local deployment. Clinicians should note that this study validates a technical tool rather than demonstrating clinical benefit. Do not infer clinical outcomes from a technical validation study. Do not infer causality between deprescribing and outcomes as no clinical outcome data were collected.