Validation study of an open-source LLM-enabled genetic testing recommendation pipeline in patients with rare genetic aortopathies.

Photo by Shubham Dhage / Unsplash

medRxiv Published April 8, 2026 Medically reviewed April 26, 2026 Study authors: Singhal, P.; Li, Z.; Yang, Z.; Nandi, T.; Morse, C.; Rodriguez, Z.; Rodriguez, A.; Kindratenko, V.; … DOI ↗ By Dr. Julia Lee, PhD · Oncology, Genomics & Drug Development

Key Takeaway

Consider this LLM pipeline a potential decision-support tool for rare genetic aortopathies, noting it is a validation study with one case requiring further evaluation.

This validation study assessed an open-source large language model (LLM) pipeline designed to support genetic testing recommendations for rare genetic aortopathies. The system utilized retrieval augmented generation (RAG) on curated corpora and was tested in a cohort of 499 patients drawn from the Penn Medicine BioBank, comprising 250 cases and 250 controls. No comparator was reported for this evaluation.

Performance metrics indicated robust algorithmic performance across multiple dimensions. The pipeline achieved a patient-level recommendation accuracy of 0.834. Additional metrics included a precision of 0.835, sensitivity of 0.831, specificity of 0.836, an F1-score of 0.833, and an F3-score of 0.832. Regarding patient categorization, the system successfully categorized 425 out of 499 patients. Statistical significance or confidence intervals were not reported for these outcomes.

Safety and tolerability data were not reported in this study. One specific case required further clinician evaluation due to incomplete information, highlighting a limitation in the dataset or system handling of specific scenarios. The study design was a validation study, not a clinical trial, which limits the ability to infer direct clinical efficacy or safety in routine practice.

The practice relevance lies in providing a potential decision-support tool to assist clinicians in the earlier recognition of rare genetic disease risks. However, because this was a validation study without a control arm or randomized design, the findings should be interpreted as preliminary evidence of technical feasibility rather than proof of clinical benefit. Clinicians should exercise caution when considering integration into standard care workflows.

Study Details

Sample sizen = 499

EvidenceLevel 5

PublishedApr 2026

View Original Abstract ↓

Rare genetic aortopathies are frequently undiagnosed due to phenotypic heterogeneity, and delayed diagnosis can lead to fatal cardiac outcomes. While genetic testing can enable early proactive interventions, it relies on primary care physicians to recognize a genetic basis for symptoms and then refer patients to clinical genetics. Broad-scale screening methods are needed to identify cases that do not fit an obvious diagnostic pattern. Clinical notes, rich in narrative details, may support the automated flagging of patients for genetic testing. Given the strength of Large Language Models (LLMs) in processing unstructured text, we developed an open-source LLM-enabled genetic testing recommendation pipeline, which leverages retrieval augmented generation (RAG) on curated genetic aortopathy-related corpora to utilize relevant clinical knowledge for identifying patients likely to benefit from genetic testing. The pipeline was validated using 22,510 patient progress notes from 500 individuals (250 cases, 250 controls) in the Penn Medicine BioBank, and successfully categorized 425 out of 499 patients, with one case requiring further clinician evaluation due to incomplete information. The pipeline achieved a patient-level recommendation accuracy of 0.834, precision of 0.835, sensitivity of 0.831, specificity of 0.836, F1-score of 0.833, and F3-score of 0.832. Our LLM-enabled workflow integrating RAG showed strong performance in recommending genetic testing for patients with rare genetic aortopathies. These findings illustrate the feasibility of using open-source LLMs to support identification of patients who may benefit from genetic testing based on free-text clinical notes, providing a potential decision-support tool to assist clinicians in earlier recognition of rare genetic disease risks.

Validation study of an open-source LLM-enabled genetic testing recommendation pipeline in patients with rare genetic aortopathies.

Study Details

Regional motif diversity score in cfDNA predicts pembrolizumab response in head and neck cancer

Can a DNA pattern predict who will heal from head and neck cancer immunotherapy?

Clinical research that matters. Delivered to your inbox.

Validation study of an open-source LLM-enabled genetic testing recommendation pipeline in patients with rare genetic aortopathies.

Study Details

Regional motif diversity score in cfDNA predicts pembrolizumab response in head and neck cancer

Can a DNA pattern predict who will heal from head and neck cancer immunotherapy?

Clinical research that matters. Delivered to your inbox.

Related in Genetics & Precision Medicine

From Other Specialties