Phenotype database PAVS shows utility for gene prioritization in rare disease casesNew database helps prioritize genes for rare disease diagnosis in Saudi Arabian patients

medRxiv Published April 8, 2026 Study authors: Abdelhakim, M.; Althagafi, A.; SCHOFIELD, P.; Hoehndorf, R. DOI ↗ Editorial oversight: Dr. Julia Lee, PhD · Oncology, Genomics & Drug Development

AI-generated summary of the cited source, checked by automated accuracy review. How we work

Key Takeaway

Consider PAVS as a potential population-specific resource for phenotype-driven gene prioritization in rare disease evaluation.

This observational database development and evaluation study assessed PAVS, a curated database integrating phenotype-associated variants. The database incorporated data from 5132 Saudi clinical cases, 522 cases from a mixed-population cohort, 1856 cases from the Deciphering Developmental Disorders study, and 9588 literature phenopackets. The primary outcome was the utility of phenotype annotations for gene prioritization using semantic similarity, compared against global literature-curated databases.

The main result showed that phenotypes in PAVS could successfully rank the correct gene at a high rank, with a reported ROCAUC of 0.89 for gene prioritization performance. No specific absolute numbers, p-values, or confidence intervals were reported for this metric. Safety, tolerability, and adverse event data were not reported, as this was a database evaluation study rather than a clinical intervention trial.

Key limitations of the study were not explicitly reported. The practice relevance is that this work addresses a gap in population-specific genotype-phenotype resources and provides a benchmark for phenotype-driven variant prioritization in under-represented populations. However, clinicians should interpret these findings cautiously as they represent a database evaluation with clear differences compared to global literature-curated databases, and do not directly assess clinical outcomes or generalizability beyond the evaluation context.

Researchers created a new database called PAVS to help doctors diagnose rare diseases, especially in Saudi Arabian patients. The database combines genetic information with detailed descriptions of patient symptoms, known as phenotypes. It was built using data from over 5,000 Saudi clinical cases, along with thousands of other cases from international studies and medical literature.

The main goal was to see if this database could help scientists and doctors figure out which gene might be causing a patient's rare disease. When tested, the system was good at putting the correct gene near the top of the list of possibilities, with a performance score of 0.89. This means it could be a useful tool for sorting through complex genetic data.

It is important to understand that this study only evaluated how well the database worked in a technical test. It was not a clinical trial that treated patients. The researchers note there are clear differences between their database and other global resources. While this tool addresses a gap in resources for under-represented populations, more research is needed to see how it improves actual diagnosis and care for patients in hospitals and clinics.

What this means for you:

A new database shows promise for helping diagnose rare diseases, but it is a research tool, not yet proven in everyday clinical care.

Study Details

Study typeCohort

EvidenceLevel 3

PublishedApr 2026

View Original Abstract ↓

Genotype-phenotype databases are essential for variant interpretation and disease gene discovery. Genetic variation differs among human populations, mainly in allele frequencies and haplotype patterns shaped by ancestry and demographic history. Population-specific genotypes can influence traits and disease risk; this makes population specific characterization important. Most existing resources focus on the characterization of a population's genetic background, but do not represent the resulting phenotypes. We have developed PAVS (Phenotype-Associated Variants in Saudi Arabia), a curated, publicly accessible database that integrates 5,132 Saudi clinical cases from four Saudi cohorts and 522 cases from analysis of a mixed-population cohort, together with 1,856 cases from the Deciphering Developmental Disorders study (DDD) and 9,588 literature phenopackets. Each case record describes patient-level phenotypes, encoded with the Human Phenotype Ontology (HPO), and links them to genomic variants, gene identifiers, zygosity, pathogenicity classifications, and disease diagnoses mapped to standardized disease terminologies. The data is represented in Phenopackets format and as a knowledge graph in RDF. Additionally, a web interface provides phenotype-based similarity search, gene and variant browsers, and an HPO hierarchy explorer. We evaluate the utility of the phenotype annotations for gene prioritization using semantic similarity. While there are clear differences to global literature-curated databases, phenotypes in PAVS can successfully rank the correct gene at high rank (ROCAUC: 0.89). PAVS addresses a gap in population-specific genotype-phenotype resources and provides a benchmark for phenotype-driven variant prioritization in under-represented populations.

Phenotype database PAVS shows utility for gene prioritization in rare disease casesNew database helps prioritize genes for rare disease diagnosis in Saudi Arabian patients

Study Details

TLR4 Asp299Gly polymorphism associated with increased infection susceptibility (OR 2.05) and mortality (HR 1.78)

Genetic variations linked to higher risk of severe infections

Clinical research that matters. Delivered to your inbox.

Phenotype database PAVS shows utility for gene prioritization in rare disease casesNew database helps prioritize genes for rare disease diagnosis in Saudi Arabian patients

More on Rare Diseases

Study Details

TLR4 Asp299Gly polymorphism associated with increased infection susceptibility (OR 2.05) and mortality (HR 1.78)

Genetic variations linked to higher risk of severe infections

Clinical research that matters. Delivered to your inbox.

Related in Genetics & Precision Medicine

From Other Specialties