Phenotype database PAVS shows utility for gene prioritization in rare disease cases
This observational database development and evaluation study assessed PAVS, a curated database integrating phenotype-associated variants. The database incorporated data from 5132 Saudi clinical cases, 522 cases from a mixed-population cohort, 1856 cases from the Deciphering Developmental Disorders study, and 9588 literature phenopackets. The primary outcome was the utility of phenotype annotations for gene prioritization using semantic similarity, compared against global literature-curated databases.
The main result showed that phenotypes in PAVS could successfully rank the correct gene at a high rank, with a reported ROCAUC of 0.89 for gene prioritization performance. No specific absolute numbers, p-values, or confidence intervals were reported for this metric. Safety, tolerability, and adverse event data were not reported, as this was a database evaluation study rather than a clinical intervention trial.
Key limitations of the study were not explicitly reported. The practice relevance is that this work addresses a gap in population-specific genotype-phenotype resources and provides a benchmark for phenotype-driven variant prioritization in under-represented populations. However, clinicians should interpret these findings cautiously as they represent a database evaluation with clear differences compared to global literature-curated databases, and do not directly assess clinical outcomes or generalizability beyond the evaluation context.