Mode
Text Size
Log in / Sign up

Algorithm development study validates FindPart-w model for identifying SARS-CoV-2 lineage groups in the US

Algorithm development study validates FindPart-w model for identifying SARS-CoV-2 lineage groups in …
Photo by Ferenc Almasi / Unsplash
Key Takeaway
Consider FindPart-w algorithm for identifying SARS-CoV-2 lineage groups sharing reproduction numbers.

This study focuses on algorithm development and validation for SARS-CoV-2 strains, specifically Omicron subvariants, utilizing time-stamped lineage counts from the United States. The authors developed and tested a FindPart-w algorithm alongside a constrained RelRe model to identify groups of viral lineages that share the same relative effective reproduction numbers. The comparator used was the Pango lineage nomenclature system.

The primary outcome involved identifying these groups of lineages using two distinct data sources: hypothetical observation count data created by simulation and actual real-world data of time-stamped lineage counts from the United States. The study did not report specific effect sizes, absolute numbers, p-values, or confidence intervals for these outcomes. Furthermore, no adverse events, tolerability data, or discontinuations were reported as this was an algorithmic validation effort rather than a clinical trial.

The authors note that this work contributes to the future development of lineage designation systems that consider both genetic backgrounds and transmissibilities of lineages. Limitations regarding funding, conflicts of interest, and specific causality notes were not reported. The practice relevance is limited to methodological advancement rather than immediate clinical application, as no patient outcomes or safety profiles were assessed.

Study Details

EvidenceLevel 5
PublishedApr 2026
View Original Abstract ↓
The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has continuously evolved since its emergence in the human population in 2019. As of 1st August 2025, more than 1,700 Omicron subvariants have been designated by the Pango nomenclature system. The Pango nomenclature system designates a new lineage based on genetic and epidemiological information of SARS-CoV-2 strains. However, there is a possibility that strains that have similar genetic backgrounds and the same phenotype are given different Pango lineage names. In this paper, we propose a new algorithm, called FindPart-w, which can identify groups of viral lineages that share the same relative effective reproduction numbers. We introduced a new lineage replacement model, called the constrained RelRe model, which constrains groups of lineages to have the same relative effective reproduction numbers. The FindPart-w algorithm searches the equality constraints that minimise the Akaike Information Criterion of constrained RelRe models. Using hypothetical observation count data created by simulation, we found that the FindPart-w algorithm can identify groups of lineages having the same relative effective reproduction number in a practical computational time. Applying FindPart-w to actual real-world data of time-stamped lineage counts from the United States, we found that the Pango lineage nomenclature system may have given different lineage names to SARS-CoV-2 strains even if they have the same relative effective reproduction number and similar genetic backgrounds. In conclusion, this study showed that viruses that had the same relative effective reproduction number were identifiable from temporal count data of viral sequences. These findings will contribute to the future development of lineage designation systems that consider both genetic backgrounds and transmissibilities of lineages.
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.