Mode
Text Size
Log in / Sign up

Meta-analysis of 7,750 saliva samples validates non-invasive disease prediction models for nasopharyngeal carcinoma, colorectal cancer, and PLHIV.

Meta-analysis of 7,750 saliva samples validates non-invasive disease prediction models for nasophary…
Photo by Testalize.me / Unsplash
Key Takeaway
Note that a meta-analysis of 7,750 saliva samples supports non-invasive disease prediction models with AUCs of 0.898–0.995.

This meta-analysis evaluated public 16S saliva data from 22 cohorts comprising 7,750 samples to investigate microbiota structure differences and multi-disease prediction model performance. The scope included nasopharyngeal carcinoma, colorectal cancer, and PLHIV, utilizing a negative control group for comparison. The study aimed to establish healthy baselines and assess the feasibility of non-invasive diagnosis using machine learning approaches.

The analysis identified nine core microbiota in the negative control group, including g:Streptococcus and g:Haemophilus_D_735815. Microbiota structure differences were observed at the genus level, where nasopharyngeal carcinoma groups resembled controls but diverged from colorectal cancer and PLHIV groups. Multi-class random forest models demonstrated robust classification performance, achieving an AUC between 0.898 and 0.995 for V3-V4 regions and 0.957 to 1 for V4 regions.

The authors acknowledge several limitations, including the constraint of single-disease-focused studies and the necessity to expand disease coverage. They also highlight the need to increase sample sizes and further investigate microbiota-disease associations. Safety data, adverse events, and tolerability were not reported in this review. While the validated feasibility of establishing healthy baselines via saliva microbiota is noted, the authors urge caution in interpreting these results as definitive diagnostic tools pending further investigation.

Study Details

Study typeMeta analysis
EvidenceLevel 1
PublishedApr 2026
View Original Abstract ↓
Saliva harbors a complex human microbiota closely linked to the occurrence and progression of various diseases. This meta-analysis of public 16S saliva data aimed to expand understanding of the microbiota’s associations with multiple diseases and explore its potential as molecular markers for multi-disease prediction, overcoming the limitation of single-disease-focused studies. From PubMed (2016–2024), 22 cohorts met the screening criteria (V3-V4 region 13 cohorts, V4 region 9 cohorts), comprising 7,750 samples. Bioinformatics analyses using QIIME2, Wekemo, and statistical modeling revealed saliva microbiota community characteristics, identified core microbes in the negative control group, and constructed a multi-disease prediction model based on 16S data. Key findings included: (1) significant differences in microbiota structure across physiological/pathological states (e.g., NPC groups resembled controls but diverged from colorectal cancer and PLHIV groups at the genus level); (2) Nine core microbiota, such as g:Streptococcus and g:Haemophilus_D_735815, were identified in the saliva samples of the negative control group; (3) robust classification performance of multi-class random forest models (AUC: 0.898–0.995 for V3-V4, 0.957–1 for V4). This study validated the feasibility of establishing healthy baselines via saliva microbiota and using machine learning for non-invasive disease diagnosis. Future research should expand disease coverage, increase sample sizes, and further investigate microbiota-disease associations to advance the development of non-invasive diagnostics.
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.