Meta-analysis of 7,750 saliva samples validates non-invasive disease prediction models for nasopharyngeal carcinoma, colorectal cancer, and PLHIV.
This meta-analysis evaluated public 16S saliva data from 22 cohorts comprising 7,750 samples to investigate microbiota structure differences and multi-disease prediction model performance. The scope included nasopharyngeal carcinoma, colorectal cancer, and PLHIV, utilizing a negative control group for comparison. The study aimed to establish healthy baselines and assess the feasibility of non-invasive diagnosis using machine learning approaches.
The analysis identified nine core microbiota in the negative control group, including g:Streptococcus and g:Haemophilus_D_735815. Microbiota structure differences were observed at the genus level, where nasopharyngeal carcinoma groups resembled controls but diverged from colorectal cancer and PLHIV groups. Multi-class random forest models demonstrated robust classification performance, achieving an AUC between 0.898 and 0.995 for V3-V4 regions and 0.957 to 1 for V4 regions.
The authors acknowledge several limitations, including the constraint of single-disease-focused studies and the necessity to expand disease coverage. They also highlight the need to increase sample sizes and further investigate microbiota-disease associations. Safety data, adverse events, and tolerability were not reported in this review. While the validated feasibility of establishing healthy baselines via saliva microbiota is noted, the authors urge caution in interpreting these results as definitive diagnostic tools pending further investigation.