Mode
Text Size
Log in / Sign up

Meta-analysis of 529 samples identifies 108 gene signatures for active tuberculosis diagnosis.

Meta-analysis of 529 samples identifies 108 gene signatures for active tuberculosis diagnosis.
Photo by Warren Umoh / Unsplash
Key Takeaway
Note that machine learning classifiers using 108 gene signatures show high AUC (0.85-0.94) for active tuberculosis diagnosis in this meta-analysis.

This meta-analysis integrated transcriptomic data from five datasets comprising 529 samples to identify host-derived transcriptional signatures of active tuberculosis. The analysis utilized machine learning approaches to develop classifiers based on these gene expression profiles. No specific medications or interventions were evaluated, as the study focused on diagnostic biomarkers rather than therapeutic effects.

The analysis identified 108 core differentially expressed genes conserved across cohorts. Specifically, 80 genes were upregulated and 28 were downregulated. Pathway analysis revealed modest downregulation of NF-κB signaling (fold-change: -0.023, p = 0.02), antigen presentation (fold-change: -0.026, p = 0.08), and tuberculosis pathways (fold-change: -0.023, p = 0.05).

Machine learning classifiers demonstrated excellent discrimination with cross-validated AUCs ranging from 0.85 to 0.94 (mean: 0.89 ± 0.04). Sensitivity was balanced between 82% and 91%, while specificity ranged from 85% to 93%. No adverse events, serious adverse events, discontinuations, or tolerability data were reported, as safety was not a primary endpoint of this diagnostic biomarker study.

Key limitations include the observational nature of the transcriptomic data, which prevents causal inference. The study phase and specific population characteristics were not reported. Despite these constraints, the high diagnostic accuracy and biologically interpretable feature sets provide validated biomarkers for TB diagnosis, supporting clinical translation toward precision medicine approaches in global TB control.

Study Details

Study typeMeta analysis
EvidenceLevel 1
PublishedApr 2026
View Original Abstract ↓
BackgroundTuberculosis (TB) caused 1.23 million deaths in 2024, with accurate diagnosis hampered by population heterogeneity and limited biomarker generalizability. We developed an integrative framework combining multi-cohort transcriptomics and machine learning to identify host-derived transcriptional signatures of active TB.MethodsFive transcriptomic datasets (GSE83456, GSE107995, GSE158802, GSE19435, GSE25534) comprising 529 samples were analyzed. After standardized preprocessing, we performed differential expression analysis, inverse variance-weighted meta-analysis, and single-sample gene set enrichment analysis (ssGSEA) for three KEGG pathways. Machine learning classifiers were developed using logistic regression with SHapley Additive exPlanations (SHAP)-based interpretability.ResultsMeta-analysis identified 108 core differentially expressed genes (80 upregulated, 28 downregulated) conserved across all cohorts. Upregulated genes showed significant enrichment in interferon signaling, antigen presentation, and chemokine activity. Pathway analysis revealed modest downregulation in NF-κB signaling (fold-change: −0.023, p = 0.02), antigen presentation (fold-change: −0.026, p = 0.08), tuberculosis pathway (fold-change: −0.023, p = 0.05). Machine learning classifiers achieved excellent discrimination with cross-validated AUCs of 0.85–0.94 (mean: 0.89 ± 0.04), maintaining balanced sensitivity (82–91%) and specificity (85–93%). SHAP analysis identified interferon-stimulated genes (STAT1, IFITM1), chemokine receptors (CXCL10, CXCL9), and MHC class II molecules (HLA-DRA) as top predictive features, underscoring the biological relevance of the human host response to Mycobacterium tuberculosis.ConclusionOur integrative framework identifies a conserved 347-gene transcriptional signature and three key immune pathways that transcend population and technical heterogeneity. The high diagnostic accuracy and biologically interpretable feature sets provide validated biomarkers for TB diagnosis and support clinical translation toward precision medicine approaches in global TB control.Clinical trial registrationhttps://www.chictr.org.cn/, identifier ChiCTR2300074328.
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.