Unsupervised clustering of glioma RNAseq data shows diagnostic and prognostic potential
This is an unsupervised clustering analysis of public RNAseq data from 692 TCGA gliomas (524 low-grade gliomas, 168 glioblastomas). The authors applied Dynamic Quantum Clustering (DQC) to the data and assessed diagnostic concordance, survival separation, and gene subset accuracy.
Key synthesized findings include a 90.9% posthoc diagnostic concordance between DQC clusters and clinical diagnosis, and 97.3% accuracy for a 554-gene subset in diagnostic separation. A GBM-rich cluster had a 97.1% positive predictive value. For low-grade gliomas, three pure subclusters showed ordered but different survival outcomes based on 90 genes.
The authors note limitations, including heterogeneous biology and analytic drift that may obscure structure, and that unsupervised methods require no clinical labels but posthoc concordance is assessed. The analysis is not a primary trial and results are from public data.
Practice relevance is illustrative: geometry-aware unsupervised learning can translate computational discovery into biology-based patient stratification and prognosis. However, the authors caution that no causal claims are made and clinical implementation requires further validation.