Deep learning models for glioma segmentation achieve high whole tumor accuracy with DSC 0.860
This is a systematic review and meta-analysis of deep learning models for glioma segmentation on preoperative MRI. The authors synthesized evidence from 88 identified models, with 36 included in quantitative analyses. The key finding is high whole tumor segmentation accuracy, with a pooled Dice Similarity Coefficient of 0.860 (95% CI 0.840-0.881). Performance was significantly lower for enhancing, non-enhancing, and tumor core delineation. Models using 3D and multiparametric MRI inputs consistently outperformed those without, and training on BraTS datasets was associated with higher performance. A trend toward improved accuracy for high-grade gliomas was not statistically significant. Training dataset size was not associated with performance. Multivariate meta-regression found only publication year independently predicted improved accuracy (beta=0.023, p=0.017). The authors note that no single factor explained the variability observed across studies and that factors driving heterogeneity are poorly understood. Practice relevance is limited; future studies should prioritize multivariable analyses to better define determinants of model performance and support application into everyday practice.