Meta-analysis links speech features to severity in Parkinsons, cerebral palsy, and ALS
This meta-analysis evaluates a training-free method designed to measure the degradation of phonological feature subspaces within frozen HuBERT representations. The scope encompasses 890 speakers across 10 corpora spanning 5 languages, focusing on conditions including Parkinsons disease, cerebral palsy, and amyotrophic lateral sclerosis. The primary outcome assessed was the correlation of consonant d features with clinical severity, using healthy control speech as a comparator.
The analysis found a significant negative correlation between consonant d features and clinical severity. A random-effects meta-analysis yielded a rho of -0.50 to -0.56, while the pooled Spearman rho ranged from -0.47 to -0.55. These results were statistically significant with a p-value less than 2 x 10^-4, and bootstrap 95% confidence intervals did not cross zero. Additionally, nasality d changes decreased monotonically from control to severe states, and all 12 features distinguished controls from severely dysarthric speakers with a p-value less than 0.001.
The authors highlight that the method requires no dysarthric training data and applies to any language with an existing MFA acoustic model, currently covering 29 languages. Limitations include the reliance on existing acoustic models and the observational nature of the data synthesis. The review suggests this approach could support remote monitoring of speech decline in neurodegenerative diseases and enable screening in settings where specialist assessment is unavailable.