Systematic review and meta-analysis of AI for melanoma risk assessment in pigmented skin lesions
This is a systematic review and meta-analysis of diagnostic accuracy for malignancy risk assessment in pigmented skin lesions, including melanoma. The scope covered 17 diagnostic arms: 10 dermoscopy arms, 6 AI-alone arms, and 1 AI-assisted clinician arm, evaluated in real-world clinical settings. The authors synthesized pooled sensitivity and specificity for each modality compared to standard dermoscopy.
For dermoscopy, pooled sensitivity was 0.773 (95% CI, 0.648-0.863) and specificity was 0.793 (95% CI, 0.673-0.877). For standalone AI, pooled sensitivity was 0.757 (95% CI, 0.428-0.928) and specificity was 0.859 (95% CI, 0.619-0.958). For the single AI-assisted clinician arm, sensitivity was 1.000 and specificity was 0.837, though confidence intervals were not reported.
The authors noted that heterogeneity in AI performance was driven almost entirely by threshold effects rather than by differences in inherent model capacity. They also highlighted that more evidence is needed for AI-assisted clinicians. Limitations include the small number of AI-assisted clinician arms and the lack of reported follow-up duration.
Practice relevance is restrained; the authors suggest AI should be viewed as a complementary decision-support tool rather than a replacement for dermoscopic evaluation. The evidence base is early and incomplete, and clinicians should interpret findings with caution.