This observational review assesses a deep learning (DL) framework designed for the multiclass classification of major mitral valve etiologies, including normal, rheumatic, degenerative, prolapse, and functional states. The analysis utilized a nationwide multicenter registry comprising 4,344 TTE examinations for development and 2,262 examinations for external testing, comparing DL performance against expert visual interpretation.
In the internal test dataset, the area under the receiver operating characteristic curve (AUROC) ranged from 0.968 to 0.997 across etiologies. External testing yielded an AUROC between 0.931 and 0.992. Sensitivity for mitral valve prolapse increased markedly with moderate or greater mitral regurgitation compared with mild MR, whereas sensitivity for degenerative disease remained persistently lower across MR severity levels. In cases involving multiple etiologies, the model correctly identified at least one expert-assigned etiology in 85.7% of instances.
The authors note that adverse events, discontinuations, and tolerability were not reported, as was the follow-up duration. Limitations regarding the study design were not reported. The practice relevance lies in the fact that DL-based analysis of limited, routinely acquired TTE views enables reliable multiclass classification, potentially supporting more consistent and scalable evaluation in routine echocardiographic practice.
View Original Abstract ↓
Background: Accurate etiologic classification of the mitral valve (MV) is essential for guiding clinical management but remains dependent on expert visual interpretation. Despite advances in artificial intelligence (AI)-based quantitative analysis, automated morphologic interpretation under routine imaging conditions remains limited. Objectives: To develop and validate a deep learning (DL) framework for multiclass classification for major MV etiologies using a limited routine transthoracic echocardiography (TTE) views. Methods: A multi-view DL model was developed to classify five MV etiologies (normal, rheumatic, degenerative, prolapse, and functional). The developmental dataset comprised 4,344 TTE examinations from a nationwide multicenter registry. Validation was performed using an internal test dataset and an independent external test dataset (2,262 TTE examinations). Prespecified subgroup analyses were conducted according to mitral regurgitation (MR) severity and automated image quality (IQ). Results: The model demonstrated robust performance across all MV etiologies in both internal and external datasets. In the internal test dataset, area under the receiver operating characteristic curve (AUROC) values ranged from 0.968 to 0.997 across etiologies, with higher performance observed for normal valves and rheumatic disease. In the external test dataset, discriminative performance remained preserved (AUROC, 0.931-0.992), despite differences in disease distribution and MR severity. Sensitivity for MV prolapse increased markedly with moderate or greater MR compared with mild MR, whereas degenerative disease showed persistently lower sensitivity across MR severity. Diagnostic performance remained stable across IQ strata, with comparable accuracy and macro-F1 scores in all-adequate and partially suboptimal examinations. In post-hoc analyses of cases with multiple MV etiologies, the model correctly identified at least one expert-assigned etiology in 85.7% of cases. Conclusions: DL-based analysis of limited, routinely acquired TTE views enables reliable multiclass classification of MV etiologies. This approach may complement quantitative automation and expert visual assessment, supporting more consistent and scalable MV evaluation in routine echocardiographic practice.