This study developed and validated a deep learning framework for cardiac segmentation using cine steady-state free precession MRI images. The CorSeg-CineSAX framework, based on the MedNeXt-L model with a three-step post-processing pipeline, was trained on data from 1,555 subjects across 12 centers and tested on 310 internal subjects plus 855 external subjects from 6 additional centers across 3 countries. The framework was evaluated against manual measurements for segmenting cardiac structures across 15 cardiac disease categories.
On internal testing, the framework achieved a Dice similarity coefficient (DSC) of 0.913 ± 0.037. External validation across multiple datasets showed similar performance with a DSC of 0.911 ± 0.040, representing a minimal cross-domain performance gap of 0.002. The framework eliminated containment violations (reduced from 7.5% to 0%) and gap errors (reduced from 1.8% to 0%), while reducing fragment rates from 9.0% to 1.3%.
For zero-shot generalization to 10 unseen disease categories, DSC ranged from 0.899 to 0.921. Intraclass correlation coefficients for left ventricular indices and right ventricular volumes showed excellent agreement (≥0.977) with manual measurements. No safety or tolerability data were reported as this was a technical validation study.
Key limitations include the lack of p-values or confidence intervals for reported metrics, no reported follow-up period, and the absence of clinical outcome validation. The study demonstrates technical feasibility but does not establish clinical utility or workflow integration. While promising for automated cardiac segmentation, this framework requires prospective clinical validation before considering implementation in practice.
View Original Abstract ↓
Background: Segmentation of the left ventricular myocardium, left ventricular cavity, and right ventricular cavity on short-axis cine cardiac magnetic resonance (CMR) images is essential for quantifying cardiac structure and function. However, existing automated segmentation tools are limited by small training datasets, narrow disease coverage, restrictive input format requirements, and the absence of anatomical plausibility constraints, hindering their clinical adoption. Methods: We constructed the largest annotated CMR short-axis segmentation dataset to date, comprising 1,555 subjects from 12 centers with five cardiac disease types and full cardiac cycle annotations totaling 319,175 labeled images. A MedNeXt-L model was trained using a 2D slice-by-slice strategy with full field-of-view input, eliminating dependencies on 3D volumes, temporal sequences, or region-of-interest(ROI) localization. A deterministic three-step post-processing pipeline was designed to enforce anatomical priors: connected component constraint, containment relationship constraint, and gap-filling constraint. The model was validated on an internal test set (310 subjects) and three independent public external datasets (ACDC, M&Ms1, M and Ms2; 855 subjects from 6 additional centers across 3 countries), spanning 15 cardiac disease categories-10 of which were never encountered during training. Results: The model achieved mean Dice similarity coefficients (DSC) of 0.913 {+/-} 0.037 and 0.911 {+/-} 0.040 on internal and external test sets, respectively, with a cross-domain performance gap of only 0.002. Post-processing eliminated all containment violations (7.5% [->] 0%) and gap errors (1.8% [->] 0%) while reducing fragment rates by 85.5% (9.0% [->] 1.3%). Zero-shot generalization to 10 unseen disease categories yielded DSC values ranging from 0.899 to 0.921. Automated clinical functional parameters demonstrated excellent agreement with manual measurements for left ventricular indices and right ventricular volumes (intraclass correlation coefficients [≥] 0.977). Conclusions: CorSeg-CineSAX provides a robust, open-source framework for fully automatic CMR short-axis segmentation across diverse clinical scenarios. All source code and pre-trained weights are publicly available at https://github.com/RunhaoXu2003/CorSeg.