Multi-task learning model for gastrointestinal cancer histopathology demonstrates high classification and segmentation performance in preclinical validation.

Photo by Elen Sher / Unsplash

Frontiers in Medicine Published April 22, 2026 Medically reviewed April 25, 2026 Study authors: Qing-Chun Feng, Ting Yang, Hai-Long Guo, Xiao-Yun Wang DOI ↗ By Dr. Julia Lee, PhD · Oncology, Genomics & Drug Development

Key Takeaway

Recognize this preclinical model for gastrointestinal cancer requires verification through cohorts before clinical use.

This methodological paradigm development study utilized a preclinical validation system to assess a multi-task learning model for gastrointestinal cancer. The population consisted of histopathological patches from GasHisSDB and GCHTID datasets, totaling approximately 99,000 H&E-stained images. Evaluation occurred under controlled benchmark settings without a clinical follow-up period reported in the study.

The intervention involved a multi-task learning model integrating Swin Transformer, DeepLabV3+, and R2U-Net for joint classification and segmentation. Comparators included ResNet-50 and Swin-T. The model achieved a classification F1-score of 0.938 ± 0.007 and a segmentation Dice coefficient of 0.839 ± 0.009. Against ResNet-50, Swin-T showed a classification F1-score of 0.945 vs. 0.917 and an AUC of 0.965 vs. 0.907. Additional metrics included a LYM Dice of 0.781 for small-volume tissues and a cross-domain transfer F1-score of 0.902. A Dice decrease under staining perturbations was 2.4%.

Safety data regarding adverse events, serious adverse events, and discontinuations were not reported, consistent with the preclinical nature of the work. Key limitations include validation conducted under controlled benchmark conditions using strongly annotated patch-level data. The framework is regarded as a clinically relevant, preclinical validation system. Feasibility for routine clinical implementation requires further verification through large-scale whole-slide image cohorts, prospective multicenter studies, and workflow integration assessments and future research.

Study Details

Study typeCohort

EvidenceLevel 3

PublishedApr 2026

View Original Abstract ↓

Accurate identification and segmentation of tumor and microenvironment features in gastrointestinal cancer (GC) pathology images are crucial for diagnosis, yet challenging for traditional methods. This study aims to develop and validate a deep learning (ML) framework integrating multi-task learning and interpretability mechanisms for cross-scale automatic classification and segmentation of tumor and microenvironmental structures in gastrointestinal cancer histopathological patches, and to evaluate its robustness, output consistency, and decision transparency under controlled benchmark settings. We constructed a multi-task learning (MTL) model integrating Swin Transformer, DeepLabV3+, and R2U-Net for joint classification and segmentation. The model was trained and validated on approximately 99,000 H&E-stained images from the GasHisSDB and GCHTID datasets. Preprocessing included color normalization and quality control. Performance was evaluated via five-fold cross-validation. Explainability was assessed using Grad-CAM, Score-CAM, and Layer-wise Relevance Propagation (LRP), with validation from pathology experts. The multi-task model achieved a classification F1-score of 0.938 ± 0.007 and a segmentation Dice coefficient of 0.839 ± 0.009 on the test set. Compared with ResNet-50, Swin-T obtained higher classification performance, with improved F1-score (0.945 vs. 0.917) and AUC (0.965 vs. 0.907). For small-volume tissues, the LYM Dice reached 0.781. In cross-domain transfer from GCHTID to GasHisSDB, the model achieved an F1-score of 0.902. Under staining perturbations, the Dice decreased by only 2.4%, and Grad-CAM correlation reached r = 0.86. The expert-model agreement rate (EMAR) was 0.864, with a Cohen’s κ of 0.79. The proposed cross-scale multi-task Transformer framework achieves high-precision recognition and multi-component segmentation of gastrointestinal cancer histopathological images, demonstrating stability and interpretability across scale variations, cross-dataset evaluations, and staining perturbation tests. Overall, this study emphasizes the establishment and validation of a methodological paradigm integrating multi-task joint learning, cross-scale generalization assessment, and interpretable evidence review. As the current validation was conducted under controlled benchmark conditions using strongly annotated patch-level data, the framework should be regarded as a clinically relevant, preclinical validation system. Its feasibility for routine clinical implementation requires further verification through large-scale whole-slide image (WSI) cohorts, prospective multicenter studies, and workflow integration assessments.

Multi-task learning model for gastrointestinal cancer histopathology demonstrates high classification and segmentation performance in preclinical validation.

Study Details

Digital and AI-supported psychosocial interventions modestly reduce distress and improve quality of life in women with breast cancer.

Digital mental health tools may help breast cancer patients feel better

Clinical research that matters. Delivered to your inbox.

Multi-task learning model for gastrointestinal cancer histopathology demonstrates high classification and segmentation performance in preclinical validation.

More on Gastrointestinal Cancer

Study Details

Digital and AI-supported psychosocial interventions modestly reduce distress and improve quality of life in women with breast cancer.

Digital mental health tools may help breast cancer patients feel better

Clinical research that matters. Delivered to your inbox.

Related in Oncology

From Other Specialties