Mode
Text Size
Log in / Sign up

Machine learning quantification of Gleason Pattern 4 showed higher discrimination for adverse pathology than grade group in prostate cancer.

Machine learning quantification of Gleason Pattern 4 showed higher discrimination for adverse pathol…
Photo by Logan Voss / Unsplash
Key Takeaway
Note that ML quantification of GP4 showed higher discrimination than GG in this proof-of-principle cohort.

This cohort study, classified as a proof-of-principle investigation, included 726 patients with grade group 2-4 prostate cancer identified on systematic biopsy who subsequently underwent radical prostatectomy. The setting and specific funding sources were not reported. The primary exposure involved machine learning (ML) quantification approaches for Gleason Pattern 4 (GP4) utilizing the PAIGE-AI algorithm, which was compared against standard Grade Group (GG) assessment. The main outcome assessed was adverse radical prostatectomy pathology and biochemical recurrence.

Regarding discrimination for adverse radical prostatectomy pathology, the pixel-counting method yielded an area under the curve (AUC) of 0.648. Quantification of GP4 using the ML approach outperformed standard GG assessment, with an AUC of 0.627 versus 0.608 for GG. Findings regarding biochemical recurrence were consistent with adverse pathology, though specific effect sizes were not reported. The predictive value of GP3 was noted as non-predictive once GP4 was known. Safety data, including adverse events and tolerability, were not reported.

Key limitations include the use of a convenience sample and the study's designation as a proof-of-principle effort. The authors intend to apply this method on larger cohorts to determine the best prediction of oncologic outcomes. Consequently, practice relevance is currently limited to supporting the use of ML as a research tool to compare different GP4 quantification approaches. Larger cohorts are needed to determine the best prediction of oncologic outcome.

Study Details

Study typeCohort
Sample sizen = 726
EvidenceLevel 3
PublishedApr 2026
View Original Abstract ↓
Objective: To demonstrate the proof of principle that machine learning (ML) can be used to quantify Gleason Pattern (GP) 4 on digitized biopsy slides using multiple measurement approaches, allowing direct comparison of their prognostic performance. Methods: We assembled a convenience sample of 726 patients with grade group 2-4 prostate cancer on systematic biopsy who underwent radical prostatectomy between 2014 and 2023. Digitized biopsy slides were analyzed using a machine-learning algorithm (PAIGE-AI) to quantify GP4 using multiple measurement approaches, particularly with respect to how gaps between cancer foci (interfocal stroma) were handled. GP4 extent was quantified using linear measurements or a pixel-based area metric. Discrimination of each GP4 quantification approach, along with Grade Group (GG), was assessed for adverse radical prostatectomy pathology and biochemical recurrence. Results: We identified 15 different quantification approaches and observed differences between their discrimination. The highest discrimination was in the pixel-counting method (AUC 0.648). GP4 quantification outperformed GG for predicting adverse pathology (AUC 0.627 vs 0.608). Amount of GP3 was non-predictive once GP4 was known. These findings were consistent for BCR. Conclusions: We were able to measure slides using 15 distinct measurement approaches and replicated prior findings using ML to quantify GP4. Our findings support the use of ML as a research tool to compare different GP4 quantification approaches. We intend to use our method on larger cohorts to determine with which measurement approach best predicts oncologic outcome.
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.