CT-based deep learning models show high accuracy predicting EGFR mutation status in lung adenocarcinomaMachine learning models help predict lung cancer mutations from scans

The Journal of international medical research Published July 5, 2026 Study authors: Yang Jia, Jiang Junyu, Peng Jin, Li Jie, Mi Peng, Chen Guangwen PubMed ↗ DOI ↗ Editorial oversight: Dr. Julia Lee, PhD · Oncology, Genomics & Drug Development

AI-generated summary of the cited source, checked by automated accuracy review. How we work

Key Takeaway

Consider CT-based deep learning models as a promising noninvasive tool for predicting EGFR mutation status in lung adenocarcinoma.

This meta-analysis evaluated the diagnostic performance of machine learning-based radiomics models for predicting epidermal growth factor receptor (EGFR) mutation status in a population of 6,628 Chinese patients with lung adenocarcinoma. The study aimed to determine how different computational approaches and imaging modalities influence the accuracy of identifying these critical mutations.

The analysis compared several variables including model architecture (deep learning versus conventional machine learning), imaging modality (Computed Tomography [CT] versus Positron Emission Tomography-Computed Tometry [PET-CT]), and validation methods (internal versus external cohorts). The primary outcome was the diagnostic performance of these radiomics models in predicting EGFR mutation status.

The pooled results for the radiomics models showed a sensitivity of 71% (95% confidence interval: 68-74) and a specificity of 81% (95% confidence interval: 78-84). The summary area under the curve (AUC) for these models was 0.85 (95% confidence interval: 0.82-0.88).

Comparative analyses revealed significant differences based on methodology and technology. Deep learning models demonstrated a significantly higher AUC of 0.871 compared to 0.798 for conventional machine learning models (P = 0.012). Furthermore, CT-based models yielded higher accuracy with an AUC of 0.879 compared to 0.828 for PET-CT-based models (P = 0.038). The study also found that models validated on independent external cohorts performed significantly better than those using internal validation, showing an AUC of 0.922 versus 0.841 (P = 0.006).

Safety and tolerability data were not reported as the study focused on diagnostic modeling rather than clinical intervention. The findings suggest that CT-based deep learning models, particularly those validated on external cohorts, offer a promising noninvasive pathway for predicting EGFR mutations.

Methodological limitations included the retrospective nature of the studies included in the meta-analysis and substantial heterogeneity among those studies. These factors may impact the generalizability of the results to broader populations outside of the specific Chinese cohort studied.

Clinical implications suggest that machine learning-based radiomics, specifically CT-based deep learning models with external validation, show promise for noninvasive prediction of EGFR mutations in lung adenocarcinoma patients. However, clinical translation requires large-scale, prospective, multicenter trials with standardized workflows to confirm these findings in a real-world setting.

Several questions remain regarding the integration of these models into standard diagnostic pipelines and whether they can consistently outperform traditional biopsy methods across diverse ethnicities and imaging hardware. Further research is needed to establish standardized radiomics features for consistent cross-platform application.

How this fits prior evidence

How this fits prior evidence: This finding addresses a gap in noninvasive diagnostic tools for lung adenocarcinoma. While previous reports have highlighted specific biomarkers like ADAM9 expression linked to survival, and clinical management of EGFR-TKI-resistant cases or complications from drugs like osimertinib, this study provides data on the predictive accuracy of radiomics models for identifying EGFR mutations specifically.

For people living with lung adenocarcinoma, knowing the specific genetic makeup of a tumor is vital. One common target is the EGFR mutation. Identifying this mutation helps doctors choose the most effective treatments for their patients. Currently, identifying these mutations often requires specialized laboratory tests on tissue samples. This research looks at whether computer-based tools can help predict these mutations using medical images instead.

The researchers conducted a meta-analysis, which is a study that combines data from many previous studies to find broad patterns. They looked at data from over 6,000 Chinese patients with lung adenocarcinoma. The goal was to see how well machine learning and radiomics models could predict EGFR mutation status based on different types of medical imaging.

The results showed that these computer models are quite effective at identifying the mutations. Specifically, the models had a high accuracy rate, with an area under the curve score of 0.85. The study found that deep learning models performed significantly better than traditional machine learning methods. Additionally, models using computed tomography (CT) scans were more accurate than those using positron emission tomography-computed tomography (PET-CT) scans. The most accurate results came from models that were tested on independent groups of patients.

While these results are promising, there are important reasons to remain cautious. This study was a meta-analysis of existing data, which means it is not a new clinical trial. Many of the original studies included in this analysis were retrospective, meaning they looked back at old data rather than following patients forward in real time. There was also a lot of variation between the different studies that were combined.

What does this mean for patients today? Right now, these computer models are not yet a replacement for standard laboratory testing. The technology shows great potential for being a non-invasive way to predict mutations, but it still needs much more testing. To be used in everyday clinics, the method would need large-scale trials and standardized procedures across many different hospitals. In short, while the computer models show strong promise for the future of lung cancer diagnosis, they are currently a research tool rather than an immediate change in how doctors treat patients. It is an encouraging step toward potentially faster ways to identify important genetic markers in lung cancer.

What this means for you:

Deep learning models using CT scans show promise for predicting lung cancer mutations but need more testing.

Study Details

Study typeMeta analysis

Sample sizen = 6,628

EvidenceLevel 1

PublishedJul 2026

PMID42396618

View Original Abstract ↓

ObjectiveThis systematic review and meta-analysis evaluates the diagnostic performance of machine learning-based radiomics models for predicting epidermal growth factor receptor mutation status in Chinese patients with lung adenocarcinoma.MethodsFollowing the Preferred Reporting Items for Systematic Reviews and Meta-Analyses 2020 guidelines and prospectively registered in the International Prospective Register of Systematic Reviews (CRD420251273027), a systematic search of PubMed, Embase, Web of Science, the Cochrane Library, Scopus, China National Knowledge Infrastructure, Wanfang, VIP, and Chinese Biomedical Literature Database was conducted from inception to 31 October 2025. Two reviewers independently screened studies, extracted data, and assessed bias using the Quality Assessment of Diagnostic Accuracy Studies-2 tool. A bivariate random-effects model was used to synthesize the data. Subgroup analyses were conducted for three factors: (a) imaging modality (computed tomography vs. positron emission tomography-computed tomography); (b) algorithm type (deep learning vs. conventional machine learning); and (3) validation strategy (external vs. internal).ResultsThirteen studies encompassing 6628 patients were included. The pooled sensitivity was 71% (95% confidence interval: 68-74), the pooled specificity was 81% (95% confidence interval: 78-84), and the summary area under the curve was 0.85 (95% confidence interval: 0.82-0.88). Deep learning models significantly outperformed conventional machine learning models (area under the curve: 0.871 vs. 0.798; P = 0.012). Computed tomography-based models yielded higher accuracy than positron emission tomography-computed tomography-based models (area under the curve: 0.879 vs. 0.828; P = 0.038). Models validated on independent external cohorts demonstrated superior performance compared with those relying solely on internal validation (area under the curve: 0.922 vs. 0.841; P = 0.006). Imaging modality was a significant source of heterogeneity (P < 0.05). No threshold effect or publication bias was detected.ConclusionMachine learning-based radiomics models exhibit promising diagnostic accuracy for the noninvasive prediction of epidermal growth factor receptor mutations in Chinese patients with lung adenocarcinoma. Computed tomography-based deep learning models subjected to independent external validation represent the current optimal approach. However, the retrospective nature and substantial heterogeneity of the included studies necessitate large-scale, prospective, multicenter trials with standardized workflows before clinical translation.

CT-based deep learning models show high accuracy predicting EGFR mutation status in lung adenocarcinomaMachine learning models help predict lung cancer mutations from scans

How this fits prior evidence

Study Details

Combination of Local Treatment and Systemic Therapy Improves Outcomes in Patients with Hepatocellular Carcinoma

Combining local and systemic treatments improves survival in liver cancer

Clinical research that matters. Delivered to your inbox.

CT-based deep learning models show high accuracy predicting EGFR mutation status in lung adenocarcinomaMachine learning models help predict lung cancer mutations from scans

How this fits prior evidence

More on Lung Adenocarcinoma

Study Details

Combination of Local Treatment and Systemic Therapy Improves Outcomes in Patients with Hepatocellular Carcinoma

Combining local and systemic treatments improves survival in liver cancer

Clinical research that matters. Delivered to your inbox.

Related in Oncology

From Other Specialties