Mode
Text Size
Log in / Sign up

Open-sourcing data and annotations for renal cell carcinoma yielded 142 annotated CT scans from 101 patients.

Open-sourcing data and annotations for renal cell carcinoma yielded 142 annotated CT scans from 101 …
Photo by asif mohomed / Unsplash
Key Takeaway
Note that open-sourcing data for RCC yielded 142 annotated CT scans, though re-annotation is often necessary.

This cohort study evaluated the utility of open-sourcing data and annotations for renal cell carcinoma (RCC). The analysis included a sample of 142 annotated CT scans derived from 101 patients, comprising 26 females and 75 males with a mean age of 56 years. The specific setting and publication type were not reported in the available data.

The primary outcome of the data processing involved the total number of annotated CT scans after exclusion and quality control, which resulted in 142 scans. The cohort was further stratified by histologic subtype, identifying 95 cases of clear cell RCC, 29 cases of papillary RCC, and 18 cases of chromophobe RCC.

No adverse events, serious adverse events, discontinuations, or tolerability data were reported as the study focused on data availability rather than clinical intervention. A key limitation noted was that re-annotation is often necessary due to limited access to public annotations. Funding sources and conflicts of interest were not reported.

The practice relevance of this work is to encourage accessible and reproducible AI research for renal cell carcinoma. Clinicians should recognize that while data sharing is beneficial, the quality of annotations may require additional verification before integration into clinical workflows.

Study Details

Study typeCohort
Sample sizen = 56
EvidenceLevel 3
PublishedApr 2026
View Original Abstract ↓
Background: Medical imaging, especially computed tomography and magnetic resonance imaging, is essential in clinical care of patients with renal cell carcinoma (RCC). Artificial intelligence (AI) research into computer-aided diagnosis, staging and treatment planning needs curated and annotated datasets. Across literature, The Cancer Genome Atlas (TCGA) datasets are widely used for model training and validation. However, re-annotation is often necessary due to limited access to public annotations, raising entry barriers and hindering comparison with prior work. Methods: We screened 1915 CT scans from three TCGA-RCC databases and employed a segmentation model to annotate kidney lesion. After a meta-data-based exclusion step, we hosted a reader study with all papillary (n=56), chromophobe (n=27) and 200 randomly selected clear cell RCC cases. Two students quality checked and corrected the data as well as annotated tumors and cysts. Uncertain cases were checked by a board-certified radiologist. Results: After data exclusion and quality control a total of 142 annotated CT scans from 101 patients (26 female, 75 male, mean age 56 years) remained. This includes 95 CTs with clear cell RCC, 29 with papillary RCC and 18 with chromophobe RCC. Images and voxel-level annotations of kidneys and lesions are open sourced at https://zenodo.org/records/19630298. Conclusion: By making the annotations open-source, we encourage accessible and reproducible AI research for renal cell carcinoma. We invite other researchers who have previously annotated any of these cohorts to share their annotations.
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.