Mode
Text Size
Log in / Sign up

AI software modestly improves AUROC and reduces interpretation time in breast ultrasound readingAI software modestly improves breast ultrasound analysis speed and performance in small study

AI-generated summary of the cited source, checked by automated accuracy review. How we work

Key Takeaway
Interpret AI's modest AUROC gain and time savings cautiously without significant accuracy change.

In a retrospective multi-reader cohort study, six radiologists interpreted 258 breast ultrasound examinations (129 malignant and 129 benign lesions) with and without assistance from Vis-BUS, a commercial AI detection and analysis software. The study compared diagnostic performance and interpretation time between AI-assisted and unassisted reads.

With AI assistance, the pooled area under the receiver operating characteristic curve (AUROC) increased modestly from 0.921 to 0.953 (p = 0.002). Median interpretation time per case decreased from 6.0 to 3.0 seconds (p < 0.001). However, key diagnostic accuracy metrics showed no significant differences: accuracy was 79.1% vs. 83.9% (p = 0.061), sensitivity was 94.2% vs. 96.3% (p = 0.243), and specificity was 64.0% vs. 71.6% (p = 0.069).

Safety and tolerability data were not reported. Key limitations include the retrospective design and the use of a multi-reader study with a washout period, which may not reflect real-world clinical workflow. The study demonstrates an association between AI use and improved AUROC with faster interpretation, but the lack of significant change in accuracy, sensitivity, or specificity suggests the clinical impact on diagnostic performance may be limited. Prospective studies are needed to determine if these findings translate to improved patient outcomes.

Researchers wanted to see if artificial intelligence (AI) software could help radiologists analyze breast ultrasound images. They tested a commercial AI program called Vis-BUS. Six radiologists examined 258 breast ultrasound images, half showing cancerous lesions and half showing benign ones. They read the images once without AI help and once with it, with a break in between to avoid memory bias.

When radiologists used the AI software, one technical measure of performance improved slightly. Their reading time also dropped from about 6 seconds to 3 seconds per case. However, key measures of diagnostic accuracy—like how often they correctly identified cancer or ruled it out—did not change significantly with the AI.

This was a small, early study that looked back at existing images. It did not test the AI in a real clinical setting where doctors make immediate decisions. The study also did not report any safety issues or conflicts of interest. The results suggest AI might help speed up analysis, but it did not clearly make radiologists more accurate in this specific test. More research in real-world settings is needed to see if this tool is truly helpful for patients.

What this means for you:
Early study shows AI may speed up breast ultrasound reading but didn't improve diagnostic accuracy in this test.

Study Details

Study typeCohort
EvidenceLevel 3
PublishedApr 2026
View Original Abstract ↓
ObjectivesTo evaluate whether Vis-BUS, a commercial artificial intelligence (AI) breast ultrasound detection and analysis software, improves diagnostic discrimination and interpretation efficiency in breast ultrasound examinations.Materials and methodsThis retrospective multi-reader study included 258 breast ultrasound examinations (129 malignant and 129 benign lesions). Six radiologists independently interpreted all cases without AI and, after a two-week washout, with AI assistance. Diagnostic performance metrics, including the area under the receiver operating characteristic curve (AUROC), area under the precision–recall curve (AUPRC), accuracy, sensitivity, and specificity, were compared using multi-reader analysis. Median interpretation time per case was recorded and compared using paired statistical tests.ResultsVis-BUS assistance modestly increased the pooled AUROC (0.921 vs. 0.953, p = 0.002) and reduced median reading time (6.0 vs. 3.0 s, p  0.06). Accuracy (79.1% vs. 83.9%, p = 0.061), sensitivity (94.2% vs. 96.3%, p = 0.243), and specificity (64.0% vs. 71.6%, p = 0.069) showed no significant differences. Median interpretation time decreased from 6.0 to 3.0 s (p 
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.