Mode
Text Size
Log in / Sign up

AI-assisted echocardiography shows moderate agreement with cardiologists in ATTR-CM monitoringAI Ultrasounds Match Experts for Tracking a Stealth Heart Disease

AI-generated summary of the cited source, checked by automated accuracy review. How we work

Key Takeaway
Note moderate AI-cardiologist agreement in ATTR-CM; consider for longitudinal monitoring with caution.

This retrospective observational study assessed the performance of a fully automated, AI-assisted echocardiographic algorithm (Us2.ai) in 62 patients with transthyretin cardiomyopathy (ATTR-CM) undergoing serial annual echocardiograms. The primary outcome measured agreement and reproducibility of automated measurements compared to those obtained by a reference cardiologist, a second cardiologist, and a novice reader. Secondary outcomes included interrater agreement, intrarater variability, and AI repeatability.

For interventricular septum thickness (IVSd), the AI showed moderate agreement with the reference cardiologist, with an intraclass correlation coefficient (ICC) of 0.65 and a negative bias of -1.9 mm. For left ventricular end-diastolic volume (LVEDV), agreement was also moderate (ICC 0.51) with a negative bias of -39 mL. In contrast, interrater agreement between the two cardiologists was good for both IVSd (ICC 0.79, bias -0.2 mm) and LVEDV (ICC 0.84, bias +3 mL). Intrarater variability for both cardiologists was moderate to excellent, with limits of agreement ranging from 2.7 mm to 43 mL. AI repeatability limits of agreement were 3.6 mm and 37 mL, comparable to experienced cardiologists. The novice reader demonstrated higher variability, with limits of agreement of 5.1 mm and 61 mL.

No adverse events, serious adverse events, discontinuations, or tolerability issues were reported, as this was a diagnostic accuracy study. Key limitations include the retrospective study design and the lack of reported funding or conflicts of interest. The study supports the use of automated analysis for longitudinal echocardiographic monitoring in ATTR-CM, but results should be interpreted with caution given the association-only nature of the data and the moderate agreement observed between the AI and expert human readers.

The silent protein clogging hearts

ATTR cardiomyopathy (amyloid protein buildup in the heart) used to be considered rare.

Today, doctors realize it's much more common than once thought, especially in older adults. New treatments can slow it down — but only if changes in the heart are spotted early and accurately.

Tracking those changes is harder than it sounds.

Echocardiography (heart ultrasound) is the workhorse test for monitoring ATTR-CM.

But echoes have a well-known weakness: they depend heavily on the person doing them. Two cardiologists reading the same patient's echoes may get slightly different numbers — and those small differences can obscure real disease progression.

As new ATTR therapies become available, knowing whether the heart is getting worse — or holding steady — matters more than ever.

The old way vs. the new question

Traditionally, serial echoes have been measured by human readers who manually trace chambers, walls, and motion on the screen.

That works, but variation is inevitable — especially between a senior cardiologist and a less-experienced reader.

What's new: fully automated AI algorithms can now perform many of these measurements without human input. The question this study asked is whether AI is reliable enough to track patients over time, alongside — or instead of — human readers.

How it works, in simple terms

Picture an echocardiogram as a moving blueprint of the heart.

A cardiologist clicks through frames, marks the walls and chambers, and calculates measurements like ventricular wall thickness and chamber volume. It's careful work, but mouse clicks and judgment calls vary.

An AI algorithm, trained on thousands of echoes, does the same job almost instantly. It always "clicks" the same way given the same image — so in theory, it should be highly consistent.

Think of it like two bakers measuring flour by eye versus a kitchen scale — the scale may not be perfectly calibrated, but it's always the same scale every time.

The study at a glance

Researchers retrospectively analyzed 178 serial annual echocardiograms from 62 patients with ATTR-CM.

Each echo was evaluated four ways: by a reference (expert) cardiologist, by a second cardiologist, by a novice reader, and by an automated AI system (Us2.ai).

They used Bland-Altman analysis and intraclass correlation coefficients (ICCs) to judge agreement between readers and measured each reader's repeatability when re-reading images blindly.

The picture was mixed — but encouraging for AI in an important way.

AI showed moderate agreement with the reference cardiologist for two key measurements: interventricular septal thickness (IVSd) and left ventricular end-diastolic volume (LVEDV). The ICCs were 0.65 and 0.51.

Cardiologist-to-cardiologist agreement was better (ICC 0.79 and 0.84) with very small systematic differences.

But here's the more interesting result: AI's repeatability — how consistent it was when re-measuring the same images — was comparable to experienced cardiologists. The novice reader, by comparison, was noticeably less consistent.

This means AI isn't replacing the expert's eye, but it's already more reliable than a less-experienced one.

A subtle but important point

For tracking disease progression, repeatability may matter even more than absolute accuracy.

If an AI systematically undershoots a measurement by a small amount — but does so consistently every single time — it can still reliably flag real changes over time.

That's the key takeaway for longitudinal monitoring: detecting change is what matters, and AI did that well.

The authors suggest AI-based echo analysis could support long-term monitoring of ATTR-CM, especially in centers that lack a dedicated expert reader for every patient.

It won't replace experienced cardiologists anytime soon. But it could ease workload, standardize measurements across visits, and offer a reliable second opinion — particularly valuable when an echo is read by someone less experienced.

More broadly, this study fits the trend of AI tools earning their place as assistants rather than replacements in medical imaging.

If you have ATTR cardiomyopathy and your echoes are being reviewed by a dedicated cardiac team, you're likely in good hands already.

At centers where AI-assisted readings are available, those tools may add consistency from one annual echo to the next — helping your care team detect subtle progression sooner.

You don't need to ask for AI analysis specifically. But knowing it exists may help explain why your reports are sometimes accompanied by automated measurements alongside the cardiologist's.

Honest limitations

This was a single-center retrospective study of 62 patients with one AI system.

Results might not apply to other AI algorithms or other hospital settings. The "reference" in any agreement study is itself imperfect — so saying AI agrees "moderately" with an expert doesn't necessarily mean AI is wrong. Sample sizes for some sub-analyses were small.

And echocardiography is just one way to monitor ATTR-CM; imaging like cardiac MRI and blood biomarkers also play roles.

Expect AI-assisted echocardiography to expand rapidly over the next few years.

Larger multi-center studies will help pin down exactly how much variation AI tools add — and reduce — compared to human readers. Direct comparisons across AI platforms will also matter.

For ATTR-CM specifically, this is good news: as treatments evolve, reliable tracking is critical, and AI may help make that tracking more consistent across clinics of different sizes and expertise levels.

Study Details

Sample sizen = 62
EvidenceLevel 5
PublishedApr 2026
View Original Abstract ↓
Background: Detection of disease progression is key to personalize treatment strategies in transthyretin cardiomyopathy (ATTR-CM), particularly with emerging therapies. Echocardiography can detect subtle longitudinal changes but is limited by operator dependence. This study evaluates agreement and reproducibility of fully automated, AI-assisted echocardiographic measurements under real-world conditions. Methods: This retrospective study included 62 patients with ATTR-CM undergoing 178 serial annual echocardiograms assessed by a reference cardiologist, a second cardiologist, a novice reader, and a fully automated AI algorithm (Us2.ai). Interrater agreement was assessed using Bland-Altman analysis and intraclass correlation coefficients (ICCs). Intrarater variability for human readers was derived from repeated blinded measurements, with limits of agreement (LoA = mean difference +/- 1.96 x SD) defining the smallest detectable change. AI repeatability was assessed using within-study pairwise differences. Results: AI showed moderate agreement with the reference cardiologist for IVSd and LVEDV (ICC 0.65 and 0.51), with biases of -1.9 mm and -39 mL, respectively. Interrater agreement between cardiologists was good (ICC 0.79 and 0.84) with minimal bias (-0.2 mm and +3 mL). Intrarater variability was moderate to excellent for both cardiologists (LoA 3.0 mm and 43 mL for the reference cardiologist; 2.7 mm and 31 mL for the second cardiologist). AI demonstrated comparable repeatability (LoA 3.6 mm and 37 mL), while the novice showed higher variability (5.1 mm and 61 mL). Conclusion: AI-based measurements demonstrated repeatability comparable to experienced cardiologists. Despite moderate agreement and systematic differences in volumetric assessments, their reproducibility supports automated analysis for longitudinal echocardiographic monitoring.
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.