Researchers analyzed seven studies to see how well machine learning (ML) models could predict acute kidney injury in children undergoing cardiac surgery. The study looked at several measures of accuracy, including sensitivity and specificity, to determine if these computer models could identify at-risk patients effectively.
The results showed that while some models performed very well during internal testing, their performance dropped when tested on external data. Specifically, externally validated models had a lower sensitivity of 0.70 compared to 0.84 in internally validated versions. The overall accuracy for the average performing models was also lower than those specifically tuned for internal use.
Because there was a lot of variation between the different studies and limited testing across multiple centers, these results are not yet ready to change standard hospital practices. While the technology shows promise for identifying kidney risks in young patients, more large-scale validation is needed before it can be used reliably by doctors in daily clinical care.