Researchers evaluated how well different artificial intelligence (AI) tools could screen titles and abstracts to find relevant information about Parkinson disease. They compared two specific tools, Loon Lens and Catchii, against human screening as the gold standard. The study aimed to see if these technologies could help scientists manage large amounts of medical data more efficiently.
The results showed that Loon Lens performed better than Catchii when compared to human reviewers. When looking at full-text articles, most disagreements between the AI and humans happened on papers the AI included but humans left out. The study also found that when the AI gave a high confidence score, it was much more likely to agree with the human experts.
Because AI technology is changing very quickly, these results are not yet a standard for every situation. Different tools perform differently depending on the specific task. Researchers should still test any AI tool thoroughly before using it in practice. These findings show that while AI can be a helpful assistant, it currently works best as a support tool rather than a replacement for human oversight.