arXiv:2606.00096v2 Announce Type: replace-cross Abstract: Visual agents employ external visual tools within visual chains of thought to incorporate fine-grained evidence. While prior work has mainly studied these tools in visual search tasks, their role in more complex visual reasoning remains underexplored. In this paper, we move beyond simple...
Read the full article at the source.
Comments (0)
No comments yet. Be the first to comment!