Echo-Memory: A Controlled Study of Memory in Action World Models Paper • 2606.09803 • Published 12 days ago • 32
SWE-Explore: Benchmarking How Coding Agents Explore Repositories Paper • 2606.07297 • Published 15 days ago • 116
XSkill: Continual Learning from Experience and Skills in Multimodal Agents Paper • 2603.12056 • Published Mar 12 • 34
AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials Paper • 2412.09605 • Published Dec 12, 2024 • 31
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Paper • 2412.04454 • Published Dec 5, 2024 • 71
MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures Paper • 2410.13754 • Published Oct 17, 2024 • 76