Preference Orchestrator: Prompt-Aware Multi-Objective Alignment for Large Language Models Paper • 2511.10656 • Published Nov 3, 2025
SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks Paper • 2606.09669 • Published 14 days ago • 45
SlimSearcher: Training Efficiency-Aware Web Agents via Adaptive Reward Gating Paper • 2606.07074 • Published 17 days ago • 12
SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks Paper • 2606.09669 • Published 14 days ago • 45
SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search Paper • 2605.29796 • Published 25 days ago • 25
CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents Paper • 2605.25624 • Published 28 days ago • 34
OpenWorldLib: A Unified Codebase and Definition of Advanced World Models Paper • 2604.04707 • Published Apr 6 • 204