-
Token-Efficient Long Video Understanding for Multimodal LLMs
Paper • 2503.04130 • Published • 97 -
Temporal Preference Optimization for Long-Form Video Understanding
Paper • 2501.13919 • Published • 23 -
MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning
Paper • 2503.07365 • Published • 61
Zhang Yuanhan
ZhangYuanhan
AI & ML interests
None yet
Recent Activity
upvoted a paper 4 days ago
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models upvoted a paper 4 days ago
Rethinking the Divergence Regularization in LLM RL upvoted a paper 2 months ago
FileGram: Grounding Agent Personalization in File-System Behavioral Traces