Submitted by
Yi-Fan Zhang
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
OmniVideo-100K: A Dataset for Audio-Visual Reasoning through Structured Scripts and Evidence Chains
HYDRA-X: Native Unified Multimodal Models with Holistic Visual Tokenizers