arxiv:2602.04705
Junyuan Shang
sjy1203
AI & ML interests
NLP
Recent Activity
upvoted a paper 2 days ago
Scaling Embeddings Outperforms Scaling Experts in Language Models upvoted a paper 20 days ago
Native Audio-Visual Alignment for Generation authored a paper 4 months ago
DHA: Learning Decoupled-Head Attention from Transformer Checkpoints via Adaptive Heads FusionOrganizations
None yet