view article Article MTEB Leaderboard: From a slow demo to feature-rich leaderboard Samoed • 4 days ago • 21
Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses Paper • 2606.02373 • Published 15 days ago • 53
view article Article Designing the hf CLI as an agent-optimized way to work with the Hub celinah, Wauplin • 12 days ago • 56
Less Is More? When Dataset Context Hurts LLM-Generated Dataset Descriptions Paper • 2606.02334 • Published 15 days ago • 1
view article Article Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL +6 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, lvwerra, sergiopaniego • 20 days ago • 41
view article Article Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic ibm-research • 15 days ago • 86
view article Article MONET: Lowering the Barrier to World Class Image Generation Research jasperai • 19 days ago • 10
MONET - Massive Open Non-redundant, Enriched, Text-to-image Collection A curated, deduped & recaptioned open image–text dataset of 104.9M samples released under the Apache2.0 licence. https://huggingface.co/blog/jasperai/ • 4 items • Updated 19 days ago • 11
MiniCPM5 Collection A SOTA 1B on-device LLM, small yet powerful. • 11 items • Updated 21 days ago • 27
view article Article Why Open Models Are the Only Sustainable Way to Teach AI penelopegittos • 25 days ago • 8
Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation Paper • 2605.19833 • Published 28 days ago • 134
Scaling Properties of Continuous Diffusion Spoken Language Models Paper • 2604.24416 • Published Apr 27 • 1