Daniel van Strien's picture

Building on HF

Daniel van Strien PRO

davanstrien

huggingface

·

https://danielvanstrien.xyz/

AI & ML interests

Machine Learning Librarian

Recent Activity

updated a dataset about 2 hours ago

librarian-bots/arxiv-cs-papers-lance

updated a Space about 3 hours ago

davanstrien/benchmark-race

updated a dataset about 7 hours ago

librarian-bots/model_cards_with_metadata

View all activity

Organizations

upvoted an article 4 days ago

Article

MTEB Leaderboard: From a slow demo to feature-rich leaderboard

Samoed

•

4 days ago

• 21

upvoted 2 articles 6 days ago

Article

Any Custom Frontend with Gradio's Backend

ysharma, abidlabs

•

Apr 1

• 38

Article

Migrating Your GitHub CI to Hugging Face Jobs

abidlabs

•

7 days ago

• 9

upvoted a paper 9 days ago

Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses

Paper • 2606.02373 • Published 15 days ago • 53

upvoted an article 12 days ago

Article

Designing the hf CLI as an agent-optimized way to work with the Hub

celinah, Wauplin

•

12 days ago

• 56

upvoted a paper 13 days ago

Less Is More? When Dataset Context Hurts LLM-Generated Dataset Descriptions

Paper • 2606.02334 • Published 15 days ago • 1

upvoted an article 13 days ago

Article

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL

+6

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, lvwerra, sergiopaniego

•

20 days ago

• 41

upvoted an article 15 days ago

Article

Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic

ibm-research

•

15 days ago

• 86

upvoted an article 19 days ago

Article

MONET: Lowering the Barrier to World Class Image Generation Research

jasperai

•

19 days ago

• 10

upvoted a collection 19 days ago

MONET - Massive Open Non-redundant, Enriched, Text-to-image

A curated, deduped & recaptioned open image–text dataset of 104.9M samples released under the Apache2.0 licence. https://huggingface.co/blog/jasperai/ • 4 items • Updated 19 days ago • 11

upvoted a collection 21 days ago

MiniCPM5

A SOTA 1B on-device LLM, small yet powerful. • 11 items • Updated 21 days ago • 27

upvoted an article 25 days ago

Article

Why Open Models Are the Only Sustainable Way to Teach AI

penelopegittos

•

25 days ago

• 8

upvoted a paper 25 days ago

Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation

Paper • 2605.19833 • Published 28 days ago • 134

upvoted a collection 26 days ago

Hy-MT2

混元翻译模型2.0版本 • 11 items • Updated 21 days ago • 43

upvoted an article 29 days ago

Article

The Open Agent Leaderboard

ibm-research

•

29 days ago

• 13

upvoted a collection about 1 month ago

OCR models

11 items • Updated 26 days ago • 13

upvoted a paper about 1 month ago

Scaling Properties of Continuous Diffusion Spoken Language Models

Paper • 2604.24416 • Published Apr 27 • 1

upvoted an article about 2 months ago

Article

Granite 4.1 LLMs: How They’re Built

ibm-granite

•

Apr 29

• 81

upvoted an article 2 months ago

Article

The PR you would have opened yourself

pcuenq, awni

•

Apr 16

• 72

upvoted a collection 2 months ago

Qwen3.6

4 items • Updated Apr 22 • 407