junmingyang's picture

junmingyang

jmyang

·

https://junming-yang.github.io/

junming-yang

AI & ML interests

LLM Alignment, VLM

Recent Activity

authored a paper 13 days ago

Preference Orchestrator: Prompt-Aware Multi-Objective Alignment for Large Language Models

authored a paper 13 days ago

SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks

upvoted a paper 13 days ago

SlimSearcher: Training Efficiency-Aware Web Agents via Adaptive Reward Gating

View all activity

Organizations

None yet

authored 2 papers 13 days ago

Preference Orchestrator: Prompt-Aware Multi-Objective Alignment for Large Language Models

Paper • 2511.10656 • Published Nov 3, 2025

SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks

Paper • 2606.09669 • Published 14 days ago • 45

upvoted 2 papers 13 days ago

SlimSearcher: Training Efficiency-Aware Web Agents via Adaptive Reward Gating

Paper • 2606.07074 • Published 17 days ago • 12

SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks

Paper • 2606.09669 • Published 14 days ago • 45

upvoted a paper 21 days ago

SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search

Paper • 2605.29796 • Published 25 days ago • 25

upvoted a paper 27 days ago

CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents

Paper • 2605.25624 • Published 28 days ago • 34

liked a model about 2 months ago

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 14 days ago • 2.61M • • 5k

upvoted a paper 3 months ago

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Paper • 2604.04707 • Published Apr 6 • 204

updated a collection 4 months ago

Meta APO

Model of MetaAPO https://arxiv.org/abs/2509.23371 • 6 items • Updated Feb 28 • 2

upvoted a collection 4 months ago

Meta APO

Model of MetaAPO https://arxiv.org/abs/2509.23371 • 6 items • Updated Feb 28 • 2

updated a model 4 months ago

jmyang/MetaAPO-Qwen2.5-7B

0.5B • Updated Feb 28 • 4 • 1

published a model 4 months ago

jmyang/MetaAPO-Qwen2.5-7B

0.5B • Updated Feb 28 • 4 • 1

updated a collection 4 months ago

Meta APO

Model of MetaAPO https://arxiv.org/abs/2509.23371 • 6 items • Updated Feb 28 • 2

updated a model 4 months ago

jmyang/Qwen2.5-7B-rm

1B • Updated Feb 28 • 1

published a model 4 months ago

jmyang/Qwen2.5-7B-rm

1B • Updated Feb 28 • 1

updated a collection 4 months ago

Meta APO

Model of MetaAPO https://arxiv.org/abs/2509.23371 • 6 items • Updated Feb 28 • 2