duongntd2/qwen3vl_4b_traffic_reasoning_localize_v2 Image-Text-to-Text • 4B • Updated Nov 18, 2025 • 1
Structured Distillation of Web Agent Capabilities Enables Generalization Paper • 2604.07776 • Published Apr 9 • 23
A3: Agent-as-Annotators Collection Models and data from "Structured Distillation of Web Agent Capabilities Enables Generalization" (arXiv:2604.07776) • 6 items • Updated Apr 14 • 1
Running Agents Featured 15 PTS Visualizer 🔍 15 Visualize pivotal tokens and thought anchors in language models
Lance: Unified Multimodal Modeling by Multi-Task Synergy Paper • 2605.18678 • Published 12 days ago • 75
Fast and Effective On-policy Distillation from Reasoning Prefixes Paper • 2602.15260 • Published Feb 16 • 1
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe Paper • 2604.13016 • Published Apr 14 • 109