view article Article Unlocking asynchronicity in continuous batching +1 ror, pcuenq, ariG23498 • 5 days ago • 46
Flow-OPD: On-Policy Distillation for Flow Matching Models Paper • 2605.08063 • Published 11 days ago • 96
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published 12 days ago • 184
SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer Paper • 2605.15178 • Published 5 days ago • 74
Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling Paper • 2605.13301 • Published 6 days ago • 149
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling Paper • 2605.08083 • Published 11 days ago • 65
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents Paper • 2604.26752 • Published 20 days ago • 106
Heterogeneous Scientific Foundation Model Collaboration Paper • 2604.27351 • Published 19 days ago • 215
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published 16 days ago • 157
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published 15 days ago • 330
Accelerating RL Post-Training Rollouts via System-Integrated Speculative Decoding Paper • 2604.26779 • Published 20 days ago • 13
Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond Paper • 2604.22748 • Published 25 days ago • 226