Arbitrage: Efficient Reasoning via Advantage-Aware Speculation Paper • 2512.05033 • Published Dec 4, 2025 • 17
Let's (not) just put things in Context: Test-Time Training for Long-Context LLMs Paper • 2512.13898 • Published Dec 15, 2025 • 2
$V_1$: Unifying Generation and Self-Verification for Parallel Reasoners Paper • 2603.04304 • Published Mar 4 • 14
Reward Under Attack: Analyzing the Robustness and Hackability of Process Reward Models Paper • 2603.06621 • Published Feb 20
Squeeze Evolve: Unified Multi-Model Orchestration for Verifier-Free Evolution Paper • 2604.07725 • Published Apr 10
Learning, Fast and Slow: Towards LLMs That Adapt Continually Paper • 2605.12484 • Published 4 days ago • 14
Learning, Fast and Slow: Towards LLMs That Adapt Continually Paper • 2605.12484 • Published 4 days ago • 14 • 2
Learning, Fast and Slow: Towards LLMs That Adapt Continually Paper • 2605.12484 • Published 4 days ago • 14
Learning, Fast and Slow: Towards LLMs That Adapt Continually Paper • 2605.12484 • Published 4 days ago • 14