DFlash Collection Block Diffusion for Flash Speculative Decoding • 13 items • Updated 4 days ago • 47
Can LLMs Learn to Reason Robustly under Noisy Supervision? Paper • 2604.03993 • Published 5 days ago • 22
The Model Says Walk: How Surface Heuristics Override Implicit Constraints in LLM Reasoning Paper • 2603.29025 • Published 10 days ago • 12
Meta-Harness: End-to-End Optimization of Model Harnesses Paper • 2603.28052 • Published 10 days ago • 16
AgentHazard: A Benchmark for Evaluating Harmful Behavior in Computer-Use Agents Paper • 2604.02947 • Published 7 days ago • 18
Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory Paper • 2604.01007 • Published 8 days ago • 29
LightThinker++: From Reasoning Compression to Memory Management Paper • 2604.03679 • Published 6 days ago • 28
HippoCamp: Benchmarking Contextual Agents on Personal Computers Paper • 2604.01221 • Published 8 days ago • 27
Reasoning Shift: How Context Silently Shortens LLM Reasoning Paper • 2604.01161 • Published 8 days ago • 29
Embarrassingly Simple Self-Distillation Improves Code Generation Paper • 2604.01193 • Published 8 days ago • 33
CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery Paper • 2604.01658 • Published 8 days ago • 51
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published 8 days ago • 161
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper • 2604.04921 • Published 4 days ago • 89
MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome Paper • 2603.28407 • Published 10 days ago • 68