MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens Paper • 2603.23516 • Published Mar 6 • 46
How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings Paper • 2604.04323 • Published 3 days ago • 28
Xpertbench: Expert Level Tasks with Rubrics-Based Evaluation Paper • 2604.02368 • Published 13 days ago • 7
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published 20 days ago • 326
Meta-Reinforcement Learning with Self-Reflection for Agentic Search Paper • 2603.11327 • Published 28 days ago • 9
ActionParty: Multi-Subject Action Binding in Generative Video Games Paper • 2604.02330 • Published 7 days ago • 6
Meta-Harness: End-to-End Optimization of Model Harnesses Paper • 2603.28052 • Published 10 days ago • 16
FlashSampling: Fast and Memory-Efficient Exact Sampling Paper • 2603.15854 • Published 24 days ago • 9
LookaheadKV: Fast and Accurate KV Cache Eviction by Glimpsing into the Future without Generation Paper • 2603.10899 • Published 29 days ago • 7
EndoCoT: Scaling Endogenous Chain-of-Thought Reasoning in Diffusion Models Paper • 2603.12252 • Published 28 days ago • 12
Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion Paper • 2603.06577 • Published Mar 6 • 48
Lost in Backpropagation: The LM Head is a Gradient Bottleneck Paper • 2603.10145 • Published 30 days ago • 12
Adaptive Loops and Memory in Transformers: Think Harder or Know More? Paper • 2603.08391 • Published 29 days ago • 1