VenusBench-Mobile: A Challenging and User-Centric Benchmark for Mobile GUI Agents with Capability Diagnostics Paper • 2604.06182 • Published Feb 6 • 1
FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling Paper • 2604.06916 • Published 1 day ago • 5
INSPATIO-WORLD: A Real-Time 4D World Simulator via Spatiotemporal Autoregressive Modeling Paper • 2604.07209 • Published 1 day ago • 9
TC-AE: Unlocking Token Capacity for Deep Compression Autoencoders Paper • 2604.07340 • Published 1 day ago • 7
MARS: Enabling Autoregressive Models Multi-Token Generation Paper • 2604.07023 • Published 1 day ago • 16
Expert-Choice Routing Enables Adaptive Computation in Diffusion Language Models Paper • 2604.01622 • Published 7 days ago • 3
DARE: Diffusion Large Language Models Alignment and Reinforcement Executor Paper • 2604.04215 • Published 4 days ago • 17
Scientific Graphics Program Synthesis via Dual Self-Consistency Reinforcement Learning Paper • 2604.06079 • Published 2 days ago • 3
Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents Paper • 2604.06132 • Published 2 days ago • 100
Paper Circle: An Open-source Multi-agent Research Discovery and Analysis Framework Paper • 2604.06170 • Published 2 days ago • 20
How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings Paper • 2604.04323 • Published 3 days ago • 28
Beyond Accuracy: Unveiling Inefficiency Patterns in Tool-Integrated Reasoning Paper • 2604.05404 • Published 2 days ago • 35
Scaling Teams or Scaling Time? Memory Enabled Lifelong Learning in LLM Multi-Agent Systems Paper • 2604.03295 • Published 13 days ago • 7
Synthetic Sandbox for Training Machine Learning Engineering Agents Paper • 2604.04872 • Published 3 days ago • 9
PLUME: Latent Reasoning Based Universal Multimodal Embedding Paper • 2604.02073 • Published 7 days ago • 12
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published 7 days ago • 159
Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies Paper • 2604.00830 • Published 7 days ago • 11