From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning Paper • 2606.17682 • Published 10 days ago • 26
Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space Paper • 2510.12603 • Published Oct 14, 2025 • 1