VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding Paper • 2606.05259 • Published 7 days ago • 35
Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation Paper • 2606.06428 • Published 6 days ago • 23
Does Seeing More Mean Knowing More? Mono-Anchored Advantage Normalization for Multi-Source Visual Reasoning Paper • 2605.25437 • Published 16 days ago • 16
Cheers: Decoupling Patch Details from Semantic Representations Enables Unified Multimodal Comprehension and Generation Paper • 2603.12793 • Published Mar 13 • 38
Thinking with Drafting: Optical Decompression via Logical Reconstruction Paper • 2602.11731 • Published Feb 12 • 36 • 4
Imagination Helps Visual Reasoning, But Not Yet in Latent Space Paper • 2602.22766 • Published Feb 26 • 44
Imagination Helps Visual Reasoning, But Not Yet in Latent Space Paper • 2602.22766 • Published Feb 26 • 44
Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance Paper • 2601.14171 • Published Jan 20 • 53
MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization Paper • 2510.08540 • Published Oct 9, 2025 • 110
Running MCP 189 Recommend Similar Papers 🌖 189 Get similar paper recommendations from a Hugging Face link
AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning Paper • 2505.24298 • Published May 30, 2025 • 34
Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL Paper • 2505.15436 • Published May 21, 2025 • 2