arxiv:2604.07023
Lei Wang
demolei
AI & ML interests
LLMs
Recent Activity
upvoted a paper about 4 hours ago
Why Multi-Step Tool-Use Reinforcement Learning Collapses and How Supervisory Signals Fix It upvoted a paper about 4 hours ago
GUI vs. CLI: Execution Bottlenecks in Screen-Only and Skill-Mediated Computer-Use Agents upvoted a paper about 4 hours ago
The Verification Horizon: No Silver Bullet for Coding Agent Rewards