Playing Along: Learning a Double-Agent Defender for Belief Steering via Theory of Mind Paper • 2604.11666 • Published 2 days ago • 3
Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization Paper • 2602.23008 • Published Feb 26 • 37
pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated Feb 26 • 96
PRInTS: Reward Modeling for Long-Horizon Information Seeking Paper • 2511.19314 • Published Nov 24, 2025 • 8
One Life to Learn: Inferring Symbolic World Models for Stochastic Environments from Unguided Exploration Paper • 2510.12088 • Published Oct 14, 2025 • 5
The Well Collection A 15TB collection of physics simulation datasets. • 18 items • Updated Mar 24, 2025 • 49
Video-Skill-CoT: Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning Paper • 2506.03525 • Published Jun 4, 2025 • 6
Executable Functional Abstractions: Inferring Generative Programs for Advanced Math Problems Paper • 2504.09763 • Published Apr 14, 2025 • 12
Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning Paper • 2503.05641 • Published Mar 7, 2025 • 2