view article Article OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments +3 christian-washington, ajasuja, santosh-iima, lewtun, burtenshaw • Feb 12 • 32
AIRS-Bench: a Suite of Tasks for Frontier AI Research Science Agents Paper • 2602.06855 • Published Feb 6 • 83
view article Article We Got Claude to Build CUDA Kernels and teach open models! +2 burtenshaw, evalstate, merve, pcuenq • Jan 28 • 156
TorchAO: PyTorch-Native Training-to-Serving Model Optimization Paper • 2507.16099 • Published Jul 21, 2025 • 7
Don't Transform the Code, Code the Transforms: Towards Precise Code Rewriting using LLMs Paper • 2410.08806 • Published Oct 11, 2024 • 1
Compiler generated feedback for Large Language Models Paper • 2403.14714 • Published Mar 18, 2024 • 7
Priority Sampling of Large Language Models for Compilers Paper • 2402.18734 • Published Feb 28, 2024 • 19
EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine Paper • 2206.10558 • Published Jun 21, 2022 • 2
view article Article Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training +3 smohammadi, siro1, winglian, marcsun13, djsaunde • Aug 8, 2025 • 98
AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs Paper • 2507.05687 • Published Jul 8, 2025 • 31
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels drbh, danieldk • Aug 18, 2025 • 98