view article Article Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models nvidia • Dec 15, 2025 • 111
view article Article Why You Should Care About Partial Differential Equations (PDEs) hugging-science • Dec 12, 2025 • 45
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper • 2511.18538 • Published Nov 23, 2025 • 304
view article Article Open-source DeepResearch – Freeing our search agents +3 m-ric, albertvillanova, merve, thomwolf, clefourrier • Feb 4, 2025 • 1.32k
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning Paper • 2510.25992 • Published Oct 29, 2025 • 48
view article Article Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs +7 wenhuach, Haihao, weiweiz1, n1ck-guo, isaacmac, kding1, IlyasMoutawwakil, marcsun13, medmekk • Apr 29, 2025 • 44
view article Article Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment NormalUhr • Feb 11, 2025 • 119
view article Article From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate +2 mirinflim, aldopareja, muellerzr, stas • Jun 13, 2024 • 62