view article Article The Optimal Architecture for Small Language Models codelion • Dec 26, 2025 • 120
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge NormalUhr • Feb 7, 2025 • 292
view article Article Ellora: Enhancing LLMs with LoRA - Standardized Recipes for Capability Enhancement codelion • Dec 3, 2025 • 14
view article Article Continuous batching from first principles +1 ror, ArthurZ, mcpotato • Nov 25, 2025 • 379
view article Article Supercharge your OCR Pipelines with Open Models +5 merve, ariG23498, davanstrien, hynky, andito, reach-vb, pcuenq • Oct 21, 2025 • 312