view article Article Assisted Generation: a new direction toward low-latency text generation May 11, 2023 • 78
view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes Aug 17, 2022 • 130
Running on CPU Upgrade Featured 3.12k The Smol Training Playbook 📚 3.12k The secrets to building world-class LLMs