π LLM pretraining datasets Collection A collection of datasets for LLM pretraining β’ 9 items β’ Updated May 5, 2025 β’ 19
Running on CPU Upgrade Featured 3.18k The Smol Training Playbook π 3.18k The secrets to building world-class LLMs
Running 105 Unlocking On-Policy Distillation for Any Model Family π 105 Visualize on-policy distillation for any model family
[lecture artifacts] aligning open language models Collection artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin β’ 63 items β’ Updated Apr 17, 2024 β’ 58
Running Agents Featured 253 Jupyter Agent 2 π 253 Generate Jupyter notebooks from natural language tasks
laion/CLIP-ViT-B-32-laion2B-s34B-b79K Zero-Shot Image Classification β’ 0.2B β’ Updated Jan 22, 2025 β’ 3.33M β’ 140