-
The Ultra-Scale Playbook
๐3.87kThe ultimate guide to training LLM on large GPU Clusters
-
The Smol Training Playbook
๐3.2kThe secrets to building world-class LLMs
-
FineWeb: decanting the web for the finest text data at scale
๐ท1.35kExplore and download the FineWeb webโscale text dataset
-
Unlocking On-Policy Distillation for Any Model Family
๐109Visualize onโpolicy distillation token alignment
Aditya Bhosale
croeasusking
ยท
AI & ML interests
None yet
Recent Activity
updated a collection 7 days ago
HF Books liked a Space 7 days ago
dlouapre/eiffel-tower-llama updated a collection 22 days ago
HF BooksOrganizations
None yet