-
LNS-Madam: Low-Precision Training in Logarithmic Number System using Multiplicative Weight Update
Paper • 2106.13914 • Published • 1 -
HeurAgenix: Leveraging LLMs for Solving Complex Combinatorial Optimization Challenges
Paper • 2506.15196 • Published • 3 -
Ascend HiFloat8 Format for Deep Learning
Paper • 2409.16626 • Published • 1 -
Recipes for Pre-training LLMs with MXFP8
Paper • 2506.08027 • Published • 1
zhangwenbin
ExceedZhang
AI & ML interests
None yet
Recent Activity
liked a dataset about 1 hour ago
databricks/officeqa upvoted an article about 1 hour ago
Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic upvoted an article about 1 hour ago
Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrainsOrganizations
None yet