Chenggang Zhao
LyricZ
AI & ML interests
Building efficient machine learning systems.
Recent Activity
new activity about 15 hours ago
deepseek-ai/DeepSeek-V4-Pro:关于 "Observations and Proposals" 中激活函数建议的疑问:去掉 gate projection 为何能放宽 EP 带宽要求? liked a model about 22 hours ago
deepseek-ai/DeepSeek-V4-Pro authored a paper 4 months ago
mHC: Manifold-Constrained Hyper-Connections