arxiv:2604.20817
Deqing Fu PRO
deqing
AI & ML interests
None yet
Recent Activity
updated a model about 16 hours ago
deqing/convergent-llama-300M-muon-6digit-addition_6digit_custom6 upvoted a paper about 16 hours ago
Value-Aware Stochastic KV Cache Eviction for Reasoning Models submitted a paper about 17 hours ago
Value-Aware Stochastic KV Cache Eviction for Reasoning Models