Chenzehao's picture

4 4

Chenzehao

beichenhang

·

AI & ML interests

None yet

Recent Activity

liked a model about 21 hours ago

tencent/Hy-MT2-1.8B

liked a model 5 days ago

openbmb/MiniCPM-V-4.6

upvoted a paper 7 days ago

Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation

View all activity

Organizations

None yet

upvoted a paper 7 days ago

Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation

Paper • 2605.11739 • Published 13 days ago • 58

upvoted a paper about 2 months ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 351

upvoted 2 papers 4 months ago

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published Feb 9 • 290

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published Jan 13 • 158