Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Project of MoE reward model

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

zyhang1998  authored a paper 2 days ago
Synthetic Sandbox for Training Machine Learning Engineering Agents
zhuokai  authored a paper 2 days ago
Synthetic Sandbox for Training Machine Learning Engineering Agents
zyhang1998  submitted a paper 3 days ago
Synthetic Sandbox for Training Machine Learning Engineering Agents
View all activity

Yuhang Zhou's profile pictureShengyi Qian's profile pictureZhuokai Zhao's profile pictureJing Zhu's profile pictureXiaoyu Liu's profile picturewave's profile picture

MoeReward 's models 6

MoeReward/rl_checkpoints

Updated Jun 27, 2025

MoeReward/lora_checkpoint

Updated Mar 30, 2025

MoeReward/reward_lora_qwen_1_5_base

Updated Mar 21, 2025 • 1

MoeReward/reward_qwen_1_5

14B • Updated Mar 17, 2025 • 2

MoeReward/reward_lora_qwen_1_5

Updated Mar 17, 2025 • 2

MoeReward/sft_full_param_qwen_1_5

14B • Updated Mar 16, 2025
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs