Jiannan Xu
ansonxu
ยท
AI & ML interests
Trustworthy NLP, LLMs
Recent Activity
upvoted an article 25 days ago
From GRPO to DAPO and GSPO: What, Why, and How upvoted an article about 1 month ago
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge liked a dataset 8 months ago
Ar4ikov/civitai-sd-337k