arxiv:2504.15477
Rohan Surana
rohan2810
AI & ML interests
None yet
Recent Activity
upvoted a paper about 7 hours ago
MASS-DPO: Multi-negative Active Sample Selection for Direct Policy Optimization submitted a paper 1 day ago
F-GRPO: Factorized Group-Relative Policy Optimization for Unified Candidate Generation and RankingOrganizations
None yet