payelb/UltraFeedback_openbmb_TinyLlama-1.1B_aligned_with_semantic_MARS_deberta_RM Updated 18 days ago
payelb/UltraFeedback_openbmb_TinyLlama-1.1B_aligned_with_semantic_MARS_RM_roberta_semantic_MARS_RM Updated 19 days ago
payelb/UltraFeedback_openbmb_roberta-large_1k_fixed_MARS_semantic_refined Text Classification • 0.4B • Updated 19 days ago • 46
payelb/PKUSafeRLHF_roberta-large_1k_fixed_MARS_semantic_refined Text Classification • 0.4B • Updated 19 days ago • 39
payelb/PKUSafeRLHF_TinyLlama-1.1B_aligned_with_semantic_MARS_RM_roberta_semantic_MARS_RM Updated 19 days ago
payelb/HHRLHF_roberta-large_1k_fixed_MARS_semantic_refined Text Classification • 0.4B • Updated 19 days ago • 46
payelb/HHRLHF_TinyLlama-1.1B_aligned_with_semantic_MARS_RM_roberta_semantic_MARS_RM Updated 19 days ago
payelb/UltraFeedback_openbmb_Llama-3.2-1B_aligned_with_baseline_roberta_RM_KLsafe Updated 20 days ago