GoLongRL
-
GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment
Paper • 2605.19577 • Published • 58 -
Kwai-Klear/GoLongRL-30B-A3B
Text Generation • 31B • Updated • 449 • 11 -
Kwai-Klear/GoLongRL-4B
Text Generation • 4B • Updated • 225 • • 4 -
Kwai-Klear/GoLongRL
Viewer • Updated • 23k • 1.1k • 23