Robert Mueller
bordeauxred
ยท
AI & ML interests
RL, RLHF, RLAIF, meta learning
Recent Activity
updated a model about 5 hours ago
GoodStartLabs/qwen3-8b-openspiel-mix8-selfplay-randmix-1000iter published a model about 5 hours ago
GoodStartLabs/qwen3-8b-openspiel-mix8-selfplay-randmix-1000iter updated a model about 5 hours ago
GoodStartLabs/qwen3p5-4b-reasoning-games-async-2048tok-curriculum-500iter