arxiv:2310.16944
Nathan Habib PRO
AI & ML interests
Evals
Recent Activity
new activity 2 days ago
stepfun-ai/Step-3.7-Flash:Add SWE-bench Pro evaluation result new activity 2 days ago
stepfun-ai/Step-3.7-Flash:Add HLE with tools evaluation result liked a model 2 days ago
stepfun-ai/Step-3.7-Flash