GLM-5.1 / .eval_results /swe_bench_pro.yaml
ZHANGYUXUAN-zR's picture
Update .eval_results/swe_bench_pro.yaml (#10)
dd43008
- dataset:
id: ScaleAI/SWE-bench_Pro
task_id: SWE_Bench_Pro
value: 58.4
source:
url: https://huggingface.co/zai-org/GLM-5.1
name: Model Card
notes: high reasoning