🏟️ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do about 1 month ago • 38
FINAL Bench World's First Functional Metacognition Benchmark. "Not how much AI knows — but whether it knows what it doesn't know, and can fix it." FINAL-Bench/Metacognitive Viewer • Updated Feb 27 • 100 • 1.01k • 77 Running Featured 45 Leaderboard - FINAL Bench 'Metacognitive' 🚀 45 Metacognitive
FINAL Bench World's First Functional Metacognition Benchmark. "Not how much AI knows — but whether it knows what it doesn't know, and can fix it." FINAL-Bench/Metacognitive Viewer • Updated Feb 27 • 100 • 1.01k • 77 Running Featured 45 Leaderboard - FINAL Bench 'Metacognitive' 🚀 45 Metacognitive