Running on CPU Upgrade Agents 76 La Leaderboard πΈ 76 Evaluate open LLMs in the languages of LATAM and Spain.
vectara/hallucination_evaluation_model Text Classification β’ 0.1B β’ Updated Oct 20, 2025 β’ 127k β’ 354
Running Featured 671 The Tokenizer Playground π 671 Experiment with and compare different tokenizers
Runtime error 16 Spanish LLM Benchmark Annotation with Argilla π· 16 Collaborative effort on Spanish ARC-C, HellaSwag, and MMLU
TinyLlama/TinyLlama-1.1B-Chat-v1.0 Text Generation β’ 1B β’ Updated Mar 17, 2024 β’ 2.26M β’ β’ 1.6k