EvalEval Coalition

community

https://evalevalai.com/

evaluatingevals

Activity Feed Request to join this org

AI & ML interests

We’re building a research coalition on evaluating evaluations (EvalEval)! Hosted by Hugging Face, University of Edinburgh, and EleutherAI.

Recent Activity

j-chim updated a dataset about 3 hours ago

evaleval/entity-registry-data

evijit updated a dataset about 7 hours ago

evaleval/card_backend

evijit updated a bucket about 10 hours ago

evaleval/general-eval-card-storage

View all activity

Papers

Every Eval Ever: A Unifying Schema and Community Repository for AI Evaluation Results

Evaluation Cards: An Interpretive Layer for AI Evaluation Reporting

View all Papers

Articles

Introducing Evaluation Cards: A Live Interpretive Layer for Understanding the AI Evaluations Ecosystem

AI evals are becoming the new compute bottleneck

evaleval 's Spaces 6

Eval Cards

Standardized evaluation cards for AI models and benchmarks

Best Model Finder

Agentic search over the EEE datastore for your use case

Every Eval Ever Schema Review

Summarize schema discussions and add comments

eval-card-registry

README

BenchmarkCard Webhook

Receive and process benchmark data via webhook