arXiv AI recent: Every Eval Ever: A Unifying Schema and Community Repository for AI Evaluation Results
Researchers introduced Every Eval Ever, a shared schema and community-crowdsourced repository for AI evaluation results.,The schema standardizes how evaluations are represented in a unifi...
The Every Eval Ever schema is source-agnostic and can ingest results from evaluation harnesses and papers.,It has a companion instance-level schema and automatic converters from popular formats, evaluation harnesses, and leaderboards to the unified schema.