Agent Bench
Free2 GitHub stars
Agent ToolAgnosticFile System
Overview
Agent Bench is an industrial-grade benchmarking engine designed for AI agents, allowing users to define test scenarios in YAML and run high-performance evaluations. It is ideal for developers and researchers looking to assess the performance of various AI agent stacks.