Claude Eval
Free12 GitHub stars
Agent ToolClaude CodeFile System
Overview
Claude Eval is an evaluation tool designed for assessing Claude Code using a simplified LLM-as-a-judge approach. It is ideal for developers and researchers looking to evaluate AI models effectively.