AgentStack
Back to directory

Claude Eval

Free
12 GitHub stars
Agent ToolClaude CodeFile System

Overview

Claude Eval is an evaluation tool designed for assessing Claude Code using a simplified LLM-as-a-judge approach. It is ideal for developers and researchers looking to evaluate AI models effectively.

Visit resource