Judgeval
Free1.0k GitHub stars
Platform & FrameworkAgnosticFile System
Overview
Judgeval is a continuous-improvement stack designed for agents, providing tools for monitoring and enhancing their performance. It is ideal for developers and researchers working on AI agents who seek to improve their capabilities through data-driven evaluations.