AgentStack
Back to directory

Judgeval

Free
1.0k GitHub stars
Platform & FrameworkAgnosticFile System

Overview

Judgeval is a continuous-improvement stack designed for agents, providing tools for monitoring and enhancing their performance. It is ideal for developers and researchers working on AI agents who seek to improve their capabilities through data-driven evaluations.

Visit resource