AgentStack
Back to directory

LLM as a Judge by IBM

Free
35 GitHub stars
Platform & FrameworkAgnosticFile System

Overview

This framework utilizes LLMs to automate the evaluation of Agentic AI, RAG, and Text2SQL at scale, serving as a proxy for human judgment. It is designed for developers and researchers looking to implement automated evaluation processes in their AI projects.

Visit resource
Connect LLM as a Judge by IBM to Local OS / File System | AgentStack