LLM as a Judge by IBM
Free35 GitHub stars
Platform & FrameworkAgnosticFile System
Overview
This framework utilizes LLMs to automate the evaluation of Agentic AI, RAG, and Text2SQL at scale, serving as a proxy for human judgment. It is designed for developers and researchers looking to implement automated evaluation processes in their AI projects.