LLM as a Judge by IBM

Free

35 GitHub stars

Platform & FrameworkAgnosticFile System

Overview

This framework utilizes LLMs to automate the evaluation of Agentic AI, RAG, and Text2SQL at scale, serving as a proxy for human judgment. It is designed for developers and researchers looking to implement automated evaluation processes in their AI projects.

Visit resource