AgentStack
Back to directory

One Eval

Free
137 GitHub stars
Agent ToolAgnosticFile System

Overview

One Eval is an automated system designed for evaluating large language models (LLMs) using agents. It is ideal for data scientists and AI researchers looking to benchmark and analyze LLM performance effectively.

Visit resource