One Eval
Free137 GitHub stars
Agent ToolAgnosticFile System
Overview
One Eval is an automated system designed for evaluating large language models (LLMs) using agents. It is ideal for data scientists and AI researchers looking to benchmark and analyze LLM performance effectively.