AgentStack
Back to directory

TolokaForge

Free
8 GitHub stars
Agent ToolAgnosticWeb Scraper

Overview

TolokaForge is a universal benchmarking harness designed for evaluating large language models across various tasks including tool use, browser interactions, and coding. It is ideal for researchers and developers looking to assess the performance of LLMs in diverse scenarios.

Visit resource