TolokaForge
Free8 GitHub stars
Agent ToolAgnosticWeb Scraper
Overview
TolokaForge is a universal benchmarking harness designed for evaluating large language models across various tasks including tool use, browser interactions, and coding. It is ideal for researchers and developers looking to assess the performance of LLMs in diverse scenarios.