AgentStack
Back to directory

Mind2Web-2 Benchmark

Free
111 GitHub stars
Learning ResourceAgnosticWeb Scraper

Overview

Mind2Web-2 is a benchmark designed to evaluate agentic search capabilities using agents as judges. It is ideal for researchers and developers interested in advancing the field of AI agents and their applications in web search.

Visit resource