AgentStack
Back to directory

Terminal Bench Science

Free
106 GitHub stars
Learning ResourceAgnosticFile System

Overview

Terminal Bench Science evaluates AI agents on complex real-world scientific workflows in the terminal. This resource is ideal for researchers and developers looking to understand the application of AI in scientific contexts.

Visit resource