Terminal Bench Science
Free106 GitHub stars
Learning ResourceAgnosticFile System
Overview
Terminal Bench Science evaluates AI agents on complex real-world scientific workflows in the terminal. This resource is ideal for researchers and developers looking to understand the application of AI in scientific contexts.