AgentStack
Back to directory

OptiLLM

Free
4.0k GitHub stars
Platform & FrameworkAgnosticFile System

Overview

OptiLLM is an optimizing inference proxy designed for large language models, enhancing their performance and efficiency. It is ideal for developers and researchers working with AI models who need to optimize inference processes.

Visit resource