OptiLLM
Free4.0k GitHub stars
Platform & FrameworkAgnosticFile System
Overview
OptiLLM is an optimizing inference proxy designed for large language models, enhancing their performance and efficiency. It is ideal for developers and researchers working with AI models who need to optimize inference processes.