OMLX

Free

15.1k GitHub stars

Platform & FrameworkOpenAI AssistantsFile System

Overview

OMLX is an LLM inference server designed for Apple Silicon, featuring continuous batching and SSD caching. It is ideal for developers looking to manage LLMs efficiently from the macOS menu bar.

Visit resource