VLLM MLX
Free1.2k GitHub stars
Platform & FrameworkClaude CodeFile System
Overview
VLLM MLX is a server designed for running large language models and vision-language models on Apple Silicon, offering features like continuous batching and multimodal support. It is ideal for developers and researchers looking to leverage advanced AI capabilities on macOS systems.