TokenSpeed
Free1.2k GitHub stars
Platform & FrameworkAgnosticFile System
Overview
TokenSpeed is a high-performance inference engine designed for large language models, enabling rapid processing and response times. It is ideal for developers and researchers looking to optimize their AI applications with efficient LLM capabilities.