NanoLLM
Free371 GitHub stars
Platform & FrameworkAgnosticFile System
Overview
NanoLLM provides optimized local inference for large language models with a user-friendly API, making it suitable for developers working on multimodal AI applications. It supports quantization, vision and language models, and integrates with vector databases for enhanced performance.