AutoGPTQ
Free5.1k GitHub stars
Platform & FrameworkAgnosticFile System
Overview
AutoGPTQ is an easy-to-use quantization package for large language models, providing user-friendly APIs based on the GPTQ algorithm. It is designed for developers and researchers looking to optimize LLMs for efficient inference and deployment.