AgentStack
Back to directory

AutoGPTQ

Free
5.1k GitHub stars
Platform & FrameworkAgnosticFile System

Overview

AutoGPTQ is an easy-to-use quantization package for large language models, providing user-friendly APIs based on the GPTQ algorithm. It is designed for developers and researchers looking to optimize LLMs for efficient inference and deployment.

Visit resource