vLLM Ascend Plugin
Free2.1k GitHub stars
IntegrationAgnosticFile System
Overview
This community-maintained hardware plugin enables efficient inference for vLLM on Ascend hardware. It is designed for developers and organizations looking to optimize their large language model serving capabilities.