AgentStack
Back to directory

LLM-D

Free
3.3k GitHub stars
Platform & FrameworkAgnosticFile System

Overview

LLM-D enables state-of-the-art inference performance using modern accelerators on Kubernetes. It is designed for developers and organizations looking to optimize their machine learning model serving capabilities in a distributed environment.

Visit resource