LLM-D
Free3.3k GitHub stars
Platform & FrameworkAgnosticFile System
Overview
LLM-D enables state-of-the-art inference performance using modern accelerators on Kubernetes. It is designed for developers and organizations looking to optimize their machine learning model serving capabilities in a distributed environment.