AgentStack
Back to directory

Metal Flash Attention

Free
602 GitHub stars
Platform & FrameworkAgnosticFile System

Overview

Metal Flash Attention is a high-performance implementation of the FlashAttention algorithm optimized for Metal. It is designed for developers and researchers working with AI models that require efficient attention mechanisms.

Visit resource