Metal Flash Attention
Free602 GitHub stars
Platform & FrameworkAgnosticFile System
Overview
Metal Flash Attention is a high-performance implementation of the FlashAttention algorithm optimized for Metal. It is designed for developers and researchers working with AI models that require efficient attention mechanisms.