Picture for Juechu Dong

Juechu Dong

Flex Attention: A Programming Model for Generating Optimized Attention Kernels

Add code
Dec 07, 2024
Viaarxiv icon