Efficient CUDA implementations for Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention.
AndrewBoessen/cuda-native-sparse-attention
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|
| Name | Name | Last commit date | ||
|---|---|---|---|---|
Efficient CUDA implementations for Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention.