We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
34c9d76
There was an error while loading. Please reload this page.
CUDA: add attention sinks for tile and wmma (#15178) * CUDA: add attention sinks for tile and wmma * Review: formatting changes + remove syncthreads from tile + remove warp_reduce_max from wmma
1425f58
CUDA: attention sinks for mma FlashAttention (#15157)