u/adyaman

SageAttention v2 native port running on RDNA4
▲ 34 r/ROCm

SageAttention v2 native port running on RDNA4

https://github.com/thu-ml/SageAttention/pull/368

Please try it out and let me know here or on the PR if you face any problems or if you see any speedups. Thanks and enjoy!

Currently only tested on Windows, not linux, but it should work on Linux with hopefully no/minimal changes.

On my side on Windows with a 9070XT when using comfyui with `--use-sage-attention` and running WAN2.1 1.4b (fp8_e4m3fn weight dtype), I'm seeing about a 42% speedup on the diffusion step times vs. `--use-pytorch-cross-attention`.

u/adyaman — 9 days ago