Skip to content

Pull requests: ROCm/aiter

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Revert "add gen_fake for 4 gemm operators"
#1746 opened Dec 27, 2025 by azaidy Loading…
[Triton] skip_reduce for gemm_afp4wfp4_preshuffle
#1745 opened Dec 27, 2025 by k50112113 Loading…
Causal blockwise fmha v3
#1744 opened Dec 26, 2025 by antsaukk Draft
1 task
moe tunner fix
#1743 opened Dec 26, 2025 by lalala-sh Loading…
1 task done
add groupnorm cuda kernel
#1742 opened Dec 26, 2025 by LiuYinfeng01 Loading…
1 task
Fix nt
#1738 opened Dec 25, 2025 by Zzz9990 Draft
1 task
Wjx/ck tile moe merge
#1737 opened Dec 25, 2025 by Zzz9990 Draft
1 task
tmp test: a4w4 gemm tune
#1734 opened Dec 25, 2025 by minmengdie Loading…
1 task
[MLA] Update 950 MLA FP8 Kernel with Optimized V3 Pipeline
#1733 opened Dec 25, 2025 by liyjiang Loading…
1 task
CI: Optimize and collect op_tests summaries
#1731 opened Dec 25, 2025 by gyohuangxin Draft
2 tasks
[MLA] nhead64 and nhead32
#1730 opened Dec 25, 2025 by Zzz9990 Loading…
1 task
[refractor] mha fwd api refractor
#1719 opened Dec 24, 2025 by minmengdie Loading…
1 task
Lxx/dev/gfx950 fwd v3 hd192x128 adjust dispatch
#1718 opened Dec 24, 2025 by shay-li77 Loading…
1 task
add fake for MLA RoPE operator
#1714 opened Dec 23, 2025 by mqhc2020 Loading…
1 task
[Fix] Add mutates_args to flash_attn_backward to fix AOTAutograd DDP …
#1712 opened Dec 23, 2025 by tomjen12 Loading…
1 task done
rd only env
#1704 opened Dec 22, 2025 by zufayu Loading…
1 task
ahmed-bsod/gemm a8w8 gluon
#1684 opened Dec 18, 2025 by ahmed-bsod Loading…
[Draft] [Preview] Support gfx1201
#1681 opened Dec 18, 2025 by tjtanaa Draft
1 task
add dealing with memory access fault in Mp tuner
#1680 opened Dec 18, 2025 by yzhou103 Loading…
1 task
use a16w4 for a4w4 decode.
#1664 opened Dec 17, 2025 by lalala-sh Loading…
1 task
[WIP] [TRITON] MHA reverse pid order
#1659 opened Dec 17, 2025 by cagrikymk Draft
fixed a bug in rmsnorm quantization fusion kernel and add a unit test
#1658 opened Dec 16, 2025 by scxiao Loading…
1 task done
[Triton] Shaoclee/triton fp4 gemm cat preshuffle
#1656 opened Dec 16, 2025 by k50112113 Loading…
ProTip! What’s not been updated in a month: updated:<2025-11-28.