User problem
- Unfused expert GEMMs and cast functions
- Unfused activation func
Desired outcome
Enable all the features in the group linear module
MXFP8 fusion: GroupGEMM + (d)SwiGLU + quant + swizzle (TE/#2769, MCore#3971, Mbridge#2841)
Grouped MXFP8 quantization (TE/#2769, MCore#3971)
Use cublas fused GEMMs for unfused group GEMM cases (TE/#2769, MCore#3971)
Alternatives or workarounds considered
No response
Affected area
area:perf
Urgency / use case
Important but not blocking
Environment
No response
User problem
Desired outcome
Enable all the features in the group linear module
MXFP8 fusion: GroupGEMM + (d)SwiGLU + quant + swizzle (TE/#2769, MCore#3971, Mbridge#2841)
Grouped MXFP8 quantization (TE/#2769, MCore#3971)
Use cublas fused GEMMs for unfused group GEMM cases (TE/#2769, MCore#3971)
Alternatives or workarounds considered
No response
Affected area
area:perf
Urgency / use case
Important but not blocking
Environment
No response