Add support to setting known_block_size by ruanjm · Pull Request #267 · ROCm/FlyDSL

ruanjm · 2026-03-23T08:09:18Z

This PR is for for supporting workgroup size 512. The AMDGPU backend defaults to max_flat_workgroup_size = 256.

Copilot

Pull request overview

Adds an optional known_block_size plumbing path from the @kernel decorator through KernelFunction to the emitted gpu.func, enabling AMDGPU backends to derive max_flat_workgroup_size for larger workgroups (e.g., 512 threads).

Changes:

Add known_block_size parameter to create_gpu_func() and forward it to gpu.GPUFuncOp.
Extend KernelFunction / @kernel decorator API to accept and store known_block_size.
Pass stored known_block_size when emitting the kernel gpu.func and document usage in the decorator docstring.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

python/flydsl/compiler/kernel_function.py

coderfeli · 2026-03-23T13:13:01Z

@ruanjm CI failed.

ruanjm · 2026-03-24T05:46:30Z

@ruanjm CI failed.

fixed.

Add support to setting known_block_size

2f226d1

Copilot AI review requested due to automatic review settings March 23, 2026 08:09

ruanjm added the enhancement New feature or request label Mar 23, 2026

Copilot started reviewing on behalf of ruanjm March 23, 2026 08:10 View session

Copilot AI reviewed Mar 23, 2026

View reviewed changes

python/flydsl/compiler/kernel_function.py Show resolved Hide resolved

python/flydsl/compiler/kernel_function.py Outdated Show resolved Hide resolved

python/flydsl/compiler/kernel_function.py Show resolved Hide resolved

ruanjm and others added 2 commits March 23, 2026 08:43

fix issue raised by Copilot.

bf4b80b

Merge branch 'main' into jruan/known_block_size

9d77f04

fix tests

5af360e

ruanjm force-pushed the jruan/known_block_size branch from 42d953a to 5af360e Compare March 24, 2026 03:57

coderfeli approved these changes Mar 25, 2026

View reviewed changes

coderfeli merged commit 4633681 into main Mar 25, 2026
8 checks passed

ruanjm deleted the jruan/known_block_size branch March 25, 2026 02:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support to setting known_block_size#267

Add support to setting known_block_size#267
coderfeli merged 4 commits intomainfrom
jruan/known_block_size

ruanjm commented Mar 23, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderfeli commented Mar 23, 2026

Uh oh!

ruanjm commented Mar 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ruanjm commented Mar 23, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderfeli commented Mar 23, 2026

Uh oh!

ruanjm commented Mar 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants