Skip to content

remove lightllm_kernel#1296

Merged
hiworldwzj merged 2 commits intomainfrom
wzj_dev
May 8, 2026
Merged

remove lightllm_kernel#1296
hiworldwzj merged 2 commits intomainfrom
wzj_dev

Conversation

@hiworldwzj
Copy link
Copy Markdown
Collaborator

No description provided.

Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request removes the dependency on the lightllm_kernel CUDA/C++ extension across the codebase, including the attention, MoE, and quantization modules. Specifically, it deletes the light_ops utility, removes the cuda_grouped_topk implementation in favor of Triton-based kernels, and updates unit tests to focus on Triton performance and consistency rather than comparing against the removed CUDA baselines. I have no feedback to provide.

@hiworldwzj hiworldwzj merged commit cc7e8f4 into main May 8, 2026
1 check passed
@hiworldwzj hiworldwzj deleted the wzj_dev branch May 8, 2026 06:25
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant