remove lightllm_kernel by hiworldwzj · Pull Request #1296 · ModelTC/LightLLM

hiworldwzj · 2026-05-08T03:15:24Z

No description provided.

gemini-code-assist

Code Review

This pull request removes the dependency on the lightllm_kernel CUDA/C++ extension across the codebase, including the attention, MoE, and quantization modules. Specifically, it deletes the light_ops utility, removes the cuda_grouped_topk implementation in favor of Triton-based kernels, and updates unit tests to focus on Triton performance and consistency rather than comparing against the removed CUDA baselines. I have no feedback to provide.

remove lightllm_kernel

6b84b77

gemini-code-assist Bot reviewed May 8, 2026

View reviewed changes

fix

d960a19

hiworldwzj merged commit cc7e8f4 into main May 8, 2026
1 check passed

hiworldwzj deleted the wzj_dev branch May 8, 2026 06:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

remove lightllm_kernel#1296

remove lightllm_kernel#1296
hiworldwzj merged 2 commits intomainfrom
wzj_dev

hiworldwzj commented May 8, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

hiworldwzj commented May 8, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant