-
Notifications
You must be signed in to change notification settings - Fork 638
Pull requests: InternLM/lmdeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Fix] fix a quant scale caculation bug in quant_utils.py
Bug:P2
#4233
opened Dec 24, 2025 by
43758726
Loading…
[ci] fix fail testcase and add generate testcase in pr test
#4231
opened Dec 23, 2025 by
zhulinJulia24
Loading…
Configurable max CTAs and NVLS usage for CUDA IPC communicator
improvement
#4227
opened Dec 20, 2025 by
lzhangzz
Loading…
fix: Fix Guided Decoding Crashes and State Corruption Issues
#4167
opened Nov 28, 2025 by
windreamer
Loading…
[WIP]: Support fp32 head for qwen and internlm models
#4160
opened Nov 27, 2025 by
RunningLeon
•
Draft
Add step_map to track token decoding order in DLLM
#4057
opened Oct 21, 2025 by
Auraithm
Loading…
4 tasks done
quant blocked fp8
enhancement
New feature or request
#4018
opened Sep 29, 2025 by
CUHKSZzxy
Loading…
4 of 5 tasks
add ppu quick start doc
documentation
Improvements or additions to documentation
#3841
opened Aug 14, 2025 by
guozixu2001
Loading…
fix: qwen3 nonstream parse with no or uncompleted think content
#3748
opened Jul 18, 2025 by
ywx217
Loading…
[ascend] support lora
enhancement
New feature or request
#3715
opened Jul 7, 2025 by
tangzhiyi11
•
Draft
Previous Next
ProTip!
Adding no:label will show everything without a label.