-
Notifications
You must be signed in to change notification settings - Fork 697
Pull requests: InternLM/lmdeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: detect tool open tag within reasoning content in _consume_reasoning
#4580
opened May 11, 2026 by
dbsd11
Loading…
3 tasks done
[security] fix(proxy): require auth for node management
#4579
opened May 11, 2026 by
Hinotoi-agent
Loading…
5 of 9 tasks
[Improve]: Drain queues when sleep engine
improvement
#4577
opened May 9, 2026 by
RunningLeon
Collaborator
Loading…
Fix health latency under concurrent VL request preparation
Bug:P0
#4570
opened May 7, 2026 by
CUHKSZzxy
Collaborator
Loading…
refactor(turbomind): consolidate CUDA error handling and add manual stacktracing
improvement
#4565
opened Apr 30, 2026 by
lzhangzz
Collaborator
Loading…
[Feature] Add guided decoding support for speculative decoding
enhancement
New feature or request
#4559
opened Apr 28, 2026 by
windreamer
Collaborator
Loading…
4 tasks done
[Feature] Implement New feature or request
/v1/embeddings endpoint for OpenAI-compatible API
enhancement
#4550
opened Apr 23, 2026 by
ZhijunLStudio
Contributor
Loading…
2 of 4 tasks
Test: update sleep/wakeup and abort scenarios
#4528
opened Apr 15, 2026 by
littlegy
Contributor
Loading…
style: add autopep8 pre-commit hook and apply PEP 8 formatting fixes
#4524
opened Apr 14, 2026 by
windreamer
Collaborator
Loading…
make fp8 model quantized by llm-compressor can be inferenced in turbomind
enhancement
New feature or request
#4509
opened Apr 8, 2026 by
43758726
Collaborator
Loading…
Integrate deep-ep nccl backend
enhancement
New feature or request
#4477
opened Mar 27, 2026 by
irexyc
Collaborator
Loading…
feat: Turbomind linear gdn prefix caching
enhancement
New feature or request
#4465
opened Mar 25, 2026 by
lapy
Contributor
Loading…
[Feature] Support n parameter in /v1/chat/completions and /v1/completions
improvement
#4419
opened Mar 17, 2026 by
ziyangliu-666
Loading…
Add model deployment best practice section in user guide
documentation
Improvements or additions to documentation
Fix Structured Output for GPT-OSS Models
Bug:P1
#4386
opened Mar 2, 2026 by
windreamer
Collaborator
•
Draft
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.