Skip to content

Pull requests: InternLM/lmdeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

lmdeploy support kernel block size
#4421 opened Mar 17, 2026 by Tsundoku958 Draft
fix inference crashed on v100 with qwen3.5-0.8b
#4420 opened Mar 17, 2026 by lvhan028 Loading…
fix test_hf_overrides for transformers>5
#4418 opened Mar 17, 2026 by grimoire Loading…
chore: add CLAUDE.md and Claude Code skills
#4413 opened Mar 16, 2026 by CUHKSZzxy Loading…
2 tasks
Qwen3.5-27B-AWQ Turbomind V100 optimization
#4412 opened Mar 14, 2026 by lapy Loading…
4 tasks done
[WIP] Support qwen3-omni
#4411 opened Mar 13, 2026 by CUHKSZzxy Draft
1 of 4 tasks
fix metrics Bug:P1
#4410 opened Mar 13, 2026 by CUHKSZzxy Loading…
[ci] add nightly docker build workflow
#4406 opened Mar 12, 2026 by zhulinJulia24 Loading…
support qwen3.5 on volta improvement
#4405 opened Mar 12, 2026 by grimoire Loading…
[Ascend] support qwen3.5 27B
#4395 opened Mar 4, 2026 by wanfengcxz Draft
Builtin mrope improvement
#4393 opened Mar 4, 2026 by grimoire Loading…
add tool and reasoning test
#4388 opened Mar 2, 2026 by littlegy Loading…
Fix Structured Output for GPT-OSS Models
#4386 opened Mar 2, 2026 by windreamer Loading…
bump version to v0.12.2
#4378 opened Feb 28, 2026 by lvhan028 Loading…
8 tasks done
Support video inputs
#4360 opened Feb 13, 2026 by CUHKSZzxy Loading…
3 of 5 tasks
Improve proxy server improvement
#4354 opened Feb 12, 2026 by lvhan028 Loading…
Support MiniMax-M2 in TurboMind engine enhancement New feature or request
#4343 opened Feb 10, 2026 by zh-nj Loading…
[WIP]Support torch compile
#4336 opened Feb 8, 2026 by grimoire Draft
ProTip! Follow long discussions with comments:>50.