issue/350 - add ChatGLM causal LM model support by JoeZhang-0x000 · Pull Request #353 · InfiniTensor/InfiniLM

JoeZhang-0x000 · 2026-05-07T06:59:21Z

Summary

Add ChatGLM model config adapter (csrc/models/chatglm/)
Extend rank_worker.cpp condition to include "chatglm"
Register "chatglm" in config_factory.cpp classic_models list and auto_config.py
Add ChatGLM weight remapping (key rename + QKV/FFN split) in modeling_utils.py
Update test_infer.py for ChatGLM tokenizer special token handling

Note: This branch is based on issue/349 (GLM4 support) since ChatGLM depends on GLM4's infrastructure changes (partial rotary, RoPE algo, rank_worker special path). Please merge #352 first.

Closes #350
Parent issue: #332

- Add GLM4 model config adapter (csrc/models/glm4/) - Add partial rotary support (llama_utils.hpp, llama_attention.*) - Add forward_naive + GLM4 post-norm support (llama_decoder_layer.*) - Add RoPE algo selection for GLM4 (llama_model.cpp) - Add GLM4 special model creation path (rank_worker.cpp) - Register glm4 in config_factory.cpp and auto_config.py - Add GLM4 weight remapping (gate_up_proj split) in modeling_utils.py

- Add ChatGLM model config adapter (csrc/models/chatglm/) - Extend rank_worker.cpp condition to include chatglm - Register chatglm in config_factory.cpp and auto_config.py - Add ChatGLM weight remapping (key rename + QKV/FFN split) in modeling_utils.py - Update test_infer.py for ChatGLM tokenizer special token handling

pengcheng888 · 2026-05-07T10:07:47Z

请先处理glm4的#352

wooway777 · 2026-05-07T11:03:20Z

在当前分支模型输出为空

但在之前的三合一分支正常：

JoeZhang-0x000 requested a review from a team May 7, 2026 06:59

JoeZhang-0x000 force-pushed the issue/350 branch from 78f41f8 to 26a5d33 Compare May 7, 2026 07:08

JoeZhang-0x000 mentioned this pull request May 7, 2026

issue/332 - add GLM4, ChatGLM and Baichuan support #333

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

issue/350 - add ChatGLM causal LM model support#353

issue/350 - add ChatGLM causal LM model support#353
JoeZhang-0x000 wants to merge 2 commits intoInfiniTensor:mainfrom
JoeZhang-0x000:issue/350

JoeZhang-0x000 commented May 7, 2026

Uh oh!

pengcheng888 commented May 7, 2026

Uh oh!

wooway777 commented May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

JoeZhang-0x000 commented May 7, 2026

Summary

Uh oh!

pengcheng888 commented May 7, 2026

Uh oh!

wooway777 commented May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants