issue/348 - add Baichuan causal LM model support by JoeZhang-0x000 · Pull Request #351 · InfiniTensor/InfiniLM

JoeZhang-0x000 · 2026-05-07T06:30:28Z

Summary

Add Baichuan model config adapter (csrc/models/baichuan/)
Register "baichuan" in config_factory.cpp classic_models list and auto_config.py
Add Baichuan weight remapping (W_pack → q/k/v_proj) in modeling_utils.py
Update test_infer.py for Baichuan tokenization and chat prompt handling

Closes #348
Parent issue: #332

- Add Baichuan model config adapter (csrc/models/baichuan/) - Register baichuan in config_factory.cpp and auto_config.py - Add Baichuan weight remapping (W_pack -> q/k/v_proj) in modeling_utils.py - Update test_infer.py for Baichuan tokenization and chat prompt handling

wooway777 · 2026-05-07T09:19:32Z

这个是我这边tokenizer版本或者精度问题么？我看好像确实没输出eos
第一张图里不会正常终止，第二张图无标点

wooway777 · 2026-05-07T09:21:57Z

这样修改是不是意味着只能做单轮推理，bench、精度测试、服务都无法使用？
如何做通用我们也需要花时间看一眼

pengcheng888 · 2026-05-07T10:20:21Z

        it->second(model_config);
    } else {
-        std::vector<std::string> classic_models = {"llama", "qwen2", "minicpm", "fm9g", "fm9g7b"};
+        std::vector<std::string> classic_models = {"llama", "qwen2", "minicpm", "fm9g", "fm9g7b", "baichuan"};


新增模型不需要修改这里，请删除csrc/config/config_factory.cpp文件的修改

pengcheng888 · 2026-05-07T10:56:06Z

@@ -47,4 +47,7 @@ def from_pretrained(model_path):
            cfg.model_type = "minicpmv"
            return cfg



新增模型不需要修改这里，请删除python/infinilm/auto_config.py文件的修改

pengcheng888 · 2026-05-07T10:59:04Z

+
+namespace infinilm::models::baichuan {
+
+std::shared_ptr<infinilm::config::ModelConfig> create_baichuan_model_config(


需要明确给出BaichuanForCausalLM的定义: 添加 using BaichuanForCausalLM = infinilm::models::llama::LlamaForCausalLM,

pengcheng888 · 2026-05-07T10:59:50Z

+
+INFINILM_REGISTER_CAUSAL_LM_MODEL(
+    baichuan,
+    infinilm::models::llama::LlamaForCausalLM,


移除csrc/models/baichuan/baichuan_for_causal_lm.cpp文件中#include "../llama/llama_for_causal_lm.hpp"。

将infinilm::models::llama::LlamaForCausalLM修改为infinilm::models::baichuan ::BaichuanForCausalLM

pengcheng888 · 2026-05-07T11:05:04Z

+
+namespace {
+
+#ifndef USE_CLASSIC_LLAMA


新增的模型不需要放到USE_CLASSIC_LLAMA宏中。请删除csrc/models/baichuan/baichuan_for_causal_lm.cpp文件中的 USE_CLASSIC_LLAMA

wooway777 · 2026-05-07T11:33:28Z

+    return new_sd
+
+
+def maybe_remap_weights(state_dict, model):


这个函数改名叫adjust_state_dict或者就叫remap_weights吧，maybe略显随意

pengcheng888 · 2026-05-07T11:33:55Z

 }


+def _split_first_dim(tensor, sizes, name):


将这个_split_first_dim 函数放到 _remap_baichuan_weights函数里面吧，作为_remap_baichuan_weights函数专用的。

JoeZhang-0x000 requested a review from a team May 7, 2026 06:30

JoeZhang-0x000 force-pushed the issue/348 branch from 5a23225 to d3763f7 Compare May 7, 2026 07:07

JoeZhang-0x000 mentioned this pull request May 7, 2026

issue/332 - add GLM4, ChatGLM and Baichuan support #333

Closed

wooway777 reviewed May 7, 2026

View reviewed changes

pengcheng888 reviewed May 7, 2026

View reviewed changes

wooway777 reviewed May 7, 2026

View reviewed changes

pengcheng888 reviewed May 7, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

issue/348 - add Baichuan causal LM model support#351

issue/348 - add Baichuan causal LM model support#351
JoeZhang-0x000 wants to merge 1 commit intoInfiniTensor:mainfrom
JoeZhang-0x000:issue/348

JoeZhang-0x000 commented May 7, 2026

Uh oh!

wooway777 commented May 7, 2026 •

edited

Loading

Uh oh!

wooway777 May 7, 2026

Uh oh!

pengcheng888 May 7, 2026

Uh oh!

pengcheng888 May 7, 2026

Uh oh!

pengcheng888 May 7, 2026 •

edited

Loading

Uh oh!

pengcheng888 May 7, 2026

Uh oh!

pengcheng888 May 7, 2026

Uh oh!

wooway777 May 7, 2026

Uh oh!

pengcheng888 May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -47,4 +47,7 @@ def from_pretrained(model_path):
		cfg.model_type = "minicpmv"
		return cfg


		namespace infinilm::models::baichuan {

		std::shared_ptr<infinilm::config::ModelConfig> create_baichuan_model_config(

Conversation

JoeZhang-0x000 commented May 7, 2026

Summary

Uh oh!

wooway777 commented May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wooway777 May 7, 2026

Choose a reason for hiding this comment

Uh oh!

pengcheng888 May 7, 2026

Choose a reason for hiding this comment

Uh oh!

pengcheng888 May 7, 2026

Choose a reason for hiding this comment

Uh oh!

pengcheng888 May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pengcheng888 May 7, 2026

Choose a reason for hiding this comment

Uh oh!

pengcheng888 May 7, 2026

Choose a reason for hiding this comment

Uh oh!

wooway777 May 7, 2026

Choose a reason for hiding this comment

Uh oh!

pengcheng888 May 7, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

wooway777 commented May 7, 2026 •

edited

Loading

pengcheng888 May 7, 2026 •

edited

Loading