Skip to content

Conversation

@voltjia
Copy link
Collaborator

@voltjia voltjia commented Dec 25, 2025

image

@voltjia voltjia requested a review from a team December 26, 2025 08:55
@voltjia voltjia changed the title issue/160: 将 cpp.LlamaForCausalLM 提出,变为 infinilm.infer_engine.InferEngine,并将 Config 构造逻辑拆分至 AutoConfig issue/160: 梳理 infinilm.infer_engine.InferEngine 相关接口 Dec 26, 2025
@voltjia
Copy link
Collaborator Author

voltjia commented Dec 26, 2025

合入此 PR 前请先确保 InfiniTensor/InfiniCore#861 已合入。

@voltjia
Copy link
Collaborator Author

voltjia commented Dec 26, 2025

image

@voltjia voltjia added the enhancement New feature or request label Dec 26, 2025
@voltjia
Copy link
Collaborator Author

voltjia commented Dec 26, 2025

python ./test/bench/test_benchmark.py --nvidia /data/shared/models/TinyLlama -1.1B-Chat-v1.0 --bench ceval 的部分输出如下:

image image

@voltjia voltjia changed the title issue/160: 梳理 infinilm.infer_engine.InferEngine 相关接口 issue/160: 梳理 InferEngine 相关接口 Dec 26, 2025
@voltjia
Copy link
Collaborator Author

voltjia commented Dec 26, 2025

image

@voltjia
Copy link
Collaborator Author

voltjia commented Dec 26, 2025

python ./test/bench/test_benchmark.py --nvidia /data/shared/models/TinyLlama -1.1B-Chat-v1.0 --bench ceval 的部分输出如下:

image image

@voltjia
Copy link
Collaborator Author

voltjia commented Dec 26, 2025

image

@voltjia
Copy link
Collaborator Author

voltjia commented Dec 26, 2025

python ./test/bench/test_benchmark.py --nvidia /data/shared/models/TinyLlama -1.1B-Chat-v1.0 --bench ceval 的部分输出如下:

image image

@voltjia voltjia merged commit 34c0770 into main Dec 26, 2025
@voltjia voltjia deleted the issue/160 branch December 26, 2025 11:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

4 participants