Feature/qwen3 #311

zpitroda · 2025-04-29T16:01:56Z

Updated models from Qwen 2.5 to Qwen 3 equivalents
Updated transformers and torch python packages

- Updated from qwen 2.5 to qwen 3 models - Updated transformers and torch python packages

Updated llama.cpp for Qwen3 support

imaffe · 2025-04-29T18:20:38Z

README.md

 For model deployment, we utilized [llama.cpp](https://github.com/ggml-org/llama.cpp), which provides efficient inference capabilities.

-Our base models primarily come from the [Qwen2.5](https://huggingface.co/Qwen) series.
+Our base models primarily come from the [Qwen3](https://huggingface.co/Qwen) series.


I am not sure what is Second Me's model update policy. As a community user I definitely want to use Qwen3 considering its SOTA capabilities (but I haven't tested yet so maybe there will be some in & outs).

Huge thanks to your work!

On top of that, It would be nice to add Qwen 3 on top of existing Qwen 2.5 model support, like adding a new support model instead of replace the existing Qwen 2.5 model.

I am not sure what is Second Me's model update policy. As a community user I definitely want to use Qwen3 considering its SOTA capabilities (but I haven't tested yet so maybe there will be some in & outs).

Huge thanks to your work!

On top of that, It would be nice to add Qwen 3 on top of existing Qwen 2.5 model support, like adding a new support model instead of replace the existing Qwen 2.5 model.

I was wondering that as well, I'm testing right now to ensure qwen 3 doesn't break anything but yeah I don't know if the Second Me team currents actually wants to update . I can also keep the 2.5 models and add the option for 3 as well if that's preferable?

Updated convert_hf_to_gguf script and gguf-py package to support qwen3 models

Disabled thinking mode and updated backend dockerfile to work with new llama.cpp

zpitroda · 2025-04-30T02:29:00Z

Everything is working except during inference it outputs the <think></think> blocks because llama.cpp hasn't updated the qwen3 template to support {"enable_thinking": false} kwargs. There is already a PR so hopefully it'll be updated asap

Added Qwen 2.5 models back along with 3

kevinaimonster · 2025-05-06T06:41:00Z

Hi, I've test it a bit, it fails when downloading models...

2025-05-06 14:30:21 [INFO] trainprocess_service.py:265 - Starting model download: Qwen3-1.7B
2025-05-06 14:30:21 [ERROR] trainprocess_service.py:285 - Download model failed: cannot access local variable 'base_dir' where it is not associated with a value
2025-05-06 14:30:21 [ERROR] trainprocess_service.py:1109 - Step model_download failed

zpitroda · 2025-05-06T16:59:01Z

Hi, I've test it a bit, it fails when downloading models...

2025-05-06 14:30:21 [INFO] trainprocess_service.py:265 - Starting model download: Qwen3-1.7B
2025-05-06 14:30:21 [ERROR] trainprocess_service.py:285 - Download model failed: cannot access local variable 'base_dir' where it is not associated with a value
2025-05-06 14:30:21 [ERROR] trainprocess_service.py:1109 - Step model_download failed

Sorry about that! The base_dir variable was accidentally indented into the "if" block above it but should be working now

kevin-mindverse · 2025-05-09T07:59:08Z

Hi, it works now. so once the conflict is resolved, it will be added to develop branch :)

zpitroda · 2025-05-10T04:19:12Z

@kevin-mindverse sounds good! I think I have a temporary solution to the extra block being outputted until the llama.pp pr is merged, I'll try to have that and the conflict fixes pushed tomorrow

zpitroda · 2025-05-10T04:44:18Z

Ok I actually quickly pushed what I think should do it, shouldn't break anything but haven't double check it's fixed

docs/Custom Model Config(Ollama).md

lpm_frontend/src/app/dashboard/playground/chat/page.tsx

lpm_kernel/L2/dpo/dpo_train.py

zpitroda · 2025-05-12T22:55:03Z

@yingapple I'm away from my computer this week and unable to test, but it should hopefull now only add no_think flags when using a qwen3 model

zpitroda added 3 commits April 29, 2025 11:59

Update to qwen3

9b69fdb

- Updated from qwen 2.5 to qwen 3 models - Updated transformers and torch python packages

Updated llama.cpp

c7a8ba3

Updated llama.cpp for Qwen3 support

Change more qwen2.5 references

1d1e153

imaffe reviewed Apr 29, 2025

View reviewed changes

zpitroda added 2 commits April 29, 2025 19:36

Updated convert to gguf

97a0aa0

Updated convert_hf_to_gguf script and gguf-py package to support qwen3 models

Disable thinking

0fcb5a4

Disabled thinking mode and updated backend dockerfile to work with new llama.cpp

kevin-mindverse requested a review from yingapple April 30, 2025 06:09

Add Qwen 2.5 models

42c4a70

Added Qwen 2.5 models back along with 3

fix base_dir

48448d4

zpitroda marked this pull request as ready for review May 10, 2025 04:24

zpitroda added 2 commits May 10, 2025 00:31

Merge remote-tracking branch 'upstream/develop' into feature/qwen3

af29fb6

no think

61b784b

yingapple reviewed May 12, 2025

View reviewed changes

docs/Custom Model Config(Ollama).md Show resolved Hide resolved

yingapple reviewed May 12, 2025

View reviewed changes

lpm_frontend/src/app/dashboard/playground/chat/page.tsx Show resolved Hide resolved

yingapple reviewed May 12, 2025

View reviewed changes

lpm_kernel/L2/dpo/dpo_train.py Show resolved Hide resolved

zpitroda added 3 commits May 12, 2025 18:41

no think only for qwen3

6fed1c2

Update page.tsx

fc1b6aa

Update utils.py

fe69556

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature/qwen3 #311

Feature/qwen3 #311

Uh oh!

zpitroda commented Apr 29, 2025

Uh oh!

imaffe Apr 29, 2025

Uh oh!

zpitroda Apr 29, 2025

Uh oh!

zpitroda commented Apr 30, 2025 •

edited

Loading

Uh oh!

kevinaimonster commented May 6, 2025

Uh oh!

zpitroda commented May 6, 2025

Uh oh!

kevin-mindverse commented May 9, 2025

Uh oh!

zpitroda commented May 10, 2025

Uh oh!

zpitroda commented May 10, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zpitroda commented May 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Feature/qwen3 #311

Are you sure you want to change the base?

Feature/qwen3 #311

Uh oh!

Conversation

zpitroda commented Apr 29, 2025

Uh oh!

imaffe Apr 29, 2025

Choose a reason for hiding this comment

Uh oh!

zpitroda Apr 29, 2025

Choose a reason for hiding this comment

Uh oh!

zpitroda commented Apr 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kevinaimonster commented May 6, 2025

Uh oh!

zpitroda commented May 6, 2025

Uh oh!

kevin-mindverse commented May 9, 2025

Uh oh!

zpitroda commented May 10, 2025

Uh oh!

zpitroda commented May 10, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zpitroda commented May 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

zpitroda commented Apr 30, 2025 •

edited

Loading