Skip to content

[Feature] Add support for Neo++#1274

Open
XHPlus wants to merge 43 commits intomainfrom
neo_plus_clean
Open

[Feature] Add support for Neo++#1274
XHPlus wants to merge 43 commits intomainfrom
neo_plus_clean

Conversation

@XHPlus
Copy link
Copy Markdown
Contributor

@XHPlus XHPlus commented Apr 17, 2026

No description provided.

Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request implements multimodal image generation (x2i) support, introducing NeoChat and NeoChat MOE models and a dedicated x2i server architecture. Key changes include Triton kernels for KV cache offloading, API extensions for multimodal requests, and integration into the inference pipeline. Feedback identifies opportunities to improve code robustness by replacing empty exception handlers and resolving ambiguity regarding commented-out stream synchronization logic.

Comment on lines +170 to +171
except:
pass
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The empty except block suppresses all exceptions, which can hide bugs. It is better to catch specific exceptions or at least log the error.

sync_event = torch.cuda.Event()
sync_event.record()
sync_event.wait(g_infer_context.get_overlap_stream())
# sync_event.synchronize()
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

Synchronizing the stream here might be necessary, but it is commented out. If it is required for stability, it should be uncommented; otherwise, it should be removed to avoid confusion.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants