[sglang_disagg] mori IO optimization and proxy server relocation#161
Merged
Merged
Conversation
lcskrishna
requested changes
Jun 2, 2026
- Unified launcher with MoRI/Mooncake backend selection (KV_TRANSFER_BACKEND) - CX7 multi-rail NIC support, default to CX7 400G rail NICs - xP/yD multi-node support for DP_MODE=0 (TP-only) - DP_MODE=1 restricted to 1P1D (multi-node DP not yet supported) - Condensed RDMA/NCCL/Gloo env config in mori_ep_env.sh - Model flag catalog cleanup: dp settings only on DeepSeek-V3/R1 - Configurable benchmark combinations with random-range-ratio=1.0 - Dockerfile updated to rocm720 base image Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
7d31b82 to
e971f22
Compare
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Contributor
Author
|
@lcskrishna all requested changes are done. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Motivation
Initial motivation is run proxy on prefill node. But other fixes are done.
Technical Details
improved mori-io performance
Test Plan
validated 1p1d with all supported model and run setip mori/without mori. and mori ep.
Test Result
Submission Checklist