Skip to content

mxfp8 support#2447

Draft
xiuhu17 wants to merge 57 commits into
NVIDIA-NeMo:mainfrom
xiuhu17:zhw/mxfp8_support
Draft

mxfp8 support#2447
xiuhu17 wants to merge 57 commits into
NVIDIA-NeMo:mainfrom
xiuhu17:zhw/mxfp8_support

Conversation

@xiuhu17
Copy link
Copy Markdown

@xiuhu17 xiuhu17 commented May 8, 2026

mxfp8 e2e rl support.

mxfp8 checkpoint conversion, weight update and fine-grained layer precision is adopted from:
radixark/miles#615 which is done by @zianglih.

Also need to wait sglang pr: sgl-project/sglang#24657 to support partial features for mxfp8 weight update

@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot Bot commented May 8, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@xiuhu17 xiuhu17 changed the title mxfp8 support [wip] mxfp8 support May 8, 2026
@xiuhu17 xiuhu17 changed the title [wip] mxfp8 support [WIP] mxfp8 support May 8, 2026
@xiuhu17 xiuhu17 changed the title [WIP] mxfp8 support mxfp8 support May 8, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants