(2) FA4 support (sglang-fa4 -> sglang) by avnermay · Pull Request #3 · togethercomputer/ssd

avnermay · 2026-03-28T15:13:29Z

No description provided.

… flashinfer dependency

…s in cross-node case

arnica-github-connector · 2026-03-28T20:00:25Z

ssd/engine/model_runner.py

        self.block_size = config.kvcache_block_size
        self.enforce_eager = config.enforce_eager
-        self.tokenizer = AutoTokenizer.from_pretrained(config.tokenizer_path if config.tokenizer_path else config.model, use_fast=True)
+        self.tokenizer = AutoTokenizer.from_pretrained(config.tokenizer_path if config.tokenizer_path else config.model, use_fast=True, trust_remote_code=True)


Static Code Analysis Risk: Together python huggingface trust remote code

trust_remote_code=True downloads and executes arbitrary Python code from the model repository without sandboxing (OWASP LLM03:2025 Supply Chain). A malicious or compromised model repo can achieve RCE on every host that loads the model (CWE-94). Pin to a verified commit hash and audit remote code before use, or use models that don't require trust_remote_code.

Severity: High 🚨
Status: Open 🔴

References:

https://cwe.mitre.org/data/definitions/94

https://huggingface.co/docs/transformers/main/en/main_classes/model#transformers.PreTrainedModel.from_pretrained

https://genai.owasp.org/llmrisk/llm032025-supply-chain/

https://hiddenlayer.com/research/weaponizing-machine-learning-models-with-ransomware/

Suggested reviewers 🧐: @avnermay

More details:

🌻 View in Arnica

If you see an issue, please contact Shasheen in the #security-engineering Slack channel.

Take action by replying with an [arnica] command 💬

Actions

Use [arnica] or [a] to interact with the Arnica bot to acknowledge or dismiss code risks.

To acknowledge the finding as a valid code risk: [arnica] ack <acknowledge additional details>

To dismiss the risk with a reason: [arnica] dismiss <fp|accept|capacity> <dismissal reason>

Examples

[arnica] ack This is a valid risk and I'm looking into it

[arnica] dismiss fp Dismissed - Risk Not Accurate: (i.e. False Positive)

[arnica] dismiss accept Dismiss - Risk Accepted: Allow the risk to exist in the system

[arnica] dismiss capacity Dismiss - No Capacity: This will need to wait for a future sprint

avnermay added 2 commits March 28, 2026 08:12

FA4 support

66b8b7b

Add tests and tree_mask.py so that FA4 works

65301a3

avnermay changed the title ~~FA4 support~~ FA4 support (sglang-fa4 -> sglang) Mar 28, 2026

avnermay changed the title ~~FA4 support (sglang-fa4 -> sglang)~~ (2) FA4 support (sglang-fa4 -> sglang) Mar 28, 2026

avnermay and others added 5 commits March 28, 2026 09:27

Merge branch 'avner/sglang' into avner/sglang-fa4

aa50214

Update pyproject.toml to reflect flash-attn 4 dependency, and no more…

d1c9215

… flashinfer dependency

Fix FA4 import

2463748

Add logging statement once draft process is waiting for target proces…

d86d0fb

…s in cross-node case

Trust remote code fix

1425f32

arnica-github-connector bot reviewed Mar 28, 2026

View reviewed changes

avnermay added 6 commits March 28, 2026 13:13

Add logging for draft model warmup

cb51158

Switch all attention calls to use FA4

bfcb931

Add tests for attention fa4

cce45eb

Upgrade transformers, pin FA4

080c4a3

DUMP_TENSORS=false fix

eb5e612

Switch from ssh to https git dependency in pyproject.toml

ff59fdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(2) FA4 support (sglang-fa4 -> sglang)#3

(2) FA4 support (sglang-fa4 -> sglang)#3
avnermay wants to merge 13 commits intoavner/sglangfrom
avner/sglang-fa4

avnermay commented Mar 28, 2026

Uh oh!

arnica-github-connector bot Mar 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

avnermay commented Mar 28, 2026

Uh oh!

arnica-github-connector bot Mar 28, 2026

Choose a reason for hiding this comment

Static Code Analysis Risk: Together python huggingface trust remote code

References:

More details:

Actions

Examples

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant