-
Notifications
You must be signed in to change notification settings - Fork 0
Production LLM deployment specs for NVIDIA Blackwell GPUs (RTX Pro 6000, DGX Spark). Includes vLLM configurations, benchmarks, load balancer, and throughput calculators for NVFP4/FP8/MoE models.
PrimitiveContext/blackwell
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
About
Production LLM deployment specs for NVIDIA Blackwell GPUs (RTX Pro 6000, DGX Spark). Includes vLLM configurations, benchmarks, load balancer, and throughput calculators for NVFP4/FP8/MoE models.
Topics
Stars
Watchers
Forks
Packages 0
No packages published