File tree Expand file tree Collapse file tree
examples/models/voxtral_realtime Expand file tree Collapse file tree Original file line number Diff line number Diff line change 152152 " llm-release"
153153 ],
154154 "cacheVariables" : {
155- "EXECUTORCH_BUILD_CUDA" : " ON"
155+ "EXECUTORCH_BUILD_CUDA" : " ON" ,
156+ "CMAKE_CUDA_ARCHITECTURES" : " native"
156157 },
157158 "condition" : {
158159 "type" : " inList" ,
Original file line number Diff line number Diff line change 5757 "name" : " voxtral-realtime-cpu" ,
5858 "displayName" : " Build Voxtral Realtime runner (CPU)" ,
5959 "configurePreset" : " voxtral-realtime-cpu" ,
60+ "configuration" : " Release" ,
6061 "targets" : [
6162 " voxtral_realtime_runner"
6263 ]
7374 {
7475 "name" : " voxtral-realtime-cuda" ,
7576 "displayName" : " Build Voxtral Realtime runner (CUDA)" ,
77+ "configuration" : " Release" ,
7678 "configurePreset" : " voxtral-realtime-cuda" ,
7779 "targets" : [
7880 " voxtral_realtime_runner"
Original file line number Diff line number Diff line change @@ -198,7 +198,6 @@ capability to avoid "invalid device function" errors (the `int4mm` kernels
198198require SM 80+).
199199
200200``` powershell
201- $env:CMAKE_CUDA_ARCHITECTURES="80;86;89;90;120"
202201cmake --workflow --preset llm-release-cuda
203202Push-Location examples/models/voxtral_realtime
204203cmake --workflow --preset voxtral-realtime-cuda
Original file line number Diff line number Diff line change @@ -65,6 +65,20 @@ $ cmake --workflow --preset llm-debug-cuda
6565$ cmake --workflow --preset llm-debug-metal
6666```
6767
68+ > [ !NOTE]
69+ > ** CUDA architecture selection:** The ` llm-release-cuda ` (and ` llm-debug-cuda ` )
70+ > preset sets ` CMAKE_CUDA_ARCHITECTURES=native ` , which auto-detects the GPU
71+ > on the build machine at configure time. To target a different architecture,
72+ > override it with ` -D ` on the configure step:
73+ > ``` bash
74+ > cmake --preset llm-release-cuda -DCMAKE_CUDA_ARCHITECTURES=" 80;86;89;90;120"
75+ > cmake --build --preset llm-release-cuda --config Release
76+ > ` ` `
77+ > Note that ` cmake --workflow` does not accept ` -D` flags, so you must run
78+ > configure and build as separate steps when overriding. Also note that on
79+ > Windows, setting ` CMAKE_CUDA_ARCHITECTURES` via environment variable does
80+ > ** not** work with CMake presets — you must use the ` -D` flag.
81+
6882# ### Understanding workflow components
6983
7084A workflow preset typically consists of:
You can’t perform that action at this time.
0 commit comments