-
Notifications
You must be signed in to change notification settings - Fork 1
Open
Description
The example examples/vadd.jl should vectorize, but the look at the output of @device_code_native ... shows no vector instruction(s).
There are two reasons for this behavior:
- in our LLVM source tree LoopVectorizer is disabled by means of cost function in order to not interfere with the RegionVectorizer (RV).
- Julia GPUCompiler uses an own set of optimization passes which does not call RV.
Either reenable LoopVectorizer or call RV in the optimization step.
Metadata
Metadata
Assignees
Labels
No labels