Enable dpnp build on AMD GPU by vlad-perevezentsev · Pull Request #2302 · IntelPython/dpnp

vlad-perevezentsev · 2025-02-10T15:24:54Z

This PR updates СMakeLists files and build_locally.py to enable building dpnp for AMD targets.

To build dpnp on AMD:

python scripts/build_locally.py --target-hip=gfx90a

To find the architecture, use

rocminfo | grep 'Name: *gfx.*'

Have you provided a meaningful PR description?
Have you added a test, reproducer or referred to issue with a reproducer?
Have you tested your changes locally for CPU and GPU devices?
Have you made sure that new changes do not introduce compiler warnings?
Have you checked performance impact of proposed changes?
If this PR is a work in progress, are you filing the PR as a draft?

github-actions · 2025-02-10T16:12:02Z

Array API standard conformance tests for dpnp=0.18.0dev2=py312he4f9c94_48 ran successfully.
Passed: 1229
Failed: 0
Skipped: 9

github-actions · 2025-02-10T17:15:04Z

View rendered docs @ https://intelpython.github.io/dpnp/index.html

dpnp/backend/extensions/indexing/CMakeLists.txt

antonwolfy · 2025-02-17T19:10:02Z

scripts/build_locally.py

+        if not arch:
+            raise ValueError("--arch is required when --target=hip")
+        cmake_args += [
+            "-DDPNP_TARGET_HIP=ON",


For what do we need to define two variables? Can it be combined in a single one, like in dpctl: -DDPNP_TARGET_HIP={arch}?

Additionally, --target=cuda is current dpnp approach, but:

dpctl and dpnp should consider supporting targeting specific CUDA architectures

--target=hip means that there is no way to build simultaneously for HIP and CUDA (which is very, very much an edge case, but should be considered)

For these reasons, I think it is most sensible to move away from --target= universal approach to --target-cuda= and --target-hip= or something to that effect

@ndgrigorian it is a great suggestion.
I have added support for --target-hip and I am going to add --target-cuda instead of --target in the next PR.
Thanks

scripts/build_locally.py

coveralls · 2025-03-18T11:28:02Z

coverage: 71.711%. remained the same
when pulling f41e5d8 on enable_amd_build
into e41ff80 on master.

doc/quick_start_guide.rst

scripts/build_locally.py

antonwolfy · 2025-03-31T13:41:21Z

CMakeLists.txt

    "Build DPNP with oneMKL Interfaces"
    OFF
 )
+set(HIP_TARGETS "" CACHE STRING "HIP architecture for target")


I assume there is no support for multiple values:

Suggested change

set(HIP_TARGETS "" CACHE STRING "HIP architecture for target")

set(HIP_TARGET "" CACHE STRING "HIP architecture for target")

At some point, it was clear in docs that only one architecture was supported at a time, but now it isn't as clear and should be tested

Also, there is new information in the extension guide

The compiler driver also offers alias targets for each target+architecture pair to make the command line shorter and easier to understand for humans. Thanks to the aliases, the -Xsycl-target-backend flags no longer need to be specified.

It shows that the command

icpx -fsycl -fsycl-targets=spir64_gen,amdgcn-amd-amdhsa,nvptx64-nvidia-cuda \ -Xsycl-target-backend=spir64_gen '-device pvc' \ -Xsycl-target-backend=amdgcn-amd-amdhsa --offload-arch=gfx1030 \ -Xsycl-target-backend=nvptx64-nvidia-cuda --offload-arch=sm_80 \ -o sycl-app sycl-app.cpp

is equivalent to

icpx -fsycl -fsycl-targets=intel_gpu_pvc,amd_gpu_gfx1030,nvidia_gpu_sm_80 \ -o sycl-app sycl-app.cpp

so maybe both dpctl and dpnp can simplify by removing the need for -Xsycl-target-backend=amdgcn-amd-amdhsa --offload-arch=[X] completely

list of aliases:
https://intel.github.io/llvm/UsersManual.html

Aliases list seems to claim only one alias is supported at a time. So probably only one architecture at once is possible? That would be my guess

I assume there is no support for multiple values:

I am using HIP_TARGETS instead of HIP_TARGET because oneMath requires HIP_TARGETS to be defined
https://github.com/uxlfoundation/oneMath/blob/4ad4dfb5db834117248ad5f8fbded5cfc1097005/CMakeLists.txt#L73

Is it only declaration of cmake variable which doesn't impact oneMath, isn't that?

Is it only declaration of cmake variable which doesn't impact oneMath, isn't that?

According to OneMath documentation HIP_TARGETS must be set for ROCm builds

At some point, it was clear in docs that only one architecture was supported at a time, but now it isn't as clear and should be tested

Also, there is new information in the extension guide

The compiler driver also offers alias targets for each target+architecture pair to make the command line shorter and easier to understand for humans. Thanks to the aliases, the -Xsycl-target-backend flags no longer need to be specified.

It shows that the command

icpx -fsycl -fsycl-targets=spir64_gen,amdgcn-amd-amdhsa,nvptx64-nvidia-cuda \ -Xsycl-target-backend=spir64_gen '-device pvc' \ -Xsycl-target-backend=amdgcn-amd-amdhsa --offload-arch=gfx1030 \ -Xsycl-target-backend=nvptx64-nvidia-cuda --offload-arch=sm_80 \ -o sycl-app sycl-app.cpp

is equivalent to

icpx -fsycl -fsycl-targets=intel_gpu_pvc,amd_gpu_gfx1030,nvidia_gpu_sm_80 \ -o sycl-app sycl-app.cpp

so maybe both dpctl and dpnp can simplify by removing the need for -Xsycl-target-backend=amdgcn-amd-amdhsa --offload-arch=[X] completely

list of aliases: https://intel.github.io/llvm/UsersManual.html

This is a great suggestion.

The compiler supports more than one target

Using aliases greatly simplifies the logic

In this PR I will implement using aliases for AMD,
In the next PR I will suggest an update for CUDA

CMakeLists.txt

antonwolfy

Thank you @vlad-perevezentsev, in overall LGTM

CHANGELOG.md

CMakeLists.txt

This PR updates `СMakeLists` files and `build_locally.py` to enable building dpnp for AMD targets. To build dpnp on AMD: ``` python scripts/build_locally.py --target-hip=gfx90a ``` To find the architecture, use ``` rocminfo | grep 'Name: *gfx.*' ``` fdf9ba7

vlad-perevezentsev added 3 commits February 10, 2025 06:53

Enable CMake options to build dpnp on AMD

72bc4d4

Add build_locally args for AMD build

5f11917

Remove unused lines

c07e0a7

vlad-perevezentsev self-assigned this Feb 10, 2025

vlad-perevezentsev requested review from AlexanderKalistratov, antonwolfy and vtavana as code owners February 10, 2025 15:24

vlad-perevezentsev added 3 commits February 11, 2025 04:58

Remove ROCM_PATH logic

323bbb4

Support amd build for indexing extension

e111ce1

Merge master into enable_amd_build

ccc7b72

antonwolfy reviewed Feb 17, 2025

View reviewed changes

dpnp/backend/extensions/indexing/CMakeLists.txt Show resolved Hide resolved

antonwolfy reviewed Feb 17, 2025

View reviewed changes

scripts/build_locally.py Outdated Show resolved Hide resolved

antonwolfy added this to the 0.18.0 release milestone Feb 26, 2025

vlad-perevezentsev added 7 commits March 14, 2025 04:22

Merge master into enable_amd_build

efbab02

Support amd build for window extension

310cd82

Set HIP specific flags for MKL

c3adf4e

pdate logic to use --target-hip

574ea90

Merge master into enable_amd_build

d6c5925

Add docs for dpnp build on AMD

5bca529

Remove unnecessary HIP_TARGETS validation in CMake

b858ae2

A small docs update

273113e

antonwolfy reviewed Mar 31, 2025

View reviewed changes

vlad-perevezentsev added 4 commits April 16, 2025 03:31

Improve validation of --target and --target-hip

c4da3ef

Clarify --target-hip usage in doc

e6c280e

Update SYCL target selection logic in CMakeLists

b27a8a1

Merge master into enable_amd_build

2238372

Avoid false HIP error when building for default target

5e2cc3d

antonwolfy reviewed May 6, 2025

View reviewed changes

CMakeLists.txt Outdated Show resolved Hide resolved

vlad-perevezentsev mentioned this pull request May 8, 2025

Avoid using sycl/ext/intel/math.hpp on non-Intel devices #2439

Merged

7 tasks

vlad-perevezentsev added 6 commits May 8, 2025 07:38

Merge master into enable_amd_build

2eaf883

Merge master into enable_amd_build

0877a32

Enable onemkl_interfaces when DPNP_SYCL_TARGETS match amd or cuda

395871e

Fix logic for multitarget builds

54f44cd

Use target aliases and clean up HIP configuration

e57ccf8

Merge master into enable_amd_build

c2ebe72

antonwolfy reviewed May 13, 2025

View reviewed changes

CMakeLists.txt Show resolved Hide resolved

vlad-perevezentsev added 2 commits May 14, 2025 02:44

Update CHANGELOG

7a89e0f

Merge master into enable_amd_build

07639f6

antonwolfy approved these changes May 14, 2025

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

vlad-perevezentsev commented May 14, 2025

View reviewed changes

CMakeLists.txt Outdated Show resolved Hide resolved

vlad-perevezentsev commented May 14, 2025

View reviewed changes

CMakeLists.txt Outdated Show resolved Hide resolved

vlad-perevezentsev commented May 14, 2025

View reviewed changes

CMakeLists.txt Show resolved Hide resolved

vlad-perevezentsev added 4 commits May 15, 2025 02:38

Disallow HIP/CUDA targets without oneMKL interface flag

d2e5792

Merge remote-tracking branch 'origin/master' into enable_amd_build

a278f13

Update changelog

e2351ee

Merge master into enable_amd_build

f41e5d8

antonwolfy approved these changes May 15, 2025

View reviewed changes

antonwolfy merged commit fdf9ba7 into master May 15, 2025
62 of 68 checks passed

antonwolfy deleted the enable_amd_build branch May 15, 2025 19:13

	set(HIP_TARGETS "" CACHE STRING "HIP architecture for target")
	set(HIP_TARGET "" CACHE STRING "HIP architecture for target")

Conversation

vlad-perevezentsev commented Feb 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Feb 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Feb 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

antonwolfy Feb 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ndgrigorian Feb 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vlad-perevezentsev Mar 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coveralls commented Mar 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

antonwolfy Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

ndgrigorian Apr 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ndgrigorian Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

vlad-perevezentsev May 6, 2025

Choose a reason for hiding this comment

Uh oh!

antonwolfy May 6, 2025

Choose a reason for hiding this comment

Uh oh!

vlad-perevezentsev May 9, 2025

Choose a reason for hiding this comment

Uh oh!

vlad-perevezentsev May 9, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

antonwolfy left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

vlad-perevezentsev commented Feb 10, 2025 •

edited

Loading

github-actions bot commented Feb 10, 2025 •

edited

Loading

github-actions bot commented Feb 10, 2025 •

edited

Loading

antonwolfy Feb 17, 2025 •

edited

Loading

ndgrigorian Feb 21, 2025 •

edited

Loading

coveralls commented Mar 18, 2025 •

edited

Loading

ndgrigorian Apr 10, 2025 •

edited

Loading