Skip to content

OCPNODE-4043: Add DRA e2e tests to run on NVIDIA GPU#30758

Draft
sairameshv wants to merge 2 commits intoopenshift:mainfrom
sairameshv:nvidia_dra_ocp
Draft

OCPNODE-4043: Add DRA e2e tests to run on NVIDIA GPU#30758
sairameshv wants to merge 2 commits intoopenshift:mainfrom
sairameshv:nvidia_dra_ocp

Conversation

@sairameshv
Copy link
Member

@sairameshv sairameshv commented Feb 4, 2026

Add NVIDIA DRA E2E tests for OpenShift
Implements comprehensive E2E tests for NVIDIA Dynamic Resource Allocation (DRA) on OpenShift clusters with GPU nodes.

  • Skip the tests for non-GPU clusters
  • Automated prerequisite installation (GPU Operator + DRA Driver)
  • Single GPU allocation tests
  • Multi-GPU workload tests(Skips on a single GPU setup)
  • Resource lifecycle validation
  • README.md doc explaining the execution of e2e tests along with the installation of the pre-reqs

Tested on: OCP 4.21.0, Kubernetes 1.34.2, Tesla T4 GPU"

Implements comprehensive E2E tests for NVIDIA Dynamic Resource Allocation (DRA) on OpenShift clusters with GPU nodes.

- Skip the test for non-GPU clusters
- Automated prerequisite installation (GPU Operator + DRA Driver)
- Single GPU allocation tests
- Multi-GPU workload tests(Skips on a single GPU setup)
- Resource lifecycle validation

Tested on: OCP 4.21.0, Kubernetes 1.34.2, Tesla T4 GPU"

Signed-off-by: Sai Ramesh Vanka <svanka@redhat.com>
- README.md doc with a detailed description of running these tests along
  with installing the pre-requisites that helps the manual validation

Signed-off-by: Sai Ramesh Vanka <svanka@redhat.com>
@openshift-ci-robot
Copy link

Pipeline controller notification
This repo is configured to use the pipeline controller. Second-stage tests will be triggered either automatically or after lgtm label is added, depending on the repository configuration. The pipeline controller will automatically detect which contexts are required and will utilize /test Prow commands to trigger the second stage.

For optional jobs, comment /test ? to see a list of all defined jobs. To trigger manually all jobs from second stage use /pipeline required command.

This repository is configured in: automatic mode

@openshift-ci-robot openshift-ci-robot added the jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. label Feb 4, 2026
@openshift-ci-robot
Copy link

openshift-ci-robot commented Feb 4, 2026

@sairameshv: This pull request references OCPNODE-4043 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "4.22.0" version, but no target version was set.

Details

In response to this:

Add NVIDIA DRA E2E tests for OpenShift
Implements comprehensive E2E tests for NVIDIA Dynamic Resource Allocation (DRA) on OpenShift clusters with GPU nodes.

  • Skip the test for non-GPU clusters
  • Automated prerequisite installation (GPU Operator + DRA Driver)
  • Single GPU allocation tests
  • Multi-GPU workload tests(Skips on a single GPU setup)
  • Resource lifecycle validation
  • README.md doc explaining the execution of e2e tests along with the installation of the pre-reqs

Tested on: OCP 4.21.0, Kubernetes 1.34.2, Tesla T4 GPU"

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@openshift-ci openshift-ci bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Feb 4, 2026
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Feb 4, 2026

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Feb 4, 2026

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: sairameshv
Once this PR has been reviewed and has the lgtm label, please assign bertinatto for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci-robot
Copy link

openshift-ci-robot commented Feb 4, 2026

@sairameshv: This pull request references OCPNODE-4043 which is a valid jira issue.

Details

In response to this:

Add NVIDIA DRA E2E tests for OpenShift
Implements comprehensive E2E tests for NVIDIA Dynamic Resource Allocation (DRA) on OpenShift clusters with GPU nodes.

  • Skip the tests for non-GPU clusters
  • Automated prerequisite installation (GPU Operator + DRA Driver)
  • Single GPU allocation tests
  • Multi-GPU workload tests(Skips on a single GPU setup)
  • Resource lifecycle validation
  • README.md doc explaining the execution of e2e tests along with the installation of the pre-reqs

Tested on: OCP 4.21.0, Kubernetes 1.34.2, Tesla T4 GPU"

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. jira/valid-reference Indicates that this PR references a valid Jira ticket of any type.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants