Skip to content

Conversation

@siegfriedweber
Copy link
Member

Description

Use unique names for cluster-wide Prometheus resources

Prometheus was installed with the Helm paramter --take-ownership which should have prevented the following error, but it did not:

Error: INSTALLATION FAILED: 3 errors occurred:
    * clusterroles.rbac.authorization.k8s.io "prometheus-stack-kube-prom-prometheus" already exists
    * clusterrolebindings.rbac.authorization.k8s.io "prometheus-stack-kube-prom-prometheus" already exists
    * mutatingwebhookconfigurations.admissionregistration.k8s.io "prometheus-stack-kube-prom-admission" already exists

Therefore, the namespace is now added to the Helm release name and in turn to the cluster-wide resources.

Definition of Done Checklist

  • Not all of these items are applicable to all PRs, the author should update this template to only leave the boxes in that are relevant
  • Please make sure all these things are done and tick the boxes

Author

  • Changes are OpenShift compatible
  • Integration tests passed (for non trivial changes)

Reviewer

  • Code contains useful comments
  • Code contains useful logging statements
  • (Integration-)Test cases added
  • Documentation added or updated. Follows the style guide.
  • Changelog updated
  • Cargo.toml only contains references to git tags (not specific commits or branches)

Acceptance

  • Feature Tracker has been updated
  • Proper release label has been added
  • Links to generated (nightly) docs added
  • Release note snippet added
  • Add type/deprecation label & add to the deprecation schedule
  • Add type/experimental label & add to the experimental features tracker

@siegfriedweber siegfriedweber requested a review from a team January 14, 2026 09:51
@siegfriedweber siegfriedweber self-assigned this Jan 14, 2026
@siegfriedweber siegfriedweber moved this to Development: Waiting for Review in Stackable Engineering Jan 14, 2026
Copy link
Member

@xeniape xeniape left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  • Is the assert of Prometheus starting up no longer needed?
  • The test fails locally for me, a ServiceMonitor is created a step earlier than Prometheus being installed (which has the ServiceMonitor CRD)

@xeniape xeniape moved this from Development: Waiting for Review to Development: In Review in Stackable Engineering Jan 14, 2026
@siegfriedweber
Copy link
Member Author

The test fails locally for me, a ServiceMonitor is created a step earlier than Prometheus being installed (which has the ServiceMonitor CRD)

Argh. I should have tested the change in a fresh Kubernetes cluster.

Is the assert of Prometheus starting up no longer needed?

helm install ... --wait waits until all resources are in a ready state. Therefore, the assertion was never required. I removed it because the StatefulSet is now also suffixed with the namespace and this cannot be expressed in the assertion.

Copy link
Member

@xeniape xeniape left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Passes now 👍 thanks :)

@siegfriedweber siegfriedweber added this pull request to the merge queue Jan 14, 2026
Merged via the queue into main with commit c0275bb Jan 14, 2026
10 checks passed
@siegfriedweber siegfriedweber deleted the test/fix-metrics-test branch January 14, 2026 13:53
@siegfriedweber siegfriedweber moved this from Development: In Review to Development: Done in Stackable Engineering Jan 14, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: Development: Done

Development

Successfully merging this pull request may close these issues.

3 participants