Replies: 29 comments
-
#36 - Common Observability APIThere is an ongoing experiment of extracting Gardener Observability out of core and into an extension (no document or plan exists yet). There are 2 main issues that need to be tackled for this to happen:
This topic covers the first part - Common Observability API. |
Beta Was this translation helpful? Give feedback.
-
#35 - gardener-apiserver: Enable
|
Beta Was this translation helpful? Give feedback.
-
#34 - PVC Autoscaler: PoC to automatically recovery from
|
Beta Was this translation helpful? Give feedback.
-
#33 - PVC Autoscaler: PoC to automate downscaling of
|
Beta Was this translation helpful? Give feedback.
-
#32 - Rsyslog-relp: Support
|
Beta Was this translation helpful? Give feedback.
-
#31 - Dev: Define AGENTS/SKILLS.md files/templates suitable for the Gardener orgThe AI tools/helpers adoption is growing, it would be nice repos in gardener org to be prepared with configuration files instructing the models how to behave better. |
Beta Was this translation helpful? Give feedback.
-
#30 - Extension Controllers: Custom histogram metric for reconciliation times with better bucket allocationsController runtime already provides such a metric, however its bucket allocation does not fit Gardener needs as the slowest bucket is for the range This metric has already enabled Prometheus native histogram which provides dynamic bucket allocation and it would have been sufficient to monitor "slow" controllers, however the Similar to gardener/gardener#10971, a custom metric with better bucket allocation can be introduced for the purpose of monitoring the reconciliation times of the extension controllers. |
Beta Was this translation helpful? Give feedback.
-
#29 - Lakom: Extension controller
|
Beta Was this translation helpful? Give feedback.
-
#28 - Functional local setup with Workload IdentityIt would be nice to have a functional local setup that is making use of the Workload Identity as credentials. It should be possible the seed cluster(s) to be configured via structured authentication to trust the Workload Identity tokens, the permissions bound to the Workload Identity token can be configured via RBAC. Controllers like MCM-provider-local should be able to use the Workload Identity token to create the machine (pod in the seed). Other controllers, like infrastructure, backup bucket|entry, DNS if applicable, should also make use of the Workload Identity token. Later, this should allow implementation of e2e tests with shoots using Workload Identity. |
Beta Was this translation helpful? Give feedback.
-
#27 - Local Setup: Secure the registry with HTTPS certWith gardener/gardener#13751 Gardener is adding support for registries using custom CA, this would allow registry.local.gardener.cloud to be served over HTTPS instead of HTTP, respectively all configurations configuring the registry as insecure can be removed. Other components, like lakom, will be able to remove the insecure option allowing registries served in plain text. |
Beta Was this translation helpful? Give feedback.
-
#26 - Diki: Implement a Pod Security Standards rulesetThe Pod Security Standards (PSS for short) define three policies to classify the security postures of Kubernetes Pods. Each of these policies is defined by an exhaustive list of guidelines and requirements, as shown in the official Kubernetes documentation. Diki could benefit from the implementation of a ruleset that checks if the Pods are adhering to those policies. Some additional tips/suggestions for the implementation:
ruleset:
rulesetID: "pod-security-standards"
rulesetName: "Pod Security Standards ruleset"
rulesetVersion: "alpha"
args:
minimalSecurityProfile: ("privileged"|"restricted"|"baseline") # the minimal allowed security profile
|
Beta Was this translation helpful? Give feedback.
-
#25 - Diki: Add support for Merge of rule OptionsCurrently there is an open issue in diki for this feature. There should be a way to merge |
Beta Was this translation helpful? Give feedback.
-
#24 - Diki as a ServiceCurrently, we recommend Gardener users to use the Diki CLI for running compliance scans in their Shoot clusters. This leads to users having to manage the execution of Diki runs themselves, including scheduling, configuration, report collection and storage. The current initiative is to provide a Diki service within Gardener that will improve the user experience and streamline compliance management. Development has been initiated with the diki-operator, which is a new Kubernetes operator that will be responsible for managing diki runs based on custom resource definitions (CRDs).
There is also a need for a new Shoot extension that will deploy the |
Beta Was this translation helpful? Give feedback.
-
#23 - Diki: Pod worker poolCurrently, To address this issue, there can be a pod worker pool that can be used by multiple rules. This worker pool should:
Note Currently, rules that create pods have the options to not create pods on all nodes via the |
Beta Was this translation helpful? Give feedback.
-
#22 - [GEP-28] Self-Hosted Shoot control plane Node restorationThe Gardener Core Sofia team has already clarified several scenarios for disaster recovery of a self-hosted Shoot cluster. The main prominent scenario is a control plane restoration. @Kostov6 has a PoC on how to restore a control plane in a new Node in a local setup. We would like to discuss and brainstorm about several questions:
We would love to have a brainstorming together. |
Beta Was this translation helpful? Give feedback.
-
#21 - Extensions: Leaking resources in (virtual-)garden clusterAnother thing we can improve is related to |
Beta Was this translation helpful? Give feedback.
-
#20 - Extensions: Use controller-runtime abstraction for validating/mutating webhooksCurrently we use a custom abstraction for validating/mutating webhooks in the Extensions API, which we can probably replace with the upstream custom validators and mutators provided by controller-runtime. Full details in this issue: |
Beta Was this translation helpful? Give feedback.
-
#19 - Extension: CloudNativePGWhat is this extension about?This extension integrates CloudNativePG into Gardener-managed Kubernetes clusters, providing automated deployment and lifecycle management of production-grade PostgreSQL databases. CloudNativePG OverviewCloudNativePG is a Kubernetes operator that manages PostgreSQL database clusters natively within Kubernetes. It provides:
Why?The Problem: Relational Data in Cloud-Native EcosystemsThe NeoNephos ecosystem hosts multiple projects that produce or consume data fundamentally unsuitable for etcd. While etcd excels as a distributed key-value store for Kubernetes state, it is explicitly not designed for:
Projects that can make use of PostgreSQL (non-exhaustive list)The following projects have data requirements that can make use of a proper relational database:
Why Not Use Managed Cloud Databases?While cloud providers offer managed PostgreSQL (AWS RDS, Azure Database, GCP Cloud SQL), there are compelling reasons to run self-managed PostgreSQL via CloudNativePG:
The Gardener Extension Value PropositionBy providing CloudNativePG as a Gardener extension, we enable:
|
Beta Was this translation helpful? Give feedback.
-
#18 - PoC: repo tools integration with extension repositoriesRepository owners should focus on writing go code that gets shipped as production ready images. Any maintenance work that is not around go code is a burden. More precisely that kind of maintenance is based around make targets and hack scripts. This is documented in an issue, but the problem has been discussed long before that. I currently have a POC, based on a subtree approach with the following story: Extraction of dependencies into a separate subtree and generalizing usage - this will help with setup of new repositories in the future. The idea came from the effort it took for the Re-vendoring Gardener versions that include changes to the hack helper scripts or make targets in By separating the shared scripts and common make targets from Actual goal: |
Beta Was this translation helpful? Give feedback.
-
#17 - OTEL data plane collector metric persistenceCurrently, an OTEL collector is being developed for shoot data planes, which will serve 2 purposes.
This tasks is to create a POC for the second point. A general idea is to use the already existing OTEL collector pipelines and add a processor/extractor that has the option for metric retention. The goal is a moving target, but initially, being able to retain metrics for 10-15 minutes would be a win, as network connectivity issues don't usually persist for too long and this would be useful for stakeholders. |
Beta Was this translation helpful? Give feedback.
-
#16 - [GEP-28] Ensure all system pods run on control plane nodesCurrently, it is not ensured (via node selector or similar placement strategies) that system components in a self-hosted shoot (pods in |
Beta Was this translation helpful? Give feedback.
-
#15 - [GEP-28] Implement public CA bundle discovery mechanismCurrently, when running Let's investigate this process and consider implementing it for our use-case as well. |
Beta Was this translation helpful? Give feedback.
-
#14 - [GEP-28] Eliminate static admin token from control plane components after
|
Beta Was this translation helpful? Give feedback.
-
#13 - Extensions: Helm v4 and handling of
|
Beta Was this translation helpful? Give feedback.
-
#12 - Resolve the Istio Metrics LeakCurrently, istio metrics are disabled because metrics for no longer existing kube-apiserver instances are served until istio finally restarts. This leads to a huge increase in metrics size, which can lead to congestion, cost explosion and metrics retention reduction. We should figure out how to report only the relevant istio metrics. |
Beta Was this translation helpful? Give feedback.
-
#11 - Dual-Stack Seed APIAs one of the last steps missing for full Gardener Dual-Stack support, the Seed API needs to be extended. |
Beta Was this translation helpful? Give feedback.
-
#10 - Reduce number of Istio Ingress GatewaysIn a standard multi-zonal seed cluster, there is one multi-zonal istio ingress gateway and one per availability zone. The multi-zonal istio ingress gateway could be replaced by usage of all single-zone istio ingress gateways. This could lead to higher resource usage, reduced costs and a less complicated setup. |
Beta Was this translation helpful? Give feedback.
-
#9 - More real loadbalancers for
|
Beta Was this translation helpful? Give feedback.
-
#8 - [GEP-28] Continue work on
|
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
π Hackathon Topic Voting Results
π Ranked Topics
gind(gardener-in-docker analogous tokind)kube-api-linterclass=seedPersistentVolumeClaimsexpansion failuresgardenadm connectprovider-localPersistentVolumeClaimscrdsGardenandSeedextensionsBeta Was this translation helpful? Give feedback.
All reactions