feat(code_executors): Add GkeCodeExecutor for sandboxed code execution on GKE #1629

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Closed

syangx39 wants to merge 32 commits into google:main from syangx39:main

syangx39 commented Jun 24, 2025 •

edited

Loading

close #2170

Summary

This PR introduces GkeCodeExecutor, a new code executor that provides a secure and scalable method for running LLM-generated code by leveraging GKE Sandbox. It serves as a robust alternative to local or standard containerized executors by leveraging the GKE Sandbox environment, which uses gVisor for workload isolation.

For each code execution request, it dynamically creates an ephemeral Kubernetes Job with a hardened Pod configuration, offering significant security benefits and ensuring that each code execution runs in a clean, isolated environment.

Key Features of GkeCodeExecutor

Dynamic Job Creation: Uses the Kubernetes batch/v1 API to create a new Job for each code snippet.
Secure Code Mounting: Injects code into the Pod via a temporary ConfigMap, which is mounted to a read-only file.
gVisor Sandboxing: Enforces execution within a gvisor runtime for kernel-level isolation.
Hardened Security Context: Pods run as non-root with all Linux capabilities dropped and a read-only root filesystem.
Resource Management: Applies configurable CPU and memory limits to prevent abuse.
Automatic Cleanup: Uses the ttl_seconds_after_finished feature on Jobs for robust, automatic garbage collection of completed Pods and Jobs.
Node Scheduling: The executor uses Kubernetes tolerations in its Pod specification. This allows the k8s scheduler to place the execution Pod onto a pre-configured gVisor-enabled node.
Module Integration: The GkeCodeExecutor is registered in the code_executors/__init__.py, making it available for use by agents. The ImportError handling is configured to check for the required kubernetes SDK.

Execution Flow:

Agent invokes GkeCodeExecutor with the LLM-generated code.
The GkeCodeExecutor will execute_code – creates a temporary ConfigMap, and then create a k8s Job to run it.
This Job runs a standard python:3.11-slim container. The image is pulled once to the node and cached. The Job will mount the ConfigMap as /app/code.py
The GkeCodeExecutor will monitor the Job to completion, fetch stdout/stderr logs from the container, return CodeExecutionResult to the LlmAgent, and ensure all temp resources are deleted.
The calling agent formats the result and provides a final response to the user. If the result contains error, it will retry up to error_retry_attempts times.

Author

syangx39 commented Jul 7, 2025

@hangfei could you assign a reviewer for my PR

eliaslevy reviewed

View reviewed changes

src/google/adk/code_executors/gke_code_executor.py Outdated Show resolved Hide resolved

src/google/adk/code_executors/gke_code_executor.py Outdated Show resolved Hide resolved

src/google/adk/code_executors/gke_code_executor.py Outdated Show resolved Hide resolved

src/google/adk/code_executors/gke_code_executor.py Outdated Show resolved Hide resolved

src/google/adk/code_executors/gke_code_executor.py Outdated Show resolved Hide resolved

src/google/adk/code_executors/gke_code_executor.py Outdated Show resolved Hide resolved

eliaslevy commented Jul 9, 2025

@syangx39 what about blocking network access from the sandbox? Seems like we'd want to add that in. Or are you expecting the developer to create a network policy on his own an apply it to the jobs?

syangx39 commented

View reviewed changes

Author

syangx39 left a comment

Two Action Items left afterwards

Follow-up PR on input/output files support with _create_pvc().
regarding this comment "@syangx39 what about blocking network access from the sandbox? Seems like we'd want to add that in. Or are you expecting the developer to create a network policy on his own an apply it to the jobs?"
gVisor's default network sandboxing in GKE specifically blocks access to sensitive host-level endpoints like the GKE metadata server. A pod running in the gVisor sandbox can still make network calls to public services on the internet.
If gke_code_executor blocks network access, the sandboxed code will not be able to pip install packages from the internet. So I'm thinking -
The best way to handle dependencies is ahead of time. A developer would build an image with the required packages already installed via a requirements.txt file. They would then push this image to Artifact Registry and configure to use their custom image instead of the default python:3.11.
Anyway, I will add a block_network_access flag for the flexibility and default to True. But for readbility, will do it in a follow-up PR.

src/google/adk/code_executors/gke_code_executor.py Outdated Show resolved Hide resolved

src/google/adk/code_executors/gke_code_executor.py Outdated Show resolved Hide resolved

src/google/adk/code_executors/gke_code_executor.py Outdated Show resolved Hide resolved

src/google/adk/code_executors/gke_code_executor.py Outdated Show resolved Hide resolved

src/google/adk/code_executors/gke_code_executor.py Outdated Show resolved Hide resolved

src/google/adk/code_executors/gke_code_executor.py Outdated Show resolved Hide resolved

syangx39 requested a review from eliaslevy

July 11, 2025 05:54

eliaslevy approved these changes

View reviewed changes

Collaborator

hangfei commented Jul 16, 2025

What's the startup time for GkeCodeExecutor? Should the user starts it first or it can be triggered on-demand?

hangfei reviewed

View reviewed changes

Collaborator

hangfei left a comment

please create a issue and doc to update docs ad adk-docs repo.

contributing/samples/gke_agent_sandbox/deployment_rbac.yaml Show resolved Hide resolved

src/google/adk/code_executors/gke_code_executor.py Show resolved Hide resolved

src/google/adk/code_executors/gke_code_executor.py Outdated Show resolved Hide resolved

src/google/adk/code_executors/gke_code_executor.py Outdated

    
                  each execution request. The user's code is mounted via a ConfigMap, and the

                  Pod is hardened with a strict security context and resource limits.

                  Key Features:

Collaborator

hangfei Jul 16, 2025

no need to add this to source code.

Author

syangx39 Jul 25, 2025

I think this 'Key Features' serve as a high-level summary so a future developer can immediately grasp the component's security posture and design without having to read the whole implementation.

Do you think we could keep it for that reason? I feel it adds a lot of value for future maintainability.

Collaborator

hangfei Jul 30, 2025

fair point.

src/google/adk/code_executors/gke_code_executor.py Outdated

    
                  file at: contributing/samples/gke_agent_sandbox/deployment_rbac.yaml

                  """

                  namespace: str = "default"

                  image: str = "python:3.11-slim"

Collaborator

hangfei Jul 16, 2025

does it only work with 3.11?

Author

syangx39 Jul 25, 2025

Not necessary 3.11, but it has to be Python https://github.com/syangx39/adk-python/blob/5533fbf31085a98184b91bda48b36a515524ea84/src/google/adk/code_executors/gke_code_executor.py#L110

I can patch a follow-up PR to make the command configurable, turning it into a more generic script executor.

Collaborator

hangfei Jul 30, 2025

please. thanks.

src/google/adk/code_executors/gke_code_executor.py Outdated Show resolved Hide resolved

src/google/adk/code_executors/gke_code_executor.py Outdated Show resolved Hide resolved

src/google/adk/code_executors/gke_code_executor.py Show resolved Hide resolved

hangfei reviewed

View reviewed changes

src/google/adk/code_executors/gke_code_executor.py Outdated

    
                  _batch_v1: client.BatchV1Api

                  _core_v1: client.CoreV1Api

                  def __init__(self, **data):

Collaborator

hangfei Jul 16, 2025

does this work for AI Studio api key or EasyGCP?

If not, is it possible to throw an exception when it inits?

Author

syangx39 Jul 25, 2025

AI Studio API key is irrelevant for this component. The GkeCodeExecutor's only job is to communicate with the Kubernetes API, either via ServiceAccount token or kubeconfig fail.

If load_kube_config() fails, it raises a kubernetes.config.ConfigException. This exception will halt the initialization and cause the GkeCodeExecutor constructor to fail, which I think is the correct behavior when no valid Kubernetes credentials can be found.

hangfei added the wip label

Collaborator

hangfei commented Jul 16, 2025

How long does it take to spin up the code executor and finish it?

Collaborator

hangfei commented Jul 16, 2025

Please add unit tests.

Please add a sample under contributing/samples

Collaborator

hangfei commented Jul 17, 2025

Please make sure you add a corresponding docs in adk-docs before submission. Thanks.

syangx39 mentioned this pull request

FEAT: Add GkeCodeExecutor for Secure Code Execution on GKE #2170

Closed

Author

syangx39 commented Jul 25, 2025 •

edited

Loading

"Please add a sample under contributing/samples"
added, contributing/samples/gke_agent_sandbox/deployment_rbac.yaml is the permission file and contributing/samples/code_execution/gke_sandbox.agent.py is the agent app. I feel like adding them at different places might confuse reader -- maybe I should move both under contributing/samples/code_execution/??
"Please add unit tests."
added

tests/unittests/code_executors/test_gke_code_executor.py::TestGkeCodeExecutor::test_create_job_manifest_structure[VERTEX] PASSED                                                            [100%]

"What's the startup time for GkeCodeExecutor?"
We haven't really tested this but I can give you a ballpark figure
Here's the breakdown of startup time:

API latency: agent making API call to k8s control plane to create ConfigMap and Job. This is negligible (miliseconds).
Pod Scheduling: k8s scheduler assign new Pod to a gVisor-enabled node. If healthy cluster w available nodes, this is miliseconds to seconds.
Pod Initialization:
- image pull is the biggest factor. The node must have the container image. For the first time (cold start), it download from registry, since python:3.11-slim is a lightweight image, it takes seconds. For the following times (cache), it's neglible.
- gVisor Sandbox creation: slight overhead (miliseconds?)
From my previous experience, it usually takes a few seconds (for the 1st time, the following times are instant due to cache) on a healthy cluster. We're working on cold start optimization.

"Should the user starts it first or it can be triggered on-demand?"
triggered on-demand, code_executor.execute_code(...) automatically run analysis code on any new data files the user provides.
"please create a issue and doc to update docs ad adk-docs repo. Please make sure you add a corresponding docs in adk-docs before submission."
Sure. Do you have a specific page you want me to put?
maybe https://google.github.io/adk-docs/tools/built-in-tools/#code-execution?

syangx39 requested a review from hangfei

July 25, 2025 08:27

Collaborator

hangfei commented Jul 30, 2025

5. maybe https://google.github.io/adk-docs/tools/built-in-tools/#code-execution?

this works: maybe https://google.github.io/adk-docs/tools/built-in-tools/#code-execution?

Collaborator

hangfei commented Jul 30, 2025

Overall LGTM. Thanks!

Let's do the following before merge:

have a adk-docs PR for this change.
fix the tests.

syangx38 mentioned this pull request

Fixed google/adk-docs#619

Closed

This was referenced Aug 15, 2025

feat(code_executors): Add GkeCodeExecutor for sandboxed code execution on GKE google/adk-docs#620

Closed

feat(code_executors): Add GkeCodeExecutor for sandboxed execution on GKE google/adk-docs#621

Merged

Author

syangx39 commented Aug 15, 2025 •

edited

Loading

Overall LGTM. Thanks!

Let's do the following before merge:

have a adk-docs PR for this change.

fix the tests.

I added the "adk-docs" PR here: https://github.com/google/adk-docs/pull/621/files
What do you mean by "fix the tests"? The unit tests are already added in tests/unittests/code_executors/test_gke_code_executor.py and they are all passed. Here's the log

$ pip install -e .[test,gke]
$ pytest tests/unittests/code_executors/test_gke_code_executor.py
========================================================== test session starts ==========================================================
platform linux -- Python 3.12.3, pytest-8.4.1, pluggy-1.6.0
rootdir: /home/user/adk-python
configfile: pyproject.toml
plugins: asyncio-1.1.0, mock-3.14.1, anyio-4.10.0, langsmith-0.4.14, xdist-3.8.0
asyncio: mode=Mode.AUTO, asyncio_default_fixture_loop_scope=function, asyncio_default_test_loop_scope=function
collected 14 items                                                                                                                      

tests/unittests/code_executors/test_gke_code_executor.py ..............                                                           [100%]

========================================================== 14 passed in 7.94s ===========================================================

Author

syangx39 commented Aug 19, 2025

@hangfei
The doc PR got merged. Please merge this PR as well for consistency.
https://google.github.io/adk-docs/tools/built-in-tools/#gke-code-executor

Collaborator

hangfei commented Aug 19, 2025

thanks.

there are some formatting and tests failures. plz fix.
could you share the steps to test the code execuctor so we can test it?

seanzhou1023 reviewed

View reviewed changes

pyproject.toml Outdated Show resolved Hide resolved

seanzhou1023 reviewed

View reviewed changes

src/google/adk/code_executors/__init__.py Outdated Show resolved Hide resolved

GWeale self-requested a review

August 19, 2025 23:39

Author

syangx39 commented Aug 25, 2025

there are some formatting and tests failures. plz fix.

could you share the steps to test the code execuctor so we can test it?

sounds good. I forgot to add "kubernetes" in [test] in pyproject.toml, fixed it
Sounds good. This is the plan:

GkeCodeExecutor Validation Plan

Step 1: Set Up the GKE Autopilot Cluster

gcloud container clusters create-auto adk-sandbox-cluster \
  --project="<YOUR_PROJECT_ID>" \
  --region="<YOUR_GCP_REGION>"

Step 2: Deploy the Agent and Send Predictions

Apply RBAC Permissions kubectl apply -f deployment_rbac.yaml.
Build and Push the adk Agent Image
Update deployment_agent.yaml with your image path and run kubectl apply -f ...
kubectl port-forward svc/adk-agent-service 8080:80 -n adk-sandbox
gVisor Sandbox Execution

curl -X POST http://localhost:8080/invoke \
-H "Content-Type: application/json" \
-d '{
    "prompt": "Write a Python script to query the GKE metadata server for the node'\''s instance ID and print the result. If it fails, print the error."
}'

syangx39 requested a review from seanzhou1023

August 25, 2025 20:17

Collaborator

GWeale commented Sep 5, 2025

LGTM
all scenarios tested worked!

syangx39 added 11 commits

September 5, 2025 16:07


          [07/25] modify pyproject.toml to add gke-specific dependency

efd7a76


          [08/15] rename cpu_request


          [08/15] rename cpu_request

5b7595c


          [08/15] rename cpu_request

1a6eca2


          [08/25] Modify ADK

6010a3b


          [08/25] Modify ADK

4dc4753


          [08/25] Modify ADK

9ec5e8b


          [08/25] Modify ADK

3c962ff


          [08/25] Modify ADK

ae13800


          [08/25] Modify ADK

d3c1dd0


          [08/28] fix kubeconfig

67ebf38

GWeale force-pushed the main branch from c3e4dbf to 67ebf38 Compare

September 5, 2025 23:32

GWeale added 2 commits

September 5, 2025 17:20


          fix(gke): add future annotations for py3.9 compatibility #non-breaking

5a7002a


          fix(gke_code_executor): add annotations to the fil

7b40d22

GWeale force-pushed the main branch from f695a15 to 7b40d22 Compare

September 6, 2025 00:40

GWeale added 2 commits

September 5, 2025 17:44


          style(gke): use relative imports; no cli imports #non-breaking

2049d68


          style(gke): make imports relative; avoid cli import pattern #non-brea…

adb2ad4

…king

GWeale force-pushed the main branch from 0e7306d to adb2ad4 Compare

September 6, 2025 00:49


          test(gke): expose client/config for monkeypatch; fix imports #non-bre…

75814e9

…aking

GWeale force-pushed the main branch from 579a027 to 75814e9 Compare

September 6, 2025 00:56

GWeale approved these changes

View reviewed changes

GWeale added ready to pull and removed wip labels


          chore(pyproject): keep-sorted ordering for kubernetes entries #non-br…

cd6d518

…eaking

GWeale force-pushed the main branch from 32bb89b to cd6d518 Compare

September 6, 2025 01:51

GWeale and others added 2 commits

September 5, 2025 18:51


          Merge branch 'main' into main

3f57383


          Merge branch 'main' into main

869ae1a

copybara-service bot pushed a commit that referenced this pull request


          feat: Add GkeCodeExecutor for sandboxed code execution on GKE #non-br…

72ff9c6

…eaking

Merge #1629

close #2170

### Summary

This PR introduces `GkeCodeExecutor`, a new code executor that provides a secure and scalable method for running LLM-generated code by leveraging GKE Sandbox. It serves as a robust alternative to local or standard containerized executors by leveraging the **GKE Sandbox** environment, which uses gVisor for workload isolation.

For each code execution request, it dynamically creates an ephemeral Kubernetes Job with a hardened Pod configuration, offering significant security benefits and ensuring that each code execution runs in a clean, isolated environment.

### Key Features of GkeCodeExecutor

* **Dynamic Job Creation**: Uses the Kubernetes `batch/v1` API to create a new Job for each code snippet.
* **Secure Code Mounting**: Injects code into the Pod via a temporary `ConfigMap`, which is mounted to a read-only file.
* **gVisor Sandboxing**: Enforces execution within a `gvisor` runtime for kernel-level isolation.
* **Hardened Security Context**: Pods run as non-root with all Linux capabilities dropped and a read-only root filesystem.
* **Resource Management**: Applies configurable CPU and memory limits to prevent abuse.
* **Automatic Cleanup**: Uses the `ttl_seconds_after_finished` feature on Jobs for robust, automatic garbage collection of completed Pods and Jobs.
* **Node Scheduling**: The executor uses Kubernetes `tolerations` in its Pod specification. This allows the k8s scheduler to place the execution Pod onto a **_pre-configured_** gVisor-enabled node.
* **Module Integration**: The `GkeCodeExecutor` is registered in the `code_executors/__init__.py`, making it available for use by agents. The `ImportError` handling is configured to check for the required `kubernetes` SDK.

### Execution Flow:

1.  Agent invokes `GkeCodeExecutor` with the LLM-generated code.
2.  The `GkeCodeExecutor` will `execute_code` – creates a temporary `ConfigMap`, and then create a k8s `Job` to run it.
3.  This Job runs a standard `python:3.11-slim` container. The image is pulled once to the node and cached. The Job will mount the ConfigMap as `/app/code.py`
4.  The GkeCodeExecutor will monitor the Job to completion, fetch `stdout/stderr` logs from the container, return `CodeExecutionResult` to the LlmAgent, and ensure all temp resources are deleted.
5.  The calling agent formats the result and provides a final response to the user. If the result contains error, it will retry up to `error_retry_attempts` times.

PiperOrigin-RevId: 804511467

Collaborator

GWeale commented Sep 8, 2025

Merged!

GWeale closed this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels