feat(api): Implement connector support via static configuration #4263

jaideepr97 · 2025-12-01T10:04:15Z

What does this PR do?

Implements support for configuring static connectors in the stack via run.yaml. Major features of this PR:

Adds a new configurable object called connector
connectors can be configured under connectors in the run.yaml
config allows stack admins to specify connector_id, server URL and label
Adds connectors as a new internal API implementation
Implements API surface outlined in feat(api): add readonly connectors API #4258
updates meta-reference agent provider to have access to the connectors API implementation and handles plumbing
updates responses API MCP tool input to accept either server_url or connector_id and handles resolution

Examples:

run.yaml config:

version: 2
image_name: llamastack-crimson
container_image: null
apis: ...
providers: ...
storage: ...
models: ...
connectors:
  - connector_id: kubernetes
    url: "http://localhost:8080/mcp"
    connector_type: mcp
...

API requests:

curl -s http://localhost:8321/v1alpha/connectors | jq 

Response:
{
  "data": [
    {
      "connector_type": "mcp",
      "connector_id": "kubernetes",
      "url": "http://localhost:8080/mcp",
      "server_label": null,
      "server_name": null,
      "server_description": null,
      "server_version": null
    }
  ]
}

curl -s http://localhost:8321/v1alpha/connectors/kubernetes | jq 

Response:                                                   
{
  "connector_type": "mcp",
  "connector_id": "kubernetes",
  "url": "http://localhost:8080/mcp",
  "server_label": null,
  "server_name": "kubernetes-mcp-server",
  "server_description": null,
  "server_version": "v0.0.55"
}

curl -s http://localhost:8321/v1alpha/connectors/kubernetes/tools/resources_list | jq 

Response:
{
  "toolgroup_id": null,
  "name": "resources_list",
  "description": "List Kubernetes resources and objects in the current cluster by providing their apiVersion and kind and optionally the namespace and label selector\n(common apiVersion and kind include: v1 Pod, v1 Service, v1 Node, apps/v1 Deployment, networking.k8s.io/v1 Ingress)",
  "input_schema": {
    "properties": {
      "apiVersion": {
        "description": "apiVersion of the resources (examples of valid apiVersion are: v1, apps/v1, networking.k8s.io/v1)",
        "type": "string"
      },
      "kind": {
        "description": "kind of the resources (examples of valid kind are: Pod, Service, Deployment, Ingress)",
        "type": "string"
      },
      "labelSelector": {
        "description": "Optional Kubernetes label selector (e.g. 'app=myapp,env=prod' or 'app in (myapp,yourapp)'), use this option when you want to filter the pods by label",
        "pattern": "([A-Za-z0-9][-A-Za-z0-9_.]*)?[A-Za-z0-9]",
        "type": "string"
      },
      "namespace": {
        "description": "Optional Namespace to retrieve the namespaced resources from (ignored in case of cluster scoped resources). If not provided, will list resources from all namespaces",
        "type": "string"
      }
    },
    "required": [
      "apiVersion",
      "kind"
    ],
    "type": "object"
  },
  "output_schema": null,
  "metadata": {
    "endpoint": "http://localhost:8080/mcp"
  }
}

Client side usage example:

import os
from openai import OpenAI

client = OpenAI(base_url="http://localhost:8321/v1", api_key=os.getenv("OPENAI_API_KEY"))
respB = client.responses.create(
        model=args.model,
        tools=[
            {
                "type": "mcp",
                "server_label": MCP_LABEL,
                "connector_id": "kubernetes",
                "require_approval": "never",
            }
        ],
        input=[{
            "role": "user",
            "content": (
                "List what kubernetes MCP tools you are allowed to use in this context.Tell me something about the cluster. \
                Try to call only the MCP tools that you have access to, and tell me which tools you called. If none are available, explain why."
            )
        }],
    )
    pretty_print_result("B: no restriction at the MCP tool (server) level, tool choice is mcp with server label and tool name", respB)

output:

=== B: no restriction at the MCP tool (server) level, tool choice is mcp with server label and tool name ===
Output text:
 I don't have access to user's context (e.g., cluster details). However, the available MCP tools for Kubernetes are:  
1. **configuration_view** - Shows Kubernetes configuration.  
2. **events_list** - Lists Kubernetes events.  
3. **namespaces_list** - Lists Kubernetes namespaces.  
4. **nodes_log** - Retrieves node logs.  
5. **nodes_stats_summary** - Gets node summary stats.  
6. **nodes_top** - Shows CPU and memory usage.  
7. **pods_list** - Lists all pods.  
8. **pods_list_in_namespace** - Lists pods in a namespace.  
9. **pods_delete** - Deletes pods.  
10. **pods_run** - Runs pods.  
11. **pods_get** - Retrieves pods.  
12. **resources_list** - Lists cluster resources.  

These tools help manage cluster configurations, track events, monitor nodes, and interact with pods. Let me know if you'd like to use specific ones!

Closes #4186 and #4061 (partially)

Test Plan

pending

NOTE: this PR builds on top of #4258 therefore it also contains changes from it. This PR should only be reviewed after #4258

cdoern

one comment so far, the implementation looks good especially compared to prompts which follows a similar structure.

cdoern · 2025-12-01T14:57:51Z

src/llama_stack/core/connectors/connectors.py

+    Registry,
+    ToolDef,
+)
+from llama_stack_api.common.errors import (


include this in the from llama_stack_api import. or If we missed these please add them to llama_stack_api's __init__.py thanks!

updated, thanks!

github-actions · 2025-12-01T20:41:57Z

✱ Stainless preview builds

This PR will update the llama-stack-client SDKs with the following commit message.

feat(api): Implement connector support via static configuration

Edit this comment to update it. It will appear in the SDK's changelogs.

✅ llama-stack-client-node studio · code · diff

Your SDK built successfully.
generate ⚠️ → build ✅ → lint ✅ → test ✅
npm install https://pkg.stainless.com/s/llama-stack-client-node/6de68db8887ad05a02df05d2f86e51fe77ce90a1/dist.tar.gz

✅ llama-stack-client-kotlin studio · code · diff

Your SDK built successfully.
generate ⚠️ → lint ✅ → test ❗

✅ llama-stack-client-python studio · code · diff

Your SDK built successfully.
generate ⚠️ → build ✅ → lint ✅ → test ✅
pip install https://pkg.stainless.com/s/llama-stack-client-python/e30c1e431887bd5d892282810e37d8349a49bc55/llama_stack_client-0.4.0a14-py3-none-any.whl

✅ llama-stack-client-go studio · code · diff

Your SDK built successfully.
generate ❗ → lint ❗ → test ❗
go get github.com/stainless-sdks/llama-stack-client-go@4d517d6907f9eeb14dafe5acec7e900b5e964585

This comment is auto-generated by GitHub Actions and is automatically kept up to date as you push.
Last updated: 2025-12-24 14:21:56 UTC

src/llama_stack/providers/inline/agents/meta_reference/responses/streaming.py

mergify · 2025-12-04T05:21:53Z

This pull request has merge conflicts that must be resolved before it can be merged. @jaideepr97 please rebase it. https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

jaideepr97 · 2025-12-15T14:29:39Z

added unit tests for this feature. Looked into adding an integration test as well, but ran into a chicken and egg problem with the connector needing to be configured in the stack at boot time, but the mcp servers will need to be spun up at runtime with a port not known beforehand

This feature is currently incomplete with only read operations supported for connectors. IMO, we should wait until add/delete endpoints are available (#4239 ) for connectors so that the feature is truly complete and it is easier to write integration tests when connectors can be registered/unregistered on the fly

src/llama_stack/core/connectors/connectors.py

jaideepr97 · 2025-12-19T16:29:12Z

cc @mattf @cdoern would be great if yall could also PTAL

ashwinb

lgtm

mattf

question about the future use of connectors: should each responses provider need to handle connectors or should the inference router convert connector_id to connector_url before dispatching to providers?

mattf · 2025-12-20T13:10:47Z

src/llama_stack/core/connectors/connectors.py

+        # Upsert: always write (idempotent for boot-time registration)
+        if await self.kvstore.get(self._get_key(connector_id)):
+            logger.debug(f"Found existing connector: {connector_id}, overwritting it")
+        await self.kvstore.set(self._get_key(connector_id), connector.model_dump_json())


this is unconditional overwriting. idempotent would be to accept if the value is the same and reject otherwise.

mattf · 2025-12-20T13:12:42Z

tests/unit/core/connectors/test_connectors.py

+        stored = await mock_kvstore.get(f"{KEY_PREFIX}my-mcp")
+        assert stored is not None
+
+    async def test_register_connector_idempotent(self, connector_service, mock_kvstore):


to validate idempotency, you also need a negative test, one where the id is the same but the values are different and the service rejects the registration.

mattf · 2025-12-20T13:18:31Z

client-sdks/stainless/openapi.yml

+        source:
+          $ref: '#/components/schemas/ConnectorSource'
+          description: "How the connector was registered: 'config' (from run.yaml) or 'api' (via API call)"
+          default: config


why expose this detail?

this field was added to be able to differentiate between stale config connectors and connectors that were registered via api
updated to exclude from schema

jaideepr97 · 2025-12-22T15:40:15Z

question about the future use of connectors: should each responses provider need to handle connectors or should the inference router convert connector_id to connector_url before dispatching to providers?

I'm not sure at the moment since we only have 1 agents/responses provider (do we plan to have more?)
that might require setting up a separate resolver layer that is outside of the agent provider scope but would also need to be MCP tool-aware if it is going to manipulate tools, if we are okay with that

cdoern

thanks for all of the back and forth on this one!

cdoern · 2025-12-22T18:56:46Z

@mattf PTAL, this is the last PR linked to an issue in the 0.4.0 milestone

mattf · 2025-12-23T19:16:02Z

docs/static/llama-stack-spec.yaml

+    ConnectorSource:
+      type: string
+      enum:
+      - config
+      - api
+      title: ConnectorSource
+      description: Source of connector registration.


this shouldn't be exposed

updated, should be fixed now

mattf · 2025-12-23T19:27:04Z

src/llama_stack/core/connectors/connectors.py

+        """Register a new connector (idempotent - updates if already exists)."""
+
+        connector = Connector(
+            connector_id=connector_id,
+            connector_type=connector_type,
+            source=source,
+            url=url,
+            server_label=server_label,
+            server_name=server_name,
+            server_description=server_description,
+        )
+
+        key = self._get_key(connector_id)
+        existing_connector_json = await self.kvstore.get(key)
+
+        if existing_connector_json:
+            existing_connector = Connector.model_validate_json(existing_connector_json)
+
+            # Only overwrite if the connector is an exact match; otherwise log and keep existing.
+            if existing_connector.model_dump() != connector.model_dump():
+                logger.info(
+                    "Connector %s already exists with different configuration; skipping overwrite",
+                    connector_id,
+                )
+                return existing_connector
+
+            logger.debug("Connector %s already exists and matches; overwriting with same value", connector_id)
+
+        # Persist full connector, including source (Field is excluded from schema/dumps by default).
+        connector_payload = connector.model_dump()
+        connector_payload["source"] = connector.source
+        await self.kvstore.set(key, json.dumps(connector_payload))
+
+        return connector


"idempotent - updates if already exists"

please self-review this code.

consider -

why update if the values are the same?

how will a user know their register attempt failed?

here's the logic I ended up with after another look:

config is the source of truth at boot time. config connectors can also never be updated, only registered. Therefore, config based connectors are allowed overwrites at registration time, so stack admins can take deliberate action to "update" any previously registered connector by updating the config and restarting the server. If the values are the same then it is a no-op

if a new connector has a different source (api) and has conflicts with an existing connector, then the registration fails, since that is technically an update operation and should happen via the update endpoint (once it is implemented in Add /admin level Create/Update/Delete endpoints for the connectors API #4239 ) and not at registration time

as for

how will a user know their register attempt failed?

the user will always be a stack admin since only they can register new connectors (via config or /admin api), so I left it as a log statement that they can check instead of raising an error

Signed-off-by: Jaideep Rao <jrao@redhat.com>

…andle stale config connectors Signed-off-by: Jaideep Rao <jrao@redhat.com>

Signed-off-by: Jaideep Rao <jrao@redhat.com>

… from openapi schema Signed-off-by: Jaideep Rao <jrao@redhat.com>

Signed-off-by: Jaideep Rao <jrao@redhat.com>

mattf

there's a debate about how to handle resources in config vs api. to be consistent, you should not include the source as an input param and resources defined in config can be unregistered + registered via api.

an admin for the service may not be the same as the service operator.

jaideepr97 · 2025-12-25T15:04:22Z

there's a debate about how to handle resources in config vs api. to be consistent, you should not include the source as an input param and resources defined in config can be unregistered + registered via api.

an admin for the service may not be the same as the service operator.

@ashwinb felt connectors should not be considered to be resources since they have no provider and are not routable and suggested to look at them as simple objects instead

Not sure how that informs the debate or connectors' subjectivity to it

jaideepr97 · 2025-12-26T15:46:45Z

@mattf providing some more context/thoughts/questions here based on further looking at what exists -

to be consistent, you should not include the source as an input param

I found RegistryEntrySource being used by models at present. So source tracking exists, but it seems to be baked into ResourceWithOwner and currently only utilized by models. It also doesn't seem to distinguish between config and api (as you referenced) and treats them both as "api"

Connectors are not resources therefore they do not inherit from ResourceWithOwner and currently handle source tracking directly in their own definition. If we want to stay consistent with the precedent of source tracking but not keep it in the connector definition I can extract it out to some parent class and inherit it from there but that seems rather forced and unnecessary given that this field is already not exposed

alternatively, if connectors should stick with the pattern of not differentiating between config and api as sources, I see the problem of stale objects occurring (which is a gap within the existing implementation IMO). Resources/objects previously configured via config but removed later are not automatically deleted from the KV store. This can only be handled in 2 ways I think:

identify if a seemingly stale object was registered via config or api - if api, leave it alone. if config, and it has been removed since, then that means it should be cleaned up at stack boot time.
leave it up to the api to manually query what objects have been registered and clean up stale ones

option 1 assumes that the config is meant to be an accurate reflection of the state of things at any given time as opposed to just a point of initialization. Maybe that is a key point that will decide what approach should be followed
How do we view the role of the config in this project?

If I have misunderstood, and you only meant to remove source from the function signature for register_connector then I would say that the source field is not going to be exposed to either config or API users, it will be set internally based on where it is coming from

resources defined in config can be unregistered + registered via api.

if we extend this to all objects, this would still be possible for connectors once API operations (other than read) are implemented in #4239 , as long as it is done in 2 steps - unregister.+ register. The only API operation I have blocked here is re-registering an existing connector via API in a single action.

an admin for the service may not be the same as the service operator.

That's fair - users registering connectors via API would not see the log statement. I think that should be handled in the follow up PR when that API is the focus. This PR focuses on the config based workflow, for which the logging is enough since only stack admins can set the config and see the logs. How does that sound?

All of this is also encapsulated by the open question about whether non-resources are expected to follow the same standards and patterns as resources

jaideepr97 requested review from ashwinb, bbrowning, cdoern, ehhuang, franciscojavierarceo, leseb, mattf and raghotham as code owners December 1, 2025 10:04

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Dec 1, 2025

jaideepr97 changed the title ~~feat: Implement connector support via static configuration~~ feat(api): Implement connector support via static configuration Dec 1, 2025

jaideepr97 mentioned this pull request Dec 1, 2025

Add support for OpenAI Connectors and MCP Registries #4061

Open

2 tasks

jaideepr97 force-pushed the impl-connectors branch from 174b8e9 to 5331e80 Compare December 1, 2025 10:24

cdoern reviewed Dec 1, 2025

View reviewed changes

jaideepr97 force-pushed the impl-connectors branch 2 times, most recently from 4fa016e to 01a76e0 Compare December 1, 2025 20:41

mattf reviewed Dec 2, 2025

View reviewed changes

src/llama_stack/providers/inline/agents/meta_reference/responses/streaming.py Outdated Show resolved Hide resolved

mergify bot added the needs-rebase label Dec 4, 2025

jaideepr97 force-pushed the impl-connectors branch from 01a76e0 to e6633bb Compare December 13, 2025 05:13

mergify bot removed the needs-rebase label Dec 13, 2025

jaideepr97 force-pushed the impl-connectors branch 4 times, most recently from f596311 to eb67320 Compare December 14, 2025 04:50

jaideepr97 force-pushed the impl-connectors branch 4 times, most recently from bf737bc to 78cdbfe Compare December 15, 2025 19:47

ashwinb requested changes Dec 15, 2025

View reviewed changes

src/llama_stack/core/connectors/connectors.py Outdated Show resolved Hide resolved

src/llama_stack/core/connectors/connectors.py Show resolved Hide resolved

jaideepr97 mentioned this pull request Dec 17, 2025

feat: implement fastapi router for connectors API #4402

Draft

jaideepr97 force-pushed the impl-connectors branch 2 times, most recently from e58b376 to 3cb63dc Compare December 19, 2025 12:51

ashwinb approved these changes Dec 19, 2025

View reviewed changes

mattf requested changes Dec 20, 2025

View reviewed changes

jaideepr97 force-pushed the impl-connectors branch from 3cb63dc to 3be402c Compare December 22, 2025 15:29

jaideepr97 force-pushed the impl-connectors branch from 3be402c to f834680 Compare December 22, 2025 18:17

cdoern approved these changes Dec 22, 2025

View reviewed changes

cdoern requested a review from mattf December 22, 2025 19:47

jaideepr97 mentioned this pull request Dec 23, 2025

fix: update responses limitations doc to track latest state #4392

Open

mattf requested changes Dec 23, 2025

View reviewed changes

jaideepr97 added 10 commits December 24, 2025 09:12

feat(api): implement connector support via static configuration

56e7d16

Signed-off-by: Jaideep Rao <jrao@redhat.com>

integrate connector_id support into agent provider

b98b8fd

Signed-off-by: Jaideep Rao <jrao@redhat.com>

make mcp import lazy

96fa6b2

Signed-off-by: Jaideep Rao <jrao@redhat.com>

add unit tests

0001b99

Signed-off-by: Jaideep Rao <jrao@redhat.com>

empty commit

ceb7f8d

stop treating connectors as resources, add source field to properly h…

69a558e

…andle stale config connectors Signed-off-by: Jaideep Rao <jrao@redhat.com>

make mcp a required core dependency

9815821

Signed-off-by: Jaideep Rao <jrao@redhat.com>

enforce connector idempotency, add unit test and exclude source field…

92e0fca

… from openapi schema Signed-off-by: Jaideep Rao <jrao@redhat.com>

change source to skipjsonschema

ed56c62

Signed-off-by: Jaideep Rao <jrao@redhat.com>

update logic for connector registration conflict cases

9f51de0

Signed-off-by: Jaideep Rao <jrao@redhat.com>

jaideepr97 force-pushed the impl-connectors branch from e65d620 to 9f51de0 Compare December 24, 2025 14:13

mattf requested changes Dec 25, 2025

View reviewed changes

Merge branch 'main' into impl-connectors

464969f

feat(api): Implement connector support via static configuration #4263

Are you sure you want to change the base?

feat(api): Implement connector support via static configuration #4263

Uh oh!

Conversation

jaideepr97 commented Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Examples:

API requests:

Client side usage example:

Test Plan

Uh oh!

cdoern left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✱ Stainless preview builds

Uh oh!

Uh oh!

mergify bot commented Dec 4, 2025

Uh oh!

jaideepr97 commented Dec 15, 2025

Uh oh!

Uh oh!

Uh oh!

jaideepr97 commented Dec 19, 2025

Uh oh!

ashwinb left a comment

Choose a reason for hiding this comment

Uh oh!

mattf left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jaideepr97 commented Dec 22, 2025

Uh oh!

cdoern left a comment

Choose a reason for hiding this comment

Uh oh!

cdoern commented Dec 22, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mattf left a comment

Choose a reason for hiding this comment

Uh oh!

jaideepr97 commented Dec 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jaideepr97 commented Dec 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

jaideepr97 commented Dec 1, 2025 •

edited

Loading

github-actions bot commented Dec 1, 2025 •

edited

Loading

jaideepr97 commented Dec 25, 2025 •

edited

Loading

jaideepr97 commented Dec 26, 2025 •

edited

Loading