TanStack · AlemTuzlak · Apr 23, 2026 · Apr 23, 2026 · Apr 23, 2026 · Apr 23, 2026
diff --git a/.changeset/otel-middleware.md b/.changeset/otel-middleware.md
@@ -0,0 +1,13 @@
+---
+'@tanstack/ai': minor
+---
+
+**OpenTelemetry middleware.** `otelMiddleware({ tracer, meter?, captureContent?, redact?, ... })` emits GenAI-semantic-convention traces and metrics for every `chat()` call.
+
+- Root span per `chat()` + child span per agent-loop iteration (named `chat <model> #<iteration>`) + grandchild span per tool call.
+- `gen_ai.client.operation.duration` (seconds) recorded **once per `chat()` call**; `gen_ai.client.token.usage` (tokens) recorded **per iteration** (one input + one output record). Metric attributes are kept low-cardinality — `gen_ai.response.model` and `gen_ai.response.id` are intentionally excluded.
+- `captureContent: true` attaches prompt/completion content as `gen_ai.{user,system,assistant,tool}.message` and `gen_ai.choice` span events. Redactor failures fail closed to a `"[redaction_failed]"` sentinel — raw content never leaks. Assistant text is capped at `maxContentLength` (default 100 000).
+- Four extension points for custom attributes, names, span-options, and end-of-span callbacks. Thrown callbacks are caught and logged to `console.warn` with a label so failures remain diagnosable.
+- `@opentelemetry/api` is an optional peer dependency. The middleware is exported from the dedicated subpath `@tanstack/ai/middlewares/otel` so that importing `@tanstack/ai/middlewares` does not eagerly require OTel.
+
+See `docs/advanced/otel.md` for the full guide.
diff --git a/docs/advanced/otel.md b/docs/advanced/otel.md
@@ -0,0 +1,171 @@
+---
+title: OpenTelemetry
+id: otel
+order: 4
+description: "Emit vendor-neutral OpenTelemetry traces and metrics from every TanStack AI chat() call, following the OTel GenAI semantic conventions."
+keywords:
+  - tanstack ai
+  - opentelemetry
+  - otel
+  - observability
+  - tracing
+  - metrics
+  - gen_ai
+  - semantic conventions
+---
+
+The `otelMiddleware` factory wires TanStack AI into your existing OpenTelemetry setup. Every `chat()` call produces a root span, one child span per agent-loop iteration, and one grandchild span per tool call — all with [GenAI semantic-convention attributes](https://opentelemetry.io/docs/specs/semconv/gen-ai/). It also records GenAI token and duration histograms when a `Meter` is provided.
+
+## Setup
+
+Install `@opentelemetry/api` — it's an optional peer dependency of `@tanstack/ai`:
+
+```bash
+pnpm add @opentelemetry/api
+```
+
+Wire up your OTel SDK however you already do (e.g. `@opentelemetry/sdk-node`). Then pass a `Tracer` (and optionally a `Meter`) into the middleware. The OTel middleware lives on its own subpath — importing it never affects users who don't need OTel:
+
+```ts
+import { chat } from '@tanstack/ai'
+import { otelMiddleware } from '@tanstack/ai/middlewares/otel'
+import { openaiText } from '@tanstack/ai-openai/adapters'
+import { trace, metrics } from '@opentelemetry/api'
+
+const otel = otelMiddleware({
+  tracer: trace.getTracer('my-app'),
+  meter: metrics.getMeter('my-app'),
+})
+
+const result = await chat({
+  adapter: openaiText('gpt-4o'),
+  messages: [{ role: 'user', content: 'hi' }],
+  middleware: [otel],
+  stream: false,
+})
+```
+
+## What gets emitted
+
+### Spans
+
+```text
+chat gpt-4o              (root, kind: INTERNAL)
+├── chat gpt-4o #0       (iteration, kind: CLIENT)
+│   ├── execute_tool get_weather
+│   └── execute_tool get_time
+└── chat gpt-4o #1       (iteration, kind: CLIENT)
+```
+
+Iteration spans are numbered (`#0`, `#1`, ...) so distinct iterations of the same chat are easy to pick apart in trace viewers.
+
+### Attribute reference
+
+| Level | Attribute | Value |
+| --- | --- | --- |
+| root / iteration | `gen_ai.system` | `openai`, `anthropic`, ... |
+| iteration | `gen_ai.operation.name` | `chat` |
+| root / iteration | `gen_ai.request.model` | requested model |
+| iteration | `gen_ai.response.model` | actual model |
+| iteration | `gen_ai.request.temperature` | from config |
+| iteration | `gen_ai.request.top_p` | from config |
+| iteration | `gen_ai.request.max_tokens` | from config |
+| iteration | `gen_ai.usage.input_tokens` | per iteration |
+| iteration | `gen_ai.usage.output_tokens` | per iteration |
+| iteration | `gen_ai.response.finish_reasons` | `[stop]`, `[tool_calls]`, ... |
+| root | `gen_ai.usage.input_tokens` | rolled up |
+| root | `gen_ai.usage.output_tokens` | rolled up |
+| root | `tanstack.ai.iterations` | iteration count |
+| tool | `gen_ai.tool.name` | tool name |
+| tool | `gen_ai.tool.call.id` | tool call id |
+| tool | `gen_ai.tool.type` | `function` |
+| tool | `tanstack.ai.tool.outcome` | `success` / `error` |
+
+### Metrics
+
+Two GenAI-standard histograms:
+
+- `gen_ai.client.operation.duration` (seconds) — recorded **once per `chat()` call**, covering all agent-loop iterations and tool execution. On error or abort the record carries an `error.type` attribute (the thrown error's `name`, or `"cancelled"` for aborts).
+- `gen_ai.client.token.usage` (tokens) — recorded **once per iteration** (two records: input and output), tagged with `gen_ai.token.type`.
+
+Both `gen_ai.response.id` and `gen_ai.response.model` are deliberately excluded from metric attributes to keep cardinality low (per-request custom-model names and request IDs would blow up the series set).
+
+## Privacy: capturing prompts and completions
+
+By default, only metadata lands on spans. To record prompt and completion content, set `captureContent: true`. Content is captured as OTel span events following the GenAI convention:
+
+- `gen_ai.user.message`, `gen_ai.system.message`, `gen_ai.assistant.message`, `gen_ai.tool.message`, `gen_ai.choice`
+
+Pass a `redact` function to strip PII before anything is recorded:
+
+```ts
+otelMiddleware({
+  tracer,
+  captureContent: true,
+  redact: (text) => text.replace(/\b\d{3}-\d{2}-\d{4}\b/g, '[SSN]'),
+})
+```
+
+If `redact` throws, the middleware writes the literal sentinel `"[redaction_failed]"` into the span event and logs a warning — it never falls back to the raw content. This is the load-bearing invariant for users who ship traces to third-party backends: a broken redactor should shut off capture, not leak prompts.
+
+Accumulated assistant text (the `gen_ai.choice` event) is capped at `maxContentLength` characters (default `100 000`); longer completions are truncated with a trailing `"…"` marker.
+
+Multimodal content (images, audio, video, documents) is represented as placeholder strings (`[image]`, `[audio]`, ...) to preserve message order without dumping binary data onto spans. Use `onSpanEnd` if you need richer multimodal capture.
+
+Prompt/system/user message events fire from `onConfig` at the start of every iteration, which means the full conversation history (as the adapter will re-send it) is re-emitted on each iteration span. This mirrors what the provider actually sees on the wire.
+
+## Extension points
+
+All four extensions are optional. Each wraps user code in try/catch — a thrown callback becomes a log line, never a broken chat.
+
+### `spanNameFormatter(info)`
+
+Override default span names. `info.kind` is `'chat' | 'iteration' | 'tool'`.
+
+```ts
+otelMiddleware({
+  tracer,
+  spanNameFormatter: (info) =>
+    info.kind === 'tool' ? `tool:${info.toolName}` : `chat:${info.ctx.model}`,
+})
+```
+
+### `attributeEnricher(info)`
+
+Add custom attributes to every span. Fires once per span.
+
+```ts
+otelMiddleware({
+  tracer,
+  attributeEnricher: () => ({
+    'tenant.id': getCurrentTenant(),
+  }),
+})
+```
+
+### `onBeforeSpanStart(info, options)`
+
+Mutate `SpanOptions` immediately before `tracer.startSpan(...)`. Useful for adding links, custom start times, or extra default attributes.
+
+### `onSpanEnd(info, span)`
+
+Fires just before every `span.end()`. Common uses: record custom events, emit per-tool metrics via your own `Meter`.
+
+```ts
+const toolDuration = meter.createHistogram('tool.duration')
+otelMiddleware({
+  tracer,
+  onSpanEnd: (info, span) => {
+    if (info.kind === 'tool') {
+      // span is still recording; read timestamps from your own store if needed
+      toolDuration.record(1, { 'tool.name': info.toolName })
+    }
+  },
+})
+```
+
+## Related
+
+- [Middleware](./middleware) — the lifecycle this middleware hooks into
+- [Debug Logging](./debug-logging) — quick console-output diagnostics, complementary to OTel
+- [Observability](./observability) — TanStack AI's built-in event client
diff --git a/docs/config.json b/docs/config.json
@@ -175,6 +175,10 @@
           "label": "Debug Logging",
           "to": "advanced/debug-logging"
         },
+        {
+          "label": "OpenTelemetry",
+          "to": "advanced/otel"
+        },
         {
           "label": "Observability",
           "to": "advanced/observability"

diff --git a/knip.json b/knip.json
@@ -22,6 +22,9 @@
     "packages/react-ai": {
       "ignore": []
     },
+    "packages/typescript/ai": {
+      "ignoreDependencies": ["@opentelemetry/api"]
+    },
     "packages/typescript/ai-anthropic": {
       "ignore": ["src/tools/**"]
     },

diff --git a/packages/typescript/ai/package.json b/packages/typescript/ai/package.json
@@ -25,6 +25,10 @@
       "types": "./dist/esm/middlewares/index.d.ts",
       "import": "./dist/esm/middlewares/index.js"
     },
+    "./middlewares/otel": {
+      "types": "./dist/esm/middlewares/otel.d.ts",
+      "import": "./dist/esm/middlewares/otel.js"
+    },
     "./adapter-internals": {
       "types": "./dist/esm/adapter-internals.d.ts",
       "import": "./dist/esm/adapter-internals.js"
@@ -65,7 +69,16 @@
     "@tanstack/ai-event-client": "workspace:*",
     "partial-json": "^0.1.7"
   },
+  "peerDependencies": {
+    "@opentelemetry/api": ">=1.9.0"
+  },
+  "peerDependenciesMeta": {
+    "@opentelemetry/api": {
+      "optional": true
+    }
+  },
   "devDependencies": {
+    "@opentelemetry/api": "^1.9.0",
     "@standard-schema/spec": "^1.1.0",
     "@vitest/coverage-v8": "4.0.14",
     "zod": "^4.2.0"

diff --git a/packages/typescript/ai/src/middlewares/index.ts b/packages/typescript/ai/src/middlewares/index.ts
@@ -11,3 +11,8 @@ export {
   type ContentGuardRule,
   type ContentFilteredInfo,
 } from './content-guard'
+
+// otelMiddleware is exported from the dedicated subpath
+// `@tanstack/ai/middlewares/otel` so that importing the main middlewares barrel
+// does not eagerly require `@opentelemetry/api` (which is an optional peer
+// dependency).