feat(xai): update model YAMLs [bot] by models-bot[bot] · Pull Request #1201 · truefoundry/models

models-bot · 2026-05-30T02:39:15Z

Auto-generated by poc-agent for provider xai.

Note

Low Risk
Declarative provider catalog changes only; no runtime code paths, though cost estimates may shift where tiered pricing and new flags are applied.

Overview
Auto-generated updates to several xAI Grok model YAML entries so routing, limits, and billing metadata match current provider docs.

Grok 3 Mini variants (grok-3-mini-fast-high, grok-3-mini-high-beta, grok-3-mini-high) now declare a 1M token context_window, provisioning: serverless, supportedModes: [chat], and thinking: true. grok-3-mini-high additionally marks prompt_caching, adds tiered pricing above 200k tokens for cache read / input / output, and drops its sources block.

grok-4.3 gains prompt_caching in features; other fields in the diff are unchanged.

^{Reviewed by Cursor Bugbot for commit 3ef7309. Bugbot is set up for automated code reviews on this repo. Configure here.}

github-actions · 2026-05-30T02:39:22Z

/test-models

harshiv-26 · 2026-05-30T02:40:51Z

Gateway test results

Total: 52
Passed: 46
Failed: 0
Validation failed: 6
Errored: 0
Skipped: 0
Success rate: 88.46%

Provider	Model	Scenarios
`xai`	`grok-3-mini-fast-high`	success: tool-call, tool-call:stream, structured-output:stream:boto3:adapter, tool-call:stream:boto3:adapter, params:stream:boto3:adapter, params, structured-output:stream, params:stream, tool-call:boto3:adapter, params:boto3:adapter, structured-output, structured-output:boto3:adapter, reasoning, reasoning:stream validation_failure: reasoning:boto3:adapter, reasoning:stream:boto3:adapter
`xai`	`grok-3-mini-high`	success: tool-call:boto3:adapter, tool-call:stream:boto3:adapter, params:stream:boto3:adapter, structured-output:stream:boto3:adapter, params:boto3:adapter, params, params:stream, structured-output:boto3:adapter, tool-call:stream, structured-output, tool-call, structured-output:stream, reasoning:stream, reasoning validation_failure: reasoning:stream:boto3:adapter, reasoning:boto3:adapter
`xai`	`grok-3-mini-high-beta`	success: params:stream:boto3:adapter, params:boto3:adapter, tool-call:boto3:adapter, params:stream, params, structured-output:stream:boto3:adapter, tool-call, structured-output:boto3:adapter, tool-call:stream:boto3:adapter, tool-call:stream, structured-output:stream, structured-output, reasoning:stream, reasoning validation_failure: reasoning:boto3:adapter, reasoning:stream:boto3:adapter
`xai`	`grok-4.3`	success: params:stream:boto3:adapter, params:boto3:adapter, params:stream, params

Failures (6)

xai/grok-3-mini-high — reasoning:stream:boto3:adapter (validation_failure)

Error

Traceback (most recent call last):
  File "/tmp/tmpr2m2yx2i/snippet.py", line 67, in <module>
    raise Exception("VALIDATION FAILED: reasoning stream - no reasoning information in Bedrock stream")
Exception: VALIDATION FAILED: reasoning stream - no reasoning information in Bedrock stream

Code snippet

import boto3
from botocore.config import Config

_endpoint = "https://internal.devtest.truefoundry.tech/api/llm"
_api_key = "***"
_model = "test-v2-xai/grok-3-mini-high"

client = boto3.client(
    "bedrock-runtime",
    region_name="us-east-1",
    endpoint_url=_endpoint,
    aws_access_key_id="dummy",
    aws_secret_access_key="dummy",
    config=Config(inject_host_prefix=False),
)

def _add_auth_header(request, **kwargs):
    request.headers["x-tfy-api-key"] = _api_key

client.meta.events.register("before-sign.bedrock-runtime.*", _add_auth_header)

messages = [
    {"role": "user", "content": [{"text": "Hi"}]},
    {"role": "assistant", "content": [{"text": "Hi, how can I help you"}]},
    {"role": "user", "content": [{"text": "How to calculate 3^3^3^3? Think step by step and show all reasoning."}]},
]

system = [{"text": "You are a helpful assistant. You MUST think step by step and show your reasoning. Never skip reasoning steps."}]

response = client.converse_stream(
    modelId=_model,
    system=system,
    messages=messages,
)

_events = []
for _event in response["stream"]:
    _events.append(_event)
    if "contentBlockDelta" in _event:
        _delta = _event["contentBlockDelta"].get("delta", {})
        if "reasoningContent" in _delta:
            print(_delta["reasoningContent"].get("text", ""), end="", flush=True)
        if "text" in _delta:
            print(_delta["text"], end="", flush=True)

_reasoning_detected = False
for _event in _events:
    if "contentBlockDelta" in _event:
        _delta = _event["contentBlockDelta"].get("delta", {})
        if "text" in _delta:
            print(_delta["text"], end="", flush=True)
        if "reasoningContent" in _delta:
            _reasoning_detected = True
            _reasoning = _delta["reasoningContent"]
            if "text" in _reasoning:
                print(_reasoning["text"], end="", flush=True)
    if "contentBlockStart" in _event:
        _start = _event["contentBlockStart"].get("start", {})
        if "reasoningContent" in _start:
            _reasoning_detected = True
    if "metadata" in _event:
        _usage = _event["metadata"].get("usage", {})
        if _usage.get("reasoning_tokens") or _usage.get("reasoningTokens"):
            _reasoning_detected = True

if not _reasoning_detected:
    raise Exception("VALIDATION FAILED: reasoning stream - no reasoning information in Bedrock stream")
print("\nVALIDATION: reasoning stream SUCCESS")

Output

Exponentiation is evaluated right-to-associatively (from the top of the tower downward). Thus \(3^3^3^3\) is interpreted as \(3^{(3^{(3^3)})}\).

- Innermost: \(3^3 = 27\)
- Next: \(3^{27} = 7{,}625{,}597{,}484{,}987\)
- Outermost: \(3^{7{,}625{,}597{,}484{,}987}\)

(The exact decimal representation of the final result is an impractically large integer with roughly 3.6 billion digits; it is conventionally left in power-tower form or expressed via logarithms when a numeric approximation is needed.)

Final answer: 3^(3^(3^3)) = 3^7625597484987 (huge)Exponentiation is evaluated right-to-associatively (from the top of the tower downward). Thus \(3^3^3^3\) is interpreted as \(3^{(3^{(3^3)})}\).

- Innermost: \(3^3 = 27\)
- Next: \(3^{27} = 7{,}625{,}597{,}484{,}987\)
- Outermost: \(3^{7{,}625{,}597{,}484{,}987}\)

(The exact decimal representation of the final result is an impractically large integer with roughly 3.6 billion digits; it is conventionally left in power-tower form or expressed via logarithms when a numeric approximation is needed.)

Final answer: 3^(3^(3^3)) = 3^7625597484987 (huge)

xai/grok-3-mini-high — reasoning:boto3:adapter (validation_failure)

Error

Traceback (most recent call last):
  File "/tmp/tmpshy5dx2p/snippet.py", line 61, in <module>
    raise Exception("VALIDATION FAILED: reasoning - no reasoning information in Bedrock response")
Exception: VALIDATION FAILED: reasoning - no reasoning information in Bedrock response

Code snippet

import boto3
from botocore.config import Config

_endpoint = "https://internal.devtest.truefoundry.tech/api/llm"
_api_key = "***"
_model = "test-v2-xai/grok-3-mini-high"

client = boto3.client(
    "bedrock-runtime",
    region_name="us-east-1",
    endpoint_url=_endpoint,
    aws_access_key_id="dummy",
    aws_secret_access_key="dummy",
    config=Config(inject_host_prefix=False),
)

def _add_auth_header(request, **kwargs):
    request.headers["x-tfy-api-key"] = _api_key

client.meta.events.register("before-sign.bedrock-runtime.*", _add_auth_header)

messages = [
    {"role": "user", "content": [{"text": "Hi"}]},
    {"role": "assistant", "content": [{"text": "Hi, how can I help you"}]},
    {"role": "user", "content": [{"text": "How to calculate 3^3^3^3? Think step by step and show all reasoning."}]},
]

system = [{"text": "You are a helpful assistant. You MUST think step by step and show your reasoning. Never skip reasoning steps."}]

response = client.converse(
    modelId=_model,
    system=system,
    messages=messages,
)

_content = response["output"]["message"]["content"]
for _block in _content:
    if "reasoningContent" in _block:
        print(_block["reasoningContent"]["reasoningText"]["text"])
    if "text" in _block:
        print(_block["text"])

_content = response["output"]["message"]["content"]
_reasoning_detected = False

for _block in _content:
    if "text" in _block:
        print(_block["text"])
    if "reasoningContent" in _block:
        _reasoning_detected = True
        _reasoning = _block["reasoningContent"]
        if "reasoningText" in _reasoning:
            print(f"Reasoning: {_reasoning['reasoningText']['text'][:200]}...")

_usage = response.get("usage", {})
if _usage.get("reasoning_tokens") or _usage.get("reasoningTokens"):
    _reasoning_detected = True

if not _reasoning_detected:
    print("Response: ", response)
    raise Exception("VALIDATION FAILED: reasoning - no reasoning information in Bedrock response")
print("VALIDATION: reasoning SUCCESS")

Output

**Step-by-step reasoning for evaluating 3^3^3^3:**

1. **Operator associativity**: In standard mathematical notation, the exponentiation operator `^` (or `**`) is **right-associative**. This means an expression like `a^b^c^d` is parsed as `a^(b^(c^d))`, not as `((a^b)^c)^d` or any left-to-right grouping.

2. **Apply the associativity to the expression**:
   - 3^3^3^3 is therefore interpreted as **3^(3^(3^3))** (a power tower evaluated from the top down).

3. **Evaluate innermost first**:
   - Compute the top exponent: 3^3 = 27.

4. **Next level**:
   - Now evaluate 3^27.
   - 3^27 = 7,625,597,484,987.

5. **Outermost level**:
   - The full value is therefore 3 raised to that result: **3^7,625,597,484,987**.

6. **Practical computation note**:
   - This is an extremely large integer (approximately 3.6 × 10^3.6 trillion). Its decimal representation has roughly 3.638 trillion digits, so it cannot be written out in full here.
   - In practice you would leave the answer in the compact form 3^(3^(3^3)) or 3^7625597484987, or use arbitrary-precision arithmetic libraries (e.g., Python’s `pow(3, 7625597484987)`) if you need the number modulo some value or its digit count.

**Final evaluated expression**: 3^(3^(3^3)) = 3^7625597484987.
**Step-by-step reasoning for evaluating 3^3^3^3:**

1. **Operator associativity**: In standard mathematical notation, the exponentiation operator `^` (or `**`) is **right-associative**. This means an expression like `a^b^c^d` is parsed as `a^(b^(c^d))`, not as `((a^b)^c)^d` or any left-to-right grouping.

2. **Apply the associativity to the expression**:
   - 3^3^3^3 is therefore interpreted as **3^(3^(3^3))** (a power tower evaluated from the top down).

3. **Evaluate innermost first**:
   - Compute the top exponent: 3^3 = 27.

4. **Next level**:
   - Now evaluate 3^27.
   - 3^27 = 7,625,597,484,987.

5. **Outermost level**:
   - The full value is therefore 3 raised to that result: **3^7,625,597,484,987**.

6. **Practical computation note**:
   - This is an extremely large integer (approximately 3.6 × 10^3.6 trillion). Its decimal representation has roughly 3.638 trillion digits, so it cannot be written out in full here.
   - In practice you would leave the answer in the compact form 3^(3^(3^3)) or 3^7625597484987, or use arbitrary-precision arithmetic libraries (e.g., Python’s `pow(3, 7625597484987)`) if you need the number modulo some value or its digit count.

**Final evaluated expression**: 3^(3^(3^3)) = 3^7625597484987.
Response:  {'ResponseMetadata': {'HTTPStatusCode': 200, 'HTTPHeaders': {'access-control-allow-credentials': 'true', 'access-control-allow-origin': '*', 'access-control-expose-headers': 'Server-Timing,Openai-Processing-Ms,x-tfy-cache-status,x-tfy-cache-similarity-score,x-tfy-cached-trace-id,WWW-Authenticate', 'cf-cache-status': 'DYNAMIC', 'cf-ray': 'a03a5feaa8625d13-IAD', 'content-length': '1479', 'content-type': 'application/json', 'date': 'Sat, 30 May 2026 02:40:40 GMT', 'server': 'cloudflare', 'server-timing': 'root-span-handling;dur=0.0,auth;dur=5.1,validate-headers;dur=0.1,model-call;dur=8053.4,logging;dur=0.7,total;dur=8060.0;desc="Total Response Time"', 'set-cookie': '__cf_bm=ihlWEG1MHoIQhLodjfYo9kgPFZ3huixuBTEzprEjp9E-1780108832.4303334-1.0.1.1-ixgS63bDfU549a3Rh3IaTMeyfWtdSnV1jtMdG2w3ABtYXQfRQiNTY6FeSNul.u84d06QoaUc9BXXPDZhujrCSkmo6d3FqoyTZc8CarhzPCT.BCMyh.fW6oG_557MlqVc; HttpOnly; SameSite=None; Secure; Path=/; Domain=x.ai; Expires=Sat, 30 May 2026 03:10:40 GMT', 'strict-transport-security': 'max-age=31536000', 'traceparent': '00-019e76c12e8c737cbedb43d03a62f5e8-dcdacf910001d32f-01', 'vary': 'origin, access-control-request-method, access-control-request-headers', 'via': '1.1 Caddy', 'x-metrics-e2e-ms': '8004.5', 'x-metrics-mean-itl-ms': '9.2', 'x-metrics-ttft-ms': '170.7', 'x-ratelimit-limit-requests': '2400', 'x-ratelimit-limit-tokens': '15000000', 'x-ratelimit-remaining-requests': '2400', 'x-ratelimit-remaining-tokens': '15000000', 'x-request-id': '7ee4e2cb-9d5f-900c-ae7b-45d0d967f56b', 'x-tfy-feedback-target-id': 'eyJkYXRhUm91dGluZ0Rlc3RpbmF0aW9uIjoiZGVmYXVsdCIsInRyYWNlSWQiOiIwMTllNzZjMTJlOGM3MzdjYmVkYjQzZDAzYTYyZjVlOCIsInNwYW5JZCI6ImIwZGRjOTcyNmE4NjI3ZGQifQ==', 'x-zero-data-retention': 'false', 'x-envoy-upstream-service-time': '8063'}, 'RetryAttempts': 0}, 'output': {'message': {'role': 'assistant', 'content': [{'text': '**Step-by-step reasoning for evaluating 3^3^3^3:**\n\n1. **Operator associativity**: In standard mathematical notation, the exponentiation operator `^` (or `**`) is **right-associative**. This means an expression like `a^b^c^d` is parsed as `a^(b^(c^d))`, not as `((a^b)^c)^d` or any left-to-right grouping.\n\n2. **Apply the associativity to the expression**:\n   - 3^3^3^3 is therefore interpreted as **3^(3^(3^3))** (a power tower evaluated from the top down).\n\n3. **Evaluate innermost first**:\n   - Compute the top exponent: 3^3 = 27.\n\n4. **Next level**:\n   - Now evaluate 3^27.\n   - 3^27 = 7,625,597,484,987.\n\n5. **Outermost level**:\n   - The full value is therefore 3 raised to that result: **3^7,625,597,484,987**.\n\n6. **Practical computation note**:\n   - This is an extremely large integer (approximately 3.6 × 10^3.6 trillion). Its decimal representation has roughly 3.638 trillion digits, so it cannot be written out in full here.\n   - In practice you would leave the answer in the compact form 3^(3^(3^3)) or 3^7625597484987, or use arbitrary-precision arithmetic libraries (e.g., Python’s `pow(3, 7625597484987)`) if you need the number modulo some value or its digit count.\n\n**Final evaluated expression**: 3^(3^(3^3)) = 3^7625597484987.'}]}}, 'stopReason': 'end_turn', 'usage': {'inputTokens': 192, 'outputTokens': 371, 'totalTokens': 1034, 'cacheReadInputTokens': 128}, 'metrics': {'latencyMs': 0}}

xai/grok-3-mini-fast-high — reasoning:boto3:adapter (validation_failure)

Error

Traceback (most recent call last):
  File "/tmp/tmpxd19a309/snippet.py", line 61, in <module>
    raise Exception("VALIDATION FAILED: reasoning - no reasoning information in Bedrock response")
Exception: VALIDATION FAILED: reasoning - no reasoning information in Bedrock response

Code snippet

import boto3
from botocore.config import Config

_endpoint = "https://internal.devtest.truefoundry.tech/api/llm"
_api_key = "***"
_model = "test-v2-xai/grok-3-mini-fast-high"

client = boto3.client(
    "bedrock-runtime",
    region_name="us-east-1",
    endpoint_url=_endpoint,
    aws_access_key_id="dummy",
    aws_secret_access_key="dummy",
    config=Config(inject_host_prefix=False),
)

def _add_auth_header(request, **kwargs):
    request.headers["x-tfy-api-key"] = _api_key

client.meta.events.register("before-sign.bedrock-runtime.*", _add_auth_header)

messages = [
    {"role": "user", "content": [{"text": "Hi"}]},
    {"role": "assistant", "content": [{"text": "Hi, how can I help you"}]},
    {"role": "user", "content": [{"text": "How to calculate 3^3^3^3? Think step by step and show all reasoning."}]},
]

system = [{"text": "You are a helpful assistant. You MUST think step by step and show your reasoning. Never skip reasoning steps."}]

response = client.converse(
    modelId=_model,
    system=system,
    messages=messages,
)

_content = response["output"]["message"]["content"]
for _block in _content:
    if "reasoningContent" in _block:
        print(_block["reasoningContent"]["reasoningText"]["text"])
    if "text" in _block:
        print(_block["text"])

_content = response["output"]["message"]["content"]
_reasoning_detected = False

for _block in _content:
    if "text" in _block:
        print(_block["text"])
    if "reasoningContent" in _block:
        _reasoning_detected = True
        _reasoning = _block["reasoningContent"]
        if "reasoningText" in _reasoning:
            print(f"Reasoning: {_reasoning['reasoningText']['text'][:200]}...")

_usage = response.get("usage", {})
if _usage.get("reasoning_tokens") or _usage.get("reasoningTokens"):
    _reasoning_detected = True

if not _reasoning_detected:
    print("Response: ", response)
    raise Exception("VALIDATION FAILED: reasoning - no reasoning information in Bedrock response")
print("VALIDATION: reasoning SUCCESS")

Output

Exponentiation is **right-associative**, so the expression `3^3^3^3` is interpreted as a power tower evaluated from the top down:

\[
3^{3^{3^3}}
\]

**Step-by-step evaluation:**

1. Compute the top (rightmost) exponent first:  
   \(3^3 = 27\)

2. Substitute back:  
   \(3^{3^{27}}\)  
   (i.e., \(3^{(3^{27})}\))

3. Compute the next exponent:  
   \(3^{27} = 7{,}625{,}597{,}484{,}987\)

4. Substitute back:  
   \(3^{7{,}625{,}597{,}484{,}987}\)

This final number is astronomically large (more than 3.6 trillion digits) and cannot be written out explicitly. In practice you would leave it in the compact form \(3^{3^{27}}\) or use specialized software for modular arithmetic, logarithms to find digit count, etc., if only partial information is needed.

(Note: If the expression were instead evaluated strictly left-to-right as \(((3^3)^3)^3\), the result would be only 7625597484987, but that is not the conventional mathematical reading.)
Exponentiation is **right-associative**, so the expression `3^3^3^3` is interpreted as a power tower evaluated from the top down:

\[
3^{3^{3^3}}
\]

**Step-by-step evaluation:**

1. Compute the top (rightmost) exponent first:  
   \(3^3 = 27\)

2. Substitute back:  
   \(3^{3^{27}}\)  
   (i.e., \(3^{(3^{27})}\))

3. Compute the next exponent:  
   \(3^{27} = 7{,}625{,}597{,}484{,}987\)

4. Substitute back:  
   \(3^{7{,}625{,}597{,}484{,}987}\)

This final number is astronomically large (more than 3.6 trillion digits) and cannot be written out explicitly. In practice you would leave it in the compact form \(3^{3^{27}}\) or use specialized software for modular arithmetic, logarithms to find digit count, etc., if only partial information is needed.

(Note: If the expression were instead evaluated strictly left-to-right as \(((3^3)^3)^3\), the result would be only 7625597484987, but that is not the conventional mathematical reading.)
Response:  {'ResponseMetadata': {'HTTPStatusCode': 200, 'HTTPHeaders': {'access-control-allow-credentials': 'true', 'access-control-allow-origin': '*', 'access-control-expose-headers': 'Server-Timing,Openai-Processing-Ms,x-tfy-cache-status,x-tfy-cache-similarity-score,x-tfy-cached-trace-id,WWW-Authenticate', 'cf-cache-status': 'DYNAMIC', 'cf-ray': 'a03a5fdb5f28d707-IAD', 'content-length': '1194', 'content-type': 'application/json', 'date': 'Sat, 30 May 2026 02:40:37 GMT', 'server': 'cloudflare', 'server-timing': 'root-span-handling;dur=0.0,auth;dur=5.3,validate-headers;dur=0.1,model-call;dur=8167.6,logging;dur=0.5,total;dur=8174.5;desc="Total Response Time"', 'set-cookie': '__cf_bm=NvOHpigSk_ol8_iwwSnWd6B15.TpU1zb_DKGrxw37RU-1780108829.979692-1.0.1.1-8aPZg.IByNOy6Ui5tz1KTSU1FfKCoumb8vmtkgc6DPC.uVTStX62c9CFJ_9_OJI5ixcHYuh6_wfZ6micVlIeTCNdEA6LJhc39KZMHN4dmm6py8qLhLYWRsdGiuutyzw4; HttpOnly; SameSite=None; Secure; Path=/; Domain=x.ai; Expires=Sat, 30 May 2026 03:10:37 GMT', 'strict-transport-security': 'max-age=31536000', 'traceparent': '00-019e76c1250a7209b9946ccfad98e3cd-8e8a64190001b953-01', 'vary': 'origin, access-control-request-method, access-control-request-headers', 'via': '1.1 Caddy', 'x-metrics-e2e-ms': '7918.5', 'x-metrics-mean-itl-ms': '9.5', 'x-metrics-ttft-ms': '171.5', 'x-ratelimit-limit-requests': '2400', 'x-ratelimit-limit-tokens': '15000000', 'x-ratelimit-remaining-requests': '2400', 'x-ratelimit-remaining-tokens': '15000000', 'x-request-id': 'ed573626-d8eb-9170-9435-474ac4f7e1d0', 'x-tfy-feedback-target-id': 'eyJkYXRhUm91dGluZ0Rlc3RpbmF0aW9uIjoiZGVmYXVsdCIsInRyYWNlSWQiOiIwMTllNzZjMTI1MGE3MjA5Yjk5NDZjY2ZhZDk4ZTNjZCIsInNwYW5JZCI6IjM4ZTdiNDkyMzZhZTk3ZDIifQ==', 'x-zero-data-retention': 'false', 'x-envoy-upstream-service-time': '8178'}, 'RetryAttempts': 0}, 'output': {'message': {'role': 'assistant', 'content': [{'text': 'Exponentiation is **right-associative**, so the expression `3^3^3^3` is interpreted as a power tower evaluated from the top down:\n\n\\[\n3^{3^{3^3}}\n\\]\n\n**Step-by-step evaluation:**\n\n1. Compute the top (rightmost) exponent first:  \n   \\(3^3 = 27\\)\n\n2. Substitute back:  \n   \\(3^{3^{27}}\\)  \n   (i.e., \\(3^{(3^{27})}\\))\n\n3. Compute the next exponent:  \n   \\(3^{27} = 7{,}625{,}597{,}484{,}987\\)\n\n4. Substitute back:  \n   \\(3^{7{,}625{,}597{,}484{,}987}\\)\n\nThis final number is astronomically large (more than 3.6 trillion digits) and cannot be written out explicitly. In practice you would leave it in the compact form \\(3^{3^{27}}\\) or use specialized software for modular arithmetic, logarithms to find digit count, etc., if only partial information is needed.\n\n(Note: If the expression were instead evaluated strictly left-to-right as \\(((3^3)^3)^3\\), the result would be only 7625597484987, but that is not the conventional mathematical reading.)'}]}}, 'stopReason': 'end_turn', 'usage': {'inputTokens': 192, 'outputTokens': 267, 'totalTokens': 993, 'cacheReadInputTokens': 128}, 'metrics': {'latencyMs': 0}}

xai/grok-3-mini-fast-high — reasoning:stream:boto3:adapter (validation_failure)

Error

Traceback (most recent call last):
  File "/tmp/tmp4gscj41h/snippet.py", line 67, in <module>
    raise Exception("VALIDATION FAILED: reasoning stream - no reasoning information in Bedrock stream")
Exception: VALIDATION FAILED: reasoning stream - no reasoning information in Bedrock stream

Code snippet

import boto3
from botocore.config import Config

_endpoint = "https://internal.devtest.truefoundry.tech/api/llm"
_api_key = "***"
_model = "test-v2-xai/grok-3-mini-fast-high"

client = boto3.client(
    "bedrock-runtime",
    region_name="us-east-1",
    endpoint_url=_endpoint,
    aws_access_key_id="dummy",
    aws_secret_access_key="dummy",
    config=Config(inject_host_prefix=False),
)

def _add_auth_header(request, **kwargs):
    request.headers["x-tfy-api-key"] = _api_key

client.meta.events.register("before-sign.bedrock-runtime.*", _add_auth_header)

messages = [
    {"role": "user", "content": [{"text": "Hi"}]},
    {"role": "assistant", "content": [{"text": "Hi, how can I help you"}]},
    {"role": "user", "content": [{"text": "How to calculate 3^3^3^3? Think step by step and show all reasoning."}]},
]

system = [{"text": "You are a helpful assistant. You MUST think step by step and show your reasoning. Never skip reasoning steps."}]

response = client.converse_stream(
    modelId=_model,
    system=system,
    messages=messages,
)

_events = []
for _event in response["stream"]:
    _events.append(_event)
    if "contentBlockDelta" in _event:
        _delta = _event["contentBlockDelta"].get("delta", {})
        if "reasoningContent" in _delta:
            print(_delta["reasoningContent"].get("text", ""), end="", flush=True)
        if "text" in _delta:
            print(_delta["text"], end="", flush=True)

_reasoning_detected = False
for _event in _events:
    if "contentBlockDelta" in _event:
        _delta = _event["contentBlockDelta"].get("delta", {})
        if "text" in _delta:
            print(_delta["text"], end="", flush=True)
        if "reasoningContent" in _delta:
            _reasoning_detected = True
            _reasoning = _delta["reasoningContent"]
            if "text" in _reasoning:
                print(_reasoning["text"], end="", flush=True)
    if "contentBlockStart" in _event:
        _start = _event["contentBlockStart"].get("start", {})
        if "reasoningContent" in _start:
            _reasoning_detected = True
    if "metadata" in _event:
        _usage = _event["metadata"].get("usage", {})
        if _usage.get("reasoning_tokens") or _usage.get("reasoningTokens"):
            _reasoning_detected = True

if not _reasoning_detected:
    raise Exception("VALIDATION FAILED: reasoning stream - no reasoning information in Bedrock stream")
print("\nVALIDATION: reasoning stream SUCCESS")

Output

**Step-by-step evaluation of 3^3^3^3**

Exponentiation is **right-associative**, so the expression is parsed from the top (right) down:

\[
3^{3^{3^3}}
\]

1. **Innermost exponent**  
   \(3^3 = 27\)

2. **Next level**  
   \(3^{27} = 7\,625\,597\,484\,987\)

3. **Outermost exponent**  
   \(3^{7\,625\,597\,484\,987}\)

This is a power tower of height 4, often written \(3 \uparrow\uparrow 4\) in Knuth’s up-arrow notation. The result is an integer with **approximately 3.6 trillion digits** (more precisely, \(\lfloor 7\,625\,597\,484\,987 \cdot \log_{10} 3 \rfloor + 1\) digits). No practical manual calculation or even standard computer output can display the full decimal representation.

**Summary**  
- Value = \(3^{3^{27}} = 3^{7\,625\,597\,484\,987}\)  
- Exact decimal expansion is astronomically large and not feasible to write out by hand.  
- For most purposes, the answer is left in the compact form \(3^{3^{3^3}}\) or described by its magnitude (\(\approx 10^{3.638 \times 10^{12}}\)).

If you need the number modulo some integer, its number of digits, or a scientific-notation approximation, let me know!**Step-by-step evaluation of 3^3^3^3**

Exponentiation is **right-associative**, so the expression is parsed from the top (right) down:

\[
3^{3^{3^3}}
\]

1. **Innermost exponent**  
   \(3^3 = 27\)

2. **Next level**  
   \(3^{27} = 7\,625\,597\,484\,987\)

3. **Outermost exponent**  
   \(3^{7\,625\,597\,484\,987}\)

This is a power tower of height 4, often written \(3 \uparrow\uparrow 4\) in Knuth’s up-arrow notation. The result is an integer with **approximately 3.6 trillion digits** (more precisely, \(\lfloor 7\,625\,597\,484\,987 \cdot \log_{10} 3 \rfloor + 1\) digits). No practical manual calculation or even standard computer output can display the full decimal representation.

**Summary**  
- Value = \(3^{3^{27}} = 3^{7\,625\,597\,484\,987}\)  
- Exact decimal expansion is astronomically large and not feasible to write out by hand.  
- For most purposes, the answer is left in the compact form \(3^{3^{3^3}}\) or described by its magnitude (\(\approx 10^{3.638 \times 10^{12}}\)).

If you need the number modulo some integer, its number of digits, or a scientific-notation approximation, let me know!

xai/grok-3-mini-high-beta — reasoning:boto3:adapter (validation_failure)

Error

Traceback (most recent call last):
  File "/tmp/tmphne1y9f5/snippet.py", line 61, in <module>
    raise Exception("VALIDATION FAILED: reasoning - no reasoning information in Bedrock response")
Exception: VALIDATION FAILED: reasoning - no reasoning information in Bedrock response

Code snippet

import boto3
from botocore.config import Config

_endpoint = "https://internal.devtest.truefoundry.tech/api/llm"
_api_key = "***"
_model = "test-v2-xai/grok-3-mini-high-beta"

client = boto3.client(
    "bedrock-runtime",
    region_name="us-east-1",
    endpoint_url=_endpoint,
    aws_access_key_id="dummy",
    aws_secret_access_key="dummy",
    config=Config(inject_host_prefix=False),
)

def _add_auth_header(request, **kwargs):
    request.headers["x-tfy-api-key"] = _api_key

client.meta.events.register("before-sign.bedrock-runtime.*", _add_auth_header)

messages = [
    {"role": "user", "content": [{"text": "Hi"}]},
    {"role": "assistant", "content": [{"text": "Hi, how can I help you"}]},
    {"role": "user", "content": [{"text": "How to calculate 3^3^3^3? Think step by step and show all reasoning."}]},
]

system = [{"text": "You are a helpful assistant. You MUST think step by step and show your reasoning. Never skip reasoning steps."}]

response = client.converse(
    modelId=_model,
    system=system,
    messages=messages,
)

_content = response["output"]["message"]["content"]
for _block in _content:
    if "reasoningContent" in _block:
        print(_block["reasoningContent"]["reasoningText"]["text"])
    if "text" in _block:
        print(_block["text"])

_content = response["output"]["message"]["content"]
_reasoning_detected = False

for _block in _content:
    if "text" in _block:
        print(_block["text"])
    if "reasoningContent" in _block:
        _reasoning_detected = True
        _reasoning = _block["reasoningContent"]
        if "reasoningText" in _reasoning:
            print(f"Reasoning: {_reasoning['reasoningText']['text'][:200]}...")

_usage = response.get("usage", {})
if _usage.get("reasoning_tokens") or _usage.get("reasoningTokens"):
    _reasoning_detected = True

if not _reasoning_detected:
    print("Response: ", response)
    raise Exception("VALIDATION FAILED: reasoning - no reasoning information in Bedrock response")
print("VALIDATION: reasoning SUCCESS")

Output

**Step-by-step calculation of 3^3^3^3**

Exponentiation is **right-associative**, meaning expressions are evaluated from the top (right) down.  
Thus:

3^3^3^3 = 3^(3^(3^3))

Now evaluate from the inside out:

1. Innermost exponent:  
   3^3 = 27

2. Next level:  
   3^27  
   This equals **7,625,597,484,987**

3. Outermost exponent:  
   3^(7,625,597,484,987)

The result is the enormous integer  
**3^7,625,597,484,987**

(This number has roughly 3.6 trillion digits and cannot be written out explicitly.)

**Final answer**  
3^7625597484987
**Step-by-step calculation of 3^3^3^3**

Exponentiation is **right-associative**, meaning expressions are evaluated from the top (right) down.  
Thus:

3^3^3^3 = 3^(3^(3^3))

Now evaluate from the inside out:

1. Innermost exponent:  
   3^3 = 27

2. Next level:  
   3^27  
   This equals **7,625,597,484,987**

3. Outermost exponent:  
   3^(7,625,597,484,987)

The result is the enormous integer  
**3^7,625,597,484,987**

(This number has roughly 3.6 trillion digits and cannot be written out explicitly.)

**Final answer**  
3^7625597484987
Response:  {'ResponseMetadata': {'HTTPStatusCode': 200, 'HTTPHeaders': {'access-control-allow-credentials': 'true', 'access-control-allow-origin': '*', 'access-control-expose-headers': 'Server-Timing,Openai-Processing-Ms,x-tfy-cache-status,x-tfy-cache-similarity-score,x-tfy-cached-trace-id,WWW-Authenticate', 'cf-cache-status': 'DYNAMIC', 'cf-ray': 'a03a5ff2dd4bf26e-IAD', 'content-length': '779', 'content-type': 'application/json', 'date': 'Sat, 30 May 2026 02:40:39 GMT', 'server': 'cloudflare', 'server-timing': 'root-span-handling;dur=0.0,auth;dur=3.3,validate-headers;dur=0.1,model-call;dur=5400.1,logging;dur=0.5,total;dur=5405.1;desc="Total Response Time"', 'set-cookie': '__cf_bm=4LeN4tdb_BrLQW6PZEjPnvBWzmlihI9t3qVu0CdeA18-1780108833.7363846-1.0.1.1-tynalVDUsAF7b7xECJuMqinIALfa9peYQJeocuh1nF4kyITLt61nrbqJ0tXsazfmiCpEnVZnxA8asLuL5ohHi4FqMhT4jy0YMR.S1k88VQD17THjWFDYLQ0UBhhcTQf9; HttpOnly; SameSite=None; Secure; Path=/; Domain=x.ai; Expires=Sat, 30 May 2026 03:10:39 GMT', 'strict-transport-security': 'max-age=31536000', 'traceparent': '00-019e76c133bc736d954549b21913f611-b67f567c00023fb7-01', 'vary': 'origin, access-control-request-method, access-control-request-headers', 'via': '1.1 Caddy', 'x-metrics-e2e-ms': '5366.3', 'x-metrics-mean-itl-ms': '8.8', 'x-metrics-ttft-ms': '251.9', 'x-ratelimit-limit-requests': '2400', 'x-ratelimit-limit-tokens': '15000000', 'x-ratelimit-remaining-requests': '2400', 'x-ratelimit-remaining-tokens': '15000000', 'x-request-id': '73e92ff2-09d5-9780-a957-dd054a81b416', 'x-tfy-feedback-target-id': 'eyJkYXRhUm91dGluZ0Rlc3RpbmF0aW9uIjoiZGVmYXVsdCIsInRyYWNlSWQiOiIwMTllNzZjMTMzYmM3MzZkOTU0NTQ5YjIxOTEzZjYxMSIsInNwYW5JZCI6ImMwYjk3OTdmZTBmYjkzYTIifQ==', 'x-zero-data-retention': 'false', 'x-envoy-upstream-service-time': '5409'}, 'RetryAttempts': 0}, 'output': {'message': {'role': 'assistant', 'content': [{'text': '**Step-by-step calculation of 3^3^3^3**\n\nExponentiation is **right-associative**, meaning expressions are evaluated from the top (right) down.  \nThus:\n\n3^3^3^3 = 3^(3^(3^3))\n\nNow evaluate from the inside out:\n\n1. Innermost exponent:  \n   3^3 = 27\n\n2. Next level:  \n   3^27  \n   This equals **7,625,597,484,987**\n\n3. Outermost exponent:  \n   3^(7,625,597,484,987)\n\nThe result is the enormous integer  \n**3^7,625,597,484,987**\n\n(This number has roughly 3.6 trillion digits and cannot be written out explicitly.)\n\n**Final answer**  \n3^7625597484987'}]}}, 'stopReason': 'end_turn', 'usage': {'inputTokens': 192, 'outputTokens': 174, 'totalTokens': 759, 'cacheReadInputTokens': 128}, 'metrics': {'latencyMs': 0}}

xai/grok-3-mini-high-beta — reasoning:stream:boto3:adapter (validation_failure)

Error

Traceback (most recent call last):
  File "/tmp/tmp6hq1w9m1/snippet.py", line 67, in <module>
    raise Exception("VALIDATION FAILED: reasoning stream - no reasoning information in Bedrock stream")
Exception: VALIDATION FAILED: reasoning stream - no reasoning information in Bedrock stream

Code snippet

import boto3
from botocore.config import Config

_endpoint = "https://internal.devtest.truefoundry.tech/api/llm"
_api_key = "***"
_model = "test-v2-xai/grok-3-mini-high-beta"

client = boto3.client(
    "bedrock-runtime",
    region_name="us-east-1",
    endpoint_url=_endpoint,
    aws_access_key_id="dummy",
    aws_secret_access_key="dummy",
    config=Config(inject_host_prefix=False),
)

def _add_auth_header(request, **kwargs):
    request.headers["x-tfy-api-key"] = _api_key

client.meta.events.register("before-sign.bedrock-runtime.*", _add_auth_header)

messages = [
    {"role": "user", "content": [{"text": "Hi"}]},
    {"role": "assistant", "content": [{"text": "Hi, how can I help you"}]},
    {"role": "user", "content": [{"text": "How to calculate 3^3^3^3? Think step by step and show all reasoning."}]},
]

system = [{"text": "You are a helpful assistant. You MUST think step by step and show your reasoning. Never skip reasoning steps."}]

response = client.converse_stream(
    modelId=_model,
    system=system,
    messages=messages,
)

_events = []
for _event in response["stream"]:
    _events.append(_event)
    if "contentBlockDelta" in _event:
        _delta = _event["contentBlockDelta"].get("delta", {})
        if "reasoningContent" in _delta:
            print(_delta["reasoningContent"].get("text", ""), end="", flush=True)
        if "text" in _delta:
            print(_delta["text"], end="", flush=True)

_reasoning_detected = False
for _event in _events:
    if "contentBlockDelta" in _event:
        _delta = _event["contentBlockDelta"].get("delta", {})
        if "text" in _delta:
            print(_delta["text"], end="", flush=True)
        if "reasoningContent" in _delta:
            _reasoning_detected = True
            _reasoning = _delta["reasoningContent"]
            if "text" in _reasoning:
                print(_reasoning["text"], end="", flush=True)
    if "contentBlockStart" in _event:
        _start = _event["contentBlockStart"].get("start", {})
        if "reasoningContent" in _start:
            _reasoning_detected = True
    if "metadata" in _event:
        _usage = _event["metadata"].get("usage", {})
        if _usage.get("reasoning_tokens") or _usage.get("reasoningTokens"):
            _reasoning_detected = True

if not _reasoning_detected:
    raise Exception("VALIDATION FAILED: reasoning stream - no reasoning information in Bedrock stream")
print("\nVALIDATION: reasoning stream SUCCESS")

Output

**3^3^3^3 is interpreted as the power tower 3^(3^(3^3)) because exponentiation is right-associative** (evaluated from the top/right downward).

Step-by-step evaluation:

1. Innermost (top of the tower): 3^3 = 27  
2. Next level: 3^27 = 7,625,597,484,987  
3. Outermost: 3^(3^27) = 3^7,625,597,484,987  

This is an extremely large number (approximately 10^(3.64 × 10^12), with trillions of digits), so it is normally left in the compact form 3^(3^(3^3)) rather than expanded.**3^3^3^3 is interpreted as the power tower 3^(3^(3^3)) because exponentiation is right-associative** (evaluated from the top/right downward).

Step-by-step evaluation:

1. Innermost (top of the tower): 3^3 = 27  
2. Next level: 3^27 = 7,625,597,484,987  
3. Outermost: 3^(3^27) = 3^7,625,597,484,987  

This is an extremely large number (approximately 10^(3.64 × 10^12), with trillions of digits), so it is normally left in the compact form 3^(3^(3^3)) rather than expanded.

Successes (46)

xai/grok-3-mini-high — tool-call:boto3:adapter (success)

Output

Tool: get_weather
Input: {'location': 'London'}
Tool: get_weather
Input: {'location': 'London'}
VALIDATION: tool-call SUCCESS

xai/grok-3-mini-high — tool-call:stream:boto3:adapter (success)

Output

Tool: get_weather
{"location":"London"}Tool: get_weather
{"location":"London"}
VALIDATION: tool-call stream SUCCESS

xai/grok-3-mini-high — params:stream:boto3:adapter (success)

Output

The capital of France is Paris.

xai/grok-3-mini-high — structured-output:stream:boto3:adapter (success)

Output

Tool: CalendarEvent
{"name":"science fair","date":"Friday","participants":["Alice","Bob"]}{
  "name": "science fair",
  "date": "Friday",
  "participa
... (truncated, 83 chars omitted)

xai/grok-3-mini-high — params:boto3:adapter (success)

Output

The capital of France is Paris.

xai/grok-3-mini-high — params (success)

Output

The capital of France is Paris.

xai/grok-3-mini-high — params:stream (success)

Output

The capital of France is Paris.

xai/grok-3-mini-high — structured-output:boto3:adapter (success)

Output

{
  "name": "science fair",
  "date": "Friday",
  "participants": [
    "Alice",
    "Bob"
  ]
}
{
  "name": "science fair",
  "date": "Friday",
  "pa
... (truncated, 82 chars omitted)

xai/grok-3-mini-high — tool-call:stream (success)

Output

{"location":"London"}
VALIDATION: tool-call stream SUCCESS

xai/grok-3-mini-high — structured-output (success)

Output

{
"name": "science fair",
"date": "Friday",
"participants": ["Alice", "Bob"]
}
VALIDATION: structured-output SUCCESS

xai/grok-3-mini-high — tool-call (success)

Output

Function: get_weather
Arguments: {"location":"London"}
VALIDATION: tool-call SUCCESS

xai/grok-3-mini-high — structured-output:stream (success)

Output

{
"name": "science fair",
"date": "Friday",
"participants": ["Alice", "Bob"]
}
VALIDATION: structured-output stream SUCCESS

xai/grok-3-mini-high — reasoning:stream (success)

Output

**The expression 3^3^3^3 is evaluated right-to-left** (right-associative), which is the standard convention for exponentiation in mathematics.

This m
... (truncated, 944 chars omitted)

xai/grok-3-mini-high — reasoning (success)

Output

**The expression 3^3^3^3 is evaluated right-associatively** (standard mathematical convention for exponentiation, also called right-associativity). Th
... (truncated, 1048 chars omitted)

xai/grok-3-mini-fast-high — tool-call (success)

Output

Function: get_weather
Arguments: {"location":"London"}
VALIDATION: tool-call SUCCESS

xai/grok-3-mini-fast-high — tool-call:stream (success)

Output

{"location":"London"}
VALIDATION: tool-call stream SUCCESS

xai/grok-3-mini-fast-high — structured-output:stream:boto3:adapter (success)

Output

Tool: CalendarEvent
{"name":"science fair","date":"Friday","participants":["Alice","Bob"]}{
  "name": "science fair",
  "date": "Friday",
  "participa
... (truncated, 83 chars omitted)

xai/grok-3-mini-fast-high — tool-call:stream:boto3:adapter (success)

Output

Tool: get_weather
{"location":"London"}Tool: get_weather
{"location":"London"}
VALIDATION: tool-call stream SUCCESS

xai/grok-3-mini-fast-high — params:stream:boto3:adapter (success)

Output

The capital of France is Paris.

xai/grok-3-mini-fast-high — params (success)

Output

The capital of France is Paris.

xai/grok-3-mini-fast-high — structured-output:stream (success)

Output

{
  "name": "science fair",
  "date": "Friday",
  "participants": ["Alice", "Bob"]
}
VALIDATION: structured-output stream SUCCESS

xai/grok-3-mini-fast-high — params:stream (success)

Output

The capital of France is Paris.

xai/grok-3-mini-fast-high — tool-call:boto3:adapter (success)

Output

Tool: get_weather
Input: {'location': 'London'}
Tool: get_weather
Input: {'location': 'London'}
VALIDATION: tool-call SUCCESS

xai/grok-3-mini-fast-high — params:boto3:adapter (success)

Output

The capital of France is Paris.

xai/grok-3-mini-fast-high — structured-output (success)

Output

{
"name": "science fair",
"date": "Friday",
"participants": ["Alice", "Bob"]
}
VALIDATION: structured-output SUCCESS

xai/grok-3-mini-fast-high — structured-output:boto3:adapter (success)

Output

{
  "name": "science fair",
  "date": "Friday",
  "participants": [
    "Alice",
    "Bob"
  ]
}
{
  "name": "science fair",
  "date": "Friday",
  "pa
... (truncated, 82 chars omitted)

xai/grok-3-mini-fast-high — reasoning (success)

Output

**Exponentiation is right-associative**, so the expression `3^3^3^3` is interpreted as a power tower evaluated from the top down (right to left):

`3^
... (truncated, 829 chars omitted)

xai/grok-3-mini-fast-high — reasoning:stream (success)

Output

**Step-by-step reasoning for evaluating 3^3^3^3:**

1. **Association rule**: Exponentiation is **right-associative** (evaluated from the top/right to 
... (truncated, 1566 chars omitted)

xai/grok-3-mini-high-beta — params:stream:boto3:adapter (success)

Output

The capital of France is Paris.

xai/grok-3-mini-high-beta — params:boto3:adapter (success)

Output

The capital of France is Paris.

xai/grok-3-mini-high-beta — tool-call:boto3:adapter (success)

Output

Tool: get_weather
Input: {'location': 'London'}
Tool: get_weather
Input: {'location': 'London'}
VALIDATION: tool-call SUCCESS

xai/grok-3-mini-high-beta — params:stream (success)

Output

The capital of France is **Paris**.

xai/grok-3-mini-high-beta — params (success)

Output

The capital of France is Paris.

xai/grok-3-mini-high-beta — structured-output:stream:boto3:adapter (success)

Output

Tool: CalendarEvent
{"name":"science fair","date":"Friday","participants":["Alice","Bob"]}{
  "name": "science fair",
  "date": "Friday",
  "participa
... (truncated, 83 chars omitted)

xai/grok-3-mini-high-beta — tool-call (success)

Output

Function: get_weather
Arguments: {"location":"London"}
VALIDATION: tool-call SUCCESS

xai/grok-3-mini-high-beta — structured-output:boto3:adapter (success)

Output

{
  "name": "science fair",
  "date": "Friday",
  "participants": [
    "Alice",
    "Bob"
  ]
}
{
  "name": "science fair",
  "date": "Friday",
  "pa
... (truncated, 82 chars omitted)

xai/grok-3-mini-high-beta — tool-call:stream:boto3:adapter (success)

Output

Tool: get_weather
{"location":"London"}Tool: get_weather
{"location":"London"}
VALIDATION: tool-call stream SUCCESS

xai/grok-3-mini-high-beta — tool-call:stream (success)

Output

{"location":"London"}
VALIDATION: tool-call stream SUCCESS

xai/grok-3-mini-high-beta — structured-output:stream (success)

Output

{
"name": "science fair",
"date": "Friday",
"participants": ["Alice", "Bob"]
}
VALIDATION: structured-output stream SUCCESS

xai/grok-3-mini-high-beta — structured-output (success)

Output

{
"name": "science fair",
"date": "Friday",
"participants": ["Alice", "Bob"]
}
VALIDATION: structured-output SUCCESS

xai/grok-3-mini-high-beta — reasoning:stream (success)

Output

**Step-by-step evaluation of \(3^3^3^3\)**

1. **Operator precedence / associativity**  
   Exponentiation is right-associative, so the expression  
 
... (truncated, 1201 chars omitted)

xai/grok-3-mini-high-beta — reasoning (success)

Output

**Step-by-step reasoning for evaluating 3^3^3^3:**

1. **Determine the order of operations**: Exponentiation is right-associative. This means a chain 
... (truncated, 968 chars omitted)

xai/grok-4.3 — params:stream:boto3:adapter (success)

Output

The capital of France is Paris.

xai/grok-4.3 — params:boto3:adapter (success)

Output

The capital of France is Paris.

xai/grok-4.3 — params:stream (success)

Output

The capital of France is Paris.

xai/grok-4.3 — params (success)

Output

Paris is the capital of France.

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 3ef7309. Configure here.}

cursor · 2026-05-30T02:42:32Z

 status: active
+supportedModes:
+    - chat
+thinking: true


Sources field accidentally removed from model YAML

Medium Severity

The sources field (previously pointing to https://docs.x.ai/developers/models/grok-3-mini-fast) was deleted from grok-3-mini-high.yaml. The two sibling model files updated in this same PR (grok-3-mini-fast-high.yaml and grok-3-mini-high-beta.yaml) both retain their sources field. This looks like an unintentional removal during the update, causing loss of provenance metadata for this model.

^{Reviewed by Cursor Bugbot for commit 3ef7309. Configure here.}

cursor · 2026-05-30T02:42:33Z

      output_cost_per_token: 2.5e-6
      region: "*"
+features:
+    - prompt_caching


Incomplete features list added to grok-4.3 model

Medium Severity

The newly added features section in grok-4.3.yaml only includes prompt_caching, while the corresponding grok-4.3-latest.yaml (same model, same pricing) declares function_calling, tool_choice, structured_output, system_messages, and prompt_caching. Adding an explicit but incomplete features list is potentially worse than having none, because consumers may treat the presence of the key as authoritative and conclude the model lacks capabilities it actually supports.

^{Reviewed by Cursor Bugbot for commit 3ef7309. Configure here.}

feat(xai): update model YAMLs [bot]

3ef7309

cursor Bot reviewed May 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(xai): update model YAMLs [bot]#1201

feat(xai): update model YAMLs [bot]#1201
models-bot[bot] wants to merge 1 commit into
mainfrom
bot/update-xai-20260530-023913

models-bot Bot commented May 30, 2026 •

edited by cursor Bot

Loading

Uh oh!

github-actions Bot commented May 30, 2026

Uh oh!

harshiv-26 commented May 30, 2026

Uh oh!

cursor Bot left a comment

Uh oh!

cursor Bot May 30, 2026

Uh oh!

cursor Bot May 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

models-bot Bot commented May 30, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented May 30, 2026

Uh oh!

harshiv-26 commented May 30, 2026

Gateway test results

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor Bot May 30, 2026

Choose a reason for hiding this comment

Sources field accidentally removed from model YAML

Uh oh!

cursor Bot May 30, 2026

Choose a reason for hiding this comment

Incomplete features list added to grok-4.3 model

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

models-bot Bot commented May 30, 2026 •

edited by cursor Bot

Loading