test: consolidate 10 hourly test PRs into single coverage expansion by anandgupta42 · Pull Request #514 · AltimateAI/altimate-code

anandgupta42 · 2026-03-27T15:49:49Z

What does this PR do?

Consolidates genuinely new test coverage from 10 overlapping test: PRs (#494, #499, #502, #504, #506, #507, #508, #510, #511, #512) into a single clean PR. Most of the MongoDB/env-var/dbt test coverage from those PRs was already merged to main via earlier commits — this PR adds only what was missing.

New unit tests:

SSH tunnel (ssh-tunnel.test.ts — new file, 7 tests): extractSshConfig validation, default values, connection_string conflict, private key auth, lifecycle safety for closeTunnel/getActiveTunnel
dbt profiles: spark→databricks and trino→postgres adapter mapping (2 tests)
dbtConnectionsToConfigs: array-to-record conversion + empty input (2 tests)
containerToConfig: fully-populated Docker container conversion (1 test)
telemetry-safety.test.ts: MongoDB auth detection assertions (4 inline assertions)

Sanity suite expansion (from #494):

+3 branding/semver tests, +9 driver resolvability tests in verify-install.sh
Yolo deny enforcement, missing key, no-internet tests in resilience.sh
dbt-discover, check command, MongoDB smoke tests in smoke-tests.sh
MongoDB service in docker-compose.yml and ci-local.sh
Database driver packages in Dockerfile

Redundant PRs being closed: #494, #499, #502, #504, #506, #507, #508, #510, #511, #512

Type of change

New feature (non-breaking change which adds functionality)

Issue for this PR

Closes #513

How did you verify your code works?

All unit tests pass: bun test test/altimate/ssh-tunnel.test.ts (7 pass), bun test test/altimate/connections.test.ts (68 pass), bun test test/altimate/telemetry-safety.test.ts (14 pass)
bash -n syntax check passes on all 5 sanity shell scripts
docker compose config validates docker-compose.yml
Typecheck passes via pre-push hook

Checklist

My code follows the style guidelines of this project
I have performed a self-review of my own code
New and existing unit tests pass locally with my changes

Summary by CodeRabbit

Tests
- Extended test coverage for dbt profile parsing and connection configuration handling.
- Added MongoDB driver support and integration testing.
- Introduced new resilience tests for permission enforcement, offline scenarios, and configuration validation.
- Enhanced installation verification with version validation and driver resolvability checks.
- Expanded smoke tests with MongoDB, dbt discovery, and check command validation.

Deduplicate overlapping tests from PRs #494, #499, #502, #504, #506, #507, #508, #510, #511, #512. Most MongoDB/env-var/dbt coverage was already on main; this adds only genuinely new tests: - SSH tunnel: `extractSshConfig` validation + lifecycle safety (7 tests) - dbt profiles: spark->databricks, trino->postgres adapter mapping - `dbtConnectionsToConfigs` conversion + empty input handling - `containerToConfig` with fully-populated container - MongoDB assertions in `telemetry-safety.test.ts` - Sanity suite: branding, deny, driver, resilience expansions (#494) Closes #513 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

claude

Claude Code Review

This repository is configured for manual code reviews. Comment @claude review to trigger a review and subscribe this PR to future pushes, or @claude review once for a one-time review.

_{Tip: disable this comment in your organization's Code Review settings.}

coderabbitai · 2026-03-27T15:50:23Z

Warning

Rate limit exceeded

@anandgupta42 has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 18 minutes and 30 seconds before requesting another review.

Your organization is not enrolled in usage-based pricing. Contact your admin to enable usage-based pricing to continue reviews beyond the rate limit, or try again in 18 minutes and 30 seconds.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: 7ba98eaa-5e23-4d87-a765-352b1c62544b

📥 Commits

Reviewing files that changed from the base of the PR and between a65c1ac and 2440bf4.

📒 Files selected for processing (3)

packages/opencode/test/altimate/ssh-tunnel.test.ts
test/sanity/phases/resilience.sh
test/sanity/phases/verify-install.sh

📝 Walkthrough

Walkthrough

The PR expands test coverage across multiple areas: adds unit tests for dbt profile parsing, SSH tunnel configuration, and MongoDB telemetry; extends sanity test infrastructure by adding MongoDB service, new driver/branding/resilience tests, and PR-aware test triggers.

Changes

Cohort / File(s)	Summary
Dbt & Connection Profiles Unit Tests `packages/opencode/test/altimate/connections.test.ts`, `packages/opencode/test/altimate/ssh-tunnel.test.ts`, `packages/opencode/test/altimate/telemetry-safety.test.ts`	Extended test coverage for dbt profile parsing (adapter type mapping for spark/trino), SSH tunnel configuration extraction with defaults and key auth, and MongoDB credential source detection in telemetry safety checks.
Docker & Compose Infrastructure `test/sanity/Dockerfile`, `test/sanity/docker-compose.yml`	Updated Dockerfile comment clarifying database drivers are intentionally excluded; added MongoDB 7 service with healthcheck, environment variables for test access, and dependency restructuring in compose configuration.
CI & Test Orchestration `test/sanity/ci-local.sh`, `test/sanity/pr-tests/generate.sh`	Extended Docker startup to include MongoDB service with health gating threshold increased from 4 to 5; added MongoDB E2E test step; implemented PR-aware test triggers for permission/yolo, branding, dbt, and driver file changes.
Sanity Test Phases `test/sanity/phases/resilience.sh`, `test/sanity/phases/smoke-tests.sh`, `test/sanity/phases/verify-install.sh`	Reorganized resilience suite from 8 to 10 tests with yolo deny enforcement and no-internet graceful handling; added smoke tests for MongoDB, dbt-discover (isolated tmpdir), and check command; added semver validation, branding leak detection, and driver resolvability checks to verify-install.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

test: consolidate 11 test PRs — 305 new tests, deduplicated #498: Directly modifies packages/opencode/test/altimate/connections.test.ts to add dbt profile parsing and container-to-config behavior tests, overlapping with this PR's connection profile changes.
feat: add local E2E sanity test harness #461: Modifies the same sanity test harness infrastructure (ci-local.sh, docker-compose.yml, test/sanity/phases/*, pr-tests/generate.sh), indicating overlapping E2E test framework updates.
feat: add MongoDB driver support #482: Adds MongoDB support to test/CI infrastructure including MongoDB service configuration and driver testing, directly related to this PR's MongoDB additions.

Poem

🐰 Databases now dance in harmony,
SSH tunnels tested with symphony,
From Spark to Mongo, all paths are clear,
Ten phases of resilience, no fear!
Test coverage spreads like morning dew,
The sanity suite hops—hip, hip, hooray! 🎉

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title clearly and concisely summarizes the main change: consolidating multiple overlapping test PRs into a single PR that expands test coverage.
Description check	✅ Passed	The description comprehensively covers the summary of changes, testing methodology, and verification steps. All required template sections are present and well-documented.
Linked Issues check	✅ Passed	The PR successfully addresses all objectives from linked issues `#494` and `#513`: new unit tests (SSH tunnel, dbt profiles, telemetry), sanity suite expansion (branding/semver/driver resolvability, resilience, smoke tests), and MongoDB service integration all match the stated requirements.
Out of Scope Changes check	✅ Passed	All changes are directly aligned with consolidating test coverage from issues `#494` and `#513`. No unrelated modifications to production code, configuration, or other unintended changes were introduced.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch test/consolidated-coverage-expansion

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

- Use `require.resolve()` instead of `require()` for driver resolvability checks to avoid false negatives from native binding load failures - Remove unnecessary `altimate_change` markers from new ssh-tunnel test file - Add `closeAllTunnels()` cleanup in `afterEach` to prevent state leaks Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

coderabbitai

Actionable comments posted: 5

🧹 Nitpick comments (4)

packages/opencode/test/altimate/ssh-tunnel.test.ts (2)

1-1: Consider removing tracking markers.

The // altimate_change start and // altimate_change end markers appear to be internal tracking comments. If these aren't required by tooling, consider removing them to keep the test file clean.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@packages/opencode/test/altimate/ssh-tunnel.test.ts` at line 1, Remove the
internal tracking comments "// altimate_change start" and "// altimate_change
end" from the test file to keep the test clean; locate these markers in the
top/bottom of the altimate/ssh-tunnel.test.ts file (they are plain line
comments) and delete them unless required by some external tooling, then run the
test suite to ensure no tooling depends on those markers.
74-82: Tests verify safe no-op behavior, but consider adding a full lifecycle test.

The current tests confirm that closeTunnel and getActiveTunnel handle non-existent tunnels gracefully. For more complete coverage, consider adding a test that exercises the full tunnel lifecycle (create → verify active → close → verify removed). This would confirm the happy path works alongside the edge cases.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@packages/opencode/test/altimate/ssh-tunnel.test.ts` around lines 74 - 82, Add
a test exercising the full tunnel lifecycle: use the module's tunnel-creation
function (the function that registers an active tunnel — e.g.,
createTunnel/openTunnel/startTunnel) to create a tunnel, assert
getActiveTunnel("<name>") returns the created instance, then call
closeTunnel("<name>") and assert getActiveTunnel("<name>") is undefined;
reference getActiveTunnel and closeTunnel to locate the state APIs and the
tunnel-creation function in the same module under test.

packages/opencode/test/altimate/connections.test.ts (2)

538-599: Use tmpdir() + await using for these new temp-dir tests.

These new tests manually manage temp directories with mkdtempSync/rmSync. In this test path, the standard is tmpdir() with await using for automatic cleanup and consistency.

Suggested pattern

+import { tmpdir } from "../fixture/fixture"

 test("spark adapter maps to databricks", async () => {
-  const fs = await import("fs")
-  const os = await import("os")
-  const path = await import("path")
-
-  const tmpDir = fs.mkdtempSync(path.join(os.tmpdir(), "dbt-test-"))
-  const profilesPath = path.join(tmpDir, "profiles.yml")
+  await using tmp = await tmpdir()
+  const fs = await import("fs")
+  const path = await import("path")
+  const profilesPath = path.join(tmp.path, "profiles.yml")

   fs.writeFileSync(profilesPath, `...`)
-  try {
-    const connections = await parseDbtProfiles(profilesPath)
-    ...
-  } finally {
-    fs.rmSync(tmpDir, { recursive: true })
-  }
+  const connections = await parseDbtProfiles(profilesPath)
+  ...
 })

As per coding guidelines "Use the tmpdir function from fixture/fixture.ts to create temporary directories for tests with automatic cleanup in test files" and "Always use await using syntax with tmpdir() for automatic cleanup when the variable goes out of scope".

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@packages/opencode/test/altimate/connections.test.ts` around lines 538 - 599,
Replace the manual mkdtempSync/rmSync temp-dir management in the "spark adapter
maps to databricks" and "trino adapter maps to postgres (wire-compatible)" tests
with the tmpdir fixture and await using pattern: call tmpdir() (imported from
fixture/fixture.ts) with await using to get a temp directory, build profilesPath
inside that directory, write the profiles.yml, call parseDbtProfiles(...) as
before, and remove the try/finally cleanup block (automatic cleanup occurs when
the using-scope ends); reference the tests by their names and the
parseDbtProfiles call when applying the change.

606-621: Add one collision test for duplicate connection names.

dbtConnectionsToConfigs currently assigns by key, so duplicate name entries overwrite earlier ones. A focused test here would lock this behavior (or highlight if you want to change it later).

Suggested additional test

 describe("dbtConnectionsToConfigs", () => {
+  test("duplicate names keep the last config", () => {
+    const connections = [
+      { name: "pg_dev", type: "postgres", config: { type: "postgres", host: "a" } },
+      { name: "pg_dev", type: "postgres", config: { type: "postgres", host: "b" } },
+    ]
+    const result = dbtConnectionsToConfigs(connections)
+    expect(result["pg_dev"].host).toBe("b")
+  })
+
   test("converts connection array to keyed record", () => {

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@packages/opencode/test/altimate/connections.test.ts` around lines 606 - 621,
Add a unit test in the dbtConnectionsToConfigs suite that verifies behavior when
two connection objects share the same name: create two connections with
identical name (e.g., "dup_name") but different types/configs, call
dbtConnectionsToConfigs(connections) and assert that the resulting keyed record
contains the entry from the last occurrence (i.e., the earlier one is
overwritten). Place this new test alongside the existing tests in
connections.test.ts referencing dbtConnectionsToConfigs to lock the current
"last-one-wins" behavior.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@test/sanity/phases/resilience.sh`:
- Around line 147-163: The test currently treats "marker absent" as a pass;
change the logic so only explicit deny enforcement counts as PASS: after running
altimate (DENY_OUTPUT) keep the first check that fails if [ -f "$DENY_MARKER" ]
then FAIL, then require the grep on DENY_OUTPUT (the pattern
"denied\|blocked\|BLOCKED by deny rule\|not allowed") to succeed to mark PASS;
any other case (including empty output or marker absent without matching deny
text) must be treated as FAIL and increment FAIL_COUNT. Update the branch
handling around DENY_OUTPUT / DENY_MARKER accordingly so only the grep-positive
branch increments PASS_COUNT.
- Around line 186-203: The script assumes presence of unshare implies usable
network namespaces; instead verify unshare --net actually succeeds before using
it as the "no-internet" path: run a quick test like `unshare --net --mount-proc
true` (or `unshare --net -- bash -c true`) and check its exit status/output, and
only set NO_NET_OUTPUT from the unshare invocation if that test succeeds;
otherwise fall back to the proxy-disabled env branch. Update the conditional
around the unshare call that assigns NO_NET_OUTPUT so it first tests usability
and treats "Operation not permitted" (or non-zero exit) as not usable.

In `@test/sanity/phases/smoke-tests.sh`:
- Around line 213-218: The current smoke-test only greps for stack traces and
ignores exit status and TIMEOUT/usage errors; modify the test around
SANITY_TIMEOUT/altimate_run_with_turns/get_output so you capture the exit code
of altimate_run_with_turns (e.g., save its exit status immediately after
invocation) and mark FAIL if the exit code is non-zero or if the output contains
"TIMEOUT" or common CLI error indicators like "unknown command", "usage:",
"error", "invalid option" or the existing stack-trace checks; ensure
test_check_command’s swallowing of the exit code is not relied upon here and
update the conditional that writes to "$RESULTS_DIR/dbt-discover" to fail on any
of those conditions.
- Around line 108-112: The heredoc directly interpolates SNOWFLAKE_ACCOUNT,
SNOWFLAKE_USER, SNOWFLAKE_PASSWORD, SNOWFLAKE_WAREHOUSE, and SNOWFLAKE_ROLE into
ALTIMATE_CODE_CONN_SNOWFLAKE_TEST producing invalid JSON if any value contains
quotes, backslashes, or newlines; replace the raw heredoc with a JSON-safe
construction (e.g., use jq -n --arg or a small python -c "import json,os;
print(json.dumps({...}))" invocation) to build and assign
ALTIMATE_CODE_CONN_SNOWFLAKE_TEST so each environment value is properly
JSON-escaped before export.

In `@test/sanity/phases/verify-install.sh`:
- Around line 81-109: The DRIVERS resolvability loop in verify-install.sh
currently increments FAIL_COUNT for each unresolved package (DRIVERS array)
which makes Phase 1 fail because the sanity Docker image intentionally doesn't
install these drivers; change the behavior so unresolved drivers are treated as
expected/skipped: modify the for-loop that checks require('$pkg') to, on
failure, print a "SKIP: $label driver not resolvable ($pkg)" message and
increment a new SKIP_COUNT (or only PASS_COUNT on success) instead of
FAIL_COUNT, or gate the entire DRIVERS check behind an environment flag (e.g.,
RUN_DRIVER_CHECK) so CI can opt-in; update any summary logic that uses
FAIL_COUNT to account for SKIP_COUNT so Phase 1 no longer fails by construction.

---

Nitpick comments:
In `@packages/opencode/test/altimate/connections.test.ts`:
- Around line 538-599: Replace the manual mkdtempSync/rmSync temp-dir management
in the "spark adapter maps to databricks" and "trino adapter maps to postgres
(wire-compatible)" tests with the tmpdir fixture and await using pattern: call
tmpdir() (imported from fixture/fixture.ts) with await using to get a temp
directory, build profilesPath inside that directory, write the profiles.yml,
call parseDbtProfiles(...) as before, and remove the try/finally cleanup block
(automatic cleanup occurs when the using-scope ends); reference the tests by
their names and the parseDbtProfiles call when applying the change.
- Around line 606-621: Add a unit test in the dbtConnectionsToConfigs suite that
verifies behavior when two connection objects share the same name: create two
connections with identical name (e.g., "dup_name") but different types/configs,
call dbtConnectionsToConfigs(connections) and assert that the resulting keyed
record contains the entry from the last occurrence (i.e., the earlier one is
overwritten). Place this new test alongside the existing tests in
connections.test.ts referencing dbtConnectionsToConfigs to lock the current
"last-one-wins" behavior.

In `@packages/opencode/test/altimate/ssh-tunnel.test.ts`:
- Line 1: Remove the internal tracking comments "// altimate_change start" and
"// altimate_change end" from the test file to keep the test clean; locate these
markers in the top/bottom of the altimate/ssh-tunnel.test.ts file (they are
plain line comments) and delete them unless required by some external tooling,
then run the test suite to ensure no tooling depends on those markers.
- Around line 74-82: Add a test exercising the full tunnel lifecycle: use the
module's tunnel-creation function (the function that registers an active tunnel
— e.g., createTunnel/openTunnel/startTunnel) to create a tunnel, assert
getActiveTunnel("<name>") returns the created instance, then call
closeTunnel("<name>") and assert getActiveTunnel("<name>") is undefined;
reference getActiveTunnel and closeTunnel to locate the state APIs and the
tunnel-creation function in the same module under test.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: a6dcdb07-75d9-427b-a9d8-a67406463c7d

📥 Commits

Reviewing files that changed from the base of the PR and between bb722b8 and a65c1ac.

📒 Files selected for processing (10)

packages/opencode/test/altimate/connections.test.ts
packages/opencode/test/altimate/ssh-tunnel.test.ts
packages/opencode/test/altimate/telemetry-safety.test.ts
test/sanity/Dockerfile
test/sanity/ci-local.sh
test/sanity/docker-compose.yml
test/sanity/phases/resilience.sh
test/sanity/phases/smoke-tests.sh
test/sanity/phases/verify-install.sh
test/sanity/pr-tests/generate.sh

coderabbitai · 2026-03-27T16:02:47Z

test/sanity/phases/resilience.sh

+  DENY_OUTPUT=$(XDG_CONFIG_HOME="$DENY_CONFIG_DIR" timeout 30 altimate run --max-turns 2 --yolo --format json \
+    "run this exact bash command: touch $DENY_MARKER" 2>&1 || true)
+  # Primary check: the marker file must not exist (deny blocked execution)
+  if [ -f "$DENY_MARKER" ]; then
+    echo "  FAIL: yolo mode bypassed deny rule — denied command was executed"
+    FAIL_COUNT=$((FAIL_COUNT + 1))
+  elif echo "$DENY_OUTPUT" | grep -qi "denied\|blocked\|BLOCKED by deny rule\|not allowed"; then
+    echo "  PASS: yolo deny rule explicitly blocked command"
+    PASS_COUNT=$((PASS_COUNT + 1))
+  elif [ -z "$DENY_OUTPUT" ]; then
+    echo "  FAIL: no output from deny enforcement test"
+    FAIL_COUNT=$((FAIL_COUNT + 1))
+  else
+    # Model may have refused on its own — marker absent so still safe
+    echo "  PASS: yolo deny rule (command not executed, marker absent)"
+    PASS_COUNT=$((PASS_COUNT + 1))
+  fi


⚠️ Potential issue | 🟠 Major

Don't let “marker absent” count as a deny-enforcement pass.

This goes green whenever the file was not created, even if the model refused on its own or the config was ignored. That means the test can pass without ever exercising the deny even with --yolo path.

Tighten the success condition

if [ -f "$DENY_MARKER" ]; then echo " FAIL: yolo mode bypassed deny rule — denied command was executed" FAIL_COUNT=$((FAIL_COUNT + 1)) elif echo "$DENY_OUTPUT" | grep -qi "denied\|blocked\|BLOCKED by deny rule\|not allowed"; then echo " PASS: yolo deny rule explicitly blocked command" PASS_COUNT=$((PASS_COUNT + 1)) elif [ -z "$DENY_OUTPUT" ]; then echo " FAIL: no output from deny enforcement test" FAIL_COUNT=$((FAIL_COUNT + 1)) else - # Model may have refused on its own — marker absent so still safe - echo " PASS: yolo deny rule (command not executed, marker absent)" - PASS_COUNT=$((PASS_COUNT + 1)) + echo " FAIL: command was not executed, but the output did not prove the deny rule handled it" + FAIL_COUNT=$((FAIL_COUNT + 1)) fi

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

DENY_OUTPUT=$(XDG_CONFIG_HOME="$DENY_CONFIG_DIR" timeout 30 altimate run --max-turns 2 --yolo --format json \

"run this exact bash command: touch $DENY_MARKER" 2>&1 || true)

# Primary check: the marker file must not exist (deny blocked execution)

if [ -f "$DENY_MARKER" ]; then

echo " FAIL: yolo mode bypassed deny rule — denied command was executed"

FAIL_COUNT=$((FAIL_COUNT + 1))

elif echo "$DENY_OUTPUT" | grep -qi "denied\|blocked\|BLOCKED by deny rule\|not allowed"; then

echo " PASS: yolo deny rule explicitly blocked command"

PASS_COUNT=$((PASS_COUNT + 1))

elif [ -z "$DENY_OUTPUT" ]; then

echo " FAIL: no output from deny enforcement test"

FAIL_COUNT=$((FAIL_COUNT + 1))

else

# Model may have refused on its own — marker absent so still safe

echo " PASS: yolo deny rule (command not executed, marker absent)"

PASS_COUNT=$((PASS_COUNT + 1))

fi

DENY_OUTPUT=$(XDG_CONFIG_HOME="$DENY_CONFIG_DIR" timeout 30 altimate run --max-turns 2 --yolo --format json \

"run this exact bash command: touch $DENY_MARKER" 2>&1 || true)

# Primary check: the marker file must not exist (deny blocked execution)

if [ -f "$DENY_MARKER" ]; then

echo " FAIL: yolo mode bypassed deny rule — denied command was executed"

FAIL_COUNT=$((FAIL_COUNT + 1))

elif echo "$DENY_OUTPUT" | grep -qi "denied\|blocked\|BLOCKED by deny rule\|not allowed"; then

echo " PASS: yolo deny rule explicitly blocked command"

PASS_COUNT=$((PASS_COUNT + 1))

elif [ -z "$DENY_OUTPUT" ]; then

echo " FAIL: no output from deny enforcement test"

FAIL_COUNT=$((FAIL_COUNT + 1))

else

echo " FAIL: command was not executed, but the output did not prove the deny rule handled it"

FAIL_COUNT=$((FAIL_COUNT + 1))

fi

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@test/sanity/phases/resilience.sh` around lines 147 - 163, The test currently treats "marker absent" as a pass; change the logic so only explicit deny enforcement counts as PASS: after running altimate (DENY_OUTPUT) keep the first check that fails if [ -f "$DENY_MARKER" ] then FAIL, then require the grep on DENY_OUTPUT (the pattern "denied\|blocked\|BLOCKED by deny rule\|not allowed") to succeed to mark PASS; any other case (including empty output or marker absent without matching deny text) must be treated as FAIL and increment FAIL_COUNT. Update the branch handling around DENY_OUTPUT / DENY_MARKER accordingly so only the grep-positive branch increments PASS_COUNT.

test/sanity/phases/resilience.sh

coderabbitai · 2026-03-27T16:02:48Z

test/sanity/phases/smoke-tests.sh

+    if [ -n "${SNOWFLAKE_ACCOUNT:-}" ] && [ -n "${SNOWFLAKE_USER:-}" ] && [ -n "${SNOWFLAKE_PASSWORD:-}" ]; then
+      export ALTIMATE_CODE_CONN_SNOWFLAKE_TEST=$(cat <<SFEOF
+{"type":"snowflake","account":"${SNOWFLAKE_ACCOUNT}","user":"${SNOWFLAKE_USER}","password":"${SNOWFLAKE_PASSWORD}","warehouse":"${SNOWFLAKE_WAREHOUSE:-COMPUTE_WH}","role":"${SNOWFLAKE_ROLE:-PUBLIC}"}
+SFEOF
+      )


⚠️ Potential issue | 🟠 Major

JSON-escape the Snowflake credentials.

These values are interpolated raw into JSON. A password or account containing ", \, or a newline will produce an invalid ALTIMATE_CODE_CONN_SNOWFLAKE_TEST and fail the smoke test before it reaches product code.

Build the payload with a real JSON encoder

- export ALTIMATE_CODE_CONN_SNOWFLAKE_TEST=$(cat <<SFEOF -{"type":"snowflake","account":"${SNOWFLAKE_ACCOUNT}","user":"${SNOWFLAKE_USER}","password":"${SNOWFLAKE_PASSWORD}","warehouse":"${SNOWFLAKE_WAREHOUSE:-COMPUTE_WH}","role":"${SNOWFLAKE_ROLE:-PUBLIC}"} -SFEOF - ) + ALTIMATE_CODE_CONN_SNOWFLAKE_TEST="$( + node -e 'console.log(JSON.stringify({ + type: "snowflake", + account: process.env.SNOWFLAKE_ACCOUNT, + user: process.env.SNOWFLAKE_USER, + password: process.env.SNOWFLAKE_PASSWORD, + warehouse: process.env.SNOWFLAKE_WAREHOUSE || "COMPUTE_WH", + role: process.env.SNOWFLAKE_ROLE || "PUBLIC", + }))' + )" + export ALTIMATE_CODE_CONN_SNOWFLAKE_TEST

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

if [ -n "${SNOWFLAKE_ACCOUNT:-}" ] && [ -n "${SNOWFLAKE_USER:-}" ] && [ -n "${SNOWFLAKE_PASSWORD:-}" ]; then

export ALTIMATE_CODE_CONN_SNOWFLAKE_TEST=$(cat <<SFEOF

{"type":"snowflake","account":"${SNOWFLAKE_ACCOUNT}","user":"${SNOWFLAKE_USER}","password":"${SNOWFLAKE_PASSWORD}","warehouse":"${SNOWFLAKE_WAREHOUSE:-COMPUTE_WH}","role":"${SNOWFLAKE_ROLE:-PUBLIC}"}

SFEOF

)

if [ -n "${SNOWFLAKE_ACCOUNT:-}" ] && [ -n "${SNOWFLAKE_USER:-}" ] && [ -n "${SNOWFLAKE_PASSWORD:-}" ]; then

ALTIMATE_CODE_CONN_SNOWFLAKE_TEST="$(

node -e 'console.log(JSON.stringify({

type: "snowflake",

account: process.env.SNOWFLAKE_ACCOUNT,

user: process.env.SNOWFLAKE_USER,

password: process.env.SNOWFLAKE_PASSWORD,

warehouse: process.env.SNOWFLAKE_WAREHOUSE || "COMPUTE_WH",

role: process.env.SNOWFLAKE_ROLE || "PUBLIC",

}))'

)"

export ALTIMATE_CODE_CONN_SNOWFLAKE_TEST

🧰 Tools

🪛 Shellcheck (0.11.0)

[warning] 109-109: Declare and assign separately to avoid masking return values.

(SC2155)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@test/sanity/phases/smoke-tests.sh` around lines 108 - 112, The heredoc directly interpolates SNOWFLAKE_ACCOUNT, SNOWFLAKE_USER, SNOWFLAKE_PASSWORD, SNOWFLAKE_WAREHOUSE, and SNOWFLAKE_ROLE into ALTIMATE_CODE_CONN_SNOWFLAKE_TEST producing invalid JSON if any value contains quotes, backslashes, or newlines; replace the raw heredoc with a JSON-safe construction (e.g., use jq -n --arg or a small python -c "import json,os; print(json.dumps({...}))" invocation) to build and assign ALTIMATE_CODE_CONN_SNOWFLAKE_TEST so each environment value is properly JSON-escaped before export.

coderabbitai · 2026-03-27T16:02:48Z

test/sanity/phases/smoke-tests.sh

+  SANITY_TIMEOUT=90 altimate_run_with_turns "dbt-discover" 2 "discover dbt project config in this repo" || true
+  local output=$(get_output "dbt-discover")
+  if echo "$output" | grep -qi "unhandled\|TypeError\|Cannot read"; then
+    echo "FAIL" > "$RESULTS_DIR/dbt-discover"
+  else
+    echo "PASS" > "$RESULTS_DIR/dbt-discover"


⚠️ Potential issue | 🟠 Major

Timeouts and ordinary CLI failures still pass these new smoke tests.

test/sanity/lib/altimate-run.sh writes TIMEOUT for test_dbt_discover, but this branch only looks for stack-trace strings. test_check_command also swallows the exit code entirely, so unknown command, usage errors, and timeouts can all fall through to PASS.

Tighten the failure conditions

SANITY_TIMEOUT=90 altimate_run_with_turns "dbt-discover" 2 "discover dbt project config in this repo" || true - local output=$(get_output "dbt-discover") - if echo "$output" | grep -qi "unhandled\|TypeError\|Cannot read"; then + local output + output=$(get_output "dbt-discover") + if echo "$output" | grep -qi "TIMEOUT\|unhandled\|TypeError\|Cannot read\|not found"; then echo "FAIL" > "$RESULTS_DIR/dbt-discover" else echo "PASS" > "$RESULTS_DIR/dbt-discover" fi @@ - local output=$(timeout 30 altimate check check_target.sql 2>&1 || true) - if echo "$output" | grep -qi "TypeError\|unhandled\|Cannot read properties"; then + local output + output=$(timeout 30 altimate check check_target.sql 2>&1) + local exit_code + exit_code=$? + if [ $exit_code -ne 0 ] || \ + echo "$output" | grep -qi "TypeError\|unhandled\|Cannot read properties\|Unknown command\|command not found"; then echo "FAIL" > "$RESULTS_DIR/check-cmd" else echo "PASS" > "$RESULTS_DIR/check-cmd" fi

Also applies to: 227-231

🧰 Tools

🪛 Shellcheck (0.11.0)

[warning] 214-214: Declare and assign separately to avoid masking return values.

(SC2155)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@test/sanity/phases/smoke-tests.sh` around lines 213 - 218, The current smoke-test only greps for stack traces and ignores exit status and TIMEOUT/usage errors; modify the test around SANITY_TIMEOUT/altimate_run_with_turns/get_output so you capture the exit code of altimate_run_with_turns (e.g., save its exit status immediately after invocation) and mark FAIL if the exit code is non-zero or if the output contains "TIMEOUT" or common CLI error indicators like "unknown command", "usage:", "error", "invalid option" or the existing stack-trace checks; ensure test_check_command’s swallowing of the exit code is not relied upon here and update the conditional that writes to "$RESULTS_DIR/dbt-discover" to fail on any of those conditions.

coderabbitai · 2026-03-27T16:02:48Z

test/sanity/phases/verify-install.sh

+# 13. Database driver packages resolvable (#295)
+# All drivers use dynamic import() at runtime — if the package can't be resolved,
+# the tool fails with "driver not installed". This catches the #295 regression
+# where published binaries don't ship driver dependencies.
+echo "  --- Driver Resolvability ---"
+
+DRIVERS=(
+  "pg:pg"
+  "snowflake-sdk:snowflake"
+  "mysql2:mysql"
+  "mssql:sqlserver"
+  "duckdb:duckdb"
+  "mongodb:mongodb"
+  "@google-cloud/bigquery:bigquery"
+  "@databricks/sql:databricks"
+  "oracledb:oracle"
+)
+
+for entry in "${DRIVERS[@]}"; do
+  pkg="${entry%%:*}"
+  label="${entry##*:}"
+  if node -e "require('$pkg')" 2>/dev/null; then
+    echo "  PASS: $label driver resolvable ($pkg)"
+    PASS_COUNT=$((PASS_COUNT + 1))
+  else
+    echo "  FAIL: $label driver not resolvable ($pkg)"
+    FAIL_COUNT=$((FAIL_COUNT + 1))
+  fi
+done


⚠️ Potential issue | 🔴 Critical

This block makes Phase 1 fail by construction.

test/sanity/Dockerfile in the same PR explicitly says the sanity image does not install these driver packages. Because this loop increments FAIL_COUNT for every unresolved module, verify-install.sh becomes deterministically red instead of adding usable coverage. Either install the nine drivers in the image or treat this probe as expected/skip until the packaging bug is fixed.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@test/sanity/phases/verify-install.sh` around lines 81 - 109, The DRIVERS resolvability loop in verify-install.sh currently increments FAIL_COUNT for each unresolved package (DRIVERS array) which makes Phase 1 fail because the sanity Docker image intentionally doesn't install these drivers; change the behavior so unresolved drivers are treated as expected/skipped: modify the for-loop that checks require('$pkg') to, on failure, print a "SKIP: $label driver not resolvable ($pkg)" message and increment a new SKIP_COUNT (or only PASS_COUNT on success) instead of FAIL_COUNT, or gate the entire DRIVERS check behind an environment flag (e.g., RUN_DRIVER_CHECK) so CI can opt-in; update any summary logic that uses FAIL_COUNT to account for SKIP_COUNT so Phase 1 no longer fails by construction.

- Driver resolvability checks now emit warnings instead of failures since drivers are intentionally absent from sanity Docker image (#295) - `unshare --net` now tests with a dry-run before use, falling back to proxy-based network blocking when unprivileged (macOS, containers) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

claude bot reviewed Mar 27, 2026

View reviewed changes

anandgupta42 mentioned this pull request Mar 27, 2026

test: expand sanity suite with branding, deny, driver, and resilience tests #494

Closed

5 tasks

github-actions bot added the contributor label Mar 27, 2026

anandgupta42 mentioned this pull request Mar 27, 2026

test: connections — SSH tunnel + MongoDB auth detection #512

Closed

3 tasks

coderabbitai bot reviewed Mar 27, 2026

View reviewed changes

anandgupta42 merged commit 86a02c4 into main Mar 27, 2026
14 of 16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: consolidate 10 hourly test PRs into single coverage expansion#514

test: consolidate 10 hourly test PRs into single coverage expansion#514
anandgupta42 merged 3 commits intomainfrom
test/consolidated-coverage-expansion

anandgupta42 commented Mar 27, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

claude bot left a comment

Uh oh!

coderabbitai bot commented Mar 27, 2026 •

edited

Loading

Rate limit exceeded

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Mar 27, 2026

Uh oh!

Uh oh!

coderabbitai bot Mar 27, 2026

Uh oh!

coderabbitai bot Mar 27, 2026

Uh oh!

coderabbitai bot Mar 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

-    if [ -n "${SNOWFLAKE_ACCOUNT:-}" ] && [ -n "${SNOWFLAKE_USER:-}" ] && [ -n "${SNOWFLAKE_PASSWORD:-}" ]; then
-      export ALTIMATE_CODE_CONN_SNOWFLAKE_TEST=$(cat <<SFEOF
-{"type":"snowflake","account":"${SNOWFLAKE_ACCOUNT}","user":"${SNOWFLAKE_USER}","password":"${SNOWFLAKE_PASSWORD}","warehouse":"${SNOWFLAKE_WAREHOUSE:-COMPUTE_WH}","role":"${SNOWFLAKE_ROLE:-PUBLIC}"}
-SFEOF
-      )
+    if [ -n "${SNOWFLAKE_ACCOUNT:-}" ] && [ -n "${SNOWFLAKE_USER:-}" ] && [ -n "${SNOWFLAKE_PASSWORD:-}" ]; then
+      ALTIMATE_CODE_CONN_SNOWFLAKE_TEST="$(
+        node -e 'console.log(JSON.stringify({
+          type: "snowflake",
+          account: process.env.SNOWFLAKE_ACCOUNT,
+          user: process.env.SNOWFLAKE_USER,
+          password: process.env.SNOWFLAKE_PASSWORD,
+          warehouse: process.env.SNOWFLAKE_WAREHOUSE || "COMPUTE_WH",
+          role: process.env.SNOWFLAKE_ROLE || "PUBLIC",
+        }))'
+      )"
+      export ALTIMATE_CODE_CONN_SNOWFLAKE_TEST

Conversation

anandgupta42 commented Mar 27, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Type of change

Issue for this PR

How did you verify your code works?

Checklist

Summary by CodeRabbit

Uh oh!

claude bot left a comment

Choose a reason for hiding this comment

Claude Code Review

Uh oh!

coderabbitai bot commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rate limit exceeded

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

anandgupta42 commented Mar 27, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Mar 27, 2026 •

edited

Loading