Staging by Shashank0701-byte · Pull Request #160 · Shashank0701-byte/System-Craft

Shashank0701-byte · 2026-05-24T19:29:03Z

Summary by CodeRabbit

Chores
- Optimized autoscaling targets and behavior for more efficient, stable scaling.
- Updated monitoring setup and environment labeling to production.
- Enhanced health endpoint to emit metrics for better observability.
- Improved CI workflow to produce and push multiple Docker image tags and included deployment script examples.
- Added a load-testing script to validate performance and reliability.

…tion

chore(infra): prepare kubernetes manifests and ci pipeline for production

vercel · 2026-05-24T19:29:08Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
system-craft	Ready	Preview, Comment	May 24, 2026 7:45pm

coderabbitai · 2026-05-24T19:29:12Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 820961f4-8839-4b30-bc64-c4aa82053c43

📥 Commits

Reviewing files that changed from the base of the PR and between 8d744b4 and 6e0b30e.

⛔ Files ignored due to path filters (1)

package-lock.json is excluded by !**/package-lock.json

📒 Files selected for processing (3)

kubernetes/hpa.yaml
kubernetes/metrics-server-patch.yaml
tests/load/k6-spike.js

💤 Files with no reviewable changes (1)

kubernetes/metrics-server-patch.yaml

📝 Walkthrough

Walkthrough

Coordinated production-readiness updates: instrument the health endpoint, configure Kubernetes metrics-server, tune HPA (CPU 60%, memory 80% and behavior), update CI to tag/push both latest and commit-SHA and expand deploy scripting, set Prometheus target to production, and add a k6 spike load test.

Changes

Production readiness: monitoring, scaling, and deployment

Layer / File(s)	Summary
Health endpoint metrics instrumentation `app/api/health/route.ts`	Health route exported as `GET = withMetrics('/api/health', async (req: NextRequest) => ...)`, preserving `NextResponse.json({ status: 'ok' }, { status: 200 })`.
Kubernetes metrics collection `kubernetes/metrics-server-patch.yaml`	Adds `metrics-server` container args to pod template: cert dir, secure port, kubelet preferred address types, node-status port usage, and `--metric-resolution=15s`.
HPA resource-based scaling `kubernetes/hpa.yaml`	CPU target changed to `averageUtilization: 60`, memory target added `averageUtilization: 80`, and `behavior` configured (scaleUp stabilizationWindowSeconds: 60 with pods policy; scaleDown stabilizationWindowSeconds: 300).
CI/CD Docker image tagging and deploy `.github/workflows/ci.yml`	Build emits two tags (`latest` and commit SHA); Push step pushes both tags; Deploy step expanded to a multi-line script with the original echo and commented kubectl/CD example commands.
Production environment configuration `monitoring/prometheus/prometheus.yml`	Prometheus `systemcraft` job `static_configs` label `environment` changed from `development` to `production`.
Spike load testing `tests/load/k6-spike.js`	Adds k6 spike test: `options` with stages (50→200→500→0) and thresholds (p95 < 500ms, http_req_failed < 1%); default VU issues GET `/api/health` and asserts 200.

🎯 3 (Moderate) | ⏱️ ~25 minutes

Possibly related PRs

Shashank0701-byte/System-Craft#128: Related changes to GitHub Actions Docker build/push tagging and scanning logic.
Shashank0701-byte/System-Craft#125: Related HPA configuration changes and autoscaling setup.
Shashank0701-byte/System-Craft#119: Related CI deploy-step and commit-SHA echoing adjustments in the workflow.

"I hopped across the CI stream so bright,
Metrics hummed softly through the night,
HPA tuned its gentle song,
Prometheus watched all along,
I jittered load—now production feels right." 🐇✨

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 inconclusive)

Check name	Status	Explanation	Resolution
Title check	❓ Inconclusive	The title 'Staging' is vague and does not meaningfully describe the actual changes in this pull request, which include Docker workflow updates, metrics instrumentation, Kubernetes HPA configuration adjustments, and load testing additions.	Replace the title with a specific summary of the main changes, such as 'Add metrics instrumentation and update deployment infrastructure' or 'Configure HPA scaling and add load testing'.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch staging

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

netlify · 2026-05-24T19:29:45Z

✅ Deploy Preview for system-craft-staging ready!

Name	Link
🔨 Latest commit	`6e0b30e`
🔍 Latest deploy log	https://app.netlify.com/projects/system-craft-staging/deploys/6a1355284efefd0008143b74
😎 Deploy Preview	https://deploy-preview-160--system-craft-staging.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

coderabbitai

Actionable comments posted: 4

🧹 Nitpick comments (4)

app/api/health/route.ts (1)
4-6: 💤 Low value

Implementation correctly follows the withMetrics contract.

The req parameter is required by the AppRouteHandler type signature (as defined in src/lib/withMetrics.ts) even though it's not used in this handler's body. The static analysis warning about the unused parameter is a false positive in this case.

If you prefer to silence the linter warning, you can prefix the parameter with an underscore:
Optional: Silence the linter warning
-export const GET = withMetrics('/api/health', async (req: NextRequest) => {
+export const GET = withMetrics('/api/health', async (_req: NextRequest) => {
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@app/api/health/route.ts` around lines 4 - 6, The GET handler's unused req
parameter triggers a lint warning even though withMetrics requires it; update
the function signature for export const GET = withMetrics('/api/health', async
(req: NextRequest) => { ... }) to rename the parameter to _req (or _request) to
silence the linter while keeping the AppRouteHandler contract intact—locate the
GET export and the withMetrics wrapper to make this change.
tests/load/k6-spike.js (1)
22-26: 💤 Low value

The anonymous default export is correct k6 convention.

The static analysis warning about "Unexpected default export of anonymous function" is a false positive. k6 requires the virtual user (VU) code to be exported as a default function, and it's standard practice to use an anonymous function for this purpose.

However, consider adding a check for the response body to validate the endpoint returns the expected { status: 'ok' } payload:
Optional: Add response body validation
 export default function () {
     const res = http.get(`${BASE_URL}/api/health`);
-    check(res, { 'status 200': (r) => r.status === 200 });
+    check(res, { 
+        'status 200': (r) => r.status === 200,
+        'body is ok': (r) => r.json('status') === 'ok'
+    });
     // No Sleep to maximize load
 }
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/load/k6-spike.js` around lines 22 - 26, The anonymous default export is
fine for k6; update the health check in the exported function to also validate
the JSON body from the GET to `${BASE_URL}/api/health`: after calling http.get
and the existing check on r.status, parse the response body (use res.json() or
JSON.parse(res.body)) and add a check that the parsed object's status equals
'ok' (augment the check call or add a new check), referencing the exported
default function, http.get, check, BASE_URL, and the '/api/health' endpoint.
.github/workflows/ci.yml (2)
124-126: ⚡ Quick win

Optional defense-in-depth: move image ref construction out of inline ${{ ... }} in run:

The usual script-injection risk from ${{ ... }} in run: applies most when values are untrusted; here env.REPO is derived from GITHUB_REPOSITORY and github.sha is a hex SHA, and the push step only runs for non-pull_request events.

Defense-in-depth (and to avoid static-analyzer noise): set the image repo/tag in step env: and reference them in run:.
🔒 Proposed defense-in-depth hardening
       - name: Push Docker image
         if: github.event_name != 'pull_request'
+        env:
+          IMAGE_REPO: ghcr.io/${{ env.REPO }}/systemcraft-web
+          IMAGE_TAG: ${{ github.sha }}
         run: |
-          docker push ghcr.io/${{ env.REPO }}/systemcraft-web:latest
-          docker push ghcr.io/${{ env.REPO }}/systemcraft-web:${{ github.sha }}
+          docker push ${IMAGE_REPO}:latest
+          docker push ${IMAGE_REPO}:${IMAGE_TAG}
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In @.github/workflows/ci.yml around lines 124 - 126, Move the dynamic image
references out of the inline run script and into the step environment to avoid
`${{ ... }}` expansion inside run; specifically, in the CI step that currently
calls docker push with ghcr.io/${{ env.REPO }}/systemcraft-web:latest and
ghcr.io/${{ env.REPO }}/systemcraft-web:${{ github.sha }}, add step-level env
variables (e.g., IMAGE_LATEST and IMAGE_SHA using expressions for env.REPO and
github.sha) and then reference those env vars inside the run block (docker push
$IMAGE_LATEST and docker push $IMAGE_SHA) so the run script uses plain env-vars
instead of `${{ ... }}` interpolation.
160-164: Deploy step is a placeholder with good preparation.

The expanded deploy step with commented kubectl examples provides a clear path forward for actual deployment implementation. The approach of using commit SHA tags (line 163) aligns with production best practices for reproducibility.

The commented examples are syntactically correct—GitHub Actions will expand ${{ github.sha }} before shell execution. When you're ready to implement actual deployment, consider:

Adding kubectl authentication setup

Implementing deployment verification checks

Adding rollback capability

Would you like me to generate a complete deployment implementation or open an issue to track this task?
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In @.github/workflows/ci.yml around lines 160 - 164, The deploy step currently
only echoes a message and leaves kubectl/sed commands commented out; replace
this placeholder with a real deploy sequence that (1) authenticates to the
Kubernetes cluster (e.g., setup kubeconfig or use cloud provider login action),
(2) tags the image reference systemcraft-web:${{ github.sha }} in your manifests
(the current sed example targeting kubernetes/deployment.yaml is fine as the
replacement approach), (3) applies the manifests via kubectl apply, and (4) runs
post-deploy verification checks (kubectl rollout status, health probes) and a
simple rollback path (kubectl rollout undo) on failure; ensure
secrets/credentials are provided via GitHub Actions secrets and any
cloud-provider-specific auth actions are added before invoking kubectl.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@kubernetes/hpa.yaml`:
- Line 20: The current HPA config sets averageUtilization: 40 which is overly
aggressive given the CPU request of 250m; update the averageUtilization value in
the HorizontalPodAutoscaler spec (the averageUtilization field) to a more
conservative value (suggest 60 or 70) or confirm 40 with load-test evidence,
then adjust the threshold accordingly so scaling triggers align with expected
pod CPU usage and reduce unnecessary pod churn.
- Around line 27-33: The HPA's scaleUp is too aggressive: update
behavior.scaleUp.stabilizationWindowSeconds from 0 to a sane value (e.g., 30 or
60) and reduce the scale-up policy to add fewer pods less frequently by changing
the policy under behavior.scaleUp.policies (the entry with type: Pods) from
value: 4 and periodSeconds: 15 to something like value: 2 and periodSeconds: 30;
adjust these numbers as needed for your app's startup time and traffic patterns
to avoid thrashing.

In `@kubernetes/metrics-server-patch.yaml`:
- Line 12: Remove the --kubelet-insecure-tls flag from the metrics-server
manifest (the argument currently shown as "--kubelet-insecure-tls") and instead
configure metrics-server to verify kubelet TLS: either ensure kubelets present
valid certificates trusted by the cluster CA or add the
--kubelet-certificate-authority argument pointing to a mounted CA bundle (create
a Secret/ConfigMap and mount it into the pod if needed) so production uses
proper certificate verification rather than disabling TLS.

In `@tests/load/k6-spike.js`:
- Line 4: The import includes an unused symbol `sleep`; remove `sleep` from the
import statement so it only imports `check` (i.e., change `import { sleep, check
} from 'k6';` to `import { check } from 'k6';`) to reflect the intentional
no-sleep behavior mentioned in the comment.

---

Nitpick comments:
In @.github/workflows/ci.yml:
- Around line 124-126: Move the dynamic image references out of the inline run
script and into the step environment to avoid `${{ ... }}` expansion inside run;
specifically, in the CI step that currently calls docker push with ghcr.io/${{
env.REPO }}/systemcraft-web:latest and ghcr.io/${{ env.REPO
}}/systemcraft-web:${{ github.sha }}, add step-level env variables (e.g.,
IMAGE_LATEST and IMAGE_SHA using expressions for env.REPO and github.sha) and
then reference those env vars inside the run block (docker push $IMAGE_LATEST
and docker push $IMAGE_SHA) so the run script uses plain env-vars instead of
`${{ ... }}` interpolation.
- Around line 160-164: The deploy step currently only echoes a message and
leaves kubectl/sed commands commented out; replace this placeholder with a real
deploy sequence that (1) authenticates to the Kubernetes cluster (e.g., setup
kubeconfig or use cloud provider login action), (2) tags the image reference
systemcraft-web:${{ github.sha }} in your manifests (the current sed example
targeting kubernetes/deployment.yaml is fine as the replacement approach), (3)
applies the manifests via kubectl apply, and (4) runs post-deploy verification
checks (kubectl rollout status, health probes) and a simple rollback path
(kubectl rollout undo) on failure; ensure secrets/credentials are provided via
GitHub Actions secrets and any cloud-provider-specific auth actions are added
before invoking kubectl.

In `@app/api/health/route.ts`:
- Around line 4-6: The GET handler's unused req parameter triggers a lint
warning even though withMetrics requires it; update the function signature for
export const GET = withMetrics('/api/health', async (req: NextRequest) => { ...
}) to rename the parameter to _req (or _request) to silence the linter while
keeping the AppRouteHandler contract intact—locate the GET export and the
withMetrics wrapper to make this change.

In `@tests/load/k6-spike.js`:
- Around line 22-26: The anonymous default export is fine for k6; update the
health check in the exported function to also validate the JSON body from the
GET to `${BASE_URL}/api/health`: after calling http.get and the existing check
on r.status, parse the response body (use res.json() or JSON.parse(res.body))
and add a check that the parsed object's status equals 'ok' (augment the check
call or add a new check), referencing the exported default function, http.get,
check, BASE_URL, and the '/api/health' endpoint.

🪄 Autofix (Beta)

✅ Autofix completed

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: fb3aa5b1-1268-4976-8500-69d89f67f1be

📥 Commits

Reviewing files that changed from the base of the PR and between 1b502ac and 8d744b4.

📒 Files selected for processing (6)

.github/workflows/ci.yml
app/api/health/route.ts
kubernetes/hpa.yaml
kubernetes/metrics-server-patch.yaml
monitoring/prometheus/prometheus.yml
tests/load/k6-spike.js

coderabbitai · 2026-05-24T19:40:59Z

Note

Autofix is a beta feature. Expect some limitations and changes as we gather feedback and continue to improve it.

Fixes Applied Successfully

Fixed 4 file(s) based on 4 unresolved review comments.

Files modified:

kubernetes/hpa.yaml
kubernetes/metrics-server-patch.yaml
package-lock.json
tests/load/k6-spike.js

Commit: 6e0b30e300546b3da7bbd1de46f1e158a9b068a2

The changes have been pushed to the staging branch.

Time taken: 3m 13s

Fixed 4 file(s) based on 4 unresolved review comments. Co-authored-by: CodeRabbit <noreply@coderabbit.ai>

Shashank0701-byte added 2 commits May 25, 2026 00:55

chore(infra): prepare kubernetes manifests and ci pipeline for produc…

3349b31

…tion

Merge pull request #159 from Shashank0701-byte/feat/prometheus

8d744b4

chore(infra): prepare kubernetes manifests and ci pipeline for production

vercel Bot deployed to Preview May 24, 2026 19:29 View deployment

coderabbitai Bot reviewed May 24, 2026

View reviewed changes

Comment thread kubernetes/hpa.yaml Outdated

Comment thread kubernetes/hpa.yaml Outdated

Comment thread kubernetes/metrics-server-patch.yaml Outdated

Comment thread tests/load/k6-spike.js Outdated

fix: apply CodeRabbit auto-fixes

6e0b30e

Fixed 4 file(s) based on 4 unresolved review comments. Co-authored-by: CodeRabbit <noreply@coderabbit.ai>

vercel Bot deployed to Preview May 24, 2026 19:45 View deployment

Shashank0701-byte merged commit bb630f2 into main May 24, 2026
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Staging#160

Staging#160
Shashank0701-byte merged 3 commits into
mainfrom
staging

Shashank0701-byte commented May 24, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

vercel Bot commented May 24, 2026 •

edited

Loading

Uh oh!

coderabbitai Bot commented May 24, 2026 •

edited

Loading

Walkthrough

Changes

Possibly related PRs

❌ Failed checks (1 inconclusive)

Uh oh!

netlify Bot commented May 24, 2026 •

edited

Loading

Uh oh!

coderabbitai Bot left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot commented May 24, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Shashank0701-byte commented May 24, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

vercel Bot commented May 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai Bot commented May 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Possibly related PRs

❌ Failed checks (1 inconclusive)

Uh oh!

netlify Bot commented May 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for system-craft-staging ready!

Uh oh!

coderabbitai Bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot commented May 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Fixes Applied Successfully

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Shashank0701-byte commented May 24, 2026 •

edited by coderabbitai Bot

Loading

vercel Bot commented May 24, 2026 •

edited

Loading

coderabbitai Bot commented May 24, 2026 •

edited

Loading

netlify Bot commented May 24, 2026 •

edited

Loading

coderabbitai Bot left a comment •

edited

Loading

coderabbitai Bot commented May 24, 2026 •

edited

Loading