Skip to content

Conversation

@andyzhangx
Copy link
Contributor

No description provided.

Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR adds a new blog post about autoscaling KAITO inference workloads on AKS using KEDA. The post introduces the alpha autoscaling feature released in KAITO v0.8.0 and provides a comprehensive guide for enabling intelligent autoscaling based on service monitoring metrics.

Key Changes

  • New blog post documenting KAITO inference workload autoscaling with KEDA
  • Includes architecture overview, prerequisites, installation steps, and quickstart guide
  • Demonstrates using the new InferenceSet CRD with KEDA's external scaler pattern

@andyzhangx andyzhangx force-pushed the autoscale-inference-workloads-with-kaito branch from 6d99ae0 to 7ba066d Compare December 15, 2025 15:18
Copy link
Contributor

@sdesai345 sdesai345 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added some comments!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants