AI deployment
| Name | Type | Description | Notes |
|---|---|---|---|
| gpuCount | Long | Number of GPUs | [optional] |
| updatedAt | OffsetDateTime | Update time | [optional] [readonly] |
| deploymentUrl | String | Deployment URL (nullable) | [optional] |
| serviceLevel | String | Service level | [optional] |
| inferenceEngineVersion | InferenceEngineVersion | [optional] | |
| name | String | Deployment name | [optional] |
| state | StateEnum | Deployment state | [optional] |
| gpuType | String | GPU type family | [optional] |
| id | UUID | Deployment ID | [optional] [readonly] |
| replicas | Long | Number of replicas (>=0) | [optional] |
| stateDetails | String | Deployment state details | [optional] |
| createdAt | OffsetDateTime | Creation time | [optional] [readonly] |
| inferenceEngineParameters | List<String> | Optional extra inference engine server CLI args | [optional] |
| model | ModelRef | [optional] |
| Name | Value |
|---|---|
| READY | "ready" |
| CREATING | "creating" |
| ERROR | "error" |
| DEPLOYING | "deploying" |