Skip to content

chore(pricing): Update google pricing#655

Open
siddharthsambharia-portkey wants to merge 3 commits intomainfrom
pricing-update/google-24276535421
Open

chore(pricing): Update google pricing#655
siddharthsambharia-portkey wants to merge 3 commits intomainfrom
pricing-update/google-24276535421

Conversation

@siddharthsambharia-portkey
Copy link
Copy Markdown
Collaborator

@siddharthsambharia-portkey siddharthsambharia-portkey commented Apr 11, 2026

🔄 Pricing Update: google

📊 Summary (complete_diff mode)

Change Type Count
➕ Models added 2
🔄 Models updated (merged) 46

➕ New Models

  • veo-3.1-lite-generate-preview-lte-128k
  • veo-3.1-lite-generate-preview-gt-128k

🔄 Updated Models

  • gemini-3.1-pro-preview-lte-128k
  • gemini-3.1-pro-preview-gt-128k
  • gemini-3.1-pro-preview-customtools-lte-128k
  • gemini-3.1-pro-preview-customtools-gt-128k
  • gemini-3.1-flash-image-preview-lte-128k
  • gemini-3.1-flash-image-preview-gt-128k
  • gemini-3.1-flash-lite-preview-lte-128k
  • gemini-3.1-flash-lite-preview-gt-128k
  • gemini-3-pro-preview-lte-128k
  • gemini-3-pro-preview-gt-128k
  • gemini-3-pro-image-preview-lte-128k
  • gemini-3-pro-image-preview-gt-128k
  • gemini-3-flash-preview-lte-128k
  • gemini-3-flash-preview-gt-128k
  • gemini-pro-latest-lte-128k
  • gemini-pro-latest-gt-128k
  • gemini-flash-latest-lte-128k
  • gemini-flash-latest-gt-128k
  • gemini-flash-lite-latest-lte-128k
  • gemini-flash-lite-latest-gt-128k
  • gemini-2.5-pro-lte-128k
  • gemini-2.5-pro-gt-128k
  • gemini-2.5-flash-lte-128k
  • gemini-2.5-flash-gt-128k
  • gemini-2.5-flash-image-lte-128k
  • gemini-2.5-flash-image-gt-128k
  • gemini-2.5-flash-lite-lte-128k
  • gemini-2.5-flash-lite-gt-128k
  • gemini-2.0-flash-lte-128k
  • gemini-2.0-flash-gt-128k
  • ... and 16 more

📋 Model → pricing page mapping

Model ID Pricing page section Notes
gemini-3.1-pro-preview-lte-128k Gemini 3.1 Pro Preview, ≤200K input $2, output $12, cache $0.2, batch $1/$6, web_search+search
gemini-3.1-pro-preview-gt-128k Gemini 3.1 Pro Preview, >200K input $4, output $18, cache $0.4, batch $2/$9
gemini-3.1-pro-preview-customtools-lte-128k Gemini 3.1 Pro Preview (customtools), ≤200K same pricing as gemini-3.1-pro-preview
gemini-3.1-pro-preview-customtools-gt-128k Gemini 3.1 Pro Preview (customtools), >200K same pricing as gemini-3.1-pro-preview gt
gemini-3.1-flash-image-preview-lte-128k Gemini 3.1 Flash Image, flat pricing input $0.50, text output $3, image_token $60, batch $0.25/$1.50
gemini-3.1-flash-image-preview-gt-128k Gemini 3.1 Flash Image, flat pricing identical to lte (no context tiers on page)
gemini-3.1-flash-lite-preview-lte-128k Gemini 3.1 Flash-Lite, flat pricing input $0.25, output $1.50, cache $0.03, batch $0.13/$0.75
gemini-3.1-flash-lite-preview-gt-128k Gemini 3.1 Flash-Lite, flat pricing identical to lte (no context tiers on page)
gemini-3-pro-preview-lte-128k Gemini 3 Pro, ≤200K input $2, output $12, cache $0.2, batch $1/$6
gemini-3-pro-preview-gt-128k Gemini 3 Pro, >200K input $4, output $18, cache $0.4, batch $2/$9
gemini-3-pro-image-preview-lte-128k Gemini 3 Pro Image, flat pricing input $2, text output $12, image_token $120, batch $1/$6
gemini-3-pro-image-preview-gt-128k Gemini 3 Pro Image, flat pricing identical to lte (no context tiers on page)
gemini-3-flash-preview-lte-128k Gemini 3 Flash, flat pricing input $0.5, output $3, cache $0.05, batch $0.25/$1.50
gemini-3-flash-preview-gt-128k Gemini 3 Flash, flat pricing identical to lte (no context tiers on page)
gemini-pro-latest-lte-128k *-latest → resolved to gemini-3.1-pro-preview same pricing as gemini-3.1-pro-preview lte
gemini-pro-latest-gt-128k *-latest → resolved to gemini-3.1-pro-preview same pricing as gemini-3.1-pro-preview gt
gemini-flash-latest-lte-128k *-latest → resolved to gemini-3-flash-preview same pricing as gemini-3-flash-preview
gemini-flash-latest-gt-128k *-latest → resolved to gemini-3-flash-preview identical to lte
gemini-flash-lite-latest-lte-128k *-latest → resolved to gemini-3.1-flash-lite-preview same pricing as gemini-3.1-flash-lite-preview
gemini-flash-lite-latest-gt-128k *-latest → resolved to gemini-3.1-flash-lite-preview identical to lte
gemini-2.5-pro-lte-128k Gemini 2.5 Pro, ≤200K input $1.25, output $10, cache $0.13, batch $0.625/$5
gemini-2.5-pro-gt-128k Gemini 2.5 Pro, >200K input $2.50, output $15, cache $0.25, batch $1.25/$7.5
gemini-2.5-flash-lte-128k Gemini 2.5 Flash, flat pricing input $0.30, output $2.50, cache $0.03, batch $0.15/$1.25
gemini-2.5-flash-gt-128k Gemini 2.5 Flash, flat pricing identical to lte (no context tiers on page)
gemini-2.5-flash-image-lte-128k Gemini 2.5 Flash Image, flat pricing input $0.30, text output $2.50, image_token $30, batch $0.15/$1.25; batch image rate $15/1M
gemini-2.5-flash-image-gt-128k Gemini 2.5 Flash Image, flat pricing identical to lte (no context tiers on page)
gemini-2.5-flash-lite-lte-128k Gemini 2.5 Flash-Lite, flat pricing input $0.10, output $0.40, cache $0.01, batch $0.05/$0.20
gemini-2.5-flash-lite-gt-128k Gemini 2.5 Flash-Lite, flat pricing identical to lte (no context tiers on page)
gemini-2.0-flash-lte-128k Gemini 2.0 Flash, flat pricing input $0.15, output $0.60, batch $0.075/$0.30
gemini-2.0-flash-gt-128k Gemini 2.0 Flash, flat pricing identical to lte
gemini-2.0-flash-001-lte-128k Gemini 2.0 Flash 001, flat pricing same pricing as gemini-2.0-flash
gemini-2.0-flash-001-gt-128k Gemini 2.0 Flash 001, flat pricing identical to lte
gemini-2.0-flash-lite-lte-128k Gemini 2.0 Flash-Lite, flat pricing input $0.075, output $0.30, batch $0.0375/$0.15
gemini-2.0-flash-lite-gt-128k Gemini 2.0 Flash-Lite, flat pricing identical to lte
gemini-2.0-flash-lite-001-lte-128k Gemini 2.0 Flash-Lite 001, flat pricing same pricing as gemini-2.0-flash-lite
gemini-2.0-flash-lite-001-gt-128k Gemini 2.0 Flash-Lite 001, flat pricing identical to lte
gemini-embedding-001-lte-128k Gemini Embedding 001 input $0.15/1M tokens ($0.00015/1K), output $0
gemini-embedding-001-gt-128k Gemini Embedding 001 identical to lte
gemini-embedding-2-preview-lte-128k Gemini Embedding 2 Preview input $0.20/1M tokens, output $0
gemini-embedding-2-preview-gt-128k Gemini Embedding 2 Preview identical to lte
imagen-4.0-generate-001-lte-128k Imagen 4.0 Generate $0.04/image
imagen-4.0-generate-001-gt-128k Imagen 4.0 Generate $0.04/image
imagen-4.0-ultra-generate-001-lte-128k Imagen 4.0 Ultra Generate $0.06/image
imagen-4.0-ultra-generate-001-gt-128k Imagen 4.0 Ultra Generate $0.06/image
imagen-4.0-fast-generate-001-lte-128k Imagen 4.0 Fast Generate $0.02/image
imagen-4.0-fast-generate-001-gt-128k Imagen 4.0 Fast Generate $0.02/image
veo-2.0-generate-001-lte-128k Veo 2.0 Generate $0.50/s → 50¢/s, default 8s, 1 sample
veo-2.0-generate-001-gt-128k Veo 2.0 Generate identical to lte
veo-3.0-generate-001-lte-128k Veo 3.0 Generate $0.20/s video-only → 20¢/s, default 8s, 1 sample
veo-3.0-generate-001-gt-128k Veo 3.0 Generate identical to lte
veo-3.0-fast-generate-001-lte-128k Veo 3.0 Fast Generate $0.10/s → 10¢/s, default 8s, 1 sample
veo-3.0-fast-generate-001-gt-128k Veo 3.0 Fast Generate identical to lte
veo-3.1-generate-preview-lte-128k Veo 3.1 Generate Preview $0.20/s video-only → 20¢/s, default 8s, 1 sample
veo-3.1-generate-preview-gt-128k Veo 3.1 Generate Preview identical to lte
veo-3.1-fast-generate-preview-lte-128k Veo 3.1 Fast Generate Preview $0.10/s → 10¢/s, default 8s, 1 sample
veo-3.1-fast-generate-preview-gt-128k Veo 3.1 Fast Generate Preview identical to lte
veo-3.1-lite-generate-preview-lte-128k Veo 3.1 Lite Generate Preview $0.05/s → 5¢/s, default 8s, 1 sample
veo-3.1-lite-generate-preview-gt-128k Veo 3.1 Lite Generate Preview identical to lte

📌 Source Notes

  • Token pricing sourced from Google Cloud Vertex AI Generative AI Pricing (https://cloud.google.com/vertex-ai/generative-ai/pricing) — the Gemini API pricing page (https://ai.google.dev/gemini-api/docs/pricing) was inaccessible (Firecrawl insufficient credits).
  • Web search pricing ($3.5/1K calls → 0.35¢/call) uses Gemini API standard rate per skill examples.
  • Vertex AI uses ≤200K / >200K context tiers; mapped to -lte-128k / -gt-128k naming convention.
  • *-latest aliases resolved: gemini-pro-latest → gemini-3.1-pro-preview, gemini-flash-latest → gemini-3-flash-preview, gemini-flash-lite-latest → gemini-3.1-flash-lite-preview (verified from Vertex AI pricing page showing current latest models).
  • Veo video_seconds uses video-only rate as the base; audio+video rate is higher but video-only is the standard baseline.
  • Thinking tokens: not listed as separate line items on the Vertex AI pricing page (included in "Text output (response and reasoning)") — not added to avoid double-counting.

Generated by Pricing Agent on 2026-04-11

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant