Complete pricing guide for Vertex AI. Compare all plans, analyze costs, and find the perfect tier for your needs.
Not sure if free is enough? See our Free vs Paid comparison →
Still deciding? Read our full verdict on whether Vertex AI is worth it →
mo
mo
mo
mo
Pricing sourced from Vertex AI · Last verified March 2026
Vertex AI uses a pay-as-you-go model with separate pricing per service. Foundation models like Gemini are billed per 1K input/output tokens (or per character for older PaLM models), while custom training is billed per machine-type node-hour. Predictions are billed per node-hour for online endpoints or per 1K records for batch jobs. AutoML, Feature Store, Pipelines, and Model Monitoring each have their own rate cards. Google offers a $300 free trial credit for new Google Cloud customers and a limited free tier for some Vertex AI components.
Yes. Vertex AI Model Garden offers first-party access to Anthropic's Claude family, Meta's Llama, Mistral, AI21, and many open-source models. These can be invoked through the same Vertex AI SDK and API as Gemini, with unified billing, IAM, logging, and VPC controls — making it a convenient single pane for multi-model deployments.
Google AI Studio (aistudio.google.com) is a free, consumer-friendly playground aimed at developers prototyping with Gemini via a simple API key. Vertex AI is the enterprise platform: it adds IAM-based authentication, VPC Service Controls, data residency, audit logging, provisioned throughput, fine-tuning, MLOps tooling, and SLAs. Production workloads should use Vertex AI; quick prototyping can start in AI Studio.
Yes. Vertex AI supports supervised fine-tuning for Gemini and select open models, distillation, RLHF, and adapter-based tuning methods. You can also bring custom containers and train from scratch on GPUs or TPUs using Vertex AI Training, then register and deploy via the Model Registry and Endpoints.
Yes. Vertex AI is covered by Google Cloud's compliance portfolio including HIPAA BAA, FedRAMP High, PCI DSS, ISO 27001/27017/27018, SOC 1/2/3, and regional data residency options. Customer data is not used to train Google's foundation models by default, and VPC Service Controls plus CMEK provide network isolation and customer-managed encryption keys.
AI builders and operators use Vertex AI to streamline their workflow.
Try Vertex AI Now →