Complete pricing guide for OpenAI Responses API. Compare all plans, analyze costs, and find the perfect tier for your needs.
Not sure if free is enough? See our Free vs Paid comparison →
Still deciding? Read our full verdict on whether OpenAI Responses API is worth it →
per use
Usage-based billing, no monthly minimum
per use
No long context pricing tier
per use
Reduced capability for complex reasoning
per use
Significantly higher cost per token
per use
Tool call charges are in addition to model token costs. Web search content tokens billed at model rates for GPT-4o/4.1; free for reasoning models.
Pricing sourced from OpenAI Responses API · Last verified March 2026
The Responses API adds built-in tools (web search, file search, code interpreter, computer use), server-side tool orchestration (the model chains multiple tool calls in one request), guaranteed structured outputs, and a richer conversation model. It's designed for agent workflows. Chat Completions still works but new features focus on Responses.
No. There is no API surcharge — you pay the same per-token rates regardless of which API you use (Responses, Chat Completions, Realtime, Batch, or Assistants). The only additional costs are for built-in tool usage: web search calls, file search calls, and container sessions.
Yes. Custom function definitions work alongside web search, file search, and code interpreter in the same request. The model can decide to use any combination of built-in and custom tools within a single orchestration loop.
MCP (Model Context Protocol) is a standard for connecting AI models to external tools and data sources. The Responses API supports MCP, meaning agents can invoke any MCP-compatible tool server — accessing databases, APIs, or custom services through a standardized interface.
All current OpenAI models including GPT-5.4, GPT-5.4-mini, GPT-5.4-nano, GPT-5.4-pro, reasoning models (o3, o4-mini), and legacy GPT-4o/4.1 series. Each model has different pricing and capability tradeoffs.
AI builders and operators use OpenAI Responses API to streamline their workflow.
Try OpenAI Responses API Now →Google's flagship AI assistant combining real-time web search, multimodal understanding, and native Google Workspace integration for productivity-focused users.
Compare Pricing →OpenAI's official open-source framework for building agentic AI applications with minimal abstractions. Production-ready successor to Swarm, providing agents, handoffs, guardrails, and tracing primitives that work with Python and TypeScript.
Compare Pricing →