OpenAI's primary API for building AI agents — combines text generation, built-in web search, file search, code interpreter, and computer use in a single endpoint with server-side tool orchestration.
OpenAI's primary API for building AI agents — includes built-in web search, code execution, file analysis, and schema-based structured outputs in a single endpoint.
OpenAI Responses API is an AI Models developer API for building text, vision, structured-output, and tool-using AI agents through one endpoint, using OpenAI-hosted model reasoning, usage-based token billing, and native tool orchestration for developers, product teams, and enterprises that want fewer custom agent infrastructure components.
The API creates model responses at https://api.openai.com/v1/responses and supports text inputs, image inputs, file inputs, text outputs, JSON outputs, streaming, function calling, and conversation state. Its main advantage over older single-turn generation endpoints is that tools are first-class: OpenAI documents built-in tools including web search, file search, computer use, code interpreter, MCP tools, and custom function calls. The request schema also exposes operational controls that matter in production, including maxtoolcalls to cap built-in tool calls, paralleltoolcalls defaulting to true, store defaulting to true, stream defaulting to false, and metadata with up to 16 key-value pairs where keys are limited to 64 characters and values to 512 characters.
Pricing is usage-based rather than a monthly SaaS subscription. As of the current OpenAI pricing page, Responses API itself is not priced separately; tokens are billed at the selected model's rates, with examples including GPT-5 nano at $0.05 per 1M input tokens, $0.005 per 1M cached input tokens, and $0.40 per 1M output tokens; GPT-5 mini at $0.25 input, $0.025 cached input, and $2.00 output; GPT-5.4 at $2.50 input, $0.25 cached input, and $15.00 output; and GPT-5.5 at $5.00 input, $0.50 cached input, and $30.00 output. Built-in tools add separate cost drivers such as web search at $10.00 per 1K calls, file search tool calls at $2.50 per 1K calls, file search storage at $0.10 per GB-day with the first GB free, and containers at $0.03 for 1 GB or $1.92 for 64 GB per 20-minute session per container. Teams should verify the latest OpenAI pricing page before committing production budgets, especially for high-volume token workloads, web search, file search, regional processing, and container-heavy data analysis workflows.
Was this helpful?
The API handles multi-step tool use internally — the model can chain web searches, file reads, code execution, and custom function calls within a single request without client-side orchestration loops.
Use Case:
A single API call where the model searches the web for competitor pricing, processes results with code interpreter to build a comparison table, and returns structured JSON — no client-side loop management needed.
Native web search powered by OpenAI's search infrastructure provides access to recent information where the selected model and tool configuration support it. Pricing and availability should be verified against OpenAI's current pricing documentation.
Use Case:
Building a research agent that answers questions about current events, recent product launches, or live pricing data without integrating a separate search API.
JSON Schema enforcement helps responses conform to a developer-specified format, reducing invalid values, missing fields, and schema violations compared with free-form text generation.
Use Case:
An extraction pipeline that processes invoices and returns {vendor, amount, date, line_items[]} in the requested format, with fewer malformed JSON outputs or hallucinated field names.
Sandboxed Python execution environment for data analysis, calculations, file processing, and chart generation. Container pricing and session rules should be verified against OpenAI's current pricing documentation before production deployment.
Use Case:
An analyst uploads a CSV of 100K sales records and asks for quarterly trends. The model writes Python code, executes it in a container, generates charts, and returns both the analysis and downloadable visualizations.
Screen-based interaction with desktop applications via screenshots and mouse/keyboard control — enabling automation of workflows in applications that lack APIs. Pricing and availability may vary by model and deployment context.
Use Case:
Automating data entry in a legacy ERP system by having the agent navigate the application UI, fill in forms, and verify submissions — useful for enterprise apps with no API access.
Prompt caching can reduce costs on repeated input prefixes where supported, and Batch API can process asynchronous workloads at reduced cost where available. Teams should verify current model support, discount levels, and timing guarantees in OpenAI documentation.
Use Case:
A content moderation system processes 1 million user posts daily using Batch API, with a shared system prompt that may benefit from prompt caching across all requests.
GPT-5 nano: $0.05 / 1M input tokens, $0.005 / 1M cached input tokens, $0.40 / 1M output tokens
GPT-5 mini: $0.25 / 1M input tokens, $0.025 / 1M cached input tokens, $2.00 / 1M output tokens
GPT-5.4: $2.50 / 1M input tokens, $0.25 / 1M cached input tokens, $15.00 / 1M output tokens
GPT-5.5: $5.00 / 1M input tokens, $0.50 / 1M cached input tokens, $30.00 / 1M output tokens
Web search: $10.00 / 1K calls; File Search Tool Call: $2.50 / 1K tool calls; File Search Storage: $0.10 / GB-day, first GB free; Containers: 1 GB for $0.03 or 64 GB for $1.92 per 20-minute session per container
Ready to get started with OpenAI Responses API?
View Pricing Options →We believe in transparent reviews. Here's what OpenAI Responses API doesn't handle well:
Weekly insights on the latest AI tools, features, and trends delivered to your inbox.
AI assistant
Google Gemini is a ai assistant tool for teams evaluating real workflows, pricing limits, strengths, drawbacks, and alternatives before committing.
AI Agent Builders
OpenAI Agents SDK is an open-source Python framework for building agentic apps with handoffs, guardrails, sessions, tracing, MCP tools, sandbox agents, and realtime voice agents.
No reviews yet. Be the first to share your experience!
Get started with OpenAI Responses API and see if it's the right fit for your needs.
Get Started →Take our 60-second quiz to get personalized tool recommendations
Find Your Perfect AI Stack →Explore 20 ready-to-deploy AI agent templates for sales, support, dev, research, and operations.
Browse Agent Templates →