Complete pricing guide for Datadog LLM Observability. Compare all plans, analyze costs, and find the perfect tier for your needs.
Not sure if free is enough? See our Free vs Paid comparison →
Still deciding? Read our full verdict on whether Datadog LLM Observability is worth it →
mo
mo
Pricing sourced from Datadog LLM Observability · Last verified March 2026
LangSmith and Langfuse are purpose-built LLM platforms focused on prompt engineering, dataset management, and developer-centric evaluation workflows. Datadog LLM Observability is built for production operations: it stitches LLM spans into the same distributed traces as your infrastructure, APM, and logs, and reuses Datadog's monitor, alerting, RBAC, and security detection systems. It is stronger for SRE and platform teams running AI in production, weaker for prompt iteration during development.
Datadog supports OpenAI, Anthropic, Amazon Bedrock, Azure OpenAI, Google Vertex AI, and other major providers, plus orchestration frameworks including LangChain, LlamaIndex, and OpenAI Assistants. Custom instrumentation is available through Datadog's SDKs for Python, Node.js, and other supported runtimes.
No. Datadog is a SaaS product and does not offer a self-hosted or on-prem version of LLM Observability. Teams with strict data residency requirements can choose between US, EU, and other regional Datadog sites, and sensitive data scrubbing can be applied client-side before telemetry is shipped.
Datadog offers built-in LLM-as-judge evaluations for quality, faithfulness, topic relevance, and toxicity, plus custom rule-based and code-based evaluators. Evaluations can run on sampled production traffic or on curated datasets, and results are stored alongside the trace so regressions are visible in the same UI as latency or cost spikes.
Yes. LLM Observability integrates with Datadog's Sensitive Data Scanner and detection rules engine to flag prompt injection attempts, jailbreaks, and PII or secrets that appear in prompts or responses. Findings can route to Datadog Cloud SIEM workflows for security teams to triage.
AI builders and operators use Datadog LLM Observability to streamline their workflow.
Try Datadog LLM Observability Now →Langfuse is an open-source LLM observability and engineering platform providing tracing, prompt management, evaluations, and dataset management for production AI applications.
Compare Pricing →Open-source LLM observability and AI gateway — logs every prompt, response, cost, and latency across 20+ providers with a one-line proxy or async SDK, plus caching, retries, and prompt experiments.
Compare Pricing →Phoenix is Arize's open-source LLM observability project, and it has quietly become the default way tens of thousands of teams see what their agents are actually doing in production. The pitch is simple: `pip install arize-phoenix`, instrument with OpenInference (or any OpenTelemetry-compatible library), and every LLM call, tool invocation, retrieval, and embedding shows up as a spanned timeline you can filter, search, and replay. No vendor account required, no proprietary SDK lock-in. The Open
Compare Pricing →LangSmith is LangChain's commercial observability, evaluation and prompt management platform for LLM apps and agents in production.
Compare Pricing →