Open-source LLM observability platform that helps debug AI applications through detailed tracing, evaluation, and prompt experimentation with notebook-first design.
An open-source tool that helps you see inside your AI's thinking — debug and improve AI performance with visual tracing.
Open-source LLM observability platform that helps debug AI applications through detailed tracing, evaluation, and prompt experimentation with notebook-first design.
Was this helpful?
Free
Contact sales
Ready to get started with Arize Phoenix?
View Pricing Options →We believe in transparent reviews. Here's what Arize Phoenix doesn't handle well:
Weekly insights on the latest AI tools, features, and trends delivered to your inbox.
AI Observability
LangSmith is LangChain's commercial observability, evaluation and prompt management platform for LLM apps and agents in production.
MLOps
End-to-end MLOps and AI developer platform — Models (experiment tracking, sweeps, model registry) plus Weave (LLM/agent observability and evals) — used by frontier labs and enterprise ML teams.
Testing & Quality
Open-source LLM evaluation framework with 50+ research-backed metrics including hallucination detection, tool use correctness, and conversational quality. Pytest-style testing for AI agents with CI/CD integration.
LLM Observability
Langfuse is an open-source LLM observability and engineering platform providing tracing, prompt management, evaluations, and dataset management for production AI applications.
No reviews yet. Be the first to share your experience!
Get started with Arize Phoenix and see if it's the right fit for your needs.
Get Started →Take our 60-second quiz to get personalized tool recommendations
Find Your Perfect AI Stack →Explore 20 ready-to-deploy AI agent templates for sales, support, dev, research, and operations.
Browse Agent Templates →