
phoenix
AI Observability & Evaluation
The Lens
Phoenix is open source observability for AI apps. When an LLM feature misbehaves in production, this is how you see why. It traces every model call your app makes, captures the prompts and responses, and lets you run evals to score output quality over time. Arize built it on OpenTelemetry, and it runs in a notebook, a container, or on your own server.
Self-hosting is the point, and it delivers. Run the container, point your app's tracing at it, and you get traces, datasets, experiments, and prompt management in one UI with nothing leaving your infrastructure. That matters when your prompts carry customer data you can't ship to a vendor. The ops burden is moderate. This is a real service to keep running, not a library you import.
Solo developers and small teams should self-host this and skip the LLM-observability SaaS bills from the LangSmith and Datadog tier. You get tracing and evals for the cost of a container. Larger teams already paying for Arize's hosted platform get the managed version; the open release is the same engine minus the ops work.
The catch is the license. Phoenix is Elastic License 2.0, not true open source. You can self-host and use it freely, but you cannot stand it up as a competing hosted service. For anyone using it to debug their own app, that line never gets crossed. Just know it is source-available, not MIT.
Free vs Self-Hosted vs Paid
source availableFree (self-hosted): The full Phoenix platform, tracing, evals, datasets, experiments, prompt management, runs free under the Elastic License 2.0. No feature gating, no seat limits.
Self-hosted reality: A real service to run as a container or server. Moderate ops. Your data never leaves your infrastructure.
Paid (Arize): Arize sells a managed cloud and the larger Arize AX enterprise platform for teams that want hosting, scale, and support instead of running it themselves.
Free to self-host under the Elastic License. Source-available, not true open source, but that restriction never touches teams using it to debug their own apps.
Get tools like this every Wednesday
One featured tool, three on the radar. No fluff.
Similar Tools
License: Other
Review license manually.
Commercial use: ✗ Restricted
About
- Owner
- Arize AI (Organization)
- Stars
- 10,092
- Forks
- 915
Explore Further
More tools in the directory
everything-claude-code
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
212.9k ★hermes-agent
The agent that grows with you
190.3k ★dify
Production-ready platform for agentic workflow development.
144.8k ★




