The Lens

Promptfoo answers a question every team shipping LLM features eventually hits: did that prompt change actually make things better, or did I just break something I cannot see? It is a developer-first tool to evaluate prompts, compare models side by side, and red-team your app for vulnerabilities like prompt injection and data leakage. It runs as a CLI or library, supports the major providers, and is built to live in CI so regressions get caught before users do.

Ops burden is close to zero. You run npx promptfoo, write your test cases in YAML, and it runs entirely on your machine. There is no server to stand up for the core eval workflow, which is a big part of why developers like it. It plugs into your pipeline the same way your unit tests do.

The core is MIT and free, and the free tier does real work: full local evals, model comparisons, and red-teaming up to 10,000 probes a month. The paid Enterprise tier is custom-priced and buys the team layer, a centralized security dashboard, access controls with SSO, and managed or on-prem deployment. Solo developers and small teams can live on the free CLI indefinitely. Reach for Enterprise when you need shared results and SSO across a security team. It substitutes for paid eval platforms like Braintrust and LangSmith. Worth noting: Promptfoo is now part of OpenAI, while staying MIT open source.

The catch is the one that defines most open-core tools. The free tier is yours, local and uncapped for evals, but the moment you want a shared dashboard, team controls, and collaboration, that is the paid pitch. For most individual developers, that line never gets crossed.

Explore Further

GitHub Repository

Source code, issues, README

Reddit Discussions

Community opinions and use cases

Hacker News

HN threads and discussions

Dev.to Articles

Tutorials and write-ups

Tutorials & Guides

Getting started resources

Official Website

Docs, blog, and more

promptfoo

The Lens

Free vs Self-Hosted vs Paid

Similar Tools

License: MIT License

About

Explore Further

More tools in the directory

everything-claude-code

hermes-agent

ollama