AI Agent Tools

17 open source tools compared. Sorted by stars. Scroll down for our analysis.

Tool	Stars	Velocity	Language	License	Score
cc-switch A cross-platform desktop All-in-One assistant tool for Claude Code, Codex, OpenCode, openclaw & Gemini CLI.	67.7k	+7495/wk	Rust	MIT License	86
mempalace The best-benchmarked open-source AI memory system. And it's free.	52.0k	+726/wk	Python	MIT License	88
mempalace The highest-scoring AI memory system ever benchmarked. And it's free.	52.0k	+726/wk	Python	MIT License	88
NemoClaw Run OpenClaw more securely inside NVIDIA OpenShell with managed inference	20.3k	+178/wk	JavaScript	Apache License 2.0	92
hermes-webui Hermes WebUI: The best way to use Hermes Agent from the web or from your phone!	6.8k	+972/wk	Python	MIT License	73
byterover-cli ByteRover CLI (brv) - The portable memory layer for autonomous coding agents (formerly Cipher)	4.7k	+30/wk	TypeScript	-	59
openclaw-control-center Turn OpenClaw from a black box into a local control center you can see, trust, and control.	4.0k	+14/wk	TypeScript	MIT License	78
engram Persistent memory system for AI coding agents — agent-agnostic Go binary with SQLite + FTS5, MCP server, HTTP API, and CLI.	3.4k	+190/wk	Go	MIT License	71
Qclaw 不用命令行，小白也能轻松玩转 OpenClaw	2.7k	+27/wk	TypeScript	Apache License 2.0	71
codex-console codex-console 是一个集成化控制台项目，支持任务管理、批量处理、数据导出、自动上传、日志查看与打包支持。	2.1k	+19/wk	Python	MIT License	77
boxlite Sandboxes for every agent — embeddable, stateful, with snapshots and hardware isolation.	2.0k	+123/wk	Rust	Apache License 2.0	71
mirage A Unified Virtual Filesystem For AI Agents	2.0k	-	TypeScript	Apache License 2.0	67
mcp-brasil MCP Server para 41 APIs públicas brasileiras	1.6k	+20/wk	Python	MIT License	69
agent-governance-toolkit AI Agent Governance Toolkit — Policy enforcement, zero-trust identity, execution sandboxing, and reliability engineering for autonomous AI agents. Covers 10/10 OWASP Agentic Top 10.	1.5k	+80/wk	Python	MIT License	69
gitlab-mcp First gitlab mcp for you	1.5k	+28/wk	TypeScript	MIT License	67
OpenSquirrel For people who get distracted by agents. A native Rust/GPUI control plane for running Claude Code, Codex, Cursor, and OpenCode side by side — because if you're going to be squirrely, you might as well optimize for it.	1.4k	-	Rust	MIT License	71
vm0 the easiest way to run natural language-described workflows automatically	1.1k	+6/wk	TypeScript	-	54

Stay ahead of the category

New tools and momentum shifts, every Wednesday.

Our Analysis

cc-switch67.7k★

Cc-switch wraps them into a single Tauri-based GUI. Cross-platform, open source, and free. Consider it a launcher that lets you switch between agents without context-switching between terminals. Setup is straightforward: download the app, configure your API keys, and pick which agents you want active. It doesn't add intelligence on top of the agents themselves. It's a convenience layer. The value is entirely in the unified interface and the ability to compare agent outputs side-by-side. Solo developers who already use multiple coding agents will get the most out of this. Teams probably don't need it since most teams standardize on one agent. If you only use one coding agent, there's nothing here for you. The catch: it's a wrapper, not a product. If the underlying agents change their CLI interfaces (which they do, frequently), cc-switch breaks until someone updates the integration. You're adding a dependency on a third-party GUI for tools that already work fine in a terminal.

mempalace52.0k★

MemPalace stores your AI conversation history verbatim and searches it semantically. Every Claude session, every project file, indexed locally. The structure is a metaphor: projects become wings, topics become rooms, so you can scope searches instead of querying a flat blob. It publishes 96.6% recall on LongMemEval with no LLM in the loop, and the benchmarks are reproducible from the repo. Install is pip plus pointing it at a directory. ChromaDB is the default backend, embeddings run on CPU with a 300MB model, no API key required. The MCP server exposes 29 tools so Claude Code can read and write the palace directly during a session. Solo developers using Claude Code heavily: install it. The 'wake-up' command that loads relevant context for a new session is the pitch and it works. Small teams: each engineer runs their own palace, there is no shared knowledge layer yet. The catch: it's about two weeks old. The benchmarks are real but the operational track record is not. Breaking changes will happen, and fast-growing projects attract impostor domains. The README has a scam alert for a reason.

mempalace52.0k★

MemPalace gives AI assistants a memory that actually works. It stores complete conversation histories locally using ChromaDB vector search, organized into a "memory palace" hierarchy of wings, rooms, halls, and closets. No cloud dependency, no subscriptions, no data leaving your machine. It scores 96.6% recall on LongMemEval benchmarks, which puts it ahead of every other memory system tested. The architecture stores verbatim conversations rather than lossy summaries, which is the key insight. Most AI memory systems compress your history into summaries and lose critical detail. MemPalace keeps everything and retrieves it semantically. Setup is straightforward: Python package, local ChromaDB instance, and an MCP server that plugs into Claude, ChatGPT, Cursor, or Gemini. Solo developers and power users get the most value here. Daily AI assistant users who want persistent context across sessions without paying for a cloud service won't find a better option right now. Teams could use it individually but there's no shared memory layer. The catch: it's local-only by design. Working across multiple machines means your memory palace doesn't follow you. And ChromaDB adds a real storage footprint once your conversation history grows.

NemoClaw20.3k★

NemoClaw runs OpenClaw (the open source coding agent) inside NVIDIA's OpenShell sandbox with managed inference, solving the real security risk of agents executing arbitrary code on your machine. Your agent gets GPU-accelerated model inference through NVIDIA's infrastructure while staying sandboxed. This is NVIDIA saying 'run your coding agents on our hardware, securely.' You get the performance of NVIDIA GPUs for inference without managing the infrastructure yourself. The sandbox prevents the agent from doing anything destructive to your system. Apache 2.0 licensed. The catch: this ties you to NVIDIA's ecosystem. You need NVIDIA hardware or their cloud infrastructure, no running this on Apple Silicon or AMD GPUs. It's OpenClaw-specific, so Claude Code and Cursor users are out. And 'managed inference' is a gateway to NVIDIA's paid compute. The tool is free but the GPU time may not be.

hermes-webui6.8k★

Hermes WebUI is a browser frontend for Hermes Agent, a self-hosted autonomous AI agent that holds memory across sessions, runs scheduled jobs, and integrates with messaging platforms. Free and MIT-licensed. Setup is moderate. You bring your own LLM API key (OpenAI, Anthropic, Google, DeepSeek, OpenRouter, others) and run the agent plus WebUI on your own hardware or VPS. Once running, the agent persists conversation context, learns from interactions, and can be triggered on a schedule. The web UI mirrors the CLI experience without locking you out when you close the terminal. For solo developers and small teams who want an AI agent that isn't tied to ChatGPT or Claude.ai, this is a real option. Your conversations, your memory, your hardware. The cost is your LLM API bill, which can climb fast if the agent is making frequent calls. Solo: probably $10 to $50 per month in API spend depending on usage. The catch is that "autonomous AI agent" is doing a lot of work in the description. These systems still hallucinate, still drift, still need supervision. Don't wire it into anything destructive without guardrails.

byterover-cli4.7k★

ByteRover adds a persistent memory layer that travels with you. It works as a CLI tool that sits alongside Claude Code, Codex, or any agent that reads context files. It's a portable brain for your coding assistant. Install it globally, run `brv init` in your project, and it creates a structured memory store. The agent can read and write to it during sessions, building up project knowledge over time. It stores things like architecture decisions, coding conventions, and task history. The data lives on your machine in JSON files. This solves a real problem for developers who spend the first 5 minutes of every AI session re-explaining their project. Solo developers and small teams get the most value. The memory is project-scoped, so each repo gets its own context. The catch: you're trusting a third-party tool to manage context that feeds directly into your AI agent. If the memory format drifts from what agents expect, or if the project goes unmaintained, you've got stale context files that might do more harm than good. And Claude Code already has its own CLAUDE.md convention for project context, so the overlap is real.

openclaw-control-center4.0k★

This gives you a local control center with full visibility. You get a dashboard that shows what OpenClaw is doing in real time, how much each task costs, and lets you set guardrails. It turns OpenClaw from 'fire and pray' into something you can actually trust and control. You see every API call, every decision branch, every token spent. You set budget limits, approve expensive operations, and kill tasks that go off the rails. MIT licensed, TypeScript. The catch: this is OpenClaw-specific. If you're using Claude Code, Cursor, or Codex, this does nothing for you. And 'control center' implies oversight, but you still need to understand what you're looking at. It surfaces the data, it doesn't interpret it for you. Early stage, so expect UI rough edges.

engram3.4k★

Engram gives it persistent memory. It's a Go binary with SQLite and full-text search that any AI agent can read and write to, so context survives across sessions. It works via MCP server, HTTP API, or CLI, meaning it's agent-agnostic. Claude Code, Codex, OpenClaw, or anything else that speaks HTTP can use it. Your agent writes memories during a session and reads them back next time. Full-text search (FTS5) means it retrieves relevant context, not just raw dumps. MIT licensed, Go. The catch: persistent memory is only useful if the agent writes good memories. Garbage in, garbage out. If the agent stores irrelevant context, it pollutes future sessions. SQLite is great for single-user but won't scale to a team sharing one memory store. And the MCP protocol is still young; not every agent supports it natively.

Qclaw2.7k★

Qclaw is a GUI wrapper for OpenClaw that removes the command-line barrier. If you want to use AI coding tools but the terminal feels intimidating, Qclaw puts a graphical interface on top of OpenClaw's capabilities. Chinese-language interface, built for users who prefer visual interaction over command-line workflows. It translates OpenClaw's CLI operations into clickable buttons and forms. The catch: Chinese-language only, and it wraps another tool rather than providing standalone functionality. You still need OpenClaw installed underneath. If you're comfortable with a terminal, OpenClaw directly is more flexible. And because it depends on another project's API, breaking changes upstream can break Qclaw.

codex-console2.1k★

Codex-console is an integrated control panel for that workflow. Task management, batch processing, data export, auto-upload, log viewing, and packaging in one place. Built in Python with MIT license. The project provides compatibility fixes and experience optimizations for managing multiple concurrent AI coding sessions. The catch: the README and documentation are entirely in Chinese. If you don't read Chinese, you'll be navigating the tool through translation or code reading. The project is a console/dashboard wrapper. It doesn't do the AI work itself, it just helps you manage it. And at with limited English documentation, community support outside Chinese-speaking developers will be thin.

boxlite2.0k★

Boxlite gives you lightweight sandboxes. Each sandbox is a stateful micro-VM with hardware isolation, snapshots, and an API to control it. Picture giving every AI agent its own disposable computer. The project is open source under Apache 2.0 and self-hosting is free. It's early but growing fast. The 'agent sandboxing' space is heating up as AI agents get more autonomous and need safer execution environments. The catch: this is emerging technology. The documentation and ecosystem are still maturing. Running Firecracker-based micro-VMs requires Linux with KVM support. No macOS, no Windows natively. And the question of whether you need full VM isolation versus Docker containers depends on your threat model. For most use cases, Docker is simpler. Boxlite is for when you can't trust the code being executed.

mirage2.0k★

Mirage mounts S3 buckets, Google Drive, Slack, Gmail, and Redis side by side as one filesystem so an AI agent can use familiar Unix commands across all of them. Instead of teaching the agent five different SDKs, you point it at a virtual filesystem and let it `grep`, `cat`, `cp`, and pipe between services the way it would on a local disk. It's an abstraction layer designed for how agents already think. Install is pip or npm or a curl one-liner. Python 3.12+ or Node 20+, macOS or Linux. You provide credentials for whichever backends you want to mount (AWS, Google, Slack, etc.) and Mirage exposes them as paths. There's no central service; everything runs locally inside the agent's environment. Solo developers building agent workflows: this is the kind of glue you'd otherwise hand-roll, and having it as an Apache 2.0 package is useful. Small teams shipping agents: worth testing as part of your tooling stack. Large teams: monitor the project; it's young but the design is right. The catch: v0.0.1, released May 6th, 2026. First public release. The abstraction is interesting but the implementation is brand new. Expect rough edges and breaking changes. Pin the version and read release notes carefully.

mcp-brasil1.6k★

This MCP server wraps 41 Brazilian public APIs into one standardized interface your agent can query. The smart part: it doesn't dump all 200+ tools on your agent at once. BM25 search filters to show only relevant tools per query, and a query planner can combine multiple APIs in a single call. 24 of the APIs need no authentication at all. The remaining ones use 2 optional API keys you get with free registration. MIT licensed. Built in Python with async httpx, Pydantic v2, and rate limiting with backoff. The catch: this is Brazil-specific. If you're not working with Brazilian data, there's nothing here for you. And wrapping government APIs means you inherit their reliability issues: downtime, rate limits, and data quality are the API provider's problem, not Floci's. The project is brand new and maintained by what appears to be a single developer.

agent-governance-toolkit1.5k★

Agent Governance Toolkit puts deterministic policy enforcement between your AI agents and the actions they take. Every tool call, resource access, and inter-agent message gets evaluated against policy before execution. Not prompt-based safety (which fails 27% of the time in red-team tests) but application-layer enforcement with a 0% violation rate. Works with any agent framework: LangChain, CrewAI, AutoGen, AWS Bedrock, Google ADK, Azure AI, and 20+ others. Ships with a CLI (`agt`), governance dashboard, and covers all 10 OWASP Agentic risks. SDKs for Python, TypeScript, Rust, Go, and dotnet. Sub-millisecond policy checks. Free and open source under MIT. Solo devs building agents should use this from day one. Teams running agents in production need this or something like it. There is no excuse for shipping autonomous agents without action-level governance. The catch: this is still in public preview, so expect breaking changes before GA. It governs agent actions, not model outputs. For prompt-level safety, you still need a separate content moderation layer.

gitlab-mcp1.5k★

GitLab MCP fills the gap Anthropic left open. GitHub has an official MCP server for AI coding assistants. GitLab does not. This community-built server connects Claude Code, Cursor, Copilot, VS Code, and Codex to your GitLab instance, exposing merge requests, issues, pipelines, wiki, releases, and labels as callable tools. Setup is simple for local use: one npx command plus a personal access token. Self-hosted GitLab works fine with a custom API URL. For team deployments, there's a Docker image with OAuth2 support and multi-user remote authorization. Four auth methods cover everything from quick local testing to production multi-tenant setups. Solo developers on GitLab get AI coding assistant integration that was previously GitHub-only. Teams running self-hosted GitLab get the same MCP capabilities without migrating to GitHub. There's a read-only mode toggle for safety if you want to prevent the AI from making changes. The catch: community-maintained, not official GitLab or Anthropic. Feature parity depends on one maintainer keeping up with GitLab's API surface. The multi-user OAuth setup requires a public HTTPS endpoint and pre-registered GitLab app, which is non-trivial.

OpenSquirrel1.4k★

OpenSquirrel is a native desktop app that puts Codex, Cursor, and OpenCode in one window so you stop losing track of what each one is doing. A control plane for your AI coding agents, built in Rust with the GPUI framework. What's free: Everything. MIT licensed, fully open source. No paid tier, no cloud service. The pitch is honest: you're squirrely, you jump between agents, and you need a way to see them all at once without alt-tabbing through six terminal windows. The Rust/GPUI foundation means it's fast and native, not an Electron wrapper eating 2GB of RAM. The catch: this is early, so it's not battle-tested yet. GPUI (Zed's UI framework) is relatively new itself, so you're building on new foundations. If you only use one AI coding tool, this adds zero value. It's specifically for the multi-agent workflow that a growing number of developers are adopting.

vm01.1k★

VM0 runs AI coding agents in isolated cloud sandboxes on a schedule. Describe a workflow in natural language, point it at a repo, and it executes in a Firecracker microVM with full Claude Code compatibility. Think of it as cron for AI agents, with sandboxing built in. The platform gives you persistence (resume, fork, version sessions), observability (logs, metrics, network visibility), and integration with 35,000+ skills via the skills.sh ecosystem. Self-hosting means running the Firecracker VM infrastructure yourself, which is a real infrastructure commitment. Solo developers who want to automate repetitive coding tasks (daily CI fixes, dependency updates, report generation) get the most value here. Teams running multiple agents benefit from the orchestration layer. The catch: this is very early stage. The license isn't a standard OSS license, the docs are sparse, and you're building on a startup's roadmap. The managed cloud is the realistic path for most users, and pricing for that isn't finalized yet.