28 open source tools compared. Sorted by stars. Scroll down for our analysis.
| Tool | Stars | Velocity | Score |
|---|---|---|---|
everything-claude-code The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond. | 221.8k | +3951/wk | 86 |
gstack Use Garry Tan's exact Claude Code setup: 15 opinionated tools that serve as CEO, Designer, Eng Manager, Release Manager, Doc Engineer, and QA |
| 116.0k |
| +4714/wk |
| 91 |
caveman ๐ชจ why use many token when few token do trick โ Claude Code skill that cuts 65% of tokens by talking like caveman | 77.0k | +2367/wk | 86 |
graphify AI coding assistant skill (Claude Code, Codex, OpenCode, OpenClaw). Turn any folder of code, docs, papers, or images into a queryable knowledge graph | 72.2k | +3385/wk | 86 |
ponytail Makes your AI agent think like the laziest senior dev in the room. The best code is the code you never wrote. | 58.6k | +20871/wk | 82 |
career-ops AI-powered job search system built on Claude Code. 14 skill modes, Go dashboard, PDF generation, batch processing. | 55.8k | +1313/wk | 86 |
codex-plugin-cc Use Codex from Claude Code to review code or delegate tasks. | 21.7k | +435/wk | 83 |
prompt-master A Claude skill that writes the accurate prompts for any AI tool. Zero tokens or credits wasted. Full context and memory retention | 9.8k | +302/wk | 84 |
skills Claude Code skills based on The Minimalist Entrepreneur by Sahil Lavingia | 9.2k | +77/wk | 75 |
claude-obsidian Claude + Obsidian knowledge companion. Persistent, compounding wiki vault based on Karpathy's LLM Wiki pattern. /wiki /save /autoresearch | 7.9k | +770/wk | 77 |
gsd-2 A powerful meta-prompting, context engineering and spec-driven development system that enables agents to work for long periods of time autonomously without losing track of the big picture | 7.7k | - | 80 |
dbskill dontbesilent ็ๅไธ่ฏๆญ Skills for Claude Code | 7.0k | +218/wk | 63 |
awesome-codex-subagents A collection of 130+ specialized Codex subagents covering a wide range of development use cases. | 5.3k | +99/wk | 82 |
get-shit-done-redux Getting Shit Done, the Aftermath | 5.1k | +751/wk | 79 |
lottie Open-source skill and harness for generating production ready Lottie animations with codex/claude code | 3.8k | +372/wk | 73 |
chrome-cdp-skill Give your AI agent access to your live Chrome session โ works out of the box, connects to tabs you already have open | 3.1k | +13/wk | 74 |
diagram-design Thirteen editorial diagram types for Claude Code. Self-contained HTML + SVG. No shadows, no Mermaid-slop. | 2.6k | +34/wk | 71 |
wewrite ๅ ฌไผๅทๆ็ซ ๅ จๆต็จ AI Skill for Claude Code โ ็ญ็นๆๅ โ ้้ข โ ๅไฝ โ SEO โ ่ง่งAI โ ๆ็ โ ๅพฎไฟก่็จฟ็ฎฑ | 2.5k | +113/wk | 71 |
baoyu-design Run Claude Design locally as an Agent Skill โ Cursor, Claude Code & more. Produce polished UI mockups, prototypes, decks & wireframes as self-contained HTML, without claude.ai/design. Best with Opus 4.8. | 1.9k | +340/wk | 69 |
chromex A Codex-powered Chrome side-panel assistant for page context, tabs, voice, and image workflows. | 1.1k | +1/wk | 67 |
pixel2motion AI logo animation skill: turn raster logos into smooth SVG animation, animated HTML demos, GIF/video previews, and motion QA evidence. | 1.0k | +290/wk | 64 |
pm-claude-skills 174 professional Agent Skills (SKILL.md) + subagents + slash commands for Claude, ChatGPT, Gemini, Cursor, Codex & Hermes โ one source, every AI tool | 1.0k | - | 65 |
vibecode-pro-max-kit Your AI forgets. This remembers. Spec-driven coding harness for vibecoders, product owners, CEOs and real builders โ self-improving context memory, 12 agents, 32 skills. Kills context rot, ships features, not spaghetti. Claude Code & Codex. Any stack. 30 seconds | 995 | +71/wk | 63 |
security-audit-skill A coding-agent skill for multi-phase security audits with independently verified, machine-readable findings | 917 | - | 66 |
guard-skills Guard skills for coding agents, quality gates that catch AI-generated failure modes in code, tests, and docs | 894 | +64/wk | 61 |
qiaomu-goal-meta-skill Turn vague or complex Codex tasks into strong `/goal` commands with outcome, verification, constraints, boundaries, iteration policy, completion evide | 731 | +71/wk | 64 |
recall Stop wasting tokens and re-explaining your project every session. Recall gives Claude Code durable memory, entirely offline. | 552 | - | 60 |
compass-skills ๅธๅ๏ผไธชๆงๅ AI ไปปๅกๆปๆง Skills ็ณป็ป /COMPASS: Personal Alignment Skills OS for AI Agents | 490 | - | 62 |
Stay ahead of the category
New tools and momentum shifts, every Wednesday.
Everything Claude Code (ECC) is a plugin you install through the Claude Code marketplace that bundles 63 agents, 249 skills, 79 commands, hooks, MCP server configs, and security enforcement scripts into one harness. The pitch is: stop hand-rolling your CLAUDE.md and skills, install this and get a curated set that actually works. It's MIT licensed and works across Claude Code, Cursor, OpenCode, and Codex, not just Claude. The agents cover the usual suspects (code review, security analysis, planning, language-specific reviewers) and the skills span TDD, video editing, and dozens of niche workflows. Hooks auto-execute on editor events. The security scanning piece tries to catch prompt injection and rule violations before they hit your codebase. Solo devs and small teams using AI coding agents get the biggest lift here. Pulling in 63 agents you didn't ask for is overkill, but cherry-picking the ones that match your workflow is the real value. Large teams will probably want to fork it and trim aggressively. The catch: 249 skills is a lot of surface area to audit. You're trusting someone else's prompt engineering to run inside your editor. Read the security-relevant pieces before you turn the hooks on, and treat this as a starting template, not gospel.
gstack is Garry Tan's (YC CEO) Claude Code workflow as a public repo. It ships 23 specialized agents that act like a virtual engineering team, plus slash commands like /office-hours, /review, and /ship that drive end-to-end product work from one terminal. The agents play roles: CEO, designer, engineer manager, QA lead, and more. The idea is that a solo dev orchestrates the team through commands instead of doing every job in series. TypeScript core with some Go for the parts that need speed. Everything lives in .claude/, nothing touches your PATH, and a persistent Chromium browser daemon handles visual testing and QA loops. MIT licensed. Solo devs and tiny teams are the target. If you're already shipping by yourself with Claude Code, this is a real upgrade. Bigger teams will trip over the opinionated roles, since you already have a real designer and QA lead and don't need an agent pretending to be one. The catch: this is one founder's workflow turned into a kit. If your style doesn't match Garry's, you'll fight the agents instead of using them. It's also Claude Code only. No Cursor, no Codex, no swap-in for other harnesses.
Caveman strips the fluff from Claude Code responses. Install it with one command, activate with /caveman, and your AI assistant drops the pleasantries, hedging, and filler words while keeping full technical accuracy. Average savings: 65% fewer output tokens. Four intensity levels now: Lite stays professional but terse, Full drops articles and uses fragments, Ultra goes telegraphic, and Wenyan compresses into classical Chinese. Code blocks, error messages, git commits, and technical terms pass through untouched. Only the natural language gets compressed. A companion tool (caveman-compress) rewrites your CLAUDE.md and memory files to cut input tokens too. Works across 40+ AI coding agents, not just Claude Code. Cursor, Copilot, Windsurf, Cline, Codex, all supported. Heavy token users will feel the difference in both speed and cost. The catch: it started as a meme (Kevin from The Office) but the benchmarks are real, backed by a 2026 arxiv paper. Ultra mode can be hard to read. And the savings are output tokens only, so your thinking and reasoning costs stay the same.
Graphify reads your entire codebase, docs, PDFs, and even screenshots, then builds a knowledge graph you can actually navigate. It parses 19 languages via tree-sitter for code and uses an LLM for everything else. The result is an interactive HTML visualization showing how your architecture, concepts, and files connect. The first extraction pass costs real API tokens (Claude or GPT), proportional to your corpus size. After that, incremental updates via SHA256 caching mean re-runs only process changed files. The 71x token compression claim is real for subsequent queries, not the initial scan. Runs as a /graphify slash command inside Claude Code, Codex, or OpenCode. For developers onboarding to large or unfamiliar codebases: this is genuinely useful. Architecture reviews, cross-referencing code with design docs, understanding how a monorepo fits together. Exports to Neo4j, Obsidian vaults, or standalone wikis. The catch: it is a plugin, not a standalone tool. You need Claude Code or a compatible AI assistant as the runtime. Quality of inferred relationships depends on the underlying LLM, and the initial scan of a large repo is not cheap.
Ponytail is a skill that nudges your AI coding agent to write less code. It bakes in "lazy senior developer" instincts: reach for the standard library, use native language features, lean on dependencies you already have before writing anything custom. The pitch is that agents left alone tend to over-build, and ponytail pushes them toward the minimal solution a seasoned developer would actually ship. Setup is trivial, it installs as a skill or plugin and works across Claude Code, Codex, OpenCode, Cursor, Windsurf, and other agent platforms. There's nothing to host. It's MIT-licensed and free, and it's actively maintained, several releases deep. You add it once and it shapes how your agent approaches problems from then on. Developers who've watched an agent reinvent something the standard library already does, or bloat a file with code that didn't need to exist, are the audience. If you carefully review every line your agent writes anyway, the gain is smaller. The value is in steering output toward leaner, more maintainable code without you having to police it every time. The catch: it's a behavioral nudge, not a guarantee. It shifts the agent's tendencies, but the model still does what it does, and "minimal" is a judgment call it won't always get right. The claimed reductions in code volume are real in spirit, but treat them as a direction, not a promise. You're still the one who has to read what comes out.
Career-Ops turns your AI coding CLI into a full job search command center. Paste a job URL, get a structured A-F evaluation against your CV with weighted scoring across 10 dimensions. It generates tailored, ATS-optimized PDF resumes per application. One developer used it to evaluate 740+ offers and land a Head of Applied AI role. The system gets smarter over time. It builds an interview story bank in STAR format, scans 45+ pre-configured company portals (Anthropic, OpenAI, Vercel, n8n), and can batch-evaluate offers in parallel using sub-agents. The Go-based dashboard lets you browse your entire pipeline from the terminal. Running it needs Node.js and Playwright for the PDF generation and portal scanning. Solo job seekers running any AI coding CLI: this is a force multiplier. It used to be Claude Code only, but now it also works with Gemini CLI, OpenCode, Copilot, and Qwen, several of which have free tiers. It does not replace your judgment, it structures it. The human-in-the-loop design means the AI evaluates and you decide. The catch: the tool is free, but it leans on an AI CLI underneath, so your real cost depends on which provider you pick and how hard you run it. The pre-configured portals skew heavily toward AI and tech companies. And the first evaluations are rough until you feed it enough context about yourself.
OpenAI's Codex agent packaged as a Claude Code skill plugin. It lets you invoke Codex from inside Claude Code to review code or delegate tasks, connecting two AI coding agents so they can collaborate. Useful if you want a second opinion from a different model without switching tools. The integration is straightforward: install the skill, and you can ask Claude Code to hand off specific tasks to Codex. Code review is the primary use case, where having two different models look at the same code catches more issues than either alone. The catch: requires both Claude Code and OpenAI API access, so you're paying for two AI services to talk to each other. The value proposition only makes sense if you're already invested in both ecosystems. For most developers, one AI coding tool is enough.
Prompt Master writes the prompts for you. It's a Claude Code skill that generates accurate, context-aware prompts for any AI tool, optimized so you waste fewer tokens and get better output on the first try. The value proposition is simple: instead of trial-and-error with different prompt phrasings, you describe what you want and Prompt Master generates the prompt that actually works. It retains full context and memory across your session, so each prompt builds on what came before. MIT licensed. The catch: this is a skill that writes prompts for other AI tools, so you're adding an extra LLM call before every interaction. If your prompts are already working fine, this is overhead. And 'accurate prompts' is a bold claim. Prompt engineering is still more art than science, and what works for one model may not work for another.
This turns Sahil Lavingia's entire Minimalist Entrepreneur methodology into executable Claude Code skills. Nine skills you install in your terminal: validate your idea, scope an MVP, find first customers, set pricing, and more. Instead of reading a book and trying to apply it, you invoke a skill and your AI walks through Sahil's exact framework applied to your specific situation. It's interactive: it asks you questions, processes your answers, and gives you structured output. The skills are well-structured and the methodology is proven (Gumroad was built on it). The catch: this is a business methodology, not a technical tool. The quality of the output depends entirely on how good your inputs are. And it's one person's framework; if you disagree with the minimalist approach, you'll disagree with the advice. No pricing page for the skills themselves: they're free, but the book is $17 on Amazon if you want the full context.
Claude Obsidian turns your Obsidian vault into an autonomous knowledge engine powered by Claude Code. Instead of passive AI chat, it actively reads sources you drop in, extracts entities and concepts, creates cross-referenced wiki pages, and maintains a session context cache so the next conversation picks up where you left off. Ten skill commands cover everything from ingestion to vault linting to autonomous web research. Setup is a git clone and a shell script. The vault structure works directly in Obsidian with no plugin conflicts. It supports six wiki modes (Website, GitHub, Business, Personal, Research, Book/Course) and runs an 8-category vault linter that catches orphan notes, dead links, stale claims, and missing cross-references. Batch ingestion runs through parallel agents. Solo knowledge workers who already use Obsidian and Claude Code get a structured workflow for turning raw sources into an organized, interlinked wiki. The hot.md context cache is a clever solution to Claude Code's session boundary problem. The catch: you need a Claude Code subscription to run any of it, so it's free software that requires paid infrastructure. Heavy ingestion sessions burn through Claude Code context fast. The quality depends on prompt engineering that could break with model updates.
GSD-2 is a framework for keeping AI agents on track through long, complex tasks by giving them structured context and spec-driven goals. If you've used AI coding agents and watched them lose the plot halfway through a big refactor, this is the fix. The core idea is 'context engineering': you define specs that describe what the agent should build, break work into phases, and the framework ensures the agent always has the right information at the right time. Specs look like structured documents with acceptance criteria, constraints, and dependencies. Instead of the agent drowning in its own conversation history, GSD-2 feeds it focused context windows that keep each step scoped and grounded. The catch: growing fast, but the API is still evolving. Docs are catching up. This is a bet on a concept (structured agent orchestration) rather than a stable production tool. If you need something battle-tested today, look at established agent frameworks. If you want to experiment with the next wave of agent reliability, this is worth watching.
This is a prompt library that gives Claude structured frameworks for doing it. Picture pre-built consulting templates that turn Claude into a business analyst. What's free: Everything. It's a collection of Claude Code skills (prompt files) you drop into your project. No install, no dependencies, no account. The real value here is the structure. Instead of prompting Claude from scratch every time you need a SWOT analysis or process audit, these skills give it a repeatable framework. The prompts are well-organized and cover common business diagnostics. The catch: it's brand new (almost all growth in the last week), the description is partially in Chinese, and the license is listed as 'Other' which means you should read it before using commercially. The skills are also opinionated. They assume a specific diagnostic methodology that may not match how you work. And since these are just prompt files, the barrier to building your own is low.
This is a curated collection of subagents for OpenAI's Codex CLI, now past 170 entries across more than a dozen categories. Picture an app store of pre-built specialists: each subagent is a config file tuned for one job, testing, documentation, security review, database migration, and dozens more. You don't build these yourself. You browse the list, grab the ones that fit your workflow, and drop them into your Codex agents directory. They use Codex's native TOML format with Codex-specific fields like reasoning effort and sandbox mode. Codex does not spawn them automatically, you delegate to them explicitly. The collection is free. You pay for Codex usage through your OpenAI account. It is community-maintained and still growing, so the catalog keeps expanding. The catch: this is a curated list, not a framework. Quality varies across the entries. Some are polished, some are experiments. And it is Codex-only. It does not work with Claude Code, Cursor, or other AI coding tools, so the value evaporates the moment you switch assistants.
get-shit-done-redux is a system for keeping AI coding assistants reliable over long sessions. The problem it targets is context rot, the quality drop that happens as an agent fills its context window and starts forgetting what it was doing. GSD fixes this with a structured command loop and fresh subagent contexts, so the main window stays uncluttered. It works with Claude Code, OpenCode, Gemini CLI, and others. MIT licensed and free. The workflow is six commands: start a project, discuss the phase, plan it, execute through parallel agents, verify the result, then ship and repeat. Each executor gets its own clean context window, which keeps your primary session running at a fraction of capacity instead of choking on accumulated history. Install is a single npx command, so there is almost nothing to set up. Solo developers who run long agent sessions and watch quality degrade will get the most out of this. Small teams adopting a shared agent workflow can standardize on the same command loop. It is free at every scale, so the only cost is learning the discipline it imposes, which is the actual point. The catch is that this is methodology wrapped in tooling. It only helps if you commit to the loop. Bolt it onto a chaotic workflow and you will get chaos with extra steps. The structure is the value, and structure takes buy-in.
This turns "make me a loading spinner animation" into actual Lottie code through your AI coding agent. Lottie is the format apps use for lightweight vector animations, and creating them normally means After Effects and a plugin. diffusionstudio/lottie installs as a skill for agents like Claude Code, then converts an SVG or a plain-text description into working animation code you can drop into web, React Native, iOS, Android, or Flutter. Setup is one command, npx skills add diffusionstudio/lottie, and you're generating animations by prompting. There's nothing to host and no account. Because it produces standard Lottie JSON, the output plugs into the same players and tooling you'd already use. The whole thing is MIT-licensed and free. Developers who need simple animations but don't have a motion designer, or don't want to learn After Effects, are the target. It won't replace a real animator for complex, hand-crafted motion work. For spinners, icon transitions, and straightforward UI animation, it's a fast path from idea to code without leaving your editor. The catch: it leans on your AI agent, so the quality of what you get tracks the quality of your prompt and your model. Intricate animation is still beyond what a text prompt produces cleanly, this shines on the simple-to-moderate stuff. Think of it as a head start, not a finished motion-design studio.
This skill connects your agent to your live Chrome via the Chrome DevTools Protocol (CDP). Your agent can read pages, click buttons, fill forms, and navigate, in the browser you're already using. The difference from tools like Playwright is that this connects to existing tabs. Your agent can interact with pages where you're already authenticated, see what you see, and do what you'd do manually. MIT licensed, JavaScript. The catch: giving an AI agent access to your live browser session with all your logged-in accounts is a real security consideration. The agent can see everything you can see, including sensitive data in open tabs. There's no permission model beyond 'full access.' And CDP connections can be fragile; Chrome updates can break the protocol.
Diagram-design is a Claude Code skill for editorial-quality diagrams. Architecture sketches, flowcharts, sequence diagrams, quadrants, pyramids. 13 types total, all rendered as self-contained HTML and SVG with no JavaScript or build step. The output is opinionated: low density, restrained color, accent used sparingly on the one or two things that matter. It looks like something a design team made, not generic AI output. Install is clone or plugin, then tell Claude to onboard it to your website and it pulls your brand palette and typography from your homepage. After that every diagram uses your colors. The skill activates automatically when you ask for a diagram. Solo writers and technical bloggers: install it. Small teams with a brand style guide: install it team-wide and onboard to your site. Large teams with a design system: the editorial constraints may conflict with your existing design language, evaluate first. The catch: the plugin route puts the skill in a cache that updates overwrite, so style-guide customizations get wiped unless you clone the repo and symlink. And the opinionated style is the whole pitch. If you want Mermaid-style diagrams, this is the wrong tool.
Wewrite is a Claude Code skill that handles trending topic research, topic selection, article writing, SEO optimization, and publishing. Built for the Chinese content market. The pipeline goes from identifying trending topics on Chinese social platforms to generating articles optimized for WeChat's distribution algorithm. It covers the entire workflow that content teams typically do manually across multiple tools. The catch: Chinese-language tool for a Chinese platform. If you don't publish on WeChat, this is not for you. AI-generated content at scale raises quality questions regardless of platform, and WeChat has its own content policies around automated publishing that you need to understand before running this at volume.
Baoyu-design turns your AI coding agent into a UI designer. Describe a screen and it generates a polished, self-contained HTML mockup, prototype, wireframe, or even a slide deck, right inside Cursor or Claude Code. It's a local, MIT-licensed take on Anthropic's claude.ai/design feature, so you get that capability without the hosted product. It bundles two dozen specialized sub-skills for design systems, decks, mobile layouts, and exports, and it runs across Claude Code, Cursor, Codex, and a long list of other agents. Output is self-contained HTML, and it can export to PDF, editable PowerPoint, and into Figma or Canva. Install is a single command through the skills CLI. This is the open alternative to claude.ai/design, and it overlaps the territory of Figma AI, v0, and Lovable for spinning up mockups fast. It does not replace Figma as a collaborative design tool, it replaces the "generate me a first draft" step. Solo builders and small teams who want quick, throwaway UI drafts: install it and go. The catch: quality leans hard on running a strong model, Opus 4.8 specifically, inside a paid coding agent, so "free" assumes you already pay for the agent underneath. It's also days old and solo-maintained, so durability is unproven.
Chromex is a Chrome side-panel extension that connects your browser to OpenAI's Codex through a local native messaging bridge. Summarize pages, work across tabs and screenshots, edit images, transcribe voice, and run browser-control workflows with visible in-page indicators. MIT licensed. Setup is heavier than a typical extension: clone the repo, `npm install && npm run build`, run `install-native-host.mjs`, then load the unpacked extension at `chrome://extensions`. The architecture (Chrome extension to native host to local bridge to codex app-server) keeps your API key local; raw keys aren't stored in extension storage. Pick this if you live in Chrome, already run Codex, and want one assistant that sees the page you're on. Solo: free, you pay only for Codex tokens. Small teams: same. Large teams or non-Codex shops: skip; this is built around Codex specifically. The catch: Codex-only. Switch to Claude or Gemini for your CLI agent and Chromex doesn't follow. The native bridge is only as polished as the project, which is small and early. For a more mature Chrome AI assistant, Sider and Monica have more features and broader model support.
Pixel2motion takes a static logo and turns it into an animated SVG, through your AI coding agent. Hand it a PNG, JPG, or WebP and it produces a smooth animated vector, an interactive HTML demo, GIF and video previews, and automated checks on the animation quality. It is aimed squarely at brand and logo motion work, MIT-licensed and free. Setup is heavier than most agent skills. The rendering and QA run locally and lean on a real toolchain: Python 3.10+, Pillow, NumPy, and Playwright driving a headless Chrome. So 'install and go' here means installing that stack first. It is also young, around twenty commits, and tied to running inside Claude Code or Codex rather than working standalone. This is for designers and developers who already live in an AI agent and want logo animation without opening After Effects. Solo or team, it is free. Skip it if you do not work inside an agent, or if you only need a single animation, in which case a motion designer or a one-off tool is less setup than standing up Playwright and Chrome. The catch is the toolchain and the maturity. The idea is sharp and the built-in QA step is a nice touch, but you pay for it in local dependencies and early-project rough edges. For polished, hand-crafted brand motion, this is a starting point, not a replacement for a real animator.
pm-claude-skills is a library of ready-made skills you install into Claude Code or another AI agent to give it senior-professional workflows. Think PRDs, launch plans, and postmortems, around two hundred structured templates across twenty-plus fields, installed with one npx command and exportable to ChatGPT, Gemini, or Cursor. MIT, free. There is nothing to host; these are local Markdown files your agent reads. The value is entirely in the templates, so it lives or dies on whether the bundled frameworks match how you actually work. Across two hundred-plus skills the quality will be uneven, and a generic PRD template is only as useful as your willingness to adapt it. Treat it as a starting library, not gospel. This is for product managers and other knowledge workers using an AI agent who would rather start from a structured template than write every prompt from scratch. Solo or team, it is free. If you already have refined prompts you trust, or you bristle at one-size-fits-all frameworks, you will not get much from it. The catch is that this is content, not software. It does not make your agent smarter; it gives it scaffolding. Good scaffolding saves time. Generic scaffolding just adds a step. Skim what is in the box before you build a workflow around it.
vibecode-pro-max-kit is an installable bundle of agents, skills, and lifecycle hooks for Claude Code, Cursor, Codex, and similar AI coding environments. The pitch is a spec-driven workflow that pushes you through research, innovate, plan, and execute phases with explicit approval gates instead of letting the agent run free. MIT-licensed, free, installed by curl. Setup is a 30-second curl that drops 12 specialized agents, 31 skills, and hooks into your project's .claude/ directory. The framework's bet is that disciplined phase gates beat raw agent autonomy: you don't get past planning without approving the plan, and you don't get past execution without a spec to check against. Works across any tech stack since it's harness-level, not language-level. For solo developers who keep losing context with vibe-coded agent runs, this is a structured replacement. Small teams adopting it together get a shared coding workflow without having to enforce one manually. Larger orgs will want to read what the agents actually do before standardizing on someone else's prompt library, but the core idea (gate the agent, don't free-run it) is good practice. The catch: this is an opinionated workflow, not a tool. Skip the gates and you've installed 12 agents producing noise. The upstream also changes whenever the maintainer changes their mind, which is its own kind of dependency. If you want to enforce spec-driven coding with AI, this is one way; if you want the agent to be fast and loose, skip it.
Cloudflare's Security Audit Skill turns a coding agent into a vulnerability auditor. A skill is a module you add to an AI coding agent to give it a specific job. This one runs a six-phase audit: it maps your app, sends parallel agents to attack from different angles, then has separate agents try to disprove each finding before it gets reported. The philosophy is blunt, only report what you can actually exploit, with a concrete attack scenario, not a checklist of maybes. MIT licensed and free. There is no service to run. You install it into your agent with one command, npx skills add, and ask it to audit a codebase. The only real dependency is a coding agent that supports tool use and parallel sub-agents, plus Node for schema validation. The cost you do pay is model tokens, since the multi-phase, multi-agent design burns through a lot of them on a real codebase. The independent verification pass exists to cut false positives, which is the usual failure mode of AI security scanners. For solo developers and small teams without a security budget, this is a strong first pass and it is free. Larger teams should treat it as one input, not a replacement for a real pentest or a human reviewer. The catch: it is only as good as the agent running it and the tokens you feed it. It finds plausible issues and verifies them, but it does not replace someone who actually understands your threat model.
Guard-skills is a pack of quality gates for AI coding agents. You point your agent at them and they run a second pass over the code it just wrote, catching the specific ways AI-generated code tends to fail: swallowed errors, hardcoded "success" returns, hallucinated APIs, tests that assert nothing. Free and MIT licensed. Five skills cover the bases. One guards general clean code, one guards test quality (mock abuse, duplicate tests, implementation-detail assertions), and one treats your docs as claims and verifies each against the actual code. Two more are specific to WordPress and WooCommerce, handling escaping, sanitization, and money math. Install is a one-liner through the skills CLI, with nothing to host. If you let an AI agent write code, this is a cheap safety net for catching the slop before it ships. It's complementary to your real CI tooling, not a replacement for it. Useful at any team size, though heavier teams will still want proper linters and human review on top. The catch: it's brand new, just a couple of commits in, and tied to agents that support the skills format. Half the pack is WordPress and WooCommerce specific, so a general developer really gets three of the five skills. Worth watching as it grows.
qiaomu-goal-meta-skill is a skill for Claude Code and Codex that takes a vague request like 'build me an app' and turns it into a structured goal spec: clear success criteria, constraints, safety boundaries, and conditions where the agent should stop and ask. The point is to make an AI agent plan properly before it starts changing things. MIT, free, one command to install. There is nothing to run and no setup beyond adding the skill to your agent. It is narrowly scoped: it generates tight, bounded task specifications, with conservative defaults and pause-on-risk conditions baked in. It is also a young, single-commit project, and the documentation is primarily in Chinese with an English mirror, so set expectations accordingly. This is for people using a goal-style agent who keep getting burned by under-specified prompts and want guardrails between 'I have an idea' and 'the agent is editing files.' Solo or team, it is free. If you already write detailed, bounded task specs by hand, or you do not use a compatible agent, you will not get much from it. The catch is scope and maturity. It does one specific thing, generate disciplined goal specs, and it only helps inside a compatible agent. Treat it as a useful habit-former for agentic work, not a finished framework.
Recall fixes the annoying part of working with Claude Code across sessions: re-explaining your project every time you come back. It captures what happened in a session, the prompts, the responses, the files you touched, the commands you ran, and boils it down to a short summary the agent can read next time to pick up where you left off. MIT licensed, free, and it runs entirely on your machine. The clever bit is that it summarizes locally with classic text-ranking algorithms, TF-IDF and TextRank, so there is no extra LLM call and no token cost for the summary itself. It drops two files in a .recall/ folder: an append-only history log and an auto-generated context digest with your goal, key files, and next steps. It even redacts common secret patterns before writing anything to disk. No setup, works offline. This is a Claude Code plugin, not an MCP server, so it slots into the workflow without extra infrastructure. Solo developers juggling several projects get the most out of it: a 1 to 2K token digest is far cheaper than pasting your whole project history back in, and it works with whatever Claude subscription you already have. The catch is that a TF-IDF summary is compression, not understanding. It surfaces what looks important by word frequency, which is usually enough to orient the agent but occasionally misses the one detail that mattered. Skim the context file before you trust it to have remembered everything.
compass-skills is a set of Markdown skills that help an AI coding agent manage long, multi-session work. It gets the agent to clarify the task, keep a repo-local memory of goals as a kind of task tree, hand off context cleanly between sessions, and follow your collaboration preferences. Everything stays in local plaintext, nothing is uploaded. MIT, free. There is nothing to run and no remote piece; the state sits as plaintext in your repo. It is narrowly aimed at the problem of agents losing the thread across long projects and multiple sessions. It is also very new, a couple of weeks old and mostly one author, so the conventions may shift and longevity is unproven. Early but pointed at a real pain. This is for developers running Claude Code or Codex on long-running projects who keep losing context between sessions and want persistent task memory. Solo or team, it is free. If your work is short and single-session, or you do not want to adopt another agent convention layer, skip it. The catch is maturity. The idea, giving an agent durable memory and clean handoffs, is a real one, but this is an early, single-author project, so bet on the concept more than the specific implementation. If it sticks, great; if it stalls, you have not lost much.