addyosmani

    addyosmani/agent-skills

    Production-grade engineering skills for AI coding agents.

    cli
    agent-skills
    antigravity
    antigravity-ide
    claude-code
    cursor
    skills
    Shell
    MIT
    17.9K stars
    2.2K forks
    17.9K watching
    Updated 4/20/2026
    View on GitHub
    Backblaze Advertisement

    Loading star history...

    Health Score

    25.11

    Weekly Growth

    +4.6K

    +25.7% this week

    Contributors

    1

    Total contributors

    Open Issues

    40

    Generated Insights

    About agent-skills

    Agent Skills

    Production-grade engineering skills for AI coding agents.

    Skills encode the workflows, quality gates, and best practices that senior engineers use when building software. These ones are packaged so AI agents follow them consistently across every phase of development.

      DEFINE          PLAN           BUILD          VERIFY         REVIEW          SHIP
     ┌──────┐      ┌──────┐      ┌──────┐      ┌──────┐      ┌──────┐      ┌──────┐
     │ Idea │ ───▶ │ Spec │ ───▶ │ Code │ ───▶ │ Test │ ───▶ │  QA  │ ───▶ │  Go  │
     │Refine│      │  PRD │      │ Impl │      │Debug │      │ Gate │      │ Live │
     └──────┘      └──────┘      └──────┘      └──────┘      └──────┘      └──────┘
      /spec          /plan          /build        /test         /review       /ship
    

    Commands

    7 slash commands that map to the development lifecycle. Each one activates the right skills automatically.

    What you're doingCommandKey principle
    Define what to build/specSpec before code
    Plan how to build it/planSmall, atomic tasks
    Build incrementally/buildOne slice at a time
    Prove it works/testTests are proof
    Review before merge/reviewImprove code health
    Simplify the code/code-simplifyClarity over cleverness
    Ship to production/shipFaster is safer

    Skills also activate automatically based on what you're doing — designing an API triggers api-and-interface-design, building UI triggers frontend-ui-engineering, and so on.


    Quick Start

    Claude Code (recommended)

    Marketplace install:

    /plugin marketplace add addyosmani/agent-skills
    /plugin install agent-skills@addy-agent-skills
    

    SSH errors? The marketplace clones repos via SSH. If you don't have SSH keys set up on GitHub, either add your SSH key or switch to HTTPS for fetches only:

    git config --global url."https://github.com/".insteadOf "[email protected]:"
    

    Local / development:

    git clone https://github.com/addyosmani/agent-skills.git
    claude --plugin-dir /path/to/agent-skills
    
    Cursor

    Copy any SKILL.md into .cursor/rules/, or reference the full skills/ directory. See docs/cursor-setup.md.

    Gemini CLI

    Install as native skills for auto-discovery, or add to GEMINI.md for persistent context. See docs/gemini-cli-setup.md.

    Install from the repo:

    gemini skills install https://github.com/addyosmani/agent-skills.git --path skills
    

    Install from a local clone:

    gemini skills install ./agent-skills/skills/
    
    Windsurf

    Add skill contents to your Windsurf rules configuration. See docs/windsurf-setup.md.

    OpenCode

    Uses agent-driven skill execution via AGENTS.md and the skill tool.

    See docs/opencode-setup.md.

    GitHub Copilot

    Use agent definitions from agents/ as Copilot personas and skill content in .github/copilot-instructions.md. See docs/copilot-setup.md.

    Kiro IDE & CLI Skills for Kiro reside under ".kiro/skills/" and can be stored under Project or Global level. Kiro also supports Agents.md. See Kiro docs at https://kiro.dev/docs/skills/
    Codex / Other Agents

    Skills are plain Markdown - they work with any agent that accepts system prompts or instruction files. See docs/getting-started.md.


    All 20 Skills

    The commands above are the entry points. Under the hood, they activate these 20 skills — each one a structured workflow with steps, verification gates, and anti-rationalization tables. You can also reference any skill directly.

    Define - Clarify what to build

    SkillWhat It DoesUse When
    idea-refineStructured divergent/convergent thinking to turn vague ideas into concrete proposalsYou have a rough concept that needs exploration
    spec-driven-developmentWrite a PRD covering objectives, commands, structure, code style, testing, and boundaries before any codeStarting a new project, feature, or significant change

    Plan - Break it down

    SkillWhat It DoesUse When
    planning-and-task-breakdownDecompose specs into small, verifiable tasks with acceptance criteria and dependency orderingYou have a spec and need implementable units

    Build - Write the code

    SkillWhat It DoesUse When
    incremental-implementationThin vertical slices - implement, test, verify, commit. Feature flags, safe defaults, rollback-friendly changesAny change touching more than one file
    test-driven-developmentRed-Green-Refactor, test pyramid (80/15/5), test sizes, DAMP over DRY, Beyonce Rule, browser testingImplementing logic, fixing bugs, or changing behavior
    context-engineeringFeed agents the right information at the right time - rules files, context packing, MCP integrationsStarting a session, switching tasks, or when output quality drops
    source-driven-developmentGround every framework decision in official documentation - verify, cite sources, flag what's unverifiedYou want authoritative, source-cited code for any framework or library
    frontend-ui-engineeringComponent architecture, design systems, state management, responsive design, WCAG 2.1 AA accessibilityBuilding or modifying user-facing interfaces
    api-and-interface-designContract-first design, Hyrum's Law, One-Version Rule, error semantics, boundary validationDesigning APIs, module boundaries, or public interfaces

    Verify - Prove it works

    SkillWhat It DoesUse When
    browser-testing-with-devtoolsChrome DevTools MCP for live runtime data - DOM inspection, console logs, network traces, performance profilingBuilding or debugging anything that runs in a browser
    debugging-and-error-recoveryFive-step triage: reproduce, localize, reduce, fix, guard. Stop-the-line rule, safe fallbacksTests fail, builds break, or behavior is unexpected

    Review - Quality gates before merge

    SkillWhat It DoesUse When
    code-review-and-qualityFive-axis review, change sizing (~100 lines), severity labels (Nit/Optional/FYI), review speed norms, splitting strategiesBefore merging any change
    code-simplificationChesterton's Fence, Rule of 500, reduce complexity while preserving exact behaviorCode works but is harder to read or maintain than it should be
    security-and-hardeningOWASP Top 10 prevention, auth patterns, secrets management, dependency auditing, three-tier boundary systemHandling user input, auth, data storage, or external integrations
    performance-optimizationMeasure-first approach - Core Web Vitals targets, profiling workflows, bundle analysis, anti-pattern detectionPerformance requirements exist or you suspect regressions

    Ship - Deploy with confidence

    SkillWhat It DoesUse When
    git-workflow-and-versioningTrunk-based development, atomic commits, change sizing (~100 lines), the commit-as-save-point patternMaking any code change (always)
    ci-cd-and-automationShift Left, Faster is Safer, feature flags, quality gate pipelines, failure feedback loopsSetting up or modifying build and deploy pipelines
    deprecation-and-migrationCode-as-liability mindset, compulsory vs advisory deprecation, migration patterns, zombie code removalRemoving old systems, migrating users, or sunsetting features
    documentation-and-adrsArchitecture Decision Records, API docs, inline documentation standards - document the whyMaking architectural decisions, changing APIs, or shipping features
    shipping-and-launchPre-launch checklists, feature flag lifecycle, staged rollouts, rollback procedures, monitoring setupPreparing to deploy to production

    Agent Personas

    Pre-configured specialist personas for targeted reviews:

    AgentRolePerspective
    code-reviewerSenior Staff EngineerFive-axis code review with "would a staff engineer approve this?" standard
    test-engineerQA SpecialistTest strategy, coverage analysis, and the Prove-It pattern
    security-auditorSecurity EngineerVulnerability detection, threat modeling, OWASP assessment

    Reference Checklists

    Quick-reference material that skills pull in when needed:

    ReferenceCovers
    testing-patterns.mdTest structure, naming, mocking, React/API/E2E examples, anti-patterns
    security-checklist.mdPre-commit checks, auth, input validation, headers, CORS, OWASP Top 10
    performance-checklist.mdCore Web Vitals targets, frontend/backend checklists, measurement commands
    accessibility-checklist.mdKeyboard nav, screen readers, visual design, ARIA, testing tools

    How Skills Work

    Every skill follows a consistent anatomy:

    ┌─────────────────────────────────────────────┐
    │  SKILL.md                                   │
    │                                             │
    │  ┌─ Frontmatter ─────────────────────────┐  │
    │  │ name: lowercase-hyphen-name           │  │
    │  │ description: Use when [trigger]       │  │
    │  └───────────────────────────────────────┘  │
    │                                             │
    │  Overview         → What this skill does    │
    │  When to Use      → Triggering conditions   │
    │  Process          → Step-by-step workflow   │
    │  Rationalizations → Excuses + rebuttals     │
    │  Red Flags        → Signs something's wrong │
    │  Verification     → Evidence requirements   │
    └─────────────────────────────────────────────┘
    

    Key design choices:

    • Process, not prose. Skills are workflows agents follow, not reference docs they read. Each has steps, checkpoints, and exit criteria.
    • Anti-rationalization. Every skill includes a table of common excuses agents use to skip steps (e.g., "I'll add tests later") with documented counter-arguments.
    • Verification is non-negotiable. Every skill ends with evidence requirements - tests passing, build output, runtime data. "Seems right" is never sufficient.
    • Progressive disclosure. The SKILL.md is the entry point. Supporting references load only when needed, keeping token usage minimal.

    Project Structure

    agent-skills/
    ├── skills/                            # 20 core skills (SKILL.md per directory)
    │   ├── idea-refine/                   #   Define
    │   ├── spec-driven-development/       #   Define
    │   ├── planning-and-task-breakdown/   #   Plan
    │   ├── incremental-implementation/    #   Build
    │   ├── context-engineering/           #   Build
    │   ├── source-driven-development/     #   Build
    │   ├── frontend-ui-engineering/       #   Build
    │   ├── test-driven-development/       #   Build
    │   ├── api-and-interface-design/      #   Build
    │   ├── browser-testing-with-devtools/ #   Verify
    │   ├── debugging-and-error-recovery/  #   Verify
    │   ├── code-review-and-quality/       #   Review
    │   ├── code-simplification/          #   Review
    │   ├── security-and-hardening/        #   Review
    │   ├── performance-optimization/      #   Review
    │   ├── git-workflow-and-versioning/   #   Ship
    │   ├── ci-cd-and-automation/          #   Ship
    │   ├── deprecation-and-migration/     #   Ship
    │   ├── documentation-and-adrs/        #   Ship
    │   ├── shipping-and-launch/           #   Ship
    │   └── using-agent-skills/            #   Meta: how to use this pack
    ├── agents/                            # 3 specialist personas
    ├── references/                        # 4 supplementary checklists
    ├── hooks/                             # Session lifecycle hooks
    ├── .claude/commands/                  # 7 slash commands
    └── docs/                              # Setup guides per tool
    

    Why Agent Skills?

    AI coding agents default to the shortest path - which often means skipping specs, tests, security reviews, and the practices that make software reliable. Agent Skills gives agents structured workflows that enforce the same discipline senior engineers bring to production code.

    Each skill encodes hard-won engineering judgment: when to write a spec, what to test, how to review, and when to ship. These aren't generic prompts - they're the kind of opinionated, process-driven workflows that separate production-quality work from prototype-quality work.

    Skills bake in best practices from Google's engineering culture — including concepts from Software Engineering at Google and Google's engineering practices guide. You'll find Hyrum's Law in API design, the Beyonce Rule and test pyramid in testing, change sizing and review speed norms in code review, Chesterton's Fence in simplification, trunk-based development in git workflow, Shift Left and feature flags in CI/CD, and a dedicated deprecation skill treating code as a liability. These aren't abstract principles — they're embedded directly into the step-by-step workflows agents follow.


    Contributing

    Skills should be specific (actionable steps, not vague advice), verifiable (clear exit criteria with evidence requirements), battle-tested (based on real workflows), and minimal (only what's needed to guide the agent).

    See docs/skill-anatomy.md for the format specification and CONTRIBUTING.md for guidelines.


    License

    MIT - use these skills in your projects, teams, and tools.

    Discover Repositories

    Search across tracked repositories by name or description