zhongwei/gh-racurry-neat-little-package-plugins-box-factory

Files

Zhongwei Li 2b34b5aa74 Initial commit

2025-11-30 08:48:43 +08:00

7.4 KiB

Raw Permalink Blame History

name, description

name	description
box-factory-agent-design	Interpretive guidance for designing Claude Code agents. Helps apply official documentation effectively and avoid common pitfalls. Use when creating or reviewing agents.

Agent Design Skill

This skill provides interpretive guidance for creating Claude Code agents. ALWAYS fetch current official documentation before creating agents - this skill helps you understand what the docs mean, not replace them.

Required Reading Before Creating Agents

Fetch these docs with WebFetch every time:

https://code.claude.com/docs/en/sub-agents.md - Core specification and examples
https://code.claude.com/docs/en/settings#tools-available-to-claude - Verify tool names
https://code.claude.com/docs/en/model-config.md - Current model options

Critical Architecture Understanding

Agents operate in isolated context with a return-based model:

User ↔ Main Claude → Agent (isolated, returns results)

Critical implications:

Agents CAN'T ask users questions
Agents CAN'T interact with users directly
Agents SHOULD do actual work (run code, write files, analyze code) when appropriate
Main agent handles ALL user communication

The key distinction: No user interaction (no asking questions, no confirming), but full ability to do autonomous work within scope.

Common misconception: If your agent prompt includes phrases like "ask the user", "gather from user", "clarify with user" - you've misunderstood the architecture.

Decision Framework

Agent vs Command vs Skill

Use Agent when:

Need isolated context (won't pollute main conversation)
Want autonomous delegation (triggered by context)
Require specific tool restrictions
Task runs as part of larger workflows

Use Command when:

User explicitly triggers it
Simple, straightforward task
No need for context isolation

Use Skill when:

Knowledge needed by multiple contexts
Procedural expertise that's substantial
Progressive disclosure would save tokens

Tool Selection Philosophy

Match tools to autonomous responsibilities:

If agent's job is to write files → include Write/Edit
If agent only analyzes → Read, Grep, Glob only
Never include AskUserQuestion (agents can't use it)

Common mistake: Over-restricting tools because you're thinking "safety"

Reality: An agent whose job is generating code but only has Read tool can't do its job

Balance: Reviewers should be read-only; builders need write access

Description Field Design

The description field determines when Claude delegates to your agent. This is critical for autonomous invocation.

Official requirement: "Natural language explanation of when to invoke the subagent"

Quality test: Would Claude invoke this agent based on context alone, or only when explicitly asked?

Guidelines:

State WHEN to use (triggering conditions), not just WHAT it does
Be specific about context and use cases
Test empirically - if your agent isn't being invoked automatically, revise the description
Avoid overly generic descriptions that match too many scenarios

Common Gotchas

Gotcha #1: User Interaction Language

Problem: Agent prompts assume they can ask questions or confirm actions

Forbidden phrases anywhere in agent prompt:

"ask the user", "gather from user", "clarify with user"
"request from user", "prompt the user", "wait for input"
"check with user", "verify with user", "confirm with user"

Replace with:

"infer from context", "use provided parameters"
"make reasonable assumptions", "use available information"
"default to [specific behavior]"

Gotcha #2: Hardcoding Version-Specific Info

Problem: Docs change; hardcoded details become outdated

Instead of:

Available models: sonnet, opus, haiku
Use these tools: Read, Write, Edit, Bash

Do this:

See model-config documentation for current options
Refer to tools documentation for current capabilities

Gotcha #3: Tool Mismatches

Problem: Tools don't match the agent's autonomous responsibilities

Examples:

❌ Code generator with only Read (can't write generated code)
❌ Test runner without Bash (can't run tests)
❌ Code reviewer with Write/Edit (should be read-only)

Solution: Grant minimal necessary permissions for the agent's actual work

Common Antipatterns

Antipattern: Overly Broad Scope

What you'll see: "Full-stack engineer agent that handles everything"

Why it fails:

Unclear when to delegate
Context pollution
Violates single responsibility principle

Solution: Split into focused agents (frontend-dev, backend-dev, db-specialist)

Antipattern: Vague Delegation Triggers

What you'll see: Great functionality, vague description

Why it fails: Agent only fires on explicit request, not autonomously

Solution: Make description specific about triggering conditions and use cases

Antipattern: Interactive Assumptions

What you'll see: "Ask user for target directory", "Confirm with user before proceeding"

Why it fails: Agents can't interact with users

Solution: "Use provided directory parameter or default to ./src", "Proceed based on available context"

System Prompt Best Practices

Structure

Use consistent markdown hierarchy:

# Agent Name (H1 - single heading)

## Purpose
[Clear statement of role]

## Process
1. Step one
2. Step two

## Guidelines
- Key principle one
- Key principle two

## Constraints
- What NOT to do
- Boundaries and limitations

Content Quality

Be specific and actionable:

✅ "Run pytest -v and parse output for failures"
❌ "Run tests and check for problems"

Define scope clearly:

✅ "Only analyze Python files in src/ directory"
❌ "Analyze code"

Include constraints:

✅ "Never modify production configuration files"
✅ "Only analyze; never modify code"

Validation Workflow

Before finalizing an agent:

Fetch official docs - Verify against current specification
Check structure - Valid YAML frontmatter, required fields present
Scan for forbidden language - No user interaction phrases
Validate tools - Match autonomous responsibilities, no AskUserQuestion
Test description - Specific triggering conditions, not generic
Review system prompt - Single H1, clear structure, actionable instructions
Verify no hardcoding - No version-specific details that will become outdated

Path Resolution

When writing agents:

If caller specifies path → use exact path
If working in .claude/agents/ → use that
Default → .claude/agents/ (project-level)
User-level (~/.claude/agents/) → only when explicitly requested

Documentation References

Authoritative sources to fetch before creating agents:

Core specifications:

https://code.claude.com/docs/en/sub-agents.md - Agent structure, examples, patterns

Tool verification:

https://code.claude.com/docs/en/settings#tools-available-to-claude - Current tool list

Model selection:

https://code.claude.com/docs/en/model-config.md - Available models, selection guidance

Workflow patterns:

https://code.claude.com/docs/en/common-workflows.md - Real-world delegation patterns

Remember: This skill helps you interpret and apply those docs effectively. Always fetch current documentation for specifications and details.

7.4 KiB Raw Permalink Blame History