Initial commit

2025-11-30 08:48:43 +08:00
commit 2b34b5aa74
25 changed files with 7414 additions and 0 deletions
--- a/skills/agent-design/SKILL.md
+++ b/skills/agent-design/SKILL.md
@@ -0,0 +1,250 @@
+---
+name: box-factory-agent-design
+description: Interpretive guidance for designing Claude Code agents. Helps apply official documentation effectively and avoid common pitfalls. Use when creating or reviewing agents.
+---
+
+# Agent Design Skill
+
+This skill provides interpretive guidance for creating Claude Code agents. **ALWAYS fetch current official documentation before creating agents** - this skill helps you understand what the docs mean, not replace them.
+
+## Required Reading Before Creating Agents
+
+Fetch these docs with WebFetch every time:
+
+- **<https://code.claude.com/docs/en/sub-agents.md>** - Core specification and examples
+- **<https://code.claude.com/docs/en/settings#tools-available-to-claude>** - Verify tool names
+- **<https://code.claude.com/docs/en/model-config.md>** - Current model options
+
+## Critical Architecture Understanding
+
+Agents operate in **isolated context** with a **return-based model**:
+
+```
+User ↔ Main Claude → Agent (isolated, returns results)
+```
+
+**Critical implications:**
+
+- Agents CAN'T ask users questions
+- Agents CAN'T interact with users directly
+- Agents SHOULD do actual work (run code, write files, analyze code) when appropriate
+- Main agent handles ALL user communication
+
+**The key distinction:** No user interaction (no asking questions, no confirming), but full ability to do autonomous work within scope.
+
+**Common misconception:** If your agent prompt includes phrases like "ask the user", "gather from user", "clarify with user" - you've misunderstood the architecture.
+
+## Decision Framework
+
+### Agent vs Command vs Skill
+
+**Use Agent when:**
+
+- Need isolated context (won't pollute main conversation)
+- Want autonomous delegation (triggered by context)
+- Require specific tool restrictions
+- Task runs as part of larger workflows
+
+**Use Command when:**
+
+- User explicitly triggers it
+- Simple, straightforward task
+- No need for context isolation
+
+**Use Skill when:**
+
+- Knowledge needed by multiple contexts
+- Procedural expertise that's substantial
+- Progressive disclosure would save tokens
+
+### Tool Selection Philosophy
+
+**Match tools to autonomous responsibilities:**
+
+- If agent's job is to write files → include Write/Edit
+- If agent only analyzes → Read, Grep, Glob only
+- Never include AskUserQuestion (agents can't use it)
+
+**Common mistake:** Over-restricting tools because you're thinking "safety"
+
+**Reality:** An agent whose job is generating code but only has Read tool can't do its job
+
+**Balance:** Reviewers should be read-only; builders need write access
+
+## Description Field Design
+
+The `description` field determines when Claude delegates to your agent. This is critical for autonomous invocation.
+
+**Official requirement:** "Natural language explanation of when to invoke the subagent"
+
+**Quality test:** Would Claude invoke this agent based on context alone, or only when explicitly asked?
+
+**Guidelines:**
+
+- State WHEN to use (triggering conditions), not just WHAT it does
+- Be specific about context and use cases
+- Test empirically - if your agent isn't being invoked automatically, revise the description
+- Avoid overly generic descriptions that match too many scenarios
+
+## Common Gotchas
+
+### Gotcha #1: User Interaction Language
+
+**Problem:** Agent prompts assume they can ask questions or confirm actions
+
+**Forbidden phrases anywhere in agent prompt:**
+
+- "ask the user", "gather from user", "clarify with user"
+- "request from user", "prompt the user", "wait for input"
+- "check with user", "verify with user", "confirm with user"
+
+**Replace with:**
+
+- "infer from context", "use provided parameters"
+- "make reasonable assumptions", "use available information"
+- "default to [specific behavior]"
+
+### Gotcha #2: Hardcoding Version-Specific Info
+
+**Problem:** Docs change; hardcoded details become outdated
+
+**Instead of:**
+
+```markdown
+Available models: sonnet, opus, haiku
+Use these tools: Read, Write, Edit, Bash
+```
+
+**Do this:**
+
+```markdown
+See model-config documentation for current options
+Refer to tools documentation for current capabilities
+```
+
+### Gotcha #3: Tool Mismatches
+
+**Problem:** Tools don't match the agent's autonomous responsibilities
+
+**Examples:**
+
+- ❌ Code generator with only Read (can't write generated code)
+- ❌ Test runner without Bash (can't run tests)
+- ❌ Code reviewer with Write/Edit (should be read-only)
+
+**Solution:** Grant minimal necessary permissions for the agent's actual work
+
+## Common Antipatterns
+
+### Antipattern: Overly Broad Scope
+
+**What you'll see:** "Full-stack engineer agent that handles everything"
+
+**Why it fails:**
+
+- Unclear when to delegate
+- Context pollution
+- Violates single responsibility principle
+
+**Solution:** Split into focused agents (frontend-dev, backend-dev, db-specialist)
+
+### Antipattern: Vague Delegation Triggers
+
+**What you'll see:** Great functionality, vague description
+
+**Why it fails:** Agent only fires on explicit request, not autonomously
+
+**Solution:** Make description specific about triggering conditions and use cases
+
+### Antipattern: Interactive Assumptions
+
+**What you'll see:** "Ask user for target directory", "Confirm with user before proceeding"
+
+**Why it fails:** Agents can't interact with users
+
+**Solution:** "Use provided directory parameter or default to ./src", "Proceed based on available context"
+
+## System Prompt Best Practices
+
+### Structure
+
+Use consistent markdown hierarchy:
+
+```markdown
+# Agent Name (H1 - single heading)
+
+## Purpose
+[Clear statement of role]
+
+## Process
+1. Step one
+2. Step two
+
+## Guidelines
+- Key principle one
+- Key principle two
+
+## Constraints
+- What NOT to do
+- Boundaries and limitations
+```
+
+### Content Quality
+
+**Be specific and actionable:**
+
+- ✅ "Run pytest -v and parse output for failures"
+- ❌ "Run tests and check for problems"
+
+**Define scope clearly:**
+
+- ✅ "Only analyze Python files in src/ directory"
+- ❌ "Analyze code"
+
+**Include constraints:**
+
+- ✅ "Never modify production configuration files"
+- ✅ "Only analyze; never modify code"
+
+## Validation Workflow
+
+Before finalizing an agent:
+
+1. **Fetch official docs** - Verify against current specification
+2. **Check structure** - Valid YAML frontmatter, required fields present
+3. **Scan for forbidden language** - No user interaction phrases
+4. **Validate tools** - Match autonomous responsibilities, no AskUserQuestion
+5. **Test description** - Specific triggering conditions, not generic
+6. **Review system prompt** - Single H1, clear structure, actionable instructions
+7. **Verify no hardcoding** - No version-specific details that will become outdated
+
+## Path Resolution
+
+When writing agents:
+
+1. If caller specifies path → use exact path
+2. If working in `.claude/agents/` → use that
+3. Default → `.claude/agents/` (project-level)
+4. User-level (`~/.claude/agents/`) → only when explicitly requested
+
+## Documentation References
+
+Authoritative sources to fetch before creating agents:
+
+**Core specifications:**
+
+- <https://code.claude.com/docs/en/sub-agents.md> - Agent structure, examples, patterns
+
+**Tool verification:**
+
+- <https://code.claude.com/docs/en/settings#tools-available-to-claude> - Current tool list
+
+**Model selection:**
+
+- <https://code.claude.com/docs/en/model-config.md> - Available models, selection guidance
+
+**Workflow patterns:**
+
+- <https://code.claude.com/docs/en/common-workflows.md> - Real-world delegation patterns
+
+**Remember:** This skill helps you interpret and apply those docs effectively. Always fetch current documentation for specifications and details.