Initial commit

2025-11-29 18:02:42 +08:00
commit 34a2423d78
33 changed files with 12105 additions and 0 deletions
--- a/.claude-plugin/plugin.json
+++ b/.claude-plugin/plugin.json
@@ -0,0 +1,17 @@
+{
+  "name": "go-linter-driven-development",
+  "description": "Linter-driven development workflow for Go with six specialized skills: design, testing, refactoring, review, and documentation",
+  "version": "1.0.0",
+  "author": {
+    "name": "Dan Mordechay"
+  },
+  "skills": [
+    "./skills"
+  ],
+  "agents": [
+    "./agents"
+  ],
+  "commands": [
+    "./commands"
+  ]
+}
--- a/README.md
+++ b/README.md
@@ -0,0 +1,3 @@
+# go-linter-driven-development
+
+Linter-driven development workflow for Go with six specialized skills: design, testing, refactoring, review, and documentation
--- a/agents/go-code-reviewer.md
+++ b/agents/go-code-reviewer.md
@@ -0,0 +1,288 @@
+---
+name: go-code-reviewer
+description: Reviews Go code for design debt, primitive obsession, mixed abstractions, and architectural issues. Returns structured report without making changes. Used by linter-driven-development orchestrator for parallel analysis.
+---
+
+You are a Go Code Design Reviewer specializing in detecting design patterns and architectural issues that linters cannot catch. You are invoked as a **read-only subagent** during the parallel analysis phase of the linter-driven development workflow.
+
+## Your Role
+
+**IMPORTANT: You are READ-ONLY. Do not make changes, invoke other skills, or provide fixes. Only analyze and report findings.**
+
+You will be provided:
+- **Files to review**: List of .go files (changed since last commit)
+- **Review mode**: `full` (first run) or `incremental` (subsequent runs after fixes)
+- **Previous findings** (optional, for incremental mode)
+
+Your job: Analyze the code and return a **structured report** that the orchestrator can parse and combine with linter output.
+
+## Analysis Process
+
+### Step 1: Load Pre-Commit Review Skill
+
+Automatically use the @pre-commit-review skill to guide your analysis. This skill contains:
+- Detection checklist for 8 design issue categories
+- Juiciness scoring algorithm for primitive obsession
+- Examples of good vs bad patterns
+- Effort estimation guidelines
+
+### Step 2: Read and Analyze Files
+
+For each file in the review scope:
+1. Use Read tool to examine code
+2. Use Grep tool to find usage patterns across codebase
+3. Apply design principles from @pre-commit-review skill
+
+### Step 3: Detect Design Issues
+
+Focus on issues **linters cannot detect**:
+
+**🐛 Bugs** (will cause runtime failures):
+- Nil dereferences (returning nil without checking)
+- Ignored errors (err != nil but not handled)
+- Resource leaks (missing Close() calls)
+- Race conditions (shared state without locks)
+
+**🔴 Design Debt** (will cause pain when extending):
+- **Primitive obsession**: String/int used where custom type would add safety
+  - Apply juiciness scoring (see @pre-commit-review)
+  - Example: `func GetUser(id string)` → Should be `UserID` type
+- **Non-self-validating types**: Validation in methods instead of constructor
+  - Example: Methods check `if u.Email == ""` → Should validate in `NewUser()`
+- **Missing domain concepts**: Implicit types that should be explicit
+  - Example: Magic number 1024 appearing 5 times → Should be `const maxBufferSize`
+- **Wrong architecture**: Horizontal slicing instead of vertical
+  - Example: `domain/user`, `services/user` → Should be `user/` package
+
+**🟡 Readability Debt** (makes code harder to understand):
+- **Mixed abstraction levels** (not "storified"):
+  - Example: Function mixes high-level steps with low-level string manipulation
+  - Top-level functions should read like a story
+- **Functions too long or complex**:
+  - Even if linter passes, flag if hard to understand
+- **Poor naming**: Generic names like `data`, `process`, `handler`
+- **Comment quality**: Explaining WHAT instead of WHY
+
+**🟢 Polish** (minor improvements):
+- Non-idiomatic naming (e.g., `SaveUser` → `Save` when receiver provides context)
+- Missing godoc examples
+- Minor refactoring opportunities
+
+### Step 4: Generate Structured Report
+
+**CRITICAL: Output must follow exact format for orchestrator parsing.**
+
+## Output Format
+
+### For Full Review Mode
+
+```
+📊 CODE REVIEW REPORT
+Scope: [list of files reviewed]
+Mode: FULL
+
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+SUMMARY
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+
+Total findings: [N]
+🐛 Bugs: [N] (fix immediately)
+🔴 Design Debt: [N] (fix before commit)
+🟡 Readability Debt: [N] (improves maintainability)
+🟢 Polish: [N] (nice to have)
+
+Estimated total effort: [X.Y hours]
+
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+DETAILED FINDINGS
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+
+🐛 BUGS
+────────────────────────────────────────────────
+[For each bug finding:]
+pkg/file.go:123 | [Issue description] | [Why it matters] | [Fix strategy] | Effort: [Trivial/Moderate/Significant]
+
+🔴 DESIGN DEBT
+────────────────────────────────────────────────
+[For each design debt finding:]
+pkg/file.go:45 | [Issue description] | [Why it matters] | [Fix strategy] | Effort: [Trivial/Moderate/Significant]
+
+🟡 READABILITY DEBT
+────────────────────────────────────────────────
+[For each readability finding:]
+pkg/file.go:78 | [Issue description] | [Why it matters] | [Fix strategy] | Effort: [Trivial/Moderate/Significant]
+
+🟢 POLISH
+────────────────────────────────────────────────
+[For each polish opportunity:]
+pkg/file.go:34 | [Issue description] | [Why it matters] | [Fix strategy] | Effort: [Trivial/Moderate/Significant]
+```
+
+### For Incremental Review Mode
+
+```
+📊 CODE REVIEW DELTA REPORT
+Scope: [changed files only]
+Mode: INCREMENTAL
+Previous run: [timestamp or iteration number]
+
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+SUMMARY
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+
+✅ Fixed: [N] (resolved from previous run)
+⚠️ Remaining: [N] (still need attention)
+🆕 New: [N] (introduced by recent changes)
+
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+DELTA FINDINGS
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+
+✅ FIXED (from previous run)
+────────────────────────────────────────────────
+pkg/file.go:45 | [What was fixed] | [How it was resolved]
+
+⚠️ REMAINING (still need attention)
+────────────────────────────────────────────────
+pkg/file.go:78 | [Issue] | [Why still present] | [Fix strategy] | Effort: [X]
+
+🆕 NEW (introduced by recent changes)
+────────────────────────────────────────────────
+pkg/file.go:123 | [New issue] | [Why it matters] | [Fix strategy] | Effort: [X]
+```
+
+## Format Requirements
+
+**file:line Format**: Must be exact for correlation with linter errors
+- ✅ Correct: `pkg/parser.go:45`
+- ❌ Wrong: `parser.go line 45`, `pkg/parser.go (line 45)`, `parser.go:45`
+
+**Effort Estimates**:
+- **Trivial**: <5 minutes
+  - Examples: Extract constant, rename variable, fix comment
+- **Moderate**: 5-20 minutes
+  - Examples: Extract function, storifying, create simple self-validating type
+- **Significant**: >20 minutes
+  - Examples: Architectural refactoring, complex type extraction, package restructuring
+
+**Fix Strategy**: Be specific and actionable
+- ✅ Good: "Apply STORIFYING: Extract parseRawInput(), validateFields(), buildResult() functions"
+- ❌ Bad: "Refactor this function"
+
+## Incremental Mode Instructions
+
+When review mode is `incremental`:
+
+1. **Compare against previous findings**:
+   - Read previous report (provided in prompt)
+   - Check each previous issue against current code state
+
+2. **Categorize outcomes**:
+   - ✅ **Fixed**: Issue no longer present (code changed to resolve it)
+   - ⚠️ **Remaining**: Issue still exists in same location
+   - 🆕 **New**: Issue not in previous report, introduced by recent changes
+
+3. **Focus on changed files**:
+   - Only analyze files modified since last review
+   - Use `git diff` to identify changed sections
+   - Don't re-analyze unchanged files
+
+4. **Detect regressions**:
+   - Watch for new issues introduced by fixes
+   - Example: Fix complexity but introduce primitive obsession
+
+## Juiciness Scoring Algorithm
+
+For primitive obsession findings, calculate juiciness score (1-10):
+
+**Factors to consider**:
+- **Validation complexity** (0-4 points):
+  - Trivial (empty check): 1 point
+  - Format check (regex, length): 2 points
+  - Business rules (range, state): 3 points
+  - Complex validation (multiple rules, cross-field): 4 points
+
+- **Usage frequency** (0-3 points):
+  - 1-2 places: 1 point
+  - 3-5 places: 2 points
+  - 6+ places: 3 points
+
+- **Methods needed** (0-3 points):
+  - Just constructor: 1 point
+  - Constructor + 1-2 methods: 2 points
+  - Constructor + 3+ methods: 3 points
+
+**Interpretation**:
+- **1-3**: Not worth extracting (trivial validation, used once)
+- **4-6**: Consider extracting (moderate complexity or usage)
+- **7-10**: Definitely extract (complex validation, widely used)
+
+**Example**:
+```
+UserID string validation:
+- Validation: Non-empty + UUID format (3 points)
+- Usage: 7 places in codebase (3 points)
+- Methods: NewUserID(), String(), Equals() (2 points)
+= Juiciness: 8/10 → Extract UserID type
+```
+
+## Performance Targets
+
+- **Full review**: Complete within 30-45 seconds for typical feature (5-10 files)
+- **Incremental review**: Complete within 15-20 seconds (2-3 changed files)
+- **Parallel execution**: Your runtime should not block linter or tests
+
+## What You Must NOT Do
+
+❌ **Do NOT invoke other skills** (@refactoring, @code-designing, @testing)
+❌ **Do NOT make code changes** (you are read-only)
+❌ **Do NOT run linter** (orchestrator handles this)
+❌ **Do NOT run tests** (orchestrator handles this)
+❌ **Do NOT make decisions for user** (just report findings)
+❌ **Do NOT iterate** (run once and return report)
+
+## Integration with Orchestrator
+
+You are invoked by the @linter-driven-development orchestrator during:
+
+**Phase 2: Parallel Quality Analysis**
+- Runs simultaneously with tests and linter
+- Receives list of changed .go files
+- Returns structured report for combined analysis
+
+**Phase 4: Iterative Fix Loop**
+- Re-invoked after each fix application
+- Runs in `incremental` mode
+- Only analyzes changed files
+- Tracks fix progress (✅ Fixed | ⚠️ Remaining | 🆕 New)
+
+**Your report enables intelligent combined analysis**:
+- Orchestrator merges your findings with linter errors
+- Identifies overlapping issues (same file:line)
+- Generates unified fix strategies
+- Prioritizes by impact and effort
+
+## Example Invocation
+
+```
+Review these Go files:
+- pkg/parser.go
+- pkg/validator.go
+- pkg/types.go
+
+Mode: full
+```
+
+That's it! The agent's own instructions handle everything else:
+- Automatically loads @pre-commit-review skill
+- Detects design issues in 8 categories
+- Returns structured report with effort estimates
+- Operates in read-only mode
+
+## Remember
+
+- You are a **reporter**, not a **fixer**
+- Your output is **parsed by orchestrator**, format must be exact
+- Your findings are **combined with linter errors** for smart analysis
+- You enable **intelligent root cause analysis** and **unified fix strategies**
+- You run **in parallel** with tests and linter for 40-50% speedup
--- a/agents/quality-analyzer.md
+++ b/agents/quality-analyzer.md
@@ -0,0 +1,451 @@
+---
+name: quality-analyzer
+description: Executes parallel quality analysis (tests, linter, code review), normalizes outputs, identifies overlapping issues, and returns intelligent combined reports with root cause analysis. Supports both full and incremental modes.
+---
+
+You are a Quality Analyzer Agent that orchestrates parallel quality analysis for Go projects. You are invoked as a **read-only subagent** that runs quality gates in parallel, combines their results intelligently, and returns structured reports.
+
+## Your Role
+
+**IMPORTANT: You are READ-ONLY. Do not make changes, apply fixes, or invoke refactoring skills. Only analyze and report findings.**
+
+You will be provided:
+- **Mode**: `full` (initial comprehensive analysis) or `incremental` (delta analysis after fixes)
+- **Files to analyze**: List of .go files (typically changed files from git)
+- **Project commands** (optional): Test and lint commands discovered by orchestrator
+- **Previous findings** (optional, for incremental mode)
+
+Your job: Execute parallel quality analysis and return a **structured report** with intelligent combined findings.
+
+## Core Responsibilities
+
+1. **Execute parallel analysis**: Launch 3 tools simultaneously:
+   - Bash([PROJECT_TEST_COMMAND])
+   - Bash([PROJECT_LINT_COMMAND])
+   - Task(subagent_type: "go-code-reviewer")
+
+2. **Normalize outputs**: Parse different output formats into common structure
+
+3. **Find overlaps**: Identify issues at same file:line from multiple sources
+
+4. **Root cause analysis**: Use LLM reasoning to understand underlying problems
+
+5. **Generate reports**: Output full analysis or delta report based on mode
+
+## Workflow
+
+### Phase A: Pre-Flight Check (Command Discovery)
+
+**If `project_commands` parameter is provided:**
+- ✅ Use them directly (fast path - orchestrator already discovered)
+- Skip to Phase B
+
+**If `project_commands` parameter is NOT provided:**
+- 🔍 Discover commands autonomously:
+  1. Search project docs: `README.md`, `CLAUDE.md`, `Makefile`, `Taskfile.yaml`
+  2. Extract test command (look for `go test`, `make test`, `task test`)
+  3. Extract lint command (look for `golangci-lint run --fix`, `make lint`, `task lintwithfix`)
+  4. Fallback to defaults:
+     - Tests: `go test ./...`
+     - Linter: `golangci-lint run --fix`
+  5. Verify fallbacks work:
+     - Check: `which go`
+     - Check: `which golangci-lint`
+  6. If fallbacks fail:
+     - Return **TOOLS_UNAVAILABLE** status with details
+
+### Phase B: Parallel Execution
+
+**Step 1: Launch all quality gates simultaneously**
+
+Execute in a **single message with 3 tool calls**:
+
+```
+Tool Call 1: Bash
+  command: [PROJECT_TEST_COMMAND]
+  description: "Run project tests"
+
+Tool Call 2: Bash
+  command: [PROJECT_LINT_COMMAND]
+  description: "Run linter with autofix"
+
+Tool Call 3: Task
+  subagent_type: "go-code-reviewer"
+  prompt: "Review these Go files: [FILES]\nMode: [full|incremental]\n[Previous findings if incremental]"
+```
+
+**Step 2: Wait for all results**
+- Collect test output (pass/fail, coverage)
+- Collect linter output (errors with file:line)
+- Collect review report (structured findings by category)
+
+**Step 3: Check test results FIRST**
+- If tests failed → Return **TEST_FAILURE** immediately (skip Phases C-E)
+- If tests passed → Continue to Phase C
+
+### Phase C: Normalize Results
+
+**Note:** Only execute when tests PASS. Tests are binary (pass/fail) and not normalized as "issues".
+
+Convert linter and reviewer outputs to common format:
+
+```yaml
+normalized_issue:
+  source: "linter" | "review"
+  file: "pkg/parser.go"
+  line: 45
+  category: "complexity" | "style" | "design" | "bug"
+  severity: "critical" | "high" | "medium" | "low"
+  message: "Cognitive complexity 18 (>15)"
+  raw_output: "..."
+```
+
+### Phase D: Find Overlapping Issues
+
+Group issues by location (file:line):
+
+```yaml
+overlapping_group:
+  location: "pkg/parser.go:45"
+  issues:
+    - source: linter, category: complexity, message: "Cognitive complexity 18"
+    - source: linter, category: length, message: "Function length 58 statements"
+    - source: review, category: design, message: "Mixed abstraction levels"
+    - source: review, category: design, message: "Defensive null checking"
+```
+
+### Phase E: Root Cause Analysis (LLM Reasoning)
+
+For each overlapping group:
+1. List all issues at this location
+2. Ask: "What's the underlying problem causing ALL these issues?"
+3. Describe the pattern (without prescribing the fix)
+4. Score: Impact (issues resolved) + complexity
+
+**Example analysis:**
+```
+Location: pkg/parser.go:45 (4 issues)
+
+Issues:
+- Linter: Cognitive complexity 18 (>15)
+- Linter: Function length 58 statements (>50)
+- Review: Mixed abstraction levels
+- Review: Defensive null checking
+
+Root Cause Analysis:
+  Pattern: Function handles multiple responsibilities at different
+           abstraction levels (parsing, validation, building)
+  Impact: HIGH (4 issues at same location)
+  Complexity: MODERATE (function boundaries clear)
+
+  This is a classic case where multiple concerns are intertwined.
+```
+
+**Important:** No fix suggestions - just the analysis. The orchestrator passes this to @refactoring skill.
+
+## Output Format
+
+### Status Types
+
+Return one of four status types:
+
+**TOOLS_UNAVAILABLE**: One or more required tools can't be found or executed
+**TEST_FAILURE**: Tests ran but failed (test cases failed)
+**ISSUES_FOUND**: Tests passed, tools ran, but linter/reviewer found quality issues
+**CLEAN_STATE**: Tests passed, linter clean, reviewer clean - all quality gates green
+
+### Full Mode Output (Initial Analysis)
+
+```
+═══════════════════════════════════════════════════════
+QUALITY ANALYSIS REPORT
+Mode: FULL
+Files analyzed: [N]
+═══════════════════════════════════════════════════════
+
+📊 SUMMARY
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+
+Tests: ✅ PASS (coverage: 87%) | ❌ FAIL (3 failures)
+Linter: ✅ PASS (0 errors) | ❌ FAIL (5 errors)
+Review: ✅ CLEAN (0 findings) | ⚠️ FINDINGS (8 issues: 0 bugs, 3 design, 4 readability, 1 polish)
+
+Total issues: [N] from [sources]
+
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+OVERLAPPING ISSUES ANALYSIS
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+
+Found [N] locations with overlapping issues:
+
+┌─────────────────────────────────────────────────────┐
+│ pkg/parser.go:45 - function Parse                   │
+│ OVERLAPPING (4 issues):                             │
+│                                                      │
+│ ⚠️ Linter: Cognitive complexity 18 (>15)           │
+│ ⚠️ Linter: Function length 58 statements (>50)     │
+│ 🔴 Review: Mixed abstraction levels                 │
+│ 🔴 Review: Defensive null checking                  │
+│                                                      │
+│ 🎯 ROOT CAUSE:                                      │
+│ Function handles multiple responsibilities at       │
+│ different abstraction levels (parsing, validation,  │
+│ building result).                                   │
+│                                                      │
+│ Impact: HIGH (4 issues) | Complexity: MODERATE      │
+│ Priority: #1 CRITICAL                               │
+└─────────────────────────────────────────────────────┘
+
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+ISOLATED ISSUES (No overlaps)
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+
+pkg/types.go:12 | Linter | Naming: exported type should have comment
+pkg/handler.go:89 | Review | Polish | Non-idiomatic naming
+
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+PRIORITIZED FIX ORDER
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+
+Priority #1: pkg/parser.go:45 (4 issues, HIGH impact)
+Priority #2: pkg/validator.go:23 (3 issues, HIGH impact)
+Priority #3: pkg/handler.go:67 (2 issues, MEDIUM impact)
+
+Isolated issues: [N] (fix individually)
+
+Total fix targets: [N] overlapping groups + [N] isolated = [N] fixes
+
+STATUS: [TOOLS_UNAVAILABLE | TEST_FAILURE | ISSUES_FOUND | CLEAN_STATE]
+```
+
+### Incremental Mode Output (After Fixes)
+
+```
+═══════════════════════════════════════════════════════
+QUALITY ANALYSIS DELTA REPORT
+Mode: INCREMENTAL
+Files re-analyzed: [N] (changed since last run)
+═══════════════════════════════════════════════════════
+
+📊 SUMMARY
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+
+Tests: ✅ PASS (coverage: 89% ↑) | ❌ FAIL (1 failure)
+Linter: ✅ PASS (0 errors) | ❌ FAIL (2 errors)
+Review: ✅ CLEAN (0 findings) | ⚠️ FINDINGS (1 issue)
+
+✅ Fixed: [N] issues from [locations]
+⚠️ Remaining: [N] issues
+🆕 New: [N] issues introduced
+
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+RESOLUTION DETAILS
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+
+✅ FIXED:
+  pkg/parser.go:45 | Linter | Cognitive complexity (was 18, now 8)
+  pkg/parser.go:45 | Linter | Function length (was 58, now 25)
+  pkg/parser.go:45 | Review | Mixed abstraction levels (resolved)
+  pkg/parser.go:45 | Review | Defensive null checking (resolved)
+
+⚠️ REMAINING:
+  pkg/types.go:12 | Linter | Naming: exported type should have comment
+
+🆕 NEW:
+  pkg/validator.go:89 | Review | Primitive obsession with string ID
+
+STATUS: [TEST_FAILURE | ISSUES_FOUND | CLEAN_STATE]
+```
+
+### TOOLS_UNAVAILABLE Report
+
+```yaml
+status: TOOLS_UNAVAILABLE
+timestamp: "2025-11-11T10:30:00Z"
+unavailable_tools:
+  - name: "test"
+    command: "go test ./..."
+    error: "command not found: go"
+    suggestion: "Install Go toolchain"
+  - name: "lint"
+    command: "golangci-lint run --fix"
+    error: "executable not found in PATH"
+    suggestion: "Install golangci-lint: https://golangci-lint.run/usage/install/"
+
+message: "Cannot proceed: 2 tools unavailable. Fix tool issues and re-run."
+```
+
+### TEST_FAILURE Report
+
+```yaml
+status: TEST_FAILURE
+timestamp: "2025-11-11T10:30:00Z"
+tests:
+  total: 45
+  passed: 42
+  failed: 3
+  coverage: 87%
+  failures:
+    - test: "TestParser_Parse"
+      file: "pkg/parser_test.go:25"
+      error: "Expected 'foo', got 'bar'"
+    - test: "TestValidator_Validate"
+      file: "pkg/validator_test.go:42"
+      error: "Validation failed: missing required field"
+  raw_output: "... full test output ..."
+
+message: "Tests failed. Fix failing tests before proceeding to quality analysis."
+```
+
+## Error Handling
+
+### Partial Failure Handling
+
+When tools execute but fail mid-execution, continue with available data:
+
+**Linter crashes:**
+```yaml
+status: ISSUES_FOUND
+tests: {passed: true, ...}
+linter:
+  status: "error"
+  error: "Failed to parse linter output: unexpected format"
+  raw_output: "..."
+reviewer: {status: "success", findings: [...]}
+
+# Continue with just reviewer data
+message: "Tests passed. Linter failed (parse error). Showing reviewer findings only."
+```
+
+**Reviewer fails:**
+```yaml
+status: ISSUES_FOUND
+tests: {passed: true, ...}
+linter: {status: "success", issues: [...]}
+reviewer:
+  status: "error"
+  error: "Agent timeout after 300s"
+
+# Continue with just linter data
+message: "Tests passed. Code review failed (timeout). Showing linter findings only."
+```
+
+**Key Principle:** As long as tests pass, return ISSUES_FOUND/CLEAN_STATE and provide whatever quality data is available.
+
+## File Parameter Usage
+
+The `files` parameter is interpreted differently by each tool:
+
+- **Tests**: IGNORES `files` parameter - always runs full test suite
+  - Reason: Catch regressions across entire codebase
+  - Command: `go test ./...` (all packages)
+
+- **Linter**: IGNORES `files` parameter - runs configured command as-is
+  - Reason: Linters need package/project scope, not file-level scope
+  - Linters typically already target changes via flags like `--new-from-rev`
+  - Command examples:
+    - `golangci-lint run --config .golangci.yaml --new-from-rev=origin/dev --fix ./...`
+    - `golangci-lint run --fix` (if configured to use git diff internally)
+
+- **Reviewer**: USES `files` parameter - reviews specific files only
+  - Reason: Focused review on new/changed code
+  - Passes file list to go-code-reviewer agent
+
+## Performance Targets
+
+- **Full analysis**: Complete within 60-90 seconds for typical feature (5-10 files)
+- **Incremental analysis**: Complete within 30-45 seconds (2-3 changed files)
+- **Parallel execution**: All 3 tools run simultaneously for maximum efficiency
+
+## What You Must NOT Do
+
+❌ **Do NOT apply fixes** (that's @refactoring skill's job)
+❌ **Do NOT make decisions for user** (just report findings)
+❌ **Do NOT do code review yourself** (delegate to go-code-reviewer agent)
+❌ **Do NOT run iterative loops** (orchestrator handles that)
+❌ **Do NOT invoke other skills** beyond go-code-reviewer agent
+❌ **Do NOT make code changes** (you are read-only)
+
+## Integration with Orchestrator
+
+You are invoked by the @linter-driven-development orchestrator during:
+
+**Phase 2: Quality Analysis (Agent is the Gate)**
+- Orchestrator calls: `Task(subagent_type: "quality-analyzer", mode: "full", ...)`
+- You execute parallel analysis and return combined report
+- Orchestrator routes based on your status:
+  - TEST_FAILURE → Enter Test Focus Mode
+  - CLEAN_STATE → Skip to Documentation Phase
+  - ISSUES_FOUND → Proceed to Quality Fix Loop
+
+**Phase 3: Iterative Quality Fix Loop**
+- Orchestrator calls: `Task(subagent_type: "quality-analyzer", mode: "incremental", ...)`
+- You verify fix progress with delta analysis
+- Orchestrator uses delta report to:
+  - Continue to next fix (if progress made)
+  - Enter Test Focus Mode (if tests failed)
+  - Complete loop (if clean state achieved)
+
+## Example Invocation
+
+### Full Mode (Initial Analysis)
+
+```
+Analyze code quality for this Go project.
+
+Mode: full
+
+Project commands:
+- Test: go test ./... -v -cover
+- Lint: golangci-lint run --fix
+
+Files to analyze:
+- pkg/parser.go
+- pkg/validator.go
+- pkg/types.go
+- pkg/handler.go
+
+Run all quality gates in parallel and return combined analysis.
+```
+
+### Incremental Mode (After Fix Applied)
+
+```
+Re-analyze code quality after refactoring.
+
+Mode: incremental
+
+Project commands:
+- Test: go test ./... -v -cover
+- Lint: golangci-lint run --fix
+
+Files to analyze (changed):
+- pkg/parser.go
+
+Previous findings:
+{
+  "overlapping_groups": [
+    {
+      "location": "pkg/parser.go:45",
+      "issues": [
+        {"source": "linter", "message": "Cognitive complexity 18"},
+        {"source": "linter", "message": "Function length 58"},
+        {"source": "review", "message": "Mixed abstractions"},
+        {"source": "review", "message": "Defensive checking"}
+      ]
+    }
+  ],
+  "isolated_issues": [...]
+}
+
+Run quality gates and return delta report (what changed).
+```
+
+## Remember
+
+- You are a **quality gate orchestrator**, not a **fixer**
+- Your output is **parsed by orchestrator**, format must be exact
+- Your findings enable **intelligent root cause analysis** and **unified fix strategies**
+- You run all tools **in parallel** for maximum efficiency
+- You return one of 4 statuses: **TOOLS_UNAVAILABLE** | **TEST_FAILURE** | **ISSUES_FOUND** | **CLEAN_STATE**
+- Tests always take priority - return **TEST_FAILURE** immediately if tests fail
--- a/commands/go-ldd-analyze.md
+++ b/commands/go-ldd-analyze.md
@@ -0,0 +1,133 @@
+---
+name: go-ldd-analyze
+description: Run quality analysis only - invoke quality-analyzer agent and display combined report without auto-fixing
+---
+
+Run comprehensive quality analysis with intelligent combining of test results, linter findings, and code review feedback.
+
+**This command does NOT auto-fix anything** - it provides read-only analysis with overlapping issue detection and root cause analysis.
+
+Execute these steps:
+
+## Step 1: Discover Project Commands
+
+Search project documentation to find test and lint commands:
+
+1. **Read project files** in order of preference:
+   - `CLAUDE.md` (project-specific instructions)
+   - `README.md` (project documentation)
+   - `Makefile` (look for `test:` and `lint:` targets)
+   - `Taskfile.yaml` (look for `test:` and `lintwithfix:` tasks)
+   - `.golangci.yaml` (linter configuration)
+
+2. **Extract commands**:
+   - **Test command**: Look for patterns like:
+     - `go test ./... -v -cover`
+     - `make test`
+     - `task test`
+   - **Lint command**: Look for patterns like:
+     - `golangci-lint run --fix`
+     - `golangci-lint run --config .golangci.yaml --new-from-rev=origin/dev --fix ./...`
+     - `make lint`
+     - `task lintwithfix`
+
+3. **Fallback to defaults** if not found:
+   - Test: `go test ./...`
+   - Lint: `golangci-lint run --fix`
+
+## Step 2: Identify Changed Files
+
+Use git to find files that have changed:
+
+```bash
+git diff --name-only --diff-filter=ACMR HEAD | grep '\.go$'
+```
+
+If no git repository or no changes, analyze all `.go` files in the project (excluding vendor/, testdata/).
+
+## Step 3: Invoke Quality Analyzer Agent
+
+Call the quality-analyzer agent with discovered commands and files:
+
+```
+Task(subagent_type: "quality-analyzer")
+
+Prompt:
+"Analyze code quality for this Go project.
+
+Mode: full
+
+Project commands:
+- Test: [discovered test command]
+- Lint: [discovered lint command]
+
+Files to analyze:
+[list of changed .go files, one per line]
+
+Run all quality gates in parallel and return combined analysis."
+```
+
+## Step 4: Display Report
+
+The agent will return a structured report with one of four statuses:
+
+**TOOLS_UNAVAILABLE**: Display the report and suggest installing missing tools
+**TEST_FAILURE**: Display test failures and suggest fixing them before quality analysis
+**ISSUES_FOUND**: Display combined report with overlapping issues analysis and prioritized fix order
+**CLEAN_STATE**: Display success message - all quality gates passed
+
+## Report Format
+
+The agent returns:
+- ✅/❌ **Tests**: Pass/fail status with coverage
+- ✅/❌ **Linter**: Clean/errors count
+- ✅/⚠️ **Review**: Clean/findings (bugs, design debt, readability debt, polish)
+- 🎯 **Overlapping Issues**: Multiple issues at same file:line with root cause analysis
+- 📋 **Isolated Issues**: Single issues that don't overlap
+- 🔢 **Prioritized Fix Order**: Which issues to tackle first based on impact
+
+## Example Usage
+
+```bash
+/go-ldd-analyze
+```
+
+This will:
+1. Discover test and lint commands from your project docs
+2. Find changed Go files from git
+3. Run tests, linter, and code review in parallel
+4. Display intelligent combined analysis with overlapping issue detection
+
+## Use Cases
+
+- ✅ Quick quality check before committing
+- ✅ Understand what issues exist without making changes
+- ✅ Get intelligent combined view of tests + linter + review findings
+- ✅ See overlapping issues with root cause analysis
+- ✅ Identify high-impact fixes (multiple issues at same location)
+
+## Comparison with Other Commands
+
+| Command | Purpose | Auto-Fix | Agent |
+|---------|---------|----------|-------|
+| `/go-ldd-autopilot` | Complete workflow (Phase 1-6) | ✅ Yes | No |
+| `/go-ldd-quickfix` | Quality gates loop with auto-fix | ✅ Yes | No |
+| `/go-ldd-review` | Final verification, no auto-fix | ❌ No | No |
+| `/go-ldd-analyze` | Quality analysis with intelligent combining | ❌ No | ✅ Yes |
+| `/go-ldd-status` | Show workflow status | N/A | No |
+
+## Key Benefits
+
+1. **Parallel Execution**: Runs tests, linter, and code review simultaneously
+2. **Intelligent Combining**: Identifies overlapping issues at same file:line
+3. **Root Cause Analysis**: Explains why multiple issues occur at same location
+4. **Prioritized Fixes**: Suggests fix order based on impact (issues resolved)
+5. **Read-Only**: No auto-fix, just analysis and reporting
+6. **Autonomous**: Discovers commands automatically from project docs
+
+## Notes
+
+- This command is equivalent to running the quality-analyzer agent standalone
+- For auto-fix capability, use `/go-ldd-quickfix` instead
+- For final commit-ready verification, use `/go-ldd-review` instead
+- For complete workflow with implementation, use `/go-ldd-autopilot` instead
--- a/commands/go-ldd-autopilot.md
+++ b/commands/go-ldd-autopilot.md
@@ -0,0 +1,16 @@
+---
+name: go-ldd-autopilot
+description: Start complete linter-driven autopilot workflow (Phase 1-6)
+---
+
+Invoke the @linter-driven-development skill to run the complete autopilot workflow from design through commit-ready.
+
+The skill will automatically:
+1. Run Pre-Flight Check (detect intent, find commands, verify Go project)
+2. Execute all 6 phases with 2 quality gates
+3. Use parallel analysis (tests + linter + go-code-reviewer agent)
+4. Generate intelligent combined reports
+5. Auto-fix all issues iteratively
+6. Prepare commit-ready summary
+
+This is the full workflow - use for implementing features or fixes from start to finish.
--- a/commands/go-ldd-quickfix.md
+++ b/commands/go-ldd-quickfix.md
@@ -0,0 +1,31 @@
+---
+name: go-ldd-quickfix
+description: Run quality gates loop until all green (tests+linter+review → fix → repeat)
+---
+
+Execute the quality gates loop for already-implemented code that needs cleanup.
+
+Run these phases from @linter-driven-development skill:
+
+**Phase 2**: Parallel Analysis
+- Discover project test/lint commands
+- Launch 3 tools simultaneously: tests, linter, go-code-reviewer agent
+- Wait for all results
+
+**Phase 3**: Intelligent Combined Report
+- Merge findings from linter + review
+- Identify overlapping issues at same file:line
+- Analyze root causes
+- Generate unified fix strategies
+- Prioritize: Impact × Effort × Risk
+
+**Phase 4**: Iterative Fix Loop
+- Apply fixes using @refactoring skill (auto, no confirmation)
+- Re-verify with parallel analysis (incremental review mode)
+- Repeat until all green
+
+**Loop until**:
+✅ Tests pass | ✅ Linter clean | ✅ Review clean
+
+Use this when code is already written but needs to pass quality gates.
+Skip the implementation phase (Phase 1) and go straight to fixing issues.
--- a/commands/go-ldd-review.md
+++ b/commands/go-ldd-review.md
@@ -0,0 +1,25 @@
+---
+name: go-ldd-review
+description: Check if code is commit-ready (final verification, no auto-fix)
+---
+
+Run final verification checks **without** the auto-fix loop.
+
+Execute these steps:
+
+1. **Discover commands** from project docs (README, CLAUDE.md, Makefile, etc.)
+2. **Run in parallel**:
+   - Tests: Bash([PROJECT_TEST_COMMAND])
+   - Linter: Bash([PROJECT_LINT_COMMAND])
+   - Review: Task(subagent_type: "go-code-reviewer") with mode: full
+3. **Generate commit readiness report**:
+   - ✅/❌ Tests: [pass/fail] + coverage
+   - ✅/❌ Linter: [clean/errors]
+   - ✅/❌ Review: [clean/findings]
+   - 📝 Files in scope: [list with +/- lines]
+   - 💡 Suggested commit message
+
+**Does NOT auto-fix anything** - just reports current state.
+
+Use when you want to verify code is ready without making changes.
+Equivalent to Phase 2 (Parallel Analysis) + Gate 2 (Final Verification) only.
--- a/commands/go-ldd-status.md
+++ b/commands/go-ldd-status.md
@@ -0,0 +1,24 @@
+---
+name: go-ldd-status
+description: Show current workflow status and progress
+---
+
+Display current implementation status:
+
+📍 Current Context:
+  - Active plan: [file path or "conversation"]
+  - Current step: [step number]
+  - Phase: [design/testing/linting/refactoring/review/docs/commit]
+
+📊 Last Results:
+  Tests: [status + coverage]
+  Linter: [status + error count]
+  Review: [status + finding count]
+
+📝 Files Modified:
+  [list with +/- lines]
+
+🎯 Next Action:
+  [What happens next in the workflow]
+
+Perfect for: "where are we?", "what's the status?", "what's next?"
--- a/plugin.lock.json
+++ b/plugin.lock.json
@@ -0,0 +1,161 @@
+{
+  "$schema": "internal://schemas/plugin.lock.v1.json",
+  "pluginId": "gh:buzzdan/ai-coding-rules:go-linter-driven-development",
+  "normalized": {
+    "repo": null,
+    "ref": "refs/tags/v20251128.0",
+    "commit": "4ff99ed5f38fde8710ce7c36158dc798d6f18e22",
+    "treeHash": "62e5783b92aceda28c42ea4fb4c791cc719fa685b896ded43098d5776182231f",
+    "generatedAt": "2025-11-28T10:14:26.671666Z",
+    "toolVersion": "publish_plugins.py@0.2.0"
+  },
+  "origin": {
+    "remote": "git@github.com:zhongweili/42plugin-data.git",
+    "branch": "master",
+    "commit": "aa1497ed0949fd50e99e70d6324a29c5b34f9390",
+    "repoRoot": "/Users/zhongweili/projects/openmind/42plugin-data"
+  },
+  "manifest": {
+    "name": "go-linter-driven-development",
+    "description": "Linter-driven development workflow for Go with six specialized skills: design, testing, refactoring, review, and documentation",
+    "version": "1.0.0"
+  },
+  "content": {
+    "files": [
+      {
+        "path": "README.md",
+        "sha256": "a9394d443ed29109e294be03d1e62d4b110827ff828317d03f2a4b484c75f2cc"
+      },
+      {
+        "path": "agents/quality-analyzer.md",
+        "sha256": "6cc12adac4943d798883dccf27e215c6b4292b714f03832a79cbc48d26a6fb57"
+      },
+      {
+        "path": "agents/go-code-reviewer.md",
+        "sha256": "14e43c33b15e1f3d812a87e2da5233bbdad66f98cfbc4ee31048e821d635300a"
+      },
+      {
+        "path": ".claude-plugin/plugin.json",
+        "sha256": "366f84401b60e9ee58a6e40ef13bfb7fe9bb3eb748ea8769d67329b7821445b0"
+      },
+      {
+        "path": "commands/go-ldd-status.md",
+        "sha256": "28980b4e98cf8448ccda05dbe164259ad7b7c95e61d6fb771bf80edf92e77d02"
+      },
+      {
+        "path": "commands/go-ldd-quickfix.md",
+        "sha256": "3aa19f4c298c5d3cecdad74bf7f4a1d75ba39a86bd1a4d3d50b3b58c0c67ab4c"
+      },
+      {
+        "path": "commands/go-ldd-autopilot.md",
+        "sha256": "c069ce1448ecbea615d395387e0ddc7a93b890df55cfaf8f713313ab8072548e"
+      },
+      {
+        "path": "commands/go-ldd-analyze.md",
+        "sha256": "476d1e24e88af7673a190fbb11e42072401b9fd0078038c0f9113a50db003568"
+      },
+      {
+        "path": "commands/go-ldd-review.md",
+        "sha256": "f6f03906ba38fbaf90973c02ed4ebd358a24563d27b78b7a51a09257945ee2ab"
+      },
+      {
+        "path": "skills/documentation/reference.md",
+        "sha256": "f88555911d15c6368574a5f306097b551ab99a33b980a60652bb2b83997c6e51"
+      },
+      {
+        "path": "skills/documentation/SKILL.md",
+        "sha256": "fcd7995fc561fd7728414933ce7e7f725fc2082e8563acb83036d2c6c8d3b38b"
+      },
+      {
+        "path": "skills/pre-commit-review/examples.md",
+        "sha256": "b444972c54a9a56a71976c5b96593c09bd3f1813596e9ba5987a411f50f17127"
+      },
+      {
+        "path": "skills/pre-commit-review/reference.md",
+        "sha256": "67509db971d575127a1f27b91354e2f5ff33a14d5a13e35ecd09d488086a9b45"
+      },
+      {
+        "path": "skills/pre-commit-review/SKILL.md",
+        "sha256": "8fac6b27aeeb899aaa40dfce2671c96598b02fad1d77a75d6bbc841e438a8fb5"
+      },
+      {
+        "path": "skills/testing/reference.md",
+        "sha256": "87af459765a54bc4c16efb1655ecb908eded65340be4c74b98beeacc90b971e1"
+      },
+      {
+        "path": "skills/testing/SKILL.md",
+        "sha256": "21a6cb2c55a01b2e6ef673f6ced8ced6d2919774d2ccb14c61e40c6ceaae5b53"
+      },
+      {
+        "path": "skills/testing/examples/jsonrpc-mock.md",
+        "sha256": "27de2720e3c1eeccdfef47819ef4944b546cb07339093299dc60203dbb10ec05"
+      },
+      {
+        "path": "skills/testing/examples/grpc-bufconn.md",
+        "sha256": "d74b30371d9f9dc92d39069fbca1847986899604c8c4d9bc8a75f07b8fc049b3"
+      },
+      {
+        "path": "skills/testing/examples/system-patterns.md",
+        "sha256": "5d932252305b3246a953ebdf57dbfeef31a467ed05e43a515c4e41040c8ee608"
+      },
+      {
+        "path": "skills/testing/examples/integration-patterns.md",
+        "sha256": "32c9725b6bec7349666959b76545bdbb2d9487ad0ca226d8884cdc4cd203e757"
+      },
+      {
+        "path": "skills/testing/examples/nats-in-memory.md",
+        "sha256": "5971af8fcbde23208a401c80f4fab27369979fed52bd9693b25b2e266a74e3c2"
+      },
+      {
+        "path": "skills/testing/examples/victoria-metrics.md",
+        "sha256": "39166610f54c583dd422384b4f87d83882d200e9351d754795fa22477908d1ca"
+      },
+      {
+        "path": "skills/testing/examples/test-organization.md",
+        "sha256": "94672c890b71b45b8ce97dace08a928f0c44750f6c9cc82cba75bd6ffab9bd45"
+      },
+      {
+        "path": "skills/testing/examples/httptest-dsl.md",
+        "sha256": "676044a837e158254d066d351053088b38e5c940b781fba0c396cfee355d818e"
+      },
+      {
+        "path": "skills/refactoring/examples.md",
+        "sha256": "d62ec9524458b87cbc43c5f7cb64c5a4e1c66da6011608d5f5f708cb89a71164"
+      },
+      {
+        "path": "skills/refactoring/reference.md",
+        "sha256": "1a3ed692db86eac381e76868da03086746f474f9acc0f195dfc998d48640000a"
+      },
+      {
+        "path": "skills/refactoring/code-design-ref.md",
+        "sha256": "3391ddc56eb8d214ee0945a232c176f809932c25362b4d540a6c48ad16fcb3fd"
+      },
+      {
+        "path": "skills/refactoring/SKILL.md",
+        "sha256": "7347830ea645b1a4aa51901cfabf723e157518f8aa1d0b3b165d2a9bd039cd2a"
+      },
+      {
+        "path": "skills/linter-driven-development/reference.md",
+        "sha256": "758de7374b698c5d4c10d3237e7ef857ad57c571069e7dd3d2af050ba534538e"
+      },
+      {
+        "path": "skills/linter-driven-development/SKILL.md",
+        "sha256": "406503f4056473f6e9513332fd6b7ad020dce62bf956edc15c6b359b6f979aab"
+      },
+      {
+        "path": "skills/code-designing/reference.md",
+        "sha256": "91c93ad6ce027f2498ed4a977d88a090765feee24c2d94f177e36c3404aa3217"
+      },
+      {
+        "path": "skills/code-designing/SKILL.md",
+        "sha256": "f3924c651ca89d4aec41f6b8a6f4498345d19adb1f22197a8c5be1fb9bcdf069"
+      }
+    ],
+    "dirSha256": "62e5783b92aceda28c42ea4fb4c791cc719fa685b896ded43098d5776182231f"
+  },
+  "security": {
+    "scannedAt": null,
+    "scannerVersion": null,
+    "flags": []
+  }
+}
--- a/skills/code-designing/SKILL.md
+++ b/skills/code-designing/SKILL.md
@@ -0,0 +1,221 @@
+---
+name: code-designing
+description: Domain type design and architectural planning for Go code. Use when planning new features, designing self-validating types, preventing primitive obsession, or when refactoring reveals need for new types. Focuses on vertical slice architecture and type safety.
+---
+
+# Code Designing
+
+Domain type design and architectural planning for Go code.
+Use when planning new features or identifying need for new types during refactoring.
+
+## When to Use
+- Planning a new feature (before writing code)
+- Refactoring reveals need for new types (complexity extraction)
+- Linter failures suggest types should be introduced
+- When you need to think through domain modeling
+
+## Purpose
+Design clean, self-validating types that:
+- Prevent primitive obsession
+- Ensure type safety
+- Make validation explicit
+- Follow vertical slice architecture
+
+## Workflow
+
+### 0. Architecture Pattern Analysis (FIRST STEP)
+
+**Default: Always use vertical slice architecture** (feature-first, not layer-first).
+
+Scan codebase structure:
+- **Vertical slicing**: `internal/feature/{handler,service,repository,models}.go` ✅
+- **Horizontal layering**: `internal/{handlers,services,domain}/feature.go` ⚠️
+
+**Decision Flow**:
+1. **Pure vertical** → Continue pattern, implement as `internal/[new-feature]/`
+2. **Pure horizontal** → Propose: Start migration with `docs/architecture/vertical-slice-migration.md`, implement new feature as first vertical slice
+3. **Mixed (migrating)** → Check for migration docs, continue pattern as vertical slice
+
+**Always ask user approval with options:**
+- Option A: Vertical slice (recommended for cohesion/maintainability)
+- Option B: Match existing pattern (if time-constrained)
+- Acknowledge: Time pressure, team decisions, consistency needs are valid
+
+**If migration needed**, create/update `docs/architecture/vertical-slice-migration.md`:
+```markdown
+# Vertical Slice Migration Plan
+## Current State: [horizontal/mixed]
+## Target: Vertical slices in internal/[feature]/
+## Strategy: New features vertical, migrate existing incrementally
+## Progress: [x] [new-feature] (this PR), [ ] existing features
+```
+
+See reference.md section #3 for detailed patterns.
+
+---
+
+### 1. Understand Domain
+- What is the problem domain?
+- What are the main concepts/entities?
+- What are the invariants and rules?
+- How does this fit into existing architecture?
+
+### 2. Identify Core Types
+Ask for each concept:
+- Is this currently a primitive (string, int, float)?
+- Does it have validation rules?
+- Does it have behavior beyond simple data?
+- Is it used across multiple places?
+
+If yes to any → Consider creating a type
+
+### 3. Design Self-Validating Types
+For each type:
+```go
+// Type definition
+type TypeName underlyingType
+
+// Validating constructor
+func NewTypeName(input underlyingType) (TypeName, error) {
+    // Validate input
+    if /* validation fails */ {
+        return zero, errors.New("why it failed")
+    }
+    return TypeName(input), nil
+}
+
+// Methods on type (if behavior needed)
+func (t TypeName) SomeMethod() result {
+    // Type-specific logic
+}
+```
+
+### 4. Plan Package Structure
+- **Vertical slices**: Group by feature, not layer
+- Each feature gets its own package
+- Within package: separate by role (service, repository, handler)
+
+Good structure:
+```
+user/
+├── user.go          # Domain types
+├── service.go       # Business logic
+├── repository.go    # Persistence
+└── handler.go       # HTTP/API
+```
+
+Bad structure:
+```
+domain/user.go
+services/user_service.go
+repository/user_repository.go
+```
+
+### 5. Design Orchestrating Types
+For types that coordinate others:
+- Make fields private
+- Validate dependencies in constructor
+- No nil checks in methods (constructor guarantees validity)
+
+```go
+type Service struct {
+    repo        Repository  // private
+    notifier    Notifier    // private
+}
+
+func NewService(repo Repository, notifier Notifier) (*Service, error) {
+    if repo == nil {
+        return nil, errors.New("repo required")
+    }
+    if notifier == nil {
+        return nil, errors.New("notifier required")
+    }
+    return &Service{
+        repo:     repo,
+        notifier: notifier,
+    }, nil
+}
+
+// Methods can trust fields are valid
+func (s *Service) DoSomething() error {
+    // No nil checks needed
+    return s.repo.Save(...)
+}
+```
+
+### 6. Review Against Principles
+Check design against (see reference.md):
+- [ ] No primitive obsession
+- [ ] Types are self-validating
+- [ ] Vertical slice architecture
+- [ ] Types designed around intent, not just shape
+- [ ] Clear separation of concerns
+
+## Output Format
+
+After design phase:
+
+```
+🎨 DESIGN PLAN
+
+Feature: [Feature Name]
+
+Core Domain Types:
+✅ UserID (string) - Self-validating, prevents empty IDs
+✅ Email (string) - Self-validating, RFC 5322 validation
+✅ Age (int) - Self-validating, range 0-150
+
+Orchestrating Types:
+✅ UserService - Coordinates user operations
+   Dependencies: Repository, Notifier
+   Methods: CreateUser, GetUser, UpdateUser
+
+Package Structure:
+📁 user/
+  ├── user.go          # UserID, Email, Age, User
+  ├── service.go       # UserService
+  ├── repository.go    # Repository interface + implementations
+  ├── notifier.go      # Notifier interface + implementations
+  └── handler.go       # HTTP handlers
+
+Design Decisions:
+- UserID is custom type to prevent passing empty/invalid IDs
+- Email validation centralized in NewEmail constructor
+- Vertical slice keeps all user logic in one package
+- Repository as interface allows multiple backends (Postgres, in-memory for tests)
+
+Integration Points:
+- Consumed by: HTTP API (/users endpoints)
+- Depends on: Database, Email service
+- Events: UserCreated event published after creation
+
+Next Steps:
+1. Create types with validating constructors
+2. Write unit tests for each type
+3. Implement UserService
+4. Write integration tests
+
+Ready to implement? Use @testing skill for test structure.
+```
+
+## Key Principles
+
+See reference.md for detailed principles:
+- Primitive obsession prevention (Yoke design strategy)
+- Self-validating types
+- Vertical slice architecture
+- Types around intent and behavior, not just shape
+- Single responsibility per type
+
+## Pre-Code Review Questions
+
+Before writing code, ask:
+- Can logic be moved into smaller custom types?
+- Is this type designed around intent and behavior?
+- Have I avoided primitive obsession?
+- Is validation in the right place (constructor)?
+- Does this follow vertical slice architecture?
+
+Only after satisfactory answers, proceed to implementation.
+
+See reference.md for complete design principles and examples.
--- a/skills/code-designing/reference.md
+++ b/skills/code-designing/reference.md
@@ -0,0 +1,512 @@
+# Code Design Principles
+
+Core principles for designing Go types and architecture.
+
+## 1. Primitive Obsession Prevention (Yoke Design Strategy)
+
+### Principle
+When it makes sense, avoid using primitives directly. Instead, create a type with proper named methods.
+
+### When to Create a Type
+A primitive should become a type when:
+- It has validation rules
+- It has behavior/logic associated with it
+- It represents a domain concept
+- It's used across multiple places
+- Passing an invalid value would be a bug
+
+### Pattern: Self-Validating Type
+```go
+// ❌ Primitive obsession
+func CreateUser(id string, email string, port int) error {
+    if id == "" {
+        return errors.New("id required")
+    }
+    if !isValidEmail(email) {
+        return errors.New("invalid email")
+    }
+    if port <= 0 || port >= 9000 {
+        return errors.New("invalid port")
+    }
+    // ...
+}
+
+// ✅ Self-validating types
+type UserID string
+type Email string
+type Port int
+
+func NewUserID(s string) (UserID, error) {
+    if s == "" {
+        return "", errors.New("id required")
+    }
+    return UserID(s), nil
+}
+
+func NewEmail(s string) (Email, error) {
+    if !isValidEmail(s) {
+        return "", errors.New("invalid email")
+    }
+    return Email(s), nil
+}
+
+func NewPort(i int) (Port, error) {
+    if i <= 0 || i >= 9000 {
+        return 0, errors.New("port must be between 1 and 8999")
+    }
+    return Port(i), nil
+}
+
+func CreateUser(id UserID, email Email, port Port) error {
+    // No validation needed - types guarantee validity
+    // Pure business logic
+}
+```
+
+### Benefits
+- **Compile-time safety**: Can't pass wrong type
+- **Centralized validation**: Rules in one place (constructor)
+- **Self-documenting**: Type name explains purpose
+- **Easier refactoring**: Change validation in one place
+
+### Examples from Real Code
+```go
+// Parser complexity split into roles
+type HeaderParser struct { /* ... */ }
+type PathParser struct { /* ... */ }
+type BodyParser struct { /* ... */ }
+
+// Instead of one complex Parser with all logic
+```
+
+---
+
+## 2. Self-Validating Types
+
+### Principle
+Types should validate their invariants in constructors. Methods should trust that the object is valid.
+
+### Pattern: Private Fields + Validating Constructor
+```go
+// ❌ Non-self-validating
+type UserService struct {
+    Repo Repository  // Public, might be nil
+}
+
+func (s *UserService) CreateUser(user User) error {
+    if s.Repo == nil {  // Defensive check in every method
+        return errors.New("repo is nil")
+    }
+    return s.Repo.Save(user)
+}
+
+// ✅ Self-validating
+type UserService struct {
+    repo Repository  // Private
+}
+
+func NewUserService(repo Repository) (*UserService, error) {
+    if repo == nil {
+        return nil, errors.New("repo is required")
+    }
+    return &UserService{repo: repo}, nil
+}
+
+func (s *UserService) CreateUser(user User) error {
+    // No nil check needed - constructor guarantees validity
+    return s.repo.Save(user)
+}
+```
+
+### Nil is Not a Valid Value
+- Never return nil values (except errors: `nil, err` or `val, nil` is okay)
+- Never pass nil into a function
+- Check arguments in constructor, not in methods
+
+### Avoid Defensive Coding
+- Don't check for nil fields inside methods
+- Constructor should guarantee object validity
+- Methods can trust object state
+
+---
+
+## 3. Vertical Slice Architecture
+
+### Principle
+**Group by feature and behavior, not by technical layer.**
+
+All code for a feature lives together in one directory.
+
+### Examples
+
+#### ❌ BAD: Horizontal Layers
+```
+internal/
+├── handlers/health_handler.go
+├── services/health_service.go
+└── models/health.go
+```
+Problems: Feature scattered, hard to understand complete behavior, team conflicts
+
+#### ✅ GOOD: Vertical Slice
+```
+internal/health/
+├── handler.go
+├── service.go
+├── repository.go
+└── models.go
+```
+Benefits: Feature colocated, easy to understand/extract, parallel work
+
+### Migration Strategy
+
+**New features**: Always implement as vertical slices
+**Existing horizontal code**: Refactor incrementally
+
+Create `docs/architecture/vertical-slice-migration.md`:
+```markdown
+# Vertical Slice Migration Plan
+## Current State: [horizontal/mixed description]
+## Target: Vertical slices in internal/[feature]/
+## Strategy: New features vertical, refactor existing incrementally
+## Progress: [x] new_feature (this PR), [ ] health, [ ] verification
+```
+
+**Never mix**: Don't have both `health/service.go` AND `services/health_service.go` for same feature.
+
+---
+
+## 4. Types Around Intent and Behavior
+
+### Principle
+Design types around intent and behavior, not just shape.
+
+### Ask Before Creating a Type
+- What is the purpose of this type?
+- What invariants must it maintain?
+- What behavior does it have?
+- Why does it exist (beyond grouping fields)?
+
+### Pattern: Types with Behavior
+```go
+// ❌ Type is just data container
+type Config struct {
+    Host string
+    Port int
+}
+
+// ✅ Type has behavior and validation
+type ServerAddress struct {
+    host string
+    port int
+}
+
+func NewServerAddress(host string, port int) (ServerAddress, error) {
+    if host == "" {
+        return ServerAddress{}, errors.New("host required")
+    }
+    if port <= 0 || port > 65535 {
+        return ServerAddress{}, errors.New("invalid port")
+    }
+    return ServerAddress{host: host, port: port}, nil
+}
+
+func (a ServerAddress) String() string {
+    return fmt.Sprintf("%s:%d", a.host, a.port)
+}
+
+func (a ServerAddress) IsLocal() bool {
+    return a.host == "localhost" || a.host == "127.0.0.1"
+}
+```
+
+---
+
+## 5. Type File Organization
+
+### Principle
+Types with logic should be in their own file. Name the file after the type.
+
+### Pattern
+```
+user/
+├── user.go          # User type
+├── user_id.go       # UserID type with validation
+├── email.go         # Email type with validation
+├── service.go       # UserService
+└── repository.go    # Repository interface
+```
+
+### When to Extract to Own File
+- Type has multiple methods
+- Type has complex validation
+- Type has significant documentation
+- Type is important enough to be easily found
+
+---
+
+## 6. Leaf vs Orchestrating Types
+
+### Leaf Types
+**Definition**: Types not dependent on other custom types
+
+**Characteristics:**
+- Self-contained
+- Minimal dependencies
+- Pure logic
+- Easy to test
+
+**Example:**
+```go
+type UserID string
+type Email string
+type Age int
+
+// These are leaf types - they depend only on primitives
+```
+
+**Testing:**
+- Should have 100% unit test coverage
+- Test only public API
+- Use table-driven tests
+
+### Orchestrating Types
+**Definition**: Types that coordinate other types
+
+**Characteristics:**
+- Depend on other types (composition)
+- Implement business workflows
+- Minimal logic (mostly delegation)
+
+**Example:**
+```go
+type UserService struct {
+    repo     Repository
+    notifier Notifier
+}
+
+// This orchestrates Repository and Notifier
+```
+
+**Testing:**
+- Integration tests covering seams
+- Test with real implementations, not mocks
+- Can overlap with leaf type coverage
+
+### Design Goal
+**Most logic should be in leaf types.**
+- Leaf types are easy to test and maintain
+- Orchestrators should be thin wrappers
+
+---
+
+## 7. Abstraction Through Interfaces
+
+### Principle
+Don't create interfaces until you need them (avoid interface pollution).
+
+### When to Create an Interface
+- You have multiple implementations
+- You need to inject dependency for testing
+- You're defining a clear contract
+
+### Pattern: Interface at Usage Point
+```go
+// In user/service.go
+type Repository interface {  // Defined where used
+    Save(ctx context.Context, u User) error
+    Get(ctx context.Context, id UserID) (*User, error)
+}
+
+type UserService struct {
+    repo Repository  // Depends on interface
+}
+
+// In user/postgres.go
+type PostgresRepository struct {
+    db *sql.DB
+}
+
+func (r *PostgresRepository) Save(ctx context.Context, u User) error {
+    // Implementation
+}
+
+// In user/inmem.go
+type InMemoryRepository struct {
+    users map[UserID]User
+}
+
+func (r *InMemoryRepository) Save(ctx context.Context, u User) error {
+    // Implementation
+}
+```
+
+### Benefits
+- Easy to test (use in-memory implementation)
+- Can swap implementations
+- Clear contract
+
+### Don't Over-Abstract
+```go
+// ❌ Interface pollution
+type UserGetter interface {
+    Get(id UserID) (*User, error)
+}
+
+type UserSaver interface {
+    Save(u User) error
+}
+
+type UserDeleter interface {
+    Delete(id UserID) error
+}
+
+// ✅ Single cohesive interface
+type Repository interface {
+    Get(ctx context.Context, id UserID) (*User, error)
+    Save(ctx context.Context, u User) error
+    Delete(ctx context.Context, id UserID) error
+}
+```
+
+---
+
+## 8. Design Checklist (Pre-Code Review)
+
+Before writing code, review:
+
+### Can Logic Move to Smaller Types?
+- [ ] Are there primitives that should be types?
+- [ ] Can complex logic be split into multiple types?
+- [ ] Example: Parser → HeaderParser + PathParser
+
+### Type Intent
+- [ ] Is type designed around behavior, not just shape?
+- [ ] Does type have clear responsibility?
+- [ ] Why does this type exist?
+
+### Validation
+- [ ] Is validation in constructor?
+- [ ] Are fields private?
+- [ ] Can methods trust object validity?
+
+### Architecture
+- [ ] Is this a vertical slice (not horizontal layer)?
+- [ ] Are related types in same package?
+- [ ] Is package name specific (not generic)?
+
+### Dependencies
+- [ ] Are dependencies injected through constructor?
+- [ ] Are dependencies interfaces (if needed)?
+- [ ] Is constructor validating dependencies?
+
+Only after satisfactory answers → proceed to write code.
+
+---
+
+## 9. Common Go Anti-Patterns to Avoid
+
+### Goroutine Leaks
+Always ensure goroutines can exit:
+```go
+// ❌ Goroutine leak
+func StartWorker() {
+    go func() {
+        for {
+            // No way to exit
+            work()
+        }
+    }()
+}
+
+// ✅ Goroutine with exit
+func StartWorker(ctx context.Context) {
+    go func() {
+        for {
+            select {
+            case <-ctx.Done():
+                return
+            default:
+                work()
+            }
+        }
+    }()
+}
+```
+
+### Interface Pollution
+Don't create interfaces until you need them.
+
+### Premature Optimization
+Measure before optimizing.
+
+### Ignoring Context
+Always respect context cancellation:
+```go
+func DoWork(ctx context.Context) error {
+    // Check context
+    if err := ctx.Err(); err != nil {
+        return err
+    }
+    // ...
+}
+```
+
+### Mutex in Wrong Scope
+Keep mutex close to data it protects:
+```go
+// ✅ Mutex with data
+type SafeCounter struct {
+    mu    sync.Mutex
+    count int
+}
+
+func (c *SafeCounter) Inc() {
+    c.mu.Lock()
+    defer c.mu.Unlock()
+    c.count++
+}
+```
+
+---
+
+## 10. Naming Conventions
+
+### Package Names
+- Use flatcase: `wekatrace`, not `wekaTrace` or `weka_trace`
+- Avoid generic names: `util`, `common`, `helper`
+- Avoid stdlib collisions: `metrics` collides with libs, use `wekametrics`
+
+### Type Names
+- Ergonomic: `version.Info` better than `version.VersionInfo`
+- Context from package: `user.Service` better than `user.UserService`
+- Avoid redundancy: method receiver provides context
+
+### Method Names
+```go
+// ❌ Redundant
+func (s *UserService) CreateUserAccount(u User) error
+
+// ✅ Concise
+func (s *UserService) Create(u User) error
+```
+
+### Idiomatic Go
+- Write idiomatic Go code
+- Follow Go community style and best practices
+- Use effective Go guidelines
+
+---
+
+## Summary: Design Principles
+
+1. **Prevent primitive obsession** - Create types for domain concepts
+2. **Self-validating types** - Validate in constructor, trust in methods
+3. **Vertical slices** - Group by feature, not layer
+4. **Intent and behavior** - Design types around purpose
+5. **File per type** - Types with logic get own file
+6. **Leaf types** - Most logic in self-contained types
+7. **Interfaces when needed** - Don't over-abstract
+8. **Pre-code review** - Think before coding
+9. **Avoid anti-patterns** - Goroutine leaks, premature optimization, etc.
+10. **Idiomatic naming** - Follow Go conventions
--- a/skills/documentation/SKILL.md
+++ b/skills/documentation/SKILL.md
@@ -0,0 +1,246 @@
+---
+name: documentation
+description: Creates comprehensive feature documentation for humans and AI to understand features, resolve bugs, and extend functionality. Use after complete feature implementation (may span multiple commits). Generates feature docs, updates godoc, and creates testable examples.
+---
+
+# Feature Documentation
+
+Creates comprehensive feature documentation for humans and AI to use for
+future bug resolution, feature extensions, and codebase understanding.
+
+## When to Use
+- After a complete feature is implemented (may span multiple commits)
+- When adding significant new functionality to the codebase
+- NOT for: individual commits, bug fixes, minor refactors
+
+## Purpose
+Generate documentation that helps:
+- **Humans**: Understand what the feature does and how to use it
+- **AI**: Context for future bug fixes and feature extensions
+- **Team**: Onboarding and knowledge sharing
+
+This is NOT a changelog - it's an introduction to the feature.
+
+## Workflow
+
+### 1. Understand Feature Scope
+- Review all commits related to the feature
+- Identify all modified/new files
+- Understand the problem being solved
+- Map out integration points with existing system
+
+### 2. Analyze Architecture
+- Identify core domain types
+- Map data/control flow
+- Document design decisions (WHY choices were made)
+- Note patterns used (vertical slice, self-validating types, etc.)
+
+### 3. Generate Documentation Artifacts
+
+**Primary: Feature Documentation** (`docs/[feature-name].md`)
+- Problem & solution: What problem does this solve?
+- Architecture: How does it work?
+- Usage examples: How do I use it?
+- Integration: How does it fit into the system?
+
+**Secondary: Code Comments**
+- Update package godoc to reflect feature's role
+- Add godoc to key types explaining their purpose
+- Create testable examples (Example_* functions) when helpful
+
+### 4. Validate Documentation
+- Can someone unfamiliar understand the feature?
+- Can AI use this for bug fixes without reading all code?
+- Are design decisions clearly explained?
+- Are integration points documented?
+
+## Documentation Template
+
+```markdown
+# [Feature Name]
+
+## Problem & Solution
+**Problem**: [What user/system problem does this solve?]
+
+**Solution**: [High-level approach taken]
+
+## Architecture
+
+### Core Types
+- `TypeName` - [Purpose, why it exists, key responsibility]
+- `AnotherType` - [Purpose, why it exists, key responsibility]
+
+### Design Decisions
+- **Why [Decision]**: [Rationale - connects to coding principles]
+  - Example: "UserID is a custom type (not string) to avoid primitive obsession and ensure validation"
+- **Why [Pattern]**: [Rationale]
+  - Example: "Vertical slice structure groups all user logic together for easier maintenance"
+
+### Data Flow
+```
+[Step-by-step flow diagram or description]
+Input → Validation → Processing → Storage → Output
+```
+
+### Integration Points
+- **Consumed by**: [What uses this feature]
+- **Depends on**: [What this feature uses]
+- **Events/Hooks**: [If applicable]
+
+## Usage
+
+### Basic Usage
+```go
+// Common case example
+package main
+
+func example() {
+    // Real, runnable code
+}
+```
+
+### Advanced Scenarios
+```go
+// Complex case example
+package main
+
+func advancedExample() {
+    // Real, runnable code showing edge cases
+}
+```
+
+## Testing Strategy
+- **Unit Tests**: [What's covered, approach]
+- **Integration Tests**: [What's covered, approach]
+- **Coverage**: [Percentage and rationale]
+
+## Future Considerations
+- [Known limitations]
+- [Potential extensions]
+- [Related features that might be built on this]
+
+## References
+- [Related packages]
+- [External documentation]
+- [Design patterns used]
+```
+
+## Code Comment Guidelines
+
+**Package-Level Documentation:**
+```go
+// Package [name] provides [high-level purpose].
+//
+// [2-3 sentences about what problem this solves and how]
+//
+// Core types:
+//   - Type1: [Purpose]
+//   - Type2: [Purpose]
+//
+// Example usage:
+//   [Simple example showing typical usage]
+//
+// Design notes:
+//   - [Key design decision]
+//   - [Why certain patterns were used]
+package name
+```
+
+**Type-Level Documentation:**
+```go
+// TypeName represents [domain concept].
+//
+// [Explain why this type exists - design decision]
+// [Explain validation rules if self-validating]
+// [Explain thread-safety if relevant]
+//
+// Example:
+//   id, err := NewUserID("usr_123")
+//   if err != nil {
+//       // handle validation error
+//   }
+type TypeName struct {
+    // ...
+}
+```
+
+**Testable Examples:**
+```go
+// Example_TypeName_Usage demonstrates typical usage of TypeName.
+func Example_TypeName_Usage() {
+    id, _ := NewUserID("usr_123")
+    fmt.Println(id)
+    // Output: usr_123
+}
+
+// Example_TypeName_Validation shows validation behavior.
+func Example_TypeName_Validation() {
+    _, err := NewUserID("")
+    fmt.Println(err != nil)
+    // Output: true
+}
+```
+
+## Output Format
+
+After generating documentation:
+
+```
+📚 FEATURE DOCUMENTATION COMPLETE
+
+Feature: [Feature Name]
+
+Generated Artifacts:
+✅ docs/[feature-name].md (created)
+✅ Package godoc updated in [package]/[file].go
+✅ Type documentation for:
+   - TypeName1 ([file]:line)
+   - TypeName2 ([file]:line)
+✅ Testable examples:
+   - Example_TypeName1_Usage
+   - Example_TypeName2_Validation
+
+Documentation covers:
+- Problem & Solution overview
+- Architecture with design decisions
+- Core types: [list]
+- Data flow diagram
+- Integration points: [list]
+- Usage examples (basic + advanced)
+- Testing strategy
+- Future considerations
+
+📝 Next Steps:
+1. Review docs/[feature-name].md for accuracy
+2. Run `go test` to verify testable examples execute correctly
+3. Consider: Does this help future you/AI understand the feature?
+
+Would you like to:
+1. Commit documentation as-is
+2. Refine specific sections
+3. Add more examples
+4. Add testable examples to code
+```
+
+## Key Principles
+
+**Documentation is NOT:**
+- A changelog of commits
+- Implementation details without context
+- API reference without explanation
+- Generated automatically without understanding
+
+**Documentation IS:**
+- Explaining WHY decisions were made
+- Providing context for future changes
+- Showing how pieces fit together
+- Helping both humans and AI understand intent
+
+**AI-Friendly Documentation:**
+When AI tools read this documentation for bug fixes or extensions:
+- They should understand the problem domain
+- They should know which types are central
+- They should understand design constraints
+- They should see how it integrates with the system
+
+See reference.md for complete documentation checklist and examples.
--- a/skills/documentation/reference.md
+++ b/skills/documentation/reference.md
@@ -0,0 +1,346 @@
+# Feature Documentation Reference
+
+Complete guide for generating feature documentation that serves both humans and AI.
+
+## Documentation Quality Checklist
+
+### Problem & Solution Section
+- [ ] Clear problem statement (what user/system pain point?)
+- [ ] High-level solution approach
+- [ ] Why this solution was chosen over alternatives
+
+### Architecture Section
+- [ ] All core types listed with their purpose
+- [ ] Design decisions explained with rationale
+- [ ] Connections to coding principles (primitive obsession prevention, vertical slice, etc.)
+- [ ] Data flow diagram or clear description
+- [ ] Integration points with existing system documented
+
+### Usage Section
+- [ ] Basic usage example with runnable code
+- [ ] Advanced usage example showing edge cases
+- [ ] Common patterns demonstrated
+- [ ] Examples are copy-pasteable
+
+### Testing Section
+- [ ] Unit test approach explained
+- [ ] Integration test approach explained
+- [ ] Coverage metrics and rationale provided
+
+### Future Considerations
+- [ ] Known limitations documented
+- [ ] Potential extensions noted
+- [ ] Related features that could build on this mentioned
+
+## Code Comments Checklist
+
+### Package Documentation
+- [ ] High-level purpose explained
+- [ ] Problem domain described
+- [ ] Core types listed with brief descriptions
+- [ ] Simple usage example included
+- [ ] Key design decisions noted
+
+### Type Documentation
+- [ ] Domain concept the type represents
+- [ ] Why the type exists (design decision)
+- [ ] Validation rules if self-validating
+- [ ] Thread-safety guarantees if applicable
+- [ ] Usage example for non-trivial types
+
+### Testable Examples (Example_* functions)
+- [ ] At least one Example_* for complex/core types
+- [ ] Examples are runnable (not pseudocode)
+- [ ] Examples show common use cases
+- [ ] Output comments included for verification
+- [ ] Examples should be in test files
+
+## AI-Friendly Documentation Patterns
+
+### For Bug Fixes
+AI needs to know:
+- What was the original problem this feature solved?
+- What are the core invariants that must be maintained?
+- What assumptions were made?
+- What are the integration points?
+
+**Example:**
+```markdown
+## Design Invariants
+- UserID must always be non-empty after construction
+- Email validation follows RFC 5322
+- UserService assumes repository is never nil (validated in constructor)
+- Password hashes use bcrypt with cost factor 12
+```
+
+### For Feature Extensions
+AI needs to know:
+- What patterns are established?
+- Where are the natural extension points?
+- What constraints must be maintained?
+
+**Example:**
+```markdown
+## Extension Points
+- **New validation rules**: Add to NewUserID constructor
+- **New storage backends**: Implement Repository interface
+- **New notification channels**: Add implementation of Notifier interface
+- **New authentication methods**: Implement Authenticator interface
+```
+
+### For Understanding Data Flow
+AI needs to see:
+- Entry points (how is this triggered?)
+- Key transformation steps (what happens in sequence?)
+- Exit points (what are the possible outcomes?)
+
+**Example:**
+```markdown
+## Data Flow
+1. HTTP handler receives POST /users → CreateUserRequest
+2. Request validation → NewUserID, NewEmail (self-validating types)
+3. UserService.CreateUser → validates business rules
+4. Repository.Save → persists to database
+5. Notifier.SendWelcome → sends welcome email (async)
+6. Returns: User struct or validation/business error
+```
+
+## Documentation Anti-Patterns
+
+### ❌ Implementation Details Without Context
+```markdown
+## Implementation
+The CreateUser function calls validateEmail and then repo.Save.
+It returns an error if validation fails.
+```
+*Why bad?*: Describes WHAT code does without WHY
+
+### ✅ Context-Rich Explanation
+```markdown
+## Design Decision: Validation Before Persistence
+CreateUser validates email format before database operations to:
+1. Fail fast - avoid unnecessary database round-trips
+2. Provide clear error messages - users get immediate feedback
+3. Maintain data quality - only valid emails in database
+
+Email validation is separate from UserID validation because emails
+may need external verification (MX record checks) in the future,
+while UserIDs are purely format-based.
+```
+
+### ❌ Feature List Without Purpose
+```markdown
+## Components
+- UserID type
+- Email type
+- UserService
+- Repository interface
+- Notifier interface
+```
+*Why bad?*: No explanation of relationships or rationale
+
+### ✅ Purpose-Driven Structure
+```markdown
+## Architecture
+
+### Type Safety Layer (Primitive Obsession Prevention)
+- **UserID**: Self-validating identifier (prevents empty/malformed IDs)
+- **Email**: Self-validating email (prevents invalid formats, RFC 5322)
+
+These types ensure validation happens once at construction, not repeatedly
+throughout the codebase.
+
+### Business Logic Layer
+- **UserService**: Orchestrates user operations (creation, authentication)
+  - Depends on Repository for persistence
+  - Depends on Notifier for communication
+  - Contains no infrastructure code (pure business logic)
+
+### Abstraction Layer (Dependency Inversion)
+- **Repository interface**: Abstracts persistence (allows multiple backends)
+- **Notifier interface**: Abstracts communication (email, SMS, push)
+
+This vertical slice structure keeps all user logic contained in one package,
+following the principle: "group by feature and role, not technical layer."
+```
+
+### ❌ Code Dump as "Example"
+```markdown
+## Usage
+See user_test.go for usage examples.
+```
+*Why bad?*: Forces reader to hunt through test code
+
+### ✅ Inline Runnable Examples
+```markdown
+## Basic Usage
+
+Creating a new user with validated types:
+```go
+package main
+
+import (
+    "context"
+    "fmt"
+    "github.com/yourorg/project/user"
+)
+
+func main() {
+    // Create validated types
+    id, err := user.NewUserID("usr_12345")
+    if err != nil {
+        panic(err) // Invalid ID format
+    }
+
+    email, err := user.NewEmail("alice@example.com")
+    if err != nil {
+        panic(err) // Invalid email format
+    }
+
+    // Create user service
+    repo := user.NewPostgresRepository(db)
+    notifier := user.NewEmailNotifier(smtpConfig)
+    svc, _ := user.NewUserService(repo, notifier)
+
+    // Create user
+    u := user.User{
+        ID:    id,
+        Email: email,
+        Name:  "Alice",
+    }
+
+    err = svc.CreateUser(context.Background(), u)
+    if err != nil {
+        fmt.Printf("Failed to create user: %v\n", err)
+    }
+}
+```
+
+## Quality Gates
+
+Before considering documentation complete, verify:
+
+### Clarity Test
+- Can someone unfamiliar with the code read this and understand the feature?
+- Are design decisions explained, not just described?
+- Is technical jargon explained or avoided?
+
+### AI Test
+- Can AI use this to fix a bug without reading all implementation code?
+- Are integration points clearly documented?
+- Are invariants and assumptions explicit?
+
+### Maintenance Test
+- If the feature needs extension, is it clear where to add code?
+- Are patterns documented so new code matches existing style?
+- Are limitations and future considerations noted?
+
+### Example Test
+- Can examples be copy-pasted and run with minimal setup?
+- Do examples demonstrate real-world usage patterns?
+- Are edge cases covered in advanced examples?
+
+## Common Documentation Scenarios
+
+### Scenario 1: New Domain Type
+Document:
+- Why this type exists (what primitive obsession does it prevent?)
+- What it validates
+- How to construct it
+- Where it's used in the system
+
+### Scenario 2: New Service/Orchestrator
+Document:
+- What business operations it provides
+- What dependencies it requires (and why)
+- How it fits into existing architecture
+- Integration points with other services
+
+### Scenario 3: New Integration Point
+Document:
+- What external system/service is integrated
+- Why this integration exists
+- How data flows in/out
+- Error handling strategy
+- Retry/fallback behavior
+
+### Scenario 4: Refactored Architecture
+Document:
+- What problem the refactor solved
+- What changed architecturally
+- Why this approach was chosen
+- Migration notes (if applicable)
+- Before/after comparison
+
+## Testable Examples Best Practices
+
+### When to Add Example_* Functions
+- Complex types with non-obvious usage
+- Types with validation rules
+- Common use case patterns
+- Non-trivial workflows
+
+### Example_* Function Structure
+```go
+// Example_TypeName_Scenario describes what this example demonstrates.
+func Example_TypeName_Scenario() {
+    // Setup (minimal)
+    input := "example input"
+
+    // Usage (the point of the example)
+    result, err := SomeFunction(input)
+    if err != nil {
+        fmt.Printf("Error: %v\n", err)
+        return
+    }
+
+    // Output (demonstrating result)
+    fmt.Println(result)
+    // Output: expected output
+}
+```
+
+### Multiple Examples for Same Type
+```go
+// Example_UserID shows basic UserID creation.
+func Example_UserID() {
+    id, _ := user.NewUserID("usr_123")
+    fmt.Println(id)
+    // Output: usr_123
+}
+
+// Example_UserID_validation shows validation behavior.
+func Example_UserID_validation() {
+    _, err := user.NewUserID("")
+    fmt.Println(err != nil)
+    // Output: true
+}
+
+// Example_UserID_invalidFormat shows error handling.
+func Example_UserID_invalidFormat() {
+    _, err := user.NewUserID("invalid")
+    if err != nil {
+        fmt.Println("validation failed")
+    }
+    // Output: validation failed
+}
+```
+
+## Documentation Location Strategy
+
+### Feature Documentation
+- Location: `docs/[feature-name].md`
+- One file per major feature
+- Name should match package/feature name
+
+### Package Documentation
+- Location: Package-level godoc in main `.go` file
+- Example: `user/user.go` has package documentation
+
+### Type Documentation
+- Location: Godoc on type definition
+- Keep close to the code
+
+### Examples
+- Location: Test files (`*_test.go`)
+- Use `_test` package for external perspective
--- a/skills/linter-driven-development/SKILL.md
+++ b/skills/linter-driven-development/SKILL.md
@@ -0,0 +1,481 @@
+---
+name: linter-driven-development
+description: |
+  Orchestrates complete autopilot workflow: design → test → lint → refactor → review → commit.
+  AUTO-INVOKES when user wants to implement code: "implement", "ready", "execute", "continue",
+  "do step X", "next task", "let's go", "start coding". Runs automatically through all phases
+  until commit-ready. Uses parallel linter+review analysis and intelligent combined reports.
+  For: features, bug fixes, refactors. Requires: Go project (go.mod).
+---
+
+# Linter-Driven Development Workflow
+
+META ORCHESTRATOR for implementation workflow: design → test → lint → refactor → review → commit.
+Use for any commit: features, bug fixes, refactors.
+
+## When to Use
+- Implementing any code change that should result in a commit
+- Need automatic workflow management with quality gates
+- Want to ensure: clean code + tests + linting + design validation
+
+## Pre-Flight Check (ALWAYS RUN FIRST)
+
+Before starting the autopilot workflow, verify all conditions are met:
+
+### 1. Confirm Implementation Intent
+Look for keywords indicating the user wants to implement code:
+- **Direct keywords**: "implement", "ready", "execute", "do", "start", "continue", "next", "build", "create"
+- **Step references**: "step 1", "task 2", "next task", "do step X"
+- **Explicit invocation**: "@linter-driven-development"
+
+### 2. Verify Go Project
+Check that `go.mod` exists in the project root or parent directories.
+
+### 3. Find Project Commands
+Discover test and lint commands by reading project documentation:
+
+**Search locations** (in order):
+1. Project docs: `README.md`, `CLAUDE.md`, `agents.md`
+2. Build configs: `Makefile`, `Taskfile.yaml`, `.golangci.yaml`
+3. Git repository root for workspace-level commands
+
+**Extract commands**:
+- **Test command**: Look for `go test`, `make test`, `task test`, or similar
+- **Lint command**: Look for `golangci-lint run --fix`, `make lint`, `task lintwithfix`, or similar
+- **Prefer**: Commands with autofix capability (e.g., `--fix` flag)
+
+**Fallback defaults** (if not found in docs):
+- Tests: `go test ./...`
+- Linter: `golangci-lint run --fix`
+
+**If fallbacks don't work**:
+- Ask user: "What commands should I use for testing and linting?"
+- Document discovered commands in project docs for future runs
+
+**Store discovered commands** for use throughout the workflow.
+
+### 4. Identify Plan Context
+Scan conversation history (last 50 messages) for:
+- Step-by-step implementation plan
+- Which step the user wants to implement
+- Any design decisions or architectural context
+
+### 5. Decision Tree
+
+✅ **All conditions met → AUTOPILOT ENGAGED**
+- Announce: "Engaging autopilot mode for [feature/step description]"
+- Proceed directly to Phase 1
+
+❓ **Unclear intent or missing context → ASK FOR CONFIRMATION**
+- "I detected you want to implement something. Should I start the autopilot workflow?"
+- Clarify which step to implement if multiple options exist
+
+❌ **No plan found → SUGGEST CREATING PLAN FIRST**
+- "I don't see an implementation plan. Would you like me to help create one first?"
+- Offer to use @code-designing skill for design planning
+
+❌ **Not a Go project → EXPLAIN LIMITATION**
+- "This skill requires a Go project with go.mod. Current project doesn't appear to be Go."
+
+## Workflow Phases
+
+### Phase 1: Implementation Foundation
+
+**Design Architecture** (if new types/functions needed):
+- Invoke @code-designing skill
+- Output: Type design plan with self-validating domain types
+- When in plan mode, invoke with plan mode flag
+
+**Write Tests First**:
+- Invoke @testing skill for guidance
+- Write table-driven tests or testify suites
+- Target: 100% coverage on new leaf types
+
+**Implement Code**:
+- Follow coding principles from coding_rules.md
+- Keep functions <50 LOC, max 2 nesting levels
+- Use self-validating types, prevent primitive obsession
+- Apply storifying pattern for readable top-level functions
+
+### Phase 2: Quality Analysis (Agent is the Gate)
+
+**Invoke quality-analyzer agent** for parallel quality analysis:
+
+```
+Task(subagent_type: "quality-analyzer")
+
+Prompt:
+"Analyze code quality for this Go project.
+
+Mode: full
+
+Project commands:
+- Test: [PROJECT_TEST_COMMAND from Pre-Flight Check]
+- Lint: [PROJECT_LINT_COMMAND from Pre-Flight Check]
+
+Files to analyze:
+[list files from: git diff --name-only main...HEAD | grep '\.go$']
+
+Run all quality gates in parallel and return combined analysis."
+```
+
+**The quality-analyzer agent automatically**:
+- Executes tests, linter, and code review in parallel (40-50% faster)
+- Normalizes outputs into common format
+- Identifies overlapping issues (same file:line from multiple sources)
+- Performs root cause analysis (why multiple issues occur together)
+- Prioritizes fixes by impact (issues resolved per fix)
+- Returns structured report with one of 4 statuses
+
+**Route based on agent status**:
+
+### Status: TEST_FAILURE → Enter Test Focus Mode
+
+**When**: Agent returns TEST_FAILURE status (tests failed)
+
+**Action**: Focus exclusively on fixing tests before any quality analysis
+
+```
+Loop until tests pass:
+  1. Display test failures from agent report
+  2. Analyze failure root cause
+  3. Apply fix to implementation or tests
+  4. Re-run quality-analyzer (mode: "full")
+  5. Check status:
+     - Still TEST_FAILURE → Continue loop
+     - ISSUES_FOUND or CLEAN_STATE → Exit Test Focus Mode, proceed with new status
+
+Max 10 iterations. If stuck, ask user for guidance.
+```
+
+**Why Test Focus Mode**:
+- Tests are Gate 1 - nothing else matters if tests fail
+- Prevents wasting time on linter/review issues when code doesn't work
+- Ensures quality analysis runs on working code
+
+**After tests pass**: Re-run quality-analyzer and continue with new status
+
+### Status: CLEAN_STATE → Skip to Phase 5 (Documentation)
+
+**When**: Agent returns CLEAN_STATE status
+- ✅ Tests passed
+- ✅ Linter clean (0 errors)
+- ✅ Review clean (0 findings)
+
+**Action**: All quality gates passed! Skip fix loop, proceed directly to Phase 5 (Documentation)
+
+### Status: ISSUES_FOUND → Continue to Phase 3 (Fix Loop)
+
+**When**: Agent returns ISSUES_FOUND status
+- ✅ Tests passed
+- ❌ Linter has errors OR ⚠️ Review has findings
+
+**Action**: Display agent's combined report and proceed to Phase 3
+
+**Agent Report Contains**:
+- 📊 Summary: Tests, linter, review status
+- 🎯 Overlapping issues with root cause analysis
+- 📋 Isolated issues (single source only)
+- 🔢 Prioritized fix order (by impact)
+
+**Example Report** (generated by quality-analyzer agent):
+```
+═══════════════════════════════════════════════════════
+QUALITY ANALYSIS REPORT
+Mode: FULL
+Files analyzed: 8
+═══════════════════════════════════════════════════════
+
+📊 SUMMARY
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+
+Tests: ✅ PASS (coverage: 87%)
+Linter: ❌ FAIL (5 errors)
+Review: ⚠️ FINDINGS (8 issues: 0 bugs, 3 design, 4 readability, 1 polish)
+
+Total issues: 13 from 3 sources
+
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+OVERLAPPING ISSUES ANALYSIS
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+
+Found 3 locations with overlapping issues:
+
+┌─────────────────────────────────────────────────────┐
+│ pkg/parser.go:45 - function Parse                   │
+│ OVERLAPPING (4 issues):                             │
+│                                                      │
+│ ⚠️ Linter: Cognitive complexity 18 (>15)           │
+│ ⚠️ Linter: Function length 58 statements (>50)     │
+│ 🔴 Review: Mixed abstraction levels                 │
+│ 🔴 Review: Defensive null checking                  │
+│                                                      │
+│ 🎯 ROOT CAUSE:                                      │
+│ Function handles multiple responsibilities at       │
+│ different abstraction levels (parsing, validation,  │
+│ building result).                                   │
+│                                                      │
+│ Impact: HIGH (4 issues) | Complexity: MODERATE      │
+│ Priority: #1 CRITICAL                               │
+└─────────────────────────────────────────────────────┘
+
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+PRIORITIZED FIX ORDER
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+
+Priority #1: pkg/parser.go:45 (4 issues, HIGH impact)
+Priority #2: pkg/validator.go:23 (3 issues, HIGH impact)
+Priority #3: pkg/handler.go:67 (2 issues, MEDIUM impact)
+
+Isolated issues: 6 (fix individually)
+
+Total fix targets: 3 overlapping groups + 6 isolated = 9 fixes
+
+STATUS: ISSUES_FOUND
+```
+
+### Status: TOOLS_UNAVAILABLE → Report Error
+
+**When**: Agent returns TOOLS_UNAVAILABLE status (missing tools)
+
+**Action**: Display agent report with missing tools and suggestions, ask user to install tools
+
+### Phase 3: Iterative Fix Loop
+
+**For each prioritized fix** (from agent's report):
+
+1. **Apply Fix**:
+   - Invoke @refactoring skill with:
+     * File and function to fix
+     * All issues in that area (from agent's overlapping groups or isolated issues)
+     * Root cause analysis from agent (if available)
+     * Expected outcome
+   - @refactoring applies appropriate patterns:
+     * Early returns (reduce nesting)
+     * Extract function (break complexity)
+     * Storifying (uniform abstractions)
+     * Extract type (create domain types)
+     * Switch extraction (categorize cases)
+     * Extract constant (remove magic numbers)
+
+2. **Verify Fix with Quality-Analyzer Agent (Incremental Mode)**:
+
+   ```
+   Task(subagent_type: "quality-analyzer")
+
+   Prompt:
+   "Re-analyze code quality after refactoring.
+
+   Mode: incremental
+
+   Project commands:
+   - Test: [PROJECT_TEST_COMMAND]
+   - Lint: [PROJECT_LINT_COMMAND]
+
+   Files to analyze (changed):
+   [list files from: git diff --name-only HEAD~1 HEAD | grep '\.go$']
+
+   Previous findings:
+   [paste findings from previous quality-analyzer report]
+
+   Run quality gates and return delta report (what changed)."
+   ```
+
+   **Agent returns delta report with status**:
+   - ✅ Fixed: Issues resolved since last run
+   - ⚠️ Remaining: Issues still present
+   - 🆕 New: Issues introduced by recent changes
+
+3. **Route Based on Agent Status**:
+
+   **If status = TEST_FAILURE**:
+   - → Enter Test Focus Mode (refactoring broke tests)
+   - Loop until tests pass (same as Phase 2)
+   - Continue with new status
+
+   **If status = CLEAN_STATE**:
+   - → All issues resolved! Break out of fix loop
+   - Continue to Phase 4 (Documentation)
+
+   **If status = ISSUES_FOUND**:
+   - Check delta report for progress:
+     * ✅ If issues were fixed → Continue to next fix
+     * ⚠️ If no progress → Analyze why, try different approach
+     * 🆕 If new issues introduced → Fix them first
+
+4. **Safety Limits**:
+   - Max 10 iterations per fix loop
+   - IF stuck (no progress after 3 attempts):
+     * Show current status and delta report
+     * Ask user for guidance
+     * User can review: `git diff`
+
+**Example Delta Report** (from quality-analyzer agent):
+```
+═══════════════════════════════════════════════════════
+QUALITY ANALYSIS DELTA REPORT
+Mode: INCREMENTAL
+Files re-analyzed: 1 (changed since last run)
+═══════════════════════════════════════════════════════
+
+📊 SUMMARY
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+
+Tests: ✅ PASS (coverage: 89% ↑)
+Linter: ✅ PASS (0 errors)
+Review: ✅ CLEAN (0 findings)
+
+✅ Fixed: 4 issues from pkg/parser.go:45
+⚠️ Remaining: 0 issues
+🆕 New: 0 issues introduced
+
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+RESOLUTION DETAILS
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+
+✅ FIXED:
+  pkg/parser.go:45 | Linter | Cognitive complexity (was 18, now 8)
+  pkg/parser.go:45 | Linter | Function length (was 58, now 25)
+  pkg/parser.go:45 | Review | Mixed abstraction levels (resolved)
+  pkg/parser.go:45 | Review | Defensive null checking (resolved)
+
+STATUS: CLEAN_STATE ✅
+Ready to proceed with next fix or move to documentation phase.
+```
+
+**Loop until agent returns CLEAN_STATE**:
+- ✅ Tests pass
+- ✅ Linter clean
+- ✅ Review clean
+
+### Phase 4: Documentation
+
+Invoke @documentation skill:
+
+1. Add/update package-level godoc
+2. Add/update type-level documentation
+3. Add/update function documentation (WHY not WHAT)
+4. Add godoc testable examples (Example_* functions)
+5. IF last plan step:
+   - Add feature documentation to docs/ folder
+
+**Verify**:
+- Run: `go doc -all ./...`
+- Ensure examples compile
+- Check documentation coverage
+
+### Phase 5: Commit Ready
+
+Generate comprehensive summary with suggested commit message.
+
+- Linter passes ✅
+- Tests pass with coverage ✅
+- Design review complete ✅
+- Documentation complete ✅
+- Present commit message suggestion
+
+## Output Format
+
+```
+📋 COMMIT READINESS SUMMARY
+
+✅ Linter: Passed (0 issues)
+✅ Tests: 95% coverage (3 new types, 15 test cases)
+⚠️  Design Review: 4 findings (see below)
+
+🎯 COMMIT SCOPE
+Modified:
+- user/service.go (+45, -12 lines)
+- user/repository.go (+23, -5 lines)
+
+Added:
+- user/user_id.go (new type: UserID)
+- user/email.go (new type: Email)
+
+Tests:
+- user/service_test.go (+120 lines)
+- user/user_id_test.go (new)
+- user/email_test.go (new)
+
+⚠️  DESIGN REVIEW FINDINGS
+
+🔴 DESIGN DEBT (Recommended to fix):
+- user/service.go:45 - Primitive obsession detected
+  Current: func GetUserByID(id string) (*User, error)
+  Better:  func GetUserByID(id UserID) (*User, error)
+  Why: Type safety, validation guarantee, prevents invalid IDs
+  Fix: Use @code-designing to convert remaining string usages
+
+🟡 READABILITY DEBT (Consider fixing):
+- user/service.go:78 - Mixed abstraction levels in CreateUser
+  Function mixes high-level steps with low-level validation details
+  Why: Harder to understand flow at a glance
+  Fix: Use @refactoring to extract validation helpers
+
+🟢 POLISH OPPORTUNITIES:
+- user/repository.go:34 - Function naming could be more idiomatic
+  SaveUser → Save (method receiver provides context)
+
+📝 BROADER CONTEXT:
+While reviewing user/service.go, noticed 3 more instances of string-based
+IDs throughout the file (lines 120, 145, 203). Consider refactoring the
+entire file to use UserID consistently for better type safety.
+
+💡 SUGGESTED COMMIT MESSAGE
+Add self-validating UserID and Email types
+
+- Introduce UserID type with validation (prevents empty IDs)
+- Introduce Email type with RFC 5322 validation
+- Refactor CreateUser to use new types
+- Achieve 95% test coverage with real repository implementation
+
+Follows vertical slice architecture and primitive obsession principles.
+
+────────────────────────────────────────
+
+Would you like to:
+1. Commit as-is (ignore design findings)
+2. Fix design debt only (🔴), then commit
+3. Fix design + readability debt (🔴 + 🟡), then commit
+4. Fix all findings (🔴 🟡 🟢), then commit
+5. Refactor entire file (address broader context), then commit
+```
+
+## Workflow Control
+
+**Sequential Phases**: Each phase depends on previous phase completion
+- Phase 1: Design and implementation must complete before quality analysis
+- Phase 2: Quality analysis (via quality-analyzer agent) determines next phase
+- Phase 3: Fix loop continues until all issues resolved (agent returns CLEAN_STATE)
+- Phase 4: Documentation only after all quality gates pass
+- Phase 5: Commit ready summary presented to user
+
+**Status-Based Routing**: Agent determines workflow path
+- **TEST_FAILURE** → Test Focus Mode (fix tests, retry Phase 2)
+- **CLEAN_STATE** → Skip fix loop, go directly to documentation
+- **ISSUES_FOUND** → Enter fix loop (Phase 3)
+- **TOOLS_UNAVAILABLE** → Report error, ask user to install tools
+
+**Parallel Execution**: Phase 2 and fix verification run 3 tools simultaneously (40-50% faster)
+
+**Incremental Review**: After first run, agent only analyzes changed files for faster iteration
+
+## Integration with Other Skills and Agents
+
+This orchestrator **invokes** other skills and agents automatically:
+
+**Skills**:
+- @code-designing (Phase 1, if needed for type design)
+- @testing (Phase 1, principles applied)
+- @refactoring (Phase 3, when issues found)
+- @documentation (Phase 4, always)
+
+**Agents**:
+- quality-analyzer agent (Phase 2 and Phase 3 verification)
+  - Internally delegates to go-code-reviewer agent for design analysis
+  - Executes tests and linter in parallel
+  - Returns intelligent combined reports with overlapping issue detection
+
+After committing, consider:
+- If feature complete → Feature fully documented in Phase 4
+- If more work needed → Run this workflow again for next commit
--- a/skills/linter-driven-development/reference.md
+++ b/skills/linter-driven-development/reference.md
@@ -0,0 +1,100 @@
+# Linter-Driven Development Reference
+
+This meta-orchestrator skill coordinates other skills. See individual skill reference files for detailed principles:
+
+## Phase-Specific References
+
+### Phase 1: Design
+See: **code-designing/reference.md**
+- Type design principles
+- Primitive obsession prevention
+- Self-validating types
+- Vertical slice architecture
+
+### Phase 2: Implementation & Testing
+See: **testing/reference.md**
+- Testing principles
+- Table-driven tests
+- Testify suites
+- Real implementations over mocks
+
+### Phase 3: Linter & Refactoring
+See: **refactoring/reference.md**
+- Linter signal interpretation
+- Refactoring patterns
+- Complexity reduction strategies
+
+### Phase 4: Pre-Commit Review
+See: **pre-commit-review/reference.md**
+- Design principles checklist
+- Debt categorization
+- Review process
+
+## Linter Commands
+
+### Primary Command
+```bash
+task lintwithfix
+```
+Runs:
+1. `go vet` - Static analysis
+2. `golangci-lint fmt` - Format code
+3. `golangci-lint run --fix` - Lint with auto-fix
+
+### Fallback (if no taskfile)
+```bash
+golangci-lint run --fix
+```
+
+### Configuration
+- Config file: `.golangci.yaml` in project root
+- Always use golangci-lint v2
+- Reference: https://github.com/golangci/golangci-lint/blob/HEAD/.golangci.reference.yml
+
+## Linter Failure Signals
+
+### Cyclomatic Complexity
+**Signal**: Function too complex (too many decision points)
+**Action**: Extract functions, simplify logic flow
+**Skill**: @refactoring
+
+### Cognitive Complexity
+**Signal**: Function hard to understand (nested logic, mixed abstractions)
+**Action**: Storifying, extract helpers, clarify abstraction levels
+**Skill**: @refactoring
+
+### Maintainability Index
+**Signal**: Code difficult to maintain
+**Action**: Break into smaller pieces, improve naming, reduce coupling
+**Skill**: @refactoring + potentially @code-designing
+
+## Coverage Targets
+
+### Leaf Types
+- **Target**: 100% unit test coverage
+- **Why**: Leaf types contain core logic, must be bulletproof
+- **Test**: Only public API, use pkg_test package
+
+### Orchestrating Types
+- **Target**: Integration test coverage
+- **Why**: Test seams between components
+- **Test**: Can overlap with leaf type coverage
+
+## Commit Readiness Criteria
+
+All must be true:
+- ✅ Linter passes with 0 issues
+- ✅ Tests pass
+- ✅ Target coverage achieved (100% for leaf types)
+- ✅ Design review complete (advisory, but acknowledged)
+
+## Next Steps After Commit
+
+### Feature Complete?
+→ Invoke @documentation skill to create feature docs
+
+### More Work Needed?
+→ Run @linter-driven-development again for next commit
+
+### Found Broader Issues During Review?
+→ Create new task to address technical debt
--- a/skills/pre-commit-review/SKILL.md
+++ b/skills/pre-commit-review/SKILL.md
@@ -0,0 +1,322 @@
+---
+name: pre-commit-review
+description: ADVISORY validation of code against design principles that linters cannot enforce. Use after linter passes and tests pass to validate design quality. Categorizes findings as Design Debt, Readability Debt, or Polish Opportunities. Does NOT block commits.
+---
+
+# Pre-Commit Design Review
+
+Expert design analysis that detects issues linters can't catch. Returns detailed report to caller with categorized findings and fix recommendations.
+
+## What This Skill Does
+
+**Pure Analysis & Reporting** - Generates report, doesn't fix anything or invoke skills.
+
+### Input
+- Files to review (specific files or all staged changes)
+- Review mode: `full` (first run) or `incremental` (subsequent runs)
+- Previous findings (optional, for incremental mode)
+- Context (invoked by refactoring, orchestrator, subagent, or user)
+
+### Output
+- Structured report with categorized findings
+- Each finding: `file:line`, issue, why it matters, fix strategy, effort estimate
+- Prioritized by impact and effort
+- Format: Parseable for combined analysis (when invoked by orchestrator)
+
+### Invocation Modes
+
+**1. Direct Skill Invocation** (User or Orchestrator)
+- Full control, can invoke other skills
+- Can make changes based on findings
+- Interactive mode with user feedback
+
+**2. Subagent Mode** (Task tool with go-code-reviewer)
+- Read-only analysis, returns report only
+- Cannot invoke other skills
+- Used for parallel execution by orchestrator
+- Designed for speed and focused analysis
+
+### What Reviewer Detects (That Linters Can't)
+- Primitive obsession (with juiciness scoring)
+- Unstorified functions (mixed abstraction levels)
+- Missing domain concepts (implicit types that should be explicit)
+- Non-self-validating types (defensive code in methods)
+- Poor comment quality (explaining what instead of why)
+- File structure issues (too long, too many types)
+- Generic package extraction opportunities
+- Design bugs (nil deref, ignored errors, resource leaks)
+- Test quality (weak assertions, missing use cases, mock overuse, conditionals in tests)
+
+**See [reference.md](./reference.md) for complete detection checklist with examples**
+
+## Who Invokes This Skill
+
+1. **@refactoring skill** - After applying patterns, validates design quality remains high
+2. **@linter-driven-development** - Phase 4, checks design quality after linter passes
+3. **User** - Manual code review before commit
+
+## Workflow
+
+### Full Review Mode (First Run)
+
+```
+1. Read all files under review (using Read tool)
+2. Apply design principles checklist from reference.md (LLM reasoning)
+3. Search for usage patterns across codebase (using Grep tool)
+4. Categorize findings:
+   🐛 Bugs (nil deref, ignored errors, resource leaks)
+   🔴 Design Debt (types, architecture, validation)
+   🟡 Readability Debt (abstraction, flow, clarity)
+   🟢 Polish (naming, docs, minor improvements)
+5. Generate structured report with recommendations
+6. Return report to caller (doesn't invoke other skills or make fixes)
+```
+
+### Incremental Review Mode (Subsequent Runs)
+
+Used after fixes have been applied to verify resolution and detect new issues.
+
+```
+1. Read ONLY changed files since last review (using git diff)
+2. Compare against previous findings:
+   - Mark resolved issues as ✅ Fixed
+   - Identify issues that still exist
+3. Analyze changed code for NEW issues introduced by fixes
+4. Generate delta report:
+   - ✅ Fixed: Issues from previous run that are now resolved
+   - ⚠️ Remaining: Issues that still need attention
+   - 🆕 New: Issues introduced by recent changes
+5. Return concise delta report (not full analysis)
+```
+
+**When to Use Incremental Mode:**
+- After @refactoring skill applies fixes
+- During iterative fix loop in Phase 4 of autopilot workflow
+- User requests re-review after making changes
+
+**Benefits:**
+- Faster execution (only analyzes changed files)
+- Clear feedback on what was fixed vs what remains
+- Detects regressions introduced by fixes
+
+## Detection Approach
+
+**LLM-Powered Analysis** (not AST parsing or metrics calculation):
+
+The reviewer reads code like a senior developer and applies design principles:
+- Reads files with Read tool
+- Searches patterns with Grep tool (find usages, duplications)
+- Applies checklist from reference.md using LLM reasoning
+- Pattern matches against anti-patterns
+- Counts occurrences and calculates juiciness scores
+- Generates findings with specific locations and fix guidance
+
+**Division of Labor:**
+- **Linter handles**: Complexity metrics, line counts, formatting, syntax
+- **Reviewer handles**: Design patterns, domain modeling, conceptual issues
+
+## Report Format
+
+### Full Report (First Run)
+
+```
+📊 CODE REVIEW REPORT
+Scope: [files reviewed]
+Mode: FULL
+
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+SUMMARY
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+
+Total findings: 18
+🐛 Bugs: 2 (fix immediately)
+🔴 Design Debt: 5 (fix before commit)
+🟡 Readability Debt: 8 (improves maintainability)
+🟢 Polish: 3 (nice to have)
+
+Estimated fix effort: 3.5 hours
+
+[Detailed findings by category]
+[Recommendations by priority]
+[Skills to use for fixes]
+```
+
+### Incremental Report (Subsequent Runs)
+
+```
+📊 CODE REVIEW DELTA REPORT
+Scope: [changed files only]
+Mode: INCREMENTAL
+
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+SUMMARY
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+
+✅ Fixed: 4 (resolved from previous run)
+⚠️ Remaining: 2 (still need attention)
+🆕 New: 1 (introduced by recent changes)
+
+[Detailed delta findings]
+```
+
+### Structured Output for Orchestrator Parsing
+
+When invoked as subagent for combined analysis, output follows strict format:
+
+```
+🐛 BUGS
+────────────────────────────────────────────────
+file:line | Issue description | Why it matters | Fix strategy | Effort: [Trivial/Moderate/Significant]
+
+🔴 DESIGN DEBT
+────────────────────────────────────────────────
+file:line | Issue description | Why it matters | Fix strategy | Effort: [Trivial/Moderate/Significant]
+
+🟡 READABILITY DEBT
+────────────────────────────────────────────────
+file:line | Issue description | Why it matters | Fix strategy | Effort: [Trivial/Moderate/Significant]
+
+🟢 POLISH
+────────────────────────────────────────────────
+file:line | Issue description | Why it matters | Fix strategy | Effort: [Trivial/Moderate/Significant]
+```
+
+**Effort Estimates:**
+- **Trivial**: <5 minutes (extract constant, rename variable)
+- **Moderate**: 5-20 minutes (extract function, storifying, create simple type)
+- **Significant**: >20 minutes (architectural refactoring, complex type extraction)
+
+**file:line Format:** Must be exact for orchestrator to correlate with linter errors
+- Example: `pkg/parser.go:45`
+- NOT: `parser.go line 45` or `pkg/parser.go (line 45)`
+
+**See [examples.md](./examples.md) for complete report examples**
+
+## What This Skill Does NOT Do
+
+- ❌ Invoke other skills (@refactoring, @code-designing, @testing)
+- ❌ Fix anything or make code changes
+- ❌ Make decisions on behalf of user
+- ❌ Parse AST or calculate complexity metrics (linter does this)
+- ❌ Run linter (caller does this)
+- ❌ Iterate or loop (caller decides whether to re-invoke)
+- ❌ Block commits (findings are advisory)
+
+## Integration with Other Skills
+
+### Invoked by @refactoring
+```
+Refactoring completes → invoke reviewer → analyze report:
+- Bugs found? → Fix immediately, re-run linter
+- Design debt found? → Apply another refactoring pattern
+- All clean? → Return success to orchestrator
+```
+
+### Invoked by @linter-driven-development
+```
+Phase 4 (after linter passes):
+1. Invoke reviewer on all staged changes
+2. Receive categorized report
+3. Present findings to user with options:
+   - Commit as-is (accept debt knowingly)
+   - Fix critical issues only (bugs + design debt)
+   - Fix all recommended (bugs + design + readability)
+   - Fix everything (including polish)
+4. Based on user choice:
+   - Invoke @refactoring or @code-designing for chosen fixes
+   - Return to Phase 3 (linter loop)
+   - Iterate until user satisfied
+```
+
+### Invoked by User
+```
+Manual review request:
+1. User invokes: @pre-commit-review on path/to/file.go
+2. Receive detailed report
+3. User decides how to proceed
+4. User may invoke @refactoring or @code-designing for fixes
+```
+
+## Review Scope
+
+**Primary Scope**: Changed code in commit
+- All modified lines
+- All new files
+- Specific focus on design principle adherence
+
+**Secondary Scope**: Context around changes
+- Entire files containing modifications
+- Flag patterns/issues outside commit scope (in BROADER CONTEXT section)
+- Suggest broader refactoring opportunities if valuable
+
+## Advisory Nature
+
+**This review does NOT block commits.**
+
+Purpose:
+- ✅ Provide visibility into design quality
+- ✅ Offer concrete improvement suggestions with examples
+- ✅ Help maintain coding principles
+- ✅ Guide refactoring decisions
+
+Caller (or user) decides:
+- Commit as-is (accept debt knowingly)
+- Fix critical debt before commit (bugs, major design issues)
+- Fix all debt before commit (comprehensive cleanup)
+- Expand scope to broader refactor (when broader context issues found)
+
+## Finding Categories
+
+Findings are categorized by technical debt type and severity:
+
+### 🐛 Bugs
+**Will cause runtime failures or correctness issues**
+- Nil dereferences, ignored errors, resource leaks
+- Invalid nil returns, race conditions
+- Fix immediately before any other work
+
+### 🔴 Design Debt
+**Will cause pain when extending/modifying code**
+- Primitive obsession, missing domain types
+- Non-self-validating types
+- Wrong architecture (horizontal vs vertical)
+- Fix before commit recommended
+
+### 🟡 Readability Debt
+**Makes code harder to understand and work with**
+- Mixed abstraction levels, not storified
+- Functions too long or complex
+- Poor naming, unclear intent
+- Fix improves team productivity
+
+### 🟢 Polish Opportunities
+**Minor improvements for consistency and quality**
+- Non-idiomatic naming, missing examples
+- Comment improvements, minor refactoring
+- Low priority, nice to have
+
+**See [reference.md](./reference.md) for detailed principles and examples for each category**
+
+## Key Capabilities
+
+**Detects 8 Issue Categories:**
+1. Primitive Obsession - with juiciness scoring algorithm
+2. Storifying - detects mixed abstraction levels
+3. Missing Domain Concepts - identifies implicit types
+4. Self-Validating Types - finds defensive code patterns
+5. Comment Quality - analyzes what vs why
+6. File Structure - checks size and responsibility boundaries
+7. Testing Approach - validates test structure and quality
+8. Design Bugs - catches common runtime issues
+
+**For complete detection patterns and examples, see [reference.md](./reference.md)**
+**For real-world review scenarios, see [examples.md](./examples.md)**
+
+## Integration with Orchestrator
+
+This skill is automatically invoked by @linter-driven-development workflow:
+- **Phase 4**: Design review after linter passes
+- **Iterative**: Re-invoked after fixes until clean or user accepts debt
+- **Advisory**: Never blocks, always presents options
+
+See [linter-driven-development workflow](../linter-driven-development/SKILL.md) for complete flow.
--- a/skills/pre-commit-review/examples.md
+++ b/skills/pre-commit-review/examples.md
@@ -0,0 +1,832 @@
+# Pre-Commit Review Examples
+
+Real-world examples of design review findings with before/after comparisons.
+
+## Example 0: Detection Workflow
+
+Shows how the pre-commit-review skill analyzes code step-by-step using LLM reasoning.
+
+### Input: user/service.go
+
+```go
+package user
+
+type UserService struct {
+    Repo Repository
+}
+
+func (s *UserService) CreateUser(id string, email string) error {
+    // Validate ID
+    if id == "" {
+        return errors.New("empty id")
+    }
+
+    // Validate email
+    if !strings.Contains(email, "@") {
+        return errors.New("invalid email")
+    }
+
+    // Open DB
+    db, err := sql.Open("postgres", connStr)
+    if err != nil {
+        return err
+    }
+    defer db.Close()
+
+    // Build query
+    query := "INSERT INTO users (id, email) VALUES (?, ?)"
+    _, err = db.Exec(query, id, email)
+    if err != nil {
+        return err
+    }
+
+    // Send email
+    _ = sendEmail(email, "Welcome!")
+
+    return nil
+}
+```
+
+### LLM Analysis Step-by-Step
+
+**Step 1: Apply Primitive Obsession Check (reference.md #1)**
+
+Questions LLM asks:
+- Q: Are primitives being validated?
+- A: Yes - 'id' string validated at line 9, 'email' string validated at line 14
+
+- Q: Is same validation repeated elsewhere?
+- A: [Uses Grep tool to search codebase]
+  - Found in handler.go:23 (same id validation)
+  - Found in repository.go:45 (same email validation)
+
+- Q: Would types add value beyond validation?
+- A: Yes
+  - Email: Complex validation pattern, domain concept
+  - UserID: Validation scattered, type safety benefit
+
+- Q: Calculate juiciness score (see reference.md):
+  ```
+  UserID:
+  - Validation in 3 places (+1)
+  - Simple empty check (+0)
+  - Used in 8 locations (+2)
+  Score: 3/10 → MEDIUM priority
+
+  Email:
+  - Validation in 3 places (+1)
+  - Regex validation possible (+3)
+  - Domain/LocalPart methods (+2)
+  - Used in 8 locations (+2)
+  Score: 8/10 → HIGH priority
+  ```
+
+**Finding identified:**
+```
+🔴 Design Debt: Primitive obsession
+- string id → UserID type (MEDIUM juiciness)
+- string email → Email type (HIGH juiciness)
+```
+
+---
+
+**Step 2: Apply Storifying Check (reference.md #2)**
+
+Questions LLM asks:
+- Q: Does function read like a story?
+- A: No - mixes abstraction levels:
+  - Lines 7-16: High-level validation
+  - Lines 18-23: Low-level DB connection setup
+  - Lines 25-31: SQL implementation details
+  - Lines 33-34: High-level notification
+
+- Q: Can I explain flow in 3-5 clear steps?
+- A: Should be: validateUser() → saveUser() → notifyUser()
+  But currently mixes infrastructure and business logic
+
+**Finding identified:**
+```
+🟡 Readability Debt: Function not storified
+Mixed abstraction levels in CreateUser
+Should extract: validateUser(), saveUser(), notifyUser()
+```
+
+---
+
+**Step 3: Apply Self-Validating Types Check (reference.md #3)**
+
+Questions LLM asks:
+- Q: Do methods check if struct fields are nil/invalid?
+- A: No checks in this method, but...
+
+- Q: Could UserService.Repo be nil?
+- A: Yes - struct can be created without validation
+  ```go
+  svc := &UserService{} // Repo is nil!
+  svc.CreateUser(...) // Will panic if we use Repo
+  ```
+
+- Q: Is there a constructor with validation?
+- A: [Searches codebase] No NewUserService constructor found
+
+**Finding identified:**
+```
+🔴 Design Debt: Non-self-validating type
+UserService can be created with nil Repo
+Need NewUserService constructor
+```
+
+---
+
+**Step 4: Apply Design Bugs Check (reference.md #8)**
+
+Questions LLM asks:
+- Q: Potential nil dereferences?
+- A: Not obvious in this function (but found UserService.Repo issue above)
+
+- Q: Are errors ignored?
+- A: YES! Line 34: `_ = sendEmail(email, "Welcome!")`
+  Silent failure, no visibility into delivery failures
+
+- Q: Resources cleaned up?
+- A: Yes, defer db.Close() at line 23 ✓
+
+- Q: Context propagated?
+- A: No ctx parameter, but no downstream calls needing it
+
+**Finding identified:**
+```
+🐛 Bug: Ignored error
+Line 34: _ = sendEmail() silently ignores failure
+Should at least log the error
+```
+
+---
+
+**Step 5: Apply Other Checks**
+
+- Abstraction Levels (#4): Already covered in storifying
+- Architecture (#5): Single file, can't assess pattern
+- Naming (#6): Names are clear and idiomatic ✓
+- Testing (#7): No test file in scope
+- Comment Quality: Missing (no package docs, no function docs)
+- File Structure: Single type, reasonable size ✓
+
+---
+
+### Analysis Complete - Categorize & Prioritize
+
+**Bugs (Priority 1 - Fix immediately):**
+1. Ignored error at line 34
+
+**Design Debt (Priority 2 - Fix before commit):**
+2. Primitive obsession: string email (HIGH juiciness: 8/10)
+3. Primitive obsession: string id (MEDIUM juiciness: 3/10)
+4. Non-self-validating: UserService
+
+**Readability Debt (Priority 3 - Improves maintainability):**
+5. Not storified: CreateUser function
+
+---
+
+### Output Report
+
+```
+📊 CODE REVIEW REPORT
+Generated: 2025-11-09 15:45:00
+Scope: user/service.go (1 file, 40 lines)
+
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+SUMMARY
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+
+Total findings: 5
+🐛 Bugs: 1 (fix immediately)
+🔴 Design Debt: 3 (fix before commit recommended)
+🟡 Readability Debt: 1 (improves maintainability)
+🟢 Polish: 0
+
+Estimated fix effort: 50 minutes total
+  - Critical (bugs + high juiciness design): 25 min
+  - Recommended (medium design + readability): 25 min
+
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+🐛 BUGS (1) - FIX IMMEDIATELY
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+
+1. Ignored error
+   Location: user/service.go:34
+   Code: _ = sendEmail(email, "Welcome!")
+
+   Issue: Email sending failure silently ignored
+   Impact: No visibility into delivery failures, hard to debug
+
+   Fix: Log error at minimum:
+     if err := sendEmail(email, "Welcome!"); err != nil {
+       log.Printf("failed to send welcome email: %v", err)
+     }
+
+   Better: Return error if critical:
+     if err := sendEmail(email, "Welcome!"); err != nil {
+       return fmt.Errorf("send welcome email: %w", err)
+     }
+
+   Effort: 2 min
+
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+🔴 DESIGN DEBT (3) - FIX BEFORE COMMIT RECOMMENDED
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+
+1. Primitive obsession: string email (HIGH juiciness: 8/10)
+   Locations: Line 6, 14, 28, 34
+   Also found in: handler.go:23, repository.go:45
+
+   Juiciness Score: 8/10
+   - Validation in 3 places (+1)
+   - Complex regex validation possible (+3)
+   - Methods: Domain(), LocalPart() (+2)
+   - Used in 8 locations (+2)
+
+   Current:
+     func CreateUser(id string, email string) error {
+       if !strings.Contains(email, "@") {
+         return errors.New("invalid email")
+       }
+       // ...
+     }
+
+   Better:
+     type Email string
+
+     func ParseEmail(s string) (Email, error) {
+       if !emailRegex.MatchString(s) {
+         return "", fmt.Errorf("invalid email: %s", s)
+       }
+       return Email(s), nil
+     }
+
+     func (e Email) Domain() string { /* ... */ }
+     func (e Email) LocalPart() string { /* ... */ }
+     func (e Email) String() string { return string(e) }
+
+     func CreateUser(id string, email Email) error {
+       // No validation needed, guaranteed valid
+     }
+
+   Why: Type safety, centralized validation, prevents invalid emails
+   Fix: Use @code-designing skill → Create Email type
+   Effort: 20 min
+
+2. Primitive obsession: string id (MEDIUM juiciness: 3/10)
+   Locations: Line 6, 9, 28
+   Also found in: handler.go:23, repository.go:45
+
+   Juiciness Score: 3/10
+   - Validation in 3 places (+1)
+   - Simple empty check (+0)
+   - Used in 8 locations (+2)
+
+   Note: Borderline case. Judgment call on whether to create type.
+
+   Better:
+     type UserID string
+
+     func ParseUserID(s string) (UserID, error) {
+       if s == "" {
+         return "", errors.New("empty user id")
+       }
+       return UserID(s), nil
+     }
+
+   Why: Centralizes validation, type safety
+   Fix: Use @code-designing skill → Create UserID type
+   Effort: 10 min
+
+3. Non-self-validating type: UserService
+   Location: user/service.go:4
+
+   Issue: UserService.Repo is public, can be nil
+   No constructor to validate dependencies
+
+   Current:
+     type UserService struct {
+       Repo Repository  // Can be nil!
+     }
+
+     svc := &UserService{}  // Invalid state allowed
+
+   Better:
+     type UserService struct {
+       repo Repository  // Private
+     }
+
+     func NewUserService(repo Repository) (*UserService, error) {
+       if repo == nil {
+         return nil, errors.New("repo required")
+       }
+       return &UserService{repo: repo}, nil
+     }
+
+     func (s *UserService) CreateUser(id string, email Email) error {
+       // No nil checks needed - constructor guarantees validity
+       return s.repo.Save(...)
+     }
+
+   Why: Impossible to create invalid service, eliminates defensive checks
+   Fix: Use @code-designing skill → Add constructor
+   Effort: 10 min
+
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+🟡 READABILITY DEBT (1) - IMPROVES MAINTAINABILITY
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+
+1. Function not storified: CreateUser
+   Location: user/service.go:6-36
+
+   Issue: Mixes 3 abstraction levels:
+   - Lines 7-16: High-level validation
+   - Lines 18-31: Low-level DB connection/SQL
+   - Lines 33-34: High-level notification
+
+   Flow not clear at a glance, hard to test pieces independently.
+
+   Better:
+     func CreateUser(id string, email Email) error {
+       if err := validateUser(id, email); err != nil {
+         return err
+       }
+
+       if err := saveUser(id, email); err != nil {
+         return err
+       }
+
+       if err := notifyUser(email); err != nil {
+         return err
+       }
+
+       return nil
+     }
+
+     func validateUser(id string, email Email) error { /* ... */ }
+     func saveUser(id string, email Email) error { /* ... */ }
+     func notifyUser(email Email) error { /* ... */ }
+
+   Why: Reads like a story, testable pieces, clear intent
+   Fix: Use @refactoring skill → Storifying pattern
+   Effort: 15 min
+
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+RECOMMENDATIONS
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+
+Priority 1: Fix immediately (2 min)
+  ☐ Fix ignored error (log or return)
+
+Priority 2: Fix before commit (40 min)
+  ☐ Create Email type (HIGH juiciness) @code-designing
+  ☐ Create UserID type (MEDIUM juiciness) @code-designing
+  ☐ Add NewUserService constructor @code-designing
+
+Priority 3: Improve maintainability (15 min)
+  ☐ Storify CreateUser function @refactoring
+
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+SKILLS TO USE
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+
+@code-designing: For creating Email, UserID types and NewUserService
+@refactoring: For storifying CreateUser function
+Manual: For fixing ignored error (simple change)
+
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+END OF REPORT
+━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
+```
+
+This example demonstrates how the reviewer skill applies the complete checklist from reference.md systematically, using LLM reasoning to detect issues that linters cannot catch.
+
+---
+
+## Example 1: Primitive Obsession + Self-Validating Types
+
+### Before (Design Debt 🔴)
+```go
+package user
+
+type UserService struct {
+    DB *sql.DB  // Might be nil
+}
+
+func (s *UserService) CreateUser(id string, email string) error {
+    // Defensive check
+    if s.DB == nil {
+        return errors.New("db is nil")
+    }
+
+    // Primitive validation
+    if id == "" {
+        return errors.New("id required")
+    }
+    if !strings.Contains(email, "@") {
+        return errors.New("invalid email")
+    }
+
+    // Business logic
+    _, err := s.DB.Exec("INSERT INTO users (id, email) VALUES ($1, $2)", id, email)
+    return err
+}
+```
+
+**Review Findings:**
+- 🔴 **Design Debt**: Primitive obsession on `id` and `email`
+- 🔴 **Design Debt**: Non-self-validating type (`UserService.DB` might be nil)
+
+### After (No Debt)
+```go
+package user
+
+type UserID string
+type Email string
+
+func NewUserID(s string) (UserID, error) {
+    if s == "" {
+        return "", errors.New("id required")
+    }
+    return UserID(s), nil
+}
+
+func NewEmail(s string) (Email, error) {
+    if !strings.Contains(s, "@") {
+        return "", errors.New("invalid email")
+    }
+    return Email(s), nil
+}
+
+type UserService struct {
+    db *sql.DB
+}
+
+func NewUserService(db *sql.DB) (*UserService, error) {
+    if db == nil {
+        return nil, errors.New("db is required")
+    }
+    return &UserService{db: db}, nil
+}
+
+func (s *UserService) CreateUser(id UserID, email Email) error {
+    // No validation needed - types guarantee validity
+    _, err := s.db.Exec("INSERT INTO users (id, email) VALUES ($1, $2)", id, email)
+    return err
+}
+```
+
+---
+
+## Example 2: Mixed Abstraction Levels + Storifying
+
+### Before (Readability Debt 🟡)
+```go
+func ProcessPayment(orderID string, amount float64) error {
+    // High-level: validation
+    if orderID == "" {
+        return errors.New("invalid order id")
+    }
+    if amount <= 0 {
+        return errors.New("invalid amount")
+    }
+
+    // Low-level: HTTP client setup
+    client := &http.Client{Timeout: 10 * time.Second}
+    req, err := http.NewRequest("POST", "https://api.payment.com/charge", nil)
+    if err != nil {
+        return err
+    }
+    req.Header.Set("Authorization", "Bearer "+os.Getenv("API_KEY"))
+    req.Header.Set("Content-Type", "application/json")
+
+    // Low-level: JSON marshaling
+    body := map[string]interface{}{
+        "order_id": orderID,
+        "amount":   amount,
+    }
+    jsonBody, err := json.Marshal(body)
+    if err != nil {
+        return err
+    }
+    req.Body = io.NopCloser(bytes.NewReader(jsonBody))
+
+    // Low-level: HTTP call
+    resp, err := client.Do(req)
+    if err != nil {
+        return err
+    }
+    defer resp.Body.Close()
+
+    // High-level: logging
+    log.Printf("Payment processed for order %s", orderID)
+    return nil
+}
+```
+
+**Review Findings:**
+- 🟡 **Readability Debt**: Mixed abstraction levels (business + HTTP details)
+- 🟡 **Readability Debt**: Function not storified (hard to see flow)
+
+### After (No Debt)
+```go
+func ProcessPayment(orderID OrderID, amount Amount) error {
+    if err := validatePayment(orderID, amount); err != nil {
+        return err
+    }
+
+    if err := chargePaymentGateway(orderID, amount); err != nil {
+        return err
+    }
+
+    logPaymentSuccess(orderID)
+    return nil
+}
+
+func validatePayment(orderID OrderID, amount Amount) error {
+    // Validation logic only (already validated by types, but could have business rules)
+    return nil
+}
+
+func chargePaymentGateway(orderID OrderID, amount Amount) error {
+    // HTTP client logic encapsulated
+    client := newPaymentClient()
+    return client.Charge(orderID, amount)
+}
+
+func logPaymentSuccess(orderID OrderID) {
+    log.Printf("Payment processed for order %s", orderID)
+}
+```
+
+---
+
+## Example 3: Horizontal Layers → Vertical Slices
+
+### Before (Design Debt 🔴)
+```
+project/
+├── domain/
+│   └── user.go
+├── service/
+│   └── user_service.go
+├── repository/
+│   └── user_repository.go
+└── handler/
+    └── user_handler.go
+```
+
+**Review Finding:**
+- 🔴 **Design Debt**: Horizontal layering instead of vertical slices
+- Impact: User feature changes require touching 4 different directories
+
+### After (No Debt)
+```
+project/
+└── user/
+    ├── user.go           # Domain type
+    ├── service.go        # Business logic
+    ├── repository.go     # Persistence
+    ├── handler.go        # HTTP
+    └── user_test.go
+```
+
+**Benefits:**
+- All user-related code in one place
+- Easy to understand complete feature
+- Independent testing/deployment
+
+---
+
+## Example 4: Generic Naming
+
+### Before (Readability Debt 🟡)
+```go
+package common
+
+type DataManager struct {
+    store Storage
+}
+
+func (m *DataManager) ProcessData(data interface{}) (interface{}, error) {
+    // ...
+}
+
+func HandleRequest(ctx context.Context, data map[string]interface{}) error {
+    // ...
+}
+```
+
+**Review Findings:**
+- 🟡 **Readability Debt**: Generic package name (`common`)
+- 🟡 **Readability Debt**: Vague type name (`DataManager`)
+- 🟡 **Readability Debt**: Meaningless function names (`ProcessData`, `HandleRequest`)
+
+### After (No Debt)
+```go
+package user
+
+type Service struct {
+    repo Repository
+}
+
+func (s *Service) Create(ctx context.Context, u User) error {
+    // ...
+}
+
+func (s *Service) Authenticate(ctx context.Context, credentials Credentials) (Token, error) {
+    // ...
+}
+```
+
+---
+
+## Example 5: Testing Anti-Patterns
+
+### Before (Design Debt 🔴)
+```go
+package user  // Same package
+
+// Testing private function
+func TestValidateEmailInternal(t *testing.T) {
+    assert.True(t, validateEmailInternal("test@example.com"))
+}
+
+// Heavy mocking
+func TestCreateUser(t *testing.T) {
+    mockRepo := &MockRepository{}
+    mockEmailer := &MockEmailer{}
+
+    mockRepo.On("Save", mock.Anything).Return(nil)
+    mockEmailer.On("Send", mock.Anything).Return(nil)
+
+    svc := &UserService{
+        Repo:    mockRepo,
+        Emailer: mockEmailer,
+    }
+
+    err := svc.CreateUser("123", "test@example.com")
+    assert.NoError(t, err)
+
+    mockRepo.AssertExpectations(t)
+}
+
+// Flaky with time.Sleep
+func TestAsyncOperation(t *testing.T) {
+    go doAsyncWork()
+    time.Sleep(100 * time.Millisecond)  // ❌ Flaky
+    assert.True(t, workCompleted)
+}
+```
+
+**Review Findings:**
+- 🔴 **Design Debt**: Testing private methods
+- 🔴 **Design Debt**: Using mocks instead of real implementations
+- 🔴 **Design Debt**: Flaky test with time.Sleep
+
+### After (No Debt)
+```go
+package user_test  // External package
+
+// Test public API only
+func TestService_CreateUser(t *testing.T) {
+    // Use real implementations
+    repo := user.NewInMemoryRepository()
+    emailer := user.NewTestEmailer()
+
+    svc, err := user.NewUserService(repo, emailer)
+    require.NoError(t, err)
+
+    id, _ := user.NewUserID("123")
+    email, _ := user.NewEmail("test@example.com")
+
+    u := user.User{ID: id, Email: email}
+    err = svc.CreateUser(context.Background(), u)
+
+    assert.NoError(t, err)
+
+    // Verify via public API
+    retrieved, err := svc.GetUser(context.Background(), id)
+    assert.NoError(t, err)
+    assert.Equal(t, email, retrieved.Email)
+}
+
+// No flakiness with channels
+func TestAsyncOperation(t *testing.T) {
+    done := make(chan struct{})
+
+    go func() {
+        doAsyncWork()
+        close(done)
+    }()
+
+    select {
+    case <-done:
+        // Success
+    case <-time.After(1 * time.Second):
+        t.Fatal("timeout waiting for async work")
+    }
+}
+```
+
+---
+
+## Example 6: Complete Commit Review Output
+
+### Scenario
+Developer adds user authentication feature with some design issues.
+
+### Review Output
+```
+📋 COMMIT READINESS SUMMARY
+
+✅ Linter: Passed (0 issues)
+✅ Tests: 87% coverage (5 new types, 23 test cases)
+⚠️  Design Review: 5 findings (see below)
+
+🎯 COMMIT SCOPE
+Modified:
+- user/service.go (+120, -30 lines)
+- user/auth.go (new file, +85 lines)
+
+Added:
+- user/user_id.go (new type: UserID)
+- user/password.go (new type: Password)
+
+Tests:
+- user/service_test.go (+95 lines)
+- user/auth_test.go (new, +140 lines)
+
+⚠️  DESIGN REVIEW FINDINGS
+
+🔴 DESIGN DEBT (Recommended to fix):
+
+1. user/service.go:67 - Primitive obsession on session token
+   Current: func CreateSession(userID UserID) (string, error)
+   Better:  func CreateSession(userID UserID) (SessionToken, error)
+   Why: Session tokens should be validated types to prevent empty/invalid tokens
+   Fix: Use @code-designing to create SessionToken type with validation
+
+2. user/auth.go:34 - Non-self-validating type
+   Current:
+     type Authenticator struct {
+         HashCost int  // Could be invalid
+     }
+   Better:
+     func NewAuthenticator(hashCost int) (*Authenticator, error) {
+         if hashCost < 4 || hashCost > 31 {
+             return nil, errors.New("invalid hash cost")
+         }
+         // ...
+     }
+   Why: Constructor should validate, methods shouldn't need defensive checks
+   Fix: Use @code-designing to add validating constructor
+
+🟡 READABILITY DEBT (Consider fixing):
+
+3. user/auth.go:89 - Mixed abstraction levels in Authenticate()
+   Function mixes high-level auth flow with low-level bcrypt details
+   Why: Harder to understand auth logic at a glance
+   Fix: Use @refactoring to extract password comparison to separate function
+
+4. user/service.go:45 - Function could be storified better
+   Current: validateAndCreateUser() does validation + creation in one function
+   Better: Split into validateUser() and createUser() for clarity
+   Why: Single responsibility, easier to test each part
+   Fix: Use @refactoring to split responsibilities
+
+🟢 POLISH OPPORTUNITIES:
+
+5. user/auth.go:12 - Less idiomatic naming
+   Current: ComparePasswordWithHash
+   Better:  PasswordMatches
+   Why: More concise, Go-style naming
+
+📝 BROADER CONTEXT:
+While reviewing user/service.go, noticed email is still stored as string type
+(line 23). Consider refactoring to use Email type consistently across the file
+for better type safety (similar to UserID change in this commit).
+
+💡 SUGGESTED COMMIT MESSAGE
+Add user authentication with self-validating types
+
+- Introduce UserID and Password self-validating types
+- Implement Authenticator with bcrypt password hashing
+- Add CreateSession and Authenticate methods
+- Achieve 87% test coverage with real bcrypt testing
+
+Follows primitive obsession principles with type-safe IDs and passwords.
+
+────────────────────────────────────────
+
+Would you like to:
+1. Commit as-is (5 design findings remain)
+2. Fix design debt only (🔴 items 1-2), then commit
+3. Fix design + readability debt (🔴🟡 items 1-4), then commit
+4. Fix all findings including polish (🔴🟡🟢 all items), then commit
+5. Expand scope to refactor email type throughout file, then commit
+```
--- a/skills/pre-commit-review/reference.md
+++ b/skills/pre-commit-review/reference.md
--- a/skills/refactoring/SKILL.md
+++ b/skills/refactoring/SKILL.md
@@ -0,0 +1,759 @@
+---
+name: refactoring
+description: Linter-driven refactoring patterns to reduce complexity and improve code quality. Use when linter fails with complexity issues (cyclomatic, cognitive, maintainability) or when code feels hard to read/maintain. Applies storifying, type extraction, and function extraction patterns.
+---
+
+# Refactoring
+
+Linter-driven refactoring patterns to reduce complexity and improve code quality.
+
+## When to Use
+- **Automatically invoked** by @linter-driven-development when linter fails
+- **Automatically invoked** by @pre-commit-review when design issues detected
+- **Complexity failures**: cyclomatic, cognitive, maintainability index
+- **Architectural failures**: noglobals, gochecknoinits, gochecknoglobals
+- **Design smell failures**: dupl (duplication), goconst (magic strings), ineffassign
+- Functions > 50 LOC or nesting > 2 levels
+- Mixed abstraction levels in functions
+- Manual invocation when code feels hard to read/maintain
+
+**IMPORTANT**: This skill operates autonomously - no user confirmation needed during execution
+
+## Learning Resources
+
+Choose your learning path:
+- **Quick Start**: Use the patterns below for common refactoring cases
+- **Complete Reference**: See [reference.md](./reference.md) for full decision tree and all patterns
+- **Real-World Examples**: See [examples.md](./examples.md) to learn the refactoring thought process
+  - [Example 1](./examples.md#example-1-storifying-mixed-abstractions-and-extracting-logic-into-leaf-types): Storifying and extracting a single leaf type
+  - [Example 2](./examples.md#example-2-primitive-obsession-with-multiple-types-and-storifying-switch-statements): Primitive obsession with multiple types and switch elimination
+
+## Analysis Phase (Automatic)
+
+Before applying any refactoring patterns, the skill automatically analyzes the context:
+
+### System Context Analysis
+```
+AUTOMATICALLY ANALYZE:
+1. Find all callers of the failing function
+2. Identify which flows/features depend on it
+3. Determine primary responsibility
+4. Check for similar functions revealing patterns
+5. Spot potential refactoring opportunities
+```
+
+### Type Discovery
+Proactively identify hidden types in the code:
+
+```
+POTENTIAL TYPES TO DISCOVER:
+1. Data being parsed from strings → Parse* types
+   Example: ParseCommandResult(), ParseLogEntry()
+
+2. Scattered validation logic → Validated types
+   Example: Email, Port, IPAddress types
+
+3. Data that always travels together → Aggregate types
+   Example: UserCredentials, ServerConfig
+
+4. Complex conditions → State/status types
+   Example: DeploymentStatus with IsReady(), CanProceed()
+
+5. Repeated string manipulation → Types with methods
+   Example: FilePath with Dir(), Base(), Ext()
+```
+
+### Analysis Output
+The analysis produces a refactoring plan identifying:
+- Function's role in the system
+- Potential domain types to extract
+- Recommended refactoring approach
+- Expected complexity reduction
+
+## Refactoring Signals
+
+### Linter Failures
+**Complexity Issues:**
+- **Cyclomatic Complexity**: Too many decision points → Extract functions, simplify logic
+- **Cognitive Complexity**: Hard to understand → Storifying, reduce nesting
+- **Maintainability Index**: Hard to maintain → Break into smaller pieces
+
+**Architectural Issues:**
+- **noglobals/gochecknoglobals**: Global variable usage → Dependency rejection pattern
+- **gochecknoinits**: Init function usage → Extract initialization logic
+- **Static/singleton patterns**: Hidden dependencies → Inject dependencies
+
+**Design Smells:**
+- **dupl**: Code duplication → Extract common logic/types
+- **goconst**: Magic strings/numbers → Extract constants or types
+- **ineffassign**: Ineffective assignments → Simplify logic
+
+### Code Smells
+- Functions > 50 LOC
+- Nesting > 2 levels
+- Mixed abstraction levels
+- Unclear flow/purpose
+- Primitive obsession
+- Global variable access scattered throughout code
+
+## Workflow (Automatic)
+
+### 1. Receive Linter Failures
+Automatically receive failures from @linter-driven-development:
+```
+user/service.go:45:1: cyclomatic complexity 15 of func `CreateUser` is high (> 10)
+user/handler.go:23:1: cognitive complexity 25 of func `HandleRequest` is high (> 15)
+```
+
+### 2. Automatic Root Cause Analysis
+The skill automatically diagnoses each failure:
+- Does this code read like a story? → Apply storifying
+- Can this be broken into smaller pieces? → Extract functions/types
+- Does logic run on primitives? → Check for primitive obsession
+- Is function long due to switch statement? → Extract case handlers
+
+### 3. Automatic Pattern Application
+Applies patterns in priority order without user intervention:
+- **Early Returns**: Try first (least invasive)
+- **Extract Function**: Break up complexity
+- **Storifying**: Improve abstraction levels
+- **Extract Type**: Create domain types (if juicy)
+- **Switch Extraction**: Categorize cases
+
+### 4. Automatic Verification Loop
+- Re-run linter automatically
+- If still failing, try next pattern
+- Continue until linter passes
+- Report final results
+
+## Automation Flow
+
+This skill operates completely autonomously once invoked:
+
+### Automatic Iteration Loop
+```
+AUTOMATED PROCESS:
+1. Receive trigger:
+   - From @linter-driven-development (linter failures)
+   - From @pre-commit-review (design debt/readability debt)
+2. Apply refactoring pattern (start with least invasive)
+3. Run linter immediately (no user confirmation)
+4. If linter still fails OR review finds more issues:
+   - Try next pattern in priority order
+   - Repeat until both linter and review pass
+5. If patterns exhausted and still failing:
+   - Report what was tried
+   - Suggest file splitting or architectural changes
+```
+
+### Pattern Priority Order
+Apply patterns based on failure type:
+
+**For Complexity Failures** (cyclomatic, cognitive, maintainability):
+```
+1. Early Returns → Reduce nesting quickly
+2. Extract Function → Break up long functions
+3. Storifying → Improve abstraction levels
+4. Extract Type → Create domain types (only if "juicy")
+5. Switch Extraction → Categorize switch cases
+```
+
+**For Architectural Failures** (noglobals, singletons):
+```
+1. Dependency Rejection → Incremental bottom-up approach
+2. Extract Type with dependency injection
+3. Push global access up call chain one level
+4. Iterate until globals only at entry points (main, handlers)
+```
+
+**For Design Smells** (dupl, goconst):
+```
+1. Extract Type → For repeated values or validation
+2. Extract Function → For code duplication
+3. Extract Constant → For magic strings/numbers
+```
+
+### No Manual Intervention
+- **NO** asking for confirmation between patterns
+- **NO** waiting for user input
+- **NO** manual linter runs
+- **AUTOMATIC** progression through patterns
+- **ONLY** report results at the end
+
+## Refactoring Patterns
+
+### Pattern 1: Storifying (Mixed Abstractions)
+**Signal**: Function mixes high-level steps with low-level details
+
+```go
+// ❌ Before - Mixed abstractions
+func ProcessOrder(order Order) error {
+    // Validation
+    if order.ID == "" {
+        return errors.New("invalid")
+    }
+
+    // Low-level DB setup
+    db, err := sql.Open("postgres", connStr)
+    if err != nil { return err }
+    defer db.Close()
+
+    // SQL construction
+    query := "INSERT INTO..."
+    // ... many lines
+
+    return nil
+}
+
+// ✅ After - Story-like
+func ProcessOrder(order Order) error {
+    if err := validateOrder(order); err != nil {
+        return err
+    }
+
+    if err := saveToDatabase(order); err != nil {
+        return err
+    }
+
+    return notifyCustomer(order)
+}
+
+func validateOrder(order Order) error { /* ... */ }
+func saveToDatabase(order Order) error { /* ... */ }
+func notifyCustomer(order Order) error { /* ... */ }
+```
+
+### Pattern 2: Extract Type (Primitive Obsession)
+**Signal**: Complex logic operating on primitives OR unstructured data needing organization
+
+#### Juiciness Test v2 - When to Create Types
+
+**BEHAVIORAL JUICINESS** (rich behavior):
+- ✅ Complex validation rules (regex, ranges, business rules)
+- ✅ Multiple meaningful methods (≥2 methods)
+- ✅ State transitions or transformations
+- ✅ Format conversions (different representations)
+
+**STRUCTURAL JUICINESS** (organizing complexity):
+- ✅ Parsing unstructured data into fields
+- ✅ Grouping related data that travels together
+- ✅ Making implicit structure explicit
+- ✅ Replacing map[string]interface{} with typed fields
+
+**USAGE JUICINESS** (simplifies code):
+- ✅ Used in multiple places
+- ✅ Significantly simplifies calling code
+- ✅ Makes tests cleaner and more focused
+
+**Score**: Need "yes" in at least ONE category to create the type
+
+#### Examples of Juicy vs Non-Juicy Types
+
+```go
+// ❌ NOT JUICY - Don't create type
+func ValidateUserID(id string) error {
+    if id == "" {
+        return errors.New("empty id")
+    }
+    return nil
+}
+// Just use: if userID == ""
+
+// ✅ JUICY (Behavioral) - Complex validation
+type Email string
+
+func ParseEmail(s string) (Email, error) {
+    if s == "" {
+        return "", errors.New("empty email")
+    }
+    if !emailRegex.MatchString(s) {
+        return "", errors.New("invalid format")
+    }
+    if len(s) > 255 {
+        return "", errors.New("too long")
+    }
+    return Email(s), nil
+}
+
+func (e Email) Domain() string { /* extract domain */ }
+func (e Email) LocalPart() string { /* extract local */ }
+func (e Email) String() string { return string(e) }
+
+// ✅ JUICY (Structural) - Parsing complex data
+type CommandResult struct {
+    FailedFiles  []string
+    SuccessFiles []string
+    Message      string
+    ExitCode     int
+    Warnings     []string
+}
+
+func ParseCommandResult(output string) (CommandResult, error) {
+    // Parse unstructured output into structured fields
+    // Making implicit structure explicit
+}
+
+// ✅ JUICY (Mixed) - Both behavior and structure
+type ServiceEndpoint struct {
+    host string
+    port Port
+}
+
+func ParseEndpoint(s string) (ServiceEndpoint, error) {
+    // Parse "host:port/path" format
+}
+
+func (e ServiceEndpoint) URL() string { }
+func (e ServiceEndpoint) IsSecure() bool { }
+func (e ServiceEndpoint) WithPath(path string) string { }
+```
+
+**⚠️ Warning Signs of Over-Engineering:**
+- Type with only one trivial method
+- Simple validation (just empty check)
+- Type that's just a wrapper without behavior
+- Good variable naming would be clearer
+
+**→ See [Example 2](./examples.md#first-refactoring-attempt-the-over-abstraction-trap)** for complete case study.
+
+### Pattern 3: Extract Function (Long Functions)
+**Signal**: Function > 50 LOC or multiple responsibilities
+
+```go
+// ❌ Before - Long function
+func CreateUser(data map[string]interface{}) error {
+    // Validation (15 lines)
+    // ...
+
+    // Database operations (20 lines)
+    // ...
+
+    // Email notification (10 lines)
+    // ...
+
+    // Logging (5 lines)
+    // ...
+
+    return nil
+}
+
+// ✅ After - Extracted functions
+func CreateUser(data map[string]interface{}) error {
+    user, err := validateAndParseUser(data)
+    if err != nil {
+        return err
+    }
+
+    if err := saveUser(user); err != nil {
+        return err
+    }
+
+    if err := sendWelcomeEmail(user); err != nil {
+        return err
+    }
+
+    logUserCreation(user)
+    return nil
+}
+```
+
+### Pattern 4: Early Returns (Deep Nesting)
+**Signal**: Nesting > 2 levels
+
+```go
+// ❌ Before - Deeply nested
+func ProcessItem(item Item) error {
+    if item.IsValid() {
+        if item.IsReady() {
+            if item.HasPermission() {
+                // Process
+                return nil
+            } else {
+                return errors.New("no permission")
+            }
+        } else {
+            return errors.New("not ready")
+        }
+    } else {
+        return errors.New("invalid")
+    }
+}
+
+// ✅ After - Early returns
+func ProcessItem(item Item) error {
+    if !item.IsValid() {
+        return errors.New("invalid")
+    }
+
+    if !item.IsReady() {
+        return errors.New("not ready")
+    }
+
+    if !item.HasPermission() {
+        return errors.New("no permission")
+    }
+
+    // Process
+    return nil
+}
+```
+
+### Pattern 5: Switch Extraction (Long Switch)
+**Signal**: Switch statement with complex cases
+
+```go
+// ❌ Before - Long switch in one function
+func HandleRequest(reqType string, data interface{}) error {
+    switch reqType {
+    case "create":
+        // 20 lines of creation logic
+    case "update":
+        // 20 lines of update logic
+    case "delete":
+        // 15 lines of delete logic
+    default:
+        return errors.New("unknown type")
+    }
+    return nil
+}
+
+// ✅ After - Extracted handlers
+func HandleRequest(reqType string, data interface{}) error {
+    switch reqType {
+    case "create":
+        return handleCreate(data)
+    case "update":
+        return handleUpdate(data)
+    case "delete":
+        return handleDelete(data)
+    default:
+        return errors.New("unknown type")
+    }
+}
+
+func handleCreate(data interface{}) error { /* ... */ }
+func handleUpdate(data interface{}) error { /* ... */ }
+func handleDelete(data interface{}) error { /* ... */ }
+```
+
+### Pattern 6: Dependency Rejection (Architectural Refactoring)
+**Signal**: noglobals linter fails OR global/singleton usage detected
+
+**Goal**: Create "islands of clean code" by incrementally pushing dependencies up the call chain
+
+**Strategy**: Work from bottom-up, rejecting dependencies one level at a time
+- DON'T do massive refactoring all at once
+- Start at deepest level (furthest from main)
+- Extract clean type with dependency injected
+- Push global access up one level
+- Repeat until global only at entry points
+
+**Quick Example**:
+```go
+// ❌ Before - Global accessed deep in code
+func PublishEvent(event Event) error {
+    conn, err := nats.Connect(env.Configs.NATsAddress)
+    // ... complex logic
+}
+
+// ✅ After - Dependency rejected up one level
+type EventPublisher struct {
+    natsAddress string  // injected, not global
+}
+
+func NewEventPublisher(natsAddress string) *EventPublisher {
+    return &EventPublisher{natsAddress: natsAddress}
+}
+
+func (p *EventPublisher) Publish(event Event) error {
+    conn, err := nats.Connect(p.natsAddress)
+    // ... same logic, now testable
+}
+
+// Caller pushed up (closer to main)
+func SetupMessaging() *EventPublisher {
+    return NewEventPublisher(env.Configs.NATsAddress)  // Global only here
+}
+```
+
+**Result**: EventPublisher is now 100% testable without globals
+
+**Key Principles**:
+- **Incremental**: One type at a time, one level at a time
+- **Bottom-up**: Start at deepest code, work toward main
+- **Pragmatic**: Accept globals at entry points (main, handlers)
+- **Testability**: Each extracted type is an island (testable in isolation)
+
+**→ See [Example 3](./examples.md#example-3-dependency-rejection-pattern) for complete case study with config access patterns**
+
+## Refactoring Decision Tree
+
+When linter fails, ask these questions (see reference.md for details):
+
+1. **Does this read like a story?**
+   - No → Extract functions for different abstraction levels
+
+2. **Can this be broken into smaller pieces?**
+   - By responsibility? → Extract functions
+   - By task? → Extract functions
+   - By category? → Extract functions
+
+3. **Does logic run on primitives?**
+   - Yes → Is this primitive obsession? → Extract type
+
+4. **Is function long due to switch statement?**
+   - Yes → Extract case handlers
+
+5. **Are there deeply nested if/else?**
+   - Yes → Use early returns or extract functions
+
+## Testing Integration
+
+### Automatic Test Creation
+When creating new types or extracting functions during refactoring:
+
+**ALWAYS invoke @testing skill** to write tests for:
+- **Isolated types**: Types with injected dependencies (testable islands)
+- **Value object types**: Types that may depend on other value objects but are still isolated
+- **Extracted functions**: New functions created during refactoring
+- **Parse functions**: Functions that transform unstructured data
+
+### Island of Clean Code Definition
+
+A type is an "island of clean code" if:
+- ✅ Dependencies are explicit (injected via constructor)
+- ✅ No global or static dependencies
+- ✅ Can be tested in isolation
+- ✅ Has 100% testable public API
+
+**Examples of testable islands:**
+- `NATSClient` with injected `natsAddress` string (no other dependencies)
+- `Email` type with validation logic (no dependencies)
+- `ServiceEndpoint` that uses `Port` value object (both are testable islands)
+- `OrderService` with injected `Repository` and `EventPublisher` (all testable)
+
+**Note**: Islands can depend on other value objects and still be isolated!
+
+### Workflow
+```
+REFACTORING → TESTING:
+1. Extract type during refactoring
+2. Immediately invoke @testing skill
+3. @testing skill writes appropriate tests:
+   - Table-driven tests for simple scenarios
+   - Testify suites for complex infrastructure
+   - Integration tests for orchestrating types
+4. Verify tests pass
+5. Continue refactoring
+```
+
+### Testing Delegation
+- **Refactoring skill**: Makes code testable (creates islands)
+- **@testing skill**: Writes all tests (structure, patterns, coverage)
+
+**→ See @testing skill for test structure, patterns, and guidelines**
+
+## Stopping Criteria
+
+### When to Stop Refactoring
+
+**STOP when ALL of these are met:**
+```
+✅ Linter passes
+✅ All functions < 50 LOC
+✅ Nesting ≤ 2 levels
+✅ Code reads like a story
+✅ No more "juicy" abstractions to extract
+```
+
+### Don't Over-Refactor
+
+**Warning Signs of Over-Engineering:**
+- Creating types with only one method
+- Functions that just call another function
+- More abstraction layers than necessary
+- Code becomes harder to understand
+- Diminishing returns on complexity reduction
+
+**Pragmatic Approach:**
+```
+IF linter passes AND code is readable:
+    STOP - Don't extract more
+EVEN IF you could theoretically extract more:
+    STOP - Avoid abstraction bloat
+```
+
+### Example Stopping Decision
+```
+Current State:
+- Function: 45 LOC (was 120) ✅
+- Complexity: 8 (was 25) ✅
+- Nesting: 2 levels (was 4) ✅
+- Created 2 juicy types (Email, PhoneNumber) ✅
+
+Could extract UserID type but:
+- Only validation is "not empty" ❌
+- No other methods needed ❌
+- Good naming is sufficient ❌
+
+Decision: STOP HERE - Goals achieved, avoid bloat
+```
+
+## After Refactoring
+
+### Verify
+- [ ] Re-run `task lintwithfix` - Should pass
+- [ ] Run tests - Should still pass
+- [ ] Check coverage - Should maintain or improve
+- [ ] Code more readable? - Get feedback if unsure
+
+### May Need
+- **New types created** → Use @code-designing to validate design
+- **New functions added** → Ensure tests cover them
+- **Major restructuring** → Consider using @pre-commit-review
+
+## Output Format
+
+```
+🔍 CONTEXT ANALYSIS
+
+Function: CreateUser (user/service.go:45)
+Role: Core user creation orchestration
+Called by:
+- api/handler.go:89 (HTTP endpoint)
+- cmd/user.go:34 (CLI command)
+- test/fixtures.go:123 (test fixtures)
+
+Potential types spotted:
+- Email: Complex validation logic scattered
+- UserID: Generation and validation logic
+- UserCreationRequest: Multiple related fields
+
+🔧 REFACTORING APPLIED
+
+✅ Patterns Successfully Applied:
+1. Early Returns: Reduced nesting from 4 to 2 levels
+2. Storifying: Extracted validate(), save(), notify()
+3. Extract Type: Created Email and PhoneNumber types
+
+❌ Patterns Tried but Insufficient:
+- Extract Function alone: Still too complex, needed types
+
+🎯 Types Created (with Juiciness Score):
+
+✅ Email type (JUICY - Behavioral + Usage):
+- Behavioral: ParseEmail(), Domain(), LocalPart() methods
+- Usage: Used in 5+ places across codebase
+- Island: Testable in isolation
+- → Invoke @testing skill to write tests
+
+✅ PhoneNumber type (JUICY - Behavioral):
+- Behavioral: Parse(), Format(), CountryCode() methods
+- Validation: Complex international format rules
+- Island: Testable in isolation
+- → Invoke @testing skill to write tests
+
+❌ Types Considered but Rejected (NOT JUICY):
+- UserID: Only empty check, good naming sufficient
+- Status: Just string constants, enum adequate
+
+🏗️ ARCHITECTURAL REFACTORING (if applicable)
+
+Trigger: noglobals linter failure
+
+Global Dependencies Identified:
+- env.Configs.NATsAddress: Used in 12 places
+- env.Configs.DBHost: Used in 8 places
+
+Dependency Rejection Applied:
+✅ Level 1 (Bottom): Created NATSClient with injected address
+✅ Level 2 (Middle): Created OrderService using clean types
+⬆️ Pushed env.Configs to: main() and HTTP handlers (2 locations)
+
+Islands of Clean Code Created:
+- messaging/nats_client.go: Ready for testing (isolated, injected deps)
+- order/service.go: Ready for testing (isolated, injected deps)
+→ Invoke @testing skill to write tests for these islands
+
+Progress:
+- Before: 20 global accesses scattered throughout
+- After: 2 global accesses (entry points only)
+- Islands created: 2 new testable types
+
+📊 METRICS
+
+Complexity Reduction:
+- Cyclomatic: 18 → 7 ✅
+- Cognitive: 25 → 8 ✅
+- LOC: 120 → 45 ✅
+- Nesting: 4 → 2 ✅
+
+📝 FILES MODIFIED
+
+Modified:
+- user/service.go (+15, -75 lines)
+- user/handler.go (+5, -20 lines)
+
+Created (Islands of Clean Code):
+- user/email.go (new, +45 lines) → Ready for @testing skill
+- user/phone_number.go (new, +38 lines) → Ready for @testing skill
+
+Next: Invoke @testing skill to write tests for new islands
+
+✅ AUTOMATION COMPLETE
+
+Stopping Criteria Met:
+✅ Linter passes (0 issues)
+✅ All functions < 50 LOC
+✅ Max nesting = 2 levels
+✅ Code reads like a story
+✅ No more juicy abstractions
+
+Ready for: @pre-commit-review phase
+```
+
+## Learning from Examples
+
+For real-world refactoring case studies that show the complete thought process:
+
+**[Example 1: Storifying Mixed Abstractions](./examples.md#example-1-storifying-mixed-abstractions-and-extracting-logic-into-leaf-types)**
+- Transforms a 48-line fat function into lean orchestration + isolated type
+- Shows how to extract `IPConfig` type for collection and validation logic
+- Demonstrates creating testable islands of clean code
+
+**[Example 2: Primitive Obsession with Multiple Types](./examples.md#example-2-primitive-obsession-with-multiple-types-and-storifying-switch-statements)**
+- Transforms a 60-line function into a 7-line story by extracting 4 isolated types
+- Shows the Type Alias Pattern for config-friendly types
+- Demonstrates eliminating switch statement duplication
+- Fixed misleading function name (`validateCIDR` → `alignCIDRArgs`)
+
+**[Example 3: Dependency Rejection Pattern](./examples.md#example-3-dependency-rejection-pattern)**
+- Incremental elimination of global config access (`env.Configs.NATsAddress`)
+- Shows bottom-up approach: create clean islands one level at a time
+- Demonstrates testability benefits of dependency injection
+- Pragmatic stopping point: globals only at entry points
+
+See [examples.md](./examples.md) for complete case studies with thought process.
+
+## Integration with Other Skills
+
+### Invoked By (Automatic Triggering)
+- **@linter-driven-development**: Automatically invokes when linter fails (Phase 3)
+- **@pre-commit-review**: Automatically invokes when design issues detected (Phase 4)
+
+### Iterative Loop
+```
+1. Linter fails → invoke @refactoring
+2. Refactoring complete → re-run linter
+3. Linter passes → invoke @pre-commit-review
+4. Review finds design debt → invoke @refactoring again
+5. Refactoring complete → re-run linter
+6. Repeat until both linter AND review pass
+```
+
+### Invokes (When Needed)
+- **@code-designing**: When refactoring creates new types, validate design
+- **@testing**: Automatically invoked to write tests for new types/functions
+- **@pre-commit-review**: Validates design quality after linting passes
+
+See [reference.md](./reference.md) for complete refactoring patterns and decision tree.
--- a/skills/refactoring/code-design-ref.md
+++ b/skills/refactoring/code-design-ref.md
@@ -0,0 +1,147 @@
+# Type Design Subset for Refactoring
+
+Quick reference for type design principles when refactoring.
+For complete type design guidance, see @code-designing skill.
+
+## When Refactoring Reveals Need for Types
+
+### Primitive Obsession Signal
+During refactoring, if you find:
+- Validation repeated across multiple functions
+- Complex logic operating on primitives (string, int, float)
+- Parameters passed around without type safety
+
+→ Create a self-validating type
+
+### Pattern: Self-Validating Type
+```go
+type TypeName underlyingType
+
+func NewTypeName(input underlyingType) (TypeName, error) {
+    // Validate
+    if /* invalid */ {
+        return zero, errors.New("why invalid")
+    }
+    return TypeName(input), nil
+}
+
+// Add methods if behavior needed
+func (t TypeName) SomeMethod() result {
+    // Type-specific logic
+}
+```
+
+## Type Design Checklist
+
+When creating types during refactoring:
+
+- [ ] **Constructor validates** - Check in New* function
+- [ ] **Fields are private** - Prevent invalid state
+- [ ] **Methods trust validity** - No nil checks
+- [ ] **Type has behavior** - Not just data container
+- [ ] **Type in own file** - If it has logic
+
+## Examples
+
+### Example 1: Port Validation
+```go
+// Before refactoring - Validation scattered
+func StartServer(port int) error {
+    if port <= 0 || port >= 9000 {
+        return errors.New("invalid port")
+    }
+    // ...
+}
+
+func ConnectTo(host string, port int) error {
+    if port <= 0 || port >= 9000 {
+        return errors.New("invalid port")
+    }
+    // ...
+}
+
+// After refactoring - Self-validating type
+type Port int
+
+func NewPort(p int) (Port, error) {
+    if p <= 0 || p >= 9000 {
+        return 0, errors.New("port must be 1-8999")
+    }
+    return Port(p), nil
+}
+
+func StartServer(port Port) error {
+    // No validation needed
+    // ...
+}
+
+func ConnectTo(host string, port Port) error {
+    // No validation needed
+    // ...
+}
+```
+
+### Example 2: Parser Complexity
+```go
+// Before refactoring - One complex Parser
+type Parser struct {
+    // Too many responsibilities
+}
+
+func (p *Parser) Parse(input string) (Result, error) {
+    // 100+ lines parsing headers, path, body, etc.
+}
+
+// After refactoring - Separate types by role
+type HeaderParser struct { /* ... */ }
+type PathParser struct { /* ... */ }
+type BodyParser struct { /* ... */ }
+
+func (p *HeaderParser) Parse(input string) (Header, error) {
+    // Focused logic for headers only
+}
+
+func (p *PathParser) Parse(input string) (Path, error) {
+    // Focused logic for path only
+}
+
+func (p *BodyParser) Parse(input string) (Body, error) {
+    // Focused logic for body only
+}
+```
+
+## Quick Decision: Create Type or Extract Function?
+
+### Create Type When:
+- Logic operates on a primitive
+- Validation is repeated
+- Type represents domain concept
+- Behavior is cohesive
+
+### Extract Function When:
+- Logic is procedural (no state needed)
+- Different abstraction level
+- One-time operation
+- No validation required
+
+## Integration with Refactoring
+
+After creating types during refactoring:
+1. Run tests - Ensure they pass
+2. Run linter - Should reduce complexity
+3. Consider @code-designing - Validate type design
+4. Update tests - Ensure new types have 100% coverage
+
+## File Organization
+
+When creating types during refactoring:
+```
+package/
+├── original.go          # Original file
+├── new_type.go          # New type in own file (if has logic)
+└── original_test.go     # Tests
+```
+
+---
+
+For complete type design principles, see @code-designing skill.
--- a/skills/refactoring/examples.md
+++ b/skills/refactoring/examples.md
--- a/skills/refactoring/reference.md
+++ b/skills/refactoring/reference.md
@@ -0,0 +1,745 @@
+# Refactoring Patterns Reference
+
+Complete guide for linter-driven refactoring with decision tree and patterns.
+
+## Refactoring Decision Tree
+
+When linter fails or code feels complex, use this decision tree:
+
+### Question 1: Does this code read like a story?
+**Check**: Does it mix different levels of abstractions?
+
+```go
+// ❌ No - Mixes abstractions
+func CreatePizza(order Order) Pizza {
+    pizza := Pizza{Base: order.Size}  // High-level
+
+    // Low-level temperature control
+    for oven.Temp < cookingTemp {
+        time.Sleep(checkOvenInterval)
+        oven.Temp = getOvenTemp(oven)
+    }
+
+    return pizza
+}
+
+// ✅ Yes - Story-like
+func CreatePizza(order Order) Pizza {
+    pizza := prepare(order)
+    bake(pizza)
+    return pizza
+}
+```
+
+**Action**: Break it down to same level of abstraction. Hide nitty-gritty details behind methods with proper names.
+
+### Question 2: Can this be broken into smaller pieces?
+**By what**: Responsibility? Task? Category?
+
+Breaking down can be done at all levels:
+- Extract a variable
+- Extract a function
+- Create a new type
+- Create a new package
+
+```go
+// ❌ Multiple responsibilities
+func HandleUserRequest(w http.ResponseWriter, r *http.Request) {
+    // Parse request
+    var user User
+    json.NewDecoder(r.Body).Decode(&user)
+
+    // Validate
+    if user.Email == "" { /* ... */ }
+
+    // Save to DB
+    db.Exec("INSERT INTO...")
+
+    // Send response
+    json.NewEncoder(w).Encode(map[string]string{"status": "ok"})
+}
+
+// ✅ Separated by responsibility
+func HandleUserRequest(w http.ResponseWriter, r *http.Request) {
+    user, err := parseUser(r)
+    if err != nil {
+        respondError(w, err)
+        return
+    }
+
+    if err := validateUser(user); err != nil {
+        respondError(w, err)
+        return
+    }
+
+    if err := saveUser(user); err != nil {
+        respondError(w, err)
+        return
+    }
+
+    respondSuccess(w)
+}
+```
+
+### Question 3: Does logic run on a primitive?
+**Check**: Is this primitive obsession?
+
+If logic operates on string/int/float, consider creating a type.
+
+```go
+// ❌ Primitive obsession
+func ValidateEmail(email string) bool {
+    return strings.Contains(email, "@")
+}
+
+func SendEmail(email string, subject, body string) error {
+    if !ValidateEmail(email) {
+        return errors.New("invalid email")
+    }
+    // Send
+}
+
+// ✅ Custom type
+type Email string
+
+func NewEmail(s string) (Email, error) {
+    if !strings.Contains(s, "@") {
+        return "", errors.New("invalid email")
+    }
+    return Email(s), nil
+}
+
+func SendEmail(email Email, subject, body string) error {
+    // No validation needed - type guarantees validity
+    // Send
+}
+```
+
+**Note**: Cohesion is more important than coupling. Put logic where it belongs, even if it creates dependencies.
+
+### Question 4: Is function long due to switch statement?
+**Check**: Can cases be categorized and extracted?
+
+```go
+// ❌ Long switch statement
+func ProcessEvent(eventType string, data interface{}) error {
+    switch eventType {
+    case "user_created":
+        // 20 lines
+    case "user_updated":
+        // 25 lines
+    case "user_deleted":
+        // 15 lines
+    // ... more cases
+    }
+}
+
+// ✅ Extracted case handlers
+func ProcessEvent(eventType string, data interface{}) error {
+    switch eventType {
+    case "user_created":
+        return handleUserCreated(data)
+    case "user_updated":
+        return handleUserUpdated(data)
+    case "user_deleted":
+        return handleUserDeleted(data)
+    default:
+        return errors.New("unknown event type")
+    }
+}
+
+func handleUserCreated(data interface{}) error { /* ... */ }
+func handleUserUpdated(data interface{}) error { /* ... */ }
+func handleUserDeleted(data interface{}) error { /* ... */ }
+```
+
+### Question 5: Types with logic?
+**Rule**: Types with logic should be in their own file. Name file after type.
+
+```
+user/
+├── user.go          # User type
+├── user_id.go       # UserID type with logic
+├── email.go         # Email type with logic
+└── service.go       # UserService
+```
+
+---
+
+## Detailed Refactoring Patterns
+
+### 1. Storifying (Abstraction Levels)
+
+**Signal:**
+- Linter: High cognitive complexity
+- Code smell: Mixed high-level and low-level code
+
+**Pattern:**
+```go
+// Before
+func ProcessOrder(order Order) error {
+    // Validation
+    if order.ID == "" { return errors.New("invalid") }
+    if len(order.Items) == 0 { return errors.New("no items") }
+    for _, item := range order.Items {
+        if item.Price < 0 { return errors.New("negative price") }
+    }
+
+    // Database
+    db, err := sql.Open("postgres", os.Getenv("DB_URL"))
+    if err != nil { return err }
+    defer db.Close()
+
+    tx, err := db.Begin()
+    if err != nil { return err }
+
+    // SQL queries
+    _, err = tx.Exec("INSERT INTO orders...")
+    // ... many more lines
+
+    // Email
+    smtp, err := mail.Dial("smtp.example.com:587")
+    // ... email sending logic
+
+    return nil
+}
+
+// After
+func ProcessOrder(order Order) error {
+    if err := validateOrder(order); err != nil {
+        return err
+    }
+
+    if err := saveToDatabase(order); err != nil {
+        return err
+    }
+
+    if err := notifyCustomer(order); err != nil {
+        return err
+    }
+
+    return nil
+}
+```
+
+**Benefits:**
+- Clear flow (validate → save → notify)
+- Each function single responsibility
+- Easy to test
+- Easy to modify
+
+**Real-world example:** See [Example 1 in examples.md](./examples.md#example-1-storifying-mixed-abstractions-and-extracting-logic-into-leaf-types) for a production case of storifying mixed abstractions and extracting a leaf type for IP collection logic
+
+### 2. Extract Type (Primitive Obsession)
+
+**Signal:**
+- Linter: High cyclomatic complexity (due to validation)
+- Code smell: Validation repeated across codebase
+
+**Pattern:**
+```go
+// Before: Validation scattered
+func CreateServer(host string, port int) (*Server, error) {
+    if host == "" {
+        return nil, errors.New("host required")
+    }
+    if port <= 0 || port > 65535 {
+        return nil, errors.New("invalid port")
+    }
+    // ...
+}
+
+func ConnectToServer(host string, port int) error {
+    if host == "" {
+        return errors.New("host required")
+    }
+    if port <= 0 || port > 65535 {
+        return errors.New("invalid port")
+    }
+    // ...
+}
+
+// After: Self-validating types
+type Host string
+type Port int
+
+func NewHost(s string) (Host, error) {
+    if s == "" {
+        return "", errors.New("host required")
+    }
+    return Host(s), nil
+}
+
+func NewPort(p int) (Port, error) {
+    if p <= 0 || p > 65535 {
+        return 0, errors.New("port must be 1-65535")
+    }
+    return Port(p), nil
+}
+
+type ServerAddress struct {
+    host Host
+    port Port
+}
+
+func NewServerAddress(host Host, port Port) ServerAddress {
+    // No validation needed - types are already valid
+    return ServerAddress{host: host, port: port}
+}
+
+func (a ServerAddress) String() string {
+    return fmt.Sprintf("%s:%d", a.host, a.port)
+}
+
+func CreateServer(addr ServerAddress) (*Server, error) {
+    // No validation needed
+    // ...
+}
+
+func ConnectToServer(addr ServerAddress) error {
+    // No validation needed
+    // ...
+}
+```
+
+**Benefits:**
+- Validation centralized
+- Type safety
+- Reduced complexity
+- Self-documenting
+
+**Real-world example:** See [Example 2 in examples.md](./examples.md#example-2-primitive-obsession-with-multiple-types-and-storifying-switch-statements) for extracting multiple types from a 60-line function with primitive obsession. Shows the Type Alias Pattern for creating config-friendly types and eliminating switch statement duplication.
+
+---
+
+### 2.5. The Over-Abstraction Trap ⚠️
+
+**Critical**: Not every primitive needs a type. The goal is **clarity**, not **type proliferation**.
+
+#### Quick Decision Checklist
+
+**Create types when they**:
+- ✅ Have multiple meaningful methods (>1) with real logic
+- ✅ Enforce invariants/validation at construction
+- ✅ Hide complex implementation
+- ✅ Need controlled mutation → use **private fields**, NOT wrappers
+
+**DON'T create types when they**:
+- ❌ Just wrap primitives with one trivial method
+- ❌ Add ceremony without benefit
+- ❌ Good naming achieves same clarity
+
+#### Bad vs Good: One Example
+
+```go
+// ❌ Bad: Trivial wrapper - 8 lines, no benefit
+type CIDRPresence bool
+func (p CIDRPresence) IsSet() bool { return bool(p) }
+
+// ✅ Good: Private fields - same safety, less code
+type CIDRConfig struct {
+    clusterCIDRSet bool  // Only parser can set
+    serviceCIDRSet bool
+}
+func (c CIDRConfig) ClusterCIDRSet() bool { return c.clusterCIDRSet }
+```
+
+#### Complete Teaching & Examples
+
+**→ See [Example 2: Over-Abstraction Section](./examples.md#first-refactoring-attempt-the-over-abstraction-trap)**
+
+Full case study includes:
+- Complete thought process & comparisons
+- 6 questions before creating a type
+- Balance diagram & decision tree
+- When to stop refactoring
+
+---
+
+### 3. Early Returns (Reduce Nesting)
+
+**Signal:**
+- Linter: High cyclomatic complexity
+- Code smell: Nesting > 2 levels
+
+**Pattern:**
+```go
+// Before: Deep nesting
+func ProcessRequest(req Request) error {
+    if req.IsValid() {
+        if req.HasAuth() {
+            if req.HasPermission() {
+                // Do work
+                result, err := doWork(req)
+                if err != nil {
+                    return err
+                }
+                return saveResult(result)
+            } else {
+                return errors.New("no permission")
+            }
+        } else {
+            return errors.New("not authenticated")
+        }
+    } else {
+        return errors.New("invalid request")
+    }
+}
+
+// After: Early returns
+func ProcessRequest(req Request) error {
+    if !req.IsValid() {
+        return errors.New("invalid request")
+    }
+
+    if !req.HasAuth() {
+        return errors.New("not authenticated")
+    }
+
+    if !req.HasPermission() {
+        return errors.New("no permission")
+    }
+
+    result, err := doWork(req)
+    if err != nil {
+        return err
+    }
+
+    return saveResult(result)
+}
+```
+
+**Benefits:**
+- Reduced nesting (max 1 level)
+- Easier to read (guard clauses up front)
+- Lower cyclomatic complexity
+
+### 4. Extract Function (Long Functions)
+
+**Signal:**
+- Function > 50 LOC
+- Multiple distinct concerns
+
+**Pattern:**
+```go
+// Before: Long function (80 LOC)
+func RegisterUser(data map[string]interface{}) error {
+    // Parsing (15 lines)
+    email, ok := data["email"].(string)
+    if !ok { return errors.New("email required") }
+    // ... more parsing
+
+    // Validation (20 lines)
+    if email == "" { return errors.New("email required") }
+    if !strings.Contains(email, "@") { return errors.New("invalid email") }
+    // ... more validation
+
+    // Database (25 lines)
+    db, err := getDB()
+    if err != nil { return err }
+    // ... DB operations
+
+    // Email (15 lines)
+    smtp := getSMTP()
+    // ... email sending
+
+    // Logging (5 lines)
+    log.Printf("User registered: %s", email)
+    // ...
+
+    return nil
+}
+
+// After: Extracted functions
+func RegisterUser(data map[string]interface{}) error {
+    user, err := parseUserData(data)
+    if err != nil {
+        return err
+    }
+
+    if err := validateUser(user); err != nil {
+        return err
+    }
+
+    if err := saveUserToDB(user); err != nil {
+        return err
+    }
+
+    if err := sendWelcomeEmail(user); err != nil {
+        return err
+    }
+
+    logUserRegistration(user)
+    return nil
+}
+
+func parseUserData(data map[string]interface{}) (*User, error) {
+    // 15 lines
+}
+
+func validateUser(user *User) error {
+    // 20 lines
+}
+
+func saveUserToDB(user *User) error {
+    // 25 lines
+}
+
+func sendWelcomeEmail(user *User) error {
+    // 15 lines
+}
+
+func logUserRegistration(user *User) {
+    // 5 lines
+}
+```
+
+**Guidelines:**
+- Aim for functions under 50 LOC
+- Each function single responsibility
+- Top-level function reads like a story
+
+### 5. Switch Statement Extraction
+
+**Signal:**
+- Long function due to switch statement
+- Each case is complex
+
+**Pattern:**
+```go
+// Before
+func RouteHandler(action string, params map[string]string) error {
+    switch action {
+    case "create":
+        // Validate create params
+        if params["name"] == "" { return errors.New("name required") }
+        // ... 15 more lines
+        return db.Create(...)
+
+    case "update":
+        // Validate update params
+        if params["id"] == "" { return errors.New("id required") }
+        // ... 20 more lines
+        return db.Update(...)
+
+    case "delete":
+        // Validate delete params
+        // ... 12 more lines
+        return db.Delete(...)
+
+    default:
+        return errors.New("unknown action")
+    }
+}
+
+// After
+func RouteHandler(action string, params map[string]string) error {
+    switch action {
+    case "create":
+        return handleCreate(params)
+    case "update":
+        return handleUpdate(params)
+    case "delete":
+        return handleDelete(params)
+    default:
+        return errors.New("unknown action")
+    }
+}
+
+func handleCreate(params map[string]string) error {
+    // All create logic (15 lines)
+}
+
+func handleUpdate(params map[string]string) error {
+    // All update logic (20 lines)
+}
+
+func handleDelete(params map[string]string) error {
+    // All delete logic (12 lines)
+}
+```
+
+### 6. Defer Complexity Extraction
+
+**Signal:**
+- Linter: Defer function has cyclomatic complexity > 1
+
+**Pattern:**
+```go
+// Before: Complex defer
+func ProcessFile(filename string) error {
+    f, err := os.Open(filename)
+    if err != nil {
+        return err
+    }
+
+    defer func() {
+        if err := f.Close(); err != nil {
+            if !errors.Is(err, fs.ErrClosed) {
+                log.Printf("Error closing file: %v", err)
+            }
+        }
+    }()
+
+    // Process file
+    return nil
+}
+
+// After: Extracted cleanup function
+func ProcessFile(filename string) error {
+    f, err := os.Open(filename)
+    if err != nil {
+        return err
+    }
+    defer closeFile(f)
+
+    // Process file
+    return nil
+}
+
+func closeFile(f *os.File) {
+    if err := f.Close(); err != nil {
+        if !errors.Is(err, fs.ErrClosed) {
+            log.Printf("Error closing file: %v", err)
+        }
+    }
+}
+```
+
+---
+
+## Linter-Specific Refactoring
+
+### Cyclomatic Complexity
+**Cause**: Too many decision points (if, switch, for, &&, ||)
+
+**Solutions:**
+1. Extract functions for different branches
+2. Use early returns to reduce nesting
+3. Extract type with methods for primitive logic
+4. Simplify boolean expressions
+
+### Cognitive Complexity
+**Cause**: Code hard to understand (nested logic, mixed abstractions)
+
+**Solutions:**
+1. Storifying (clarify abstraction levels)
+2. Extract nested logic to named functions
+3. Use early returns
+4. Break into smaller, focused functions
+
+### Maintainability Index
+**Cause**: Code difficult to maintain
+
+**Solutions:**
+1. All of the above
+2. Improve naming
+3. Add comments for complex logic
+4. Reduce coupling
+
+---
+
+## Guidelines for Effective Refactoring
+
+### Keep Functions Small
+- Target: Under 50 LOC
+- Max 2 nesting levels
+- Single responsibility
+
+### Prefer Simplicity
+- Simple, straightforward solutions over complex ones
+- Descriptive variable and function names
+- Avoid magic numbers and strings
+
+### Maintain Tests
+- Tests should pass after refactoring
+- Add tests for new functions if needed
+- Maintain or improve coverage
+
+### Avoid Global State
+- No global variables
+- Inject dependencies through constructors
+- Keep state localized
+
+---
+
+## Common Refactoring Scenarios
+
+### Scenario 1: Linter Says "Cyclomatic Complexity Too High"
+1. Identify decision points (if, switch, loops)
+2. Extract branches to separate functions
+3. Consider early returns
+4. Check for primitive obsession (move logic to type)
+
+### Scenario 2: Function Feels Hard to Test
+1. Probably doing too much → Extract functions
+2. Might have hidden dependencies → Inject through constructor
+3. Might mix concerns → Separate responsibilities
+
+### Scenario 3: Code Duplicated Across Functions
+1. Extract common logic to shared function
+2. Consider if primitives should be types (with methods)
+3. Check if behavior belongs on existing type
+
+### Scenario 4: Can't Name Function Clearly
+1. Probably doing too much → Split responsibilities
+2. Might be at wrong abstraction level
+3. Reconsider what the function should do
+
+---
+
+## After Refactoring Checklist
+
+- [ ] Linter passes (`task lintwithfix`)
+- [ ] Tests pass (`go test ./...`)
+- [ ] Coverage maintained or improved
+- [ ] Code more readable
+- [ ] Functions under 50 LOC
+- [ ] Max 2 nesting levels
+- [ ] Each function has clear purpose
+
+---
+
+## Integration with Design Principles
+
+Refactoring often reveals design issues. After refactoring, consider:
+
+**Created new types?**
+→ Use @code-designing to validate type design
+
+**Changed architecture?**
+→ Ensure still following vertical slice structure
+
+**Extracted significant logic?**
+→ Ensure tests cover new functions (100% for leaf types)
+
+---
+
+## Summary: Refactoring Decision Tree
+
+```
+Linter fails or code complex
+    ↓
+1. Does it read like a story?
+    No → Extract functions for abstraction levels
+    ↓
+2. Can it be broken into smaller pieces?
+    Yes → By responsibility/task/category?
+          Extract functions/types/packages
+    ↓
+3. Does logic run on primitives?
+    Yes → Is this primitive obsession?
+          Create custom type with methods
+    ↓
+4. Is it long due to switch statement?
+    Yes → Extract case handlers
+    ↓
+5. Deeply nested if/else?
+    Yes → Early returns or extract functions
+    ↓
+Re-run linter → Should pass
+Run tests → Should pass
+If new types → Validate with @code-designing
+```
+
+**Remember**: Cohesion > Coupling. Put logic where it belongs.
--- a/skills/testing/SKILL.md
+++ b/skills/testing/SKILL.md
@@ -0,0 +1,255 @@
+---
+name: testing
+description: Automatically invoked to write tests for new types, or use as testing expert advisor for guidance and recommendations. Covers unit, integration, and system tests with emphasis on in-memory dependencies. Use when creating leaf types, after refactoring, during implementation, or when testing advice is needed. Ensures 100% coverage on leaf types with public API testing.
+---
+
+# Testing Principles
+
+Principles and patterns for writing effective Go tests.
+
+## When to Use
+
+### Automatic Invocation (Proactive)
+- **Automatically invoked** by @linter-driven-development during Phase 2 (Implementation)
+- **Automatically invoked** by @refactoring when new isolated types are created
+- **Automatically invoked** by @code-designing after designing new types
+- **After creating new leaf types** - Types that should have 100% unit test coverage
+- **After extracting functions** during refactoring that create testable units
+
+### Manual Invocation
+- User explicitly requests tests to be written
+- User asks for testing advice, recommendations, or "what to do"
+- When testing strategy is unclear (table-driven vs testify suites)
+- When choosing between dependency levels (in-memory vs binary vs test-containers)
+- When adding tests to existing untested code
+- When user needs testing expert guidance or consultation
+
+**IMPORTANT**: This skill writes tests autonomously based on the code structure and type design, and also serves as a testing expert advisor
+
+## Testing Philosophy
+
+**Test only the public API**
+- Use `pkg_test` package name
+- Test types through their constructors
+- No testing private methods/functions
+
+**Prefer real implementations over mocks**
+- Use in-memory implementations (fastest, no external deps)
+- Use HTTP test servers (httptest)
+- Use temp files/directories
+- Test with actual dependencies when beneficial
+
+**Coverage targets**
+- Leaf types: 100% unit test coverage
+- Orchestrating types: Integration tests
+- Critical workflows: System tests
+
+## Test Pyramid
+
+Three levels of testing, each serving a specific purpose:
+
+**Unit Tests** (Base of pyramid - most tests here)
+- Test leaf types in isolation
+- Fast, focused, no external dependencies
+- 100% coverage target for leaf types
+- Use `pkg_test` package, test public API only
+
+**Integration Tests** (Middle - fewer than unit)
+- Test seams between components
+- Test workflows across package boundaries
+- Use real or in-memory implementations
+- Verify components work together correctly
+
+**System Tests** (Top - fewest tests)
+- Black box testing from `tests/` folder
+- Test entire system via CLI/API
+- Test critical end-to-end workflows
+- **Strive for independence in Go** (minimize external deps)
+
+## Reusable Test Infrastructure
+
+Build shared test infrastructure in `internal/testutils/`:
+- In-memory mock servers with DSL (HTTP, DB, file system)
+- Reusable across all test levels
+- Test the infrastructure itself!
+- Can expose as CLI tools for manual testing
+
+**Dependency Priority** (minimize external dependencies):
+1. **In-memory** (preferred): Pure Go, httptest, in-memory DB
+2. **Binary**: Standalone executable via exec.Command
+3. **Test-containers**: Programmatic Docker from Go
+4. **Docker-compose**: Last resort, manual testing only
+
+Goal: System tests should be **independent in Go** when possible.
+
+See reference.md for comprehensive testutils patterns and DSL examples.
+
+## Workflow
+
+### Unit Tests Workflow
+
+**Purpose**: Test leaf types in isolation, 100% coverage target
+
+1. **Identify leaf types** - Self-contained types with logic
+2. **Choose structure** - Table-driven (simple) or testify suites (complex setup)
+3. **Write in pkg_test package** - Test public API only
+4. **Use in-memory implementations** - From testutils or local implementations
+5. **Avoid pitfalls** - No time.Sleep, no conditionals in cases, no private method tests
+
+**Test structure:**
+- Table-driven: Separate success/error test functions (complexity = 1)
+- Testify suites: Only for complex infrastructure setup (HTTP servers, DBs)
+- Always use named struct fields (linter reorders fields)
+
+See reference.md for detailed patterns and examples.
+
+### Integration Tests Workflow
+
+**Purpose**: Test seams between components, verify they work together
+
+1. **Identify integration points** - Where packages/components interact
+2. **Choose dependencies** - Prefer: in-memory > binary > test-containers
+3. **Write tests** - In `pkg_test` or `integration_test.go` with build tags
+4. **Test workflows** - Cover happy path and error scenarios across boundaries
+5. **Use real or testutils implementations** - Avoid heavy mocking
+
+**File organization:**
+```go
+//go:build integration
+
+package user_test
+
+// Test Service + Repository + real/mock dependencies
+```
+
+See reference.md for integration test patterns with dependencies.
+
+### System Tests Workflow
+
+**Purpose**: Black box test entire system, critical end-to-end workflows
+
+1. **Place in tests/ folder** - At project root, separate from packages
+2. **Test via CLI/API** - exec.Command for CLI, HTTP client for APIs
+3. **Minimize external deps** - Prefer: in-memory mocks > binary > test-containers
+4. **Strive for Go independence** - Pure Go tests, no Docker when possible
+5. **Test critical workflows** - User journeys, not every edge case
+
+**Example structure:**
+```go
+// tests/cli_test.go
+func TestCLI_UserWorkflow(t *testing.T) {
+    mockAPI := testutils.NewMockServer().
+        OnGET("/users/1").RespondJSON(200, user).
+        Build() // In-memory httptest.Server
+    defer mockAPI.Close()
+
+    cmd := exec.Command("./myapp", "get-user", "1",
+        "--api-url", mockAPI.URL())
+    output, err := cmd.CombinedOutput()
+    // Assert on output
+}
+```
+
+See reference.md for comprehensive system test patterns.
+
+## Key Test Patterns
+
+**Table-Driven Tests:**
+- Separate success and error test functions (complexity = 1)
+- Always use named struct fields (linter reorders fields)
+- No wantErr bool pattern (adds conditionals)
+
+**Testify Suites:**
+- Only for complex infrastructure (HTTP servers, DBs, OpenTelemetry)
+- SetupSuite/TearDownSuite for expensive shared setup
+- SetupTest/TearDownTest for per-test isolation
+
+**Synchronization:**
+- Never use time.Sleep (flaky, slow)
+- Use channels with select/timeout for async operations
+- Use sync.WaitGroup for concurrent operations
+
+See reference.md for complete patterns with code examples.
+
+## Output Format
+
+After writing tests:
+
+```
+✅ TESTING COMPLETE
+
+📊 Unit Tests:
+- user/user_id_test.go: 100% (4 test cases)
+- user/email_test.go: 100% (6 test cases)
+- user/service_test.go: 100% (8 test cases)
+
+🔗 Integration Tests:
+- user/integration_test.go: 3 workflows tested
+- Dependencies: In-memory DB, httptest mock server
+
+🎯 System Tests:
+- tests/cli_test.go: 2 end-to-end workflows
+- tests/api_test.go: 1 full API workflow
+- Infrastructure: In-memory mocks (pure Go, no Docker)
+
+Test Infrastructure:
+- internal/testutils/httpserver: In-memory mock API with DSL
+- internal/testutils/mockdb: In-memory database mock
+
+Test Execution:
+$ go test ./...                    # All tests
+$ go test -tags=integration ./...  # Include integration tests
+$ go test ./tests/...              # System tests only
+
+✅ All tests pass
+✅ 100% coverage on leaf types
+✅ No external dependencies required
+
+Next Steps:
+1. Run linter: task lintwithfix
+2. If linter fails → use @refactoring skill
+3. If linter passes → use @pre-commit-review skill
+```
+
+## Key Principles
+
+See reference.md for:
+- Table-driven test patterns
+- Testify suite guidelines
+- Real implementations over mocks
+- Synchronization techniques
+- Coverage strategies
+
+## Testing Checklist
+
+### Unit Tests
+- [ ] All unit tests in pkg_test package
+- [ ] Testing public API only (no private methods)
+- [ ] Table-driven tests use named struct fields
+- [ ] No conditionals in test cases (complexity = 1)
+- [ ] Using in-memory implementations from testutils
+- [ ] No time.Sleep (using channels/waitgroups)
+- [ ] Leaf types have 100% coverage
+
+### Integration Tests
+- [ ] Test seams between components
+- [ ] Use in-memory or binary dependencies (avoid Docker)
+- [ ] Build tags for optional execution (`//go:build integration`)
+- [ ] Cover happy path and error scenarios across boundaries
+- [ ] Real or testutils implementations (minimal mocking)
+
+### System Tests
+- [ ] Located in tests/ folder at project root
+- [ ] Black box testing via CLI/API
+- [ ] Uses in-memory testutils mocks (pure Go)
+- [ ] No external dependencies (no Docker required)
+- [ ] Tests critical end-to-end workflows
+- [ ] Fast execution, runs in CI without setup
+
+### Test Infrastructure
+- [ ] Reusable mocks in internal/testutils/
+- [ ] Test infrastructure has its own tests
+- [ ] DSL provides readable test setup
+- [ ] Can be exposed as CLI for manual testing
+
+See reference.md for complete testing guidelines and examples.
--- a/skills/testing/examples/grpc-bufconn.md
+++ b/skills/testing/examples/grpc-bufconn.md
@@ -0,0 +1,318 @@
+# gRPC Testing with bufconn and Rich Client Mocks
+
+## When to Use This Example
+
+Use this when:
+- Testing gRPC servers
+- Need bidirectional streaming tests
+- Want in-memory gRPC (no network I/O)
+- Testing server-client interactions
+- Need rich DSL for readable tests
+
+**Dependency Level**: Level 1 (In-Memory) - Uses `bufconn` for in-memory gRPC connections
+
+**Key Insight**: When testing a **gRPC server**, mock the **clients** that connect to it. When testing a **gRPC client**, mock the **server**.
+
+## Implementation
+
+### Rich gRPC Client Mock with DSL
+
+When your **System Under Test (SUT) is a gRPC server**, create rich client mocks:
+
+```go
+// internal/testutils/grpc_client_mock.go
+package testutils
+
+import (
+    "context"
+    "io"
+    "sync"
+    "testing"
+
+    "github.com/stretchr/testify/require"
+    "google.golang.org/grpc"
+    "google.golang.org/grpc/metadata"
+
+    pb "myproject/grpc_api/gen/go/traces/v1"
+)
+
+// TaskmonOnCluster is a rich gRPC client mock with DSL for testing gRPC servers.
+// It connects to your gRPC server and provides helper methods for assertions.
+type TaskmonOnCluster struct {
+    clusterRoutingKey       string
+    taskmonID               string
+    stream                  pb.RemoteTracesService_StreamTracesClient
+    mu                      sync.RWMutex
+    receivedQuery           *pb.TracesQuery
+    receivedQueriesPayloads []string
+}
+
+// OpenTaskmonToWekaHomeStream creates a gRPC client mock that connects to your server.
+// This is the constructor for the mock - returns a rich DSL object.
+func OpenTaskmonToWekaHomeStream(
+    ctx context.Context,
+    client pb.RemoteTracesServiceClient,
+    clusterRoutingKey, taskmonID string,
+) (*TaskmonOnCluster, error) {
+    // Inject metadata (like session tokens) into context
+    md := metadata.Pairs("X-Taskmon-session-token", clusterRoutingKey)
+    ctx = metadata.NewOutgoingContext(ctx, md)
+
+    // Open streaming connection to the server (your SUT)
+    stream, err := client.StreamTraces(ctx, grpc.Header(&md))
+    if err != nil {
+        return nil, err
+    }
+
+    return &TaskmonOnCluster{
+        stream:                  stream,
+        clusterRoutingKey:       clusterRoutingKey,
+        taskmonID:               taskmonID,
+        receivedQueriesPayloads: []string{},
+    }, nil
+}
+
+// SessionToken returns the session token (useful for assertions)
+func (m *TaskmonOnCluster) SessionToken() string {
+    return m.clusterRoutingKey
+}
+
+// Close closes the stream (idempotent)
+func (m *TaskmonOnCluster) Close() {
+    if m.stream == nil {
+        return
+    }
+    m.stream.CloseSend()
+}
+
+// ListenToStreamAndAssert is a helper that listens to server messages and asserts.
+// This makes tests read like documentation!
+func (m *TaskmonOnCluster) ListenToStreamAndAssert(
+    t *testing.T,
+    expectedQueryPayload,
+    resultPayload string,
+) {
+    for {
+        query, err := m.stream.Recv()
+        if err == io.EOF {
+            break
+        }
+        require.NoError(t, err, "Failed to receive query from server")
+
+        // Store received data (thread-safe)
+        m.mu.Lock()
+        m.receivedQuery = query
+        m.receivedQueriesPayloads = append(m.receivedQueriesPayloads, string(query.TracesQueryPayload))
+        m.mu.Unlock()
+
+        // Assert expected payload
+        require.Equal(t, expectedQueryPayload, string(query.TracesQueryPayload))
+
+        // Send response back to server
+        response := &pb.TracesFromServer{
+            TraceServerRoute: query.TraceServerRoute,
+            TracesPayload:    []byte(resultPayload),
+            MessageId:        query.MessageId,
+        }
+        err = m.stream.Send(response)
+        require.NoError(t, err, "Failed to send response")
+    }
+}
+
+// LastReceivedQuery returns the last received query (thread-safe)
+func (m *TaskmonOnCluster) LastReceivedQuery() *pb.TracesQuery {
+    m.mu.RLock()
+    defer m.mu.RUnlock()
+    return m.receivedQuery
+}
+
+// ReceivedQueriesPayloads returns all received payloads (thread-safe)
+func (m *TaskmonOnCluster) ReceivedQueriesPayloads() []string {
+    m.mu.RLock()
+    defer m.mu.RUnlock()
+    return m.receivedQueriesPayloads
+}
+```
+
+## Usage in Integration Tests
+
+### Complete Test Suite Example
+
+```go
+//go:build integration
+
+package integration_test
+
+import (
+    "context"
+    "net"
+    "testing"
+    "time"
+
+    "github.com/stretchr/testify/suite"
+    "google.golang.org/grpc"
+    "google.golang.org/grpc/credentials/insecure"
+    "google.golang.org/grpc/test/bufconn"
+
+    pb "myproject/grpc_api/gen/go/traces/v1"
+    "myproject/internal/remotetraces"
+    "myproject/internal/testutils"
+)
+
+type RemoteTracesTestSuite struct {
+    suite.Suite
+    lis        *bufconn.Listener  // In-memory gRPC connection
+    ctx        context.Context
+    natsServer *nserver.Server     // In-memory NATS
+}
+
+func (suite *RemoteTracesTestSuite) SetupSuite() {
+    suite.ctx = context.Background()
+
+    // Start in-memory NATS server (Level 1)
+    natsServer, err := testutils.RunNATsServer()
+    suite.Require().NoError(err)
+    suite.natsServer = natsServer
+
+    // Connect to NATS
+    natsAddress := "nats://" + natsServer.Addr().String()
+    nc, err := natsremotetraces.ConnectToRemoteTracesSession(suite.ctx, natsAddress, 2, 2, 10)
+    suite.Require().NoError(err)
+
+    // ** System Under Test: gRPC Server **
+    // Use bufconn for in-memory gRPC (no network I/O!)
+    suite.lis = bufconn.Listen(1024 * 1024)
+    s := grpc.NewServer()
+
+    // Your gRPC server implementation
+    remoteTracesServer := remotetraces.NewGRPCServer(nc, 10, 10, time.Second)
+    pb.RegisterRemoteTracesServiceServer(s, remoteTracesServer)
+
+    go func() {
+        if err := s.Serve(suite.lis); err != nil {
+            suite.NoError(err)
+        }
+    }()
+}
+
+func (suite *RemoteTracesTestSuite) bufDialer(ctx context.Context, _ string) (net.Conn, error) {
+    return suite.lis.DialContext(ctx)
+}
+
+func (suite *RemoteTracesTestSuite) TestStreamTraces() {
+    // Create gRPC client (connects to your server)
+    conn, err := grpc.NewClient("passthrough:///bufnet",
+        grpc.WithContextDialer(suite.bufDialer),
+        grpc.WithTransportCredentials(insecure.NewCredentials()),
+    )
+    suite.Require().NoError(err)
+    defer conn.Close()
+
+    client := pb.NewRemoteTracesServiceClient(conn)
+
+    // Create rich gRPC client mock (testutils DSL!)
+    clusterRoutingKey := "test-cluster-123"
+    taskmonMock, err := testutils.OpenTaskmonToWekaHomeStream(
+        suite.ctx, client, clusterRoutingKey, "taskmon-1")
+    suite.Require().NoError(err)
+    defer taskmonMock.Close()
+
+    expectedQuery := "fetch_traces_query"
+    expectedResult := "traces_result_data"
+
+    // Start listening (this makes the test readable!)
+    go taskmonMock.ListenToStreamAndAssert(suite.T(), expectedQuery, expectedResult)
+
+    // Send query to server (via NATS or HTTP API)
+    // ... your test logic here ...
+
+    // Assert using helper methods
+    suite.Eventually(func() bool {
+        return taskmonMock.LastReceivedQuery() != nil &&
+            string(taskmonMock.LastReceivedQuery().TracesQueryPayload) == expectedQuery
+    }, 5*time.Second, 500*time.Millisecond)
+}
+
+func TestRemoteTracesTestSuite(t *testing.T) {
+    suite.Run(t, new(RemoteTracesTestSuite))
+}
+```
+
+## Why This Pattern is Excellent
+
+1. **Rich DSL** - `OpenTaskmonToWekaHomeStream()` returns friendly object with helper methods
+2. **Helper Methods** - `ListenToStreamAndAssert()`, `LastReceivedQuery()`, `ReceivedQueriesPayloads()`
+3. **Thread-Safe** - Mutex protects shared state for concurrent access
+4. **Readable Tests** - Tests read like documentation, clear intent
+5. **In-Memory** - Uses `bufconn` (no network I/O, pure Go)
+6. **Reusable** - Same mock for unit, integration, and system tests
+7. **Event-Driven** - Can add channels for connection events if needed
+
+## Key Design Principles
+
+### Testing Direction
+
+- **Testing a server?** → Mock the **clients** that connect to it
+- **Testing a client?** → Mock the **server** it connects to
+
+### DSL Benefits
+
+- Use rich DSL objects with helper methods
+- Make tests read like documentation
+- Hide complexity behind clean interfaces
+- Provide thread-safe state tracking
+- Enable fluent assertions
+
+### In-Memory with bufconn
+
+`bufconn` provides an in-memory, full-duplex network connection:
+- No network I/O overhead
+- No port allocation needed
+- Faster than TCP loopback
+- Perfect for CI/CD
+- Deterministic behavior
+
+## Benefits
+
+- **No Docker required** - Pure Go, works anywhere
+- **No binary downloads** - Everything in-memory
+- **No network I/O** - Unless testing actual network code
+- **Perfect for CI/CD** - Fast, reliable, no external dependencies
+- **Lightning fast** - Microsecond startup time
+- **Thread-safe** - Concurrent test execution safe
+
+## Alternative: Testing gRPC Clients
+
+If you're testing a **gRPC client**, mock the **server** instead:
+
+```go
+// internal/testutils/grpc_server_mock.go
+type MockGRPCServer struct {
+    pb.UnimplementedRemoteTracesServiceServer
+    mu              sync.Mutex
+    receivedQueries []*pb.TracesQuery
+}
+
+func (m *MockGRPCServer) StreamTraces(stream pb.RemoteTracesService_StreamTracesServer) error {
+    // Mock server implementation
+    // Store received queries, send responses
+    // ...
+    return nil
+}
+
+// Usage
+server := testutils.NewMockGRPCServer()
+lis := bufconn.Listen(1024 * 1024)
+s := grpc.NewServer()
+pb.RegisterRemoteTracesServiceServer(s, server)
+// ... test your client against this mock server
+```
+
+## Key Takeaways
+
+1. **bufconn is Level 1** - In-memory, no external dependencies
+2. **Mock the opposite end** - Server → mock clients, Client → mock server
+3. **Rich DSL makes tests readable** - Helper methods, clear intent
+4. **Thread-safe state tracking** - Use mutexes for concurrent access
+5. **Reusable across test levels** - Same infrastructure everywhere
+6. **Check for official test harnesses first** - Many libraries provide them (like NATS)
--- a/skills/testing/examples/httptest-dsl.md
+++ b/skills/testing/examples/httptest-dsl.md
@@ -0,0 +1,281 @@
+# HTTP Test Server with DSL Pattern
+
+## When to Use This Example
+
+Use this when:
+- Testing HTTP clients or APIs
+- Need simple, readable HTTP mocking
+- Want to avoid complex mock frameworks
+- Testing REST APIs, webhooks, or HTTP integrations
+
+**Dependency Level**: Level 1 (In-Memory) - Uses stdlib `httptest.Server`
+
+## Basic httptest.Server Pattern
+
+### Simple HTTP Mock
+
+```go
+func TestAPIClient(t *testing.T) {
+    // Create test server
+    server := httptest.NewServer(http.HandlerFunc(func(w http.ResponseWriter, r *http.Request) {
+        // Mock API response
+        w.WriteHeader(http.StatusOK)
+        json.NewEncoder(w).Encode(map[string]string{"status": "ok"})
+    }))
+    defer server.Close()
+
+    // Use real HTTP client with test server URL
+    client := NewAPIClient(server.URL)
+    result, err := client.GetStatus()
+
+    assert.NoError(t, err)
+    assert.Equal(t, "ok", result.Status)
+}
+```
+
+## DSL Pattern for Readable Tests
+
+### Without DSL (Verbose)
+
+```go
+func TestUserAPI(t *testing.T) {
+    handler := http.HandlerFunc(func(w http.ResponseWriter, r *http.Request) {
+        if r.Method == "GET" && r.URL.Path == "/users/1" {
+            w.WriteHeader(200)
+            json.NewEncoder(w).Encode(map[string]string{"id": "1", "name": "Alice"})
+        } else if r.Method == "POST" && r.URL.Path == "/users" {
+            // ... more complex logic
+        } else {
+            w.WriteHeader(404)
+        }
+    })
+    server := httptest.NewServer(handler)
+    defer server.Close()
+    // ... test
+}
+```
+
+### With DSL (Readable)
+
+```go
+func TestUserAPI(t *testing.T) {
+    mockAPI := httpserver.New().
+        OnGET("/users/1").
+            RespondJSON(200, User{ID: "1", Name: "Alice"}).
+        OnPOST("/users").
+            WithBodyMatcher(hasRequiredFields).
+            RespondJSON(201, User{ID: "2", Name: "Bob"}).
+        Build()
+    defer mockAPI.Close()
+
+    // Test reads like documentation!
+    client := NewAPIClient(mockAPI.URL())
+    user, err := client.GetUser("1")
+    // ... assertions
+}
+```
+
+## Implementing the DSL
+
+### Basic DSL Structure
+
+```go
+// internal/testutils/httpserver/server.go
+package httpserver
+
+import (
+    "encoding/json"
+    "net/http"
+    "net/http/httptest"
+)
+
+type MockServer struct {
+    routes  map[string]map[string]mockRoute // method -> path -> handler
+    server  *httptest.Server
+}
+
+type mockRoute struct {
+    statusCode int
+    response   any
+    matcher    func(*http.Request) bool
+}
+
+func New() *MockServerBuilder {
+    return &MockServerBuilder{
+        routes: make(map[string]map[string]mockRoute),
+    }
+}
+
+type MockServerBuilder struct {
+    routes map[string]map[string]mockRoute
+}
+
+func (b *MockServerBuilder) OnGET(path string) *RouteBuilder {
+    return &RouteBuilder{
+        builder: b,
+        method:  "GET",
+        path:    path,
+    }
+}
+
+func (b *MockServerBuilder) OnPOST(path string) *RouteBuilder {
+    return &RouteBuilder{
+        builder: b,
+        method:  "POST",
+        path:    path,
+    }
+}
+
+type RouteBuilder struct {
+    builder    *MockServerBuilder
+    method     string
+    path       string
+    statusCode int
+    response   any
+    matcher    func(*http.Request) bool
+}
+
+func (r *RouteBuilder) RespondJSON(statusCode int, response any) *MockServerBuilder {
+    if r.builder.routes[r.method] == nil {
+        r.builder.routes[r.method] = make(map[string]mockRoute)
+    }
+    r.builder.routes[r.method][r.path] = mockRoute{
+        statusCode: statusCode,
+        response:   response,
+        matcher:    r.matcher,
+    }
+    return r.builder
+}
+
+func (r *RouteBuilder) WithBodyMatcher(matcher func(*http.Request) bool) *RouteBuilder {
+    r.matcher = matcher
+    return r
+}
+
+func (b *MockServerBuilder) Build() *MockServer {
+    mock := &MockServer{routes: b.routes}
+
+    handler := http.HandlerFunc(func(w http.ResponseWriter, r *http.Request) {
+        methodRoutes, ok := mock.routes[r.Method]
+        if !ok {
+            w.WriteHeader(http.StatusMethodNotAllowed)
+            return
+        }
+
+        route, ok := methodRoutes[r.URL.Path]
+        if !ok {
+            w.WriteHeader(http.StatusNotFound)
+            return
+        }
+
+        if route.matcher != nil && !route.matcher(r) {
+            w.WriteHeader(http.StatusBadRequest)
+            return
+        }
+
+        w.WriteHeader(route.statusCode)
+        json.NewEncoder(w).Encode(route.response)
+    })
+
+    mock.server = httptest.NewServer(handler)
+    return mock
+}
+
+func (m *MockServer) URL() string {
+    return m.server.URL
+}
+
+func (m *MockServer) Close() {
+    m.server.Close()
+}
+```
+
+## Simple In-Memory Patterns
+
+### In-Memory Repository
+
+```go
+// user/inmem.go
+package user
+
+type InMemoryRepository struct {
+    mu    sync.RWMutex
+    users map[UserID]User
+}
+
+func NewInMemoryRepository() *InMemoryRepository {
+    return &InMemoryRepository{
+        users: make(map[UserID]User),
+    }
+}
+
+func (r *InMemoryRepository) Save(ctx context.Context, u User) error {
+    r.mu.Lock()
+    defer r.mu.Unlock()
+    r.users[u.ID] = u
+    return nil
+}
+
+func (r *InMemoryRepository) Get(ctx context.Context, id UserID) (*User, error) {
+    r.mu.RLock()
+    defer r.mu.RUnlock()
+
+    u, ok := r.users[id]
+    if !ok {
+        return nil, ErrNotFound
+    }
+    return &u, nil
+}
+```
+
+### Test Email Sender
+
+```go
+// user/test_emailer.go
+package user
+
+import (
+    "bytes"
+    "fmt"
+    "sync"
+)
+
+type TestEmailer struct {
+    mu     sync.Mutex
+    buffer bytes.Buffer
+}
+
+func NewTestEmailer() *TestEmailer {
+    return &TestEmailer{}
+}
+
+func (e *TestEmailer) Send(to Email, subject, body string) error {
+    e.mu.Lock()
+    defer e.mu.Unlock()
+
+    fmt.Fprintf(&e.buffer, "To: %s\nSubject: %s\n%s\n\n", to, subject, body)
+    return nil
+}
+
+func (e *TestEmailer) SentEmails() string {
+    e.mu.Lock()
+    defer e.mu.Unlock()
+    return e.buffer.String()
+}
+```
+
+## Benefits
+
+- **Simple** - Built on stdlib, no external dependencies
+- **Readable** - DSL makes tests self-documenting
+- **Fast** - In-memory, microsecond startup
+- **Flexible** - Easy to extend with new methods
+- **Reusable** - Same pattern for all HTTP testing
+
+## Key Takeaways
+
+1. **Start with httptest.Server** - Simple and powerful
+2. **Add DSL for readability** - When tests get complex
+3. **Keep implementations simple** - In-memory maps, buffers
+4. **Thread-safe** - Use mutexes for concurrent access
+5. **Test your test infrastructure** - It's production code
--- a/skills/testing/examples/integration-patterns.md
+++ b/skills/testing/examples/integration-patterns.md
@@ -0,0 +1,248 @@
+# Integration Test Patterns
+
+## Purpose
+
+Integration tests verify that components work together correctly. They test the seams between packages, ensure proper data flow, and validate that integrated components behave as expected.
+
+**When to Write**: After unit testing individual components, test how they interact.
+
+## File Organization
+
+### Option 1: In Package with Build Tags (Preferred)
+
+```go
+//go:build integration
+
+package user_test
+
+import (
+    "testing"
+    "myproject/internal/testutils"
+)
+
+func TestUserService_Integration(t *testing.T) {
+    // Integration test
+}
+```
+
+### Option 2: Separate Package
+
+```
+user/
+├── user.go
+├── user_test.go              # Unit tests
+└── integration/
+    └── user_integration_test.go  # Integration tests
+```
+
+## Pattern 1: Service + Repository (In-Memory)
+
+**Use when**: Testing service logic with data persistence
+
+```go
+//go:build integration
+
+package user_test
+
+import (
+    "context"
+    "testing"
+
+    "github.com/stretchr/testify/require"
+    "myproject/user"
+)
+
+func TestUserService_CreateAndRetrieve(t *testing.T) {
+    // Setup: In-memory repository (Level 1)
+    repo := user.NewInMemoryRepository()
+    svc := user.NewUserService(repo, nil)
+
+    ctx := context.Background()
+
+    // Create user
+    userID, _ := user.NewUserID("usr_123")
+    email, _ := user.NewEmail("alice@example.com")
+    newUser := user.User{
+        ID:    userID,
+        Name:  "Alice",
+        Email: email,
+    }
+
+    err := svc.CreateUser(ctx, newUser)
+    require.NoError(t, err)
+
+    // Retrieve user
+    retrieved, err := svc.GetUser(ctx, userID)
+    require.NoError(t, err)
+    require.Equal(t, "Alice", retrieved.Name)
+    require.Equal(t, email, retrieved.Email)
+}
+```
+
+## Pattern 2: Testing with Real External Service
+
+**Use when**: Need to test against real service behavior (Victoria Metrics, NATS, etc.)
+
+```go
+//go:build integration
+
+package metrics_test
+
+import (
+    "context"
+    "testing"
+
+    "github.com/stretchr/testify/require"
+    "myproject/internal/testutils"
+    "myproject/metrics"
+)
+
+func TestMetricsIngest_WithVictoriaMetrics(t *testing.T) {
+    // Start real Victoria Metrics (Level 2 - binary)
+    vmServer, err := testutils.RunVictoriaMetricsServer()
+    require.NoError(t, err)
+    defer vmServer.Shutdown()
+
+    // Create service with real dependency
+    svc := metrics.NewIngester(vmServer.WriteURL())
+
+    // Test ingestion
+    err = svc.IngestMetric(context.Background(), "test_metric", 42.0)
+    require.NoError(t, err)
+
+    // Force flush and verify
+    vmServer.ForceFlush(context.Background())
+    results, err := testutils.QueryVictoriaMetrics(vmServer.QueryURL(), "test_metric")
+    require.NoError(t, err)
+    require.Len(t, results, 1)
+}
+```
+
+## Pattern 3: Multi-Component Workflow
+
+**Use when**: Testing complete workflows across multiple components
+
+```go
+//go:build integration
+
+package workflow_test
+
+import (
+    "context"
+    "testing"
+    "time"
+
+    "github.com/stretchr/testify/suite"
+    "myproject/internal/testutils"
+    "myproject/user"
+    "myproject/notification"
+)
+
+type UserWorkflowSuite struct {
+    suite.Suite
+    userRepo    *user.InMemoryRepository
+    emailer     *user.TestEmailer
+    natsServer  *nserver.Server
+    userService *user.UserService
+    notifSvc    *notification.NotificationService
+}
+
+func (s *UserWorkflowSuite) SetupSuite() {
+    // Setup in-memory NATS (Level 1)
+    natsServer, err := testutils.RunNATsServer()
+    s.Require().NoError(err)
+    s.natsServer = natsServer
+
+    // Setup components
+    s.userRepo = user.NewInMemoryRepository()
+    s.emailer = user.NewTestEmailer()
+    s.userService = user.NewUserService(s.userRepo, s.emailer)
+
+    natsAddr := "nats://" + natsServer.Addr().String()
+    s.notifSvc = notification.NewService(natsAddr)
+}
+
+func (s *UserWorkflowSuite) TearDownSuite() {
+    s.natsServer.Shutdown()
+}
+
+func (s *UserWorkflowSuite) TestCreateUser_TriggersNotification() {
+    ctx := context.Background()
+
+    // Subscribe to notifications
+    received := make(chan string, 1)
+    s.notifSvc.Subscribe("user.created", func(msg string) {
+        received <- msg
+    })
+
+    // Create user
+    userID, _ := user.NewUserID("usr_123")
+    email, _ := user.NewEmail("alice@example.com")
+    newUser := user.User{ID: userID, Name: "Alice", Email: email}
+
+    err := s.userService.CreateUser(ctx, newUser)
+    s.Require().NoError(err)
+
+    // Verify notification sent
+    select {
+    case msg := <-received:
+        s.Contains(msg, "Alice")
+    case <-time.After(2 * time.Second):
+        s.Fail("timeout waiting for notification")
+    }
+
+    // Verify email sent
+    emails := s.emailer.SentEmails()
+    s.Contains(emails, "alice@example.com")
+}
+
+func TestUserWorkflowSuite(t *testing.T) {
+    suite.Run(t, new(UserWorkflowSuite))
+}
+```
+
+## Dependency Priority
+
+1. **Level 1: In-Memory** (Preferred) - httptest, in-memory maps, NATS harness
+2. **Level 2: Binary** (When needed) - Victoria Metrics, standalone services
+3. **Level 3: Test-containers** (Last resort) - Docker containers, slow startup
+
+## Best Practices
+
+### DO:
+- Test seams between components
+- Use in-memory implementations when possible
+- Test happy path and error scenarios
+- Use testify suites for complex setup
+- Focus on data flow and integration points
+
+### DON'T:
+- Don't test business logic (that's unit tests)
+- Don't use heavy mocking (use real implementations)
+- Don't require Docker unless absolutely necessary
+- Don't duplicate unit test coverage
+- Don't skip cleanup (always defer)
+
+## Running Integration Tests
+
+```bash
+# Skip integration tests (default)
+go test ./...
+
+# Run with integration tests
+go test -tags=integration ./...
+
+# Run only integration tests
+go test -tags=integration ./... -run Integration
+
+# With coverage
+go test -tags=integration -coverprofile=coverage.out ./...
+```
+
+## Key Takeaways
+
+1. **Test component interactions** - Not individual units
+2. **Prefer real implementations** - Over mocks when possible
+3. **Use build tags** - Keep unit tests fast
+4. **Reuse testutils** - Same infrastructure across tests
+5. **Test workflows** - Not just individual operations
--- a/skills/testing/examples/jsonrpc-mock.md
+++ b/skills/testing/examples/jsonrpc-mock.md
@@ -0,0 +1,263 @@
+# JSON-RPC Server Mock with DSL
+
+## When to Use This Example
+
+Use this when:
+- Testing JSON-RPC clients
+- Need to mock JSON-RPC server responses
+- Want configurable mock behavior per method
+- Need to track and assert on received requests
+- Testing with OpenTelemetry trace propagation
+
+**Dependency Level**: Level 1 (In-Memory) - Uses `httptest.Server` for in-memory HTTP
+
+**Key Insight**: When testing a **JSON-RPC client**, mock the **server** it calls. Use rich DSL for readable test setup.
+
+## Implementation
+
+### Rich JSON-RPC Server Mock
+
+```go
+// internal/testutils/jrpc_server_mock.go
+package testutils
+
+import (
+    "errors"
+    "fmt"
+    "net/http"
+    "net/http/httptest"
+
+    "github.com/gorilla/rpc/v2/json2"
+    "go.opentelemetry.io/otel"
+    "go.opentelemetry.io/otel/propagation"
+    "go.opentelemetry.io/otel/trace"
+)
+
+var ErrMethodNotFound = errors.New("method not found")
+
+// TraceQuery holds received JSON-RPC queries for assertions
+type TraceQuery struct {
+    Method string
+    Params string
+}
+
+// JrpcTraceServerMock is a rich JSON-RPC server mock with DSL.
+// Uses httptest.Server for in-memory HTTP (Level 1).
+type JrpcTraceServerMock struct {
+    tracer          trace.Tracer
+    server          *httptest.Server
+    mockResponses   map[string]any    // method -> response
+    queriesReceived []TraceQuery      // for assertions
+}
+
+// StartJrpcTraceServerMock starts an in-memory JSON-RPC server.
+// Returns a rich DSL object for configuring mock responses.
+func StartJrpcTraceServerMock() *JrpcTraceServerMock {
+    mock := &JrpcTraceServerMock{
+        mockResponses: make(map[string]any),
+        tracer:        otel.Tracer("trace-server-mock"),
+    }
+
+    mux := mock.createHTTPHandlers()
+    mock.server = httptest.NewServer(mux)
+
+    return mock
+}
+
+// AddMockResponse configures the mock to return a response for a method.
+// This is the DSL - chain multiple calls for different methods!
+func (m *JrpcTraceServerMock) AddMockResponse(method string, response any) {
+    m.mockResponses[method] = response
+}
+
+// GetQueriesReceived returns all queries received (for assertions)
+func (m *JrpcTraceServerMock) GetQueriesReceived() []TraceQuery {
+    return m.queriesReceived
+}
+
+// Close shuts down the server (idempotent)
+func (m *JrpcTraceServerMock) Close() {
+    m.server.Close()
+}
+
+// Address returns the server address (for client configuration)
+func (m *JrpcTraceServerMock) Address() string {
+    return m.server.Listener.Addr().String()
+}
+
+func (m *JrpcTraceServerMock) createHTTPHandlers() *http.ServeMux {
+    mux := http.NewServeMux()
+    codec := json2.NewCodec()
+
+    mux.HandleFunc("/reader", func(w http.ResponseWriter, r *http.Request) {
+        // Extract OpenTelemetry context for realistic testing
+        reqCtx := r.Context()
+        reqCtx = otel.GetTextMapPropagator().Extract(reqCtx, propagation.HeaderCarrier(r.Header))
+        reqCtx, span := m.tracer.Start(reqCtx, "jrpc-trace-server",
+            trace.WithSpanKind(trace.SpanKindServer))
+        defer span.End()
+
+        if r.Method != http.MethodPost {
+            w.WriteHeader(http.StatusMethodNotAllowed)
+            return
+        }
+
+        receivedReq := codec.NewRequest(r)
+        method, err := receivedReq.Method()
+        if err != nil {
+            w.WriteHeader(http.StatusBadRequest)
+            return
+        }
+
+        // Check if we have a mock response configured
+        if response, exists := m.mockResponses[method]; exists {
+            args := struct{}{}
+            if err := receivedReq.ReadRequest(&args); err != nil {
+                receivedReq.WriteError(w, http.StatusBadRequest, err)
+                return
+            }
+
+            // Store query for assertions
+            m.queriesReceived = append(m.queriesReceived, TraceQuery{
+                Method: method,
+                Params: fmt.Sprintf("%+v", args),
+            })
+
+            // Write mock response
+            receivedReq.WriteResponse(w, response)
+            return
+        }
+
+        // Method not configured
+        params := []string{}
+        receivedReq.ReadRequest(&params)
+        receivedReq.WriteError(w, http.StatusBadRequest, ErrMethodNotFound)
+    })
+
+    return mux
+}
+```
+
+## Usage Examples
+
+### Setup in Test Suite
+
+```go
+func (suite *TaskmonTestSuite) SetupSuite() {
+    // Start in-memory JSON-RPC server mock (Level 1)
+    suite.jrpcServerMock = testutils.StartJrpcTraceServerMock()
+
+    // Configure mock responses using DSL
+    suite.jrpcServerMock.AddMockResponse("protocol", struct {
+        Version string `json:"version"`
+        Date    string `json:"date"`
+    }{
+        Version: "3.18.0",
+        Date:    "Sep-04-2018",
+    })
+
+    suite.jrpcServerMock.AddMockResponse("get_traces", struct {
+        Traces []string `json:"traces"`
+    }{
+        Traces: []string{"trace1", "trace2"},
+    })
+
+    // Configure your client to use the mock server
+    client := jrpc.NewClient(suite.jrpcServerMock.Address() + "/reader")
+}
+
+func (suite *TaskmonTestSuite) TearDownSuite() {
+    suite.jrpcServerMock.Close()
+}
+```
+
+### Test with Assertions
+
+```go
+func (suite *TaskmonTestSuite) TestProtocolVersion() {
+    // Call your code that makes JSON-RPC requests
+    version, err := suite.taskmon.GetProtocolVersion()
+    suite.Require().NoError(err)
+    suite.Equal("3.18.0", version.Version)
+
+    // Assert on received queries
+    queries := suite.jrpcServerMock.GetQueriesReceived()
+    suite.Require().Len(queries, 1)
+    suite.Equal("protocol", queries[0].Method)
+}
+
+func (suite *TaskmonTestSuite) TestGetTraces() {
+    // Call your code
+    traces, err := suite.taskmon.GetTraces()
+    suite.Require().NoError(err)
+    suite.Equal([]string{"trace1", "trace2"}, traces)
+
+    // Verify the right method was called
+    queries := suite.jrpcServerMock.GetQueriesReceived()
+    suite.Require().Len(queries, 2) // protocol + get_traces
+    suite.Equal("get_traces", queries[1].Method)
+}
+```
+
+## Why This Pattern is Excellent
+
+1. **Rich DSL** - `AddMockResponse()` for easy, readable configuration
+2. **Readable Setup** - Tests are self-documenting, clear intent
+3. **In-Memory** - Uses `httptest.Server` (Level 1, no network I/O)
+4. **Query Tracking** - `GetQueriesReceived()` for assertions on what was called
+5. **OpenTelemetry Integration** - Realistic trace propagation for observability testing
+6. **Idempotent Cleanup** - Safe to call `Close()` multiple times
+7. **Flexible** - Configure any method/response combination dynamically
+
+## Key Design Principles
+
+### DSL for Configuration
+
+Mock setup should read like configuration:
+```go
+mock.AddMockResponse("method_name", expectedResponse)
+mock.AddMockResponse("another_method", anotherResponse)
+```
+
+### Query Tracking for Assertions
+
+Always track what was received:
+- Method names called
+- Parameters passed
+- Order of calls
+- Number of calls
+
+### Built on httptest.Server
+
+httptest.Server provides:
+- In-memory HTTP (no network I/O)
+- Automatic address allocation
+- Clean lifecycle management
+- Standard library, no dependencies
+
+## Pattern Comparison
+
+| Pattern | Use When |
+|---------|----------|
+| **httptest.Server** | Simple HTTP mocking |
+| **NATS test harness** | Need real NATS (pub/sub) |
+| **gRPC client mock** | Testing gRPC **server** |
+| **JSON-RPC server mock** | Testing JSON-RPC **client** |
+
+## Benefits
+
+- **In-Memory** - No network I/O, pure Go
+- **Fast** - Microsecond startup time
+- **Configurable** - Dynamic response configuration per test
+- **Trackable** - Full visibility into received requests
+- **OpenTelemetry-aware** - Realistic trace propagation
+- **Reusable** - Same infrastructure across test levels
+
+## Key Takeaways
+
+1. **Mock servers should have rich DSL** - Makes setup readable
+2. **Track received requests** - Essential for assertions
+3. **Use httptest.Server** - Perfect for HTTP-based protocols
+4. **Make setup read like configuration** - Self-documenting tests
+5. **Support trace propagation** - Realistic observability testing
+6. **Idempotent cleanup** - Safe resource management
--- a/skills/testing/examples/nats-in-memory.md
+++ b/skills/testing/examples/nats-in-memory.md
@@ -0,0 +1,175 @@
+# NATS In-Memory Test Server
+
+## When to Use This Example
+
+Use this when:
+- Testing message queue integrations with NATS
+- Need pub/sub functionality in tests
+- Want fast, in-memory NATS server (no Docker, no binary)
+- Testing event-driven architectures
+
+**Dependency Level**: Level 1 (In-Memory) - Pure Go, official test harness
+
+## Implementation
+
+### Setup Test Infrastructure
+
+Many official SDKs provide test harnesses. Here's NATS:
+
+```go
+// internal/testutils/nats.go
+package testutils
+
+import (
+    nserver "github.com/nats-io/nats-server/v2/server"
+    natsserver "github.com/nats-io/nats-server/v2/test"
+    "github.com/projectdiscovery/freeport"
+)
+
+// RunNATsServer runs a NATS server in-memory for testing.
+// Uses the official NATS SDK test harness - no binary download needed!
+func RunNATsServer() (*nserver.Server, error) {
+    opts := natsserver.DefaultTestOptions
+
+    // Allocate free port to prevent conflicts in parallel tests
+    tcpPort, err := freeport.GetFreePort("127.0.0.1", freeport.TCP)
+    if err != nil {
+        return nil, err
+    }
+
+    opts.Port = tcpPort.Port
+
+    // Start NATS server in-memory (pure Go!)
+    return natsserver.RunServer(&opts), nil
+}
+
+// RunNATsServerWithJetStream runs NATS with JetStream enabled
+func RunNATsServerWithJetStream() (*nserver.Server, error) {
+    opts := natsserver.DefaultTestOptions
+
+    tcpPort, err := freeport.GetFreePort("127.0.0.1", freeport.TCP)
+    if err != nil {
+        return nil, err
+    }
+
+    opts.Port = tcpPort.Port
+    opts.JetStream = true
+
+    return natsserver.RunServer(&opts), nil
+}
+```
+
+## Usage in Integration Tests
+
+### Basic Pub/Sub Test
+
+```go
+//go:build integration
+
+package integration_test
+
+import (
+    "context"
+    "testing"
+    "time"
+    "github.com/nats-io/nats.go"
+    "github.com/stretchr/testify/require"
+    "myproject/internal/testutils"
+)
+
+func TestNATSPubSub_Integration(t *testing.T) {
+    // Start NATS server in-memory (Level 1 - pure Go!)
+    natsServer, err := testutils.RunNATsServer()
+    require.NoError(t, err)
+    defer natsServer.Shutdown()
+
+    // Connect to in-memory NATS
+    natsAddress := "nats://" + natsServer.Addr().String()
+    nc, err := nats.Connect(natsAddress)
+    require.NoError(t, err)
+    defer nc.Close()
+
+    // Test pub/sub
+    received := make(chan string, 1)
+    _, err = nc.Subscribe("test.subject", func(msg *nats.Msg) {
+        received <- string(msg.Data)
+    })
+    require.NoError(t, err)
+
+    // Publish message
+    err = nc.Publish("test.subject", []byte("hello"))
+    require.NoError(t, err)
+
+    // Wait for message
+    select {
+    case msg := <-received:
+        require.Equal(t, "hello", msg)
+    case <-time.After(1 * time.Second):
+        t.Fatal("timeout waiting for message")
+    }
+}
+```
+
+### Real-World Usage Example (gRPC + NATS)
+
+```go
+// tests/gointegration/remote_traces_test.go
+type RemoteTracesTestSuite struct {
+    suite.Suite
+    natsServer  *nserver.Server
+    natsAddress string
+    nc          *nats.Conn
+    // ... other fields
+}
+
+func (suite *RemoteTracesTestSuite) SetupSuite() {
+    // Start NATS server in-memory
+    natsServer, err := testutils.RunNATsServer()
+    suite.Require().NoError(err)
+
+    suite.natsServer = natsServer
+    suite.natsAddress = "nats://" + natsServer.Addr().String()
+
+    // Connect application to in-memory NATS
+    suite.nc, err = natsremotetraces.ConnectToRemoteTracesSession(
+        suite.ctx, suite.natsAddress, numWorkers, numWorkers, channelSize)
+    suite.Require().NoError(err)
+
+    // Start gRPC server with NATS backend
+    // ... rest of setup
+}
+
+func (suite *RemoteTracesTestSuite) TearDownSuite() {
+    suite.nc.Close()
+    suite.natsServer.Shutdown() // Clean shutdown
+}
+
+func (suite *RemoteTracesTestSuite) TestMessageFlow() {
+    // Test your application logic that uses NATS
+    // ...
+}
+```
+
+## Why This is Excellent
+
+- **Pure Go** - NATS server imported as library (no binary download)
+- **Official** - Uses NATS SDK's official test harness
+- **Fast** - Starts in microseconds
+- **Reliable** - Same behavior as production NATS
+- **Portable** - Works anywhere Go runs
+- **No Docker** - No external dependencies
+- **Parallel-Safe** - Free port allocation prevents conflicts
+
+## Other Libraries with Test Harnesses
+
+- **Redis**: `github.com/alicebob/miniredis` - Pure Go in-memory Redis
+- **NATS**: `github.com/nats-io/nats-server/v2/test` (shown above)
+- **PostgreSQL**: `github.com/jackc/pgx/v5/pgxpool` with pgx mock
+- **MongoDB**: `github.com/tryvium-travels/memongo` - In-memory MongoDB
+
+## Key Takeaways
+
+1. **Check for official test harnesses first** - Many popular libraries provide them
+2. **Use free port allocation** - Prevents conflicts in parallel tests
+3. **Clean shutdown** - Always call `Shutdown()` in teardown
+4. **Reusable infrastructure** - Same setup for unit, integration, and system tests
--- a/skills/testing/examples/system-patterns.md
+++ b/skills/testing/examples/system-patterns.md
@@ -0,0 +1,288 @@
+# System Test Patterns
+
+## Purpose
+
+System tests are black-box tests that verify the entire application works correctly from an external perspective. They test via CLI or API, simulating real user interactions.
+
+**Location**: `tests/` directory at project root (separate from package code)
+
+## Principles
+
+### Black Box Testing
+- Test only via public interfaces (CLI, API)
+- No access to internal packages
+- Simulate real user behavior
+- Test critical workflows end-to-end
+
+### Independence in Go
+- Strive for pure Go tests (no Docker required)
+- Use in-memory mocks from `testutils`
+- Binary dependencies when needed
+- Avoid docker-compose in CI
+
+## CLI Testing Patterns
+
+### Pattern 1: Simple Command Execution
+
+```go
+// tests/cli_test.go
+package tests
+
+import (
+    "os/exec"
+    "strings"
+    "testing"
+
+    "github.com/stretchr/testify/require"
+)
+
+func TestCLI_Version(t *testing.T) {
+    // Execute CLI command
+    cmd := exec.Command("./myapp", "version")
+    output, err := cmd.CombinedOutput()
+
+    require.NoError(t, err)
+    require.Contains(t, string(output), "myapp version")
+}
+
+func TestCLI_Help(t *testing.T) {
+    cmd := exec.Command("./myapp", "--help")
+    output, err := cmd.CombinedOutput()
+
+    require.NoError(t, err)
+    require.Contains(t, string(output), "Usage:")
+}
+```
+
+### Pattern 2: CLI with In-Memory Mocks
+
+```go
+// tests/cli_metrics_test.go
+package tests
+
+import (
+    "context"
+    "os/exec"
+    "testing"
+
+    "github.com/stretchr/testify/require"
+    "myproject/internal/testutils"
+)
+
+func TestCLI_MetricsIngest(t *testing.T) {
+    // Start Victoria Metrics (Level 2 - binary)
+    vmServer, err := testutils.RunVictoriaMetricsServer()
+    require.NoError(t, err)
+    defer vmServer.Shutdown()
+
+    // Test CLI against real Victoria Metrics
+    cmd := exec.Command("./myapp", "ingest",
+        "--metrics-url", vmServer.WriteURL(),
+        "--metric-name", "cli_test_metric",
+        "--value", "100")
+
+    output, err := cmd.CombinedOutput()
+    require.NoError(t, err)
+    require.Contains(t, string(output), "Metric ingested successfully")
+
+    // Verify with helpers
+    vmServer.ForceFlush(context.Background())
+    results, err := testutils.QueryVictoriaMetrics(vmServer.QueryURL(), "cli_test_metric")
+    require.NoError(t, err)
+    require.Len(t, results, 1)
+}
+```
+
+### Pattern 3: CLI with File System
+
+```go
+// tests/cli_config_test.go
+package tests
+
+import (
+    "os"
+    "os/exec"
+    "path/filepath"
+    "testing"
+
+    "github.com/stretchr/testify/require"
+)
+
+func TestCLI_ConfigFile(t *testing.T) {
+    // Create temp directory
+    tempDir := t.TempDir()
+    configPath := filepath.Join(tempDir, "config.yaml")
+
+    // Write config file
+    configContent := `
+server:
+  port: 8080
+  host: localhost
+`
+    err := os.WriteFile(configPath, []byte(configContent), 0644)
+    require.NoError(t, err)
+
+    // Test CLI with config file
+    cmd := exec.Command("./myapp", "start", "--config", configPath, "--dry-run")
+    output, err := cmd.CombinedOutput()
+
+    require.NoError(t, err)
+    require.Contains(t, string(output), "Server would start on localhost:8080")
+}
+```
+
+## API Testing Patterns
+
+### Pattern 1: HTTP API with In-Memory Mocks
+
+```go
+// tests/api_test.go
+package tests
+
+import (
+    "bytes"
+    "encoding/json"
+    "io"
+    "net/http"
+    "os/exec"
+    "testing"
+    "time"
+
+    "github.com/stretchr/testify/require"
+    "myproject/internal/testutils"
+)
+
+func TestAPI_UserWorkflow(t *testing.T) {
+    // Start in-memory NATS (Level 1)
+    natsServer, err := testutils.RunNATsServer()
+    require.NoError(t, err)
+    defer natsServer.Shutdown()
+
+    natsAddr := "nats://" + natsServer.Addr().String()
+
+    // Start API server
+    cmd := exec.Command("./myapp", "serve",
+        "--port", "0", // Random free port
+        "--nats-url", natsAddr)
+
+    // Start in background
+    err = cmd.Start()
+    require.NoError(t, err)
+    defer cmd.Process.Kill()
+
+    // Wait for API to be ready
+    time.Sleep(500 * time.Millisecond)
+
+    // Get actual port (from logs or endpoint)
+    apiURL := "http://localhost:8080" // Or parse from logs
+
+    // Test API workflow
+    // 1. Create user
+    createReq := map[string]string{
+        "name":  "Alice",
+        "email": "alice@example.com",
+    }
+    body, _ := json.Marshal(createReq)
+
+    resp, err := http.Post(apiURL+"/users", "application/json", bytes.NewBuffer(body))
+    require.NoError(t, err)
+    defer resp.Body.Close()
+    require.Equal(t, http.StatusCreated, resp.StatusCode)
+
+    // Parse response
+    var createResp map[string]string
+    json.NewDecoder(resp.Body).Decode(&createResp)
+    userID := createResp["id"]
+
+    // 2. Retrieve user
+    resp, err = http.Get(apiURL + "/users/" + userID)
+    require.NoError(t, err)
+    defer resp.Body.Close()
+    require.Equal(t, http.StatusOK, resp.StatusCode)
+
+    var user map[string]string
+    json.NewDecoder(resp.Body).Decode(&user)
+    require.Equal(t, "Alice", user["name"])
+}
+```
+
+## Architecture for Independence
+
+### Dependency Injection Pattern
+
+Design your application to accept dependency URLs:
+
+```go
+// cmd/myapp/main.go
+func main() {
+    // Allow overriding dependencies via flags
+    natsURL := flag.String("nats-url", "nats://localhost:4222", "NATS server URL")
+    metricsURL := flag.String("metrics-url", "http://localhost:8428", "Metrics server URL")
+    flag.Parse()
+
+    // Use provided URLs (allows in-memory mocks in tests)
+    app := app.New(*natsURL, *metricsURL)
+    app.Run()
+}
+```
+
+### Test with In-Memory Dependencies
+
+```go
+// tests/app_test.go
+func TestApp_WithMocks(t *testing.T) {
+    // Start all mocks
+    natsServer, _ := testutils.RunNATsServer()
+    defer natsServer.Shutdown()
+
+    vmServer, _ := testutils.RunVictoriaMetricsServer()
+    defer vmServer.Shutdown()
+
+    // Test app with mocked dependencies (pure Go, no Docker!)
+    cmd := exec.Command("./myapp", "serve",
+        "--nats-url", "nats://"+natsServer.Addr().String(),
+        "--metrics-url", vmServer.WriteURL())
+
+    // ... test application
+}
+```
+
+## Running System Tests
+
+```bash
+# Build application first
+go build -o myapp ./cmd/myapp
+
+# Run system tests
+go test -v ./tests/...
+
+# With coverage
+go test -v -coverprofile=coverage.out ./tests/...
+
+# Specific test
+go test -v ./tests/... -run TestCLI_MetricsIngest
+```
+
+## Best Practices
+
+### DO:
+- Test via CLI/API only (black box)
+- Use in-memory mocks from testutils
+- Test critical end-to-end workflows
+- Build binary before running tests
+- Use temp directories for file operations
+
+### DON'T:
+- Don't import internal packages
+- Don't test every edge case (that's unit/integration tests)
+- Don't require Docker in CI
+- Don't use sleep for timing (use polling/channels)
+- Don't skip cleanup
+
+## Key Takeaways
+
+1. **Black box only** - Test via public interfaces
+2. **Independent in Go** - No Docker required
+3. **Use testutils mocks** - Reuse infrastructure
+4. **Test critical paths** - Not every scenario
+5. **Fast execution** - Should run quickly in CI
--- a/skills/testing/examples/test-organization.md
+++ b/skills/testing/examples/test-organization.md
@@ -0,0 +1,260 @@
+# Test Organization and File Structure
+
+## File Organization
+
+### Basic Structure
+
+```
+user/
+├── user.go
+├── user_test.go          # Unit tests for user.go
+├── service.go
+├── service_test.go       # Unit tests for service.go
+├── repository.go
+└── repository_test.go    # Unit tests for repository.go
+```
+
+### With Integration and System Tests
+
+```
+project/
+├── user/
+│   ├── user.go
+│   ├── user_test.go              # Unit tests (pkg_test)
+│   ├── service.go
+│   ├── service_test.go           # Unit tests (pkg_test)
+│   └── integration_test.go       # Integration tests with //go:build integration
+├── internal/
+│   └── testutils/                # Reusable test infrastructure
+│       ├── nats.go              # In-memory NATS server
+│       ├── victoria.go          # Victoria Metrics binary management
+│       └── httpserver/          # HTTP mock DSL
+│           ├── server.go
+│           └── server_test.go   # Test the infrastructure!
+└── tests/                        # System tests (black box)
+    ├── cli_test.go              # CLI testing via exec.Command
+    └── api_test.go              # API testing via HTTP client
+```
+
+## Package Naming
+
+### Use `pkg_test` for Unit Tests
+
+```go
+// ✅ External package - tests public API only
+package user_test
+
+import (
+    "testing"
+    "github.com/yourorg/project/user"
+)
+
+func TestService_CreateUser(t *testing.T) {
+    // Test through public API
+    svc, _ := user.NewUserService(repo, notifier)
+    err := svc.CreateUser(ctx, testUser)
+    // ...
+}
+```
+
+### Avoid Same Package Testing
+
+```go
+// ❌ Same package - can test private methods (don't do this)
+package user
+
+import "testing"
+
+func TestInternalValidation(t *testing.T) {
+    // Testing private function - bad practice
+    result := validateEmailInternal("test@example.com")
+    // ...
+}
+```
+
+## Build Tags for Integration Tests
+
+### Using Build Tags
+
+```go
+//go:build integration
+
+package user_test
+
+import (
+    "context"
+    "testing"
+    "myproject/internal/testutils"
+)
+
+func TestUserService_Integration(t *testing.T) {
+    // Integration test with real dependencies
+    natsServer, _ := testutils.RunNATsServer()
+    defer natsServer.Shutdown()
+
+    // Test with real NATS
+    // ...
+}
+```
+
+### Running Tests
+
+```bash
+# Run only unit tests (default - no build tags)
+go test ./...
+
+# Run unit + integration tests
+go test -tags=integration ./...
+
+# Run specific package integration tests
+go test -tags=integration ./user
+
+# Run system tests only
+go test ./tests/...
+
+# Run all tests
+go test -tags=integration ./...
+```
+
+## Makefile/Taskfile Integration
+
+### Taskfile.yml Example
+
+```yaml
+version: '3'
+
+tasks:
+  test:
+    desc: Run unit tests
+    cmds:
+      - go test -v -race ./...
+
+  test:integration:
+    desc: Run integration tests
+    cmds:
+      - go test -v -race -tags=integration ./...
+
+  test:system:
+    desc: Run system tests
+    cmds:
+      - go test -v -race ./tests/...
+
+  test:all:
+    desc: Run all tests
+    cmds:
+      - task: test:integration
+      - task: test:system
+
+  test:coverage:
+    desc: Run tests with coverage
+    cmds:
+      - go test -v -race -coverprofile=coverage.out ./...
+      - go tool cover -html=coverage.out -o coverage.html
+```
+
+### Makefile Example
+
+```makefile
+.PHONY: test test-integration test-system test-all coverage
+
+test:
+	go test -v -race ./...
+
+test-integration:
+	go test -v -race -tags=integration ./...
+
+test-system:
+	go test -v -race ./tests/...
+
+test-all: test-integration test-system
+
+coverage:
+	go test -v -race -coverprofile=coverage.out ./...
+	go tool cover -html=coverage.out -o coverage.html
+```
+
+## Test File Naming
+
+### Unit Tests
+- `*_test.go` - Standard test files
+- Located next to the code being tested
+- Use `pkg_test` package name
+
+### Integration Tests
+- `integration_test.go` or `*_integration_test.go`
+- Use `//go:build integration` tag
+- Can be in same directory or separate `integration/` folder
+- Use `pkg_test` package name
+
+### System Tests
+- `*_test.go` in `tests/` directory at project root
+- No build tags needed (separate directory)
+- Use `tests` or `main_test` package name
+
+## CI/CD Integration
+
+### GitHub Actions Example
+
+```yaml
+name: Tests
+
+on: [push, pull_request]
+
+jobs:
+  unit-tests:
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v3
+      - uses: actions/setup-go@v4
+        with:
+          go-version: '1.21'
+      - name: Run unit tests
+        run: go test -v -race ./...
+
+  integration-tests:
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v3
+      - uses: actions/setup-go@v4
+        with:
+          go-version: '1.21'
+      - name: Run integration tests
+        run: go test -v -race -tags=integration ./...
+
+  system-tests:
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v3
+      - uses: actions/setup-go@v4
+        with:
+          go-version: '1.21'
+      - name: Build application
+        run: go build -o myapp ./cmd/myapp
+      - name: Run system tests
+        run: go test -v -race ./tests/...
+```
+
+## testutils Package Structure
+
+```
+internal/testutils/
+├── nats.go              # NATS in-memory server helpers
+├── victoria.go          # Victoria Metrics binary management
+├── prometheus.go        # Prometheus payload helpers
+├── grpc_client_mock.go  # gRPC client mock with DSL
+├── jrpc_server_mock.go  # JSON-RPC server mock with DSL
+└── httpserver/          # HTTP mock server with DSL
+    ├── server.go
+    ├── server_test.go   # Test the infrastructure!
+    ├── dsl.go
+    └── README.md
+```
+
+## Key Principles
+
+1. **Co-locate unit tests** - Next to the code being tested
+2. **Use pkg_test package** - Forces public API testing
+3. **Build tags for integration** - Keep unit tests fast by default
+4. **Separate system tests** - In `tests/` directory
+5. **Test your test infrastructure** - Treat testutils as production code
+6. **Reusable infrastructure** - Share across all test levels
--- a/skills/testing/examples/victoria-metrics.md
+++ b/skills/testing/examples/victoria-metrics.md
@@ -0,0 +1,566 @@
+# Victoria Metrics Binary Test Server
+
+## When to Use This Example
+
+Use this when:
+- Testing Prometheus Remote Write integrations
+- Need real Victoria Metrics for testing metrics ingestion
+- Testing PromQL queries
+- Want production-like behavior without Docker
+- Testing metrics pipelines end-to-end
+
+**Dependency Level**: Level 2 (Binary) - Standalone executable via `exec.Command`
+
+**Why Binary Instead of In-Memory:**
+- Victoria Metrics is complex; reimplementing as in-memory mock isn't practical
+- Need real PromQL engine behavior
+- Need actual data persistence and querying
+- Binary startup is fast (< 1 second) and requires no Docker
+
+## Implementation
+
+### Victoria Server Infrastructure
+
+This example shows how to download, manage, and run Victoria Metrics binary for testing:
+
+```go
+// internal/testutils/victoria.go
+package testutils
+
+import (
+    "context"
+    "errors"
+    "fmt"
+    "net/http"
+    "os"
+    "os/exec"
+    "path/filepath"
+    "runtime"
+    "sync"
+    "time"
+
+    "github.com/projectdiscovery/freeport"
+)
+
+const (
+    DefaultVictoriaMetricsVersion = "v1.128.0"
+    VictoriaMetricsVersionEnvVar  = "TEST_VICTORIA_METRICS_VERSION"
+)
+
+var (
+    ErrVictoriaMetricsNotHealthy = errors.New("victoria metrics did not become healthy")
+    ErrDownloadFailed            = errors.New("download failed")
+
+    // binaryDownloadMu protects concurrent downloads (prevent race conditions)
+    binaryDownloadMu sync.Mutex
+)
+
+// VictoriaServer represents a running Victoria Metrics test instance
+type VictoriaServer struct {
+    cmd          *exec.Cmd
+    port         int
+    dataPath     string
+    writeURL     string
+    queryURL     string
+    version      string
+    binaryPath   string
+    shutdownOnce sync.Once
+    shutdownErr  error
+}
+
+// WriteURL returns the URL for writing metrics (Prometheus Remote Write endpoint)
+func (vs *VictoriaServer) WriteURL() string {
+    return vs.writeURL
+}
+
+// QueryURL returns the URL for querying metrics (Prometheus-compatible query endpoint)
+func (vs *VictoriaServer) QueryURL() string {
+    return vs.queryURL
+}
+
+// Port returns the port Victoria Metrics is listening on
+func (vs *VictoriaServer) Port() int {
+    return vs.port
+}
+
+// ForceFlush forces Victoria Metrics to flush buffered samples from memory to disk,
+// making them immediately queryable. This is useful for testing to avoid waiting
+// for the automatic flush cycle.
+func (vs *VictoriaServer) ForceFlush(ctx context.Context) error {
+    url := fmt.Sprintf("http://localhost:%d/internal/force_flush", vs.port)
+    req, err := http.NewRequestWithContext(ctx, http.MethodGet, url, nil)
+    if err != nil {
+        return fmt.Errorf("failed to create force flush request: %w", err)
+    }
+
+    resp, err := http.DefaultClient.Do(req)
+    if err != nil {
+        return fmt.Errorf("failed to force flush: %w", err)
+    }
+    defer resp.Body.Close()
+
+    if resp.StatusCode != http.StatusOK {
+        return fmt.Errorf("force flush failed: status %d", resp.StatusCode)
+    }
+
+    return nil
+}
+
+// Shutdown stops Victoria Metrics and cleans up resources.
+// Safe to call multiple times (idempotent).
+func (vs *VictoriaServer) Shutdown() error {
+    vs.shutdownOnce.Do(func() {
+        if vs.cmd == nil || vs.cmd.Process == nil {
+            return
+        }
+
+        // Send interrupt signal for graceful shutdown
+        if err := vs.cmd.Process.Signal(os.Interrupt); err != nil {
+            vs.shutdownErr = err
+            return
+        }
+
+        // Wait for process to exit (with timeout)
+        done := make(chan error, 1)
+        go func() {
+            done <- vs.cmd.Wait()
+        }()
+
+        select {
+        case <-time.After(5 * time.Second):
+            vs.cmd.Process.Kill()
+            vs.shutdownErr = errors.New("shutdown timeout")
+        case err := <-done:
+            if err != nil && err.Error() != "signal: interrupt" {
+                vs.shutdownErr = err
+            }
+        }
+
+        // Cleanup data directory
+        if vs.dataPath != "" {
+            os.RemoveAll(vs.dataPath)
+        }
+    })
+    return vs.shutdownErr
+}
+
+// RunVictoriaMetricsServer starts a Victoria Metrics instance for testing.
+// It downloads the binary if needed, starts the server, and waits for it to be healthy.
+func RunVictoriaMetricsServer() (*VictoriaServer, error) {
+    version := getVictoriaMetricsVersion()
+
+    // Ensure binary exists (downloads if missing)
+    binaryPath, err := ensureVictoriaBinary(version)
+    if err != nil {
+        return nil, err
+    }
+
+    // Get free port (prevents conflicts in parallel tests)
+    freePort, err := freeport.GetFreePort("127.0.0.1", freeport.TCP)
+    if err != nil {
+        return nil, fmt.Errorf("failed to get free port: %w", err)
+    }
+    port := freePort.Port
+
+    // Create temporary data directory
+    dataPath, err := os.MkdirTemp("", "victoria-metrics-test-*")
+    if err != nil {
+        return nil, fmt.Errorf("failed to create temp directory: %w", err)
+    }
+
+    // Start Victoria Metrics
+    cmd := exec.Command(
+        binaryPath,
+        fmt.Sprintf("-httpListenAddr=:%d", port),
+        "-storageDataPath="+dataPath,
+        "-retentionPeriod=1d",
+        "-inmemoryDataFlushInterval=1ms", // Force immediate data flush for testing
+    )
+
+    cmd.Stdout = os.Stdout
+    cmd.Stderr = os.Stderr
+
+    if err := cmd.Start(); err != nil {
+        os.RemoveAll(dataPath)
+        return nil, fmt.Errorf("failed to start victoria metrics: %w", err)
+    }
+
+    baseURL := fmt.Sprintf("http://localhost:%d", port)
+    server := &VictoriaServer{
+        cmd:        cmd,
+        port:       port,
+        dataPath:   dataPath,
+        writeURL:   baseURL + "/api/v1/write",
+        queryURL:   baseURL + "/api/v1/query",
+        version:    version,
+        binaryPath: binaryPath,
+    }
+
+    // Wait for server to become healthy
+    if err := waitForHealth(baseURL); err != nil {
+        server.Shutdown()
+        return nil, err
+    }
+
+    return server, nil
+}
+
+func getVictoriaMetricsVersion() string {
+    if version := os.Getenv(VictoriaMetricsVersionEnvVar); version != "" {
+        return version
+    }
+    return DefaultVictoriaMetricsVersion
+}
+
+// ensureVictoriaBinary ensures the Victoria Metrics binary exists, downloading if necessary.
+// Thread-safe with double-check locking to prevent race conditions.
+func ensureVictoriaBinary(version string) (string, error) {
+    binaryName := fmt.Sprintf("victoria-metrics-%s-%s-%s", version, runtime.GOOS, getVMArch())
+    binaryPath := filepath.Join(".bin", binaryName)
+
+    // Quick check without lock (optimization)
+    if _, err := os.Stat(binaryPath); err == nil {
+        return binaryPath, nil
+    }
+
+    // Acquire lock to prevent concurrent downloads
+    binaryDownloadMu.Lock()
+    defer binaryDownloadMu.Unlock()
+
+    // Double-check after acquiring lock (another goroutine might have downloaded it)
+    if _, err := os.Stat(binaryPath); err == nil {
+        return binaryPath, nil
+    }
+
+    // Create .bin directory
+    if err := os.MkdirAll(".bin", 0755); err != nil {
+        return "", fmt.Errorf("failed to create .bin directory: %w", err)
+    }
+
+    // Download to temporary location with unique name
+    tempPath := fmt.Sprintf("%s.tmp.%d", binaryPath, os.Getpid())
+    defer os.Remove(tempPath)
+
+    downloadURL := fmt.Sprintf(
+        "https://github.com/VictoriaMetrics/VictoriaMetrics/releases/download/%s/victoria-metrics-%s-%s-%s.tar.gz",
+        version, runtime.GOOS, getVMArch(), version,
+    )
+
+    if err := downloadAndExtract(downloadURL, tempPath); err != nil {
+        return "", fmt.Errorf("failed to download: %w", err)
+    }
+
+    if err := os.Chmod(tempPath, 0755); err != nil {
+        return "", fmt.Errorf("failed to make binary executable: %w", err)
+    }
+
+    // Atomic rename - only one goroutine succeeds if multiple try
+    if err := os.Rename(tempPath, binaryPath); err != nil {
+        // If rename fails, check if another goroutine succeeded
+        if _, statErr := os.Stat(binaryPath); statErr == nil {
+            return binaryPath, nil // Another goroutine won the race
+        }
+        return "", fmt.Errorf("failed to rename binary: %w", err)
+    }
+
+    return binaryPath, nil
+}
+
+func getVMArch() string {
+    switch runtime.GOARCH {
+    case "amd64":
+        return "amd64"
+    case "arm64":
+        return "arm64"
+    default:
+        return runtime.GOARCH
+    }
+}
+
+func waitForHealth(baseURL string) error {
+    healthURL := baseURL + "/health"
+    maxRetries := 30
+    retryInterval := time.Second
+
+    ctx := context.Background()
+    for range maxRetries {
+        req, err := http.NewRequestWithContext(ctx, http.MethodGet, healthURL, nil)
+        if err != nil {
+            time.Sleep(retryInterval)
+            continue
+        }
+
+        resp, err := http.DefaultClient.Do(req)
+        if err == nil {
+            statusOK := resp.StatusCode == http.StatusOK
+            resp.Body.Close()
+            if statusOK {
+                return nil
+            }
+        }
+
+        time.Sleep(retryInterval)
+    }
+
+    return ErrVictoriaMetricsNotHealthy
+}
+```
+
+### Helper Functions for Prometheus/Victoria Metrics Testing
+
+Add practical helpers that make tests clear and maintainable:
+
+```go
+// internal/testutils/prometheus.go
+package testutils
+
+import (
+    "context"
+    "encoding/json"
+    "errors"
+    "fmt"
+    "io"
+    "net/http"
+    "net/url"
+    "testing"
+    "time"
+
+    "github.com/gogo/protobuf/proto"
+    "github.com/golang/snappy"
+    "github.com/prometheus/prometheus/prompb"
+    "github.com/stretchr/testify/require"
+)
+
+var (
+    ErrQueryFailed     = errors.New("victoria metrics query failed")
+    ErrQueryNonSuccess = errors.New("query returned non-success status")
+)
+
+// CreatePrometheusPayload creates a valid Prometheus Remote Write payload
+// with a sample metric. The payload is protobuf-encoded and snappy-compressed,
+// ready to be sent to Victoria Metrics' /api/v1/write endpoint.
+func CreatePrometheusPayload(metricName string, value float64, labels map[string]string) ([]byte, error) {
+    // Create timestamp (current time in milliseconds)
+    timestampMs := time.Now().UnixMilli()
+
+    // Build label pairs
+    labelPairs := make([]prompb.Label, 0, len(labels)+1)
+    labelPairs = append(labelPairs, prompb.Label{
+        Name:  "__name__",
+        Value: metricName,
+    })
+    for name, val := range labels {
+        labelPairs = append(labelPairs, prompb.Label{
+            Name:  name,
+            Value: val,
+        })
+    }
+
+    // Create a single time series with one sample
+    timeseries := []prompb.TimeSeries{
+        {
+            Labels: labelPairs,
+            Samples: []prompb.Sample{
+                {
+                    Value:     value,
+                    Timestamp: timestampMs,
+                },
+            },
+        },
+    }
+
+    // Create WriteRequest
+    writeRequest := &prompb.WriteRequest{
+        Timeseries: timeseries,
+    }
+
+    // Marshal to protobuf
+    data, err := proto.Marshal(writeRequest)
+    if err != nil {
+        return nil, fmt.Errorf("failed to marshal protobuf: %w", err)
+    }
+
+    // Compress with snappy
+    compressed := snappy.Encode(nil, data)
+
+    return compressed, nil
+}
+
+// VMQueryResult represents a single result from a Victoria Metrics query.
+type VMQueryResult struct {
+    Metric map[string]string // label name -> label value
+    Value  []any             // [timestamp, value_string]
+}
+
+// VMQueryResponse represents the full Victoria Metrics API response.
+type VMQueryResponse struct {
+    Status string `json:"status"`
+    Data   struct {
+        ResultType string          `json:"result_type"`
+        Result     []VMQueryResult `json:"result"`
+    } `json:"data"`
+}
+
+// QueryVictoriaMetrics executes a PromQL query against Victoria Metrics.
+// The query is performed via the /api/v1/query endpoint with time buffer
+// for clock skew and delayed indexing.
+func QueryVictoriaMetrics(queryURL, query string) ([]VMQueryResult, error) {
+    // Query with current time + 1 minute to catch any clock skew or delayed indexing
+    currentTime := time.Now().Add(1 * time.Minute)
+    fullURL := fmt.Sprintf("%s?query=%s&time=%d", queryURL, url.QueryEscape(query), currentTime.Unix())
+
+    // Execute HTTP request
+    ctx, cancel := context.WithTimeout(context.Background(), 5*time.Second)
+    defer cancel()
+
+    req, err := http.NewRequestWithContext(ctx, http.MethodGet, fullURL, nil)
+    if err != nil {
+        return nil, fmt.Errorf("failed to create request: %w", err)
+    }
+
+    resp, err := http.DefaultClient.Do(req)
+    if err != nil {
+        return nil, fmt.Errorf("failed to execute query: %w", err)
+    }
+    defer resp.Body.Close()
+
+    if resp.StatusCode != http.StatusOK {
+        return nil, fmt.Errorf("%w: %s", ErrQueryFailed, resp.Status)
+    }
+
+    // Read response body
+    bodyBytes, err := io.ReadAll(resp.Body)
+    if err != nil {
+        return nil, fmt.Errorf("failed to read response body: %w", err)
+    }
+
+    // Parse JSON response
+    var queryResp VMQueryResponse
+    if err := json.Unmarshal(bodyBytes, &queryResp); err != nil {
+        return nil, fmt.Errorf("failed to decode response: %w", err)
+    }
+
+    if queryResp.Status != "success" {
+        return nil, fmt.Errorf("%w: %s", ErrQueryNonSuccess, queryResp.Status)
+    }
+
+    return queryResp.Data.Result, nil
+}
+
+// AssertLabelExists checks if at least one result contains a label with the given name and value.
+// Fails the test if the label is not found.
+func AssertLabelExists(t *testing.T, results []VMQueryResult, labelName, labelValue string) {
+    t.Helper()
+
+    for _, result := range results {
+        if val, exists := result.Metric[labelName]; exists && val == labelValue {
+            return // Found it!
+        }
+    }
+
+    // Label not found - fail with helpful message
+    require.Fail(t, "Label not found",
+        "Expected to find label %s=%s in query results, but it was not present",
+        labelName, labelValue)
+}
+```
+
+## Usage Examples
+
+### Integration Test
+
+```go
+// internal/api/stats/prometheus_ingest_test.go
+func TestPrometheusIngest_WithVictoriaMetrics(t *testing.T) {
+    // Start real Victoria Metrics server (Level 2)
+    vmServer, err := testutils.RunVictoriaMetricsServer()
+    require.NoError(t, err)
+    defer vmServer.Shutdown()
+
+    // Create valid Prometheus payload using helper
+    payload, err := testutils.CreatePrometheusPayload("test_metric", 42.0, map[string]string{
+        "service": "api",
+        "env":     "test",
+    })
+    require.NoError(t, err)
+
+    // Send to Victoria Metrics
+    req := httptest.NewRequest(http.MethodPost, vmServer.WriteURL(), bytes.NewBuffer(payload))
+    req.Header.Set("Content-Type", "application/x-protobuf")
+    req.Header.Set("Content-Encoding", "snappy")
+
+    resp, err := http.DefaultClient.Do(req)
+    require.NoError(t, err)
+    defer resp.Body.Close()
+
+    assert.Equal(t, http.StatusNoContent, resp.StatusCode)
+
+    // Force flush to make data queryable immediately
+    err = vmServer.ForceFlush(context.Background())
+    require.NoError(t, err)
+
+    // Query using helper
+    results, err := testutils.QueryVictoriaMetrics(vmServer.QueryURL(), `test_metric{service="api"}`)
+    require.NoError(t, err)
+    require.Len(t, results, 1)
+
+    // Assert using helper
+    testutils.AssertLabelExists(t, results, "env", "test")
+}
+```
+
+### System Test
+
+```go
+// tests/prometheus_ingestion_test.go
+func TestE2E_PrometheusIngestion(t *testing.T) {
+    // Same Victoria Metrics infrastructure!
+    vmServer, err := testutils.RunVictoriaMetricsServer()
+    require.NoError(t, err)
+    defer vmServer.Shutdown()
+
+    // Test CLI against real Victoria Metrics
+    cmd := exec.Command("./myapp", "ingest",
+        "--metrics-url", vmServer.WriteURL(),
+        "--metric-name", "cli_test_metric",
+        "--value", "100")
+
+    output, err := cmd.CombinedOutput()
+    require.NoError(t, err)
+    assert.Contains(t, string(output), "Metric ingested successfully")
+
+    // Verify with helpers
+    vmServer.ForceFlush(context.Background())
+    results, err := testutils.QueryVictoriaMetrics(vmServer.QueryURL(), "cli_test_metric")
+    require.NoError(t, err)
+    require.Len(t, results, 1)
+}
+```
+
+## Key Features
+
+- **Binary download with OS/arch detection** - Works on macOS/Linux, amd64/arm64
+- **Thread-safe download** - Mutex + double-check locking prevents race conditions
+- **Free port allocation** - Prevents conflicts in parallel tests
+- **Idempotent shutdown** - Safe to call multiple times with `sync.Once`
+- **Resource cleanup** - Proper temp directory and process cleanup
+- **Helper functions** - `ForceFlush()` for immediate data availability
+- **Prometheus helpers** - Create payloads, query, assert on results
+
+## Benefits
+
+- **Production-like testing** - Testing against REAL Victoria Metrics, not mocks
+- **Reusable** - Same `testutils` infrastructure for unit, integration, and system tests
+- **Readable** - Helper functions make tests read like documentation
+- **No Docker** - No Docker required, works in any environment
+- **Fast** - Binary starts in < 1 second
+- **Portable** - Works anywhere Go runs
+- **Maintainable** - Changes to test infrastructure are centralized
+
+## Key Takeaways
+
+1. **Binary level is good for complex services** - When in-memory is too complex
+2. **Download management is critical** - Thread-safe, cached, version-controlled
+3. **Helper functions make tests readable** - DSL for common operations
+4. **Reuse across test levels** - Same infrastructure for unit, integration, system
+5. **Force flush is essential** - Make data immediately queryable in tests
--- a/skills/testing/reference.md
+++ b/skills/testing/reference.md
@@ -0,0 +1,689 @@
+# Testing Reference
+
+Complete guide to Go testing principles and patterns.
+
+## Core Testing Principles
+
+### 1. Test Only Public API
+- **Use `pkg_test` package name** - Forces external perspective
+- **Test types via constructors** - No direct struct initialization
+- **No testing private methods** - If you need to test it, make it public or rethink design
+
+```go
+// ✅ Good
+package user_test
+
+import "github.com/yourorg/project/user"
+
+func TestService_CreateUser(t *testing.T) {
+    svc, _ := user.NewUserService(repo, notifier)
+    err := svc.CreateUser(ctx, testUser)
+    // ...
+}
+```
+
+### 2. Avoid Mocks - Use Real Implementations
+
+Instead of mocks, use:
+- **HTTP test servers** (`httptest` package)
+- **Temp files/directories** (`os.CreateTemp`, `os.MkdirTemp`)
+- **In-memory databases** (SQLite in-memory, or custom implementations)
+- **Test implementations** (TestEmailer that writes to buffer)
+
+**Benefits:**
+- Tests are more reliable
+- Tests verify actual behavior
+- Easier to maintain
+
+### 3. Coverage Strategy
+
+**Leaf Types** (self-contained):
+- **Target**: 100% unit test coverage
+- **Why**: Core logic must be bulletproof
+
+**Orchestrating Types** (coordinate others):
+- **Target**: Integration test coverage
+- **Why**: Test seams between components
+
+**Goal**: Most logic in leaf types (easier to test and maintain)
+
+---
+
+## Table-Driven Tests
+
+### When to Use
+- Each test case has **cyclomatic complexity = 1**
+- No conditionals inside t.Run()
+- Simple, focused testing scenarios
+
+### ❌ Anti-Pattern: wantErr bool
+
+**DO NOT** use `wantErr bool` pattern - it violates complexity = 1 rule:
+
+```go
+// ❌ BAD - Has conditionals (complexity > 1)
+func TestNewUserID(t *testing.T) {
+    tests := []struct {
+        name    string
+        input   string
+        want    UserID
+        wantErr bool  // ❌ Anti-pattern
+    }{
+        {name: "valid ID", input: "usr_123", want: UserID("usr_123"), wantErr: false},
+        {name: "empty ID", input: "", wantErr: true},
+    }
+
+    for _, tt := range tests {
+        t.Run(tt.name, func(t *testing.T) {
+            got, err := NewUserID(tt.input)
+            if tt.wantErr {  // ❌ Conditional
+                assert.Error(t, err)
+                return
+            }
+            assert.NoError(t, err)
+            assert.Equal(t, tt.want, got)
+        })
+    }
+}
+```
+
+### ✅ Correct Pattern: Separate Functions
+
+**Always separate success and error cases:**
+
+```go
+// ✅ Success cases - Complexity = 1
+func TestNewUserID_Success(t *testing.T) {
+    tests := []struct {
+        name  string
+        input string
+        want  UserID
+    }{
+        {name: "valid ID", input: "usr_123", want: UserID("usr_123")},
+        {name: "with numbers", input: "usr_456", want: UserID("usr_456")},
+    }
+
+    for _, tt := range tests {
+        t.Run(tt.name, func(t *testing.T) {
+            got, err := NewUserID(tt.input)
+            require.NoError(t, err)  // ✅ No conditionals
+            assert.Equal(t, tt.want, got)
+        })
+    }
+}
+
+// ✅ Error cases - Complexity = 1
+func TestNewUserID_Error(t *testing.T) {
+    tests := []struct {
+        name  string
+        input string
+    }{
+        {name: "empty ID", input: ""},
+        {name: "whitespace only", input: "   "},
+    }
+
+    for _, tt := range tests {
+        t.Run(tt.name, func(t *testing.T) {
+            _, err := NewUserID(tt.input)
+            assert.Error(t, err)  // ✅ No conditionals
+        })
+    }
+}
+```
+
+### Critical Rule: Named Struct Fields
+
+**ALWAYS use named struct fields** - Linter reorders fields, breaking unnamed initialization:
+
+```go
+// ❌ BAD - Breaks when linter reorders fields
+tests := []struct {
+    name   string
+    input  int
+    want   string
+}{
+    {"test1", 42, "result"},  // Will break
+}
+
+// ✅ GOOD - Works regardless of field order
+tests := []struct {
+    name   string
+    input  int
+    want   string
+}{
+    {name: "test1", input: 42, want: "result"},  // Always works
+}
+```
+
+---
+
+## Testify Suites
+
+### When to Use
+
+ONLY for complex test infrastructure setup:
+- Mock HTTP servers
+- Database connections
+- OpenTelemetry testing setup
+- Temporary files/directories needing cleanup
+- Shared expensive setup/teardown
+
+### When NOT to Use
+- Simple unit tests (use table-driven instead)
+- Tests without complex setup
+
+### Pattern
+
+```go
+package user_test
+
+import (
+    "net/http/httptest"
+    "testing"
+    "github.com/stretchr/testify/suite"
+)
+
+type ServiceSuite struct {
+    suite.Suite
+    server   *httptest.Server
+    svc      *user.UserService
+}
+
+func (s *ServiceSuite) SetupSuite() {
+    s.server = httptest.NewServer(testHandler)
+}
+
+func (s *ServiceSuite) TearDownSuite() {
+    s.server.Close()
+}
+
+func (s *ServiceSuite) SetupTest() {
+    s.svc = user.NewUserService(s.server.URL)
+}
+
+func (s *ServiceSuite) TestCreateUser() {
+    err := s.svc.CreateUser(ctx, testUser)
+    s.NoError(err)
+}
+
+func TestServiceSuite(t *testing.T) {
+    suite.Run(t, new(ServiceSuite))
+}
+```
+
+---
+
+## Synchronization in Tests
+
+### Never Use time.Sleep
+
+Use channels or WaitGroups instead.
+
+### Use Channels
+
+```go
+func TestAsyncOperation(t *testing.T) {
+    done := make(chan struct{})
+
+    go func() {
+        doAsyncWork()
+        close(done)
+    }()
+
+    select {
+    case <-done:
+        // Success
+    case <-time.After(1 * time.Second):
+        t.Fatal("timeout")
+    }
+}
+```
+
+### Use WaitGroups
+
+```go
+func TestConcurrentOperations(t *testing.T) {
+    var wg sync.WaitGroup
+    results := make([]string, 10)
+
+    for i := 0; i < 10; i++ {
+        wg.Add(1)
+        go func(index int) {
+            defer wg.Done()
+            results[index] = doWork(index)
+        }(i)
+    }
+
+    wg.Wait()
+    // Assert on results
+}
+```
+
+---
+
+## Test Organization
+
+### File Structure
+
+```
+user/
+├── user.go
+├── user_test.go          # Tests for user.go (pkg_test)
+├── service.go
+├── service_test.go       # Tests for service.go (pkg_test)
+```
+
+### Package Naming
+
+```go
+// ✅ External package - tests public API only
+package user_test
+
+import (
+    "testing"
+    "github.com/yourorg/project/user"
+)
+```
+
+---
+
+## Real Implementation Patterns
+
+### In-Memory Repository
+
+```go
+package user
+
+type InMemoryRepository struct {
+    mu    sync.RWMutex
+    users map[UserID]User
+}
+
+func NewInMemoryRepository() *InMemoryRepository {
+    return &InMemoryRepository{
+        users: make(map[UserID]User),
+    }
+}
+
+func (r *InMemoryRepository) Save(ctx context.Context, u User) error {
+    r.mu.Lock()
+    defer r.mu.Unlock()
+    r.users[u.ID] = u
+    return nil
+}
+
+func (r *InMemoryRepository) Get(ctx context.Context, id UserID) (*User, error) {
+    r.mu.RLock()
+    defer r.mu.RUnlock()
+    u, ok := r.users[id]
+    if !ok {
+        return nil, ErrNotFound
+    }
+    return &u, nil
+}
+```
+
+### Test Email Sender
+
+```go
+package user
+
+import (
+    "bytes"
+    "fmt"
+    "sync"
+)
+
+type TestEmailer struct {
+    mu     sync.Mutex
+    buffer bytes.Buffer
+}
+
+func NewTestEmailer() *TestEmailer {
+    return &TestEmailer{}
+}
+
+func (e *TestEmailer) Send(to Email, subject, body string) error {
+    e.mu.Lock()
+    defer e.mu.Unlock()
+    fmt.Fprintf(&e.buffer, "To: %s\nSubject: %s\n%s\n\n", to, subject, body)
+    return nil
+}
+
+func (e *TestEmailer) SentEmails() string {
+    e.mu.Lock()
+    defer e.mu.Unlock()
+    return e.buffer.String()
+}
+```
+
+---
+
+## Testable Examples (GoDoc Examples)
+
+### When to Add
+- Non-trivial types
+- Types with validation
+- Common usage patterns
+
+### Pattern
+
+```go
+// Example_UserID demonstrates basic usage.
+func Example_UserID() {
+    id, _ := user.NewUserID("usr_123")
+    fmt.Println(id)
+    // Output: usr_123
+}
+
+// Example_UserID_validation shows validation behavior.
+func Example_UserID_validation() {
+    _, err := user.NewUserID("")
+    fmt.Println(err != nil)
+    // Output: true
+}
+```
+
+---
+
+## Testing Checklist
+
+### Before Considering Tests Complete
+
+**Structure:**
+- [ ] Tests in `pkg_test` package
+- [ ] Testing public API only
+- [ ] Table-driven tests use named fields
+- [ ] No conditionals in test cases
+
+**Implementation:**
+- [ ] Using real implementations, not mocks
+- [ ] No time.Sleep (using channels/waitgroups)
+- [ ] Testify suites only for complex setup
+
+**Coverage:**
+- [ ] Leaf types: 100% unit test coverage
+- [ ] Orchestrating types: Integration tests
+- [ ] Happy path, edge cases, error cases covered
+
+---
+
+## Summary
+
+**The Golden Rule**: Cyclomatic complexity = 1 in all test cases
+
+**Test Structure Choices:**
+- **Table-driven tests**: Simple, focused scenarios
+- **Testify suites**: Complex infrastructure setup only
+
+**Test Philosophy:**
+- Test only public API (`pkg_test` package)
+- Use real implementations, not mocks
+- Leaf types: 100% coverage
+- Orchestrating types: Integration tests
+
+**Common Pitfalls to Avoid:**
+- ❌ Testing private methods
+- ❌ Heavy mocking
+- ❌ time.Sleep in tests
+- ❌ Conditionals in test cases
+- ❌ Unnamed struct fields in table tests
+
+---
+
+# Example Files - Reusable Testing Patterns
+
+The following example files contain **transferable patterns** that apply to many scenarios, not just the specific technologies shown. Claude should read these files based on the **pattern needed**, not the specific technology mentioned.
+
+## Pattern 1: In-Memory Test Harness (Level 1)
+
+**File**: `examples/nats-in-memory.md`
+
+**Pattern**: Using official test harnesses from Go libraries
+
+**When to read:**
+- Need to test with ANY service that provides an official Go test harness
+- Testing message queues, databases, caches, or any service with in-memory test mode
+- Want to avoid Docker but need realistic service behavior
+
+**Applies to:**
+- **NATS** (shown in example) - Message queue with official test harness
+- **Redis** - `github.com/alicebob/miniredis` pure Go in-memory Redis
+- **MongoDB** - `github.com/tryvium-travels/memongo` in-memory MongoDB
+- **PostgreSQL** - `github.com/jackc/pgx/v5` with pgx mock
+- **Any Go library with test package** - Check if dependency has `/test` package
+
+**Key techniques to adapt:**
+- Wrapping official harness with clean API
+- Free port allocation for parallel tests
+- Clean lifecycle management (Setup/Teardown)
+- Thread-safe initialization
+
+---
+
+## Pattern 2: Binary Dependency Management (Level 2)
+
+**File**: `examples/victoria-metrics.md`
+
+**Pattern**: Download, manage, and run ANY standalone binary for testing
+
+**When to read:**
+- Need to test against ANY external binary executable
+- No in-memory option available
+- Want production-like testing without Docker
+
+**Applies to:**
+- **Victoria Metrics** (shown in example) - Metrics database
+- **Prometheus** - Metrics and alerting
+- **Grafana** - Dashboards and visualization
+- **Any database binaries** - PostgreSQL, MySQL, Redis, etc.
+- **Any CLI tools** - Language servers, formatters, linters
+- **Custom binaries** - Your own services or third-party tools
+
+**Key techniques to adapt:**
+- OS/ARCH detection (`runtime.GOOS`, `runtime.GOARCH`)
+- Thread-safe binary downloads with double-check locking
+- Health check polling with retries
+- Graceful shutdown with `sync.Once`
+- Free port allocation
+- Temp directory management
+- Version management via environment variables
+
+---
+
+## Pattern 3: Mock Server with Generic DSL (Level 1)
+
+**File**: `examples/jsonrpc-mock.md`
+
+**Pattern**: Building generic mock servers with configurable responses using `AddMockResponse()`
+
+**When to read:**
+- Need to mock ANY request/response protocol
+- Want readable test setup with DSL
+- Testing clients that call external APIs
+
+**Applies to:**
+- **JSON-RPC** (shown in example) - RPC over HTTP
+- **REST APIs** - Use same pattern with route matching
+- **GraphQL** - Configure response per query
+- **gRPC** - Adapt for protobuf messages
+- **WebSocket** - Mock message responses
+- **Any HTTP-based protocol** - SOAP, XML-RPC, custom protocols
+
+**Key techniques to adapt:**
+- Generic `AddMockResponse(identifier, response)` pattern
+- Using `httptest.Server` as foundation
+- Query/request tracking for assertions
+- Configuration-based response mapping
+- Thread-safe response storage
+
+---
+
+## Pattern 4: Bidirectional Streaming with Rich DSL (Level 1)
+
+**File**: `examples/grpc-bufconn.md`
+
+**Pattern**: In-memory bidirectional communication with rich client/server mocks
+
+**When to read:**
+- Testing ANY bidirectional streaming protocol
+- Need full-duplex communication in tests
+- Want to avoid network I/O
+
+**Applies to:**
+- **gRPC** (shown in example) - Uses bufconn for in-memory
+- **WebSockets** - Adapt bufconn pattern
+- **TCP streams** - Custom protocols over TCP
+- **Unix sockets** - Inter-process communication
+- **Any streaming protocol** - Server-Sent Events, HTTP/2 streams
+
+**Key techniques to adapt:**
+- `bufconn` for in-memory connections (gRPC-specific, but concept applies)
+- Rich mock objects with helper methods
+- Thread-safe state tracking with mutexes
+- Assertion helpers (`ListenToStreamAndAssert()`)
+- When testing **server** → mock the **clients**
+- When testing **client** → mock the **server**
+
+---
+
+## Pattern 5: HTTP DSL and Builder Pattern (Level 1)
+
+**File**: `examples/httptest-dsl.md`
+
+**Pattern**: Building readable test infrastructure with DSL wrappers over stdlib
+
+**When to read:**
+- Want to wrap ANY test infrastructure with clean DSL
+- Need fluent, readable test setup
+- Building reusable test utilities
+
+**Applies to:**
+- **HTTP mocking** (shown in example) - httptest.Server wrapper
+- **Any test infrastructure** - Databases, queues, file systems
+- **Test data builders** - Fluent APIs for creating test data
+- **Custom test harnesses** - Wrapping complex setups
+
+**Key techniques to adapt:**
+- Builder pattern with method chaining
+- Fluent API design (`OnGET().RespondJSON()`)
+- Separating configuration from execution
+- Type-safe builders with Go generics
+- Hiding complexity behind clean interfaces
+
+---
+
+## Pattern 6: Test Organization and Structure
+
+**File**: `examples/test-organization.md`
+
+**When to read:**
+- Setting up test structure for new projects
+- Adding build tags for integration tests
+- Configuring CI/CD for tests
+- Creating testutils package structure
+
+**Universal patterns** (not technology-specific):
+- File organization (`pkg_test` package naming)
+- Build tags (`//go:build integration`)
+- Makefile/Taskfile structure
+- CI/CD configuration
+- testutils package layout
+
+---
+
+## Pattern 7: Integration Test Workflows
+
+**File**: `examples/integration-patterns.md`
+
+**When to read:**
+- Testing component interactions across package boundaries
+- Need patterns for Service + Repository testing
+- Testing workflows that span multiple components
+
+**Universal patterns:**
+- Pattern 1: Service + Repository with in-memory deps
+- Pattern 2: Testing with real external services
+- Pattern 3: Multi-component workflow with testify suites
+- Dependency priority (in-memory > binary > test-containers)
+
+---
+
+## Pattern 8: System Test (Black Box)
+
+**File**: `examples/system-patterns.md`
+
+**When to read:**
+- Writing black-box end-to-end tests
+- Testing via CLI or API
+- Need tests that work without Docker
+
+**Universal patterns:**
+- CLI testing with `exec.Command`
+- API testing with HTTP client
+- Dependency injection architecture
+- Pure Go testing (no Docker)
+
+---
+
+## How Claude Should Use These Files
+
+### Pattern-Based Reading Rules
+
+**When user needs to test with external dependencies:**
+
+1. **Has official Go test harness?** → Read `nats-in-memory.md`
+   - "Test with Redis/MongoDB/PostgreSQL/NATS"
+   - "Avoid Docker but need real service"
+   - Look for inspiration on wrapping official harnesses
+
+2. **Need to download/run binary?** → Read `victoria-metrics.md`
+   - "Test with Prometheus/Grafana/any binary"
+   - "Manage binary dependencies"
+   - Learn OS/ARCH detection, download patterns, health checks
+
+3. **Need to mock request/response?** → Read `jsonrpc-mock.md`
+   - "Mock REST/GraphQL/RPC/any HTTP API"
+   - "Build mock with DSL"
+   - Learn generic `AddMockResponse()` pattern
+
+4. **Need bidirectional streaming?** → Read `grpc-bufconn.md`
+   - "Test gRPC/WebSocket/streaming protocol"
+   - "In-memory bidirectional communication"
+   - Learn rich mock patterns, thread-safe state
+
+5. **Want readable test DSL?** → Read `httptest-dsl.md`
+   - "Build fluent test API"
+   - "Wrap test infrastructure"
+   - Learn builder pattern, method chaining
+
+**When user asks about test structure:**
+- "How should I organize tests?" → Read `test-organization.md`
+- "How do I write integration tests?" → Read `integration-patterns.md`
+- "How do I write system tests?" → Read `system-patterns.md`
+
+### Key Principle
+
+**Examples show specific technologies (NATS, Victoria Metrics, JSON-RPC) but teach transferable patterns.**
+
+Claude should:
+1. Identify the **pattern needed** (harness, binary, mock DSL, etc.)
+2. Read the **example file** that demonstrates that pattern
+3. **Adapt the techniques** to the user's specific technology
+4. Use the example as a **template**, not a literal solution
+
+### Default Behavior (No Example Needed)
+
+For simple scenarios, use the core patterns in this file:
+- Basic table-driven tests → Use patterns from this file
+- Simple testify suites → Use patterns from this file
+- Basic synchronization → Use patterns from this file
+- Simple in-memory implementations → Use InMemoryRepository/TestEmailer from this file
+
+**Read example files when patterns/techniques are needed, not just for specific tech.**
+
+---
+
+## Final Notes
+
+This reference provides core testing principles and patterns. For detailed implementations and complete examples, refer to the example files listed above. Each example file is self-contained and can be read independently based on your testing needs.