Files
gh-cskiro-claudex-meta-tools/skills/insight-skill-generator/examples/example-clustering-output.md
2025-11-29 18:16:56 +08:00

8.7 KiB

Example: Clustering Analysis Output

This example shows what the clustering phase produces when analyzing a project's insights.

Scenario

A project has been using the extract-explanatory-insights hook for 2 weeks, generating 12 insights across different categories.


Phase 1: Discovery Summary

Total Insights Found: 12 Date Range: 2025-11-01 to 2025-11-14 Unique Sessions: 8 Categories:

  • testing: 5 insights
  • hooks-and-events: 3 insights
  • architecture: 2 insights
  • performance: 2 insights

Preview:

  1. "Modern Testing Strategy with Testing Trophy" (testing, 2025-11-01)
  2. "Hook Deduplication Session Management" (hooks-and-events, 2025-11-03)
  3. "CPU Usage Prevention in Vitest" (testing, 2025-11-03)
  4. "BSD awk Compatibility in Hook Scripts" (hooks-and-events, 2025-11-05)
  5. "Semantic Query Priorities in React Testing Library" (testing, 2025-11-06)

Phase 2: Clustering Analysis

Cluster 1: Testing Strategy

Size: 3 insights Similarity Score: 0.75 (high) Recommended Complexity: Standard Recommended Pattern: Validation

Insights:

  1. "Modern Testing Strategy with Testing Trophy"

    • Keywords: testing, integration, unit, e2e, trophy, kent-c-dodds
    • Category: testing
    • Date: 2025-11-01
    • Length: 156 lines
    • Has code examples: Yes
  2. "Semantic Query Priorities in React Testing Library"

    • Keywords: testing, react, semantic, query, getByRole, accessibility
    • Category: testing
    • Date: 2025-11-06
    • Length: 89 lines
    • Has code examples: Yes
  3. "What NOT to Test - Brittle Patterns"

    • Keywords: testing, avoid, brittle, implementation-details, user-behavior
    • Category: testing
    • Date: 2025-11-08
    • Length: 67 lines
    • Has code examples: No

Shared Keywords: testing (3), react (2), user (2), behavior (2), semantic (2)

Cluster Characteristics:

  • All in same category (testing)
  • Temporal span: 7 days
  • Common theme: User-focused testing approach
  • Total code examples: 5 blocks
  • Actionable items: 12

Suggested Skill Name: "user-focused-testing-guide"

Suggested Description: "Use PROACTIVELY when writing tests to ensure user-centric testing strategy following Testing Trophy methodology and React Testing Library best practices"

Skill Structure Recommendation:

SKILL.md sections:
- Overview (Testing Trophy philosophy)
- Phase 1: Query Selection (semantic queries)
- Phase 2: Test Writing (user workflows)
- Phase 3: Avoiding Brittle Tests
- Important Reminders (what NOT to test)
- Examples (from code blocks)

Cluster 2: Hook Development

Size: 2 insights Similarity Score: 0.68 (medium-high) Recommended Complexity: Minimal to Standard Recommended Pattern: Phase-based

Insights:

  1. "Hook Deduplication Session Management"

    • Keywords: hook, deduplication, session, hash, state
    • Category: hooks-and-events
    • Date: 2025-11-03
    • Length: 124 lines
    • Has code examples: Yes (bash)
  2. "BSD awk Compatibility in Hook Scripts"

    • Keywords: hook, awk, bsd, macos, compatibility, portable
    • Category: hooks-and-events
    • Date: 2025-11-05
    • Length: 78 lines
    • Has code examples: Yes (bash)

Shared Keywords: hook (2), bash (2), script (2), compatibility (2)

Cluster Characteristics:

  • Same category (hooks-and-events)
  • Temporal span: 2 days (high proximity)
  • Common theme: Hook script development best practices
  • Total code examples: 4 blocks (all bash)
  • Actionable items: 8

Suggested Skill Name: "hook-script-development-guide"

Suggested Description: "Guides Claude Code hook script development with deduplication patterns and cross-platform compatibility best practices for macOS/Linux environments"

Skill Structure Recommendation:

SKILL.md sections:
- Overview (Hook development challenges)
- Phase 1: Deduplication Strategy (session vs content)
- Phase 2: Cross-Platform Compatibility (BSD awk)
- Phase 3: Testing and Validation
- Examples (bash scripts)
- Templates (hook-template.sh)

Cluster 3: Performance Optimization

Size: 2 insights Similarity Score: 0.62 (medium) Recommended Complexity: Minimal Recommended Pattern: Phase-based

Insights:

  1. "CPU Usage Prevention in Vitest"

    • Keywords: vitest, cpu, performance, threads, watch-mode
    • Category: testing
    • Date: 2025-11-03
    • Length: 45 lines
    • Has code examples: Yes (npm scripts)
  2. "Bundle Size Optimization"

    • Keywords: bundle, size, webpack, optimization, tree-shaking
    • Category: performance
    • Date: 2025-11-09
    • Length: 92 lines
    • Has code examples: Yes (webpack config)

Shared Keywords: performance (2), optimization (2), build (2)

Cluster Characteristics:

  • Different categories (testing, performance)
  • Temporal span: 6 days
  • Common theme: Build/test performance
  • Total code examples: 3 blocks
  • Actionable items: 6

Suggested Skill Name: "build-performance-optimizer"

Suggested Description: "Analyzes build and test performance issues and generates optimization recommendations for Vitest test runs and webpack bundle sizes"

Skill Structure Recommendation:

SKILL.md sections:
- Overview (Performance impact on DX)
- Phase 1: Test Performance Analysis
- Phase 2: Bundle Size Analysis
- Phase 3: Optimization Implementation
- Important Reminders (check before test, monitor bundle)
- Examples (npm scripts, webpack config)

Standalone Insights

Standalone 1: "Hook State Management Patterns"

Quality Score: 0.85 (high - qualifies for standalone skill) Category: hooks-and-events Date: 2025-11-07 Length: 134 lines Code Examples: Yes (3 blocks)

Why Standalone:

  • Doesn't cluster with other hook insights (different focus)
  • High quality with comprehensive coverage
  • Self-contained topic (state management)
  • Multiple actionable patterns

Suggested Skill Name: "hook-state-manager"

Suggested Description: "Automates state management setup for Claude Code hooks with persistent storage, cleanup, and safe concurrency patterns"


Standalone 2: "Architecture Decision Records"

Quality Score: 0.82 (high - qualifies for standalone skill) Category: architecture Date: 2025-11-12 Length: 156 lines Code Examples: Yes (template)

Why Standalone:

  • Unique topic (no other architecture insights)
  • High quality with complete template
  • Valuable for documentation
  • Industry best practice

Suggested Skill Name: "adr-documentation-helper"

Suggested Description: "Guides creation of Architecture Decision Records (ADRs) following industry standards with templates and integration with project documentation"


"Git Branch Naming Convention"

Quality Score: 0.42 (low) Category: version-control Reason for Exclusion: Too simple, covered by existing conventions, no unique value

"TypeScript Strict Mode Benefits"

Quality Score: 0.38 (low) Category: typescript Reason for Exclusion: Common knowledge, well-documented elsewhere, not actionable enough


User Decision Points

At this stage, the skill would present the following options to the user:

Option 1: Generate All Recommended Skills (5 skills)

  • user-focused-testing-guide (Cluster 1)
  • hook-script-development-guide (Cluster 2)
  • build-performance-optimizer (Cluster 3)
  • hook-state-manager (Standalone 1)
  • adr-documentation-helper (Standalone 2)

Option 2: Select Specific Skills

  • User picks which clusters/standalones to convert

Option 3: Modify Clusters

  • Split large clusters
  • Merge small clusters
  • Recategorize insights
  • Adjust complexity levels

Option 4: Tune Thresholds and Retry

  • Increase cluster_minimum (0.6 → 0.7) for tighter clusters
  • Decrease standalone_quality (0.8 → 0.7) for more standalone skills

Proceeding to Phase 3

If user selects "user-focused-testing-guide" to generate, the skill would proceed to Phase 3: Interactive Skill Design with the following proposal:

Skill Design Proposal:

  • Name: user-focused-testing-guide
  • Description: "Use PROACTIVELY when writing tests to ensure user-centric testing strategy following Testing Trophy methodology and React Testing Library best practices"
  • Complexity: Standard
  • Pattern: Validation
  • Structure:
    • SKILL.md with validation workflow
    • data/insights-reference.md with 3 source insights
    • examples/query-examples.md with semantic query patterns
    • templates/test-checklist.md with testing checklist

User can then customize before generation begins.


This example demonstrates:

  1. How clustering groups related insights
  2. What information is presented for each cluster
  3. How standalone insights are identified
  4. Why some insights are excluded
  5. What decisions users can make
  6. How the process flows into Phase 3