zhongwei/gh-linus-mcmanamey-unify-2-1-plugin

Files

Zhongwei Li 506a828b22 Initial commit

2025-11-30 08:37:55 +08:00

4.9 KiB

Raw Blame History

name: prime-claude-md description: Distill CLAUDE.md to essentials, moving detailed knowledge into skills for on-demand loading. Reduces context pollution by 80-90%. args: [--analyze-only] | [--backup] | [--apply]

Prime CLAUDE.md

Distill your CLAUDE.md file to only essential information, moving detailed knowledge into skills.

Problem

Large CLAUDE.md files (400+ lines) are loaded into context for EVERY conversation:

Wastes 5,000-15,000 tokens per conversation
Reduces space for actual work
Slows Claude's responses
80% of the content is rarely needed

Solution

Prime your CLAUDE.md:

Keep only critical architecture and coding standards
Move detailed knowledge into skills (loaded on-demand)
Reduce from 400+ lines to ~100 lines
Save 80-90% context per conversation

Usage

Analyze Current CLAUDE.md

/prime-claude-md --analyze-only

Shows what would be moved to skills without making changes.

Create Backup and Apply

/prime-claude-md --backup --apply

Backs up current CLAUDE.md to CLAUDE.md.backup
Creates supporting skills with detailed knowledge
Replaces CLAUDE.md with distilled version
Documents what was moved where

Just Apply (No Backup)

/prime-claude-md --apply

What Gets Distilled

Kept in CLAUDE.md (Essential)

Critical architecture concepts (high-level only)
Mandatory coding standards (line length, blank lines, decorators)
Quality gates (syntax check, linting, formatting)
Essential commands (2-3 most common)
References to skills for details

Moved to Skills (Detailed Knowledge)

project-architecture skill:

Detailed medallion architecture
Pipeline execution flow
Data source details
Azure integration specifics
Configuration management
Testing architecture

project-commands skill:

Complete make command reference
All development workflows
Azure operations
Database operations
Git operations
Troubleshooting commands

pyspark-patterns skill:

TableUtilities method documentation
ETL class pattern details
Logging standards
DataFrame operation patterns
JDBC connection patterns
Performance tips

Results

Before Priming:

CLAUDE.md: 420 lines
Context cost: ~12,000 tokens per conversation
Skills: 0
Knowledge: Always loaded

After Priming:

CLAUDE.md: ~100 lines (76% reduction)
Context cost: ~2,000 tokens per conversation (83% savings)
Skills: 3 specialized skills
Knowledge: Loaded only when needed

Example Distilled CLAUDE.md

# CLAUDE.md

**CRITICAL**: READ `.claude/rules/python_rules.md`

## Architecture
Medallion: Bronze → Silver → Gold
Core: `session_optimiser.py` (SparkOptimiser, NotebookLogger, TableUtilities)

## Essential Commands
python3 -m py_compile <file>  # Must run
ruff check python_files/       # Must pass
make run_all                   # Full pipeline

## Coding Standards
- Line length: 240 chars
- No blank lines in functions
- Use @synapse_error_print_handler
- Use logger (not print)

## Skills Available
- project-architecture: Detailed architecture
- project-commands: Complete command reference
- pyspark-patterns: PySpark best practices

Benefits

Faster conversations: Less context overhead
Better responses: More room for actual work
On-demand knowledge: Load only what you need
Maintainable: Easier to update focused skills
Reusable pattern: Apply to any repository

Applying to Other Repositories

This command is repository-agnostic. To use on another repo:

Run /prime-claude-md --analyze-only to see what you have
Command will identify:
- Architectural concepts
- Command references
- Coding standards
- Configuration details
Creates appropriate skills based on content
Run /prime-claude-md --apply when ready

Files Created

.claude/
├── CLAUDE.md                          # Distilled (100 lines)
├── CLAUDE.md.backup                   # Original (if --backup used)
└── skills/
    ├── project-architecture/
    │   └── skill.md                   # Architecture details
    ├── project-commands/
    │   └── skill.md                   # Command reference
    └── pyspark-patterns/              # (project-specific)
        └── skill.md                   # Code patterns

Philosophy

CLAUDE.md should answer: "What's special about this repo?"

Skills should answer: "How do I do X in detail?"

Task Execution

I will:

Read current CLAUDE.md (both project and global if exists)
Analyze content and categorize
Create distilled CLAUDE.md (essential only)
Create supporting skills with detailed knowledge
If --backup: Save CLAUDE.md.backup
If --apply: Replace CLAUDE.md with distilled version
Generate summary report of changes

Current Project: Unify Data Migration (PySpark/Azure Synapse)

Let me analyze your CLAUDE.md and create the distilled version with supporting skills.

4.9 KiB Raw Blame History