zhongwei/gh-rafaelcalleja-claude-market-place-plugins-claudekit-skills

Fork 0

Files

Zhongwei Li 6ec3196ecc Initial commit

2025-11-30 08:48:52 +08:00

7.0 KiB

Raw Blame History

Advanced Features

Extended thinking, prompt caching, checkpointing, and memory management in Claude Code.

Extended Thinking

Deep reasoning for complex problems.

Enable Extended Thinking

Global configuration:

claude config set thinking.enabled true
claude config set thinking.budget 15000

Project settings (.claude/settings.json):

{
  "thinking": {
    "enabled": true,
    "budget": 10000,
    "mode": "auto"
  }
}

Command-line flag:

claude --thinking "architect microservices system"

Thinking Modes

auto: Claude decides when to use extended thinking manual: User explicitly requests thinking disabled: No extended thinking

{
  "thinking": {
    "mode": "auto",
    "budget": 10000,
    "minComplexity": 0.7
  }
}

Budget Control

Set token budget for thinking:

{
  "thinking": {
    "budget": 10000,      // Max tokens for thinking
    "budgetPerRequest": 5000,  // Per-request limit
    "adaptive": true      // Adjust based on task complexity
  }
}

Best Use Cases

Architecture design
Complex algorithm development
System refactoring
Performance optimization
Security analysis
Bug investigation

Example

claude --thinking "Design a distributed caching system with:
- High availability
- Consistency guarantees
- Horizontal scalability
- Fault tolerance"

Prompt Caching

Reduce costs by caching repeated context.

Enable Caching

API usage:

const response = await client.messages.create({
  model: 'claude-sonnet-4-5-20250929',
  system: [
    {
      type: 'text',
      text: 'You are a coding assistant...',
      cache_control: { type: 'ephemeral' }
    }
  ],
  messages: [...]
});

CLI configuration:

{
  "caching": {
    "enabled": true,
    "ttl": 300,
    "maxSize": "100MB"
  }
}

Cache Strategy

What to cache:

Large codebases
Documentation
API specifications
System prompts
Project context

What not to cache:

User queries
Dynamic content
Temporary data
Session-specific info

Cache Control

// Cache large context
{
  type: 'text',
  text: largeCodebase,
  cache_control: { type: 'ephemeral' }
}

// Update without invalidating cache
{
  type: 'text',
  text: newUserQuery
  // No cache_control = not cached
}

Cost Savings

With caching:

First request: Full cost
Subsequent requests: ~90% discount on cached tokens
Cache TTL: 5 minutes

Example:

Without caching:
Request 1: 10,000 tokens @ $3/M = $0.03
Request 2: 10,000 tokens @ $3/M = $0.03
Total: $0.06

With caching (8,000 tokens cached):
Request 1: 10,000 tokens @ $3/M = $0.03
Request 2: 2,000 new + 8,000 cached @ $0.30/M = $0.0024
Total: $0.0324 (46% savings)

Checkpointing

Automatically track and rewind changes.

Enable Checkpointing

claude config set checkpointing.enabled true

Settings:

{
  "checkpointing": {
    "enabled": true,
    "autoSave": true,
    "interval": 300,
    "maxCheckpoints": 50
  }
}

View Checkpoints

# List checkpoints
claude checkpoint list

# View checkpoint details
claude checkpoint show checkpoint-123

Restore Checkpoint

# Restore to checkpoint
claude checkpoint restore checkpoint-123

# Restore to time
claude checkpoint restore --time "2025-11-06T10:00:00Z"

# Restore specific files
claude checkpoint restore checkpoint-123 --files src/main.js

Create Manual Checkpoint

# Create checkpoint with message
claude checkpoint create "before refactoring auth module"

# Create at important moments
claude checkpoint create "working state before experiment"

Checkpoint Strategies

Auto-save checkpoints:

Before major changes
After successful tests
Every N minutes
Before destructive operations

Manual checkpoints:

Before risky refactors
At working states
Before experiments
After milestones

Example Workflow

# Create checkpoint before risky change
claude checkpoint create "before performance optimization"

# Make changes
claude "optimize database queries for 10x performance"

# If something breaks
claude checkpoint restore "before performance optimization"

# Or continue with improvements
claude checkpoint create "performance optimization complete"

Memory Management

Control how Claude remembers context across sessions.

Memory Locations

global: Share memory across all projects project: Project-specific memory none: Disable memory

# Set memory location
claude config set memory.location project

# Enable memory
claude config set memory.enabled true

Configuration

{
  "memory": {
    "enabled": true,
    "location": "project",
    "ttl": 86400,
    "maxSize": "10MB",
    "autoSummarize": true
  }
}

Memory Operations

# View stored memories
claude memory list

# View specific memory
claude memory show memory-123

# Clear all memories
claude memory clear

# Clear old memories
claude memory clear --older-than 7d

# Clear project memories
claude memory clear --project

What Gets Remembered

Automatically:

Project structure
Coding patterns
Preferences
Common commands
File locations

Explicitly stored:

Important context
Design decisions
Architecture notes
Team conventions

Memory Best Practices

Project memory:

Good for project-specific context
Shares across team members
Persists in .claude/memory/
Commit to version control (optional)

Global memory:

Personal preferences
General knowledge
Common patterns
Cross-project learnings

Disable memory when:

Working with sensitive data
One-off tasks
Testing/experimentation
Troubleshooting

Example

# Remember project architecture
claude "Remember: This project uses Clean Architecture with:
- Domain layer (core business logic)
- Application layer (use cases)
- Infrastructure layer (external dependencies)
- Presentation layer (API/UI)"

# Claude will recall this in future sessions
claude "Add a new user registration feature"
# Claude: "I'll implement this following the Clean Architecture..."

Context Windows

Manage large context effectively.

Maximum Context

Model context limits:

Claude Sonnet: 200k tokens
Claude Opus: 200k tokens
Claude Haiku: 200k tokens

Context Management

{
  "context": {
    "maxTokens": 200000,
    "autoTruncate": true,
    "prioritize": ["recent", "relevant"],
    "summarizeLong": true
  }
}

Strategies

Summarization:

Auto-summarize old context
Keep summaries of large files
Compress conversation history

Prioritization:

Recent messages first
Most relevant files
Explicit user priorities

Chunking:

Process large codebases in chunks
Incremental analysis
Parallel processing

7.0 KiB Raw Blame History

Advanced Features

Extended Thinking

Enable Extended Thinking

Thinking Modes

Budget Control

Best Use Cases

Example

Prompt Caching

Enable Caching

Cache Strategy

Cache Control

Cost Savings

Checkpointing

Enable Checkpointing

View Checkpoints

Restore Checkpoint

Create Manual Checkpoint

Checkpoint Strategies

Example Workflow

Memory Management

Memory Locations

Configuration

Memory Operations

What Gets Remembered

Memory Best Practices

Example

Context Windows

Maximum Context

Context Management

Strategies

See Also

7.0 KiB

Raw Blame History