Initial commit

2025-11-30 09:07:22 +08:00
commit fab98d059b
179 changed files with 46209 additions and 0 deletions
--- a/skills/methodology-bootstrapping/templates/experiment-template.md
+++ b/skills/methodology-bootstrapping/templates/experiment-template.md
@@ -0,0 +1,250 @@
+# Experiment Template
+
+Use this template to structure your methodology development experiment.
+
+## Directory Structure
+
+```
+my-experiment/
+├── README.md                    # Overview and objectives
+├── ITERATION-PROMPTS.md         # Iteration execution guide
+├── iteration-0.md               # Baseline iteration
+├── iteration-1.md               # First iteration
+├── iteration-N.md               # Additional iterations
+├── results.md                   # Final results and knowledge
+├── knowledge/                   # Extracted knowledge
+│   ├── INDEX.md                 # Knowledge catalog
+│   ├── patterns/                # Domain patterns
+│   ├── principles/              # Universal principles
+│   ├── templates/               # Code templates
+│   └── best-practices/          # Context-specific practices
+├── agents/                      # Specialized agents (if needed)
+├── meta-agents/                 # Meta-agent definitions
+└── data/                        # Analysis data and artifacts
+```
+
+## README.md Structure
+
+```markdown
+# Experiment Name
+
+**Status**: 🔄 In Progress | ✅ Converged
+**Domain**: [testing|ci-cd|observability|etc.]
+**Iterations**: N
+**Duration**: X hours
+
+## Objectives
+
+### Instance Objective (Agent Layer)
+[Domain-specific goal, e.g., "Reach 80% test coverage"]
+
+### Meta Objective (Meta-Agent Layer)
+[Methodology goal, e.g., "Develop transferable testing methodology"]
+
+## Approach
+
+1. **Observe**: [How you'll collect data]
+2. **Codify**: [How you'll extract patterns]
+3. **Automate**: [How you'll enforce methodology]
+
+## Success Criteria
+
+- V_instance(s) ≥ 0.80
+- V_meta(s) ≥ 0.80
+- System stable (M_n == M_{n-1}, A_n == A_{n-1})
+
+## Timeline
+
+| Iteration | Focus | Duration | Status |
+|-----------|-------|----------|--------|
+| 0 | Baseline | Xh | ✅ |
+| 1 | ... | Xh | 🔄 |
+
+## Results
+
+[Link to results.md when complete]
+```
+
+## Iteration File Structure
+
+```markdown
+# Iteration N: [Title]
+
+**Date**: YYYY-MM-DD
+**Duration**: X hours
+**Focus**: [Primary objective]
+
+## Objectives
+
+1. [Objective 1]
+2. [Objective 2]
+3. [Objective 3]
+
+## Execution
+
+### Observe Phase
+[Data collection activities]
+
+### Codify Phase
+[Pattern extraction activities]
+
+### Automate Phase
+[Tool/check creation activities]
+
+## Value Calculation
+
+### V_instance(s_n)
+- Component 1: 0.XX
+- Component 2: 0.XX
+- **Total**: 0.XX
+
+### V_meta(s_n)
+- Completeness: 0.XX
+- Effectiveness: 0.XX
+- Reusability: 0.XX
+- Validation: 0.XX
+- **Total**: 0.XX
+
+## System State
+
+- M_n: [unchanged|evolved]
+- A_n: [unchanged|new agents: ...]
+- Stable: [YES|NO]
+
+## Convergence Check
+
+- [ ] V_instance ≥ 0.80
+- [ ] V_meta ≥ 0.80
+- [ ] M_n == M_{n-1}
+- [ ] A_n == A_{n-1}
+- [ ] Objectives complete
+- [ ] ΔV < 0.02 for 2+ iterations
+
+**Status**: [NOT CONVERGED | CONVERGED]
+
+## Knowledge Extracted
+
+- Patterns: [list]
+- Principles: [list]
+- Templates: [list]
+
+## Next Iteration
+
+[If not converged, plan for next iteration]
+```
+
+## results.md Structure
+
+```markdown
+# Experiment Results
+
+**Status**: ✅ CONVERGED
+**Convergence Pattern**: [Standard Dual | Meta-Focused | Practical]
+**Final Iteration**: N
+**Total Duration**: X hours
+
+## Convergence State
+
+### Final Values
+- V_instance(s_N): 0.XX
+- V_meta(s_N): 0.XX
+
+### System State
+- M_N: [description]
+- A_N: [list of agents]
+- Iterations to convergence: N
+
+## Knowledge Output
+
+### Patterns (X total)
+1. [Pattern name](knowledge/patterns/pattern1.md)
+2. ...
+
+### Principles (X total)
+1. [Principle name](knowledge/principles/principle1.md)
+2. ...
+
+### Templates (X total)
+1. [Template name](knowledge/templates/template1.md)
+2. ...
+
+### Best Practices (X total)
+1. [Practice name](knowledge/best-practices/practice1.md)
+2. ...
+
+## Transferability Analysis
+
+**Estimated transferability**: XX%
+
+**What transfers**:
+- [Universal patterns]
+- [Core principles]
+- [General templates]
+
+**What needs adaptation**:
+- [Domain-specific items]
+- [Language-specific items]
+- [Platform-specific items]
+
+## Effectiveness Analysis
+
+**Speedup**: Xx vs ad-hoc
+**Time savings**: X hours saved per use
+**ROI**: X hours invested, Y hours saved
+
+## Lessons Learned
+
+### What Worked Well
+- [Success 1]
+- [Success 2]
+
+### Challenges
+- [Challenge 1]: [How addressed]
+- [Challenge 2]: [How addressed]
+
+### Future Improvements
+- [Improvement 1]
+- [Improvement 2]
+
+## Knowledge Extraction Instruction
+
+**For Claude Code**: When user asks "extract knowledge from this experiment", perform:
+
+1. **Read source files**:
+   - `knowledge/INDEX.md` - Inventory
+   - `knowledge/patterns/*.md` - Patterns
+   - `knowledge/principles/*.md` - Principles
+   - `knowledge/templates/*.md` - Templates
+   - `knowledge/best-practices/*.md` - Best practices
+
+2. **Create skill directory**:
+   ```bash
+   mkdir -p .claude/skills/[domain-name]/{reference,examples,templates,scripts,best-practices}
+   ```
+
+3. **Generate SKILL.md**:
+   - Extract description from INDEX.md summary
+   - Focus on "when to use" from patterns
+   - List allowed-tools based on domain
+   - Keywords from domain tags
+
+4. **Copy and consolidate files**:
+   - patterns/*.md → reference/patterns.md (consolidated)
+   - principles/*.md → reference/principles.md (consolidated)
+   - templates/*.md → templates/ (keep separate)
+   - best-practices/*.md → best-practices/ (keep separate)
+
+5. **Create examples**:
+   - Extract 1-2 worked examples from data/ or iteration files
+   - Show before/after, concrete steps
+
+6. **Make self-contained**:
+   - Remove references to experiments/
+   - Remove references to knowledge/
+   - Make all paths relative to skill directory
+
+7. **Validation**:
+   - Skill description matches "when to use"
+   - All internal links work
+   - No external dependencies
+```
--- a/skills/methodology-bootstrapping/templates/iteration-prompts-template.md
+++ b/skills/methodology-bootstrapping/templates/iteration-prompts-template.md
@@ -0,0 +1,297 @@
+# ITERATION-PROMPTS.md Template
+
+**Purpose**: Structure for agent iteration prompts in BAIME experiments
+**Usage**: Copy this template to `ITERATION-PROMPTS.md` in your experiment directory
+
+---
+
+## ITERATION-PROMPTS.md
+
+```markdown
+# Iteration Prompts for [Methodology Name]
+
+**Experiment**: [experiment-name]
+**Objective**: [Clear objective statement]
+**Target**: [Specific measurable goals]
+
+---
+
+## Iteration 0: Baseline & Observe
+
+**Objective**: Establish baseline metrics and identify core problems
+
+**Prompt**:
+```
+Analyze current [domain] state for [project]:
+
+1. Measure baseline metrics:
+   - [Metric 1]: Current value
+   - [Metric 2]: Current value
+   - [Metric 3]: Current value
+
+2. Identify problems:
+   - High frequency, high impact issues
+   - Pain points in current workflow
+   - Gaps in current approach
+
+3. Document observations:
+   - Time spent on tasks
+   - Quality indicators
+   - Blockers encountered
+
+4. Deliverables:
+   - baseline-metrics.md
+   - problems-identified.md
+   - iteration-0-summary.md
+
+Target time: 60 minutes
+```
+
+**Expected Output**:
+- Baseline metrics document
+- Prioritized problem list
+- Initial hypotheses for patterns
+
+---
+
+## Iteration 1: Core Patterns
+
+**Objective**: Create 2-3 core patterns addressing top problems
+
+**Prompt**:
+```
+Develop initial patterns for [domain]:
+
+1. Select top 3 problems from Iteration 0
+
+2. For each problem, create pattern:
+   - Problem statement
+   - Solution approach
+   - Code/process template
+   - Working example
+   - Time/quality metrics
+
+3. Apply patterns:
+   - Test on 2-3 real examples
+   - Measure time and quality
+   - Document results
+
+4. Calculate V_instance:
+   - [Metric 1]: Target vs Actual
+   - [Metric 2]: Target vs Actual
+   - Overall: V_instance = ?
+
+5. Deliverables:
+   - pattern-1.md
+   - pattern-2.md
+   - pattern-3.md
+   - iteration-1-results.md
+
+Target time: 90 minutes
+```
+
+**Expected Output**:
+- 2-3 documented patterns with examples
+- V_instance ≥ 0.50 (initial progress)
+- Identified gaps for Iteration 2
+
+---
+
+## Iteration 2: Expand & Automate
+
+**Objective**: Add 2-3 more patterns, create first automation tool
+
+**Prompt**:
+```
+Expand pattern library and begin automation:
+
+1. Refine Iteration 1 patterns based on usage
+
+2. Add 2-3 new patterns for remaining gaps
+
+3. Create automation tool:
+   - Identify repetitive task (done >3 times)
+   - Design tool to automate it
+   - Implement script/tool
+   - Measure speedup (Nx faster)
+   - Calculate ROI
+
+4. Calculate metrics:
+   - V_instance = ?
+   - V_meta = patterns_documented / patterns_needed
+
+5. Deliverables:
+   - pattern-4.md, pattern-5.md, pattern-6.md
+   - scripts/tool-name.sh
+   - tool-documentation.md
+   - iteration-2-results.md
+
+Target time: 90 minutes
+```
+
+**Expected Output**:
+- 5-6 total patterns
+- 1 automation tool (ROI > 3x)
+- V_instance ≥ 0.70, V_meta ≥ 0.60
+
+---
+
+## Iteration 3: Consolidate & Validate
+
+**Objective**: Reach V_instance ≥ 0.80, validate transferability
+
+**Prompt**:
+```
+Consolidate patterns and validate methodology:
+
+1. Review all patterns:
+   - Merge similar patterns
+   - Remove unused patterns
+   - Refine documentation
+
+2. Add final patterns if gaps exist (target: 6-8 total)
+
+3. Create additional automation tools if ROI > 3x
+
+4. Validate transferability:
+   - Can patterns apply to other projects?
+   - What needs adaptation?
+   - Estimate transferability %
+
+5. Calculate convergence:
+   - V_instance = ? (target ≥ 0.80)
+   - V_meta = ? (target ≥ 0.60)
+
+6. Deliverables:
+   - consolidated-patterns.md
+   - transferability-analysis.md
+   - iteration-3-results.md
+
+Target time: 90 minutes
+```
+
+**Expected Output**:
+- 6-8 consolidated patterns
+- V_instance ≥ 0.80 (target met)
+- Transferability score (≥ 80%)
+
+---
+
+## Iteration 4: Meta-Layer Convergence
+
+**Objective**: Reach V_meta ≥ 0.80, prepare for production
+
+**Prompt**:
+```
+Achieve meta-layer convergence:
+
+1. Complete methodology documentation:
+   - All patterns with examples
+   - All tools with usage guides
+   - Transferability guide for other languages/projects
+
+2. Measure automation effectiveness:
+   - Time manual vs with tools
+   - ROI for each tool
+   - Overall speedup
+
+3. Calculate final metrics:
+   - V_instance = ? (maintain ≥ 0.80)
+   - V_meta = 0.4×completeness + 0.3×transferability + 0.3×automation
+   - Check: V_meta ≥ 0.80?
+
+4. Create deliverables:
+   - complete-methodology.md (production-ready)
+   - tool-suite-documentation.md
+   - transferability-guide.md
+   - final-results.md
+
+5. If not converged: Identify remaining gaps and plan Iteration 5
+
+Target time: 90 minutes
+```
+
+**Expected Output**:
+- Complete, production-ready methodology
+- V_meta ≥ 0.80 (converged)
+- Dual convergence (V_instance ≥ 0.80, V_meta ≥ 0.80)
+
+---
+
+## Iteration 5+ (If Needed): Gap Closure
+
+**Objective**: Address remaining gaps to reach dual convergence
+
+**Prompt**:
+```
+Close remaining gaps:
+
+1. Analyze why convergence not reached:
+   - V_instance gaps: [specific metrics below target]
+   - V_meta gaps: [patterns missing, tools needed, transferability issues]
+
+2. Targeted improvements:
+   - Create patterns for specific gaps
+   - Improve automation for low ROI areas
+   - Enhance transferability documentation
+
+3. Re-measure:
+   - V_instance = ?
+   - V_meta = ?
+   - Check dual convergence
+
+4. Deliverables:
+   - gap-analysis.md
+   - additional-patterns.md (if needed)
+   - iteration-N-results.md
+
+Repeat until dual convergence achieved
+
+Target time: 60-90 minutes per iteration
+```
+
+**Stopping Criteria**:
+- V_instance ≥ 0.80 for 2 consecutive iterations
+- V_meta ≥ 0.80 for 2 consecutive iterations
+- No critical gaps remaining
+
+---
+
+## Customization Guide
+
+### For Different Domains
+
+**Testing Methodology**:
+- Replace metrics with: coverage%, pass rate, test count
+- Patterns: Test patterns (table-driven, fixture, etc.)
+- Tools: Coverage analyzer, test generator
+
+**CI/CD Pipeline**:
+- Replace metrics with: build time, failure rate, deployment frequency
+- Patterns: Pipeline stages, optimization patterns
+- Tools: Pipeline analyzer, config generator
+
+**Error Recovery**:
+- Replace metrics with: error classification coverage, MTTR, prevention rate
+- Patterns: Error categories, recovery patterns
+- Tools: Error classifier, diagnostic workflows
+
+### Adjusting Iteration Count
+
+**Rapid Convergence (3-4 iterations)**:
+- Strong Iteration 0 (2 hours)
+- Borrow patterns (70-90% reuse)
+- Focus on high-impact only
+
+**Standard Convergence (5-6 iterations)**:
+- Normal Iteration 0 (1 hour)
+- Create patterns from scratch
+- Comprehensive coverage
+
+---
+
+**Template Version**: 1.0
+**Source**: BAIME Framework
+**Usage**: Copy and customize for your experiment
+**Success Rate**: 100% across 13 experiments
+```