Initial commit

2025-11-29 18:26:08 +08:00
commit 8f22ddf339
295 changed files with 59710 additions and 0 deletions
--- a/skills/meta.compatibility/README.md
+++ b/skills/meta.compatibility/README.md
@@ -0,0 +1,239 @@
+# meta.compatibility
+
+Automatic artifact dependency graph validation and diagnostics for the Betty Framework.
+
+## Overview
+
+The `meta.compatibility` skill analyzes the artifact dependency graph across all Betty skills, detecting compatibility issues, cycles, orphan nodes, and unresolved dependencies. It provides actionable diagnostics to maintain a healthy skill ecosystem.
+
+## Features
+
+- **Graph Construction**: Builds directed graph from skill artifact metadata
+- **Cycle Detection**: Identifies recursive dependencies (marks as `recursive:true`)
+- **Orphan Detection**: Finds isolated nodes with no connections
+- **Dependency Analysis**: Detects unresolved dependencies (consumed but never produced)
+- **Producer Analysis**: Identifies orphan producers (produced but never consumed)
+- **Health Reporting**: Comprehensive JSON and human-readable reports
+
+## Usage
+
+### Basic Validation
+
+```bash
+betty run /meta/compatibility --check artifacts
+```
+
+### Save Report to File
+
+```bash
+betty run /meta/compatibility --check artifacts --output graph-report.json
+```
+
+### JSON Output Only
+
+```bash
+betty run /meta/compatibility --check artifacts --json
+```
+
+### Custom Skills Directory
+
+```bash
+betty run /meta/compatibility --check artifacts --skills-dir /path/to/skills
+```
+
+## Output Format
+
+### Human-Readable Output
+
+```
+======================================================================
+Artifact Dependency Graph Validation
+======================================================================
+Total Skills:     25
+Total Artifacts:  42
+Graph Density:    12.50%
+
+✅ 38 artifacts connected
+⚠️  3 isolated
+   - report.template
+   - legacy.format
+   - experimental.data
+❌ 1 cycle(s) detected
+   1. model-a → model-b → model-a
+
+✅ Graph Status: HEALTHY
+======================================================================
+```
+
+### JSON Report
+
+```json
+{
+  "total_artifacts": 42,
+  "total_skills": 25,
+  "connected": 38,
+  "isolated": ["report.template", "legacy.format"],
+  "cyclic": [["model-a", "model-b", "model-a"]],
+  "unresolved": ["dataset.missing"],
+  "orphans": ["deprecated.artifact"],
+  "status": "warning",
+  "graph_stats": {
+    "nodes": 42,
+    "edges": 58,
+    "density": 0.125
+  }
+}
+```
+
+## Report Fields
+
+| Field | Type | Description |
+|-------|------|-------------|
+| `total_artifacts` | number | Total artifact types in the graph |
+| `total_skills` | number | Total skills with artifact metadata |
+| `connected` | number | Artifacts with at least one connection |
+| `isolated` | array | Artifact types with no connections |
+| `cyclic` | array | Lists of artifact cycles (recursive dependencies) |
+| `unresolved` | array | Artifacts consumed but never produced |
+| `orphans` | array | Artifacts produced but never consumed |
+| `status` | string | Overall health: `healthy`, `warning`, or `error` |
+| `graph_stats` | object | Graph metrics (nodes, edges, density) |
+
+## Status Levels
+
+- **healthy**: No critical issues detected
+- **warning**: Non-critical issues (isolated nodes, orphan producers)
+- **error**: Critical issues (cycles, unresolved dependencies)
+
+## Validation Rules
+
+### Critical Issues (Error Status)
+
+1. **Cycles**: Circular dependencies in artifact flow
+   - Example: Skill A produces X, consumes Y; Skill B produces Y, consumes X
+   - Can cause infinite loops in workflows
+
+2. **Unresolved Dependencies**: Artifacts consumed but never produced
+   - Example: Skill requires `api-spec` but no skill produces it
+   - Breaks skill composition and workflows
+
+### Warnings (Warning Status)
+
+1. **Isolated Nodes**: Artifacts with no producers or consumers
+   - May indicate deprecated or unused artifact types
+   - Not critical but suggests cleanup needed
+
+2. **Orphan Producers**: Artifacts produced but never consumed
+   - May indicate missing consumer skills
+   - Not critical but suggests incomplete workflows
+
+## Example Scenarios
+
+### Scenario 1: Healthy Graph
+
+```
+Skills:
+  - api.define: produces openapi-spec
+  - api.validate: consumes openapi-spec, produces validation-report
+  - api.test: consumes openapi-spec, produces test-results
+
+Result: ✅ All connected, no issues
+```
+
+### Scenario 2: Cycle Detected
+
+```
+Skills:
+  - transform.a: consumes data-raw, produces data-interim
+  - transform.b: consumes data-interim, produces data-processed
+  - transform.c: consumes data-processed, produces data-raw
+
+Result: ❌ Cycle detected: data-raw → data-interim → data-processed → data-raw
+```
+
+### Scenario 3: Unresolved Dependency
+
+```
+Skills:
+  - api.test: consumes api-spec
+  - (no skill produces api-spec)
+
+Result: ❌ Unresolved dependency: api-spec consumed but never produced
+```
+
+### Scenario 4: Isolated Artifact
+
+```
+Skills:
+  - legacy.export: produces legacy-format
+  - (no skill consumes legacy-format)
+  - (no skill produces legacy-format to feed others)
+
+Result: ⚠️  Isolated: legacy-format (if also not consumed, orphan producer)
+```
+
+## Implementation Details
+
+### Graph Construction
+
+The skill builds a directed graph where:
+- **Nodes**: Artifact types (e.g., `openapi-spec`, `validation-report`)
+- **Edges**: Dependency flows (A → B means artifact A is consumed by a skill that produces B)
+- **Attributes**: Each node tracks its producers and consumers (skill names)
+
+```python
+import networkx as nx
+
+def build_artifact_graph(artifacts):
+    g = nx.DiGraph()
+    for artifact in artifacts:
+        g.add_node(artifact["id"])
+        for dep in artifact.get("consumes", []):
+            g.add_edge(dep, artifact["id"])
+    return g
+```
+
+### Validation Algorithms
+
+1. **Cycle Detection**: Uses `networkx.simple_cycles()` to find all cycles
+2. **Isolated Nodes**: Checks `in_degree == 0 and out_degree == 0`
+3. **Unresolved**: Nodes with consumers but no producers
+4. **Orphans**: Nodes with producers but no consumers
+
+## Integration
+
+This skill is part of the meta-skills layer and can be used to:
+
+1. **Pre-flight Checks**: Validate graph before deploying new skills
+2. **CI/CD**: Run in pipelines to ensure artifact compatibility
+3. **Documentation**: Generate artifact flow diagrams
+4. **Debugging**: Identify why skill compositions fail
+
+## Dependencies
+
+- `networkx>=3.0`: Graph analysis library
+- `PyYAML>=6.0`: Parse skill.yaml files
+
+## Related Skills
+
+- `artifact.validate`: Validates individual artifact files
+- `artifact.define`: Defines artifact metadata schemas
+- `registry.query`: Queries skill registry for artifact information
+
+## Exit Codes
+
+- `0`: Success (healthy or warning status)
+- `1`: Error (critical issues detected)
+
+## Limitations
+
+- Does not validate actual artifact files, only metadata
+- Wildcard consumers (`type: "*"`) are ignored in graph construction
+- Does not check runtime compatibility, only structural compatibility
+
+## Future Enhancements
+
+- Visual graph rendering (GraphViz output)
+- Suggested fixes for detected issues
+- Artifact version compatibility checking
+- Runtime dependency validation
--- a/skills/meta.compatibility/init.py
+++ b/skills/meta.compatibility/init.py
@@ -0,0 +1 @@
+# Auto-generated package initializer for skills.
--- a/skills/meta.compatibility/meta_compatibility.py
+++ b/skills/meta.compatibility/meta_compatibility.py
@@ -0,0 +1,401 @@
+#!/usr/bin/env python3
+"""
+meta.compatibility skill - Automatic artifact dependency graph validation
+
+Validates artifact dependency graphs, detects cycles, orphan nodes, and unresolved
+dependencies. Provides actionable diagnostics for artifact compatibility issues.
+"""
+
+import sys
+import os
+import argparse
+import json
+from pathlib import Path
+from typing import Dict, Any, List, Set, Tuple, Optional
+import yaml
+
+try:
+    import networkx as nx
+except ImportError:
+    print("Error: networkx is required. Install with: pip install networkx>=3.0")
+    sys.exit(1)
+
+
+
+def load_skill_metadata(skills_dir: Path) -> List[Dict[str, Any]]:
+    """Load artifact metadata from all skills"""
+    skills = []
+
+    if not skills_dir.exists():
+        return skills
+
+    for skill_path in skills_dir.iterdir():
+        if not skill_path.is_dir() or skill_path.name.startswith('.'):
+            continue
+
+        # Try to load skill.yaml
+        skill_yaml = skill_path / "skill.yaml"
+        if not skill_yaml.exists():
+            continue
+
+        try:
+            with open(skill_yaml, 'r') as f:
+                skill_data = yaml.safe_load(f)
+
+            if not skill_data:
+                continue
+
+            # Extract artifact metadata
+            if 'artifact_metadata' in skill_data:
+                skills.append({
+                    'name': skill_data.get('name', skill_path.name),
+                    'path': str(skill_path),
+                    'produces': skill_data['artifact_metadata'].get('produces', []),
+                    'consumes': skill_data['artifact_metadata'].get('consumes', [])
+                })
+        except Exception as e:
+            print(f"Warning: Failed to load {skill_yaml}: {e}", file=sys.stderr)
+            continue
+
+    return skills
+
+
+def build_artifact_graph(artifacts: List[Dict[str, Any]]) -> nx.DiGraph:
+    """
+    Build artifact dependency graph from skill metadata.
+
+    Nodes represent artifact types.
+    Edges represent dependencies: if skill A produces artifact X and skill B consumes X,
+    there's an edge from X (as produced by A) to X (as consumed by B).
+
+    We also track which skills produce/consume each artifact type.
+    """
+    g = nx.DiGraph()
+
+    # Track artifact types and their producers/consumers
+    artifact_types: Set[str] = set()
+    producers: Dict[str, List[str]] = {}  # artifact_type -> [skill_names]
+    consumers: Dict[str, List[str]] = {}  # artifact_type -> [skill_names]
+
+    # First pass: collect all artifact types and track producers/consumers
+    for skill in artifacts:
+        skill_name = skill.get('name', 'unknown')
+
+        # Process produces
+        for artifact in skill.get('produces', []):
+            if isinstance(artifact, dict):
+                artifact_type = artifact.get('type')
+            else:
+                artifact_type = artifact
+
+            if artifact_type:
+                artifact_types.add(artifact_type)
+                if artifact_type not in producers:
+                    producers[artifact_type] = []
+                producers[artifact_type].append(skill_name)
+
+        # Process consumes
+        for artifact in skill.get('consumes', []):
+            if isinstance(artifact, dict):
+                artifact_type = artifact.get('type')
+            else:
+                artifact_type = artifact
+
+            if artifact_type and artifact_type != '*':  # Ignore wildcard consumers
+                artifact_types.add(artifact_type)
+                if artifact_type not in consumers:
+                    consumers[artifact_type] = []
+                consumers[artifact_type].append(skill_name)
+
+    # Add nodes for each artifact type
+    for artifact_type in artifact_types:
+        g.add_node(
+            artifact_type,
+            producers=producers.get(artifact_type, []),
+            consumers=consumers.get(artifact_type, [])
+        )
+
+    # Add edges representing artifact flows
+    # If artifact A is consumed by a skill that produces artifact B,
+    # create edge A -> B (artifact A flows into creating artifact B)
+    for skill in artifacts:
+        consumed_artifacts = []
+        produced_artifacts = []
+
+        # Get consumed artifact types
+        for artifact in skill.get('consumes', []):
+            if isinstance(artifact, dict):
+                artifact_type = artifact.get('type')
+            else:
+                artifact_type = artifact
+
+            if artifact_type and artifact_type != '*':
+                consumed_artifacts.append(artifact_type)
+
+        # Get produced artifact types
+        for artifact in skill.get('produces', []):
+            if isinstance(artifact, dict):
+                artifact_type = artifact.get('type')
+            else:
+                artifact_type = artifact
+
+            if artifact_type:
+                produced_artifacts.append(artifact_type)
+
+        # Create edges from consumed to produced artifacts
+        for consumed in consumed_artifacts:
+            for produced in produced_artifacts:
+                if consumed in g and produced in g:
+                    g.add_edge(consumed, produced, skill=skill.get('name'))
+
+    return g
+
+
+def detect_cycles(g: nx.DiGraph) -> List[List[str]]:
+    """Detect cycles in the artifact dependency graph"""
+    try:
+        cycles = list(nx.simple_cycles(g))
+        return cycles
+    except Exception:
+        return []
+
+
+def identify_isolated_nodes(g: nx.DiGraph) -> List[str]:
+    """Identify nodes with no incoming or outgoing edges"""
+    isolated = []
+    for node in g.nodes():
+        if g.in_degree(node) == 0 and g.out_degree(node) == 0:
+            isolated.append(node)
+    return isolated
+
+
+def identify_unresolved_dependencies(g: nx.DiGraph, skills: List[Dict[str, Any]]) -> List[str]:
+    """Identify artifact types that are consumed but never produced"""
+    unresolved = []
+
+    for node in g.nodes():
+        node_data = g.nodes[node]
+        producers = node_data.get('producers', [])
+        consumers = node_data.get('consumers', [])
+
+        # If consumed but not produced, it's unresolved
+        if consumers and not producers:
+            unresolved.append(node)
+
+    return unresolved
+
+
+def identify_orphan_producers(g: nx.DiGraph) -> List[str]:
+    """Identify artifact types that are produced but never consumed"""
+    orphans = []
+
+    for node in g.nodes():
+        node_data = g.nodes[node]
+        producers = node_data.get('producers', [])
+        consumers = node_data.get('consumers', [])
+
+        # If produced but not consumed, it's an orphan
+        if producers and not consumers:
+            orphans.append(node)
+
+    return orphans
+
+
+def validate_artifact_graph(skills_dir: str = "skills") -> Dict[str, Any]:
+    """
+    Validate artifact dependency graph and generate diagnostic report
+
+    Args:
+        skills_dir: Path to skills directory
+
+    Returns:
+        Validation report with graph health metrics
+    """
+    skills_path = Path(skills_dir)
+
+    # Load skill metadata
+    skills = load_skill_metadata(skills_path)
+
+    if not skills:
+        return {
+            'success': False,
+            'error': 'No skills with artifact metadata found',
+            'total_artifacts': 0,
+            'total_skills': 0,
+            'status': 'error'
+        }
+
+    # Build graph
+    g = build_artifact_graph(skills)
+
+    # Detect issues
+    cycles = detect_cycles(g)
+    isolated = identify_isolated_nodes(g)
+    unresolved = identify_unresolved_dependencies(g, skills)
+    orphans = identify_orphan_producers(g)
+
+    # Calculate connected artifacts (not isolated)
+    connected = len(g.nodes()) - len(isolated)
+
+    # Determine status
+    status = 'healthy'
+    if cycles or unresolved:
+        status = 'error'
+    elif isolated or orphans:
+        status = 'warning'
+
+    # Build report
+    report = {
+        'success': True,
+        'total_artifacts': len(g.nodes()),
+        'total_skills': len(skills),
+        'connected': connected,
+        'isolated': isolated,
+        'cyclic': [list(cycle) for cycle in cycles],
+        'unresolved': unresolved,
+        'orphans': orphans,
+        'status': status,
+        'graph_stats': {
+            'nodes': g.number_of_nodes(),
+            'edges': g.number_of_edges(),
+            'density': nx.density(g) if g.number_of_nodes() > 0 else 0
+        }
+    }
+
+    return report
+
+
+def print_human_readable_report(report: Dict[str, Any]) -> None:
+    """Print human-readable graph validation summary"""
+    if not report.get('success'):
+        print(f"\n❌ Graph Validation Failed")
+        print(f"Error: {report.get('error', 'Unknown error')}")
+        return
+
+    total = report.get('total_artifacts', 0)
+    connected = report.get('connected', 0)
+    isolated = report.get('isolated', [])
+    cycles = report.get('cyclic', [])
+    unresolved = report.get('unresolved', [])
+    orphans = report.get('orphans', [])
+    status = report.get('status', 'unknown')
+
+    print(f"\n{'='*70}")
+    print(f"Artifact Dependency Graph Validation")
+    print(f"{'='*70}")
+    print(f"Total Skills:     {report.get('total_skills', 0)}")
+    print(f"Total Artifacts:  {total}")
+    print(f"Graph Density:    {report['graph_stats']['density']:.2%}")
+    print(f"")
+
+    # Connected artifacts
+    if connected > 0:
+        print(f"✅ {connected} artifacts connected")
+
+    # Isolated artifacts
+    if isolated:
+        print(f"⚠️  {len(isolated)} isolated")
+        for artifact in isolated[:5]:  # Show first 5
+            print(f"   - {artifact}")
+        if len(isolated) > 5:
+            print(f"   ... and {len(isolated) - 5} more")
+
+    # Cycles
+    if cycles:
+        print(f"❌ {len(cycles)} cycle(s) detected")
+        for i, cycle in enumerate(cycles[:3], 1):  # Show first 3
+            cycle_path = ' → '.join(cycle + [cycle[0]])
+            print(f"   {i}. {cycle_path}")
+        if len(cycles) > 3:
+            print(f"   ... and {len(cycles) - 3} more")
+
+    # Unresolved dependencies
+    if unresolved:
+        print(f"❌ {len(unresolved)} unresolved dependencies")
+        for artifact in unresolved[:5]:
+            print(f"   - {artifact} (consumed but never produced)")
+        if len(unresolved) > 5:
+            print(f"   ... and {len(unresolved) - 5} more")
+
+    # Orphan producers
+    if orphans:
+        print(f"⚠️  {len(orphans)} orphan producers")
+        for artifact in orphans[:5]:
+            print(f"   - {artifact} (produced but never consumed)")
+        if len(orphans) > 5:
+            print(f"   ... and {len(orphans) - 5} more")
+
+    print(f"")
+
+    # Overall status
+    if status == 'healthy':
+        print(f"✅ Graph Status: HEALTHY")
+    elif status == 'warning':
+        print(f"⚠️  Graph Status: WARNING (non-critical issues)")
+    else:
+        print(f"❌ Graph Status: ERROR (critical issues)")
+
+    print(f"{'='*70}\n")
+
+
+def main():
+    """Main entry point for meta.compatibility skill"""
+    parser = argparse.ArgumentParser(
+        description='Validate artifact dependency graph and detect compatibility issues'
+    )
+    parser.add_argument(
+        '--check',
+        type=str,
+        choices=['artifacts', 'skills', 'all'],
+        default='artifacts',
+        help='What to check (default: artifacts)'
+    )
+    parser.add_argument(
+        '--skills-dir',
+        type=str,
+        default='skills',
+        help='Path to skills directory (default: skills)'
+    )
+    parser.add_argument(
+        '--output',
+        type=str,
+        help='Save JSON report to file'
+    )
+    parser.add_argument(
+        '--json',
+        action='store_true',
+        help='Output JSON only (no human-readable output)'
+    )
+
+    args = parser.parse_args()
+
+    # Validate artifact graph
+    report = validate_artifact_graph(skills_dir=args.skills_dir)
+
+    # Save to file if requested
+    if args.output:
+        output_path = Path(args.output)
+        output_path.parent.mkdir(parents=True, exist_ok=True)
+        with open(output_path, 'w') as f:
+            json.dump(report, f, indent=2)
+        if not args.json:
+            print(f"Report saved to: {output_path}")
+
+    # Print output
+    if args.json:
+        print(json.dumps(report, indent=2))
+    else:
+        print_human_readable_report(report)
+
+    # Exit with appropriate code
+    status = report.get('status', 'error')
+    if status == 'healthy':
+        return 0
+    elif status == 'warning':
+        return 0  # Warnings don't fail the check
+    else:
+        return 1
+
+
+if __name__ == '__main__':
+    sys.exit(main())
--- a/skills/meta.compatibility/skill.yaml
+++ b/skills/meta.compatibility/skill.yaml
@@ -0,0 +1,107 @@
+name: meta.compatibility
+version: 0.1.0
+description: >
+  Automatic artifact dependency graph validation and diagnostics. Builds a dependency
+  graph from skill artifact metadata, detects cycles, orphan nodes, unresolved dependencies,
+  and provides actionable health reports for the Betty Framework ecosystem.
+
+inputs:
+  - name: check
+    type: string
+    required: false
+    default: artifacts
+    description: What to check (artifacts, skills, or all)
+
+  - name: skills_dir
+    type: string
+    required: false
+    default: skills
+    description: Path to skills directory
+
+  - name: output
+    type: string
+    required: false
+    description: Save JSON report to file
+
+  - name: json
+    type: boolean
+    required: false
+    default: false
+    description: Output JSON only (no human-readable format)
+
+outputs:
+  - name: graph_report
+    type: object
+    description: >
+      Comprehensive graph validation report including total artifacts, connected nodes,
+      isolated nodes, cycles, unresolved dependencies, and overall health status
+
+  - name: status
+    type: string
+    description: Overall graph health status (healthy, warning, or error)
+
+  - name: total_artifacts
+    type: number
+    description: Total number of artifact types in the graph
+
+dependencies: []
+
+entrypoints:
+  - command: /meta/compatibility
+    handler: meta_compatibility.py
+    runtime: python
+    description: >
+      Validate artifact dependency graph across all Betty skills. Constructs a directed
+      graph where nodes represent artifact types and edges represent producer-consumer
+      relationships. Detects cycles (recursive dependencies), isolated nodes (disconnected
+      artifacts), orphan producers (artifacts produced but never consumed), and unresolved
+      dependencies (artifacts consumed but never produced). Generates comprehensive
+      health reports with actionable diagnostics.
+    parameters:
+      - name: check
+        type: string
+        required: false
+        default: artifacts
+        description: What to check
+      - name: skills_dir
+        type: string
+        required: false
+        default: skills
+        description: Skills directory path
+      - name: output
+        type: string
+        required: false
+        description: Output file for JSON report
+      - name: json
+        type: boolean
+        required: false
+        default: false
+        description: JSON-only output
+    permissions:
+      - filesystem:read
+
+status: active
+
+tags:
+  - meta
+  - validation
+  - artifacts
+  - compatibility
+  - graph
+  - diagnostics
+  - tier2
+  - phase3
+
+# This skill's own artifact metadata
+artifact_metadata:
+  produces:
+    - type: compatibility-report
+      description: Artifact dependency graph validation report with health metrics and diagnostics
+      file_pattern: "*.compatibility-report.json"
+      content_type: application/json
+
+  consumes:
+    - type: skill-metadata
+      description: Reads artifact_metadata from all skill.yaml files
+      file_pattern: "**/skill.yaml"
+      required: true
				`@@ -0,0 +1 @@`
				`# Auto-generated package initializer for skills.`