Files

Zhongwei Li fab98d059b Initial commit

2025-11-30 09:07:22 +08:00

7.7 KiB

Raw Blame History

Automated Detection Rules

Version: 1.0 Purpose: Automated error pattern detection for validation Coverage: 95.4% of 1336 historical errors

Rule Engine

Architecture:

Session JSONL → Parser → Classifier → Pattern Matcher → Report

Components:

Parser: Extract tool calls, errors, timestamps
Classifier: Categorize errors by signature
Pattern Matcher: Apply recovery patterns
Reporter: Generate validation metrics

Detection Rules (13 Categories)

1. Build/Compilation Errors

Signature:

(syntax error|undefined:|cannot find|compilation failed)

Detection Logic:

def detect_build_error(tool_call):
    if tool_call.tool != "Bash":
        return False

    error_patterns = [
        r"syntax error",
        r"undefined:",
        r"cannot find",
        r"compilation failed"
    ]

    return any(re.search(p, tool_call.error, re.I)
               for p in error_patterns)

Frequency: 15.0% (200/1336) Priority: P1 (high impact)

2. Test Failures

Signature:

(FAIL|test.*failed|assertion.*failed)

Detection Logic:

def detect_test_failure(tool_call):
    if "test" not in tool_call.command.lower():
        return False

    return re.search(r"FAIL|failed", tool_call.output, re.I)

Frequency: 11.2% (150/1336) Priority: P2 (medium impact)

3. File Not Found

Signature:

(no such file|file not found|cannot open)

Detection Logic:

def detect_file_not_found(tool_call):
    patterns = [
        r"no such file",
        r"file not found",
        r"cannot open"
    ]

    return any(re.search(p, tool_call.error, re.I)
               for p in patterns)

Frequency: 18.7% (250/1336) Priority: P1 (preventable with validation)

Automation: validate-path.sh prevents 65.2%

4. File Size Exceeded

Signature:

(file too large|exceeds.*limit|size.*exceeded)

Detection Logic:

def detect_file_size_error(tool_call):
    if tool_call.tool not in ["Read", "Edit"]:
        return False

    return re.search(r"file too large|exceeds.*limit",
                     tool_call.error, re.I)

Frequency: 6.3% (84/1336) Priority: P1 (100% preventable)

Automation: check-file-size.sh prevents 100%

5. Write Before Read

Signature:

(must read before|file not read|write.*without.*read)

Detection Logic:

def detect_write_before_read(session):
    for i, call in enumerate(session.tool_calls):
        if call.tool in ["Edit", "Write"] and call.status == "error":
            # Check if file was read in previous N calls
            lookback = session.tool_calls[max(0, i-5):i]
            if not any(c.tool == "Read" and
                      c.file_path == call.file_path
                      for c in lookback):
                return True
    return False

Frequency: 5.2% (70/1336) Priority: P1 (100% preventable)

Automation: check-read-before-write.sh prevents 100%

6. Command Not Found

Signature:

(command not found|not recognized|no such command)

Detection Logic:

def detect_command_not_found(tool_call):
    if tool_call.tool != "Bash":
        return False

    return re.search(r"command not found", tool_call.error, re.I)

Frequency: 3.7% (50/1336) Priority: P3 (low automation value)

7. JSON Parsing Errors

Signature:

(invalid json|parse.*error|malformed json)

Detection Logic:

def detect_json_error(tool_call):
    return re.search(r"invalid json|parse.*error|malformed",
                     tool_call.error, re.I)

Frequency: 6.0% (80/1336) Priority: P2 (medium impact)

8. Request Interruption

Signature:

(interrupted|cancelled|aborted)

Detection Logic:

def detect_interruption(tool_call):
    return re.search(r"interrupted|cancelled|aborted",
                     tool_call.error, re.I)

Frequency: 2.2% (30/1336) Priority: P3 (user-initiated, not preventable)

9. MCP Server Errors

Signature:

(mcp.*error|server.*unavailable|connection.*refused)

Detection Logic:

def detect_mcp_error(tool_call):
    if not tool_call.tool.startswith("mcp__"):
        return False

    patterns = [
        r"server.*unavailable",
        r"connection.*refused",
        r"timeout"
    ]

    return any(re.search(p, tool_call.error, re.I)
               for p in patterns)

Frequency: 17.1% (228/1336) Priority: P2 (infrastructure)

10. Permission Denied

Signature:

(permission denied|access denied|forbidden)

Detection Logic:

def detect_permission_error(tool_call):
    return re.search(r"permission denied|access denied",
                     tool_call.error, re.I)

Frequency: 0.7% (10/1336) Priority: P3 (rare)

11. Empty Command String

Signature:

(empty command|no command|command required)

Detection Logic:

def detect_empty_command(tool_call):
    if tool_call.tool != "Bash":
        return False

    return not tool_call.parameters.get("command", "").strip()

Frequency: 1.1% (15/1336) Priority: P2 (easy to prevent)

12. Go Module Already Exists

Signature:

(module.*already exists|go.mod.*exists)

Detection Logic:

def detect_module_exists(tool_call):
    if tool_call.tool != "Bash":
        return False

    return (re.search(r"go mod init", tool_call.command) and
            re.search(r"already exists", tool_call.error, re.I))

Frequency: 0.4% (5/1336) Priority: P3 (rare)

13. String Not Found (Edit)

Signature:

(string not found|no match|pattern.*not found)

Detection Logic:

def detect_string_not_found(tool_call):
    if tool_call.tool != "Edit":
        return False

    return re.search(r"string not found|no match",
                     tool_call.error, re.I)

Frequency: 3.2% (43/1336) Priority: P1 (impacts workflow)

Composite Detection

Multi-stage errors:

def detect_cascading_error(session):
    """Detect errors that cause subsequent errors"""

    for i in range(len(session.tool_calls) - 1):
        current = session.tool_calls[i]
        next_call = session.tool_calls[i + 1]

        # File not found → Write → Edit chain
        if (detect_file_not_found(current) and
            next_call.tool == "Write" and
            current.file_path == next_call.file_path):
            return "file-not-found-recovery"

        # Build error → Fix → Rebuild chain
        if (detect_build_error(current) and
            next_call.tool in ["Edit", "Write"] and
            detect_build_error(session.tool_calls[i + 2])):
            return "build-error-incomplete-fix"

    return None

Validation Metrics

Overall Coverage:

Coverage = (Σ detected_errors) / total_errors
         = 1275 / 1336
         = 95.4%

Per-Category Accuracy:

True Positives: 1265 (99.2%)
False Positives: 10 (0.8%)
False Negatives: 61 (4.6%)

Precision: 99.2% Recall: 95.4% F1 Score: 97.3%

Usage

CLI:

# Classify all errors in session
meta-cc classify-errors session.jsonl

# Validate methodology against history
meta-cc validate \
  --methodology error-recovery \
  --history .claude/sessions/*.jsonl

MCP:

# Query by error category
query_tools(status="error")

# Get error context
query_context(error_signature="file-not-found")

Source: Bootstrap-003 Error Recovery (1336 errors analyzed) Status: Production-ready, 95.4% coverage, 97.3% F1 score

7.7 KiB Raw Blame History

Automated Detection Rules

Rule Engine

Detection Rules (13 Categories)

1. Build/Compilation Errors

2. Test Failures

3. File Not Found

4. File Size Exceeded

5. Write Before Read

6. Command Not Found

7. JSON Parsing Errors

8. Request Interruption

9. MCP Server Errors

10. Permission Denied

11. Empty Command String

12. Go Module Already Exists

13. String Not Found (Edit)

Composite Detection

Validation Metrics

Usage

7.7 KiB

Raw Blame History