zhongwei/gh-mvdmakesthings-skills-markdown-optimizer

Files

Zhongwei Li 735685a38f Initial commit

2025-11-30 08:41:42 +08:00

5.3 KiB

Raw Permalink Blame History

name, description, version

name	description	version
markdown-optimizer	Optimize markdown files for LLM consumption by adding YAML front-matter with metadata and TOC, normalizing heading hierarchy, removing noise and redundancy, converting verbose prose to structured formats, and identifying opportunities for Mermaid diagrams. Use when preparing technical documentation, notes, research, or knowledge base content for use as LLM reference material or in prompts.	1.0.0

Markdown Optimizer

Optimize markdown documents to maximize information density and LLM parsing efficiency while preserving semantic meaning.

Optimization Approaches

Automated Optimization (Recommended First Step)

Use the bundled script for initial optimization:

python scripts/optimize_markdown.py input.md output.md

The script automatically:

Adds YAML front-matter with title, token estimate, key concepts, and TOC
Normalizes heading hierarchy (ensures no skipped levels)
Removes noise (excessive horizontal rules, redundant empty lines)
Identifies diagram opportunities (process flows, relationships, architecture)
Generates structured metadata for LLM reference

Token estimates: The script adds metadata (~50-150 tokens) but identifies optimization opportunities that typically yield net reductions of 20-40% when manual optimizations are applied.

Manual Optimization (Apply After Automated)

After running the script, review and apply manual optimizations:

Review suggested_diagrams in front-matter - Create Mermaid diagrams for flagged sections
Convert verbose prose to structured formats - Use tables, definition lists where appropriate
Consolidate redundant examples - Merge similar code examples
Strip unnecessary emphasis - Remove excessive bold/italic that doesn't add semantic value

Consult references/optimization-patterns.md for detailed patterns and examples.

Workflow

For Single Documents

Run automated optimizer:

python scripts/optimize_markdown.py document.md document-optimized.md

Review output, especially:
- suggested_diagrams - sections flagged for visualization
- concepts - verify key topics are captured
- toc - ensure structure is logical
Apply manual optimizations using patterns from references/optimization-patterns.md
Create Mermaid diagrams for suggested sections
Verify all key information preserved

For Multiple Documents

When optimizing related documents, add relationship metadata:

---
title: "API Authentication"
related_docs:
  - api-reference.md
  - security-guide.md
dependencies:
  - python>=3.8
  - requests
---

This helps LLMs understand document connections when used as references.

Front-Matter Schema

The optimizer generates this structure:

---
title: "Document Title"              # From first H1 or filename
tokens: 1234                         # Estimated token count
optimized_for_llm: true              # Optimization flag
concepts:                            # Top 5 key concepts/topics
  - ConceptA
  - ConceptB
toc:                                 # Table of contents
  - Heading 1
    - Heading 2
      - Heading 3
suggested_diagrams:                  # Sections that could use visualization
  - section: "Section Name"
    type: flowchart                  # or: graph, architecture
---

Add manually when relevant:

related_docs: [file1.md, file2.md]  # Document relationships
dependencies: [tool1, tool2]         # Required tools/libraries
audience: developers                 # Target audience
status: published                    # Document status

Diagram Integration

When front-matter suggests diagrams, create them using Mermaid syntax. Common patterns:

Process Flow (type: flowchart)

flowchart TD
    A[Start] --> B[Step 1]
    B --> C{Decision?}
    C -->|Yes| D[Step 2]
    C -->|No| E[Alternative]

Relationships (type: graph)

graph LR
    A[Component A] --> B[Component B]
    A --> C[Component C]
    D[Component D] --> A

Architecture (type: architecture)

graph TB
    subgraph Frontend
        A[UI]
    end
    subgraph Backend
        B[API]
        C[Database]
    end
    A --> B
    B --> C

See references/optimization-patterns.md for comprehensive diagram patterns.

Best Practices

Do:

Run automated optimizer first to establish baseline
Review suggested diagrams - they often highlight unclear prose
Preserve all semantic information
Test that code examples still work
Verify cross-references remain intact

Don't:

Optimize creative writing or legal documents
Remove explanatory context that aids understanding
Over-compress at expense of clarity
Apply to already-concise technical specs

Quality Verification

After optimization, confirm:

Front-matter is complete and accurate
Key information preserved
Logical flow maintained
Token count reduced or value added
Document is more scannable

Integration with Other Skills

Optimized markdown works well as:

Reference material loaded by other skills (references/ directories)
Input to prompt construction
Knowledge base entries
Technical documentation ingested by LLMs

Store optimized documents in skill references/ directories when they provide domain knowledge that Claude should access on-demand.

5.3 KiB Raw Permalink Blame History