5.3 KiB
name, description, version
| name | description | version |
|---|---|---|
| markdown-optimizer | Optimize markdown files for LLM consumption by adding YAML front-matter with metadata and TOC, normalizing heading hierarchy, removing noise and redundancy, converting verbose prose to structured formats, and identifying opportunities for Mermaid diagrams. Use when preparing technical documentation, notes, research, or knowledge base content for use as LLM reference material or in prompts. | 1.0.0 |
Markdown Optimizer
Optimize markdown documents to maximize information density and LLM parsing efficiency while preserving semantic meaning.
Optimization Approaches
Automated Optimization (Recommended First Step)
Use the bundled script for initial optimization:
python scripts/optimize_markdown.py input.md output.md
The script automatically:
- Adds YAML front-matter with title, token estimate, key concepts, and TOC
- Normalizes heading hierarchy (ensures no skipped levels)
- Removes noise (excessive horizontal rules, redundant empty lines)
- Identifies diagram opportunities (process flows, relationships, architecture)
- Generates structured metadata for LLM reference
Token estimates: The script adds metadata (~50-150 tokens) but identifies optimization opportunities that typically yield net reductions of 20-40% when manual optimizations are applied.
Manual Optimization (Apply After Automated)
After running the script, review and apply manual optimizations:
- Review suggested_diagrams in front-matter - Create Mermaid diagrams for flagged sections
- Convert verbose prose to structured formats - Use tables, definition lists where appropriate
- Consolidate redundant examples - Merge similar code examples
- Strip unnecessary emphasis - Remove excessive bold/italic that doesn't add semantic value
Consult references/optimization-patterns.md for detailed patterns and examples.
Workflow
For Single Documents
-
Run automated optimizer:
python scripts/optimize_markdown.py document.md document-optimized.md -
Review output, especially:
suggested_diagrams- sections flagged for visualizationconcepts- verify key topics are capturedtoc- ensure structure is logical
-
Apply manual optimizations using patterns from references/optimization-patterns.md
-
Create Mermaid diagrams for suggested sections
-
Verify all key information preserved
For Multiple Documents
When optimizing related documents, add relationship metadata:
---
title: "API Authentication"
related_docs:
- api-reference.md
- security-guide.md
dependencies:
- python>=3.8
- requests
---
This helps LLMs understand document connections when used as references.
Front-Matter Schema
The optimizer generates this structure:
---
title: "Document Title" # From first H1 or filename
tokens: 1234 # Estimated token count
optimized_for_llm: true # Optimization flag
concepts: # Top 5 key concepts/topics
- ConceptA
- ConceptB
toc: # Table of contents
- Heading 1
- Heading 2
- Heading 3
suggested_diagrams: # Sections that could use visualization
- section: "Section Name"
type: flowchart # or: graph, architecture
---
Add manually when relevant:
related_docs: [file1.md, file2.md] # Document relationships
dependencies: [tool1, tool2] # Required tools/libraries
audience: developers # Target audience
status: published # Document status
Diagram Integration
When front-matter suggests diagrams, create them using Mermaid syntax. Common patterns:
Process Flow (type: flowchart)
flowchart TD
A[Start] --> B[Step 1]
B --> C{Decision?}
C -->|Yes| D[Step 2]
C -->|No| E[Alternative]
Relationships (type: graph)
graph LR
A[Component A] --> B[Component B]
A --> C[Component C]
D[Component D] --> A
Architecture (type: architecture)
graph TB
subgraph Frontend
A[UI]
end
subgraph Backend
B[API]
C[Database]
end
A --> B
B --> C
See references/optimization-patterns.md for comprehensive diagram patterns.
Best Practices
Do:
- Run automated optimizer first to establish baseline
- Review suggested diagrams - they often highlight unclear prose
- Preserve all semantic information
- Test that code examples still work
- Verify cross-references remain intact
Don't:
- Optimize creative writing or legal documents
- Remove explanatory context that aids understanding
- Over-compress at expense of clarity
- Apply to already-concise technical specs
Quality Verification
After optimization, confirm:
- Front-matter is complete and accurate
- Key information preserved
- Logical flow maintained
- Token count reduced or value added
- Document is more scannable
Integration with Other Skills
Optimized markdown works well as:
- Reference material loaded by other skills (
references/directories) - Input to prompt construction
- Knowledge base entries
- Technical documentation ingested by LLMs
Store optimized documents in skill references/ directories when they provide domain knowledge that Claude should access on-demand.