Initial commit

2025-11-30 08:24:34 +08:00
commit 4ad7b3dd73
15 changed files with 4798 additions and 0 deletions
--- a/.claude-plugin/plugin.json
+++ b/.claude-plugin/plugin.json
@@ -0,0 +1,12 @@
+{
+  "name": "cloudflare-vectorize",
+  "description": "Build semantic search and RAG applications with Cloudflare Vectorize, a globally distributed vector database. Supports Workers AI and OpenAI embeddings, metadata filtering with 10 indexes, and namespace partitioning. Use when: creating vector indexes, querying embeddings, implementing semantic search or RAG, configuring metadata filters, or troubleshooting dimension mismatches, metadata index timing, insert vs upsert confusion, or filter syntax errors.",
+  "version": "1.0.0",
+  "author": {
+    "name": "Jeremy Dawes",
+    "email": "jeremy@jezweb.net"
+  },
+  "skills": [
+    "./"
+  ]
+}
--- a/README.md
+++ b/README.md
@@ -0,0 +1,3 @@
+# cloudflare-vectorize
+
+Build semantic search and RAG applications with Cloudflare Vectorize, a globally distributed vector database. Supports Workers AI and OpenAI embeddings, metadata filtering with 10 indexes, and namespace partitioning. Use when: creating vector indexes, querying embeddings, implementing semantic search or RAG, configuring metadata filters, or troubleshooting dimension mismatches, metadata index timing, insert vs upsert confusion, or filter syntax errors.
--- a/SKILL.md
+++ b/SKILL.md
@@ -0,0 +1,387 @@
+---
+name: cloudflare-vectorize
+description: |
+  Build semantic search with Cloudflare Vectorize V2 (Sept 2024 GA). Covers V2 breaking changes: async mutations,
+  5M vectors/index (was 200K), 31ms latency (was 549ms), returnMetadata enum, and V1 deprecation (Dec 2024).
+
+  Use when: migrating V1→V2, handling async mutations with mutationId, creating metadata indexes before insert,
+  or troubleshooting "returnMetadata must be 'all'", V2 timing issues, metadata index errors, dimension mismatches.
+license: MIT
+metadata:
+  keywords:
+    - vectorize v2
+    - vectorize ga september 2024
+    - vectorize breaking changes
+    - async mutations
+    - mutationId
+    - returnMetadata enum
+    - v1 deprecated december 2024
+    - metadata index before insert
+    - 5 million vectors
+    - 31ms latency
+    - topK 100
+    - range queries v2
+    - $gte $lte $in $nin
+    - wrangler 3.71.0
+    - vectorize migration
+---
+
+# Cloudflare Vectorize
+
+Complete implementation guide for Cloudflare Vectorize - a globally distributed vector database for building semantic search, RAG (Retrieval Augmented Generation), and AI-powered applications with Cloudflare Workers.
+
+**Status**: Production Ready ✅
+**Last Updated**: 2025-10-21
+**Dependencies**: cloudflare-worker-base (for Worker setup), cloudflare-workers-ai (for embeddings)
+**Latest Versions**: wrangler@4.43.0, @cloudflare/workers-types@4.20251014.0
+**Token Savings**: ~65%
+**Errors Prevented**: 8
+**Dev Time Saved**: ~3 hours
+
+## What This Skill Provides
+
+### Core Capabilities
+- ✅ **Index Management**: Create, configure, and manage vector indexes
+- ✅ **Vector Operations**: Insert, upsert, query, delete, and list vectors
+- ✅ **Metadata Filtering**: Advanced filtering with 10 metadata indexes per index
+- ✅ **Semantic Search**: Find similar vectors using cosine, euclidean, or dot-product metrics
+- ✅ **RAG Patterns**: Complete retrieval-augmented generation workflows
+- ✅ **Workers AI Integration**: Native embedding generation with @cf/baai/bge-base-en-v1.5
+- ✅ **OpenAI Integration**: Support for text-embedding-3-small/large models
+- ✅ **Document Processing**: Text chunking and batch ingestion pipelines
+
+### Templates Included
+1. **basic-search.ts** - Simple vector search with Workers AI
+2. **rag-chat.ts** - Full RAG chatbot with context retrieval
+3. **document-ingestion.ts** - Document chunking and embedding pipeline
+4. **metadata-filtering.ts** - Advanced filtering patterns
+
+---
+
+## ⚠️ Vectorize V2 Breaking Changes (September 2024)
+
+**IMPORTANT**: Vectorize V2 became GA in September 2024 with significant breaking changes.
+
+### What Changed in V2
+
+**Performance Improvements**:
+- **Index capacity**: 200,000 → **5 million vectors** per index
+- **Query latency**: 549ms → **31ms** median (18× faster)
+- **TopK limit**: 20 → **100** results per query
+- **Scale limits**: 100 → **50,000 indexes** per account
+- **Namespace limits**: 100 → **50,000 namespaces** per index
+
+**Breaking API Changes**:
+1. **Async Mutations** - All mutations now asynchronous:
+   ```typescript
+   // V2: Returns mutationId
+   const result = await env.VECTORIZE_INDEX.insert(vectors);
+   console.log(result.mutationId); // "xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx"
+
+   // Vector inserts/deletes may take a few seconds to be reflected
+   ```
+
+2. **returnMetadata Parameter** - Boolean → String enum:
+   ```typescript
+   // ❌ V1 (deprecated)
+   { returnMetadata: true }
+
+   // ✅ V2 (required)
+   { returnMetadata: 'all' | 'indexed' | 'none' }
+   ```
+
+3. **Metadata Indexes Required Before Insert**:
+   - V2 requires metadata indexes created BEFORE vectors inserted
+   - Vectors added before metadata index won't be indexed
+   - Must re-upsert vectors after creating metadata index
+
+**V1 Deprecation Timeline**:
+- **December 2024**: Can no longer create V1 indexes
+- **Existing V1 indexes**: Continue to work (other operations unaffected)
+- **Migration**: Use `wrangler vectorize --deprecated-v1` flag for V1 operations
+
+**Wrangler Version Required**:
+- **Minimum**: wrangler@3.71.0 for V2 commands
+- **Recommended**: wrangler@4.43.0+ (latest)
+
+### Check Mutation Status
+
+```typescript
+// Get index info to check last mutation processed
+const info = await env.VECTORIZE_INDEX.describe();
+console.log(info.mutationId); // Last mutation ID
+console.log(info.processedUpToMutation); // Last processed timestamp
+```
+
+---
+
+## Critical Setup Rules
+
+### ⚠️ MUST DO BEFORE INSERTING VECTORS
+```bash
+# 1. Create the index with FIXED dimensions and metric
+npx wrangler vectorize create my-index \
+  --dimensions=768 \
+  --metric=cosine
+
+# 2. Create metadata indexes IMMEDIATELY (before inserting vectors!)
+npx wrangler vectorize create-metadata-index my-index \
+  --property-name=category \
+  --type=string
+
+npx wrangler vectorize create-metadata-index my-index \
+  --property-name=timestamp \
+  --type=number
+```
+
+**Why**: Metadata indexes MUST exist before vectors are inserted. Vectors added before a metadata index was created won't be filterable on that property.
+
+### Index Configuration (Cannot Be Changed Later)
+
+```bash
+# Dimensions MUST match your embedding model output:
+# - Workers AI @cf/baai/bge-base-en-v1.5: 768 dimensions
+# - OpenAI text-embedding-3-small: 1536 dimensions
+# - OpenAI text-embedding-3-large: 3072 dimensions
+
+# Metrics determine similarity calculation:
+# - cosine: Best for normalized embeddings (most common)
+# - euclidean: Absolute distance between vectors
+# - dot-product: For non-normalized vectors
+```
+
+## Wrangler Configuration
+
+**wrangler.jsonc**:
+```jsonc
+{
+  "name": "my-vectorize-worker",
+  "main": "src/index.ts",
+  "compatibility_date": "2025-10-21",
+  "vectorize": [
+    {
+      "binding": "VECTORIZE_INDEX",
+      "index_name": "my-index"
+    }
+  ],
+  "ai": {
+    "binding": "AI"
+  }
+}
+```
+
+## TypeScript Types
+
+```typescript
+export interface Env {
+  VECTORIZE_INDEX: VectorizeIndex;
+  AI: Ai;
+}
+
+interface VectorizeVector {
+  id: string;
+  values: number[] | Float32Array | Float64Array;
+  namespace?: string;
+  metadata?: Record<string, string | number | boolean | string[]>;
+}
+
+interface VectorizeMatches {
+  matches: Array<{
+    id: string;
+    score: number;
+    values?: number[];
+    metadata?: Record<string, any>;
+    namespace?: string;
+  }>;
+  count: number;
+}
+```
+
+## Metadata Filter Operators (V2)
+
+Vectorize V2 supports advanced metadata filtering with range queries:
+
+```typescript
+// Equality (implicit $eq)
+{ category: "docs" }
+
+// Not equals
+{ status: { $ne: "archived" } }
+
+// In/Not in arrays
+{ category: { $in: ["docs", "tutorials"] } }
+{ category: { $nin: ["deprecated", "draft"] } }
+
+// Range queries (numbers) - NEW in V2
+{ timestamp: { $gte: 1704067200, $lt: 1735689600 } }
+
+// Range queries (strings) - prefix searching
+{ url: { $gte: "/docs/workers", $lt: "/docs/workersz" } }
+
+// Nested metadata with dot notation
+{ "author.id": "user123" }
+
+// Multiple conditions (implicit AND)
+{ category: "docs", language: "en", "metadata.published": true }
+```
+
+## Metadata Best Practices
+
+### 1. Cardinality Considerations
+
+**Low Cardinality (Good for $eq filters)**:
+```typescript
+// Few unique values - efficient filtering
+metadata: {
+  category: "docs",        // ~10 categories
+  language: "en",          // ~5 languages
+  published: true          // 2 values (boolean)
+}
+```
+
+**High Cardinality (Avoid in range queries)**:
+```typescript
+// Many unique values - avoid large range scans
+metadata: {
+  user_id: "uuid-v4...",         // Millions of unique values
+  timestamp_ms: 1704067200123    // Use seconds instead
+}
+```
+
+### 2. Metadata Limits
+
+- **Max 10 metadata indexes** per Vectorize index
+- **Max 10 KiB metadata** per vector
+- **String indexes**: First 64 bytes (UTF-8)
+- **Number indexes**: Float64 precision
+- **Filter size**: Max 2048 bytes (compact JSON)
+
+### 3. Key Restrictions
+
+```typescript
+// ❌ INVALID metadata keys
+metadata: {
+  "": "value",              // Empty key
+  "user.name": "John",      // Contains dot (reserved for nesting)
+  "$admin": true,           // Starts with $
+  "key\"with\"quotes": 1    // Contains quotes
+}
+
+// ✅ VALID metadata keys
+metadata: {
+  "user_name": "John",
+  "isAdmin": true,
+  "nested": { "allowed": true }  // Access as "nested.allowed" in filters
+}
+```
+
+## Common Errors & Solutions
+
+### Error 1: Metadata Index Created After Vectors Inserted
+```
+Problem: Filtering doesn't work on existing vectors
+Solution: Delete and re-insert vectors OR create metadata indexes BEFORE inserting
+```
+
+### Error 2: Dimension Mismatch
+```
+Problem: "Vector dimensions do not match index configuration"
+Solution: Ensure embedding model output matches index dimensions:
+  - Workers AI bge-base: 768
+  - OpenAI small: 1536
+  - OpenAI large: 3072
+```
+
+### Error 3: Invalid Metadata Keys
+```
+Problem: "Invalid metadata key"
+Solution: Keys cannot:
+  - Be empty
+  - Contain . (dot)
+  - Contain " (quote)
+  - Start with $ (dollar sign)
+```
+
+### Error 4: Filter Too Large
+```
+Problem: "Filter exceeds 2048 bytes"
+Solution: Simplify filter or split into multiple queries
+```
+
+### Error 5: Range Query on High Cardinality
+```
+Problem: Slow queries or reduced accuracy
+Solution: Use lower cardinality fields for range queries, or use seconds instead of milliseconds for timestamps
+```
+
+### Error 6: Insert vs Upsert Confusion
+```
+Problem: Updates not reflecting in index
+Solution: Use upsert() to overwrite existing vectors, not insert()
+```
+
+### Error 7: Missing Bindings
+```
+Problem: "VECTORIZE_INDEX is not defined"
+Solution: Add [[vectorize]] binding to wrangler.jsonc
+```
+
+### Error 8: Namespace vs Metadata Confusion
+```
+Problem: Unclear when to use namespace vs metadata filtering
+Solution:
+  - Namespace: Partition key, applied BEFORE metadata filters
+  - Metadata: Flexible key-value filtering within namespace
+```
+
+### Error 9: V2 Async Mutation Timing (NEW in V2)
+```
+Problem: Inserted vectors not immediately queryable
+Solution: V2 mutations are asynchronous - vectors may take a few seconds to be reflected
+  - Use mutationId to track mutation status
+  - Check env.VECTORIZE_INDEX.describe() for processedUpToMutation timestamp
+```
+
+### Error 10: V1 returnMetadata Boolean (BREAKING in V2)
+```
+Problem: "returnMetadata must be 'all', 'indexed', or 'none'"
+Solution: V2 changed returnMetadata from boolean to string enum:
+  - ❌ V1: { returnMetadata: true }
+  - ✅ V2: { returnMetadata: 'all' }
+```
+
+---
+
+## V2 Migration Checklist
+
+**If migrating from V1 to V2**:
+
+1. ✅ Update wrangler to 3.71.0+ (`npm install -g wrangler@latest`)
+2. ✅ Create new V2 index (can't upgrade V1 → V2)
+3. ✅ Create metadata indexes BEFORE inserting vectors
+4. ✅ Update `returnMetadata` boolean → string enum ('all', 'indexed', 'none')
+5. ✅ Handle async mutations (expect `mutationId` in responses)
+6. ✅ Test with V2 limits (topK up to 100, 5M vectors per index)
+7. ✅ Update error handling for async behavior
+
+**V1 Deprecation**:
+- After December 2024: Cannot create new V1 indexes
+- Existing V1 indexes: Continue to work
+- Use `wrangler vectorize --deprecated-v1` for V1 operations
+
+---
+
+## Official Documentation
+
+- **Vectorize V2 Docs**: https://developers.cloudflare.com/vectorize/
+- **V2 Changelog**: https://developers.cloudflare.com/vectorize/platform/changelog/
+- **V1 to V2 Migration**: https://developers.cloudflare.com/vectorize/reference/transition-vectorize-legacy/
+- **Metadata Filtering**: https://developers.cloudflare.com/vectorize/reference/metadata-filtering/
+- **Workers AI Models**: https://developers.cloudflare.com/workers-ai/models/
+
+---
+
+**Status**: Production Ready ✅ (Vectorize V2 GA - September 2024)
+**Last Updated**: 2025-11-22
+**Token Savings**: ~70%
+**Errors Prevented**: 10 (includes V2 breaking changes)
--- a/plugin.lock.json
+++ b/plugin.lock.json
@@ -0,0 +1,89 @@
+{
+  "$schema": "internal://schemas/plugin.lock.v1.json",
+  "pluginId": "gh:jezweb/claude-skills:skills/cloudflare-vectorize",
+  "normalized": {
+    "repo": null,
+    "ref": "refs/tags/v20251128.0",
+    "commit": "de2ea268bcafcbbf8ae0be57c6a3d873b29e4cdb",
+    "treeHash": "327aa5de3888099b1a554dc7477712aa47d13777e8b2664713fccd69ea27170f",
+    "generatedAt": "2025-11-28T10:18:56.801059Z",
+    "toolVersion": "publish_plugins.py@0.2.0"
+  },
+  "origin": {
+    "remote": "git@github.com:zhongweili/42plugin-data.git",
+    "branch": "master",
+    "commit": "aa1497ed0949fd50e99e70d6324a29c5b34f9390",
+    "repoRoot": "/Users/zhongweili/projects/openmind/42plugin-data"
+  },
+  "manifest": {
+    "name": "cloudflare-vectorize",
+    "description": "Build semantic search and RAG applications with Cloudflare Vectorize, a globally distributed vector database. Supports Workers AI and OpenAI embeddings, metadata filtering with 10 indexes, and namespace partitioning. Use when: creating vector indexes, querying embeddings, implementing semantic search or RAG, configuring metadata filters, or troubleshooting dimension mismatches, metadata index timing, insert vs upsert confusion, or filter syntax errors.",
+    "version": "1.0.0"
+  },
+  "content": {
+    "files": [
+      {
+        "path": "README.md",
+        "sha256": "05ca70d55f10152936ffea58f014492f9dd671f1faec080d3acdd6712a3de3c3"
+      },
+      {
+        "path": "SKILL.md",
+        "sha256": "78734c14174cda1067c0a6dcede80ae9ecbf13c1377811ba39bc52c68a965345"
+      },
+      {
+        "path": "references/vector-operations.md",
+        "sha256": "c6d5dbf5ad52453212f1f4c4825dbc5cd03bcafd75f2ef7d8bffa36246d99e74"
+      },
+      {
+        "path": "references/metadata-guide.md",
+        "sha256": "8815b8c188d7be48d87c5fe3468f1f88d745e8bb80ed01480f05f98fb247d0bd"
+      },
+      {
+        "path": "references/integration-workers-ai-bge-base.md",
+        "sha256": "b4c270ba6ee4d097b6443321cdd5e8b4558adad4fc49e6cd05c70d6d17607d0c"
+      },
+      {
+        "path": "references/integration-openai-embeddings.md",
+        "sha256": "81d6dbdb262d7e0018d08218a02fb168beec92edc9a2ae49bed66d74c69dec99"
+      },
+      {
+        "path": "references/wrangler-commands.md",
+        "sha256": "9f0fba62708ccf5644331abfb6e5d81fa7da03237661b7869ab63f8d630177f2"
+      },
+      {
+        "path": "references/embedding-models.md",
+        "sha256": "20bc83002e6989b4951681a5ba35b382cb9159abcd9e879ebb444c2ea9f829f7"
+      },
+      {
+        "path": "references/index-operations.md",
+        "sha256": "a05090c2a2d6cdab083400232aeaf91ba6c511e1947c2ded68a0d1a8bc7a4d84"
+      },
+      {
+        "path": ".claude-plugin/plugin.json",
+        "sha256": "3616f151dec1c7f42a0aabce23aae9aedbd7fb162bdb1deb823f219f15db99b2"
+      },
+      {
+        "path": "templates/document-ingestion.ts",
+        "sha256": "2860088733e249a4f311898f942002e7437bd571790067aac6f832e29e1d50fe"
+      },
+      {
+        "path": "templates/metadata-filtering.ts",
+        "sha256": "95d4b599b0e9d992ace6619ef3dbde9d206bf35cdf29f3d6ef3368b51e1fc907"
+      },
+      {
+        "path": "templates/basic-search.ts",
+        "sha256": "e15f40238cee2227ef2189d16036adf5d771351d53a77272210ab716003f9ef0"
+      },
+      {
+        "path": "templates/rag-chat.ts",
+        "sha256": "09a9dbeb76166c23e8ff4a3e8c5a17ab8b1c5f9ad66b27c925d9ebfc94dd65e9"
+      }
+    ],
+    "dirSha256": "327aa5de3888099b1a554dc7477712aa47d13777e8b2664713fccd69ea27170f"
+  },
+  "security": {
+    "scannedAt": null,
+    "scannerVersion": null,
+    "flags": []
+  }
+}
--- a/references/embedding-models.md
+++ b/references/embedding-models.md
@@ -0,0 +1,425 @@
+# Embedding Models Reference
+
+Complete guide for generating vector embeddings with Workers AI and OpenAI.
+
+## Model Comparison
+
+| Model | Provider | Dimensions | Metric | Cost | Performance |
+|-------|----------|------------|--------|------|-------------|
+| @cf/baai/bge-base-en-v1.5 | Workers AI | 768 | cosine | Free | Fast, edge-optimized |
+| text-embedding-3-small | OpenAI | 1536 | cosine | $0.02/1M tokens | High quality, affordable |
+| text-embedding-3-large | OpenAI | 3072 | cosine | $0.13/1M tokens | Highest accuracy |
+| text-embedding-ada-002 | OpenAI (legacy) | 1536 | cosine | $0.10/1M tokens | Deprecated |
+
+## Workers AI (@cf/baai/bge-base-en-v1.5)
+
+**Best for**: Production apps requiring free, fast embeddings with good quality.
+
+### Configuration
+
+```bash
+# Create index with 768 dimensions
+npx wrangler vectorize create my-index \
+  --dimensions=768 \
+  --metric=cosine
+```
+
+### Wrangler Binding
+
+```jsonc
+{
+  "ai": {
+    "binding": "AI"
+  },
+  "vectorize": [
+    {
+      "binding": "VECTORIZE_INDEX",
+      "index_name": "my-index"
+    }
+  ]
+}
+```
+
+### Single Text
+
+```typescript
+const embedding = await env.AI.run('@cf/baai/bge-base-en-v1.5', {
+  text: "Cloudflare Workers are serverless functions."
+});
+
+// embedding.data[0] is number[] with 768 dimensions
+await env.VECTORIZE_INDEX.upsert([{
+  id: 'doc-1',
+  values: embedding.data[0],
+  metadata: { title: 'Workers Intro' }
+}]);
+```
+
+### Batch Embeddings
+
+```typescript
+const texts = [
+  "Document 1 content",
+  "Document 2 content",
+  "Document 3 content"
+];
+
+const embeddings = await env.AI.run('@cf/baai/bge-base-en-v1.5', {
+  text: texts  // Array of strings
+});
+
+// embeddings.data is number[][] (array of 768-dim vectors)
+const vectors = texts.map((text, i) => ({
+  id: `doc-${i}`,
+  values: embeddings.data[i],
+  metadata: { content: text }
+}));
+
+await env.VECTORIZE_INDEX.upsert(vectors);
+```
+
+### Error Handling
+
+```typescript
+try {
+  const embedding = await env.AI.run('@cf/baai/bge-base-en-v1.5', {
+    text: userQuery
+  });
+
+  if (!embedding?.data?.[0]) {
+    throw new Error('No embedding returned');
+  }
+
+  // Use embedding
+} catch (error) {
+  console.error('Embedding generation failed:', error);
+  // Fallback logic
+}
+```
+
+### Limits
+
+- **Max input length**: ~512 tokens (~2000 characters)
+- **Batch size**: Up to 100 texts per request
+- **Rate limits**: Generous (Workers AI scales automatically)
+- **Cost**: Free!
+
+## OpenAI Embeddings
+
+**Best for**: Higher quality embeddings, larger context windows, or specific use cases.
+
+### API Key Setup
+
+Store API key as environment variable:
+
+```bash
+npx wrangler secret put OPENAI_API_KEY
+```
+
+### text-embedding-3-small (1536 dimensions)
+
+**Best for**: Cost-effective high quality embeddings.
+
+#### Configuration
+
+```bash
+# Create index with 1536 dimensions
+npx wrangler vectorize create my-index \
+  --dimensions=1536 \
+  --metric=cosine
+```
+
+#### Worker Code
+
+```typescript
+import OpenAI from 'openai';
+
+export interface Env {
+  OPENAI_API_KEY: string;
+  VECTORIZE_INDEX: VectorizeIndex;
+}
+
+export default {
+  async fetch(request: Request, env: Env): Promise<Response> {
+    const openai = new OpenAI({ apiKey: env.OPENAI_API_KEY });
+
+    // Single embedding
+    const response = await openai.embeddings.create({
+      model: "text-embedding-3-small",
+      input: "Text to embed",
+      encoding_format: "float" // Default
+    });
+
+    await env.VECTORIZE_INDEX.upsert([{
+      id: 'doc-1',
+      values: response.data[0].embedding,  // 1536 dimensions
+      metadata: { model: 'openai-3-small' }
+    }]);
+
+    return Response.json({ success: true });
+  }
+};
+```
+
+#### Batch Embeddings
+
+```typescript
+const response = await openai.embeddings.create({
+  model: "text-embedding-3-small",
+  input: [
+    "Document 1",
+    "Document 2",
+    "Document 3"
+  ]
+});
+
+const vectors = response.data.map((item, i) => ({
+  id: `doc-${i}`,
+  values: item.embedding,
+  metadata: { index: i }
+}));
+
+await env.VECTORIZE_INDEX.upsert(vectors);
+```
+
+### text-embedding-3-large (3072 dimensions)
+
+**Best for**: Maximum accuracy, research, or high-stakes applications.
+
+```bash
+# Create index with 3072 dimensions
+npx wrangler vectorize create high-accuracy-index \
+  --dimensions=3072 \
+  --metric=cosine
+```
+
+```typescript
+const response = await openai.embeddings.create({
+  model: "text-embedding-3-large",
+  input: "Text requiring high accuracy embedding"
+});
+
+await env.VECTORIZE_INDEX.upsert([{
+  id: 'doc-1',
+  values: response.data[0].embedding,  // 3072 dimensions
+  metadata: { model: 'openai-3-large' }
+}]);
+```
+
+### OpenAI Error Handling
+
+```typescript
+try {
+  const response = await openai.embeddings.create({
+    model: "text-embedding-3-small",
+    input: text
+  });
+
+  return response.data[0].embedding;
+} catch (error) {
+  if (error.status === 429) {
+    console.error('Rate limited');
+    // Implement retry with backoff
+  } else if (error.status === 401) {
+    console.error('Invalid API key');
+  } else {
+    console.error('OpenAI error:', error);
+  }
+  throw error;
+}
+```
+
+### OpenAI Limits
+
+- **text-embedding-3-small**: 8191 tokens input
+- **text-embedding-3-large**: 8191 tokens input
+- **Batch size**: Up to 2048 inputs per request
+- **Rate limits**: Varies by tier (check OpenAI dashboard)
+
+## Model Selection Guide
+
+### Use Workers AI (@cf/baai/bge-base-en-v1.5) when:
+
+✅ Building production apps with budget constraints
+✅ Need fast, edge-optimized embeddings
+✅ Working with English text
+✅ Don't need extremely high accuracy
+✅ Want zero per-request costs
+
+### Use OpenAI text-embedding-3-small when:
+
+✅ Need higher quality than Workers AI
+✅ Budget allows ($0.02/1M tokens is affordable)
+✅ Working with multilingual content
+✅ Need longer context (8191 tokens)
+✅ Willing to pay for better accuracy
+
+### Use OpenAI text-embedding-3-large when:
+
+✅ Accuracy is critical (legal, medical, research)
+✅ Large budget ($0.13/1M tokens)
+✅ Need best possible search quality
+✅ Working with complex or nuanced content
+
+## Embedding Best Practices
+
+### 1. Consistent Model Usage
+
+**Always use the SAME model for indexing and querying!**
+
+```typescript
+// ❌ Wrong: Different models
+// Index with Workers AI
+const indexEmbedding = await env.AI.run('@cf/baai/bge-base-en-v1.5', {
+  text: document
+});
+
+// Query with OpenAI (WRONG!)
+const queryEmbedding = await openai.embeddings.create({
+  model: "text-embedding-3-small",
+  input: query
+});
+// This won't work - different embedding spaces!
+
+// ✅ Right: Same model
+const indexEmbedding = await env.AI.run('@cf/baai/bge-base-en-v1.5', {
+  text: document
+});
+const queryEmbedding = await env.AI.run('@cf/baai/bge-base-en-v1.5', {
+  text: query
+});
+```
+
+### 2. Text Preprocessing
+
+```typescript
+function preprocessText(text: string): string {
+  return text
+    .trim()                    // Remove leading/trailing whitespace
+    .replace(/\s+/g, ' ')      // Normalize whitespace
+    .slice(0, 8000);           // Truncate to model limits
+}
+
+const embedding = await env.AI.run('@cf/baai/bge-base-en-v1.5', {
+  text: preprocessText(rawText)
+});
+```
+
+### 3. Batch for Efficiency
+
+```typescript
+// ✅ Good: Batch processing
+const embeddings = await env.AI.run('@cf/baai/bge-base-en-v1.5', {
+  text: arrayOf100Texts
+});
+
+// ❌ Bad: Individual requests
+for (const text of texts) {
+  const embedding = await env.AI.run('@cf/baai/bge-base-en-v1.5', {
+    text
+  });
+}
+```
+
+### 4. Cache Embeddings
+
+```typescript
+// Store embeddings, don't regenerate
+await env.VECTORIZE_INDEX.upsert([{
+  id: 'doc-1',
+  values: embedding,
+  metadata: {
+    content: text,           // Store original
+    model: 'bge-base-en-v1.5',
+    generated_at: Date.now()
+  }
+}]);
+
+// Later: Retrieve embedding instead of regenerating
+const vectors = await env.VECTORIZE_INDEX.getByIds(['doc-1']);
+const cachedEmbedding = vectors[0].values; // Reuse!
+```
+
+### 5. Handle Failures Gracefully
+
+```typescript
+async function generateEmbedding(text: string, env: Env): Promise<number[]> {
+  try {
+    const response = await env.AI.run('@cf/baai/bge-base-en-v1.5', { text });
+    return response.data[0];
+  } catch (error) {
+    console.error('Primary embedding failed, trying fallback');
+
+    // Fallback to OpenAI if Workers AI fails
+    const openai = new OpenAI({ apiKey: env.OPENAI_API_KEY });
+    const fallback = await openai.embeddings.create({
+      model: "text-embedding-3-small",
+      input: text
+    });
+
+    return fallback.data[0].embedding;
+  }
+}
+```
+
+## Testing Embedding Quality
+
+### Compare Similarity Scores
+
+```typescript
+// Test known similar texts
+const text1 = "Cloudflare Workers are serverless functions";
+const text2 = "Workers are serverless code running on Cloudflare's edge";
+const text3 = "Unrelated content about cooking recipes";
+
+const [emb1, emb2, emb3] = await Promise.all([
+  env.AI.run('@cf/baai/bge-base-en-v1.5', { text: text1 }),
+  env.AI.run('@cf/baai/bge-base-en-v1.5', { text: text2 }),
+  env.AI.run('@cf/baai/bge-base-en-v1.5', { text: text3 }),
+]);
+
+const similar = cosineSimilarity(emb1.data[0], emb2.data[0]); // Should be high (>0.7)
+const different = cosineSimilarity(emb1.data[0], emb3.data[0]); // Should be low (<0.3)
+```
+
+### Cosine Similarity Helper
+
+```typescript
+function cosineSimilarity(a: number[], b: number[]): number {
+  const dotProduct = a.reduce((sum, val, i) => sum + val * b[i], 0);
+  const magA = Math.sqrt(a.reduce((sum, val) => sum + val * val, 0));
+  const magB = Math.sqrt(b.reduce((sum, val) => sum + val * val, 0));
+  return dotProduct / (magA * magB);
+}
+```
+
+## Dimension Mismatch Debugging
+
+```typescript
+// Check actual dimensions before upserting
+const embedding = await env.AI.run('@cf/baai/bge-base-en-v1.5', {
+  text: "test"
+});
+
+console.log('Embedding dimensions:', embedding.data[0].length);
+
+// Verify against index
+const indexInfo = await fetch(
+  `https://api.cloudflare.com/client/v4/accounts/${accountId}/vectorize/v2/indexes/my-index`,
+  { headers: { 'Authorization': `Bearer ${apiToken}` } }
+);
+const indexConfig = await indexInfo.json();
+console.log('Index dimensions:', indexConfig.result.config.dimensions);
+
+// Must match!
+if (embedding.data[0].length !== indexConfig.result.config.dimensions) {
+  throw new Error('Dimension mismatch!');
+}
+```
+
+## See Also
+
+- [Vector Operations](./vector-operations.md)
+- [Index Operations](./index-operations.md)
+- [Workers AI Models](https://developers.cloudflare.com/workers-ai/models/#text-embeddings)
+- [OpenAI Embeddings API](https://platform.openai.com/docs/guides/embeddings)
--- a/references/index-operations.md
+++ b/references/index-operations.md
@@ -0,0 +1,364 @@
+# Index Operations Guide
+
+Complete guide for creating and managing Vectorize indexes.
+
+## Index Configuration
+
+### Critical Decisions (Cannot Be Changed!)
+
+When creating an index, these settings are **permanent**:
+
+1. **Dimensions**: Vector width (must match embedding model)
+2. **Distance Metric**: How similarity is calculated
+
+Choose carefully - you cannot change these after creation!
+
+### Dimensions
+
+Dimensions must match your embedding model's output:
+
+| Model | Provider | Dimensions | Recommended Metric |
+|-------|----------|------------|-------------------|
+| @cf/baai/bge-base-en-v1.5 | Workers AI | 768 | cosine |
+| text-embedding-3-small | OpenAI | 1536 | cosine |
+| text-embedding-3-large | OpenAI | 3072 | cosine |
+| text-embedding-ada-002 | OpenAI (legacy) | 1536 | cosine |
+| embed-english-v3.0 | Cohere | 1024 | cosine |
+
+**Common Mistake**: Creating an index with 1536 dimensions but using a 768-dim model!
+
+### Distance Metrics
+
+Choose based on your embedding model and use case:
+
+#### Cosine Similarity (`cosine`)
+- **Best for**: Normalized embeddings (most common)
+- **Range**: -1 (opposite) to 1 (identical)
+- **Use when**: Embeddings are L2-normalized
+- **Most common choice** - works with Workers AI, OpenAI, Cohere
+
+```bash
+npx wrangler vectorize create my-index \
+  --dimensions=768 \
+  --metric=cosine
+```
+
+#### Euclidean Distance (`euclidean`)
+- **Best for**: Absolute distance matters
+- **Range**: 0 (identical) to ∞ (different)
+- **Use when**: Magnitude of vectors is important
+- **Example**: Geographic coordinates, image features
+
+```bash
+npx wrangler vectorize create geo-index \
+  --dimensions=2 \
+  --metric=euclidean
+```
+
+#### Dot Product (`dot-product`)
+- **Best for**: Non-normalized embeddings
+- **Range**: -∞ to ∞
+- **Use when**: Embeddings are not normalized
+- **Less common** - most models produce normalized embeddings
+
+```bash
+npx wrangler vectorize create sparse-index \
+  --dimensions=1024 \
+  --metric=dot-product
+```
+
+## Creating Indexes
+
+### Via Wrangler CLI
+
+```bash
+npx wrangler vectorize create <name> \
+  --dimensions=<number> \
+  --metric=<metric> \
+  [--description="<text>"]
+```
+
+### Via REST API
+
+```typescript
+const response = await fetch(
+  `https://api.cloudflare.com/client/v4/accounts/${accountId}/vectorize/v2/indexes`,
+  {
+    method: 'POST',
+    headers: {
+      'Authorization': `Bearer ${apiToken}`,
+      'Content-Type': 'application/json',
+    },
+    body: JSON.stringify({
+      name: 'my-index',
+      description: 'Production semantic search',
+      config: {
+        dimensions: 768,
+        metric: 'cosine',
+      },
+    }),
+  }
+);
+```
+
+## Metadata Indexes
+
+**⚠️ CRITICAL TIMING**: Create metadata indexes IMMEDIATELY after creating the main index, BEFORE inserting any vectors!
+
+### Why Timing Matters
+
+Vectorize builds metadata indexes **only for vectors inserted AFTER** the metadata index was created. Vectors inserted before won't be filterable!
+
+### Best Practice Workflow
+
+```bash
+# 1. Create main index
+npx wrangler vectorize create docs-search \
+  --dimensions=768 \
+  --metric=cosine
+
+# 2. IMMEDIATELY create all metadata indexes
+npx wrangler vectorize create-metadata-index docs-search \
+  --property-name=category --type=string
+
+npx wrangler vectorize create-metadata-index docs-search \
+  --property-name=timestamp --type=number
+
+npx wrangler vectorize create-metadata-index docs-search \
+  --property-name=published --type=boolean
+
+# 3. Verify metadata indexes exist
+npx wrangler vectorize list-metadata-index docs-search
+
+# 4. NOW safe to start inserting vectors
+```
+
+### Metadata Index Limits
+
+- **Max 10 metadata indexes** per Vectorize index
+- **String type**: First 64 bytes indexed (UTF-8 boundaries)
+- **Number type**: Float64 precision
+- **Boolean type**: true/false
+
+### Choosing What to Index
+
+Only create metadata indexes for fields you'll **filter** on:
+
+✅ **Good candidates**:
+- `category` (string) - "docs", "tutorials", "guides"
+- `language` (string) - "en", "es", "fr"
+- `published_at` (number) - Unix timestamp
+- `status` (string) - "published", "draft", "archived"
+- `verified` (boolean) - true/false
+
+❌ **Bad candidates** (don't need indexes):
+- `title` (string) - only for display, not filtering
+- `content` (string) - stored in metadata but not filtered
+- `url` (string) - unless filtering by URL prefix
+
+## Wrangler Binding
+
+After creating an index, bind it to your Worker:
+
+### wrangler.jsonc
+
+```jsonc
+{
+  "name": "my-worker",
+  "main": "src/index.ts",
+  "vectorize": [
+    {
+      "binding": "VECTORIZE_INDEX",
+      "index_name": "docs-search"
+    }
+  ]
+}
+```
+
+### TypeScript Types
+
+```typescript
+export interface Env {
+  VECTORIZE_INDEX: VectorizeIndex;
+}
+```
+
+## Index Management Operations
+
+### List All Indexes
+
+```bash
+npx wrangler vectorize list
+```
+
+### Get Index Details
+
+```bash
+npx wrangler vectorize get my-index
+```
+
+**Returns**:
+```json
+{
+  "name": "my-index",
+  "description": "Production search",
+  "config": {
+    "dimensions": 768,
+    "metric": "cosine"
+  },
+  "created_on": "2024-01-15T10:30:00Z",
+  "modified_on": "2024-01-15T10:30:00Z"
+}
+```
+
+### Get Index Info (Vector Count)
+
+```bash
+npx wrangler vectorize info my-index
+```
+
+**Returns**:
+```json
+{
+  "vectorsCount": 12543,
+  "lastProcessedMutation": {
+    "id": "abc123...",
+    "timestamp": "2024-01-20T14:22:00Z"
+  }
+}
+```
+
+### Delete Index
+
+```bash
+# With confirmation
+npx wrangler vectorize delete my-index
+
+# Skip confirmation (use with caution!)
+npx wrangler vectorize delete my-index --force
+```
+
+**⚠️ WARNING**: Deletion is **irreversible**! All vectors are permanently lost.
+
+## Index Naming Best Practices
+
+### Good Names
+
+- `production-docs-search` - Environment + purpose
+- `dev-product-recommendations` - Environment + use case
+- `customer-support-rag` - Descriptive use case
+- `en-knowledge-base` - Language + type
+
+### Bad Names
+
+- `index1` - Not descriptive
+- `my_index` - Use dashes, not underscores
+- `PRODUCTION` - Use lowercase
+- `this-is-a-very-long-index-name-that-exceeds-limits` - Too long
+
+### Naming Rules
+
+- Lowercase letters and numbers only
+- Dashes allowed (not underscores or spaces)
+- Must start with a letter
+- Max 32 characters
+- No special characters
+
+## Common Patterns
+
+### Multi-Environment Setup
+
+```bash
+# Development
+npx wrangler vectorize create dev-docs-search \
+  --dimensions=768 --metric=cosine
+
+# Staging
+npx wrangler vectorize create staging-docs-search \
+  --dimensions=768 --metric=cosine
+
+# Production
+npx wrangler vectorize create prod-docs-search \
+  --dimensions=768 --metric=cosine
+```
+
+```jsonc
+// wrangler.jsonc
+{
+  "env": {
+    "dev": {
+      "vectorize": [
+        { "binding": "VECTORIZE", "index_name": "dev-docs-search" }
+      ]
+    },
+    "staging": {
+      "vectorize": [
+        { "binding": "VECTORIZE", "index_name": "staging-docs-search" }
+      ]
+    },
+    "production": {
+      "vectorize": [
+        { "binding": "VECTORIZE", "index_name": "prod-docs-search" }
+      ]
+    }
+  }
+}
+```
+
+### Multi-Tenant with Namespaces
+
+Instead of creating separate indexes per customer, use one index with namespaces:
+
+```bash
+# Single index for all tenants
+npx wrangler vectorize create multi-tenant-index \
+  --dimensions=768 --metric=cosine
+```
+
+```typescript
+// Insert with namespace
+await env.VECTORIZE_INDEX.upsert([{
+  id: 'doc-1',
+  values: embedding,
+  namespace: 'customer-abc123', // Isolates by customer
+  metadata: { title: 'Customer document' }
+}]);
+
+// Query within namespace
+const results = await env.VECTORIZE_INDEX.query(queryVector, {
+  topK: 5,
+  namespace: 'customer-abc123' // Only search this customer's data
+});
+```
+
+## Troubleshooting
+
+### "Index name already exists"
+
+```bash
+# Check existing indexes
+npx wrangler vectorize list
+
+# Delete old index if needed
+npx wrangler vectorize delete old-name --force
+```
+
+### "Cannot change dimensions"
+
+**No fix** - must create new index and re-insert all vectors.
+
+### "Wrangler version 3.71.0 required"
+
+```bash
+# Update Wrangler
+npm install -g wrangler@latest
+
+# Or use npx
+npx wrangler@latest vectorize create ...
+```
+
+## See Also
+
+- [Wrangler Commands](./wrangler-commands.md)
+- [Vector Operations](./vector-operations.md)
+- [Metadata Guide](./metadata-guide.md)
--- a/references/integration-openai-embeddings.md
+++ b/references/integration-openai-embeddings.md
@@ -0,0 +1,479 @@
+# OpenAI Embeddings Integration Example
+
+Complete working example using OpenAI embeddings (text-embedding-3-small/large) with Vectorize.
+
+## Model Specifications
+
+### text-embedding-3-small
+- **Dimensions**: 1536
+- **Metric**: cosine (recommended)
+- **Max Input**: 8191 tokens (~32K characters)
+- **Cost**: $0.02 per 1M tokens
+- **Best for**: High-quality embeddings at affordable cost
+
+### text-embedding-3-large
+- **Dimensions**: 3072
+- **Metric**: cosine (recommended)
+- **Max Input**: 8191 tokens (~32K characters)
+- **Cost**: $0.13 per 1M tokens
+- **Best for**: Maximum accuracy
+
+## Setup
+
+### 1. Install OpenAI SDK
+
+```bash
+npm install openai
+```
+
+### 2. Store API Key
+
+```bash
+# Set as Cloudflare secret
+npx wrangler secret put OPENAI_API_KEY
+# Paste your API key when prompted
+```
+
+### 3. Create Vectorize Index
+
+**For text-embedding-3-small**:
+```bash
+npx wrangler vectorize create openai-search \
+  --dimensions=1536 \
+  --metric=cosine \
+  --description="Semantic search with OpenAI embeddings"
+```
+
+**For text-embedding-3-large**:
+```bash
+npx wrangler vectorize create openai-high-accuracy \
+  --dimensions=3072 \
+  --metric=cosine
+```
+
+### 4. Create Metadata Indexes
+
+```bash
+npx wrangler vectorize create-metadata-index openai-search \
+  --property-name=category --type=string
+
+npx wrangler vectorize create-metadata-index openai-search \
+  --property-name=timestamp --type=number
+```
+
+### 5. Configure Wrangler
+
+**wrangler.jsonc**:
+```jsonc
+{
+  "name": "vectorize-openai-example",
+  "main": "src/index.ts",
+  "compatibility_date": "2025-10-21",
+  "vectorize": [
+    {
+      "binding": "VECTORIZE_INDEX",
+      "index_name": "openai-search"
+    }
+  ],
+  "vars": {
+    "EMBEDDING_MODEL": "text-embedding-3-small"
+  }
+}
+```
+
+**Note**: OPENAI_API_KEY is stored as a secret, not in wrangler.jsonc!
+
+## Complete Worker Example
+
+```typescript
+import OpenAI from 'openai';
+
+export interface Env {
+  OPENAI_API_KEY: string;
+  VECTORIZE_INDEX: VectorizeIndex;
+  EMBEDDING_MODEL?: string; // From wrangler.jsonc vars
+}
+
+interface Document {
+  id: string;
+  title: string;
+  content: string;
+  category?: string;
+  metadata?: Record<string, any>;
+}
+
+export default {
+  async fetch(request: Request, env: Env, ctx: ExecutionContext): Promise<Response> {
+    const openai = new OpenAI({
+      apiKey: env.OPENAI_API_KEY,
+    });
+
+    const embeddingModel = env.EMBEDDING_MODEL || 'text-embedding-3-small';
+    const url = new URL(request.url);
+
+    // CORS
+    if (request.method === 'OPTIONS') {
+      return new Response(null, {
+        headers: {
+          'Access-Control-Allow-Origin': '*',
+          'Access-Control-Allow-Methods': 'GET, POST, OPTIONS',
+          'Access-Control-Allow-Headers': 'Content-Type',
+        },
+      });
+    }
+
+    // INDEX DOCUMENTS
+    if (url.pathname === '/index' && request.method === 'POST') {
+      try {
+        const { documents } = await request.json() as { documents: Document[] };
+
+        if (!documents || !Array.isArray(documents) || documents.length === 0) {
+          return Response.json({ error: 'Invalid documents array' }, { status: 400 });
+        }
+
+        // Generate embeddings (batch)
+        const response = await openai.embeddings.create({
+          model: embeddingModel,
+          input: documents.map(doc => doc.content),
+          encoding_format: 'float',
+        });
+
+        // Prepare vectors
+        const vectors = documents.map((doc, i) => ({
+          id: doc.id,
+          values: response.data[i].embedding,
+          metadata: {
+            title: doc.title,
+            content: doc.content,
+            category: doc.category || 'general',
+            timestamp: Math.floor(Date.now() / 1000),
+            model: embeddingModel,
+            ...doc.metadata,
+          },
+        }));
+
+        // Batch upsert (100 at a time)
+        const batchSize = 100;
+        for (let i = 0; i < vectors.length; i += batchSize) {
+          const batch = vectors.slice(i, i + batchSize);
+          await env.VECTORIZE_INDEX.upsert(batch);
+        }
+
+        return Response.json({
+          success: true,
+          indexed: vectors.length,
+          model: embeddingModel,
+          usage: {
+            prompt_tokens: response.usage.prompt_tokens,
+            total_tokens: response.usage.total_tokens,
+          },
+        }, {
+          headers: { 'Access-Control-Allow-Origin': '*' },
+        });
+      } catch (error) {
+        console.error('Indexing error:', error);
+
+        // Handle OpenAI-specific errors
+        if (error instanceof OpenAI.APIError) {
+          return Response.json({
+            error: 'OpenAI API error',
+            message: error.message,
+            status: error.status,
+            code: error.code,
+          }, { status: error.status || 500 });
+        }
+
+        return Response.json({
+          error: error instanceof Error ? error.message : 'Unknown error',
+        }, { status: 500 });
+      }
+    }
+
+    // SEARCH
+    if (url.pathname === '/search' && request.method === 'POST') {
+      try {
+        const { query, topK = 5, filter, namespace } = await request.json() as {
+          query: string;
+          topK?: number;
+          filter?: Record<string, any>;
+          namespace?: string;
+        };
+
+        if (!query) {
+          return Response.json({ error: 'Missing query' }, { status: 400 });
+        }
+
+        // Generate query embedding
+        const response = await openai.embeddings.create({
+          model: embeddingModel,
+          input: query,
+          encoding_format: 'float',
+        });
+
+        // Search Vectorize
+        const results = await env.VECTORIZE_INDEX.query(
+          response.data[0].embedding,
+          {
+            topK,
+            filter,
+            namespace,
+            returnMetadata: 'all',
+            returnValues: false,
+          }
+        );
+
+        return Response.json({
+          query,
+          model: embeddingModel,
+          results: results.matches.map(match => ({
+            id: match.id,
+            score: match.score,
+            title: match.metadata?.title,
+            content: match.metadata?.content,
+            category: match.metadata?.category,
+          })),
+          count: results.count,
+          usage: {
+            prompt_tokens: response.usage.prompt_tokens,
+          },
+        }, {
+          headers: { 'Access-Control-Allow-Origin': '*' },
+        });
+      } catch (error) {
+        console.error('Search error:', error);
+
+        if (error instanceof OpenAI.APIError) {
+          return Response.json({
+            error: 'OpenAI API error',
+            message: error.message,
+            status: error.status,
+          }, { status: error.status || 500 });
+        }
+
+        return Response.json({
+          error: error instanceof Error ? error.message : 'Unknown error',
+        }, { status: 500 });
+      }
+    }
+
+    // DEFAULT: API Documentation
+    return Response.json({
+      name: 'Vectorize + OpenAI Embeddings',
+      model: embeddingModel,
+      endpoints: {
+        'POST /index': {
+          description: 'Index documents with OpenAI embeddings',
+          body: {
+            documents: [
+              {
+                id: 'doc-1',
+                title: 'Document Title',
+                content: 'Document content (up to 8191 tokens)',
+                category: 'tutorials',
+              },
+            ],
+          },
+        },
+        'POST /search': {
+          description: 'Semantic search',
+          body: {
+            query: 'search query',
+            topK: 5,
+            filter: { category: 'tutorials' },
+          },
+        },
+      },
+    });
+  },
+};
+```
+
+## Usage Examples
+
+### 1. Index Documents
+
+```bash
+curl -X POST https://your-worker.workers.dev/index \
+  -H "Content-Type: application/json" \
+  -d '{
+    "documents": [
+      {
+        "id": "legal-doc-1",
+        "title": "Terms of Service",
+        "content": "This Terms of Service agreement governs your use of our platform. By accessing or using the service, you agree to be bound by these terms. The service is provided as-is without warranties...",
+        "category": "legal",
+        "metadata": {
+          "version": "2.1",
+          "effective_date": "2024-01-01"
+        }
+      },
+      {
+        "id": "legal-doc-2",
+        "title": "Privacy Policy",
+        "content": "We collect and process personal data in accordance with GDPR and other applicable regulations. This policy describes what data we collect, how we use it, and your rights regarding your data...",
+        "category": "legal"
+      }
+    ]
+  }'
+```
+
+### 2. Search with High Accuracy
+
+```bash
+curl -X POST https://your-worker.workers.dev/search \
+  -H "Content-Type: application/json" \
+  -d '{
+    "query": "What are my rights under your privacy policy?",
+    "topK": 3,
+    "filter": { "category": "legal" }
+  }'
+```
+
+## Cost Estimation
+
+### text-embedding-3-small ($0.02/1M tokens)
+
+```
+1 page ≈ 500 tokens
+10,000 pages = 5M tokens = $0.10
+100,000 pages = 50M tokens = $1.00
+1M pages = 500M tokens = $10.00
+```
+
+### text-embedding-3-large ($0.13/1M tokens)
+
+```
+10,000 pages = 5M tokens = $0.65
+100,000 pages = 50M tokens = $6.50
+1M pages = 500M tokens = $65.00
+```
+
+## Error Handling
+
+### Rate Limiting
+
+```typescript
+async function generateEmbeddingWithRetry(
+  text: string,
+  openai: OpenAI,
+  model: string,
+  maxRetries = 3
+): Promise<number[]> {
+  for (let attempt = 0; attempt < maxRetries; attempt++) {
+    try {
+      const response = await openai.embeddings.create({
+        model,
+        input: text,
+      });
+      return response.data[0].embedding;
+    } catch (error) {
+      if (error instanceof OpenAI.APIError && error.status === 429) {
+        // Rate limited - exponential backoff
+        const delay = Math.pow(2, attempt) * 1000; // 1s, 2s, 4s
+        console.log(`Rate limited. Retrying in ${delay}ms...`);
+        await new Promise(resolve => setTimeout(resolve, delay));
+        continue;
+      }
+      throw error;
+    }
+  }
+  throw new Error('Max retries exceeded');
+}
+```
+
+### API Key Validation
+
+```typescript
+if (!env.OPENAI_API_KEY) {
+  return Response.json({
+    error: 'OpenAI API key not configured',
+    message: 'Set OPENAI_API_KEY using: npx wrangler secret put OPENAI_API_KEY',
+  }, { status: 500 });
+}
+```
+
+### Dimension Validation
+
+```typescript
+const response = await openai.embeddings.create({
+  model: 'text-embedding-3-small',
+  input: 'test',
+});
+
+const dimensions = response.data[0].embedding.length;
+console.log(`Embedding dimensions: ${dimensions}`); // Should be 1536
+
+if (dimensions !== 1536) {
+  throw new Error(`Expected 1536 dimensions, got ${dimensions}`);
+}
+```
+
+## Switching Between Models
+
+### Update wrangler.jsonc
+
+```jsonc
+{
+  "vars": {
+    "EMBEDDING_MODEL": "text-embedding-3-large"
+  }
+}
+```
+
+### Create New Index
+
+```bash
+# Create index with 3072 dimensions for text-embedding-3-large
+npx wrangler vectorize create openai-large \
+  --dimensions=3072 \
+  --metric=cosine
+
+# Update binding
+# wrangler.jsonc:
+{
+  "vectorize": [
+    {
+      "binding": "VECTORIZE_INDEX",
+      "index_name": "openai-large"
+    }
+  ]
+}
+```
+
+## Testing Locally
+
+```bash
+# Set API key for local dev
+export OPENAI_API_KEY=sk-...
+
+# Run dev server
+npx wrangler dev
+
+# Test
+curl -X POST http://localhost:8787/index \
+  -H "Content-Type: application/json" \
+  -d '{"documents":[{"id":"test","title":"Test","content":"Test content"}]}'
+```
+
+## Performance Tips
+
+1. **Batch requests**: Up to 2048 inputs per API call
+2. **Monitor usage**: Track token consumption in response
+3. **Cache embeddings**: Store in Vectorize, don't regenerate
+4. **Use smaller model**: text-embedding-3-small is 6.5x cheaper
+
+## Migration from Workers AI
+
+If migrating from Workers AI to OpenAI:
+
+1. Create new index with 1536 or 3072 dimensions
+2. Re-generate embeddings with OpenAI
+3. Update queries to use same model
+4. **Don't mix models!** Always use the same model for index and query
+
+## See Also
+
+- [Main Skill Documentation](../SKILL.md)
+- [Embedding Models Reference](../references/embedding-models.md)
+- [Workers AI Example](./workers-ai-bge-base.md)
+- [OpenAI Embeddings API](https://platform.openai.com/docs/guides/embeddings)
--- a/references/integration-workers-ai-bge-base.md
+++ b/references/integration-workers-ai-bge-base.md
@@ -0,0 +1,388 @@
+# Workers AI Integration Example (@cf/baai/bge-base-en-v1.5)
+
+Complete working example using Cloudflare Workers AI for embeddings with Vectorize.
+
+## Model Specifications
+
+- **Model**: `@cf/baai/bge-base-en-v1.5`
+- **Dimensions**: 768
+- **Metric**: cosine (recommended)
+- **Max Input**: ~512 tokens (~2000 characters)
+- **Cost**: Free
+- **Latency**: ~50-200ms (edge-optimized)
+
+## Setup
+
+### 1. Create Vectorize Index
+
+```bash
+npx wrangler vectorize create docs-search \
+  --dimensions=768 \
+  --metric=cosine \
+  --description="Documentation search with Workers AI"
+```
+
+### 2. Create Metadata Indexes
+
+```bash
+npx wrangler vectorize create-metadata-index docs-search \
+  --property-name=category --type=string
+
+npx wrangler vectorize create-metadata-index docs-search \
+  --property-name=published_at --type=number
+```
+
+### 3. Configure Wrangler
+
+**wrangler.jsonc**:
+```jsonc
+{
+  "name": "vectorize-workers-ai-example",
+  "main": "src/index.ts",
+  "compatibility_date": "2025-10-21",
+  "ai": {
+    "binding": "AI"
+  },
+  "vectorize": [
+    {
+      "binding": "VECTORIZE_INDEX",
+      "index_name": "docs-search"
+    }
+  ]
+}
+```
+
+## Complete Worker Example
+
+```typescript
+export interface Env {
+  AI: Ai;
+  VECTORIZE_INDEX: VectorizeIndex;
+}
+
+interface Document {
+  id: string;
+  title: string;
+  content: string;
+  category?: string;
+  url?: string;
+}
+
+export default {
+  async fetch(request: Request, env: Env, ctx: ExecutionContext): Promise<Response> {
+    const url = new URL(request.url);
+
+    // INDEX DOCUMENTS
+    if (url.pathname === '/index' && request.method === 'POST') {
+      try {
+        const { documents } = await request.json() as { documents: Document[] };
+
+        if (!documents || !Array.isArray(documents)) {
+          return Response.json({ error: 'Invalid documents array' }, { status: 400 });
+        }
+
+        // Extract text for embedding
+        const texts = documents.map(doc => doc.content);
+
+        // Generate embeddings (batch)
+        const embeddings = await env.AI.run('@cf/baai/bge-base-en-v1.5', {
+          text: texts
+        });
+
+        // Prepare vectors
+        const vectors = documents.map((doc, i) => ({
+          id: doc.id,
+          values: embeddings.data[i],
+          metadata: {
+            title: doc.title,
+            content: doc.content,
+            category: doc.category || 'general',
+            url: doc.url,
+            published_at: Math.floor(Date.now() / 1000),
+          },
+        }));
+
+        // Upsert to Vectorize
+        await env.VECTORIZE_INDEX.upsert(vectors);
+
+        return Response.json({
+          success: true,
+          indexed: vectors.length,
+          ids: vectors.map(v => v.id),
+        });
+      } catch (error) {
+        return Response.json({
+          error: error instanceof Error ? error.message : 'Unknown error',
+        }, { status: 500 });
+      }
+    }
+
+    // SEARCH
+    if (url.pathname === '/search' && request.method === 'POST') {
+      try {
+        const { query, topK = 5, filter } = await request.json() as {
+          query: string;
+          topK?: number;
+          filter?: Record<string, any>;
+        };
+
+        if (!query) {
+          return Response.json({ error: 'Missing query' }, { status: 400 });
+        }
+
+        // Generate query embedding
+        const queryEmbedding = await env.AI.run('@cf/baai/bge-base-en-v1.5', {
+          text: query,
+        });
+
+        // Search Vectorize
+        const results = await env.VECTORIZE_INDEX.query(
+          queryEmbedding.data[0],
+          {
+            topK,
+            filter,
+            returnMetadata: 'all',
+            returnValues: false,
+          }
+        );
+
+        return Response.json({
+          query,
+          results: results.matches.map(match => ({
+            id: match.id,
+            score: match.score,
+            title: match.metadata?.title,
+            content: match.metadata?.content,
+            category: match.metadata?.category,
+            url: match.metadata?.url,
+          })),
+          count: results.count,
+        });
+      } catch (error) {
+        return Response.json({
+          error: error instanceof Error ? error.message : 'Unknown error',
+        }, { status: 500 });
+      }
+    }
+
+    // DEFAULT: API Documentation
+    return Response.json({
+      name: 'Vectorize + Workers AI Example',
+      endpoints: {
+        'POST /index': {
+          description: 'Index documents',
+          body: {
+            documents: [
+              {
+                id: 'doc-1',
+                title: 'Document Title',
+                content: 'Document content for embedding',
+                category: 'tutorials',
+                url: '/docs/getting-started',
+              },
+            ],
+          },
+        },
+        'POST /search': {
+          description: 'Semantic search',
+          body: {
+            query: 'search query text',
+            topK: 5,
+            filter: { category: 'tutorials' },
+          },
+        },
+      },
+    });
+  },
+};
+```
+
+## Usage Examples
+
+### 1. Index Documents
+
+```bash
+curl -X POST https://your-worker.workers.dev/index \
+  -H "Content-Type: application/json" \
+  -d '{
+    "documents": [
+      {
+        "id": "workers-intro",
+        "title": "Introduction to Cloudflare Workers",
+        "content": "Cloudflare Workers allow you to deploy serverless code globally across Cloudflare's edge network. Workers run on V8 isolates providing fast cold starts.",
+        "category": "documentation",
+        "url": "/workers/getting-started"
+      },
+      {
+        "id": "vectorize-intro",
+        "title": "Introduction to Vectorize",
+        "content": "Vectorize is a globally distributed vector database for semantic search and AI applications. It integrates seamlessly with Workers AI for embedding generation.",
+        "category": "documentation",
+        "url": "/vectorize/getting-started"
+      },
+      {
+        "id": "d1-intro",
+        "title": "Introduction to D1",
+        "content": "D1 is Cloudflare's serverless SQL database built on SQLite. It provides familiar SQL semantics with global distribution.",
+        "category": "documentation",
+        "url": "/d1/getting-started"
+      }
+    ]
+  }'
+```
+
+### 2. Search
+
+```bash
+curl -X POST https://your-worker.workers.dev/search \
+  -H "Content-Type: application/json" \
+  -d '{
+    "query": "How do I deploy serverless functions?",
+    "topK": 3,
+    "filter": { "category": "documentation" }
+  }'
+```
+
+**Response**:
+```json
+{
+  "query": "How do I deploy serverless functions?",
+  "results": [
+    {
+      "id": "workers-intro",
+      "score": 0.87,
+      "title": "Introduction to Cloudflare Workers",
+      "content": "Cloudflare Workers allow you to deploy...",
+      "category": "documentation",
+      "url": "/workers/getting-started"
+    },
+    {
+      "id": "vectorize-intro",
+      "score": 0.62,
+      "title": "Introduction to Vectorize",
+      "content": "Vectorize is a globally distributed...",
+      "category": "documentation",
+      "url": "/vectorize/getting-started"
+    }
+  ],
+  "count": 2
+}
+```
+
+## Performance Tips
+
+### 1. Batch Embeddings
+
+```typescript
+// ✅ Good: Single API call for multiple texts
+const embeddings = await env.AI.run('@cf/baai/bge-base-en-v1.5', {
+  text: [text1, text2, text3, ...] // Up to 100 texts
+});
+
+// ❌ Bad: Multiple API calls
+for (const text of texts) {
+  const embedding = await env.AI.run('@cf/baai/bge-base-en-v1.5', {
+    text
+  });
+}
+```
+
+### 2. Optimize Return Data
+
+```typescript
+// Only return what you need
+const results = await env.VECTORIZE_INDEX.query(queryVector, {
+  topK: 5,
+  returnValues: false,     // Don't return 768 floats per result
+  returnMetadata: 'all',   // Return metadata only
+});
+```
+
+### 3. Use Filters
+
+```typescript
+// Narrow search scope with metadata filters
+const results = await env.VECTORIZE_INDEX.query(queryVector, {
+  topK: 5,
+  filter: {
+    category: 'documentation',
+    published_at: { $gte: lastWeek }
+  }
+});
+```
+
+## Error Handling
+
+```typescript
+async function generateEmbedding(text: string, env: Env): Promise<number[]> {
+  try {
+    const response = await env.AI.run('@cf/baai/bge-base-en-v1.5', {
+      text: text.trim().slice(0, 2000) // Truncate to model limits
+    });
+
+    if (!response?.data?.[0]) {
+      throw new Error('No embedding returned from Workers AI');
+    }
+
+    if (response.data[0].length !== 768) {
+      throw new Error(`Expected 768 dimensions, got ${response.data[0].length}`);
+    }
+
+    return response.data[0];
+  } catch (error) {
+    console.error('Workers AI embedding error:', error);
+    throw new Error(`Failed to generate embedding: ${error instanceof Error ? error.message : 'Unknown error'}`);
+  }
+}
+```
+
+## Testing Locally
+
+```bash
+# Install dependencies
+npm install
+
+# Run dev server
+npx wrangler dev
+
+# Test indexing
+curl -X POST http://localhost:8787/index \
+  -H "Content-Type: application/json" \
+  -d '{"documents":[{"id":"test-1","title":"Test","content":"Test content"}]}'
+
+# Test search
+curl -X POST http://localhost:8787/search \
+  -H "Content-Type: application/json" \
+  -d '{"query":"test","topK":5}'
+```
+
+## Deployment
+
+```bash
+# Deploy to production
+npx wrangler deploy
+
+# View logs
+npx wrangler tail
+```
+
+## Common Issues
+
+### "Embedding dimensions don't match"
+- **Cause**: Index created with wrong dimensions
+- **Fix**: Ensure index has 768 dimensions for bge-base-en-v1.5
+
+### "Text too long for model"
+- **Cause**: Input text exceeds ~2000 characters
+- **Fix**: Truncate or chunk text before embedding
+
+### "Rate limiting"
+- **Cause**: Too many concurrent requests
+- **Fix**: Workers AI scales automatically, but add retry logic for safety
+
+## See Also
+
+- [Main Skill Documentation](../SKILL.md)
+- [RAG Chat Template](../templates/rag-chat.ts)
+- [Embedding Models Reference](../references/embedding-models.md)
+- [Workers AI Docs](https://developers.cloudflare.com/workers-ai/models/bge-base-en-v1.5/)
--- a/references/metadata-guide.md
+++ b/references/metadata-guide.md
@@ -0,0 +1,458 @@
+# Metadata Filtering Guide
+
+Complete reference for metadata indexes and filtering in Vectorize.
+
+## Overview
+
+Metadata allows you to:
+- Store additional data alongside vectors (up to 10 KiB per vector)
+- Filter query results based on metadata properties
+- Narrow search scope without re-indexing
+
+## Metadata Indexes
+
+**⚠️ CRITICAL**: Metadata indexes MUST be created BEFORE inserting vectors!
+
+Vectors inserted before a metadata index exists won't be filterable on that property.
+
+### Creating Metadata Indexes
+
+```bash
+npx wrangler vectorize create-metadata-index <index-name> \
+  --property-name=<property> \
+  --type=<type>
+```
+
+**Types**: `string`, `number`, `boolean`
+
+**Limits**:
+- Max **10 metadata indexes** per Vectorize index
+- **String**: First 64 bytes indexed (UTF-8 boundaries)
+- **Number**: Float64 precision
+- **Boolean**: true/false
+
+### Example Setup
+
+```bash
+# Create index
+npx wrangler vectorize create docs-search --dimensions=768 --metric=cosine
+
+# Create metadata indexes IMMEDIATELY
+npx wrangler vectorize create-metadata-index docs-search \
+  --property-name=category --type=string
+
+npx wrangler vectorize create-metadata-index docs-search \
+  --property-name=published_at --type=number
+
+npx wrangler vectorize create-metadata-index docs-search \
+  --property-name=verified --type=boolean
+
+# Verify
+npx wrangler vectorize list-metadata-index docs-search
+```
+
+## Metadata Schema
+
+### Valid Metadata Keys
+
+```typescript
+// ✅ Valid keys
+metadata: {
+  category: 'docs',
+  title: 'Getting Started',
+  published_at: 1704067200,
+  verified: true,
+  nested: { allowed: true }
+}
+
+// ❌ Invalid keys
+metadata: {
+  '': 'value',               // Empty key
+  'user.name': 'John',       // Contains dot (reserved for nesting)
+  '$admin': true,            // Starts with $
+  'key"quoted': 1            // Contains "
+}
+```
+
+**Key restrictions**:
+- Cannot be empty
+- Cannot contain `.` (dot) - reserved for nested access
+- Cannot contain `"` (double quote)
+- Cannot start with `$` (dollar sign)
+- Max 512 characters
+
+### Nested Metadata
+
+Use dot notation for nested properties:
+
+```typescript
+// Store nested metadata
+metadata: {
+  author: {
+    id: 'user123',
+    name: 'John Doe',
+    verified: true
+  }
+}
+
+// Filter with dot notation
+filter: { 'author.verified': true }
+
+// Create index for nested property
+npx wrangler vectorize create-metadata-index docs-search \
+  --property-name=author_verified \
+  --type=boolean
+```
+
+## Filter Operators
+
+### Equality
+
+```typescript
+// Implicit $eq
+filter: { category: 'documentation' }
+
+// Explicit $eq
+filter: { category: { $eq: 'documentation' } }
+```
+
+### Not Equals
+
+```typescript
+filter: { status: { $ne: 'archived' } }
+```
+
+### In Array
+
+```typescript
+filter: { category: { $in: ['docs', 'tutorials', 'guides'] } }
+```
+
+### Not In Array
+
+```typescript
+filter: { status: { $nin: ['archived', 'draft', 'deleted'] } }
+```
+
+### Less Than
+
+```typescript
+filter: { published_at: { $lt: 1735689600 } }
+```
+
+### Less Than or Equal
+
+```typescript
+filter: { priority: { $lte: 5 } }
+```
+
+### Greater Than
+
+```typescript
+filter: { published_at: { $gt: 1704067200 } }
+```
+
+### Greater Than or Equal
+
+```typescript
+filter: { score: { $gte: 0.8 } }
+```
+
+## Range Queries
+
+### Number Ranges
+
+```typescript
+// Documents published in 2024
+filter: {
+  published_at: {
+    $gte: 1704067200,  // >= Jan 1, 2024
+    $lt: 1735689600    // < Jan 1, 2025
+  }
+}
+
+// Scores between 0.7 and 0.9
+filter: {
+  quality_score: {
+    $gte: 0.7,
+    $lte: 0.9
+  }
+}
+```
+
+### String Ranges (Prefix Search)
+
+```typescript
+// URLs starting with /docs/workers/
+filter: {
+  url: {
+    $gte: '/docs/workers/',
+    $lt: '/docs/workersz'  // 'z' after all possible chars
+  }
+}
+
+// IDs starting with 'user-2024'
+filter: {
+  id: {
+    $gte: 'user-2024',
+    $lt: 'user-2025'
+  }
+}
+```
+
+## Combined Filters
+
+Multiple conditions are combined with implicit **AND**:
+
+```typescript
+filter: {
+  category: 'documentation',     // AND
+  language: 'en',                // AND
+  published: true,               // AND
+  published_at: { $gte: 1704067200 } // AND
+}
+```
+
+**No OR operator** - for OR logic, make multiple queries.
+
+## Complex Examples
+
+### Multi-field with Ranges
+
+```typescript
+filter: {
+  category: { $in: ['docs', 'tutorials'] },
+  language: 'en',
+  status: { $ne: 'archived' },
+  published_at: {
+    $gte: 1704067200,
+    $lt: 1735689600
+  },
+  'author.verified': true
+}
+```
+
+### Boolean and String
+
+```typescript
+filter: {
+  published: true,
+  featured: false,
+  category: 'documentation',
+  language: { $in: ['en', 'es', 'fr'] }
+}
+```
+
+### Nested with Range
+
+```typescript
+filter: {
+  'metrics.views': { $gte: 1000 },
+  'metrics.rating': { $gte: 4.5 },
+  'author.verified': true,
+  published_at: { $gt: Date.now() / 1000 - 86400 * 30 } // Last 30 days
+}
+```
+
+## Cardinality Considerations
+
+**Cardinality** = Number of unique values in a field
+
+### Low Cardinality (Good for Filtering)
+
+```typescript
+// Few unique values - efficient
+category: 'docs' | 'tutorials' | 'guides'  // ~3-10 values
+language: 'en' | 'es' | 'fr'                // ~5-20 values
+published: true | false                      // 2 values
+```
+
+### High Cardinality (Avoid in Range Queries)
+
+```typescript
+// Many unique values - can impact performance
+user_id: 'uuid-v4-...'           // Millions of unique values
+timestamp_ms: 1704067200123      // Unique per millisecond
+email: 'user@example.com'        // Unique per user
+```
+
+### Performance Impact
+
+**Range queries** on high-cardinality fields can be slow:
+
+```typescript
+// ❌ Slow: High cardinality range
+filter: {
+  user_id: {  // Millions of unique UUIDs
+    $gte: '00000000-0000-0000-0000-000000000000',
+    $lt: 'zzzzzzzz-zzzz-zzzz-zzzz-zzzzzzzzzzzz'
+  }
+}
+
+// ✅ Better: Low cardinality range
+filter: {
+  published_at: {  // Timestamps in seconds
+    $gte: 1704067200,
+    $lt: 1735689600
+  }
+}
+```
+
+### Best Practices
+
+1. **Use seconds, not milliseconds** for timestamps
+2. **Categorize high-cardinality fields** (e.g., user → user_tier)
+3. **Limit range span** to avoid scanning millions of values
+4. **Use $eq for high cardinality**, not ranges
+
+## Filter Size Limit
+
+**Max 2048 bytes** (compact JSON representation)
+
+```typescript
+// Check filter size
+const filterString = JSON.stringify(filter);
+if (filterString.length > 2048) {
+  console.error('Filter too large!');
+}
+```
+
+If filter is too large:
+- Split into multiple queries
+- Simplify conditions
+- Use namespace filtering first
+
+## Namespace vs Metadata Filtering
+
+### Namespace Filtering
+
+```typescript
+// Insert with namespace
+await env.VECTORIZE_INDEX.upsert([{
+  id: 'doc-1',
+  values: embedding,
+  namespace: 'customer-abc123',  // Partition key
+  metadata: { type: 'support' }
+}]);
+
+// Query with namespace (applied FIRST)
+const results = await env.VECTORIZE_INDEX.query(queryVector, {
+  namespace: 'customer-abc123',
+  filter: { type: 'support' }
+});
+```
+
+### When to Use Each
+
+| Use Namespace | Use Metadata |
+|---------------|--------------|
+| Multi-tenant isolation | Fine-grained filtering |
+| Customer segmentation | Category filtering |
+| Environment (dev/prod) | Date ranges |
+| Large partitions | Boolean flags |
+| Applied BEFORE metadata | Applied AFTER namespace |
+
+### Combined Strategy
+
+```typescript
+// Namespace: Customer isolation
+namespace: 'customer-abc123'
+
+// Metadata: Detailed filtering
+filter: {
+  category: 'support_tickets',
+  status: { $ne: 'closed' },
+  priority: { $gte: 3 },
+  created_at: { $gte: Date.now() / 1000 - 86400 * 7 } // Last 7 days
+}
+```
+
+## Common Patterns
+
+### Published Content Only
+
+```typescript
+filter: {
+  published: true,
+  status: { $ne: 'archived' }
+}
+```
+
+### Recent Documents
+
+```typescript
+const oneWeekAgo = Math.floor(Date.now() / 1000) - (7 * 24 * 60 * 60);
+filter: {
+  published_at: { $gte: oneWeekAgo }
+}
+```
+
+### Multi-Language Support
+
+```typescript
+filter: {
+  language: { $in: ['en', 'es', 'fr'] },
+  published: true
+}
+```
+
+### Verified Authors Only
+
+```typescript
+filter: {
+  'author.verified': true,
+  'author.active': true
+}
+```
+
+### Time-Based Content
+
+```typescript
+// Content from specific quarter
+filter: {
+  published_at: {
+    $gte: 1704067200,  // Q1 2024 start
+    $lt: 1711929600    // Q1 2024 end
+  }
+}
+```
+
+## Debugging Filters
+
+### Test Filter Syntax
+
+```bash
+npx wrangler vectorize query docs-search \
+  --vector="[0.1,0.2,...]" \
+  --filter='{"category":"docs","published":true}' \
+  --top-k=5
+```
+
+### Check Metadata Indexes
+
+```bash
+npx wrangler vectorize list-metadata-index docs-search
+```
+
+### Verify Metadata Structure
+
+```typescript
+const vectors = await env.VECTORIZE_INDEX.getByIds(['doc-1']);
+console.log(vectors[0].metadata);
+```
+
+## Error Messages
+
+| Error | Cause | Solution |
+|-------|-------|----------|
+| "Metadata property not indexed" | No metadata index for property | Create metadata index |
+| "Filter exceeds 2048 bytes" | Filter JSON too large | Simplify or split queries |
+| "Invalid metadata key" | Key contains `.`, `"`, or starts with `$` | Rename metadata key |
+| "Filter must be non-empty object" | Empty filter `{}` | Remove filter or add conditions |
+
+## See Also
+
+- [Vector Operations](./vector-operations.md)
+- [Wrangler Commands](./wrangler-commands.md)
+- [Index Operations](./index-operations.md)
+- [Official Docs](https://developers.cloudflare.com/vectorize/reference/metadata-filtering/)
--- a/references/vector-operations.md
+++ b/references/vector-operations.md
@@ -0,0 +1,371 @@
+# Vector Operations Guide
+
+Complete guide for inserting, querying, updating, and deleting vectors in Vectorize.
+
+## Insert vs Upsert
+
+**Critical difference**:
+- **insert()**: Keeps the FIRST vector if ID already exists
+- **upsert()**: Overwrites with the LATEST vector if ID already exists
+
+**Use upsert() for updates!**
+
+```typescript
+// ❌ Wrong: Updates won't work!
+await env.VECTORIZE_INDEX.insert([
+  { id: 'doc-1', values: newEmbedding, metadata: { version: 2 } }
+]);
+// If doc-1 exists, this does nothing!
+
+// ✅ Right: Use upsert for updates
+await env.VECTORIZE_INDEX.upsert([
+  { id: 'doc-1', values: newEmbedding, metadata: { version: 2 } }
+]);
+// This WILL update doc-1
+```
+
+## Vector Format
+
+```typescript
+interface VectorizeVector {
+  id: string;                    // Unique identifier
+  values: number[] | Float32Array | Float64Array; // Embedding
+  namespace?: string;             // Partition key (optional)
+  metadata?: Record<string, any>; // Filterable data (optional)
+}
+```
+
+### ID Guidelines
+
+- **String** type
+- **Unique** within namespace
+- **Descriptive**: `doc-123`, `user-456-profile`, `chunk-789`
+- **Avoid special chars**: Use alphanumeric + dashes
+- **Max length**: No official limit, but keep reasonable (<256 chars)
+
+### Values (Embeddings)
+
+Accepted types:
+- `number[]` - Most common (from AI APIs)
+- `Float32Array` - Memory efficient
+- `Float64Array` - High precision (stored as Float32)
+
+**Must match index dimensions exactly!**
+
+```typescript
+// ❌ Wrong dimensions
+await env.VECTORIZE_INDEX.upsert([{
+  id: '1',
+  values: [0.1, 0.2] // Index expects 768!
+}]);
+// Error: "Vector dimensions do not match"
+
+// ✅ Correct dimensions
+await env.VECTORIZE_INDEX.upsert([{
+  id: '1',
+  values: embedding.data[0] // 768 dimensions
+}]);
+```
+
+## Inserting Vectors
+
+### Single Vector
+
+```typescript
+await env.VECTORIZE_INDEX.upsert([{
+  id: 'doc-1',
+  values: [0.1, 0.2, 0.3, ...], // 768 dims
+  metadata: {
+    title: 'Getting Started',
+    category: 'docs'
+  }
+}]);
+```
+
+### Batch Insert (Recommended)
+
+```typescript
+const vectors = documents.map((doc, i) => ({
+  id: `doc-${doc.id}`,
+  values: embeddings.data[i],
+  metadata: {
+    title: doc.title,
+    content: doc.content,
+    category: doc.category
+  }
+}));
+
+// Insert in batches of 100-1000
+const batchSize = 100;
+for (let i = 0; i < vectors.length; i += batchSize) {
+  const batch = vectors.slice(i, i + batchSize);
+  await env.VECTORIZE_INDEX.upsert(batch);
+}
+```
+
+### With Namespace
+
+```typescript
+await env.VECTORIZE_INDEX.upsert([{
+  id: 'ticket-123',
+  values: embedding,
+  namespace: 'customer-abc', // Isolate by customer
+  metadata: { type: 'support_ticket' }
+}]);
+```
+
+## Querying Vectors
+
+### Basic Query
+
+```typescript
+const results = await env.VECTORIZE_INDEX.query(queryVector, {
+  topK: 5,
+});
+
+// Returns: { matches: [...], count: number }
+```
+
+### Query with Options
+
+```typescript
+const results = await env.VECTORIZE_INDEX.query(queryVector, {
+  topK: 10,                    // Return top 10 matches
+  returnValues: false,          // Don't return vector values (saves bandwidth)
+  returnMetadata: 'all',       // Return all metadata
+  namespace: 'customer-abc',   // Query specific namespace
+  filter: {                    // Metadata filtering
+    category: 'documentation',
+    published: true
+  }
+});
+```
+
+### Query Parameters
+
+| Parameter | Type | Default | Description |
+|-----------|------|---------|-------------|
+| `topK` | number | 10 | Number of results to return (1-100 recommended) |
+| `returnValues` | boolean | false | Include vector values in response |
+| `returnMetadata` | string | `'none'` | `'none'`, `'indexed'`, or `'all'` |
+| `namespace` | string | undefined | Query specific namespace only |
+| `filter` | object | undefined | Metadata filter conditions |
+
+### Query Results
+
+```typescript
+interface VectorizeMatches {
+  matches: Array<{
+    id: string;
+    score: number;           // Similarity score
+    values?: number[];       // If returnValues: true
+    metadata?: Record<string, any>; // If returnMetadata: 'all' or 'indexed'
+    namespace?: string;
+  }>;
+  count: number;             // Total matches returned
+}
+```
+
+### Score Interpretation
+
+**Cosine metric** (most common):
+- `1.0` = Identical vectors
+- `0.5-0.9` = Similar
+- `0.0-0.5` = Somewhat related
+- `< 0.0` = Opposite direction
+- `-1.0` = Completely opposite
+
+**Euclidean metric**:
+- `0.0` = Identical
+- `< 1.0` = Very similar
+- `1.0-10.0` = Similar
+- `> 10.0` = Different
+
+## Metadata Filtering in Queries
+
+See [Metadata Guide](./metadata-guide.md) for complete reference.
+
+### Common Patterns
+
+```typescript
+// Exact match
+filter: { category: 'docs' }
+
+// Not equals
+filter: { status: { $ne: 'archived' } }
+
+// In array
+filter: { category: { $in: ['docs', 'tutorials'] } }
+
+// Range (timestamp)
+filter: {
+  published_at: {
+    $gte: 1704067200,
+    $lt: 1735689600
+  }
+}
+
+// Multiple conditions (AND)
+filter: {
+  category: 'docs',
+  language: 'en',
+  published: true
+}
+
+// Nested metadata
+filter: { 'author.verified': true }
+```
+
+## Retrieving Vectors
+
+### List Vector IDs
+
+```typescript
+const response = await env.VECTORIZE_INDEX.listVectors({
+  limit: 100,
+  cursor: null, // Or cursor from previous response
+});
+
+// Returns: { vectors: [{ id: '...' }, ...], cursor: '...' }
+```
+
+### Get Specific Vectors
+
+```typescript
+const vectors = await env.VECTORIZE_INDEX.getByIds([
+  'doc-1',
+  'doc-2',
+  'doc-3'
+]);
+
+// Returns: Array<VectorizeVector>
+```
+
+## Deleting Vectors
+
+### Delete by IDs
+
+```typescript
+await env.VECTORIZE_INDEX.deleteByIds([
+  'doc-1',
+  'doc-2',
+  'old-chunk-123'
+]);
+```
+
+### Delete All Vectors for a Document
+
+```typescript
+// If using doc-{id}-chunk-{index} pattern
+const docId = 'doc-123';
+const allVectors = await env.VECTORIZE_INDEX.listVectors({ limit: 1000 });
+
+const chunkIds = allVectors.vectors
+  .filter(v => v.id.startsWith(`${docId}-chunk-`))
+  .map(v => v.id);
+
+if (chunkIds.length > 0) {
+  await env.VECTORIZE_INDEX.deleteByIds(chunkIds);
+}
+```
+
+## Performance Tips
+
+### Batch Operations
+
+✅ **Good** - Batch insert/upsert:
+```typescript
+await env.VECTORIZE_INDEX.upsert(arrayOf100Vectors);
+```
+
+❌ **Bad** - Individual operations:
+```typescript
+for (const vector of vectors) {
+  await env.VECTORIZE_INDEX.upsert([vector]); // Slow!
+}
+```
+
+### Optimal Batch Sizes
+
+- **Insert/Upsert**: 100-1000 vectors per batch
+- **Delete**: 100-500 IDs per batch
+- **Query**: topK = 3-10 for best latency
+
+### Return Only What You Need
+
+```typescript
+// ✅ Efficient - no vector values
+const results = await env.VECTORIZE_INDEX.query(queryVector, {
+  topK: 5,
+  returnValues: false,    // Saves bandwidth
+  returnMetadata: 'all'   // Only metadata needed
+});
+
+// ❌ Wasteful - returns 768 floats per match
+const results = await env.VECTORIZE_INDEX.query(queryVector, {
+  topK: 5,
+  returnValues: true,     // Unnecessary if not using values
+  returnMetadata: 'all'
+});
+```
+
+### Namespace for Multi-Tenancy
+
+Use namespaces instead of separate indexes:
+
+```typescript
+// ✅ One index, isolated by namespace
+await env.VECTORIZE_INDEX.upsert([{
+  id: 'doc-1',
+  values: embedding,
+  namespace: `customer-${customerId}`,
+  metadata: { ... }
+}]);
+
+// Query only customer's data
+const results = await env.VECTORIZE_INDEX.query(queryVector, {
+  namespace: `customer-${customerId}`,
+  topK: 5
+});
+```
+
+## Common Errors
+
+### "Vector dimensions do not match"
+
+```typescript
+// Check your embedding dimensions
+const embedding = await env.AI.run('@cf/baai/bge-base-en-v1.5', {
+  text: 'test'
+});
+console.log(embedding.data[0].length); // Should match index dimensions (768)
+```
+
+### "Metadata property not indexed"
+
+```typescript
+// Create metadata index first!
+// npx wrangler vectorize create-metadata-index my-index --property-name=category --type=string
+
+// Then you can filter
+const results = await env.VECTORIZE_INDEX.query(queryVector, {
+  filter: { category: 'docs' } // Now works!
+});
+```
+
+### "Insert vs Upsert not working"
+
+```typescript
+// Use upsert for updates, not insert
+await env.VECTORIZE_INDEX.upsert([{ // ✅ Use upsert
+  id: 'existing-doc',
+  values: newEmbedding,
+  metadata: { version: 2 }
+}]);
+```
+
+## See Also
+
+- [Metadata Guide](./metadata-guide.md)
+- [Wrangler Commands](./wrangler-commands.md)
+- [Embedding Models](./embedding-models.md)
--- a/references/wrangler-commands.md
+++ b/references/wrangler-commands.md
@@ -0,0 +1,378 @@
+# Wrangler Vectorize Commands Reference
+
+Complete CLI reference for managing Cloudflare Vectorize indexes with Wrangler.
+
+**Minimum Version Required**: Wrangler 3.71.0+
+
+## Index Management
+
+### create
+
+Create a new Vectorize index.
+
+**⚠️ CRITICAL**: Dimensions and metric cannot be changed after creation!
+
+```bash
+npx wrangler vectorize create <INDEX_NAME> \
+  --dimensions=<NUMBER> \
+  --metric=<METRIC> \
+  [--description=<TEXT>]
+```
+
+**Parameters**:
+- `INDEX_NAME` (required): Name of the index (lowercase, alphanumeric, dashes, max 32 chars)
+- `--dimensions` (required): Vector width (768 for Workers AI bge-base, 1536 for OpenAI small, 3072 for OpenAI large)
+- `--metric` (required): Distance metric (`cosine`, `euclidean`, or `dot-product`)
+- `--description` (optional): Human-readable description
+
+**Examples**:
+```bash
+# Workers AI @cf/baai/bge-base-en-v1.5 (768 dimensions)
+npx wrangler vectorize create docs-search \
+  --dimensions=768 \
+  --metric=cosine \
+  --description="Documentation semantic search"
+
+# OpenAI text-embedding-3-small (1536 dimensions)
+npx wrangler vectorize create product-recs \
+  --dimensions=1536 \
+  --metric=cosine
+
+# OpenAI text-embedding-3-large (3072 dimensions)
+npx wrangler vectorize create high-accuracy-index \
+  --dimensions=3072 \
+  --metric=euclidean
+```
+
+### list
+
+List all Vectorize indexes in your account.
+
+```bash
+npx wrangler vectorize list
+```
+
+**Output**: Table with index names, dimensions, and distance metrics.
+
+### get
+
+Get details about a specific index.
+
+```bash
+npx wrangler vectorize get <INDEX_NAME>
+```
+
+**Returns**: Index configuration (name, dimensions, metric, description).
+
+### info
+
+Get additional information about an index.
+
+```bash
+npx wrangler vectorize info <INDEX_NAME>
+```
+
+**Returns**: Vector count, last processed mutation, index status.
+
+### delete
+
+Delete a Vectorize index (irreversible!).
+
+```bash
+npx wrangler vectorize delete <INDEX_NAME> [--force]
+```
+
+**Parameters**:
+- `--force` (optional): Skip confirmation prompt
+
+**Example**:
+```bash
+npx wrangler vectorize delete old-index --force
+```
+
+## Metadata Indexes
+
+**⚠️ CRITICAL**: Create metadata indexes BEFORE inserting vectors! Vectors added before a metadata index exists won't be filterable on that property.
+
+### create-metadata-index
+
+Enable metadata filtering on a specific property.
+
+```bash
+npx wrangler vectorize create-metadata-index <INDEX_NAME> \
+  --property-name=<PROPERTY> \
+  --type=<TYPE>
+```
+
+**Parameters**:
+- `INDEX_NAME` (required): Vectorize index name
+- `--property-name` (required): Metadata field name
+- `--type` (required): Data type (`string`, `number`, or `boolean`)
+
+**Limits**:
+- Max 10 metadata indexes per Vectorize index
+- String indexes: First 64 bytes (UTF-8)
+- Number indexes: Float64 precision
+
+**Examples**:
+```bash
+# String metadata (category filtering)
+npx wrangler vectorize create-metadata-index docs-search \
+  --property-name=category \
+  --type=string
+
+# Number metadata (timestamp filtering)
+npx wrangler vectorize create-metadata-index docs-search \
+  --property-name=published_at \
+  --type=number
+
+# Boolean metadata (published status)
+npx wrangler vectorize create-metadata-index docs-search \
+  --property-name=published \
+  --type=boolean
+
+# Nested metadata (use dot notation)
+npx wrangler vectorize create-metadata-index docs-search \
+  --property-name=author_verified \
+  --type=boolean
+```
+
+### list-metadata-index
+
+List all metadata indexes for an index.
+
+```bash
+npx wrangler vectorize list-metadata-index <INDEX_NAME>
+```
+
+**Output**: Table with property names and types.
+
+### delete-metadata-index
+
+Disable metadata filtering on a property.
+
+```bash
+npx wrangler vectorize delete-metadata-index <INDEX_NAME> \
+  --property-name=<PROPERTY>
+```
+
+**Example**:
+```bash
+npx wrangler vectorize delete-metadata-index docs-search \
+  --property-name=category
+```
+
+## Vector Operations
+
+### insert
+
+Insert vectors from a file (NDJSON format).
+
+```bash
+npx wrangler vectorize insert <INDEX_NAME> \
+  --file=<PATH>
+```
+
+**File Format** (NDJSON - one JSON object per line):
+```json
+{"id":"1","values":[0.1,0.2,0.3],"metadata":{"title":"Doc 1"}}
+{"id":"2","values":[0.4,0.5,0.6],"metadata":{"title":"Doc 2"}}
+```
+
+**Example**:
+```bash
+npx wrangler vectorize insert docs-search --file=vectors.ndjson
+```
+
+### query
+
+Query vectors directly from CLI.
+
+```bash
+npx wrangler vectorize query <INDEX_NAME> \
+  --vector="[<COMMA_SEPARATED_FLOATS>]" \
+  [--top-k=<NUMBER>] \
+  [--return-metadata=<MODE>] \
+  [--namespace=<NAMESPACE>] \
+  [--filter=<JSON>]
+```
+
+**Parameters**:
+- `--vector` (required): Query vector as JSON array
+- `--top-k` (optional): Number of results (default: 10)
+- `--return-metadata` (optional): `none`, `indexed`, or `all` (default: `none`)
+- `--namespace` (optional): Query specific namespace
+- `--filter` (optional): Metadata filter as JSON string
+
+**Examples**:
+```bash
+# Simple query
+npx wrangler vectorize query docs-search \
+  --vector="[0.1,0.2,0.3,...]" \
+  --top-k=5 \
+  --return-metadata=all
+
+# Query with filter
+npx wrangler vectorize query docs-search \
+  --vector="[0.1,0.2,...]" \
+  --filter='{"category":"documentation","published":true}' \
+  --top-k=3
+
+# Query specific namespace
+npx wrangler vectorize query docs-search \
+  --vector="[0.1,0.2,...]" \
+  --namespace="customer-123" \
+  --top-k=5
+```
+
+### list-vectors
+
+List vector IDs in paginated manner.
+
+```bash
+npx wrangler vectorize list-vectors <INDEX_NAME> \
+  [--count=<NUMBER>] \
+  [--cursor=<CURSOR>]
+```
+
+**Parameters**:
+- `--count` (optional): Vectors per page (1-1000, default: 100)
+- `--cursor` (optional): Pagination cursor from previous response
+
+**Example**:
+```bash
+# Get first 100 vector IDs
+npx wrangler vectorize list-vectors docs-search --count=100
+
+# Get next page (use cursor from previous response)
+npx wrangler vectorize list-vectors docs-search \
+  --count=100 \
+  --cursor="abc123..."
+```
+
+### get-vectors
+
+Fetch specific vectors by ID.
+
+```bash
+npx wrangler vectorize get-vectors <INDEX_NAME> \
+  --ids=<ID1,ID2,ID3>
+```
+
+**Example**:
+```bash
+npx wrangler vectorize get-vectors docs-search \
+  --ids="doc-1,doc-2,doc-3"
+```
+
+### delete-vectors
+
+Delete vectors by ID.
+
+```bash
+npx wrangler vectorize delete-vectors <INDEX_NAME> \
+  --ids=<ID1,ID2,ID3>
+```
+
+**Example**:
+```bash
+npx wrangler vectorize delete-vectors docs-search \
+  --ids="old-doc-1,old-doc-2"
+```
+
+## Common Workflows
+
+### Initial Setup
+
+```bash
+# 1. Create index
+npx wrangler vectorize create my-index \
+  --dimensions=768 \
+  --metric=cosine
+
+# 2. Create metadata indexes (BEFORE inserting!)
+npx wrangler vectorize create-metadata-index my-index \
+  --property-name=category --type=string
+
+npx wrangler vectorize create-metadata-index my-index \
+  --property-name=timestamp --type=number
+
+# 3. Verify metadata indexes
+npx wrangler vectorize list-metadata-index my-index
+
+# 4. Now safe to insert vectors (via Worker or CLI)
+```
+
+### Bulk Data Import
+
+```bash
+# Prepare NDJSON file
+cat > vectors.ndjson << 'EOF'
+{"id":"doc-1","values":[0.1,0.2,0.3,...],"metadata":{"category":"docs"}}
+{"id":"doc-2","values":[0.4,0.5,0.6,...],"metadata":{"category":"tutorials"}}
+EOF
+
+# Import
+npx wrangler vectorize insert my-index --file=vectors.ndjson
+
+# Verify
+npx wrangler vectorize info my-index
+```
+
+### Debug / Inspect
+
+```bash
+# Check index configuration
+npx wrangler vectorize get my-index
+
+# Check vector count
+npx wrangler vectorize info my-index
+
+# List some vector IDs
+npx wrangler vectorize list-vectors my-index --count=10
+
+# Inspect specific vectors
+npx wrangler vectorize get-vectors my-index --ids="doc-1,doc-2"
+
+# Test query
+npx wrangler vectorize query my-index \
+  --vector="[0.1,0.2,...]" \
+  --top-k=3 \
+  --return-metadata=all
+```
+
+### Cleanup
+
+```bash
+# Delete specific vectors
+npx wrangler vectorize delete-vectors my-index --ids="doc-1,doc-2"
+
+# Delete entire index (irreversible!)
+npx wrangler vectorize delete my-index --force
+```
+
+## Tips & Best Practices
+
+1. **Always use latest Wrangler**: `npx wrangler@latest vectorize ...`
+2. **Create metadata indexes first**: Before any vector insertion
+3. **Test with small data**: Use `--count=10` when listing/testing
+4. **Batch operations**: Use Workers for bulk operations (faster than CLI)
+5. **Monitor vector count**: Use `info` command to track index size
+6. **Verify before delete**: Always check with `get` before `delete`
+
+## Error Messages
+
+| Error | Cause | Solution |
+|-------|-------|----------|
+| "Wrangler version 3.71.0 required" | Old Wrangler | Update: `npm install -g wrangler@latest` |
+| "Vector dimensions do not match" | Wrong embedding size | Check model output dimensions |
+| "Metadata property not indexed" | Metadata index missing | Create metadata index before querying |
+| "Index name already exists" | Duplicate name | Use different name or delete old index |
+| "Invalid filter syntax" | Malformed JSON filter | Check JSON syntax and operators |
+
+## See Also
+
+- [Index Operations](./index-operations.md)
+- [Vector Operations](./vector-operations.md)
+- [Metadata Guide](./metadata-guide.md)
+- [Official Wrangler Docs](https://developers.cloudflare.com/workers/wrangler/commands/#vectorize)
--- a/templates/basic-search.ts
+++ b/templates/basic-search.ts
@@ -0,0 +1,254 @@
+/**
+ * Basic Semantic Search with Cloudflare Vectorize + Workers AI
+ *
+ * Use case: Simple semantic search over documents, FAQs, or product catalog
+ *
+ * Features:
+ * - Workers AI embeddings (@cf/baai/bge-base-en-v1.5)
+ * - Vectorize query with topK results
+ * - Metadata filtering
+ * - Simple JSON API
+ */
+
+export interface Env {
+	VECTORIZE_INDEX: VectorizeIndex;
+	AI: Ai;
+}
+
+interface SearchRequest {
+	query: string;
+	topK?: number;
+	filter?: Record<string, any>;
+	namespace?: string;
+}
+
+interface SearchResult {
+	id: string;
+	score: number;
+	metadata: Record<string, any>;
+}
+
+export default {
+	async fetch(request: Request, env: Env, ctx: ExecutionContext): Promise<Response> {
+		// Handle CORS preflight
+		if (request.method === 'OPTIONS') {
+			return new Response(null, {
+				headers: {
+					'Access-Control-Allow-Origin': '*',
+					'Access-Control-Allow-Methods': 'GET, POST, OPTIONS',
+					'Access-Control-Allow-Headers': 'Content-Type',
+				},
+			});
+		}
+
+		const url = new URL(request.url);
+
+		// Route: POST /search - Semantic search endpoint
+		if (url.pathname === '/search' && request.method === 'POST') {
+			try {
+				const body = await request.json() as SearchRequest;
+				const { query, topK = 5, filter, namespace } = body;
+
+				if (!query) {
+					return Response.json(
+						{ error: 'Missing required field: query' },
+						{ status: 400 }
+					);
+				}
+
+				// Generate embedding for search query
+				const queryEmbedding = await env.AI.run('@cf/baai/bge-base-en-v1.5', {
+					text: query,
+				});
+
+				// Search vector database
+				const results = await env.VECTORIZE_INDEX.query(queryEmbedding.data[0], {
+					topK,
+					filter,
+					namespace,
+					returnMetadata: 'all',
+					returnValues: false, // Save bandwidth
+				});
+
+				// Format results
+				const searchResults: SearchResult[] = results.matches.map((match) => ({
+					id: match.id,
+					score: match.score,
+					metadata: match.metadata || {},
+				}));
+
+				return Response.json({
+					query,
+					results: searchResults,
+					count: results.count,
+				}, {
+					headers: { 'Access-Control-Allow-Origin': '*' },
+				});
+			} catch (error) {
+				console.error('Search error:', error);
+				return Response.json(
+					{
+						error: 'Search failed',
+						message: error instanceof Error ? error.message : 'Unknown error',
+					},
+					{ status: 500 }
+				);
+			}
+		}
+
+		// Route: POST /index - Add document to index
+		if (url.pathname === '/index' && request.method === 'POST') {
+			try {
+				const body = await request.json() as {
+					id: string;
+					content: string;
+					metadata?: Record<string, any>;
+					namespace?: string;
+				};
+
+				if (!body.id || !body.content) {
+					return Response.json(
+						{ error: 'Missing required fields: id, content' },
+						{ status: 400 }
+					);
+				}
+
+				// Generate embedding for document
+				const embedding = await env.AI.run('@cf/baai/bge-base-en-v1.5', {
+					text: body.content,
+				});
+
+				// Upsert vector (overwrites if exists)
+				await env.VECTORIZE_INDEX.upsert([
+					{
+						id: body.id,
+						values: embedding.data[0],
+						namespace: body.namespace,
+						metadata: {
+							...body.metadata,
+							content: body.content,
+							indexed_at: Date.now(),
+						},
+					},
+				]);
+
+				return Response.json({
+					success: true,
+					id: body.id,
+					message: 'Document indexed successfully',
+				}, {
+					headers: { 'Access-Control-Allow-Origin': '*' },
+				});
+			} catch (error) {
+				console.error('Index error:', error);
+				return Response.json(
+					{
+						error: 'Indexing failed',
+						message: error instanceof Error ? error.message : 'Unknown error',
+					},
+					{ status: 500 }
+				);
+			}
+		}
+
+		// Route: DELETE /index/:id - Remove document from index
+		if (url.pathname.startsWith('/index/') && request.method === 'DELETE') {
+			try {
+				const id = url.pathname.split('/')[2];
+
+				if (!id) {
+					return Response.json(
+						{ error: 'Missing document ID' },
+						{ status: 400 }
+					);
+				}
+
+				await env.VECTORIZE_INDEX.deleteByIds([id]);
+
+				return Response.json({
+					success: true,
+					id,
+					message: 'Document removed from index',
+				}, {
+					headers: { 'Access-Control-Allow-Origin': '*' },
+				});
+			} catch (error) {
+				console.error('Delete error:', error);
+				return Response.json(
+					{
+						error: 'Delete failed',
+						message: error instanceof Error ? error.message : 'Unknown error',
+					},
+					{ status: 500 }
+				);
+			}
+		}
+
+		// Default: API documentation
+		return Response.json({
+			name: 'Vectorize Semantic Search API',
+			endpoints: {
+				'POST /search': {
+					description: 'Semantic search over indexed documents',
+					body: {
+						query: 'string (required)',
+						topK: 'number (optional, default: 5)',
+						filter: 'object (optional)',
+						namespace: 'string (optional)',
+					},
+					example: {
+						query: 'How do I deploy a Worker?',
+						topK: 3,
+						filter: { category: 'documentation' },
+					},
+				},
+				'POST /index': {
+					description: 'Add or update document in index',
+					body: {
+						id: 'string (required)',
+						content: 'string (required)',
+						metadata: 'object (optional)',
+						namespace: 'string (optional)',
+					},
+					example: {
+						id: 'doc-123',
+						content: 'Cloudflare Workers are serverless functions...',
+						metadata: { category: 'documentation', author: 'Cloudflare' },
+					},
+				},
+				'DELETE /index/:id': {
+					description: 'Remove document from index',
+					example: 'DELETE /index/doc-123',
+				},
+			},
+		});
+	},
+};
+
+/**
+ * Example Usage:
+ *
+ * 1. Index a document:
+ *
+ * curl -X POST https://your-worker.workers.dev/index \
+ *   -H "Content-Type: application/json" \
+ *   -d '{
+ *     "id": "doc-1",
+ *     "content": "Cloudflare Workers allow you to deploy serverless code globally.",
+ *     "metadata": { "category": "docs", "section": "workers" }
+ *   }'
+ *
+ * 2. Search:
+ *
+ * curl -X POST https://your-worker.workers.dev/search \
+ *   -H "Content-Type: application/json" \
+ *   -d '{
+ *     "query": "How do I deploy serverless functions?",
+ *     "topK": 5,
+ *     "filter": { "category": "docs" }
+ *   }'
+ *
+ * 3. Delete:
+ *
+ * curl -X DELETE https://your-worker.workers.dev/index/doc-1
+ */
--- a/templates/document-ingestion.ts
+++ b/templates/document-ingestion.ts
@@ -0,0 +1,414 @@
+/**
+ * Document Ingestion Pipeline for Cloudflare Vectorize
+ *
+ * Use case: Process large documents, chunk text, generate embeddings, and index
+ *
+ * Features:
+ * - Intelligent text chunking (sentence-based)
+ * - Batch embedding generation
+ * - Metadata tagging (doc_id, chunk_index, timestamps)
+ * - R2 integration for document storage (optional)
+ * - Progress tracking and error handling
+ */
+
+export interface Env {
+	VECTORIZE_INDEX: VectorizeIndex;
+	AI: Ai;
+	DOCUMENTS_BUCKET?: R2Bucket; // Optional: Store original documents
+}
+
+interface Document {
+	id: string;
+	title: string;
+	content: string;
+	url?: string;
+	author?: string;
+	category?: string;
+	tags?: string[];
+	publishedAt?: number;
+	[key: string]: any;
+}
+
+interface ChunkMetadata {
+	doc_id: string;
+	doc_title: string;
+	chunk_index: number;
+	total_chunks: number;
+	content: string;
+	[key: string]: any;
+}
+
+/**
+ * Chunk text into smaller segments while preserving sentence boundaries
+ */
+function chunkText(text: string, maxChunkSize = 500, overlapSize = 50): string[] {
+	// Split into sentences (handles . ! ? with spaces)
+	const sentences = text.match(/[^.!?]+[.!?]+(?:\s|$)/g) || [text];
+	const chunks: string[] = [];
+	let currentChunk = '';
+
+	for (let i = 0; i < sentences.length; i++) {
+		const sentence = sentences[i].trim();
+
+		// If adding this sentence exceeds max size and we have content, start new chunk
+		if ((currentChunk + ' ' + sentence).length > maxChunkSize && currentChunk) {
+			chunks.push(currentChunk.trim());
+
+			// Create overlap by including last few words
+			const words = currentChunk.split(' ');
+			const overlapWords = words.slice(-Math.floor(overlapSize / 6)); // ~6 chars/word
+			currentChunk = overlapWords.join(' ') + ' ' + sentence;
+		} else {
+			currentChunk += (currentChunk ? ' ' : '') + sentence;
+		}
+	}
+
+	// Add final chunk
+	if (currentChunk.trim()) {
+		chunks.push(currentChunk.trim());
+	}
+
+	return chunks.length > 0 ? chunks : [text];
+}
+
+/**
+ * Batch array into smaller arrays of specified size
+ */
+function batchArray<T>(array: T[], batchSize: number): T[][] {
+	const batches: T[][] = [];
+	for (let i = 0; i < array.length; i += batchSize) {
+		batches.push(array.slice(i, i + batchSize));
+	}
+	return batches;
+}
+
+export default {
+	async fetch(request: Request, env: Env, ctx: ExecutionContext): Promise<Response> {
+		// Handle CORS
+		if (request.method === 'OPTIONS') {
+			return new Response(null, {
+				headers: {
+					'Access-Control-Allow-Origin': '*',
+					'Access-Control-Allow-Methods': 'GET, POST, DELETE, OPTIONS',
+					'Access-Control-Allow-Headers': 'Content-Type',
+				},
+			});
+		}
+
+		const url = new URL(request.url);
+
+		// Route: POST /ingest - Process and index document(s)
+		if (url.pathname === '/ingest' && request.method === 'POST') {
+			try {
+				const body = await request.json() as {
+					documents: Document[];
+					chunkSize?: number;
+					overlapSize?: number;
+					namespace?: string;
+					storeInR2?: boolean;
+				};
+
+				const {
+					documents,
+					chunkSize = 500,
+					overlapSize = 50,
+					namespace,
+					storeInR2 = false,
+				} = body;
+
+				if (!documents || !Array.isArray(documents) || documents.length === 0) {
+					return Response.json(
+						{ error: 'Missing or invalid field: documents (non-empty array)' },
+						{ status: 400 }
+					);
+				}
+
+				const results = {
+					success: true,
+					processed: 0,
+					totalChunks: 0,
+					errors: [] as string[],
+					documentDetails: [] as any[],
+				};
+
+				// Process each document
+				for (const doc of documents) {
+					try {
+						if (!doc.id || !doc.content) {
+							results.errors.push(`Document missing id or content: ${JSON.stringify(doc)}`);
+							continue;
+						}
+
+						// Optional: Store original document in R2
+						if (storeInR2 && env.DOCUMENTS_BUCKET) {
+							await env.DOCUMENTS_BUCKET.put(
+								`documents/${doc.id}.json`,
+								JSON.stringify(doc),
+								{
+									httpMetadata: { contentType: 'application/json' },
+									customMetadata: { title: doc.title, indexed_at: Date.now().toString() },
+								}
+							);
+						}
+
+						// Chunk the document
+						const chunks = chunkText(doc.content, chunkSize, overlapSize);
+
+						// Generate embeddings for all chunks (batch)
+						const embeddings = await env.AI.run('@cf/baai/bge-base-en-v1.5', {
+							text: chunks,
+						});
+
+						// Prepare vectors with metadata
+						const vectors = chunks.map((chunk, index) => ({
+							id: `${doc.id}-chunk-${index}`,
+							values: embeddings.data[index],
+							namespace,
+							metadata: {
+								doc_id: doc.id,
+								doc_title: doc.title,
+								chunk_index: index,
+								total_chunks: chunks.length,
+								content: chunk,
+								url: doc.url,
+								author: doc.author,
+								category: doc.category,
+								tags: doc.tags,
+								published_at: doc.publishedAt,
+								indexed_at: Date.now(),
+							} as ChunkMetadata,
+						}));
+
+						// Upsert in batches (100 vectors at a time)
+						const vectorBatches = batchArray(vectors, 100);
+						for (const batch of vectorBatches) {
+							await env.VECTORIZE_INDEX.upsert(batch);
+						}
+
+						results.processed++;
+						results.totalChunks += chunks.length;
+						results.documentDetails.push({
+							id: doc.id,
+							title: doc.title,
+							chunks: chunks.length,
+						});
+					} catch (error) {
+						const errorMsg = `Failed to process document ${doc.id}: ${
+							error instanceof Error ? error.message : 'Unknown error'
+						}`;
+						console.error(errorMsg);
+						results.errors.push(errorMsg);
+					}
+				}
+
+				const statusCode = results.errors.length > 0 ? 207 : 200; // 207 Multi-Status
+
+				return Response.json(results, {
+					status: statusCode,
+					headers: { 'Access-Control-Allow-Origin': '*' },
+				});
+			} catch (error) {
+				console.error('Ingest error:', error);
+				return Response.json(
+					{
+						error: 'Ingestion failed',
+						message: error instanceof Error ? error.message : 'Unknown error',
+					},
+					{ status: 500 }
+				);
+			}
+		}
+
+		// Route: POST /ingest/url - Fetch and ingest from URL (requires Firecrawl or similar)
+		if (url.pathname === '/ingest/url' && request.method === 'POST') {
+			try {
+				const body = await request.json() as {
+					url: string;
+					id?: string;
+					category?: string;
+					namespace?: string;
+				};
+
+				if (!body.url) {
+					return Response.json({ error: 'Missing required field: url' }, { status: 400 });
+				}
+
+				// Fetch content (simple fetch - for production use Firecrawl or similar)
+				const response = await fetch(body.url);
+				const html = await response.text();
+
+				// Simple text extraction (production would use proper HTML parsing)
+				const text = html
+					.replace(/<script[^>]*>[\s\S]*?<\/script>/gi, '')
+					.replace(/<style[^>]*>[\s\S]*?<\/style>/gi, '')
+					.replace(/<[^>]+>/g, ' ')
+					.replace(/\s+/g, ' ')
+					.trim();
+
+				// Create document from fetched content
+				const doc: Document = {
+					id: body.id || `url-${Date.now()}`,
+					title: body.url,
+					content: text,
+					url: body.url,
+					category: body.category || 'web-page',
+					publishedAt: Date.now(),
+				};
+
+				// Re-use the /ingest logic
+				const ingestResponse = await this.fetch(
+					new Request(new URL('/ingest', request.url), {
+						method: 'POST',
+						headers: { 'Content-Type': 'application/json' },
+						body: JSON.stringify({
+							documents: [doc],
+							namespace: body.namespace,
+						}),
+					}),
+					env,
+					ctx
+				);
+
+				return ingestResponse;
+			} catch (error) {
+				console.error('URL ingest error:', error);
+				return Response.json(
+					{
+						error: 'URL ingestion failed',
+						message: error instanceof Error ? error.message : 'Unknown error',
+					},
+					{ status: 500 }
+				);
+			}
+		}
+
+		// Route: DELETE /documents/:id - Delete all chunks for a document
+		if (url.pathname.startsWith('/documents/') && request.method === 'DELETE') {
+			try {
+				const docId = url.pathname.split('/')[2];
+
+				if (!docId) {
+					return Response.json({ error: 'Missing document ID' }, { status: 400 });
+				}
+
+				// List all vector IDs (need to find chunks for this doc)
+				// Note: This is inefficient for large indexes. Better to maintain a separate index of doc -> chunk mappings
+				const allVectors = await env.VECTORIZE_INDEX.listVectors({ limit: 1000 });
+
+				const chunkIds = allVectors.vectors
+					.filter((v) => v.id.startsWith(`${docId}-chunk-`))
+					.map((v) => v.id);
+
+				if (chunkIds.length === 0) {
+					return Response.json(
+						{ error: 'Document not found', id: docId },
+						{ status: 404 }
+					);
+				}
+
+				// Delete in batches
+				const idBatches = batchArray(chunkIds, 100);
+				for (const batch of idBatches) {
+					await env.VECTORIZE_INDEX.deleteByIds(batch);
+				}
+
+				// Optional: Delete from R2 if exists
+				if (env.DOCUMENTS_BUCKET) {
+					await env.DOCUMENTS_BUCKET.delete(`documents/${docId}.json`);
+				}
+
+				return Response.json({
+					success: true,
+					id: docId,
+					chunksDeleted: chunkIds.length,
+				}, {
+					headers: { 'Access-Control-Allow-Origin': '*' },
+				});
+			} catch (error) {
+				console.error('Delete error:', error);
+				return Response.json(
+					{
+						error: 'Delete failed',
+						message: error instanceof Error ? error.message : 'Unknown error',
+					},
+					{ status: 500 }
+				);
+			}
+		}
+
+		// Default: API documentation
+		return Response.json({
+			name: 'Document Ingestion Pipeline API',
+			endpoints: {
+				'POST /ingest': {
+					description: 'Process and index documents with chunking',
+					body: {
+						documents: [
+							{
+								id: 'string (required)',
+								title: 'string (required)',
+								content: 'string (required)',
+								url: 'string (optional)',
+								author: 'string (optional)',
+								category: 'string (optional)',
+								tags: ['array (optional)'],
+								publishedAt: 'number (optional)',
+							},
+						],
+						chunkSize: 'number (optional, default: 500)',
+						overlapSize: 'number (optional, default: 50)',
+						namespace: 'string (optional)',
+						storeInR2: 'boolean (optional, default: false)',
+					},
+				},
+				'POST /ingest/url': {
+					description: 'Fetch and ingest document from URL',
+					body: {
+						url: 'string (required)',
+						id: 'string (optional)',
+						category: 'string (optional)',
+						namespace: 'string (optional)',
+					},
+				},
+				'DELETE /documents/:id': {
+					description: 'Delete all chunks for a document',
+					example: 'DELETE /documents/doc-123',
+				},
+			},
+		});
+	},
+};
+
+/**
+ * Example Usage:
+ *
+ * 1. Ingest a single document:
+ *
+ * curl -X POST https://your-worker.workers.dev/ingest \
+ *   -H "Content-Type: application/json" \
+ *   -d '{
+ *     "documents": [{
+ *       "id": "cloudflare-workers-intro",
+ *       "title": "Introduction to Cloudflare Workers",
+ *       "content": "Very long document content here...",
+ *       "category": "documentation",
+ *       "author": "Cloudflare",
+ *       "tags": ["workers", "serverless", "edge-computing"]
+ *     }],
+ *     "chunkSize": 500,
+ *     "overlapSize": 50
+ *   }'
+ *
+ * 2. Ingest from URL:
+ *
+ * curl -X POST https://your-worker.workers.dev/ingest/url \
+ *   -H "Content-Type: application/json" \
+ *   -d '{
+ *     "url": "https://developers.cloudflare.com/workers/",
+ *     "category": "documentation"
+ *   }'
+ *
+ * 3. Delete document:
+ *
+ * curl -X DELETE https://your-worker.workers.dev/documents/cloudflare-workers-intro
+ */
--- a/templates/metadata-filtering.ts
+++ b/templates/metadata-filtering.ts
@@ -0,0 +1,425 @@
+/**
+ * Advanced Metadata Filtering Examples for Cloudflare Vectorize
+ *
+ * Use case: Multi-tenant apps, complex filtering, range queries, nested metadata
+ *
+ * Features:
+ * - All filter operators ($eq, $ne, $in, $nin, $lt, $lte, $gt, $gte)
+ * - Nested metadata with dot notation
+ * - Namespace-based isolation
+ * - Combined filters (implicit AND)
+ * - Range queries on numbers and strings
+ * - Performance optimization tips
+ */
+
+export interface Env {
+	VECTORIZE_INDEX: VectorizeIndex;
+	AI: Ai;
+}
+
+interface FilterExample {
+	name: string;
+	description: string;
+	filter: Record<string, any>;
+	namespace?: string;
+}
+
+export default {
+	async fetch(request: Request, env: Env, ctx: ExecutionContext): Promise<Response> {
+		// Handle CORS
+		if (request.method === 'OPTIONS') {
+			return new Response(null, {
+				headers: {
+					'Access-Control-Allow-Origin': '*',
+					'Access-Control-Allow-Methods': 'GET, POST, OPTIONS',
+					'Access-Control-Allow-Headers': 'Content-Type',
+				},
+			});
+		}
+
+		const url = new URL(request.url);
+
+		// Route: GET /examples - Show all filter examples
+		if (url.pathname === '/examples' && request.method === 'GET') {
+			const examples: FilterExample[] = [
+				{
+					name: 'Equality (implicit)',
+					description: 'Find vectors with exact category match',
+					filter: { category: 'documentation' },
+				},
+				{
+					name: 'Equality (explicit)',
+					description: 'Explicit $eq operator',
+					filter: { category: { $eq: 'documentation' } },
+				},
+				{
+					name: 'Not Equals',
+					description: 'Exclude archived documents',
+					filter: { status: { $ne: 'archived' } },
+				},
+				{
+					name: 'In Array',
+					description: 'Match any of multiple categories',
+					filter: { category: { $in: ['docs', 'tutorials', 'guides'] } },
+				},
+				{
+					name: 'Not In Array',
+					description: 'Exclude multiple statuses',
+					filter: { status: { $nin: ['archived', 'draft', 'deleted'] } },
+				},
+				{
+					name: 'Greater Than',
+					description: 'Documents published after date',
+					filter: { published_at: { $gt: 1704067200 } }, // Jan 1, 2024
+				},
+				{
+					name: 'Less Than or Equal',
+					description: 'Documents published before or on date',
+					filter: { published_at: { $lte: 1735689600 } }, // Jan 1, 2025
+				},
+				{
+					name: 'Range Query (numbers)',
+					description: 'Documents published in 2024',
+					filter: {
+						published_at: {
+							$gte: 1704067200, // >= Jan 1, 2024
+							$lt: 1735689600, // < Jan 1, 2025
+						},
+					},
+				},
+				{
+					name: 'Range Query (strings - prefix search)',
+					description: 'URLs starting with /docs/workers/',
+					filter: {
+						url: {
+							$gte: '/docs/workers/',
+							$lt: '/docs/workersz', // 'z' is after all possible chars
+						},
+					},
+				},
+				{
+					name: 'Nested Metadata',
+					description: 'Filter by nested author ID',
+					filter: { 'author.id': 'user123' },
+				},
+				{
+					name: 'Combined Filters (AND)',
+					description: 'Multiple conditions (implicit AND)',
+					filter: {
+						category: 'docs',
+						language: 'en',
+						published: true,
+						published_at: { $gte: 1704067200 },
+					},
+				},
+				{
+					name: 'Multi-tenant (namespace)',
+					description: 'Isolate by customer ID using namespace',
+					namespace: 'customer-abc123',
+					filter: { type: 'support_ticket' },
+				},
+				{
+					name: 'Boolean Filter',
+					description: 'Published documents only',
+					filter: { published: true },
+				},
+				{
+					name: 'Complex Multi-field',
+					description: 'Docs in English, published in 2024, not archived',
+					filter: {
+						category: { $in: ['docs', 'tutorials'] },
+						language: 'en',
+						status: { $ne: 'archived' },
+						published_at: { $gte: 1704067200, $lt: 1735689600 },
+						'author.verified': true,
+					},
+				},
+			];
+
+			return Response.json({ examples });
+		}
+
+		// Route: POST /search/filtered - Execute filtered search
+		if (url.pathname === '/search/filtered' && request.method === 'POST') {
+			try {
+				const body = await request.json() as {
+					query: string;
+					exampleName?: string;
+					filter?: Record<string, any>;
+					namespace?: string;
+					topK?: number;
+				};
+
+				const { query, exampleName, filter, namespace, topK = 5 } = body;
+
+				if (!query) {
+					return Response.json({ error: 'Missing required field: query' }, { status: 400 });
+				}
+
+				// If exampleName provided, use pre-defined filter
+				let finalFilter = filter;
+				let finalNamespace = namespace;
+
+				if (exampleName) {
+					const examplesResponse = await this.fetch(
+						new Request(new URL('/examples', request.url)),
+						env,
+						ctx
+					);
+					const { examples } = (await examplesResponse.json()) as { examples: FilterExample[] };
+
+					const example = examples.find((ex) => ex.name === exampleName);
+					if (example) {
+						finalFilter = example.filter;
+						finalNamespace = example.namespace || namespace;
+					}
+				}
+
+				// Generate embedding
+				const embedding = await env.AI.run('@cf/baai/bge-base-en-v1.5', {
+					text: query,
+				});
+
+				// Query with filter
+				const results = await env.VECTORIZE_INDEX.query(embedding.data[0], {
+					topK,
+					filter: finalFilter,
+					namespace: finalNamespace,
+					returnMetadata: 'all',
+					returnValues: false,
+				});
+
+				return Response.json({
+					query,
+					filter: finalFilter,
+					namespace: finalNamespace,
+					results: results.matches.map((m) => ({
+						id: m.id,
+						score: m.score,
+						metadata: m.metadata,
+					})),
+					count: results.count,
+				}, {
+					headers: { 'Access-Control-Allow-Origin': '*' },
+				});
+			} catch (error) {
+				console.error('Filtered search error:', error);
+				return Response.json(
+					{
+						error: 'Filtered search failed',
+						message: error instanceof Error ? error.message : 'Unknown error',
+					},
+					{ status: 500 }
+				);
+			}
+		}
+
+		// Route: POST /seed - Seed example data with rich metadata
+		if (url.pathname === '/seed' && request.method === 'POST') {
+			try {
+				// Sample documents with diverse metadata
+				const sampleDocs = [
+					{
+						content: 'Cloudflare Workers are serverless functions that run on the edge.',
+						metadata: {
+							category: 'documentation',
+							language: 'en',
+							status: 'published',
+							published_at: 1704153600, // Jan 2, 2024
+							published: true,
+							url: '/docs/workers/intro',
+							author: { id: 'user123', name: 'John Doe', verified: true },
+							tags: ['workers', 'serverless', 'edge'],
+						},
+					},
+					{
+						content: 'Vectorize is a globally distributed vector database.',
+						metadata: {
+							category: 'documentation',
+							language: 'en',
+							status: 'published',
+							published_at: 1720310400, // Jul 7, 2024
+							published: true,
+							url: '/docs/vectorize/intro',
+							author: { id: 'user456', name: 'Jane Smith', verified: true },
+							tags: ['vectorize', 'database', 'ai'],
+						},
+					},
+					{
+						content: 'D1 is Cloudflare\'s serverless SQL database.',
+						metadata: {
+							category: 'tutorials',
+							language: 'en',
+							status: 'draft',
+							published_at: 1735603200, // Dec 31, 2024
+							published: false,
+							url: '/tutorials/d1/getting-started',
+							author: { id: 'user123', name: 'John Doe', verified: true },
+							tags: ['d1', 'database', 'sql'],
+						},
+					},
+					{
+						content: 'R2 provides S3-compatible object storage without egress fees.',
+						metadata: {
+							category: 'guides',
+							language: 'en',
+							status: 'published',
+							published_at: 1712880000, // Apr 12, 2024
+							published: true,
+							url: '/docs/r2/overview',
+							author: { id: 'user789', name: 'Bob Wilson', verified: false },
+							tags: ['r2', 'storage', 'object-storage'],
+						},
+					},
+					{
+						content: 'Workers KV is a key-value store for edge applications.',
+						metadata: {
+							category: 'documentation',
+							language: 'en',
+							status: 'archived',
+							published_at: 1640995200, // Jan 1, 2022
+							published: true,
+							url: '/docs/kv/intro',
+							author: { id: 'user456', name: 'Jane Smith', verified: true },
+							tags: ['kv', 'storage', 'edge'],
+						},
+					},
+				];
+
+				// Generate embeddings
+				const texts = sampleDocs.map((doc) => doc.content);
+				const embeddings = await env.AI.run('@cf/baai/bge-base-en-v1.5', { text: texts });
+
+				// Prepare vectors
+				const vectors = sampleDocs.map((doc, i) => ({
+					id: `sample-${i + 1}`,
+					values: embeddings.data[i],
+					metadata: {
+						content: doc.content,
+						...doc.metadata,
+						indexed_at: Date.now(),
+					},
+				}));
+
+				// Upsert all
+				await env.VECTORIZE_INDEX.upsert(vectors);
+
+				return Response.json({
+					success: true,
+					message: 'Seeded 5 sample documents with rich metadata',
+					count: vectors.length,
+				}, {
+					headers: { 'Access-Control-Allow-Origin': '*' },
+				});
+			} catch (error) {
+				console.error('Seed error:', error);
+				return Response.json(
+					{
+						error: 'Seeding failed',
+						message: error instanceof Error ? error.message : 'Unknown error',
+					},
+					{ status: 500 }
+				);
+			}
+		}
+
+		// Default: API documentation
+		return Response.json({
+			name: 'Metadata Filtering Examples API',
+			endpoints: {
+				'GET /examples': {
+					description: 'List all filter examples with syntax',
+				},
+				'POST /search/filtered': {
+					description: 'Execute filtered vector search',
+					body: {
+						query: 'string (required)',
+						exampleName: 'string (optional) - use pre-defined filter',
+						filter: 'object (optional) - custom filter',
+						namespace: 'string (optional)',
+						topK: 'number (optional, default: 5)',
+					},
+					example: {
+						query: 'serverless database',
+						exampleName: 'Range Query (numbers)',
+					},
+				},
+				'POST /seed': {
+					description: 'Seed database with example documents',
+					note: 'Creates 5 sample documents with rich metadata for testing',
+				},
+			},
+			filterOperators: {
+				$eq: 'Equals',
+				$ne: 'Not equals',
+				$in: 'In array',
+				$nin: 'Not in array',
+				$lt: 'Less than',
+				$lte: 'Less than or equal',
+				$gt: 'Greater than',
+				$gte: 'Greater than or equal',
+			},
+			notes: {
+				'Metadata Keys': 'Cannot be empty, contain dots (.), quotes ("), or start with $',
+				'Filter Size': 'Max 2048 bytes (compact JSON)',
+				'Cardinality': 'High cardinality in range queries can impact performance',
+				'Namespace': 'Applied BEFORE metadata filters',
+			},
+		});
+	},
+};
+
+/**
+ * Example Usage:
+ *
+ * 1. Seed example data:
+ *
+ * curl -X POST https://your-worker.workers.dev/seed
+ *
+ * 2. List filter examples:
+ *
+ * curl https://your-worker.workers.dev/examples
+ *
+ * 3. Search with pre-defined filter:
+ *
+ * curl -X POST https://your-worker.workers.dev/search/filtered \
+ *   -H "Content-Type: application/json" \
+ *   -d '{
+ *     "query": "database storage",
+ *     "exampleName": "Range Query (numbers)"
+ *   }'
+ *
+ * 4. Search with custom filter:
+ *
+ * curl -X POST https://your-worker.workers.dev/search/filtered \
+ *   -H "Content-Type: application/json" \
+ *   -d '{
+ *     "query": "edge computing",
+ *     "filter": {
+ *       "category": { "$in": ["docs", "tutorials"] },
+ *       "language": "en",
+ *       "status": { "$ne": "archived" },
+ *       "author.verified": true
+ *     },
+ *     "topK": 3
+ *   }'
+ *
+ * Performance Tips:
+ *
+ * 1. Low Cardinality for Range Queries:
+ *    ✅ Good: published_at (timestamps in seconds, not milliseconds)
+ *    ❌ Bad: user_id (millions of unique values in range)
+ *
+ * 2. Namespace First:
+ *    Use namespace for partition key (customer_id, tenant_id)
+ *    Then use metadata filters for finer-grained filtering
+ *
+ * 3. Filter Size:
+ *    Keep filters under 2048 bytes
+ *    If hitting limit, split into multiple queries
+ *
+ * 4. Indexed Metadata:
+ *    Create metadata indexes BEFORE inserting vectors:
+ *    npx wrangler vectorize create-metadata-index my-index \
+ *      --property-name=category --type=string
+ */
--- a/templates/rag-chat.ts
+++ b/templates/rag-chat.ts
@@ -0,0 +1,351 @@
+/**
+ * RAG (Retrieval Augmented Generation) Chatbot
+ * with Cloudflare Vectorize + Workers AI
+ *
+ * Use case: Q&A chatbot that retrieves relevant context before generating answers
+ *
+ * Features:
+ * - Semantic search over knowledge base
+ * - Context-aware LLM responses
+ * - Source citations
+ * - Conversation history support
+ * - Streaming responses (optional)
+ */
+
+export interface Env {
+	VECTORIZE_INDEX: VectorizeIndex;
+	AI: Ai;
+}
+
+interface ChatRequest {
+	question: string;
+	conversationHistory?: Array<{ role: string; content: string }>;
+	topK?: number;
+	filter?: Record<string, any>;
+	namespace?: string;
+}
+
+interface ChatResponse {
+	answer: string;
+	sources: Array<{
+		id: string;
+		title: string;
+		score: number;
+		excerpt: string;
+	}>;
+	context: string;
+}
+
+export default {
+	async fetch(request: Request, env: Env, ctx: ExecutionContext): Promise<Response> {
+		// Handle CORS
+		if (request.method === 'OPTIONS') {
+			return new Response(null, {
+				headers: {
+					'Access-Control-Allow-Origin': '*',
+					'Access-Control-Allow-Methods': 'GET, POST, OPTIONS',
+					'Access-Control-Allow-Headers': 'Content-Type',
+				},
+			});
+		}
+
+		const url = new URL(request.url);
+
+		// Route: POST /chat - RAG chatbot endpoint
+		if (url.pathname === '/chat' && request.method === 'POST') {
+			try {
+				const body = await request.json() as ChatRequest;
+				const {
+					question,
+					conversationHistory = [],
+					topK = 3,
+					filter,
+					namespace,
+				} = body;
+
+				if (!question) {
+					return Response.json(
+						{ error: 'Missing required field: question' },
+						{ status: 400 }
+					);
+				}
+
+				// Step 1: Generate embedding for user question
+				const questionEmbedding = await env.AI.run('@cf/baai/bge-base-en-v1.5', {
+					text: question,
+				});
+
+				// Step 2: Search vector database for relevant context
+				const searchResults = await env.VECTORIZE_INDEX.query(
+					questionEmbedding.data[0],
+					{
+						topK,
+						filter,
+						namespace,
+						returnMetadata: 'all',
+						returnValues: false,
+					}
+				);
+
+				// Step 3: Build context from retrieved documents
+				const contextParts: string[] = [];
+				const sources: ChatResponse['sources'] = [];
+
+				for (const match of searchResults.matches) {
+					const metadata = match.metadata || {};
+					const title = metadata.title || metadata.id || match.id;
+					const content = metadata.content || '';
+
+					// Truncate content for context (max ~500 chars per source)
+					const excerpt =
+						content.length > 500 ? content.slice(0, 497) + '...' : content;
+
+					contextParts.push(`[${title}]\n${content}`);
+					sources.push({
+						id: match.id,
+						title,
+						score: match.score,
+						excerpt,
+					});
+				}
+
+				const context = contextParts.join('\n\n---\n\n');
+
+				// Step 4: Build conversation with context
+				const messages = [
+					{
+						role: 'system',
+						content: `You are a helpful AI assistant. Answer questions based on the following context. If the context doesn't contain enough information to answer the question, say so honestly.
+
+Context:
+${context}`,
+					},
+					...conversationHistory,
+					{
+						role: 'user',
+						content: question,
+					},
+				];
+
+				// Step 5: Generate answer with LLM
+				const aiResponse = await env.AI.run('@cf/meta/llama-3-8b-instruct', {
+					messages,
+				});
+
+				const answer = aiResponse.response || 'Sorry, I could not generate a response.';
+
+				// Return response with sources
+				return Response.json({
+					answer,
+					sources,
+					context: context.slice(0, 1000), // Include truncated context for debugging
+				} as ChatResponse, {
+					headers: { 'Access-Control-Allow-Origin': '*' },
+				});
+			} catch (error) {
+				console.error('Chat error:', error);
+				return Response.json(
+					{
+						error: 'Chat failed',
+						message: error instanceof Error ? error.message : 'Unknown error',
+					},
+					{ status: 500 }
+				);
+			}
+		}
+
+		// Route: POST /chat/stream - Streaming RAG responses
+		if (url.pathname === '/chat/stream' && request.method === 'POST') {
+			try {
+				const body = await request.json() as ChatRequest;
+				const { question, topK = 3, filter, namespace } = body;
+
+				if (!question) {
+					return Response.json(
+						{ error: 'Missing required field: question' },
+						{ status: 400 }
+					);
+				}
+
+				// Retrieve context (same as above)
+				const questionEmbedding = await env.AI.run('@cf/baai/bge-base-en-v1.5', {
+					text: question,
+				});
+
+				const searchResults = await env.VECTORIZE_INDEX.query(
+					questionEmbedding.data[0],
+					{ topK, filter, namespace, returnMetadata: 'all', returnValues: false }
+				);
+
+				const contextParts = searchResults.matches.map(
+					(m) => `[${m.metadata?.title || m.id}]\n${m.metadata?.content || ''}`
+				);
+				const context = contextParts.join('\n\n---\n\n');
+
+				// Stream LLM response
+				const stream = await env.AI.run('@cf/meta/llama-3-8b-instruct', {
+					messages: [
+						{
+							role: 'system',
+							content: `Answer based on context:\n\n${context}`,
+						},
+						{ role: 'user', content: question },
+					],
+					stream: true,
+				});
+
+				return new Response(stream, {
+					headers: {
+						'Content-Type': 'text/event-stream',
+						'Access-Control-Allow-Origin': '*',
+					},
+				});
+			} catch (error) {
+				console.error('Stream error:', error);
+				return Response.json(
+					{
+						error: 'Streaming failed',
+						message: error instanceof Error ? error.message : 'Unknown error',
+					},
+					{ status: 500 }
+				);
+			}
+		}
+
+		// Route: POST /ingest - Add knowledge base content
+		if (url.pathname === '/ingest' && request.method === 'POST') {
+			try {
+				const body = await request.json() as {
+					documents: Array<{
+						id: string;
+						title: string;
+						content: string;
+						metadata?: Record<string, any>;
+					}>;
+					namespace?: string;
+				};
+
+				if (!body.documents || !Array.isArray(body.documents)) {
+					return Response.json(
+						{ error: 'Missing or invalid field: documents (array)' },
+						{ status: 400 }
+					);
+				}
+
+				// Generate embeddings for all documents
+				const texts = body.documents.map((doc) => doc.content);
+				const embeddings = await env.AI.run('@cf/baai/bge-base-en-v1.5', {
+					text: texts,
+				});
+
+				// Prepare vectors for upsert
+				const vectors = body.documents.map((doc, i) => ({
+					id: doc.id,
+					values: embeddings.data[i],
+					namespace: body.namespace,
+					metadata: {
+						title: doc.title,
+						content: doc.content,
+						...doc.metadata,
+						indexed_at: Date.now(),
+					},
+				}));
+
+				// Batch upsert
+				await env.VECTORIZE_INDEX.upsert(vectors);
+
+				return Response.json({
+					success: true,
+					count: vectors.length,
+					message: `Successfully indexed ${vectors.length} documents`,
+				}, {
+					headers: { 'Access-Control-Allow-Origin': '*' },
+				});
+			} catch (error) {
+				console.error('Ingest error:', error);
+				return Response.json(
+					{
+						error: 'Ingestion failed',
+						message: error instanceof Error ? error.message : 'Unknown error',
+					},
+					{ status: 500 }
+				);
+			}
+		}
+
+		// Default: API documentation
+		return Response.json({
+			name: 'RAG Chatbot API',
+			endpoints: {
+				'POST /chat': {
+					description: 'Ask questions with context retrieval',
+					body: {
+						question: 'string (required)',
+						conversationHistory: 'array (optional)',
+						topK: 'number (optional, default: 3)',
+						filter: 'object (optional)',
+						namespace: 'string (optional)',
+					},
+					example: {
+						question: 'How do I deploy a Cloudflare Worker?',
+						topK: 3,
+						filter: { category: 'documentation' },
+					},
+				},
+				'POST /chat/stream': {
+					description: 'Streaming responses',
+					body: 'Same as /chat',
+				},
+				'POST /ingest': {
+					description: 'Add documents to knowledge base',
+					body: {
+						documents: [
+							{
+								id: 'doc-1',
+								title: 'Document Title',
+								content: 'Document content...',
+								metadata: { category: 'docs' },
+							},
+						],
+						namespace: 'string (optional)',
+					},
+				},
+			},
+		});
+	},
+};
+
+/**
+ * Example Usage:
+ *
+ * 1. Ingest knowledge base:
+ *
+ * curl -X POST https://your-worker.workers.dev/ingest \
+ *   -H "Content-Type: application/json" \
+ *   -d '{
+ *     "documents": [
+ *       {
+ *         "id": "workers-intro",
+ *         "title": "Introduction to Workers",
+ *         "content": "Cloudflare Workers allow you to deploy serverless code globally...",
+ *         "metadata": { "category": "docs", "section": "workers" }
+ *       }
+ *     ]
+ *   }'
+ *
+ * 2. Ask a question:
+ *
+ * curl -X POST https://your-worker.workers.dev/chat \
+ *   -H "Content-Type: application/json" \
+ *   -d '{
+ *     "question": "How do I deploy serverless code?",
+ *     "topK": 3,
+ *     "filter": { "category": "docs" }
+ *   }'
+ *
+ * 3. Streaming response:
+ *
+ * curl -X POST https://your-worker.workers.dev/chat/stream \
+ *   -H "Content-Type: application/json" \
+ *   -d '{ "question": "What is a Worker?" }'
+ */