gh-cwensel-arcaneum/commands/models.md at b923023331732db3b4c05012ed4abff8c74fda27

Files

Zhongwei Li b923023331 Initial commit

2025-11-29 18:17:12 +08:00

description, argument-hint

description	argument-hint
Manage embedding models	<list> [options]

Manage and view available embedding models for vector search.

Subcommands:

Options:

Examples:

/models list
/models list --json

Execution:

cd ${CLAUDE_PLUGIN_ROOT}
arc models $ARGUMENTS

Available Models:

The list command shows:

Current Models:

For Documents/PDFs:

For Source Code:

For General Use:

Model Selection Tips:

Match content type:
- PDFs/docs → stella or modernbert
- Source code → jina-code
- Mixed → stella or bge
Consider dimensions:
- Higher dimensions (1024D) = better quality, more storage
- Lower dimensions (384D, 768D) = faster, less storage
Backend matters:
- fastembed: Faster, optimized, limited models
- sentence-transformers: More models, HuggingFace ecosystem
Collection consistency:
- Use same model for all documents in a collection
- Cannot mix dimensions in one vector space

Downloading Models:

Models auto-download on first use (~1-2GB):

Pre-download for offline use:

python -c "from sentence_transformers import SentenceTransformer; SentenceTransformer('jinaai/jina-embeddings-v2-base-code')"

Related Commands:

Implementation: