Files
gh-giuseppe-trisciuoglio-de…/skills/aws-sdk-java-v2-bedrock/references/model-reference.md
2025-11-29 18:28:34 +08:00

8.9 KiB

Model Reference

Supported Foundation Models

Amazon Models

Amazon Titan Text

Model ID: amazon.titan-text-express-v1

  • Description: High-quality text generation model
  • Context Window: Up to 8K tokens
  • Languages: English, Spanish, French, German, Italian, Portuguese

Payload Format:

{
    "inputText": "Your prompt here",
    "textGenerationConfig": {
        "maxTokenCount": 512,
        "temperature": 0.7,
        "topP": 0.9
    }
}

Response Format:

{
    "results": [{
        "outputText": "Generated text"
    }]
}

Amazon Titan Text Lite

Model ID: amazon.titan-text-lite-v1

  • Description: Cost-effective text generation model
  • Context Window: Up to 4K tokens
  • Use Case: Simple text generation tasks

Amazon Titan Embeddings

Model ID: amazon.titan-embed-text-v1

  • Description: High-quality text embeddings
  • Context Window: 8K tokens
  • Output: 1024-dimensional vector

Payload Format:

{
    "inputText": "Your text here"
}

Response Format:

{
    "embedding": [0.1, -0.2, 0.3, ...]
}

Amazon Titan Image Generator

Model ID: amazon.titan-image-generator-v1

  • Description: High-quality image generation
  • Image Size: 512x512, 1024x1024
  • Use Case: Text-to-image generation

Payload Format:

{
    "taskType": "TEXT_IMAGE",
    "textToImageParams": {
        "text": "Your description"
    },
    "imageGenerationConfig": {
        "numberOfImages": 1,
        "quality": "standard",
        "cfgScale": 8.0,
        "height": 512,
        "width": 512,
        "seed": 12345
    }
}

Anthropic Models

Claude 3.5 Sonnet

Model ID: anthropic.claude-3-5-sonnet-20241022-v2:0

  • Description: High-performance model for complex reasoning, analysis, and creative tasks
  • Context Window: 200K tokens
  • Languages: Multiple languages supported
  • Use Case: Code generation, complex analysis, creative writing, research
  • Features: Tool use, function calling, JSON mode

Payload Format:

{
    "anthropic_version": "bedrock-2023-05-31",
    "max_tokens": 1000,
    "messages": [{
        "role": "user",
        "content": "Your message"
    }]
}

Response Format:

{
    "content": [{
        "text": "Response content"
    }],
    "usage": {
        "input_tokens": 10,
        "output_tokens": 20
    }
}

Claude 3.5 Haiku

Model ID: anthropic.claude-3-5-haiku-20241022-v2:0

  • Description: Fast and affordable model for real-time applications
  • Context Window: 200K tokens
  • Use Case: Real-time applications, chatbots, quick responses
  • Features: Tool use, function calling, JSON mode

Claude 3 Opus

Model ID: anthropic.claude-3-opus-20240229-v1:0

  • Description: Most capable model
  • Context Window: 200K tokens
  • Use Case: Complex reasoning, analysis

Claude 3 Sonnet (Legacy)

Model ID: anthropic.claude-3-sonnet-20240229-v1:0

  • Description: Previous generation model
  • Context Window: 200K tokens
  • Use Case: General purpose applications

Meta Models

Llama 3.1 70B

Model ID: meta.llama3-1-70b-instruct-v1:0

  • Description: Latest generation large open-source model
  • Context Window: 128K tokens
  • Use Case: General purpose instruction following, complex reasoning
  • Features: Improved instruction following, larger context window

Llama 3.1 8B

Model ID: meta.llama3-1-8b-instruct-v1:0

  • Description: Latest generation small fast model
  • Context Window: 8K tokens
  • Use Case: Fast inference, lightweight applications

Llama 3 70B

Model ID: meta.llama3-70b-instruct-v1:0

  • Description: Previous generation large open-source model
  • Context Window: 8K tokens
  • Use Case: General purpose instruction following

Payload Format:

{
    "prompt": "[INST] Your prompt here [/INST]",
    "max_gen_len": 512,
    "temperature": 0.7,
    "top_p": 0.9
}

Response Format:

{
    "generation": "Generated text"
}

Llama 3 8B

Model ID: meta.llama3-8b-instruct-v1:0

  • Description: Smaller, faster version
  • Context Window: 8K tokens
  • Use Case: Fast inference, lightweight applications

Stability AI Models

Stable Diffusion XL

Model ID: stability.stable-diffusion-xl-v1

  • Description: High-quality image generation
  • Image Size: Up to 1024x1024
  • Use Case: Text-to-image generation, art creation

Payload Format:

{
    "text_prompts": [{
        "text": "Your description"
    }],
    "style_preset": "photographic",
    "seed": 12345,
    "cfg_scale": 10,
    "steps": 50
}

Response Format:

{
    "artifacts": [{
        "base64": "base64-encoded-image-data",
        "finishReason": "SUCCESS"
    }]
}

Other Models

Cohere Command

Model ID: cohere.command-text-v14

  • Description: Text generation model
  • Context Window: 128K tokens
  • Use Case: Content generation, summarization

Mistral Models

Model ID: mistral.mistral-7b-instruct-v0:2

  • Description: High-performing open-source model
  • Context Window: 32K tokens
  • Use Case: Instruction following, code generation

Model ID: mistral.mixtral-8x7b-instruct-v0:1

  • Description: Mixture of experts model
  • Context Window: 32K tokens
  • Use Case: Complex reasoning tasks

Model Selection Guide

Use Case Recommendations

Use Case Recommended Models Notes
General Chat/Chatbots Claude 3.5 Haiku, Llama 3 8B Fast response times
Content Creation Claude 3.5 Sonnet, Cohere Creative, coherent outputs
Code Generation Claude 3.5 Sonnet, Llama 3.1 70B Excellent understanding
Analysis & Reasoning Claude 3 Opus, Claude 3.5 Sonnet Complex reasoning
Real-time Applications Claude 3.5 Haiku, Titan Lite Fast inference
Cost-sensitive Apps Titan Lite, Claude 3.5 Haiku Lower cost per token
High Quality Claude 3 Opus, Claude 3.5 Sonnet Premium quality

Performance Characteristics

Model Speed Cost Quality Context Window
Claude 3 Opus Slow High Excellent 200K
Claude 3.5 Sonnet Medium Medium Excellent 200K
Claude 3.5 Haiku Fast Low Good 200K
Claude 3 Sonnet (Legacy) Medium Medium Good 200K
Llama 3.1 70B Medium Medium Good 128K
Llama 3.1 8B Fast Low Fair 8K
Llama 3 70B Medium Medium Good 8K
Llama 3 8B Fast Low Fair 8K
Titan Express Fast Medium Good 8K
Titan Lite Fast Low Fair 4K

Model Comparison Matrix

Feature Claude 3 Llama 3 Titan Stability
Streaming
Tool Use
Image Generation
Embeddings
Multiple Languages
Context Window 200K 8K 8K N/A
Open Source

Model Configuration Templates

Text Generation Template

private static JSONObject createTextGenerationPayload(String modelId, String prompt) {
    JSONObject payload = new JSONObject();

    if (modelId.startsWith("anthropic.claude")) {
        payload.put("anthropic_version", "bedrock-2023-05-31");
        payload.put("max_tokens", 1000);
        payload.put("messages", new JSONObject[]{new JSONObject()
            .put("role", "user")
            .put("content", prompt)
        });
    } else if (modelId.startsWith("meta.llama")) {
        payload.put("prompt", "[INST] " + prompt + " [/INST]");
        payload.put("max_gen_len", 512);
    } else if (modelId.startsWith("amazon.titan")) {
        payload.put("inputText", prompt);
        payload.put("textGenerationConfig", new JSONObject()
            .put("maxTokenCount", 512)
            .put("temperature", 0.7)
        );
    }

    return payload;
}

Image Generation Template

private static JSONObject createImageGenerationPayload(String modelId, String prompt) {
    JSONObject payload = new JSONObject();

    if (modelId.equals("amazon.titan-image-generator-v1")) {
        payload.put("taskType", "TEXT_IMAGE");
        payload.put("textToImageParams", new JSONObject().put("text", prompt));
        payload.put("imageGenerationConfig", new JSONObject()
            .put("numberOfImages", 1)
            .put("quality", "standard")
            .put("height", 512)
            .put("width", 512)
        );
    } else if (modelId.equals("stability.stable-diffusion-xl-v1")) {
        payload.put("text_prompts", new JSONObject[]{new JSONObject().put("text", prompt)});
        payload.put("style_preset", "photographic");
        payload.put("steps", 50);
        payload.put("cfg_scale", 10);
    }

    return payload;
}