Files
gh-jezweb-claude-skills-ski…/references/images-guide.md
2025-11-30 08:25:12 +08:00

3.5 KiB

Images Guide (DALL-E 3 & GPT-Image-1)

Last Updated: 2025-10-25

Best practices for image generation and editing with OpenAI's Images API.


DALL-E 3 Generation

Size Selection

Size Use Case
1024x1024 Profile pictures, icons, square posts
1024x1536 Portrait photos, vertical ads
1536x1024 Landscape photos, banners
1024x1792 Tall portraits, mobile wallpapers
1792x1024 Wide banners, desktop wallpapers

Quality Settings

standard: Normal quality, faster, cheaper

  • Use for: Prototyping, high-volume generation, quick iterations

hd: High definition, finer details, more expensive

  • Use for: Final production images, marketing materials, print

Style Options

vivid: Hyper-real, dramatic, high-contrast

  • Use for: Marketing, advertising, eye-catching visuals

natural: More realistic, less dramatic

  • Use for: Product photos, realistic scenes, professional content

Prompting Best Practices

Be Specific

❌ "A cat"
✅ "A white siamese cat with striking blue eyes, sitting on a wooden table, golden hour lighting, professional photography"

Include Art Style

✅ "Oil painting of a sunset in the style of Claude Monet"
✅ "3D render of a futuristic city, Pixar animation style"
✅ "Professional product photo with studio lighting"

Specify Lighting

- "Golden hour lighting"
- "Soft studio lighting from the left"
- "Dramatic shadows"
- "Bright natural daylight"

Composition Details

- "Shallow depth of field"
- "Wide angle lens"
- "Centered composition"
- "Rule of thirds"

GPT-Image-1 Editing

Input Fidelity

low: More creative freedom

  • Use for: Major transformations, style changes

medium: Balance (default)

  • Use for: Most editing tasks

high: Stay close to original

  • Use for: Subtle edits, preserving details

Common Editing Tasks

  1. Background Removal
formData.append('prompt', 'Remove the background, keep only the product');
formData.append('format', 'png');
formData.append('background', 'transparent');
  1. Color Correction
formData.append('prompt', 'Increase brightness and saturation, make colors more vibrant');
  1. Object Removal
formData.append('prompt', 'Remove the person from the background');
  1. Compositing
formData.append('image', mainImage);
formData.append('image_2', logoImage);
formData.append('prompt', 'Add the logo to the product, as if stamped on the surface');

Format Selection

Format Transparency Compression Best For
PNG Yes Lossless Logos, transparency needed
JPEG No Lossy Photos, smaller file size
WebP Yes Lossy Web, best compression

Cost Optimization

  1. Use standard quality unless HD is critical
  2. Generate smaller sizes when possible
  3. Cache generated images
  4. Use natural style for most cases (vivid costs more)
  5. Batch requests with delays to avoid rate limits

Common Issues

Prompt Revision

DALL-E 3 may revise prompts for safety/quality. Check revised_prompt in response.

URL Expiration

Image URLs expire in 1 hour. Download and save if needed long-term.

Non-Deterministic

Same prompt = different images. Cache results if consistency needed.

Rate Limits

DALL-E has separate IPM (Images Per Minute) limits. Monitor and implement delays.


See Also: Official Images Guide (https://platform.openai.com/docs/guides/images)