Initial commit
This commit is contained in:
153
references/images-guide.md
Normal file
153
references/images-guide.md
Normal file
@@ -0,0 +1,153 @@
|
||||
# Images Guide (DALL-E 3 & GPT-Image-1)
|
||||
|
||||
**Last Updated**: 2025-10-25
|
||||
|
||||
Best practices for image generation and editing with OpenAI's Images API.
|
||||
|
||||
---
|
||||
|
||||
## DALL-E 3 Generation
|
||||
|
||||
### Size Selection
|
||||
|
||||
| Size | Use Case |
|
||||
|------|----------|
|
||||
| 1024x1024 | Profile pictures, icons, square posts |
|
||||
| 1024x1536 | Portrait photos, vertical ads |
|
||||
| 1536x1024 | Landscape photos, banners |
|
||||
| 1024x1792 | Tall portraits, mobile wallpapers |
|
||||
| 1792x1024 | Wide banners, desktop wallpapers |
|
||||
|
||||
### Quality Settings
|
||||
|
||||
**standard**: Normal quality, faster, cheaper
|
||||
- Use for: Prototyping, high-volume generation, quick iterations
|
||||
|
||||
**hd**: High definition, finer details, more expensive
|
||||
- Use for: Final production images, marketing materials, print
|
||||
|
||||
### Style Options
|
||||
|
||||
**vivid**: Hyper-real, dramatic, high-contrast
|
||||
- Use for: Marketing, advertising, eye-catching visuals
|
||||
|
||||
**natural**: More realistic, less dramatic
|
||||
- Use for: Product photos, realistic scenes, professional content
|
||||
|
||||
---
|
||||
|
||||
## Prompting Best Practices
|
||||
|
||||
### Be Specific
|
||||
|
||||
```
|
||||
❌ "A cat"
|
||||
✅ "A white siamese cat with striking blue eyes, sitting on a wooden table, golden hour lighting, professional photography"
|
||||
```
|
||||
|
||||
### Include Art Style
|
||||
|
||||
```
|
||||
✅ "Oil painting of a sunset in the style of Claude Monet"
|
||||
✅ "3D render of a futuristic city, Pixar animation style"
|
||||
✅ "Professional product photo with studio lighting"
|
||||
```
|
||||
|
||||
### Specify Lighting
|
||||
|
||||
```
|
||||
- "Golden hour lighting"
|
||||
- "Soft studio lighting from the left"
|
||||
- "Dramatic shadows"
|
||||
- "Bright natural daylight"
|
||||
```
|
||||
|
||||
### Composition Details
|
||||
|
||||
```
|
||||
- "Shallow depth of field"
|
||||
- "Wide angle lens"
|
||||
- "Centered composition"
|
||||
- "Rule of thirds"
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## GPT-Image-1 Editing
|
||||
|
||||
### Input Fidelity
|
||||
|
||||
**low**: More creative freedom
|
||||
- Use for: Major transformations, style changes
|
||||
|
||||
**medium**: Balance (default)
|
||||
- Use for: Most editing tasks
|
||||
|
||||
**high**: Stay close to original
|
||||
- Use for: Subtle edits, preserving details
|
||||
|
||||
### Common Editing Tasks
|
||||
|
||||
1. **Background Removal**
|
||||
```typescript
|
||||
formData.append('prompt', 'Remove the background, keep only the product');
|
||||
formData.append('format', 'png');
|
||||
formData.append('background', 'transparent');
|
||||
```
|
||||
|
||||
2. **Color Correction**
|
||||
```typescript
|
||||
formData.append('prompt', 'Increase brightness and saturation, make colors more vibrant');
|
||||
```
|
||||
|
||||
3. **Object Removal**
|
||||
```typescript
|
||||
formData.append('prompt', 'Remove the person from the background');
|
||||
```
|
||||
|
||||
4. **Compositing**
|
||||
```typescript
|
||||
formData.append('image', mainImage);
|
||||
formData.append('image_2', logoImage);
|
||||
formData.append('prompt', 'Add the logo to the product, as if stamped on the surface');
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## Format Selection
|
||||
|
||||
| Format | Transparency | Compression | Best For |
|
||||
|--------|--------------|-------------|----------|
|
||||
| PNG | Yes | Lossless | Logos, transparency needed |
|
||||
| JPEG | No | Lossy | Photos, smaller file size |
|
||||
| WebP | Yes | Lossy | Web, best compression |
|
||||
|
||||
---
|
||||
|
||||
## Cost Optimization
|
||||
|
||||
1. Use standard quality unless HD is critical
|
||||
2. Generate smaller sizes when possible
|
||||
3. Cache generated images
|
||||
4. Use natural style for most cases (vivid costs more)
|
||||
5. Batch requests with delays to avoid rate limits
|
||||
|
||||
---
|
||||
|
||||
## Common Issues
|
||||
|
||||
### Prompt Revision
|
||||
DALL-E 3 may revise prompts for safety/quality. Check `revised_prompt` in response.
|
||||
|
||||
### URL Expiration
|
||||
Image URLs expire in 1 hour. Download and save if needed long-term.
|
||||
|
||||
### Non-Deterministic
|
||||
Same prompt = different images. Cache results if consistency needed.
|
||||
|
||||
### Rate Limits
|
||||
DALL-E has separate IPM (Images Per Minute) limits. Monitor and implement delays.
|
||||
|
||||
---
|
||||
|
||||
**See Also**: Official Images Guide (https://platform.openai.com/docs/guides/images)
|
||||
Reference in New Issue
Block a user