gh-dotclaude-marketplace-pl…/commands/yt-transcribe.md

---
model: claude-sonnet-4-5
allowed-tools: Task, Bash, Read, Write, WebFetch
argument-hint: <video-url> [--output-format format] [--summary] [--language lang]
description: Transcribe a YouTube video and extract key insights
---

# YouTube Transcribe Command

Transcribe a YouTube video, fetch the transcript, and extract key insights and summaries.

## Arguments

**$1 (Required)**: `video-url`

The full YouTube URL to transcribe. Example: `https://www.youtube.com/watch?v=dQw4w9WgXcQ`

**Optional flags**:

- `--output-format` - Output format: `text` (default), `markdown`, `json`, `vtt`
- `--summary` - Generate a concise summary alongside full transcript
- `--language` - Specific language code (e.g., `en`, `es`, `fr`). Default: auto-detect
- `--timestamps` - Include timestamps in output
- `--speakers` - Identify and separate speaker segments

## Examples

```bash
/yt-transcribe "https://www.youtube.com/watch?v=VIDEO_ID"
/yt-transcribe "https://www.youtube.com/watch?v=VIDEO_ID" --summary --format markdown
/yt-transcribe "https://www.youtube.com/watch?v=VIDEO_ID" --language es --timestamps
/yt-transcribe "https://www.youtube.com/watch?v=VIDEO_ID" --summary --speakers
```

## How It Works

1. **Video Analysis** - Uses the video-analyzer agent to inspect the video
2. **Transcript Fetching** - Uses the transcript-fetcher agent to get the best available transcript
3. **Processing** - Uses the transcript-processor agent to format and optionally summarize
4. **Output** - Delivers transcript in your specified format

## Output

The command returns:
- Complete transcript with timestamps (if requested)
- Speaker identification and segmentation (if requested)
- Summary and key points (if requested)
- Metadata including duration, language, source quality

## Tips

- For videos without captions, the system will attempt speech-to-text transcription
- Add `--summary` to get a concise overview alongside the full transcript
- Use `--speakers` to identify who's speaking and when
- Markdown format is best for reading; JSON is best for downstream processing

---

Invoke the transcript-fetcher agent to retrieve the transcript, then use the transcript-processor agent to format and analyze it.