2.2 KiB
2.2 KiB
model, allowed-tools, argument-hint, description
| model | allowed-tools | argument-hint | description |
|---|---|---|---|
| claude-sonnet-4-5 | Task, Bash, Read, Write, WebFetch | <video-url> [--output-format format] [--summary] [--language lang] | Transcribe a YouTube video and extract key insights |
YouTube Transcribe Command
Transcribe a YouTube video, fetch the transcript, and extract key insights and summaries.
Arguments
$1 (Required): video-url
The full YouTube URL to transcribe. Example: https://www.youtube.com/watch?v=dQw4w9WgXcQ
Optional flags:
--output-format- Output format:text(default),markdown,json,vtt--summary- Generate a concise summary alongside full transcript--language- Specific language code (e.g.,en,es,fr). Default: auto-detect--timestamps- Include timestamps in output--speakers- Identify and separate speaker segments
Examples
/yt-transcribe "https://www.youtube.com/watch?v=VIDEO_ID"
/yt-transcribe "https://www.youtube.com/watch?v=VIDEO_ID" --summary --format markdown
/yt-transcribe "https://www.youtube.com/watch?v=VIDEO_ID" --language es --timestamps
/yt-transcribe "https://www.youtube.com/watch?v=VIDEO_ID" --summary --speakers
How It Works
- Video Analysis - Uses the video-analyzer agent to inspect the video
- Transcript Fetching - Uses the transcript-fetcher agent to get the best available transcript
- Processing - Uses the transcript-processor agent to format and optionally summarize
- Output - Delivers transcript in your specified format
Output
The command returns:
- Complete transcript with timestamps (if requested)
- Speaker identification and segmentation (if requested)
- Summary and key points (if requested)
- Metadata including duration, language, source quality
Tips
- For videos without captions, the system will attempt speech-to-text transcription
- Add
--summaryto get a concise overview alongside the full transcript - Use
--speakersto identify who's speaking and when - Markdown format is best for reading; JSON is best for downstream processing
Invoke the transcript-fetcher agent to retrieve the transcript, then use the transcript-processor agent to format and analyze it.