Agent skill
youtube-to-markdown
Use when user asks YouTube video extraction, get, fetch, transcripts, subtitles, or captions. Writes video details and transcription into structured markdown file.
Install this agent skill to your Project
npx add-skill https://github.com/vre/flow-state/tree/main/docs/youtube-to-markdown/research/2026.01.14 token usage test/code/modules
SKILL.md
YouTube to Markdown
Multiple videos: Process one video at a time, sequentially. Do not run parallel extractions. Do not create your own scripts.
Step 0: Check if extracted before
python3 ./check_existing.py "<YOUTUBE_URL>" "<output_directory>"
If any *_valid: false: Show the issues to user and proceed.
Output JSON contains video_id. Set BASE_NAME = youtube_{video_id} for all subsequent steps.
Step 1: Choose output
AskUserQuestion:
- question: "What do you want to extract from the video?"
- header: "Output"
- multiSelect: false
- options: A. "Summary only" - Tight summary of video content B. "Transcript only" - Cleaned, formatted full transcript C. "Comments only" - Curated comments D. "Summary + Comments" - Summary with cross-analyzed comment insights E. "Full (Recommended)" - All: summary, transcript, comments
Step 2: Execute modules
Based on user's choice, read and follow each module instruction in ./modules/{file}. "|" marks possibility to run concurrently.
- A: transcript_extract.md → transcript_summarize.md
- B: transcript_extract.md → transcript_polish.md
- C: comment_extract.md
- D: transcript_extract.md → (transcript_summarize.md | comment_extract.md) → comment_summarize.md
- E: transcript_extract.md → (transcript_summarize.md | transcript_polish.md | comment_extract.md) → comment_summarize.md
Step 3: Finalize
python3 finalize.py [flag] "${BASE_NAME}" "<output_directory>"
Flags: A=--summary-only, B=--transcript-only, C=--comments-only, D=--summary-comments, E=(none)
Use --debug to keep intermediate files.
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
youtube-to-markdown
Use when user asks YouTube video extraction, get, fetch, transcripts, subtitles, or captions, or channel browsing. Writes video details and transcription into structured markdown file.
cli-tool-builder
Use when creating CLI tools with action dispatcher pattern
building-skills
Use when creating new skills or converting workflows into SKILL.md
session-codex
Use when delegating a task to Codex CLI with session persistence across turns.
session-gemini
Use when delegating a task to Gemini CLI with session persistence across turns.
project-builder
Use when creating a new skill, MCP server, or CLI tool project.
Didn't find tool you were looking for?