Agent skill
producer
Install this agent skill to your Project
npx add-skill https://github.com/GoogleCloudPlatform/vertex-ai-creative-studio/tree/main/experiments/mcp-genmedia/sample-agents/geminicli/sample_extensions/google-genmedia/skills/producer
SKILL.md
GenMedia Producer Skill
You are a highly capable media production assistant. Use this skill when asked to help with storyboarding, podcast creation, or complex multi-step media workflows using the Google GenMedia MCP servers.
Core Audio Production Workflow
- Script Preparation: Remove markdown formatting (*, #) and replace structure with spoken language.
- Generation: Use
chirp_ttsto generate audio. For long text, split into <5000 byte chunks. - Assembly: Use the
avtool(ffmpeg)concatfilter to assemble mixed-source audio.- Example:
ffmpeg -y -i file1.wav -i file2.wav -filter_complex "[0:0][1:0]concat=n=2:v=0:a=1[out]" -map "[out]" final_audio.wav - NEVER use
-c copyor concat demuxer for mixed sources.
- Example:
- Bumpers: Create 5-second intro/outro music using
lyria_generate_music(with thelyria-3-clip-previewmodel), trim it, and apply a 1-secondafade.
Storyboarding
For video >8 seconds, construct a scene-by-scene narrative that can be segmented into 5-8 second clips.
Veo Video Generation
- If a request times out, retry once. If it fails again, reduce the
durationparameter and inform the user. - For voiceovers, ensure the video total runtime matches the audio duration (use
ffmpeg_get_media_info). - The
bucketparameter must be a full GCS URI (gs://...).
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
genmedia-audio-engineer
Expert in audio synthesis, music generation, and mixing. Use when creating podcasts, background scores, or multi-track audio layering using mcp-chirp3-go, mcp-lyria-go, mcp-gemini-go, mcp-nanobanana-go, and mcp-avtool-go.
agent-aware-cli
Guide for designing and implementing command-line interfaces (CLIs) that are equally usable by human developers and automated coding agents. Use when the user wants to build a CLI, apply CLI best practices, or use Go with Cobra and Viper.
genmedia-voice-director
Expert in casting, directing, and generating expressive text-to-speech using Gemini TTS. Use this when the user needs virtual voice actor personas, expressive speech generation, or multiple variations of a voiceover (like "take 3 on the bounce").
genmedia-image-artist
Expert in AI image generation and editing. Use when the user needs high-quality textures, character-consistent visuals, or image-to-image editing using mcp-nanobanana-go.
genmedia-video-editor
Expert in video composition, editing, and format conversion. Use when the user wants to generate high-quality video, overlay images on video, concatenate clips, create GIFs, or sync audio to video using mcp-avtool-go and mcp-veo-go.
genmedia-producer
Expert media production assistant. Use when requested to help with storyboarding, podcast creation, audio assembly, or complex multi-step media workflows using the GenMedia MCP servers (Veo, Lyria, Gemini TTS, NanoBanana).
Didn't find tool you were looking for?