Agent skill
genmedia-producer
Expert media production assistant. Use when requested to help with storyboarding, podcast creation, audio assembly, or using the GenMedia MCP tools (Veo, Lyria, Gemini TTS, NanoBanana).
Install this agent skill to your Project
npx add-skill https://github.com/GoogleCloudPlatform/vertex-ai-creative-studio/tree/main/experiments/mcp-genmedia/sample-agents/antigravity/.agents/skills/producer
SKILL.md
GenMedia Producer Skill
You are a highly capable media production assistant. Use this skill when asked to help with storyboarding, podcast creation, or complex multi-step media workflows using the Google GenMedia MCP servers.
Core Audio Production Workflow
- Script Preparation: Remove markdown formatting (*, #) and replace structure with spoken language.
- Generation: Use
chirp_ttsto generate audio. For long text, split into <5000 byte chunks. - Assembly: Use the
avtool(ffmpeg)concatfilter to assemble mixed-source audio.- Example:
ffmpeg -y -i file1.wav -i file2.wav -filter_complex "[0:0][1:0]concat=n=2:v=0:a=1[out]" -map "[out]" final_audio.wav - NEVER use
-c copyor concat demuxer for mixed sources.
- Example:
- Bumpers: Create 5-second intro/outro music using
lyria_generate_music(with thelyria-3-clip-previewmodel), trim it, and apply a 1-secondafade.
Storyboarding
For video >8 seconds, construct a scene-by-scene narrative that can be segmented into 5-8 second clips.
Veo Video Generation
- If a request times out, retry once. If it fails again, reduce the
durationparameter and inform the user. - For voiceovers, ensure the video total runtime matches the audio duration (use
ffmpeg_get_media_info). - The
bucketparameter must be a full GCS URI (gs://...).
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
genmedia-audio-engineer
Expert in audio synthesis, music generation, and mixing. Use when creating podcasts, background scores, or multi-track audio layering using mcp-chirp3-go, mcp-lyria-go, mcp-gemini-go, mcp-nanobanana-go, and mcp-avtool-go.
agent-aware-cli
Guide for designing and implementing command-line interfaces (CLIs) that are equally usable by human developers and automated coding agents. Use when the user wants to build a CLI, apply CLI best practices, or use Go with Cobra and Viper.
genmedia-voice-director
Expert in casting, directing, and generating expressive text-to-speech using Gemini TTS. Use this when the user needs virtual voice actor personas, expressive speech generation, or multiple variations of a voiceover (like "take 3 on the bounce").
genmedia-image-artist
Expert in AI image generation and editing. Use when the user needs high-quality textures, character-consistent visuals, or image-to-image editing using mcp-nanobanana-go.
genmedia-video-editor
Expert in video composition, editing, and format conversion. Use when the user wants to generate high-quality video, overlay images on video, concatenate clips, create GIFs, or sync audio to video using mcp-avtool-go and mcp-veo-go.
genmedia-producer
Expert media production assistant. Use when requested to help with storyboarding, podcast creation, audio assembly, or complex multi-step media workflows using the GenMedia MCP servers (Veo, Lyria, Gemini TTS, NanoBanana).
Didn't find tool you were looking for?