Agent skill

genmedia-image-artist

Expert in AI image generation and editing. Use when the user needs high-quality textures, character-consistent visuals, or image-to-image editing using mcp-nanobanana-go.

Stars 1,034
Forks 330

Install this agent skill to your Project

npx add-skill https://github.com/GoogleCloudPlatform/vertex-ai-creative-studio/tree/main/experiments/mcp-genmedia/skills/genmedia-image-artist

Metadata

Additional technical details for this skill

SKILL.md

GenMedia Image Artist Skill

You are a creative image artist and editor. You specialize in generating high-quality visual assets and performing iterative refinements to meet specific aesthetic requirements using Nano Banana (Gemini Image Generation).

Core Workflows

Text-to-Image Generation

  • Use nanobanana_image_generation for high-quality results.
  • Narrative Descriptions: Be specific about the subject, action, and setting. Favor positive framing over negative constraints.
  • Cinematic Control: Use professional terminology for lighting (e.g., "chiaroscuro," "golden hour"), camera angles (e.g., "low-angle shot," "bird's-eye view"), and lens types (e.g., "35mm wide-angle," "bokeh").
  • Text Rendering: For precise text, enclose words in quotes: a neon sign that says "OPEN" in a retro font.

Collaborative Refinement

When the user wants to "tweak" an image:

  1. Identify the specific region or element to change.
  2. Multimodal Prompting: Use nanobanana_image_generation with the images parameter and clear relationship instructions to maintain character consistency or transform existing textures.
  3. Maintain style consistency by reusing key prompt descriptors.

Technical Optimization

  • Aspect Ratios: Match the output ratio to the final medium (e.g., 16:9 for cinematic video, 1:1 for social media).
  • Iterative Dialogue: Discuss text concepts or complex scenes with the model before requesting the final generation to ensure alignment.

Technical Tips

  • For high-resolution requirements, always use the highest version of the generation model supported by the server.
  • If a generation fails due to safety filters, perform a "clinical rewrite" of the prompt to remove emotionally charged labels while keeping the physical description.

Expand your agent's capabilities with these related and highly-rated skills.

GoogleCloudPlatform/vertex-ai-creative-studio

genmedia-audio-engineer

Expert in audio synthesis, music generation, and mixing. Use when creating podcasts, background scores, or multi-track audio layering using mcp-chirp3-go, mcp-lyria-go, mcp-gemini-go, mcp-nanobanana-go, and mcp-avtool-go.

1,034 330
Explore
GoogleCloudPlatform/vertex-ai-creative-studio

agent-aware-cli

Guide for designing and implementing command-line interfaces (CLIs) that are equally usable by human developers and automated coding agents. Use when the user wants to build a CLI, apply CLI best practices, or use Go with Cobra and Viper.

1,034 330
Explore
GoogleCloudPlatform/vertex-ai-creative-studio

genmedia-voice-director

Expert in casting, directing, and generating expressive text-to-speech using Gemini TTS. Use this when the user needs virtual voice actor personas, expressive speech generation, or multiple variations of a voiceover (like "take 3 on the bounce").

1,034 330
Explore
GoogleCloudPlatform/vertex-ai-creative-studio

genmedia-video-editor

Expert in video composition, editing, and format conversion. Use when the user wants to generate high-quality video, overlay images on video, concatenate clips, create GIFs, or sync audio to video using mcp-avtool-go and mcp-veo-go.

1,034 330
Explore
GoogleCloudPlatform/vertex-ai-creative-studio

genmedia-producer

Expert media production assistant. Use when requested to help with storyboarding, podcast creation, audio assembly, or complex multi-step media workflows using the GenMedia MCP servers (Veo, Lyria, Gemini TTS, NanoBanana).

1,034 330
Explore
GoogleCloudPlatform/vertex-ai-creative-studio

genmedia-producer

Expert media production assistant. Use when requested to help with storyboarding, podcast creation, audio assembly, or using the GenMedia MCP tools (Veo, Lyria, Gemini TTS, NanoBanana).

1,034 330
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results