Agent skill
qwen-edit
AI image editing prompting patterns for Qwen-Image-Edit. Use when editing photos while preserving identity, reframing cropped images, changing clothing or accessories, adjusting poses, applying style transfers, or character transformations. Provides prompt patterns, parameter tuning, and examples.
Install this agent skill to your Project
npx add-skill https://github.com/digitalsamba/claude-code-video-toolkit/tree/main/.claude/skills/qwen-edit
SKILL.md
Qwen-Image-Edit Skill
AI-powered image editing using Qwen-Image-Edit-2511 via RunPod serverless.
Status: Evolving - learnings being captured as we experiment
When to Use This Skill
Use when the user wants to:
- Edit/transform photos while preserving identity
- Reframe cropped images (fix cut-off heads, etc.)
- Change clothing, add accessories
- Change pose (arm positions, hand placement)
- Apply style transfers (cyberpunk, anime, oil painting)
- Adjust lighting/color grading
- Add/remove objects
- Character transformations (Bond, Neo, etc.)
When NOT to Use
- Background replacement (single image) - creates cut-out artifacts, halos
- Face swapping - cannot preserve identity from reference
- Outpainting - can't extend canvas reliably
Use With Care
- Multi-image compositing - CAN work with explicit identity anchors (see examples.md for prompt patterns). Requires describing distinctive features (hair texture/color, ethnicity, outfit) and using guidance ~2.0
- Camera angle changes - Inconsistent results. Vertical angles (low/high) work better than rotational (three-quarter view)
Quick Reference
# Basic edit
python tools/image_edit.py --input photo.jpg --prompt "Add sunglasses"
# With negative prompt (recommended)
python tools/image_edit.py --input photo.jpg \
--prompt "Reframe as portrait with full head visible" \
--negative "blur, distortion, artifacts"
# Style transfer
python tools/image_edit.py --input photo.jpg --style cyberpunk
# Background (use cautiously - often fails)
python tools/image_edit.py --input photo.jpg --background office
# Higher quality
python tools/image_edit.py --input photo.jpg --prompt "..." --steps 16 --guidance 3.0
# Multi-image composite (identity-preserving)
python tools/image_edit.py --input person.jpg background.jpg \
--prompt "The [ethnicity] [gender] with [hair description] from first image is now in [scene] from second image. Same [features], [outfit]." \
--negative "different ethnicity, different hair color, different face shape, generic stock photo" \
--steps 16 --guidance 2.0
Key Files
prompting.md- Prompt patterns and structureexamples.md- Good/bad examples from experimentsparameters.md- Tuning steps, guidance, negative prompts
Tool Location
tools/image_edit.py - CLI wrapper for RunPod endpoint
Related Docs
docs/qwen-edit-patterns.md- Character transformation patterns.ai_dev/qwen-edit-research.md- Research notes
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
remotion-best-practices
Best practices for Remotion - Video creation in React
runpod
Cloud GPU processing via RunPod serverless. Use when setting up RunPod endpoints, deploying Docker images, managing GPU resources, troubleshooting endpoint issues, or understanding costs. Covers all 5 toolkit images (qwen-edit, realesrgan, propainter, sadtalker, qwen3-tts).
acestep
AI music generation with ACE-Step 1.5 — background music, vocal tracks, covers, stem extraction for video production. Use when generating music, soundtracks, jingles, or working with audio stems. Triggers include background music, soundtrack, jingle, music generation, stem extraction, cover, style transfer, or musical composition tasks.
ltx2
AI video generation with LTX-2.3 22B — text-to-video, image-to-video clips for video production. Use when generating video clips, animating images, creating b-roll, animated backgrounds, or motion content. Triggers include video generation, animate image, b-roll, motion, video clip, text-to-video, image-to-video.
ffmpeg
Video and audio processing with FFmpeg. Use for format conversion, resizing, compression, audio extraction, and preparing assets for Remotion. Triggers include converting GIF to MP4, resizing video, extracting audio, compressing files, or any media transformation task.
elevenlabs
Generate AI voiceovers, sound effects, and music using ElevenLabs APIs. Use when creating audio content for videos, podcasts, or games. Triggers include generating voiceovers, narration, dialogue, sound effects from descriptions, background music, soundtrack generation, voice cloning, or any audio synthesis task.
Didn't find tool you were looking for?