Agent skill
ai-visual-generation
Install this agent skill to your Project
npx add-skill https://github.com/Gaku52/claude-code-skills/tree/main/07-ai/ai-visual-generation
SKILL.md
日本語版
AI Visual Generation
AI is revolutionizing image and video production. This skill covers all aspects of AI visual generation — from Stable Diffusion, DALL-E, and Midjourney to video generation (Sora) and 3D modeling.
Target Audience
- Creators looking to learn AI image and video generation technologies
- Engineers integrating AI visual generation into their products
- Those interested in AI art and design
Prerequisites
- Foundational AI/ML concepts
- Basic knowledge of image processing
Learning Guide
00-fundamentals — Image Generation AI Fundamentals
| # | File | Description |
|---|
01-image — Image Generation
| # | File | Description |
|---|
02-video — Video Generation
| # | File | Description |
|---|
03-3d — 3D Generation
| # | File | Description |
|---|
Quick Reference
AI Image Generation Service Comparison:
Midjourney: Highest quality, Discord-based
DALL-E 3: Easy API integration, ChatGPT integration
Stable Diffusion: Open source, fully customizable
Adobe Firefly: Commercially safe, Adobe integration
Flux: Latest open model, high quality
References
- Rombach, R. et al. "High-Resolution Image Synthesis with Latent Diffusion Models." CVPR, 2022.
- OpenAI. "DALL-E 3." openai.com, 2024.
- Stability AI. "Stable Diffusion." stability.ai, 2024.
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
computer-science-fundamentals
A comprehensive guide covering the fundamentals of computer science. From hardware internals and data representation to algorithms, data structures, computation theory, programming paradigms, and software engineering basics — a systematic guide to all the CS foundations every engineer needs.
operating-system-guide
programming-language-fundamentals
algorithm-and-data-structures
linux-cli-mastery
aws-cloud-guide
Didn't find tool you were looking for?