Agent skill

gif-sticker-maker

Convert photos (people, pets, objects, logos) into 4 animated GIF stickers with captions. Use when: user wants to create cartoon stickers, GIF expressions, emoji packs, animated avatars, or convert photos to Funko Pop / Pop Mart blind box style animations. Triggers: sticker, GIF, cartoon, emoji, expression pack, avatar animation.

Stars 19
Forks 4

Install this agent skill to your Project

npx add-skill https://github.com/x-cmd/skill/tree/main/data/minimax/gif-sticker-maker

Metadata

Additional technical details for this skill

style
Funko Pop / Pop Mart
sources
[
    "MiniMax Image Generation API",
    "MiniMax Video Generation API"
]
version
1.2
category
creative-tools
output count
4
output format
GIF

SKILL.md

GIF Sticker Maker

Convert user photos into 4 animated GIF stickers (Funko Pop / Pop Mart style).

Style Spec

  • Funko Pop / Pop Mart blind box 3D figurine
  • C4D / Octane rendering quality
  • White background, soft studio lighting
  • Caption: black text + white outline, bottom of image

Prerequisites

Before starting any generation step, ensure:

  1. Python venv is activated with dependencies from requirements.txt installed
  2. MINIMAX_API_KEY is exported (e.g. export MINIMAX_API_KEY='your-key')
  3. ffmpeg is available on PATH (for Step 3 GIF conversion)

If any prerequisite is missing, set it up first. Do NOT proceed to generation without all three.

Workflow

Step 0: Collect Captions

Ask user (in their language):

"Would you like to customize the captions for your stickers, or use the defaults?"

  • Custom: Collect 4 short captions (1–3 words). Actions auto-match caption meaning.
  • Default: Look up captions table by detected user language. Never mix languages.

Step 1: Generate 4 Static Sticker Images

Tool: scripts/minimax_image.py

  1. Analyze the user's photo — identify subject type (person / animal / object / logo).
  2. For each of the 4 stickers, build a prompt from image-prompt-template.txt by filling {action} and {caption}.
  3. If subject is a person: pass --subject-ref <user_photo_path> so the generated figurine preserves the person's actual facial likeness.
  4. Generate (all 4 are independent — run concurrently):
bash
python3 scripts/minimax_image.py "<prompt>" -o output/sticker_hi.png --ratio 1:1 --subject-ref <photo>
python3 scripts/minimax_image.py "<prompt>" -o output/sticker_laugh.png --ratio 1:1 --subject-ref <photo>
python3 scripts/minimax_image.py "<prompt>" -o output/sticker_cry.png --ratio 1:1 --subject-ref <photo>
python3 scripts/minimax_image.py "<prompt>" -o output/sticker_love.png --ratio 1:1 --subject-ref <photo>

--subject-ref only works for person subjects (API limitation: type=character). For animals/objects/logos, omit the flag and rely on text description.

Step 2: Animate Each Image → Video

Tool: scripts/minimax_video.py with --image flag (image-to-video mode)

For each sticker image, build a prompt from video-prompt-template.txt, then:

bash
python3 scripts/minimax_video.py "<prompt>" --image output/sticker_hi.png -o output/sticker_hi.mp4
python3 scripts/minimax_video.py "<prompt>" --image output/sticker_laugh.png -o output/sticker_laugh.mp4
python3 scripts/minimax_video.py "<prompt>" --image output/sticker_cry.png -o output/sticker_cry.mp4
python3 scripts/minimax_video.py "<prompt>" --image output/sticker_love.png -o output/sticker_love.mp4

All 4 calls are independent — run concurrently.

Step 3: Convert Videos → GIF

Tool: scripts/convert_mp4_to_gif.py

bash
python3 scripts/convert_mp4_to_gif.py output/sticker_hi.mp4 output/sticker_laugh.mp4 output/sticker_cry.mp4 output/sticker_love.mp4

Outputs GIF files alongside each MP4 (e.g. sticker_hi.gif).

Step 4: Deliver

Output format (strict order):

  1. Brief status line (e.g. "4 stickers created:")
  2. <deliver_assets> block with all GIF files
  3. NO text after deliver_assets
xml
<deliver_assets>
<item><path>output/sticker_hi.gif</path></item>
<item><path>output/sticker_laugh.gif</path></item>
<item><path>output/sticker_cry.gif</path></item>
<item><path>output/sticker_love.gif</path></item>
</deliver_assets>

Default Actions

# Action Filename ID Animation
1 Happy waving hi Wave hand, slight head tilt
2 Laughing hard laugh Shake with laughter, eyes squint
3 Crying tears cry Tears stream, body trembles
4 Heart gesture love Heart hands, eyes sparkle

See references/captions.md for multilingual caption defaults.

Rules

  • Detect user's language, all outputs follow it
  • Captions MUST come from captions.md matching user's language column — never mix languages
  • All image prompts must be in English regardless of user language (only caption text is localized)
  • <deliver_assets> must be LAST in response, no text after

Expand your agent's capabilities with these related and highly-rated skills.

x-cmd/skill

pufferlib

High-performance reinforcement learning framework optimized for speed and scale. Use when you need fast parallel training, vectorized environments, multi-agent systems, or integration with game environments (Atari, Procgen, NetHack). Achieves 2-10x speedups over standard implementations. For quick prototyping or standard algorithm implementations with extensive documentation, use stable-baselines3 instead.

19 4
Explore
x-cmd/skill

fluidsim

Framework for computational fluid dynamics simulations using Python. Use when running fluid dynamics simulations including Navier-Stokes equations (2D/3D), shallow water equations, stratified flows, or when analyzing turbulence, vortex dynamics, or geophysical flows. Provides pseudospectral methods with FFT, HPC support, and comprehensive output analysis.

19 4
Explore
x-cmd/skill

metabolomics-workbench-database

Access NIH Metabolomics Workbench via REST API (4,200+ studies). Query metabolites, RefMet nomenclature, MS/NMR data, m/z searches, study metadata, for metabolomics and biomarker discovery.

19 4
Explore
x-cmd/skill

geniml

This skill should be used when working with genomic interval data (BED files) for machine learning tasks. Use for training region embeddings (Region2Vec, BEDspace), single-cell ATAC-seq analysis (scEmbed), building consensus peaks (universes), or any ML-based analysis of genomic regions. Applies to BED file collections, scATAC-seq data, chromatin accessibility datasets, and region-based genomic feature learning.

19 4
Explore
x-cmd/skill

zinc-database

Access ZINC (230M+ purchasable compounds). Search by ZINC ID/SMILES, similarity searches, 3D-ready structures for docking, analog discovery, for virtual screening and drug discovery.

19 4
Explore
x-cmd/skill

astropy

Comprehensive Python library for astronomy and astrophysics. This skill should be used when working with astronomical data including celestial coordinates, physical units, FITS files, cosmological calculations, time systems, tables, world coordinate systems (WCS), and astronomical data analysis. Use when tasks involve coordinate transformations, unit conversions, FITS file manipulation, cosmological distance calculations, time scale conversions, or astronomical data processing.

19 4
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results