Agent skill

gif-sticker-maker

Convert photos (people, pets, objects, logos) into 4 animated GIF stickers with captions. Use when: user wants to create cartoon stickers, GIF expressions, emoji packs, animated avatars, or convert photos to Funko Pop / Pop Mart blind box style animations. Triggers: sticker, GIF, cartoon, emoji, expression pack, avatar animation.

Stars 1,273
Forks 120

Install this agent skill to your Project

npx add-skill https://github.com/poco-ai/poco-claw/tree/main/backend/assets/skills/gif-sticker-maker

Metadata

Additional technical details for this skill

style
Funko Pop / Pop Mart
sources
[
    "MiniMax Image Generation API",
    "MiniMax Video Generation API"
]
version
1.2
category
creative-tools
output count
4
output format
GIF

SKILL.md

GIF Sticker Maker

Convert user photos into 4 animated GIF stickers (Funko Pop / Pop Mart style).

Style Spec

  • Funko Pop / Pop Mart blind box 3D figurine
  • C4D / Octane rendering quality
  • White background, soft studio lighting
  • Caption: black text + white outline, bottom of image

Prerequisites

Before starting any generation step, ensure:

  1. Python venv is activated with dependencies from requirements.txt installed
  2. MINIMAX_API_KEY is exported (e.g. export MINIMAX_API_KEY='your-key')
  3. ffmpeg is available on PATH (for Step 3 GIF conversion)

If any prerequisite is missing, set it up first. Do NOT proceed to generation without all three.

Workflow

Step 0: Collect Captions

Ask user (in their language):

"Would you like to customize the captions for your stickers, or use the defaults?"

  • Custom: Collect 4 short captions (1–3 words). Actions auto-match caption meaning.
  • Default: Look up captions table by detected user language. Never mix languages.

Step 1: Generate 4 Static Sticker Images

Tool: scripts/minimax_image.py

  1. Analyze the user's photo — identify subject type (person / animal / object / logo).
  2. For each of the 4 stickers, build a prompt from image-prompt-template.txt by filling {action} and {caption}.
  3. If subject is a person: pass --subject-ref <user_photo_path> so the generated figurine preserves the person's actual facial likeness.
  4. Generate (all 4 are independent — run concurrently):
bash
python3 scripts/minimax_image.py "<prompt>" -o output/sticker_hi.png --ratio 1:1 --subject-ref <photo>
python3 scripts/minimax_image.py "<prompt>" -o output/sticker_laugh.png --ratio 1:1 --subject-ref <photo>
python3 scripts/minimax_image.py "<prompt>" -o output/sticker_cry.png --ratio 1:1 --subject-ref <photo>
python3 scripts/minimax_image.py "<prompt>" -o output/sticker_love.png --ratio 1:1 --subject-ref <photo>

--subject-ref only works for person subjects (API limitation: type=character). For animals/objects/logos, omit the flag and rely on text description.

Step 2: Animate Each Image → Video

Tool: scripts/minimax_video.py with --image flag (image-to-video mode)

For each sticker image, build a prompt from video-prompt-template.txt, then:

bash
python3 scripts/minimax_video.py "<prompt>" --image output/sticker_hi.png -o output/sticker_hi.mp4
python3 scripts/minimax_video.py "<prompt>" --image output/sticker_laugh.png -o output/sticker_laugh.mp4
python3 scripts/minimax_video.py "<prompt>" --image output/sticker_cry.png -o output/sticker_cry.mp4
python3 scripts/minimax_video.py "<prompt>" --image output/sticker_love.png -o output/sticker_love.mp4

All 4 calls are independent — run concurrently.

Step 3: Convert Videos → GIF

Tool: scripts/convert_mp4_to_gif.py

bash
python3 scripts/convert_mp4_to_gif.py output/sticker_hi.mp4 output/sticker_laugh.mp4 output/sticker_cry.mp4 output/sticker_love.mp4

Outputs GIF files alongside each MP4 (e.g. sticker_hi.gif).

Step 4: Deliver

Output format (strict order):

  1. Brief status line (e.g. "4 stickers created:")
  2. <deliver_assets> block with all GIF files
  3. NO text after deliver_assets
xml
<deliver_assets>
<item><path>output/sticker_hi.gif</path></item>
<item><path>output/sticker_laugh.gif</path></item>
<item><path>output/sticker_cry.gif</path></item>
<item><path>output/sticker_love.gif</path></item>
</deliver_assets>

Default Actions

# Action Filename ID Animation
1 Happy waving hi Wave hand, slight head tilt
2 Laughing hard laugh Shake with laughter, eyes squint
3 Crying tears cry Tears stream, body trembles
4 Heart gesture love Heart hands, eyes sparkle

See references/captions.md for multilingual caption defaults.

Rules

  • Detect user's language, all outputs follow it
  • Captions MUST come from captions.md matching user's language column — never mix languages
  • All image prompts must be in English regardless of user language (only caption text is localized)
  • <deliver_assets> must be LAST in response, no text after

Expand your agent's capabilities with these related and highly-rated skills.

poco-ai/poco-claw

minimax-multimodal-toolkit

MiniMax multimodal model skill — use MiniMax Multi-Modal models for speech, music, video, and image. Create voice, music, video, and images with MiniMax AI: TTS (text-to-speech, voice cloning, voice design, multi-segment), music (songs, instrumentals), video (text-to-video, image-to-video, start-end frame, subject reference, templates, long-form multi-scene), image (text-to-image, image-to-image with character reference), and media processing (convert, concat, trim, extract). Use when the user mentions MiniMax, multimodal generation, or wants speech/music/video/image AI, MiniMax APIs, or FFmpeg workflows alongside MiniMax outputs.

1,273 120
Explore
poco-ai/poco-claw

minimax-pdf

Use this skill when visual quality and design identity matter for a PDF. CREATE (generate from scratch): "make a PDF", "generate a report", "write a proposal", "create a resume", "beautiful PDF", "professional document", "cover page", "polished PDF", "client-ready document". FILL (complete form fields): "fill in the form", "fill out this PDF", "complete the form fields", "write values into PDF", "what fields does this PDF have". REFORMAT (apply design to an existing doc): "reformat this document", "apply our style", "convert this Markdown/text to PDF", "make this doc look good", "re-style this PDF". This skill uses a token-based design system: color, typography, and spacing are derived from the document type and flow through every page. The output is print-ready. Prefer this skill when appearance matters, not just when any PDF output is needed.

1,273 120
Explore
poco-ai/poco-claw

pptx-generator

Generate, edit, and read PowerPoint presentations. Create from scratch with PptxGenJS (cover, TOC, content, section divider, summary slides), edit existing PPTX via XML workflows, or extract text with markitdown. Triggers: PPT, PPTX, PowerPoint, presentation, slide, deck, slides.

1,273 120
Explore
poco-ai/poco-claw

minimax-xlsx

Open, create, read, analyze, edit, or validate Excel/spreadsheet files (.xlsx, .xlsm, .csv, .tsv). Use when the user asks to create, build, modify, analyze, read, validate, or format any Excel spreadsheet, financial model, pivot table, or tabular data file. Covers: creating new xlsx from scratch, reading and analyzing existing files, editing existing xlsx with zero format loss, formula recalculation and validation, and applying professional financial formatting standards. Triggers on 'spreadsheet', 'Excel', '.xlsx', '.csv', 'pivot table', 'financial model', 'formula', or any request to produce tabular data in Excel format.

1,273 120
Explore
poco-ai/poco-claw

minimax-docx

Professional DOCX document creation, editing, and formatting using OpenXML SDK (.NET). Three pipelines: (A) create new documents from scratch, (B) fill/edit content in existing documents, (C) apply template formatting with XSD validation gate-check. MUST use this skill whenever the user wants to produce, modify, or format a Word document — including when they say "write a report", "draft a proposal", "make a contract", "fill in this form", "reformat to match this template", or any task whose final output is a .docx file. Even if the user doesn't mention "docx" explicitly, if the task implies a printable/formal document, use this skill.

1,273 120
Explore
poco-ai/poco-claw

skill-creator

Create new skills, modify and improve existing skills, and measure skill performance. Use when users want to create a skill from scratch, edit, or optimize an existing skill, run evals to test a skill, benchmark skill performance with variance analysis, or optimize a skill's description for better triggering accuracy.

1,273 120
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results