Agent skill
meme-generation
Generate real meme images by picking a template and overlaying text with Pillow. Produces actual .png meme files.
Install this agent skill to your Project
npx add-skill https://github.com/NousResearch/hermes-agent/tree/main/optional-skills/creative/meme-generation
Metadata
Additional technical details for this skill
- hermes
-
{ "tags": [ "creative", "memes", "humor", "images" ], "category": "creative", "related_skills": [ "ascii-art", "generative-widgets" ] }
SKILL.md
Meme Generation
Generate actual meme images from a topic. Picks a template, writes captions, and renders a real .png file with text overlay.
When to Use
- User asks you to make or generate a meme
- User wants a meme about a specific topic, situation, or frustration
- User says "meme this" or similar
Available Templates
The script supports any of the ~100 popular imgflip templates by name or ID, plus 10 curated templates with hand-tuned text positioning.
Curated Templates (custom text placement)
| ID | Name | Fields | Best for |
|---|---|---|---|
this-is-fine |
This is Fine | top, bottom | chaos, denial |
drake |
Drake Hotline Bling | reject, approve | rejecting/preferring |
distracted-boyfriend |
Distracted Boyfriend | distraction, current, person | temptation, shifting priorities |
two-buttons |
Two Buttons | left, right, person | impossible choice |
expanding-brain |
Expanding Brain | 4 levels | escalating irony |
change-my-mind |
Change My Mind | statement | hot takes |
woman-yelling-at-cat |
Woman Yelling at Cat | woman, cat | arguments |
one-does-not-simply |
One Does Not Simply | top, bottom | deceptively hard things |
grus-plan |
Gru's Plan | step1-3, realization | plans that backfire |
batman-slapping-robin |
Batman Slapping Robin | robin, batman | shutting down bad ideas |
Dynamic Templates (from imgflip API)
Any template not in the curated list can be used by name or imgflip ID. These get smart default text positioning (top/bottom for 2-field, evenly spaced for 3+). Search with:
python "$SKILL_DIR/scripts/generate_meme.py" --search "disaster"
Procedure
Mode 1: Classic Template (default)
- Read the user's topic and identify the core dynamic (chaos, dilemma, preference, irony, etc.)
- Pick the template that best matches. Use the "Best for" column, or search with
--search. - Write short captions for each field (8-12 words max per field, shorter is better).
- Find the skill's script directory:
SKILL_DIR=$(dirname "$(find ~/.hermes/skills -path '*/meme-generation/SKILL.md' 2>/dev/null | head -1)") - Run the generator:
bash
python "$SKILL_DIR/scripts/generate_meme.py" <template_id> /tmp/meme.png "caption 1" "caption 2" ... - Return the image with
MEDIA:/tmp/meme.png
Mode 2: Custom AI Image (when image_generate is available)
Use this when no classic template fits, or when the user wants something original.
- Write the captions first.
- Use
image_generateto create a scene that matches the meme concept. Do NOT include any text in the image prompt — text will be added by the script. Describe only the visual scene. - Find the generated image path from the image_generate result URL. Download it to a local path if needed.
- Run the script with
--imageto overlay text, choosing a mode:- Overlay (text directly on image, white with black outline):
bash
python "$SKILL_DIR/scripts/generate_meme.py" --image /path/to/scene.png /tmp/meme.png "top text" "bottom text" - Bars (black bars above/below with white text — cleaner, always readable):
bash
python "$SKILL_DIR/scripts/generate_meme.py" --image /path/to/scene.png --bars /tmp/meme.png "top text" "bottom text"
--barswhen the image is busy/detailed and text would be hard to read on top of it. - Overlay (text directly on image, white with black outline):
- Verify with vision (if
vision_analyzeis available): Check the result looks good:If the vision model flags issues (text hard to read, bad placement, etc.), try the other mode (switch between overlay and bars) or regenerate the scene.vision_analyze(image_url="/tmp/meme.png", question="Is the text legible and well-positioned? Does the meme work visually?") - Return the image with
MEDIA:/tmp/meme.png
Examples
"debugging production at 2 AM":
python generate_meme.py this-is-fine /tmp/meme.png "SERVERS ARE ON FIRE" "This is fine"
"choosing between sleep and one more episode":
python generate_meme.py drake /tmp/meme.png "Getting 8 hours of sleep" "One more episode at 3 AM"
"the stages of a Monday morning":
python generate_meme.py expanding-brain /tmp/meme.png "Setting an alarm" "Setting 5 alarms" "Sleeping through all alarms" "Working from bed"
Listing Templates
To see all available templates:
python generate_meme.py --list
Pitfalls
- Keep captions SHORT. Memes with long text look terrible.
- Match the number of text arguments to the template's field count.
- Pick the template that fits the joke structure, not just the topic.
- Do not generate hateful, abusive, or personally targeted content.
- The script caches template images in
scripts/.cache/after first download.
Verification
The output is correct if:
- A .png file was created at the output path
- Text is legible (white with black outline) on the template
- The joke lands — caption matches the template's intended structure
- File can be delivered via MEDIA: path
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
agentmail
Give the agent its own dedicated email inbox via AgentMail. Send, receive, and manage email autonomously using agent-owned email addresses (e.g. hermes-agent@agentmail.to).
base
Query Base (Ethereum L2) blockchain data with USD pricing — wallet balances, token info, transaction details, gas analysis, contract inspection, whale detection, and live network stats. Uses Base RPC + CoinGecko. No API key required.
solana
Query Solana blockchain data with USD pricing — wallet balances, token portfolios with values, transaction details, NFTs, whale detection, and live network stats. Uses Solana RPC + CoinGecko. No API key required.
one-three-one-rule
Structured decision-making framework for technical proposals and trade-off analysis. When the user faces a choice between multiple approaches (architecture decisions, tool selection, refactoring strategies, migration paths), this skill produces a 1-3-1 format: one clear problem statement, three distinct options with pros/cons, and one concrete recommendation with definition of done and implementation plan. Use when the user asks for a "1-3-1", says "give me options", or needs help choosing between competing approaches.
fastmcp
Build, test, inspect, install, and deploy MCP servers with FastMCP in Python. Use when creating a new MCP server, wrapping an API or database as MCP tools, exposing resources or prompts, or preparing a FastMCP server for Claude Code, Cursor, or HTTP deployment.
qdrant-vector-search
High-performance vector similarity search engine for RAG and semantic search. Use when building production RAG systems requiring fast nearest neighbor search, hybrid search with filtering, or scalable vector storage with Rust-powered performance.
Didn't find tool you were looking for?