Agent skill
evaluate-diagram
Install this agent skill to your Project
npx add-skill https://github.com/llmsresearch/paperbanana/tree/main/.claude/skills/evaluate-diagram
SKILL.md
Evaluate Diagram
Evaluate a generated diagram against a human reference using PaperBanana's VLM-as-Judge scoring.
Instructions
$ARGUMENTS[0]is the path to the generated image.$ARGUMENTS[1]is the path to the human reference image.- Ask the user for:
- Source context: the methodology text (or a file path to read it from). If the user provides a file path, read that file to get the text.
- Figure caption: a description of what the diagram communicates.
- Call the MCP tool
evaluate_diagramwith:generated_path: the generated image pathreference_path: the reference image pathcontext: the methodology text contentcaption: the figure caption
- Present the evaluation scores to the user. Scores cover 4 dimensions: Faithfulness, Conciseness, Readability, and Aesthetics.
CLI Fallback
If the MCP tool is not available, fall back to the CLI:
paperbanana evaluate --generated <generated-img> --reference <reference-img> --context <context-file> --caption "<caption>"
Example
/evaluate-diagram output.png reference.png
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
generate-diagram
generate-plot
obsidian-clipper-template-creator
Guide for creating templates for the Obsidian Web Clipper. Use when you want to create a new clipping template, understand available variables, or format clipped content.
claude-code-expert
Especialista profundo em Claude Code - CLI da Anthropic. Maximiza produtividade com atalhos, hooks, MCPs, configuracoes avancadas, workflows, CLAUDE.md, memoria, sub-agentes, permissoes e integracao com ecossistemas.
lex
Centralized 'Truth Engine' for cross-jurisdictional legal context (US, EU, CA) and contract scaffolding.
odoo-inventory-optimizer
Expert guide for Odoo Inventory: stock valuation (FIFO/AVCO), reordering rules, putaway strategies, routes, and multi-warehouse configuration.
Didn't find tool you were looking for?