Agent skills
google-image-search

Agent skill

google-image-search

Search and download images via Google Custom Search API with LLM-powered selection. This skill should be used when finding images for articles, presentations, research documents, or enriching Obsidian notes with relevant visuals. Supports simple queries, batch processing from JSON config, automatic config generation from terms, and full note enrichment with automatic image insertion below headings.

View SKILL.md on GitHub Repository

Stars 74

Forks 17

Install this agent skill to your Project

npx add-skill https://github.com/glebis/claude-skills/tree/main/google-image-search

SKILL.md

Google Image Search Skill

Search for images using Google Custom Search API with intelligent scoring and LLM-based selection.

When to Use

Finding images to illustrate technical articles or research
Adding visuals to presentations
Enriching Obsidian notes with relevant images
Batch image search for multiple topics
Generating image search configs from plain text lists

Requirements

Google Custom Search API key and Search Engine ID
OpenRouter API key (for LLM selection)
llm CLI installed at /opt/homebrew/bin/llm

Store credentials in .env:

Google-Custom-Search-JSON-API-KEY=your_key
Google-Custom-Search-CX=your_cx
OPENROUTER_API_KEY=your_openrouter_key

Modes of Operation

1. Simple Query

Search for a single term:

bash

python3 ~/.claude/skills/google-image-search/scripts/google_image_search.py \
  --query "neural interface wearable device" \
  --output-dir ./images \
  --num-results 5

2. Batch Processing

Process multiple queries from JSON config:

bash

python3 ~/.claude/skills/google-image-search/scripts/google_image_search.py \
  --config image_queries.json \
  --output-dir ./images \
  --llm-select

3. Generate Config from Terms

Create JSON config from a list of terms using LLM:

bash

python3 ~/.claude/skills/google-image-search/scripts/google_image_search.py \
  --generate-config \
  --terms "AlterEgo wearable" "sEMG electrodes" "BCI headset" \
  --output my_queries.json

4. Enrich Obsidian Note

Extract visual terms from note, find images, and insert below headings:

bash

python3 ~/.claude/skills/google-image-search/scripts/google_image_search.py \
  --enrich-note ~/Brains/brain/Research/neural-interfaces.md

This mode:

Detects Obsidian vault and attachments folder
Uses LLM to extract visual-worthy terms from note
Searches for images for each term
Downloads best images to attachments folder
Inserts image embeds below relevant headings
Creates backup before modifying note

Key Options

Option	Description
`--query TEXT`	Simple single query
`--config FILE`	JSON config for batch
`--generate-config`	Generate config from `--terms`
`--enrich-note FILE`	Enrich Obsidian note
`--output-dir DIR`	Where to save images
`--urls-only`	Return URLs only, no download
`--llm-select`	Use LLM to pick best image (default: on)
`--no-llm-select`	Disable LLM selection
`--num-results N`	Results per query (default: 5)
`--dry-run`	Show what would be done

JSON Config Format

Each entry supports:

json

{
  "id": "unique-id",
  "heading": "Display Heading",
  "description": "Context for what image to find",
  "query": "Google search query",
  "numResults": 5,
  "selectionCriteria": "What makes a good image",
  "requiredTerms": ["must", "have"],
  "optionalTerms": ["bonus", "terms"],
  "excludeTerms": ["stock", "clipart"],
  "preferredHosts": ["official-site.com"],
  "selectionCount": 2
}

See references/api_config_reference.md for full documentation.

Scoring System

Images are scored based on:

Required terms: -80 if missing, +30 if all present
Optional terms: +5 per match
Exclude terms: -50 per match
Preferred hosts: +25 if trusted, -5 if unknown
MIME type: +5 for PNG/JPEG, -10 for GIF
Resolution: +10 for high res, -10 for low res
File size: -5 if very small

LLM Selection

After scoring, LLM picks the best image from top candidates based on:

Title and URL metadata
Scoring reasons
Selection criteria

The LLM evaluates authenticity, clarity, and relevance for technical audiences.

Obsidian Integration

When in an Obsidian vault:

Auto-detects vault root via .obsidian folder
Uses configured attachments folder (default: Attachments)
Generates Obsidian-style embeds: ![[image.png|alt text]]
Creates backup before modifying notes

Script Files

File	Purpose
`google_image_search.py`	Main entry point
`api.py`	Google Custom Search API
`config.py`	Credentials and config handling
`download.py`	Image download with magic bytes
`evaluate.py`	Keyword-based scoring
`llm_select.py`	LLM selection and term extraction
`obsidian.py`	Vault detection and enrichment
`output.py`	Markdown output generation

Maintainer

glebis Core maintainer

Source details

Full Name: glebis/claude-skills
Branch: main
Path in repo: google-image-search

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

glebis/claude-skills

tdd

This skill should be used when the user wants to implement features or fix bugs using test-driven development. Enforces the RED-GREEN-REFACTOR cycle with vertical slicing, context isolation between test writing and implementation, human checkpoints, and auto-test feedback loops. Uses multi-agent orchestration with the Task tool for architecturally enforced context isolation. Supports Jest, Vitest, pytest, Go test, cargo test, PHPUnit, and RSpec.

74 17

Explore

glebis/claude-skills

brand-agency

Applies Agency brand colors and typography to artifacts including presentations, SVG graphics, documents, and web interfaces. This skill should be used when brand colors, visual formatting, neobrutalism style, or Agency design standards apply. Keywords - branding, corporate identity, visual identity, styling, brand colors, typography, visual formatting, visual design, neobrutalism.

74 17

Explore

glebis/claude-skills

github-gist

Publish files or Obsidian notes as GitHub Gists. Use when user wants to share code/notes publicly, create quick shareable snippets, or publish markdown to GitHub. Triggers include "publish as gist", "create gist", "share on github", "make a gist from this".

74 17

Explore

glebis/claude-skills

chrome-history

Query Chrome browsing history with natural language. Filter by date range, article type, keywords, and specific sites.

74 17

Explore

glebis/claude-skills

wispr-analytics

This skill should be used when analyzing Wispr Flow voice dictation history for self-reflection, work patterns, mental health insights, or productivity analytics. Triggered by requests like "/wispr-analytics", "analyze my dictations", "what did I dictate today", "wispr reflection", or any request to review voice dictation patterns. Supports modes - technical (coding/work), soft (communication), trends (volume/frequency), mental (sentiment/energy/rumination).

74 17

Explore

glebis/claude-skills

granola

This skill should be used when importing, listing, or exporting Granola meeting recordings and transcripts. Queries Granola's local cache and API to list meetings, extract transcripts, and export to Obsidian notes in Fathom-compatible format.

74 17

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Google Image Search Skill

When to Use

Requirements

Modes of Operation

1. Simple Query

2. Batch Processing

3. Generate Config from Terms

4. Enrich Obsidian Note

Key Options

JSON Config Format

Scoring System

LLM Selection

Obsidian Integration

Script Files

Recommended Agent Skills

tdd

brand-agency

github-gist

chrome-history

wispr-analytics

granola