Agent skill

blackbox

Delegate coding tasks to Blackbox AI CLI agent. Multi-model agent with built-in judge that runs tasks through multiple LLMs and picks the best result. Requires the blackbox CLI and a Blackbox AI API key.

View SKILL.md on GitHub Repository

Stars 56,643

Forks 7,481

Install this agent skill to your Project

npx add-skill https://github.com/NousResearch/hermes-agent/tree/main/optional-skills/autonomous-ai-agents/blackbox

Metadata

Additional technical details for this skill

hermes: { "tags": [ "Coding-Agent", "Blackbox", "Multi-Agent", "Judge", "Multi-Model" ], "related_skills": [ "claude-code", "codex", "hermes-agent" ] }

SKILL.md

Blackbox CLI

Delegate coding tasks to Blackbox AI via the Hermes terminal. Blackbox is a multi-model coding agent CLI that dispatches tasks to multiple LLMs (Claude, Codex, Gemini, Blackbox Pro) and uses a judge to select the best implementation.

The CLI is open-source (GPL-3.0, TypeScript, forked from Gemini CLI) and supports interactive sessions, non-interactive one-shots, checkpointing, MCP, and vision model switching.

Prerequisites

Node.js 20+ installed
Blackbox CLI installed: npm install -g @blackboxai/cli

Or install from source:

git clone https://github.com/blackboxaicode/cli.git
cd cli && npm install && npm install -g .

API key from app.blackbox.ai/dashboard
Configured: run blackbox configure and enter your API key
Use pty=true in terminal calls — Blackbox CLI is an interactive terminal app

One-Shot Tasks

terminal(command="blackbox --prompt 'Add JWT authentication with refresh tokens to the Express API'", workdir="/path/to/project", pty=true)

For quick scratch work:

terminal(command="cd $(mktemp -d) && git init && blackbox --prompt 'Build a REST API for todos with SQLite'", pty=true)

Background Mode (Long Tasks)

For tasks that take minutes, use background mode so you can monitor progress:

# Start in background with PTY
terminal(command="blackbox --prompt 'Refactor the auth module to use OAuth 2.0'", workdir="~/project", background=true, pty=true)
# Returns session_id

# Monitor progress
process(action="poll", session_id="<id>")
process(action="log", session_id="<id>")

# Send input if Blackbox asks a question
process(action="submit", session_id="<id>", data="yes")

# Kill if needed
process(action="kill", session_id="<id>")

Checkpoints & Resume

Blackbox CLI has built-in checkpoint support for pausing and resuming tasks:

# After a task completes, Blackbox shows a checkpoint tag
# Resume with a follow-up task:
terminal(command="blackbox --resume-checkpoint 'task-abc123-2026-03-06' --prompt 'Now add rate limiting to the endpoints'", workdir="~/project", pty=true)

Session Commands

During an interactive session, use these commands:

Command	Effect
`/compress`	Shrink conversation history to save tokens
`/clear`	Wipe history and start fresh
`/stats`	View current token usage
`Ctrl+C`	Cancel current operation

PR Reviews

Clone to a temp directory to avoid modifying the working tree:

terminal(command="REVIEW=$(mktemp -d) && git clone https://github.com/user/repo.git $REVIEW && cd $REVIEW && gh pr checkout 42 && blackbox --prompt 'Review this PR against main. Check for bugs, security issues, and code quality.'", pty=true)

Parallel Work

Spawn multiple Blackbox instances for independent tasks:

terminal(command="blackbox --prompt 'Fix the login bug'", workdir="/tmp/issue-1", background=true, pty=true)
terminal(command="blackbox --prompt 'Add unit tests for auth'", workdir="/tmp/issue-2", background=true, pty=true)

# Monitor all
process(action="list")

Multi-Model Mode

Blackbox's unique feature is running the same task through multiple models and judging the results. Configure which models to use via blackbox configure — select multiple providers to enable the Chairman/judge workflow where the CLI evaluates outputs from different models and picks the best one.

Key Flags

Flag	Effect
`--prompt "task"`	Non-interactive one-shot execution
`--resume-checkpoint "tag"`	Resume from a saved checkpoint
`--yolo`	Auto-approve all actions and model switches
`blackbox session`	Start interactive chat session
`blackbox configure`	Change settings, providers, models
`blackbox info`	Display system information

Vision Support

Blackbox automatically detects images in input and can switch to multimodal analysis. VLM modes:

"once" — Switch model for current query only
"session" — Switch for entire session
"persist" — Stay on current model (no switch)

Token Limits

Control token usage via .blackboxcli/settings.json:

json

{
  "sessionTokenLimit": 32000
}

Rules

Always use pty=true — Blackbox CLI is an interactive terminal app and will hang without a PTY
Use workdir — keep the agent focused on the right directory
Background for long tasks — use background=true and monitor with process tool
Don't interfere — monitor with poll/log, don't kill sessions because they're slow
Report results — after completion, check what changed and summarize for the user
Credits cost money — Blackbox uses a credit-based system; multi-model mode consumes credits faster
Check prerequisites — verify blackbox CLI is installed before attempting delegation

Maintainer

NousResearch Core maintainer

Source details

Full Name: NousResearch/hermes-agent
Branch: main
Path in repo: optional-skills/autonomous-ai-agents/blackbox
License: MIT License
Topics: ai claude-code anthropic claude ai-agents clawdbot llm openclaw ai-agent codex chatgpt moltbot openai hermes hermes-agent nous-research

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

NousResearch/hermes-agent

agentmail

Give the agent its own dedicated email inbox via AgentMail. Send, receive, and manage email autonomously using agent-owned email addresses (e.g. hermes-agent@agentmail.to).

56,643 7,481

Explore

NousResearch/hermes-agent

base

Query Base (Ethereum L2) blockchain data with USD pricing — wallet balances, token info, transaction details, gas analysis, contract inspection, whale detection, and live network stats. Uses Base RPC + CoinGecko. No API key required.

56,643 7,481

Explore

NousResearch/hermes-agent

solana

Query Solana blockchain data with USD pricing — wallet balances, token portfolios with values, transaction details, NFTs, whale detection, and live network stats. Uses Solana RPC + CoinGecko. No API key required.

56,643 7,481

Explore

NousResearch/hermes-agent

one-three-one-rule

Structured decision-making framework for technical proposals and trade-off analysis. When the user faces a choice between multiple approaches (architecture decisions, tool selection, refactoring strategies, migration paths), this skill produces a 1-3-1 format: one clear problem statement, three distinct options with pros/cons, and one concrete recommendation with definition of done and implementation plan. Use when the user asks for a "1-3-1", says "give me options", or needs help choosing between competing approaches.

56,643 7,481

Explore

NousResearch/hermes-agent

fastmcp

Build, test, inspect, install, and deploy MCP servers with FastMCP in Python. Use when creating a new MCP server, wrapping an API or database as MCP tools, exposing resources or prompts, or preparing a FastMCP server for Claude Code, Cursor, or HTTP deployment.

56,643 7,481

Explore

NousResearch/hermes-agent

qdrant-vector-search

High-performance vector similarity search engine for RAG and semantic search. Use when building production RAG systems requiring fast nearest neighbor search, hybrid search with filtering, or scalable vector storage with Rust-powered performance.

56,643 7,481

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

Metadata

SKILL.md

Blackbox CLI

Prerequisites

One-Shot Tasks

Background Mode (Long Tasks)

Checkpoints & Resume

Session Commands

PR Reviews

Parallel Work

Multi-Model Mode

Key Flags

Vision Support

Token Limits

Rules

Recommended Agent Skills

agentmail

base

solana

one-three-one-rule

fastmcp

qdrant-vector-search