Agent skill

browser

Web browser automation with AI-optimized snapshots for claude-flow agents

View SKILL.md on GitHub Repository

Stars 31,446

Forks 3,514

Install this agent skill to your Project

npx add-skill https://github.com/ruvnet/ruflo/tree/main/v3/@claude-flow/cli/.claude/skills/browser

SKILL.md

Browser Automation Skill

Web browser automation using agent-browser with AI-optimized snapshots. Reduces context by 93% using element refs (@e1, @e2) instead of full DOM.

Core Workflow

bash

# 1. Navigate to page
agent-browser open <url>

# 2. Get accessibility tree with element refs
agent-browser snapshot -i    # -i = interactive elements only

# 3. Interact using refs from snapshot
agent-browser click @e2
agent-browser fill @e3 "text"

# 4. Re-snapshot after page changes
agent-browser snapshot -i

Quick Reference

Navigation

Command	Description
`open <url>`	Navigate to URL
`back`	Go back
`forward`	Go forward
`reload`	Reload page
`close`	Close browser

Snapshots (AI-Optimized)

Command	Description
`snapshot`	Full accessibility tree
`snapshot -i`	Interactive elements only (buttons, links, inputs)
`snapshot -c`	Compact (remove empty elements)
`snapshot -d 3`	Limit depth to 3 levels
`screenshot [path]`	Capture screenshot (base64 if no path)

Interaction

Command	Description
`click <sel>`	Click element
`fill <sel> <text>`	Clear and fill input
`type <sel> <text>`	Type with key events
`press <key>`	Press key (Enter, Tab, etc.)
`hover <sel>`	Hover element
`select <sel> <val>`	Select dropdown option
`check/uncheck <sel>`	Toggle checkbox
`scroll <dir> [px]`	Scroll page

Get Info

Command	Description
`get text <sel>`	Get text content
`get html <sel>`	Get innerHTML
`get value <sel>`	Get input value
`get attr <sel> <attr>`	Get attribute
`get title`	Get page title
`get url`	Get current URL

Wait

Command	Description
`wait <selector>`	Wait for element
`wait <ms>`	Wait milliseconds
`wait --text "text"`	Wait for text
`wait --url "pattern"`	Wait for URL
`wait --load networkidle`	Wait for load state

Sessions

Command	Description
`--session <name>`	Use isolated session
`session list`	List active sessions

Selectors

Element Refs (Recommended)

bash

# Get refs from snapshot
agent-browser snapshot -i
# Output: button "Submit" [ref=e2]

# Use ref to interact
agent-browser click @e2

CSS Selectors

bash

agent-browser click "#submit"
agent-browser fill ".email-input" "test@test.com"

Semantic Locators

bash

agent-browser find role button click --name "Submit"
agent-browser find label "Email" fill "test@test.com"
agent-browser find testid "login-btn" click

Examples

Login Flow

bash

agent-browser open https://example.com/login
agent-browser snapshot -i
agent-browser fill @e2 "user@example.com"
agent-browser fill @e3 "password123"
agent-browser click @e4
agent-browser wait --url "**/dashboard"

Form Submission

bash

agent-browser open https://example.com/contact
agent-browser snapshot -i
agent-browser fill @e1 "John Doe"
agent-browser fill @e2 "john@example.com"
agent-browser fill @e3 "Hello, this is my message"
agent-browser click @e4
agent-browser wait --text "Thank you"

Data Extraction

bash

agent-browser open https://example.com/products
agent-browser snapshot -i
# Iterate through product refs
agent-browser get text @e1  # Product name
agent-browser get text @e2  # Price
agent-browser get attr @e3 href  # Link

Multi-Session (Swarm)

bash

# Session 1: Navigator
agent-browser --session nav open https://example.com
agent-browser --session nav state save auth.json

# Session 2: Scraper (uses same auth)
agent-browser --session scrape state load auth.json
agent-browser --session scrape open https://example.com/data
agent-browser --session scrape snapshot -i

Integration with Claude Flow

MCP Tools

All browser operations are available as MCP tools with browser/ prefix:

browser/open
browser/snapshot
browser/click
browser/fill
browser/screenshot
etc.

Memory Integration

bash

# Store successful patterns
npx @claude-flow/cli memory store --namespace browser-patterns --key "login-flow" --value "snapshot->fill->click->wait"

# Retrieve before similar task
npx @claude-flow/cli memory search --query "login automation"

Hooks

bash

# Pre-browse hook (get context)
npx @claude-flow/cli hooks pre-edit --file "browser-task.ts"

# Post-browse hook (record success)
npx @claude-flow/cli hooks post-task --task-id "browse-1" --success true

Tips

Always use snapshots - They're optimized for AI with refs
Prefer -i flag - Gets only interactive elements, smaller output
Use refs, not selectors - More reliable, deterministic
Re-snapshot after navigation - Page state changes
Use sessions for parallel work - Each session is isolated

Maintainer

ruvnet Core maintainer

Source details

Full Name: ruvnet/ruflo
Branch: main
Path in repo: v3/@claude-flow/cli/.claude/skills/browser
License: MIT License
Topics: claude-code anthropic-claude claude-code-skills mcp-server agents model-context-protocol codex agentic-workflow agentic-ai autonomous-agents multi-agent-systems ai-tools multi-agent agentic-framework ai-assistant huggingface swarm agentic-rag agentic-engineering swarm-intelligence

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

ruvnet/ruflo

add-model-descriptions

Add descriptions for new models from the HuggingFace router to chat-ui configuration. Use when new models are released on the router and need descriptions added to prod.yaml and dev.yaml. Triggers on requests like "add new model descriptions", "update models from router", "sync models", or when explicitly invoking /add-model-descriptions.

31,446 3,514

Explore

ruvnet/ruflo

agent-swarm-pr

Agent skill for swarm-pr - invoke with $agent-swarm-pr

31,446 3,514

Explore

ruvnet/ruflo

agent-neural-network

Agent skill for neural-network - invoke with $agent-neural-network

31,446 3,514

Explore

ruvnet/ruflo

agent-performance-analyzer

Agent skill for performance-analyzer - invoke with $agent-performance-analyzer

31,446 3,514

Explore

ruvnet/ruflo

agent-researcher

Agent skill for researcher - invoke with $agent-researcher

31,446 3,514

Explore

ruvnet/ruflo

V3 Memory Unification

Unify 6+ memory systems into AgentDB with HNSW indexing for 150x-12,500x search improvements. Implements ADR-006 (Unified Memory Service) and ADR-009 (Hybrid Memory Backend).

31,446 3,514

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Browser Automation Skill

Core Workflow

Quick Reference

Navigation

Snapshots (AI-Optimized)

Interaction

Get Info

Wait

Sessions

Selectors

Element Refs (Recommended)

CSS Selectors

Semantic Locators

Examples

Login Flow

Form Submission

Data Extraction

Multi-Session (Swarm)

Integration with Claude Flow

MCP Tools

Memory Integration

Hooks

Tips

Recommended Agent Skills

add-model-descriptions

agent-swarm-pr

agent-neural-network

agent-performance-analyzer

agent-researcher

V3 Memory Unification