Agent skill

dogfood

Systematic exploratory QA testing of web applications — find bugs, capture evidence, and generate structured reports

View SKILL.md on GitHub Repository

Stars 56,643

Forks 7,481

Install this agent skill to your Project

npx add-skill https://github.com/NousResearch/hermes-agent/tree/main/skills/dogfood

Metadata

Additional technical details for this skill

hermes: { "tags": [ "qa", "testing", "browser", "web", "dogfood" ], "related_skills": [] }

SKILL.md

Dogfood: Systematic Web Application QA Testing

Overview

This skill guides you through systematic exploratory QA testing of web applications using the browser toolset. You will navigate the application, interact with elements, capture evidence of issues, and produce a structured bug report.

Prerequisites

Browser toolset must be available (browser_navigate, browser_snapshot, browser_click, browser_type, browser_vision, browser_console, browser_scroll, browser_back, browser_press)
A target URL and testing scope from the user

Inputs

The user provides:

Target URL — the entry point for testing
Scope — what areas/features to focus on (or "full site" for comprehensive testing)
Output directory (optional) — where to save screenshots and the report (default: ./dogfood-output)

Workflow

Follow this 5-phase systematic workflow:

Phase 1: Plan

Create the output directory structure:

{output_dir}/
├── screenshots/       # Evidence screenshots
└── report.md          # Final report (generated in Phase 5)

Identify the testing scope based on user input.
Build a rough sitemap by planning which pages and features to test:
- Landing/home page
- Navigation links (header, footer, sidebar)
- Key user flows (sign up, login, search, checkout, etc.)
- Forms and interactive elements
- Edge cases (empty states, error pages, 404s)

Phase 2: Explore

For each page or feature in your plan:

Navigate to the page:

browser_navigate(url="https://example.com/page")

Take a snapshot to understand the DOM structure:
```
browser_snapshot()
```
Check the console for JavaScript errors:
```
browser_console(clear=true)
```
Do this after every navigation and after every significant interaction. Silent JS errors are high-value findings.
Take an annotated screenshot to visually assess the page and identify interactive elements:
```
browser_vision(question="Describe the page layout, identify any visual issues, broken elements, or accessibility concerns", annotate=true)
```
The annotate=true flag overlays numbered [N] labels on interactive elements. Each [N] maps to ref @eN for subsequent browser commands.
Test interactive elements systematically:
- Click buttons and links: browser_click(ref="@eN")
- Fill forms: browser_type(ref="@eN", text="test input")
- Test keyboard navigation: browser_press(key="Tab"), browser_press(key="Enter")
- Scroll through content: browser_scroll(direction="down")
- Test form validation with invalid inputs
- Test empty submissions
After each interaction, check for:
- Console errors: browser_console()
- Visual changes: browser_vision(question="What changed after the interaction?")
- Expected vs actual behavior

Phase 3: Collect Evidence

For every issue found:

Take a screenshot showing the issue:
```
browser_vision(question="Capture and describe the issue visible on this page", annotate=false)
```
Save the screenshot_path from the response — you will reference it in the report.
Record the details:
- URL where the issue occurs
- Steps to reproduce
- Expected behavior
- Actual behavior
- Console errors (if any)
- Screenshot path
Classify the issue using the issue taxonomy (see references/issue-taxonomy.md):
- Severity: Critical / High / Medium / Low
- Category: Functional / Visual / Accessibility / Console / UX / Content

Phase 4: Categorize

Review all collected issues.
De-duplicate — merge issues that are the same bug manifesting in different places.
Assign final severity and category to each issue.
Sort by severity (Critical first, then High, Medium, Low).
Count issues by severity and category for the executive summary.

Phase 5: Report

Generate the final report using the template at templates/dogfood-report-template.md.

The report must include:

Executive summary with total issue count, breakdown by severity, and testing scope
Per-issue sections with:
- Issue number and title
- Severity and category badges
- URL where observed
- Description of the issue
- Steps to reproduce
- Expected vs actual behavior
- Screenshot references (use MEDIA:<screenshot_path> for inline images)
- Console errors if relevant
Summary table of all issues
Testing notes — what was tested, what was not, any blockers

Save the report to {output_dir}/report.md.

Tools Reference

Tool	Purpose
`browser_navigate`	Go to a URL
`browser_snapshot`	Get DOM text snapshot (accessibility tree)
`browser_click`	Click an element by ref (`@eN`) or text
`browser_type`	Type into an input field
`browser_scroll`	Scroll up/down on the page
`browser_back`	Go back in browser history
`browser_press`	Press a keyboard key
`browser_vision`	Screenshot + AI analysis; use `annotate=true` for element labels
`browser_console`	Get JS console output and errors

Tips

Always check browser_console() after navigating and after significant interactions. Silent JS errors are among the most valuable findings.
Use annotate=true with browser_vision when you need to reason about interactive element positions or when the snapshot refs are unclear.
Test with both valid and invalid inputs — form validation bugs are common.
Scroll through long pages — content below the fold may have rendering issues.
Test navigation flows — click through multi-step processes end-to-end.
Check responsive behavior by noting any layout issues visible in screenshots.
Don't forget edge cases: empty states, very long text, special characters, rapid clicking.
When reporting screenshots to the user, include MEDIA:<screenshot_path> so they can see the evidence inline.

Maintainer

NousResearch Core maintainer

Source details

Full Name: NousResearch/hermes-agent
Branch: main
Path in repo: skills/dogfood
License: MIT License
Topics: ai claude-code anthropic claude ai-agents clawdbot llm openclaw ai-agent codex chatgpt moltbot openai hermes hermes-agent nous-research

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

NousResearch/hermes-agent

agentmail

Give the agent its own dedicated email inbox via AgentMail. Send, receive, and manage email autonomously using agent-owned email addresses (e.g. hermes-agent@agentmail.to).

56,643 7,481

Explore

NousResearch/hermes-agent

base

Query Base (Ethereum L2) blockchain data with USD pricing — wallet balances, token info, transaction details, gas analysis, contract inspection, whale detection, and live network stats. Uses Base RPC + CoinGecko. No API key required.

56,643 7,481

Explore

NousResearch/hermes-agent

solana

Query Solana blockchain data with USD pricing — wallet balances, token portfolios with values, transaction details, NFTs, whale detection, and live network stats. Uses Solana RPC + CoinGecko. No API key required.

56,643 7,481

Explore

NousResearch/hermes-agent

one-three-one-rule

Structured decision-making framework for technical proposals and trade-off analysis. When the user faces a choice between multiple approaches (architecture decisions, tool selection, refactoring strategies, migration paths), this skill produces a 1-3-1 format: one clear problem statement, three distinct options with pros/cons, and one concrete recommendation with definition of done and implementation plan. Use when the user asks for a "1-3-1", says "give me options", or needs help choosing between competing approaches.

56,643 7,481

Explore

NousResearch/hermes-agent

fastmcp

Build, test, inspect, install, and deploy MCP servers with FastMCP in Python. Use when creating a new MCP server, wrapping an API or database as MCP tools, exposing resources or prompts, or preparing a FastMCP server for Claude Code, Cursor, or HTTP deployment.

56,643 7,481

Explore

NousResearch/hermes-agent

qdrant-vector-search

High-performance vector similarity search engine for RAG and semantic search. Use when building production RAG systems requiring fast nearest neighbor search, hybrid search with filtering, or scalable vector storage with Rust-powered performance.

56,643 7,481

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

Metadata

SKILL.md

Dogfood: Systematic Web Application QA Testing

Overview

Prerequisites

Inputs

Workflow

Phase 1: Plan

Phase 2: Explore

Phase 3: Collect Evidence

Phase 4: Categorize

Phase 5: Report

Tools Reference

Tips

Recommended Agent Skills

agentmail

base

solana

one-three-one-rule

fastmcp

qdrant-vector-search