Agent skill
web_navigator
Strategic patterns for effective browser automation — research, forms, and data extraction.
Install this agent skill to your Project
npx add-skill https://github.com/siddsachar/Thoth/tree/main/bundled_skills/web_navigator
SKILL.md
When the user asks you to browse a website, fill out a form, extract data from a page, or perform any multi-step browser interaction, apply these strategic patterns:
Research Patterns
- Snapshot First, Act Second — After every navigation or interaction, always read the snapshot before deciding your next action. Never chain clicks blindly.
- Progressive Disclosure — Start with the visible content. If you need more, scroll down and take another snapshot. Don't assume content below the fold exists or doesn't.
- Multi-Tab Research — When comparing options across sites (prices, reviews, specs), open each source in a separate tab. Gather all data first, then synthesise. This avoids losing context by navigating away.
- Read URL for Bulk Text — If you only need the text content of a page (no interaction needed), use
read_urlinstead of the browser. Reserve the browser for when you need to click, scroll, or interact.
Form Filling
- Survey the Form — Before filling anything, take a snapshot to understand all the fields. Plan the fill order based on what you see.
- Type Carefully — Use
browser_typewith the correct ref for each field. After filling critical fields (payment, addresses), snapshot to verify the values took. - Handle Dropdowns and Selects — Click the dropdown first, wait for the snapshot showing options, then click the desired option. Don't try to type into select elements.
- Confirm Before Submit — Always snapshot and summarise what you've filled in before clicking a submit button. Let the user verify.
Data Extraction
- Structured Extraction — When extracting tabular data (product listings, search results, comparison tables), present it in a clean markdown table or structured format.
- Pagination — If the data spans multiple pages, mention how many pages there are and ask whether to continue after the first page. Don't silently paginate through 50 pages.
- Save Long Results — For large extractions, offer to save results to a workspace file or memory rather than dumping everything into the chat.
Error Recovery
- Stale Refs — If a click fails or doesn't produce the expected result, take a fresh snapshot. Page state may have changed (dynamic content, overlays, redirects).
- Pop-ups and Overlays — Cookie banners, newsletter pop-ups, and chat widgets are common. Look for dismiss/close buttons in the snapshot and clear them before proceeding with the main task.
- Login Walls — If a login page appears unexpectedly, tell the user immediately and ask them to log in via the visible browser window. Snapshot after they confirm they're done.
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
deep_research
Perform multi-source research on a topic and produce a structured report.
self_reflection
Periodically review memory for contradictions, gaps, and stale information.
meeting_notes
Structure raw meeting notes into actionable minutes with follow-ups.
knowledge_base
Manage the user's personal knowledge base — knowledge graph, documents, and wiki vault.
daily_briefing
Compile a morning briefing with weather, calendar, and news headlines.
data_analyst
Analyses datasets with professional rigour — statistical summaries, clear narratives, and well-chosen visualisations.
Didn't find tool you were looking for?