Agent skill

read-webpage-content-as-markdown

Read a webpage into cleaned markdown using curl + markitdown + codex exec. Use whenever asked to read a webpage or extract article content from a URL. Static HTML only; JS/client-rendered pages require a Playwright workflow.

Stars 163
Forks 31

Install this agent skill to your Project

npx add-skill https://github.com/majiayu000/claude-skill-registry/tree/main/skills/development/read-webpage-content-as-markdown

SKILL.md

Read Webpage Content as Markdown

Use:

bash
scripts/read-webpage-content-as-markdown.sh [--navlinks] <url> [output_md]

Notes:

  • Uses curl (static HTML only); JavaScript is not executed.
  • Temp artifacts are stored under /tmp.
  • Output includes YAML frontmatter: source_url, accessed_at, commands.
  • Output path defaults to /tmp/read-webpage-content-as-markdown.<timestamp>.md; relative output paths are written under /tmp/.
  • --navlinks keeps only topic-relevant navigation links (e.g., in-page table of contents); it drops site-wide menus and unrelated links.
  • If the script reports JS/client rendering, retry with Playwright.

Didn't find tool you were looking for?

Be as detailed as possible for better results