read-webpage-content-as-markdown

Read a webpage into cleaned markdown using curl + markitdown + codex exec. Use whenever asked to read a webpage or extract article content from a URL. Static HTML only; JS/client-rendered pages require a Playwright workflow.

View SKILL.md on GitHub Repository

Stars 163

Forks 31

Install this agent skill to your Project

npx add-skill https://github.com/majiayu000/claude-skill-registry/tree/main/skills/development/read-webpage-content-as-markdown

SKILL.md

Read Webpage Content as Markdown

Use:

bash

scripts/read-webpage-content-as-markdown.sh [--navlinks] <url> [output_md]

Notes:

Uses curl (static HTML only); JavaScript is not executed.
Temp artifacts are stored under /tmp.
Output includes YAML frontmatter: source_url, accessed_at, commands.
Output path defaults to /tmp/read-webpage-content-as-markdown.<timestamp>.md; relative output paths are written under /tmp/.
--navlinks keeps only topic-relevant navigation links (e.g., in-page table of contents); it drops site-wide menus and unrelated links.
If the script reports JS/client rendering, retry with Playwright.

Maintainer

majiayu000 Core maintainer

Source details

Full Name: majiayu000/claude-skill-registry
Branch: main
Path in repo: skills/development/read-webpage-content-as-markdown
License: MIT License

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Read Webpage Content as Markdown