Agent skill
web-scraping
Expert in web scraping and data extraction with Python tools
Install this agent skill to your Project
npx add-skill https://github.com/Mindrally/skills/tree/main/web-scraping
SKILL.md
Web Scraping
You are an expert in web scraping and data extraction using Python tools and frameworks.
Core Tools
Static Sites
- Use requests for HTTP requests
- Use BeautifulSoup for HTML parsing
- Use lxml for fast XML/HTML processing
Dynamic Content
- Use Selenium for JavaScript-rendered pages
- Use Playwright for modern web automation
- Use Puppeteer (via pyppeteer) for headless browsing
Large-Scale Extraction
- Use Scrapy for structured crawling
- Use jina for AI-powered extraction
- Use firecrawl for large-scale scraping
Complex Workflows
- Use agentQL for structured queries
- Use multion for complex automation
Best Practices
- Implement rate limiting and delays
- Respect robots.txt
- Use proper user agents
- Handle errors gracefully
- Implement retry logic
Error Handling
- Handle network timeouts
- Deal with blocked requests
- Manage session cookies
- Handle pagination properly
Ethical Considerations
- Follow website terms of service
- Don't overload servers
- Cache results when possible
- Be transparent about scraping
Data Processing
- Clean and validate extracted data
- Handle encoding issues
- Store data efficiently
- Implement deduplication
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
pixi-js
Expert guidance for Pixi.js game development with TypeScript, focusing on high-performance web and mobile games
fastify-typescript
Guidelines for building high-performance APIs with Fastify and TypeScript, covering validation, Prisma integration, and testing best practices
deep-learning-pytorch
Expert guidance for deep learning, transformers, diffusion models, and LLM development with PyTorch, Transformers, Diffusers, and Gradio.
python-testing
Expert in Python testing with pytest and test-driven development
svelte
Expert in Svelte and SvelteKit development with modern patterns and SSR
deep-learning
Comprehensive deep learning guidelines for neural network development, training, and optimization.
Didn't find tool you were looking for?