Agent skill
selenium
Provides comprehensive guidance for Selenium WebDriver including browser automation, element location, waits, and test frameworks. Use when the user asks about Selenium, needs to automate web browsers, write Selenium tests, or work with Selenium WebDriver.
Install this agent skill to your Project
npx add-skill https://github.com/partme-ai/full-stack-skills/tree/main/skills/testing-skills/selenium
SKILL.md
When to use this skill
Use this skill whenever the user wants to:
- Write or maintain browser automation tests with Selenium WebDriver
- Locate elements using CSS selectors, IDs, or relative locators
- Implement explicit and implicit waits for robust test execution
- Run tests in headless mode or across browsers via Selenium Grid
- Integrate Selenium tests into CI/CD pipelines
How to use this skill
Workflow
- Set up the environment: install browser drivers (ChromeDriver/GeckoDriver) or use Selenium 4 Manager
- Write test scripts: navigate, find elements, interact, and assert
- Add waits and error handling: use explicit waits instead of
sleep - Run in CI: configure headless mode or Grid; generate reports
1. Basic Test (Python)
from selenium import webdriver
from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC
driver = webdriver.Chrome()
driver.get("https://example.com/login")
# Explicit wait for element
wait = WebDriverWait(driver, 10)
username = wait.until(EC.presence_of_element_located((By.ID, "username")))
username.send_keys("testuser")
driver.find_element(By.ID, "password").send_keys("secret")
driver.find_element(By.CSS_SELECTOR, "button[type='submit']").click()
# Assert login success
assert "Dashboard" in driver.title
driver.quit()
2. Basic Test (Java)
WebDriver driver = new ChromeDriver();
driver.get("https://example.com/login");
WebDriverWait wait = new WebDriverWait(driver, Duration.ofSeconds(10));
WebElement username = wait.until(ExpectedConditions.presenceOfElementLocated(By.id("username")));
username.sendKeys("testuser");
driver.findElement(By.id("password")).sendKeys("secret");
driver.findElement(By.cssSelector("button[type='submit']")).click();
assertTrue(driver.getTitle().contains("Dashboard"));
driver.quit();
3. Headless Mode
from selenium.webdriver.chrome.options import Options
options = Options()
options.add_argument("--headless")
options.add_argument("--no-sandbox")
driver = webdriver.Chrome(options=options)
Best Practices
- Use explicit waits (
WebDriverWait) instead oftime.sleepfor reliable tests - Prefer ID, CSS selector, or relative locators over fragile XPath expressions
- Keep test cases independent and repeatable; capture screenshots on failure
- Store sensitive data (URLs, credentials) in configuration files or environment variables
- Use Selenium Grid for parallel execution; ensure browser and driver versions match
Resources
- Official documentation: https://www.selenium.dev/documentation/
- Selenium Grid: https://www.selenium.dev/documentation/grid/
Keywords
selenium, WebDriver, browser automation, E2E, end-to-end testing, headless, Selenium Grid, ChromeDriver, explicit wait, CSS selector
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
ocrmypdf-batch
OCRmyPDF batch processing skill — process multiple PDFs, Docker automation, shell scripting, and CI/CD integration. Use when the user needs to OCR many PDFs, set up automated OCR pipelines, or integrate OCR into workflows.
ocrmypdf-optimize
OCRmyPDF optimization skill — compress PDFs, configure PDF/A output, JBIG2 encoding, and lossless optimization. Use when the user needs to reduce PDF file size, create archival PDF/A files, or optimize OCR output.
ocrmypdf-image
OCRmyPDF image processing skill — deskew, rotate, clean, despeckle, remove border from scanned documents. Use when the user needs to improve scanned PDF quality, fix skewed pages, remove noise, or clean up scanned documents before OCR.
ocrmypdf-api
OCRmyPDF Python API and plugin skill — use OCRmyPDF programmatically from Python, integrate with applications, and extend with plugins (EasyOCR, PaddleOCR, AppleOCR). Use when the user needs to call OCRmyPDF from Python code, build OCR pipelines, or use alternative OCR engines.
ocrmypdf
OCRmyPDF core skill — add searchable OCR text layer to scanned PDFs, convert images to searchable PDFs, support 100+ languages via Tesseract. Use when the user needs to OCR a PDF, make a scanned PDF searchable, or extract text from scanned documents.
svelte
Guides Svelte and SvelteKit development including reactive components, stores, transitions, lifecycle hooks, SSR, file-based routing, and deployment. Use when the user needs to build Svelte components, create SvelteKit applications, implement reactivity patterns, or configure Svelte with Vite.
Didn't find tool you were looking for?