Agent skill
rust-testing-code-review
Reviews Rust test code for unit test patterns, integration test structure, async testing, mocking approaches, and property-based testing. Use when reviewing _test.rs files,
Install this agent skill to your Project
npx add-skill https://github.com/existential-birds/beagle/tree/main/plugins/beagle-rust/skills/rust-testing-code-review
SKILL.md
Rust Testing Code Review
Review Workflow
- Check test organization — Unit tests in
#[cfg(test)]modules, integration tests intests/directory - Check async test setup —
#[tokio::test]for async tests, proper runtime configuration - Check assertions — Meaningful messages, correct assertion type
- Check test isolation — No shared mutable state between tests, proper setup/teardown
- Check coverage patterns — Error paths tested, edge cases covered
Output Format
Report findings as:
[FILE:LINE] ISSUE_TITLE
Severity: Critical | Major | Minor | Informational
Description of the issue and why it matters.
Quick Reference
| Issue Type | Reference |
|---|---|
| Unit test structure, assertions, test organization | references/unit-tests.md |
| Integration tests, async testing, fixtures, test databases | references/integration-tests.md |
Review Checklist
Test Structure
- Unit tests in
#[cfg(test)] mod testswithin source files - Integration tests in
tests/directory (one file per module or feature) -
use super::*in test modules to access parent module items - Test function names describe the scenario:
test_<function>_<scenario>_<expected> - Tests are independent — no reliance on execution order
Async Tests
-
#[tokio::test]used for async test functions -
#[tokio::test(flavor = "multi_thread")]when testing multi-threaded behavior - No
block_oninside async tests (use.awaitdirectly) - Test timeouts set for tests that could hang
Assertions
-
assert_eq!/assert_ne!used for value comparisons (better error messages thanassert!) - Custom messages on assertions that aren't self-documenting
-
matches!macro used for enum variant checking - Error types checked with
matches!or pattern matching, not string comparison - One assertion per test where practical (easier to diagnose failures)
Mocking and Test Doubles
- Traits used as seams for dependency injection (not concrete types)
- Mock implementations kept minimal — only what the test needs
- No mocking of types you don't own (wrap external dependencies behind your own trait)
- Test fixtures as helper functions, not global state
Error Path Testing
-
Result::Errvariants tested, not just happy paths - Specific error variants checked (not just "is error")
-
#[should_panic]used sparingly — preferResult-returning tests
Test Naming
- Test names read like sentences describing behavior (not
test_happy_path) - Related tests grouped in nested
modblocks for organization - Test names follow pattern:
<function>_should_<behavior>_when_<condition>
Snapshot Testing
-
cargo instaused for complex structural output (JSON, YAML, HTML, CLI output) - Snapshots are small and focused (not huge objects)
- Redactions used for unstable fields (timestamps, UUIDs)
- Snapshots committed to git in
snapshots/directory - Simple values use
assert_eq!, not snapshots
Doc Tests
- Public API functions have
/// # Exampleswith runnable code - Doc tests serve as both documentation and correctness checks
- Hidden setup lines prefixed with
#to keep examples clean -
cargo test --docpasses (nextest doesn't run doc tests)
Severity Calibration
Critical
- Tests that pass but don't actually verify behavior (assertions on wrong values)
- Shared mutable state between tests causing flaky results
- Missing error path tests for security-critical code
Major
#[should_panic]withoutexpectedmessage (catches any panic, including wrong ones)unwrap()in test setup that hides the real failure location- Tests that depend on execution order
Minor
- Missing assertion messages on complex comparisons
assert!(x == y)instead ofassert_eq!(x, y)(worse error messages)- Test names that don't describe the scenario
- Redundant setup code that could be extracted to a helper
Informational
- Suggestions to add property-based tests via
proptestorquickcheck - Suggestions to add snapshot testing for complex output
- Coverage improvement opportunities
Valid Patterns (Do NOT Flag)
unwrap()/expect()in tests — Panicking on unexpected errors is the correct test behavioruse super::*in test modules — Standard pattern for accessing parent items#[allow(dead_code)]on test helpers — Helper functions may not be used in every testclone()in tests — Clarity over performance- Large test functions — Integration tests can be long; extracting helpers isn't always clearer
assert!for boolean checks — Fine when the expression is clearly boolean (.is_some(),.is_empty())- Multiple assertions testing one logical behavior — Sometimes one behavior needs multiple checks
unwrap()onResult-returning test functions — Propagating with?is also fine but not required
Before Submitting Findings
Load and follow beagle-rust:review-verification-protocol before reporting any issue.
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
review-python
Comprehensive Python/FastAPI backend code review with optional parallel agents
review-verification-protocol
Mandatory verification steps for all code reviews to reduce false positives. Load this skill before reporting ANY code review findings.
sqlalchemy-code-review
Reviews SQLAlchemy code for session management, relationships, N+1 queries, and migration patterns. Use when reviewing SQLAlchemy 2.0 code, checking session lifecycle, relationship() usage, or Alembic migrations.
fastapi-code-review
Reviews FastAPI code for routing patterns, dependency injection, validation, and async handlers. Use when reviewing FastAPI apps, checking APIRouter setup, Depends() usage, or response models.
pytest-code-review
Reviews pytest test code for async patterns, fixtures, parametrize, and mocking. Use when reviewing test_*.py files, checking async test functions, fixture usage, or mock patterns.
postgres-code-review
Reviews PostgreSQL code for indexing strategies, JSONB operations, connection pooling, and transaction safety. Use when reviewing SQL queries, database schemas, JSONB usage, or connection management.
Didn't find tool you were looking for?