Agent skills
analyzing-test-quality

Agent skill

analyzing-test-quality

Automatically activated when user asks about test quality, code coverage, test reliability, test maintainability, or wants to analyze their test suite. Provides framework-agnostic test quality analysis and improvement recommendations. Does NOT provide framework-specific patterns - use jest-testing or playwright-testing for those.

View SKILL.md on GitHub Repository

Stars 3

Forks 0

Install this agent skill to your Project

npx add-skill https://github.com/C0ntr0lledCha0s/claude-code-plugin-automations/tree/main/testing-expert/skills/analyzing-test-quality

SKILL.md

Analyzing Test Quality

You are an expert in test quality analysis with deep knowledge of testing principles, patterns, and metrics that apply across all testing frameworks.

Your Capabilities

Quality Metrics: Coverage, mutation score, test effectiveness
Test Patterns: AAA, GWT, fixtures, factories, page objects
Anti-Patterns: Flaky tests, test pollution, over-mocking
Maintainability: DRY, readability, test organization
Reliability: Determinism, isolation, independence
Coverage Analysis: Statement, branch, function, line coverage

When to Use This Skill

Claude should automatically invoke this skill when:

The user asks about test quality or test effectiveness
Code coverage reports or metrics are discussed
Test reliability or flakiness is mentioned
Test organization or refactoring is needed
General test improvement is requested

How to Use This Skill

Accessing Resources

Use {baseDir} to reference files in this skill directory:

Scripts: {baseDir}/scripts/
Documentation: {baseDir}/references/
Templates: {baseDir}/assets/

Available Resources

This skill includes ready-to-use resources in {baseDir}:

references/quality-checklist.md - Printable test quality checklist with scoring guide
assets/quality-report.template.md - Complete template for test quality assessment reports
scripts/calculate-metrics.sh - Calculates test metrics (test count, ratios, patterns, assertions)

Test Quality Dimensions

1. Correctness

Tests accurately verify intended behavior:

Tests match requirements
Assertions are complete
Edge cases are covered
Error scenarios are tested

2. Readability

Tests are easy to understand:

Clear naming (what is being tested)
Proper structure (AAA/GWT pattern)
Minimal setup noise
Self-documenting code

3. Maintainability

Tests are easy to modify:

DRY with appropriate helpers
Focused tests (single responsibility)
Proper abstraction level
Clear dependencies

4. Reliability

Tests produce consistent results:

No timing dependencies
Proper isolation
Deterministic data
Independent execution

5. Speed

Tests run efficiently:

Appropriate test pyramid
Efficient setup/teardown
Proper mocking strategy
Parallel execution

Test Quality Checklist

Structure

Uses AAA (Arrange-Act-Assert) or GWT pattern
One logical assertion per test
Descriptive test names
Proper describe/context nesting
Appropriate setup/teardown

Coverage

Happy path scenarios
Error/edge cases
Boundary conditions
Integration points
Security scenarios

Reliability

No timing dependencies
Proper async handling
Isolated tests (no shared state)
Deterministic data
Order-independent

Maintainability

Reusable fixtures/factories
Clear variable naming
Focused assertions
Appropriate abstraction
No magic numbers/strings

Common Anti-Patterns

Test Pollution

typescript

// BAD: Shared mutable state
let count = 0;
beforeEach(() => count++);

// GOOD: Reset in setup
let count: number;
beforeEach(() => { count = 0; });

Over-Mocking

Mocking too much hides bugs and makes tests brittle.

typescript

// BAD: Mock everything - test only verifies mocks
// Jest
jest.mock('./dep1');
jest.mock('./dep2');
jest.mock('./dep3');

// Vitest
vi.mock('./dep1');
vi.mock('./dep2');
vi.mock('./dep3');

// GOOD: Mock boundaries only
// Mock external services, keep internal logic real
mock('./api'); // External service only
// Test actual business logic

Flaky Assertions

typescript

// BAD: Timing dependent
await delay(100);
expect(element).toBeVisible();

// GOOD: Wait for condition
// Testing Library
await waitFor(() => expect(element).toBeVisible());

// Playwright
await expect(element).toBeVisible();

Mystery Guest

typescript

// BAD: Hidden dependencies
test('should process', () => {
  const result = process(); // Uses global data
  expect(result).toBe(42);
});

// GOOD: Explicit setup
test('should process input', () => {
  const input = createInput({ value: 21 });
  const result = process(input);
  expect(result).toBe(42);
});

Assertion Roulette

typescript

// BAD: Multiple unrelated assertions
test('should work', () => {
  expect(user.name).toBe('John');
  expect(items.length).toBe(3);
  expect(total).toBe(100);
});

// GOOD: Focused assertions
test('should set user name', () => {
  expect(user.name).toBe('John');
});

test('should have correct item count', () => {
  expect(items).toHaveLength(3);
});

Mutation Testing

Mutation testing validates test effectiveness by modifying code and checking if tests catch the changes.

Concept

Mutants are created by modifying source code (changing operators, values, etc.)
Tests run against each mutant
Killed mutants = tests caught the change (good!)
Survived mutants = tests missed the change (weak tests)

Stryker Setup

bash

# Install Stryker
npm install -D @stryker-mutator/core

# For specific frameworks
npm install -D @stryker-mutator/jest-runner      # Jest
npm install -D @stryker-mutator/vitest-runner    # Vitest
npm install -D @stryker-mutator/mocha-runner     # Mocha

# Initialize configuration
npx stryker init

Stryker Configuration

javascript

// stryker.conf.js
module.exports = {
  packageManager: 'npm',
  reporters: ['html', 'clear-text', 'progress'],
  testRunner: 'jest',
  coverageAnalysis: 'perTest',

  // What to mutate
  mutate: [
    'src/**/*.ts',
    '!src/**/*.test.ts',
    '!src/**/*.spec.ts',
  ],

  // Mutation types to use
  mutator: {
    excludedMutations: [
      'StringLiteral', // Skip string mutations
    ],
  },

  // Thresholds
  thresholds: {
    high: 80,
    low: 60,
    break: 50, // Fail CI if below this
  },
};

Interpreting Results

Mutation score: 85%
Killed: 170 | Survived: 30 | Timeout: 5 | No coverage: 10

High score (>80%): Tests are effective Medium score (60-80%): Some weak areas Low score (<60%): Tests need significant improvement

Common Surviving Mutations

Boundary mutations: < changed to <=

typescript

// Mutation survives if tests don't check boundary
if (value < 10) { ... }  // Changed to: value <= 10

Arithmetic mutations: + changed to -

typescript

// Mutation survives if result isn't precisely checked
return a + b;  // Changed to: a - b

Boolean mutations: && changed to ||

typescript

// Mutation survives if both conditions aren't tested
if (a && b) { ... }  // Changed to: a || b

CI Integration

yaml

# GitHub Actions
- name: Run mutation tests
  run: npx stryker run

- name: Upload Stryker report
  uses: actions/upload-artifact@v3
  with:
    name: stryker-report
    path: reports/mutation/

Coverage Metrics

Types of Coverage

Statement: Lines executed
Branch: Decision paths taken
Function: Functions called
Line: Lines covered

Coverage Thresholds

javascript

// Recommended minimums
{
  statements: 80,
  branches: 75,
  functions: 80,
  lines: 80
}

Coverage Pitfalls

High coverage ≠ good tests
Can miss logical errors
Doesn't test interactions
Can incentivize bad tests

Mutation Testing

Concept

Mutation testing modifies code to check if tests catch the changes:

Tests should fail when code is mutated
Surviving mutants indicate weak tests
Higher kill rate = better tests

Types of Mutations

Arithmetic operators (+, -, *, /)
Comparison operators (<, >, ==)
Boolean operators (&&, ||, !)
Return values
Constants

Test Pyramid

Unit Tests (Base)

Fast execution
Isolated components
High coverage
Many tests

Integration Tests (Middle)

Component interactions
Database/API calls
Moderate coverage
Medium quantity

E2E Tests (Top)

Full user flows
Real browser
Critical paths only
Few tests

Analysis Workflow

When analyzing test quality:

Gather Metrics
- Run coverage report
- Count test/code ratio
- Measure test execution time
Identify Patterns
- Check test structure
- Look for anti-patterns
- Assess naming quality
Evaluate Reliability
- Check for flaky indicators
- Assess isolation
- Review async handling
Provide Recommendations
- Prioritize by impact
- Give specific examples
- Include code samples

Examples

Example 1: Coverage Analysis

When analyzing coverage:

Run coverage tool
Identify uncovered lines
Prioritize critical paths
Suggest test cases

Example 2: Reliability Audit

When auditing for reliability:

Search for timing patterns
Check shared state usage
Review async assertions
Identify order dependencies

Important Notes

Quality is more important than quantity
Coverage is a starting point, not a goal
Fast feedback enables TDD
Readable tests serve as documentation
Test maintenance cost should be low

Maintainer

C0ntr0lledCha0s Core maintainer

Source details

Full Name: C0ntr0lledCha0s/claude-code-plugin-automations
Branch: main
Path in repo: testing-expert/skills/analyzing-test-quality
License: MIT License

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

C0ntr0lledCha0s/claude-code-plugin-automations

analyzing-docs

Expert at analyzing documentation quality, coverage, and completeness. Auto-invokes when evaluating documentation health, checking documentation coverage, auditing existing docs, assessing documentation quality metrics, or analyzing how well code is documented. Provides frameworks for measuring documentation effectiveness.

3 0

Explore

C0ntr0lledCha0s/claude-code-plugin-automations

writing-docs

Expert at writing high-quality documentation for code, APIs, and projects. Auto-invokes when generating docstrings, creating README files, writing API documentation, adding code comments, or producing any technical documentation. Provides language-specific templates and best practices for effective documentation writing.

3 0

Explore

C0ntr0lledCha0s/claude-code-plugin-automations

managing-docs

Expert at organizing and managing documentation structure across projects. Auto-invokes when organizing documentation files, setting up documentation frameworks, creating documentation directories, managing doc site configurations, or establishing documentation standards for a project. Provides guidance on documentation architecture and tooling.

3 0

Explore

C0ntr0lledCha0s/claude-code-plugin-automations

Hook Development

This skill should be used when the user asks to "create a hook", "add a PreToolUse/PostToolUse/Stop hook", "validate tool use", "implement prompt-based hooks", "use ${CLAUDE_PLUGIN_ROOT}", "set up event-driven automation", "block dangerous commands", or mentions hook events (PreToolUse, PostToolUse, Stop, SubagentStop, SessionStart, SessionEnd, UserPromptSubmit, PreCompact, Notification). Provides comprehensive guidance for creating and implementing Claude Code plugin hooks with focus on advanced prompt-based hooks API.

3 0

Explore

C0ntr0lledCha0s/claude-code-plugin-automations

MCP Integration

This skill should be used when the user asks to "add MCP server", "integrate MCP", "configure MCP in plugin", "use .mcp.json", "set up Model Context Protocol", "connect external service", mentions "${CLAUDE_PLUGIN_ROOT} with MCP", or discusses MCP server types (SSE, stdio, HTTP, WebSocket). Provides comprehensive guidance for integrating Model Context Protocol servers into Claude Code plugins for external tool and service integration.

3 0

Explore

C0ntr0lledCha0s/claude-code-plugin-automations

Agent Development

This skill should be used when the user asks to "create an agent", "add an agent", "write a subagent", "agent frontmatter", "when to use description", "agent examples", "agent tools", "agent colors", "autonomous agent", or needs guidance on agent structure, system prompts, triggering conditions, or agent development best practices for Claude Code plugins.

3 0

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Analyzing Test Quality

Your Capabilities

When to Use This Skill

How to Use This Skill

Accessing Resources

Available Resources

Test Quality Dimensions

1. Correctness

2. Readability

3. Maintainability

4. Reliability

5. Speed

Test Quality Checklist

Structure

Coverage

Reliability

Maintainability

Common Anti-Patterns

Test Pollution

Over-Mocking

Flaky Assertions

Mystery Guest

Assertion Roulette

Mutation Testing

Concept

Stryker Setup

Stryker Configuration

Interpreting Results

Common Surviving Mutations

CI Integration

Coverage Metrics

Types of Coverage

Coverage Thresholds

Coverage Pitfalls

Mutation Testing

Concept

Types of Mutations

Test Pyramid

Unit Tests (Base)

Integration Tests (Middle)

E2E Tests (Top)

Analysis Workflow

Examples

Example 1: Coverage Analysis

Example 2: Reliability Audit

Important Notes

Recommended Agent Skills

analyzing-docs

writing-docs

managing-docs

Hook Development

MCP Integration

Agent Development