Agent skill

dummy-dataset

Generate realistic dummy datasets for testing with customizable columns, constraints, and output formats (CSV, JSON, SQL, Python script). Use when creating test data, building mock datasets, or generating sample data for development and demos.

View SKILL.md on GitHub Repository

Stars 9,823

Forks 1,082

Install this agent skill to your Project

npx add-skill https://github.com/phuryn/pm-skills/tree/main/pm-execution/skills/dummy-dataset

SKILL.md

Dummy Dataset Generation

Generate realistic dummy datasets for testing with customizable columns, constraints, and output formats (CSV, JSON, SQL, Python script). Creates executable scripts or direct data files for immediate use.

Use when: Creating test data, generating sample datasets, building realistic mock data for development, or populating test environments.

Arguments:

$PRODUCT: The product or system name
$DATASET_TYPE: Type of data (e.g., customer feedback, transactions, user profiles)
$ROWS: Number of rows to generate (default: 100)
$COLUMNS: Specific columns or fields to include
$FORMAT: Output format (CSV, JSON, SQL, Python script)
$CONSTRAINTS: Additional constraints or business rules

Step-by-Step Process

Identify dataset type - Understand the data domain
Define column specifications - Names, data types, and value ranges
Determine row count - How many sample records needed
Select output format - CSV, JSON, SQL INSERT, or Python script
Apply realistic patterns - Ensure data looks authentic and valid
Add business constraints - Respect business logic and relationships
Generate or script data - Create executable output
Validate output - Ensure data quality and completeness

Template: Python Script Output

python

import csv
import json
from datetime import datetime, timedelta
import random

# Configuration
ROWS = $ROWS
FILENAME = "$DATASET_TYPE.csv"

# Column definitions with realistic value generators
columns = {
    "id": "auto-increment",
    "name": "first_last_name",
    "email": "email",
    "created_at": "timestamp",
    # Add more columns...
}

def generate_dataset():
    """Generate realistic dummy dataset"""
    data = []
    for i in range(1, ROWS + 1):
        record = {
            "id": f"U{i:06d}",
            # Generate values based on column definitions
        }
        data.append(record)
    return data

def save_as_csv(data, filename):
    """Save dataset as CSV"""
    with open(filename, 'w', newline='') as f:
        writer = csv.DictWriter(f, fieldnames=data[0].keys())
        writer.writeheader()
        writer.writerows(data)

if __name__ == "__main__":
    dataset = generate_dataset()
    save_as_csv(dataset, FILENAME)
    print(f"Generated {len(dataset)} records in {FILENAME}")

Example Dataset Specification

Dataset Type: Customer Feedback

Columns:

feedback_id (auto-increment, U001, U002...)
customer_name (realistic names)
email (valid email format)
feedback_date (dates last 90 days)
rating (1-5 stars)
category (Bug, Feature Request, Complaint, Praise)
text (realistic feedback)
product (electronics, clothing, home)

Constraints:

Ratings skewed: 40% 5-star, 30% 4-star, 20% 3-star, 10% 1-2 star
Bug category only with ratings 1-3
Feature requests only with ratings 3-5
Email domains realistic (gmail, yahoo, company.com)

Output Deliverables

Ready-to-execute Python script OR direct data file
CSV file with proper headers and formatting
JSON file with valid structure and types
SQL INSERT statements for database population
Data validation and constraint compliance
Realistic, business-appropriate values
Documentation of data generation logic
Quick-start instructions for using the dataset

Output Formats

CSV: Flat tabular format, easy to import into spreadsheets and databases

JSON: Nested structure, ideal for APIs and NoSQL databases

SQL: INSERT statements, directly executable on relational databases

Python Script: Executable generator for custom or large datasets

Maintainer

phuryn Core maintainer

Source details

Full Name: phuryn/pm-skills
Branch: main
Path in repo: pm-execution/skills/dummy-dataset
License: MIT License
Topics: agent-skills agentic-skills claude-code-marketplace claude-code-plugins product-management agent-skill-repository claude-cowork-plugin

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

phuryn/pm-skills

ab-test-analysis

Analyze A/B test results with statistical significance, sample size validation, confidence intervals, and ship/extend/stop recommendations. Use when evaluating experiment results, checking if a test reached significance, interpreting split test data, or deciding whether to ship a variant.

9,823 1,082

Explore

phuryn/pm-skills

cohort-analysis

Perform cohort analysis on user engagement data — retention curves, feature adoption trends, and segment-level insights. Use when analyzing user retention by cohort, studying feature adoption over time, investigating churn patterns, or identifying engagement trends.

9,823 1,082

Explore

phuryn/pm-skills

sql-queries

Generate SQL queries from natural language descriptions. Supports BigQuery, PostgreSQL, MySQL, and other dialects. Reads database schemas from uploaded diagrams or documentation. Use when writing SQL, building data reports, exploring databases, or translating business questions into queries.

9,823 1,082

Explore

phuryn/pm-skills

swot-analysis

Perform a detailed SWOT analysis — strengths, weaknesses, opportunities, and threats with actionable recommendations. Use when doing strategic assessment, competitive analysis, or evaluating a product or business position.

9,823 1,082

Explore

phuryn/pm-skills

product-strategy

Create a comprehensive product strategy using the 9-section Product Strategy Canvas — vision, segments, costs, value propositions, trade-offs, metrics, growth, capabilities, and defensibility. Use when building a product strategy, creating a strategic plan, or defining product direction.

9,823 1,082

Explore

phuryn/pm-skills

pricing-strategy

Analyze and design pricing strategies including pricing models, competitive pricing analysis, willingness-to-pay estimation, and price elasticity. Use when setting prices, evaluating pricing models, preparing for a pricing change, or comparing freemium vs paid approaches.

9,823 1,082

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Dummy Dataset Generation

Step-by-Step Process

Template: Python Script Output

Example Dataset Specification

Output Deliverables

Output Formats

Recommended Agent Skills

ab-test-analysis

cohort-analysis

sql-queries

swot-analysis

product-strategy

pricing-strategy