Agent skill

engineering-features-for-machine-learning

Execute create, select, and transform features to improve machine learning model performance. Handles feature scaling, encoding, and importance analysis. Use when asked to "engineer features" or "select features". Trigger with relevant phrases based on skill purpose.

Stars 1,415
Forks 109

Install this agent skill to your Project

npx add-skill https://github.com/foryourhealth111-pixel/Vibe-Skills/tree/main/bundled/skills/engineering-features-for-machine-learning

SKILL.md

Feature Engineering Toolkit

This skill provides automated assistance for feature engineering toolkit tasks.

Overview

This skill provides automated assistance for feature engineering toolkit tasks. This skill enables Claude to leverage the feature-engineering-toolkit plugin to enhance machine learning models. It automates the process of creating new features, selecting the most relevant ones, and transforming existing features to better suit the model's needs. Use this skill to improve the accuracy, efficiency, and interpretability of machine learning models.

How It Works

  1. Analyzing Requirements: Claude analyzes the user's request and identifies the specific feature engineering task required.
  2. Generating Code: Claude generates Python code using the feature-engineering-toolkit plugin to perform the requested task. This includes data validation and error handling.
  3. Executing Task: The generated code is executed, creating, selecting, or transforming features as requested.
  4. Providing Insights: Claude provides performance metrics and insights related to the feature engineering process, such as the importance of newly created features or the impact of transformations on model performance.

When to Use This Skill

This skill activates when you need to:

  • Create new features from existing data to improve model accuracy.
  • Select the most relevant features from a dataset to reduce model complexity and improve efficiency.
  • Transform features to better suit the assumptions of a machine learning model (e.g., scaling, normalization, encoding).

Examples

Example 1: Improving Model Accuracy

User request: "Create new features from the existing 'age' and 'income' columns to improve the accuracy of a customer churn prediction model."

The skill will:

  1. Generate code to create interaction terms between 'age' and 'income' (e.g., age * income, age / income).
  2. Execute the code and evaluate the impact of the new features on model performance.

Example 2: Reducing Model Complexity

User request: "Select the top 10 most important features from the dataset to reduce the complexity of a fraud detection model."

The skill will:

  1. Generate code to calculate feature importance using a suitable method (e.g., Random Forest, SelectKBest).
  2. Execute the code and select the top 10 features based on their importance scores.

Best Practices

  • Data Validation: Always validate the input data to ensure it is clean and consistent before performing feature engineering.
  • Feature Scaling: Scale numerical features to prevent features with larger ranges from dominating the model.
  • Encoding Categorical Features: Encode categorical features appropriately (e.g., one-hot encoding, label encoding) to make them suitable for machine learning models.

Integration

This skill integrates with the feature-engineering-toolkit plugin, providing a seamless way to create, select, and transform features for machine learning models. It can be used in conjunction with other Claude Code skills to build complete machine learning pipelines.

Prerequisites

  • Appropriate file access permissions
  • Required dependencies installed

Instructions

  1. Invoke this skill when the trigger conditions are met
  2. Provide necessary context and parameters
  3. Review the generated output
  4. Apply modifications as needed

Output

The skill produces structured output relevant to the task.

Error Handling

  • Invalid input: Prompts for correction
  • Missing dependencies: Lists required components
  • Permission errors: Suggests remediation steps

Resources

  • Project documentation
  • Related skills and commands

Expand your agent's capabilities with these related and highly-rated skills.

foryourhealth111-pixel/Vibe-Skills

pufferlib

This skill should be used when working with reinforcement learning tasks including high-performance RL training, custom environment development, vectorized parallel simulation, multi-agent systems, or integration with existing RL environments (Gymnasium, PettingZoo, Atari, Procgen, etc.). Use this skill for implementing PPO training, creating PufferEnv environments, optimizing RL performance, or developing policies with CNNs/LSTMs.

1,415 109
Explore
foryourhealth111-pixel/Vibe-Skills

fluidsim

Framework for computational fluid dynamics simulations using Python. Use when running fluid dynamics simulations including Navier-Stokes equations (2D/3D), shallow water equations, stratified flows, or when analyzing turbulence, vortex dynamics, or geophysical flows. Provides pseudospectral methods with FFT, HPC support, and comprehensive output analysis.

1,415 109
Explore
foryourhealth111-pixel/Vibe-Skills

metabolomics-workbench-database

Access NIH Metabolomics Workbench via REST API (4,200+ studies). Query metabolites, RefMet nomenclature, MS/NMR data, m/z searches, study metadata, for metabolomics and biomarker discovery.

1,415 109
Explore
foryourhealth111-pixel/Vibe-Skills

build-error-resolver

Compatibility alias for build-specific error resolution. Use this when VCO routes to build-error-resolver but the upstream agent is unavailable in the current runtime.

1,415 109
Explore
foryourhealth111-pixel/Vibe-Skills

geniml

This skill should be used when working with genomic interval data (BED files) for machine learning tasks. Use for training region embeddings (Region2Vec, BEDspace), single-cell ATAC-seq analysis (scEmbed), building consensus peaks (universes), or any ML-based analysis of genomic regions. Applies to BED file collections, scATAC-seq data, chromatin accessibility datasets, and region-based genomic feature learning.

1,415 109
Explore
foryourhealth111-pixel/Vibe-Skills

zinc-database

Access ZINC (230M+ purchasable compounds). Search by ZINC ID/SMILES, similarity searches, 3D-ready structures for docking, analog discovery, for virtual screening and drug discovery.

1,415 109
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results