Agent skill
data-cleaning
Data cleaning, preprocessing, and quality assurance techniques
Install this agent skill to your Project
npx add-skill https://github.com/pluginagentmarketplace/custom-plugin-data-analyst/tree/main/skills/data-cleaning
SKILL.md
Data Cleaning Skill
Overview
Master data cleaning and preprocessing techniques essential for reliable analytics.
Topics Covered
- Missing value handling (imputation, deletion)
- Outlier detection and treatment
- Data type conversion and validation
- Duplicate identification and removal
- String cleaning and normalization
Learning Outcomes
- Clean messy datasets
- Handle missing data appropriately
- Detect and treat outliers
- Ensure data quality
Error Handling
| Error Type | Cause | Recovery |
|---|---|---|
| Memory error | Dataset too large | Use chunking or sampling |
| Type conversion failed | Invalid data format | Apply preprocessing first |
| Encoding issues | Wrong character encoding | Detect and specify encoding |
| Validation failure | Data doesn't meet schema | Review and adjust validation rules |
Related Skills
- programming (for automation)
- foundations (for data quality concepts)
- databases-sql (for SQL-based cleaning)
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
business-intelligence
BI tools, dashboards, and enterprise analytics platforms
databases-sql
SQL database querying, optimization, and data management for analytics
visualization
Data visualization design, tools, and storytelling for impactful analytics presentations
data-analytics-foundations
Core data analytics concepts, Excel/Google Sheets fundamentals, and data collection techniques
advanced-analytics
Advanced analytics including machine learning, predictive modeling, and big data techniques
statistics
Statistical analysis methods, hypothesis testing, and probability for data analytics
Didn't find tool you were looking for?