Agent skill

ml-engineer

Machine learning engineer expert for PyTorch, scikit-learn, model evaluation, and MLOps

View SKILL.md on GitHub Repository

Stars 16,576

Forks 2,073

Install this agent skill to your Project

npx add-skill https://github.com/RightNow-AI/openfang/tree/main/crates/openfang-skills/bundled/ml-engineer

SKILL.md

Machine Learning Engineer

A machine learning practitioner with deep expertise in model development, training infrastructure, evaluation methodology, and production deployment. This skill provides guidance for building ML systems end-to-end using PyTorch for deep learning, scikit-learn for classical ML, and MLOps practices that ensure models are reproducible, monitored, and maintainable in production environments.

Key Principles

Start with a strong baseline using simple models and solid feature engineering before reaching for complex architectures; a well-tuned logistic regression often outperforms a poorly configured neural network
Evaluate models with metrics that align with business objectives, not just accuracy; precision, recall, F1, and AUC-ROC each tell different stories about model behavior on imbalanced data
Version everything: datasets, code, hyperparameters, and model artifacts; reproducibility is the foundation of trustworthy ML systems
Design training pipelines to be idempotent and resumable; checkpointing, deterministic seeding, and configuration files enable reliable experimentation
Monitor models in production for data drift, prediction drift, and performance degradation; a model that was accurate at deployment time can silently degrade as input distributions shift

Techniques

Structure PyTorch training with a clear pattern: define nn.Module subclass, configure DataLoader with proper num_workers and pin_memory, implement the training loop with optimizer.zero_grad(), loss.backward(), and optimizer.step()
Build scikit-learn pipelines with Pipeline and ColumnTransformer to chain preprocessing (scaling, encoding, imputation) with model fitting, ensuring that all transformations are fit on training data only
Perform hyperparameter tuning with GridSearchCV or RandomizedSearchCV using cross-validation; for expensive models, use Optuna or Bayesian optimization to search efficiently
Compute evaluation metrics on held-out test sets: classification_report for precision/recall/F1 per class, roc_auc_score for ranking quality, and confusion_matrix for error analysis
Engineer features systematically: log transforms for skewed distributions, interaction terms for feature combinations, target encoding for high-cardinality categoricals, and temporal features for time-series data
Track experiments with MLflow or Weights and Biases: log hyperparameters, metrics, artifacts, and model versions for every run

Common Patterns

Train-Validate-Test Split: Use stratified splitting (80/10/10) to maintain class distribution; never touch the test set during development, only for final evaluation
Learning Rate Schedule: Use warmup followed by cosine annealing or reduce-on-plateau for training stability; sudden large learning rates cause divergence in deep networks
Ensemble Methods: Combine predictions from diverse models (gradient boosting + neural network + linear model) to improve robustness and reduce variance
Model Registry: Promote models through stages (staging, production, archived) in MLflow Model Registry with approval gates and automated validation checks

Pitfalls to Avoid

Do not evaluate on the training set or leak test data into preprocessing; this produces overly optimistic metrics that do not reflect real-world performance
Do not train models without understanding the data: check for class imbalance, missing values, duplicates, and label noise before building any model
Do not deploy models without a rollback plan; maintain the previous model version in production so you can revert quickly if the new model underperforms
Do not treat feature engineering as a one-time task; as the domain evolves and new data sources become available, revisit and expand the feature set regularly

Maintainer

RightNow-AI Core maintainer

Source details

Full Name: RightNow-AI/openfang
Branch: main
Path in repo: crates/openfang-skills/bundled/ml-engineer
License: Apache License 2.0
Topics: mcp ai-agents llm openclaw rust agent-framework open-source operating-system

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

RightNow-AI/openfang

predictor-hand-skill

Expert knowledge for AI forecasting — superforecasting principles, signal taxonomy, confidence calibration, reasoning chains, and accuracy tracking

16,576 2,073

Explore

RightNow-AI/openfang

researcher-hand-skill

Expert knowledge for AI deep research — methodology, source evaluation, search optimization, cross-referencing, synthesis, and citation formats

16,576 2,073

Explore

RightNow-AI/openfang

lead-hand-skill

Expert knowledge for AI lead generation — web research, enrichment, scoring, deduplication, and report generation

16,576 2,073

Explore

RightNow-AI/openfang

collector-hand-skill

Expert knowledge for AI intelligence collection — OSINT methodology, entity extraction, knowledge graphs, change detection, and sentiment analysis

16,576 2,073

Explore

RightNow-AI/openfang

infisical-sync-skill

Expert knowledge for the Infisical Sync Hand — Infisical API reference, vault operations, error patterns, security guidance

16,576 2,073

Explore

RightNow-AI/openfang

browser-automation

Playwright-based browser automation patterns for autonomous web interaction

16,576 2,073

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Machine Learning Engineer

Key Principles

Techniques

Common Patterns

Pitfalls to Avoid

Recommended Agent Skills

predictor-hand-skill

researcher-hand-skill

lead-hand-skill

collector-hand-skill

infisical-sync-skill

browser-automation