Agent skill

funsloth-train

Generate Unsloth training notebooks and scripts. Use when the user wants to create a training notebook, configure fine-tuning parameters, or set up SFT/DPO/GRPO training.

View SKILL.md on GitHub Repository

Stars 5

Forks 0

Install this agent skill to your Project

npx add-skill https://github.com/chrisvoncsefalvay/funsloth/tree/main/skills/funsloth-train

SKILL.md

Unsloth Training Notebook Generator

Generate training notebooks for fine-tuning with Unsloth.

Quick Start

Copy and customize the template notebook:

notebooks/sft_template.ipynb

Or use a training script directly:

bash

python scripts/train_sft.py  # Supervised fine-tuning
python scripts/train_dpo.py  # Direct preference optimization
python scripts/train_grpo.py # Group relative policy optimization

Configuration Modes

Ask the user which mode they prefer:

Sensible defaults - Production-ready notebook with recommended settings
Guide me - Walk through each option with explanations
Leave it empty - Notebook with ipywidgets for runtime configuration

Mode 1: Sensible Defaults

Use these production-ready defaults:

Parameter	Default	Reasoning
Model	`unsloth/llama-3.1-8b-unsloth-bnb-4bit`	Good balance
Max seq length	2048	Covers most use cases
Load in 4-bit	True	70% VRAM reduction
LoRA rank	16	Good trade-off
Batch size	2	Works on 8GB+ VRAM
Gradient accumulation	4	Effective batch of 8
Learning rate	2e-4	Unsloth recommended
Epochs	1	Often sufficient

Mode 2: Guide Me

Ask questions in order. See MODEL_SELECTION.md for model options and TRAINING_METHODS.md for technique details.

Key Questions

Model family: Llama, Qwen, Gemma, Phi, Mistral, DeepSeek?
Model size: Based on VRAM (see HARDWARE_GUIDE.md)
Training technique: SFT, DPO, GRPO, ORPO, KTO?
Quantization: 4-bit (recommended), 8-bit, 16-bit?
LoRA rank: 8, 16, 32, 64?
Sequence length: 512, 1024, 2048, 4096?
Batch size: 1, 2, 4, 8?
Learning rate: 1e-5, 5e-5, 2e-4, 5e-4?
Training duration: 1 epoch, 3 epochs, or specific steps?

Mode 3: ipywidgets

Generate a notebook with interactive configuration widgets. Users select options at runtime.

Notebook Structure

Generate notebooks with these sections:

Title and Overview - What the notebook does
Installation - Install Unsloth
Imports and GPU Check - Verify environment
Configuration - All tunable parameters
Load Model - FastLanguageModel.from_pretrained()
Apply LoRA - FastLanguageModel.get_peft_model()
Load Dataset - Format-appropriate loading
Training - SFTTrainer/DPOTrainer/GRPOTrainer
Save Model - LoRA adapter + merged model
Test Inference - Quick verification

After Generation

Ask where to run training:

Hugging Face Jobs - Cloud GPUs (funsloth-hfjobs)
RunPod - Flexible GPU rentals (funsloth-runpod)
Local - Your own GPU (funsloth-local)

Context to Pass

yaml

notebook_path: "./training_notebook.ipynb"
model_name: "unsloth/llama-3.1-8b-unsloth-bnb-4bit"
dataset_name: "mlabonne/FineTome-100k"
technique: "SFT"
lora_rank: 16
max_seq_length: 2048
batch_size: 2
learning_rate: 2e-4
num_epochs: 1

Bundled Resources

notebooks/sft_template.ipynb - Ready-to-use SFT template
scripts/train_sft.py - SFT script template
scripts/train_dpo.py - DPO script template
scripts/train_grpo.py - GRPO script template
references/MODEL_SELECTION.md - Model recommendations
references/HARDWARE_GUIDE.md - VRAM requirements
references/TRAINING_METHODS.md - SFT vs DPO vs GRPO

Maintainer

chrisvoncsefalvay Core maintainer

Source details

Full Name: chrisvoncsefalvay/funsloth
Branch: main
Path in repo: skills/funsloth-train

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

chrisvoncsefalvay/funsloth

funsloth-upload

Generate comprehensive model cards and upload fine-tuned models to Hugging Face Hub with professional documentation

5 0

Explore

chrisvoncsefalvay/funsloth

funsloth-hfjobs

Training manager for Hugging Face Jobs - launch fine-tuning on HF cloud GPUs with optional WandB monitoring

5 0

Explore

chrisvoncsefalvay/funsloth

funsloth-local

Training manager for local GPU training - validate CUDA, manage GPU selection, monitor progress, handle checkpoints

5 0

Explore

chrisvoncsefalvay/funsloth

funsloth-check

Validate datasets for Unsloth fine-tuning. Use when the user wants to check a dataset, analyze tokens, calculate Chinchilla optimality, or prepare data for training.

5 0

Explore

chrisvoncsefalvay/funsloth

funsloth-runpod

Training manager for RunPod GPU instances - configure pods, launch training, monitor progress, retrieve checkpoints

5 0

Explore

mattpocock/skills

edit-article

Edit and improve articles by restructuring sections, improving clarity, and tightening prose. Use when user wants to edit, revise, or improve an article draft.

111,310 9,758

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Unsloth Training Notebook Generator

Quick Start

Configuration Modes

Mode 1: Sensible Defaults

Mode 2: Guide Me

Key Questions

Mode 3: ipywidgets

Notebook Structure

After Generation

Context to Pass

Bundled Resources

Recommended Agent Skills

funsloth-upload

funsloth-hfjobs

funsloth-local

funsloth-check

funsloth-runpod

edit-article