Agent skill

llm

Large Language Model development, training, fine-tuning, and deployment best practices.

View SKILL.md on GitHub Repository

Stars 48

Forks 4

Install this agent skill to your Project

npx add-skill https://github.com/Mindrally/skills/tree/main/llm

SKILL.md

LLM Development

You are an expert in Large Language Model development, training, and fine-tuning.

Core Principles

Understand transformer architectures deeply
Implement efficient training strategies
Apply proper evaluation methodologies
Optimize for inference performance

Model Architecture

Attention Mechanisms

Implement self-attention correctly
Use multi-head attention patterns
Apply positional encodings appropriately
Understand context length limitations

Tokenization

Choose appropriate tokenizers (BPE, SentencePiece)
Handle special tokens properly
Manage vocabulary size trade-offs
Implement proper padding and truncation

Fine-Tuning Techniques

Parameter-Efficient Methods

Use LoRA for efficient adaptation
Apply P-tuning for prompt optimization
Implement adapter layers
Use prefix tuning when appropriate

Full Fine-Tuning

Manage learning rates carefully
Implement proper warmup schedules
Use gradient checkpointing for memory
Apply regularization appropriately

Training Infrastructure

Distributed Training

Use DeepSpeed for large models
Implement FSDP for memory efficiency
Handle gradient synchronization
Manage checkpoint saving/loading

Memory Optimization

Apply gradient accumulation
Use mixed precision training
Implement activation checkpointing
Optimize batch sizes dynamically

Evaluation

Use appropriate metrics (perplexity, BLEU, etc.)
Implement proper benchmark evaluation
Handle evaluation at scale
Track metrics during training

Deployment

Optimize models for inference (quantization, pruning)
Implement efficient serving solutions
Handle batched inference
Monitor production performance

Project Structure

Organize configs in YAML files
Separate data processing from training
Implement experiment tracking
Version control models and configs

Maintainer

Mindrally Core maintainer

Source details

Full Name: Mindrally/skills
Branch: main
Path in repo: llm
License: Apache License 2.0

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

Mindrally/skills

pixi-js

Expert guidance for Pixi.js game development with TypeScript, focusing on high-performance web and mobile games

48 4

Explore

Mindrally/skills

fastify-typescript

Guidelines for building high-performance APIs with Fastify and TypeScript, covering validation, Prisma integration, and testing best practices

48 4

Explore

Mindrally/skills

deep-learning-pytorch

Expert guidance for deep learning, transformers, diffusion models, and LLM development with PyTorch, Transformers, Diffusers, and Gradio.

48 4

Explore

Mindrally/skills

python-testing

Expert in Python testing with pytest and test-driven development

48 4

Explore

Mindrally/skills

svelte

Expert in Svelte and SvelteKit development with modern patterns and SSR

48 4

Explore

Mindrally/skills

deep-learning

Comprehensive deep learning guidelines for neural network development, training, and optimization.

48 4

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

LLM Development

Core Principles

Model Architecture

Attention Mechanisms

Tokenization

Fine-Tuning Techniques

Parameter-Efficient Methods

Full Fine-Tuning

Training Infrastructure

Distributed Training

Memory Optimization

Evaluation

Deployment

Project Structure

Recommended Agent Skills

pixi-js

fastify-typescript

deep-learning-pytorch

python-testing

svelte

deep-learning