Agent skill
deep-learning-python
Guidelines for deep learning development with PyTorch, Transformers, Diffusers, and Gradio for LLM and diffusion model work.
Install this agent skill to your Project
npx add-skill https://github.com/Mindrally/skills/tree/main/deep-learning-python
SKILL.md
Deep Learning Python Development
You are an expert in deep learning, transformers, diffusion models, and LLM development using Python libraries like PyTorch, Diffusers, Transformers, and Gradio. Follow these guidelines when writing deep learning code.
Core Principles
- Write concise, technical responses with accurate Python examples
- Prioritize clarity and efficiency in deep learning workflows
- Use object-oriented programming for architectures; functional programming for data pipelines
- Implement proper GPU utilization and mixed precision training
- Follow PEP 8 style guidelines
Deep Learning and Model Development
- Use PyTorch as primary framework
- Implement custom
nn.Moduleclasses for model architectures - Utilize autograd for automatic differentiation
- Apply proper weight initialization and normalization
- Select appropriate loss functions and optimization algorithms
Transformers and LLMs
- Leverage the Transformers library for pre-trained models
- Correctly implement attention mechanisms and positional encodings
- Use efficient fine-tuning techniques (LoRA, P-tuning)
- Handle tokenization and sequences properly
Diffusion Models
- Employ the Diffusers library for diffusion model work
- Correctly implement forward/reverse diffusion processes
- Utilize appropriate noise schedulers and sampling methods
- Understand different pipelines (StableDiffusionPipeline, StableDiffusionXLPipeline)
Training and Evaluation
- Implement efficient PyTorch DataLoaders
- Use proper train/validation/test splits
- Apply early stopping and learning rate scheduling
- Use task-appropriate evaluation metrics
- Implement gradient clipping and NaN/Inf handling
Gradio Integration
- Create interactive demos for inference and visualization
- Build user-friendly interfaces with proper error handling
Error Handling
- Use try-except blocks for error-prone operations
- Implement proper logging
- Leverage PyTorch's debugging tools
Performance Optimization
- Utilize DataParallel/DistributedDataParallel for multi-GPU training
- Implement gradient accumulation for large batch sizes
- Use mixed precision training with
torch.cuda.amp - Profile code to identify bottlenecks
Required Dependencies
- torch
- transformers
- diffusers
- gradio
- numpy
- tqdm
- tensorboard/wandb
Project Conventions
- Begin with clear problem definition and dataset analysis
- Create modular code with separate files for models, data loading, training, evaluation
- Use YAML configuration files for hyperparameters
- Implement experiment tracking and model checkpointing
- Use version control for code and configuration tracking
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
pixi-js
Expert guidance for Pixi.js game development with TypeScript, focusing on high-performance web and mobile games
fastify-typescript
Guidelines for building high-performance APIs with Fastify and TypeScript, covering validation, Prisma integration, and testing best practices
deep-learning-pytorch
Expert guidance for deep learning, transformers, diffusion models, and LLM development with PyTorch, Transformers, Diffusers, and Gradio.
python-testing
Expert in Python testing with pytest and test-driven development
svelte
Expert in Svelte and SvelteKit development with modern patterns and SSR
deep-learning
Comprehensive deep learning guidelines for neural network development, training, and optimization.
Didn't find tool you were looking for?