Agent skills
Skills you can use with AI coding agents, indexed from public GitHub repositories.
-
grpo-rl-training
Expert guidance for GRPO/RL fine-tuning with TRL for reasoning and task-specific model training
Orchestra-Research/AI-Research-SKILLs 6,644
-
slime-rl-training
Provides guidance for LLM post-training with RL using slime, a Megatron+SGLang framework. Use when training GLM models, implementing custom data generation workflows, or needing tight Megatron-LM integration for RL scaling.
Orchestra-Research/AI-Research-SKILLs 6,644
-
openrlhf-training
High-performance RLHF framework with Ray+vLLM acceleration. Use for PPO, GRPO, RLOO, DPO training of large models (7B-70B+). Built on Ray, vLLM, ZeRO-3. 2× faster than DeepSpeedChat with distributed architecture and GPU resource sharing.
Orchestra-Research/AI-Research-SKILLs 6,644
-
torchforge-rl-training
Provides guidance for PyTorch-native agentic RL using torchforge, Meta's library separating infra from algorithms. Use when you want clean RL abstractions, easy algorithm experimentation, or scalable training with Monarch and TorchTitan.
Orchestra-Research/AI-Research-SKILLs 6,644
-
simpo-training
Simple Preference Optimization for LLM alignment. Reference-free alternative to DPO with better performance (+6.4 points on AlpacaEval 2.0). No reference model needed, more efficient than DPO. Use for preference alignment when want simpler, faster training than DPO/PPO.
Orchestra-Research/AI-Research-SKILLs 6,644
-
verl-rl-training
Provides guidance for training LLMs with reinforcement learning using verl (Volcano Engine RL). Use when implementing RLHF, GRPO, PPO, or other RL algorithms for LLM post-training at scale with flexible infrastructure backends.
Orchestra-Research/AI-Research-SKILLs 6,644
-
fine-tuning-with-trl
Fine-tune LLMs using reinforcement learning with TRL - SFT for instruction tuning, DPO for preference alignment, PPO/GRPO for reward optimization, and reward model training. Use when need RLHF, align model with preferences, or train from human feedback. Works with HuggingFace Transformers.
Orchestra-Research/AI-Research-SKILLs 6,644
-
rust-wasm
Master WebAssembly with Rust - wasm-pack, wasm-bindgen, and browser integration
pluginagentmarketplace/custom-plugin-rust 1
-
rust-docker
Master Docker containerization for Rust applications
pluginagentmarketplace/custom-plugin-rust 1
-
async-programming
Master Rust async/await with Tokio
pluginagentmarketplace/custom-plugin-rust 1
-
rust-macros
Master Rust macros - declarative and procedural macros
pluginagentmarketplace/custom-plugin-rust 1
-
trait-generics
Master Rust traits, generics, and type system
pluginagentmarketplace/custom-plugin-rust 1
-
ownership-borrowing
Master Rust's ownership, borrowing, and lifetime system
pluginagentmarketplace/custom-plugin-rust 1
-
rust-testing
Master Rust testing - unit tests, integration tests, mocking, and TDD
pluginagentmarketplace/custom-plugin-rust 1
-
error-handling
Master Rust's Result, Option, and error handling patterns
pluginagentmarketplace/custom-plugin-rust 1
-
rust-cli
Build professional CLI applications with clap and TUI
pluginagentmarketplace/custom-plugin-rust 1
-
cargo-ecosystem
Master Cargo, testing, and Rust development tools
pluginagentmarketplace/custom-plugin-rust 1
-
rust-performance
Master Rust performance - profiling, benchmarking, and optimization
pluginagentmarketplace/custom-plugin-rust 1
-
rust-concurrency
Master Rust concurrency - threads, channels, and parallel iterators
pluginagentmarketplace/custom-plugin-rust 1
-
how-to-publish-paks
A practical guide for creating and publishing high-quality Agent Skills (paks) to the Paks registry. Covers SKILL.md format, frontmatter structure, content writing best practices, validation, versioning, and publishing workflow.
stakpak/community-paks 3
-
beads-issue-tracker
Guide for using Beads (bd), a dependency-aware issue tracker for AI agents. Issues chained together like beads.
stakpak/community-paks 3
-
vllm-deployment
Deploy vLLM for high-performance LLM inference. Covers Docker CPU/GPU deployments and cloud VM provisioning with OpenAI-compatible API endpoints.
stakpak/community-paks 3
-
dockerization
Official Stakpak application containerization standard operating procedure, a step-by-step guidline to properly dockerize applications. This is a rule book curated by the Stakpak Team.
stakpak/community-paks 3
-
simple-deployment-on-vm
How to do simple but secure deployments using virtual machines on different cloud providers
stakpak/community-paks 3