Sponsored by

Find leads on Reddit on auto pilot

Agent skills

Skills you can use with AI coding agents, indexed from public GitHub repositories.

Latest Most Popular

grpo-rl-training Expert guidance for GRPO/RL fine-tuning with TRL for reasoning and task-specific model training
Orchestra-Research/AI-Research-SKILLs 6,644
slime-rl-training Provides guidance for LLM post-training with RL using slime, a Megatron+SGLang framework. Use when training GLM models, implementing custom data generation workflows, or needing tight Megatron-LM integration for RL scaling.
Orchestra-Research/AI-Research-SKILLs 6,644
openrlhf-training High-performance RLHF framework with Ray+vLLM acceleration. Use for PPO, GRPO, RLOO, DPO training of large models (7B-70B+). Built on Ray, vLLM, ZeRO-3. 2× faster than DeepSpeedChat with distributed architecture and GPU resource sharing.
Orchestra-Research/AI-Research-SKILLs 6,644
torchforge-rl-training Provides guidance for PyTorch-native agentic RL using torchforge, Meta's library separating infra from algorithms. Use when you want clean RL abstractions, easy algorithm experimentation, or scalable training with Monarch and TorchTitan.
Orchestra-Research/AI-Research-SKILLs 6,644
simpo-training Simple Preference Optimization for LLM alignment. Reference-free alternative to DPO with better performance (+6.4 points on AlpacaEval 2.0). No reference model needed, more efficient than DPO. Use for preference alignment when want simpler, faster training than DPO/PPO.
Orchestra-Research/AI-Research-SKILLs 6,644
verl-rl-training Provides guidance for training LLMs with reinforcement learning using verl (Volcano Engine RL). Use when implementing RLHF, GRPO, PPO, or other RL algorithms for LLM post-training at scale with flexible infrastructure backends.
Orchestra-Research/AI-Research-SKILLs 6,644
fine-tuning-with-trl Fine-tune LLMs using reinforcement learning with TRL - SFT for instruction tuning, DPO for preference alignment, PPO/GRPO for reward optimization, and reward model training. Use when need RLHF, align model with preferences, or train from human feedback. Works with HuggingFace Transformers.
Orchestra-Research/AI-Research-SKILLs 6,644
rust-wasm Master WebAssembly with Rust - wasm-pack, wasm-bindgen, and browser integration
pluginagentmarketplace/custom-plugin-rust 1
rust-docker Master Docker containerization for Rust applications
pluginagentmarketplace/custom-plugin-rust 1
async-programming Master Rust async/await with Tokio
pluginagentmarketplace/custom-plugin-rust 1
rust-macros Master Rust macros - declarative and procedural macros
pluginagentmarketplace/custom-plugin-rust 1
trait-generics Master Rust traits, generics, and type system
pluginagentmarketplace/custom-plugin-rust 1
ownership-borrowing Master Rust's ownership, borrowing, and lifetime system
pluginagentmarketplace/custom-plugin-rust 1
rust-testing Master Rust testing - unit tests, integration tests, mocking, and TDD
pluginagentmarketplace/custom-plugin-rust 1
error-handling Master Rust's Result, Option, and error handling patterns
pluginagentmarketplace/custom-plugin-rust 1
rust-cli Build professional CLI applications with clap and TUI
pluginagentmarketplace/custom-plugin-rust 1
cargo-ecosystem Master Cargo, testing, and Rust development tools
pluginagentmarketplace/custom-plugin-rust 1
rust-performance Master Rust performance - profiling, benchmarking, and optimization
pluginagentmarketplace/custom-plugin-rust 1
rust-concurrency Master Rust concurrency - threads, channels, and parallel iterators
pluginagentmarketplace/custom-plugin-rust 1
how-to-publish-paks A practical guide for creating and publishing high-quality Agent Skills (paks) to the Paks registry. Covers SKILL.md format, frontmatter structure, content writing best practices, validation, versioning, and publishing workflow.
stakpak/community-paks 3
beads-issue-tracker Guide for using Beads (bd), a dependency-aware issue tracker for AI agents. Issues chained together like beads.
stakpak/community-paks 3
vllm-deployment Deploy vLLM for high-performance LLM inference. Covers Docker CPU/GPU deployments and cloud VM provisioning with OpenAI-compatible API endpoints.
stakpak/community-paks 3
dockerization Official Stakpak application containerization standard operating procedure, a step-by-step guidline to properly dockerize applications. This is a rule book curated by the Stakpak Team.
stakpak/community-paks 3
simple-deployment-on-vm How to do simple but secure deployments using virtual machines on different cloud providers
stakpak/community-paks 3