Topic: ai-automation
2,090 skills in this topic.
-
macro-systems
Expert skill for designing and implementing macro systems including hygienic macros, procedural macros, and macro expansion. Supports pattern-based macros, quasi-quotation, and hygiene management.
a5c-ai/babysitter 514
-
tensorrt-optimization
NVIDIA TensorRT model optimization and deployment. Convert models to TensorRT engines, configure optimization profiles and precision modes, apply INT8 calibration, analyze kernel fusion, generate custom plugins, and profile inference performance.
a5c-ai/babysitter 514
-
stencil-convolution
Expert skill for optimized stencil and convolution pattern implementations on GPU. Design tiled stencil algorithms with halos, implement 2D/3D convolution kernels, optimize boundary condition handling, apply temporal blocking techniques, generate separable filter implementations, and profile stencil memory bandwidth.
a5c-ai/babysitter 514
-
parallel-patterns
GPU parallel algorithm design patterns and implementations. Implement parallel reduction, scan/prefix sum, histogram, parallel sort algorithms, stream compaction, and work-efficient patterns optimized for specific GPU architectures.
a5c-ai/babysitter 514
-
opencl-runtime
Cross-vendor OpenCL runtime management and kernel development. Query platforms/devices, generate portable OpenCL C kernel code, handle vendor-specific extensions, manage contexts and command queues, compile and cache programs.
a5c-ai/babysitter 514
-
nvenc-nvdec
NVIDIA hardware video encoding/decoding integration. Configure NVENC encoding parameters, set up NVDEC decoding pipelines, handle codec configurations, integrate with CUDA for pre/post processing, and manage video memory surfaces.
a5c-ai/babysitter 514
-
nsight-profiler
Expert skill for NVIDIA Nsight Systems and Nsight Compute profiling tools. Configure profiling sessions, analyze kernel reports, interpret occupancy metrics, roofline model data, memory bandwidth bottlenecks, and warp execution efficiency.
a5c-ai/babysitter 514
-
nccl-communication
NVIDIA Collective Communications Library integration for multi-GPU operations. Initialize NCCL communicators, execute collective operations, configure communication topologies, profile collective performance, and support RCCL for AMD compatibility.
a5c-ai/babysitter 514
-
hip-rocm
AMD HIP and ROCm ecosystem for cross-platform GPU development. Execute hipify conversion tools, generate HIP-compatible kernel code, handle CUDA/HIP API differences, configure ROCm toolchain, and profile with rocprof.
a5c-ai/babysitter 514
-
gpu-memory-analysis
Specialized skill for GPU memory hierarchy analysis and optimization. Analyze memory access patterns, detect bank conflicts, optimize cache utilization, profile global memory bandwidth, and generate optimized memory access code patterns.
a5c-ai/babysitter 514
-
gpu-benchmarking
Expert skill for automated GPU performance benchmarking and regression detection. Design micro-benchmarks, measure kernel execution time with CUDA events, calculate achieved vs theoretical performance, generate comparison reports, detect regressions in CI/CD, and profile power/thermal characteristics.
a5c-ai/babysitter 514
-
cutlass-triton
High-performance kernel template libraries and DSLs. Generate CUTLASS GEMM configurations, implement Triton kernel definitions, configure epilogue operations, tune tile sizes and warp arrangements, and benchmark against cuBLAS.
a5c-ai/babysitter 514
-
cuda-toolkit
Deep integration with NVIDIA CUDA toolkit for kernel development, compilation, and debugging. Execute nvcc compilation with optimization flags analysis, generate and validate CUDA kernel code, analyze PTX/SASS assembly output, and configure execution parameters.
a5c-ai/babysitter 514
-
cuda-graphs
Expert skill for CUDA Graph capture and optimization for reduced launch overhead. Capture CUDA operations into graphs, instantiate and execute graph instances, update graph node parameters, profile graph vs stream execution, design graph-friendly kernel patterns, and optimize launch latency for inference.
a5c-ai/babysitter 514
-
cuda-debugging
Expert skill for GPU debugging using CUDA-GDB and NVIDIA Compute Sanitizer. Detect memory errors, race conditions, uninitialized memory access, validate atomic operations, analyze kernel synchronization issues, and generate debugging reports with recommendations.
a5c-ai/babysitter 514
-
cublas-cudnn
Expert integration with NVIDIA GPU-accelerated math libraries. Configure cuBLAS tensor core operations, generate cuBLAS GEMM calls, integrate cuDNN layers, handle algorithm selection, and support mixed-precision operations.
a5c-ai/babysitter 514
-
zbrush
ZBrush skill for decimation and normal map baking.
a5c-ai/babysitter 514
-
unreal-development
Unreal Engine integration skill for C++/Blueprint development, actor lifecycle management, plugin development, and editor automation. Enables LLMs to interact with Unreal Editor through MCP servers for level manipulation, Blueprint generation, and automated workflows.
a5c-ai/babysitter 514
-
unreal-metasounds
Unreal Engine MetaSounds skill for procedural audio, real-time synthesis, and advanced audio graphs.
a5c-ai/babysitter 514
-
unreal-materials
Unreal Engine Material Editor skill for PBR workflows, material instances, shader complexity, and material functions.
a5c-ai/babysitter 514
-
unreal-lumen
Unreal Engine Lumen skill for global illumination, reflections, and dynamic lighting.
a5c-ai/babysitter 514
-
unreal-gamesframework
Unreal Engine Gameplay Ability System (GAS) skill for attributes, abilities, and gameplay effects.
a5c-ai/babysitter 514
-
unreal-control-rig
Unreal Engine Control Rig skill for procedural animation, IK chains, and runtime rig modifications.
a5c-ai/babysitter 514
-
unreal-chaos
Unreal Engine Chaos physics skill for destruction, vehicle simulation, and cloth physics.
a5c-ai/babysitter 514