Agent skill
Edge Deployment Skill
ML model optimization and deployment on robot edge devices (Jetson, embedded)
Stars
163
Forks
31
Install this agent skill to your Project
npx add-skill https://github.com/majiayu000/claude-skill-registry/tree/main/skills/skills/other/edge-deployment
SKILL.md
Edge Deployment Skill
Overview
Expert skill for optimizing and deploying machine learning models on robot edge devices including NVIDIA Jetson and embedded systems.
Capabilities
- Configure TensorRT optimization for NVIDIA Jetson
- Set up ONNX model conversion and optimization
- Implement INT8 and FP16 quantization
- Configure DeepStream for video analytics
- Set up CUDA graph optimization
- Implement model pruning and distillation
- Configure DLA (Deep Learning Accelerator) deployment
- Set up multi-stream inference
- Implement ROS2 inference nodes
- Profile and benchmark on target hardware
Target Processes
- nn-model-optimization.js
- object-detection-pipeline.js
- rl-robot-control.js
- field-testing-validation.js
Dependencies
- TensorRT
- ONNX Runtime
- NVIDIA Jetson SDK
- DeepStream
Usage Context
This skill is invoked when processes require deploying ML models on edge devices with optimized inference performance.
Output Artifacts
- TensorRT engine files
- ONNX optimized models
- Quantization configurations
- DeepStream pipeline configs
- Inference benchmark reports
- ROS2 inference node implementations
Didn't find tool you were looking for?