Agent skill
huggingface-hub
Hugging Face Hub CLI (hf) — search, download, and upload models and datasets, manage repos, query datasets with SQL, deploy inference endpoints, manage Spaces and buckets.
Install this agent skill to your Project
npx add-skill https://github.com/NousResearch/hermes-agent/tree/main/skills/mlops/huggingface-hub
SKILL.md
Hugging Face CLI (hf) Reference Guide
The hf command is the modern command-line interface for interacting with the Hugging Face Hub, providing tools to manage repositories, models, datasets, and Spaces.
IMPORTANT: The
hfcommand replaces the now deprecatedhuggingface-clicommand.
Quick Start
- Installation:
curl -LsSf https://hf.co/cli/install.sh | bash -s - Help: Use
hf --helpto view all available functions and real-world examples. - Authentication: Recommended via
HF_TOKENenvironment variable or the--tokenflag.
Core Commands
General Operations
hf download REPO_ID: Download files from the Hub.hf upload REPO_ID: Upload files/folders (recommended for single-commit).hf upload-large-folder REPO_ID LOCAL_PATH: Recommended for resumable uploads of large directories.hf sync: Sync files between a local directory and a bucket.hf env/hf version: View environment and version details.
Authentication (hf auth)
login/logout: Manage sessions using tokens from huggingface.co/settings/tokens.list/switch: Manage and toggle between multiple stored access tokens.whoami: Identify the currently logged-in account.
Repository Management (hf repos)
create/delete: Create or permanently remove repositories.duplicate: Clone a model, dataset, or Space to a new ID.move: Transfer a repository between namespaces.branch/tag: Manage Git-like references.delete-files: Remove specific files using patterns.
Specialized Hub Interactions
Datasets & Models
- Datasets:
hf datasets list,info, andparquet(list parquet URLs). - SQL Queries:
hf datasets sql SQL— Execute raw SQL via DuckDB against dataset parquet URLs. - Models:
hf models listandinfo. - Papers:
hf papers list— View daily papers.
Discussions & Pull Requests (hf discussions)
- Manage the lifecycle of Hub contributions:
list,create,info,comment,close,reopen, andrename. diff: View changes in a PR.merge: Finalize pull requests.
Infrastructure & Compute
- Endpoints: Deploy and manage Inference Endpoints (
deploy,pause,resume,scale-to-zero,catalog). - Jobs: Run compute tasks on HF infrastructure. Includes
hf jobs uvfor running Python scripts with inline dependencies andstatsfor resource monitoring. - Spaces: Manage interactive apps. Includes
dev-modeandhot-reloadfor Python files without full restarts.
Storage & Automation
- Buckets: Full S3-like bucket management (
create,cp,mv,rm,sync). - Cache: Manage local storage with
list,prune(remove detached revisions), andverify(checksum checks). - Webhooks: Automate workflows by managing Hub webhooks (
create,watch,enable/disable). - Collections: Organize Hub items into collections (
add-item,update,list).
Advanced Usage & Tips
Global Flags
--format json: Produces machine-readable output for automation.-q/--quiet: Limits output to IDs only.
Extensions & Skills
- Extensions: Extend CLI functionality via GitHub repositories using
hf extensions install REPO_ID. - Skills: Manage AI assistant skills with
hf skills add.
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
agentmail
Give the agent its own dedicated email inbox via AgentMail. Send, receive, and manage email autonomously using agent-owned email addresses (e.g. hermes-agent@agentmail.to).
base
Query Base (Ethereum L2) blockchain data with USD pricing — wallet balances, token info, transaction details, gas analysis, contract inspection, whale detection, and live network stats. Uses Base RPC + CoinGecko. No API key required.
solana
Query Solana blockchain data with USD pricing — wallet balances, token portfolios with values, transaction details, NFTs, whale detection, and live network stats. Uses Solana RPC + CoinGecko. No API key required.
one-three-one-rule
Structured decision-making framework for technical proposals and trade-off analysis. When the user faces a choice between multiple approaches (architecture decisions, tool selection, refactoring strategies, migration paths), this skill produces a 1-3-1 format: one clear problem statement, three distinct options with pros/cons, and one concrete recommendation with definition of done and implementation plan. Use when the user asks for a "1-3-1", says "give me options", or needs help choosing between competing approaches.
fastmcp
Build, test, inspect, install, and deploy MCP servers with FastMCP in Python. Use when creating a new MCP server, wrapping an API or database as MCP tools, exposing resources or prompts, or preparing a FastMCP server for Claude Code, Cursor, or HTTP deployment.
qdrant-vector-search
High-performance vector similarity search engine for RAG and semantic search. Use when building production RAG systems requiring fast nearest neighbor search, hybrid search with filtering, or scalable vector storage with Rust-powered performance.
Didn't find tool you were looking for?