Agent skill
rag-chunking-strategy
Document chunking with multiple strategies including semantic, recursive, and fixed-size chunking
Install this agent skill to your Project
npx add-skill https://github.com/a5c-ai/babysitter/tree/main/library/specializations/ai-agents-conversational/skills/rag-chunking-strategy
SKILL.md
RAG Chunking Strategy Skill
Capabilities
- Implement multiple document chunking strategies
- Configure semantic chunking based on content boundaries
- Set up recursive character text splitting
- Design fixed-size chunking with overlap
- Implement document-aware chunking (markdown, code, etc.)
- Optimize chunk sizes for retrieval quality
Target Processes
- rag-pipeline-implementation
- chunking-strategy-design
Implementation Details
Chunking Strategies
- RecursiveCharacterTextSplitter: Hierarchical splitting with separators
- SemanticChunker: Embedding-based semantic boundaries
- TokenTextSplitter: Token-aware splitting
- MarkdownHeaderTextSplitter: Structure-aware markdown splitting
- CodeSplitter: Language-aware code chunking
Configuration Options
- Chunk size (characters or tokens)
- Chunk overlap percentage
- Separator hierarchy
- Embedding model for semantic chunking
- Document type detection
Best Practices
- Match chunk size to embedding model limits
- Use appropriate overlap for context preservation
- Test retrieval quality with different strategies
- Consider document structure in strategy selection
Dependencies
- langchain-text-splitters
- sentence-transformers (for semantic chunking)
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
gsd-tools
Central utility skill for GSD operations. Provides config parsing, slug generation, timestamps, path operations, and orchestrates calls to other specialized skills. Acts as the unified entry point that the original gsd-tools.cjs provided via its lib/ modules (commands, config, core, init).
model-profile-resolution
Resolve model profile (quality/balanced/budget) at orchestration start and map agents to specific models. Enables cost/quality tradeoffs by selecting appropriate AI models for each agent role.
verification-suite
Plan structure validation, phase completeness checks, reference integrity verification, and artifact existence confirmation. Provides the structured verification layer ensuring GSD artifacts are well-formed and complete.
state-management
STATE.md reading, writing, and field-level updates. Provides cross-session state persistence via .planning/STATE.md with structured fields for current task, completed phases, blockers, decisions, and quick tasks.
git-integration
Git commit patterns, formats, and conventions for GSD methodology. Provides atomic commits per task, structured commit messages, planning file commits, branch management, and milestone tag operations.
frontmatter-parsing
YAML frontmatter parsing and manipulation for .planning/ documents. Provides read, write, update, query, and validation operations on frontmatter blocks in GSD markdown artifacts.
Didn't find tool you were looking for?