Agent skill
checkpoint-mode
Pause for review every N tasks - selective autonomy pattern
Install this agent skill to your Project
npx add-skill https://github.com/asklokesh/loki-mode/tree/main/agent-skills/checkpoint-mode
SKILL.md
Checkpoint Mode Skill
Overview
Implements selective autonomy - shorter bursts of autonomous work with feedback loops.
Research Source: "Use Agents or Be Left Behind" by Tim Dettmers
Philosophy
"More than 90% of code should be written by agents, but iteratively design systems with shorter bursts of autonomy with feedback loops." — Tim Dettmers, 2026
Problem with Perpetual Autonomy:
- Can waste resources on wrong approach
- No opportunity for course correction
- User feels disconnected from progress
Solution:
- Pause after N tasks or M minutes
- Generate summary of accomplishments
- Wait for explicit approval to continue
When to Use
Use Checkpoint Mode For:
- Novel projects where approach may need adjustment
- High-cost operations (expensive API calls, cloud resources)
- Learning phases where user wants to guide direction
- Regulated environments requiring audit trail
Use Perpetual Mode For:
- Well-defined PRDs with clear requirements
- Established patterns with high confidence
- Overnight builds where interruption isn't desired
- CI/CD pipelines requiring full automation
Configuration
# Enable checkpoint mode
LOKI_AUTONOMY_MODE=checkpoint
# Pause frequency
LOKI_CHECKPOINT_FREQUENCY=10 # tasks
LOKI_CHECKPOINT_TIME=60 # minutes
# Always pause after these phases
LOKI_CHECKPOINT_PHASES="architecture,deployment"
Checkpoint Workflow
[Work on 10 tasks] → [Pause] → [Generate Summary] → [Wait for Approval]
↓
[User reviews and approves]
↓
[Resume work]
On Checkpoint:
-
Generate Summary
markdown# Checkpoint Summary ## Tasks Completed (10) - Implemented POST /api/todos endpoint - Added unit tests (95% coverage) - Set up CI/CD pipeline - ... ## Next Actions - Deploy to staging - Run integration tests - Security audit ## Resources Used - 15 minutes elapsed - 3 Haiku agents, 2 Sonnet agents - Estimated cost: $0.45 -
Create Approval Signal
bash# System writes: .loki/signals/CHECKPOINT_SUMMARY_2026-01-14-10-30.md # User reviews and creates: .loki/signals/CHECKPOINT_APPROVED -
Wait for Approval
- Orchestrator pauses execution
- Monitors for approval signal
- Resumes when signal detected
Agent Instructions (Orchestrator)
When LOKI_AUTONOMY_MODE=checkpoint:
completed_tasks = load_completed_tasks()
tasks_since_checkpoint = completed_tasks - last_checkpoint_count
if tasks_since_checkpoint >= CHECKPOINT_FREQUENCY:
# Pause and generate summary
summary = generate_checkpoint_summary()
write_signal("CHECKPOINT_SUMMARY", summary)
# Wait for approval
log_info("Waiting for checkpoint approval...")
while not signal_exists("CHECKPOINT_APPROVED"):
sleep(5)
# Resume work
remove_signal("CHECKPOINT_APPROVED")
log_info("Checkpoint approved. Resuming work...")
last_checkpoint_count = completed_tasks
Comparison with Other Modes
| Mode | Best For | Approval Frequency | Use Case |
|---|---|---|---|
| Perpetual | Overnight builds | Never | Fully automated CI/CD |
| Checkpoint | Novel projects | Every 10 tasks | Learning new domain |
| Supervised | Critical systems | Every task | Production deployments |
Metrics
Track checkpoint effectiveness:
{
"checkpoint_id": "cp-2026-01-14-001",
"tasks_completed": 10,
"time_elapsed_minutes": 15,
"approval_time_seconds": 45,
"course_corrections": 0,
"user_satisfaction": "approved_without_changes"
}
Storage: .loki/metrics/checkpoint-mode/
References
references/production-patterns.md- HN production insights- timdettmers.com/use-agents-or-be-left-behind
Version: 1.0.0
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
prompt-optimization
Applies prompt repetition to improve accuracy for non-reasoning LLMs
loki-mode
Launch Loki Mode autonomous SDLC agent. Handles PRD-to-deployment with minimal human intervention. Invoke for multi-phase development tasks, bug fixing campaigns, or full product builds.
verl-rl-training
Provides guidance for training LLMs with reinforcement learning using verl (Volcano Engine RL). Use when implementing RLHF, GRPO, PPO, or other RL algorithms for LLM post-training at scale with flexible infrastructure backends.
openrlhf-training
High-performance RLHF framework with Ray+vLLM acceleration. Use for PPO, GRPO, RLOO, DPO training of large models (7B-70B+). Built on Ray, vLLM, ZeRO-3. 2× faster than DeepSpeedChat with distributed architecture and GPU resource sharing.
gguf-quantization
GGUF format and llama.cpp quantization for efficient CPU/GPU inference. Use when deploying models on consumer hardware, Apple Silicon, or when needing flexible quantization from 2-8 bit without GPU requirements.
Claude Code Guide
Master guide for using Claude Code effectively. Includes configuration templates, prompting strategies "Thinking" keywords, debugging techniques, and best practices for interacting with the agent.
Didn't find tool you were looking for?