Agent skill

ralph-log-events

Event schemas for ml-ralph log.jsonl. Use when logging any event.

View SKILL.md on GitHub Repository

Stars 33

Forks 3

Install this agent skill to your Project

npx add-skill https://github.com/pentoai/ml-ralph/tree/main/.claude/skills/ralph-log-events

SKILL.md

Ralph Log Events

Append events to .ml-ralph/log.jsonl. One JSON per line. Never edit, only append.

Event Types

Phase Transitions

jsonl

{"ts":"...","type":"phase","phase":"UNDERSTAND","summary":"Exploring data and researching prior work"}
{"ts":"...","type":"phase","phase":"STRATEGIZE","summary":"Evaluating 4 potential approaches"}
{"ts":"...","type":"phase","phase":"EXECUTE","summary":"Running minimal test of H-003"}
{"ts":"...","type":"phase","phase":"REFLECT","summary":"Analyzing unexpected results"}

Thinking & Mental Models

jsonl

{"ts":"...","type":"thinking","subject":"Why is precision low?","thoughts":"Looking at false positives, I notice...","conclusion":"Model confuses X with Y because..."}
{"ts":"...","type":"mental_model","domain":"feature importance","belief":"Temporal features dominate","confidence":"high","evidence":["EDA analysis","H-002 ablation"]}

Research

jsonl

{"ts":"...","type":"research","source":"Kaggle 1st place solution","url":"https://...","key_insights":["Feature X crucial","Avoided Y because..."],"relevance":"directly applicable"}
{"ts":"...","type":"research","source":"arXiv paper","url":"https://...","key_insights":["This loss handles imbalance better"],"relevance":"inspirational"}

Path Analysis & Hypotheses

jsonl

{"ts":"...","type":"path_analysis","paths":[{"id":"A","description":"Gradient boosting","expected":"AUC ~0.78","learning_value":"high"},{"id":"B","description":"Neural net","expected":"AUC ~0.80","learning_value":"medium"}],"chosen":"A","rationale":"Best learning-to-effort ratio"}
{"ts":"...","type":"hypothesis","id":"H-001","hypothesis":"Time features improve AUC by 5%","expected":"0.75 → 0.80","rationale":"EDA showed temporal patterns"}

Experiments & Results

jsonl

{"ts":"...","type":"experiment","hypothesis_id":"H-001","metrics":{"auc":0.77,"precision":0.65},"observations":"Improvement on recent data but degradation on older samples","surprises":"Temporal features hurt pre-2020 data"}

Learnings & Decisions

jsonl

{"ts":"...","type":"learning","insight":"Distribution shifted in 2020 - models need explicit handling","source":"H-001 analysis"}
{"ts":"...","type":"decision","hypothesis_id":"H-001","action":"iterate","reason":"Partial success - need time-aware normalization","next_step":"Research distribution shift handling"}

Strategic Retreat

jsonl

{"ts":"...","type":"strategic_retreat","trigger":"3 experiments with <1% improvement","action":"Returning to UNDERSTAND","focus":"Re-examine errors and search literature"}

PRD & Kanban Updates

jsonl

{"ts":"...","type":"prd_updated","field":"success_criteria","change":"Added temporal stability requirement","reason":"Discovered distribution shift"}
{"ts":"...","type":"kanban_updated","changes":"Completed T-007, moved T-008 to focus, added T-012","reason":"Shift understood, ready to implement fix"}

Status

jsonl

{"ts":"...","type":"status","status":"paused","reason":"Need user input on constraint change"}
{"ts":"...","type":"status","status":"blocked","reason":"Hit apparent ceiling after 3 approaches","approaches_tried":["Autoencoder","IsolationForest","OCSVM"],"user_decision_needed":"Continue exploring or accept 0.90 AUC?"}
{"ts":"...","type":"status","status":"complete","reason":"All criteria met","evidence":{"auc":0.92,"threshold":0.90}}

Devil's Advocate

jsonl

{"ts":"...","type":"devils_advocate","conclusion":"AUC ceiling at 0.90 is fundamental","attacks":[{"question":"What would prove me wrong?","answer":"...","addressed":false}],"survived":false,"next_action":"continue_working"}

Limitation Claims

jsonl

{"ts":"...","type":"limitation_claim","claim":"73% of attacks are structurally indistinguishable","devils_advocate_id":"DA-003","validation_attempts":[{"method":"...","result":"..."}],"confidence":"high","user_approved":false}

See RALPH.md for when and how to use these events.

Event Reference

Type	Required Fields
`phase`	phase, summary
`thinking`	subject, thoughts, conclusion
`mental_model`	domain, belief, confidence, evidence
`research`	source, key_insights, relevance
`path_analysis`	paths, chosen, rationale
`hypothesis`	id, hypothesis, expected, rationale
`experiment`	hypothesis_id, metrics, observations
`learning`	insight, source
`decision`	hypothesis_id, action, reason
`strategic_retreat`	trigger, action, focus
`prd_updated`	field, change, reason
`kanban_updated`	changes, reason
`status`	status, reason (+ evidence if complete, + approaches_tried if blocked)
`devils_advocate`	conclusion, attacks, survived, next_action
`limitation_claim`	claim, devils_advocate_id, validation_attempts, confidence, user_approved

Querying (jq)

bash

# Current mental models
jq -s '[.[] | select(.type=="mental_model")]' .ml-ralph/log.jsonl

# All research
jq -s '[.[] | select(.type=="research")]' .ml-ralph/log.jsonl

# All hypotheses
jq -s '[.[] | select(.type=="hypothesis")]' .ml-ralph/log.jsonl

# All learnings
jq -s '[.[] | select(.type=="learning")] | .[].insight' .ml-ralph/log.jsonl

# Experiment results for H-001
jq -s '[.[] | select(.type=="experiment" and .hypothesis_id=="H-001")]' .ml-ralph/log.jsonl

# Strategic retreats (signs of being stuck)
jq -s '[.[] | select(.type=="strategic_retreat")]' .ml-ralph/log.jsonl

# Kanban evolution
jq -s '[.[] | select(.type=="kanban_updated")]' .ml-ralph/log.jsonl

Maintainer

pentoai Core maintainer

Source details

Full Name: pentoai/ml-ralph
Branch: main
Path in repo: .claude/skills/ralph-log-events
Topics: claude-code agents python codex machine-learning aicoding ralph wandb

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

pentoai/ml-ralph

ralph-file-schemas

Schema reference for ml-ralph state files (prd.json, kanban.json). Use when reading or writing these files.

33 3

Explore

pentoai/ml-ralph

ml-ralph

REQUIRED first step for ANY ML task. When user describes an ML problem, goal, experiment, or model improvement — ALWAYS invoke this skill BEFORE exploring code or planning. Triggers: ml-ralph, create prd, ml project, kaggle, implement model, improve model, train model, better model, new approach, experiment.

33 3

Explore

pentoai/ml-ralph

ml-ralph

Create ML project PRDs. Triggers: ml-ralph, create prd, ml project, kaggle.

33 3

Explore

davila7/claude-code-templates

verl-rl-training

Provides guidance for training LLMs with reinforcement learning using verl (Volcano Engine RL). Use when implementing RLHF, GRPO, PPO, or other RL algorithms for LLM post-training at scale with flexible infrastructure backends.

23,776 2,298

Explore

davila7/claude-code-templates

openrlhf-training

High-performance RLHF framework with Ray+vLLM acceleration. Use for PPO, GRPO, RLOO, DPO training of large models (7B-70B+). Built on Ray, vLLM, ZeRO-3. 2× faster than DeepSpeedChat with distributed architecture and GPU resource sharing.

23,776 2,298

Explore

davila7/claude-code-templates

gguf-quantization

GGUF format and llama.cpp quantization for efficient CPU/GPU inference. Use when deploying models on consumer hardware, Apple Silicon, or when needing flexible quantization from 2-8 bit without GPU requirements.

23,776 2,298

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Ralph Log Events

Event Types

Phase Transitions

Thinking & Mental Models

Research

Path Analysis & Hypotheses

Experiments & Results

Learnings & Decisions

Strategic Retreat

PRD & Kanban Updates

Status

Devil's Advocate

Limitation Claims

Event Reference

Querying (jq)

Recommended Agent Skills

ralph-file-schemas

ml-ralph

ml-ralph

verl-rl-training

openrlhf-training

gguf-quantization