Agent skills
video-delivery-coach

Agent skill

video-delivery-coach

Analyze YOUR video recordings before publishing. Evaluates voice (pace, pitch, volume), facial expressions (emotions, eye contact, smiles), and content (filler words, structure). Helps improve your Hinglish YouTube delivery over time.

View SKILL.md on GitHub Repository

Stars 2

Forks 0

Install this agent skill to your Project

npx add-skill https://github.com/drshailesh88/integrated_content_OS/tree/main/skills/cardiology/video-delivery-coach

SKILL.md

Video Delivery Coach

Get better at video, video by video. This skill analyzes your recordings before you publish, identifying areas for improvement.

WHAT IT DOES

Analysis Type	Metrics	Tool Used
Voice	Speech rate (WPM), pitch variation, volume consistency	Librosa + Whisper
Facial	Emotion timeline, eye contact frequency, smile frequency	OpenCV + DeepFace + Mediapipe
Content	Transcription, filler words, structure	Faster-Whisper + Claude
Overall	5-dimension score (1-5 each, max 25)	Claude analysis

SCORING RUBRIC

Dimension	Score 1	Score 5
Content & Organization	Disorganized, unclear	Logical, well-structured
Delivery & Vocal Quality	Monotone, many fillers	Clear, varied, engaging
Body Language & Eye Contact	No eye contact, stiff	Direct gaze, natural movement
Audience Engagement	Boring, loses attention	Captivating, maintains interest
Language & Clarity	Grammar issues, unclear	Clear, impactful, professional

Total Score Interpretation:

5-9: Needs significant improvement
10-14: Developing skills
15-18: Competent speaker
19-22: Proficient speaker
23-25: Outstanding speaker

TRIGGERS

Use this skill when you say:

"Analyze my video recording"
"How was my delivery?"
"Review my video before upload"
"Check my presentation"
"Coach my speaking"

USAGE

In Claude Code (Recommended)

"Analyze my video at /path/to/recording.mp4"

"Coach my delivery on the latest YouTube recording"

"What can I improve in this video?"

CLI Mode

bash

# Basic analysis
python scripts/analyze_video.py --video "/path/to/video.mp4"

# Full analysis with all features
python scripts/analyze_video.py --video "/path/to/video.mp4" --full

# Voice only (faster)
python scripts/analyze_video.py --video "/path/to/video.mp4" --voice-only

# Save report
python scripts/analyze_video.py --video "/path/to/video.mp4" --output ~/reports/

OUTPUT FORMAT

Quick Summary

┌────────────────────────────────────────┐
│     VIDEO DELIVERY ANALYSIS            │
│     recording_2025_01_15.mp4           │
├────────────────────────────────────────┤
│  OVERALL SCORE: 18/25 (Competent)      │
│                                        │
│  Content & Organization:    4/5        │
│  Delivery & Vocal Quality:  3/5        │
│  Body Language & Eye Contact: 4/5      │
│  Audience Engagement:       4/5        │
│  Language & Clarity:        3/5        │
└────────────────────────────────────────┘

Detailed Report

markdown

# Video Delivery Analysis

**File:** recording_2025_01_15.mp4
**Duration:** 12:34
**Date:** 2025-01-15

---

## VOICE ANALYSIS

| Metric | Value | Target | Assessment |
|--------|-------|--------|------------|
| Speech Rate | 145 WPM | 120-160 | ✅ Good |
| Pitch Variation | 42.3 Hz | >30 Hz | ✅ Engaging |
| Volume Consistency | 0.08 | <0.15 | ✅ Steady |

**Filler Words Detected:**
- "um" - 8 times
- "you know" - 5 times
- "basically" - 3 times

**Recommendation:** Reduce "um" usage. Try pausing instead.

---

## FACIAL ANALYSIS

| Metric | Value | Assessment |
|--------|-------|------------|
| Eye Contact Frequency | 72% | ✅ Good |
| Smile Frequency | 35% | ⚠️ Could increase |

**Emotion Timeline:**
- 0:00-2:00: Neutral (intro)
- 2:00-8:00: Happy/Engaged (main content)
- 8:00-10:00: Serious (data presentation)
- 10:00-12:34: Happy (conclusion)

**Recommendation:** More smiles during technical sections.

---

## CONTENT ANALYSIS

**Strengths:**
- Clear opening hook
- Good use of clinical examples
- Strong call-to-action

**Areas for Improvement:**
- Could use more pauses after key points
- Consider adding more Hinglish transitions
- Section on side effects could be more structured

---

## OVERALL FEEDBACK

**What You Did Well:**
1. Excellent pace - not too fast, not too slow
2. Good eye contact with camera
3. Clinical examples were relatable

**What to Improve:**
1. Reduce filler words (especially "um")
2. Add more smiles during technical explanations
3. Pause after key statistics for emphasis

**Score: 18/25 - Competent Speaker**
You're delivering solid content with room for refinement.

HINGLISH-SPECIFIC ANALYSIS

This skill is calibrated for Hinglish content:

Feature	What It Checks
Code-switching	Natural Hindi ↔ English transitions
Pace adjustment	Slower for English technical terms
Cultural markers	Use of "ji", "beta", "aapko bata doon"
Engagement phrases	"Dekho", "Suniye", "Samjhe?"

COMPARING OVER TIME

Track your improvement across recordings:

┌─────────────────────────────────────────────────────┐
│  PROGRESS TRACKER (Last 5 Videos)                   │
├─────────────────────────────────────────────────────┤
│  Video           │ Score │ Main Improvement         │
│  ─────────────────────────────────────────────────  │
│  Jan 10          │ 15/25 │ Baseline                 │
│  Jan 15          │ 18/25 │ Better eye contact       │
│  Jan 20          │ 17/25 │ Fewer filler words       │
│  Jan 25          │ 19/25 │ More varied pace         │
│  Jan 30          │ 21/25 │ Natural Hinglish flow    │
└─────────────────────────────────────────────────────┘

INTEGRATION

With Your Workflow

Record Video → Analyze with video-delivery-coach → Fix issues → Re-record (optional) → Publish

Feeds Into:

youtube-script-master - Script adjustments based on delivery feedback
Personal improvement tracking

DEPENDENCIES

bash

# Core (required)
pip install anthropic python-dotenv rich

# Voice analysis
pip install librosa moviepy faster-whisper

# Facial analysis (optional - for full analysis)
pip install opencv-python mediapipe deepface tf-keras

# Note: tf-keras is heavy (~500MB). Skip for voice-only mode.

API KEYS NEEDED

Key	Purpose	Status
ANTHROPIC_API_KEY	Final analysis and coaching	Already have

MODES

Voice-Only Mode (Lightweight)

bash

python scripts/analyze_video.py --video file.mp4 --voice-only

Requires: librosa, moviepy, faster-whisper
Analyzes: Speech rate, pitch, volume, transcription, filler words
Skip: Facial analysis (faster, lighter)

Full Mode (Comprehensive)

bash

python scripts/analyze_video.py --video file.mp4 --full

Requires: All dependencies including OpenCV, DeepFace, Mediapipe
Analyzes: Everything including facial expressions
Slower but complete

HOW CLAUDE SHOULD USE THIS SKILL

When user asks to analyze a video:

Step 1: Check if video file exists

python

import os
if not os.path.exists(video_path):
    print("Video file not found")
    return

Step 2: Run analysis

bash

python scripts/analyze_video.py --video "/path/to/video.mp4"

Step 3: Present results

Show quick summary first
Offer detailed breakdown if requested
Provide actionable recommendations

Step 4: Track progress

Compare with previous analyses
Note improvements
Identify persistent issues

SAMPLE OUTPUT

=== VIDEO DELIVERY ANALYSIS ===
File: hinglish_statin_video.mp4
Duration: 15:23

VOICE METRICS:
├── Speech Rate: 138 WPM (Target: 120-160) ✅
├── Pitch Variation: 38.5 Hz ✅ Natural variation
└── Volume: Consistent ✅

FILLER WORDS:
├── "um": 12 occurrences
├── "basically": 8 occurrences
└── "you know": 5 occurrences

FACIAL METRICS:
├── Eye Contact: 68% ✅ Good
├── Smiles: 28% ⚠️ Below target (40%)
└── Dominant Emotion: Engaged

CONTENT SCORE:
├── Content & Organization: 4/5
├── Delivery & Vocal Quality: 3/5
├── Body Language: 4/5
├── Engagement: 4/5
└── Language & Clarity: 4/5

TOTAL: 19/25 (Proficient Speaker)

TOP 3 IMPROVEMENTS:
1. Replace "um" with pauses
2. Smile more during technical explanations
3. Slow down slightly when explaining statistics

HINGLISH NOTES:
✅ Natural code-switching
✅ Good use of "aapko batata hoon"
⚠️ Consider more "samjhe?" checks for engagement

NOTES

Privacy: All analysis is local, video never uploaded anywhere
Speed: Voice-only takes ~1 min, full analysis takes ~3-5 min
File types: Supports MP4, MOV, AVI, MKV
Duration: Works best with 5-30 minute videos

This skill helps you improve your delivery over time - not by judging, but by giving you objective data to work with.

Maintainer

drshailesh88 Core maintainer

Source details

Full Name: drshailesh88/integrated_content_OS
Branch: main
Path in repo: skills/cardiology/video-delivery-coach

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

drshailesh88/integrated_content_OS

pufferlib

This skill should be used when working with reinforcement learning tasks including high-performance RL training, custom environment development, vectorized parallel simulation, multi-agent systems, or integration with existing RL environments (Gymnasium, PettingZoo, Atari, Procgen, etc.). Use this skill for implementing PPO training, creating PufferEnv environments, optimizing RL performance, or developing policies with CNNs/LSTMs.

2 0

Explore

drshailesh88/integrated_content_OS

fluidsim

Framework for computational fluid dynamics simulations using Python. Use when running fluid dynamics simulations including Navier-Stokes equations (2D/3D), shallow water equations, stratified flows, or when analyzing turbulence, vortex dynamics, or geophysical flows. Provides pseudospectral methods with FFT, HPC support, and comprehensive output analysis.

2 0

Explore

drshailesh88/integrated_content_OS

metabolomics-workbench-database

Access NIH Metabolomics Workbench via REST API (4,200+ studies). Query metabolites, RefMet nomenclature, MS/NMR data, m/z searches, study metadata, for metabolomics and biomarker discovery.

2 0

Explore

drshailesh88/integrated_content_OS

geniml

This skill should be used when working with genomic interval data (BED files) for machine learning tasks. Use for training region embeddings (Region2Vec, BEDspace), single-cell ATAC-seq analysis (scEmbed), building consensus peaks (universes), or any ML-based analysis of genomic regions. Applies to BED file collections, scATAC-seq data, chromatin accessibility datasets, and region-based genomic feature learning.

2 0

Explore

drshailesh88/integrated_content_OS

zinc-database

Access ZINC (230M+ purchasable compounds). Search by ZINC ID/SMILES, similarity searches, 3D-ready structures for docking, analog discovery, for virtual screening and drug discovery.

2 0

Explore

drshailesh88/integrated_content_OS

astropy

Comprehensive Python library for astronomy and astrophysics. This skill should be used when working with astronomical data including celestial coordinates, physical units, FITS files, cosmological calculations, time systems, tables, world coordinate systems (WCS), and astronomical data analysis. Use when tasks involve coordinate transformations, unit conversions, FITS file manipulation, cosmological distance calculations, time scale conversions, or astronomical data processing.

2 0

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Video Delivery Coach

WHAT IT DOES

SCORING RUBRIC

TRIGGERS

USAGE

In Claude Code (Recommended)

CLI Mode

OUTPUT FORMAT

Quick Summary

Detailed Report

HINGLISH-SPECIFIC ANALYSIS

COMPARING OVER TIME

INTEGRATION

With Your Workflow

Feeds Into:

DEPENDENCIES

API KEYS NEEDED

MODES

Voice-Only Mode (Lightweight)

Full Mode (Comprehensive)

HOW CLAUDE SHOULD USE THIS SKILL

Step 1: Check if video file exists

Step 2: Run analysis

Step 3: Present results

Step 4: Track progress

SAMPLE OUTPUT

NOTES

Recommended Agent Skills

pufferlib

fluidsim

metabolomics-workbench-database

geniml

zinc-database

astropy