Agent skills
ensemble-content-scorer

Agent skill

ensemble-content-scorer

Multi-model consensus scoring for content ideas. Scores the same idea with Claude, GPT-4o, Gemini, and Grok in parallel, then aggregates for a balanced verdict. Reduces single-model bias and improves viral predictions.

View SKILL.md on GitHub Repository

Stars 2

Forks 0

Install this agent skill to your Project

npx add-skill https://github.com/drshailesh88/integrated_content_OS/tree/main/skills/cardiology/ensemble-content-scorer

SKILL.md

Ensemble Content Scorer

Wisdom of crowds, but for AI. This skill scores your content ideas using multiple AI models, then aggregates for consensus. More reliable than single-model predictions.

WHAT IT DOES

                Content Idea
                     │
    ┌────────────────┼────────────────┐
    │                │                │
    ▼                ▼                ▼
[Claude]        [GPT-4o]         [Gemini]
  Score            Score            Score
    │                │                │
    └────────────────┼────────────────┘
                     │
                     ▼
            [Aggregator (Claude)]
                     │
                     ▼
         Consensus Score + Verdict

WHY MULTI-MODEL?

Single Model	Ensemble
May have biases	Biases cancel out
One perspective	Multiple perspectives
Black box score	Transparent reasoning
May miss nuances	Catches different angles

TRIGGERS

Use this skill when you say:

"Score this content idea"
"Is this topic worth pursuing?"
"Rate my video concept"
"Predict if this will go viral"
"Ensemble score: [topic]"

USAGE

In Claude Code (Recommended)

"Ensemble score: Statins myth-busting for Indian audience"

"Score this video idea: Why your LDL target depends on your risk"

"Rate these ideas and rank them:
1. GLP-1 agonists explained
2. Heart attack warning signs
3. Is coconut oil heart-healthy?"

CLI Mode

bash

# Score single idea
python scripts/score_content.py --idea "Statins myth-busting for Indian audience"

# Score multiple ideas
python scripts/score_content.py --ideas "GLP-1 explained" "Statin myths" "CAC scoring"

# Use specific models
python scripts/score_content.py --idea "Topic" --models claude,gpt4o,gemini

SCORING DIMENSIONS

Each model scores on these dimensions (1-10):

Dimension	What It Measures
Relevance	How relevant to target audience (Indian patients/doctors)
Novelty	How fresh is the angle? Been covered before?
Expertise Match	Does it match your expertise as interventional cardiologist?
Engagement Potential	Will it capture and hold attention?
Share-ability	Will people share this? Controversy potential?
Evergreen Factor	Will this be relevant in 6 months?

Total Score: 0-60

OUTPUT FORMAT

markdown

# ENSEMBLE CONTENT SCORE

**Idea:** Statins myth-busting for Indian audience - why most "side effects" aren't real

**Date:** 2025-01-01

---

## INDIVIDUAL MODEL SCORES

### Claude (Anthropic)
| Dimension | Score | Reasoning |
|-----------|-------|-----------|
| Relevance | 9/10 | High - statins widely prescribed in India, misinformation common |
| Novelty | 7/10 | Topic covered before, but Indian-specific angle is fresher |
| Expertise | 9/10 | Perfect for interventional cardiologist |
| Engagement | 8/10 | Controversial enough to spark discussion |
| Shareability | 8/10 | Will trigger debates |
| Evergreen | 9/10 | Statin myths persist |
| **Total** | **50/60** | |

### GPT-4o (OpenAI)
| Dimension | Score | Reasoning |
|-----------|-------|-----------|
| Relevance | 9/10 | Very relevant for Indian audience |
| Novelty | 6/10 | Many statin videos exist |
| Expertise | 10/10 | Perfect fit |
| Engagement | 9/10 | Myth-busting format works |
| Shareability | 8/10 | Good controversy factor |
| Evergreen | 8/10 | Will stay relevant |
| **Total** | **50/60** | |

### Gemini (Google)
| Dimension | Score | Reasoning |
|-----------|-------|-----------|
| Relevance | 8/10 | Good for health-conscious Indians |
| Novelty | 7/10 | Indian angle adds freshness |
| Expertise | 9/10 | Great fit |
| Engagement | 7/10 | Educational more than viral |
| Shareability | 7/10 | Moderate share potential |
| Evergreen | 9/10 | Long-lasting relevance |
| **Total** | **47/60** | |

---

## CONSENSUS SCORE

| Model | Total Score |
|-------|-------------|
| Claude | 50/60 |
| GPT-4o | 50/60 |
| Gemini | 47/60 |
| **Average** | **49/60 (81.7%)** |
| **Std Dev** | 1.7 (High Consensus) |

---

## VERDICT

🟢 **STRONG PURSUE** (Score: 49/60, Consensus: High)

All models agree this is a strong content idea. The combination of:
- High relevance to your audience
- Perfect expertise match
- Good controversy factor
- Evergreen potential

Makes this a priority topic for your content calendar.

---

## RECOMMENDATIONS

1. **Angle Enhancement**: Focus on the "nocebo effect" - most statin "side effects" are psychosomatic
2. **Hook Suggestion**: "90% of statin side effects aren't real - here's the data"
3. **Format**: 12-15 minute deep dive with studies
4. **Hinglish Tip**: Use "side effect ka drama" for relatability

---

## DISSENT ANALYSIS

- **Gemini** scored lower on engagement (7 vs 8-9)
- Suggests: May need stronger hook to maximize viral potential
- Consider: Adding patient testimonial or counter-narrative

SCORING TIERS

Score Range	Verdict	Action
50-60	🟢 STRONG PURSUE	High priority, create immediately
40-49	🟡 WORTH PURSUING	Good idea, add to calendar
30-39	🟠 NEEDS REFINEMENT	Has potential, needs angle work
20-29	🔴 RECONSIDER	Weak idea, low priority
0-19	⛔ SKIP	Not worth the effort

CONSENSUS INTERPRETATION

Std Deviation	Interpretation
< 3	High consensus - models agree
3-5	Moderate consensus - some disagreement
> 5	Low consensus - divisive idea (may be worth exploring!)

INTEGRATION

Enhances:

viral-content-predictor - More reliable predictions
youtube-script-master - Validate topics before scripting
content-repurposer - Know which content to repurpose

Workflow:

Idea Generation → Ensemble Score → [High Score?] → Create Content
                         ↓
                   [Low Score?] → Refine or Skip

MODELS USED

Model	Provider	Cost	Notes
Claude Sonnet	Anthropic	Subscription	Your primary
GPT-4o	OpenAI	API	Strong analysis
Gemini Pro	Google	FREE	Good for fact-checking
Grok	xAI	API	Twitter trend awareness

Minimum required: 2 models (Claude + one other) Recommended: 3+ models for robust consensus

DEPENDENCIES

python

anthropic>=0.18.0
openai>=1.0.0           # For GPT-4o
google-generativeai>=0.3.0  # For Gemini
python-dotenv>=1.0.0
rich>=13.0.0

API KEYS NEEDED

Key	Purpose	Status
ANTHROPIC_API_KEY	Claude	Already have
OPENAI_API_KEY	GPT-4o	Already have
GOOGLE_API_KEY	Gemini	Already have
XAI_API_KEY	Grok (optional)	Already have

BATCH SCORING

For scoring multiple ideas at once:

bash

python scripts/score_content.py --batch \
    --ideas "GLP-1 for heart failure" \
            "Statin myth-busting" \
            "CAC scoring guide" \
            "Why LDL matters" \
            "Exercise for heart health"

Output:

| Rank | Idea | Score | Verdict |
|------|------|-------|---------|
| 1 | Statin myth-busting | 49/60 | 🟢 STRONG PURSUE |
| 2 | GLP-1 for heart failure | 45/60 | 🟡 WORTH PURSUING |
| 3 | CAC scoring guide | 42/60 | 🟡 WORTH PURSUING |
| 4 | Why LDL matters | 38/60 | 🟠 NEEDS REFINEMENT |
| 5 | Exercise for heart health | 35/60 | 🟠 NEEDS REFINEMENT |

NOTES

Speed: ~30 seconds for single idea (parallel API calls)
Cost: Minimal - short prompts to each model
Reliability: Consensus typically more accurate than single model
When to ignore: If YOU have strong conviction, trust your expertise

This skill helps you invest your time in content that's more likely to succeed.

Maintainer

drshailesh88 Core maintainer

Source details

Full Name: drshailesh88/integrated_content_OS
Branch: main
Path in repo: skills/cardiology/ensemble-content-scorer

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

drshailesh88/integrated_content_OS

pufferlib

This skill should be used when working with reinforcement learning tasks including high-performance RL training, custom environment development, vectorized parallel simulation, multi-agent systems, or integration with existing RL environments (Gymnasium, PettingZoo, Atari, Procgen, etc.). Use this skill for implementing PPO training, creating PufferEnv environments, optimizing RL performance, or developing policies with CNNs/LSTMs.

2 0

Explore

drshailesh88/integrated_content_OS

fluidsim

Framework for computational fluid dynamics simulations using Python. Use when running fluid dynamics simulations including Navier-Stokes equations (2D/3D), shallow water equations, stratified flows, or when analyzing turbulence, vortex dynamics, or geophysical flows. Provides pseudospectral methods with FFT, HPC support, and comprehensive output analysis.

2 0

Explore

drshailesh88/integrated_content_OS

metabolomics-workbench-database

Access NIH Metabolomics Workbench via REST API (4,200+ studies). Query metabolites, RefMet nomenclature, MS/NMR data, m/z searches, study metadata, for metabolomics and biomarker discovery.

2 0

Explore

drshailesh88/integrated_content_OS

geniml

This skill should be used when working with genomic interval data (BED files) for machine learning tasks. Use for training region embeddings (Region2Vec, BEDspace), single-cell ATAC-seq analysis (scEmbed), building consensus peaks (universes), or any ML-based analysis of genomic regions. Applies to BED file collections, scATAC-seq data, chromatin accessibility datasets, and region-based genomic feature learning.

2 0

Explore

drshailesh88/integrated_content_OS

zinc-database

Access ZINC (230M+ purchasable compounds). Search by ZINC ID/SMILES, similarity searches, 3D-ready structures for docking, analog discovery, for virtual screening and drug discovery.

2 0

Explore

drshailesh88/integrated_content_OS

astropy

Comprehensive Python library for astronomy and astrophysics. This skill should be used when working with astronomical data including celestial coordinates, physical units, FITS files, cosmological calculations, time systems, tables, world coordinate systems (WCS), and astronomical data analysis. Use when tasks involve coordinate transformations, unit conversions, FITS file manipulation, cosmological distance calculations, time scale conversions, or astronomical data processing.

2 0

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Ensemble Content Scorer

WHAT IT DOES

WHY MULTI-MODEL?

TRIGGERS

USAGE

In Claude Code (Recommended)

CLI Mode

SCORING DIMENSIONS

OUTPUT FORMAT

SCORING TIERS

CONSENSUS INTERPRETATION

INTEGRATION

Enhances:

Workflow:

MODELS USED

DEPENDENCIES

API KEYS NEEDED

BATCH SCORING

NOTES

Recommended Agent Skills

pufferlib

fluidsim

metabolomics-workbench-database

geniml

zinc-database

astropy