Agent skills
research-paper-extractor

Agent skill

research-paper-extractor

Stars 2

Forks 0

Install this agent skill to your Project

npx add-skill https://github.com/drshailesh88/integrated_content_OS/tree/main/skills/cardiology/research-paper-extractor

SKILL.md

Research Paper Extractor

Extract text from cardiology research paper PDFs - FREE, runs locally.

Cost: ZERO

Text extraction: pdfplumber (free, local)
Structuring: You ask me (Claude) in this conversation - you're already paying for the subscription

No API calls. No extra costs.

How It Works

STEP 1: Extract text (free, local)
python scripts/extract_paper.py trial.pdf --output trial.md

STEP 2: Ask Claude (your existing subscription)
"Read trial.md and structure this for my content workflow"

DONE - No extra cost.

Quick Start

Install (one time)

bash

pip3 install pdfplumber

Extract text from PDF

bash

# Save to file
python scripts/extract_paper.py paper.pdf --output extracted.md

# Just first 5 pages (faster)
python scripts/extract_paper.py paper.pdf --pages 5 --output extracted.md

Then ask Claude Code

After extracting, just tell me:

"Read /path/to/extracted.md and give me:

Study design, population, intervention

Primary/secondary endpoints with HR, CI, p-values

Safety data and conclusions

Content angles for YouTube, Twitter, Newsletter"

I'll structure it for your content workflow.

Example Workflow

bash

# 1. Download PDF from NEJM/JACC/Lancet

# 2. Extract text
python scripts/extract_paper.py ~/Downloads/declare-timi-58.pdf --output declare.md

# 3. In Claude Code:
#    "Read declare.md and structure the trial data.
#     Give me content angles for my YouTube channel."

Output you'll get from me:

DECLARE-TIMI 58 Summary:

Study: RCT, N=17,160, T2DM with CV risk
Intervention: Dapagliflozin 10mg vs placebo
Duration: 4.2 years median follow-up

Primary (MACE): HR 0.93 (0.84-1.03), p=0.17 - Non-inferior, not superior
Key Secondary (CV death/HF hosp): HR 0.83 (0.73-0.95), p=0.005 ✓

Content Angles:
🎬 YouTube: "SGLT2 inhibitors: The HF story hidden in a 'negative' trial"
🐦 Twitter: "DECLARE: Primary endpoint NS, but NNT 111 for HF hosp. Bury the lede much?"
📧 Newsletter: "Why 'negative' trials often have positive stories"

Why This Approach?

Approach	Cost
❌ Anthropic API per extraction	~$0.05-0.15 per paper
❌ OpenAI API per extraction	~$0.05-0.20 per paper
✅ This approach	$0 - uses your subscription

You're already paying for Claude Code. Use it.

Integration with Your Skills

After I structure the data, you can use it with:

cardiology-trial-editorial → Write 500-word editorial
x-post-creator-skill → Generate tweets with accurate stats
youtube-script-master → Script with verified data
cardiology-newsletter-writer → Deep dive newsletter

Limitations

Works best with native PDFs (not scanned images)
Very long papers: use --pages 10 to extract key sections
Tables may need manual review

Zero cost. Maximum utility. Uses what you already pay for.

Maintainer

drshailesh88 Core maintainer

Source details

Full Name: drshailesh88/integrated_content_OS
Branch: main
Path in repo: skills/cardiology/research-paper-extractor

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

drshailesh88/integrated_content_OS

pufferlib

This skill should be used when working with reinforcement learning tasks including high-performance RL training, custom environment development, vectorized parallel simulation, multi-agent systems, or integration with existing RL environments (Gymnasium, PettingZoo, Atari, Procgen, etc.). Use this skill for implementing PPO training, creating PufferEnv environments, optimizing RL performance, or developing policies with CNNs/LSTMs.

2 0

Explore

drshailesh88/integrated_content_OS

fluidsim

Framework for computational fluid dynamics simulations using Python. Use when running fluid dynamics simulations including Navier-Stokes equations (2D/3D), shallow water equations, stratified flows, or when analyzing turbulence, vortex dynamics, or geophysical flows. Provides pseudospectral methods with FFT, HPC support, and comprehensive output analysis.

2 0

Explore

drshailesh88/integrated_content_OS

metabolomics-workbench-database

Access NIH Metabolomics Workbench via REST API (4,200+ studies). Query metabolites, RefMet nomenclature, MS/NMR data, m/z searches, study metadata, for metabolomics and biomarker discovery.

2 0

Explore

drshailesh88/integrated_content_OS

geniml

This skill should be used when working with genomic interval data (BED files) for machine learning tasks. Use for training region embeddings (Region2Vec, BEDspace), single-cell ATAC-seq analysis (scEmbed), building consensus peaks (universes), or any ML-based analysis of genomic regions. Applies to BED file collections, scATAC-seq data, chromatin accessibility datasets, and region-based genomic feature learning.

2 0

Explore

drshailesh88/integrated_content_OS

zinc-database

Access ZINC (230M+ purchasable compounds). Search by ZINC ID/SMILES, similarity searches, 3D-ready structures for docking, analog discovery, for virtual screening and drug discovery.

2 0

Explore

drshailesh88/integrated_content_OS

astropy

Comprehensive Python library for astronomy and astrophysics. This skill should be used when working with astronomical data including celestial coordinates, physical units, FITS files, cosmological calculations, time systems, tables, world coordinate systems (WCS), and astronomical data analysis. Use when tasks involve coordinate transformations, unit conversions, FITS file manipulation, cosmological distance calculations, time scale conversions, or astronomical data processing.

2 0

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Research Paper Extractor

Cost: ZERO

How It Works

Quick Start

Install (one time)

Extract text from PDF

Then ask Claude Code

Example Workflow

Why This Approach?

Integration with Your Skills

Limitations

Recommended Agent Skills

pufferlib

fluidsim

metabolomics-workbench-database

geniml

zinc-database

astropy